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Preface 


Perhaps the hrst issue in introducing an Encyclopaedia of Biophysics is the question 
“What is Biophysics?”. The answer to this has varied considerably over time and 
between different ‘biophysicists’, particularly depending on whether they have come 
to biophysics from physics or physiology. For the present purposes, we have adopted 
the definition given by the Nobel Prize-winning physiologist A. V. Hill (Science 124, 
1233, 1956): Biophysics is “the study of biological function, organization, and 
structure by physical and physicochemical ideas and methods.” - to which one 
would now add methods of mathematical analysis and computer modelling. An 
important feature of Hill’s dehnition is that it includes both physical methods and 
physical ideas. The astonishing developments in physical methods over the last 
hundred years have made them ubiquitous in biological laboratories - and indeed in 
hospitals. However, to quote Hill again “the employment of physical instruments in 
a biological laboratory does not make one a biophysicist - otherwise any user of 

a microscope, a balance.or a pH meter would drop automatically into that class.” 

Beyond simply the use of physical methods, it is the combination of both physical and 
biological ideas, intuitions and experience that makes a Biophysicist. 

The application of physical methods in biology has a very long history. For 
example, Antonie van Leeuwenhoek (1632 -1723) developed improved microscope 
lenses which allowed him to be the hrst to observe and describe single-celled 
organisms - the beginning of microbiology. The 18th century saw much speculation 
about ‘animal electricity’, culminating in the 1780s in Luigi Galvani’s famous 
experiments on the relation of electricity to muscle contraction. The middle of the 
19“ century saw what was probably the hrst explicitly biophysical programme of 
research, in which a group of physiologists (du Bois-Reymond, Ludwig, von Brucke, 
von Helmholtz) proclaimed their intention to ‘reduce physiology to physics and 
chemistry’. Of course the state of physics at the time was such as to make Biophysics 
a very premature venture, but this group did make important discoveries, notably in 
physiological optics and electrophysiology, using physical methods. An associate of 
this group, Adolf Fick - well-known for his law of diffusion - published what is 
probably the hrst biophysics text. Die Medizinische Physik (1856). 

In the 20“ and 2L' centuries there has been a dramatic howering of Biophysics. 
The early part of 20“ century saw the development of physical tools which are now 
familiar in biology - from X-ray diffraction (von Laue, Bragg) to the ultracentrifuge 
(Svedberg) and the electron microscope (Knoll & Ruska) - and this methodological 
development continues apace, notably with single-molecule techniques. The 1920s 
and 1930s saw the beginnings of physicochemical (Cohn, Edsall, Linderstrpm-Lang) 
and structural (Astbury, Bernal, Hodgkin, Perutz) studies of proteins. At the same 
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time, the first Departments or Institutes of Biophysics began to be established. 
In Germany these were commonly focussed on the study of radiation effects on 
organisms, while in the USA they concentrated largely on physiology. Many of 
these later expanded into other areas of Biophysics, and many more Departments of 
Biophysics - and Biophysical Societies - were established in the USA, Europe 
and Israel in the 1940s and 1950s. The power of the Biophysical approach was 
demonstrated unequivocally in the 1950s with the determination of the first three- 
dimensional structure of a protein, the structure of DNA and the Hodgkin-Huxley 
model of the action potential in nerves. 

Over the last 50 years. Biophysics has continued to develop at an astonishing pace. 
Biophysicists study life at every level, from atoms and molecules to cells, organisms, 
and environments. Molecular and Cellular Biophysics: This is perhaps the predom¬ 
inant strand of modern biophysics. It includes, for example, structural, functional and 
simulation studies of macromolecules and macromolecular assemblies of ever- 
increasing complexity and imaging of cells at ever-increasing resolution. Applied 
Biophysics: The applications of biophysical methods are perhaps most notable in the 
area of medical imaging - including X-ray CAT scans, magnetic resonance imaging, 
positron emission tomography and ultrasound scans. Therapeutic applications 
include radiation therapy of increasing sophistication and cardiac defibrillators. 
In addition to medical applications, there are now increasing and exciting applications 
in nanotechnology. Environmental Biophysics: For many years an important area of 
biophysics has been the study of the effects of ionising - and indeed non-ionising - 
radiation on organisms. Another key area of considerable current importance is the 
development of mathematical models of, e.g., heat and mass transfer at the level of 
organisms and ecosystems. 

This Encyclopedia is intended to provide a resource both for biophysicists inter¬ 
ested in approaches outside their immediate sub-discipline and for people coming to 
biophysics from either the physical or biological direction. The emphasis is very 
much on molecular and cellular biophysics, but some discussion of imaging and of 
nanotechnology is included. Just as there is overlap between chemistry and physics, 
so there is overlap between biophysics and biochemistry; our focus is of course on the 
techniques and uses of biophysics, but biochemical context is included where 
appropriate. 

The Encyclopedia consists of two kinds of entries. Systems and Techniques. 

• In the Systems sections, biophysical approaches to particular biological systems or 
problems - from protein structure to membranes, ion channels and receptors - are 
described. These sections, which have an emphasis on the integration of the 
different techniques, therefore provide an entry into Biophysics from the biolog¬ 
ical more than from the technique-oriented physics direction. 

• In the Techniques sections, each of the wide range of methods which fall under the 
heading of Biophysics are explained in detail, together with their strengths and the 
limitations of the information each provides. Experimental techniques covered 
range from diffraction, through a wide range of spectroscopic methods (X-ray, 
optical, EPR, NMR), kinetics, thermodynamics and hydrodynamics, to imaging 
(from electron microscopy to live cell imaging and MRI). The important and 
increasingly powerful computational, modelling and simulation approaches are 
also included. 

Each of the Sections includes concise introductions to the major concepts and 
methods, and outlines of more specific topics, in each case with links to a limited 
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number of carefully selected key reviews and/or papers in the scientific literature. 
Extensive cross-referencing (hyperlinks in the online version) between different 
articles allows access to related topics in a user-friendly manner. 

The online version of the Encyclopedia is intended to develop continuously, both 
by the updating of existing entries and by the addition of new articles. In this way it 
will provide the resources to help both practising biophysicists and newcomers to the 
field to keep pace with the continuing and exciting developments in Biophysics. 

Leicester Gordon C. K. Roberts 

UK 

May 2012 
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Synonyms 
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Introduction 

In his Nobel Prize-winning hypothesis, the biochemist 
Christian Anfinsen postulated that the native, 
biologically active conformation of a polypeptide 
chain represents a kinetically accessible unique free 
energy minimum (Anfinsen 1973). Its simplified 
form, “one sequence-one structure,” has played an 
extremely useful role, helping make sense of the struc¬ 
tural principles of protein function; but there is grow¬ 
ing evidence that it is not always true. Three- 
dimensional (3D) domain swapping is a phenomenon 
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that violates the dogma in a particularly elegant way. It 
is quite ironic that the flagship of 3D domain swapping 
is ribonuclease A (RNase A), the very protein used by 
Anhnsen to work out his dogma. 3D domain swapping 
is a mechanism of protein oligomerization, in which 
a structural element of a protein fold is replaced by an 
identical element from another subunit. Some 
proteins, for instance, RNase A, can assume many 
domain-swapped forms. 3D domain swapping is 
generally perceived as “conformational pathology” 
leading to aberrant structure and aggregation. 
However, there are also indications that it may play 
a positive biological role. 

Historical Background and the Discovery of 
3D Domain Swapping 

The term “3D domain swapping” was coined by 
Eisenberg (Bennett et al. 1994), who discovered this 
phenomenon in the crystal structure of diphtheria 
toxin. However, its existence had been deduced much 
earlier from the observation that enzymatic activity of 
bovine pancreatic RNase A (an enzyme with two 
catalytic histidine residues, an N-terminal one and 
a C-terminal one) could be recovered in mixtures of 
two inactive mutant proteins with partly knocked-out 
active sites (Crestfield et al. 1962, 1963). The predic¬ 
tion was later conhrmed by a crystallographic study 
(Liu et al. 1998), which showed a dimeric molecule 
with two monomer-like folding units, each formed 
from structural elements contributed by both polypep¬ 
tide chains. Specifically, the N-terminal helix, which in 
monomeric RNase A is nested in the fold of the 
C-terminal part of the molecule, switched places with 
the N-terminal helix from the other molecule and 
docked in the “wrong” C-terminal domain. The mutual 
character of this exchange leads to a symmetric, or 
closed, dimer. 

Definitions of Terms 

This is the general scenario of 3D domain swapping: 
a protein domain breaks its contact with other struc¬ 
tural elements and its place is taken by the same 
domain from another molecule, leading to an inter¬ 
laced dimer or higher-order oligomer (Fig. 1). The 


swapped domain could be a complete globular 
domain but in most cases (including RNase A) it can 
be just a simple secondary-structure element, such as 
a terminal a-helix or P-chain. A protein chain capable 
of 3D domain swapping must undergo at least partial 
unfolding, which breaks, and transiently exposes, the 
intramolecular interface between the domains. This 
closed interface is then recreated in an aberrant way 
from fragments of two subunits. Thus, the closed 
interface is preserved in the oligomer. However, the 
oligomer also contains a novel feature, the open inter¬ 
face between the subunits, which is absent in 
the monomer. It is obvious from this description of 
3D domain swapping that the protein chain must be 
capable of assuming two conformations. One is 
called “closed” (in the monomeric form), and the 
other, “open” (in the oligomeric form). The closed 
and open conformations are nearly the same and dif¬ 
fer in only one area, a loop or a similar “hinge” 
element that links the swapped domain with the rest 
of the polypeptide, and whose movement leads to 
the transition. 

The open interface can have more than one form. 
For instance, a closely related bovine seminal ribonu¬ 
clease (BS-RNase) forms native 3D domain-swapped 
dimers (Mazzarella et al. 1993). Although the swapped 
domain is the same N-terminal a-helix as in the above 
RNase A case, the overall organization of the dimers is 
different. RNase A itself has provided even more 
astonishing examples (Fig. 2). First, it was discovered 
that the protein can also form dimers with a 3D swap of 
the C-terminal P-strand, and later the same C-terminal 
swap was found in circular trimers. An even more 
exciting case was inspired by the discovery of 
a linear trimer, in which the central molecule has 
exchanged different domains with its neighbors, creat¬ 
ing the possibility for infinite, open-ended polymeri¬ 
zation (Liu et al. 2002). 

At first regarded as a marginal scientific curiosity, 
3D domain swapping has been now documented in 
more than 50 proteins. It appears that the number of 
cases could be much higher if the right conditions are 
found (pH, denaturant, increased concentration, or 
other environmental factor). In some cases, a slight 
modification of the sequence, even a point mutation, 
can drastically influence the ability of a protein to 
undergo 3D domain swapping. In a strict definition, 
bona fide 3D domain swapping requires experimental 



3D Domain Swapping 


3 


0-9 



3D Domain Swapping, Fig. 1 In preparation for 3D domain 
swapping, a protein molecule (a) must undergo partial unfolding 
(b) via a conformational change of a flexible hinge element, to 
expose the closed interface, which in the monomeric protein is 
buried between the two domains. The separated domains of two 
unfolded molecules can mutually recognize the complementary 
surfaces, recreating the interface in a symmetric, dimeric fashion 


(c). The 3D domain-swapped dimer is not a simple sum of the 
two monomeric molecules, as it contains a unique new feature, 
the open interface. In open-ended 3D domain swapping (d) 
molecule (A) swaps a domain into the core of molecule (B), 
which swaps the same domain in the core of molecule (C), 
etc. In effect, this runaway aggregation leads to a linear polymer 
with two sticky ends 


observation of the same protein chain in a monomeric 
fold and as an oligomer with exchanged domains. 
In many cases, described as quasi 3D domain swap¬ 
ping, we know the structure of the oligomer but have 
only nonstructural evidence about the existence of the 
monomeric form, or the monomer is a slight variant of 
the protein. For example, the case of human cystatin 
C (HCC) belongs to the latter category, because the 
wild-type protein has been crystallized only in 3D 
domain-swapped dimeric form (Janowski et al. 
2001). It is, however, clear that monomeric HCC 
does exist because only in this form the protein can 
function as a physiological inhibitor of papain-like 
cysteine proteases. 

Examples of Possible Biological Roles of 3D 
Domain Swapping 

The growing number of reported 3D domain swap¬ 
ping cases has prompted questions if it might be of 


biological signihcance. As a mechanism of protein 
oligomerization it could be utilized, for example, 
to stabilize the building blocks of virus capsids. 
Another possibility exists for enzymatic proteins, 
which could utilize 3D domain swapping for func¬ 
tional regulation, in particular in allostery, which 
involves multiple active sites. 3D domain swapping 
could also rescue protein function after a deleteri¬ 
ous mutation, as was illustrated with the case of 
RNase A. So far, however, it appears that 3D 
domain swapping is predominantly connected with 
molecular pathology, through its involvement in 
conformational disorders, of which high-order 
amyloid aggregation is perhaps the most important. 
Paradoxically, however, 3D domain swapping can 
also be used by nature to prevent high-order aggre¬ 
gation in physiological milieus, for instance, in 
the vertebrate eye lens where a-crystallins can 
reach astronomically high concentration (>1 g/ml 
in fish eye) and thus are prone to aggregation. 
a-Crystallin aggregates inter alia via 3D domain 
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3D Domain Swapping, Fig. 2 The structure of RNase 
A monomer (a), and its different 3D domain-swapped 
oligomers: two close-ended dimers (b, c), a model of an open- 
ended linear trimer (d), and a close-ended, circular trimer (e). 
The N- and C-termini are labeled. The N-terminal helix {blue) 
and the C-terminal strand (red) that are swapped in the oligomers 
are marked in the monomer (a). Dimer (b) swaps the N-terminal 
helix, while dimer (c) swaps the C-terminal strand. Both swaps 


take place in the linear trimer model (d): the green subunit swaps 
the C-terminal strand with the red subunit and swaps the 
N-terminal helix with the blue subunit. In trimer (e) the swapped 
element is the C-terminal strand. The three subunits of the 
molecule are related by a threefold axis, giving the molecule 
the shape of a propeller (The figure is reproduced with permis¬ 
sion of John Wiley & Sons, Inc. from Protein Science 11(2) 
(2002) p. 376; Copyright © 2002 The Protein Society) 


swapping of a C-terminal tail. The tail has a palin¬ 
dromic sequence (ERTIPITRE) and can bind in its 
docking site in two orientations, thus frustrating 
high-order aggregation. In addition, the hinge loop 


connecting the tail with the body of the protein is 
long and flexible, allowing swaps with different 
partners to maximize polydispersity (Laganowsky 
et al. 2010). 
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3D Domain Swapping, Fig. 3 Dimeric human cystatin 
C (HCC) with 3D swapped domains (a). The red dot indicates 
the location of L68, whose substitution by glutamine leads to 
severe cerebral amyloidosis. The monomeric fold of HCC (b) is 
now known thanks to protein engineering (introduction of 
a disulfide bond to link (32 with [33, or LI stabilization via 
a V57N mutation). The papain-binding epitope of this inhibitory 


molecule is formed by the N-terminus and loops LI and L2. 
Since upon 3D domain swapping the conformation of loop LI is 
changed, HCC dimers, oligomers, and amyloid fibrils have no 
inhibitory activity. The closed interface is formed between the 
a-helix and the gripping (3-sheet; it is present in the monomer 
and (in duplicate) in the 3D domain-swapped dimer. The open 
interface (intersubunit (S-sheet) is present only in the dimer 


3D Domain Swapping and Amyloid Fibril 
Formation 

3D domain swapping has been proposed as an intel¬ 
lectually attractive mechanism for amyloid aggrega¬ 
tion quite early (Klafki et al. 1993; Cohen and 
Prusiner 1998). The first experimental hint linking 
3D domain swapping and amyloid came from crys¬ 
tallographic studies of HCC (Janowski et al. 2001), 
which showed a dimeric assembly formed through 
a domain switch involving the N-terminal part 
(pl-a-P2 element) of the protein (Fig. 3). The open 
monomer, extracted from its dimeric context, looks 
very bizarre and certainly cannot represent a stable 
protein fold. We can safely accept that the protein 
chain assumes this conformation only transiently, on 
its transition from the monomeric to the oligomeric 
form. The open conformation cannot be stable in 
aqueous environment because it has the hydrophobic 
core of the closed interface exposed to solvent. The 
transition involves conformational rearrangement of 
a hairpin loop, which in the monomeric protein is 
necessary for its inhibitory function. Monomeric 
HCC plays a protective role in all body fluids, with 
a particularly high concentration in the cerebrospinal 
fluid. There is a naturally occurring (Icelandic) vari¬ 
ant of HCC, where a single-residue mutation, L68Q, 
leads to a dramatic change of the protein’s behavior. 
L68Q HCC is not only spontaneously converted 
to a dimeric form, but very easily aggregates 


forming insoluble amyloid deposits in the vasculature 
of the central nervous system, leading to lethal cere¬ 
bral hemorrhages. In controlled in vitro conditions 
it is also possible to form amyloid fibrils from 
wt HCC. 

The second case of 3D domain swapping in an 
amyloidogenic protein was reported for the prion 
protein (Knaus et al. 2001). Its 118-226 fragment, 
known from NMR studies to exist in the PrP*^ form 
as a monomer built of three helices (A, B, C) in a 
triangular arrangement, was found by X-ray crystal¬ 
lography to form a dimer with helix C swapped 
between the folding units. The case of PrP is particu¬ 
larly intriguing because helix C at the closed interface 
is linked to helix B by a disulfide bond, requiring 
its rearrangement upon oligomerization. After 
initial controversies regarding the significance of 3D 
domain swapping in prion amyloid fibril formation, 
ingenious experiments with redox control provided 
a convincing argument in favor of the domain¬ 
swapping hypothesis (Lee and Eisenberg 2003). 
Later, 3D domain swapping was found in other 
amyloid-forming proteins, including T7 endonucle¬ 
ase I and P 2 -microglobulin. 

There is also experimental evidence for the actual 
involvement of 3D domain swapping in HCC amyloid 
fibril formation; specifically, abrogation of 3D domain 
swapping in HCC via an engineered S-S bond was 
shown to suppress the protein’s ability to form amyloid 
fibrils. If is nol immediately clear, however, how the 
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domain swapping is involved at the molecular level. 
We need to explain how a phenomenon, known from 
symmetrical, closed-ended situations, can generate an 
open-ended, linear polymer. At least three mechanisms 
can be envisaged. In one of them, the protein would be 
exchanging different domains with its immediate 
neighbors. While this is a possibility with RNase 
A (Fig. 2d), it is not likely to be a universal situation 
because proteins with two swapping termini are rare. 
In another scenario, 3D domain swapping would be 
only utilized for the creation of dimeric molecules, 
which would then aggregate as building blocks of the 
ultimate hbril using a different mechanism, for exam¬ 
ple, intermolecular P-sheet formation. Intellectually, 
this scenario is less satisfactory because it invokes 
two different mechanisms (3D domain swapping and 
intermolecular P-sheet association) to explain one phe¬ 
nomenon (hbril formation). 

In the third scenario, 3D domain swapping would be 
operating in an open-ended, runaway fashion (Fig. Id). 
Here, molecule A swaps a domain in the core of 
molecule B, which swaps a domain in molecule C, 
etc. The two ends of a linear hbril growing in this 
way would be sticky, that is, the hbril could continue 
growing at both ends simultaneously. This open-ended 
scenario has been conhrmed with S-S stabilized (stab) 
HCC, which can be forced to oligomerize and hbrillize 
under reducing conditions (which disrupt the S-S 
bonds). The amyloid aggregates formed from this pro¬ 
tein can be then dissolved (similarly to wt HCC aggre¬ 
gates) at elevated concentration of SDS. However, 
if prior to treatment with SDS the aggregates of 
stab HCC are transferred to oxidizing conditions 
(re-formations of the S-S bonds), they become 
cemented and insoluble even at very high concentra¬ 
tion of SDS (Wahlbom et al. 2007). 

In a more advanced molecular model of amyloid 
hbrils. Nelson and Eisenberg (2006) propose that run¬ 
away 3D domain swapping could be combined with 
the formation of extremely tight endless p-structures 
called steric zippers, created by highly amyloidogenic 
oligopeptide sequences. 

Experimental Control of 3D Domain 
Swapping 

3D domain swapping can be controlled by environ¬ 
mental conditions or by mutations. The environmental 


factors include (in addition to high concentration) low 
pH, gentle heating or mild denaturant, all of which 
promote (partial) denaturation or unfolding, consid¬ 
ered to be a prerequisite of 3D domain swapping. 

The mutations that affect 3D domain swapping can 
occur in the hinge region or in a different fragment of 
the protein sequence. A mutation of a hinge residue 
could stabilize the loop conformation or, conversely, 
can lead to unfavorable (“loaded spring”) interactions 
that can be relaxed only via a conformational change in 
this area. A more intuitive strategy involves manipu¬ 
lation of the hinge loop length. If it is made too 
short (deletion), the chain may not be able to properly 
fold as a monomer and the only rescue would be in a 
3D domain-swapped dimer (or other oligomer). In this 
case, the shorter hinge loop promotes 3D domain 
swapping via an energetic effect because a closed 
monomer would be energetically expensive. It is also 
possible to promote 3D domain swapping by length¬ 
ening the hinge loop (insertion) in order to endow the 
protein chain with so much conformational freedom 
that the complementary domains would recognize each 
other in an intramolecular context only with low prob¬ 
ability. In this case, the 3D domain swapping associa¬ 
tion is driven by a kinetic effect. 

Mutations outside of the hinge region can be intro¬ 
duced at the open interface or at the closed interface. 
The former case is usually connected with an energetic 
effect: we engineer a stronger interaction at the new 
interface, which is not present in the monomer. The 
effect of a mutation in the closed interface is rather of 
a kinetic nature because the consequences of the muta¬ 
tion are the same for the monomer and dimer. It must 
be, therefore, connected with the ease with which the 
transition can be completed. This kind of mutational 
effect is seen in the L68Q variant of HCC, where the 
residue at position 68 is buried in a hydrophobic pocket 
in the protein core. 

Strategic insertion (via protein engineering) of 
disulhde bridges can be used to prevent 3D domain 
swapping by riveting the structural elements that are 
separated in the open monomer. Such an approach was 
successfully used to create the monomer-stabilized 
stab variant of HCC by connecting, through an S-S 
bond, strand (32 (of the “swapping domain”) with 
strand (33 (in the protein core). Through adjustment 
of the redox potential, the S-S bridge can be converted 
to a convenient molecular switch of 3D domain 
swapping. 
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Summary 

3D domain swapping is a mechanism of protein aggre¬ 
gation, in which a structural element of a protein fold is 
replaced hy an identical element from another subunit. 
Some proteins, for instance, RNase A, can assume 
many domain-swapped forms, thus undermining the 
dogma, “one sequence-one structure,” in a particularly 
spectacular way. Completed in a mutual fashion, it is 
a mechanism of protein oligomerization; in an open- 
ended fashion, 3D domain swapping could be a mech¬ 
anism of amyloid fibril formation. The first connection 
between 3D domain swapping and amyloidogenicity 
was found in human cystatin C (HCC), and the second, 
in the prion protein (PrP). In both cases, a disulfide 
bridge (natural in PrP, engineered in HCC) can be used 
for redox control of 3D domain swapping and to dem¬ 
onstrate that it is indeed involved in amyloid fibril 
formation. Paradoxically, 3D domain swapping can 
also be used by nature for prevention of nucleation 
processes that could lead (e.g., in the eye lens) to 
high-order aggregates. Other possible biological roles 
of 3D domain swapping include gain or rescue of 
function and stabilization of oligomeric structures. 
Mutations at the closed interface, or protein core, can 
drastically increase the propensity for 3D domain 
swapping. Mutations in other areas, such as the flexible 
hinge (especially deletions and insertions) can also be 
used to control 3D domain swapping. 

Cross-References 

► Amyloid Formation 

► Domain Swapping 

► Functional Amyloid 

► Oligomeric Proteins 
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Definitions 

Tomogram: Three-dimensional image obtained by 
tomography. 

Tomography: A procedure where images recorded 
at different sample orientations are computationally 
merged to form a tomogram. 

Cryo-electron tomography: A procedure where 
fully hydrated, vitrified samples are imaged by elec¬ 
tron tomography. 

Introduction 

In cryo-electron tomography (cryo-ET), frozen- 
hydrated, vitrified samples are imaged by electron 
tomography. Electron micrographs obtained at differ¬ 
ent sample orientations are computationally merged 
into a tomogram, allowing three-dimensional (3D) 
visualization of biological materials at molecular res¬ 
olution. Conceptually, cryo-ET procedure can be 
divided into steps carried out under cryo-conditions 
(sample preparation and imaging) and the computa¬ 
tional steps (image processing and interpretation), as 
indicated in Eig. 1. 

From Samples to Tomograms 

Cryo-preserved samples are frozen-hydrated, embed¬ 
ded in vitreous water (amorphous ice, a crystal-free 
state of solid water) for electron microscopical imag¬ 
ing. The natural distribution of water is retained by 
rapid cooling (vitrification), which prevents the forma¬ 
tion of ice crystals (Dubochet et al. 1988). Plunge 
freezing is a vitrification method that is often used for 
samples up to 20 pm in thickness: the sample is 
plunged into a cryogen (a liquid below — 135°C with 
good heat-transfer characteristics; e.g., ethane). High- 
pressure freezing provides vitrification of up to 
about 200 pm thick biological samples. Both methods 
induce vitrification within microseconds. Vitrified 
samples thicker than 500 nm are too thick for imaging 
and require a thinning step performed under cryo- 
conditions (i.e., below — 135°C), such as cryo- 
ultramicrotomy. 

Unlike dehydrated samples, hydrated vitreous sam¬ 
ples provide a faithful representation of the native 
structure because water, the main constituent of 



Emerging developments and technologies 


3D Electron Microscopy Based on Cryo-Electron Tomogra¬ 
phy, Fig. 1 Cryo-ET workflow that includes required and alter¬ 
native procedures, as well as selected emerging methods currently 
under active development (Reproduced with permission from 
Vanhecke et al. 201 1) 
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biological material, is retained. Also, chemical fixation 
is avoided and the biological material is observed 
directly, without heavy metal staining. 

In electron tomographic acquisition, a set of 
images, essentially projections, is recorded at different 
spatial orientations of a sample. The single-axis acqui¬ 
sition scheme where a specimen is rotated (tilted) 
around a fixed axis is most commonly used. Physical 
restrictions of the instrument and factors such as the 
sample thickness typically limit the tilt range to ±60° 
or ±70°. This results in an incomplete set of projec¬ 
tions and a wedge-shaped missing information region 
in Fourier space, causing distortions in tomograms 
such as an elongation in the electron beam direction 
(Frank 1992). While dual-axis tilt (around two orthog¬ 
onal axes) and conical tilt scheme (rotations around the 
optical axis at a constant tilt angle) reduce the amount 
of missing information, they are rarely used due to 
technical difficulties. 

The constant angular increment is the most often 
used scheme. The Saxton scheme uses a smaller tilt 
increment for higher tilt angles thus reducing the 
anisotropy of the resolution caused by the increase 
of thickness (Saxton et al. 1984), but it does not 
appear to perform better than the constant increment 
in the presence of noise. The maximum attainable 
isotropic resolution depends on the angular increment 
and sample thickness according to the Crowther’s 
criterion. 

Hydrated vitreous samples are sensitive to electron 
irradiation and require imaging in the low-dose mode. 
The development of automated schemes made possible 
the acquisition of tilt series in this mode by correcting 
the positioning and focusing errors caused by mechan¬ 
ical imperfections of the microscope stage (Koster 
et al. 1997). This is done using additional images that 
cause little additional beam exposure of the feature of 
interest. Automated acquisition requires the immediate 
access to images provided by CCD-based detectors. 
Current acquisition schemes use stage movement cal¬ 
ibration and/or prediction procedures to allow faster 
data acquisition. 

Tilt series alignment corrects for image shifts 
and may correct for errors due to imprecise tilting 
and imperfect tilt axis orientation with respect to 
the optical axis. A straightforward way to align a tilt 
series is based on fiducial markers: high-contrast 
particles, typically gold, which are easily recognized 
in noisy images. Fiducial-less alignment procedures 


rely on the correlation of intrinsic image features 
and are applicable to images with relatively high 
contrast. 

The reconstruction process transforms 2D projec¬ 
tions comprising a tilt series into a tomogram. 
Weighted back-projection is most commonly used; 
it reprojects the projections to the 3D space and 
corrects for the uneven radial distribution of informa¬ 
tion in the Fourier space caused by this reprojec¬ 
tion. Iterative reconstruction algorithms such as 
algebraic reconstruction technique (ART) and simul¬ 
taneous iterative reconstruction technique (SIRT) 
attempt to minimize the difference between tilt 
series projections and reprojections of the 3D image 
(Herman 2010). 

Image Formation and Resolution 

Electrons interacting elastically with the sample are 
well-localized, contain high-resolution information, 
and are used for imaging. Inelastically scattered elec¬ 
trons deposit part of their energy in the sample, thus 
limiting the electron dose that can be applied before 
damaging the specimen. Low angular spread and poor 
localization of inelastic scattering prevent their con¬ 
tribution to high-resolution information. In cryo-ET, 
elastically scattered electrons are often removed by an 
energy filter operated in the zero-loss mode, ensuring 
that only electrons that essentially have not lost 
energy reach the detector (Reimer 1997). 

Phase contrast dominates in electron micrographs 
of samples that were not subjected to heavy-metal 
staining. It is induced by recording images in 
underfocus, which reduces the high-frequency infor¬ 
mation and limits the resolution. Effects of 
defocusing are mathematically described by the 
microscope contrast transfer function (CTE), 
a damped oscillating function of the spatial frequen¬ 
cies (Reimer 1997). The frequency corresponding to 
the first zero of the CTE is commonly translated to 
a resolution limit in tomograms. However, computa¬ 
tional correction of the CTF in projection images 
may recover some of the higher frequencies in a 
tomogram. Another (technical) parameter limiting 
the nominal resolution of tomograms is the pixel 
size at the specimen level. Finite angular increment 
and limited angular range may further limit the reso¬ 
lution and make it anisotropic. 
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Arguably, low signal-to-noise ratio of cryo- 
tomograms presents the strongest limitation on the 
usable resolution. Poor image contrast arising from 
the low atomic number of biological samples and the 
need to use very low electron exposure due to the 
electron irradiation sensitivity of vitrified biological 
samples limit the signal, while limitations in CCD 
camera performance and the poor sampling of electron 
scattering contribute to the noise. 

Interpretation of Tomograms 

The separation of meaningful information from back¬ 
ground is required for the visualization and interpre¬ 
tation of tomograms. Denoising reduces the noise 
while preserving the features of interest. Typically, 
tomograms are visually Inspected by displaying slices 
through the reconstruction, often accompanied by 
manual segmentation of features of interest. This 
visual feature detection is subjective and error- 
prone. Automated segmentation methods have been 
developed for certain applications; they are fast, 
reproducible, and yield good results. Particular struc¬ 
tures can be detected in a tomogram by template 
matching, where the tomogram is searched for occur¬ 
rences of a template of choice (e.g., a macromolecular 
complex) by cross-correlation. This powerful method 
is only applicable where a suitable template exists. 
When multiple copies of a structure of interest (e.g., 
a protein complex or a virus particle) exist, a higher 
resolution of that structure can be obtained by 
aligning and averaging sub-tomograms containing 
the structure. 

Due to its ability to provide 3D images of individ¬ 
ual structures of interest directly, in their natural state 
or environment, cryo-ET is the technique of choice 
for applications to macromolecular complexes with 
unique topologies, such as pleiomorphic viruses. 
Many bacteria and archaea are sufficiently thin to be 
directly studied by cryo-ET upon plunge freezing. 
Numerous studies provided detailed structural 
insights into bacterial membrane organization, inter¬ 
nal compartments, chemoreceptor arrays, flagellar 
motor, or cytoskeleton. While most eukaryotic cells 
are too thick to be studied in toto by cryo-ET without 
applying thinning procedures, the periphery of certain 
adherent eukaryotic cells is directly accessible for 


cryo-ET. This approach has allowed the detailed anal¬ 
ysis of the actin cytoskeleton organization in 
lamellipodia and filopodia, as well as studies of the 
long and thin processes formed by neurons and fibro¬ 
blasts. Cryo-ET of isolated organelles and cellular 
fractions, in some cases augmented with tomography 
of intact cells, revealed the architecture of the nuclear 
pore complex, the organization of polysomes, and 
structural determinants of synaptic vesicle organiza¬ 
tion and release. 

Outlook 

As exciting as cryo-ET is today, we expect that its full 
development is yet to come. Recent progress in 
thinning procedures of vitreous samples, such as 
focused ion beam milling, might ease the limitations 
imposed by specimen thickness, greatly expanding 
the catalog of samples amenable for cryo-ET. Long- 
awaited technical developments such as phase plates, 
direct electron detectors, or further software develop¬ 
ment promise to bring substantial improvements 
in tomogram resolution and interpretation. Finally, 
further integration with other methods, such as cor¬ 
relative light microscopy or proteomics, are expected 
to bring a deeper functional dimension to this 
technique. 
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4-Oxalocrotonate Tautomerase - 
Computational Studies 
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Chemistry, University of Bristol, Bristol, UK 

Definition 

An enzyme that catalyzes the isomerization of 
P,y-unsaturated enones to their a,P-isomers. 

Basic Characteristics 

4-Oxalocrotonate tautomerase is an unusually small 
enzyme, with monomers of only 62 amino acid resi¬ 
dues. Another unusual feature of the enzyme is the use 
of the N-terminal proline as a catalytic base (Fig. 1). 
The enzyme forms part of a metabolic pathway 
in bacteria that serves to turn various aromatic 


hydrocarbons such as toluene, o-xylene, and 1,2,4- 
trimethylbenzene into intermediates of the citric acid 
cycle (Whitman 2002). Bacteria that contain the plas¬ 
mid on which the enzyme is encoded can therefore use 
these compounds as their sole source of energy and 
carbon. 

The reaction catalyzed by 4-oxalocrotonate 
tautomerase has been investigated by a variety of 
QM/MM and QM approaches and has served as 
a test application for the development of new 
methods, such as chain-of-state reaction path optimi¬ 
zation methods (Cisneros et al. 2005) and the quan¬ 
tum mechanics/molecular mechanics minimum free- 
energy path (QM/MM-MFEP) method (Hu et al. 
2008). QM/MM simulations with three different bind¬ 
ing orientations of the substrate in the enzyme active 
site indicated the importance of its orientation on the 
reaction barriers and energies (Tuttle and Thiel 2007). 
Ruiz-Pernia et al. presented a comprehensive QM/ 
MM study of the reaction, using AM1/CHARMM22 
QM/MM molecular dynamics simulations, with den¬ 
sity functional theory (MBWlBK/6-31-i-G(d,p)) 
energy corrections, to obtain free-energy profiles of 
both reaction steps (Ruiz-Pernia et al. 2009). This 
work indicated that the enzyme-substrate system is 
highly flexible and that a dynamic coupling exists 
between the internal rotation around the C2—C3 sin¬ 
gle bond of the substrate and the proton transfer. 
Flexibility of the enzyme-substrate model is therefore 
apparently crucial to model the reaction correctly. 
Ensemble-averaged variational transition state theory 
(VTST) calculations with tunneling corrections were 
further used to predict a modest primary H/D isotope 
effect, which is not significantly influenced by quan¬ 
tum tunneling. 


Pro1 - Prol Pro1 



4-Oxalocrotonate Tautomerase - Computational Studies, hexenedioate (3) through the dienolate intermediate 2- 
Fig. 1 Reaction mechanism of the isomeric conversion of 2- hydroxymuconate (2), as catalyzed by 4-oxalocrotonate 
oxo-4-hexenedioate (4-oxalocrotonate, 1) into 2-oxo-3- tautomerase. The catalytic N-terminal proline is indicated 
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Cross-References 

► Methylamine dehydrogenase (MADH): studies with 
variational transition state theory (VTST) 

► QM/MM Methods 

► Quantum Effects/Tunneling 
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Synonyms 

Coiled coils; Protein structure 

Introduction: Functions, Structures, and 
Sequences 

The a-helical coiled coil is a ubiquitous protein¬ 
folding motif found throughout all proteomes 
(Rackham et al. 2010) and in many proteins with 
diverse functions (Mason and Arndt 2004; Parry 
and Squire 2010). Though they occur within larger 
protein domains, the primary role of coiled coils is 
structural, and specifically to direct and cement 
protein-protein interactions in water-soluble, 
transmembrane, and fibrous proteins (Lupas and 
Gruber 2005; Moutevelis and Woolfson 2009). 
Coiled coils comprise two or more a-helices that 
interact slightly off parallel to form supercoiled 
bundles (Fig. la, b). Folding and association are 
driven by underlying sequence repeats of hydro- 
phobic (H) and polar (P) residues, in which the H- 
type residues are spaced at combinations of 3 and 4 
residues apart. In classical, or canonical, coiled 
coils, the repeat has a 7-residue pattern, HPPHPP, 
the positions of which are often labeled abcdefg. 
This directs the folding of amphipathic a-helices 
that associate via their //, or a/rf faces (Fig. Ic). 
Moreover, such repeats lead to a tight interdigita- 
tion of side chains between adjacent helices known 
as knobs-into-holes (KIH) packing (Fig. Id) 
(Crick 1953). This packing imparts much of the 
stability and specificity to coiled-coil interactions, 
which are both extraordinary given that they are 
often short polypeptides compared with globular 
proteins of comparable robustness. The heptad 
repeat and KIH packing are the accepted hallmarks 
of a-helical coiled-coil protein sequence and 
structure. 
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a-Helical Coiled Coils, 

Fig. 1 A classic dimeric 
coiled-coil structure, (a & b) 
Orthogonal views of two 
supercoiled a-helices. (c) The 
heptad repeat, ahcdefg, plotted 
on a dimer of helical wheels, 
(d) Knobs-into-holes 
interaction showing a side 
chain (dark gray) from one 
helix (not shown) packing into 
a diamond of side chains on 
the partnering helix (light 
gray). Protein images were 
constructed from the 
structures for the leucine- 
zipper region of yeast 
transcriptional activator 
GCN4 (pdb entry: 2zta) 




Sequence-to-Structure Relationships and 
Prediction 

These relatively straightforward structural and sequence 
features of the coiled coil have made it an important 
model for protein-folding studies and in the development 
of sequence-to-stmcture relationships in peptides and 
proteins (Woolfson 2005). In turn, these studies have 
led to successful coiled-coil prediction algorithms 
(Gruber et al. 2006). Indeed, a variety of bioinformatics 
approaches have been used with the favored and most 
successful being based on amino acids profiles for the 
heptad repeat culled from natural sequences (Parry 1982; 
Lupas et al. 1991; Woolfson and Alber 1995; Wolf et al. 
1997), more sophisticated Hidden Markov Models 
(HMM) (Delorenzi and Speed 2002), and, most recently, 
genome-scale predictions using a combined HMM and 
SUPERFAMILY approach (Rackham et al. 2010). 

Box 1: Coiled-Coil Prediction and Analysis on the 
Web 

Front-end prediction of coiled-coil regions: 

http://www.ch.embnet.org/software/ 

COILS_form.html 

http://www.isrec.isb-sib.ch/webmarcoil/ 
webmarcoilC 1 .html 
Oligomer-state prediction: 

http://groups.csail.mit.edu/cb/multicoil/cgi- 

bin/multicoil.cgi 


http://coiledcoils.chm.bris.ac.uk/Scorer/ 

http://supfam.cs.bris.ac.uk/SUPERFAMILY/ 

spiricoil/ 

Coiled-coil structure databases: 

http://coiledcoils.chm.bris.ac.uk/ccplus/ 

search/ 

The GCN4 Leucine Zipper as a Model System 

Perhaps the most significant model system over 
the past two decades is the leucine-zipper peptide 
from the yeast transcriptional activator GCN4 
(Fig. 1) (O’Shea et al. 1991). The wild-type peptide, 
GCN4-pl, has 4.5 canonical heptad repeats, forms 
a parallel dimer coiled coil with well-defined KIH 
interactions. Nonetheless, this structure appears to 
be plastic. Multiple mutations of the a and d sites 
lead to different parallel oligomers: a = He -i- 
d — Leu gives dimeric; a — Leu ■¥ d ~ He, tetramer; 
and a = d— He, trimer (Harbury et al. 1993), whereas, 
mutations at the core-flanking e and g sites access 
offset parallel and antiparallel tetrameric and 
heptameric assemblies (Deng et al. 2006; Liu et al. 
2006). While these switches are not fully understood, 
the mutations at the a and d positions at least can be 
rationalized in terms of distinct KIH interactions 
that accommodate the different amino acid combina¬ 
tions (Fig. 2). 







0-9 


14 


ot-Helical Coiled Coils 


'■ ax) 

Ox 

‘ 6 



a-Helical Coiled Coils, Fig. 2 Diffei ■ent knohs-into-holes 
packing arrangements, (a) Parallel packing at a sites of parallel 
dimers, (b) Perpendicular packing at d sites of parallel dimers. 
Note: these two packing geometries are more or less reversed in 
parallel tetramers. (c) Acute packing at a and d sites of parallel 
trimers. Key: Left-handed side, cartoons with helices viewed 
down the long-helix axis and represented as gray circles; the 


white circles on the circumferences of the gray circles 
represent adjacent Cc atoms, which form the holes; while 
those spurring off the central helix represent the Cp atoms of 
the knob residues. Right-hand side, images from X-ray crystal 
structures determined for the wild-type dimer (a and b) and 
a trimeric (c) mutant of GCN4-pl (From Walshaw and 
Woolfson (2001)) 


Diversity in Coiled-Coil Structures 

More generally, coiled-coil structures in the RCSB 
(Brookhaven) Protein Data Bank (PDB) have been 
surveyed using the SOCKET program (Walshaw and 
Woolfson 2001), which identihes KIH packing 
arrangements. The results have been used to 
populate a relational database of coiled-coil structures, 
CC -I- (Testa et al. 2009), and organized into a Period 
Table of Coiled Coils (Fig. 3) (Moutevelis and 
Woolfson 2009). These resources show that natural 
coiled-coil proteins adopt the architectures observed 
for the engineered GCN4-pl variants and more. Spe¬ 
cifically, as well as what might be termed the classical, 
or simple, coiled coils, natural coiled-coil proteins show 
a range of more complex structures. These have parallel 
and antiparallel arrangements of helices, different num¬ 
bers of helices, and even conglomerates of coiled-coil 
motifs. These latter, more complex structures are best 
considered in terms of an expansion of the helix-helix 


interfaces and KIH interactions, which reflects 
increased hydrophobicity in the underlying sequences; 
in essence, multiple heptad repeats are superimposed in 
a single sequence (Zaccai et al. 201 1). It is worth noting 
here that there are also variations in the underlying 
coiled-coil sequence repeats, so-called noncanonical 
coiled coils, which nonetheless lead to stable and func¬ 
tional structures (Lupas and Gruber 2005). 

Outlook: Better Understanding, Prediction, 
and Design 

The findings for the natural coiled-coil structures, 
along with the aforementioned experiments on 
GCN4, raise key questions in contemporary coiled- 
coil research, namely: What are the limits on 
coiled-coil structure? What sequence-to-structure 
relationships help define and discriminate between 
these different possibilities? How might we use any 



a-Helical Coiled Coils 


15 


0-9 


a-Helical Coiled Coils, 

Fig. 3 A Periodic Table of 
coiled-coil structures. Helices 
are represented as red circles; 
KIH interfaces identified by 
SOCKET are shown as solid, 
black lines; and consolidated 
hydrophobic cores of classical 
coiled coils above dimer are 
shown in gray (From 
Moutevelis and Woolfson 
( 2009 )) 
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improvements in our understanding of coiled-coil 
motifs in prediction, engineering, and design? 

For the higher-order oligomers (pentamers and 
above) and the more-complex coiled-coil structures, at 
present there are too few natural examples to extract 


reliable sequence-to-structure relationships, or “rules” 
to aid protein-structure prediction and design. A route 
forward in this effort, therefore, is the empiricism of de 
novo protein design (Kohn et al. 1997; Woolfson 2005; 
Zaccai et al. 2011). Indeed, a wide variety of new 
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coiled-coil structures, assemblies, and functions have 
been explored through peptide engineering and design. 
Examples include using coiled coils to drive and cement 
other protein-protein interactions and “split-protein” 
biosensors (Ghosh et al. 2000), leucine-zipper-based 
vesicle fusion systems (Marsden et al. 2009), and 
using designed coiled coils as the basis for supramolec- 
ular assemblies, in particular new peptide-based bio¬ 
materials with possible applications in 3D cell culture 
and tissue engineering (Banwell et al. 2009). 

Summary 

The a-helical coiled coil is a ubiquitous protein¬ 
folding domain that is found across all genomes, 
where it occurs in a wide variety of protein structures 
of various functions. Coiled-coil structures comprise 
two or more amphipathic helices that wrap around one 
another in a rope-like fashion to render superhelical 
bundles. Coiled coils have straightforward, indeed, 
hallmark features - notably the heptad sequence repeat 
and knobs-into-holes helix-helix packing - which 
make it a superb model for protein folding, modeling, 
prediction, and design studies. While we have learned 
a lot about the folding, assembly of a-helical coiled- 
coil proteins, much remains to be understood. Conse¬ 
quently, there are considerable opportunities for apply¬ 
ing new prediction methods, and the experimental tests 
of protein design and engineering particularly in the 
area of synthetic biology (Bromley et al. 2008; 
Marsden and Kros 2010). 

Cross-References 

► Heteromeric Versus Homomeric Association of 
Protein Complexes 

► Homology Modeling of Protein Structures 

► Macromolecular Crystallography: Overview 

► Oligomeric Proteins 

► Protein Secondary Structure Prediction in 2012 

► Protein Structure Prediction and Structural 
Annotation of Proteomes 

References 

Banwell EF, Abelardo ES, Adams DJ, Birchall MA, Corrigan A, 
Donald AM, Kirkland M, Serpell LC, Butler ME, Woolfson DN. 


Rational design and application of responsive alpha-helical 
peptide hydrogels. Nat Mater. 2009;8:596-600. 

Bromley EHC, Channon K, Moutevelis E, Woolfson DN. Pep¬ 
tide and protein building blocks for synthetic biology: from 
programming biomolecules to self-organized biomolecular 
systems. ACS Chem Biol. 2008;3:38-50. 

Crick FHC. The packing of a-helices: simple coiled-coils. Acta 
Cryst. 1953;6:689-97. 

Delorenzi M, Speed T. An HMM model for coiled-coil domains 
and a comparison with PSSM-based predictions. Bioinfor¬ 
matics. 2002;18:617-25. 

Deng YQ, Liu J, Zheng Q, Eliezer D, Kallenbach NRLuM. 
Antiparallel four-stranded coiled coil specified by a 3-3-1 
hydrophobic heptad repeat. Structure. 2006;14:247-55. 

Ghosh 1, Hamilton AD, Regan L. Antiparallel leucine zipper- 
directed protein reassembly: application to the green fluores¬ 
cent protein. J Am Chem Soc. 2000;122:5658-9. 

Gruber M, Soding J, Lupas AN. Comparative analysis of coiled- 
coil prediction methods. J Struct Biol. 2006;155:140-5. 

Harbury PB, Zhang T, Kim PS, Alber T. A switch between two-, 
three-, and four-stranded coiled coils in GCN4 leucine zipper 
mutants. Science. 1993;262:1401-7. 

Kohn WD, Mant CT, Hodges RS. Alpha-Helical protein assem¬ 
bly motifs. J Biol Chem. 1997;272:2583-6. 

Liu J, Zheng Q, Deng YQ, Cheng CS, Kallenbach NRLuM. 
A seven-helix coiled coil. Proc Natl Acad Sci USA. 
2006;103:15457-62. 

Lupas AN, Gruber M. The structure of alpha-helical coiled coils. 
Adv Protein Chem. 2005;70:37-78. 

Lupas A, Dyke MV, Stock J. Predicting coiled coils from protein 
sequences. Science. 1991;252:1162M. 

Marsden HR, Kros A. Self-assembly of coiled coils in synthetic 
biology: inspiration and progress. Angew Chem-Int Edit. 
2010;49:2988-3005. 

Marsden HR, Elbers NA, Romans PHH, Sommerdijk N, Kros A. 
A reduced SNARE model for membrane fusion. Angew 
Chem-Int Edit. 2009;48:2330-3. 

Mason JM, Arndt KM. Coiled coil domains: stability, 
specificity, and biological implications. Chembiochem. 
2004;5:170-6. 

Moutevelis E, Woolfson DN. A periodic table of coiled-coil 
protein structures. J Mol Biol. 2009;385:726-32. 

O’Shea EK, Klemm JD, Kim PS, Alber T. X-ray structure of the 
GCN4 leucine zipper, a two-stranded, parallel coiled coil. 
Science. 1991;254:539-44. 

Parry DAD. Coiled-coils in alpha-helix-containing proteins - 
analysis of the residue types within the heptad repeat and 
the use of these data in the prediction of coiled-coils in other 
proteins. Biosci Rep. 1982;2:1017-24. 

Parry DAD, Squire JM. Coiled-coils and fibrous proteins. 
J Struct Biol. 2010;170:18L 

Rackham OJL, Madera M, Armstrong CT, Vincent TL, 
Woolfson DN, Gough J. The evolution and structure predic¬ 
tion of coiled coils across all genomes. J Mol Biol. 
2010;403:480-93. 

Testa OD, Moutevelis E, Woolfson DN. CC plus: a relational 
database of coiled-coil structures. Nucleic Acids Res. 
2009;37:D315-22. 

Walshaw J, Woolfson DN. SOCKET: a program for identifying 
and analysing coiled-coil motifs within protein structures. 
J Mol Biol. 2001;307:1427-50. 



<5 Value Analysis 


17 


0-9 


Wolf E, Kim PS, Berger B. MultiCoil: a progr am for predicting two- 
and three-stranded coiled coils. Protein Sci. 1997;6:1179-89. 

Woolfson DN. The design of coiled-coil structures and assem¬ 
blies. Adv Prot Chem. 2005;70:79-112. 

Woolfson DN, Alber T. Predicting oligomerization states of 
coiled coils. Protein Sci. 1995;4:1596-607. 

Zaccai NR, Chi B, Thomson AR, Boyle AL, Bartlett GJ, 
Bruning M, Linden N, Sessions RB, Booth PJ, Brady RL, 
Woolfson DN. A de novo peptide hexamer with a mutable 
channel. Nat Chem Biol. 2011; 7:935-941. 


a-Ketoglutarate-Dependent 

Hydroxylases 
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► Glutathione 


^ Value Analysis 

Stefano Gianni 

Foundation “Istituto Pasteur-Fondazione Cenci- 
Bolognetti” and Institute of Molecular Biology and 
Pathology, CNR, Department of Biochemical Science 
‘A. Rossi Fanelli’, University of Rome, Sapienza, 
Rome, Italy 

Synonyms 

Protein fluorescent dye labeling 


Definition 

The <D value analysis is an experimental technique that 
allows monitoring structure formation in metastable 
states, such as folding intermediates and transition states. 


Introduction 

The subject of this entry is the application of the so- 
called d) value analysis to unveil the structural features 
of metastable states, such as intermediates and transi¬ 
tion states, in the protein folding reaction. Understand¬ 
ing how proteins achieve their three-dimensional 
structure is a fundamental problem of biophysics and, 
since the first structures of proteins were solved, has 
been the subject of immense interest. Some of this 
seminal work will be described in this chapter and in 
other chapters of this volume. 

The protein folding problem can be seen from two 
different perspectives: the thermodynamic question of 
how the native state is stabilized and the mechanistic 
question on how the folded structure is achieved. 
Following the latter perspective, in analogy with clas¬ 
sical organic chemistry, the best strategy to unravel the 
mechanism of the folding reaction is to isolate all the 
intermediate states and transition states in between 
them and then characterize their structure. But such 
a task is complicated by the elusive nature of interme¬ 
diates and the highly cooperative nature of folding. 

Many of the great advances in protein folding stud¬ 
ies have been achieved by isolating intermediates 
and studying their structure (see also “►Membrane 
Proteins: Folding and Stability”). However, in many 
single-domain proteins, only the fully native and fully 
denatured state may be populated at equilibrium. (Note 
that in this entry, the term “denatured” rather than 
“unfolded” will be used. This denomination stresses 
that denatured states may retain residual structure, as 
discussed in Structure of Denatured States.) In these 
cases, a key role in protein folding studies has been 
played by the description of folding transition states. 
The transition state theory was introduced by Eyring in 
1935 to describe the kinetics of simple chemical reac¬ 
tions (Eyring 1935). Following this view, observed 
kinetics are dependent upon the formation of an acti¬ 
vated complex, which arises from the productive col¬ 
lision between reactants. Fersht and coworkers 
extended the TS theory to protein folding studies and, 
as described in this entry, contributed critically to the 
description of protein folding mechanisms at nearly 
atomic resolution. In fact, by employing the so-called 
O value analysis, structural information may be 
inferred from changes in energetics upon mutagenesis 
(Matouschek et al. 1989,1990; Fersht etal. 1992). This 
method thus offers the tantalizing possibility to 
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describe the structure of metastable states that are 
poorly or never populated in solution. 

While the <1> value analysis is an excellent method to 
map interaction patterns in any kind of conformational 
transitions in proteins (i.e., protein folding, binding, 
catalysis, etc.), for simplicity, I will focus on the 
description of transition states in protein folding stud¬ 
ies. Further reading is suggested to fully explore 
the applicability of such a powerful technique (Fersht 
et al. 1986, 1987; Matouschek et al. 1989, 1990; 
Fersht et al. 1992, 2001). 


Theory 

The transition state of a simple single-step reaction 
may be defined at that particular conformation, along 
the reaction coordinate, that corresponds to the highest 
point on the free-energy profile (Eyring 1935). In the 
case of protein folding, an ideal two-state process is 
very unlikely at a molecular level - proteins are 
dynamic objects that are subject to thermal motions 
and may explore rapid local unfolding events (Daggett 
2006). Yet, even for a reaction involving a series 
of sequential intermediates with various intervening 
transition states, it is an accepted convention to refer 
to the highest energy point on the energy profile as the 
“overall” reaction transition state. This concept high¬ 
lights that while the two-state assumption is a valid 
model to describe much of the experimental data on the 
folding of proteins, the energy landscape of proteins 
may be rough, with many local minima that may 
escape an experimental characterization (Bryngelson 
et al. 1995). 

In two-state systems, the only accessible state to 
experimental studies is the transition state. Because 
the transition state never accumulates, information 
about its structure can only be obtained indirectly. In 
chemistry, a classical technique is used to probe the 
extent of bond formation in transition states (Leffler 
1953). Following the so-called a- or P-value analysis, 
when studying a given reaction, a substituent may be 
added into a nonreacting part of an organic reagent 
while measuring the effect of such substitution on the 
energy of activation and the free energy of equilib¬ 
rium. The P-value normalizes these energy terms and 
reflects the extent of bond making and formation in 
the reaction. Following this methodology, structure 


formation is inferred via perturbation of energetics. In 
the case of proteins, the development of site-directed 
mutagenesis allows performing a similar technique 
(Fersht et al. 1992). By systematically mutating pro¬ 
tein side chains, while probing the effect of the muta¬ 
tion on the folding kinetics and protein stability, it is 
possible in theory to map out interaction patterns in 
protein folding transition states (Fig. 1). In fact, muta¬ 
tions that destabilize the transition state (slowing 
down the folding reaction) do target contacts that 
are formed in its structure. The strength of the con¬ 
tacts is measured by the O value, which normalizes 
the stability loss of the transition state to that of the 
native state. 

When mutating the native state N into N’, the 
experimentally accessible quantities in O value analy¬ 
sis are the kinetic and equilibrium parameters of 
the wild-type and mutant proteins, that is, AGd-n> 
AGd'-n'j AGi>_ts, AGd'-tss AGts-n> AGts'-n'- 
Thus, a change in native stability upon mutation may 
be calculated as: 

AAGn-d — AGn'-d' — AGn-d (1) 

and the change in activation energy is: 

AAGts-d = AGts'-d' — AGts-d (2) 

Quantitatively, the <[) value for folding is defined as 
follows: 


AAGts-d 

AAGn-d 


(3) 


or 


{AGts' - TS — AGd'-d) 
(AGiv'-iv — AGd'-d) 


(4) 


In theory, the presence of the AGd'-d in the 
value complicates the analysis and could potentially 
jeopardize a rigorous interpretation (Matouschek et al. 
1989; Cho and Raleigh 2006). However, inspection of 
Eq. 4 reveals that when the target residue is as unstruc¬ 
tured as in the denatured state, AGts'-ts = AGd'-d 
and the $ = 0; on the other hand, if the residue is as 
structured as in the native state, AGts'-ts = -^^Gn'-n 
and the <1) = 1. Thus, for the two conditions where 
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Fig.1 Schematic depiction of 
the <1> value analysis. Reaction 
profiles are presented for 
a protein mutated in the region 
highlighted with a yellow star. 
(Left) When the transition state 
will be as unstructured as the 
denatured state in the site of 
mutations, its energy will be 
unaffected by the mutation. 
Accordingly, AAGts-d 0 
and 0^0. (Right) If the site 
of mutation is native-like in 
the folding transition state, 
AAGxs—D tends to AAGf,j —d 
and O —> 1. By introducing 
several conservative site- 
directed mutants, it is thus 
possible to dissect the 
structure of the folding 
transition state 




the value of (1> is 0 or 1, all mutations return an inter¬ 
pretable value and the effect of AGd'-d in Eq. 4 
cancels out. 

As highlighted below, also in the case of fractional 
values of d), the experimental determination of d) is 
interpretable for conservative deletion mutations. In 
fact, when a protein is mutated, the free energy of its 
denatured state may be affected by essentially three 
components: AGnoncov^ due to the changes in 
noncovalent interactions; AGreorg> the change in free 
energy arising from the reorganization of stmcture 
upon mutagenesis; and AGsoiv> the change in solvation 
energy upon mutagenesis (Fersht and Sato 2004). 
Importantly, both AGreorg and AGsoiv may be mini¬ 
mized for aliphatic to aliphatic mutations. Mutations 
of a hydrophobic side chain to a hydrophilic side chain 
and vice-versa lead to very large AGreorg and AGsow 
and are not well suited for d) value analysis. Accord¬ 
ingly, the recommended strategy is to mutate buried 
hydrophobic side chains, possibly by introducing 
minor deletions, without altering the stereochem¬ 
istry (i.e., Ile^Val^Ala^Gly; Leu^Ala^Gly; 
Thr^Ser; Phe^Ala^Gly). Additionally, the extent 
of formation of a helices may be probed by performing 
an Ala^Gly scanning approach on surface-exposed 
positions. It is extremely important to avoid mutations 
that introduce interactions within the protein, cause 


stereochemical clashes, or leave charged group with¬ 
out the solvating partner. The use of ► double mutant 
cycle analysis to corroborate the interpretation of some 
values is also recommended (Horovitz 1996). 

Interpretation of Fractional and 
Unusual (E> Values 

As described above, d) will tend to 1 when the site of 
mutation is fully stmctured in the transition state, 
whereas it will tend to 0 when it is as unstructured as 
in the denatured state. On the other hand, fractional <b 
values are more difficult to interpret because they can 
arise from different factors. Over-and-above the 
caveats recalled above, which can be avoided using 
the correct choice of mutations, it is evident that frac¬ 
tional d? may result from a single species of transition 
state containing weakened interactions or from hetero¬ 
geneous parallel pathways involving multiple transi¬ 
tion states, some with the interactions fully formed, 
some fully broken. A critical test to distinguish 
between these two scenarios demands the production 
of many site-directed mutants and has been exempli¬ 
fied for the folding of the chymotrypsin inhibitor 2 
(Fersht et al. 1994). In the case of folding proceeding 
via parallel pathways with two alternative transition 












0-9 


20 


O Value Analysis 


states, a given set of interactions may be present in one 
of the transition states (TSa, of folding pathway a) 
while absent in the other (TSb, of pathway b). In this 
case, destabilizing mutations in TSa would lower the 
dominance of its respective pathway (pathway a), and 
folding would be rerouted through pathway b. When 
probed on many site-directed mutants, this situation 
would complicate the dependence of the folding rate 
constants versus the equilibrium constant (as reflected 
by the so-called Bronsted plot, see (Fersht et al. 1994; 
Itzhaki et al. 1995)), which is then expected to display 
a deviation from linearity. The experimental data on 
the chymotrypsin inhibitor 2 suggest that its folding 
pathway occurs via a transition state that may be 
depicted via a narrow ensemble of structures and frac¬ 
tional 4> tends to result from weakened native-like 
interactions rather than from alternative pathways. 

A small fraction of experimentally determined O 
values have been reported to display an unusual value, 
being either negative or greater than 1. While in some 
cases these values are due to an incorrect mutation, in 
some other cases, they appear genuine. Negative 
values can be observed when the native state is 
destabilized while the transition state is stabilized and 
vice-versa, fl) values higher than 1 may be detected 
when the transition state is more affected by the muta¬ 
tion than the native state. The most accepted models to 
describe the presence of <1> values involve the presence 
of frustration in the native protein (Ventura et al. 2002) 
and/or transient formation of nonnative interactions 
in the transition state (Li et al. 2000). Importantly, 
unusual fl) values may be used to detect local 
misfolding in transient intermediates either en route 
the productive folding pathway, such as in the case of 
the Im7 protein (Capaldi et al. 2002), or acting as off- 
pathway kinetic traps, like in the case of DlpPDZ 
(Gianni et al. 2010). 


error on 4> values A complete description of the equa¬ 
tions describing the error propagation in O values 
analysis may be found in reference (Sato et al. 2006). 
In turn, it is observed that the standard error on 
values, 5(3)), may be defined as: 


5(4') = 4)- 


2 S^{h) S^{a)b^ 


(5) 


where a = + 

andS2(/>)=^ + ^. 

Importantly, at values of 4) ^ 1, the wild type and 
mutant will display the same unfolding rate constants, 
and the error on the 4) value will be lower, S(4>) ^ 
Furthermore, at values of 4) ^ 0, the folding rate 
constants will be identical and S(4>) ^ In conclu¬ 
sion, the extreme values of 4) ^ 1 and 4) ^ 0 will be 
the most reliable and will display the lowest errors. 

Over-and-above the quantitative definition of 
S(4)), a critical parameter to define the reliability 
of 4) values is the magnitude of AAGn-d upon muta¬ 
genesis. On one hand, while the smallest values 
should be more reliable because they represent the 
smallest perturbations of structure, they are more 
sensitive to experimental errors. On the other hand, 
larger AAGn-d values can be measured more accu¬ 
rately but may delete many contacts and contain reor¬ 
ganization of structure. In the latter case, fractional or 
unusual 4> values may be difficult to interpret, 
whereas 4) —> 1 and 4* ^ 0 are always a good measure 
of structure formation. The analysis of a large number 
of variants produced to date suggests that mutants 
displaying a AAGn-d lower than «0.5-0.6 kcal 
mol“' should be excluded from the analysis (Fersht 
and Sato 2004). 


Experimental Limitations and Errors 

Because 4* values are ratios of differences between 
experimental observables, they can potentially be 
very sensitive to errors when such differences are 
small. It is then of critical importance to define the 
errors of 4) values and to establish quantitatively the 
limitations of such technique. Since AAGts-d and 
AAGn-d are not independent variables, it is incorrect 
to use standard propagation methods to calculate the 


Summary 

Understanding the folding of proteins is one of the 
fundamental problems in science. By combining 
experimental and theoretical approaches, we are now¬ 
adays moving toward an atomistic depiction of folding 
landscapes. Within this frame, the 4) value analysis, 
sketched in this short entry, has played a critical role in 
the past two decades. The method, which infers struc¬ 
ture formation from thermodynamic measurements, 
can, in theory, be applied to map interaction patterns 
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in any kind of conformational transitions in proteins 
(such as protein folding, binding, catalysis, etc.) and, in 
protein folding studies, still represents the most pow¬ 
erful experimental approach to characterize the struc¬ 
ture of high-energy metastable states. 

Cross-References 

► Double-Mutant Cycle Analysis 

► Structurally Disordered Proteins 
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A^Aq ATP Synthase 

► ATP Synthases from Archaea: Structure and 
Function 

AAC1, AAC2, AAC3, AAC4 Mitochondrial 
ADP/ATP Carriers 

► Mitochondrial Transport Protein Family 

Aac3p Yeast ADP/ATP Carrier Isoform 3 

► Mitochondrial Transport Protein Family: Structure 
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Ab Initio Molecular Modeling 
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► Absorption Spectroscopy and Transition Polariza¬ 
tions: Pictorial Description 

► Protein Concentration Determination Using Dyes 
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Absorbance Spectroscopy: Overview 

Paul Wormell* and Alison Rodger^ 

'School of Science and Health, University of Western 
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Coventry, UK 

Synonyms 

Absorbance of light; Transmission of light 

Definition 

Most spectroscopic phenomena arise from the interac¬ 
tion of a molecule with one photon at a time. The 
molecule may either absorb or scatter the photon; it 
may also emit a photon. The simplest form of 
spectroscopy is absorption, which measures how 
much light of a given frequency is absorbed by 
a collection of molecules. If a molecule absorbs 
a photon of frequency v, it increases its energy by 

AE = hv = hcl (1) 


G.C.K. Roberts (ed.). Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 
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where h is Planck’s constant, 1 is the wavelength 
of the light, and c is the speed of light. Only light 
of the correct frequency to cause a molecule to 
jump from one energy level to another may be 
absorbed. When molecules absorb photons of visible 
or ultraviolet (UV) light, the magnitude of the energy 
of these photons is (2-12) x 10”*^ J/molecule, 
A = 170-800 nm which is the amount required to 
rearrange the valence electron distribution of a mole¬ 
cule. Thus, UV-visible absorption spectroscopy 
gives information about the bonding in a molecule or 
collection of molecules. 

In practice, in a collection of molecules, the photons 
absorbed by different molecules will be of slightly 
different energies, so what we measure is a curve 
such as the one in Fig. 1 , where the signal that is plotted 
is a measure of the probability that a transition will 
occur at that energy (or wavelength). Such a plot of the 
absorption of light versus X or v is known as an absorp¬ 
tion spectrum. The spectrum of Fig. 1 is the spectrum 
that would be measured for a 1 M solution in a 1 cm 
path length cuvette if any instrument could do it. In 
practice, either path length or concentration 
(or a combination) needs to be reduced by about two 
orders of magnitude. 


Absorbance is a common measure of absorption, 
and is defined in terms of the intensity of incident, Ig, 
and transmitted, I, light (Hollas 2004; Atkins and 
Friedman 2005; Atkins and de Paula 2006): 

A = logio(4//) (2) 

The Beer-Lambert law for the absorbance. A, of 
a sample of concentration C is 

A = eCi (3) 

where i is the length of the sample through which the 
light passes and £ is known as the extinction coeffi¬ 
cient. If C is measured in mol dm“^ and i is measured 
in cm, then e has units of mol“^ dm^ cm“*. e depends 
on the identity of the molecules in the sample, their 
environment, and X. Therefore, A is a function of 
wavelength. The Beer-Lambert law is valid as long as 
the spectrometer can measure I accurately and 
there are no concentration-dependent intermolecular 
interactions. The most common application of 
UV/visible absorption spectroscopy is to determine 
the concentration of a species in solution using the 
Beer-Lambert law. 
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Absorbance Spectroscopy: Quality 
Assurance 

Paul Wormell 

School of Science and Health, University of Western 
Sydney, Penrith, NSW, Australia 


Synonyms 

Calibration; Codes of practice; Standards 

Definition 

Modern spectrophotometers are usually accurate and 
reliable, but this should never be taken for granted, and 
there is a range of protocols that can he used to assess 
such parameters as wavelength accuracy (e.g., using 
holmium perchlorate solution or the sharp emission 
lines from a hydrogen or deuterium lamp), stray radi¬ 
ation, photometric accuracy (e.g., using potassium 
dichromate solution, noting the associated hazards), 
and baseline flatness. This is a routine requirement 
for an accredited testing laboratory, which may, for 
example, be required to comply with AS ISOAEC 


17025-2005, General Requirements for the Compe¬ 
tence of Testing and Calibration Laboratories. It may 
also be essential for a forensic testing laboratory, in 
which all aspects of the laboratory’s testing practice 
may attract intense scrutiny, and for any scientific 
application in which data reliability and measurement 
uncertainty must be ascertained. Eor example, the 
National Association of Testing Authorities Australia 
(NATA) requires accredited chemical testing authori¬ 
ties to “carry out regular, recorded calibration checks 
on all spectrophotometers or automated devices 
employing spectrophotometers or colorimeters”. 
These calibrations must include checks on wavelength 
accuracy, absorbance, linearity, stray light, and 
matching of cells. Calibrations must be carried out in 
accordance with the manufacturer’s instructions and/ 
or relevant codes of practice listed below. National and 
international codes of practice include the following 
standards, and others may be applicable in particular 
jurisdictions (AS refers to Australian Standard; 
ASTM refers to ASTM International, formerly 
known as the American Society for Testing and Mate¬ 
rials (ASTM)): 

AS 3753 recommended practice for chemical analysis 
by ultraviolet/visible spectrophotometry. 

ASTM El31 standard terminology relating to molec¬ 
ular spectroscopy. 

ASTM El 69 standard practices for general 
techniques of ultraviolet-visible quantitative 
analysis. 

ASTM E275 standard practices for describing and 
measuring performance of ultraviolet, visible, and 
near-infrared spectrophotometers. 

ASTM E925 standard practice for monitoring the cal¬ 
ibration of ultraviolet-visible spectrophotometers 
whose spectral slit width does not exceed 2 nm. 
ASTM E958 standard practice for measuring 
practical spectral bandwidth of ultraviolet-visihle 
spectrophotometers. 

In addition to any legal or regulatory requirements 
for testing laboratories, reliable analytical and research 
data require the spectroscopic instrumentation to per¬ 
form to a known and acceptable standard. 

Cross-References 

► Absorbance Spectroscopy: Overview 

► Absorption Spectroscopy: Practical Aspects 
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► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► UV Absorbance Spectroscopy of Biological 
Macromolecules 
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Absorbance Spectroscopy: Spectral 
Artifacts and Other Sources of Error 

Paul Wormell* and Alison Rodger^ 

'School of Science and Health, University of Western 
Sydney, Penrith, NSW, Australia 
^Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

Deviations from Beer-Lamhert Law; Light scattering; 
Solvent absorption; Spectrophotometric error; Stray 
light 


Definition 

Modern spectrophotometers and data-acquisition 
software are usually reliable and easy to use, but the 
spectra that they produce must always be regarded 
critically, as there are many possible sources of error 
and measurement uncertainty. Some of these relate to 
sample purity and identity. Others relate to the quality 
and calibration of the instrument, experimental tech¬ 
nique, choice of sample cells and wavelength ranges, 
correct use of software for data acquisition and analy¬ 
sis, appropriate scan parameters that optimize signal- 
to-noise ratios yet ensure the fidelity of the recorded 
spectrum, and uncertainties associated with data anal¬ 
ysis (e.g., the use of standard curves). A good under¬ 
standing of spectroscopic principles, and proper 
instruction and supervision in the use of spectroscopic 
instrumentation, is essential for reliable spectroscopic 
work. Other factors and artifacts may also affect the 
reliability of absorption, linear dichroism {LD) and 
circular dichroism {CD) spectra, as discussed below. 
These may lead to distorted spectral band shapes, 
deviations from the Beer-Lambert law, and unreliable 
LD and CD data. 

Basic Characteristics 

A number of issues that affect absorbance data reliability 
but may not be apparent upon inspection of a spectrum 
or calibration of an instmment are discussed below. 

Solvent Absorption 

In absorption spectrophotometry, most samples are 
dissolved in a solvent, although some spectra are 
recorded of pure solids or samples that are incorpo¬ 
rated in polymer films. The choice of solvent is often 
determined hy sample solubility and stability. For bio¬ 
molecules, buffers are often used to manage these 
factors, and to ensure that the correct ionic or neutral 
form of the sample is being analyzed. Some solvents 
(e.g., pure water and n-pentane) are effectively trans¬ 
parent at wavelengths down to 200 nm, but others are 
not. A summary of solvent cutoffs in a 1-cm cuvette is 
given in Table 1. Cutoffs in practice depend on the 
purity of the solvent as even small percentages of 
impurity may be present in high molar concentration. 
Buffers are a more complicated issue as it depends on 
the concentration required to ensure buffering 
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Absorbance Spectroscopy: Spectral Artifacts and Other 
Sources of Error, Table 1 Wavelength at which the solvent 
absorbance in a 1-cm path length cuvette is 1 (arranged in order 
of increasing absorbance) according to Honeywell Burdick and 
Jackson. An absorbance of 1 is the effective cutoff for sample 
solutions since most instruments become unreliable between 
absorbances of 2 and 3 


Solvent 

Wavelength at which 

A = 1 in 1 -cm path length 

Acetonitrile UV 

190 

Pentane 

190 

Water 

190 

Hexane UV 

195 

Cyclopentane 

198 

Cyclohexane 

200 

Heptane 

200 

Isopropyl alcohol 

205 

Methanol 

205 

Ethyl alcohol 

210 

n-Propyl alcohol 

210 

Trifluoroacetic acid 

210 

Tetrahydrofuran UV 

212 

n-Butyl alcohol 

215 

1,4-Dioxane 

215 

Ethyl ether 

215 

Iso-octane 

215 

n-Butyl chloride 

220 

Isobutyl alcohol 

220 

Ethylene dichloride 

228 

1,1,2-Trichlorotrifluoroethane 

231 

Dichloromethane 

233 

Chloroform 

245 

Ethyl acetate 

256 

A,A-dimethylformamide 

268 

Dimethyl sulfoxide 

268 

Toluene 

284 

o-Xylene 

288 

o-Dichlorobenzene 

295 

Acetone 

330 


capacity. A summary of some components as provided 
by (Kelly et al. 2005) is given in Table 2. Reducing the 
path length or the concentration of the buffer extends 
the accessible spectral range. 

Although, in principle, a double-beam spectropho¬ 
tometer can correct for a weakly absorbing solvent if it 
is placed in the reference beam, spectroscopic accu¬ 
racy and precision deteriorate as the solvent absor¬ 
bance increases, and stray light (see below) may also 
degrade the absorbance readings. In the worst case, 
both the sample and reference cells are opaque at the 


Absorbance Spectroscopy: Spectral Artifacts and Other 
Sources of Error, Table 2 Absorbances of selected buffer 
components at 1 mM in a 1-cm path length cuvette. Tris = tris 
(hydroxymethyl)aminomethane. (Note: TBS often denotes Tris- 
buffered saline, which is a mixture of 50-mM Tris and 50-mM 
NaCl adjusted to pH 7.6 with HCl, is a mixture of components all 
of which must be taken into account.) HEPES = 2-[4-(2- 
hydroxyethyl)piperazin-l-yl]ethanesulfonic acid. MES = 2-{N- 
morpholino) ethanesulfonic acid (Kelly et al. 2005) 


Component 

180 nm 

190 nm 

200 nm 

210 nm 

NaCl 

>0.5 

>0.5 

0.02 

0 

NaF 

0 

0 

0 

0 

NaC 104 

0 

0 

0 

0 

Boric acid 

0 

0 

0 

0 

Na borate (pH 9.1) 

0.3 

0.09 

0 

0 

Na2HP04 

>0.5 

0.3 

0.05 

0 

NaH 2 P 04 

0.15 

0.01 

0 

0 

Na acetate 

>0.5 

>0.5 

0.17 

0.03 

TrisH2S04 (pH 8.0) 

>0.5 

0.24 

0.13 

0.02 

HEPES/Na"" (pH 7.5) 

>0.5 

>0.5 

0.5 

0.37 

MES/Na (pH 6.0)-l- 

>0.5 

0.29 

0.29 

0.07 


analytical wavelength, producing a very noisy and 
unusable spectrum. It is thus often better to use water 
or air in the reference beam and subtract a buffer 
baseline manually as then one knows how it is affect¬ 
ing the spectrum. 

Light Scattering Samples 

Unless there is no alternative, absorption spectra 
should not be recorded for turbid samples, as the mea¬ 
sured absorbances may be misleading. However, many 
biomacromolecular species are of comparable size to 
the wavelength of light, and as a result they scatter 
light. The situation is worse for linear and circular 
dichroism as such samples may differentially scatter 
the two polarizations of light used. The theory of light 
scattering is complicated, depending not only on size 
regime of the particles but also on their shape. Thus, if 
at all possible, one should avoid it occurring rather than 
trying to correct for it. In general, scattering can be 
reduced either by reducing the size of the particles or 
collecting a high percentage of the scattered photons. 
Three methods that may enable scattered photons to be 
collected (and thus to avoid the problem of scattering) 
are to (1) have a wide-angle photomultiplier tube 
(PMT), (2) place the sample very close to the 
photomultiplier tube, or (3) have a collecting lens 
close to the sample to refocus the scattered light onto 
the PMT. 
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However, it is not always possible to remove light¬ 
scattering artifacts. In such a case, the measured spec¬ 
trum is a combination of true absorbance and scatter¬ 
ing. The scattering contribution is usually apparent as 
a sloping baseline outside absorbance bands. Nordh 
et al. (1986) showed that a simple empirical correction 
can often be subtracted from an observed linear dichro- 
ism spectrum to remove the sloping baseline 

^^Scattering ^ 

The same approach may also work for other 
spectroscopies. 

Stray Light 

In addition to any light scattering from the sample, 
so-called stray light (or stray radiant energy) from 
anywhere in the system may cause additional artifacts 
especially at lower wavelengths where lamps struggle 
for light intensity. In such a case, a signihcant percent¬ 
age of the light that does reach the photomultiplier 
detector may be stray light rather than light of the 
correct wavelength that has passed through the sample. 
However, all photons gets counted as unabsorbed pho¬ 
tons of the monochromator’s nominal wavelength, 
causing the detector to give a misleading indication 
that there is less absorption than there really is. The 
main problem is due to imperfections of the monochro¬ 
mator: long-wavelength stray light bypasses the mono¬ 
chromator. This limits the maximum absorbance at 
which an instrument will give reliable readings. If the 
spectrum is “true,” it will follow the Beer-Lambert 
Law (A = fiCf). A sharp drop off in intensity on the 
low wavelength side of a band is often indicative of 
this problem. 

Absorbance Flattening 

The phenomenon of absorption flattening is 
a suppression of the absorbance signal in regions of 
high absorbance in nonhomogeneous samples causing 
the Beer-Lambert law to break down. The same issues 
apply to linear and circular dichroism. To understand 
this phenomenon, consider the extreme case where 
half the light beam passes through pure solvent and 
half passes through a more dense solution of analytes. 
Half of the incident photons (/o/2) will thus reach the 
PMT directly, while the other half will pass through 
a solution of higher absorbance. The percentage trans¬ 
mittance of the sample is then 


Absorbance Spectroscopy: Spectral Artifacts and Other 
Sources of Error, Table 3 Absorbance flattening effects for 
samples where the light passes through a sample of which half 
has no absorbance and half has the indicated absorbance 


A of clear 
part of the 
solution 

A of dense 
part of 
solution 

A /tP 

^apparent 

inhomogeneous 

solution 

A for the 
analogous 
homogeneous 
solution 

Error 

(%) 

0 

3 

0.30 

1.5 

80 

0 

2 

0.29 

1.0 

71 

0 

1 

0.26 

0.5 

48 

0 

0.5 

0.19 

0.25 

24 


r=ioo%x(':^) (2) 

where A = logio(/o//) is the absorbance of the absorb¬ 
ing half of the sample and logio(/o//o) = 0 is the 
absorbance of the transparent half of the sample. The 
apparent absorbance of the sample is thus significantly 
reduced compared to the true absorbance had the sam¬ 
ple been homogeneous since 

Aapparent = logm 

Thus, for an absorbance in the denser part of 
the sample of 3 or greater, Aapparent is equal to 
logio(2) = 0.301. For intermediate and small absor¬ 
bances, the effect of light leakage is significant as illus¬ 
trated in Table 3. 

An effect analogous to absorption flattening can be 
caused by any arrangement that allows a fraction of the 
light to bypass the sample solution or to pass through 
a shorter path length of sample. Unusual sample 
holders such as capillaries are also prone to this 
problem (Waldron et al. 2010). In some cases, the 
problem of absorption flattening can be reduced to 
acceptable levels simply by diluting the whole solution 
as shown by the lower absorbance lines in Table 3. For 
membrane systems, one somehow has to remove the 
local high concentrations and simple dilution is often 
not sufficient. Mao and Wallace (1984) have outlined 
how this may be done by reducing membrane particle 
size. Unfortunately, the problem cannot always be 
avoided, in which case methods such as those of 
Gordon and Holzwarth (1971) can be used to correct 
measured data. 
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Photodegradation 

While not specifically a spectroscopic artifact, 
photodegradation of biomolecules in a light beam is 
a very common cause of measured spectra not being 
the true spectrum. Fast preliminary scans are the best 
way to identify whether this is a problem. 

Cross-References 
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► Circular Dichroism and Chirality 

► Circular Dichroism Spectroscopy of 
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Absorption Spectroscopy and Transition 
Polarizations: Pictorial Description 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

Absorbance of light; Transmission of light 


Definition 

Absorption can be pictorially viewed as either the 
electric field or the magnetic field (or both) of the 
radiation pushing the molecular electron density from 
a starting arrangement to a higher energy final one 
(Atkins and de Paula 2006; Atkins and Friedman 
2005; Flollas 2004). The electric field is far more 
effective than the magnetic field in achieving the 
push that gets a photon absorbed. The direction of net 
linear displacement of charge is known as the polari¬ 
zation of the transition. The polarization and intensity 
of a transition are summarized by its electric dipole 
transition moment, which is a vectorial property 
having a well-defined direction (the transition polari¬ 
zation) within each molecule and a well-defined length 
(which is proportional to the square root of the absor¬ 
bance). The transition moment may be regarded as an 
antenna by which the molecule absorbs light. Each 
transition thus has its own antenna and the maximum 
probability of absorbing light is obtained when the 
molecular antenna and the electric field of the light 
are parallel. Conversely, the absorption is zero when 
the light polarization and antenna are perpendicular to 
one another. 
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Absorption Spectroscopy to Determine 
Macromolecule Structural Changes 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Definition 

The UV-visible absorption spectrum of a biomolecule 
is the combination of the electronic transitions of all of 
its component parts or chromophores. The spectrum 
thus depends on the electronic structures of its compo¬ 
nent parts, which in turn depends on their environment. 
Thus, e.g., the absorbance of a tryptophan chromo- 
phore that is buried in the hydrophobic core of the 
protein will be at least slightly different from that of 
an exposed tryptophan. Two particularly useful appli¬ 
cations of UV absorption spectroscopy to probe 
structural changes are outlined below. 

Macromolecule Condensation 

Measuring a UV-visible spectrum with an absorption 
spectrometer may be used to follow the condensation 



Absorption Spectroscopy to Determine Macromolecule 
Structural Changes, Fig. 1 Melting curves of calf thymus 
DNA (100 |rM) in the presence of different concentrations of 


of a macromolecule sample into particles, though 
what is actually being probed is scattering of the 
light rather than absorption. A monotonic increase in 
absorbance signal outside the absorption bands of the 
molecules being studied is observed as condensation 
takes place. In the case of DNA, the addition of 
a highly charged DNA-binding ligands (such as 
spermine or [Co(NH 3 ) 6 ]^’^) will effect this change. 
Concomitantly, with the increase in absorbance signal 
above 300 nm, a decrease in the 260-nm DNA absorp¬ 
tion is observed. 

DNA or Protein-Melting Curves: Absorption 
as a Function of Temperature 

If there were no residue-residue interactions and no 
residue-environment interactions in a biomolecule, 
then the UV-visible absorption spectrum would be 
independent of its geometry. In the case of DNA, the 
71-71 stacking interactions between base pairs lower 
the magnitude of the absorbance at 260 nm 
(the hypochromic effect) compared with that of the 
isolated bases and in general changes the absorbance 
at most wavelengths. Thus, we can use the change in 
the UV absorbance signal at a chosen wavelength (usu¬ 
ally 260 nm, though at ^280 nm, A-T base pairs show 
very little change in absorbance, so this wavelength 



Temperautre (°C) 

P-[Fe 2 L 3 ]'*'^ (illustrated) measured at 260 nm in a 1-cm path 
length cuvette (Meistermann et al. 2002) 
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may be used to probe the role of G-C relative to A-T 
base pairs) to follow the disruption of base stacking and 
hence also base pair hydrogen bonding. 

The data from such an experiment is usually illus¬ 
trated as a melting curve or a derivative melting curve 
summarized by the so-called melting temperature, T„, 
(e.g., Fig. 1). is the temperature where the absor¬ 
bance is the average of the duplex and single-stranded 
DNA absorbances. Thermodynamic data relating to the 
stability of the duplex may also be extracted from melt¬ 
ing curves (Marky and Breslauer 1987). An example for 
calf thymus DNA and the stabilizing effect of a cationic 
ligand bound to the DNA are illustrated in Fig. 1. 
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Absorption Spectroscopy to Probe 
Ligand Binding 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

DNA-ligand interactions; Ligand-binding interactions 


Definition 

When a ligand solution is added to a solution of 
biomacromolecules, if it does not bind to the macro¬ 
molecules, then the UV-visible spectrum will simply 
be the sum of the DNA spectrum and the ligand 
spectrum. If the ligand does bind to the macromole¬ 
cules, then the spectrum of the complex will be (at 
least slightly) different from the sum spectrum. One 
should note that the observed spectrum is probably a 
complicated mixture of the spectrum of bound and 
unbound ligands and free and complexed 
macromolecules. 

Sometimes there are changes in the spectrum char¬ 
acteristic of the type of interaction. For example, when 
a planar aromatic molecule binds intercalatively to 
DNA (sandwiched between two base pairs), there is 
usually a characteristic decrease in the ligand absor¬ 
bance signal (hypochromism) of up to 50% and a shift 
to longer wavelength (bathochromic shift) of between 
^2 and 20 nm as illustrated in Fig. 1. The DNA 
spectrum is also affected by any molecule such as 
an intercalator which causes a structural change. 
This makes such spectra a useful probe of DNA-drug 
interactions but renders absorbance useless for concen¬ 
tration determinations unless the perturbed extinction 
coefficients are known. 

Basic Characteristics 

Titrations to Determining Data for Binding 
Constant Calculations 

The term “titration” is used to cover experiments 
where spectra are collected as a function of concentra¬ 
tion, ionic strength, pH, etc. To minimize macromole¬ 
cule consumption and also (perhaps surprisingly) to 
minimize concentration errors, the best way to proceed 
is often to add solution to the cuvette. A simple way to 
avoid dilution effects is to proceed as follows. 
Consider a starting sample that has concentration 
jc M of species X. Each time y cm^ of Y is added, also 
add y cm^ of a 2x M solution of X. The concentration of 
X remains constant at x M. Many variants on this theme 
may be derived. 

Binding Constants 

If the shape of the changes induced into a ligand or 
macromolecule spectrum remains unchanged during 
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Absorption Spectroscopy to Probe Ligand Binding, 
Fig. 1 Absorbance spectrum of an anthracene derivative, B, 
(illustrated) (5 pM) in water in a 1-cm path length cuvette in 
the presence of increasing concentrations of DNA (0-84 pM as 


indicated on Figure). Compound B absorbance intensity 
decreases and red-shifts as the DNA concentration increases. 
DNA has no absorbance above 320 nm (Tan et al. 2006) 


a titration experiment, but the magnitude changes in 
a manner that is proportional to the concentration of 
bound ligand, then we can conclude that the ligands are 
binding in one binding mode or in constant proportions 
in more than one mode (meaning site, orientation, 
sequence, etc.). In such cases, spectroscopic data can 
be used to determine the equilibrium binding constant, 
K. The data must be of very high quality for absorbance 
(or any other spectroscopic data) to be used to deter¬ 
mine K. A simple plot of change in absorbance versus 
either total macromolecule or total ligand concentra¬ 
tion (whichever is being varied) will probably enable 
the quality of the data set to be determined. It should be 
a smooth curve. There are many ways of proceeding 
from such a curve to a value for K. The one outlined 
below is designed to indicate the principles rather than 
necessarily being the best approach. 

We assume that we can think of the binding as 
a simple equilibrium; 


( 1 ) 


where Ly is a free ligand, is a bound ligand, and Syis 
a free site. For DNA, it is often possible to consider it 
as a series of binding sites of n residues (bases or 
basepairs) in size. The total site concentration is then 
>^tot = where [M] is the residue concentration of 

the macromolecule. For proteins, it is usually 
preferable to think in terms of the concentration of 
protein molecules rather than residues (amino acids). 
In this case, Stot = n’[M] for n’ the number of binding 
sites per protein should be used. We may write: 


K = 


nch 

Cf[M] 


( 2 ) 


where c* is the concentration of bound ligand and Cf is 
the concentration of the free ligand. 

There are a large number of methods for determin¬ 
ing K using absorbance data. The simplest is the 
enhancement method. This method is commonly used 
for fluorescence spectroscopy and may also be used to 
interpret absorbance data. We write: 


k + ^ 
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Cto4 = CfAf + CbAb 

— (AoI ^h)^f “I” ^hAh 
Aot (a Af'j 
Ah - Af 

Application of this equation requires knowledge of 
the absorbance of free and bound ligand. Determining the 
latter requires measuring an absorbance spectrum under 
conditions where it is known that all the ligands are 
bound to the macromolecules. K may then be determined 
directly. A more accurate value of K will be achieved if 
the data are used to perform a Scatchard plot (Scatchard 
1949) or fitted directly. The disadvantage of the 
Scatchard plot is it does not weight all data points 
equally. Its major advantage is that the curve should be 
a straight line, so it is obvious to the eye if (Eq. 2) or (3) 
are not appropriate for the system being studied. 

The Scatchard plot is based on rewriting the 
equation for the equilibrium constant as: 
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Definition 


r _ KSf 
Cf ~ [M] 



n 


(4) 


where 


r = 


Ch 

[M] 


(5) 


So, a plot of r/cy versus r has slope —K and y-intercept 
Kin. The jr-intercept occurs where r = n. 

Other methods commonly used with, for example, 
circular dichroism or linear dichroism data may be 
used with normal absorption data if the change in 
absorbance (the absorbance of the DNA-ligand system 
minus the absorbance of a free ligand solution of the 
same ligand concentration) is used in the analysis. 


Cross-References 


UV-visible absorbance is most commonly used to 
determine the concentration of a sample and also to 
give an indication of its purity. It is very easy to collect 
absorbance spectra of biomolecules; however, they are 
not always useful because of some of the issues 
outlined below (Norden et al. 2010; Rodger and Norden 
1997). Nucleic acids (DNA and RNA), proteins, and 
peptides absorb very little light above 300 nm in the 
absence of ligands or prosthetic groups with chromo- 
phores (absorbing units). However, it is usually wise to 
collect absorbance data from about 350 nm. If the 
spectrum is not flat between 350 and 310 nm, then the 
sample includes particles whose size is of the order of 
the wavelength of light; therefore, what is being mea¬ 
sured includes scattering of the incident light rather 
than simply absorption. The extreme of this is when 
an absorbance spectrometer is used to measure cell 
density in a culture using say 450 nm light - this is 
only light scattering despite usually being referred to as 
absorbance. 


► Absorption Spectroscopy: Practical Aspects 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► UV Absorbance Spectroscopy of Biological 
Macromolecules 


Basic Characteristics 

The protocol outlined below is one way of acquiring 
a simple wavelength scan of the absorbance of 
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a solution of biological macromolecules: parameters, 
solvents, and baselines. Consideration should be given 
to artifacts that may arise including the above men¬ 
tioned scattering and absorbance flattening. The pro¬ 
tocol assumes the instrument being used is a double 
beam instrument with sample and reference beam. 
Some modification is required for a single beam 
instrument. 

The procedure to perform a wavelength scan is as 
follows: 

1. Parameters 

The wavelength range will usually be 350-200 nm, 
unless the buffer cuts out the low wavelength end 
(see below) or ligands are included in the sample. 
Only go below 200 nm if the instrument is nitrogen 
purged. 

A data interval of 0.5-1 nm is adequate as the bands 
are all broad. Shorter data intervals will fill up the 
computer memory, though a shorter data interval 
may help any noise reduction analysis. If data are 
being collected to aid interpretation of linear dichro- 
ism spectra, ensure that the data interval is the same 
for absorbance and linear dichroism spectra. 

The bandwidth is the wavelength range of the radi¬ 
ation at any specified wavelength. If data are col¬ 
lected every 0.5 nm, then a bandwidth of 0.5 nm is 
appropriate. A larger value will lead to a greater 
averaging of data than the data interval suggests. Do 
not choose a much smaller bandwidth than data 
interval as the incident light intensity will be 
reduced and the spectroscopic noise will increase 
without improving the quality of your data. 

The data averaging time determines the signal to 
noise ratio and affects the scan speed required to 
produce undistorted spectra. 0.033-0.1 s is usually 
a good choice. Longer times are required if the 
absorbance is very small (less than 0.1 absorbance 
units) or small differences in absorbance are being 
determined (as for DNA melting curves, see below). 
There is no point in using a long averaging time if 
the noise level is not apparent in the spectrum at 
a shorter averaging time. 

2. Solvent or buffer 

Choose the solvent or buffer for your experiment, 
then run a spectrum of the buffer with the reference 
holder empty. Run a spectrum over the desired 
wavelength range to check that the solvent/buffer 
absorbance is not significant over the wavelength 


range of interest. If it is, then you need to change the 
solvent/buffer. 

In choosing your buffer, note that phosphate buffers 
are essentially spectroscopically invisible over the 
wavelength range usually used; however, phosphate 
does interact with some samples especially mem¬ 
branes. Cacodylate and Tris buffer are also trans¬ 
parent down to ^190 nm. Ammonium acetate cuts 
off at ^210 nm. Chloride ions begin absorbing 
above 200 nm, so high salt spectra cannot be col¬ 
lected at the lower wavelength end of the spectrum. 
Higher buffer concentrations may be able to be 
accommodated in shorter path length cuvettes 
(cf. Beer-Lambert law), so if it is not possible to 
dilute or change the buffer, try a smaller path length 
cuvette - though this will also reduce the number of 
analyte molecules in the light beam. 5-mm cuvettes 
will stand in a normal 1-cm cuvette holder; 1-mm 
cuvettes will need spacers to hold them vertical - if 
they are not held vertical, then you will be working 
with a variable path length. Smaller path length 
cuvettes will need special holders. 

3. Cuvettes 

To collect data down to 200 nm, cuvettes (also 
known as cells) should be made of fused silica. 
This material is sometimes referred to as “quartz”; 
hence the “Q” label that appears on some cuvettes. 
Optical glass cuvettes (which may be labeled “G”) 
are not suitable for these applications, although they 
may perform well at visible wavelengths. Stoppers 
should be used whenever possible, especially if the 
solvent is volatile. Cleanliness is important: human 
perspiration contains amino acids and peptides, so 
fingerprint residues absorb at UV wavelengths and 
an apparently clean cuvette can have a significant 
absorbance below 250 nm. Cuvettes should 
therefore be handled by their frosted sides or 
edges, or the top and bottom where the light will 
not be passing through to avoid finger prints on the 
surface. A variety of cleaning methods is available 
(see, e.g., Australian Standard AS 3753-2001 
Recommended practice for chemical analysis by 
ultraviolet/visible spectrophotometry). These are 
of varying rigor. Cuvettes should be wiped with 
a lint-free tissue before use. For double-beam spec¬ 
trometers, matched cuvettes (which may be 
purchased in sets from the manufacturer) are pref¬ 
erable. According to AS 3753-2011, the 
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absorbances of a pair of matched silica cells should 
differ by no more than 0.008 at 240 nm. 

4. The baseline 

There are a number of ways of collecting the base¬ 
line of an absorbance spectrum. The key thing is to 
ensure that the absorbance of the cuvette and buffer 
are subtracted from that of the analyte in the final 
spectrum. Two options are as follows. 

(a) The usual method is to place the solvent/buffer 
in a pair of matched cuvettes in both the refer¬ 
ence holder (usually the rear position) and the 
sample holder and perform a baseline accumu¬ 
lation. This is often stored in the instrument as 
a baseline and automatically subtracted from 
subsequent spectra. It is prudent to check that 
baseline subtraction from a second scan of the 
solvent/buffer spectrum gives a flat spectrum 
with an acceptable signal-to-noise ratio. This 
ratio will affect the measurement uncertainty 
in the absorbance readings from the spectropho¬ 
tometer and hence the precision of any concen¬ 
trations that are calculated using the 
Beer-Lambert law or any other data analysis 
undertaken. The reference cuvette then remains 
filled with the solvent/buffer when sample spec¬ 
tra are collected. 

(b) For low-absorbance samples or high-absorbance 
solvents/buffers, it is generally preferable to 
store an instrument (empty sample holders) 
baseline for automatic subtraction. One then 
collects a spectrum of the solvent/buffer 
referenced against an empty sample compart¬ 
ment and manually subtracts this from subse¬ 
quent sample spectra. This approach then lets 
one see where the solvent/buffer baseline is 
determining spectral features. 

5. The spectrum 

Place the sample in the sample cuvette in the sample 
holder, fill the reference beam with whatever it 
contained when you measured the baseline, and 
record the spectrum and subtract the baseline. 
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Synonyms 

Classes of absorption transitions 

Definition 

A very wide range of organic and inorganic substances 
can be analyzed using absorption spectroscopy. The 
wavelengths, intensities, and fine structure of the 
absorption bands are determined by the structures of 
the neutral or ionic species concerned. For almost all 
substances, UV and visible absorption spectra are 
caused by electronic transitions. A spectrum may con¬ 
tain more than one region of absorption, known as a 
band or band system, as molecules may undergo 
a range of different electronic transitions. 
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Absorption Spectroscopy: Relationship of Transition Type 
to Molecular Structure, Fig. 1 Aromatic absorption 
spectra of tryptophan, tyrosine and phenylalanine scaled to 
give similar long wavelength maximum absorbances. The 
concentrations to achieve these spectra are indicated on the 
figure 

Basic Characteristics 

To a first approximation, electronic transitions may 
be described as involving different combinations 
of orbitals: bonding (e.g., c and % for organic 
molecules), so-called nonbonding (n) and antibonding 
(e.g., (T* and n*) orbitals. A simple bonding-to- 
antibonding promotion model of electronic transitions 
is an oversimplification that does not adequately 
explain the spectra of all compounds - for example, 
benzene, porphyrins (e.g., in heme groups) and 
proteins - but transitions are commonly and usefully 
characterized as being, for example, 7t —> tt* or 
« —> 7t* in character, and detailed assignments 
are possible using combinations of experimental 
spectroscopy and computational chemistry (Atkins 
and Friedman 2005). 

Given the number of possible transitions that may 
occur in the UV/visible absorption spectrum of an 
organic compound or metal complex, many bands are 
composite, with overlapping band systems, often 
including vibrational (also referred to as vibronic) 
fine structure. Figure 1 shows vibrational structure in 
the absorption spectrum of phenylalanine and other 


amino acids with aromatic side chains. The vibronic 
bands of vapor-phase spectra may also show rotational 
fine structure (Flollas 2004). 

At the wavelengths, i, that are commonly used for 
absorption spectrophotometry (1 = 170-800 nm), 
electronic transitions are commonly associated with 
conjugated regions of organic molecules, or with 
metal complexes. The structural components that 
cause absorption are often referred to as chromo- 
phores. Conjugated chromophores can be linear, as in 
P-carotene, or cyclic, as in purine and pyrimidine 
bases, heme groups, and the amino acids that have 
aromatic side chains. The chromophores may consist 
entirely of CFl groups, or may include heteroatoms 
such as nitrogen, sulfur, or oxygen. Typically, the 
longest absorption wavelength (lowest energy) 
increases with the length or, for cyclic systems, the 
size of the chromophore (Atkins and Friedman 2005; 
Atkins and de Paula 2006). 

The color and hence electronic spectrum of 
molecules will change when its structure is changed, 
for example, by lowering the pH to protonate a species. 
This is well illustrated by the color changes that 
occur when compounds such as phenolphthalein and 
Coomassie Blue are protonated, causing changes in the 
extent of 7i-delocalization within the molecules. The 
neutral form of phenolphthalein, which predominates 
at neutral and low pH, is colorless as there is no 
conjugation between the aromatic chromophores 
in the molecule, which absorb at UV wavelengths. 
However, in basic solutions, the molecule is 
deprotonated, leading to extended conjugation in the 
resulting dianion, producing a distinctive pink color 
(Fig. 2). 

Most spectra are dominated by transitions in which 
electrons are redistributed between n and n* orbitals; 
these are known as ti —> 7 t* transitions. However, het¬ 
eroatoms introduce so-called nonbonding or «-orbitals 
or lone pairs to the chromophore. These are cr-type 
orbitals that can give rise to « ^ ti* transitions in an 
absorption spectrum (Hollas 2004). The transitions are 
typically weaker than ti ^ ti* transitions, and are 
often suppressed in hydrogen-bonding solvents such 
as water, but they may be seen when the accompanying 
TI ^ TI* transitions are weak or when they occur in 
a different wavelength range, n ^ n* transitions make 
an appreciable contribution to the absorption spectra of 
some proteins, and may become more prominent in 
circular dichroism (CD) spectra (Norden et al. 2010). 
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Absorption Spectroscopy: 
Relationship of Transition 
Type to Molecular 
Structure, 

Fig. 2 Phenolphthalein at low 
and high 


HO 
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Absorption Spectroscopy: Relationship of Transition Type 
to Molecular Structure, Fig. 3 Absorbance spectrum of the 
orange colored [Ru(phen) 3 ]^'^ 

As discussed above, absorption wavelengths are 
sensitive to the extent of conjugation in a chro- 
mophore. They are also sensitive to interactions 
between neighboring chromophores in the same mol¬ 
ecule (intramolecular), or in neighboring molecules 
(intermolecular). Examples of intramolecular coupling 
between chromophores include n stacking of aromatic 
residues in proteins and nucleic acids, and strong inter¬ 
actions between peptide n —> n* transitions in protein 
Qc-helices. Examples of intermolecular coupling 
include n stacking of ligands bound externally to 
DNA and the effects of intercalation between DNA 
base pairs on the spectroscopy of the ligand. 

Metal complexes often absorb in the range of 
170-800 nm, giving rise to the familiar colors of 


many transition-metal compounds, and contributing 
to the colors of some metalloproteins. The band sys¬ 
tems in these spectra may be assigned to electronic 
transitions that are predominantly metal-centered 
(MC), ligand-centered (LC), or metal-to-ligand 
charge-transfer (MLCT) in character. Eor example, 
[Ru(en) 3 ]^’^ is not strongly coloured, and its absorption 
spectrum comprises metal-centered d-d transitions. 
However, the replacement of the ethylenediamine 
ligands by a 7i-conjugated polypyridine ligand such 
as 2,2'-bipyridine (bpy) or 1,10-phenanthroline 
(phen) causes profound changes to the spectrum, by 
introducing ligand-centered n-n* transitions, and 
strong MLCT transitions associated with the transfer 
of metal c/-electrons to empty ligand 7t*-orbitals. The 
MLCT bands are predominantly responsible for the 
bright colors of complexes such as [Ru(bpy) 3 ]^’^ and 
[Ru(phen) 3 ]^'^ (Campagna et al. 2007) (Eig. 3). It 
should be noted that, for example, the orange color of 
[Ru(phen) 3 ] arises because it absorbs the shorter 
wavelength colors of light and does not absorb the 
red, orange, yellow, and some green wavelengths. 
The spectra of these complexes are sensitive to the 
chemical environment, and this is exploited in DNA 
binding studies. 
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Definition 

An enzyme that catalyzes hydrolysis of the neurotrans¬ 
mitter acetylcholine. 


Basic Characteristics 

Cholinesterases are enzymes that catalyze hydrolysis 
of the important neurotransmitter acetylcholine (ACh), 
necessary to allow cholinergic neurons to return to 
their resting state. Inhibitors of cholinesterases can 
therefore be used in treatment of diseases such as 
Alzheimer’s and Parkinson’s. Theoretical studies of 
the enzyme-catalyzed reaction in these enzymes can 
therefore be important to provide information for drug 
design. Apart from acetylcholinesterase (AChE), 
another type of cholinesterase found in mammals is 
butyrylcholinesterase (BChE). Whereas AChE has 
a very high catalytic efficiency for ACh hydrolysis 
and is mainly found in cholinergic synapses, BChE 
has lower efficiency and is widely distributed in tissues 
and plasma. Both enzymes feature a typical Ser-His- 
Glu catalytic triad, commonly found in serine hydro¬ 
lases. Enzyme-catalyzed acetylcholine hydrolysis 
starts by a nucleophilic attack of the catalytic serine 
onto the carbonyl oxygen of the substrate, followed by 
proton transfer from the histidine and elimination of 
a primary alcohol, which then leads to an acyl enzyme. 
Zhang et al. first studied this reaction step with ah 
initio QM/MM methods (Zhang et al. 2002). Minimum 
energy paths were optimized at the HF/3-21G/ 
AMBER level and subsequent single point MP2/ 
6-31-i-G(d)-AMBER calculations indicated an energy 
barrier consistent with experiment. The catalytic triad 
glutamate was found to provide electrostatic stabiliza¬ 
tion of the transition state of the reaction and the acyl 
enzyme formed. No “low-barrier” hydrogen bonds or 
“charge-relay” mechanisms were involved in this sta¬ 
bilization (as had been previously proposed). Zhou 
et al. calculated free energy prohles of both reaction 
steps, using B3LYP/6-31G(d)-AMBER ff99SB MD 
simulations (Zhou et al. 2010). In the acylation step, 
a short hydrogen bond was found between the histidine 
and the glutamate of the catalytic triad, and the simu¬ 
lations indicated spontaneous transfer between these 
species. Subsequently, QM/MM modeling was used to 
determine the barrier for proton transfer, which was 
indeed low (^2 kcal moU*). The proton was never 
observed to remain in between the residues, however, 
and proton localization on the histidine was favored. 
This indicated that a true “low-barrier” hydrogen bond 
is not involved in catalysis. In line with experiment, the 
barrier for deacylation was calculated to be higher than 
that for acylation. Chen et al. recently also reported 






Action Potentials 


39 


QM/MM free energy profiles at the MP2/6-31+G(d)- 
AMBER level for acylation and deacylation of acetyl¬ 
choline in BChE (Chen et al. 2011). Despite the 
similarity of the AChE and BChE structures and active 
sites, acylation was found to be rate determining in this 
case. This may explain kinetic data that indicate 
a difference between the rates of acetylcholine and 
acetyltft/ocholine hydrolysis by BChE (but not by 
AChE). 

QM/MM simulations have also been applied to 
investigate enzyme-inhibitor interactions in AChE. 
A very high-affinity inhibitor of AChE, 4’y/7-TZ2PA6, 
can be made from its precursor fragments. In solution, 
this reaction results in equal amounts of iyn-TZ2PA6 
(with high affinity) and fl«t;-TZ2PA6 (with lower 
affinity). When the precursors were added to AChE, 
only production of 5y«-TZ2PA6 is observed. QM/MM 
modeling of both the syn- and a«h-TZ2PA6 formation 
in AChE has been reported (Senapati et al. 2006). 
First, minimum energy paths were calculated at 
B3LYP/6-31G(d)-MM level for both reactions, based 
on the AChE-product complexes observed by 
protein crystallography. This indicated that the azide 
and acetylene moieties, which need to be combined to 
form the products, need to lie parallel in order to form 
the syn complex, and antiparallel to form the anti 
complex. Molecular dynamics simulations of the reac¬ 
tant complexes indicated that the enzyme enforces the 
parallel orientation of the precursors, which explains 
the exclusive production of the high-affinity syn- 
TZ2PA6 inhibitor by the enzyme. Another study, 
using QM/MM-FE and molecular mechanical molec¬ 
ular dynamics simulations, explained how, unexpect¬ 
edly, the binding of a positively charged 
trifluoroacetophenone inhibitor to AChE could still 
lead to an acyl enzyme after mutation of the catalytic 
triad histidine (Cheng et al. 2007). Apart from these 
reversible inhibitors of AChE, which may be used for 
treatment of disease, several irreversible inhibitors of 
AChE are also known. These highly neurotoxic com¬ 
pounds, such as the nerve agents sarin and tabun and 
the insecticide paraoxon, attach a phosphorus to the 
catalytic serine. Liu et al. modeled the dephosphoryla¬ 
tion (reactivation) mechanism of paraoxon-methyl 
(dimethylphosphoryl) with AChE in detail, using 
QM/MM-FE calculations after minimum energy path 
determination at the B3LYP/6-31G(d)-AMBER level 
(Liu et al. 2009). Similarly, QM/MM calculations have 
been used to study the reaction mechanisms of AChE 


with the nerve agents tabun (Kwasnieski et al. 2009) 
and sarin (Beck and Hadad 2010). Insights obtained 
from such studies may help in developing treatments 
against poisoning. 
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Actomyosin Cycle 
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Synonyms 

Multiple sequence alignments 

Definition 

The biological significance of sequence alignments is 
estimated by comparing the observed similarity scores 
with their expectation by chance. This score is named 
an E-Value and is used for searching databases. More 
accurate alignments can also be obtained through 
multiple sequence alignment modeling and by incor¬ 
porating structural information. 


Introduction 

The biological significance of a protein sequence 
alignment is difficult to establish when no structural 
information is available. Unfortunately, most protein 
sequences lack an experimentally established structure 
(especially the non-globular membrane-associated 
proteins). The biological meaning of their alignments 
must therefore be established on the basis of sequence 
similarity. This estimation is usually made using 
a statistical framework. Its rationale is that unexpect¬ 
edly high similarities are more likely to be the result of 
natural evolution. This very simple assumption results 
from the commonly accepted notion of divergent evo¬ 
lution, according to which all observed species can be 
traced back to a common ancestor. In this context, 
whenever two biological entities are more similar 
than would be expected by chance, common ancestry 
is assumed. This simple principle - that can be traced 
to Darwin and The Origin of Species - has been used 
across all fields of modem biology for more than 
a hundred years now. It applies to the comparison of 
molecular sequences. 
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Sequence Similarity and Structural 
Similarity: The Twilight Zone 

One of the first results objectively suggesting that 
sequence similarity might be used as a proxy for struc¬ 
tural similarity was established by Sander (Sander and 
Schneider 1991). By comparing all experimental 
3D-structures available at the time, Sander and his 
colleagues did show that whenever two sequences 
have more than 25% identity (defined as the ratio 
between the number of identical aligned residues and 
the total number of residues), these sequences usually 
have the same 3D-Structure. The 25% figure is sfill 
valid and referred to as the twilight zone. It is impor¬ 
tant to point out that in this framework, the percent 
identity is only informative above 25% identity; below 
this figure, an alignment becomes noninformative, 
meaning that the considered sequences are just as 
likely to be homologous or not. From a statistical 
point of view, this observation makes a lot of sense, 
as 20-25% is roughly the level of identity one would 
expect when considering random protein sequences. 


Fast Database Search with Blast and 
Biological Meaning of E-Values 

Evolutionary models used for comparing protein 
sequences are more sophisticated than a mere percent 
identity and the most common practice is to combine 
the Needleman and Wunsch (NW) algorithm 
(Needleman and Wunsch 1970) for global alignments 
or the Smith and Waterman (SW) (Smith and Waterman 
1981) for local ones with a log-odd substitution matrix 
Blosum (Henikoff and Henikoff 1992) and PAM matri¬ 
ces. This procedure has limited merits when considering 
sequences with less than 25% identity, but the SW 
algorithm can easily be used to search sequence data¬ 
bases. One starts with a query sequence that is aligned in 
turn with every other sequence in the database. 
Sequences yielding an alignment to the query with a 
score above a certain threshold are then considered to be 
potential homologues. Of course the main problem is to 
objectively determine whether a score is high enough to 
support homology inference. This problem was solved 
in the framework of another database search algorithm 
named Blast (Altschul et al. 1990). Blast can be 
described as heuristic version of SW. The main 
difference between these two algorithms is that Blast 


uses a rapid pre-hltering procedure in order to identify 
the most promising hits. These selected sequences are 
then aligned to the query in a second step, using an 
algorithm similar to SW. Blast is a heuristic because 
the pre-filtering step depends on common words (k-tup) 
identihcation. In theory, this step may miss very remote 
homologues. This risk, however, is limited, especially 
when using conservative thresholds and largely com¬ 
pensated by the improved efficiency: Blasf is 2-3 orders 
of magnitude faster than SW. Judging from citation 
index. Blast is one of the most widely used softwares 
in Biology. 

Aside from its speed. Blast’s most important 
characteristic is the statistical framework this algo¬ 
rithm uses to estimate the biological signihcance of 
its pairwise alignments. These statistics were devel¬ 
oped by Karlin and Altschul who worked out the 
numerical estimation of the E-Value for a given align¬ 
ment (Karlin and Altschul 1990). Given an alignment 
having a score of x, the E-Value is the number of 
alignments expected to yield by chance a score larger 
than or equal to jc given the size of the database. The 
idea of estimating the E-Value of an alignment was not 
new, but Karlin and Altschul were the first one to show 
that the score of un gapped sequence alignments fol¬ 
lows an extreme value distribution (EVD) and not 
a normal distribution, as previously assumed. This 
Ending led to the systematic use of the following 
formula for the estimation of an alignment E-Value: 

E — Kmne^^^ 


where E is the expected number of high scoring pairs 
(HSPs, ungapped alignments) expected to have a score 
higher than or equal to S, m being the length of the 
considered query sequence and n the database size. 
The value of X depends on the substitution matrix, 
while K is calibrated for the database. In all Blast 
packages, these values are precomputed for a set of 
predefined matrices. The lower its E-value, the more 
likely an alignment is to be biologically meaningful, 
and 10”^ is commonly accepted as a threshold for 
biological significance. It is worth pointing out that 
the Blast EVD formula has only been demonstrated 
for ungapped local alignments and that its use in the 
context of gapped alignment is only based on empirical 
observations. In any case, this formula is certainly the 
most widely used criterion to estimate the biological 
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relevance of a protein sequence alignment and one 
may argue that most automated annotation used today 
in databases relies on this formula. 

Low Complexity Regions and Their Effect on 
the Estimation of Biological Relevance 

An E-Value is merely a statistical quantity. Given the 
average frequency of individual amino acids, and 
given the measured similarity between a query 
sequence and a database, the E-Value gives an estimate 
of how likely one is to encounter an alignment with 
a score at least as good by pure chance. Biologists, 
however, should never forget that the E-value estima¬ 
tion relies on unrealistic premises (as most modeling 
procedures in bioinformatics). The first one is that in 
a database, all sequences contribute the same amount 
of information. This postulate is unlikely to be true, 
since a database is a complex compilation of very 
large protein families (like the kinases, or the immu¬ 
noglobulins and many other large protein families) 
sharing most of their information content. Other very 
small families of unique or quasi-unique proteins 
(i.e., one member at most per genome) contribute 
most of the variation. In theory, this bias works against 
large protein families whose overrepresentation 
results in a null model slightly skewed toward the 
most frequent situations. The only practical -though 
imperfect - way to correct for such bias is to run 
database searches against nonredundant databases. 
Another unrealistic assumption Blast relies on is 
the notion that all proteins should have an average 
amino-acid composition with no enrichment. This is 
not true and Blast is very sensitive to low complexity 
regions (LCRs). LCRs are protein regions where an 
amino acid is overrepresented. Many different kinds 
of such regions exist, including leucine zippers or 
glutamic rich regions. Different LCRs enriched for 
the same amino acid always result in seemingly 
highly signihcant alignment. However, these regions 
may not always be evolutionary or even functionally 
related and the high score they generate will result 
in wrongly assigned homologous relationships. This 
is why the standard Blast procedures involve 
caching the LCR from the sequences before doing the 
database search. This filtering of LCR regions is 
usually done using the seg algorithm(Altschul et al. 
1994). 


The main shortcoming of Blast Is its reliance on 
pairwise alignments to establish homology between 
sequences. As discussed above, it is hard to produce 
accurate pairwise alignments with sequences having 
less than 25% identity. One reason for this is that 
pairwise alignments lack an essential bit of informa¬ 
tion: knowledge of the local constraints imposed on 
each position by evolution. Indeed, it is well known 
that key features in a protein family tend to evolve 
much slower than nonfunctionally (or structurally) 
important features. Eor instance, active sites are highly 
conserved across very long evolutionary distances. 
Ideally, one would like to weight the contribution of 
every position in an alignment according to its 
expected level of conservation. Such a local weighting 
scheme can be applied, but one must first estimate it, 
a task usually carried out by assembling a multiple 
sequence alignment (MSA) of the considered family. 

High Accuracy Homology Modeling Through 
Multiple Sequence Alignments 

A multiple sequence alignment is a data structure 
where more than two sequences are simultaneously 
aligned. The goal of an MSA is to reveal conservation 
patterns across a large number of sequences and it is 
usually considered to be the method yielding the most 
biologically meaningful sequence alignments, espe¬ 
cially when no structural information is available. 
Most methods for MSA computation rely on a proce¬ 
dure known as progressive alignment (Hogeweg and 
Hesper 1984). The first step of the procedure is to 
estimate a guide tree (typically by comparing all the 
sequences two by two). The sequences are then incor¬ 
porated one after the other within the MSA. This pro¬ 
cedure is very fast but does not guaranty optimality, 
which means that the weakest patterns of conservation 
may be missed. This shortcoming is very hard to cir¬ 
cumvent and becomes increasingly problematic when 
the number of sequences grows. Most heuristic pack¬ 
ages for MSA have been designed as an attempt to 
address this problem and help generate more accurate 
MS As. The most widely used aligner is ClustalW 
(Thompson et al. 1994), a standard progressive align¬ 
ment method. While ClustalW is a reasonable choice 
for simple datasets, many independent studies have 
shown that ClustalW does not deliver the best align¬ 
ments, especially when considering distantly related 
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sequences (Thompson et ah). The current state of the 
art tends to distinguish between very fast and approx¬ 
imate aligners like Mafft (Katoh and Toh 2008) and 
slower consistency based aligners like T-Coffee 
(Notredame et al. 2000). 

These aligners are perfectly suited for dealing with 
protein sequences. Yet, one can produce even more 
accurate MSAs by incorporating extra information 
within the sequences, using a newer generation of 
aligners known as template-based aligners (Notredame 
2007). The most useful external information is 
3D-Structure. When enough structural information 
(i.e., at least two sequences) is available, one can use 
it to accurately align very distantly related protein 
sequences. Structural aligners can either be pairwise 
or multiple. DALI (Holm and Sander 1995) a structural 
aligner based on the comparison of internal distances 
between equivalent C-alpha is often considered to be 
one of the most accurate aligners. There are, however, 
many others such as sap (Taylor et al. 1994), for 
pairwise and multiple structure-based sequence align¬ 
ments. Structural aligners are ideal whenever experi¬ 
mental structures are available for all the considered 
sequences. When this is not the case, it is still be 
possible to use structural information thanks to 
methods like 3D-Coffee/Expresso (O’Sullivan et al. 
2004) and Prommals-3D (Pei et al. 2008). These 
methods are able to combine structural and sequence 
information within the same model. They also make it 
possible to combine the output of several alternative 
aligners into one unique model, thus simplifying the 
problem of selection of a specihc structure alignment 
method. It is indeed one of the trademarks of the 
T-Coffee multiple alignment package to be able to 
combine the output of any third party alignment 
method into a single MSA. When structural informa¬ 
tion is not available, the best alternative is homology 
extension. Homology extension relies on building 
a profile for each query sequence. Afterward, the 
progressive algorithm is used to multiply align the 
profiles. Homology extension is now available in 
several packages (T-Coffee, Prommals-3D) and is 
certainly one of the best ways to generate biologically 
meaningful alignments when considering distantly 
related sequences. Unfortunately, the equivalent of an 
E-Value does not exist for MSAs. The most common 
evaluation strategies are therefore empirical and 
involve making sure that key features are properly 
aligned. 


Profile-Based Sequence Alignments 

Accurate MSAs have many important applications in 
biology. These include phylogenetic tree reconstruc¬ 
tion, 3D modeling, and profile search. A profile is 
a mathematical formulation of an MSA where 
each position becomes a model (Gribskov et al. 
1987). This model is then used to quantify the cost 
for aligning any given amino acid to the considered 
position. Sequence-profile and profile-profile align¬ 
ments are often considered to yield the most accurate 
protein sequence alignments, and as such, they have 
become one of the most critical components of auto¬ 
mated annotation pipelines. These procedures rely on 
the existence of precomputed collections of profiles, 
like Pfam, SMART, or Prosite. Given a sequence, 
one can use packages like HMMer (Eddy 2009) to 
align the query against all available profiles and use 
Blast-like statistics to identify the most biologically 
significant hits. 

Other strategies exist to automatically build pro¬ 
files: either by turning an existing MSA into a profile 
with the HMMer package, or by using a special 
version of Blast named PSI-Blast that automatically 
turns the Blast output into an MSA and uses it for 
further database search. Profiles built from single 
sequences have been shown to be one of the best 
ways to identify remote structural relationships and 
to produce the most biologically relevant protein 
sequence alignments. Tools like HH-align (Soding 
2005) make it possible to do systematic profile- 
profile alignments. 

Exhaustive Resources on Protein Sequence 
Aligners 

The production of accurate sequence alignment is one 
of the most important problems in biology. This 
explains why so many efforts have been dedicated to 
this important topic. The algorithms presented here are 
only providing a rough overview of some of the avail¬ 
able alternatives. It is by no means exhaustive. The 
most up-to-date resource on sequence aligners is now 
Wikipedia where the sequence alignment entry reca¬ 
pitulates most of the aligners (multiple and pairwise) 
published over these last three decades: 

http://en.wikipedia.org/wiki/Sequence_alignment_ 
software (Table 1). 
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Alignment of Protein Sequences, Table 1 List of the aligners discussed in the text. The third column indicates their relative 
expected accuracy. Structure-based aligners are expected to be more accurate than sequence-based aligners. This table is by no means 
exhaustive 


Name 

Algorithm 

Seq. 

Struc. 

Profile 

Accuracy 

Web site 

Blast 

Pairwise 

X 



+ 

www.ncbi.nlra.gov 

SW 

Pairwise 

X 



++ 

www.ebi.ac.uk 

Hmmer 

Pairwise 

X 


X 

+++++ 

hinmer.janelia.org 

HH align 

Pairwise 



X 

++++++ 

toolkit.tuebingen.mpg.de.hhalign 

TM-align 

Pairwise 


X 


+++++++ 

zhanglab.ccrab.med.umich.edu/TM-align/ 

Sap 

Pairwise 


X 


+++++++ 

mathbio.nimr.mrc.ac.uk/wiki/SAP 

Dali 

Pairwise 


X 


+++++++ 

ekhidna.biocenter.helsinki.fi/dali server/ 

ClustalW 

Multiple 

X 



+++ 

www.clustal.org 

T-Coffee 

Multiple 

X 



++++ 

www.tcoffee.org 

Prommals-3D 

Multiple 

X 

X 

X 

+++++++++ 

prodata.swmed.edu/promals3d/promals3d.php 

3D-Coffee/Expresso 

Multiple 

X 

X 

X 

+++++++++ 

www.tcoffee.org 


Summary 

A biologically meaningful alignment is an alignment 
that can be used to infer (or discard) a homologous 
relationship between two (or more) protein sequences. 
Homology may also be inferred between a sequence 
and a group of pre-aligned sequences (profile) or 
between two profiles. In the context of protein 
sequence analysis, homology usually implies common 
ancestry and similar 3D fold. The biological signifi¬ 
cance of an alignment is estimated using statistical 
criterions, such as the probability of generating an 
alignment as good or better by chance. Many methods 
exist to produce protein sequence alignments, they 
include fast approximate database search with Blast, 
more accurate modeling using multiple sequence 
aligners and even more accurate models using aligners 
able to incorporate evolutionary and structural infor¬ 
mation. The most biologically meaningful sequence 
alignments are obtained doing structure-structures 
and profile-profile alignments. 

Cross-References 

► Linear Motifs in Protein-Protein Interactions 

► Protein Secondary Structure Prediction in 2012 

► Protein Structure Prediction 
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Definition 

Alkanols are saturated hydrocarbons with one or more 
hydroxyl groups. Lipid hi layers are membranes com¬ 
posed of two flat sheets made of phospholipids with 
a hydrophilic headgroup and two hydrophobic tails 
(Mouritsen 2005). Phospholipids are one of the main 
constituents of biological cell membranes. 

Introduction 

There has been a long-standing interest in the action of 
alkanols on membranes and lipid bilayers, which was 
in particular driven by their anesthetic potency and 
their permeation-enhancing effect in transdermal 
drug delivery. An alkanol chain-length-dependent 
effect was found for both processes, increasing for 
alkanols with up to 6-13 carbon atoms depending on 
the system or membrane and decreasing for longer 
carbon chains. In anesthesia, also a cutoff effect was 
reported for longer alkanols. Up to the cutoff, the 
anesthetic potency of alkanols is well described by 
the Meyer-Overton rule, according to which the anes¬ 
thetic effect correlates with its lipophilicity. This rela¬ 
tion was found to be valid for almost all general 
anesthetics. The interaction mainly of 1-alkanols with 
lipid bilayers has been addressed in numerous experi¬ 
mental, theoretical, and computational studies. 


Insertion of Alkanols into Lipid Bilayers 

In atomic-scale ► molecular dynamics simulations, 
alkanols spontaneously insert within tens of nanosec¬ 
onds (10”"^ s) into phospholipid bilayers and disperse 
on the timescale of 100 ns. They have a chain-length- 
dependent partition coefficient deflned as the ratio of 
the mole fraction of alkanols in the bilayer and the 
mole fraction of alkanols in the surrounding water of 
10 (ethanol) to 10^ (octanol) and above. The alkanoTs 
polar hydroxyl group anchors closely to the phospho¬ 
lipid headgroups by forming hydrogen bonds 
to the lipid glycerol backbone. The carbon tails 
stretch into the hydrophobic core of the bilayer 
(Griepernau et al. 2007) (Eig. 1). Alkanols both modify 
the structural as well as the dynamical properties of 
lipid bilayers. 

Effect of Alkanols on the Structure and 
Dynamics of Lipid Bilayers 

Structurally, partitioning of short-chain alkanols into 
phosphatidylcholine bilayers increases the lipid vol¬ 
ume, while longer alkanols increase the packing den¬ 
sity of lipids and thereby decrease the molar volume 
(Aagaard et al. 2006). The switching occurs approxi¬ 
mately at an alkanol length equal to half the acyl chain 
length. Similarly, the area per lipid is decreased for 
long-chain alkanols and increased for short-chain 
alkanols (Ly and Longo 2004; Griepernau et al. 
2007). For longer alkanols, the ordering of the lipid 
acyl chains is significantly enhanced, thus displaying 
a liquid-ordered-like phase (Fig. 2). Similarly, ► lipid 
diffusion was enhanced for lipid bilayer-ethanol sys¬ 
tems and decreased for alkanols with a chain length 
larger than eight as observed both in continuous 
bleaching experiments as well as in ► molecular 
dynamics simulations (Griepernau et al. 2007; Terama 
et al. 2008). 

► Microcalorimetry showed a stabilizing effect of 
short alkanols on the fluid phase of lipid bilayers; the 
main gel to liquid-crystal phase transition temperature 
(Tm) of DMPC was, e.g., reduced by 2.2 K for 10 vol% 
ethanol in the solvent (see Heimburg 2007 for an 
introduction into membrane thermodynamics). Oppo¬ 
sitely, long-chain alkanols increased Tn,, i.e., they 
stabilize the gel phase of membranes. The transition 
was found to be broadened for all alkanols 
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Alkanols - Effects on Lipid Bilayers, Fig. 1 Snapshots of 
alkanol-DMPC simulation systems after 100 ns with ethanol (a), 
octanol (b), decanol (c), and tetradecanol (d) (128 phospho¬ 
lipids, 72 1-alkanols). The 1-alkanol chains are highlighted in 
green, and their oxygens are represented by red spheres. Water is 
blue, the lipid chains are yellow, the phosphorus atoms are 
magenta, and nitrogen atoms are light blue (Adapted from 
Griepemau et al. 2007) 

(Westerman et al. 1988; Griepemau et al. 2007). Con¬ 
sequently, the compressibility of lipid bilayers is like¬ 
wise affected by the addition of alkanols: For DMPC 
bilayers, it is decreased for the long-chain tetradecanol 
and increased for ethanol resulting in an increased 
bending modulus of the alkanol-bilayer system for 
long-chain alkanols and a decreased bending modulus 
for short-chain alkanols (Griepemau et al. 2007). 
Micropipette aspiration experiments showed further 
a decrease in the bilayer thickness in the presence of 
short-chain alkanols (Ly and Longo 2004). The stmc- 
tural changes induced by alkanol also affect the ion 
permeability of lipid membranes: Their electrical resis¬ 
tance is reduced by orders of magnitude with respect to 
pure membranes for short-chain alkanols (C 2 to C 7 ), 
hexanol or heptanol are more effective than smaller 
alkanols. These membranes showed an increased 
permeability for cations, in particular for potassium, 
as compared to pure lipid bilayers. Large alkanols 


(Cg to Cio) yield similar results, however, requiring 
large molar aUcano 1 -lipid ratios of more than 10 
(Gutknecht and Tosteson 1970) to achieve comparable 
reductions in the resistance as compared to short-chain 
alkanols (e.g., <10~^ M for heptanol). 

The ► lateral pressure profile in the membrane - 
with tension peaks at the water-lipid interphase and in 
the region of the glycerol backbone and pressure max¬ 
ima resulting from the entropic repulsion within the 
hydrophobic core region - is stretched according to the 
observed alkanol-dependent changes in thickness. 
Additionally, the chain repulsion was found decreased 
upon addition of ethanol (Terama et al. 2008), while 
long-chain alkanols increased the pressure extrema 
within the bilayer core (Griepemau and Bdckmann 
2008). 

Effect of Alkanols on Membrane 
Protein/Lipid Bilayer Systems 

Alkanols may alter protein function not only by direct 
interactions with the protein but also indirectly by 
changing the characteristics of lipid bilayers. This 
was demonstrated for the simple channel gramicidin 
(gA) with a gA-based ► fluorescence assay (Ingolfsson 
and Andersen 2011): In the absence of an alkanol 
binding pocket of gA, addition of short-chain alkanols 
(up to nonanol) to vesicles composed of DC 20 :iPC 
lipids and gA increased the ratio between conducting 
and nonconducting gramicidin channels, while longer 
alkanols decreased this ratio, i.e., displaying a cutoff at 
approximately half the acyl chain length. While this 
effect is possibly due to a bilayer-thickness-dependent 
dimer formation of nonconducting gA monomers, 
alkanol-induced changes in the lateral pressure profiles 
of membranes were suggested to shift the conforma¬ 
tional equilibrium K between two protein channel con¬ 
formations with different cross-sectional areas AA 
according to 

+h 

AW = J Ap{z)AA{z)dz 

—h 

with K = Ko exp(—zlVF/ kpT) and the change Ap in the 
lateral pressure prohle upon addition of alkanols (Can¬ 
tor 1997). Thus, apart from direct alkanol-protein 
interactions, the biological action of alkanols, e.g., in 
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Alkanols - Effects on Lipid Bilayers, Fig. 2 Sketch of struc- membrane, while large alkanols induce a tight lipid packing and 
tural changes of a lipid bilayer upon addition of short-chain and may lead to thinning of the bilayer by interdigitation (Adapted 
long-chain 1-alkanols. Small alkanols increase the fluidity of the from Griepemau et al. 2007) 


anesthesia, is likely also due to alkanol-induced 
changes in the structure and lateral pressure of mem¬ 
branes. The cutoff observed in anesthesia is reflected in 
the lipid-chain-length-dependent switching of the 
effect of alkanols of different lengths on the membrane 
structure. Additionally, simulations of alkanol-lipid 
bilayer systems at large external pressure showed that 
the equilibrium between two different (modeled) pro¬ 
tein conformations is shifted in the opposite direction 
as compared to the effect of alkanols on lipid bilayers 
at normal pressure (Griepemau and Bbckmann 2008). 
This effect is consistent with the so-called pressure- 
reversal phenomenon in anesthesia, i.e., the reversal of 
anesthesia by large hydrostatic pressure, lending thus 
support to so-called lipid-mediated modes of 
anesthesia. 

Cross-References 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 

► Fluorescence Recovery After Photobleaching 

► Lipid Bilayer Lateral Pressure Profile 
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Synonyms 

Differential splicing; Splicing 

Definition 

Alternative splicing describes the process that allows 
the generation of different forms of mature mRNA 
(messenger RNA) from the primary gene transcript. 

Introduction 

Why Genes in Pieces? 

In 1978, Walter Gilbert (1978) contradicted the funda¬ 
mental dogma of one gene, one polypeptide chain by 


proposing the idea that differential usage of exons from 
the same gene could lead to functionally different 
proteins. 

Splicing is a regulatory mechanism by which the 
transcribed exons of a gene are combined into mature 
mRNA transcripts. If the splicing process can generate 
transcripts with more than one combination of exons 
this is known as alternative splicing. The resulting 
combination of these alternative splicing events is the 
source of high variability at the posttranscriptional 
level and these different mRNAs products can be 
translated into different protein isoforms, allowing 
the generation of multiple proteins from a single 
gene. An example of the diversity that alternative 
splicing can achieve is the Drosophila Dscam gene, 
which has the potential to encode 38, 016 distinct 
spliced variants, nearly three times the total number 
of genes in Drosophila (Black 2000). Large-scale stud¬ 
ies have started to reveal the importance of alternative 
splicing and confirm these hypotheses. 

The Splicing Mechanism 

The genes of higher eukaryotes are composed of exons 
and introns. The process of splicing removes introns from 
the pre-mRNA, retaining only the exons in the mature 
transcript. Splicing occurs through the mechanism of 
the spliceosome, a large molecular machine made 
out of several small nuclear RNAs (snRNAs) and more 
than 150 proteins. The snRNAs are bound to multiple 
proteins forming small nuclear ribonucleoprotein 
particles (snRNPs). Five snRNPs (called Ul, U2, U4, 
U5, and U6) are in charge of the splicing process and 
account for the main catalytic activity of the 
spliceosome. 

The recognition of an intron in the pre-mRNA 
sequence involves three splicing signals: the donor 
splice site at the 5' intron end, the acceptor splice site 
at the 3' intron end, and the branch point located about 
40 nucleotides upstream of the acceptor site. The donor 
site has an extended intronic consensus sequence 
GTRAGT (R stands for A or G) in mammals. The 
terminal acceptor dinucleotide is usually AG and 
is preceded by a stretch of pyrimidines, which is 
called the polypyrimidine tract. The branch point 
adenosine has the consensus sequence YTRA (Y stands 
for C or T). 

During splicing of an intron, U1 snRNP begins the 
formation of a complex at the splice donor site by 
recognizing specihc base pairings between the 
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snRNA and the mRNA. Later, U2AF binds to the 
polypyrimidine tract and acceptor site and U2 snRNP 
binds to the branch site recruiting portions of the 
spliceosome complex to the splice acceptor site. U4, 
U5, and U6 bridge the gap between these two sites, 
causing the intron to fold, mRNA is cleaved at the 
donor site, and the 5' intron end is ligated to the branch 
point adenosine forming a lariat. This brings the exonic 
regions close, allowing for ligation where the splice 
acceptor is cut and the intron is released. 





Alternative Exon Events 

Alternative splice events (also known as alternative 
exon events AEEs) can be classified by three important 
mechanisms: alternative promoters, alternative 

polyadenylation, and alternative splicing by exon cas¬ 
sette selection. Alternative promoter selection occurs 
when two alternative promoters are utilized, resulting 
in two different mRNAs (Eig. 1). Alternative 
polyadenylation can happen when there is more than 
one possible site for adding the poly(A) tail (Fig. 1). 
This mechanism is used for antibody production 
enabling the recognition of foreign molecules 
with different c-terminals. For these two alternative 
splicing mechanisms, the choice of the promoter or 
tail site, are based on cell-type-specific transcription 
factors. 

Alternative splicing by exon cassette selection is the 
most common mechanism for expressing different pro¬ 
teins from one complex transcription unit. This mech¬ 
anism requires a choice of different splicing sites and 
leads to the inclusion or exclusion of whole exons in 
the final product (Sammeth et al. 2008) as shown in 
Fig. 1 . Alternative processing pathways may be regu¬ 
lated in a cell-type-specific manner or in response to 
different developmental or environmental signals. 
However, the mechanisms involved in the recognition 
of the different possible splice sites are still yet not 
fully understood and it still cannot be fully ruled out 
that the recognition process in splicing may be less 
than 100% watertight in many cases. 

rranj-splicing is a rare mechanism that splices 
together segments from two different primary tran¬ 
scripts. It occurs more often in trypanosomes, nema¬ 
todes, and chloroplasts of plant cells. Same gene trans- 
splicing has also been demonstrated to occur in 
Drosophila. 


d 



Alternative Splicing, Fig. 1 (a) Inclusion or exclusion of one 
or more exons (exon skipping), (b) alternative donor site, 
(c) alternative acceptor site, (d) complex splicing patterns, 
(e) intron retention, (f) mutual exclusive exons 

Regulation of Alternative Splicing 

Splice sites and branch points in higher organisms are 
rather degenerate with many sequences resembling 
real splice sites. Additionally, splicing depends on 
tissue or cell type, developmental stage, or external 
stimulus. These facts suggest that additional signals 
may be involved in the regulation and splicing pro¬ 
cesses. Motifs located in exons and introns, also called 
cA-acting regulatory sites, contribute to the probability 
of promoting (enhancers) and inhibiting (silencers) 
splice junction selection; these are called ESE (exonic 
splicing enhancer), ESS (exonic splicing silencer), ISE 
(intronic splicing enhancer), and ISS (intronic splicing 
silencer. These cN-acting RNA sequence elements 
have corresponding trans-acting RNA-binding pro¬ 
teins and form a splicing code. Most of the enhancer 
motifs are bound by members of the SR protein family 
that are rich in serine and arginine and also contain 
RNA recognition sites. Silencers vary in sequence and 
are usually bound by heterogeneous nuclear ribonu- 
cleoproteins (hnRNPs). Some motifs can behave as an 
enhancer or as a silencer depending on their distance to 
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the splice site. Constitutive and alternative splicing 
is also regulated by additional splicing signals and 
splicing factors that allow pseudo splice sites to be 
distinguished. 

In order to understand how alternative splicing is 
regulated, extensive knowledge about splicing motifs 
and factors is needed. Sometimes precise regulation 
mechanisms are achieved by a combination of differ¬ 
ent splicing factors and proofreading mechanisms con¬ 
trolled in a tissue or stimulus-specific manner (Barash 
et al. 2010). The existence of specific regulatory 
programs that control multiple AS events have been 
characterized, for example, in Drosophila sex determi¬ 
nation (McIntyre et al. 2006) and mammalian synaptic 
transmission (Ule et al. 2005). Different concentration 
of splicing factors depending on the environment and 
this can influence the splicing of many transcripts 
sharing the same targets. Many splicing factors regu¬ 
late themselves by alternatively splicing their own 
mRNA. Phosphorylation status of these splicing fac¬ 
tors changes their subcellular localization so they can 
no longer affect splicing. 

Promoter architecture and transcriptional speed also 
affects the regulation of alternative splicing by kinetic 
effects and differential recruitment of splicing factors. 
The secondary structure of mRNAs also plays a role in 
splicing. 

Chromatin structure apparently also has a role in 
the decision of whether an exon will be alternatively 
or constitutively spliced. In human studies, it has 
been found that there is a correlation between nucle- 
osome occupancy and inclusion levels (Tilgner et al. 
2009). Constitutive exons are frequently included in 
the nucleosome, alternative exons are rarely 
included and introns have the lower occupancy 
rates. Exons are bounded by histones enriched in 
certain modifications. Nucleosome positioning adds 
another level of complexity in eukaryotic splicing 
regulation. 

Genome-Wide Detection Methods 

Alternatively spliced transcripts can be found by doing 
expression sequence tag (EST), DNA microarray- 
based analyses, deep sequencing, and CLIP. 

Available sampled ESTs (currently about eight mil¬ 
lion for human) allows the detection of alternative 
splice events in a genome-wide manner. ESTs and 
mRNAs are aligned to the genome sequence. Aligned 
parts generally correspond to exons as the ESTs 


represent parts of spliced transcripts and the gapped 
regions indicate the position of introns. Because of the 
ambiguity of the exon-exon junction alignments, spe¬ 
cial computer programs are used. Alternative splice 
events are detected by searching exons present in 
some ESTs and excluded in others. Intron retention 
and other splice events can be identified from these 
alignments. ESTs are also used to predict tissue- 
specific and cancer-specific splice variants. The draw¬ 
backs of this technology are the high cost, genomic 
contamination, cloning bias, paralog confusion, and 
low sensitivity for low abundance transcripts. 

In exon array analysis, arrays of DNA fragments 
representing individual exons or certain splice junc¬ 
tions are used (Sugnet et al. 2006). The array is then 
probed with labeled cDNA from tissues of interest. 
The probe cDNAs bind to DNA from the exons that 
are included in mRNAs from the selected tissue, or to 
DNA from the boundary where two exons have been 
joined. The intensity of the different probes is used to 
determine which splice events are present and ratios 
are measured to reveal the presence of particular alter¬ 
natively spliced mRNAs. 

Microarray-based studies found 74% of human 
genes to be alternatively spliced. Microarrays can be 
used to detect and analyze tissue-specific splice events, 
study the regulation of alternative splicing, and to find 
alternative splicing and human disease. Limitations of 
this technology are limited probe coverage, cross¬ 
hybridization artifacts, and complex data analysis 
requirements. 

RNA-Seq deep sequencing technologies are also 
being used to perform genome-wide studies of tran¬ 
script variation (He et al. 2009). Alternative splicing is 
found by mapping RNA-seq reads to exon junctions. 
The reliability of a predicted splice junction is 
determined by the number of reads mapping to the 
junction, the number of mismatches on each mapped 
read, mapping distances to the junction, and mis¬ 
matches positions on the junction. While it is 
a promising technique, RNA-seq data analysis still 
remains a big challenge. 

CLIP (Cross-linking and immunoprecipitation) 
links proteins to RNA molecules in a tissue during 
the splicing using UV radiation. Using specific anti¬ 
bodies, the trans-acting splicing regulatory protein is 
then precipitated. Later, the RNA attached to that 
protein is isolated and cloned revealing the target 
sequences for that protein (Ule et al. 2006). 
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Implications of Alternative Splicing 

The protein isoforms produced by alternative splicing 
can differ in various properties as domain composition, 
ligand-binding affinity, signaling, subcellular localiza¬ 
tion, and protein half-life. The insertion or deletion of 
domains, transmembrane helices, or signal peptides 
can lead to modifications to the protein function and 
localization. Transmembrane protein isoforms that 
lose exons encoding for the transmembrane helices 
can generate soluble isoforms. There is evidence of 
a slight bias toward the generation of protein isoforms 
that will lose complete functional units such as protein 
domains. Alternative splicing regulation has been 
noted to affect different biological processes as syn¬ 
apse development, axon guidance, and T-cell activa¬ 
tion, but given that alternative splicing has been shown 
for 94% of multiple exon human genes, it seems likely 
that alternative splicing will turn up in all processes. 

In order to avoid the production of abnormal pro¬ 
teins by improperly processed mRNA, cells have 
evolved mechanisms for surveillance to degrade 
some transcripts. Nonsense-mediated decay (NMD) 
causes the degradation of mRNA in which there is 
a premature stop codon. NMD detects those premature 
stop codons that have been introduced due to the alter¬ 
ation of the open reading frame of the mRNA at the 3' 
end, often as a result of skipping exons during splicing. 

Errors in splicing are the cause of a number of 
human diseases and many have been very well char¬ 
acterized, for example. Cystic Fibrosis (OMIM: 
219700) or Hutchinson-Gilford Progeria Syndrome 
{OMIM 176670). Changes that happen in the splicing 
machinery can lead to splicing errors in multiple tran¬ 
scripts. Single nucleotide alterations can also occur in 
splice sites or cw-acting splicing regulatory sites 
resulting in splicing errors in the mature mRNAs. It 
has been estimated than half of known human disease- 
causing mutations involve the splicing mechanism 
(Lopez-Bigas et al. 2005). Splicing mutations are also 
the most frequent cause of hereditary diseases and have 
been found to contribute to cancer development 
(Pajares et al. 2007). Proteins encoded by a spliced 
variant that are associated to different diseases could 
permit drug design and become therapeutic targets. 

Evolution of Alternative Splicing 

Alternative splicing increases transcriptomic and pro- 
teomic diversity in higher eukaryotes. Alternative 
splicing is present in species throughout the 


phylogenetic tree. It appears to be more abundant in 
higher eukaryotes than in lower eukaryotes, with more 
genes involved in vertebrates. Plants exhibit low levels 
of AS. There are also a few cases of splicing in bacteria 
and archaea. 

Recent studies on the comparison of different spe¬ 
cies have contributed to a better understanding of how 
intron exon architecture is generated and recognized. 

Alternative splicing architecture varies in eukary¬ 
otes. Intron and exon lengths show splicing recognition 
constraints with long introns and short exons in verte¬ 
brates and short introns and long exons in eukaryotes. 

Alternatively spliced exons can be generated from 
three different evolutionary mechanisms: exon shuf¬ 
fling, exonization of intronic sequences, and transition 
(Keren et al. 2010). In the exon shuffling process, an 
exon is inserted into a gene or duplicated in the same 
gene. Exonization happens when a section of genomic 
sequence becomes an exon, it is generated de novo. 
The transition mechanism is when a constitutive exon 
transforms into an alternative exon. Out of the newly 
generated exons, it can be imagined that only a few 
will eventually acquire function and their inclusion 
level in alternative transcripts may increase. 

Two rival hypotheses have been proposed to 
explain the biological origins of introns, intron early 
and intron late. The introns early hypothesis assumes 
that a mechanism that allowed splicing out regions 
containing stop codons from the primary message 
arose to allow proteins of greater length and greater 
functional diversity. The original short reading frames 
would be today’s exons, while the introns would be the 
splice junctions originally designed to remove delete¬ 
rious stop signals. Introns late hypothesis assumes that 
genes come from short reading frames that evolve by 
duplication and fusion mechanisms. Introns would be 
the result of insertions of foreign DNA into these 
genes. Several observations suggest that eukaryote 
alternative splicing emerged early in evolution: ances¬ 
tral splicing signals are degenerate; complex 
spliceosomes are present in lower eukaryotes; NMD 
pathways exist in animals, fungi, and plants; and sim¬ 
ilarities in alternative splicing patterns are found for 
different eukaryote genomes in genome-wide studies. 

Bioinformatics and Alternative Splicing 

Bioinformatics has a high impact in the field of alterna¬ 
tive splicing with the recent growth of large amounts of 
data generated by EST, cDNA, and genome-sequencing 
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projects. Genome-wide studies on alternative splicing 
are now possible and have estimated that 35-94% of 
human genes are alternatively spliced, with the more 
recent studies showing higher percentages of alternative 
splicing This vast new catalog of alternative splicing has 
been analyzed to characterize the functional impact of 
these splice variants through a variety of approaches 
(Blencowe and Graveley 2008). 

Many of the available bioinformatic methods are 
related with the detection of alternative splicing hy 
matching up EST sequences with mRNA and genomic 
sequences. Alternative splicing prediction accuracy 
depends on the types of alternative transcript. Alterna¬ 
tive donor, acceptor, and exon skipping sites are easier 
to detect and interpret, whereas intron retention is 
generally the most difficult one to predict due to exper¬ 
imental artifacts. Fast sequence alignment tools are 
used to map all mRNA and EST sequences against 
the genome. Recognition algorithms try to distinguish 
between transcript fragments belonging to real 
alternative transcripts from the different artifacts 
such as EST coverage limitation bias, genomic cover¬ 
age assembly errors, genomic contamination, EST 
orientation errors, sequencing errors, or EST 
fragmentation. 

The genomic sequence also provides indispensable 
information for detecting and validating splicing 
events at the transcript level since much of the signal 
is intronic. The genomic sequence also allows the 
resolution of EST clustering and paralog 
distinguishing problems which may cause incorrect 
predictions. Non-EST-based methods can also reliably 
predict exon skipping events using different features 
like conservation, flanking intronic regions, exon and 
intron lengths, and nucleotide composition to predict 
exons. 

Another role of bioinformatics in alternative splic¬ 
ing is the constmction of alternative gene models by 
integrating data from splicing and exons to build plau¬ 
sible isoforms. 

Different bioinformatic algorithms related with 
alternative splicing characterization are being devel¬ 
oped around large-scale genomic projects like 
ENCODE (Anon 2004). These algorithms are being 
used for automatic annotation and prediction of the 
spliceosome. GENCODE (Harrow et al. 2006) is 
a subproject of ENCODE, which seeks to identify all 
protein-coding genes in the ENCODE selected 
regions; manually prepared high-quality annotation 


for transcripts from this project are used as 
a feedback for the further improvement and validation 
of these algorithms. An example of these annotation 
tools is APPRIS (Tress et al. 2008a), which separates 
constitutive and alternative variants combining protein 
structural information, functionally important resi¬ 
dues, and evidence of non-neutral evolution exons 
are used to make automatic annotations of principal 
functional isoforms. 

Recent studies (Brosch et al. 2010; Ezkurdia et al. 
2012; Tress et al. 2008b) are using proteomic data from 
tandem mass spectrometry (MS/MS) experiments for 
the indentification quantification of those peptides that 
uniquely distinguish an alternative isoform to know to 
what extend these proteins are expressed. 

Cross-References 

► Alternative Splicing Regulation; Structural and 
Biophysical Studies 

► Core Splicing Machinery: Structural and 
Biophysical Studies 
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Synonyms 

Alternative pre-mRNA splicing; Splicing 

Definition 

Alternative splicing (AS) is the process hy which mul¬ 
tiple messenger RNAs are synthesized from a single 
gene and thereby can produce different proteins with 
distinct biological functions. 

Alternative Splicing 

Around 1980 it was discovered that protein variants of 
different size can be derived from the same gene. 
Later, it was shown that these alternative protein 


products result from differential inclusion of informa¬ 
tion from a given gene during pre-mRNA splicing. 
This process, called alternative splicing, entails, that 
pre-mRNA transcripts containing multiple exons can 
be spliced into different mRNAs and thus encode 
different proteins with distinct functions. A major con¬ 
sequence of alternative splicing is that the number of 
proteins in metazoan organisms greatly exceeds the 
number of genes. Alternative splicing allows differen¬ 
tial regulation of gene expression depending on tissue, 
developmental stage, gender, or external stimuli 
(Nilsen and Graveley 2010). From genome-wide stud¬ 
ies and computational analyses, it is thought that 
>95% of human multi-exon genes undergo alternative 
splicing. Approximately 70% of alternative splicing 
events affect coding regions leading to amino acid 
changes or truncated protein products. Moreover, it is 
estimated that altered splice site selection is involved 
in 15-50% of human genetic diseases. 

Pre-mRNA splicing requires the recognition of con¬ 
served splice site RNA sequences located at the 5' and 
3' ends of the non-coding introns. The regulation of 
alternative splicing depends additionally, on cA-acting 
RNA sequence motifs alternative splicing as well as on 
fran^-acting protein factors. CA-acting RNA elements 
are found as exonic splicing enhancers (ESE) or 
silencers (ESS) and as intronic splicing enhancers 
(ISE) or silencers (ISS) (Blencowe 2006) (Fig. la). 
These cA-acting RNA sequences can be recognized 
by different proteins, and the combination of several 
interactions as well as the strength of splice sites (i.e., 
binding affinities to the RNA) determines the effi¬ 
ciency of splicing. Exon inclusion usually involves 
the recognition of ESEs by serine-/arginine-rich (SR) 
proteins that activate 5' or 3' splice sites, while exon 
skipping is often associated with the recognition of 
an ESS or ISS by heterogeneous nuclear ribonucleo- 
proteins (hnRNPs) (Fig. la). Importantly, the com¬ 
plexity of alternative splicing regulation is further 
increased by alternative splicing factors that mediate 
protein-RNA and protein-protein interactions with the 
spliceosome. 

Alternative Splicing of the FAS Gene 

About one third of alternative splicing events involves 
the skipping of exons (Blencowe 2006). An important 
and well-studied example is alternative splicing of the 
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Alternative Splicing Regulation: Structural and Biophysi¬ 
cal Studies, Fig. 1 Biophysical characterization of alterna¬ 
tive splicing regulation of the FAS gene (a) Schematic 
representation of alternative splicing of exon 6 (magenta) 
of the FAS gene leads to two different protein isoforms. 


Trans-acting proteins splicing enhancers (SR proteins, 
green) or inhibitors (hnRNP proteins, red) and additional 
alternative splicing factors regulate FAS splicing, (b) The 
crystal structure of SPF45-UHM/SF3bl55-ULM5 (PDB 
code 2PEF1). The extent of NMR chemical shift 
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FAS gene. FAS encodes a trans-membrane protein that 
mediates apoptosis (programmed cell death) upon 
binding to the soluble FAS ligand protein. It was 
found that the activity of FAS is regulated hy alterna¬ 
tive splicing, in particular by skipping of exon 6. Exon 
6 encodes the trans-memhrane region of the FAS 
receptor. Therefore, exon 6 skipping produces soluble 
FAS receptor protein that sequesters the FAS ligand. 
This results in anti-apoptotic signaling and thus oppo¬ 
site functional activity as a result of alternative splicing 
(Fig. la). 

Various factors are implicated in the regulation of 
FAS alternative splicing with different outcomes and 
utilizing distinct mechanisms. The oncogenic RNA- 
hinding protein 5 (RBM5), the splicing factor 45 kDa 
(SPF45), and the poly-pyrimidine-tract binding (PTB) 
protein, all promote exon 6 skipping, yet by distinct 
mechanisms. For example, the OCRE (octamer repeat) 
domain of RBM5 was found to be essential for its 
alternative splicing activity, presumably, via protein- 
protein interactions that OCRE mediates with 
spliceosomal tri-snRNPs proteins. These interactions 
are consistent with RBM5 acting at a late step of 
spliceosome assembly by inhibiting complex 
B formation (Bonnal et al. 2008). Overexpression of 
SPF45 also promotes FAS exon 6 skipping by modu¬ 
lating protein-protein interactions. The so-called 
UHM (U2AF Homology Motif) domain of SPF45 
was shown to recognize linear peptide motifs (UHM 
ligand motifs, ULMs) in constitutive splicing factors 
such as U2 auxiliar factor 65 (U2AF65), splicing factor 
1 (SFl), and splicing factor 3B 155 kDa-subunit 
(SF3bl55). SPF45 UHM-ULM interactions are 
required for the alternative splicing activity of 
SPF45, presumably by competing with UHM-ULM 
interactions of the constitutive splicing factors (Corsini 
et al. 2007). Regulation of FAS splicing by PTB 
involves its binding to an ESS RNA element located 
in exon 6, which interferes with the exon definition 


process and results in skipping of exon 6 (Izquierdo 
et al. 2005). 

In contrast, the TIA-1 protein promotes the inser¬ 
tion of exon 6 by binding to a conserved uridine-rich 
ISE located downstream of the 5' splice site of intron 6. 
The cooperative binding of TIA-1 and UlsnRNP facil¬ 
itates the recognition of the 5' splice site. Thereby, 
TIA-1 enhances the recognition of exon 6 and subse¬ 
quently induces its inclusion (Izquierdo et al. 2005). 

Structural Biology and Biophysical Studies 
for Understanding Mechanisms of AS 

Recent large-scale studies aim at providing a genome¬ 
wide description of alternative splicing events and 
correlate these data with development of diseases 
(Blencowe 2006; Nilsen and Graveley 2010). These 
studies are important toward providing a systems biol¬ 
ogy view of the combinatorial control of alternative 
splicing. However, elucidating the underlying molec¬ 
ular mechanisms requires biophysical and structural 
studies in combination with in vitro and in vivo splic¬ 
ing assays. Crystallography is the most commonly 
used technique to provide structural details of biolog¬ 
ical macromolecules and requires obtaining well- 
diffracting crystals of these systems. Many splicing 
factors are multi-domain proteins, where individual 
structural modules are connected by flexible linkers 
and mediate weak molecular interactions. Crystalliza¬ 
tion of these proteins is challenging or may induce 
non-native domain arrangements governed by crystal 
packing effects. It is therefore important to study these 
complexes using solution techniques such as Nuclear 
Magnetic Resonance (NMR) or Small Angle Scatter¬ 
ing (Madl et al. 2010; Mackereth et al. 2011). NMR is 
especially well suited as it can be used to determine 
three-dimensional structures of proteins but also pro¬ 
vides efficient tools for characterizing internal motion 


a 


< - 

Alternative Splicing Regulation: Structural and Biophysi¬ 
cal Studies, Fig. 1 (continued) perturbations (CSP) monitored 
in NMR titration (see (e)) is indicated by a color gradient from 
white to blue (c) Solution NMR structure of PTB/CUCUCU 
(PDB code 2ADC). Looping out of an exon by RNA binding 
as indicated on the right may lead to skipping of this exon, 
(d) Competitive interactions of the SPF45 and U2AF65 
UFIMs to ULM motifs in constitutive splicing factors contribute 


to FAS alternative splicing, (e) Characterization of protein- 
protein interactions using NMR titrations. A similar set of 
amide signals is affected by the addition of the different 
ULMs, demonstrating a common binding site (see in b). 
Flowever, differential chemical shift perturbations involving 
the U2AF65 ULM ligand (insert on the right, green vs red/hlue 
NMR signals) suggest that structural details of the recognition 
are distinct 
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and mapping molecular interactions in solution 
(Cavanagh et al. 2007; Madl et al. 2010). 

Molecular details of alternative splicing of the FAS 
gene have been provided by high-resolution structures 
using X-ray crystallography and NMR-spectroscopy. 
The crystal structure of the SPF45 UHM domain 
bound to a ULM of SF3bll5 revealed atomic details 
for the recognition of a conserved tryptophan in the 
SF3bl55 ULM peptide by the SPF45 UHM domain 
(Fig. lb) (Corsini et al. 2007). The detailed structural 
information allowed the design of UHM mutations to 
probe the specificity of interactions with different 
ULM motifs. Splicing assays comparing wild type 
and mutant proteins provided a rationale for how com¬ 
petition of UHM-ULM interactions with constitutive 
and alternative splicing factors may regulate FAS alter¬ 
native splicing (Fig. Id). 

An example for the role of protein-RNA interac¬ 
tions in splicing regulation is given by PTB. The solu¬ 
tion structure of the third and fourth RRM (RNA 
recognition motif) domains bound to short RNA 
sequences was determined using NMR spectroscopy 
(Fig. Ic) (Oberstrass et al. 2005). The structure reveals 
how an intramolecular protein-protein interaction of 
the two RRM domains orients their RNA binding 
surfaces in a specific way. This may induce “looping” 
of a bound pre-mRNA harboring two CUCUCU RNA 
elements and may exclude an exon from recognition 
by constitutive splicing factor and thus afford exon 
skipping. 

Structural biology is combined with biophysical 
techniques for determining binding affinities and pro¬ 
viding additional complementary information. Isother¬ 
mal Titration Calorimetry (ITC) is used to probe 
interactions in the nanomolar to the micromolar 
range (Velazquez-Campoy et al. 2004). From thermo¬ 
dynamic parameters, such as binding enthalpy and 
entropy, the dissociation constants (K^) and stoichiom¬ 
etry can be obtained. Surface Plasmon Resonance 
(SPR) is an alternative method that can provide kinetic 
rate constants (on and off-rates) for the binding event 
in addition to Kj values. Notably, ITC and SPR have 
different sample requirements. While ITC needs large 
sample amounts, but is a solution technique, very little 
sample is needed for SPR measurements, but possible 
interference due to the surface immobilization has to 
be considered. The binding affinities of different 
ULMs to the SPF45-UHM were determined by ITC. 
It was found that all ULMs are bound with quite 


similar low micromolar binding affinities suggesting 
that a subtle balance of competing UHM-ULM inter¬ 
actions are required for alternative splicing regulation 
(Corsini et al. 2007) (Fig. Id). 

NMR spectroscopy is a versatile and highly infor¬ 
mative technique for studying protein-ligand interac¬ 
tions and macromolecular complexes in solution 
(Mackereth et al. 2005; Cavanagh et al. 2007; Madl 
et al. 2010). Typically, NMR fingerprint spectra are 
recorded using *^N-isotope-labeled protein, which 
correlate each amide proton with the corresponding 
amide nitrogen in a two-dimensional spectrum. The 
NMR resonance frequencies are very sensitive to the 
electronic environment. Perturbation of the local 
chemical environment of the amide, for example by 
ligand binding, leads to shifts in the NMR frequencies. 
Analysis of such fingerprint spectra provides informa¬ 
tion about the binding affinity and allows mapping the 
binding site by plotting chemical shift perturbations 
(CSPs) onto a three-dimensional structure of the pro¬ 
tein (Fig. lb). The titration of SPF45 with different 
ULM ligands revealed that all ULMs use the same 
binding pocket, but that the recognition of U2AF65 is 
structurally somewhat distinct from the binding to SFl 
and SF3bl55 (Corsini et al. 2007) (Fig. le). Similar 
NMR experiments can be used to map domain-domain 
interfaces in multi-domain proteins and protein com¬ 
plexes. More elaborate NMR approaches can provide 
important information about the quaternary structure 
of these complexes in solution, and thus a molecular 
basis for understanding the regulation of alternative 
splicing (Mackereth et al. 2005; Madl et al. 2010). 

In summary, biophysical techniques and structural 
biology play a crucial role for elucidation molecular 
mechanisms of alternative splicing regulation. Solu¬ 
tion techniques such as NMR are especially important 
for understanding the interplay between structure and 
conformational dynamics in these processes. 
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Synonyms 

aaRS; ARS; EFl Alpha; EF-IA 

Definition 

Aminoacyl-tRNA synthetases (aaRSs) are a group of 
ubiquitously expressed enzymes that catalyze the 
esterification of amino acids to the 3'-terminus of 
tRNAs (Ibba et al. 2005). The resulting aa-tRNAs 
are delivered by elongation factors (EF-Tu in bacte¬ 
ria, EF-IA in archaea and eukaryotes) to the ribo¬ 
some, where accurate codon-anticodon pairing 
between the mRNA and tRNA defines the genetic 
code. Discovered in the 1960s, aaRSs have since 
served as extraordinary models for the studies of 
enzymology, evolution, protein-RNA interactions, 
and translational mechanisms. AaRSs are excellent 
targets for drug discovery, and recent studies show 
that aaRS mutations are associated with multiple 
neurological disorders in humans. 

Basic Characteristics 

Class I and Class II aaRSs. AaRSs catalyze the attach¬ 
ment of amino acids to correct tRNAs in a two-step 
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Aminoacyl-tRNA Synthetases, Fig. 1 Aminoacylation reaction. AaRSs use amino acids, ATP, and tRNAs as substrates to form 
aa-tRNAs in a two-step reaction 


Aminoacyl-tRNA Synthetases, Table 1 Commonly used techniques to study aminoacyl-tRNA synthetases 


Technique 

Purpose 

Description 

In vitro 
transcription 

To prepare large quantities of tRNA 
transcripts 

The reaction contains purified T7 RNA polymerase, DNA template, and 
nucleotide triphosphates; tRNA products are purified with polyacrylamide 
gels or chromatography 

Active site 
titration 

To determine the active fractions of 
aaRSs 

The aaRS is incubated with radiolabeled amino acid and cold ATP to form 
the aaRS:aa-AMP complex, which is bound to nitrocellulose filters and 
quantified with a scintillation counter 

Pyrophosphate 

exchange 

To measure the rates of amino acid 
activation 

The aaRS is mixed with cold amino acid, cold ATP, and [32P] 
pyrophosphate; the formation of [32P] ATP correlates with the rate of amino 
acid activation 

Filter binding 

To measure dissociation constants 
between aaRSs and tRNAs 

The tRNA is first labeled with [32P] ATP; nitrocellulose filters are then used 
to bind the protein:tRNA complex 

Aminoacylation 

To measure formation of aa-tRNAs 

The aaRS is incubated with cold tRNA, ATP, and radiolabeled amino acid to 
form aa-tRNA, which is precipitated with 5% TCA on 3MM Whatman paper 
and scintillation counted. Alternatively, 32P-labeled tRNA can be used with 
cold amino acids, and the aa-tRNA level is qualified using phosphorimaging 

Stopped flow 

To study pre-steady state kinetics 

Fluorescence change is monitored during the reaction process to determine 
the rates of individual steps 

Quench flow 

To study pre-steady state kinetics 

Complements stopped flow assays. The reaction is rapidly quenched, which 
allows rate determination of fast reactions 

X-ray 

crystallography 

To solve the structures of aaRSs 

Particularly useful to understand the mechanisms of aaRS reactions 


reaction (Fig. 1): first, the amino acid is activated with 
ATP to form aa-AMP, which is measurable with active 
site titration and pyrophosphate exchange assays 
(Table 1); next, the activated amino acid is transferred 
from aa-AMP to the 3'-terminus of the tRNA, which is 
routinely determined with aminoacylation assays (Ibba 
and Soil 2000). Based on the active site architecture, 
aaRSs can be divided into two mutually exclusive 
classes (Table 2). The active sites of Class I aaRSs 


are composed of a Rossman dinucleotide-binding fold 
with conserved HIGH and KMSKS motifs; the active 
sites of Class II aaRSs lack the Rossman fold, and 
instead contain an antiparallel (3 fold (Fig. 2). A Class I 
aaRS approaches the tRNA from the minor groove 
side and attaches the amino acid to the 2'-OH of the 
terminal adenosine, while a Class II enzyme 
approaches the tRNA from the major groove and 
attaches the amino acid to the 3'-OH. Despite the 
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Aminoacyl-tRNA Synthetases, Table 2 Two classes of aminoacyl-tRNA synthetases 


Members 

Catalytic site 

tRNA recognition 

Class I 

ArgRS“, CysRS, LeuRS, IleRS, ValRS, MetRS, LysRS C, 
GlnRS, GluRS, TrpRS, TyrRS 

Rossmann dinucleotide¬ 
binding fold 

Minor groove, aa attached 
to 2'-OH 

Class II 

GlyRS, HisRS, ProRS, ThrRS, SerRS, AsnRS, AspRS, LysRS 
II, AlaRS, PheRS, SepRS, PylRS 

Antiparallel (5 sheets 

Major groove, aa attached 
to 3'-OH“ 


^Aminoacyl-tRNA synthetases are denoted by the three-letter amino acid designation, for example, ArgRS refers to arginyl-tRNA 
synthetase 

'’LysRS is the only aaRS identified in both classes 

“An exception is PheRS, which charges the amino acid to the 2'-OH 


Aminoacyl-tRNA 

Synthetases, 

Fig. 2 Representative 
structures of Class I and 
Class II aaRSs complexed 
with tRNA. Escherichia coli 
CysRS (PDB: lUOB) and 
ThrRS (PDB: iQF6) are 
shown in green, and tRNAs 
are shown in orange 


CysRS (Class I) 


ThrRS (Class II) 




structural differences, the two classes of aaRSs 
have evolved convergently to catalyze the same 
aminoacylation reaction that is central to all living 
organisms. 

Specificity and proofreading by aaRSs. Maintaining 
specihcity during aminoacylation is critical for accu¬ 
rate protein synthesis. Cells contain more than 20 
species of amino acids and tRNAs, with each aaRS 
responsible for matching the correct pair of amino acid 
and tRNA. While specific selection of the tRNA by an 
aaRS is achieved through multiple identity elements 
and kinetic proofreading, it is more challenging for an 
aaRS to choose the correct amino acid due to its sim¬ 
plicity. Approximately, half of the aaRSs use an 
editing step to hydrolyze misactivated aa-AMP (pre¬ 
transfer editing) or misacylated tRNA (post-transfer 
editing) (Ling et al. 2009; Mascarenhas et al. 2008). 
For instance, ThrRS misactivates Ser and attaches it to 
tRNA^*''^, but the resulting Ser-tRNA^*''^ is hydrolyzed 
in an editing domain appended to ThrRS. In addition to 


cA-editing domains, free-standing proteins have also 
been shown to edit misacylated tRNAs in trans. The 
editing activity of aaRSs is conserved among the three 
domains of life, and editing dehciencies have been 
shown to cause growth defects in bacteria, mitochon¬ 
drial dysfunction in yeast, and neurodegeneration in 
mammals. 

Indirect pathways of aa-tRNA synthesis. Cognate 
aaRSs have been identified for each of the 22 natural 
amino acids except Sec, yet not all types of cells 
contain the complete set of aaRSs, and several amino 
acids are modified while they are bound to the tRNA 
(Sheppard et al. 2008). For instance, certain 
methanogenic archaea lack CysRS, and most prokary¬ 
otes lack both GlnRS and AsnRS. In these organisms, 
indirect pathways are employed to synthesize relevant 
aa-tRNAs: Cys-tRNA*^^® is transformed by Sep-tRNA: 
Cys-tRNA synthase from Sep-tRNA'^^^ Gln-tRNA®'" 
and Asn-tRNA^™ are converted by amidotransferases 
from Glu-tRNA*^*" and Asp-tRNA^*", which are 
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synthesized by non-discriminating GluRS and AspRS, 
respectively. Two pathways exist for the synthesis of 
Sec-tRNA®®‘^ from Ser-tRNA®®‘^ (formed by SerRS): in 
bacteria, selenocysteine synthase directly converts 
Ser-tRNA®''‘^ to Sec-tRNA®“; in archaea and eukary¬ 
otes, 0-phosphoseryl-tRNA kinase phosphorylates 
Ser-tRNA®®‘^ to form Sep-tRNA®®‘^, which is then 
converted to Sec-tRNA®®‘^ by Sep-tRNA:Sec-tRNA 
synthase. 

AaRS complexes. Most aaRSs are free-standing 
enzymes, but several aaRSs are present in protein 
complexes (Hausmann and Ibba 2008). The most 
extensively studied aaRS complex to date is the mam¬ 
malian multi-synthetase complex (MSC), which is 
composed of nine aaRSs and three scaffold proteins. 
The formation of MSC in the cytoplasm has been 
suggested to sequester aa-tRNAs in the protein synthe¬ 
sis machinery. AaRS complexes have also been iden¬ 
tified in archaea and bacteria. In Methanothermohacter 
thermautotrophicus, LysRS, LeuRS, ProRS, and 
EFIA form a complex to enhance the aminoacylation 
efficiency of participating aaRSs. Non-discriminating 
AspRS and GluRS also interact with amidotransferases 
and tRNAs to form transamidosomes, either to 
enhance the catalytic efficiency or to improve tRNA 
selectivity. In bacteria, ProRS associates with a trans 
editing factor YbaK to increase its specificity for Cys- 
tRNA^™. 

AaRS engineering for unnatural amino acid incor¬ 
poration. The natural protein synthesis machinery uti¬ 
lizes only 22 amino acids, yet aaRSs have been 
engineered to co-translationally incorporate a number 
of unnatural amino acids (Liu and Schultz 2010). To 
date, over 70 unnatural amino acids have been genet¬ 
ically incorporated into proteins using Escherichia 
coli, yeast, or mammalian systems. In such work, an 
orthogonal aaRS:tRNA pair is evolved to specifically 
accept the unnatural amino acid. For instance, 
Methanocaldococcus janna.schii TyrRS has been 
engineered to attach a variety of unnatural amino 
acids to an amber suppresser tRNA^^^ The engineered 
TyrRS does not recognize any natural amino acid or 
tRNA in the host cell, nor is the suppressor tRNA^^' 
recognized by any endogenous aaRS. This ensures that 
only the desired unnatural amino acid is incorporated 
at the specific protein site. Genetically incorporated 
unnatural amino acids have been successfully used 
for fluorescent labeling, post-translational modifica¬ 
tions, and expanding the genetic code of life. AaRS 


engineering is expected to play a significant role in 
future studies of synthetic biology. 

AaRSs and human diseases. AaRSs are housekeep¬ 
ing enzymes, and defects in aaRSs could result in 
cellular dysfunction. In humans, mutations in several 
aaRSs have been strongly associated with human neu¬ 
rological disorders (Antonellis and Green 2008); for 
example, GlyRS and TyrRS mutations have been 
shown to cause Charcot-Marie-Tooth disease, and 
mitochondrial AspRS mutations are associated with 
leukoencephalopathy. Surprisingly, not all disease- 
causing aaRS mutations lead to aminoacylation 
defects, obscuring the mechanism of pathogenesis. It 
is also unclear why aaRS mutations are dispropor- 
tionally associated with neuropathies. Due to the 
essential role of aaRSs and the divergence between 
microbial and human enzymes, aaRSs have been 
targeted for the development of antibacterial and anti¬ 
fungal drugs (Ibba and Soil 2000; Rock et al. 2007). 
Successfully developed aaRS inhibitors include 
mupirocin (antibacterial) and AN2690 (antifungal). 
It is expected that aaRSs will continue to serve as 
promising antimicrobial targets for future drug 
development. 

AaRSs beyond translation. In addition to the essen¬ 
tial role in aminoacylation, many aaRSs contain sec¬ 
ondary functions beyond the scope of protein synthesis 
(Antonellis and Green 2008; Guo et al. 2010). For 
instance, Neurospora crossa mitochondrial TyrRS 
and Saccharomyces cerevisiae LeuRS are involved in 
Group I intron splicing. Mammalian aaRSs are 
extremely rich in appended domains and extra func¬ 
tions: the EMAPII domain of TyrRS is associated with 
angiogenesis; the WHEP domain inhibits the 
angiostatic activity of TrpRS; EPRS (with both 
glutamyl- and prolyl-tRNA synthetase activities) is 
linked to the inflammatory response; and LysRS is 
responsible for the packaging of human immunodefi¬ 
ciency virus (HIV). It is expected that more secondary 
functions of aaRSs will be revealed with our deepening 
understanding of this essential group of enzymes, 
which might provide insights into the pathogenic 
mechanisms of aaRS mutations. 
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Synonyms 

Bioanalytical sensors; Biodetectors; Biodevices; Bio¬ 
sensors; Electrochemical sensors; Enzyme-electrodes 

Definition 

Amperometric biosensors are self-contained 
integrated devices based on the measurement of the 
current resulting from the oxidation or reduction of an 
electroactive biological element providing specific 
quantitative analytical information. 


Introduction 

Determining analyte concentrations is essential 
for a wide range of applications, including 
high-throughput drug screening, disease diagnosis 
and treatment, quality control, and environmental 
monitoring. The research in the field of biosensors, 
amperometric biosensors in particular, was initiated 
by Clark whose study on oxygen electrode was 
published in 1956 (Clark 1956). Since then electro¬ 
chemical biosensors have been widely used to 
determine the analyte concentrations, both in research 
and commercial applications. 

A biosensor comprises of two closely associated 
elements: a biological recognition element and 
a signal conversion unit, the physicochemical trans¬ 
ducer, which eventually produce a readout. The 
biological recognition element determines the degree 
of selectivity or specificity of the biosensor whereas 
the sensitivity of the biosensor is greatly influenced by 
the transducer. Many different biological elements 
including enzymes, antibodies, antigens, receptors, 
organelles, microorganisms, and DNA/RNA can act 
as the recognition element in a biosensor; however, 
due to space limitations only redox enzymes will be 
discussed. 

Biosensors can be classified according to the 
parameter that is measured by the transducer of the 
biological event, these include optical, electrochemi¬ 
cal, acoustic, piezoelectrical, and thermal. Most 
biosensors use electrochemical detection for the trans¬ 
ducer because of the low cost, ease of use, portability, 
and simplicity of construction (Eggins 2002). Electro¬ 
chemistry is a surface technique and offers certain 
advantages for detection in biosensors. It does not 
depend strongly on the reaction volume, and very 
small sample volumes can be used for measurement. 
Moreover, electrochemical detection is not affected by 
sample components such as chromophores, 
fluorophores, and particles that often interfere with 
spectrophotometric detection. The reaction being 
monitored electrochemically results in a measurable 
current (amperometry), a measurable potential 
(potentiometry), or changes in the conductive proper¬ 
ties of the medium (conductometry). 

Among electrochemical biosensors, the most suc¬ 
cessful that have led to commercial devices are the 
ones based on the monitoring of electron transfer, 
therefore belonging to the amperometric category. 
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Amperometric Biosensors, 
Fig. 1 A typical 
chronoamperometry 
experiment showing the 
current-time responses of an 
immobilized enzyme upon 
successive additions of its 
substrate. Inset: The 
corresponding calibration 
curve 



The signal of these biosensors is generated by the 
electron exchange between the biological component 
and the electrode. Generally speaking, amperometric 
biosensors function by the production of a current 
when a potential is applied between two electrodes 
and the analyte undergoes, or is involved in, a redox 
reaction that can be followed by measuring the current 
in an electrochemical cell. The analyte being measured 
or the biomolecule involved with it changes its oxida¬ 
tion state at the electrode. The electron-transfer signal 
is then measured and is proportional to the amount of 
the redox-active species at the electrode. 

In amperometry, changes in current generated by 
the electrochemical oxidation or reduction are 
monitored directly with time while a constant potential 
is maintained at the working electrode with respect to 
a reference electrode (Bartlett 2008). The technique is 
implemented by stepping the potential directly to the 
desired value and then measuring the current, or hold¬ 
ing the potential at the desired value and flowing 
samples across the electrode as in flow injection 
analysis. In all cases, the resulting current is 
proportional to the concentration of the electroactive 
species in the sample. 

The typical electrochemical technique performed 
when working with amperometric biosensors is 


chronoamperometry, in which the current is measured 
as a function of time, when an appropriate constant 
potential is applied. Figure 1 shows a standard curve 
obtained when using an amperometric biosensor, in 
which the current varies upon the addition of 
a particular compound (substrate of a redox enzyme) 
to render a particular product that is electro- 
transformed at the electrode. The current change 
measured is proportional to the amount of 
oxidized/reduced species, which in turn is directly 
proportional to the analyte concentration, depending 
on the assay format. 

Amperometric biosensors have additional selec¬ 
tivity in that the oxidation or reduction potentials 
used for detection are characteristic of the biolog¬ 
ical component or its analyte. Advantageously, the 
fixed potential during amperometric detection 
results in a negligible charging current (the current 
needed to apply the potential to the system), which 
minimizes the background signal. In addition, 
hydrodynamic amperometric techniques can pro¬ 
vide significantly enhanced mass transport to the 
electrode surface, for example, when using rotating 
disk electrodes, or in flow-through conditions 
where the sample solution passes over the station¬ 
ary electrodes. 
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Fig. 2 Schematic 
representation of the three 
different generations of 
amperometric biosensors 
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Three Main Generations of Amperometric 
Biosensors 

Amperometric biosensors can be divided into three 
main categories according to the different mechanism 
of the electronic transport (Fig. 2): 

1. The first generation biosensors are based on 
measurement of the concentration of substrates or 
products of the enzymatic reaction. The decrease in 
the concentration of the substrate or the increase in 
the concentration of the product can be measured by 
amperometry if they are electroactive. In most 
cases, these biosensors function by measuring the 
oxygen reduction or generation of hydrogen 
peroxide and therefore oxidases and dehydroge¬ 
nases are the two main classes of enzymes used in 
this type of biosensors. 

Though it would be desirable to electrochemi- 
cally detect changes in the enzyme substrate (the 
analyte) or in one reaction product, unfortunately, 
this is not always possible. For example, most 


analytes of clinical interest are not natural sub¬ 
strates of a redox enzyme. A variety of strategies 
have been developed to overcome this problem, 
depending on the analyte’s chemical properties. 
The latter strategies are based on the fact that it is 
feasible to transform a non-redox reaction into 
a redox one, through use of coupled enzymatic 
reactions (coupling one reaction where the analyte 
participates to another one, producing a species 
detectable by amperometry). Examples of this 
type of amperometric biosensors include creatine, 
urea, lactate, and pyruvate determination (Belluzo 
et al. 2008). 

2. In the second generation biosensors, a mediator is 
added to the setup which shuttles electrons between 
the redox center of the enzyme and the working 
electrode. Most of the enzymes used in biosensor 
technology have their redox center embedded 
within the core of the protein, so that the active 
site is electrically insulated. One way of tackling 
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this problem is by using redox mediators, which 
shuttle the electrons between the enzyme’s active 
site and the electrode surface. The most well-known 
and widely used mediator is ferrocene and its 
derivatives. The ExacTECH^M biosensor from 
Abbott-Medisense takes advantage of the latter 
strategy, using a ferrocene derivative as mediator 
to monitor glucose blood level (Cass et al. 1984; 
Hill and Sanghera 1990). Mediated electrochemis¬ 
try of some peroxidases has also been reported 
(Sadeghi et al. 1997). Other biosensors using 
soluble mediators to detect infection-marker anti¬ 
bodies, acetylcholine, biological oxygen demand, 
as well as simple molecules such as glutathione, 
L-alanine, pyruvate, lactate, and cholesterol have 
also been reported (reviewed in Chaubey and 
Malhotra 2002). 

3. Third generation biosensors are based on direct 
electron transfer between the enzyme and the elec¬ 
trode and hence bypassing the need for external 
mediators. However, as mentioned earlier, it is 
usually difficult for redox proteins in solution to 
transfer electrons directly to bare electrodes. There 
are several possible reasons for this. Unfavorable 
orientation of protein molecules on electrode sur¬ 
face may not only hinder the electron exchange 
between the electrode and the electroactive center 
of proteins but also increase the distance between 
the redox center and electrodes and thus decrease 
the rate of direct electron transfer. Nevertheless, in 
recent years, direct electron transfer has been 
achieved for many redox proteins and enzymes by 
different immobilization techniques (described in 
following section), surface modification of elec¬ 
trodes, and also protein engineering in order to 
achieve the most favorable orientation of the 
enzyme and also to make electrode surfaces more 
compatible with redox enzymes. 

Electrochemical Setup 

Electrochemical sensors consist of either two or three 
electrodes. A typical three-electrode electrochemical 
cell consists of a working electrode of a conductive 
material, such as platinum, gold, or carbon; a reference 
electrode, usually consisting of silver metal coated 
with a layer of silver chloride (Ag/AgCl) or saturated 
calomel electrode (SCE) based on the reaction 
between mercury/Hg 2 Cl 2 ; and a platinum wire counter 


electrode. The reference electrode is usually further 
removed from the site of the redox reaction in order 
to maintain a stable reference potential. A 
two-electrode system on the other hand only includes 
the working and reference electrodes. If the current 
density is low enough (<pA cm” ), then the reference 
electrode can carry the charge with no adverse effect 
(Ronkainen et al. 2010). Both three- and two-electrode 
systems are used for sensors; however, two electrodes 
are generally preferred due to lower fabrication costs. 
Furthermore, these electrodes can be miniaturized eas¬ 
ily down to dimensions in the order of not only micro¬ 
meters but also nanometers (Wightman and Wipf 
1989). Reducing electrode dimensions not only 
reduces the sample volumes required but also leads to 
higher sensitivity and a more cost-effective fabrication 
processes (Fig. 3). It also allows the integration of 
sensor arrays with a large number of detection sites 
and diverse functionalities, enabling parallel 
biorecognition capability (Yang et al. 2009). The 
resulting mini-biosensor would combine the advan¬ 
tages of miniaturization with improved measurement 
accuracy due to the reduction of external noise sources. 
However, there are challenges in the miniaturization 
and integration of biosensors and their instrumentation 
electronics that still need to be addressed. 

Biological Component of Biosensor: 
Selectivity/Specificity 

The biological component is responsible for the selec¬ 
tivity/specificity of the biosensor and theoretically any 
biological or biologically derived biomolecule can be 
used. As mentioned earlier, different biological com¬ 
ponents including enzymes, antibodies, antigens, 
receptors, organelles, microorganisms, and DNA/RNA 
can be used to achieve the desired selectivity, 
according to the nature of the analyte. Although 
many types of biological elements can be used in 
biosensing devices, electrochemical biosensors pri¬ 
marily use enzymes due to their remarkable specificity 
for their natural substrates. 

Transducer Component of Biosensor: Sensitivity 

Enhancing the sensitivity of any analytical method 
means increasing the signal-to-noise ratio. Two 
obvious routes can be followed: boosting the signal 
or reducing the noise. In amperometric biosensors, an 
enhancement of the signal-to-noise ratio can be 
achieved by increasing the number of electroactive 
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Fig. 3 A microfluidic cell 
integrating a gold working 
electrode and a combined 
screen-printed carbon counter 
and Ag/AgCl reference 
electrodes, (a) Cross section of 
the cell with inlet and outlet 
microchannels, (b) Tapping 
mode atomic force 
microscopy of an immobilized 
enzyme on the gold working 
electrode: 2D image recorded 
in water (left), 3D view of the 
single proteins (center), and 
cross-section profile of single 
protein (right), (c) Overall 
depiction of the amperometric 
detection system 
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detectable analyte-related species and/or by lowering 
the interferences that reach the electrode, thus giving 
rise to background current noise. 

Different ways of achieving the goal of increasing 
the number of detectable species are possible based on 
magnifying the electron-transfer process, recycling the 
active enzyme, increasing the number of active 
enzyme molecules by specific orientations (described 
below), or use of nanomaterials/particles. Usually, in 
the case of enzymes, the substrate should be present in 
excess concentration to keep regenerating the active 
site of the redox enzyme. 

The obvious method to raise the amount of the 
electrochemically detectable species is increasing the 
number of active proteins so as to make the analyte 
capturing step more efficient. Self-assembled mono- 
layers (SAMs) have proved to be useful in biosensor 
design giving rise to the most favorable orientation of 
the proteins on the electrode surfaces for electron 
transfer. 

Another more recent approach involves the use 
of nanomaterials which with their high 
surface-to-volume ratio offer massive number of sites 
that can be chemically functionalized, which in the 
case of amperometric biosensors is translated into 
a surplus of analyte-interacting sites, thus leading to 
sensitivity enhancement. Depending on the nanostruc¬ 
ture composition, several strategies have been 
proposed to modify the nanomaterial surface. For 
example, gold nanoparticles and nanotubes can be 
modified with alkylthiols to which the enzymes of 


interest can be attached. Another nanomaterial, carbon 
nanotubes (CNTs), has also been studied for incorpo¬ 
ration into electrochemical biosensors; however, their 
use was initially hindered due to their insolubility. To 
overcome the latter problem, several strategies have 
been successfully applied such as dispersion in differ¬ 
ent solvents or polyelectrolytes or incorporation in 
composite matrices making them more amenable for 
applications in electrochemical biosensors (Rivas 
et al. 2007). The vast number of recently published 
work involving nanomaterials in electrochemically 
based biosensors demonstrates not only that this is 
one of the most productive fields of research but also 
in the long term may lead to more commercially 
successful devices (Pratibha et al. 2008). 

Methods for Immobilizing Enzymes on 
Electrode Surfaces 

According to the Marcus theory of protein-mediated 
electron transfer (Marcus and Sutin 1985), the rate 
constant of direct electrochemical interaction between 
donor and acceptor is determined by driving force 
between the redox center of the enzyme and electrode, 
reorganizational energy at electron transfer, and 
mostly by the distance between enzyme active site 
and electrode surface. Furthermore, the properties of 
the electrode material and the microenvironment 
around the enzyme may also affect electron transfer. 
In many enzymes, the protein matrix surrounding the 
redox-active center of the enzyme isolates the latter 
center from electric contact with the electrode surface. 











A 


66 


Amperometric Biosensors 


Amperometric Biosensors, 
Fig. 4 Different strategies 
used for immobilizing proteins 
on electrode surfaces 
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Therefore, in order to have an intimate contact between 
the enzyme and the electrode surface, without any 
adverse effects on the structure/function of the protein, 
different immobilization techniques have been 
constructed (Fig. 4). Immobilization methods are con¬ 
sidered successful if the biosensors prepared are stable 
and maintain the selectivity of the enzyme. Although 
in some cases immobilization may alter the conforma¬ 
tion of the enzyme, thereby reducing its activity, many 
methods have been successful. Some immobilization 
methods can even improve enzyme stability by 
minimizing enzyme unfolding. Generally speaking, 
for biosensor applications, the enzyme should have 
a high Vmax (broader range where the signal is 
proportional to substrate concentration for reliable 
quantitation of the analyte) and low Km values 
(maximal catalytic efficiency at low substrate concen¬ 
trations) when immobilized on the electrode. 

The simplest approach to protein immobilization is 
adsorption. Proteins can adsorb spontaneously on 
many electrode surfaces such as metal and silica 
oxides, and carbon electrodes. Although this technique 
is relatively straightforward, it is the least stable of the 
common immobilization methods leading to the for¬ 
mation of randomly oriented layers. The forces linking 
the protein to the electrode are usually weak van der 
Waals forces and hydrogen bonds together with elec¬ 
trostatic, dipole-dipole, or hydrophobic interactions. 
The formation of intermolecular interactions with the 
surface may compete with similar interactions stabi¬ 
lizing the enzyme and in some cases resulting in the 
denaturation of the protein. 

In the second approach, the immediate environment 
around the immobilized enzyme can be carefully 
designed to enhance the enzyme activity. This can be 
achieved by physically entrapping the enzyme in 


a polyelectrolyte or conducting polymer (Emr and 
Yacynych 1995). This immobilization strategy has 
been shown to be successful for membrane-bound 
enzymes such as human cytochromes P450 (Sadeghi 
et al. 2011). In physical immobilization methods, 
the native composition of the enzyme is usually pre¬ 
served due to the absence of any covalent bonds. 

In the third physical immobilization strategy, the 
protein of interest can be included in a layer of SAM. 
SAM formation provides an easy way to functionalize 
electrode surfaces by forming a highly ordered 
unimolecular film. Flexibility to design different head 
groups of monolayers by large numbers of 
electroactive or inactive functional groups makes this 
functionalization strategy especially useful for biosen¬ 
sors applications (Chaki and Vijayamohanan 2002). 
For example, the SAM can be formed using thiol- 
terminated chains immobilized on the surface of 
a noble metal. The protein interacts with the SAM 
layer in a nonspecific way. 

On the other hand, in chemical immobilization 
techniques, functional groups on the enzyme such as 
NH 2 , COOH, OH, and SH (that are not necessary for 
the catalytic activity of the enzyme) are covalently 
linked to the electrode surface. It is obvious that this 
covalent bonding leads to the most stable protein- 
electrode immobilization technique used in ampero¬ 
metric biosensors. If specific and unique amino acid 
groups on the surface of the protein are used for the 
covalent linkage, this will lead to oriented immobili¬ 
zation of the protein. In this case, the protein can be 
immobilized either directly or via a SAM on the 
electrode. This type of oriented immobilization has 
been made possible by the rapid advances in molecular 
biology and especially protein engineering (Ferrero 
et al. 2008; Mak et al. 2010; Sadeghi et al. 2010). 
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Summary 

Amperometric biosensors constitute an interdisciplin¬ 
ary field that is currently one of the most active areas of 
research. Although there is a lot of literature regarding 
studies where these biosensors have been successfully 
devised for measuring specific compounds of interest, 
nevertheless to this day, the translation from lab to 
industry has been very sluggish. The bottleneck in the 
realization of the latter step seems to be mainly due to 
the problems of the stability and activity of the 
enzymes after immobilization on the electrode surface, 
which is of course the fundamental step in the devel¬ 
opment of electrochemical biosensors. Researchers 
continue in their efforts to overcome the dependence 
of enzyme activity/stability on the solution conditions 
such as temperature, pH, ionic strength, and buffer 
composition using protein engineering. 

Finally, advances in the research fields of 
biosensors - protein engineering, nanotechnology, 
and nanomaterials - are expected to greatly help in 
addressing current and future challenges in the devel¬ 
opment of new technologies for amperometric biosen¬ 
sors in clinical diagnostics, high-throughput drug 
screening, quality control, environmental monitoring, 
and security control. 
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► Potentiometric Biosensors 

► Protein Design for Biosensors 
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Definition 

Amyloid fibrils are stable and highly organized fila¬ 
mentous aggregates rich in P-sheet secondary struc¬ 
ture. They typically comprise many copies of a single 
protein and are nanometers in diameter and microns in 
length. 

Introduction 

Proteins are encoded as a linear polymer of amino 
acids, with different proteins having the amino acids 
arrayed in a different linear sequence. The majority of 
proteins condense into well-ordered and compact 
structures with intricate but reproducible topologies 
that are encoded by this sequence of amino acids. 
Different proteins have different folds, hut the same 
sequence of amino acids will typically lead to adoption 
of the same fold. If this process of folding goes wrong, 
however, the result is often the formation of aggre¬ 
gates, one species of which is the highly ordered 
amyloid fibril. 

Amyloid fibrils are filamentous and self-assembling 
protein aggregates. They typically comprise many 
copies of a single protein that has folded or misfolded 
and then aggregated into very a stable and organized 
structure. They represent a configuration that can he 
adopted by all proteins regardless of their amino acid 
composition, a ubiquitous protein fold that is the result 
of interactions between groups in the backbone of the 
polymeric polypeptide chain. The formation of amy¬ 
loid fibrils is implicated in a number of debilitating 
human diseases (Uversky and Fink 2006, 2007; 
Murphy and Tsai 2006; Ovadi and Orosz 2009; 
Ramirez-Alvarado et al. 2010). 

The literature on amyloid and amyloid-like fibrils is 
large and ever-increasing. Where possible, the cited 
references refer to multidisciplinary review articles 
and chapters within books; other chapters within 
these volumes will provide the interested reader with 
a thorough overview of the subject area. 

History 

The term “amyloid” was coined in the mid-nineteenth 
century by Virchow to describe a pathologic structure 
in hrain tissue with an abnormal macroscopic 


appearance that stained blue on addition of iodine, 
indicative of the presence of starch (for a review of 
the history of amyloid, see Sipe and Cohen 2000). 
Subsequent analysis identified the principal compo¬ 
nent of in vivo amyloid deposits to be protein. This 
protein component was found to hind the dye Congo 
red and exhibit “apple-green” birefringence under 
cross-polarized light, and this criterion is still used to 
define the presence of amyloid in vivo (Westermark 
et al. 1999). Electron microscopy studies showed amy¬ 
loid plaques to contain filamentous material, compris¬ 
ing fibrils approximately 6-13 nm in diameter and 
microns in length. It is these fibrils that are the subject 
of intensive study to the present day, and the focus of 
this review. 

Gentle extraction of the fibrillar component of amy¬ 
loid deposits enabled the identification of the protein 
species involved in a variety of clinical syndromes, and 
showed that normal plasma proteins make up some 
forms of pathological amyloid. Since the 1960s, nearly 
30 proteins have been identified as forming amyloid 
deposits associated with both systemic and localized 
diseases (Table 1). In 1971 Glenner and others dem¬ 
onstrated the in vitro formation of fibrils by a purified 
and enzyme-digested immune system serum protein 
(Bence-Jones protein) that displayed identical micro¬ 
scopic and tinctorial properties to fibrils isolated from 
amyloid plaques (Glenner et al. 1971). Glenner also 
demonstrated the formation of fibrils sharing the char¬ 
acteristics of pathologic deposits by two proteins not 
known to be involved in disease (insulin and glucagon) 
and also noted the formation of similar structures by 
the synthetic peptide poly-L-lysine (Glenner et al. 
1974). These advances led to an explosion in 
papers describing the formation of amyloid-like fibrils 
in the test tube, culminating in 1998 in the hypothesis 
that the self-assembly of peptides and proteins into 
amyloid-like fibrils is a fundamental property of the 
polypeptide chain (for a review see Dobson 1999; 
Dobson 2006). 

Nonetheless, there remain important differences 
between the structures formed in vivo as components 
of the amyloidoses (Table 1) and the fibrillar protein 
aggregates studied in vitro, although the terminology 
found in the literature is often confusing. The term 
“amyloid” refers to the pathologic deposits found 
in vivo, and the term “amyloid fibrils” to the protein¬ 
aceous filaments found within these deposits. Filamen¬ 
tous protein aggregates formed in the test tube are 
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Amyloid Formation, Table 1 Human diseases associated 
with the aggregation of protein into amyloid or amyloid-like 
fibrils 


Disease 

Protein 

AL (light chain) amyloidosis 

Monoclonal 
immunoglobulin light 
chain 

AH (heavy chain) amyloidosis 

Immunoglobulin heavy 
chain 

AA amyloidosis 

Serum amyloid A 
protein 

Hemodialysis-related amyloidosis 

P 2 -microglobulin 

Senile systemic amyloidosis/familial 
amyloidotic polyneuropathy 
(hereditary disorder) 

Transthyretin 

Lysozyme amyloidosis 

Lysozyme 

ApoA-I amyloidosis 

ApoA-J 

ApoA-II amyloidosis 

ApoA-II 

ApoA-IV amyloidosis 

ApoA-IV 

Fibrinogen amyloidosis 

Fibrinogen 

Finnish hereditary amyloidosis 

Gelsolin 

Hereditary cerebral hemorrhage with 
amyloidosis 

Cystatin C 

Familial British dementia 

ABri (BRI-PP) 

Familial Danish dementia 

ADan (BRI-PP) 

Sporadic TSE (e.g., Creutzfeld-Jacob 
Disease (CJD)) 

Prion protein 

Hereditary TSEs (fatal familial 
insomnia, Gerstmann-Straussler- 
Scheinker disease, hereditary CJD) 

Prion protein 

Alzheimer’s disease 

Ap protein precursor 

Tauopathies (~20 distinct 
degenerative disorders) 

Tau 

Type-IJ (adult-onset) diabetes 

Islet amyloid 
polypeptide (lAPP) 

Lattice comeal dystrophy 

Kerato-epithelin 

Cardiac atrial amyloidosis 

Atrial natriuretic factor 
(ANF) 

Pituitary prolactinoma 

Prolactin 

Iatrogenic amyloidosis 

Insulin 

Aortic medial amyloidosis 

Lactadherin (Medin) 

Comeal lactoferrin amyloidosis 

Lactoferrin 

Senile seminal vesicle amyloidosis 

Semenogelin I 

Medullary thyroid carcinoma 

Procalcitonin 

Odontogenic (Pindborg) tumors 

Odontogenic 

ameloblast-associated 

protein 

Hypotrichosis simplex of the scalp 

Comeodesmosin 

Amyotrophic lateral sclerosis (ALS)"* 

Superoxide dismutase 

Cutaneous/lichen amyloidosis 

Cytokeratin'’ 


“Unconfirmed as an amyloid disorder 
'’Unconfirmed 


correctly called “amyloid-like fibrils.” Amyloid 
plaques are intra- or extracellular deposits comprising 
protein fibrils made up of many copies of a single 
protein or fragment of a protein, but they also contain 
additional protein cofactors such as Serum Amyloid 
P component (SAP) which binds to and appears to 
stabilize fibrils, apolipoprotein E, inflammatory mole¬ 
cules such as cytokines, the chaperone clusterin, 
abnormal concentrations of some metal ions, and the 
antioxidants SOD-1 and SOD-2. It has also been 
suggested that proteoglycans play an intimate role in 
the formation of the fibrils and/or the structure of the 
plaques (see Snow and Wight 1989). 

The Formation of Fibrils 

As mentioned in the Introduction, most proteins adopt 
a compact and globular conformation with a well- 
defined arrangement of the polypeptide backbone. 
With only a few rare exceptions, the structure formed 
corresponds to the most energetically stable state under 
physiological conditions. Protein folding into the 
correct three-dimensional arrangement involves an 
initial nucleation of structure by a small number of 
amino acids, often distant in sequence and often 
hydrophobic. The formation of these key contacts 
involves a random search with selection on the basis 
of the most stable, lowest energy interactions. Forma¬ 
tion of these key contacts by a small number of amino 
acids then drives the adoption of structure hy the rest of 
the chain through a random search and further mini¬ 
mization of energy. Even large and complex proteins 
appear to fold in this way, although large proteins 
appear to he made up of smaller domains that fold 
independently, after which the global structure clicks 
into place. 

A key exception to this rule of compact globular 
structures and cooperative folding pathways is the 
family of proteins classed as “intrinsically unstruc¬ 
tured” or “natively unfolded” (Csizmok and Tompa 
2009; Uversky et al. 2008). As the name implies, pro¬ 
teins in this family spend a substantial proportion of 
their life cycle in an unfolded or unstructured confor¬ 
mation. Some transiently adopt structure on interacting 
with their binding partner; others appear to function as 
unfolded chains. Importantly, members of this family 
play critical roles in regulation of transcription and 
translation as well as signal transduction. 
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What these two hroad classes of protein - globular 
and natively unfolded - have in common is the require¬ 
ment to adopt a “partially folded” conformation in 
order to misfold/self-assemble into amyloid-like 
fibrils. In the body and in disease, this typically 
requires either a mutation that alters the amino acid 
sequence, consequently destabilizing the native struc¬ 
ture and driving the formation of a partially folded 
structure; or the application of a stress which may in 
turn alter the internal environment of the cell such that 
protein structure is transiently destabilized. Under 
these conditions, exposure of groups that would other¬ 
wise be buried in the protein interior of globular 
proteins, or stabilization of structures that would oth¬ 
erwise have substantial flexibility in intrinsically 
unstructured proteins, may give rise to nonnative 
intermolecular protein-protein interactions, nucleating 
the formation of aggregates. 

The conformation any polypeptide can adopt is 
restricted by steric clashes between atoms in the back¬ 
bone and side chains. These restrictions are summa¬ 
rized in the famous Ramachandran plot, which 
describes all possible polypeptide conformations in 
terms of two backbone dihedral angles, psi and phi. 
The repetition of combinations of psi and phi defines 
local structures known as secondary structural ele¬ 
ments. Two of these, the a-helices (4> = -50-60 
v|/ = —50-60) and the P-strands (cj) = —110-140 
\|/ = H-110-135) predominate in protein structures. In 
folded proteins the hydrogen-bonding potential of 
the polypeptide backbone must also be accommo¬ 
dated. In the a-helix the majority of the backbone 
NH and CO groups are internally hydrogen-bonded. 
In contrast, internal hydrogen bonding within 
extended p-strands is not possible and, so, two or 
more P-strands pair via hydrogen bonds to form 
P-sheets. Amyloid-like fibrils comprise extended 
(potentially infinite) P-sheets in which an extensive 
network of inter- and intramolecular hydrogen bonds 
run parallel to the fibril long axis and perpendicular to 
the polypeptide chain. 

This formation of the p-sheet structure is 
a fundamental property of the hydrogen-bonding 
potential of the polypeptide chain, arising as it does 
from groups in the polypeptide backbone that is 
common to all proteins. Thus, the formation of exten¬ 
sive P-sheet structure competes with the tendency of 
proteins and polypeptides to adopt their native struc¬ 
ture, which is encoded by the precise sequence of 


amino acids. As an illustrative example, the protein 
myoglobin is almost wholly a-helical in its native, 
globular, and functional state, but following partial 
unfolding at high pH and elevated temperatures it 
self-assembles into aggregates demonstrating all of 
the characteristics of amyloid-like fibrils including 
a conformation rich in p-sheet. An alternative example 
is provided by the natively unstructured protein 
a-synuclein. The in vitro aggregation of a-synuclein 
involves the formation of a partially folded intermedi¬ 
ate (Fink 2006) that can be stabilized by high temper¬ 
atures, low pH, or the presence of metal ions. Familial 
mutations in a-synuclein cause a heritable early-onset 
form of Parkinson’s Disease with the aggregation of 
protein(s) within characteristic structures known as 
Lewy Bodies. 

Myoglobin and a-synuclein are just two examples 
of the multitude of proteins that have now been dem¬ 
onstrated to form amyloid-like fibrils in the test tube, 
and regardless of the starting sequence of amino acids 
all of the resulting fibrils share common properties. 
Universal traits include the P-sheet secondary structure 
that defines the amyloid-like fibril, typically detected 
by ► circular dichroism or ► FTIR spectroscopy, 
a “cross-P” structure detected by x-ray fiber diffrac¬ 
tion, binding of the dyes Congo Red or Thioflavin T, 
and evidence for nanometer-diameter fibrils by trans¬ 
mission electron microscopy (TEM) or atomic force 
microscopy (AFM) (Methods in Enzymology 
Volumes 309 (1999), 412 (2006) and 413 (2006)). 

The Kinetics of Fibril Formation 

The kinetics of self-assembly of proteins and polypep¬ 
tides into amyloid-like fibrils almost universally 
exhibit a sigmoidal relationship that has been ascribed 
to a nucleated growth regime in which the formation of 
a critical nucleus during the “lag” time is rate-limiting 
(for a review see Naiki and Gejyo 1999). Evidence for 
the nucleated growth mechanism is provided by the 
elimination of the lag phase following addition of 
preformed oligomers/fibrils (“seeding”) and this phe¬ 
nomenon is well known in vitro. Seeding has also been 
inferred in vivo in susceptible mouse models whereby 
injection or ingestion of amyloid-like material - 
regardless of the polypeptide precursor - triggers 
systemic amyloidosis in susceptible animal models 
(Lundmark et al. 2005). 
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Amyloid Formation, Fig. 1 Hierarchy of amyloid-like fibril 
structure. Individual polypeptide chains are assembled into 
sheets (a), which pack against one another to form a 
protofilament (b). The hydrogen-bonding direction is vertical. 
The vertical distance between strands along the hydrogen¬ 
bonding direction is 4.7 A; the horizontal distance between 


sheets is dictated by the exact composition of the protein and 
the amino acids making up the intersheet interface but is typically 
approximately 10 A. A typical x-ray fiber diffraction pattern 
illustrating these two distances is shown (c). Multiple 
protofllaments form a fibril (d) 


The nucleated growth mechanism underlying fibril 
formation has particular significance in disease 
because it is now widely accepted that it is oligomeric 
intermediates such as the critical nucleus or prefibrillar 
aggregates that are toxic to cells (Glabe 2005). This 
implies, moreover, that aggregation to form mature 
fibrils may have a protective effect since it will serve 
to sequester toxic intermediates into higher-order 
nontoxic structures. Obviously this will not be the 
case in systemic amyloid diseases involving proteins 
present in the body at moderate or high concentrations, 
where the load of amyloid fibrils in the major organs 
may exceed kilograms, drastically compromising 
organ function. The oligomer toxicity hypothesis 
does, however, have significant implications for ther¬ 
apeutic interventions, which in many cases are aimed 
at reversing fibril assembly. Such therapies may in fact 
increase the oligomeric intermediate load and thereby 
exacerbate disease. 

Structure 

The three-dimensional atomic-resolution structure 
of an amyloid-like fibril has yet to be determined. 


Their large size makes fibrils intractable to solution 
NMR techniques, and their noncrystalline and inho¬ 
mogeneous nature precludes x-ray crystallography. 
Recent advances have been made with the application 
of ► solid-state NMR (ssNMR) spectroscopic methods 
to investigate amyloid-like fibril systems, as well as 
recent success in growing microcrystals of small pep¬ 
tide fragments that share some, although not all, of the 
characteristics of the amyloid-like fibril architecture. 
Such experiments have yielded a number of models 
which, although not sufficient to define the fibril struc¬ 
ture unambiguously, nonetheless provide important 
insights into the general properties of these assemblies. 
For a review of some of the structural models derived 
using these methods, the reader is directed to Martin 
et al. (2010) and Makin and Serpell (2005). 

The structure of amyloid-like fibrils is typically 
described in a hierarchical fashion, although it should 
be emphasized that the order in which the higher levels 
of structure are formed is as yet unknown. Mature 
fibrils are typically 6-12 nm in diameter and assem¬ 
bled from a number of strands (“protofilaments”) 
wound together in a helical rope-like or twisted 
ribbon-like arrangement (Fig. 1). The contacts 
between protofilaments making up the fibril are 
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unknown. The protofilaments themselves comprise 
a hydrogen-bonded array of proteins in which the 
polypeptide backbone lies perpendicular to the fibril 
axis. This molecular structure provides an ongoing 
challenge to structural biologists. 

Fibril Networks and Aggregates 

As described previously, amyloid deposits found 
in vivo comprise a number of proteins and polysaccha¬ 
ride components in a dense mass. In vitro amyloid-like 
fibrils typically form hydrogels. Depending on the 
protein species involved and the preparation condi¬ 
tions (pH, temperature, concentration), these gels 
may be thixotropic or remain solid on inversion. The 
elastic modulus of amyloid-like fibril gels vary accord¬ 
ingly, ranging from a few Pa for weak gels, to 4-20 kPa 
for gels assembled from lysozyme and a peptide 
derived from amino acids 16-20 of the Ap peptide. 
In keeping with the polymeric nature of amyloid-like 
fibrils, some systems have also been observed to fur¬ 
ther self-assemble to form spherulites, structures more 
commonly associated with synthetic polymers such as 
polyethylene. In spherulites, amyloid-like fibrils are 
arrayed radially to give spheres 10-150 pm in diame¬ 
ter. Under the polarizing microscope, spherulites show 
a typical “Maltese cross” pattern of light extinction, 
and similar patterns have also been observed in in vivo 
amyloid deposits. 

The Amyloid-Like Fibril 

Low-resolution structural models of amyloid-like 
fibrils have been generated by cryo-^ electron micros¬ 
copy and ► atomic force microscopy methods, and 
these have provided the impetus for much speculation 
on the exact distribution of the polypeptide chain 
within protofilaments. Cryo-electron microscopy 
(cryo-EM) freezes the protein in a thin layer of vitreous 
ice, which not only keeps the sample hydrated and thus 
as similar as possible to the native environment, but 
also prevents any dramatic structural rearrangements 
as a result of radiation damage. Atomic force micros¬ 
copy involves imaging of the fibrils on a surface, but 
samples can be imaged within liquid and thus in 
their hydrated state. These technique are able to 
resolve the individual protofilaments, which coil 


around a common axis or around each other to form 
the fibril (see Rambaran and Serpell (2008), and refer¬ 
ences therein). The structure of model amyloid-like 
fibrils formed from the “SH3” protein domain consists 
of four individual protofilaments winding around a 
hollow core. Amyloid-like fibrils assembled from insu¬ 
lin form diverse structures comprising two, four, or six 
protofilaments, exhibiting variable helical pitch along 
the fibril axis. Amyloid fibrils isolated from the tissues 
of patients with hereditary systemic amyloidosis 
(Jimenez et al. 2001) were substantially more hetero¬ 
geneous than fibrils assembled in vitro, possibly due to 
the presence of extraneous nonprotein material such as 
lipid or glycoproteins. In the same study, ex vivo fibrils 
comprising a mutant form of lysozyme displayed an 
elliptical cross section consisting of six protofilaments, 
once again with a hollow core. 

The Protofilament 

The rope-like construction of amyloid-like fibrils 
reveals individual protofilaments. The most commonly 
used technique to explore the average folding of the 
polypeptide chain within protofilaments is x-ray fiber 
diffraction. The result is low-resolution structural data 
that provide information about the average distance 
between repetitive units of the polypeptide chain mak¬ 
ing up the fibrils. In the majority of cases the degree of 
structural information is limited to two reflections, one 
at 4.7 A in the meridional plane and a further reflection 
at ^10 A in the equatorial plane, and these are charac¬ 
teristic of the “cross-P pattern” that is a diagnostic 
criterion of the amyloid-like fibril. 

The majority of high-resolution atomic-level infor¬ 
mation is available at this level of the structural hier¬ 
archy, with the data coming from solid-state NMR 
methods and x-ray diffraction from microcrystals. 
Solid-state NMR has been used to study the fibrillar 
structures of short peptides ranging from 7 to 11 amino 
acids, the 40^2 amino acid AP peptide implicated in 
Alzheimer’s Disease, and a 71 amino acid region 
derived from the fungal prion protein HET-s. These 
and other proteins are reviewed extensively by Tycko 
(2006). Eurther high-resolution information has been 
derived from p-sheet microcrystals grown from short 
polypeptides. All of the microcrystals characterized to 
date adopt a “steric zipper” structure requiring 
a closely packed interface involving interdigitation of 
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amino acid side chains between P-strands. Both paral¬ 
lel and antiparallel examples have now been found, 
although it should be pointed out that solid-state NMR 
evidence suggests that the microenvironment experi¬ 
enced in the microcrystalline and fibrillar forms of 
these peptides is different. The “steric zipper” interface 
is essentially absent from the database of known pro¬ 
tein structures, suggesting it is a peculiarity of these 
P-sheet rich microcrystals, and possibly also of amy- 
loid-like fibrils. Marshall and Serpell (2010) have 
discussed structural polymorphisms in crystals and 
fibrils in their recent article. 


Prefibrillar Aggregates 

The simplest and smallest structures adopted by 
amyloid-like fibrils are the prefibrillar oligomeric 
aggregates implicated in cytotoxicity. That these 
species share a common structure is inferred from the 
fact that an antibody raised to prefibrillar aggregates of 
the Ap peptide recognizes early-stage prefibrillar 
aggregates formed by a wide range of unrelated 
polypeptides (Glabe 2005). Typically these aggregates 
comprise 15^0 monomers and are spherical, coming 
together to form “strings of beads” commonly referred 
to as protofibrils (which are also cytotoxic). These 
strings of beads sometimes anneal to form annular 
structures. The toxic effects of both oligomeric 
and annular structures are suggested to be due to per¬ 
turbation of the plasma membrane, although details 
remain unclear. Formation of the long-straight fibrils 
comprising multiple protofilaments described above 
heralds the loss of cytotoxicity in the majority of 
examples. 


"Natural" Amyloid 

With the explosion in amyloid-related research came 
the discovery that evolution appears to have selected 
some proteins precisely for their ability to form amy¬ 
loid fibrils, and that fibril formation in some cases 
appears to play a functional role in cells (for a review 
see Fowler et al. 2007). For example, the self-assembly 
of proteins into amyloid-like fibrils on the surface of 
some bacteria enable them to colonize otherwise resis¬ 
tant surfaces. Indeed, E. coli and Salmonella 
spp. dedicate a complete biosynthetic pathway 


comprising transport proteins, putative chaperones, 
and nucleating proteins to the generation of functional 
fibrils. These fibrils must confer a significant advan¬ 
tage to the organism or they would have been selected 
against by evolution. In a contrasting example, the 
self-assembly of the translation termination factor 
Sup35 into amyloid-like fibrils has been proposed as 
an example of a “DNA-free” hereditary mechanism in 
yeast (Shorter and Lindquist 2005). Aggregation of 
Sup35 into amyloid-like fibrils results in an increased 
tendency to read through stop codons and 
a corresponding decrease in translational fidelity, so 
that yeast with the aggregated form of Sup35 are able 
to explore protein sequences that have not been subject 
to selective pressure. As a final example, fibrillar self- 
assembly has also been implicated in the control of 
melanin production in mammalian cells. It is thought 
that amyloid-like fibrils assembled from a fragment of 
the transmembrane protein Pmel7 bind to and pre¬ 
organize melanin precursors along their fibril axis. 
This would enable efficient polymerization of melanin 
in a controlled fashion, and prevent free diffusion of 
cytotoxic monomers in the cell. For further details see 
the section on ► functional amyloid. 


Summary 

Amyloid and amyloid-like fibrils are self-assembled, 
organized, and kinetically stable protein aggregates 
that form via a nucleation-and-growth mechanism fol¬ 
lowing population of a partially folded structure by 
proteins. Amyloid fibril formation is implicated in 
a number of diseases, where the disease phenotype 
may arise from the loss of function of the native pro¬ 
tein, or a toxic gain of function arising from the inher¬ 
ent cytotoxicity of prefibrillar oligomers. Amyloid 
fibrils are rich in inter- and intramolecular (I-sheet 
structure, and typically comprise a number of 
protofilaments winding around a common axis. The 
stability and ubiquitous nature of this multimolecular 
structure has led to its use in nature as a functional 
scaffold. 


Cross-References 

► Amyloid Formation in Bacteria 

► Amyloid Inhibitors 
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► Atomic Force Microscopy 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Electron Microscopy 

► Functional Amyloid 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Magic Angle Spinning Solid-State NMR on 
Proteins 

► Oligomeric Proteins 

► Pressure Effects on Proteins 
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Synonyms 

Amyloids in prokaryotes 

Definition 

Amyloids result from the structural conversion of 
soluble proteins into self-associating conformations 
that form macromolecular insoluble hbrillar aggregates. 
The formation of amyloids is linked to the onset of 
debilitating human disorders like Alzheimer’s or 
Parkinson’s diseases. However, these highly ordered 
protein assemblies also form in simplest organisms 
like yeast or bacteria (de Groot et al. 2009). 
A common limitation of recombinant protein production 
in bacteria is misfolding and the concomitant for¬ 
mation of insoluble protein aggregates known as 
inclusion bodies (IBs). These intracellular aggregates 
are dense, porous, hydrated, apparently amorphous, and 
refractile particles of nearly 1 pm in diameter and are 
commonly localized at the cell poles. The mechanism of 
IBs assembly in Escherichia coli has been characterized 
in detail and shown to be nucleation driven and sequence 
specific. Transmission Electron Microscopy and Atomic 
Force Microscopy allowed visualizing the presence of 
fibrillar stractures coexisting with amorphous material 
inside the bacterial IBs formed by different proteins. 
Circular Dichroism, Infrared Spectroscopy, X-ray 
diffraction, as well as Solid and Liquid State Nuclear 
Magnetic Resonance data converge to indicate that 
bacterial IBs are characterized by the presence of an 
extensive intermolecular p-sheet structure similar at 
the molecular level to the characteristic cross-P-sheet 
structure of amyloid fibrils. Accordingly, bacterial 
aggregates bind with good affinity to amyloid diag¬ 
nostic dyes like Thioflavin-T and Congo red. In IBs, 
ordered aggregated polypeptides can coexist with 
unstructured species or even with functional globular 
domains. In addition to fibrillar material, the presence of 
SDS-resistant oligomeric assemblies has been detected 


in the insoluble fraction of bacteria. Therefore, it is not 
surprising that bacterial aggregates affect cell division 
and aging, can induce cytotoxicity in eukaryotic cells 
and become infectious in the case of IBs formed by 
prion proteins. Overall, the amyloid-like properties of 
bacterial aggregates suggest that these organisms might 
provide a simple but biologically relevant background 
for the understanding and the pharmacological targeting 
of pathologic protein deposition. Interestingly, the 
bacterial ability to construct amyloid structures is not 
restricted to recombinant IBs and several unrelated 
natural bacterial polypeptides able to form functional 
amyloid fibrils have been described. Bacteria exploit 
the mechanical, toxic, and protective properties of the 
macromolecular assemblies formed by these proteins 
for different biological functions such as surface 
adhesion, host invasion, colonization, or cytotoxicity 
(Otzen 2010). 

Cross-References 

► Amyloid Formation 

► Functional Amyloid 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Infrared Spectroscopy of Protein Structure 

► Protein NMR - Introduction 

► Protein Oligomers 
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Definition 

At least 40 diseases are caused by proteins or peptides 
folding incorrectly and aggregating into amyloid fibrils 
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or plaques, including Alzheimer’s Disease, type II dia¬ 
betes, Parkinson’s disease, Huntington’s disease, and 
the spongiform encephalopathies. The best-studied 
amyloid-based disease is Alzheimer’s, characterized 
pathologically by abnormally high levels of brain 
lesions (senile plaques), neurohbrillary tangles in 
dead and dying neurons, and elevated numbers of amy¬ 
loid deposits in the walls of cerebral blood vessels. The 
major component of senile plaques is a small peptide of 
39^3 amino acids called (I-amyloid (AP). In vitro and 
in vivo evidence shows that soluble, oligomeric forms 
of AP have potent neurotoxic activity and are the pri¬ 
mary causes of neuronal injury and cell death, rather 
than the larger fibrils and plaques that are more readily 
visualized. An obvious strategy to treat Alzheimer’s, 
and other amyloidoses, is therefore to interfere with 
amyloid aggregation, and ideally oligomer formation, 
by either breaking up oligomers or making them 
aggregate further into a less toxic form. Current well- 
established AD dmgs are acetylcholinesterase inhibi¬ 
tors and N-methyl-D-aspartate receptor antagonists. 
They can only alleviate symptoms for a limited time. 

Numerous structurally diverse organic compounds 
can inhibit or reduce the aggregation and toxicity of 
AP in vitro. Peptide inhibitors of amyloid formation 
are usually based around self-recognition elements 
within the target peptide, such as KLVFF (16-20) in 
Ap, which can form amyloid in isolation. These can be 
modihed to improve activity and other properties, such 
as resistance to protease degradation, by modifications 
including backbone N-methylation, replacement with 
D-amino acids, or adding additional groups. 

Alternatively, it may be possible to prevent key 
steps within the amyloid cascade that leads to cell 
death, initiated by the key event of aggregation. For 
example, Ap is produced from the amyloid precursor 
protein by cleavage by p- and y-secretase enzymes. 
Secretase inhibitors will thus prevent production of 
Ap, though as these enzymes have multiple substrates, 
side effects from their use is likely to be 
problematic. Alternatively, upregulation of amyloid 
degradation, such as by activation of a-secretase 
which cleaves Ap, can remove the toxic peptide. 
Immunization with AP showed exciting results in 
a transgenic mouse, though a subsequent clinical trial 
had to be halted after a small number of patients 
developed meningoencephalitis. Nevertheless, vacci¬ 
nation remains a promising approach, perhaps using 
fragments of Ap or other modifications. Activated 


microglia, along with a range of inflammatory media¬ 
tors, has been identified in association with the lesions 
of AD, implying that anti-inflammatory agents such as 
nonsteroidal anti-inflammatory drugs could protect 
against the disease. 

It is unclear why Ap oligomers are toxic. Plausible 
mechanisms of toxicity include oxidative stress, metal 
binding, free radical formation, or ion channel forma¬ 
tion, so it may be desirable to interfere with these 
processes, using reducing agents, metal chelators, or 
ion channel inhibitors. Using stem cells to replace lost 
neurons is a possibility for the future, though this 
approach would not remove the toxic material. 
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Introduction 

Amyloid fibrils are filamentous protein aggregates 
most closely associated with human pathologies 
known collectively as “amyloidoses,” notable mem¬ 
bers of which include Alzheimer’s Disease, 
Parkinson’s Disease, and Creutzfeld-Jakob Disease 
(Chiti and Dobson 2006). Despite this connection, 
they have garnered attention for their potential use as 
a biomaterial. Indeed, Nature has exploited the unique 
qualities of amyloid hbrils in a variety of ways from 
structural components of bacterial biofilms 
(► Amyloid Formation in Bacteria), to playing a role 
in the synthesis and sequestration of melanin 
(► Functional Amyloid) (Fowler et al. 2006, 2007). 
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Amyloid fibrils possess several characteristics 
which make them attractive as a class of biomaterials 
(Aggeli et al. 2001). Firstly, amyloid fibrils self- 
assemble. Self-assembly is the process by which 
molecular objects, via cooperative molecular recogni¬ 
tion, form supramolecular structures without external 
input. Self-assembly is ubiquitous in biology - from 
micellization to protein folding. This phenomenon is 
an indispensable tool in any “bottom-up” approach to 
constructing nanomaterials. 

Secondly, they possess unique structural properties. 
Experiments have demonstrated amyloid fibrils pos¬ 
sess a Young’s Modulus around the GPa scale and 
a strength comparable to steel (Knowles and Buehler 
2011). The structural properties of any material pro¬ 
vide an indirect indication of the underlying 
intermolecular forces which collectively give rise to 
its bulk attributes. The distinguishing feature of 
► amyloid (► Amyloid Formation) and amyloid-like 
fibrils, according to high-resolution data gathered from 
x-ray diffraction and NMR experiments, is the pres¬ 
ence of extensive cross P-sheet structure within the 
core of the fibril. This extensive P-sheet structure 
forms an expansive hydrogen-bonding network 
involving the polypeptide backbone which spans the 
length of the fibril. It is this cooperative intermolecular 
hydrogen-bonding network which confers the stability 
and unique material properties to amyloid fibrils 
(Knowles et al. 2007). The fact that amyloid fibrils 
exhibit similar mechanical properties to dragline spi¬ 
der silk, which is known to be one of the strongest and 
most rigid proteinaceous materials in nature, should 
not be surprising; silk and amyloid fibrils both possess 
the common characteristic of expansive hydrogen- 
bonded P-sheet networks (Xu and Lewis 1990). 
While a complete structure of an amyloid or amyloid¬ 
like fibril has yet to be determined, this common struc¬ 
tural motif is always present, regardless of the initial 
protein or polypeptide conformation. Ultimately, amy¬ 
loid fibrils, regardless of the initial structure of the 
constituent protein, generally exhibit typical dimen¬ 
sions of 5-20 nm in diameter and extending to microns 
in length. Furthermore, amyloid fibrils can display 
a spectrum of different morphologies from long 
straight filaments, to tapes, twisted ribbons, and hollow 
tubes. 

Thirdly, since amyloid fibrils can be constructed 
from a diverse range of proteins and polypeptides, the 
conditions by which fibrils form are also expansive. 


Fibrils are known to form in a wide range of tempera¬ 
ture, pH, and solvent conditions. This flexibility in 
experimental settings lifts potential restrictions in 
applying the assembly of amyloid fibrils to 
nanobiotechnological applications. 

Additionally, since amyloid fibrils are constructed 
from amino acid units, the power and relative ease of 
genetic modification and manipulation of bacterial 
expression systems is at the disposal of anyone design¬ 
ing novel amyloid biomaterials (Kobayashi et al. 
2008). These well-understood techniques permit the 
introduction of point mutations in the amino acid 
sequence which opens the possibility to chemically 
functionalize the surface of a fibril or changing its 
electrostatic characteristics (Burkoth et al. 1999; 
Collier and Messersmith 2004; Zhang 2003). Further¬ 
more, given the enormous combinatorial sequence 
space available for creating de novo peptides and pro¬ 
teins, it should one day be possible to design “to order” 
amyloidogenic materials with unique environmental 
responsiveness. The potential ease of modification, 
hybridization, and functionalization vastly enlarges 
the material applications, capabilities, and potential 
usefulness of amyloid as a biomaterial. 

There have been numerous examples of amyloid 
fibrils being used as a biomaterial in the literature; how¬ 
ever, in this entry, three broad types of amyloid-related 
biomaterials will be discussed; their use as components 
and templates for nanowires and nanoelectronics, 
amyloid hydrogels and their application to cell 
scaffolds (► Extracellular Matrix Components) and 
drug delivery technologies, and the utilization of 
amyloid fibrils as architectures for ► light-harvesting 
materials (► Photosynthetic Electron Transport). 

Nanowires and Nanoelectronics 

The emerging field of nanoelectronics has the possi¬ 
bility to usher in the next revolution in computing. 
However, fabricating the necessary architecture for 
such technology is a highly complex problem (Willner 
and Katz 2005). Forming nanowires and other nano¬ 
scale electronic components from a “top-down” 
approach, i.e., lithography, is fraught with difficulty. 
However, by exploiting molecular self-assembly, an 
efficient, controlled, and cost-effective route to creat¬ 
ing nanoscale electronic components is a distinct pos¬ 
sibility. To this end, several groups have created 
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nanowires by using self-assembled amyloid fibrils as 
a template. One of the first successful attempts was 
achieved by Scheibel and coworkers (Scheibel et al. 
2003). In this approach, they used the “NM” region of 
the yeast prion protein Sup35p to form highly stable 
amyloid fibrils 10-nm wide and with lengths ranging 
from 60 to hundreds of microns, depending on exper¬ 
imental conditions. They then genetically modified an 
NM-variant which introduced reactive cysteine resi¬ 
dues to the surface of the fibril. The cysteine residues 
can then be covalently linked to gold nanoparticles. 
The gold nanoparticles act as nucleation sites for fur¬ 
ther metallic deposition via reduction of gold or silver 
from solution onto the fibrils. This method created 
metallically continuous gold or silver nanowires with 
widths of ^100 nm which demonstrated high conduc¬ 
tivities and low resistances (~80 Q). 

Reches and Gazit have also exploited the self- 
assembly of an amyloid-like fibril structure to fabricate 
nanowires (Reches and Gazit 2003). In their approach, 
they employ an aromatic dipeptide, diphenylalanine, 
which readily self-assembles at high concentrations 
into hbrils with a hollow core ^100 nm in diameter. 
They discovered that by adding ionic silver to the 
fibril-containing solution and reducing the silver with 
citric acid, continuous metal wires could be formed 
within the peptide nanotubes. Subsequent proteolytic 
lysis of the peptide “mold” left behind a cast silver 
nanowire almost a micron in length and with a uniform 
diameter ^20 nm. They further combined this tech¬ 
nique and one similar to the approach of Scheibel et al. 
to create multilayered, coaxial nanowire assemblies 
(Gamy et al. 2006). They first assembled nanotubes 
filled with continuous metallic silver by the 
method described above. The peptide used in this 
case had been modified, akin fo the technique used by 
Scheibel et al., with a cysteine residue to allow for the 
decoration of the exterior of the nanotube with gold. 
Subsequent reduction of gold on the surface of the 
nanotube created a novel nanoscopic coaxial wire as 
confirmed by TEM and energy dispersive X-ray 
analysis. 

The ability to create these nanoscale elements is 
important but a means to control and manipulate their 
spatial arrangement is critical in developing any func¬ 
tional nanodevice. Reches and Gazit utilized a rapid 
evaporation process which created “nanoforests” of 
diphenylalanine nanotubes which are uniaxially 


aligned in dense ordered arrays. They have also devel¬ 
oped a technique which allows for horizontal manipu¬ 
lation of nanotubes by hydrophobically linking 
ferrofluidic particles to the exterior of the nanotubes. 
This ferrofluidic coating allows spatial alignment of 
the nanotubes when an external magnetic field is 
applied (Reches and Gazit 2006). More recently, they 
have developed a vapor deposition method which 
creates similarly dense, vertically aligned patterns of 
nanotubes (Adler-Abramovich et al. 2009). 
Ultracapacitors fabricated from these assemblies 
showed enhanced capacitance relative to carbon and 
carbon nanotube-modihed electrodes. Furthermore 
these arrays, when applied to a surface, act as 
a highly hydrophobic coating which can hnd 
a range of applications from microfluidic devices to 
self-cleaning surfaces. Other techniques such as 
dielectrophoresis (Castillo et al. 2008) and top-down 
approaches, e.g., lithography, have also been used to 
organize the spatial arrangement of these structures 
(Rosenman et al. 201 1). 

Amyloid derived nanowires and nanotubes may 
hnd an application as sensitive >■ biosensors 
(► Nanowire Biosensors). By virtue of being biologi¬ 
cally derived materials, these structures can be 
modihed to specihcally target and recognize electro¬ 
chemical, fluorescent, or enzymatically active species 
of interest. Several groups are already developing 
amyloid-like peptide nanotubes and nanowires to act 
as biosensors. Leng et al. have used Sup35 in conjunc¬ 
tion with a fluorescent reporter to detect the presence 
of the pesticide methyl parathion (Leng et al. 2010). 
Viguier et al. have employed amyloid-like peptide 
nanotubes as biorecognition sites on a gold electrode 
which selectively forms complexes with copper ions in 
solution. Their experiments demonstrate the ability to 
measure, with a sensitivity in the micromolar range, 
changes in copper ion concentrations (Viguier et al. 
2011). In another application, peptide nanotubes were 
functionalized with a glucose oxidase and were then 
attached to a gold electrode. By monitoring the produc¬ 
tion of hydrogen peroxide produced in the enzymatic 
reaction between the glucose oxidase and glucose, 
a sensitive determination of the amount of glucose 
present was possible. They also demonstrated the ability 
to detect changes in ethanol and NADH concentrations 
with a similar experimental setup using ethanol 
dehydrogenase (Yemini et al. 2005). Taken together. 
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amyloid-like nanotubes offer many opportunities for 
engineering novel nanoelectronics materials. 

Hydrogels 

Hydrogels are highly hydrated (up to 99% water by 
volume) three-dimensional networks of polymers which 
have found many applications in the cosmetics and bio¬ 
technology sectors (Barhucci 2009). They have proven to 
be of particular interest as a scaffold for tissue engineer¬ 
ing, regeneration, and repair. Additionally, hydrogels are 
classed as a potentially “smart” material - they can 
undergo sol-gel transitions or alter their rheological prop¬ 
erties in response to changes or differences in environ¬ 
mental conditions such as temperature, pH, ionic 
strength, enzymatic activity, and external stresses. This 
property opens the door to potential applications for 
targeted drug delivery or biosensors. 

Several researchers have turned to amyloid-hased 
hydrogels due to their self-assembly and material prop¬ 
erties. As has been stressed, the ability to functionalize, 
modify, or design amyloidogenic proteins or peptides 
is of great utility and offers flexibility to suit a variety 
of biological and engineering challenges. Zhang and 
coworkers have used a p-sheet-rich hhrillar hydrogel 
scaffold to facilitate cell adhesion and tissue regener¬ 
ation (Yanlian et al. 2009). They observed that 
oligopeptides containing a side chain motif of argi- 
nine-glycine-aspartate (RGD) act as a ligand for cell 
adhesion and developed a novel class of functionalized 
peptides with alternating hydrophobic and hydrophilic 
amino acid sequences which mimics this motif. These 
peptides, known as EAK16, RAD 16, and KLD12, 
spontaneously self-assemble into stable macroscopic 
matrices in the presence of physiological concentra¬ 
tions of monovalent salts. They employed this amy- 
loid-like hbrillar matrix to successfully promote cell 
adhesion in diverse cell types such as hbroblasts and 
keratinocytes, stimulated growth of chondrocytes to 
generate a cartilage-like extracellular matrix, and pro¬ 
vided a substrate to promote neurite growth and syn¬ 
apse formation. They even extended this technique by 
utilizing this peptide matrix to return functional vision 
to hamsters with severed optical tracts by encouraging 
axon regeneration and promoting the “knitting” of 
brain tissue together. Amyloid fibrils created from 
TTR( 105-115) directly functionalized with the cell 


adhesion RGD sequence also demonstrated cell attach¬ 
ment specihcity (Gras et al. 2008). Another group 
developed a novel means to trigger a salt-induced 
gelation of a peptide with this similar alternating 
hydrophobic/hydrophilic motif. In their scheme, 
thermo- or photoactive liposomes could be made to 
release salts upon stimulation and thereby inducing 
matrix formation (Collier et al. 2001). 

Other groups have attempted to exploit the respon¬ 
siveness of these amyloid hydrogels to environmental 
changes for developing drug delivery systems and bio¬ 
sensors. There are several criteria for any cell or drug 
delivery system based on a “smart” hydrogel that must 
be met to maximize the chances of success: biocom- 
patibility, biodegradability, robust rheological and 
structural properties, and the ability to sustain the 
viability of the cells or drugs to be delivered. Further¬ 
more, any injectable delivery system must demonstrate 
rapid gelation kinetics to prevent sedimentation and 
maintain a homogeneous distribution of the load. Once 
applied to the area of interest, the delivered hydrogel 
impregnated with the therapeutic load must remain 
localized to the area of interest. Haines-Butterick 
et al. have developed a proof-of-concept injectable 
amyloid-like hydrogel which fulfills these stringent 
criteria (Haines-Butterick et al. 2007). They used 
a 20 amino acid peptide called MAX8 which rapidly 
self-assemhles into a (3-sheet-rich hydrogel in the pres¬ 
ence of the cell growth medium DMEM. A unique 
feature of this amyloid-like peptide hydrogel is that 
under a sufficient applied shear stress, the gel will 
shear-thin. However, the gel quickly recovers rigidity 
after the shear stress ceases. This advantageous rheo¬ 
logical behavior was successfully applied to the deliv¬ 
ery of stem cells by syringe injection. The gel is formed 
within the syringe, encapsulating the stem cells. Upon 
injection, the gel is shear-thinned allowing for homo¬ 
geneous application of the cells to a targeted area. 
Quickly after injection, the mechanical rigidity of the 
gel is regained and the stem cells remain encapsulated, 
maintaining localization. 

Several groups have demonstrated successful 
enzyme entrapment within amyloid hydrogels and, 
importantly, that enzyme activity is maintained in 
a variety of harsh conditions. Pilkington and coworkers 
use hovine insulin amyloid fibrils to create 
a hydrogel which was functionalized by conjugating 
glucose oxidase to the fibrils using glutaraldehyde as a 
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cross-linker. This hydrogel scaffold was then 
incorporated into a polyvinyl alcohol (PVOH) film. 
They showed that the embedded glucose oxidase 
hydrogel conferred antibacterial activity to the PVOH 
film (Pilkingfon et al. 2010). In another example, 
Bhak et al. used curly protein fibrils of a-synuclein 
fo encapsulafe horseradish peroxidase. They demon- 
sfrafed fhe hydrogel protected the enzyme from loss of 
activity when exposed to multiple catalyses and heat 
treatments (Bhak et al. 2010). More recently, Meier 
and Welland have developed a novel wet-spinning 
technique to create macroscopic wires from amyloid 
fibrils derived from hen egg white lysozyme. These 
wires exhibit remarkable stiffness (14 GPa) and tensile 
strength (^300 MPa). They demonstrate their drug 
release capability by incubating the fibers in 
riboflavin-5'-phosphafe (RP), which acts as a model 
drug system. They can then control the release of RP 
by alteration of solution pH (Meier and Welland 2011). 
Furthermore, their technique allows for the creation of 
mineralized matrices of fibers to potentially create 
bone analogues. 

Scaffolds for Light Harvesting 

Another application which can take advantage of the 
self-assembly of amyloid fibrils and fheir resultant 
regularized structures is their use as architectures in 
photoelectric and light-harvesting technologies. The 
peptide fragment of transthyretin, TTR( 105-115), is 
known to readily form very uniform amyloid fibrils. 
Channon and coworkers hybridized these peptides 
with the fluorophore fluorene. They use the propensity 
of TTR( 105-115) self-assembly to drive the 
coassembly of these “guest” molecules with little dis¬ 
ruption to the fibrils themselves (Channon et al. 2009). 
This technique can be extended by the introduction of 
two different fluorophores whose properties are chosen 
to promote resonance energy transfer. Resonance 
energy transfer efficiency is highly sensitive to the 
relative proximity of one fluorophore to the other. 
Consequently, fluorophores hybridized with soluble 
TTR( 105-115) will not exhibit the characteristic 
fluorescence emission from acceptor molecules. How¬ 
ever, upon assembly of TTR(105-115) into amyloid 
fibrils, and in turn the coassembly of the donor and 
acceptor fluorophores, the relative distance will be 
such that resonance energy transfer is achievable. 


This technique allows for photon capture and energy 
transfer to specific locations; such behavior bears close 
resemblance to natural light-harvesting materials in 
photosynthetic organisms. This method has the advan¬ 
tage that the donor/acceptor distances and orientations 
can be controlled simply by the assembly of the protein 
architecture. A similar demonstration of this light¬ 
harvesting technique was demonstrated by Liang and 
coworkers using AP( 16-22) as the assembly scaffold 
(Liang et al. 2008). Recently, Y.S. Choi and coworkers 
incorporated phthalocyanine tetrasulfonate (PcTS) and 
ions into a-synuclein amyloid fibrils. The fibrillar 
matrix they fabricate exhibits marked light-harvesting 
and electron-transfer properties (Choi et al. 2011). 
Interestingly, these hybridized amyloid fibrils tend to 
spontaneously form 2-D arrays with high fibrillar 
alignment. This could be advantageous for increasing 
the effective area of light absorption, conductivity, and 
allows for greater material uniformity. 

Summary 

Amyloid and amyloid-like fibrils hold great promise as 
biomaterials due to their unique physical characteris¬ 
tics. Coupled with their ability to self-assemble in 
a wide range of conditions, their apparent universality 
as a class of supramolecular polypeptide structure, 
their ease of chemical modification and functiona¬ 
lization, the possibility of de novo design of unique 
polypeptide sequences, and their potential biocompat¬ 
ibility, taken as a whole, make amyloid fibrils an excit¬ 
ing avenue of further enquiry. To date, amyloid and 
amyloid-like materials have been used as templates 
and components in nanoelectronic fabrication, as envi¬ 
ronmentally sensitive matrices to promote tissue 
regeneration and targeted drug release, and have pro¬ 
vided the architecture to drive the controlled assembly 
of photo-, electro-, and chemo-active molecules. 
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Synonyms 

Ion channels; Macroscopic ion currents 

Introduction 

Most cells have a range of ion channels within their 
membrane whose activities under a given condition dic¬ 
tate the macroscopic ionic current flowing across the 
membrane. Macroscopic currents can be recorded from 
a variety of cells using techniques such as ► two- 
electrode voltage-clamp, single-electrode switched 
clamp, and ► whole-cell patch clamp. With these tech¬ 
niques, the ionic current flowing through all open chan¬ 
nels present in the cell membrane, in addition to other 
components such as capacitive currents, can be measured. 
The recording and analysis of macroscopic currents gives 
information about the range, relative density, and kinetic 
behavior of channel types present in the membrane of 
a particular cell. For conciseness, this entry will be limited 
to discussion of the ionic component of macroscopic 
currents flowing through ► voltage-gated ion channels. 

A typical procedure used in the investigation of 
voltage-gated ion channels, once the voltage-clamp 
has been established, would be to apply step changes 
in the membrane potential (a pulse protocol) and 
record the resulting macroscopic currents. Following 
a voltage perturbation there will be a large, but ideally 
rapidly diminishing, capacitive current (/c); a smaller 
component termed leak current (7/^) which is 
a combination of ion leakage between the electrode/ 
membrane interface and background activity of volt- 
age-independent channels that may be present; and 
Anally a time-dependent component resulting from 
changes in the open probability of voltage-gated ion 
channels. By convention an inward flow of positive 
ions is termed negative and plotted downward, while 
an outward flow of positive ions is termed positive and 
plotted upward. An illustration of the current obtained 
following a voltage pulse from —80 to 0 mV in 


a hypothetical cell having voltage-dependent Na"^ and 
channels is shown in Fig. la. 

For a cell containing k types of ion channel the total 
ionic current. It, will be 

k 

/r=/c+/L + ^/y (1) 

,;=i 

where Iq is the capacitative current (= Cm.dVIdt), 
the leak current, and /, is the contribution from chan¬ 
nels j. For any particular type of ion channel (j) this 
contribution is given by: 

lj = NjijPoj (2) 

where /V, is the number of channels of type j in 
the membrane of the cell, /, is the single channel current 
of channel j, and Poj is the open probability of channel / 
under the recording conditions of the experiment. 
The single channel current is a function of voltage, 
i = y(y — EfiTv), where V is the )► membrane potential, 
Erev the reversal potential of the channel, and y the 
conductance of the channel; the open probability is 
a complex function of both voltage and time. When 
analyzing macroscopic currents, the initial goal is to 
focus on the channel of interest by eliminating, as far as 
possible, the contribution of all other components to the 
current measured. Once this has been achieved the total 
current approximates to that contributed by the channel 
of interest only (i.e.. It ~ //) and it becomes possible 
to investigate properties such as permeation and ► kinet¬ 
ics of these channels in isolation. 

Isolating the Current of Choice 

Subtraction of capacitative and leak currents. 
Capacitative and leak currents can generally be 
removed using a voltage pulse procedure based on 
the assumption that these currents are related linearly 
to voltage. A common approach is to use the Pin 
method where a test pulse of magnitude P, which 
activates voltage-dependent channels within the mem¬ 
brane, is preceded (or followed) by n leak pulses of 
magnitude Pin which are too small to activate these 
channels. The currents resulting from the leak pulses are 
then summed to give the capacitative and leak current 
trace expected for a test pulse of magnitude P. 
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Analysis of Macroscopic 
Currents, Fig. 1 (a) 

A voltage step from — 80 to 
0 mV induces an initial 
capacitative cuiTent (/c) 
followed by inward and 
outward ionic currents (a leak 
component (/J would also be 
present but is not easily 
obseiwed here), (b) A voltage 
protocol designed to remove 
Ic and II components. 
Summing the cuiTents induced 
by the Pin pulses and 
subtracting from the total 
current isolates the ionic 
current, (c) The voltage-gated 
Na"^ and cuiTents can be 
separated using TTX to 
eliminate the Na"^ current or 
a combination of intracellular 
Cs"^ and external TEA"^ to 
eliminate the current 






This scaled current can then be subtracted from 
the original current to leave only the ionic current com¬ 
ponent (see Fig. lb). 

Elimination of currents through other ion channels. 
Separation of currents flowing through different ion 
channels can be achieved by combinations of ionic 
substitution, pharmacological blocking agents, and 
choosing appropriate pulse protocols. As an example 
consider the case of the simple cell in Fig. 1 that has 


both ► voltage-gated Na"^ and ► voltage-gated 
channels in its membrane. Application of tetrodotoxin 
(TTX), a specific blocker of voltage-gated Na"^ 
channels, effectively eliminates this current, thereby 
enabling the current to be recorded in isolation. 
When using whole-cell recording, a popular method 
used to eliminate currents is to All patch pipettes 
with Cs"^, which does not permeate channels, also 
bathing the cell with the general channel blocker 
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Analysis of Macroscopic 
Currents, Fig. 2 Current- 
voltage curves of voltage¬ 
gated and voltage-gated 
Na"^ cuiTents. Panels (a) and 
(c) show overlaid current 
records over a range of pulse 
potentials (Vp) for and Na"^ 
currents, respectively. The 
relationships between pulse 
potentials and peak currents 
for these and Na"^ currents 
are shown in panels (b) and (d) 
(Panels (c) and (d) are 
reproduced from 
Chiamvimonvat et al. (1995) 
with permission) 


” AVp 
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tetraethylammonium (TEA"^) may be effective 
(Fig. Ic). More complicated cells with several different 
types of channels will require other pharmacological 
agents and pulse protocols designed to avoid activating 
some currents (see Standen et al. 1994). If the current 
under investigation is expressed specihcally in 
a cell line that contains relatively little native ion 
channels, then pharmacological elimination of 
unwanted currents may be unnecessary. 

Current-Voltage Curves 

Once the current of choice has been isolated, an 
informative start is to examine the relation between 
membrane potential and current magnitude by 
generating an I-V curve. Such plots give information 
about the ion(s) likely to carry the current, the voltage 
range over which the channels are active and, 
depending on the current, the reversal potential. 
This is illustrated in Fig. 2 for a voltage-dependent 
current from an arterial smooth muscle 


cell (unpublished observations) and for a 
voltage-dependent Na"^ current from a rat ventricular 
myocyte (Chiamvimonvat et al. 1995). The peak of 
each current plotted against the pulse potential 
that generated the current yields the I-V curves for 
the and Na"^ currents respectively (Fig. 2b, d). 
Examination of the I-V curve for the current 
shows that outward (positive) current only is seen 
which begins to appear at —30 mV and continues to 
increase as the voltage pulses become more positive. In 
contrast, the I-V curve for the Na"^ current shows a clear 
reversal at about -1-55 mV, a bit more negative than the 
Na"^ equilibrium potential (En^), indicating that the 
current here is not carried exclusively by Na"^ ions. 
At —60 mV there is a considerable driving force 
{V-Erev, = —115 mV), so the single channel current 
would be large and negative (inward). Since the 
whole-cell current recorded is very small at —60 mV, 
this implies that the channels responsible for this 
current are mostly closed at this potential {Po is very 
small). As the pulses become more positive, the 
magnitude of the current increases as more channels 
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open until the largest inward current occurs at 
about —15 mV. At this potential, the rate of increase 
in Po has reduced and the decrease in single channel 
current, as driving force declines, now dominates. 

Obtaining Information About 
Channel Activity 

Using certain voltage protocols enables information 
about the relative open probability of the channels 
responsible for the current and about the (scaled) single 
channel current-voltage curve to be extracted. This 
is possible because any change in open probability 
following a voltage perturbation takes a certain 
amount of time, whereas the single channel current 
changes instantaneously. This is illustrated in Fig. 3 
for a hypothetical cell (expressing Kv channels only) 
subjected to a dual pulse protocol. Following the initial 
pulse to VI the single channel current changes 
instantly while the open probability takes time to 
increase. The important feature to note is that because 
the open probability takes time to change, its value at 
the end of the first pulse (VI) is the same as at the 
beginning of the second pulse (V2). The sudden initial 
jump in whole-cell current between V1 and V2 reflects 
the instantaneous change in single channel current 
only, not a change in Po. The decline in current during 
V2, referred to commonly as a tail current, reflects the 
relaxation of Po to a new value; the single channel 
current (i) remains constant. This time dependence of 
open probability can be exploited to obtain informa¬ 
tion regarding the relative activity of the channels 
during voltage pulses to various potentials. 

Calculating Activation Curves Using 
Tail Currents 


0 - 

Vm (mV) 

-40- 



time (ms) 

Analysis of Macroscopic Currents, Fig. 3 Diagram to 
illustrate the changes occurring with time in single channel 
current, i, open probability, Po, and whole-cell current, /, 
following three voltage pulses to different potentials (indicated 
by the blue and red and dashed black traces) in a cell containing 
2,500 Kv channels. Note that maximum Po in this case is roughly 
0.8 which is reached at a potential of +5 mV (red trace). Increas¬ 
ing VI beyond +5 mV does not increase the maximum Po, 
but it does increase the rate at which maximum Po is reached. 
The difference in whole-cell current between the red and 
dashed black trace is due only to the difference in single channel 
current 


VI 


V2 




To obtain a more quantitative measure of how mem- current at V2 is zero, because at —40 mV, these chan- 

brane potential affects channel activity, it is necessary nels are closed (Po = 0). When VI is 0 mV, this opens 

to separate the effect of voltage on Po from that on i, some of the channels and a relatively large tail current is 
and it is possible to do that for certain currents by observed at V2. When V1 is increased from -1-20 mV 
measuring the initial amplitude of tail currents gener- onward, there is an increase in the current, but the 
ated at a chosen voltage (V2) following a series of resulting tail currents do not increase, indicating that at 

voltage steps (VI) as shown in Fig. 4. As discussed in these potentials the channels are maximally activated, 

the previous section, the initial value of the current at A plot of the normalized tail currents against V1 gives 

V2 reflects Po at the end of VI and the single channel the relationship between membrane potential and open 

current at V2. Hence when VI is—40 mV, the initial probability, termed an activation curve (Fig. 4c). 
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Analysis of Macroscopic 
Currents, Fig. 4 Measuring 
the activation curve of 
a smooth muscle Kv current 
using tail current analysis. 

(a) The pulse protocol and 
resulting whole-cell currents. 

(b) Expanded view of the gray 
rectangle showing how the tail 
current amplitudes reach 

a plateau regardless of VI 
becoming increasingly 
positive. The red lines show 
exponential fits to the currents 
and the value of the fits, 
extrapolated to the beginning 
of V2, were used to generate 
the activation curve, (c) Plot of 
the normalized tail current 
against VI potential. The fit of 
Eq. 3 to the data (red line) 
returned values of —13.6 and 
9.0 mV for V 1/2 and k 
respectively 



This plot has a sigmoidal form that can be approximated 
by a Boltzmann distribution described in the equation 
below: 

hail _ o _ 1 

max I,ail max Pg ^ j _(_ g^p ^ 

where V is membrane potential, I,aii the tail 
current magnitude at its beginning, V 112 the membrane 
potential at which activation is half maximal, and k is 
a factor determining the voltage dependence of activa¬ 
tion (see Aidley and Stanfield (1996) and Hille (2001) 
for further explanation of the Boltzmann distribution). 

Measuring Reversal Potential Using 
Tail Currents 

The I-V plot of Fig. 2b and the activation curve of Fig. 4 
indicate that these particular channels are normally 
closed at membrane potentials more negative than 
about —40 mV, which is why reversal of this current 
would not be seen in a conventional I-V plot. In this 
instance, to measure E/^ev it is necessary to force the 


channels to be open at potentials considerably more 
negative than —40 mV. This can be achieved by 
delivering an initial depolarizing pulse to a constant 
value (VI), which causes substantial activation of the 
current, followed by a second voltage step to various 
potentials (V2) as illustrated in Fig. 5. Because 
the channels remain open for a finite time following 
the step from VI to V2 (see Fig. 3), plotting the 
instantaneous tail current amplitude against V2 is 
equivalent to the single channel i-V plot scaled by 
a factor equal to the number of channels open at 
the end of pulse VI. E^ev is then obtained from the 
voltage at which the current is zero (i.e., it crosses the 
x-axis. Fig. 5b). 

Inactivation Curves 

It is clear from the profile of the traces for the Na"^ 
current of Fig. 2 that following an initial increase, the 
current declines despite the constant voltage during the 
pulse, a process termed inactivation. This occurs 
because the initial rise in Po is not maintained, as, 
once in the inactivated state, Na"^ channels are unable 
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Analysis of Macroscopic 
Currents, Fig. 5 Measuring 
EfiEv using tail currents. 

(a) The pulse protocol {top) 
and corresponding simulated 
Kv currents {bottom) at the 
concentrations indicated (out/ 
in), (b) Plots of the 
instantaneous tail current 
amplitudes against V2 
potential. The Ekev values, the 
voltages at which the plots 
cross the abscissa (zero 
current) for both conditions, 
are indicated 





to open despite the depolarized potential. The propor¬ 
tion of channels in the inactivated state, and therefore 
unavailable to open, can accumulate as the holding 
potential is made more positive. This proportion can 
be measured by using the pulse protocol shown in 
Fig. 6a where the test pulse is kept the same (0 in this 
case) while the holding potential (Vh) is altered. As the 
holding potential becomes more positive, the peak cur¬ 
rent elicited by the step to 0 mV decreases and a plot of 
the normalized peak current against holding potential 
yields the steady-state inactivation curve (Fig. 6b). This 
curve can also be approximated by a Boltzmann distri¬ 
bution (Eq. 3), but in this case, the value of k would be 
negative. 

Kinetics of Macroscopic Currents 

Following a depolarizing pulse of sufficient magni¬ 
tude, voltage-gated ion channels undergo several 
conformational changes, eventually leading to their 


opening which is observed as membrane ionic current. 
A simple quantification of the activation process can 
be achieved by fitting the following equation, based on 
the ► Hodgkin-Fluxley model of channel gating, to the 
rise time of the current at different step potentials; 



where /(f) is the current at time t (the beginning of the 
pulse is at f = 0), I^ax the maximum current, x is the 
time constant, and « is a factor, which for Kv currents 
has a value around 4, that is necessary to account for 
the lag in the rise of the current following the voltage 
step (Aidley and Stanheld 1996). The closing of 
the channels following repolarization is termed deac¬ 
tivation which is responsible for the decay of the tail 
currents illustrated in Figs. 4 and 5 and these can be ht 
with an exponential (or sum of exponentials) function, 
the time constant(s) of which gives a measure of the 
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Analysis of Macroscopic 
Currents, Fig. 6 (a) Pulse 
protocol (top) and 
corresponding simulated 
currents (bottom) designed to 
obtain a steady-state 
inactivation curve, (b) Plot of 
the normalized peak current 
against holding potential (Vh) 
gives the steady-state 
inactivation curve. The V 1/2 
value gives the membrane 
potential at which half the 
channels are in the inactivated 
state 
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rate of closing. A more rigorous approach is to fit 
a kinetic scheme directly to the current profiles as has 
been done for the Shaker Kv channel by Zagotta et al. 
(1994). In their analysis of the voltage dependence 
of steady-state Po, single channel kinetics, and 
macroscopic current kinetics, Zagotta et al. (1994) 
proposed that the four Kva subunits undertook two 
conformational steps each before the channel could 
open and a necessary additional closed state results in 
alb state model. Such models are best analyzed using 
the matrix approach described by Colquhoun and 
Hawkes (1995). 

Mean-Variance Plots 

Although macroscopic currents comprise the current 
flowing through many channels simultaneously, it is 
often possible to observe fluctuations in the 
signal resulting from the gating of the channels. 
These fluctuations can be a source of information 
about the unitary current amplitude and the number 
of channels present in a cell. For voltage-gated 
channels, this information may be obtained using 
nonstationary fluctuation analysis. In this method, the 
ensemble mean and variance of a series of current 
traces, resulting from repeated pulses to the same 
potential, are calculated and a mean-variance plot is 
generated with current on the abscissa and variance on 
the ordinate. The data in a mean-variance plot can be 
fitted with the following equation: 

(T^ = il -f/N = iI{l-P„) (5) 


where is the variance, ; the single channel current, 
Po the open probability, and N the number of channels 
present. This approach was used to estimate the 
number and unitary conductance of Na"^ channels in 
frog node of Ranvier by Sigworth (1980). Additional 
information on using fluctuations of macroscopic 
currents to study channel kinetics is given by Grey 
(1994) and Aidley and Stanfield (1996). 

Summary 

Macroscopic current is that measured flowing through 
all the channels in a cell membrane that are active under 
a particular experimental condition. To investigate 
a particular current, it is necessary to design a suitable 
voltage protocol and/or use pharmacological or ionic 
conditions that reduce contamination by other currents 
to a minimum. Recording whole-cell current is the 
method of choice to identify the range of channels 
present in the membrane of a particular cell. Although 
single channel recording can yield more detailed kinetic 
information, under many circumstances, it is beneficial 
to observe the response of a large population of chan¬ 
nels, for example, when measuring the rate of and 
recovery from inactivation of voltage-gated channels. 

Cross-References 

► Automated Patch Clamp: Advantages and 
Limitations 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 
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► Fluorescence Techniques for Studying Ion Channel 
Gating; VCF, FRET, and LRET 

► Kinetics: Overview 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Voltage-Gated Sodium and Calcium Channels 
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Synonyms 

Optical trap; Single-molecule methods; Torque 

Definition 

Angular Optical Trapping is a single-molecule manip¬ 
ulation technique employing laser-based optical trap¬ 
ping technology by which both torque and force can be 
directly exerted and measured on a trapped micropar¬ 
ticle. This method has applications in biopolymer 
physics, molecular motor mechanics, microrheology, 
and cell biology. 

Basic Characteristics 

Techniques allowing for the manipulation of single 
biological molecules have provided great insights 
into the mechanisms by which many cellular processes 
occur. Optical trapping, in particular, has been 
employed for nearly two decades to measure forces 
and monitor displacements with great accuracy and 
precision in a myriad of systems, ranging from cargo- 
transport motor activity to DNA-protein interactions. 
Many motors and biomolecules, however, experience 
torque and twist as well, necessitating the direct mea¬ 
surement of these parameters to fully dissect the 
mechanics of biological processes. Alternate tech¬ 
niques, such as magnetic tweezers, allow for the appli¬ 
cation of twist, but direct measurements of torque at 
high bandwidths have not been traditionally available 
with these methods. 

Applying Twist with Light 

Angular optical trapping is a recently developed tech¬ 
nique that expands the utility of optical trapping. 
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Angular Optical Trapping, 
Fig. 1 An angulai' optical 
trap, (a) Principles of torque 
generation on a birefringent 
particle by linearly polarized 
light, (b) Principles of torque 
detection by monitoring the 
change in an initially linearly 
polarized trapping beam’s 
ellipticity 


Extraordinary Ordinary 



Left circular polarization 


V 


Right circular polarization 


Optics: quarter-wave plate, 
polarizing BSC 



Elliptical output polarization 

Angular momentum 
change in beam 

Torque on particle 


Linear input polarization 


It allows for the simultaneous and direct measurement 
of not only force and displacement, but also torque and 
twist using a single Gaussian laser beam (La Porta and 
Wang 2004). To date, it has been primarily used to 
study the torsional response of DNA-based systems, 
although applying this technology to the study of 
molecular motors or other cellular components is fea¬ 
sible and will no doubt someday provide new dimen¬ 
sions of study into the mechanochemical properties of 
many such substrates. 

While there are multiple ways to achieve angular 
coupling between a spatially symmetric Gaussian opti¬ 
cal beam and its trapped micro-object, the basic prin¬ 
ciples remain the same: (1) the trapping light must 
have a well-dehned and pure polarization state with 
which the micron-scale particle can interact in 
a rotational manner and (2) some property of the par¬ 
ticle’s symmetry must be broken, providing 
a preferential axis with which the polarization of the 
light can couple (Fig. la, b). Angular optical trapping 
designs have employed both circular and linear polar¬ 
ization, each of which can provide some level of con¬ 
trol over the angular position or rotation rate of 
a particle (Friese et al. 1998; La Porta and Wang 
2004). It is generally preferable to use a purely linearly 
polarized beam and rotate the polarization with con¬ 
trollable direction and rates using standard optical 
methods. This allows for robust and flexible manipu¬ 
lation of trapping geometries and can allow the user to 
clamp and measure both angle and torque (Inman et al. 
2010; La Porta and Wang 2004). 


To achieve coupling between laser light and parti¬ 
cle, it is necessary to generate specialized particles 
designed with a preferential axis for rotational cou¬ 
pling. Most commonly, the particle’s optical symmetry 
is broken. This can be achieved by fabricating particles 
out of a birefringent crystalline material, such as 
quartz or calcite. Recently, the high throughput 
nanofabrication of quartz cylinders was demonstrated 
(Deufel et al. 2007). In this design, the extraordinary or 
optic axis is located perpendicular to the cylindrical 
axis, resulting in a stably trapped particle that can be 
readily rotated about its cylindrical axis (Fig. 2). Pro¬ 
tocols for functionalizing quartz are readily available, 
making specific attachment of biomolecules to fixed 
locations on the cylinder’s end surface easily achiev¬ 
able, ultimately allowing for the application of torque 
and force along well-defined and highly reproducible 
geometries (Deufel et al. 2007). The generation of 
trapping particles with cores composed of the calcium 
carbonate mineral vaterite and surface-derivatized for 
attachment to proteins has also been demonstrated 
(Vogel et al. 2009). In addition to optically anisotropic 
particles, the breaking of spatial symmetry by using 
oblong or flattened trapping particles leads to angular 
confinement as well (Oroszi et al. 2006). 

When held in an optical trap with linearly polarized 
light, an optically anisotropic birefringent particle will 
tend to align its more polarizable axis with the direc¬ 
tion of the electric field’s polarization, while at the 
same time experiencing confinement in all three spatial 
dimensions, just as in traditional optical trapping 
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Angular Optical Trapping, 
Fig. 2 Nanofabricated quailz 
cylinders for use in an angular 
optical trap (Deufel et al. 
2007) (a) After fabrication, 
a field of quartz pillars covers 
the surface of a wafer, (b) As 
in a., but with increased zoom, 
(c) After mechanical cleavage 
on a segment of the wafer’s 
surface, some pillars have 
been removed, (d) A single 
quartz cylinder after cleavage. 
One end of eaeh cylinder is 
functionalized for coupling to 
a biomolecule 
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configurations. In quartz, for example, the extraordi¬ 
nary axis is more easily polarized than the two ordinary 
axes and thus alignment of this unique axis with the 
electric field’s polarization results in the most favor¬ 
able trapping state. If the particle is twisted away from 
this configuration, for instance, by the action of an 
attached biological molecule, a restoring torque on 
the particle will be exerted by the light (Fig. la). At 
the same time, a linear displacement from the beam’s 
trapping center along the axial direction results in 
a restoring force exerted on the particle by the light. 
Therefore, application of both torque and force can be 
simultaneously realized with a single trapping 
configuration. 

Measurement of Torque 

In addition to the application of twist to a particle, it 
has been shown that one can directly obtain 
a quantitative measurement of torque by monitoring 
the polarization state of the trapping beam after it has 
interacted with the particle (La Porta and Wang 2004; 
Nieminen et al. 2001). It is common to employ posi¬ 
tion-sensitive detection of the transmitted beam or 
back-focal plane interferometry with standard optical 


tweezers to monitor the change in linear momentum of 
the laser light (thus, directly measuring Ihe force on the 
particle). Similarly, an angular optical trap can be 
equipped with specialized optics to monitor the change 
in the beam’s angular momentum and, thus, determine 
the torque exerted on the particle. Because the trapping 
beam is linearly polarized before it interacts with the 
particle, it carries zero net angular momentum. If the 
interaction between the trapping light and particle 
results in a torque on the particle, and correspondingly 
a torque on the trapping light, the transmitted beam 
will experience a change in angular momentum. This 
deviation will manifest itself as a change in the down¬ 
stream beam’s ellipticity. By employing a quarter- 
wave plate and polarizing beam-splitting cube in con¬ 
junction with power detectors, the resulting change in 
ellipticity can be directly measured at kilohertz rates 
(Fig. lb). In theory, the collection of all photons can be 
directly converted into torque information. In practice, 
however, due to imperfect optics and clipping of the 
beam by unavoidable optical apertures, a calibration 
must be performed in order to extract quantitative 
torque values with real units. The method of analyzing 
Brownian fluctuations of the particle’s angular posi¬ 
tion and exploiting the equipartition theorem is partic¬ 
ularly effective (La Porta and Wang 2004). 
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Angular Optical Trapping, Fig. 3 Use of an angular optical 
trap (Forth et al. 2008). (a) A DNA molecule is fixed between 
a nanofabricated quartz cylinder and a coverslip. The trap is used 


to simultaneously exert force and apply twist, while the mole¬ 
cule’s end-to-end extension and torque are measured, (b) Exam¬ 
ple data as a function of force 


Applications of Angular Optical Trapping 

To date, this emerging technology has been used to 
study the torsional properties of DNA (Fig. 3). 
The ability of the instrument to monitor torque has 
led to the first direct measurements of the torque- 
force relationship for plectoneme formation in 
supercoiled DNA. Additionally, the high bandwidth 
of the optical trapping technique has allowed for the 
determination of the buckling dynamics in overwound 
DNA molecules (Forth et al. 2008). The direct torque 
readout has also led to insights into the twist-stretch 
coupling properties of this helical biopolymer (Sheinin 
and Wang 2009). Future directions will include a 
focus on the characterization of DNA’s mechanical 
properties, as well as the determination of the 
torque-dependent interactions between proteins and 
DNA and the rotational characteristics of important 
motors, such as RNA polymerases, topoisomerases, 
and Fl-ATPase. 

Cross-References 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 


► Magnetic Tweezers 

► Optical Tweezers 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 

► Single-Molecule Methods 
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equilibration conditions, have the halogen atom dis¬ 
posed axially (Holland et al. 1967). More generally, 
this stereoelectronic effect relates to any system 
R-Y-C-Z, where Z is an electronegative atom and 
Y has a lone electron pair, wherein electronic factors 
favor the gauche (synclinal) disposition of R and 
X along the Y-C bond rather than the anti-arrangement 
anticipated from steric considerations. 
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► Nucleic Acid Linear Dichroism 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 


Anomeric Effect in Sugars 

Derek Horton 

Department of Chemistry, American University, 
Washington, DC, USA 

Synonyms 

Carbohydrates; Exo- and endo-anomeric effects; 
Oligosaccharide stereochemistry; a- and (3-glycosides 

Definition 

The anomeric effect in a pyranose sugar or derivative 
thereof, is the term, also referred to as the Edward- 
Lemieux effect (Edward 1955; Lemieux and Chii 
1958), that denotes the tendency of a polar group at 
the anomeric position to favor the axial, rather than the 
equatorial, orientation expected on purely steric 
grounds. Thus the more stable anomers of the 
peracylated glycopyranosyl halides, formed under 


Basic Characteristics 

The effect, as applied to the biophysics of monosac¬ 
charides, influences the favored orientation at the 
anomeric position, and the favored orientation of the 
interglycosidic linkage in oligo- and polysaccharides. 
For the free aldopyranose sugars in aqueous solution, 
the anomeric effect alone would cause the a anomer to 
be favored, but hydrogen-bonding factors related to the 
network structure of water appear to stabilize the 
1-hydroxyl group in equatorial orientation. This factor 
outweighs the anomeric effect, and the net effect is that 
the P anomer of glucopyranose is more stable in 
aqueous solution than the a anomer. 

The anomeric effect has been attributed to a favored 
orientation of the C-Z dipole when it is roughly 
opposed to the net dipole of the lone pairs on the 
Y atom, in contrast to the less favorable parallel orien¬ 
tation of these dipoles when Z is equatorial. Other 
explanations are based on n-u* interactions between 
a lone electron pair on the Y atom and an unoccupied 
antibonding molecular orbital on the C atom. 

An extension of the anomeric effect, termed the 
“exo-anomeric effect” (Lemieux et al. 1979; Tvaroska 
and Bleha 1969; Gold Book 2010), determines the 
favored orientation of rotamers in glycosides, and 
the connecting linkage in oligosaccharides, wherein 
the 0-1-aglycon bond likewise favors that orientation 
in which the dipoles are approximately opposed, rather 
than parallel. These factors establish the favored (p and 
v|/ angles connecting the adjacent residues in oligo- 
and polysaccharides, and thus relate to the favored 
macromolecular conformations of saccharide chains. 

A “reverse anomeric effect,” in which a cationic 
group (quaternary ammonium) at C-1 of an 
aldopyranose derivative reverses the C-Z dipole and 
consequently causes the equatorial orientation to be 
favored, has been suggested, but the generality of the 
concept has been questioned. 
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Anomeric Effect 



Exoanomeric Effect 


Anomeric Effect in Sugars, Fig. 1 


Cross-References 

► Glycan-to-Protein Linkages 

► Glycoproteins 
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Definition 

Brownian motion is typically considered an inescapable 
attribute of small particles in free solution. This random 
jiggling often impedes or prohibits optical studies of the 
behavior of nanometer-sized objects, such as single 
molecules, because such objects quickly run away 
from the observation region. Anti-Brownian traps 
partially suppress Brownian motion by combining sin¬ 
gle-particle position tracking in space with real-time 
feedback to apply forces that counteract the Brownian 
motion, enabling extended study without surface attach¬ 
ment or encapsulation, both of which could perturb the 
object. A powerful single-molecule method, anti- 
Brownian traps have been used to hold particles ranging 
in size from individual fluorescent dyes (Fields and 
Cohen 2011) to human cells (Armani et al. 2006). 
These traps vary in their method of tracking, the source 
of the feedback, and the number of spatial dimensions 
along which the particle is trapped, and often are asso¬ 
ciated with additional imaging and spectroscopic probes 
used to extract information from the trapped object as in 
single-molecule spectroscopy (Fig. 1). 

Tracking 

Thermally agitated solvent molecules undergo random 
collisions with any particle (molecule or colloid) 
immersed in the solution. In the absence of other 
forces, these collisions cause the particle to undergo 
a random walk with a mean-square displacement along 
each axis that increases linearly with time: 

(d^) = 2Dt. 

The diffusion coefficient, D, is related to the prop¬ 
erties of the particle and the solution by the Stokes- 
Einstein formula: 


67177a ’ 

where kg is the Boltzmann constant, T is the absolute 
temperature, 77 is the viscosity of the solution, and a is 
the hydrodynamic radius of the particle (equal to the 
physical radius for a spherical particle). 

Since smaller objects have larger diffusion coeffi¬ 
cients, the tracking system must locate a particle 
quickly and accurately. These two demands (speed 
and accuracy) are at odds with each other because the 



a 


Anti-Brownian Traps, Fig. 1 Feedback principle underlying 
anti-Brownian traps. Optical imaging is used to determine the 
location of an object in solution. This position information is 
used to produce electrokinetic restoring forces by applying volt¬ 
ages to electrodes, only two of which are shown; alternatively, 
translation-based feedback can be used 


uncertainty in localization due to shot noise decreases 
with greater integration time. The optimal integration 
time for the tracking system is identified by balancing 
the competing demands of speed and accuracy (Fields 
and Cohen 2010). 

A variety of tracking systems have been used in anti- 
Brownian traps. The conceptually simplest systems use 
a video camera and particle-tracking software, with 
either fluorescence or bright-fleld imaging (Armani 
et al. 2006; Cohen and Moemer 2005; Cohen and 
Moemer 2006; Cohen and Moemer 2008). Camera- 
based systems are flexible and relatively simple to set 
up, but the feedback bandwidth is often limited by the 
electronics in either the camera or computer. More 
recent designs have performed the tracking entirely in 
hardware. These traps used laser scanning coupled with 
time-resolved detection of fluorescence on a point¬ 
sensor (Fields and Cohen 2011; Cohen and Moemer 
2008; Berglund and Mabuchi 2004; Berglund et al. 
2007; Goldsmith and Moemer 2010; Wang and 
Moemer 2010), or multiple avalanche photodiodes in 
an image plane (Lessard et al. 2007; Cang et al. 2007). 

Feedback 

One must exert forces on a molecule to counter its 
Brownian motion. Electrokinetic feedback, used in 
the ABEL trap (for anti-Brownian ELectrokinetic 
trap, Cohen and Moemer 2005), is performed by 
applying voltages to a sample contained within 
a micro- or nano-fabricated sample cell. These volt¬ 
ages generate electrophoretic and electroosmotic 
forces that together move the particle. The relative 
contributions of electrophoresis and electroosmosis 















A 


96 


Anti-Brownian Traps 


depend on the charge of the particle, the surface charge 
of the trap, and the geometry of the trap. Electrokinetic 
feedback is sufficiently fast to trap individual 
fluorophores in water, but is limited to homogeneous 
solutions of low to moderate ionic strength. Further¬ 
more, fabrication of microfluidic sample cells requires 
use of costly and specialized equipment. 

Translation-based feedback is another means of 
countering Brownian motion. Here, the entire sample 
is translated to keep the trapped particle approximately 
stationary in the laboratory frame of reference. 
Translation-based feedback is typically implemented 
with a piezoelectric translation stage in two or three 
dimensions. This approach allows long-time observation 
in complex environments such as cells, but is typically 
too slow to trap very small objects in aqueous buffer. 
Translation-based feedback does not reposition the par¬ 
ticle relative to objects in its immediate environment, 
because all parts of the sample are translated by the same 
amount. Furthermore, translation feedback provides no 
information on the charge state of the particle. 

Generalizations 

Most anti-Brownian traps use fluorescence and trap 
only one particle at a time. However, tracking systems 
based on two-photon fluorescence (Levi et al. 2005) 
and scattering (Gang et al. 2006) have also been dem¬ 
onstrated. The original anti-Brownian traps held single 
particles within a two-dimensional region, but later 
designs extended the concept to multiple particles 
(Armani et al. 2006) and to three dimensions 
(Berglund et al. 2007; Gang et al. 2006). 

Applications 

Anti-Brownian traps have been used to study the 
polymer dynamics of DNA (Gohen and Moerner 
2007), the photophysics of phycobiliproteins such as 
allophycocyanin (Goldsmith and Moemer 2010), 
orientation-dependent scattering in metallic 
nanostructures (Gang et al. 2008), and the internaliza¬ 
tion of particles by cells (Wells et al. 2008). 

Comparison to Optical Tweezers 

Optical force traps, also known as laser tweezers, are 
widely used to manipulate sub-micron-sized colloidal 
particles for use in biophysics experiments. The trap¬ 
ping strength of laser tweezers is roughly proportional 
to the volume of the object being trapped, so laser 
tweezers are incapable of trapping particles smaller 


than 100 nm in diameter with biologically safe laser 
powers. Anti-Brownian traps outperform laser twee¬ 
zers for trapping particles smaller than 100 nm, while 
typically laser tweezers are more effective for larger 
particles. 


Cross-References 

► Optical Trap 

► Optical Tweezers 

► Single-Molecule Methods 

► Single Particle Tracking 

► Single-Molecule Spectroscopy 
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Basic Characteristics 

NMR provides a unique perspective of protein 
folding and unfolding, enzymatic action, protein-pro¬ 
tein, protein-lipid, or protein-ligand interactions, as 
well as aggregation and fibrillation in both soluble 
and membrane protein systems. In this entry, we 
touch upon two challenges associated with the appli¬ 
cation of NMR to the study of proteins: 
(1) Assigning *'^F NMR spectra and (2) Biosynthetic 
labeling and the structure perturbations which may 
result from fluorinated analogs. 

Many of the successes observed with NMR stud¬ 
ies of protein function are due to the inherent sensitivity 
of the fluorine chemical shift to its local environment; it 
is not uncommon that a given fluorinated residue may 
exhibit a dispersion of 10-20 ppm. The fluorine chemical 
shift is primarily influenced by a large paramagnetic 
term, originating from an unpaired electron, making it 
sensitive to local van der Waals interactions and electro¬ 
static fields (Gerig 1994). The position of a *^F resonance 
relative to the corresponding denatured chemical shift 
position correlates well with burial in proteins and even 
slight differences in the dielectric properties of H 2 O and 
D 2 O manifest themselves as different *'^F chemical shifts 
for water-exposed *^F probes. However, it is difficult 
to predict '®F NMR chemical shifts from known protein 
structures using current ab initio methods and assign¬ 
ment is therefore usually accomplished through 
mutagenesis. 

F Assignment via NOE Contacts to the Backbone 

We consider a 148-residue calcium-binding protein 
(calmodulin, or CaM) as a test case for the feasibility 
of assigning *®F resonances without mutagenesis. In 
this study, we have replaced each of the eight phenyl¬ 
alanines with 3-fluorophenylalanine, (i.e., F12, F16, 
F19, F65, and F68 in the N-terminal domain and F89, 
F92, and F141 in the C-terminal domain, as shown in 
the ribbon diagram in Fig. 1). Phenylalanine residues 
contribute to the stabilization of the hydrophobic core 
and have been implicated in peptide binding and 
activation. The general strategy is to combine tradi¬ 
tional backbone assignment protocols with side chain 
correlation experiments, which couple to the fluorine 
resonances. As shown in Fig. 2 (Kitevski-Leblanc et al. 
2010), the 3-fluorophenylalanine resonance may be 
easily correlated to the H5 resonance via a 
HSQC, which achieves magnetization transfer through 


F65 



Assignment of Resonances in Protein Solution 
State NMR Studies, Fig. 1 Structure of calmodulin (PDB 
file 3CLN) with location of eight phenylalanine residues 
indicated 

a ^10 Hz 3-bond coupling. As shown in Fig. 2, the H§ 
and HP resonances may then be correlated through 
a *®F-edited NOESY-HSQC, which again utilizes an 
INEPT transfer from the H5 to the *®P resonance. 
Einally, the HP resonance is correlated to the n - 1 - 1 
backbone amide through a ^H-'^N NOESY-HSQC. 
The approach requires simply that the protein is *^N 
labeled although an intraresidue NOE peak to the 
phenylalanine backbone is not detected since 
3-fluorophenylalanine is not *^N-labeled. Similar 
assignment experiments have also been performed 
with fluoroaliphatics. In 5-fluoroleucine labeled 
ubiquitin, sequential backbone assignments based on 
TOCSY and NOESY experiments were combined with 
*H-*^E HMQC experiments to obtain assignments 
(Alexeev et al. 2003). 

NMR Assignments Via Scalar Couplings 

Assignments of fluorine side chain resonances may also 
be achieved by magnetization transfer through scalar 
couplings. A multinuclear assignment approach was 
developed using *^C, *^N-enriched-L-3-fluorotyrosine, 
synthesized by direct fluorination of the isotopically 
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Assignment of 

Resonances in Protein 
Solution State NMR 
Studies, Fig. 2 NMR-based 
assignment strategy for 
phenylalanine 92. (a) 

FISQC spectrum. The e, 5, and 
^ proton cross peaks are 
indicated for F92. (b) 'FI-**F 
NOESY-FISQC spectrum. The 
experiment was collected as 
a 2D (i.e., without evolving the 
’^F dimension) and shown are 
the first indirect and direct 
detect dimensions. The cross 
peak highlighted corresponds 
to the HP ofF92. (c) A 2D 
plane corresponding to a 15N 
shift of 117.21 ppm, from a 3D 
NOESY-HSQC 
spectrum. The cross peak 
indicated corresponds to the 
amide proton of aspartic acid 
93 (i.e., the n + 1 residue with 
respect to F92). All 
experiments were run at 37° C 
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Assignment of Resonances in Protein Solution 
State NMR Studies, Fig. 3 NMR-based assignment strategy 
for ''^C/^N-enriched 3-fluorotyrosine labeled calmodulin. 


Evolved nuclei are indicated with blue circles, (a) CT-HCCF- 
COSY (b) (HP)C/1(CyCS)//(5 (c) HNCACB. Experiments were 
run at 37° C 


enriched amino acid (Kitevski-Leblanc et al. 2009). The 
assignment strategy utilized the large and 

scalar couplings as outlined in Fig. 3. The 
correlation of the fluorine atom to the aromatic H§ 


proton was completed using an HCCF-COSY experi¬ 
ment. Connection to the backbone was then established 
via (HP)CP(CsC5)H5 (Yamazaki et al. 1993) and 
FINCACB experiments. In both cases, the assignment 
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Assignment of 

Resonances in Protein 
Solution State NMR 
Studies, Fig. 4 '*F (a) and 

HSQC (b) NMR spectra 
at 3-fluorophenylalanine 
enrichment levels of 100% 
enrichment and 60%. Note that 
the ''^N,'H line widths are 
comparable to the unlabeled 
species at enrichment levels of 
60% or less, while the 19F 
NMR spectra represent eight 
well-resolved Lorentzian lines. 
Fractional enrichment 
significantly improves protein 
stability and magnetization 
transfer schemes such as those 
discussed in this entry 





8.20 7.80 7.40 


ppm 


was completed using amide chemical shifts known 
from the literature, which if unavailable can be 
acquired experimentally with an appropriately labeled 
sample and well-established backbone assignment pro¬ 
tocols (Gardner and Kay 1998). Thus, there are a number 
of options to consider when assigning NMR reso¬ 
nances from biosynthetically labeled proteins. All 
approaches rely on the scalar couplings to the directly 
bound species or 3-bond scalar couplings to 'H 

nuclei. editing is also important in order that the 

NOESY cross peaks can be correlated to the ^^F reso¬ 
nance of interest. 


Fractional Labeling 

Finally, we address a common dilemma encountered in 
*^F NMR studies of proteins - namely, structure per¬ 
turbations which arise from introducing fluorinated 
amino acid analogs. Such perturbations can be easily 
identified by inspecting the backbone *^N,*H HSQC 
spectra as shown in Fig. 4, for 3-fluorophenylalanine- 
enriched calmodulin. Others have reported in detail on 
the cumulative perturbations, in terms of the 
fluorophobic effect, arising from the introduction of 
'®F atoms to proteins. In our studies of calmodulin, we 
noted that by decreasing the labeling level to ~60% 
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random fluorine enrichment, using a combination of 
native and fluorinated amino acids during expression, 
no structure perturbations of the protein were 
observed, as judged by the HSQC and associ¬ 

ated spectra, which were nearly identical to spectra of 
the wild type calmodulin. 

In summary, while mutagenesis may be used to 
assign resonances, modem heteronuclear experi¬ 
ments, which make use of either scalar couplings or 
NOESY-based transfers, may be used to obtain assign¬ 
ments of NMR spectra, without the need for mul¬ 
tiple samples. We also anticipate that in many cases, 
the spectral quality in many NMR protein studies 
may be significantly improved by reducing the ^^F 
enrichment levels to a point where there are no longer 
significant cumulative structure perturbations. This 
point is usually determined by screening 
NMR spectra as a function of ^^F enrichment levels. 

Cross-References 

► In-Cell NMR 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 
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Synonyms 

Simultaneous topography and recognition imaging 
(TREC); Single-molecule methods 

Definition 

High-resolution AFM topography imaging becomes 
a powerful bioanalytic tool when combined with single¬ 
molecule force spectroscopy or with simultaneous 
“topography and recognition imaging” (TREC). This 
new imaging method allows for mapping of specific 
ligand binding sites on biological samples under physio¬ 
logical conditions with nanometer resolution. 
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Atomic Force Microscopy 
(AFM) for Topography and 
Recognition Imaging 
at Single Molecule Level, 
Fig. 1 Principal operation of 
various AFM and its modes. 
Schematic representation of 
(a) commercial atomic force 
microscopy, (b) contact mode 
imaging, and (c) tapping mode 
imaging 




J I 

AFM image 



AFM image 


Introduction 

Atomic force microcopy (AFM) is a version member of 
scanning probe microscopy (SPM) that is extensively 
used in solid-state physics and materials science within 
the life sciences because it can be operated in physio¬ 
logical salt solution. In AFM imaging, a sharp probe tip 
mounted on a microcantilever scans over the specimen 
line by line, whereby the topographic image of the 
sample surface is generated by “feeling” rather than 
“looking.” Tips with high sharpness provide high reso¬ 
lution and cantilevers with low spring constants allow 
precise control of the forces between the tip and sample. 
These forces cause the cantilever to bend or deflect. The 
deflection is monitored by a laser that is focused on the 
backside of the very end of the cantilever, and from there 
it is reflected onto a split photodiode. The movement of 
the laser spot on the photodiode causes an electric signal 
change that is used in a feedback loop for keeping the 
exerted force on the AFM tip constant. Within the 
feedback loop, voltage is applied to a piezoelectric scan¬ 
ner, which moves the tip or the sample up and down. 
The vertical movements of the piezo allow a computer 
to generate a map of surface topography (Fig. la). 

In the last decade, AFM has widely been applied to 
biological samples ranging from single molecules, such 
as proteins and nucleic acid, to macromolecular assem¬ 
blies and whole cells, due to the fact that subnanometer 
resolution is achieved under physiological conditions 
without extensive sample preparation or labeling 


(Horber and Miles 2003; Francis et al. 2009; Muller 
2008). Topographical imaging of these samples can be 
done in various modes in accordance with the experi¬ 
mental requirements, which can be divided into contact 
and dynamic mode AFM imaging. 

Contact Mode Imaging 

In contact mode imaging, the tip is in permanent phys¬ 
ical contact with the sample surface (Fig. lb). The 
scanner traces the tip over the surface and the contact 
forces cause the cantilever to bend upon changes in 
topography. The deflection signal is held at a constant 
value by the feedback loop; thus, constant force is 
exerted on the specimen. The larger the amplitude by 
which the cantilever is bent, the higher is the imaging 
force. With an AFM operated in the contact mode, 
topographic images with a vertical resolution of less 
than 0.1 nm and a lateral resolution of about 0.2 nm 
have been obtained (Bhushan and Marti 2010). Contact 
mode AFM imaging of soft biological samples requires 
delicate adjustment of the force set point, so as to opti¬ 
mize image contrast and reduce damage to the sample. 

Dynamic Mode Imaging 

Force-sensitive samples such as weakly attached cells 
can be pushed away by contact mode imaging and 
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continuous contact with the tip might damage living 
cells (You et al. 2000). Therefore, dynamic force 
microscopy (DFM) imaging methods like tapping 
mode (Putman et al. 1994) and magnetic AC mode 
(MAC mode) AFM (Han et al. 1997) imaging are 
more preferable for imaging weakly attached and/or 
soft cell samples. 

In tapping mode, the AFM tip is oscillated near its 
resonance frequency. The vibration of the cantilever is 
achieved by oscillation of a piezo-actuator. While it scans 
over the surface, it touches the sample only intermittently 
at the end of its downward movements which results in 
an amplitude reduction upon sample contact (Fig. Ic). In 
order to apply a constant force on the sample, the ampli¬ 
tude reduction upon intermittent tip-surface contact is 
held constant in the feedback loop. Since the tip touches 
the sample surface at the very end of its downward 
movement only, the lateral forces acting on the sample 
are greatly reduced and result in less sample damage. 

In magnetic AC mode (MAC mode) AFM, the oscil¬ 
lation of the magnetically coated cantilever is caused by 
an alternating magnetic held generated in a coil beneath 
the sample stage. The cantilever is directly driven by 
a magnetic force, which results in a sinusoidal oscilla¬ 
tion with dehned frequency, and the tuning curve con¬ 
tains only one distinct resonance peak (Han et al. 1996). 
Since the magnetic held drives the cantilever directly in 
this new type of scanning mode, the background reso¬ 
nances are absent and the signal-to-noise ratio is 
improved in comparison to acoustic oscillation modes. 
The amplitude-frequency curve closely follows the 
characteristics of a damped harmonic oscillator. Ampli¬ 
tude and frequency of the cantilever oscillation can 
therefore easily be adjusted by the coil current magni¬ 
tude and frequency. This allows for a sensitive adjust¬ 
ment of oscillation frequency and of the related phase 
response, a prerequisite for high-resolution imaging 
(Raab et al. 1999). MAC mode AFM is a powerful 
imaging technique to reveal membrane features such 
as lamellipodia, cytoskeleton hbers, F-actin hlaments, 
and small globular structures on soft and weakly 
attached cells with a resolution of 5 nm on hxed and 
20 nm on living cells under physiological conditions. 

Atomic Force Spectroscopy 

In live cells, molecular recognition events between 
receptors and their cognate ligands initiate many 


important biological processes, including genome rep¬ 
lication, gene expression, enzymatic activity, immune 
responses, and other cellular processes. A number of 
techniques are presently available to investigate inter- 
and intramolecular interaction forces of biomolecules 
at the single-molecule level. These techniques span 
a measurable force window ranging from entropic 
forces at several femto Newtons (IfN = 10“*^ N) up 
to the rupture of covalent bonds at several nano 
Newtons (1 nN = 10“^ N) (Florin et al. 1994; Lee 
et al. 1994; Hinterdorfer et al. 1996). Since AFM offers 
one of the smallest force sensors, it is currently the only 
technique that allows for mapping and analysis of 
single receptor molecules with a lateral resolution at 
the nanometer (nm) scale. 

In contrast to conventional AFM imaging modes, 
single-molecule force spectroscopy requires an 
upgraded sensor design. Here, a single biological 
ligand molecule has to be bound to the outer apex of 
the AFM tip by using a flexible linker in order to study 
and quantify binding interactions to surface-bound 
receptors. The functionalization of AFM tips with flex¬ 
ibly attached probe molecules can be accomplished 
using different strategies, the choice depending on 
the tip surface material and on the ligand to be bound 
(Ebner et al. 2008). 

The force between a ligand-functionalized AFM tip 
and target surface with attached receptor molecules is 
monitored in a so-called force-distance cycle (Fig. 2) 
by moving the tip down (“trace”) and up (“retrace”) at 
constant vertical speed and at a fixed lateral position. 
During most of the approach phase period (Fig. 2, trace 
points 1-3), the cantilever deflection remains zero 
because neither the tip nor the tip-bound sensor mole¬ 
cule reach the sample surface (Fig. 2, trace point 1). 
After tip-surface contact (Fig. 2, trace point 2), the 
cantilever bends upward, consistent with a repulsive 
force that linearly increases with progressive “negative 
distance” (Fig. 2, trace point 3). Subsequent 
tip-surface retraction (Fig. 2, retrace point 4-5) first 
leads to relaxation of the cantilever deflection until the 
repulsive force drops to zero. Upon further retraction, 
the binding between the ligand on the tip and the 
receptor on the surface exerts a progressive force that 
bends the cantilever downward (Fig. 2, retrace point 
4). Since the ligand is tethered to the tip surface 
through a flexible polymer linker, the shape of 
the retract curve shows parabolic-like (nonlinear) 
characteristics (retrace, point 4), reflecting an increase 
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Atomic Force Microscopy 
(AFM) for Topography and 
Recognition Imaging 
at Single Molecule Level, 
Fig. 2 Schematics of a force- 
distance cycle 



of the spring constant of the linker during extension. 
The downward bending of the retracting cantilever 
continues to rise until the ligand-receptor complex 
finally dissociates at a certain critical force called 
unbinding force, /y. Unbinding of the complex is 
indicated by a sharp spike in the retraction curve that 
reflects an abrupt jump of the cantilever to its resting 
position (retrace, point 5). Specificity of binding is 
usually demonstrated by block experiments in which 
free ligands are added into the solution to bind to the 
receptor sites on the sample surface, or by adding free 
receptor molecules that can attach to the ligand on the 
AFM tip. After such specific block, the retrace curve 
will look very much like the trace curve (Fig. 2, inset), 
i.e., only repulsive forces but no downward deflection 
of the cantilever is observed. 

Recently, recognition force spectroscopy has been 
applied to a variety of biological systems. The interac¬ 
tion of human rhinovirus with lipoprotein receptors 
(which is the first step for an infection) was investi¬ 
gated by force spectroscopy (Rankl et al. 2008). For 
this purpose, the virus was tethered to an AFM canti¬ 
lever using a heterobifunctional crosslinker with two 
amino-reactive termini (Ebner et al. 2008) and the 
receptors were bound to a mica surface using the 
same chemistry (Fig. 3a). Flence it was possible to 
count the number of receptor molecules interacting 
with the virus and to estimate kinetic rates. The 


measured unbinding forces using such samples showed 
four equidistant force levels (Fig. 3b) which were 
attributed to binding of 1-4 receptors to a single 
virus. This explanation was supported by the fact that 
the observed number of receptors bound per virus 
particle increased with longer waiting times of the tip 
at the surface (Fig. 3c, e). The analysis of such wait 
time experiments allowed us to determine the kinetic 
binding rate constant to be 10® M~ s“\ In addition, it 
was found that the characteristic time for virus- 
receptor binding is around 20 ms. By using dynamic 
force spectroscopy, i.e., measurement of unbinding 
force as a function of pulling velocity (Fig. 3d), the 
kinetic dissociation rate constant was determined to be 
0.05 s“*, yielding a dissociation equilibrium constant 
of 24 nM (koff/kon). 


Simultaneous Topography and Recognition 
Imaging 

Identification and localization of specific binding sites 
over biological sample surfaces with high spatial accu¬ 
racy is an important objective in the life sciences. Micros¬ 
copy techniques, such as epi-fluorescence microcopy, 
photo-activated localization microscopy (PALM), 
stimulated emission depletion microscopy (STED), 














A 


106 


Atomic Force Microscopy (AFM) for Topography and Recognition imaging at Single Molecule Level 




r 


“NHS” 

o 


PEG 

HN o 

r 


orH 
laCNBH, 


b 400 


200 


-200 


-400 









20 40 60 

Piezo Movement [nM] 

Contact time 17 ms 


80 




Contact time 31 ms 



Atomic Force Microscopy (AFM) for Topography and Rec¬ 
ognition Imaging at Single Molecule Level, Fig. 3 Molec¬ 
ular recognition force measurements of human rhinovirus and its 
receptor, (a) Schematic representation of the experimental con¬ 
figuration used. The human rhinovirus was tethered to the tip 
using a hetero-bifunctional crosslinker. The receptor was teth¬ 
ered to mica using the same chemistry, (b) Typical force-dis¬ 
tance cycles showing the nonlinear stretching of the tether, 
allowing for discriminating between specific and unspecific rec¬ 
ognition events. Force curves with instantaneous rupture of 
single (red), double (green), triple (blue), and quadruple 
(cyan) bonds are shown, (c) Dynamic force spectra of one 
(solid squares), two (empty squares), and three (solid circles) 
receptor molecules bound to a human rhino virus. The results 


were well represented by a Markov binding model for the double 
bond (green) and triple bond (blue) interaction using parameters 
derived from the single baiTier model fit (red). The eiTor bars 
account for the uncertainty in the determination of the 
spring constant and the uncertainty in finding the most probable 
rupture force, (d-re) Distribution of rupture forces (black line) 
as a function of the contact time. For a short contact 
time (d) most of the ruptures represented single receptors 
bound to a virus with a few double receptor events. Increasing 
the dwell time on the surface resulted in higher force 
rupture events, indicating binding of several receptor molecules. 
The force distributions were fitted using a sum of 
Gaussians (gray), which almost perfectly reproduced the exper¬ 
imental data 


single particle tracking, single dye tracing, or scan¬ 
ning electron microscopy, have the drawbacks of 
limited resolution, lack of topographic information, 
and/or inapplicability under physiological condi¬ 
tions. AFM renders possible high-resolution topo¬ 
graphical images at the nanometer scale, combined 
with single-molecule recognition under ambient 
conditions. The fastest and most straightforward 
method in this respect is the so-called TREC mode. 
The name stands for simultaneous acquisition of 
“Topography and RECognition.” In this AFM 
mode, the surface of a biological specimen is 
scanned with a biofunctionalized tip at regular imag¬ 
ing speed, yielding a map of specific ligand-binding 


sites together with a topographic image (Stroh et al. 
2004a, b; Hinterdorfer and Dufrene 2006). 

The operating principle of TREC is based on the 
MAC mode, in combination with a ligand that is cova¬ 
lently bound to the AFM tip via a flexible PEG linker 
of defined length (~6 nm, Stroh et al. 2004a, b). This 
functionalized cantilever is oscillated close to its reso¬ 
nance frequency while scanning across a surface 
exhibiting binding sites (receptors) for the ligand. In 
recognition imaging, the cantilever oscillation ampli¬ 
tude is divided into two parts (i.e., lower and upper 
parts with respect to the baseline of the oscillation) and 
processed in different paths by using a specially 
designed electronic circuit called TREC box. While 
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Atomic Force Microscopy (AFM) for Topography and Rec¬ 
ognition Imaging at Single Molecule Level, Fig. 4 Princi¬ 
ple of TREC. (a) The cantilever oscillation is divided into two 
parts in the TREC box. While the envelope of the upper part 
yields the recognition image, the lower part provides for the 


topography image, (b) The characteristic oscillation signal of 
a bare {upper panel) and a HyHEL-5 antibody-functionalized 
AFM tip (lower panel) on a sample surface containing lyso¬ 
zyme molecules is shown. Eigures adapted from Stroh et al. 
2004a 


the lower part of the signal is used for generation of the 
topography image, the upper part reflects recognition 
events and gives the recognition image (Fig. 4a). 
Figure 4b shows the results of full scan lines with 
a bare and an antibody-functionalized AFM tip 
(top and bottom panels, respectively). In both panels, 
the lower envelope of the cantilever oscillation 
provides the information of the surface topography. 
The upper envelope of cantilever oscillation reflects 
transient binding to immobilized antigens on the 
sample surface; thus it is only seen in the lower 
panel, measured with an antibody-functionalized tip. 

By using a cantilever with a low Q-factor (the 
Q-factor divided by its resonance frequency represents 
the “memory” (characteristic time constant) of the 
cantilever) the amplitude reduction in the lower part 
of the oscillation (originating from a change in the 
topography) is sufficiently separated in time from 
amplitude reductions in the upper part of the oscilla¬ 
tion (originating from the linker stretching, which only 
occurs during molecular recognition between the 
ligand and the receptor). Consequently, only 
the lower part of the sinusoidal oscillation is fed into 
the feedback loop and is thereby held constant to 
obtain the unbiased surface topography. The upper 
part of the oscillation, solely containing information 
on recognition between ligand and receptor, is 


recorded to generate a recognition image simulta¬ 
neously to the topography image. 

In order to obtain reliable TREC images, three 
important imaging parameters have to be properly 
adjusted; feedback loop, oscillation amplitude, and 
driving frequency. First, the modified “half amplitude” 
feedback loop has to be used to measure the true surface 
topography during TREC imaging. In MAC mode AFM 
with a bare AFM tip, the peak-to-peak value of the 
oscillating amplitude (“full amplitude” -FA) is utilized 
as feedback parameter to obtain the sample topography. 
However, when a ligand-functionalized AFM tip is 
used, both topographical features (at the bottom peaks) 
and molecular recognition events (at the top peaks) 
affect the value of the full amplitude. Using the conven¬ 
tional feedback loop for TREC, therefore, leads to errors 
in the topographical image since the feedback itself 
cannot discriminate between different contributions to 
the amplitude reduction (Preiner et al. 2009). The 
stretching of the polymer linker exhibits a strong 
nonlinear behavior so that for small linker extensions 
the force acting on the cantilever is negligible. It is, 
however, much higher at extensions approaching the 
linker’s contour length. Consequently, there is almost 
no linker force acting on the cantilever during the lower 
part of oscillation. This part is only affected by changes 
in topography, and is therefore used as feedback 
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parameter in the so-called half amplitude (HA) feedback 
loop, yielding the unbiased surface topography. 

The second important TREC imaging parameter is 
the oscillation amplitude. When the ligand on the tip is 
bound to a receptor on the sample during a single line 
scan, a signihcant TREC signal is only generated if this 
ligand-receptor complex persists until the tip has 
moved laterally away from the position of the receptor 
molecule. On the one hand, the oscillation amplitude 
should not be too large since otherwise it would detach 
the ligand from the receptor immediately after binding. 
On the other hand, the amplitude must be large enough 
to sufficiently stretch the linker molecule, in order to 
generate a significant force on the cantilever. These 
two requirements define a small gap of proper linker 
extensions, and therefore three different regimes of 
oscillation amplitudes. In the first regime (I), small 
oscillation amplitudes are applied to the AEM tip. 
The linker does neither stretch nor exert an efficient 
force on the cantilever in the upward swing. Thus, no 
recognition signal is generated. When the amplitude is 
increased to the second regime (II), the linker is effi¬ 
ciently stretched without detaching the ligand from the 
receptor in the upper part of the oscillation, resulting in 
a strong recognition signal. In the last regime (III), the 
oscillation amplitude is further increased, causing the 
ligand to unbind from receptor as soon as the top peak 
of the first oscillation cycle is reached, which follows 
the binding event. Since continuous binding while 
scanning over the receptor molecule is prevented in 
this case, no recognition signals are generated. In con¬ 
clusion, the oscillation amplitude of the AEM tip must 
properly be adjusted to regime (II) in order to suffi¬ 
ciently stretch the linker without rupturing ligand- 
receptor interaction. 

The third important imaging parameter is a properly 
chosen driving frequency. As mentioned before, the 
cantilever has a kind of memory, which is the time it 
needs to “forget” information (e.g., recognition or 
topography information) caused by the damping of 
the environment. To obtain a true recognition image 
without any topographical information or feedback 
artifacts, the cantilever must have lost all of the infor¬ 
mation collected during the lower part of the oscilla¬ 
tion, when the recognition information is measured (at 
the top peak of the oscillation). If this is not the case, 
i.e., when features from the amplitude-error image 
(originating from the finite feedback speed) contribute 
to the recognition image, the time given to the 


cantilever to forget this topographical information is 
too short. Therefore, this time has to be increased, 
which can be done by lowering the excitation fre¬ 
quency. The contribution of the amplitude error to 
the contrast in the recognition image and its depen¬ 
dence on the excitation frequency have been demon¬ 
strated experimentally and explained in detail (Preiner 
et al. 2009). 

Additional TREC measurements were performed 
on the avidin-biotin test system (Ebner et al. 2005). 
In this study, single avidin molecules were adsorbed on 
a mica surface and a biotinylated AEM tip was used to 
acquire a topographic image and a separate map of 
recognition sites from single 2D-scans of a sparse 
layer of avidin molecules. Avidin was immobilized 
on the mica surface via electrostatic adsorption and 
biotin was coupled to the AEM tip by using a flexible 
PEG linker. With this configuration, two independent 
maps were simultaneously acquired, that is, 
a topography image of the immobilized avidin mole¬ 
cules and a map of the corresponding recognition sites 
(Fig. 5a, b, respectively). A comparison of these two 
maps shows that almost all avidin molecules, visible in 
the topography image, were also found in the 
corresponding recognition image with an overall suc¬ 
cess rate of pe ~ 90%. The specificity of the recogni¬ 
tion signal was proven by adding free streptavidin to 
the solution above the sample and repeating the TREC 
scan of the same area. As expected, the topographic 
image (Fig. 5c) remained unaltered, while the dark 
spots in the previous recognition image (Fig. 5b) 
were no longer visible in the new recognition image 
(Fig. 5d) after blocking the tip-linked biotin with the 
free streptavidin. This result clearly shows that, before 
the addition of free streptavidin, the biotinylated AFM 
tip specifically recognized the avidin molecules on the 
mica surface. 

Simultaneous topography and recognition imaging 
can be also used for more complex systems like mem¬ 
branes and cells. In a recent study, bacterial surface 
layer (S-layer) crystals with genetically fused affinity 
tags were investigated by TREC imaging (Tang et al. 
2008a). S-layer proteins form regular two-dimensional 
lattices in vitro, such as in suspension, on lipid films, 
on liposomes, or on solid supports. The latter method 
allows for generation of planar-supported 2D protein 
crystals by a simple bottom-up approach. The S-layer 
protein SbpA of Lysinihacillus sphaericus CCM 2177, 
which self-assembles into a square (p4) lattice 
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at Single Molecule Level, 
Fig. 5 TREC imaging of 
avidin molecules, (a) 
Topography image of avidin 
molecules electrostatically 
adsorbed on mica, (b) 
Simultaneous recognition 
image of avidin molecules in 
the same area, (c) The 
specificity of TREC imaging 
was confirmed by adding free 
streptavidin to the sample to 
block biotin on the AFM tip. 
The topography image 
remained unchanged, whereas 
(d) most binding events 
disappeared in the recognition 
image. Figures are reproduced 
with permission from 
Ebner et al. 2005 



Atomic Force Microscopy 
(AFM) for Topography and 
Recognition Imaging 
at Single Molecule Level, 
Fig. 6 TREC on S-layers. 
Topography (a) and 
recognition (b) images of the 
S-layer proteins rSbpA-Strep- 
tagll and wild-type SbpA 
cocrystallized on a silicon 
chip. Topographic (c) and 
recognition (d) images 
acquired after tip-bound Strep- 
Tactin was blocked by adding 
free Strep-tagll. Figures are 
reproduced with permission 
(License Number 
2567111253467) from 
Tang et al. 2008a 
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Atomic Force Microscopy (AFM) for Topography and Rec¬ 
ognition Imaging at Single Molecule Level, Fig. 7 Recog¬ 
nition images of VE-cadherin on Myend cells. Topography 
image (a), simultaneously recorded with recognition map (b). 
Examples of recognition spots (b+ and b++) taken from (b). 
Recognition areas are depicted by threshold analysis 


(threshold = —1.7 nm) and bordered by white lines. Single 
VE-cadherin c/i-dimers are clearly seen (arrows). After 
blocking with 5 mM EDTA, while the recognition clusters 
practically disappeared (d), the topography (c) remained 
unchanged, indicating that the blocking does not affect mem¬ 
brane topography 


Structure (Huber et al. 2006), is one of the most exten¬ 
sively characterized S-layer proteins. In previous stud¬ 
ies, a fusion protein consisting of SbpA and the short 
affinity peptide Strep-tagll (an eight amino acid pep¬ 
tide and artificial ligand for streptavidin) was prepared. 
It was shown that the fused Strep-tagll, located on the 
exposed face of the protein crystal, did not interfere 
with the self-assemhly properties of the rSbpA-Strep- 
tagll fusion protein (Tang et al. 2008b). In order to 
address Strep-tagll on co-crystallized S-layer proteins 
of SbpA-Strep-tagll with wild-type SbpA protein (with 
a molar ratio of 1:7), the AFM tip was functionalized 
with Strept-Taclm via a PEG linker. In Fig. 6a, the 
typical square lattice symmetry of the S-layer protein 
ShpA with a center-to-center spacing of ^14 nm was 


observed. Recognition events between Sfrepf-Tactin 
and Strep-tagll were identified as dark spots in the 
recognition image (Fig. 6h), reflecting the distribution 
and localization of fusion protein SbpA-Strep-tagll in 
the mixed S-layer lattice. In order to prove the speci¬ 
ficity of the recognition signals, the Sf/epf-Tactin 
functionalized AFM tip was blocked by adding free 
Strep-tagll peptides into the sample solution. Conse¬ 
quently, the dark spots in the recognition image 
(Fig. 6d) were completely abolished, while the topog¬ 
raphy image (Fig. 6c) remained unchanged. This spec¬ 
ificity of the block suggested that the previously 
observed recognition spots originated from a specific 
interaction between Strep-tagll of the chimeric S-layer 
proteins and the Strept-Tactin-functionalized AFM tip. 
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In conclusion, simultaneous topography and recogni¬ 
tion imaging has successfully revealed the location of 
functional groups in the S-layer protein nanoarray at 
nanoscale resolution. 

Moreover, TREC has successfully been exploited to 
identify binding sites of vascular endothelial (VE)- 
cadherins (calcium-dependent-flc//re;ent proteins) on 
microvascular endothelial cells from mouse myocar¬ 
dium (MyEnd) cells and to colocalize their position 
with membrane topographical features (Chtcheglova 
et al. 2007). VE-cadherin is known to play a crucial 
role in calcium-dependent, homophilic cell-to-cell 
adhesion. In this study, VE-cadherin molecules on 
the cell surface were recognized by a magnetically 
coated AFM tip, functionalized with a recombinant 
VE-cadherin-Fc cA-dimer. In order to increase the 
stability of the cell membrane and avoid lateral diffu¬ 
sion of VE-cadherin, cells were gently fixed with glu- 
taraldehyde. The topography image (Fig. 7a) shows 
linear and branched hlaments of the peripheral actin 
belt, with some globular features. Microdomains with 
dimensions from ^10 to ^100 nm were found in the 
recognition image (Fig. 7b), due to a decrease of the 
oscillation amplitude upon physical interaction 
between VE-cadherin on the tip and VE-cadherin mol¬ 
ecules on the cell surface in the presence of Ca^"^ 
cations. Figure 7b+ and b++ shows a closer look at 
the recognition spots and reveal that one to two large 
domains (50-80 nm) were surrounded by smaller 
domains (10-20 nm) or even single molecule spots. 
The specificity of binding was confirmed by the addi¬ 
tion of a calcium chelator (EDTA), leading to the 
disappearance of almost all binding events in the rec¬ 
ognition image (Fig. 7d), whereas no change in the 
topography image was detected (Fig. 7c). 

Summary 

Simultaneous topography and recognition (TREC) 
imaging is a combination of high-resolution AFM 
topography imaging with single-molecule force 
microscopy. This powerful AFM technique not only 
yields fine structural details about topography but also 
senses biochemical composition of native biological 
samples under physiological conditions. TREC shows 
single molecular interactions and thus allows one to 
visualize, identify, and quantify local receptor binding 
sites and assign their locations to topographical 


features of a cell surface. For these reasons, TREC is 
a promising tool for the identification and location of 
receptor binding sites on cells, organelles, and other 
subcellular structures. 


Cross-References 

► Electron Microscopy 

► Fluorescence and FRET in Membranes 

► High-Speed Atomic Force Microscopy (AFM) 

► Single-Molecule Spectroscopy 

► Stimulated Emission Depletion (STED) Microscopy 
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Introduction 

AFM is a technique that can provide a wealth of infor¬ 
mation on lipid membranes. Its great strength is its 


ability to operate under physiological solution condi¬ 
tions with little sample preparation, while resolving 
structures down to the nanometer level. The planar 
nature of the membranes when supported on a solid 
surface makes them ideal samples for high-resolution 
imaging, and the ability of even the most basic AFM to 
discriminate 0.1 nm height differences allows the 
measurement of different phases in phase-separated 
systems using differences in membrane thickness 
and/or mechanical properties. 

Imaging: Structure and Dynamics 

AFM is a label-free technique, not dependent on the 
addition of components that might perturb or alter 
completely the behavior of the membrane under 
study. For this reason, it has been used extensively in 
the study of phase separation in lipid membranes in 
recent years (Connell and Smith 2006; Giocondi et al. 
2010). It Is, therefore, very versatile and powerful, 
while simultaneously being easy to use, with simple 
and rapid sample preparation. Preparing a planar 
supported bilayer is achieved by introducing small 
highly strained unilamellar vesicles of the lipid mix¬ 
ture to a flat substrate, where they spontaneously unroll 
(Reviakine and Brisson 2000; Richter et al. 2003). The 
vesicles may be prepared easily by ultrasonication, or 
if containing delicate functional proteins, by repeated 
extrusion through a filter membrane. Depending on the 
head groups’ chemistry and lipid transition tempera¬ 
ture (i.e., phase) this process can be helped using 
elevated temperatures and/or addition of ions to shield 
repulsive charge. Lipid films may also be transferred to 
a substrate using the Langmuir-Blodgett technique. It 
is now becoming routine to study lipid-protein inter¬ 
actions within a membrane using AFM, particularly 
the localization of proteins within specific areas of the 
membrane (Alessandrini and Facci 2010). 

Extremely high resolution of membrane proteins is 
possible. Sub-nanometer detail of higher-order stoi¬ 
chiometric proteins has been demonstrated (Bippes 
and Muller 2011). For example, OmpF trimers were 
reconstituted into a lipid membrane and assembled 
into 2D crystals and imaged under buffer. AFM could 
resolve short periplasmic loops protruding above the 
bilayer, of only 2-3 amino acids long. The different 
stoichiometries of various toxins such as a-hemolysin 
and cholera toxin have also been imaged with 
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0.5 nm resolution. Although traditionally a slow 
technique (minutes per image), recent advances in 
high-speed AFM have allowed unprecedented time 
resolution of molecular rearrangements in membranes, 
scanning at faster than 1 Hz frame rates (Shibata et al. 
2010 ), for example, following the dynamic processes in 
a photoactivated bacteriorhodopsin mutant. At lower 
lateral resolution, high-speed AFM has also elucidated 
the formation of surface-supported bilayers from the 
vesicle suspension, revealing an intermediate structure 
of highly dynamic cylindrical or tubular membranes on 
the surface as a step between the adsorbed vesicles and 
final bilayer (Giocondi et al. 2010). 

The highest resolution currently possible of lipid 
membranes by AFM has recently been achieved 
(Fukuma 2010) using a high-sensitivity frequency 
modulation (FM) mode together with high-sensitivity 
detectors. This is capable of not only resolving the 
head group structure of solid phase membranes to 
almost atomic resolution, but also detect and image 
the highly ordered water layers adjacent to the head 
groups. As well as laterally resolved images, these 
layers can be detected by operating the FM probe in 
the normal direction, clearly showing a strongly con¬ 
fined single-molecule thick water layer lying directly 
above the head group, together with a second more 
weakly bound and less-ordered layer above. 

As the membrane is in its native state, it is possible 
to follow many other types of dynamic processes. For 
example, the action of various detergents against raft- 
containing lipids revealed a complex dependence upon 
the type of detergent and relation to the cmc (Gamer 
et al. 2008). As detergent extraction is extensively used 
in the separation of lipid rafts and raft-proteins, these 
experiments could prove extremely important in elu¬ 
cidating the real mechanisms underlying the extraction 
process, or whether it is even valid. Raft formation can 
also be studied by preferentially extracting the individ¬ 
ual components of a complex lipid mixture, for exam¬ 
ple, by the use of P-cyclodextrin (Giocondi et al. 
2004). The dynamics of domain formation itself have 
been measured, controlling the size, distribution, and 
shape of phase-separating domains. These processes 
were found to be controlled by factors such as temper¬ 
ature, membrane line tension, and proximity to phase 
boundaries (Longo and Blanchette 2010). 

Real-time imaging has been used to probe the 
dynamics of lipid-anesthetic interaction, showing sev¬ 
eral mechanisms of action, but mainly an increase in 


lipid bilayer disorder together with an interdigitation 
of the opposing leaflets and resultant thinning (Goksu 
et al. 2009). This fundamentally alters the properties of 
the membrane, particularly in terms of permeability, 
and probably has a large effect on the embedded mem¬ 
brane proteins. The interaction of species potentially 
toxic to the membrane, for example, amyloidogenic 
proteins and antimicrobial peptides, is also open to 
study by AFM by directly imaging the membrane 
surface as it is challenged by various species in real 
time. The membrane has been found to catalytically 
promote aggregation of amyloid proteins, which then 
proceeds to alter the membrane permeability and struc¬ 
ture (Relini et al. 2009; Williams et al. 201 1). 

Force Spectroscopy: Mechanics and Function 

In contact mode AFM, the probe is in mechanical 
contact with the membrane and applies a controllable 
force, resulting in some membrane deformation. For 
imaging purposes this would normally be minimized, 
but it can also be used to probe membrane mechanics. 
Although there are several more established tech¬ 
niques for studying membrane mechanics, namely, 
the biomembrane force probe, and the surface force 
apparatus (SFA), these operate over larger length 
scales, typically tens of microns. An AFM will mea¬ 
sure local properties down to the nanometer, or molec¬ 
ular scale, and hence molecular interactions between 
neighboring molecules, and with the solvent. This is 
particularly important if attempting to measure the 
mechanics within a single domain, or in the vicinity 
of a membrane disturbance, perhaps caused by 
a membrane protein, nanoparticle, or self-assembling 
peptide. Such experiments are performed on solid 
supported bilayers, where the probe interacts vertically 
with a planar bilayer, hence eliminating weaker bend¬ 
ing modes. 

There are three methods of force spectroscopy that 
have been applied to membrane nanomechanics. Tak¬ 
ing the most common first, the force on the probe is 
ramped up linearly until the membrane’s elastic limit is 
exceeded, plastic deformation occurs, and the mem¬ 
brane ruptures. The tip jumps suddenly from the upper 
surface of the compressed bilayer, to the underlying 
substrate. The force at which this occurs has been 
found to be characteristic of a particular mixture, or 
phase of the membrane, acting as a molecular 
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fingerprint. The nanomechanics of bilayers of varying 
composition has recently been comprehensively char¬ 
acterized using this method, revealing that both head 
group and the hydrophobic tails contribute to the 
mechanical stability of the membrane (Garcia-Manyes 
et al. 2010). For example, resistance to penetration 
increases proportionately with chain length, and is 
inversely proportional to the number of chain unsatu¬ 
rated bonds, easily understood in terms of the van der 
Waals forces between closely packed chains. The 
effect of head group chemistry is more complex, 
driven by hydration and ion absorption properties, but 
a simple sequence of increasing stability, PA < PE < 
PC < PS < PG, was shown. This technique also 
showed the important effect cholesterol plays in regu¬ 
lating the order, mechanical strength, and permeability 
of membranes, increasing the stability of both liquid 
and solid phases. Surprisingly, the breakthrough force 
was found to be largely independent of tip radius. In 
normal consideration of surface nano-indentation, the 
Hertz model (or similar) is invoked, containing a load 
term (force per unit area). In this situation, we are 
outside the limits of elastic deformation, and into the 
plastic regime, which in this case is dependent only on 
absolute force. 

The second method probes only the elastic regime 
(Das et al. 2010) where the Hertzian type response is 
more valid, and the local modulus can be determined 
from a static analysis of the deformation. However, 
more parameters are taken into account to achieve 
a good fit with experimental results. The two coupled 
leaflets are treated independently, a factor which is 
already known to influence dynamic fluctuations. 
Further contributions to the modulus accounted for 
are head group and tail stretching as the bilayer 
deforms, van der Waals, and electrostatic interactions. 
While the derived function scaled differently to 
Hertzian contact (elastic, two bodies much larger 
than the contact area) as might be expected, it was 
also different to elastic thin films, whether bonded or 
slipping against a solid substrate. The modulus deter¬ 
mined was not one of Young’s compressive modulus, 
but an area compressibility modulus. A test of the 
function proved an excellent fit on the Lj phase of 
a liquid-liquid phase separated ternary system 
containing DOPC, sphingomyelin, and cholesterol, 
producing a realistic modulus. A ht of the Lq phase 
compression curve revealed a small deviation only 


seen in the first 0.5 nm of bilayer compression. This 
was attributed to the stretching of hydrogen bonds, and 
once this had been taken into account an excellent ht 
resulted. 

A third method recognizes that membranes consist 
of dynamic molecules interacting in a lateral direction 
in the plane of the bilayer, qualitatively different to the 
normal direction (Butt et al. 2005). The membrane 
rupture then becomes a statistical process, where an 
increase in pressure exerted by the AFM tip results in 
an increase in probability that it will penetrate. As it is 
a probability, there will not be one hxed force, but 
a range, exactly what is observed if a histogram of 
penetration forces is plotted. This implies that the 
bilayer breakthrough is an activated process. In this 
continuum nucleation theory, the pore formation is 
connected with the line tension in the bilayer, and the 
spreading pressure, associated with the membrane’s 
adsorption to the solid surface. An outcome of this 
treatment is an expression that shows that the yield 
force is dependent upon loading rate, increasing at 
higher approach speeds. In other studies the tip veloc¬ 
ity is normally kept constant (e.g., at 1 pm/s). This is an 
outcome of the 2D fluidity of the bilayer. Although the 
shear modulus at static load is zero (because it is 
a fluid), at higher frequencies an elastic response 
appears, governed by the relaxation time of the bilayer 
molecules. At the extreme, with very high-frequency 
biophysical techniques such as QCM-D, the bilayer 
appears to be 100% elastic in the horizontal plane 
(Melzak et al. 2011). All three methods described 
have proven useful in understanding AFM force data, 
and are often used together, for instance, to show the 
temperature dependence and effect of ionic strength on 
mechanical stability. 

There are other methods for probing surface prop¬ 
erties of the bilayer other than compressing or pene¬ 
trating it. By changing the charge on the AFM probe 
using chemical modification with a functional self- 
assembled monolayer, it is possible to perform titration 
force microscopy, measuring the retraction adhesion 
between the probe and membrane (Garcia-Manyes 
et al. 2006). The adhesion force will be dependent 
upon the ionization state of the lipid molecules with 
respect to pH. Even more information on the DLVO 
electrostatic forces above a bilayer can be seen in the 
approach curve before contact. Finally, all force spec¬ 
troscopies may be carried out in a force mapping mode, 
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where a force curve is taken at each point in a 2D array, 
particularly important when attempting to measure the 
mechanical properties of individual membrane ele¬ 
ments, such as raft-like domains. Until recently these 
type of measurements have been relatively slow, hav¬ 
ing to perform a controlled approach and retract cycle 
at each pixel, but the latest generation of AFM systems 
are capable of gathering force data at high resolution at 
the same time as normal imaging, which make this 
technique much more accessible. The AFM is also 
capable of observing functional membrane proteins 
at work. Experiments in which single-membrane 
proteins are extracted from the bilayer by tethering 
between the end of the probe to a particular group 
on the protein, while measuring the opposing forces, 
probe the interaction between proteins and lipids, 
and the intermolecular forces which stabilize protein 
folding (Bippes and Muller 2011). The proteins 
may even be mechanically unfolded while still in the 
membrane, or ligand binding sites identified (Frederix 
et al. 2009). 

Summary 

It has been demonstrated that AFM is a hugely power¬ 
ful technique for studying the biophysical properties of 
membranes, their structure, mechanical properties, and 
interactions with membrane proteins and other bodies. 
The most important progress is being made in the 
understanding of membrane dynamics at the molecular 
level, where recent advances in very high-resolution 
AFM and very high-temporal-resolution AFM are 
poised to provide new insights. 
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Atomic Force Spectroscopy 

Yalda Javadi and Juilo M. Fernandez 
Columbia University, New York, NY, USA 

Synonyms 

Atomic force microscopy; Single-molecule force spec¬ 
trometry; Single-molecule methods 

Definition 

Atomic force spectroscopy is the study of the 
mechanical properties and dynamics of single proteins 
and individual chemical bonds under force. 

Introduction 

Atomic force spectroscopy has emerged as 
a powerful tool to investigate the forces and motions 
associated with biological molecules and enzymatic 
activity. The most common force spectroscopy 
techniques are ► optical tweezers, ► magnetic 
tweezers, and atomic force microscopy (AFM). In 
this entry, we focus on AFM to describe how 
mechanical forces affect the dynamics and chemis¬ 
try of proteins, at the single molecule level. We 
utilize molecular biology techniques to engineer 
tandem modular proteins, comprised of identical 
repeating units of the protein of interest. These 
polyproteins, when extended by AFM, have unique, 
mechanical properties that serve as a fingerprint 
allowing for an unambiguous probe into the dynam¬ 
ics of a single molecule. In our initial experiments, 
we would extend the protein at constant velocity 
(force-extension mode), revealing the well-known 
and characteristic saw-tooth pattern traces of 
unfolding. Whilst this approach allowed us to 


make observations that were not possible in tradi¬ 
tional bulk experiments, it yielded only qualitative 
results, owing to the vast changes in force, length, 
and loading rate that occurred simultaneously. 
Combating this marked the birth of our pioneering 
force-clamp spectroscopy, which allowed for the 
first time the force dependency of protein unfolding, 
folding, and of chemical reactions to be investi¬ 
gated. From the force dependency, we can extract 
key features and underlying molecular mechanisms 
involved in these reactions, such as the distance to 
the transition state, folding energy landscapes, and 
enzymatic activity. In the following sections, we 
depict a summary of the key attributes of force- 
clamp spectroscopy, highlighting why to date it 
has remained a powerful tool to study the physical 
mechanisms and dynamics of single proteins. 

Atomic Force Microscopy - Force Extension 

In a typical single-molecule AFM experiment, the 
protein molecule is adsorbed onto the substrate and 
the tip of the cantilever and pulled at constant velocity, 
allowing the resulting force to be obtained. 
Nonspecific events that are prevalent at the single 
molecule level, however, present challenges in analyz¬ 
ing the mechanical properties of the protein. There¬ 
fore, a clear and concise fingerprint must be in place to 
explicitly determine the mechanical properties specihc 
to that protein. Such a molecular hngerprint was first 
implemented over a decade ago with the P-sandwich 
127 polyprotein (27th immunoglobulin domain from 
human titin) that consisted of between 8 and 12 tandem 
repeating units (Carrion-Vazquez et al. 1999). These 
polyproteins have been an integral part of single¬ 
molecule AFM experiments by providing certainty 
that all the mechanical parameters that are being mea¬ 
sured are specihcally from the repeating protein 
molecule. 

In force-extension experiments, the polyprotein is 
pulled at constant velocity from the cover slide that is 
mounted on a piezoelectric positioner by a cantilever 
tip. The molecule is extended, generating a restoring 
force that causes the cantilever to bend proportionally 
to the force acting upon it in accordance with Hooke’s 
law. This technique rather eloquently allows spatial 
manipulation of less than a nanometer as well as mea¬ 
surements of forces of just a few piconewtons to be 
measured. The resulting force-extension curve is plot¬ 
ted revealing the distinctive saw-tooth pattern (Fig. 1), 




Atomic Force Spectroscopy 


117 


A 


Atomic Force 
Spectroscopy, 

Fig. 1 Experimental outline, 
(a) The polyprotein is 
stretched from a gold cover 
slide by a cantilever tip at 
constant velocity, (b) Diagram 
of the P-sandwich structure of 
the 127 protein. The 
polyprotein is engineered from 
eight domains, (c) A typical 
force-extension recording 
obtained by stretching an 
(127)8 polyprotein at constant 
velocity (Taken from Alegre- 
Cebollada et al. (2010)) 



which can be fit with the ► worm-like chain (WLC) 
model of polymer elasticity (Marko and Siggia 1995), 
to provide a worthy description of the force-extension 
behavior. The peak force reached before an unfolding 
event (^200 pN for 127) measures the mechanical 
stability of a single protein domain, whilst the contour 
length of the domain is reflected in the spacing distance 
between the peaks (^28 nm for 127) as the polyprotein 
unfolds (Carrion-Vazquez et al. 1999). 

This approach proves sufficient, albeit qualitative, 
for measuring the force-dependent parameters of 
a polyprotein. To obtain quantitative results, we 
developed a technique to address the changes of 
force and loading rates during the cycle of force- 
extension experiments. This “force-clamp” mode 
marked an important milestone in AFM technology, 
not only by permitting for the first time direct 
measurements of the force dependency of the 
unfolding and folding of single proteins, but by also 
allowing complex pulse patterns of forces to be applied 
to proteins. 

Force-Clamp Spectroscopy 

In the force-clamp mode, a polyprotein is pulled 
using a constant and calibrated stretching force 
held constant through the feedback by 
a proportional-integral-derivative (PID) controller. 
By this mechanism, the length of the protein is 
adjusted to keep the stretching force constant. 
Unfolding events are characterized by a step increase 
in the polyprotein length, revealing the characteristic 


staircase trajectory (Fig. 2). From this, unfolding 
kinetics can be readily obtained. 

What revolutionized force-clamp spectroscopy was 
its ability to relate these measurements to directly 
explore the energy landscapes of single proteins. In 
particular, by investigating the force dependency of 
unfolding, we gain insight into the distance to the 
transition state (Ax) and activation energy barrier 
(AG) for the reaction, in accordance with the following 
Arrhenius term; 

r{F) = A exp[—(AG — FAx)/kT] 

where r(F) is the force-dependent rate constant, 
obtained by averaging polyprotein traces (n = >20) 
and fitting the summed trace to a single exponential. 
This is repeated over a range of forces to determine the 
force dependency of the unfolding reaction. A is the 
attempt frequency, necessary for determining AG, 
measured independently for each reaction. Ax is the 
distance to the transition state of a reaction and mea¬ 
sured from the slope of the Arrhenius fit (Fig. 2B). k is 
the Boltzmann constant and T is the temperature. 

In the case for 127, we calculated the distance to 
the transition state to be 2.4 A (Dougan et al. 2008a). 
This distance is the size of a water molecule 
(Hajdu et al. 1976). Indeed, replacing water by larger 
solvent molecules alters the Ax by a value closely 
related to the size of the new solvent molecule 
(Dougan et al. 2008a). The capability of force-clamp 
spectroscopy to attain such a parameter proves most 
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Atomic Force Spectroscopy, Fig. 2 (a) A typical length- 
versus-time recording {top trace) obtained by stretching an 
(127)8 polyprotein at a constant force of 130 pN {bottom trace). 
The protein elongates in steps of 24.5 nm, marking the unfolding 
of individual proteins in the chain, (b) Illustration of the effect of 
mechanical force on a simple two-state energy landscape. The 


reaction is characterized by the position of the transition state, 
Ax, along the reaction coordinate and the height of the activation 
energy barrier, AG. A pulling force applied along the reaction 
coordinate accelerates the rate of the reaction, k{F), exponen¬ 
tially by reducing the size of the activation energy barrier by an 
amount equal to FAv (Adapted from Fernandez et al. (2010)) 


Striking, principally because it allows the molecular 
structure of an extremely transient transition state to 
be defined. Furthermore, the extrapolation of the 
unfolding rates to zero force directly measures 
the height of the activation energy barrier. Evidently, 
the quantification of such parameters, permitted 
by force-clamp spectroscopy, has played a central 
role in moving the field forward by allowing the energy 
landscapes of single proteins to be fully explored. 

Force Dependency of Chemical Reactions 

The effect of mechanical force on the free-energy 
surface that governs a chemical reaction remains 
largely unknown. Historically, mortar and pestle 
methodology have provided the most widely used 
probe to apply force to chemical reactions. More 
recently, techniques such as ultrasound have been 
employed in activating covalent bonds in the reacting 
system (Hickenboth et al. 2007). In these cases, 
however, the magnitude and direction of the applied 
force is not known. In this context, force-clamp exper¬ 
iments excel at providing a directional and calibrated 
probe to activate a chemical reaction, thus, allowing 
for a direct exploration into the dynamics and chemical 
reaction mechanisms of single bonds. 

In our basic strategy, we combine polyprotein 
engineering and force-clamp AFM to investigate 
force-dependent enzymatic catalysis at the single bond 
level. The outlined schematic is presented in Fig. 3. 


In this instance, the 127 polyprotein has two 
cysteine residues incorporated into its sequence 
(G32C, A75C) such that a stable disulhde bond is 
readily formed. Note that the bond is positioned 
within the p-sandwich fold of the 127 protein, essen¬ 
tially separating the protein into two parts: first, the 
unsequestered amino acids that readily unfold and 
extend when a stretching force is applied (11 nm) 
and, second, the 43 amino acids that are trapped 
behind the disulfide bond (13.5 nm). This second 
region can only be extended if the disulhde bond is 
reduced in the presence of a nucleophile, such as 
L-cysteine, DTT, or the reductase enzyme 
thioredoxin (Trx). A representative trace illustrating 
the two respective regions is shown in Fig. 3c. Using 
this combination of polyprotein engineering and 
force-clamp spectroscopy, we explore the mechanism 
by which the substrate disulhde is reduced 
(Wiita et al. 2006; Ainavarapu et al. 2008; Dougan 
2008 a, b). 

We can measure reduction kinetics of disulhde 
bonds as a function of pulling force. Here, a double 
pulse protocol is employed to provide two differing 
forces. The hrst force pulse rapidly unfolds the 
modules of 127 up to the unsequestered region, expos¬ 
ing the disulhde bonds to the solution. In the presence 
of a reducing agent, the second pulse can probe the 
force-dependent reduction rates of the exposed 
disulhdes. The reduction rate at a given force is 
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Cartoon of the unfolding of an 
127ss at a constant stretching 
force in the force-clamp mode. 

(b) Single (I27ss)8 molecules 
are stretched. After unfolding 
(11 nm), the disulphide bond is 
exposed to the solution. On 
disulphide reduction by Trx, 
the residues previously 
trapped behind the disulphide 
bond are immediately 
extended (14 nm). 

(c) Representative trace 
illustrating the unfolding 
events initially, followed by 
protein extension after the 
protein is reduced (marked by 
Mack arrows) 
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Atomic Force Spectroscopy, Fig. 4 (a) Force dependence of 
the reduction rate for a small reducing agent (L-Cys), human 
Trx, and E.coli Trx. Error bars determined by bootstrapping 
methods (Efron 1982). The Trx enzymes catalyze the substrate 


by a Michaelis-Menten mechanism (Alegre-Cebollada et al. 
(2010)). (b) This model describes two modes of Trx catalysis 
(path I in red, path II in Mue), where the catalytic rate constants 
are exponentially dependent on the applied force 


measured by fitting a single exponential to an averaged 
trace consisting of n = >20 individual traces. 

Figure 4 illustrates how disulfide bond reduction is 
sensitive to the force applied and the type of reducing 


agent applied. For L-cysteine, this corresponds to 
a simple Sn 2 chemical reaction, characterized by an 
exponential increase in the reduction rate with the 
pulling force. The Trx enzymes, on the other hand, 
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catalyze the substrate by an alternative mechanism, 
defined by a Michaelis-Menten mechanism that is 
detected when the disulfide is stretched at low forces. 

Here, the enzyme binds to the stretched polyprotein 
and the subsequent structural organization of the 
participating sulfur atoms precedes the chemical 
reaction. At high forces, however, the Trx enzymes 
exhibit two distinctly different chemical behaviors 
depending on their species origin. Eukaryotic-origin 
Trx reduces disulfide bonds through a single-electron 
transfer reaction (SET), whereas bacterial-origin Trx 
shows both Sn 2 and SET (Perez-Jimenez et al. 2009). 
The evolution of the binding groove has been specu¬ 
lated as an important factor for controlling the chem¬ 
istry of Trx catalysis that gives rise to these distinct 
reduction mechanisms. Eurther to its ability in 
distinguishing the chemistry of simple Sn 2 reactions 
from more elaborate pathways, the force-clamp reduc¬ 
tion assay can also reveal fundamental characteristics 
about the single disulfide bond. By applying the simple 
Arrhenius term (described above), we can deduce 
key characteristics of the transition state of this Sn 2 
reaction. For example, in the case where the small 
nucleophile DTT attacks the disulfide bond, we see 
a bond elongation of Ax = 0.34 A at the transition 
state. This is in line with the theoretical calculation of 
^0.37 A that represents the distance to the transition 
state in a simple Sn 2 thiol/disulfide exchange 
(Fernandes and Ramos 2004). Indeed, when alterna¬ 
tive reducing agents are used, the measured distances 
to the transition states, in each case, correspond to 
the physiochemical characteristics of the reactants. 
This demonstrates that the changes observed in the 
distance between the sulfur atoms at the transition 
state are in fact responsible for the force dependency 
of the reaction. 

Folding Trajectory of a Single Protein 

Two conflicting views have been proposed to explain 
how a protein folds from an extended primary 
sequence of amino acids into its three-dimensional 
native fold. On one hand, statistical theories have 
proposed a rough, funnel-like energy landscape, 
which guides the protein to the native state through 
progressively smaller conformational ensembles 
(Onuchic and Wolynes 2004). Meanwhile, by 
a somewhat conflicting view, traditional biophysical 
and biochemical experimentalists envisage protein 
folding as a two-state (all-or-nothing) system, whereby 
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Atomic Force Spectroscopy, Fig. 5 After mechanical 
unfolding of the protein, force quench captures the full folding 
trajectory. The end-to-end length of an ubiquitin polyprotein is 
shown as a function of time. (1) Elastic recoil of the unfolded 
polymer as the force is quenched. This is followed by (2-3) 
a continuous and cooperative collapse trajectory forming (4) an 
ensemble of collapsed states (molten globule states) until native 
contacts are made that define the protein’s final form. There is no 
marked variation in the protein length during this transition. 
A final stretching force confirms the regain of mechanical sta¬ 
bility of the native state. In this folding trajectory, the extended 
quench time results in the final native structure, and not the 
molten globule precursors, being formed (evident from the 
20-nm step sizes from the second unfolding pulse) (Adapted 
from Fernandez and Li (2004)) 

the unfolded and the folded states are separated 
by a single energy barrier (Fersht 1998). In order to 
resolve such an apparent contradiction, single- 
moiecule techniques are best positioned to test exper¬ 
imentally the folding trajectory of a single protein 
during its journey to the functional native state. 
These experiments, when tested in bulk, have mostly 
been conducted on the model protein ubiquitin. There¬ 
fore, we engineered a polyprotein that consisted of 
nine modules of ubiquitin. When combined with 
force-clamp spectroscopy, this allowed, for the first 
time, the protein’s folding landscape to be dissected 
in detail, encompassing all the different stages from the 
highly extended state to the natively folded form as 
a function of end-to-end length over time. 

In the force-quench mode, first an initial force is 
applied (100 pN) that promotes the sequential 
unfolding of each ubiquitin model, marked by step- 
size increments of 20 nm (Fig. 5). As the name sug¬ 
gests, the force is then quenched to a low force (10 pN), 
triggering the collapse of the extended protein 
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Atomic Force 
Spectroscopy, 

Fig. 6 Molten globule states. 
After full collapse, we observe 
that the protein becomes 
segregated into two distinct 
ensembles: The first is 
identified by a fast 
heterogeneous elongation 
made of multiple sized steps 
(Inset); the second 
corresponds to well-defined 
steps of 20 nm that identify 
fully folded proteins. The ratio 
between these two states of the 
protein depends on At. Longer 
values of At favor the native 
ensemble (Taken from Garcia- 
Manyes et al. (2009)) 



At = 1 s 



(Fernandez and Li 2004). This collapse, in contrast to 
the discrete unfolding events observed, is highly coop¬ 
erative and contracts in stages (Berkovich et al. 2010). 
The first slope corresponds to the elastic recoil of the 
unfolded polymer and is followed by a continuous and 
cooperative collapse trajectory leading to an ensemble 
of collapsed structures that are mechanically labile, 
structurally heterogeneous, and necessary precursors 
to the native state. From such an ensemble of collapsed 
conformations, also known as molten globule 
(Baldwin et al. 2010), the folding transition occurs in 
a two-state manner whereby the protein reaches its 
folded length, resulting in a native conformation 
that exhibits mechanical stability. The recovery of 
the native protein’s mechanical stability in our 
experiments is an unambiguous signature of the 
completion of the folding process. 

The individual folding trajectories monitored 
with the force-clamp technique readily show that the 
folding reaction from a highly extended state does 
not follow simple two-state kinetics. In addition, the 
folding trajectories are stochastic in nature. This 
means that even with identical force-quench pulses, 
individual molecules do not respond uniformly and 
display folding trajectories that vary greatly, thus in 
complete accordance with the statistical theories of 
protein folding. 

In this vein, the collapse trajectory cannot be 
described by a simple all-or-nothing transition 


from the unfolded and extended state to the collapsed 
conformation. Instead, the continuous and cooperative 
collapse trajectory reflects the diffusion behavior of the 
protein surfing a continuous energy landscape. Such 
a process can be well described by a diffusion 
Langevin model (Risken 1989). Upon reaching the 
collapsed structure that exhibits partial mechanical 
stability, the protein has to mature over time trying to 
find the right native contact networks that define the 
final three-dimensional fold. We applied a series of 
force-quench pulses to perturb these ensembles that 
form along the folding trajectories and thus gain 
insights into their role in folding (Fig. 6). We captured 
two distinct phases: (i) the molten globule states, char¬ 
acterized by a fast initial extension that unravels the 
collapsed states in a stepwise manner of different 
lengths, and (ii) the native state, marked by the char¬ 
acteristic 20-nm unfolding staircase. Such a rate- 
limiting final transition from the molten globule to 
the native form occurs in a barrier-activated two-step 
process, akin to that sampled in bulk techniques. 

Summary 

Atomic force spectroscopy has the capability to 
uncover the intricate details of protein-based reactions, 
allowing us to sense conformational changes down to 
the sub-Angstrdm scale. The data obtained from the 


























A 


122 


ATP Synthases from Archaea: Structure and Function 


single protein has already had, and continues to have, 
a great impact within the field. We can now deduce the 
distance to the transition state from force dependency 
experiments, which in turn can be related to well- 
defined molecular events. We can study the protein’s 
journey over a complex energy landscape during its 
unfolding/folding trajectory, providing insight that has 
previously been limited by traditional biochemical 
methods. Furthermore, atomic force spectroscopy has 
evolved in recent years to provide a direct view of the 
physical and chemical mechanisms that underpin pro¬ 
tein-based reactions. These developments can be read¬ 
ily combined with simulations and theoretical studies 
and work in synergy to provide a well-rounded view of 
protein dynamics at the single molecule level. 


Cross-References 

► Atomic Force Microscopy 

► Force-Fluorescence Spectroscopy 

► High-Speed Atomic Force Microscopy (AFM) 

► Magnetic Tweezers 

► Optical Tweezers 

► Worm-Like Chain (WLC) Model 
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Synonyms 

AjAo ATP Synthase 


Definition 

AiAo ATP synthases are enzymes for energy conser¬ 
vation in archaea. 
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Introduction 

In 1977 Carl Woese discovered the third domain of 
life, the Archaea. Based on the analyses of 16 S rDNA 
sequences of different organisms he showed that the 
tree of life consists of three domains, the Archaea, the 
Bacteria, and the Eukaryota (Fig. 1). Archaea were 
always considered to live in habitats with extreme 
conditions, for example, high temperature, high pres¬ 
sure, low or high pH, or high salt concentrations. So far 
more than 75 hyperthermophilic bacteria and archaea, 
could be isolated from different habitats showing 
a broad range of diversity. Moreover, in recent years 
it became evident that archaea are also present in high 
number in mesophilic environments such as the ocean 
and terrestric soils. The domain of the Archaea can be 
divided into the kingdoms Euryarchaeota, 
Crenarchaeota, Nanoarchaeota, and Korarchaeota. 
Crenarchaeota are represented by sulfur-dependent 
(hyper-) thermophilic archaea, and the Euryarchaeota 
are phylogenetically extremely diverse and are com¬ 
posed of halophiles, acidophiles, and methanogens. To 
date the kingdom Nanoarchaeota consists of only one 
archaeon called Nanoarchaeum equitans, a tiny coccus 
of 400 nm in diameter that lives in a symbiosis/para¬ 
sitism with the host Ignicoccus hospitalis, which is the 
only known organism where the ATP synthase is local¬ 
ized in the outermost membrane (Kiiper et al. 2010). 
The fourth kingdom, the Korarchaeota, was identified 
in environmental samples by their DNA sequences, but 
so far they could not be cultivated. 

A Unique Class of ATP Synthases in 
Archaea: The A^Aq ATP Synthase 

All ATP synthases derived from a common ancestor 
and developed into the ATP synthases/ATPases known 
to date; the FiFq ATP synthase present in bacteria, the 
AiAo ATP synthase present in all archaea, and the 
ViVo ATPases present in eukarya. Due to lateral 
gene transfer the picture has become more compli¬ 
cated. The tree of life is actually a “bush” of life. 
AjAq ATP synthase genes were transferred to some 
bacteria (like Thermus thermophilus) and some 
archaea have in addition to AiAq also FiFq ATP 
synthase genes. The ATP synthases/ATPases have in 
common a bipartite stmcture, a soluble Ai, Fj, Vj 
domain and a hydrophobic, membrane-bound Ao, Fo, 


Vo domain that are connected by a central and 
a peripheral stalk. AiAq and FiFq ATP synthases 
share a common function, ATP synthesis under phys¬ 
iological conditions, whereas ViVq ATPases can only 
hydrolyze, not synthesize ATP. They were designed by 
nature as efficient, ATP-driven ion pumps. AiAq ATP 
synthases and ViVq ATPases took the same road for 
some time in evolution and thus, their overall subunit 
composition is similar and the catalytic subunits of 
A- and V-ATPases share a relatively high degree of 
similarity. Different again is their regulation. Whereas 
the ViVo ATPase is known to undergo reversible 
dissociation/association of Vi/Vq dependent on the 
carbon supply and energy status of the cell, this is not 
observed in AiAq ATP synthases. The AiAq ATP 
synthases have a more complex subunit composition 
but comparatively little is known about the subunit 
topology, the structure, and function of these enzymes. 

Genetic Organization of Archaeal ATP 
Synthase Genes 

The subunits of the AiAq ATP synthase are encoded 
by nine genes, organized in an operon in most, but not 
all archaea. It starts at the 5 ’ end with atpH followed by 
atpl, atpK, atpE, atpC, atpF, atpA, atpB and atpD 
(Fig. 2). Nearly all Euryarchaeota show this genetic 
organization, except the methanogens and the 
Sulfolohales. The operons of these two groups have 
an additional open reading frame, atpG, at the 3’ end, 
whose function, if any, is not clear. A closer look at 
operons of AiAq ATP synthases from 
Thermococcales, Archaeoglohales, Halohacteria, and 
methanogens reveals that the genes atpl and atpK, 
encoding for the subunits a and c, that make the mem¬ 
brane-embedded Ao domain, are surrounded by the 
genes atpH and atpE, encoding for the hydrophilic 
subunits H and E of the peripheral stalks. The central 
stalk of the AiAq ATP synthase is made by the sub¬ 
units C, F, and D, encoded by the genes atpC, atpF, and 
atpD. The hydrophilic Ai domain consists of three 
pairs of subunits A and B, encoded by the genes atpA 
and atpB, and are arranged alternately. 

Not all archaea have such a conserved gene order in 
an operon. For example, the ATP synthase genes of the 
Crenarchaeota are scattered around the genome, and 
in the genome of N. equitans only the genes atpA, 
atpD, atpK, atpl, and a truncated atpB were detected. 
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ATP Synthases from Archaea: Structure and Function, 
Fig. 1 Tree of life based on 16 S rDNA sequences showing 
cultivated organisms of the three domains of life, Archaea, 

A very interesting question is, whether such a minimal 
ATP synthase, consisting of a head domain with only 
A, and a non-functional subunit B, a central stalk 
represented by the subunit D and an Aq domain, 
build up by a c ring and the stator subunit a, is able to 
synthesize and hydrolyze ATP? If this minimal ATP 
synthase is not functional, then it is possible that 


Bacteria, and Eukaryotes (Courtesy of Dr. Haraid Huber, Uni¬ 
versity of Regensburg, Germany) 


A. equitans receives ATP from its host /. hospitalis, 
where the ATP synthase is localized in the outer mem¬ 
brane. However, if such a minimal ATP synthase is 
functional, it would be the most ancient ATP synthase/ 
ATPase and could give us clues to the evolution of 
ATP synthases/ATPases in the tree of life (Lewalter 
and Muller 2006). 
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ATP Synthases from Archaea: Structure and Function, Fig. 2 Genetic organization of the AjAq ATP synthase genes in 
Methanosarcina mazei G61 


The Domain of Archaeal ATP Synthases 

The Ai domain of the archaeal ATP synthase of 
Methanosarcina mazei Gdl, containing the subunits 
A and B of the head part as well as C, D, and F of the 
central stalk was heterologously overproduced in 
Escherichia coli and purified to homogeneity (Lemker 
et al. 2003). A three-dimensional structure of the Ai 
domain was obtained by electron microscopy of single 
particles leading to the first three-dimensional recon¬ 
struction at a resolution of 3.2 nm (Fig. 3). The head 
domain was 10.2 nm in diameter and 10.8 nm in height, 
consisting of six elongated subunits A 3 and B 3 sur¬ 
rounding a central cavity, followed by a stalk of 
3.0 nm in length at the bottom of the head domain, 
which penetrates inside this cavity (Coskun et al. 
2004). Based on cross-linking data as well as solution 
X-ray scattering data of the subunits C, D, and F, it was 
possible to create a model of the Ai domain of the 
AiAo ATP synthase from M. mazei Gol (Fig. 4) 
revealing that the head domain is consisting of subunits 
A and B and that the central stalk is build up by the 
subunits C, D, and F leading to the subunit topology of 
an A 3 B 3 CDF complex. 

Structures of Single Subunits in the A^Aq 
ATP Synthase 

The AiAo ATP synthase is made of nine subunits. The 
catalytic nucleotide-binding subunit is A (66.0 kDa), 
which consists of four domains: an N-terminal and 
C-terminal domain, a nucleotide-binding domain, and 
a knob-like structure, often described as “spike”. 
Subunit B (51.9 kDa) is made by an N-terminal 
domain, consisting of a six-stranded p-barrel, a 
C-terminal domain consisting of a four a-helical bun¬ 
dle, and an intermediate a-P domain, which is the 
nucleotide-binding domain, consisting of a nine- 
stranded P-sheet surrounded by seven a-helices. 
Subunit C (42.5 kDa) has a funnel-like shape with 



ATP Synthases from Archaea: Structure and Function, 
Fig. 3 Surface representation (A1-A3) of the three- 
dimensional reconstruction of the A, domain of the AjAq ATP 
synthase from M. mazei Gol. The particle is represented as 
rotating around an axis parallel to the specimen support (This 
research was originally published in Coskun et al. 2004) 



ATP Synthases from Archaea: Structure and Function, 
Fig. 4 Subunit topology model of the Aj domain of M. mazei 
Gdl, based on three-dimensional reconstruction, cross-linking 
studies, and solution X-ray scattering data, showing the subunits 
A and B located in the head domain and subunits C, D, and 
F building up the central stalk (This research was originally 
published in Coskun et al. 2004) 

a polar cavity inside and consists of three uniform 
domains, each containing a bundle of four peripheral 
and two central a-helices. So far, no structure exists for 
subunit D, which has a molecular mass of 24.6 kDa. 
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Subunit F (11.9 kDa) has a two-domain structure, 
containing a well-ordered globular N-terminal domain, 
consisting of four (l-sheets and three cx-helices and 
a very flexible C-terminal domain, having 22 residues, 
forming a linker to subunit B. Subunit E (23 kDa) 
consists of two domains, a C-terminal domain with 
P-strands and short a-helices and a N-terminal domain, 
which is highly helical. For the C-terminal domain of 
E from Pyrococcus horikoshii an X-ray structure (PDB 
2DM9) at a resolution of 1.8 A is available, which 
shows that this domain is forming a tight dimer. Solu¬ 
tion X-ray scattering analyses of subunit H (12.2 kDa) 
revealed a “boomerang” shape with two arms of 68 and 
120 A length. Secondary structure prediction programs 
like PSIPRED showed for subunit H two helical 
domains, a short helix at the C-terminal end and 
a bigger N-terminal helix of 60 amino acids, which 
fits excellently with the solution X-ray scattering data 
(Vonck et al. 2009). Furthermore, it was shown for 
T. thermophilus that subunits E and H can form 
a complex (Lee et al. 2010). So far, no structures of 
subunit a (75.5 kDa) and subunit c exist. Subunit a 
contains a hydrophilic N-terminal domain formed by 
360 amino acids and a hydrophobic C-terminal domain 
formed by 300 amino acids. It is predicted that the 
hydrophilic N-terminal domain is made by two long 
a-helices as well as shorter helices, P-strands, and 
loops. A closer look at the amino acid sequence of 
subunit a showed repeating structural elements. At 
four positions in the amino acid sequence there is 
a short predicted a-helix flanked by two P-sheets 
suspecting that the hydrophilic domain of subunit a, 
which has a loop-like structure formed by the two 
long a-helices, is connected with the peripheral stalks 
mediated by the two similar sets of Pap domains. The 
hydrophobic C-terminal domain is predicted to span 
the membrane 7-8 times. 

c Subunits in Archaea 

An outstanding feature of archaea is their great diver¬ 
sity in size and number of ion-translocating residues in 
their rotor subunits (c subunits), c subunits have been 
purified and characterized from a few archaea showing 
that most of them have a c subunit with a molecular 
mass of 8 kDa, containing one hairpin, formed by two 
transmembrane helices, with one ion binding site, 
comparable to the c subunit architecture from bacteria. 


This similarity of c subunits was always considered to 
be the reason for the ability of AiAq ATP synthases to 
synthesize ATP, despite their homology to ViVq 
ATP ases. Important for the function of AiAq ATP 
synthases is the number of ion-translocating residues 
per rotor unit. The ability to synthesize ATP is directly 
dependent on the number of translocated ions per syn¬ 
thesized ATP. According to the equation: 

AGp = —n X F X Ap 

A phosphorylation potential (AGp) of ^50-70 kJ/mol 
is sustained by the use of « = 3^ ions/ATP at 
a physiological electrochemical ion potential of 
— 180 mV (Ap). According to this equation, a rotor 
with 12 ion binding sites and a catalytic Aj domain 
with three A/B pairs and therefore three ATP binding 
sites, gives exactly the number of four ions required for 
ATP synthesis, which is the case in most of the archaea. 
Surprisingly, not all archaea have a Fo-like c subunit. 
For example, Methanothermohacter thermoauto- 
trophicus and Methanocaldococcus jannaschii have 
duplicated and triplicated c subunits with two and 
three hairpins, respectively. Both arose by gene dupli¬ 
cation and triplication, followed by a fusion of genes 
(Muller and Gruber 2003). In M. thermoautotrophicus 
the ion-translocating carboxylate is conserved in helix 
two and four, whereas in M. jannaschii it is only con¬ 
served in helix four and six, in helix two it is substituted 
by a glutamine residue. In the special case of M. 
Jannaschii, the rotor has only 8 ion binding sites 
(assuming 12 hairpin domains per rotor). Apparently, 
2.6 carboxyl groups per catalytic center are sufficient 
for ATP synthesis (Ruppert et al. 1999). A rotor with 
only 6 or 7 ion binding sites as shown for ViVq 
ATP ases gives an ion/ATP stoichiometry of only 
2-2.3, which is too low to allow ATP synthesis at 
a phosphorylation potential (AGp) of ^50-70 kJ/mol 
and a physiological electrochemical ion potential of 
—180 mV (Ap). This is the reason why V iV q ATPases 
only hydrolize ATP. Moreover, the lower the number 
of ions, the greater the ion gradient established. This is 
why ViVq ATPases are splendid ATP-driven ion 
pumps. 

Another interesting example is the c subunit of 
Methanopyrus kandleri. The genome sequence of this 
archaeon revealed that the gene, encoding the c 
subunit, is 13 times the size of the gene encoding 
F-type like c subunit. The predicted molecular mass 
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ATP Synthases from Archaea: Structure and Function, synthase in top view, bottom view, and side view orientation. 
Fig. 5 Three-dimensional reconstruction of the AjAq ATP Scale bar represents 100 A (This research was originally 
synthase from P. furiosus. Surface representation of the ATP published in Vonck et al. 2009) 


of this unusual c subunit is 97.5 kDa and consists of 13 
covalently linked hairpins with 13 ion binding sites. An 
absolutely unexpected result was the finding of dupli¬ 
cated c subunits in Pyrococcus furiosus, Pyrococcus 
abyssi, and P. horikoshii. These c subunits contain two 
hairpins with four transmembrane helices, but only one 
ion binding site. Therefore they are identical to the c 
subunits from eukaryotes, which have the same archi¬ 
tecture. Assuming six to seven monomers per rotor as 
in ViVo ATPases of eukaryotes, the enzyme should 
only hydrolyze ATP, because an ion/ATP stoichiome¬ 
try of only 2.0 is too low for ATP synthesis. But 
recently, it was shown that pyrococci synthesize ATP 
by ion gradient-driven phosphorylation. The structural 
basis for ATP synthesis despite the presence of such 
a c subunit is given below. 

A Closer Look: The A-iAq ATP Synthase 
of P. furiosus 

P. furiosus, the rushing fireball, is a hyperthermophilic 
euryarchaeon with an optimal growth temperature of 
100°C. The AjAq ATP synthase from P. furiosus was 
isolated, visualized by electron microscopy, and 
a three-dimensional reconstruction was made from 
13,000 single particles (Fig. 5). The global structure 
of the AiAo ATP synthase had a resolution of 2.3 nm 
and showed a head domain, a central stalk, two periph¬ 
eral stalks, a collar-like structure, and a hydrophobic 
membrane-embedded domain. To localize the posi¬ 
tions of the different subunits in the global structure, 
known X-ray structures of the subunits were modeled 
into the electron density map (Vonck et al. 2009). The 


head of the AiAq ATP synthase had a strong resem¬ 
blance to the Al domain from M. mazei Gol (Coskun 
et al. 2004) and a pseudo-threefold symmetry. The head 
is connected to the central stalk and shows clear asym¬ 
metry. A central cavity was found in the Ai domain 
surrounded by six densities representing the three A and 
three B subunits. Three densities showed a bulge on the 
top, visible as a spike-like structure. These densities 
could be identified as the catalytic A subunits by model¬ 
ing the X-ray structure of subunit A of P. horikoshii 
(PDB IVDZ) into the three densities. The other three 
densities without the “spikes” were identihed as the 
subunits B by modeling the X-ray structure of subunit 
B of M. mazei Gdl (PDB 2C61) into these three densi¬ 
ties. Two of them are connected with the two peripheral 
stalks (Fig. 6). The central stalk is connected to the head 
domain and descends at an acentric position from the 
head to the Aq domain, inside the ring. The stalk con¬ 
sists of two subunits, a small knob-like structure above 
the collar and a bigger funnel-shaped structure starting 
inside the collar-like structure and going inside the ring. 
The X-ray structure of the C subunit of T. thermophilus 
(PDB 1R5Z) was localized in the map of P. furiosus to 
the lower part of the central stalk. The other subunits of 
the central stalk are D and F. The X-ray structure from 
P. furiosus of subunit F (PDB IQAI) hts into the central 
stalk above the C subunit. Subunit D was not visible 
in the global structure of the AiAq ATP synthase from 
P. furiosus, but is assumed to reside inside the head, as 
has been shown for the equivalent y subunit in FiFo 
ATP synthases. The stator is build up by two peripheral 
stalks and a collar-like structure. Both stalks connect 
the subunits B of the Ai domain with the collar-like 
structure, which is also connected with the Aq domain. 
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ATP Synthases from Archaea: Structure and Function, 
Fig. 6 Fitting of known X-ray structures into the electron den¬ 
sity map of Ai Ao ATP synthase from P.furiosus. (a-f), cut-open 
views of the map. (a-d), vertical views of the map, cut at the 
levels indicated in e. a, fitting of A 3 B 3 ; A from PDB file TVDZ 
(blue) and B from 2C61 (green), b, fitting of the F subunit 
(IQAI) (purple), c, fitting of the C subunit (turquoise) (1R5Z). 


d, fitting of the 20-hairpin E. hirae A-type c ring from 2BL2 
(gold), e and f, two peipendicular views along the membrane 
plane. The EC domain is fitted to the knobs from 2DM9 (pink). 
g, all fitted subunits in the same orientation as f. b, model 
for the stator with the a-subunit in yellow and the EFt dimers 
in pink (This research was originally published in Vonck 
et al. 2009) 


The connection between the peripheral stalks and subunit 
B contains prominent knobs, which are not connected to 
each other (Vonck et al. 2009). Presumably, each 
peripheral stalk consists of the subunits E and H as 
shown for the A-type ATP synthase of T. thermophilus 
where the peripheral stalks are build up by subunits E 
and G (Lee et al. 2010). Subunit G of T. thermophilus is 
equivalent to the subunit H of P.furiosus. 

The collar-like structure runs in a distance of 30 A 
parallel to the Aq domain and is represented by the 
N-terminus of subunit a. The Aq domain shows a clear 
substructure and has an oval shape. The cytoplasmic 
half is hollow and there is a shallow indentation oppo¬ 
site the hole on the extracellular side. These features 
are likely to face to the center of the c ring. Outside of 
the ring another mass is visible, which is connected to 
the collar and contains the hydrophobic domain of 
subunit a. As outlined above, the critical question is 
how the Al Aq ATP synthase of P. furiosus is able to 
synthesize ATP despite its V-type c subunit. The 
answer: by an increase in number of protomers and 
thus size of the c ring. The density in the Aq domain 


nicely fits the Cjo ring of Enterococcus hirae, that has 
the same c subunit architecture as P. furiosus and 
whose structure was solved. In addition, spectroscopic 
analyses gave direct evidence for a Cjo ring in 
P. furiosius. Spectroscopic analyses also revealed 
a molecular mass of 730 kDa +!— 10 kDa for 
the complex that hts well the stoichiometry of 
AsBaCDEjEHzacyo (Vonck et al. 2009). 

The Function of the A^Aq ATP Synthase 

EiEo ATP synthases and ViVq ATPases function as 
a pair of coupled motors, a hydrophilic and 
a membrane-bound, ion-translocating motor in the 
membrane. Ion flow through the membrane-bound 
motor leads to rotation of the rotor. The same principal 
function can be assumed for AiAq ATP synthases. In 
bacteria, the driving force for rotation can be the pro- 
ton- or sodium-motive force. So far, the ion specificity 
has been addressed for only a few archaeal species. 
The enzyme from P.furiosus requires Na"^ for activity. 
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and it contains a conserved Na"^ binding motif in the 
c subunit (Pisa et al. 2007b). The same is true for the 
AiAo ATP synthases from methanogens, indicating 
that they also may use Na"^, but evidence for this is 
still lacking (Muller et al. 1999; Pisa et al. 2007a). 

Summary 

The archaeal ATP synthase (AiAq ATP synthase) is 
a macromolecular complex that contains nine subunits 
and consists of a catalytic A; domain and 
a transmembrane, ion translocating Aq domain. The 
molecular mass of 730 kDa +/— 10 kDa of the whole 
enzyme was determined with LILBID-MS for 
P. fuiiosus, an archaeal model organism. A three- 
dimensional reconstruction was made after electron 
microscopy from negatively stained images. The map 
had a resolution of 2.3 nm and shows the A; and Aq 
domain, connected by two peripheral stalks and 
a central stalk. Both peripheral stalks are connected 
to the Al domain via prominent knobs and one 
peripheral stalk is connected to Aq. Solved X-ray 
structures of the subunits were fitted into the map. On 
the basis of the fitting and the LILBID-MS data, 
a global structure is presented with the stoichiometry 
A 3 B 3 CDE 2 FH 2 aCio. Some AiAq ATP synthases may 
use Na"^ as coupling ion, others H"^. 

Cross-References 

► ATPase Complex that Forms a Proton Gradient 
Across Vacuolar Membrane 

► ATPase: Overview 

► Membrane Proteins: Structure and Organization 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 

► X-Ray Scattering of Lipid Membranes 
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Synonyms 

Complex V; FiFq-ATP synthase; F-type ATPase; 
Ion-driven rotary motor; Molecular motors 

Definition 

Adenosine triphosphate (ATP) synthesizing enzyme 
by usage of the proton- (sodium-) motive force. 




A 


130 


ATP Synthase Structure 


Introduction 

Cells need a constant supply of chemically available 
energy to perform their various cellular activities. 
Adenosine triphosphate (ATP) is the universal energy 
supply in all known life forms. This negatively charged 
compound is impermeable for cell membranes and 
therefore it needs to be constantly produced by the 
cell itself. The FiFq-ATP synthase is a nanosized 
rotary molecular machine, which produces the bulk 
supply of ATP. This enzyme is found in mitochondria, 
chloroplasts, bacteria, and also in the archaeal king¬ 
dom (► ATP Synthases from Archaea: Structure and 
Function). The overall architecture of the enzyme 
comprises two opposing motors, one being the 
membrane-embedded Fq complex and the other the 
water-soluble Fi complex (Fig. la). The origin of 
ATP synthases occurred very early in evolution as 
ATP is the basic energy source for cells. Evolutionary 
considerations suggest that this family of enzymes 
originated from membrane-bound RNA translocases 
(Mulkidjanian et al. 2007) and remarkably, since 
then, the basic principle of these molecular motors 
has been conserved over most stages of cellular 
development. 

The two motors are driven by two different energy 
sources. An electrochemical gradient of ions (proton- 
motive force,/?«?/= 2.3RT/F x ApH -i- A^) drives the 
Fq motor in the direction of ATP synthesis. ATP 
synthase manufactures ATP by the consumption of 
the pmf or, vice versa, when ATP is hydrolyzed, the 
Fi motor drives ion pumping through the Fq complex 
against the membrane gradient. Both Fi and Fq, are 
tightly coupled and can exchange energy with each 
other by a rotating, central stalk structure, which acts 
mechanically as a camshaft. The motors are held 
together by an eccentric peripheral bearing, which is 
responsible for stabilizing their outer static connection. 

The water-soluble Fj portion in the functional core 
is universally made up of the subunits (and stoichiom¬ 
etry) otsPsySe. Fi also harbors three catalytic sites for 
ATP synthesis, located within three P-subunits. Fi 
interacts with the cell membrane-embedded Fq com¬ 
plex. In the most simplified version of the enzyme, 
found in bacteria, Fq comprises three subunits in 
a stoichiometry ab 2 Cn (« = 8 , 10-15). The overall 
stoichiometry of a fully assembled bacterial enzyme 
can therefore be given as a 3 P 3 y 5 Eab 2 Cn (eight different 
proteins) forming a minimum complex of 22 proteins 
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ATP Synthase Structure, Fig. 1 Mosaic structural model 
of an F-type ATP synthase. The subunits are shown in 
different colors, (a) Flolo-enzyme, view horizontally to the 
membrane plane; (b) open (a-subunits omitted) Fj complex 
showing the asymmetric y-subunit protruding into the a 3 P 3 
complex and the nucleotide binding pocket on one of the (3- 
subunits. An ADP molecule is shown bound in this pocket 
(shown in sphere representation), (c) View from the periplasmic 
side of the (bacterial) cell illustrating the position of the rotor 
(blue) with respect to the stator {yellowlhrownlgreen). For the 
mosaic model, the following structures were used from the 
Protein Data Base (PDB-IDs): 1E79 (Gibbons et al. 2000), 
1B9U (Dmitriev et al. 1999), 2WIE (Pogoryelov et al. 2009), 
2A7U (Wilkens et al. 2005) and 3K5B (Lee et al. 2010). The 
model for the a-subunit is shown as a brown ellipsoid (no high- 
resolution structure known). The membrane is indicated as a bar 
in gray tones. The mosaic model shown here was composed of 
bovine {Bos taurus) Fi subcomplex, a bacterial Cn rotor ring 
(n = 15 for Spirulina platensis), an 6-subunit from Escherichia 
coli as well as parts of the outer stalk (bacterial b 2 ) from the 
V-/A-type ATP synthase (Thermus thermophilus) and E. coli 
(membrane part). The nomenclature is according to bacterial 
F-type ATP synthases 

(e.g., in Escherichia coli) with a minimum total molec¬ 
ular mass of approximately 530 kDa. From a mechan¬ 
ical point of view, the enzyme is divided into two 
mechanically interacting motor parts: the rotor consists 
of the subunits ysCn and the stator comprises the sub¬ 
units a 3 P 35 ab 2 . 

Structure of the Complex 

The first structure forming a large part of the Fj 
subcomplex (subunits 013 P 3 y') from bovine heart mito¬ 
chondria was determined to 2.8 A using X-ray crystal¬ 
lography by John E. Walker and his colleagues in 1994 
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(Abrahams et al. 1994). The complex was shown to 
consist of an assembly of three alternating a- and P- 
subunits forming a hexameric headpiece around the 
protruding y-subunit (see also: ► Fi-ATPase: Funda¬ 
mental Properties and Structure). This Fi complex has 
an overall height of 125 A and a diameter of 115 A 
(Fig. lb). The y-subunit forms an asymmetric, right- 
handed coiled-coil of two a-helices. The a- and P- 
subunits share a high primary sequence similarity and 
also have a similar tertiary structure. Each of the three 
a- and p-subunits is composed of three protein 
domains, an N-terminal, six-stranded p-barrel, a cen¬ 
tral domain (a-helices and P-barrel), and a C-terminal 
a-helical domain. The P-subunits accommodate the 
catalytically active centres of the enzyme. In the crys¬ 
tal structure, each of the three P-subunits was resolved 
into three different states, depending on the position of 
the asymmetric y-subunit, resulting in an empty (Pe), 
loose (Pdp, contains ADP and Pj), or tight (Ptp, con¬ 
tains AMPPNP, a non-hydrolyzable ATP analogue) 
conformation. The nucleotide-binding sites of the P- 
subunits are characterized by a common nucleotide¬ 
binding fold, reflected by the so-called Walker 
A (GxxxxGK(T/S)) and Walker B motif 
(R/KxxxGxxxL/VhhhhD) (Walker et al. 1982). These 
motifs, together with a set of other typical nucleotide¬ 
binding sequences, are most prominent motifs and 
found in many ATPases (► CFTR, Ion Channel 
Evolved from ABC Transporter). Eurthermore, 
a conserved amino acid motif in the p-subunits, the 
DELSEED-motif, is involved in contacting the 
s-subunit and can regulate ATP synthase activity, 
depending on the C-terminal a-helix position of the 
£-subunit in the e-ATP free or e-ATP bound state 
(Eeniouk and Yoshida 2008). 

The features of the Ei complex seen in the structure 
are completely in accord with Paul Boyer’s earlier 
prediction of the binding change mechanism (Boyer 
1997). The P-subunits are, at any time-point of enzyme 
operation, in three different conformations and the 
y-subunit, as a rotating entity, causes the conforma¬ 
tions to switch from one to another. The y-subunit is 
intrinsically asymmetric and based on its position 
within the a 3 P 3 headpiece, the conformation of the 
P-subunits is defined. The N-terminal p-barrel region 
is in close contact with subunit y and with the c-subunit 
oligomer from the Eq sector. Determining the Ei struc¬ 
ture together with the binding change mechanism of 
the enzyme was a milestone in ATP synthase research 


and culminated in the Nobel Prize awarded to John E. 
Walker and Paul D. Boyer in 1997. 

The outer stalk of the ATP synthase is made of only 
two subunits (b and §) in bacteria. It is more complex 
and has a different nomenclature in mitochondrial ATP 
synthases. The outer stalk was first visualized by elec¬ 
tron microscopy (Wilkens and Capaldi 1998). The role 
of this structural element in the rotary mechanism is to 
form a stable connection between the stator subunit in 
the membrane (a-subunit) and the a 3 P 3 stator head- 
piece in the Ei complex to ensure that the Ei head 
cannot move with the rotating y-subunit. Detailed 
structural information about the soluble part of the 
outer stalk has recently become available both for the 
more complex bovine mitochondrial (Rees et al. 2009) 
and for a bacterial (Lee et al. 2010) ATP synthase. In 
both cases, the a-helical secondary structures are dom¬ 
inant. The outer stalk of the bacterial ATP synthase 
seems straighter than the more bent bovine one. The 
b-subunits are connected with the §-subunit on the top 
headpiece of the Ej complex. In the membrane Eq 
complex, subunit b interacts with both the a-subunit 
and the rotor c-ring. 

Structure of the Fq Complex 

The Eq complex is the membrane-embedded motor of 
the EiEq-ATP synthase (Fig. la). Its minimal compo¬ 
sition comprises the subunits ab 2 Cn. Subunits a and b 2 
belong to the stator part and contact the c^ rotor from 
outside. Ion translocation through the Fq complex is 
thought to involve the a-subunit and the c-ring and 
couples the downhill ion flow across the membrane 
with torque generation within the Fq rotor (Junge et al. 
2009). The torque produced is transferred via the cen¬ 
tral stalk subunits y and £ into the Fi complex and used 
for ATP synthesis. The vast majority of organisms 
utilize an electrochemical proton gradient for coupled 
ATP synthesis, while a selection of anaerobic bacteria 
rely on Na’^-driven ATP synthases for the production 
of ATP. The preferential selectivity of ATP synthases 
for protons or sodium is determined by the Fq domain, 
essentially by its rotor complex. However, both classes 
of ATP synthase rotors share a similar structural 
assembly and mechanism of action. 

The Fq rotor is comprised of several copies of 
c-subunits, assembled into a ring-shaped complex. 
Each of the c-subunits harbors a conserved carboxylic 
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residue (Asp or Glu), which is the key element of the 
ion-binding site and is located close to the middle of 
the membrane bilayer. The c-subunit is an 8-9 kDa 
highly hydrophobic protein, consisting of two mem¬ 
brane-spanning a-helices and is connected by a short 
polar loop (hairpin). The number of c-subunits defines 
not only the size of the ring, but also the number of ions 
that a c-ring can bind. Each species has a defined 
c-subunit oligomeric composition and has a constant 
number («) of c-subunits per c„ ring; however, the 
stoichiometry («) is varies among different species 
(known to date: n = 8 , 10-15) and allows them to 
adapt the ion-to-ATP ratio of the enzyme to the 
specific bioenergetic demands of the cell. 

The high-resolution structures of the Fq rotors from 
Ilyohacter tartaricus, Spirulina platensis, and Bacillus 
pseudofirmus OF4 (Meier et al. 2005; Pogoryelov et al. 
2009; Preiss et al. 2010) revealed their atomic details, 
including that of their ion-binding sites (Fig. 2a-c). 
The rings are formed by two concentric rings of 
a-helices (11 in /. tartaricus, 13 in 5. pseudofirmus 
OF4, 15 in 5. platensis) forming the inner and outer 
ring of the a-helices in a staggered position. In the 
middle of the membrane, the helices bend toward the 
center of the ring, defining the narrowest part of the 
ring and resulting in an overall hourglass-shaped 
appearance, which is sometimes less pronounced 
and more closely resembles a straight cylinder (e.g., 
B. pseudofirmus OF4). 

The ion-binding site is located within the cleft 
between two adjacent c-subunits at the point where 
the helices kink. The key glutamate of the binding 
site is oriented in the cleft toward a group of ligands, 
together forming a tight hydrogen (or Na"^) bonding 
network around the bound ion (H"^ or Na"^). This ori¬ 
entation of glutamate is thermodynamically highly 
stable in a hydrophobic environment and prevents the 
ion from being released; therefore, this state is referred 
to as the ion-locked conformation (Meier et al. 2005; 
Pogoryelov et al. 2009; Preiss et al. 2010). The binding 
site of the I. tartaricus c-ring, in contrast to the proton¬ 
binding site of the 5. platensis or B. pseudofirmus 
c-ring, preferentially binds Na"^ instead of H"^. In the 
Na"^ bound state, the site involves a total of five ligands 
coordinating the sodium ion (three and four ligands 
more than in the proton binding sites of S. platensis 
and B. pseudofirmus OF4, respectively), including 
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ATP Synthase Structure, Fig. 2 The rotor rings in the Fq 
complex of ATP synthases. View on the three types of ion¬ 
binding sites in F-type ATP synthases: (a) The Na'^-binding 
site of the Ilyohacter tartaricus Cn ring; (b) the tT^-binding 
sites of the Spirulina platensis C 15 ring; and (c) the Bacillus 
pseudofirmus OF4 0,3 ring. The hydrogen-bonding network is 
indicated by dashed lines and the NaTwater molecules are 
shown with yeUowjred spheres, respectively, (d) Microscopic 
model of proton translocation and coupled rotation in the Fq 
complex from 5. platensis. Schematic representation of the dif¬ 
ferent states envisaged by the binding sites as the c-subunits 
(blue) face the lipid membrane or the hydrated interface of the 
a-subunit (yellow). The view of the interface is tilted from 
the membrane plane ^20° toward the reader. The direction 
of the rotation in the figure (indicated by the grey aiTow) is that 
in the ATP synthesis mode; the model is reversible (ATP hydro¬ 
lysis mode) in opposite direction 

a structural water molecule (Meier et al. 2009). The 
Na"^ site is considerably more spacious and can accom¬ 
modate a larger ion (than H"^). The overall arrangement 
of the binding site confers the selectivity for Na"^ ver¬ 
sus (Pogoryelov et al. 2009; Krah et al. 2010). 

The Fq rotor structures mentioned above represent 
the structures present at the c-ring/lipid interface, 
that is, the position of the rotor outside the contact 
region with the stator subunit a. The ion translocation 
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mechanism also involves the a-subunit (Fig. 2d), for 
which high-resolution structural data are not yet avail¬ 
able. The a-subunit is thought to provide two ion 
access pathways to the binding sites of the c-ring 
from either side of the membrane (Junge et al. 2009). 
These two pathways are functionally isolated by an 
electrostatic separator - the conserved positively 
charged arginine residue, which prevents a short- 
circuit between the two pathways. Furthermore, the 
interface between the a-subunit and the c-ring is polar 
and likely to be partially hydrated. This interface pro¬ 
vides energetically favorable conditions for unlocking 
the ion-locked conformation of the binding site and 
therefore facilitates ion exchange (Pogoryelov et al. 
2010). The key premise of the ion translocating mech¬ 
anism is that the ions (IT^ or Na"^) can only be 
exchanged within the polar/hydrated interface of 
subunit a and the c-ring, while in the hydrophobic 
environment of membrane lipids the ion-binding site 
persists only in the ion-locked state, whereas ion 
release is thermodynamically prohibited. 

The mechanism is fully reversible and intrinsically 
stochastic; the direction of the rotation is biased by the 
operational mode of the enzyme, that is, by the applied 
driving force: ATP in hydrolysis mode or the ion- 
motive force in synthesis mode. In the latter, the ions 
from the periplasm are delivered to the binding site via 
the corresponding access pathway in the a-subunit. As 
the c-ring rotates anticlockwise (if viewed from the 
cytoplasmic side), the ions are transported through 
the hydrophobic core of the lipid bilayer, to the oppos¬ 
ing side of the rotor-stator interface, and are released 
via the exit pathway into the cytoplasm. In this way, 
the ion translocation path across the membrane is com¬ 
pleted. It generates torque within the Fq rotor, which is 
further transduced by the central stalk subunits y/s into 
the Fi catalytic domain for completion of the catalytic 
cycle of ATP synthesis. 

Summary 

The structure of the ATP synthase consists of the 
water-soluble Fi and the membrane-embedded Fq 
complex. The Fq and Fi complexes each represent 
a molecular motor, which are able to exchange energy 
by a rotational coupling mechanism. The Fi complex 


with the three catalytic sites on the three P-subunits 
is connected via the y- and e-subunits (central stalk) to 
the membrane-embedded c-oligomer of the Fq com¬ 
plex, and via the outer (or peripheral) stalk S- and 
b 2 -subunits to subunit a. Mechanistically the enzyme 
can be divided into the rotor, which consists of sub¬ 
units yeCn (n = 8, 10-15), and the stator (assembled 
from subunits a 3 P 35 ab 2 ). During ATP synthesis, ions 
(H"^, sometimes Na"^ (Dimroth 1997)) pass by the Fq 
complex, causing the rotor to rotate (Noji et al. 1997). 
Rotation is mechanically transduced into the Fi head- 
piece (a 3 p 3 ) by the inherently asymmetric y-subunit 
and causes conformational changes within each of the 
three P-subunits, which are in three different confor¬ 
mations at any point in time. This process finally leads 
to the formation (and release) of ATP from ADP and P; 
and in sum, this molecular machine converts electrical 
(ion gradient) to mechanical (rotor) to chemical (ATP) 
energy (Junge et al. 2009). The enzyme is also able to 
operate in the opposite direction, to act as an uphill ion 
pump using the energy released by the hydrolysis of 
ATP, similar to the V-type ATPases (► ATPase Com¬ 
plex that Forms a Proton Gradient Across Vacuolar 
Membrane). 

Cross-References 

► ATP Synthases from Archaea: Structure and 
Function 

► ATPase Complex that Forms a Proton Gradient 
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► ATPase: Overview 
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ATPase Complex that Forms a Proton 
Gradient Across Vacuolar Membrane 

► Vacuolar-Type ATPases in Animal and Plant Cells 


ATPase: Overview 

Robert Ishmukhametov 

Department of Physics, University of Oxford, 

Oxford, UK 

Synonyms 

FiFq-ATP synthase; FqFi-ATP synthase; Molecular 
motors; Rotary motor 

Definition 

ATPase is a conventional name for a large group of 
membrane proteins, which couple ATP hydrolysis with 
a vectorial translocation of cations across a lipid 
hilayer. In general, these enzymes have two moieties. 
A moiety exposed into a liquid phase (cytoplasm, 
mitochondrial lumen, etc.) has an ATP-hinding 
site(s), which undergo(es) substantial conformational 
change(s) upon binding of ATP, cleaving it and releas¬ 
ing ADP and Pi. These structural rearrangements 
within the moiety in turn force the protein’s second 
moiety, which is a transmemhrane part, to transport 
ions across a lipid bilayer. 

A superfamily of P-type ATPases represents mono¬ 
subunit proteins with a broad specificity of cations 
pumped, including H"^, Na"^, K"^, Ca^"^, and ions of 
many metals. Overall feature of these enzymes’ cata¬ 
lytic cycle is a transient phosphorylation of an impor¬ 
tant amino-acid residue during intermolecular 
domains’ rearrangements. The most studied enzymes 
of the family are H'^'-ATPase, Na'^'/K'^'-ATPase, and 
Ca^'^'-ATPase, whose structures have been resolved. 

The other three evolutionarily related families of 
ATPases share a lot of common features and are 
sophisticated multi-subunit machines. These unique 
proteins reversibly convert electric energy of ApH"^ 
or ApNa"^ (transmemhrane electrochemical gradient 
of H’’’ or Na’’’) into chemical energy of ATP. Function¬ 
ally, these ATPases are two coaxial rotary motors 
sharing a common shaft. The first (chemical) motor is 
water soluble and has 3 ATP-hinding sites and operates 
via a quite well-understood cyclic “binding change 
mechanism” During the cycle, large conformational 
rearrangements within the complex rotate the shaft 
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stepwise. These conformational changes were charac¬ 
terized structurally as well as by functional single 
molecule studies. The second (electrical) motor is 
a transmembrane turbine rotated by a flux of ions. 
The torque by this motor originates at a moment 
when ions get delivered between the motor subunits. 
The second motor has not been structurally fully char¬ 
acterized so far. Since both motors generate an oppo¬ 
sitely directed torque, an overall direction of rotation 
depends on which motor prevails, what in turn depends 
on cell physiology. 

F-type ATPases, or FiFq-ATP synthases, are the 
best understood and are found in all live organisms 
(excluding some obligatory intracellular parasites) 
where they produce most of cell ATP. Often F-type 
enzymes work only in ATP synthesis but not in ATP 
hydrolysis direction. 

V-type ATPases, or ViVo-ATPases, acidify intra¬ 
cellular compartments {vacuoles, etc.) by ATP hydro¬ 
lysis. Main feature of these enzymes is that their 
chemical motor easily and reversibly dissociates from 
the membrane motor under some physiological 
conditions. 

A-type ATPases, or AiAo-ATPases, are found 
mostly in Archaea and are the least understood but 
structurally the most complicated family. 

Cross-References 

► ATP Synthases from Archaea: Structure and 
Function 

► ATPase Complex that Forms a Proton Gradient 
Across Vacuolar Membrane 

► Fi-ATPase: Fundamental Properties and Structure 

► F-Type ATPase 


ATP-Driven Mechanical Work Performed 
by Molecular Motors 


Alf Mansson 

School of Natural Sciences, Linnaeus University, 
Kalmar, Sweden 


Synonyms 

Molecular motors 


Introduction 


Most biological ► molecular motors perform work by 
utilizing the free energy released in the turnover of 
adenosinetriphosphate (ATP). This free energy, 
AGatp (expressed per molecule in terms of the 
Boltzmarm factor, kT), is attributed to the instability of 
the y-phosphate bond and varies with the concentrations 
of substrate (ATP; or rather MgATP) and product 
(ADP and inorganic phosphate, P;) as follows: 


AGatp 


AGatp° 


-t-kTln 


[ADP] [Pi] 
[ATP] 


where AGatp° ~ —14 kT is the standard free energy. 
Under physiological conditions, [ATP] « 1-10 mM, 
[ADP] « 10 pM, and [P;] « 1 mM, i.e., the concen¬ 
trations are far from equilibrium (due to oxidation of 
food components), a precondition for the high power 
output and efficiency of molecular motors. 

Whereas molecular motors catalyze ATP turnover, 
they differ from other enzymes in that the local struc¬ 
tural changes around the active site are amplified 
through the protein structure into large amplitude 
structural changes (Cross 1997). Here we focus on 
the “linear motors” ► myosins, ► kinesins, and 
► dyneins where the structural changes around the 
active site cause the tip of a lever arm-like component 
to move typically ^10 nm (Rayment et al. 1993a; 
Cross 1997; Vale and Milligan 2000; Howard 2001; 
Spudich 2011) leading to production of force 
and movement in the interaction with linear protein 
tracks. 


Processive and Non-processive Motors 

Effective transportation of single molecular motors 
and their cargoes along protein tracks is possible for 
“processive” motors that generally move several steps 
before detaching. However, more often, motors trans¬ 
port intracellular cargoes or develop forces in teams. 
This behavior is essential for “non-processive” motors 
where each motor domain (head) only spends a small 
fraction of its ATPase cycle attached to the track, 
with significant risk of diffusing away before binding 
to a neighboring site. Most ► kinesins and cytoplas¬ 
mic ► dyneins as well as several ► myosins (e.g., 
myosin V, VII, X) are processive. In contrast, the 
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“conventional” muscular and non-muscular myosin 11, 
as well as myosin I, are generally classified as non- 
processive. 

Myosins 

The ► myosin motors (Sellers 1999) are 
heteromultimeric proteins with heavy chains, light 
chains, etc. The ► ATPase and actin-binding sites in 
the N-terminus of the heavy chain (the motor domain) 
have well-conserved amino acid sequences and 
a similar set of characteristic structural motifs in all 
myosins. Important motifs at the nucleotide-binding 
pocket (Rayment et al. 1993b, 1996) are the switch-I, 
switch-II, and P-loops as well as the relay helix. These 
coordinate nucleotide binding and relay effects of dif¬ 
ferent steps in the ATP turnover, e.g., phosphate 
release, to distant structural changes, e.g., modulated 
actin affinity and altered orientation of the myosin 
neck region (the lever arm). The latter a-helical region 
that is stabilized by different number of light chains 
in different myosins connects the motor domain to the 
C-terminal tail domain. The tail domain is variable in 
structure and amino acid sequence among different 
myosins but generally has a ► coiled-coil region for 
dimerization of two heavy chains. In myosin II, the 
coiled-coil extends for >150 nm, of critical impor¬ 
tance for polymerization, e.g., into the highly ordered 
thick filaments of the heart and skeletal muscle sarco¬ 
meres allowing myosin motors to work efficiently in 
teams. 

Kinesins 

Different ► kinesin motors (Howard 2001; Gennerich 
and Vale 2009) have their motor domain either in the 
N-terminal, the middle, or the C-terminal of the kinesin 
heavy chain. Like in myosins, the motor domain with 
ATPase and microtubule-binding sites is well con¬ 
served. The ATPase site has substantial similarities 
with that of myosin and G-proteins with switch-I, 
switch-II, and P-loop motifs (Kull et al. 1996) as well 
as a central beta-sheet-dominated structure. In analogy 
to myosin, structural changes around the ATP-binding 
site are coupled to the swing of an extension from the 
motor domain, “a neck-linker” and to kinesin- 
microtubule affinity changes. The neck-linker cortnects 


the motor domain to the stalk and positions, upon its 
swing, the partner head of dimeric kinesins for binding 
to the next tubulin heterodimer, 8 nm along the fila¬ 
ment. Other mechanisms apply for monomeric kinesins 
(although these might dimerize in vivo) (Vale 2003). 

Dyneins 

The ► dynein motors, driving ciliar and flagellar beat¬ 
ing and intracellular transport toward the microtubule 
minus end (Vale 2003; Gennerich and Vale 2009; 
Spudich 2011), have different evolutionary origin and 
different mechanisms of chemomechanical energy 
transduction compared to the kinesin and myosin 
motors. Dynein belongs to the AAA"^ ATPases and 
the core of each globular motor domain in the dynein 
dimer is a ring-like structure formed by six domains or 
subunits. Here, four domains bind ATP with one 
domain as main ATPase site. On the opposite side of 
the molecule, several nm from the ATPase site (Carter 
et al. 201 1), an ^15-nm-long coiled-coil domain (“the 
stalk”), terminates in a microtubule-binding domain. 
Both the angle of this extended domain relative to the 
AAA-ring structure and the microtubule affinity vary 
allosterically (by unknown mechanism) during ATP 
turnover, causing synchronized changes in orientation 
and microtubule affinity. 

ATP-Driven Mechanical Work by Molecular 
Motors: Myosin as an Example 

Effective ATP-driven power output by molecular 
motors is achieved by cyclic changes in motor lever 
arm orientation and track affinity (cf. modified Lymn- 
Taylor scheme (Lymn and Taylor 1971), Fig. 1) appro¬ 
priately synchronized by steps in the ATP turnover 
(e.g., release of phosphate) through propagation of 
local structural changes in the active site (Rayment 
et al. 1993a, b, 1996). This general coupling principle 
is typical of any motor, whether processive or non- 
processive or whether of the myosin, kinesin, or dynein 
types. Of particular importance in actomyosin is accel¬ 
eration of phosphate release upon myosin binding to 
actin, the associated power-stroke and subsequent 
detachment of myosin heads from actin upon ATP 
binding to the myosin active site. A critical feature as 
a basis for unidirectional motion is the polarity of the 
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ATP-Driven Mechanical 
Work Performed by 
Molecular Motors, 

Fig. 1 The Lymn-Taylor 
kinetic scheme and modified 
structural model. Length of 
arrows related to the rate of the 
transition 
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microtubule and actin tracks, only allowing motors to 
attach so that structural changes cause unidirectional 
force and motion. Processive motors generally (Vale 
2003) have two motor domains that step sequentially 
along the track. For this process to be effective, coor¬ 
dination of the two heads is essential. This is achieved 
by inter-head communication where tension, imposed 
on one head by the other, alters the rate of key transi¬ 
tions in the ATP turnover. For instance, tension from 
a leading head in a pair of track-attached motor 
domains of myosin V accelerates ADP release from 
a trailing head allowing subsequent ATP binding and 
rapid head detachment from the track (Sellers and 
Veigel 2006). Associated with such detachment, the 
lever arm swing by the leading head is facilitated by 
loss of the resisting load caused by the trailing head. 
A typical feature of processive motors is that each 
motor domain, when isolated from its partner head, 
spends a large fraction of its ATPase cycle time 
attached to its track, i.e., it has a high duty ratio. 

Load-sensitive rate constants for transition between 
distinct biochemical states (with either ATP, ADP, 
and/or Pj at the active site) are critical also for the 
optimized function of non-processive molecular 
motors acting in large teams (Huxley 1957; Huxley 
and Simmons 1971) and indeed for several non-motor 
proteins (Howard 2001). Conversely, different bio¬ 
chemical motor states are associated with different 
mechanical properties and there is also fair amount of 
evidence that a given biochemical state corresponds to 
a given structural state (i.e., with a given equilibrium 
orientation of the lever arm, neck-linker, etc.). A one- 
to-one correspondence between biochemical, mechan¬ 
ical, and structural states is, for instance, illustrated by 
the model in Fig. 2 (adapted from Mansson 2010) 


together with free-energy diagrams (Eisenberg and 
Hill 1978) to take into account the presence of an 
elastic element in the myosin head. 

The collective action of a large ensemble of non- 
processive myosin II molecular motors forms the basis 
for the Fenn effect (increased heat output during active 
shortening) and the force-velocity relationship of 
muscle, i.e., the characteristic automatic adjustment 
of steady-state shortening or lengthening velocity to 
the applied external load. The relevant motor ensemble 
in this connection is the myosin molecules in the thick 
filaments of a half-sarcomere over the muscle cross 
section, interacting cyclically with the ordered ensem¬ 
ble of actin filaments. However, ensembles of molec¬ 
ular motors also need to be considered in the in vivo 
function of processive motors of all kinesin, myosin, 
and dynein types. 

The behavior of a large ensemble of motors can be 
treated by statistical models based on state transition 
probabilities deductible from experimental data 
(Eisenberg and Hill 1978; Mansson 2010). Simulations 
based on models of this type with structural, biochem¬ 
ical, and mechanical states similar to those in Fig. 2 
predict most aspects of muscle contraction. The con¬ 
traction observables are extracted from the model state 
probabilities and from state properties assigned 
(Eisenberg and Hill 1978; Mansson 2010) on basis of 
experimental information. The model assumes that, for 
a motor in a given biochemical state, there is a range of 
strains of the elastic element and thereby a range of 
transition rates between states. In terms of most statis¬ 
tical models, the Eenn effect and the reduced force with 
increased shortening velocity are associated with 
reduced average strain (and thereby reduced force) 
per motor and a reduced number of actin-attached 
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Free energy (kT) 



ATP-Driven Mechanical Work Performed by Molecular 
Motors, Fig. 2 Schematic illustration of biochemical, struc¬ 
tural and mechanical states of key importance for actorayosin 
motor function (cf. Mansson 2010). The free energy of each state 
(differently colored diagrams) is constant for actin-detached 
states (top) or varies parabolically (as a Hookean spring) with 
the distance (x) between the tip of the myosin lever arm position 
and the myosin-binding site. The position of the tip where force 

motors. In this connection, it seems important 
with increased ADP-release rate at low strain 
(cf. processive motors above) and thereby increased 
myosin detachment rate from actin. 

Differences and Similarities Between Motors 

As mentioned above there are appreciable structural 
similarities between the active sites for ATP turnover 
in myosin and kinesin. However, the coupling between 
chemical and mechanical states differs (Howard 2001). 
Thus, for kinesin, ADP release rather than phosphate 
release is appreciably accelerated by track binding. 
Moreover, affinity between kinesin and microtubules 
is high with ATP (unlike with actomyosin) but low 
with products (ADP, Pi) in the active site. Neverthe¬ 
less, actomyosin and microtubule-kinesin are similar 
in that different steps in the turnover of ATP are syn¬ 
chronized to changes in track affinity and orientation 
of the lever arm-like component. Moreover, inter-head 
communication is of key importance in two-headed 
kinesin motors to coordinate the action of the heads 
during processive walking along microtubules 


in the actomyosin-ADP* state is zero defines x = 0 nm. The 
biochemical and structural states start with the myosin ATP state 
at the upper right and ends with the actomyosin state with no 
nucleotide below horizontal axis. Arrows indicate transitions 
between biochemical states. A given biochemical state exists 
with different strains in the elastic element (different positions of 
the lever arm tip) 


(Howard 2001). The dyneins exhibit appreciable struc¬ 
tural differences from myosin and kinesin, and less 
detail is known about chemomechanical energy trans¬ 
duction. However, in similarity to the situation with 
kinesin and myosin, changes in the nucleotide species 
at the active site (Spudich 2011) allosterically change 
the track (microtubule) affinity. 

Some Unanswered Questions 

Brownian Motors Versus Power-Strokes 

In Brownian motors, external energy is used, not to 
generate force but to rectify thermal fluctuations, e.g., 
by structural asymmetry of a molecule or the periodic 
on-off switching of an asymmetric potential. In one 
version of the Brownian motor theme, it has been 
proposed that asymmetric potentials build up along 
polarized motor tracks, e.g., actin filaments, causing 
biased diffusion of motors between stable positions at 
subsequent monomers along the filament. This model 
is challenged by arguments that biased long-range 
diffusion between rather widely spaced actin sites 
would be too slow to account for the rate of force 
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ATP-Driven Mechanical Work Performed by Molecular 
Motors, Fig. 3 Huxley and Simmons (1971) model with an 
independent elastic element and three stable states with 


progressively increased binding affinity between the myosin 
head and actin. Two states shown 


development and the power output during skeletal 
muscle contraction. Moreover, the lack of obligate 
rotation of actin filaments around their long axes when 
propelled by myosin motors argues against biased dif¬ 
fusion of the motors along the helically arranged bind¬ 
ing sites. However, as further considered below, aspects 
of the Brownian motor concept are important also in 
more conventional, so-called, power-stroke models, 
where chemical energy is used explicitly to generate 
work. Thus, in power-stroke models, a biochemical 
change (e.g., release of product) is associated with 
a unidirectional structural change that shifts the part of 
the motor most distal from the track (e.g., the tip of 
a lever arm) a distance relative to the track. Over this 
“power-stroke distance,” the motor performs work 
against external loads. It is important to distinguish the 
power-stroke from the drag stroke where motors pro¬ 
duce force that counteracts the effect of the power- 
stroke, by having been moved (by other motors) into 
a region with compressed elastic elements. 

The Huxley (1957) model is an early realization of 
a power-stroke model with signihcant features in com¬ 
mon with the Brownian motor concept. Here, the myo¬ 
sin elastic element was assumed to be strained by 
thermal motion to reach an attachment point on the 
actin filament where it is captured only if it produces 
force in appropriate direction. The capture is powered 
by binding energy provided by turnover of ATP. 
A more complex power-stroke model, partly inspired 
by structural data (Huxley 1969), was proposed by 
Huxley and Simmons (1971). In this model, the myo¬ 
sin motor domain changes its orientation (cf. Lymn 
and Taylor 1971 ) relative to the actin filament in one or 
several transitions distinct from the actomyosin attach¬ 
ment step with stretching of an elastic element and 
increased affinity between actin and myosin for each 
transition (Fig. 3). Interestingly, this idea is in good 
agreement with conclusions reached on basis of recent 


molecular evidence (Rayment et al. 1993a). The orig¬ 
inal model of Huxley and Simmons (Huxley and 
Simmons 1971) had two stable attached states, where 
the transition from the low-force state to the high-force 
state was associated with straining of the elastic ele¬ 
ment by ^10 nm. However, the authors pointed out 
that more states are needed to account for the high 
power output by muscle. 

The Huxley and Simmons paper was a conceptually 
important breakthrough and their model has several 
attractive features. These include explanation of tran¬ 
sient phenomena in muscle contraction and 
a straightforward incorporation of the main ideas into 
molecular models based on the myosin X-ray structure 
(Rayment et al. 1993a). However, the model also raises 
critical questions. First, the localization of the elastic 
structure has not been unequivocally identified. Sec¬ 
ond, the recently found high stiffness of the myosin 
elastic element (2-3 pN/nm) imposes new constraints. 
Thus, even if subdivided into three individual steps 
(^3 nm each; corresponding to elastic energy: 
>2 X 3^/2 kT, i.e., > 9 kT), such an elastic element 
would not be extended by thermal energy (^ kT) at 
sufficient rate to account for, e.g., fast transient 
phenomena. 

To address this difficulty, a large number of 
mechanical states have been suggested without strong 
support in structural and biochemical data (discussion 
in Mansson 2010). It is therefore of interest to consider 
alternative views (Eisenberg and Hill 1978) where the 
elastic element is strained as a result of a localized 
chemical change (an Eyring-like model (Howard 
2001)) without simultaneous large-scale motions. It 
should be pointed out that also in this case, thermal 
fluctuations are important to provide the activation 
energy for the chemical change but only small ampli¬ 
tude fluctuations are required (cf. Eig. 4). In contrast, 
the original Huxley (Huxley 1957) model can be 
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ATP-Driven Mechanical 
Work Performed by 
Molecular Motors, 

Fig. 4 A low amplitude 
structural change in the motor 
domain close to the ATPase 
site {red arrow) may be 
amplified (by rotation of circle 
with cross) to strain an elastic 
element (tentatively bending 
of the lever arm) 



classified (Howard 2001) as being of the Kramers type, 
i.e., a large-scale diffusional straining of an elastic 
element precedes the chemical change. 

Also the model of Huxley and Simmons (Huxley 
and Simmons 1971) exhibits several characteristics 
of a Kramers model (however see Howard (2001)). 
Presently, molecular and structural studies, e.g., using 
X-ray diffraction, electron microscopy, etc., largely 
focus on the local chemical changes as a basis for 
force generation (leading the thoughts to an Eyring- 
like mechanism). Muscle mechanical studies on the 
other hand are strongly influenced by Huxley and 
Simmons and focus on large-scale structural changes. 
It would be of interest to merge these molecular and 
muscle mechanical views to greater extent. 

Two Motor Domains in Non-processive 
Motors 

Myosin II of muscle is a non-processive molecular 
motor with two motor domains. The evolutionary 
basis for this is unclear if the two heads are not acting 
in sequence as in processive myosin V. That this is not 
the case under, for instance, isometric contraction is 
clear due to steric constraints that prevent the second 
head from attaching if the first head is already bound to 
actin (Vale and Milligan 2000). A possible role of the 
second head has been suggested by experiments indi¬ 
cating that it attaches rapidly during stretch of active 
muscle to prevent overstretching (References in 
Mansson 2010). Additionally, there is evidence that 
the two heads may execute limited processivity, i.e., 
walk few steps in sequence during shortening against 
intermediate loads. This possibility deserves further 
studies due to possible critical roles in the high effi¬ 
ciency and power output of muscle. 


The Role of the Track: Exemplified by Actin 

Apart from allosteric acceleration of the phosphate 
release from the myosin active site, actin filaments 
are often regarded as passive tracks for myosin walk¬ 
ing or ropes for force transmission. However, there are 
alternative views where actin filaments have been 
given key roles in actomyosin motor function (see 
above). More recently, suggestions have been made 
that the actin filaments play important active roles in 
force generation without being the main force¬ 
generating motor component. Thus, actin filaments 
seem to preferentially attain a certain conformation 
during actomyosin-based generation of force and 
movement and if this conformation is prevented so is 
motor function. 

Summary 

Similar overall operating mechanisms can be identified 
among the kinesin, dynein, and myosin motors with 
key importance of the interplay between mechanical 
forces and different steps in the ATP turnover for 
optimized function. In spite of intense studies, there 
are fundamental gaps in the understanding of this 
interplay as exemplified above for actomyosin. 

Cross-References 
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► Dynein Family Classification 

► Dynein Motility: Mechanism 
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Definition 

The automation of the manual patch-clamp technique 
involved an evolution of innovative steps that sought to 
address the time-consuming and labor-intensive 
aspects of this “gold-standard” method for recording 
the behavior and pharmacology of ion channels. It 
encapsulated processes of de-skilling, miniaturization, 
and parallelism, which over time have resulted in the 
development of instruments that have delivered 
the quality and throughput requirements of the drug 
discovery industry. 


Introduction and Historical Perspective 

Ion channels are a diverse family of proteins that 
comprise a membrane-spanning pore, which permits 
the passage of otherwise impermeant ions. The gold- 
standard approach for monitoring the activity of such 
entities in individual cells is the manual patch-clamp 
technique. This versatile voltage-clamp technology, 
which relies upon the formation of giga-ohm seals 
between the electrode/membrane interface, was devel¬ 
oped over 30 years ago (Hamill et al. 1981). This high- 
resistance “giga-seal” enables high-fidelity recording 
with microsecond resolution and, together with the 
application of different recording configurations, pro¬ 
vides a means to monitor directly in real-time either 
ensemble ion channel activity from a single cell (i.e., 
whole-cell recording) or the behavior of single ion 
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channels in attached or isolated membrane patches. 
Consequently, this technique has been the springboard 
for revolutionizing our understanding of the function¬ 
ing and regulation of ion channels and of the critical 
contributions they make to physiological and patho¬ 
physiological processes. Accordingly, this expansion 
in knowledge has identihed discrete ion channels as 
novel pharmacotherapeutic targets as well as the site of 
toxic effects of certain marketed products. 

However, up until relatively recently, exploiting 
this technology to evaluate large numbers of com¬ 
pounds for potentially therapeutic (as well as toxic) 
interactions at ion channels was hampered by the low 
throughput (i.e., low number of data points per day) of 
the manual patch-clamp recording setup. This limita¬ 
tion inspired a number of independent innovators to 
develop automated instruments that retained full 
whole-cell recording capabilities but that markedly 
improved the data-point output that could be achieved 
with a manually operated patch-clamp system. 
Accordingly, the purpose of this entry will be to pro¬ 
vide a brief overview of this evolution, together with 
the advantages and limitations, of the innovative tech¬ 
nological solutions to this key issue of throughput. 

Glass-Pipette-Based Platforms 

The earlier automated electrophysiology platforms 
were principally glass-pipette-based and included 
such systems as the CeNeS Autopatch (Owen and 
Silverthorne 2002), the Flyion (Lepple-Wienhues 
et al. 2003), the RoboPatch system developed by 
Wyeth Research (Vasilyev et al. 2006), the Sophion 
Biosciences Apatchi-1 system (Asmild et al. 2003), 
and the Cytocentrics CytoPatch chip (Stett et al. 2003). 

The CeNeS Autopatch system dispensed with the 
need for a microscope by the use of “interface 
patching.” This approach involved an inverted config¬ 
uration where a suspension of cells was loaded into 
a glass capillary and allowed to sediment to the air- 
solution interface. The cells suspended at this interface 
were lowered onto a regular patch pipette located 
beneath and “blind-patched” to obtain whole-cell 
recordings (see Wang and Li 2003). 

In contrast to this, the Flyion system involved the 
flushing of cell suspensions through a patch pipette 


toward the electrode tip where they formed “giga- 
seals” (see Wood et al. 2004). Suction was then applied 
to the exterior of the pipette tip in order to rupture the 
membrane of the occluding cell and enable whole-cell 
recording. 

Like the Autopatch, the RoboPatch performed 
“blind patching,” although on conventionally orien¬ 
tated cell suspensions. The computer-controlled 
robotic headstage collected a pipette from a rack, 
which it then lowered into a cell suspension. 
Computer-controlled pressure protocols were then 
used to attract a cell to the pipette tip and obtain 
whole-cell recording. 

Unlike these aforementioned systems, the 
Apatchi-1™ system retained the use of optics and 
employed imaging software to drive the operation of 
a micromanipulator robot. This performed the tasks of 
loading a pipette from a rack into the headstage and 
positioning the tip of the pipette onto the membrane of 
the selected cell. Seal formation and whole-cell access 
were then attained by engaging appropriate pressure 
protocols. 

Lastly, the CytoPatch represented a hybrid between 
a glass pipette and planar electrode. The patching 
substrate comprised an inverted quartz glass pipette 
housed beneath a silicon dioxide aperture, which 
were each supplied by separate microfluidics channels 
(see Stett et al. 2003). Cell positioning over the open¬ 
ing was achieved via suction through the planar aper¬ 
ture. This brought the cell into contact with the tip of 
the enclosed quartz pipette and suction protocols were 
applied through this electrode to achieve a “giga-seal” 
and then whole-cell access. 

While these innovative approaches certainly helped 
to alleviate some of the labor-intensive and time- 
consuming aspects of the manual patch-clamp tech¬ 
nique, they were generally serial, rather than parallel 
systems, and so only provided modest improvements 
in throughput. The challenges posed by the prospect of 
controlling and positioning multiple electrodes 
suggested that substantial gains in data output could 
not be realized using glass-pipette-based approaches. 
Instead, it would require the evolution of planar elec¬ 
trodes, a patch substrate that lends itself more readily 
to a high degree of parallelism and miniaturization, to 
deliver these game-changing advances in throughput 
for automated electrophysiology platforms. 
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Planar-Patch Platforms 

Many platforms currently use the planar-patch tech¬ 
nology to achieve electrophysiological recordings with 
varying degrees of throughput and quality. However, 
in each case, as illustrated in Fig. 1, the basic technical 
concept is the same. The patch pipette is replaced with 
a glass or plastic substrate containing a small aperture, 
usually ^1 pm in diameter. A suspension of cells is 
added to this substrate and then suction applied to pull 
the cells down onto this aperture. Enough suction can 
then be applied to obtain a seal and the section of 
membrane over the aperture is ruptured to enable 
whole-cell recordings (Mathes 2006; Clare 2010). 

This technology has been applied to a number of 
automated platforms, not only introducing the ability 
to study ion channels with an increased throughput, but 
also making electrophysiology available to those with 
less technical experience with the traditional manual 
rig. As there are too many platforms utilizing this 
technology to discuss in length, we will focus on 
some key examples in order of increasing throughput. 

The Port-a-Patch from Nanion is a platform close to 
manual patch-clamp recording in terms of throughput 
and quality (by making use of a conventional HEKA 
patch-clamp amplifier). It is essentially a miniaturized 
patch-clamp rig which records a single cell at a time 
without the use of a microscope. Although this 
technology only offers a slightly greater throughput 
than conventional patch-clamp recording, it does 
allow inexperienced users to obtain high-quality 
patch-clamp recordings (Earre et al. 2009). 

The Molecular Devices PatchXpress 7000A (PX) 
was one of the first automated platforms to introduce 
real increases in throughput. The 16-channel electrode 
plate allows independent voltage clamping and current 
recording from each cell simultaneously. The planar 
patch configuration for this platform is achieved using 
a ^ea/Chipie, a glass strip of 16 wells each containing 
a laser-bored pore. Each well is capable of holding up 
to 100 ul of liquid, but the fluidics system and inde¬ 
pendent wash station allow rapid fluid exchange and its 
glass construction reduces the degree of compound 
adhesion. The compound addition pipettor uses dispos¬ 
able tips and therefore removes the risk of contamina¬ 
tion or carryover of compounds between wells. The PX 
is not fully automated and, as such, is not truly a “walk 


away” instrument. Although compound plates and tips 
can be loaded for multiple experiments, ^ea/Chips and 
cells must be added manually at the beginning of each 
experiment. PX technology utilizes conventional 
patch-clamp amplifiers (Axon Multiclamp), which 
allow high-fidelity recording. This permits the plat¬ 
form to be used as a screening tool and also for more 
in-depth pharmacology and mechanistic studies. The 
real advantage of the PX is the ability to implement 
programmable scripts, allowing increased manipula¬ 
tion of voltage protocols for each individual cell. For 
example, scripting can be used to determine when 
inhibition in the presence of channel modulators has 
reached steady state or measure biophysical parame¬ 
ters for voltage-dependence of activation/inactivation. 

An additional example of these higher-throughput 
platforms is the QPatch HT from Sophion. This plat¬ 
form has led the way for increased throughput for true 
giga-seal patch-clamp recording and comprises inde¬ 
pendent, parallel voltage clamping from up to 48 cells, 
coupled with traditional plate-based automation and 
online automated cell preparation, which enables 
unattended operation. Building on the principles of 
planar-patch technology, at the heart of the QPatch is 
the “QPlate” consumable; electrode arrays with 
advanced microfluidics and either 16 or 48 recording 
wells. Although the QPatch amplifiers are less sophis¬ 
ticated than conventional patch-clamp amplifiers, they 
still have the ability to perform series resistance com¬ 
pensation (which reduces voltage errors and increases 
speed of voltage clamp). Due to the costs of the QPatch 
hardware and QPlate consumables, the technology 
remains very much an industry screening tool, primar¬ 
ily used for medium throughput lead development. It 
remains to be seen whether the release of a less expen¬ 
sive eight-channel instrument will make this technol¬ 
ogy more accessible to academic labs. 

As a means of increasing throughput further, 
Sophion have followed the trend of “population patch 
technology” (first developed for the lonWorks Quattro 
as described below) with the launch of the QPatch 
HTX. This modification enables summed-current 
recording from ten cells per well in a 48-channel for¬ 
mat. This offers advantages over single-cell mode, 
most notably for cell lines with low or less homoge¬ 
nous expression levels, e.g., transiently transfected 
cells. The HT and HTX modes have application for 
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Automated Patch Clamp: 
Advantages and 
Limitations, Fig. 1 An 

oblique view schematic 
representation of planar-patch 
technology. Cells are added to 
the well and drawn down over 
a small aperture using negative 
pressure to form a seal 
between cell membrane and 
the substrate. Progression to 
whole-cell access is achieved 
using either negative pressure 
protocols or a perforated-patch 
procedure, which involves the 
addition of a pore-forming 
antibiotic to the intracellular 
solution (ICS; see figure inset). 
During recordings, 
extracellular solutions (ECS) 
are added to and removed from 
the wells for effective bath 
solution exchange 



both voltage- and ligand-gated targets; the combina¬ 
tion of microfluidics and eight pipetting-heads enable 
fast ligand application with liquid exchange rates of 
around 10 ms, which has historically been a challenge 
for automated platforms. Crucially, the data generated 
in both modes show good correlation with conven¬ 
tional patch-clamp recordings (Korsgaard et al. 2009). 

The lonworks Quattro (IWQ) from Molecular 
Devices is capable of recording from 384 wells simul¬ 
taneously and was designed specifically with high- 
throughput support in mind. Again, a planar-patch 
platform, this employs the perforated-patch technique 
to gain electrical access to each cell. The platform 
utilizes PatchPlate technology and is capable of 
performing recordings in two modes. In HT mode, 
recordings are made from a single cell per well. How¬ 
ever, in the “Population Patch Clamp” (PPC) mode, an 
average of up to 64 cells per well is recorded. Currents 
from each cell/well are recorded using conventional 
patch-clamp amplification. However, giga-ohm seals 
are rarely attained with this system (typical seal resis¬ 
tances are in the order of 100 MQ) and, as a result, 
well-to-well variability can be significant with regard 
to current amplitude and stability. The PPC mode 
dramatically reduces this variability by averaging the 
recordings from multiple cells. Moreover, the 
complexity of recording from up to 384 wells simulta¬ 
neously impacts the flexibility of the voltage protocol. 


recording quality, and compound addition. Although 
recording from cells with a very low seal resistance, 
this platform is able to generate electrophysiology data 
of sufficient throughput to support compound library 
screening, a role previously considered to be outside 
the capabilities of an automated electrophysiology 
platform. 

However, a major limitation of the IWQ is that the 
compound delivery system prevents continuous volt¬ 
age control, as the electrode head must be moved to 
allow compound addition. Therefore, voltage control 
is lost for a period in excess of 8 s before clamp is 
reattained. Not only is this an issue for controlling the 
state of voltage-gated channels, it also poses 
a significant problem when it comes to ligand-gated 
channels, where the time course of activation and 
desensitization occur before clamp is regained. 
Despite these limitations, as alluded to above, it is 
being used to drive medicinal chemistry efforts by 
many companies. To some extent, these drawbacks of 
the IWQ have been addressed by Molecular Devices 
with the release of their lonWorks Barracuda system. 
This platform allows the same 384-well throughput as 
the IWQ but with continuous voltage control during 
compound addition and the possibility of onboard cell 
management and plate handling for up to 6 h. Although 
the Barracuda technology is a significant advance over 
the IWQ, in comparison to giga-seal platforms such as 
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the PX and QPatch, the quality of recordings and 
complexity of compound addition are compromised 
to accommodate higher throughput. 

Lateral-Patch Platforms 

Systems based on lateral-patch technologies differ in 
two main ways to those utilizing a planar-patch 
configuration. Firstly, cells are pulled sideways into 
recording positions, rather than downward. Secondly, 
the patch pipette is replaced by a capillary (approxi¬ 
mately 1-2 pm in diameter), rather than a hole, as in 
planar-patch substrates. As illustrated in Fig. 2, cells 
become “trapped” against these capillaries and nega¬ 
tive pressure is applied to obtain a seal and allow 
whole-cell recordings. 

The benefit of currently available lateral-patch 
instruments is that they utilize microfluidics incorpo¬ 
rated into the consumables to allow rapid application 
and full exchange of compounds, whereas planar-patch 
systems have struggled to deliver this capability due to 
the limitations of pipetting systems. Compound 
solutions are added into the consumable prior to 
recordings, allowing for complex compound addition 
protocols as flexible as those possible on manual patch 
setups. 

The Dynaflow system from Cellectricon was one of 
the first platforms to use this technology. Originally 
a fluidics system designed to be used in conjunction 
with a manual patch setup to allow rapid compound 
application and temperature control of solutions. This 
was not designed to increase throughout but to make 
complicated manual patch experiments more simple to 
achieve. This has since developed further and led to the 
release of the Dynaflow HT system, which has utilized 
the fluidics to create a high-throughput screening sys¬ 
tem, which can run 16 experiments per plate, with 6 
cells being recorded from individually for each exper¬ 
iment. As cells are recorded individually, full series 
resistance and capacitance compensation is possible. 
This platform has been designed with high-throughput 
screening in mind, specifically targeting fast acting 
ligand-gated channels. 

The lonFlux system from Fluxion is the next gen¬ 
eration of the lateral-patch technology. The lonFlux 
HT is able to perform 32 experiments per plate, each 
experiment producing two replicates from an ensemble 
recording of up to 20 cells, which greatly reduces 


experimental variability. It is a “loose-seal” platform, 
with average seal resistances being 50-200 MQ per 
cell, which is easily sufficient to measure currents from 
recombinant cell lines. As with many other 
high-throughput electrophysiology systems utilizing 
ensemble recordings, the correction and compensation 
abilities are limited and careful assay optimization is 
required to get the best out of the platform. 

Voltage control is continuous and the compound 
applications are fast, at approximately 50 ms for fluid 
exchange, thus enabling the measurement from 
voltage-gated channels and both fast and slow ligand¬ 
gated ion channels. As the plate microfluidic channels 
are micron-scale, very small volumes of compounds are 
required for the experiments. In addition, the lonFlux 
system is also currently the only electrophysiology 
platform that has internal temperature control. 

In comparison to other automated platforms that are 
floor standing, the lonFlux system also benefits from 
a significantly smaller instrument footprint, and is of 
similar dimensions to standard plate-readers. This 
small size, in combination with the unique possibility 
of incorporating the lonFlux onto fully automated 
robotic platforms, allows more autonomous operation 
and increased throughput. Moreover, due to the 
manufacturing process of the consumables, plate 
costs are lower than for other platforms of similar 
throughput. 

Summary 

Automated electrophysiology platforms have evolved 
from glass pipette through to planar-patch and lateral- 
patch substrate-based systems in little over a decade. 
These advances have brought with them significant 
gains in throughput but this has sometimes come at 
the cost of the quality of the biophysical data that are 
generated. Consequently, none of the current planar- 
and lateral-patch automated electrophysiology plat¬ 
forms are sufficient in isolation to support ion channel 
screening. Currently, a combination of higher- 
throughput, lower-quality “loose-seal” platforms for 
hit identification (e.g., IWQ or lonFlux) with lower- 
throughput, high-quality giga-seal platforms for 
supporting medicinal chemistry (e.g., PX or QPatch) 
provide the best available screen sequence. The ideal 
future instrument would be capable of performing 
high-throughput screening using population patch to 
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Automated Patch Clamp: 
Advantages and 
Limitations, Fig. 2 An aerial 
view schematic representation 
of lateral-patch technology. 
Cells flow across fixed 
capillaries, which are filled 
with intracellular solution 
(ICS). Cells are drawn to the 
capillary apertures using 
negative pressure, which then 
allows seal formation and then 
whole-cell access to be 
obtained (see figure inset). 
Cells are superfused with 
extracellular solutions (ECS) 
and exchanges are made from 
a microfluidic network of 
capillaries 



increase success rates, but also with the flexibility for 
pharmacology confirmation studies in single-cell 
mode, enabling complicated biophysical experiments 
that employ effective series resistance and capacitance 
compensation. The ability to apply voltage- and com¬ 
pound perfusion protocols, which closely mirror those 
utilized on manual patch setups, is also highly desir¬ 
able. The throughput of giga-seal planar-patch plat¬ 
forms has recently been increased to a 96-well 
recording format (Nanion Synchropatch), but rapid, 
high-throughput screening is likely beyond the limits 
of this technology. Following significant improve¬ 
ments in seal resistance and further miniaturization, 
lateral-patch technology has the potential to provide 
both high-throughput and high-quality data at a 
lower cost. In contrast, further development of 
existing giga-seal planar-patch clamp technology 
(by focusing on quality rather than throughput) 
could provide ideal platforms to generate complex 
biophysical data on primary cells or stem cells (cur¬ 
rently supported by manual patch-clamp recording). 
For example, Nanion have developed a giga-seal 
technology capable of performing current-clamp 
recordings (see http://www.nanion.de/images/stories/ 
pdf/Patchliner_CorAtCardiomyocytes.pdf). Over the 
next few years, as automated electrophysiology tech¬ 
nologies become more robust (with higher recording 
success rates) and the recording plates become less 
complex and costly to manufacture, automated 


electrophysiology is likely to expand outside of the 

drug discovery arena and be more widely employed by 

the academic community. 
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Synonyms 

Molecular motors; Rotary motor 

Introduction 

The flagellar motor of bacteria is a rotary device fueled by 
the membrane proton gradient or, in some aflcalophilic 


and marine species, the sodium-ion gradient. Hagellar 
motors can turn very rapidly, attaining speeds of about 
18,000 revolutions per minute when fueled by protons 
and more than 100,000 r.p.m. in the case of sodium- 
driven motors. Physiological smdies indicate that the 
flagellar motor is a tightly coupled device in the sense 
that it uses a constant number of ions to drive each 
revolution (Berg 2003; Sowa and Berry 2008). The 
mechanism of the motor is not yet understood in detail, 
but proteins with key roles in motor rotation have been 
identified and some key features of the mechanism have 
been established, partly as the result of stmctural and 
biochemical studies of the components functioning 
most closely in rotation. Most flagellar motors can rotate 
either clockwise (CW) or counterclockwise (CCW), and 
regulated direction reversals - and the resulting alter¬ 
ations in swimming trajectory - provide the basis for 
directed movement in response to chemical, thermal, 
and other environmental stimuli (Berg 2004; Eisenbach 
and Lengeler 2004). Stmctural and biochemical studies 
have also begun to cast light on the molecular mechanism 
of direction switching. 

Morphology and Protein Composition of the 
Flagellum 

Electron microscopy has been used to study the flagella 
of several species but the purification procedures 
developed in Salmonella are most refined and pres¬ 
ently appear to allow for the best preservation of 
basal-body features. Electron microscopic reconstmc- 
tions show an elaborate basal structure consisting of 
several rings of various sizes mounted on a central rod, 
then (proceeding outward) a curved, flexible hook that 
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been the subject of especially close study. The purifica¬ 
tion of flagella requires detergent-solubilization steps 
that cause dissociation of the stator (the non-rotating 
part), and so the high-resolution reconstructions obtained 
to date are of just the rotor, or rotating part. The stator 
consists of a set of membrane-embedded complexes 
containing the MotA and MotB proteins (or closely 
related proteins that have been given different names 
that include PomA/PomB, MotC/MotD, and MotP/ 
MotS). More recently, electron tomography has been 
used to visualize motors in situ, and features that are 
almost certainly assignable to the stator have been 
described (Chen et al. 2011; Liu et al. 2009). 

The full “parts list” of flagellar components is fairly 
well established, primarily from genetic studies 
(Erhardt et al. 2010; Macnab 2003). About 50 genes 
are involved in the assembly and operation of flagella, 
roughly half of which encode proteins present in the 
mature structure. About a dozen proteins participate in 
the flagellum-specific export process that delivers the 
protein subunits forming the structures outside the 
cytoplasmic membrane (i.e., the rod, hook, and fila¬ 
ment). Several other proteins function to regulate fla¬ 
gellar gene expression, coupling the expression of 
flagellar genes to the appropriate stage in assembly of 
the organelle (Chevance and Hughes 2008). The loca¬ 
tions of most flagellar proteins within the structure are 
also known (Fig. 1), largely through electron micro¬ 
scopic studies of the partially built structures in various 
mutants (Macnab 2003). 


Components Important for Motor Rotation 


Bacterial Flagellar Motor: Biochemical and Structural 
Studies, Fig. 1 Overview of flagellar structure, (a) Electron 
microscopic reconstruction of the lower part of the basal body of 
Salmonella including the C-ring, MS-ring, and lower part of the 
rod (Image from Dennis R. Thomas, with permission), (b) 
Names and approximate locations of flagellar proteins and rela¬ 
tionship of basal-body features to the layers of the cell envelope 
(OM outer membrane, PG peptidoglycan, IM inner membrane) 

functions as universal joint, and finally the relatively 
rigid helical filament that functions as propeller 
(Erhardt et al. 2010; Macnab 2003) (Fig. 1). The inner¬ 
most feature, termed the C-ring for its location in the 
cytosol, contains the proteins that function directly in 
rotation and direction switching. This innermost struc¬ 
ture is accordingly called the switch complex, and has 


Flagellar proteins that function in rotation are few, 
including just the integral membrane proteins MotA 
and MotB that form the stator and the cytoplasmic 
proteins FliG, FliM, and FliN that comprise the switch 
complex (Berg 2003). Structural studies are presently 
most advanced for the switch-complex components. 
Crystal structures have been solved for major parts of 
each switch-complex protein and a model for their 
organization in the rotor has been developed on the 
basis of cross-linking, protein-binding, and mutational 
experiments (Paul et al. 2011) (Fig. 2). In the upper 
(membrane-proximal) part of the switch complex, the 
FliG C-terminal domain (FliGc) exhibits a prominent 
ridge-like feature bearing several conserved charged 
residues that have been shown to interact with charged 
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Bacterial Flagellar Motor: Biochemical and Structural 
Studies, Fig. 2 Structures of the switch-complex proteins and 
a model for their arrangement in the flagellum, (a) Crystal 
structures (Brown et al. 2002, 2005; Lee et al. 2010; Park et al. 
2006). Crystallographic studies have used proteins of the hyper- 
thermophilic species Thermotoga maritima and Aquifex 
aeolicus, which show strong sequence similarity to the proteins 
of the otherwise better-studied E. coli and Salmonella proteins. 
The relative orientations of the three FliG domains cannot be 
determined directly from the crystal structures and are not 
known precisely. Darkened patches on the bottom of FIIGm 
and FliGc indicate regions implicated in binding to FliM. The 
FliM domain shown is from the middle of the protein and 
represents about 60% of the whole protein; positions that 
would make connections to the missing N- and C-terminal 
parts are indicated. The darkened region at the top indicates 
the region involved in binding to FliG. FliM also has a sizable 
C-terminal domain whose structure is not known but that has 
been modeled on the basis of its sequence resemblance to FliN 
(Sarkar et al. 2010b). The FliN protein shown represents roughly 
the C-terminal twothirds of the protein; the N-terminal parts of 
FliN are less strongly conserved and less important for function. 


The donut-shaped assembly shown is a tetramer of FliN 
subunits. The dark region is a highly conserved hydrophobic 
patch that has been implicated in binding to phospho-CheY. 
(b) A model for the arrangement of the switch-complex proteins, 
and the initial events in phospho-CheY-induced switching. Top\ 
Overall arrangement of switch-complex proteins, viewed from 
the side of the basal body. Subscript letters indicate domains; for 
example, FIIGnm refers to the N-terminal and middle domains of 
FliG. Middle: Plan view of switch-complex organization. FliM 
subunits occur in two kinds of structural setting, with roughly 26 
subunits oriented vertically and forming the outer wall of the 
C-ring and a subset of about eight subunits tipped inward to 
interact with the middle domain of FliG. As a result of this 
structural variability, FliG subunits also occur in different set¬ 
tings, some supported by two FliM subunits and others supported 
by just a single FliM subunit. For details and a summary of the 
evidence in support of this model, see Paul et al. (2011). Bottom: 
Initial events in the action of the chemotaxis signaling molecule 
phospho-CheY at the motor. Phospho-CheY is first captured 
by a segment near the N-terminus of FliM, and subsequently 
interacts with a conserved hydrophobic patch on FliN (Sarkar 
et al. 2010a) 
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residues of the stator protein MotA (Yakushi et al. 
2006). Relative movement of the rotor and stator 
must presumably involve forces produced in the vicin¬ 
ity of this FliG-MotA interface. The switch from CCW 
to CW rotation is induced by the signaling protein 
phospho-CheY, which binds to the FliM and FUN pro¬ 
teins in the lower part of the switch and induces small 
relative movement of subunits in this region (Sarkar 
et al. 2010b). Direction switching must also involve 
movements in the upper part of the switch, which have 
not yet been elucidated. A clearer picture of switching 
movements at the rotor-stator interface will further our 
understanding of direction regulation and should also 
cast light on the mechanism of torque generation. 

The largest part of the MotB protein is located in 
the periplasm, and this periplasmic region includes 
a domain that binds to the peptidoglycan, thereby 
anchoring the stator to the cell wall. Crystal structures 
have been reported for this peptidoglycan-binding 
domain of MotB (Kojima et al. 2009; Roujenikova 
2008). Structures have not yet been solved for other 
parts of the stator. The fully functional motor is 
believed to contain at least 11 independently function¬ 
ing stator complexes (Reid et al. 2006), each assem¬ 
bled from four MotA and two MotB subunits. MotA 
has four trans-membrane (TM) segments and MotB 
has one, giving a total of 18 TM segments in the 
complex. The arrangement of these segments has 
been examined by a combination of targeted cross- 
linking and systematic mutational analysis (Kim et al. 
2008) (Fig. 3). An invariant Asp residue of MotB 
(Asp32 in the protein of Escherichia coli) was found 
to be the only conserved, titratable residue critical for 
motor rotation and has been proposed to function 
directly in proton (or sodium-ion) conduction through 
the motor. In the membrane-segment arrangement 
deduced from cross-linking, the two Asp32 residues 
in the stator complex are widely separated, which 
would suggest that they function within two distinct 
proton-conducting channels. Measurements of motor 
torque across a range of rotation speeds have provided 
strong evidence that the motor utilizes an active 
power-stroke mechanism, as opposed to a “Brownian 
ratchet” mechanism based on rectification of thermal 
motions (Berry and Berg 1999). Conformational 
changes in the stator driven by ion binding/dissociation 
at the invariant Asp residue are hypothesized to pro¬ 
vide the power-stroke for rotation. In support of this, 
mutational replacements of Asp32, including charge- 


a 



Bacterial Flagellar Motor: Biochemical and Structural 
Studies, Fig. 3 Key features of the stator, (a) Membrane 
topologies of MotA and MotB, and approximate locations of 
some functionally important residues. Arg90 and Glu98 in 
the cytoplasmic domain of MotA interact with charged 
groups on the rotor. The invariant Asp residue of MotB 
is believed to function in the conduction of protons through 
the stator and in proton-induced conformational changes, 
(b) Overall arrangement of the membrane parts of the 
MotA 4 MotB 2 complex, (c) Arrangement of the TM segments 
in plan view, showing patterns of residue conservation. Black 
circles indicate invariant positions and gray indicates posi¬ 
tions where just two closely related residues occur (shown 
for only the right half of the complex). The invariant Asp 
residue is indicated. The complex is hypothesized to contain 
two distinct proton channels, indicated by shading 
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Bacterial Flagellar Motor: Biochemical and Structural 
Studies, Fig. 4 Hypothesis for the conformational change in 
the stator and a mechanism for linking this conformational change 
to rotation, (a) A cartoon model for the overall structure of the 
stator complexes and the proton-induced conformational change. 
Important features of the model are the extended conformation of 
the MotA cytoplasmic domain, and the large, in-and-out swinging 
motion of this cytoplasmic domain upon proton association/ 
dissociation. Diagrams to the sides are simplified renderings of 
the “two-in, two-out” behavior of the tips of the domains (used in 


panel b). (b) A mechanism for rotation based on this conforma¬ 
tional change. The tip regions of the MotA cytoplasmic domain are 
hypothesized to interact with both the C-terminal and middle 
domains of FliG. Both electrostatic and steric interactions are 
assumed to occur. At each step, the interactions important for 
driving rotation into the next state are indicated, with green 
representing attractive electrostatic interactions and red indicating 
repulsive steric interactions. The direction of rotation is dictated by 
the angle of the FliGc domain, and so direction might be reversed 
by a suitable rotation of this domain 


neutralizing replacements that might mimic the effects 
of protonation, were shown to affect the conformation 
of MotA in a cytoplasmic domain that interacts with 
the rotor (Kojima and Blair 2001). 

As the part of the stator that interacts directly with 
the rotor, the cytoplasmic domain of MotA (Fig. 3) is 
of particular interest for understanding the mechanism. 
While the structure of this domain is not known, sec¬ 
ondary structure predictions, as well as the size of the 


needed power stroke, suggest that it is likely to have an 
extended structure consisting mainly of alpha helices. 
In a recent hypothesis for the motor mechanism, it 
was proposed that this domain undergoes cyclical, 
ion-driven swinging motions that move the tip of the 
domain back and forth, with the two pairs of MotA 
subunits in the complex moving by turns (Fig. 4). 
A motion of this kind appears capable of driving rota¬ 
tion, given suitable placement of the MotA domains 
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above FliG (Fig. 4). The model relies on the demon¬ 
strated electrostatic interactions and additionally invokes 
steric interactions between MotA and the ridge on FliGc- 
The direction of rotation would be dictated by the orien¬ 
tation of the charge-bearing ridge on FliG and could 
presumably be reversed by an appropriate reorientation 
of the ridge. 

Summary and Future Directions 

Recent studies have led to a structural model of the rotor 
that is sufficiently detailed to provide useful constraints 
on the mechanisms of rotation and switching. Stmctural 
information on the stator is equally vital for understand¬ 
ing the mechanism but is presently very limited. Struc¬ 
tural studies of the MotA and MotB proteins are 
accordingly a high priority. Of particular interest are the 
MotA cytoplasmic domain and the nature of any proton- 
induced conformational changes. Crystallographic study 
of the stator may prove challenging not only because 
MotA and MotB are membrane proteins but because 
they are present in a multi-subunit complex that appears 
to be only marginally stable in detergent (Kojima and 
Blair 2004). As with the rotor, progress might depend on 
piecewise structure determination coupled with biochem¬ 
ical and mutational approaches to assemble the parts. 
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Synonyms 

Back focal plane interferometry; Molecular motors; 
Optical trap; Optical tweezers; Rotary motor; Torque 


Definition 


scattered by the particle and unscattered light in the back 
focal plane of the condenser. The fringe positions are 
imaged onto a quadrant photodiode, allowing nanome- 
ter/sub-millisecond precision. The output of the quadrant 
photodiode is used to determine .xy-position coordinates, 
and subsequently angular velocity (cu = dOldt) and 
torque (r = m x drag coefficient). 


B 


Optical Trapping for Hook Stiffness 
Measurements 

The flagellar filaments of cells are tethered to a coverslip 
(using antibodies or the HiC mutant strain), causing the 
cell bodies to rotate about their motors. The motors are 
then locked by addition of a respiratory poison/cross- 
linking agent. The cell body is then trapped by the laser 
and used as a handle to exert torque on the motor. The 
cell is rotated ^ 120° by moving the laser. When the laser 
is shuttered, the cell rebounds with an exponential time 
constant enabling the hook stiffness to be calculated 
{hook stiffness = drag coefficient!time constant). 


Optical tweezers are often used to investigate the bio¬ 
physics of the bacterial flagellar motor. The term, “opti¬ 
cal tweezers,” refers to a microscope setup where a beam 
of laser light is brought to a diffraction-limited focus 
through a high numerical aperture microscope objective. 
The microscope is typically used in two modes, either 
(1) using low laser powers (<50 pW) to detect the 
rotation of the motor using back focal plane interferom¬ 
etry, or (2) using high laser powers (>100 mW) to 
manipulate the motor. Note that since the ^50 nm fla¬ 
gellar motor is too small to be observed directly, either 
a polystyrene bead or the cell body itself is used as 
a marker for rotation. 

Back Focal Plane Interferometry for Torque/ 
Speed Measurements 

Cells are attached to a coverslip using polylysine. Then 
polystyrene beads (0.3-1 pm diameter) are bound to 
their flagellar filaments using antibodies or by using 
a Flic mutant strain that produces a hydrophobic fila¬ 
ment. Back focal plane interferometry is achieved using 
a laser focused by an objective lens in the image plane. 
The position of the bead is determined by analyzing the 
fringes pattern obtained by interference between the light 
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Definition 

The bacterial flagellar motor is an ion-driven rotary 
motor embedded in the cell membrane which drives 
a long helical filament (5-10 [tm long with a diameter 
of ^15 nm) called a flagellum, enabling the cell to 
swim. The flagellum/motor complex self-assembles 
following the coordinated, sequential expression of 
over 50 genes. The final structure is composed of at 
least 13 different proteins; each protein is present in 
multiple copies per motor. 

Swimming/Switching 

The motor is connected to the filament by a short 
flexible hook about 100 nm long. When all the motors 
are rotating in the same direction, hooks allow multiple 
filaments to come together, forming a bundle which 
propels the cell forward. Flagellar motors are also able 
to rotate in the reverse direction, causing the bundle to 
dissociate which sends the cell into a tumble. When 
rotation resumes in the original rotational direction, the 
cell swims in a new direction. The frequency at which 
these switching events occur is controlled by the 
chemotaxis sensory pathway which can sense changes 
in the concentration of nutrients/toxins. By terminating 
runs in the “wrong” direction, cells execute a biased 
random walk; subsequently, cells accumulate in areas 
favorable for their survival. 

Bacterial Flagellar Motor 

The flagellar motor consists of a rotor (45 nm in diameter) 
surrounded by approximately 11 independent stator units 
which are anchored to the peptidoglycan layer of the cell 
wall. The stators transmit ions (either protons or sodium 
ions depending on the bacterial species) from the peri¬ 
plasm to the cytoplasm, causing electrostatic interactions 
at the rotor/stator interface to drive rotation of the rotor in 
^26 steps per revolution (this coincides with the number 
of HiG proteins estimated to be present in the rotor). The 
electrochemical gradient of ions is maintained by elec¬ 
tron transport chains (respiration or photosynthesis). It is 
estimated that approximately 36 ions are transmitted 
through each stator unit per revolution which yields 
a maximum torque of ^1,250 pNnm/rad for proton- 
driven motors. 


Variations Between Bacterial Species 

Different bacterial species have evolved different ways 
of swimming to best fit their environmental conditions. 
Below are outlined a few examples. 

Vibrio alginolyticus: One sodium-driven polar 
flagellum; motors rotate at ^400 Hz; cells swim 
at ^60 pm/s; motors can switch direction. Multiple 
lateral proton-driven motors expressed to allow the 
cell to swarm over hard surfaces. 

E. coin Four to eight randomly located proton- 
driven flagella; motors rotate at ^100 Hz, cells swim 
at ^ 25 pm/s; motors can switch direction 

Rhodobacter sphaeroides: One proton-driven 
flagellum; similar rotation/swimming speeds to 
E. coli, but the motor stops to achieve a cell tumble 
instead of reversing its direction 

Shewanella oneidensis: One single motor able to 
recruit sodium-driven stators or proton-driven stators 
depending on the availability of ions 

Bacillus clausii: Its stator units are able to 
utilize protons or sodium ions depending upon 
availability 
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Definitions 

Bacterial globins are proteins possessing the classical 
globin fold and the highly conserved active site residues 
required for function. Functions include binding of, or 
reaction with, oxygen, nitric oxide, carbon monoxide, or 
other ligands. Bacterial globins are classified into three 
major groups - flavohemoglobins, myoglobin-like (sin¬ 
gle-domain) globins, and truncated globins. NO detoxifi¬ 
cation is a well-characterized function of some. 

Basic Characteristics 

Globins are an ancient superfamily of diverse proteins 
(Vinogradov et al. 2006). The first report of a microbial 
globin was in yeast over half a century ago but, in the past 
20 years, increasing molecular studies have revealed 
details of their stmctures, functions, and the regulation 
of their biosynthesis. There has been a dramatic increase 
in the understanding of bacterial globins and their links 
with bacterial respiration and physiology, pathogenesis, 
and biotechnological opportunities. This increase in 
interest has been fueled by the recognition that a major 
role of certain microbial globins is protection from nitric 
oxide (NO) (Wu et al. 2003). 

The subunits of all hemoglobins consist of 
a polypeptide chain with 6-8 a-helical segments that 
fold around a heme group (Fig. la). The helices con¬ 
stituting the globin fold are labeled A to FI in sequence 
order and the various topological positions within each 
helix are also numbered sequentially. The fifth coordi¬ 
nation site of the central iron atom of the heme is 
occupied by the imidazole ring from a histidine residue 
on the protein. This proximal ligand (F8 in Fig. lb) is 
the residue at position 8 in helix F and so may be 
written HisF8 or His-xy (where xy is the number of 
the residue in the entire linear globin primary structure, 
not in a particular helix). The distal ligand-binding 
pocket occupies the site diametrically opposite the 
proximal histidine. This sixth coordination site is 
available to bind oxygen or another ligand. It is typi¬ 
cally constructed from the B, E and part of the 
G helices. The identity of the BIO residue on the 
B helix and the E7 and Ell residues are important for 
the regulation of ligand binding and discrimination. In 
mammalian globins, the E7 position is almost invari¬ 
ably occupied by a histidine. The HisE7 is believed to 
stabilize the heme-bound dioxygen by H-bonding to it; 


on the other hand, the BIO and Ell positions are 
typically occupied by hydrophobic residues to ensure 
that the chemical integrity of the heme-bound 
dioxygen is preserved. In contrast, the distal residues 
in the hemeproteins performing oxygen chemistry, 
such as peroxidases and oxidases, are much more 
polar. Eor example, the BIO and E7 residues in the 
single domain globin from Campylobacter jejuni 
are occupied by tyrosine and glutamine residues, 
respectively (Fig. lb). The polar nature of the distal 
heme environment in these proteins plays an important 
role in activating the 0-0 bond of the heme-bound 
peroxide molecules. The distal residues of microbial 
hemoglobins are also much more polar than those of 
mammalian globins, suggesting that the structures 
of these hemoglobins are tailored to perform functions 
other than oxygen transport (Lu et al. 2008). 

Globin function is typically defined by reactivity 
toward small ligands, such as O 2 , CO, and NO, which 
bind to the heme distal site. In general, hemoglobins 
displaying a moderate oxygen affinity are involved in 
O 2 transport or sequestration, whereas those exhibiting 
high oxygen affinities appear to be involved in oxygen 
chemistry. To perform these functions, oxygen affinity 
is tightly regulated, and the structural basis underlying 
O 2 affinity has been extensively studied (Capece et al. 
2006). O 2 affinity is modulated by the characteristics 
of both the heme proximal and distal environments. 
Oxygen binding is determined by ligand release and 
association kinetics. The ligand release process can be 
further divided into two main steps. First, the iron- 
ligand bond must be broken and, second, the ligand 
must diffuse into the bulk solvent through the protein 
matrix. In most proteins, the observed dissociation rate 
is determined mainly by the strength of the protein- 
ligand interactions, or ligand migration. Depending on 
the H-bonding network between heme-bound O 2 and 
amino acid residues located in the distal site of the 
heme, oxygen dissociation rates (^off) can vary by 
more than five orders of magnitude. However, ligand 
association rates (^on) are usually determined by the 
presence of tunnels that facilitate ligand migration 
from the solvent towards the heme active site. 

Flavohemoglobins 

At least three classes of bacterial globin are recognized, 
namely, the flavohemoglobins, the single domain 




B 


158 


Bacterial Globins 


Bacterial Globins, 

Fig. 1 Structural features of 
the single domain globin from 
C. jejuni, Cgb. (a) Backbone 
topology of Cgb. A Cot chain 
tracing of Cgb with heme 
cofactor (black). Helices/ 
regions are labeled according 
to conventional globin 
nomenclature, (b) Structure of 
the distal pocket of Cgb 
(Shepherd et al. 2010) 



“myoglobin-like” globins and the trancated globins. All 
three classes are found among bacteria. Members of the 
best understood class, the flavohemoglobins, are distin¬ 
guished by the presence of an N-terminal globin domain 
(a three-on-three a—helical fold similar to myoglobin) 
with an additional C-terminal domain with binding sites 
for flavin adenine dinucleotide (FAD) and nicotinamide 
adenine dinucleotide (phosphate) [NAD(P)H]. Note 
that the term globin-coupled sensor is sometimes used 
to describe chimeric proteins in which a C-terminal 
domain shows clear homology with known signaling 
domains (like the bacterial chemotaxis proteins) but the 
N-terminus possesses only a weak (10%) similarity to 
the globin fold (Gilles-Gonzalez and Gonzalez 2005). 

Widely distributed in bacteria and lower eukaryotes, 
flavohemoglobins confer protection from NO and 
nitrosative stresses by direct consumption of NO (Poole 
and Hughes 2000). They are critical for pathogenicity in 
some species; for example, Escherichia coli and Salmo¬ 
nella mutants lacking Hmp, the best studied such protein, 
are compromised for survival in mouse and human mac¬ 
rophages. In the plant pathogen Erwinia chrysanthemi, 
HmpX not only protects against nitrosative stress but 
also attenuates the host hypersensitive reaction during 
infection by intercepting NO produced by the plant for 
the execution of the hypersensitive cell death program. 
Flavohemoglobins appear to have no direct role in the 
metabolism of oxygen or other gaseous ligands and have 
not been reported in higher animals. Flavohemoglobins 
oxidize NAD(P)H and transfer an electron to the 
N-terminal heme domain via a non-covalently bound 
FAD in the reductase (or FNR, ferredoxin-NADP reduc- 
tase-like) domain. Reduced heme catalyzes the reaction 
between NO and O 2 generating nitrate; that is, Hmp acts 
as a NO-detoxifying enzyme. There remains controversy 


over the reaction mechanism at the heme: either NO 
(denitrosylase mechanism) or O 2 (dioxygenase mecha¬ 
nism) has been claimed to bind first to the heme. 

In accordance with the role of Hmp in limiting 
NO-related toxicity, expression of the protein occurs 
only when NO is present in the cell environment. 
Indeed, hmp gene expression is tightly regulated at 
the transcriptional level by NO-responsive transcrip¬ 
tion factors, notably NsrR and Fnr (Spiro 2007). 
Constitutive Hmp expression and function in the 
absence of NO generates oxidative stress by virtue of 
oxygen reduction by the heme to superoxide anion. 

Single-Domain or "Myoglobin-Like" 
Hemoglobins 

The second class of bacterial globin comprises the single¬ 
domain globins. These also exhibit a three-on-three tx- 
helical fold similar to myoglobin but no separate 
C-terminal domain. This class is typified by the globin 
of Vitreoscilla (named Vgb, VtHb or Vhb), an obligate 
aerobic bacterium that grows in low oxygen environ¬ 
ments. This globin was the first bacterial hemoglobin to 
be crystallized, and the three-dimensional structure (of 
the ferric homodimer) conforms to the classical globin 
fold. However, the region following the C helix is disor¬ 
dered, residues E7-E10 do not adopt the usual a-helical 
conformation, and GlnE7 is located out of the heme 
pocket and does not appear to stabilize the heme iron- 
bound dioxygen through hydrogen bonding. The three- 
dimensional structures of the thiocyanate and imidazole 
derivatives of recombinant ferric Vgb have also been 
determined (Giangiacomo et al. 2001). The quaternary 
structure of Vgb and of its azide, thiocyanate, and 
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Fig. 2 Classification of the 
microbial globin protein 
family. Examples shown are 
E. coli flavohemoglobin 
(Hmp; Ilari et al. 2002), 

C. jejuni single-domain globin 
(Cgb; Shepherd et al. 2010), 
Mycobacterium tuberculosis 
type I truncated globin (HbN; 
Milani et al. 2004), 
Mycobacterium tuberculosis 
type II truncated globin (HbO; 
Ouellet et al. 2007), and 
C. jejuni type III truncated 
globin (Ctb; Nardini et al. 
2006) 


Microbial globins 


FHb sdHb trHb 
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imidazole derivatives is unique in that the protein is 
dimeric in the crystal lattice but association of the two 
subunits is weak. This protein has been implicated in 
redox chemistry and NO detoxification in vivo but the 
mechanism by which the protein is re-reduced after 
a catalytic cycle is obscure. The disordered CD region 
(i.e., the vicinity of the C and D helices) in the crystal 
structure of Vgb is a potential site of interaction with 
the putative FAD/NADH reductase partner. Considerable 
interest has been directed at Vgb because of its possible 
role in facilitating oxygen transport and metabolism. 
Indeed, Vgb is upregulated under microaerobic growth 
conditions and shows promise for biotechnological appli¬ 
cation by enhancing product formation under oxygen 
limitation in heterologous hosts. Interestingly, a chimeric 
protein comprising the Vitreoscilla hemoglobin and a 
flavoreductase domain from a flavohemoglobin (Fhp) 
relieves nitrosative stress in E. coli. 

A more comprehensive molecular genetic view of 
bacterial non-flavo hemoglobins is offered by the 
microaerophilic, foodbome, pathogenic bacterium Cam¬ 
pylobacter jejuni, which is exposed to NO and other 
nitrosating species during host infection. This single 
domain globin, Cgb, is dramatically upregulated by the 
transcription factor NssR in response to nitrosative 
stress. Cgb has been shown to detoxify NO and possess 


a peroxidase-like heme-binding cleft. In marked contrast 
to Vitreoscilla Vgb, there is no evidence to date that Cgb 
functions in oxygen delivery. Cgb can provide an elec¬ 
tronic “push” from the proximal ligand and an electronic 
“pull” from the distal binding pocket, creating a favor¬ 
able environment for the isomerization of a putative 
peroxynitrite intermediate in the NO dioxygenase reac¬ 
tion (Shepherd et al. 2010). 

Truncated Hemoglobins 

The third class comprises the truncated globins, which 
are the most recently discovered and appear widely 
distributed in bacteria, microbial eukaryotes, and 
plants. Instead of the classical 3-over-3 a-helical sand¬ 
wich motif adopted by single-domain globins and the 
heme domains of flavohemoglobins, trHbs adopt a 
2-over-2 a-helical structure and are typically 20 resi¬ 
dues shorter than 3-over-3 globins. Sequence analysis 
of more than 200 trHbs indicates that they can 
be divided into three groups (Fig. 2): I, II, and III 
(sometimes referred as N, O, and P, respectively) 
(Wittenberg et al. 2002). Most studies on trHbs 
have focused on trHb groups I and II, although the 
type-III trHb from the foodborne bacterial pathogen 
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Campylobacter jejuni, Ctb, has recently been structur¬ 
ally and kinetically characterized (Lu et al. 2007). 

Oxygen-binding studies of trHb I from Mycobacte¬ 
rium tuberculosis (Mf-trHbN), an NO dioxygenase, 
show an unusually high O 2 affinity (K^ — 2.3 nM), 
allowing oxygen binding even at low O 2 concentra¬ 
tions. The high oxygen affinity of Mf-trHbN is achieved 
by the presence of two polar residues in the distal site, 
TyrBlO and GlnEll, which form H-bonds with the 
heme-bound O 2 . Another trHb from Mycobacterium 
tuberculosis, Mf-trHbO (a trHb II), also displays a high 
O 2 affinity (ATj ^ 11 nM). Mf-trHbO contains three polar 
residues in the distal cavity, TyrBlO, TyrCDl, and 
TrpG8, which can donate H-bonds to the bound O 2 . 

Like M. tuberculosis, C. jejuni contains two globins, 
Cgb and Ctb, and the latter belongs to trHb group III. 
When the structural gene, ctb, is mutated, the bacterium 
does not show extra sensitivity to nitrosative stress, but, 
when grown under conditions of high aeration, the 
bacterium exhibits lowered respiration rates, suggesting 
a role for Ctb in modulating intracellular O 2 flux. The 
role of this protein in bacterial physiology remains 
obscure. Ctb contains three polar residues in the distal 
cavity; TyrBlO, HisE7, and TrpG8; it is thought that 
HisE7 regulates ligand entry, but the mechanism by 
which the three distal polar residues control oxygen 
reactivity in Ctb remains unclear (Lu et al. 2007, 2008). 
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Synonyms 

Carbohydrates; Glycoconjugates; LPS; Polysaccha¬ 
ride; Polysaccharide glycan 

Definition 

Lipopolysaccharide (LPS) is the major component of the 
external leaflet of the outer membrane of almost all 


Gram-negative bacteria. It is an amphiphilic macromole¬ 
cule consisting of three structural and biosynthetically 
different parts: an 0-specific polysaccharide (OPS) cova¬ 
lently linked to an oligosaccharide (core), in turn, linked 
to a glycolipid moiety (Lipid A). OPS is lacking in some 
wild-type strains. LPS is anchored by its lipid A moiety to 
the outer membrane and the carbohydrate part protrudes 
toward the exterior of the cell. OPS is usually built of 
a regular polysaccharide with repeating units (up to 
40-50) consisting of two to eight monosaccharides. The 
lipid A stmcture consists of a P-(l ^ 6)-linked 2-amino- 

2- deoxy-glucopyranose (GlcN) disaccharide that bears 

3- (7?)-hydroxy fatty acid residues, as ester linked at the 
3 and 3'positions, and amide-linked at 2 and 2' positions, 
which are indicated as primary fatty acid residues (C14:0 
3-OH in Escherichia coli). The 3-OH, in turn, can be 
further esterified by secondary fatty acids typically not 
carrying any other functional group (C14:0 and C12:0 in 
E. coli). The hydroxyl at position 4' of the non-reducing 
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GlcN^* residue (distal unit) and that of the a-anomeric 
position of the reducing GlcN^ residue (proximal unit) are 
generally both linked to charged residues, mostly phos¬ 
phate groups (Fig. 1). 

Both polar heads and acyl residues assortment in lipid 
A may vary in number, type, and distribution and deter¬ 
mine the three-dimensional structure, i.e., the conical 
or cylindrical molecular shape, of lipid A, which is 
correlated to its biological activity, i.e., the binding and 
recognition by proteins of the innate immune system of 
both animals and plants. 

The determination of the primary structure of an LPS 
is carried out separately on lipid A and OPS, chemically 
split in advance, by means of chemical analyses and 
biophysical methodologies (ESI and MALDI Mass 
Spectrometry and ID and 2D NMR). 

The determination of the primary structure of an 
LPS implies the rigorous ascertainment of the follow¬ 
ing points (in parenthesis is also indicated the approach 
of choice): 

(a) Quali/quantitative composition of monosaccha¬ 
rides and fatty acids (GLC) 

(b) Determination of size of the rings of each sugar 
residue (GLC) 

(c) Determination of attachment points of each sugar 
residue (GLC, NMR) 

(d) Determination of the absolute configuration of 
each sugar residue (GLC) 

(e) Determination of the anomeric configuration of 
each sugar residue (NMR) 

(f) Determination of the sequence of monosaccharides 
in the chain (MS, NMR) 

(g) Determination of primary and secondary fatty acid 
location (MS) 

(h) Determinaton of nature and location of 
non-carbohydrate/fatty acids components (MS, 
NMR, GLC) 
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Synonyms 

Carbohydrates; Exoglycan; Exopolysaccharide; Extra¬ 
cellular polysaccharide; Glycan; Polysaccharide 

Definition 

Polysaccharides are linear or branched polymers built 
up exclusively or mainly of monosaccharides. In 
glycoconjugates, a polysaccharide chain(s) is linked 
to a protein/peptide (glycoprotein, proteoglycan, and 
peptidoglycan) or a lipid (lipoglycans, glycolipids). 
The bacterial glycome consists of a wide repertoire of 
cell surface polysaccharides and glycoconjugates, 
including lipopoly saccharides (LPS, endotoxins) of 
Gram-negative bacteria, cell-wall anionic polysaccha¬ 
rides of Gram-positive bacteria, mycobacterial 
lipoglycans, capsular and extracellular polysaccha¬ 
rides (EPS), and S-layer glycoproteins (Seltmann and 
Holst 2002). A rigid protective peptidoglycan layer 
surrounds the cytoplasmic membrane in all bacterial 
groups. Specific surface polysaccharides play an 
important role in bacterial life and, particularly, are 
implicated in recognition and virulence of pathogens. 
They determine the immunospecificity of bacteria, and 
various immunogenic forms of polysaccharides and 
their fragments are used as vaccine components. Bac¬ 
teria often produce also nonspecific polysaccharides, 
such as glycogen, dextran, other glucans and fructans. 
The biosynthesis of bacterial polysaccharides is 
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Bacterial Polysaccharide Structure and Biosynthesis, Fig. 1 Schematic representation of the cell wall of Gram-negative and 
Gram-positive bacteria 


a complex process that involves membrane and soluble 
enzymes, as well as a mandatory transmembrane trans¬ 
location step in cases studied to date. 

Basic Characteristics 

Peptidoglycan or murein has a polysaccharide backbone 
of alternating residues of A'-acetylglucosamine and N- 
acetylmuramic acid bearing a peptide chain of three to 
five amino acids. The peptide chain can be cross-linked 
to that of another strand forming a three-dimensional 
mesh-like layer (Heidrich and Vollmer 2002) (Fig. 1). 
Some archaea have a similar layer of pseudopepti- 
doglycan or pseudomurein, where the sugar residues 
are A^-acetylglucosamine and A^-acetyltalosaminuronic 
acid. 

Gram-negative bacteria are distinguished by the pres¬ 
ence of an outer membrane with an outer leaflet com¬ 
posed mainly of LPS. Three stracturally different 
domains can be distinguished in the LPS molecules: 
a polysaccharide portion (0-chain, 0-antigen) attached 
to a large oligosaccharide called core, which in turn is 
linked to the lipid A typically made of a disaccharide of 
acylated glucosamine residues (Fig. 2). Lipid A anchors 
the LPS into the membrane and is responsible for 
its biological (endotoxic) activities. The O-chain is 
a homoglycan or a heteroglycan built up of oligosaccha¬ 
ride repeats (0-units) that range from di- to octa-saccha- 
rides. Heteropolysaccharides are more widespread and 
are extraordinarily diverse in composition and stracture. 


Typical components of the 0-antigens are hexoses, pen¬ 
toses, hexosamines, their deoxy derivatives, and sugar 
acids (3,6-dideoxyhexoses, diaminohexoses, aldulosonic 
acids, amino- and diamino-hexuronic and -aldulosonic 
acids), as well as some uncommon branched monosac¬ 
charides (Knirel 2009). The 0-antigens consisting of 
two repetitive homo- or hetero-polysaccharide domains 
are known. In LPS with a homopolysaccharide O-chain 
or a heteropolysaccharide having a disaccharide 0-unit, 
a non-repetitive domain may occur at the reducing 
end or/and between the O-chain and the core. Some 
bacteria produce a tmncated LPS that is devoid of any 
0-antigen or have a single O-unit linked to the core-lipid 
A moiety. 

Gram-positive bacteria lack the outer membrane 
and have a much thicker peptidoglycan layer. Their 
cell-wall polysaccharides are linked to peptidoglycan 
(teichoic acids) or to a lipid anchor in the cytoplasmic 
membrane (lipoteichoic acids). These anionic glycans 
are involved in ion exchange, permeability of the cell 
wall to nutrients and antibiotics and control of the 
activity of autolytic enzymes and adhesins. Teichoic 
acids include a polyol (usually glycerol or ribitol) 
phosphate and often also glycosyl moieties either in 
the main chain or/and as lateral groups (Lazarevic et al. 
2002) (Fig. 3). Teichuronic acids or other regular 
non-phosphorylated anionic polysaccharides may 
replace teichoic acids in some Gram-positive bacteria 
or under certain environmental conditions. 

Although mycobacteria stain slightly Gram-positive, 
they possess Gram-negative rather than Gram-positive 
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Bacterial Polysaccharide Structure and Biosynthesis, Fig. 3 Structure of the linkage region between glycerol teichoic acid and 
peptidoglycan in Bacillus suhtilis. R stands for H, a- or (l-D-glucopyranosyl 


Bacterial Polysaccharide 
Structure and Biosynthesis, 
Fig. 4 Structural model of the 
mycolyl arabinogalactan 
complex of Mycobacterium 
tuberculosis. MurAc 
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cell envelope features, i.e., a thin peptidoglycan layer 
and a lipid bilayer outer membrane. Important 
components of their cell wall are two major lipoglycans, 
the mycolyl arabinogalactan complex linked to peptido¬ 
glycan and lipoarabinomannan attached to the mem¬ 
brane via a mannosylphosphatidyl-wyo-inositol anchor 
(Seltmann and Holst 2002). They are characterized by 
various non-repetitive motifs and show little structural 
variability. Arabinogalactan has a main linear galactan 
chain of about 30 o-galactose residues, to which arabinan 


chains are attached. The non-reducing end of the latter 
has a branched domain of six D-arabinose residues, 
which bears long-chain a-alkyl-p-hydroxyacyl (mycolyl) 
groups in clusters of four (Fig. 4). The main chain of 
lipoarabinomannan consists of D-mannose residues and 
has branched regions. A branched arabinan domain 
including 50-70 D-arabinose residues is attached close 
to the non-reducing end of the mannan core. The 
non-reducing end of the arabinan chains may be capped 
with mannose, mannobiose, or mannotriose. 














B 


166 


Bacterial Polysaccharide Structure and Biosynthesis 


Periplasm 



Wzy-dependent pathway 


Und-PP-O-units 


Und-PP-O-antigen 





Cytoplasm 


Polymerization 


ABC-dependent pathway 


Periplasm 



WbdABC 




Initiation 


Elongation 


Termination 



Bacterial Polysaccharide Structure and Biosynthesis, Fig. 5 Schematics of 0-antigen biosynthesis 


Extracellular glycans are found in both Gram¬ 
negative and Gram-positive bacteria. Some of them, 
such as K-antigens, are bound to the cell surface (e.g., 
with the aid of a phosphatidic acid anchor) and form 
a protective capsule, whereas others are released to the 
environment as a slime or a component of biofilm, 
a matrix of polymeric substances encapsulating bacte¬ 
ria and adherent to a living or inert surface (Cescutti 
2009). Like cell-wall polysaccharides, exoglycans are 


built up of oligosaccharide repeating units, which 
sometimes have the same structure as O-units in 
Gram-negative bacteria. They usually have anionic 
character, often including phosphate groups. Some 
zwitterionic exopolysaccharides exhibit peculiar bio¬ 
logical activities, particularly, induce immunomodula¬ 
tory T cell response. 

Bacterial cells may be surrounded by an S-layer, an 
envelope of paracrystalline two-dimensional lattices 
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Bacterial Polysaccharide 
Structure and Biosynthesis, 
Fig. 6 Schematics of LPS 
biosynthesis and export to the 
outer membrane 
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composed of glycoprotein molecules self-aggregated 
by electrostatic or hydrophobic interactions. S-layer 
glycan chains are repetitive long polysaccharides anal¬ 
ogous to the 0-chains or oligosaccharides resembling 
the LPS core (Messner et al. 2009). Typically, they are 
0-linked to serine, threonine or, less common, tyrosine 
and are much more diverse in composition and struc¬ 
ture than glycan chains of eukaryotic glycoproteins. 
S-layers are characteristic also of archaea but in con¬ 
trast to those of bacteria, they have predominantly 
short-chain glycans that are N-linked to asparagine. 

The biosynthesis of 0-antigens (Fig. 5) (and also of 
other bacterial surface polysaccharides, exopolysac¬ 
charides, and cell-wall peptidoglycan, as well as glycans 
for protein glycosylation) starts at the cytosolic 
face of the plasma membrane by the formation of an 
undecaprenyl diphosphate (Und-PP)-linked saccharide 
using a sugar delivered as a nucleoside diphosphate 
(Raetz and Whitfield 2002). Two different families 
of integral membrane proteins designated as 


polyisopreny 1-phosphate Wacetylaminosugar-1 -phos¬ 
phate and hexose-1-phosphate transferases (PNPT and 
PFIPT, respectively) catalyze the initiation reaction 
(Valvano 2003). The subsequent steps in the glucan bio¬ 
synthesis depend on the mode of export of the phosphoi- 
soprenol-linked saccharide across the lipid membrane, 
which can be distinguished into the polymerase (Wzy)- 
dependent and ATP-binding cassette (ABC)-dependent 
pathways. 

In the Wzy-dependent pathway, the 0-unit is assem¬ 
bled from reactions catalyzed by specific glycosyl- 
transferases, which are typically peripheral membrane 
proteins associated with the plasma membrane by ionic 
interactions. Individual 0-units are translocated across 
the membrane by an ATP hydrolysis-independent mech¬ 
anism mediated by the protein Wzx (0-unit flippase). On 
the periplasmic side of the membrane, the translocated 
0-unit is polymerized to a certain length, unique to each 
0-antigen, by the concerted functions of Wzy (O-antigen 
polymerase) and Wzz (O-antigen chain length 
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regulator). In the ABC-dependent pathway, the 
0-antigen is elongated in the cytosol by processive 
glycosyltransferases, and the polymer is translocated 
across the membrane by an ABC transporter (WztAVzm 
or Wzk). ABC transporters are also used in the synthesis 
of capsular polysaccharides (Whitfield 2006) and 
S-layers (Messner et al. 2009). In some cases, a termina¬ 
tion signal that ceases the chain elongation is provided by 
methylation or other terminal sugar modifications that 
couple the polymerization with the ABC transporter. 

The LPS core oligosaccharide is assembled in 
a separate pathway on preformed lipid A at the cytosolic 
face of the plasma membrane, and the resultant 
lipooligosaccharide is translocated to the periplasmic 
face of the membrane by a specialized ABC transporter 
(MsbA). The ligation of the 0-antigen “en bloc” onto 
lipid A-core (Fig. 6) is catalyzed by an oligosaccharyl- 
transferase (WaaL, 0-antigen ligase), resulting in the 
release of Und-PP. WaaL has similarities to oligosac- 
charyltransferases involved in the synthesis of O-linked 
glycoproteins. 

The complete LPS molecule is transported to the 
outer membrane by the LptABCDEFG protein complex 
(Fig. 6) (Ruiz et al. 2009). The export of capsules to 
the outer membrane also requires a distinct complex of 
proteins, which includes an outer membrane secreting 
protein (Whitfield 2006). 
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Synonyms 

Gene repression; Regulation of gene expression 


Repression: The Trp and Lac Systems 

Repression is an important aspect of the regulation of 
gene expression in bacteria that is achieved by the 
interaction of a protein (repressor) with a specific 
DNA sequence (the operator) in the promoter region 
of a polycistronic gene. The affinity of the repressor for 
its operator is modulated by a metabolite that is part of 
the pathway controlled by the gene products, and is thus 
an example of metabolic or nutrient sensing. Repression 
can be either positive or negative as exemplified by the 
lac repressor system (positive control) and the trp 
repressor system (negative control) (Fig. 1). 

The lac repressor is constitutively expressed in 
enteric bacteria and binds tightly to the lac operator. 




Bacterial Repression: Lac and Trp Systems 


169 


B 


1 Lac 1 1 

N 

O 

Q. 

Y 

A 


\ 




88 ^ 


m 


NS DNA 


b Trp R 


\ 


□□ 


004— 

I I 




00 4 — 

I NS DNA ] 


W 


Bacterial Repression: Lac and Trp Systems, Fig. 1 Illustra¬ 
tion of the operons of lac (negative) and tip (positive) control. 
p,o represent the promoter and operator sequences, respectively. 
The promoter is accessed by RNA polymerase only if the oper¬ 
ator is not occupied by a repressor molecule, (a) Lac 1 is the gene 
for the lac repressor, which is transcribed and translated into 
a tetrameric protein that has high affinity for the operator, and 
lower affinity for other (NS) DNA. The normal state of the 
system is repressed, i.e., there is no transcription of the genes 
Z,Y, or A. Only under conditions where lactose must be metab¬ 
olized does the inducer allolactose bind to free repressor mole¬ 
cules and cause a change of state in which the repressor has low 
affinity for the operator, causing its dissociation, and permitting 
transcription/ti'anslation to proceed, (b) EDCBE are the struc¬ 
tural genes for tryptophan biosynthesis. Tip R is the gene for the 
trp repressor, which is a dimer, that has only low affinity for the 
trp operator, and thus under conditions of low tryptophan avail¬ 
ability, transcription/translation is permitted (leading to trypto¬ 
phan production. At high concentration, L-tryptophan binds to 
free repressor, increasing its affinity for the operator, and thus 
shutting down transcription/translation - an example of feed¬ 
back positive control. In addition to trpEDCBA, the trp repressor 
also regulates the trpR, AwH, AroL, and mtr operons) 


shutting down expression of the three lac genes. Only 
when glucose becomes limiting, and lactate is avail¬ 
able, are these genes needed; the lac repressor binds the 
metabolite allolactose, and this complex has a much 
lower affinity for the operator, and thus decreases the 
fraction of operator occupied by repressor, permitting 
transcription (Miiller-Hill 1996). In contrast, the trp 
repressor has only low affinity for its operator so that 
expression of the trp ABCDE genes is permitted in the 
absence of the corepressor L-tryptophan. Thus, the 
product of the pathway forms a negative feedback 
loop. Repression is only part of the regulation, 
however; in the trp system, there are additional levels 
of control at the transcription/translation level includ¬ 
ing attenuation (Yanofsky 2004), and in the lac system. 


the cAMP BP-protein-protein interactions and cou¬ 
pling to metabolism are involved. 

The lac and trp repressors represent two exemplars, 
and their interactions with the allosteric modulators 
and DNA have been extensively studied by genetic, 
structural, and biophysical approaches. Both of these 
proteins use the helix-tum-helix motif for DNA recog¬ 
nition. This motif is very common in DNA binding 
proteins and is not exclusive to bacterial repressors. 
Furthermore, it is not unusual for this motif to be 
extended, such as in the so-called winged helix-turn- 
helix proteins that occur widely (http://www.ebi.ac.uk/ 
interpro/IEntry?ac>/4IPR011991) in all kingdoms. 


B 


Biophysics of Repression 

The role of biophysics is primarily to understand the 
relationships between the structure of the interacting 
components, their energetics, and their kinetics, and 
thus the molecular mechanism of molecular recogni¬ 
tion and biological regulation. 

1. Lac repressor. The lac repressor is a tetramer, and 
the DNA binding domains can be removed either 
genetically or by proteolysis. The small domains 
(51-60 residues) are autonomously folded and 
retain their DNA binding activity and specificity 
(Miiller-Hill 1996). The three-dimensional struc¬ 
ture of the free proteins and in complexes with 
various DNAs has been determined both by high- 
resolution NMR as shown in Fig. 2 and by X-ray 
crystallography (Kercher et al. 1997). 

The DNA binding domains each contain 
a canonical helix-tum-helix motif; the second helix 
of this motif slots into the major groove of the DNA 
and is known as the “recognition helix.” Functional 
groups on the protein side chains make specific inter¬ 
actions with both the phosphate backbone, and with 
functional groups on the exposed edges of the bases, 
which accounts for the sequence specificity. Addi¬ 
tional interactions with the phosphate backbone 
account for the overall affinity. In the intact protein, 
the net affinity is very high, but it is sensitive to 
small changes in the DNA recognition sequences and 
the solution conditions, including the ionic strength. 
It was the latter that made the system attractive 
for detailed biophysical studies to determine the 
origin of sequence selectivity in DNA binding 
proteins. 
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Bacterial Repression: Lac and Trp Systems, Fig. 2 Structure 
of the lac repressor-operator complex determined by NMR 
(From Romanuka et al. 2009): Pdb 2KEI. The protein backbone 
is shown as colored ribbons with the recognition helices in the 
major groves of the DNA, shown as ribbon and in ball and stick 
representation 


2. Trp repressor. The trp repressor is a dimer of 
identical subunits, each containing a classical helix- 
tum-helix DNA binding motif, but unlike the lac 
repressor, these are not autonomously folding 
domains. The structure of the protein in the absence 
and presence of the corepressor L-tryptophan and the 
repressor-operator complex have been studied by 
both X-ray crystallography and NMR (Zhang et al. 
1987; Otwinowski et al. 1988; Zhang et al. 1994). 
Figure 3 shows the X-ray structure of the protein- 
DNA complex. 

The protein itself displays a number of interesting 
features, not least the flexible N-terminal arms, and in 
the absence of corepressor, a partially unfolded DNA 
recognition helix (Zhang et al. 1987). Binding trypto¬ 
phan stabilizes and orients the recognition helix, partly 
accounting for the role of the corepressor in increasing 
DNA binding affinity. Thermodynamic studies have 
indicated that the flexibility of this helix is a major com¬ 
ponent to the high heat capacity change associated with 
DNA and corepressor binding. The system has been a test 
bed for thermodynamic analysis to establish a signature 
for folding-dependent binding and specificity. 

An unanticipated observation on the trp repressor- 
operator complex was that the number of direct contacts 
between the protein and the DNA was quite small, 
and seemingly inadequate to account for its moderate 
sequence specificity (ca. 10,000 fold against non-cognate 
DNA). However, it appears that several interactions were 
mediated by bound water molecules, the so-called indirect 
readout mechanism, which in itself requires additional 


considerations for understanding the thermodynamics of 
the system. Generally, water is released on forming spe¬ 
cific complexes, which should be entropically favorable, 
whereas here the water was retained in the interface 
(Joachimiak et al. 1994). 

3. Free energy parsing: A major goal of biophysics is to 
understand the physical factors that determine the 
affinity and specificity of the protein-DNA interac¬ 
tions. A part of this is to relate specific identifiable 
forces to the structure characteristic of the interacting 
components. Thus the free energy change of the inter¬ 
action in principle can be parsed into several contri¬ 
butions, albeit the terms that are chosen are not unique 
(Lane and Jenkins 2000). 

A common parsing is: 

AG = AG (bonds) -F AG (elec) -f AG (HB) 

-F AG (vdW) -F AG (solv) -F AG(conf) 

Elec Is electrostatics, HB is hydrogen bonds, vdW are 
van der Waals interactions, solv is solvation, and conf 
is conformational energy. These contributions can in 
principle be broken down further into the enthalpy and 
entropy contributions, which also account for thermal 
effects, volume changes or other extrinsic factors. 

As DNA bears a net negative charge and the 
interacting protein may have a net positive charge 
whose value is pH dependent, the free energy of the 
interaction will depend both on pH and ionic strength. 
These together comprise the electrostatic component. 
The binding of repressors to DNA is often strongly 
dependent on ionic strength, indicating the formation of 
a substantial number of salt bridges between the binding 
surface and the negatively charged phosphodiester 
backbone of the DNA. At an ionic strength of 1 M, 
the electrostatic contribution to binding becomes 
zero, where the non-electrostatic contributions can be 
determined. For the lac repressor, the dissociation 
constant = 3 pM under physiological salt conditions but 
rises to 35 nM at 1 M salt (Lin and Riggs 1975), a differ¬ 
ence in favorable free energy of around 24 kJ mol“*. 

It is also common for the interface to be less 
solvated than the surfaces in the uncomplexed mole¬ 
cules so that AG also depends on water activity. The 
conformation of the free components may change 
significantly in forming the complex, which will be 
expressed in the AG(conf) term. This has been shown 
to be associated with a large change in the heat capac¬ 
ity. The binding heat capacity has also been correlated 
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Bacterial Repression: Lac and Trp Systems, Fig. 3 X-ray 
structure of the trp repressor-operator complex (From 
Otwinowski et al. 1988): Pdb ITRO. The all-helical trp repressor 
dimer is shown as connected cylinders in pastel colors. The 
recognition helices access successive major grooves of the 
DNA. Bound tryptophan molecules are shown as space filling 
molecules with blue nitrogen atoms 

with changes in hydration via the solvent accessible 
surface area and/or folding events that are coupled to 
binding (Ha et al. 1989; Spolar and Record 1994). 
However, thermodynamic parameters other than AG 
can be difficult to measure reliably and accurately. 

The large increase in binding affinity for the cognate 
operator site is in part a consequence of additional close 
interactions between the protein and the DNA, especially 
those concerned with sequence recognition, which 
include not only specihc H-bonding contacts, but also 
exact shape complementarity and concomitant changes 
in hydration. In the lac repressor, there have been many 
attempts to define the importance of H-bonds and other 
specific interactions by site-directed mutagenesis, in 
which a single residue at a time is changed for other 
amino acids, and as a corollary, changes are made to the 
DNA sequences to compensate for changes in amino 
acid interactions (Frank et al. 1997). In principle, specific 
H-bonding interactions and/or electrostatic contributions 


from individual groups can be deciphered by suitable 
mutation cycles (Fersht 1999) and investigating the 
effects of pH and ionic strength on the protein-DNA 
affinity. 

Sequence selectivity: To a first approximation, the 
rate of transcription from a particular bacterial pro¬ 
moter is determined by the fraction of the promoter 
that is available to RNA polymerase. As a repressor 
cannot have zero affinity for non-cognate DNA, the 
issue becomes one of the relative affinity between 
repressor and non-cognate sites given a potential site 
concentration ratio of the order one million or so, the 
typically low number of repressor molecules present 
per cell, and the small number of specific target 
sequences. The lac repressor, estimated to be at ca. 
10 molecules per cell on average (approx. 10 nM), 
binds to nonspecific DNA with an affinity of the 
order 10 pM (sites), whereas the affinity for the natural 
palindromic operator site is ca. 3 pM at near physio¬ 
logical salt conditions or > one-million-fold differen¬ 
tial binding capacity. However, the ratio of the number 
of possible sites to the single lac operator target site is 
also of the order one million. The implication is that 
the number of free repressor molecule is essentially 
zero, and that in the absence of an inducer, the operator 
is occupied by the repressor perhaps 90% of the time, 
the remaining repressor molecules are bound to 
nonspecific DNA. In a population of cells, this implies 
that 1 in 10 is derepressed at any given time. As the 
inducer decreases the affinity for the operator DNA 
about 1,000 fold, with no effect on the affinity for 
nonspecific DNA, the repressor redistributes to the 
nonoperator sites. 

The trp repressor has nonspecific affinity for DNA 
in the 10-20 pM range, similar to the lac repressor, and 
affinity for the Trp O operator of the order 1 nM, i.e., 
only 10^-fold selectivity ratio. Furthermore, the other 
four operators recognized by trp R have even lower 
affinity, suggesting that the simple arguments about 
relative affinity and numbers of sites is oversimplified. 
In practice, one must also consider the interactions 
with other proteins, including the availability of RNA 
polymerase and binding kinetics versus initiation rates 
in a rapidly growing culture. 

However, the use of the affinities measured under 
near ideal conditions is complicated by several factors: 
genomic DNA is largely occupied with a wide variety of 
proteins, the very high solute concentration inside bac¬ 
teria (of the order 400 mg/ml) and the presence of 
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multiple operator sites, which via DNA folding can be 
simultaneously occupied by a repressor tetramer. Other 
non-cognate sites that have some elements in common 
with the wt operator will have an affinity between that of 
the operator and the bulk DNA. In other words, the 
model should include the distribution of affinities as 
well as the number of sites actually available. Such 
considerations make modeling of the repression/dere¬ 
pression intrinsically difficult as the appropriate interac¬ 
tions at the cellular level may be stochastic, and the 
relevant affinity and rate constants could be very differ¬ 
ent from those measured in dilute aqueous solution. 
These considerations pose severe challenges to the 
biophysicist. Furthermore, the temporal aspects of 
repression/derepression have received less detailed 
examination. However, in studies of the kinetics of bind¬ 
ing the lac repressor to DNA, it appeared that the second 
order rate constant exceeded the diffusion limited rate 
constant by a large margin that could not be attributed 
only to electrostatics. It was argued that the whole geno¬ 
mic DNA is the initial target and then a repressor mole¬ 
cule would reach the specific site by a process of one 
dimensional diffusion along the contour of the DNA 
(sliding). There are other explanations that take into 
account the three-dimensional structure of closed circu¬ 
lar DNA and the tetrameric nature of the repressor (hop¬ 
ping mechanisms/intersegment transfer). Given that 
genomic DNA is far from naked, some mixture of local 
sliding and the hopping mechanism may operate in vivo. 

Cross-References 

► Coupled Folding and Binding 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► Nucleic Acid NMR - Introduction 

► Protein NMR - Introduction 

► Thermodynamics of Biomolecular Interactions 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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Synonyms 

Electron transport chains in bacteria 

Definitions 

Bacterial respiratory chains provide a biochemical 
scaffold for the shuttling of electrons from organic 
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and inorganic energy sources to a variety of acceptor 
molecules, typically oxygen. This process is coupled to 
the generation of an electrochemical gradient, which 
may he used to generate energy in the form of ATP via 
the FiFo ATPase. 

Introduction 

The aim of this entry is to provide an overview of the 
function, structure, biosynthesis, and evolution of 
respiratory chains in bacteria. Classical ideas of respi¬ 
ratory function describe the generation of an electro¬ 
chemical gradient across the inner membrane via the 
translocation of protons derived from the oxidation of 
reduced nicotinamide adenine dinucleotide (NADH) 
and succinate. This proton motive force may then be 
used by the FiFq ATPase to synthesize ATP. However, 
the mechanism via which this gradient is achieved 
depends upon a number of environmental and genetic 
factors. One common feature of bacterial respiratory 
metabolism is the modular architecture. This enables 
the utilization of the major classes of respiratory com¬ 
ponents (i.e., dehydrogenases, quinones, cytochromes, 
reductases, and terminal oxidases) in different combi¬ 
nations to optimize respiration and energy conservation 
under the prevailing, often variable, environmental 
conditions. One obvious example is the availability of 
oxygen, which can influence the use of dioxygen as 
a terminal electron acceptor (aerobic respiration), as 
opposed to inorganic molecules such as sulfate, carbon 
dioxide, nitrate or nitrite (anaerobic respiration). To 
understand aerobic and anaerobic respiration requires 
knowledge of the respiratory genes of the bacterium, 
their expression levels, availability of electron donors/ 
acceptors, and the affinity of the various respiratory 
complexes for their respective electron donors and 
acceptors. Furthermore, since most of the respiratory 
chain is reversible, the operating redox potential of a 
couple should be close to the mid-point potential of 
this couple, allowing both the reduced and oxidized 
electron carriers to exist at comparable concentra¬ 
tions. Hence, cellular redox poise impinges upon the 
utilization of redox carriers in the respiratory chain. 
This entry provides an overview of the different func¬ 
tional classes of bacterial respiratory chain, using 
Escherichia coli as an example to demonstrate the 
remarkable flexibility and modular nature of these 
respiratory complexes. 


Functions 

Aerobic and anaerobic modes of respiration differ only 
with respect to the terminal electron acceptors that are 
used: dioxygen in aerobic respiration, and a variety of 
inorganic and organic molecules for anaerobic respi¬ 
ration. Respiration-coupled oxidative phosphorylation 
via the proton motive force is significantly more effi¬ 
cient, in terms of energy conservation, than fermenta¬ 
tion. To understand the chemistry underpinning this 
phenomenon, one should consider the redox properties 
of the terminal electron acceptors. The four-electron 
reduction of dioxygen has a redox midpoint of 
-1-815 mV (Eo') and represents a larger thermodynamic 
sink than is achieved via the use of nitrate/nitrite 
(Eo' = 430 mV) or fumarate/succinate (Eo' = 33 mV) 
in anaerobic respiration, or pyruvate/lactate in 
fermentation. However, the solubility of oxygen in 
water is relatively poor, and its kinetic inertness 
necessitates activation by di-heme or heme-Cu 
centers (da Silva and Williams 2001) in bacterial 
terminal oxidases. 

The primary role of bacterial respiratory chains is to 
generate a proton motive force (pmf) for ATP synthe¬ 
sis by the EiEq ATPase. However, not all electron 
transport in bacterial respiratory chains results in the 
generation of a proton motive force: respiratory com¬ 
plexes are also thought to uncouple catabolism from 
ATP synthesis, dissipating an excess of reducing 
power in the respiratory chain when the supply of 
NADH exceeds demand or when excess oxygen 
must be consumed. This kind of electron transport 
may be useful during a transition from starvation to 
normal exponential growth, where a high rate of 
NADH dehydrogenase activity could damage down¬ 
stream respiratory complexes. Certain bacterial respi¬ 
ratory chains generate a sodium electrochemical 
gradient, not a pmf. Hence, bacterial respiratory 
chains can be thought of as highly responsive, modu¬ 
lar systems that can regulate the flux of electrons and 
translocation of protons via the use of a variety of 
electron donors, respiratory complexes, and electron 
acceptors. 

Structural Classes 

Electrons can enter and leave the respiratory chain at 
a number of different points, and individual bacteria 
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donors 

e.g. NADH, succinate, 
formate, lactate 



oxidases oxi 

\ / 


dioxygen 

Bacterial Respiratory Chains, Fig. 1 A generalized model of 
the bacterial respiratory chain (Gennis 1989; Nicholls and 
Ferguson 2002) 


are able to utilize multiple electron donors and accep¬ 
tors, often simultaneously, to provide the respiratory 
flexibility required for the changing environmental 
niches in which they reside. The respiratory flexibility 
in the mammalian mitochondrion is relatively poor, 
having limited options at the level of electron input 
and only a single terminal electron acceptor in cyto¬ 
chrome c oxidase (cytochrome aa^). Electron transport 
in plant mitochondria is slightly more flexible, where 
a number of alternative NADH dehydrogenases and 
two terminal oxidases permit respiratory functions 
beyond the synthesis of ATP: uncoupling of electron 
transport via these alternative respiratory complexes 
can be used to generate heat in thermogenic tissues or 
in flowering in plants. However, one needs to look to 
bacteria and archaea to find the most extreme examples 
of respiratory flexibility. 

Bacterial respiratory chains can comprise multiple 
dehydrogenases, quinone electron carriers, a cyto¬ 
chrome bc\ complex, and a range of quinol oxidases 
and reductase complexes that reduce molecular oxygen 
and various inorganic electron acceptors, respectively. 
Figure 1 depicts a generalized model of a bacterial respi¬ 
ratory chain. Depending on the growth conditions, 
different combinations of these complexes predominate, 
and a range of electron transport routes are available 
to suit the prevailing conditions, the individual 


components often being expressed in response to specific 
environmental stimuli. This flexibility explains why bac¬ 
terial respiratory chains are often described as being 
branched, modular, and inducible. A good example of 
this modular architecture is the respiratory chain of 
E. coli (Fig. 2). Although other bacteria may have 
a greater range of routes from dehydrogenase to oxidase, 
such as Paracoccus denitiificans, a more extensive liter¬ 
ature exists on the structure, function, and regulation of 
these respiratory complexes in E. coli. 

Electron Donors 

Bacteria can be classified by the source of energy 
used to drive electron transport and energy generation. 
Autotrophs derive energy from light (phototrophs) or 
from simple inorganic molecules (lithotrophs), and 
organotrophs derive energy from organic molecules. 
For example, E. coli can perform organotrophic 
respiration via the oxidation of NADH, succinate, 
glycerophosphate, lactate, and formate, whereas 
lithotrophic bacteria may oxidize sulfite, sulfide, 
nitrite, ammonia, carbon dioxide, and ferrous iron as 
an energy source. In addition, E. coli can utilize hydro¬ 
gen as an electron source under anaerobic conditions. 

Dehydrogenases 

Organotrophic bacteria have a wide array of dehydro¬ 
genases that may be used to process different energy 
sources such as NADH, succinate, formate, glycerol-3- 
phosphate, pyruvate, and hydrogen (hydrogenases). 
Often, there may be multiple isoforms of these 
enzymes in individual bacterial species. With regard 
to the oxidation of NADH, E. coli possesses the mem¬ 
brane-spanning NDH-I (similar to complex I - NADH- 
ubiquinone oxidoreductase of mitochondria) that 
translocates two protons upon oxidation of NADH. 
Additionally, NADH is oxidized by a range of other 
dehydrogenases that channel electrons to the quinone 
pool without contributing to the proton gradient 
(NDH-II, WrbA, QOR, YhdH) (Calhoun et al. 1993). 
In E. coli, electrons are also transferred to the qui¬ 
none pool via succinate dehydrogenase, which is 
similar to complex Il-succinate dehydrogenase in 
mitochondria and does not contribute to the proton 
gradient. 
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Bacterial Respiratory 
Chains, Fig. 2 Respiratory 
flexibility in E. coli. UQ 
ubiquinone, MQ 
menaquinone, DMQ 
dimethylmenaquinone 
(Richardson 2000; Gennis and 
Stewart 1996) 
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Quinone Carriers 

Quinones are lipid-soluble molecules that reside in the 
inner membrane, and shuttle electrons and protons 
between various respiratory complexes. Collectively, 
they are referred to as the quinone pool, which can 
consist of a number of quinone species in different 
redox states (Soballe and Poole, 1999). Bacteria com¬ 
monly use ubiquinone, along with related quinones 
such as menaquinone. E. coli uses ubiquinone to shuttle 
electrons to the terminal oxidases during aerobic respi¬ 
ration, and favors naphthoquinones (menaquinone 
and dimethyl menaquinone) to shuttle electrons to 
the various reductases during anaerobic respiration 
(see ► Electron Transport in Strict Anaerobes). 

Cytochrome bci Complex 

The cytochrome hc^ complex is the most common 
of the energy-transducing electron transfer complexes, 
and has been isolated from gram-negative and 
gram-positive bacteria, from organotrophs and 
lithotrophs, from aerobic, anaerobic, and photosyn¬ 
thetic bacteria (Trumpower 1990). Such complexes 
comprise a phylogenetically diverse group of elec¬ 
tron-transferring membrane complexes, similar to the 
mitochondrial Complex III - cytochrome hci complex 
and the cytochrome ftg/complexes of chloroplasts and 
photosynthetic bacteria (see ► Photosynthetic Elec¬ 
tron Transport), all of which couple electron transfer 
to proton translocation. The bacterial hcx complex 
occupies a branch between the quinone pool and cyto¬ 
chrome c-type terminal oxidases (Fig. 1). 


All bacteria that contain a cytochrome hci complex 
appear to contain at least one alternative ubiquinol 
oxidase pathway, which can circumvent the hcy 
complex. This branching of the respiratory chain may 
provide these bacteria with protection against compet¬ 
ing microorganisms which produce toxins that target 
the hci complex, such as antimycin and myxothiazol. 
Some bacteria lack the cytochrome hci complex 
altogether, as is the case with E. coli. 

Terminal Oxidases 

Bacterial terminal oxidases are integral membrane pro¬ 
teins that couple either (a) the two-electron oxidation 
of ubiquinone or (b) the 1-electron oxidation of cyto¬ 
chrome c to the four-electron reduction of molecular 
oxygen to water, often resulting in a contribution to the 
proton gradient (Poole and Cook 2000). Oxidases can 
be sub-grouped into four classes: Class I, cytochrome c 
oxidases containing heme a and Cu (cytochrome aaj,-, 
caa^-, and Oi-types); Class II, cytochrome c oxidases 
containing heme h (cytochrome o- and co-types); Class 
III, quinol oxidases containing heme h and heme d 
(cytochrome /jii-type); Class IV, quinol oxidases 
containing heme h and heme o (cytochrome bo- or 
(jo'-type, where the superscript designates a ligand¬ 
binding heme). E. coli possesses terminal oxidases 
from classes III and IV: two oxidases of the M-type, 
encoded by the cydAB and appBC loci, and 
a Cu-containing oxidase of the cytochrome foo'-type 
encoded by the cyo operon. Both cytochrome hd-\ 
(CydAB) and cytochrome hd-\\ (AppBC) are quinol 
oxidases, each comprising two polypeptides and a low 
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spin heme h, high spin heme h, and heme d. In contrast, 
the cytochrome ho' quinol oxidase comprises a high 
spin heme (cytochrome o), a low spin h-type heme 
(cytochrome h), and copper, resulting in a hinuclear 
heme-copper catalytic site similar to suhunit I of the 
mammalian cytochrome aos system (see ► Cyto¬ 
chrome c Oxidase (Complex IV)). These structural 
differences between cytochrome hd-type and ho'-type 
oxidases are reflected in their mode of action, since 
only cytochrome ho' acts as a proton pump, although 
cytochrome hd-l generates a proton gradient via liber¬ 
ation of protons upon quinol oxidation. It has been 
reported, based on chemostat cultures with intact 
cells, that cytochrome hr/-II-mediated quinol oxidase 
activity is electroneutral and does not contribute to the 
pmf, but this requires confirmation by direct measure¬ 
ment in membranes or with the purified oxidase. 

Reductases 

There is a diverse array of electron acceptors utilized 
by bacteria and archaea during anaerobic respiration, 
including nitrogen oxy-anions and nitrogen oxides, 
organic N-oxides, halogenated organics, metalloid 
oxyanions (e.g., arsenate), transition metals (e.g., Fe 
(III) and Mn(IV), inorganic sulfur compounds, organic 
sulfoxides and sulfonates, nitrogen oxyanions and 
oxides, and organic N-oxides. 

Facultative anaerobes may possess reductases that 
are synthesized in response to a drop in oxygen ten¬ 
sion. For example, E. coli expresses reductases that can 
utilize a range of organic acceptors (fumarate, DMSO, 
TMAO), and a series of reductases that can succes¬ 
sively reduce nitrate, nitrite, and nitric oxide (NO) 
under denitrifying conditions. 

Cytochrome Carriers 

Some cytochromes are mobile electron carriers that can 
shuttle electrons between respiratory complexes. These 
electron carriers are common in facultative anaerobic 
bacteria, and may transport electrons between the 
cytochrome hci complex and cytochrome c oxidases 
and between the quinone pool and various reductases. 
These carriers may transfer electrons between 
membrane-bound redox proteins, as is the case for the 


cytochrome C 550 of Paracoccus denitrificans that 
shuttles electrons between the hci complex and 
the cytochrome aas-type oxidase. Others are soluble 
periplasmic proteins, such as cytochrome C 552 of 
P. aeruginosa and E. coli, which provides the conduit 
between the quinone pool and nitrate reductase. 

Regulation 

The transcriptional regulation of respiratory complex 
components is influenced by a multitude of environ¬ 
mental factors that mediate their actions via a number 
of transcription factors, often eliciting concerted con¬ 
trol over individual promoters. This is exemplified by 
the regulation of two of the NADH dehydrogenases of 
E. coli K12: the proton-translocating NDH-I complex 
and the membrane-bound non-proton translocating 
monomeric NDH-II protein. Significant electron flow 
passes through both NDH-I and NDH-II during glu¬ 
cose-limited aerobic growth, although the NDH-I is 
the preferred NADH dehydrogenase of aerobic and 
nitrate respiration, whereas NDH-II is favored during 
fumarate respiration. Since the affinity for NADH of 
NDH-II is high, it is thought that the function of this 
non-electrogenic flavoprotein is to regulate the NADH 
pool independently of energy generation when the 
capacity of the cell to generate energy exceeds 
demand. The nuo operon, which encodes the subunits 
of the NDH-I, is mainly controlled by O 2 and nitrate 
via ArcA (anaerobic repression) and NarL (nitrate 
activation), although Fnr and IHF also exert weak 
anaerobic repression. The ndh gene, encoding NDH- 
II, is subject to anaerobic repression via the 
tight control of Fnr, and is also influenced by growth 
phase-dependent regulation and nutrient quality via 
Fis and Arr, respectively. 

A similar network of transcriptional control is used 
to modulate the expression of terminal oxidases in 
E. coli. Cytochrome hd-\ has a high affinity for oxygen 
and is induced under microaerobic conditions, whereas 
the lower affinity cytochrome ho' is highly expressed 
under aerobic conditions. Both the hd-l (CydAB) and ho 
oxidases of E. coli are repressed under anoxic conditions 
as components of the Arc and Fnr regulons. Also, the 
catabolite repressor activator (Cra) has been shown to 
control cydAB expression in concert with Arc and Fnr. 
In contrast, the app operon, which encodes the AppBC 
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cytochrome hd-\i quinol oxidase, is induced by carbon 
and phosphate starvation and is under the control of the 
transcription factor AppY, the sigma factor RpoS, and 
the histone-like DNA-binding protein H-NS. 

Evolution 

Through phylogenetic analyses, a number of homolo¬ 
gous respiratory proteins have been shown to exist in 
bacteria and archaea, including nitrate reductase, DMSO 
reductase, and cytochrome hd oxidase. Therefore, it 
seems plausible that these respiratory complexes arose 
early in evolution and that the last common ancestor of 
living organisms was not a simple organism with regard 
to respiratory metabolism. It, therefore, seems likely that 
respiratory flexibility provided a selective advantage 
during this early stage of evolution, which permitted 
the colonization of a range of environmental niches 
(Castresana and Moreira 1999). 

Summary 

The ability of bacteria to colonize a wide range of 
environments and rapidly adapt to changes in condi¬ 
tions is underpinned by the diversity and flexibility of 
their respiratory chains. Knowledge of these respiratory 
adaptations is of great importance for the understanding 
of clinical infections: for example, invading bacteria 
can induce an NO reductase or flavohemoglobin to 
detoxify cytotoxic NO produced by host cells, and can 
also express an NO-insensitive cytochrome hd terminal 
oxidase to facilitate aerobic respiration in the presence 
of NO, a potent inhibitor of cytochrome function 
(Mason et al. 2009). Additionally, knowledge of bac¬ 
terial respiration may be exploited for bioremediation 
purposes (e.g., for the removal of nitrates from agricul¬ 
tural waste water), and a detailed understanding of the 
structural/spectroscopic characteristics of individual 
complexes is crucial for the development of biosensors 
(e.g., for the detection of dissolved gases). With the 
emergence of second and third generation sequencing 
platforms, insightful use of the increasing number of 
available genome sequences of pathogenic bacteria 
demands an understanding of how the potential for 
respiratory flexibility can impact upon both survival 
outside the host and colonization during infection. 


Bekker M, de Vries S, Ter Beek A, Hellingwerf KJ, Teixeira de 
Mattos MJ. Respiration of Escherichia coli can be fully 
uncoupled via the non-electrogenic terminal cytochrome 
M-II oxidase. J Bacteriol. 2009;191:5510-17. 

Calhoun MW, Oden KL, Gennis RB, de Mattos MJT, Neijssel 
OM. Energetic efficiency of Escherichia coli — effects of 
mutations in components of the aerobic respiratory chain. 
J Bacteriol. 1993;175:3020-5. 

Castresana J, Moreira D. Respiratory chains in the last 
common ancestor of living organisms. J Mol Evol. 1999; 
49:453-60. 

Frausto da Silva JJR, Williams RJP (2001) The biological chem¬ 
istry of the elements. The inorganic chemistry of life, 
2nd edn. Oxford University Press, New York 

Gennis RB. Biomembranes: molecular structure and function. 
New York: Springer; 1989. 

Gennis RB, Stewart V. Respiration. In: Niedhart JC, Curtiss R, 
editors. Escherichia coli and Salmonella', cellular and 
molecular biology. Washington, DC: ASM Press; 1996. 
p. 217-61. 

Mason MG, Shepherd M, Nicholls P, Dobbin PS, 
Dodsworth KS, Poole RK, Cooper CE. Cytochrome hd 
confers nitric oxide resistance to Escherichia coli. 
Nat Chem Biol. 2009;5:94-6. 

Nicholls DG, Ferguson SJ. Bioenergetics 3. San Diego: Aca¬ 
demic; 2002. 

Richardson DJ. Bacterial respiration: a flexible process 
for a changing environment. Microbiology. 2000; 146: 
551-71. 

Poole RK, Cook GM. Redundancy of aerobic respiratory chains 
in bacteria? Routes, reasons and regulation. Adv Microb 
Physiol. 2000;43:165-224. 

Soballe B, Poole RK. Microbial ubiquinones: multiple roles in 
respiration, gene regulation and oxidative stress manage¬ 
ment. Microbiology. 1999;145:1817-30. 

Trumpower BE. Cytochrome hci complexes of microorganisms. 
Microbiol Rev. 1990;54:101-29. 




B 


178 


Bacterial Respiratory Oxygen Reductases 


Bacterial Respiratory Oxygen 
Reductases 

Robert B. Gennis 

Department of Biochemistry, University of Illinois, 
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Synonyms 

Cytochrome oxidase; Oxygen reductases; Terminal 
oxidases 


In prokaryotes (Eubacteria or Archaea), the 
enzymes are located in the cytoplasmic membrane. 
For clarity, the superscript “out-in” specifies that the 
gradients are designated hy the values on the outside 
minus the values in the cytoplasm. In organisms which 
grow near neutral pH, the ApH°“'^‘" 0, and the dom¬ 

inant term is the transmembrane voltage, 
Respiratory oxygen reductases all couple the redox 
chemistry catalyzed at the active site to the generation 
of a transmembrane voltage. There are two families of 
respiratory oxygen reductases in prokaryotes: the 
heme-copper oxygen reductases and the hd-family of 
oxygen reductases. 


Definition 


Multiple Oxygen Reductases 


Respiration is the process by which reduced organic or 
inorganic substances are oxidized through a network of 
electron transfer reactions to ultimately reduce an 
exogenous oxidant for the purposes to generate energy 
for cellular processes and to maintain redox balance. 
Aerobic respiration implies that the ultimate oxidant is 
O 2 . Anaerobic respiration implies the use of any oxi¬ 
dant except O 2 , such as nitrate, sulfate, elemental sul¬ 
fur, dimethylsulfoxide, and ferric salts. 


Introduction 

The primary purpose of the aerobic respiratory chain 
is to generate the energy that can be used to support 
cellular functions, largely, but not exclusively, by 
driving the FiEq-ATP synthase to make adenosine 
triphosphate (ATP) from ADP and inorganic phos¬ 
phate (Pi). This is accomplished by coupling the 
redox chemistry of respiration to generating a proton 
motive force. The proton motive force (pm for Ap) 
is defined as the proton electrochemical gradient 
across the membrane divided by the Faraday 
constant (F). 


P F F 

A^out-in ^ 

F 

at 298K :A/7 ™'-“ = aT'™'-” - 59ApH™‘-” 
in units of millivolts 


Most aerobic prokaryotes encode multiple respiratory 
oxygen reductases. For example, E. coli encodes one 
heme-copper oxygen reductase, cytochrome bo 3 , and 
two different bd-type oxygen reductases, cytochrome 
bd-I and cytochrome bd-II. These all catalyze the same 
reaction. 


4e^'-l-4H+-f02 2 H 2 O 

These three enzymes are expressed under different 
growth conditions. Cytochrome bd-I, for example, is 
induced under microaerophilic conditions whereas 
cytochrome ho 3 is expressed when the dissolved O 2 
concentration is high. Cytochrome hd-I has a higher 
affinity for O 2 , but the efficiency of coupling to the 
generation of the transmembrane potential is lower. It 
is not unusual for bacterial species to encode multiple 
heme-copper oxygen reductases (up to 8!) as well as 
multiple bd-type oxygen reductases. 


Heme-Copper Oxygen Reductases 

Redox centers: The mitochondrial cytochrome 
c oxidase is a member of the heme-copper superfamily, 
and most aerobic prokaryotes also contain at least one 
respiratory oxygen reductase in this group. The heme- 
copper superfamily includes both O 2 reductases and 
NO (nitric oxide) reductases. These enzymes all share 
a homologous subunit which contains all the metal 
redox centers. The O 2 reductases all contain a heme 
in which the iron is ligated to two histidines, plus 
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Bacterial Respiratory Oxygen Reductases, Fig. 1 The his¬ 
tidine-tyrosine cross-linked cofactor located at the active site of 
a heme-copper oxygen reductase. Shown are the active-site 
heme and Cub 


a “bimetallic” active site containing a second heme, 
ligated to a single histidine, and a nearby copper (Cub). 
O 2 binds to the open ligation site of the active-site 
heme, and is reduced to 2 H 2 O at this site. The active 
site also contains a cross-linked histidine-tyrosine 
cofactor which is essential for catalytic function 
(Fig. 1). The histidine is ligated to Cub and the tyrosine 
is a proton and electron donor engaged in the catalysis 
of O 2 reduction. Besides subunit I and the two hemes 
and Cub, no subunits or prosthetic groups are common 
to all heme-copper enzymes. Other subunits may be 
involved in enzyme regulation or assembly, and at 
least one of the additional subunits contains the redox 
center(s) required to deliver electrons to the catalytic 
center in subunit I. 

Heme types and notation'. Heme-copper oxygen 
reductases are usually notated by their heme content, 
and the subscript “3” indicates the heme which binds to 
O 2 at the active site. For example, cytochrome aa 3 
indicates that both the low-spin heme (with two histi¬ 
dine ligands) and the active-site heme are chemically 
identical. Heme A. Cytochrome bo 3 contains Heme B at 
the “low-spin” site and Heme O at the active site. 

Subfamilies: There are three major (and several 
minor) families of heme-copper oxygen reductases: the 
A-, B-, and C-families. More than 70% of the sequences 
of heme-copper oxygen reductases encode A-family 
enzymes; about 4% are B-family members and most 
of the remaining sequences are C-family members. 


Most of the B-family oxygen reductases are found 
in extremophiles, such as hyperthermophiles. Both the 
B- and C-families appear to have evolved as adaptations 
to growth under hypoxic (low O 2 ). 

X-ray structures: Structures are available for sev¬ 
eral A-family oxygen reductases and for one member 
each of the B-family (cytochrome ba 3 from Thermus 
thermophilus) and C-family (cytochrome cbb 3 from 
Pseudomonas stuzeri). 

Electron sources: Most of the heme-copper oxygen 
reductases utilize a soluble or membrane-bound cyto¬ 
chrome c as the source of electrons. A relatively small 
number of heme-copper enzymes directly oxidize quinols 
present in the membrane bilayer including, for example, 
the cytochrome bo 3 ubiquinol oxidase from E. coli. 

Mechanism and oxygenated intermediates: 
The heme-copper center is buried within the protein 
(Fig. 2) and the chemistry requires electrons, protons, 
and O 2 . Electrons are delivered via metal centers 
which constitute an “electron wire,” with the immedi¬ 
ate donor to the heme-copper center being the low spin 
heme within subunit I. Protons are delivered by proton¬ 
conducting channels (two channels for the A-family 
enzymes and one channel for the B- and C-family 
enzymes). O 2 diffuses from the lipid bilayer through 
a hydrophobic cavity to the active site. 

A version of the catalytic cycle is shown in Fig. 3. 
The redox state of the heme-copper binuclear center is 
indicated along with associated oxygen species. Starting 
with the oxidized state (ferric heme iron, Fe^"^, and cupric 
Cu|+), two electrons need to be added to form the 
reduced center (ferrous heme Fe^"^, cuprous Cug) before 
O 2 binds to the active-site heme. At this point a rapid 
four-electron reduction of O 2 occurs, in which two elec¬ 
trons are from the heme Fe, forming an oxoferryl species 
(Fe2+ ^ Fe+‘^ = O^^), one electron is from Cub 
(C ug ^ Cug^) and one electron and proton come from 
the tyrosine portion of the His-Tyr cofactor, forming 
a neutral radical (Tyr-OH^Tyr-O*). The rapid four- 
electron reduction of O 2 splits the 0-0 bond and leaves 
each oxygen atom at the valence state of H 2 O, avoiding 
the generation of reactive oxygen species (superoxide, 
peroxide, hydroxyl radical). This oxygenated form of the 
enzyme is called the Pm species. The next electron added 
reduces the radical back to tyrosinate and the final elec¬ 
tron reduces the oxoferryl state back to the ferric heme 
(Fe'^^ = O^^ ^ Fe^+ -|- HO^). The cycle contains four 
one-electron additions to the heme-copper center plus 
one step in which O 2 binds and reacts. 
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Bacterial Respiratory 
Oxygen Reductases, 

Fig. 2 Comparison of an 
A-family oxygen reductase 
(from R. sphaeroides) and a 
C-family oxygen reductase. 
The heme-copper active site is 
buried within subunit I in each 
case. The proton channels are 
different, however, as is the 
“electron wire” directing 
electrons into the active site 




A-family oxygen 
reductase 


H+ 

D-channel 


Periplasmic side 


C-family oxygen 
reductase 


Cytoplasmic side 
K-channel 


H+ 

K-channel 


Bacterial Respiratory 
Oxygen Reductases, 

Fig. 3 A version of the 
catalytic cycle of an A-family 
oxygen reductase, indicating 
the intermediates as well as 
steps of proton uptake and 
release. The subscripts 
indicate whether the protons 
come from the D-channel, the 
K-channel or go into the exit 
channel 



Generating the proton motive force'. Two different 
mechanisms are used to generate the proton motive 
force (Fig. 3): 

1. The electrons and protons originate from different 
sides of the membrane, which is determined by the 
arrangement of the “electron wire” and the proton 
channels. The driving force of the reaction, there¬ 
fore, results in the separation of four charges per O 2 , 
generating a voltage that is positive outside. 

2. The heme-copper oxygen reductases are proton 
pumps. For the A-family of enzymes, one proton 
is transported from the inside (bacterial cytoplasm) 
to the outside concomitant with each of the electron 
transfer steps in the mechanism (Fig. 2). This results 
in the transfer of an additional four charges across 


the membrane per O 2 . Flence, for each O 2 , eight 
charges are driven across the membrane, eight pro¬ 
tons are taken from the inside, and four protons 
appear on the outside. For a cytochrome c oxidase, 
the full reaction can be written as follows: 

4cytc2+ -b ml + 02r^4cytc3+ -b 4H+, -b 2 H 2 O 

The B- and C-family enzymes pump protons with 
a lower stoichiometry (2F1'^ per O 2 ). 

bd-Family of Oxygen Reductases 

Much less is known about the bd-type oxygen reduc¬ 
tases. These enzymes are only found in prokaryotes 
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and have no analogues in eukaryotic organisms. The 
bd-type enzymes are unrelated to the heme-copper 
oxygen reductases, and are often expressed under con¬ 
ditions of low dissolved O 2 . Most of what is known 
about the bd-type oxygen reductases is from studies of 
the cytochrome bd-I from E. coli. There is no X-ray 
structure of any member of this family. Several fea¬ 
tures of what is known follow: 

1. The enzyme is a 1:1 complex of two subunits, each 
of which has multiple transmembrane helices. 

2. The enzyme contains three hemes and no other redox 
centers: heme b 55 g, heme bsgs, and heme d. The 
amino acid ligands to heme b 55 g and heme b 595 are 
located within subunit 1, and it has been proposed that 
the axial ligand to heme d is also within subunit I. 

3. All of the bd-type oxygen reductases appear to 
utilize a quinol as the reductant. There are no 
known cytochrome c oxidases in the bd-family. 

4. The quinol binding site is near the periplasmic side of 
the membrane, so that oxidation of quinol releases 
protons on the outside of the cell. Heme bssg appears 
to be directly involved in quinol oxidation. 

5. Oxygen binds to the ferrous form of heme d and is 
reduced to water at this site. The oxygen chemistry 
is similar to that described above for the heme- 
copper enzymes. Heme bsgs is within about 10 A 
of heme d, so these two hemes form a binuclear 
active site. 

6 . Cytochrome bd is much less sensitive to inhibition 
by cyanide than the heme-copper oxygen 
reductases. 

7. Cytochrome bd is not a proton pump, but does 
generate a proton motive force because the protons 
used to form H 2 O are delivered from the cytoplasm 
whereas the electrons come from quinol oxidized 
near the periplasmic surface. As a result the net 
reaction can be written 

2 QH 2 -f 4H+ -t- O 2 2Q“ -h 4H+, -h 2 H 2 O 

8 . Cytochrome bd has a very high affinity for O 2 and 
also can play a role in scavenging O 2 to protect 
oxygen-sensitive enzymes. 

9. Some members of the bd-family of oxygen reduc¬ 
tases appear to contain a third heme b in place of 
heme d. These variants, with little or no heme d, are 
also known as “cyanide insensitive oxidases.” They 
have not been well characterized. 
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Synonyms 

BH3 mimetics; Small molecule inhibitors of protein- 
protein interactions 

Definition 

Apoptosis, or programmed cell death, is essential for 
normal development, tissue homeostasis, and defense, 
and its dysregulation is strongly associated with dis¬ 
ease. Insufficient apoptosis can result in cancer or 
autoimmune diseases, whereas high levels of apoptosis 
can lead to degenerative diseases, immunodeficiency, 
and infertility. The Bcl-2 family of proteins are critical 
regulators of apoptosis that are functionally organized 
into three groups. The terminal effectors of the family 
(Bak and Bax) directly mediate mitochondrial outer 
membrane permeabilization which is the irreversible 
commitment to apoptosis. The prosurvival Bcl-2-like 
proteins (Bcl-2, Bc1-Xl, Bcl-w, Mcl-l, and Al) bind to 
and inhibit Bax and Bak. The BH3-only proteins 


(Bad, Bim, Bik, Bid, Hrk, Bmf, Noxa, and Puma) 
competitively displace Bcl-2-like proteins, thereby 
releasing Bax and/or Bak. Interactions between differ¬ 
ent family members are not all equivalent in terms of 
binding affinities. For example, all of the prosurvival 
Bcl-2-like proteins bind Bax but only Bc1-Xl, and 
Mcl-l bind Bad with sufficient activity to restrain its 
proapoptosis activity. Similarly, different BH3-only 
proteins can target different subsets of prosurvival 
Bcl-2-like proteins (Lessene et al. 2008). The 
overexpression of prosurvival Bcl-2-like proteins is 
a common feature of cancer cells, allowing them to 
evade apoptosis and therefore bypass a fundamental 
barrier to malignant transformation and provide resis¬ 
tance to standard chemotherapy. A series of 
approaches, as described below, have thus been used 
to target prosurvival Bcl-2-like proteins. 

Basic Characteristics 

Initial approaches used antisense oligonucleotides to 
knockdown the expression of Bcl-2 (Cotter et al. 
1994), but only modest clinical effects of this type of 
drug have been observed. Rather, the focus moved to 
inhibiting overexpressed prosurvival Bcl-2-like pro¬ 
teins by inhibiting Bcl-2-BH3 interactions, which are 
mediated by a hydrophobic groove on the prosurvival 
Bcl-2-like proteins and the a-helices of BH3-only pro¬ 
teins. A series of peptides and peptide mimetics, based 
on the a-helices of BH3-only proteins, were generated, 
some of which showed promise in animal models (see 
review by Lessene et al. 2008), but this approach has 
become overshadowed by the successful development 
of small molecule BH3-mimetics. Previously, it was 
thought that protein-protein interactions would be 
intractable targets for small molecule drug develop¬ 
ment because of the relatively featureless (when com¬ 
pared with enzyme active sites) characteristics of 
protein interaction interfaces. The discovery and fur¬ 
ther development of biologically active small mole¬ 
cule lead compounds that bind to protein surfaces in 
the Bcl-2 family has been an important milestone for 
the inhibition of protein-protein interactions. 

Numerous BI13 mimetics have been characterized, 
many of which have moved into clinical trial. Of 
these, three drugs (ABT-737/Navitoclax, Obatoclax/ 
GX-15-70, and AT-101/R-(-)-gossypol; Fig. 1) are most 
advanced in their development. 
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Bcl-2 Inhibitors, Fig. 1 Structures of small molecule BH3 mimetics that are showing promise in the clinic 


ABT-737/Navitoclax (Abbott Pharmaceuticals/ 
Genentech) was identified using “SAR by NMR,” an 
NMR-based screening method, which was used to 
screen a chemical library for small molecules that 
bound to the BH3-binding pocket of Bc1-Xl. Two 
molecules that hound to proximal but distinct sites 
were identihed and used to design a composite 
molecule that hound Bc1-Xl with nanomolar affinity. 
Structure-based design and parallel synthesis resulted 
in the development of ABT-737, which binds with 
subnanomolar affinity to Bcl-2, Bc1-Xl, and Bcl-w, 
but not the remaining prosurvival Bcl-2-like proteins 
(Oltersdorf et al. 2005). A second-generation mole¬ 
cule, Navitoclax (ABT-263), has essentially the same 
target specihcity and potency as ABT-737, hut is orally 
bioavailable (Tse et al. 2008). 

Obatoclax (GX-15-70; Gemin X Biotechnologies/ 
Cephalon) was developed by lead optimization after 
a screen of a natural products library identified a series 
of proidiosin (a bipyrrole-containing compound) mol¬ 
ecules that could inhibit Bcl-2 interactions. In vitro, 
obatoclax has low micromolar affinity (0.5-7 pM) for 
Bcl-2, Bc1-Xl, Bcl-w, and Mcl-l (reviewed in Lessene 
et al. 2008). 


AT-101 (Ascenta Therapeutics; R-(—)-gossypol) 
is a polyphenolic compound originally isolated from 
cottonseeds. Although racemic gossypol has been used 
in a range of therapeutic contexts, including cancer 
treatment, it is the (—) enantiomer that is more 
effective as a BH3 mimetic (Kitada et al. 2003), binding 
Bcl-2, Bc1-Xl, and Mcl-l with submicromolar affinity 
(Tang et al. 2008). Lead optimization of gossypol has led 
to a number of variants with higher affinities for their 
Bcl-2-like targets which may prove to be even more 
effective therapeutics (reviewed in Lessene et al. 2008). 

All three classes of drags display antitumor activity, 
but whereas ABT-737/Navitoclax appears to have higher 
specihcity for the prosurvival Bcl-2-like targets (which is 
in line with the highest in vitro affinity for these targets), 
both Obatoclax and AT-101 have signihcant antitumor 
activity that is independent of their BH3-mimetic prop¬ 
erties (Konopleva et al. 2008; van Delft et al. 2006). 
Many clinical trials are ongoing, but indications are that 
all three drags have efficacy in the treatment of various 
leukemias, lymphomas, and lung cancers, particularly in 
combination therapies, rendering cancer cells (including 
many chemoresistant cancers) sensitive to other chemo¬ 
therapies or radiotherapy (reviewed in Khaw et al. 2011). 
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Synonyms 

Beer’s law 


Definition 

Tbe Beer-Lambert law (Hollas 2004; Atkins and 
Friedman 2005; Atkins and de Paula 2006) 


A = 8C£ (1) 

is arguably tbe single most useful spectroscopy equa¬ 
tion. It follows from considering wbat happens to 
a beam of photons propagating along the X direction 
as it passes through a uniform sample of absorbing 
molecules. If light of intensity /(X) enters a slice of 
the sample at position X, then the intensity /(X + S X) 
after traveling an infinitesimal distance SX is 

/(X + SX)=I(X)- 

(number of molecules hit by a photon) x 11 

where 11 is the probability that a photon that hits 
a molecule is absorbed. Now 


number of molecules hit by a photon 


= /(X) 


i^slice X CTjyiolecule 

volume of slice 


SX 


= /(X)cc7^,i,rSX 


(3) 


where NsUce is the number of molecules in the slice of 
the sample of width SX, Cj^ojecuie is the cross-sectional 
area of the molecules facing the incident light beam, 
molar is the Corresponding molar quantity, and c is the 
concentration of the analyte. Thus, 


Beer's Law 


/(X + SX) - /(X) 
SX 


I (2f)c’(7molarn 


( 4 ) 


► Beer-Lambert Law Derivation 


In the limit of small SX, this becomes 
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= -/(X)c(T™larn = -I(X)SC (5) 

where e is the wavelength-dependent extinction coef¬ 
ficient. Upon integrating across a sample of path 
length £ this gives the Beer-Lambert law. 

The Beer-Lambert law is not valid if too few 
photons are emitted from the sample for the 
photomultiplier tube to count them accurately. It also 
breaks down when the sample is inhomogeneous or if 
there are concentration-dependent intermolecular 
interactions that affect the spectroscopy. 


Beta-Lactamases: Computational 
Studies 

Ewa Chudyk and Marc Willem van der Kamp 
Centre for Computational Chemistry, School of 
Chemistry, University of Bristol, Bristol, UK 
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Definition 

A group of bacterial enzymes that catalyzes hydrolysis 
of P-lactam antibiotics, causing antibiotic resistance. 


Cross-References 

► Absorbance Spectroscopy: Overview 

► Absorbance Spectroscopy: Spectral Artifacts and 
Other Sources of Error 

► Absorption Spectroscopy and Transition 
Polarizations: Pictorial Description 

► Absorption Spectroscopy: Practical Aspects 

► Absorption Spectroscopy: Relationship of 
Transition Type to Molecular Structure 

► Concentration Determination Using Beer-Lambert 
Law 
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Behavior Near Critical Points 

► Critical Eluctuations in Lipid Mixtures 


BEST 

► Flow NMR 


Basic Characteristics 
Introduction 

In recent years, P-lactam antibiotics, such as penicillins, 
cephalosporins, or carbapenems, have become less 
effective in the treatment of bacterial infections 
(Drawz and Bonomo 2010). One important bacterial 
antibiotic resistance is through their production of 
P-lactamases, enzymes that are capable of hydrolyzing 
P-lactam antibiotics. One strategy to overcome this 
problem is the discovery of new compounds - potential 
drags - that covalently bind and inhibit those bacterial 
enzymes, letting antibiotics work properly; another is to 
develop antibiotics than are less susceptible to break¬ 
down. Understanding the reaction mechanism of hydro¬ 
lysis at the atomic level could help in this. Based on 
sequence similarity, four main classes of P-lactamases 
have been defined: A, B, C, and D. Based ab intio level 
on their proposed reaction mechanisms, they can be 
divided into two main types: metallo-p-lactamases 
(class B) and serine P-lactamases (classes A, C, and D). 

Serine |}-Lactamases 

The class A P-lactamase TEM-1 is the most common 
P-lactamase in Gram negative bacteria such as E. coli and 
K. pneumoniae. The reaction of benzylpenicillin in TEM- 
1 has been modelled with QM/MM methods. Potential 
energy surfaces for the acylation and deacylation steps of 
benzylpenicillin in TEM-1 were obtained by QM/MM 
optimizations at the AM1/CHARMM22 level, corrected 
by single point B3LYP/6-31 G(d) calculations (Hermann 
et al. 2005, 2006). This confirmed the mechanism 
for deacylation, which was reasonably well established 
experimentally, with Glul66 acting as the base. 
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Beta-Lactamases: Computational Studies, Fig. 1 Reaction mechanism of serine fl-lactamases 


The mechanism for the rate-limiting acylation step, which 
is less well established, was found to involve proton 
abstraction by Glul66 from Ser70 (which subsequently 
attacks the P-lactam ring) via a conserved water molecule 
(Fig. 1). Lys73 was involved in stabilizing the TS of 
acylation and was suggested to act as a proton shuttle in 
the subsequent steps. Other workers have modelled path¬ 
ways for acylation of TEM-1 with penicillic acid, using 
the subtractive ONIOM QM/MM method, with geometry 
optimization at the HF/3-21 G-MM level and energy 
calculation at the MP2/6-31 -i- G(d)-MM level (Meroueh 
et al. 2005). They concluded that both Lys73 and Glul66 
could act as the base, with Lys73 perhaps somewhat 
favored. They suggested that the two alternative mecha¬ 
nisms could be competitive, and that serine P-lactamases 
may use different pathways for different substrates. More 
recently, however, high-level QM/MM potential energy 
profiles for acylation with benzylpenicillin, using 
B3LYP/6-31 + G(d) for QM/MM stractural optimization 
and up to SCS-MP2/aug-cc-pVTZ for QM/MM stractural 
energy calculation, supported the mechanism with 
Glul66 as the base. It was further confirmed that proton 


abstraction from Ser70 is concerted with the nucleophilic 
attack of Ser70 on the P-lactam ring. The highest level of 
theory in this study gave a barrier of 4.1 kcal mol~ * for this 
rate-limiting acylation step, somewhat low compared to 
experiment, but reasonable considering that entropy 
effects and multiple enzyme-substrate conformations 
were not included. 

The deacylation of cephalothin from the class 
C p-lactamase P99 and a penicillin binding protein 
(PBP, which is a target for P-lactam antibiotics) have 
also been studied with QM/MM methods (Gherman 
et al. 2004). B3LYP/6-31 G(d)-MM optimizations were 
performed for the acyl-enzymes and some deacylation 
transition state structures, with various different proton¬ 
ation states of the relevant residues. Free energies of 
activation, estimated using vibrational analysis, indi¬ 
cated that deacylation is efficient in P99 and inefficient 
in PBP, as expected. For P99, the calculations support 
a mechanism in which Tyrl50 deprotonates the active 
site serine, with the proton subsequently transferred to an 
unprotonated Lys67 in a concerted fashion. For PBP, 
the equivalent mechanism was not deemed possible. 
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Beta-Lactamases: Computational Studies, Fig. 2 Reaction mechanism of metallo-P-lactamases with one zinc atom 


The difference between P99 and PBP was found to be 
the more favorable electrostatic environment of the 
active site Tyr in P99. 

Metallo-Beta-Lactamases 

Metallo-p-lactamases while growing are still relatively 
rare, but do form a significant threat to human health due 
to their rapid evolution and spread across species as well 
as the difficulty of finding specific inhibitors. Metallo-P- 
lactamases contain either one or two zinc ions, which are 
coordinated by active site residues. Their role in the 
reaction is likely to stabilize the negatively charged 
carbonyl oxygen of the tetrahedral intermediate anion, 
as well as to lower the pK^ of the coordinated water. 
For dizinc metallo-P-lactamases, it is believed that the 
P-lactam ring hydrolysis starts with a nucleophilic attack 
of a zinc-bound hydroxide on the p-lactam carbonyl, 
followed by the cleavage of the C-N bond. The latter 
could either be assisted by protonation of the nitrogen or 
by stabilization of the second zinc ion when present. The 
identify of any proton donor is still not clear. Ffiotein 
X-ray crystallography studies indicate that the carboxyl- 
ate moiety interacts directly with the zinc ion(s). This 
binding mode was verified by QM/MM studies and sub¬ 
sequently used for the modeling of moxalactam hydroly¬ 
sis in the dizinc B3 metallo-P-lactamases LI (Xu et al. 
2007). A QM/MM potential energy surface was obtained 
at the SCC-DFTB/CHARMM22 level along two reaction 
coordinates, one describing the nucleophilic attack of the 
hydroxide and the C-N bond cleavage and the other the 
proton transfer from the (former) hydroxide to Asp 120. 
This indicated that the reaction is essentially stepwise, 
starting with a rate-limiting nucleophilic attack and C-N 
bond cleavage, followed by (almost spontaneous) 
protonation of Asp 120. A free energy profile was also 
obtained at the SCC-DFTB/CHARMM22 level with 


subsequent energy corrections using single-point density 
functional throry B3LYP/CHARMM22 calculations. 
The barrier obtained was in line with experiment (23.5 
and 18.5 kcal mol” *, respectively). The calculated energy 
profile indicates that a reasonably stable intermediate is 
formed, in which the newly formed carboxylate is stabi¬ 
lized by one Zn'^ and the nitrogen leaving group by the 
other. The hydrolysis of the carbapenem antibiotic 
biapenem by monozinc class B2 metallo-p-lactamases 
CphA has also been studied using QM/MM methods 
(Xu et al. 2006). Based on a crystal structure of an 
enzyme-intermediate complex of CphA with abiapenem, 
SCC-DFTB/MM QM/MM MD simulation indicated that 
the Zn^'^ was coordinated by the substrate carboxylate, 
Cys221, His263, and Aspl20. A crystallographically 
observed water molecule formed stable interactions 
with the P-lactam carbonyl and Asp 120. Reaction model¬ 
ing at the SCC-DFTB/CHARMM22 level (validated by 
single-point B3LYP/MM QM/MM energy calculations, 
and QM calculations on a small model at the B3LYP 
level) indicated a concerted SN2-like mechanism is 
likely, with water attacking the carbonyl carbon, C-N 
bond cleavage and Asp 120 accepting the remaining pro¬ 
ton from the nucleophilic water. These also modelled a 
mechanism with Hisl 18 as the base instead of Asp 120, as 
originally suggested, but found it not to be energetically 
competitive. More recently, the full reaction of biapenem 
with CphA was also modeled at the SCC-DFTB/MM 
level (Wu et al. 2010). This indicated that Aspl20 
donates the proton it obtained from the nucleophilic 
water to the former P-lactam nitrogen. As expected, 
the initial SN2-like nucleophilic addition and elimination 
was found to be rate limiting. Interestingly, this work 
further identified the crystallographically observed 
enzyme-intermediate complex as belonging to a minor 
pathway, in contrast to the original suggestion. Overall, 
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these studies on metallo-P-lactamases demonstrate the 
mutual benefit between protein crystallography and 
QM/MM modelling of their reaction mechanisms. 
On the one hand, high-resolution experimental structural 
data on enzyme complexes is essential to set up correct 
models for QM/MM calculations. On the other hand, 
QM/MM modeling is useful to verify or modify mecha¬ 
nistic proposals based on structural (and kinetic) data 

(Fig. 2). 

Cross-References 

► Molecular Dynamics Simulations of Lipids 

► ONIOM 

► QM/MM Methods 
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Biliverdin IX-Beta Reductase: 
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Synonyms 

Biliverdin reductase; Flavin reductase 

Definition 

An enzyme that catalyzes the pyridine-nucleotide- 
dependent reduction of a variety of biologically impor¬ 
tant molecules, such as flavins and biliverdins. 

Basic Characteristics 

Biliverdin reductase is a heme oxygenase that can be 
found in the liver, where it facilitates the conversion of 
biliverdin to bilirubin. Bilirubin is important for mam¬ 
mals, as it acts as a physiologically significant antiox¬ 
idant. Two types of mammalian biliverdin reductases 
exist: biliverdin-IXa reductase and biliverdin-IXp 
reductase. They catalyze the corresponding reduction 
of biliverdin-IXa or -IXP to bilirubin-IXa or -IXp, 
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using NADPH as proton source. The conversion 
involves the reduction of a double bond between the 
second and third pyrrole ring. Biliverdin-IXa reduc¬ 
tase is a potential pharmacological target (e.g., for 
treatment of jaundice in newborns) but no atomic- 
detail enzyme-substrate structure is currently avail¬ 
able. For biliverdin-IXP reductase, structures from 
X-ray crystallography are available with various fla¬ 
vins and Verdins as well as NADP bound (Pereira et al. 
2001). Using this structural information. Smith et al. 
applied ► QM/MM methods to investigate the reaction 
mechanism in this enzyme (Smith et al. 2008). Initially, 
potential energy surfaces were obtained at the semiem- 
pirical AM1/CHARMM22 and PM3/CHARMM22 
levels to investigate the possibility of two alternative 
mechanisms; proton donation by an active-site histidine 
or from bulk solvent. The latter was modeled by treating 
a nearby water as an hydroxonium ion (H 30 '^). The 
potential energy surface with hydroxonium as proton 
donor was corrected by single point B3LYP/6-311-i-G 
(d) calculations and indicated a feasible mechanism. In 
combination with experimental mutagenesis studies, it 
was proposed that the enzyme-catalyzed reaction 
involves an initial protonation of biliverdin to form an 
intermediate species, which is followed by subsequent 
hydride transfer to form bilirubin. The histidine probably 
plays an important role in stabilizing species during the 
reaction, although it is apparently not directly involved 
or required for catalytic activity. 

Cross-References 

► Macromolecular Crystallography: Overview 

► QM/MM Methods 
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Bioanalytical Sensors 

► Amperometric Biosensors 
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Potentials and Propagation of Voltage 
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Synonyms 

Action potentials; Electrical signaling; Flow of 
electrical charges in biological circuits; Membrane 
ionic and capacitative currents; Membrane potential; 
Receptor potentials; Signal conduction; Synaptic 
potentials; Synaptic transmission; Transmembrane 
voltage; Voltage signals 

Definitions 

Bioelectricity is the flow of electrical currents, carried by 
mobile charged ions, across cell membranes and along 
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the exterior and interior ionic environment of cells. The 
membrane potential results from a separation of charge 
(transmembrane voltage) owing to biological mem¬ 
branes limiting diffusion of ions in a controlled manner 
that is largely regulated by ion channels and transporters. 
Transient changes in membrane potential (e.g., receptor 
potentials, synaptic potentials and action potentials) are 
electrical signaling events produced by temporary 
changes in current flow across the cell membrane 
and conducted by currents flowing where differences in 
electrical potential arise. 


Introduction 

Many cellular functions are mediated by electrical 
signaling events, generated by pore-forming mem¬ 
brane proteins known as ion channels. The involve¬ 
ment of channels in signaling by nerve and muscle will 
be summarized in this chapter. The realization that ion 
channels are involved in many cellular processes, 
including the secretion of hormones, the transfer of 
substances across epithelia, and the regulation of cell 
volume, has come with the development of novel bio¬ 
physical methods. These approaches will be described 
in subsequent chapters along with chapters that illus¬ 
trate the extraordinary diversity of ion channel struc¬ 
ture and function. 

Because ions are charged they cannot penetrate the 
lipid bilayer and ion channels are required to facilitate 
diffusion of physiologically important inorganic ions 
such as Na-", Ca^^, Mg^^, R-", CF, and HCOs". 

For all of these ions, concentration gradients exist 
across the cell membranes. Thus, Na"^ is the principal 
cation of the extracellular fluid, and Cl~ and HC 03 “ 
are the dominant anions, while and organic anions 
predominate in cytoplasm. 


close to that predicted by the Nemst equation for 
around —0.1 V (inside, relative to outside the cell). This 
equilibrium potential for {Ej^) is given by: 


R-r 

[K+l ' 


Ek — —. In 
zF 


where R is the gas constant (8.314 J.K“\mol“^), T is 
temperature (293 K at room temperature), F is 
Faraday’s constant (96,500 coulomb.moF*), so that 
^is 25 mV at 20° C. z is the valency of and [A''^]o 
and [A''^]i are its extra- and intracellular concentrations 
(Hille 2001). At this equilibrium potential, the concen¬ 
tration or chemical gradient on K"^, favoring efflux of 
from the cell, is balanced exactly by the electrical 
gradient, favoring influx. Thus, at the equilibrium 
potential, the electrochemical gradient across the cell 
membrane is zero and there is no net flux of K"^. 
Moving the membrane potential in a positive or 
depolarizing direction, away from Ej^, favors efflux 
of K"^. An appropriate form of the Nemst equation 
can be used to predict the equilibrium potential for 
any physiologically important ion. 

For the membrane potential to be exactly equal to 
Ek, the membrane must be perfectly selective for K"^. 
Such perfection is rarely achieved in nature, so the 
resting membrane potential approximates Ek but does 
not quite reach it. In skeletal muscle, for example, 
there is also a high permeability to Cl~. Goldman- 
Hodgkin-Katz (GHK) equations are used to describe 
membrane potentials in terms of the concentrations of 
different ionic species and their permeability 
coefficients (Aidley 1998). 


Action Potentials Are Generated by the 
Opening and Closing (Gating) of Voltage- 
Gated Ion Channels 


Membrane Potentials Result from Selective 
Permeability of the Membrane to Ions 

Ionic gradients generate voltage differences across cell 
membranes, and the sign and amplitude of the voltage 
depends on the ion(s) to which the membrane is perme¬ 
able. This permeability depends in turn on the type of 
channels that are open and their selectivity for specific 
ions. In most cells, opening of potassium channels dom¬ 
inates at rest, and the resting membrane potential comes 


Ionic gradients are used to energize processes including 
the making of ► action potentials in excitable cells. The 
existence of a resting potential (inside negative with 
respect to outside) means that Na"^ ions, with their higher 
external concentration, are far from equilibrium. If Na"^ 
channels open, Na"^ ions move in, releasing the energy 
stored in their electrochemical gradient. In generating an 
action potential, only a few pmol of Na"^ need enter 
across each cm^ of membrane to move the membrane 
potential rapidly toward the equilibrium potential forNa"^. 
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Na"^ channels are gated open by changes in the membrane 
voltage field in the positive, depolarizing direction. 
Channels that are open and closed by changes in mem¬ 
brane potential are known as voltage-gated ion channels 
to distinguish them from ligand-gated ion channels, 
gated by the binding of substances such as neurotrans¬ 
mitters (Aidley and Stanfield 1996). 

Since Na’^-channel opening lets Na"^ enter, and since 
such entry is depolarizing, the process becomes self¬ 
reinforcing so that action potentials are all-or-none in 
nature. Depolarization stops as is approached and 
Na"^ can no longer move in. To permit repolarization, 
Na"^ channels must also shut again, a type of gating 
known as inactivation to distinguish it from the ► acti¬ 
vation that opens channels upon depolarization. Though 
these two classes of gating are not fully independent of 
each other, each involves a different element of molec¬ 
ular machinery. Repolarization is usually aided by the 
opening of ► voltage-gated channels (which acceler¬ 
ate the return to the resting potential) as well as by 
background permeability to and Cl~ (Hille 2001). 

A corollary of the all-or-none nature of action 
potentials Is that a threshold needs to be exceeded for 
the action potential to develop. At threshold, the 
inward movement of Na"^, which depolarizes, just 
equals the outward movement of K"^, which repolarizes 
the membrane. In addition, once an action potential is 
set up, the membrane becomes refractory to further 
stimulus. This refractoriness is initially absolute; 
and becomes relative only upon return to rest, when 
Na"^ channels begin to recover from inactivation. 
The ► relative refractory period is also influenced by 
► voltage- and calcium-activated channels, whose 
opening opposes further depolarizing stimuli. 


Propagation depends on the region of the nerve 
fiber that is momentarily active raising immediately 
adjacent regions to threshold. In the local circuit 
theory, voltage differences allow currents - carried in 
a nonspecific fashion by ions in the cytoplasm and 
extracellular fluid - to flow between active and resting 
regions, the active region being depolarized (Jack et al. 
1975; Koester and Siegelbaum 2000). This process can 
be explained by likening axons to insulated electric 
cables submerged in extracellular fluid. The cytoplasm 
is the conducting core of the cable with electrical 
resistance r,, (units Q.crn”*); it is insulated by 
membrane with resistance r„, (Q.cm) and electrical 
capacity Cm (F.cm“^). The conduction velocity is deter¬ 
mined principally by the resistance of the cytoplasm 
(r,) and the electrical capacity of the membrane (c^), 
owing to the need to change the charge across the lipid 
bilayer. Since r, gets smaller with the increasing vol¬ 
ume of larger nerve fibers, while c„, gets larger with 
increasing surface area, conduction velocity increases 
with -Jdiameter. This square root relationship means 
that even large increments in axon diameter result 
in only modest increases in conduction velocity. 
In myelinated vertebrate nerve fibers, conduction 
velocity increases directly with diameter and faster 
conduction is possible. An insulating myelin sheath, 
periodically interrupted at nodes of Ranvier, lowers 
membrane capacitance. The region between nodes - 
the intemode - has a length (/) equal to lOO.D, where 
D is fiber diameter (Jack et al. 1975; Koester and 
Siegelbaum 2000). 

Biophysical Techniques Have Been 
Instrumental in Ion Channel Research 
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Action Potential Conduction Velocity 
Depends on the Passive Electrical Properties 
of Cells 

In nerve fibers, messages are encoded by action poten¬ 
tial frequency (rather than amplitude) and this coding 
is highly dependent on the refractory properties of the 
membrane. Such signaling requires action potentials to 
propagate often long distances, along nerve fibers from 
one part of the nervous system to another. Propagation 
is very quick (up to 120 m.s“* in the fastest nerve fibers 
at body temperature) and is quicker in bigger nerve 
fibers than in small ones. 


Molecular biology has shown us that even in the produc¬ 
tion of action potentials many channel isoforms are 
involved. Humans have nine genes encoding different 
voltage-gated Na"^ channels, for example, each expressed 
protein doing much the same job in different cell types. 
And there are very many more isoforms of ► channels, 
regulated by a variety of different stimuli, with varied 
functional roles. 

The astonishing diversity of ion channels in humans 
and other organisms is a discovery that was unexpected 
from the early days of electrophysiology. Because ion 
channels are highly selective among ions, they are first 
classified in terms of the ions for which they select: 
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thus Na"^-, K"^-, Cl“-channels and so on. This high 
degree of selectivity is found in spite of the very high 
rates - close to the limits of diffusion - at which 
channels facilitate ionic transfer. Other channels may 
select for monovalent cations (e.g., ► HCN, CNG 
channels). The further classification into channels 
that are gated either by voltage and or by the binding 
of a ligand (classically a neurotransmitter) 
encompasses two major groups of channel families in 
molecular terms: the voltage-gated superfamily, which 
includes Na"^ and channels, and the families of 
Cys-loop receptors, of glutamate receptors and of 
purinergic channels. However, the classification 
fails to describe fully the great diversity seen and has 
been extended to include consideration of other 
structures and dependence on physiological stimuli 
other than voltage or the binding of a neurotransmitter 
(see website and database of the International Union of 
Basic and Clinical Pharmacology Committee on 
Receptor Nomenclature and Drug Classification 
Nomenclature; http://www.iuphar-db.org). 

The story of ion channels began in the 1940s and 
1950s. Then, channels were the province of electro¬ 
physiologists, with their exceptionally elegant (if to 
some arcane) methods. Two things happened around 
1980 to generate the explosion of discovery. One was 
the development of ► patch clamp and particularly of 
whole cell recording, which allowed the discovery 
and study of channels in all cells. The other was the 
application of the methods of molecular biology, 
which have now fully elucidated the great diversity 
of channels. In the present day, structures are being 
elucidated. 

In the first part of this section, chapters will show 
how diverse channels are. In the second, the elegance 
of biophysical methods will be demonstrated, as 
will the work that integrates what is known of molec¬ 
ular species into an understanding of nervous 
action and the spread of the heart beat. The final chap¬ 
ters use ► channels as exemplars to show the 
knowledge derived by biophysicists studying channels 
at work. 

Cross-References 

► Alignment of Protein Sequences 

► Analysis of Macroscopic Currents 


► Automated Patch Clamp: Advantages and 
Limitations 

► Calcium Activation of Channels: RCK 
Domains 

► CLC Channels and Transporters 

► Computational Modeling of Neuronal Networks 

► Cut-Open Oocyte Voltage-Clamp Technique 

► Dynamic Clamp: Synthetic Conductances and Their 
Influence on Membrane Potential 

► ENAC, Degenerins, ASICs, and Related Channels 

► Gap Junction Proteins (Connexins, Pannexins, and 
Innexins) 

► Gating of Potassium Channels by Cyclic 
Nucleotide Binding 

► HCN Channels: Biophysics and Functional 
Relevance 

► Ion Channel Regulation by G-Protein-Coupled 
Receptors: Recent Advances with Optical 
Biosensors 

► Katp and the Sulfonylurea Receptor 

► Mechanosensitivity of Ion Channels 

► Modeling the Heart 

► N-type Inactivation in Potassium Channels 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis 

► Potassium Channels (Kv, Kir KCa and K(2P)) 
Channels 

► Voltage-Gated Potassium Channels, Kv Channels 

► Voltage-Gated Sodium and Calcium Channels 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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Biofabrication 

Gregory F. Payne*, WilliamE. Bentley*, Wei Sun^ and 
Gabor Forgacs^ 

*Fischell Department of Bioengineering, Institute for 
Bioscience and Biotechnology Research, University of 
Maryland, College Park, MD, USA 
^Department of Mechanical Engineering and 
Mechanics, Drexel University, Philadelphia, PA, USA 
^Department of Physics, Biology and Biomedical 
Engineering, University of Missouri, Columbia, 

MO, USA 

Synonyms 

Directed-assembly; Interface engineering; Tissue 
engineering 

Definition 

Biofabrication is an emerging technology that employs 
a broad range of physical, chemical, biological, and/or 
engineering methods to assemble biologically active or 
biologically derived components to generate models, 
systems, devices, interfaces, and products useful for 
a wide range of applications. 

Basic Characteristics 

Biofabrication is a developing discipline that combines 
advances from two major areas that individually expe¬ 
rienced incredible progress over the last half century. 
On one hand, advances in the physical, chemical, and 
engineering sciences led to novel fabrication technol¬ 
ogies that yielded systems such as integrated circuits 
and high-speed color printers. On the other hand, 
advances in the various branches of the life and 
medical sciences led to a better understanding of 
fundamental biological processes and an increasing 
ability to intercede in these processes. In some cases, 
advances in fabrication capabilities enabled advances 
in biology. Eor instance, novel fabrication tools such as 
nucleic acid microarrays allowed for high-throughput 
genomic analysis. In some cases, the use of biological 


principles extended the capabilities of fabricated 
systems. As an example, enzymes confer chemical 
selectivity to home-use blood glucose analyzers. 
Biofabrication aims to enhance and expand the con¬ 
nection between engineering fabrication and biology 
to address a broad range of technological opportuni¬ 
ties/needs (Sun). 

One major focus for biofabrication is to provide a 
production platform for regenerative medicine (Jakab 
et al. 2011). The supply of donated organs for trans¬ 
plantation is currently insufficient to meet demand, and 
this unmet need has provided considerable motivation 
for engineering approaches to create replacements for 
diseased, damaged, or missing organs. Conventional 
thinking in the field of tissue engineering envisioned an 
effective integration of three components: cells that are 
appropriately differentiated or can undergo the neces¬ 
sary differentiation programs, materials that serve as 
scaffolds for cells to attach and develop, and signaling 
cues and morphogens that guide the cell’s develop¬ 
mental program. Biofabrication approaches for regen¬ 
erative medicine often aim to produce tissue-like 
mimics without using scaffolds; rather the inherent 
self-organizing and regenerative capabilities of the 
cells are enlisted to create the three-dimensional 
structure of the multicellular construct. A common 
approach to assemble cells into such tissue-like mimics 
has been to use additive technologies such as 
bioprinting and rapid prototyping that add cells one 
layer at a time into a predefined shape. This approach, 
sometimes called organ printing, can be readily cus¬ 
tomized and automated using computerized robotics. 
An alternative scaffold-free approach that is emerging 
is to mold a nonadhesive material such as agarose, add 
multicellular units to the pre-shaped mold, and incu¬ 
bate the multicellular units to self-assemble into geo¬ 
metric shapes that are constrained by the mold. 

Another major focus for biofabrication is to integrate 
biological components with electronics to exploit com¬ 
plementary capabilities for several broad applications 
(Liu et al. 2010). One set of applications includes 
biosensing, which aims to couple the molecular recog¬ 
nition capabilities of biological systems with the data 
processing capabilities of electronics to enable rapid, 
sensitive, and selective analysis. In some cases, these 
capabilities are extended to high-throughput, high-con- 
tent, and point-of-use analysis. In addition, emerging 
applications are envisioned in which microbial systems 
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are interfaced with electrodes to access the metabolic 
versatility of biological systems to generate electricity 
(e.g., microbial fuel cells) or chemicals (e.g., bioelectro¬ 
chemistry or electrosynthesis). In these applications, the 
goal of biofabrication is to build the bio-electrode inter¬ 
face using biological materials (especially proteins and 
polysaccharides) and biological mechanisms (e.g., self- 
assembly and enzymatic catalysis). A key enabling dis¬ 
covery was the observation that some stimuli-responsive 
polysaccharides such as chitosan and alginate can be 
triggered to self-assemble at an electrode surface in 
response to imposed electrical stimuli. Electrodeposition 
provides a means to assemble biological components at 
specific electrode addresses on chips and within fully 
packaged microfluidic and lab-on-a-chip platforms. 

One of the challenges for biofabrication is bridging 
the fundamental differences between traditional engi¬ 
neering fabrication methods with biological assembly. 
Traditional engineering fabrication methods (e.g., pho¬ 
tolithography and printing) are top-down approaches in 
the sense that external information is imposed to gener¬ 
ate patterns and structures that are often intended to be 
permanent. In contrast, biological structures emerge 
from the bottom-up by the self-organization of labile 
components (e.g., cells) over a hierarchy of length scales 
to create structures (e.g., tissue) that often change with 
time (e.g., to differentiate, adapt, or heal). Thus, efforts to 
adapt existing or develop new fabrication methods for 
biology must accommodate the characteristics of the 
biological components, preserve biological activity in 
the fabricated construct (e.g., of an enzyme biosensor), 
and provide conditions that allow for the expression of 
the inherent dynamical nature of biological systems (e.g., 
the differentiation of stem cells). 

To date, the primary focus of biofabrication has 
been to enlist fabrication technologies to advance the 
study of biology and the practice of medicine. 

Cross-References 

► Polysaccharide Biomaterials 
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Biofluids 

► Infrared Spectroscopy of Cells, Tissues, and 
Biofluids 


Biological Half Life 

► Glycosylation in Metabolism 


Biological Magnetic Resonance Data 
Bank 

► BMRB 


Biologically Active Amyloids 

► Functional Amyloid 


BioMagResBank 

► BMRB 


Biomolecular Modeling 

► Homology Modeling of Protein Structures 

► Protein Folding: Molecular Dynamics Simulations 


Bionanowire Sensors 

► Nanowire Biosensors 
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Biosensors 

► Amperometric Biosensors 

► Solid-Phase Detection Techniques 


Biosynthetic Labeling 

► Assignment of Resonances in Protein Solution 
State NMR Studies 

► Protein NMR Resonances Assignment 

► Stereo-Array Isotope Labeling (SAIL) Method 


Blue Copper Proteins 

► Cupredoxins 


BMP Receptors 

► Dynamics of BMP Receptor Signaling 


BMRB 

Eldon L. Ulrich and John L. Markley 
Department of Biochemistry, University of 
Wisconsin-Madison, Madison, Wl, USA 


Synonyms 

Biological magnetic resonance data bank; 
BioMagResBank 


Definition 

The Biological Magnetic Resonance Data Bank 
(BMRB: http://www.bmrb.wisc.edu) is the major, 
internationally recognized repository for quantitative 
data on biological systems derived from nuclear mag¬ 
netic resonance (NMR) spectroscopy. BMRB offers the 


research and educational communities free access to 
information on the structure, dynamics, chemistry, and 
interactions of biological molecules. The metabolomics 
resource at BMRB contains information used in identi¬ 
fying and quantifying metabolites. BMRB (University 
of Wisconsin-Madison) is a member of the Worldwide 
Protein Data Bank (wwPDB) along with the RSCB-PDB 
(Rutgers University), PDBe (European Bioinformatics 
Institute), and PDBj (Osaka University). 

BMRB’s archive for biological macromolecules 
(peptides, proteins, RNA, and DNA) includes millions 
of assigned *H, and *^N chemical shifts plus smaller 
collections of coupling constants and relaxation param¬ 
eters. The repository accepts a wide range of NMR 
results: raw (time-domain) data, NMR spectral parame¬ 
ters (peak lists, assigned chemical shifts, and coupling 
constants, derived relaxation parameters, Ri, Ruho, Ri, 
cross-relaxation rates, NOEs, and order parameters), and 
other NMR-derived results (H-exchange rates, chemical 
rate constants, binding constants, p/^a values). The 
BMRB NMR Restraints Grid is constmcted from more 
than 6,000 restraints entries associated with stractures 
deposited in the Protein Data Bank (PDB). These data 
sets can be retrieved in their original format or in NMR- 
STAR, CCPN XML, X-PLOR/CNS, and CYANA for¬ 
mats. Time-domain data sets, representing over 700 
individual NMR experiments, are available for more 
than a hundred protein and RNA entries. In many 
cases, these hies include pulse programs and acquisition 
and processing parameters. BMRB has created standard 
hies for macromolecules that combine structures and 
restraints from the wwPDB archive with chemical shift 
and other experimental data from the BMRB archive. 
These hies contain data derived from the structures, such 
as, phi/psi angles, secondary structure, back-calculated 
chemical shifts, and BLAST sequence alignments. The 
hies have links to associated publications containing 
author interpretation of the data, relevant information 
in external databases, and metadata needed to understand 
relationships between the NMR results and the biologi¬ 
cal systems studied. 

The BMRB contains a small, but growing, archive 
of NMR-derived structures of small, biologically 
interesting molecules that are ineligible for wwPDB 
deposition. Users deposit these data through the Small 
Molecule Structure Deposition (SMSDep) system at 
PDBj-BMRB. 

The metaholomics archive at BMRB contains both 
raw (time-domain) and processed (spectra) data from 
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hundreds of compounds. Data from seven or more ID 
('H, and 2D ('H-'H and NMR 

experiments are available for most of the compounds. 
Each compound is represented by a chemical structure, 
assigned chemical shifts, spectral peak lists, and links 
to other useful information. 

Software services available from BMRB include 
data validation tools, a sophisticated data visualization 
system, and a CS-Rosetta structure calculation server. 

BMRB mirror sites are located in Florence, Italy 
(http://bmrb.postgenomicnmr.net/) and Osaka, Japan 
(http://bmrb.protein.osaka-u.ac.jp/). PDBj-BMRB 
is a full data deposition and data annotation site. 
NMR data deposited at the PDBe (EBI) (http://www. 
ebi.ac.uk/) are transferred to the BMRB for inclusion 
in the archive. 

We gratefully acknowledge fruitful interactions 
with our partners in the wwPDB and with members 
of the NMRFAM, CCPN, and Condor teams. BMRB is 
supported by NIEI grant LM05799 from the US 
National Library of Medicine. 


Cross-References 

► Chemical Shift Index 

► CNS (Crystallography and NMR System) 

► Cross-Correlated Relaxation 
► CYANA 

► El/D Exchange 

► J Coupling 

► Kinetic Isotope Effects 

► Kinetics: Relaxation Methods 

► Metabolomics 

► Molecular Dynamics Simulations of Lipids 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR in Drug Discovery - Introduction 

► NMR Methods for Kinetic Analysis 

► NMR-based Structural Proteomics 

► Nucleic Acid NMR - Introduction 

► Protein Folding 

► Protein NMR - Introduction 

► Relaxation Dispersion 

► Residual Dipolar Coupling 

► Solid-State NMR 

► Stereo-Array Isotope Labeling (SAIL) Method 

► Structural Genomics 


Brewster Angle Microscopy and 
Imaging Ellipsometry 

Bernard Desbat and Sabine Castano 
The Laboratory of Chemistry & Biology of 
Membranes & Nanoobjects, Universite de Bordeaux, 
CNRS UMR5248, IPB, Pessac, France 

Synonyms 

Optical analysis of surfaces, interfaces, and thin films 
under microscopic observation 

Definition 

Brewster Angle Microscopy (BAM) is a method, 
which allows to study, through a microscope, the thick¬ 
ness and topography of thin films on a dielectric 
substrate by recording the parallel polarized reflec¬ 
tance of the sample at the Brewster angle of the 
substrate. 

Imaging ellipsometry is a method, which allows to 
study, through a microscope, the thickness, refractive 
index, and topography of thin films on any substrate by 
analyzing the phase difference and the intensity ratio 
of the parallel and perpendicular polarized reflectance 
of the sample. 

Principle 

Brewster Angle Microscopy (BAM) 

BAM appeared at the beginning of the 1990s (Henon 
and Meunier 1991; Honig and Mdbius 1991) and was 
principally developed to study Langmuir hlms. This 
method takes advantage of the reflectance property of 
the p-polarized light (parallel to the incidence plane) 
on a dielectric surface. Indeed, in this case there is one 
angle 0b (Brewster angle), depending only on the 
refraction index of the substrate, for which the reflec¬ 
tion intensity is null. In the case of water (n = 1.333), 
the Brewster angle is 53.13° at a wavelength 
of 532 nm. 

The spreading of a thin molecular him at the water 
surface with a refractive index different from that of 
the water leads to a small rehection intensity which is 
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proportional to the square of the thickness of the thin 
film (Expression 1). 



- Rpi p-polarized reflectance at the X wavelength 

- d: thin film thickness 

- nil thin film refractive index 

- 112 : dielectric substrate refractive index 

The setup of the BAM technique shown in Fig. 1, 
consists of a goniometer with two arms. One arm 
supports the p-polarized laser and the laser beam is 
reflected on the sample (water most of the time). The 
reflected beam is focused with a microscopic lens 
(xlO) on a focal plane of a CCD camera, supported 
by the second arm of the goniometer. With an x 10 
magnification lens, the field of view is around 
500 X 500 pm, the lateral resolution close to 2 pm, 
and the sensitivity to the thickness of the film is 
a few A. 

Due to the lateral resolution, it is possible to 
analyze phase transitions, topography, and domain 
shapes of the Langmuir films versus their surface 
pressure. Indeed, the light is reflected only where 
the molecules are present and the intensity is pro¬ 
portional to the square of the thin film thickness. 
Fig. 2a, b. In addition, the gray levels of the camera 
can be calibrated, so it is possible to evaluate 
the absolute p-polarized reflectance, and compari¬ 
son with a simple optical model using Expression 1 
gives a good estimate of the thickness of the 
monolayer. This determination is only approximate 
because it is necessary to introduce, in Expression 1, 
the refractive index of the film, which is usually not 
precisely known. 

The different steps of a BAM experiment can be 
described as follows: 

1. Adjust the incidence plane perpendicular to the 
water subphase 

2. Focus the camera on the water surface. 

3. Calibrate the gray levels of the camera at the 
various acquisition times of the camera. 

4. Spread the sample on water and record the pictures 
and gray levels at various surface pressures (Fig. 2). 

5. Correct the pictures from the tilt angle observation 
(Fig. 3). 


Brewster Angle Microscopy and Imaging Ellipsometry, 
Fig. 1 Optical setup of BAM 

6. Evaluate the absolute reflectance from the gray 
level and the calibration curve. 

7. Evaluate the thickness of the layer (or domains) 
from the absolute reflectance. 

Example: Cholesterol layer on water surface 
From the gray levels of Fig. 2a, b and using a 
refractive index of 1,49 a thickness of 18 A ± 2 A 
has been determined for a monolayer of cholesterol, 
and 54 A ±3 A for the high-intensity domains 
observed at 45 mN/m (Mascetti et al. 2008). Such 
results demonstrate that the cholesterol undergoes 
a phase transition from monolayer to trilayer at the 
collapse pressure of 40 mN/m. 

Due to the observation of the surface with an inci¬ 
dence angle, the recorded pictures are compressed in 
the direction of observation so the shapes of the 
domains are distorted. To find a normal representation 
of the surface, the images should be multiplied along 
the direction of propagation by a factor l/cos6b, which 
in the case of water is equal to 1.666. The pictures 
before and after correction are presented in Fig. 3a, b, 
respectively. 

Imaging Ellipsometry 

The BAM method is a relatively simple method but 
limited to dielectric substrates and usually to water 
interfaces. Moreover, the evaluation of the thickness 
assumes that the laser power is stable during the exper¬ 
iment (typically a few hours) including the calibration 
step, and this condition may not be perfectly fulfilled. 
Macroscopic ellipsometry does not have these limita¬ 
tions, but to get information at the microscopic level 
the idea is to couple an ellipsometer with a microscopic 
observation like on the BAM system. The setup of the 
Imaging Ellipsometer requires simply a modification 
of the Brewster angle setup, by adding a compensating 
plate after the sample. Fig. 4. 
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Brewster Angle Microscopy 
and Imaging Eliipsometry, 
Fig. 2 BAM images of 
(a) Cholesterol on water, 
surface pressure 
Jt = 30 mN/m, (b) Cholesterol 
on water, surface pressure 
K = 45 mN/m 



Brewster Angle Microscopy 
and Imaging Eliipsometry, 
Fig. 3 Cholesterol layer on 
water, surface pressure 
K = 45mN/m (a) uncorrected 
(b) corrected 



Eliipsometry is a method which uses the change in 
the polarization state of light after reflection to deduce 
the optical properties and thickness of layered samples 
(Azzam and Bashara 1987). Reflection by a plane sur¬ 
face involves the amplitude coefficients rp and r^ 
(Expression 2), which characterize the new polariza¬ 
tion state of the light. 



fp 



rJe 


i3s 


Brewster Angie Microscopy and Imaging Ellipsometry, 
(2) Fig. 4 Optical setup of imaging ellipsometer 


Ep‘, Ep^ Electric held of the incident and reflected 
p-polarization wave 

Es', E/, Electric field of the incident and reflected 
s-polarization wave 


rp and r^ are the amplitudes of the reflection coefficient 
of p- and s-polarization, respectively. 

5p and 5s are the dephasings between the reflected and 
incidence waves of p- and s-polarization, 
respectively. 
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Thickness (nm) 


Thickness (nm) 


Brewster Angle Microscopy and Imaging Ellipsometry, Fig. 5 Variations of ellipsometer angles with thickness of thin film on 
water subphase: (a) 8A = Aq-A (b) 'P 


The ellipsometer is able to give the ratio p of these 
two coefficients; 

p = ^ = tanv|/ 

Ts 

tan\|/ =|rp/rs| and A = (5p — (5s 

'T and A are the two parameters measured by an 
ellipsometer. Indeed, with the setup represented in 
Fig. 4 if the quarter wave plate is fixed at 45°, P and 
A the azimuthal angles of the polarizer and analyzer, 
respectively, which give the null condition, the and 
A parameters can be directly calculated by the 
expression: 

^\,= |A|,A = 90°-2P 

The interpretation of the ellipsometer angles 
requires the comparison with an optical model using 
the optical index and thickness of each layer. The 
model can be very complex when the sample contains 
multiple layers, absorbing or not. However, the model 
can be much more simple when the sample consists 
only in a nonabsorbing isotropic layer on a substrate. 
The variations of the A and 'P angles in this condition 
are presented in Fig. 5 with a refractive index of 1.333 
for the substrate. 

The variations of A, Fig. 5a, are presented as the 
difference with the Aq of the substrate, 5A = Aq-A. 
The curves show that the variation of the 'P angle is 


very weak for ultrathin films, but fortunately the vari¬ 
ations of delta are much more significant. The delta 
curves are quasi linear with the thickness of the 
ultrathin film. Their slopes are very sensitive to the 
refractive index of the films when the substrate has 
a close refractive index. This difficulty is almost 
completely eliminated on silicon, germanium, or 
metallic substrates, which have refractive indices 
very different from those of organic films. 

The adsorption of p-cyclodextrin in solution under 
a cholesterol monolayer confirms the possibility of 
imaging ellipsometry. The cholesterol monolayer is 
homogeneous. Fig. 6a, and its 5A value of 6.6° 
suggests an average thickness of 1.8 nm. When the 
P-cyclodextrin was injected in the subphase many 
domains appear, with various thicknesses. The large 
domains observed in Fig. 6b present a 5A of 29.5° 
characteristic of a layer thickness of 8 nm. This behav¬ 
ior indicates the formation at the interface of columnar 
multimers of P-cyclodextrin induced by the molecules 
of cholesterol in the monolayer (Mascetti et al. 2008). 
The thicknesses and the size of the domains are vari¬ 
able depending on the number of P-cyclodextrins in the 
multimers and can be easily determined by imaging 
ellipsometry. 

Moreover, imaging ellipsometry is much more 
powerful than the BAM method, because with an 
adapted cell, this method has the ability to analyze 
liquid/solid interfaces with a lateral resolution of 
1 pm. Thus, it is possible to follow the modification 
of solid surfaces under liquids in the presence of 
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Broadband Sum Frequency Generation (BB-SFG) 


Brewster Angle Microscopy 
and Imaging Eliipsometry, 
Fig. 6 Ellipsometer images 
of (a) monolayer of 
cholesterol on water surface 
(b) monolayer of cholesterol 
on aqueous solution of 
P-cyclodextrin 



lipids, proteins, DNA, or under the influence of 
various parameters like electric field, laser excitation, 
pH, etc. 

Cross-References 

► Atomic Force Microscopy (AFM) for Topography 
and Recognition Imaging at Single Molecule Level 

► Fatty Acids, Alkanols, and Diacylglycerols 

► Lipid Domains 

► Lipid Organization, Aggregation, and 
Self-assembly 
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► Supported Lipid Bilayers 
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Introduction 

Some channels can be activated by a rise in cyto¬ 
solic Ca^"^ and, in two cases (the channels from the 
archeon Methanohacterium thermoautotrophicum 
(MthK) and the large conductance voltage- and 
Ca^'^-activated (BK) channel), the Ca^'^-binding 
sites are contained within the regulatory domains for 
conductance (RCK). The MthK channel is a tetra- 
mer formed by subunits containing two transmem¬ 
brane domains and a C-terminal domain containing a 
single RCK domain (Jiang et al. 2002). The functional 
MthK channel, however, contains eight RCK domains, 
and the green domains in Fig. la come from the cyto¬ 
plasmic milieu. 

The structure of the RCK domain of a six trans¬ 
membrane domain K"^ channel from E. coli (solved 
at 2.4 A resolution) has a Rossmann-fold topology, 
which is a very common structural motif of 


enzymes and ligand-binding proteins. Rossmann-fold 
secondary structures are organized into two linked 
P-a-p-a-P units (see Fig. lb) and were hrst identihed 
in a number of NAD’^-dependent dehydrogenases. This 
is the type of structure present in the MthK and in the 
Drosophila, mouse, and human Slol channel (com¬ 
monly known as BK; see ► Potassium Channels: 
Their Physiological and Molecular Diversity) 
(Fig. lb). The cloning of the BK channel from Dro¬ 
sophila showed that it is a member of the S4 super¬ 
family encompassing voltage-dependent K"^ (Kv), 
Na"^, and Ca^"^ channels. The gene coding for BK was 
called Slowpoke or Slo (later renamed Slol after the 
cloning and expression of Slol and Slo3 (Salkoff et al. 
2006)). In the case of Kv channels, the channel¬ 
forming protein possesses six transmembrane domains 
(S1-S6) containing the pore-forming domain S5-P-S6, 
and an S4 voltage-sensing element (S1-S4). As Kv 
channels, the BK channel is a tetramer; however, 
unlike Kv channels, the BK channel protein consists 
of seven transmembrane (S0-S6) domains with 
an exoplasmic N-terminus (Fig. Ic). The intracellular 
C-terminal domain, comprising two thirds of the pro¬ 
tein, contains four hydrophobic segments (S7-S10) 
and the Ca^'^-and Mg "^-binding sites. The C-terminus 
of BK channels consists of two tandem 
RCK domains (only RCKl is shown in Fig. lb). The 
RCK domain in the BK channel was initially unveiled 
by MacKinnon’s group (Jiang et al. 2001) 
by multiple sequence alignment of the BK channel 
with prokaryotic K"^ channels and other proteins 
known to possess the RCK domain motif. Based on 
their primary sequence, the C-terminal domains of K"^ 
channels Slol and Slo3 are believed to be structured as 


G.C.K. Roberts (ed.), Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
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Calcium Activation of K'^ Channels: RCK Domains, 
Fig. 1 RCKs in Ca^'^-activated K"^ channels, (a) Structure of 
the open MthK channel (PDB 3LDE; Ye et al. 2010). In this 
representation, one of the subunits has been eliminated for the sake 
of clarity and the RCK domains are represented by colored ovals. 
Blue ovals are RCK domains that are integral parts of the channel¬ 
forming proteins, while green ovals come from the cytosol. All 
eight RCK domains are represented in the figure and the one 
lacking the pore-forming subunit is colored in cyan. Green spheres 
are K"^ ions, (b) Sequence alignments of the RCKs of MthK and of 
RCKl of Drosophila (dSlol), mouse (mSlol), and human (hSlol). 
Primary sequences used were directly taken from the Uniprot 
database (sequence numbers 027564, Q03720, Q08460, 


Q12791, respectively), and aligned using ClustalW. There is 
about 35% identity between these proteins, but they are 
similar regarding secondary structure, (c) The a-subunit of the 
BK channel is formed by 7 transmembrane (S0-S6) segments 
and a large C-terminus. Segments S1-S4 (VSD) have been impli¬ 
cated in voltage sensing whereas S5 and S6 form the conduction 
pore. The C-terminus contains two putative RCK domains within 
which are located the Ca^"^- and Mg^'^-binding sites. Spread in the 
BK cytoplasmatic domain are located the sites for Ca^"^ high 
affinity, D367/M513/E535 (in RCKl; Wu et al. 2010), Q889/ 
D892/D895/D897 (in the calcium bowl in RCK2; Yuan et al. 
2010), and the Ca^"^ low affinity, which also coordinate Mg^"^, 
S337/E374/E399 in the RCKl 


two RCK domains in tandem (Salkoff et al. 2006). actual structural nature of the C-termini of these two 
Unlike Slol, however, Slo2 is activated by internal channels is unknown at present and will not be 
Na"^ and Cl", and Slo3 is activated by protons. The discussed further here. 
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Two RCK Domains Are Present in the 
C-Terminus of the BK Channel 

The sequence and predicted secondary structure of the 
cytoplasmic domain comprised by amino acid residues 
339-516 in BK and which includes the S7 domain are 
homologous to a RCKl domain found in a number of 
channels (Jiang et al. 2001). Homology models 
based on the E. coli six transmembrane channel 
and the Ca^'^-activated channel, MthK, were pro¬ 
posed for the BK RCKl. The existence of a second 
RCK domain (RCK2) contained in the distal part of the 
C-terminus received support from a structure-based 
alignment study of the C-terminus of the BK and 
prokaryotic RCK domains. Circular dichroism experi¬ 
ments, on the other hand, support a structural homol¬ 
ogy of 90% between the crystal structure of MthK’s 
RCK2 and that proposed for BK. The structure pro¬ 
posed for BK’s RCK2 (residues 712-999) includes the 
calcium bowl, and has the characteristic Rossmann- 
fold topology of RCK domains. In addition, the 
putative RCK2 undergoes a Ca^'^-induced change in 
conformation associated with a ot-helix to (J-folded 
structural transition; deletion of the calcium bowl 
impairs this transition (Yusifov et al. 2008). 

The Structure of the BK RCK Domains 
Revealed 

Recently, two groups determined the X-ray structure 
of the human BK C-terminus at 3 A and 3.1 A resolu¬ 
tion, respectively (Wu et al. 2010; Yuan et al. 2010). 
The structures obtained confirmed the presence of two 
tandem RCK domains in each BK a-subunit, and the 
deduced tetrameric structure indicated that the BK 
C-terminus forms a 350 kDa gating ring at the intra¬ 
cellular membrane surface. This study also confirmed 
that the Ca^"^ bowl located within the second (RCK2) 
of the tandem RCK domains forms a Ca^'^-binding site. 
This site is located on the outer perimeter of the gating 
ring, in a region called assembly interface (Fig. 2a). 
Thus, as is the case of the MthK channel, the resulting 
structure of the BK gating ring is an octamer of RCK 
domains; however, unlike the MthK channel where 
each subunit of the tetramer contributes one RCK 
domain and another is assembled from solution, both 
RCK domains in the BK channel are intrinsic to 
the a-subunit. This struSctural difference may cause 


Ca^"^ to bind to different sites within these two gating 
rings. Indeed, in the MthK, Ca^"^ binds on the “flexible” 
interface where a cleft between two RCK domains 
creates the Ca^'^-binding site (Fig. 2b). 

The structures of the BK gating ring, the MthK 
channel, and the chimeric Kvl.2/Kv2.1 channel were 
used to build a complete model of the BK channel. In 
this model, some mutations of functional importance 
are clustered toward the N-terminus of RCKl. 
These amino acids reside on the surface of the 
gating ring that faces the VSD, indicating that Ca^"^ 
binding, besides contributing directly to the energy 
necessary to open the pore, may also modulate the 
voltage sensor. 

Coupling Between Ca^'^ Binding and Pore 
Opening 

One important question is how the gating ring extracts 
the free energy of Ca^"^ binding to open the BK channel 
gates. It has been proposed that the RCK1-S6 coupling 
system is contained in the linkers, which behave as 
a passive spring that applies force to the gates in the 
absence of cytoplasmic Ca^"^ to modulate voltage- 
dependent gating (see ► Fluorescence Techniques for 
Studying Ion Channel Gating: VCF, FRET, and 
LRET). An internal Ca^"^ increase changes this force 
and increases further the BK channel open probability. 
It is important to mention that the crystal structure of 
the MthK gating ring obtained in the absence of Ca^"^ 
versus that of the open MthK channel allows one to 
visualize the open and putative closed conformations 
of the Ca^’^-binding domain. This comparison shows 
that calcium binding to each of the RCK domains 
induces an expansion of the gating ring that in turn 
can exert a lateral force on the pore opening the chan¬ 
nel (Latorre et al. 2010). 

More recently, the likely molecular basis of 
a disease has shed light on the structures involved in 
the allosteric coupling between Ca^"^ binding and BK 
channel opening (Yang et al. 2010). A mutation in the 
neighborhood of the N-terminus of the BK RCKl, 
D434G, has been associated with the human syndrome 
of generalized epilepsy and paroxysmal dyskinesia. 
This mutation is within a region that encompasses the 
secondary structures PA-aC, hence called the AC 
region. This region in the N-terminus of the BK 
RCKl domain enhances BK channel activity in 
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Calcium Activation of K'^ Channels: RCK Domains, 
Fig. 2 The RCK domains in BK and MthK channels form 
gating rings, (a) BK gating ring at 6 A resolution. The ring is 
viewed down the fourfold symmetry axis with RCKl in blue and 
RCK2 in red. Calcium ions are shown as yellow spheres, (b) BK 
RCKl and RCK2 domains of one subunit showing the position 
of the Ca^'^-binding site (calcium bowl) in the RCK2 domain. 
Calcium {orange hall) is coordinated by D892/D895/D897/ 


Q889. (c) The open gating ring structure from the MthK channel 
viewed down the fourfold axis of symmetry. Notice that a Ca^"^ 
binds to the assembly interface in the BK gating ring whereas 
two Ca^"^ ions bind to the flexible interface in the MthK gating 
ring, (d) Left. Location of the Ca^’^-binding site in the MthK 
RCK domain. Right. Amino acid residues involved in Ca^"^ 
binding in the MthK RCK domain. Notice that a water molecule 
is also part of the site (Yuan et al. 2010) 


a manner independent of binding. The results 
indicate that this mutation increases Ca^"^ sensitivity 
by augmenting the flexibility of the AC region, 
strongly suggesting that it is part of the coupling sys¬ 
tem that allows the transformation of chemical energy 
(Ca^"^ binding) to mechanical energy (pore opening). 

Location and Structure of Divalent Binding 
Sites in the BK RCK Domains 

Partial deletion or point mutations of the aspartates 
contained in the calcium bowl in RCK2 produced BK 
channels that were less sensitive to Ca^"^ (Latorre et al. 
2010). Indeed, amino acid residues Q889, D892, D895, 
and D897 in the calcium bowl coordinate the Ca^"^ ion 
(Yuan et al. 2010; Fig. 2a). The fact that disruption of 
the Ca^"^ bowl did not completely eliminate the BK 


channel sensitivity was a clear demonstration that 
there is more than one high Ca^"^ affinity site. The 
second high-affinity Ca^"^ sensor was identified on the 
proximal tail region embedded within the upstream 
RCKl domain including residue D367 (Fig. 3a). The 
two Ca^'^-binding sites activate the BK channel with 
different divalent cation selectivity and Ca^'^-binding 
properties and with very little cooperativity between 
them. More recently, Zhang et al. (2010) were able to 
unveil the structure of the high-affinity Ca^'^-binding 
site contained in the RCKl domain using the known 
C-terminus structure of the BK channel, electrophysi¬ 
ology, mutagenesis, and molecular modeling. They 
found that at this site, Ca^"^ is coordinated by four 
oxygens from the carboxylate groups side chain of 
D367 and E535 and a main chain carbonyl oxygen 
atom of R514. The structure of the RCK obtained in 
the absence of Ca^"^ indicates, however, that Ca^"^ in 
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Calcium Activation of Channels: RCK Domains, 
Fig. 3 Calcium- and Mg^'^-binding sites in BK RCKl domain, 
(a) Structural details of the BK high-affinity Ca^'^-binding site. 
Histine 365 appears to be involved in a pH-dependent activation 
of BK channels. Upon protonation this residue interacts electro¬ 
statically with D367 mimicking the effect of Ca^"^. It is unclear 
whether M513 binds Ca^"^ directly or if it is important in 
maintaining the Ca^’^-binding site structure, (b) Low-affinity 
Ca^"^- and Mg^'^-binding sites, located on the upper plateau 
of the gating ring and very near the membrane surface 
(Wu et al. 2010) 


this site is coordinated by D367, M513, and E535 but 
not by R514 (Fig. 3a) (Wu et al. 2010). Apparently the 
Ca^"^ bowl speeds up channel activation at low Ca^"^ 
concentrations, whereas the second high-affinity site 
contained in RCKl modulates both activation and 
deactivation at [Ca^"^] >10 pM. Surprisingly, the bind¬ 
ing of Ca^"^ to the high-affinity site contained in the 
RCKl domain is voltage-dependent whereas that at the 
Ca^"^ bowl is not. 

Additional research identified a third lower-affinity 
divalent cation sensing domain in the RCKl, which is 
able to bind Mg as well as Ca^"^. Mg is coordinated 
at the interface between the voltage sensor domain and 
the RCKl domain to activate BK channels. Side chains 
of amino acid residues Asp99 (intracellular SO-Sl 
linker) and Asnl72 (intracellular S2-S3 linker) in the 
VSD, and Glu374 and Glu399 in the RCKl domain of 
a different subunit domain form the Mg^'^-binding site 
(Fig. 3b), suggesting a close proximity between these 
two domains. This proximity also enables the electro¬ 
static interaction between the bound Mg and Arg213 
contained in the S4 segment, which in turn affects the 
displacement of the voltage sensor giving an elegant 
mechanistic explanation to the activation of the 
BK channel by Mg The data suggests that 
the VSD domain of one subunit is located on top of 
the RCKl domain from the neighboring subunit, 
implying that the packing of the VSD domain or the 
cytoplasmic domain relative to the pore domain in BK 
channels may differ from that of MthK channels. How¬ 
ever, the structure of the Ca^'^-binding sites contained 


in the RCKl domain of the BK channel was inferred 
from the structure of the gating ring in the absence 
of Ca^-". 

Cross-References 

► Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRFT, and LRFT 

► Potassium Channels: Their Physiological and 
Molecular Diversity 
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Abbreviations 

CaP Calcium Phosphate 

HA Hydroxyapatite 

CHA Carbonated Hydroxyapatite 

TCP Tricalcium Phosphate 

BCP Biphasic Calcium Phosphate 

MBCP Micro Macroporous Biphasic Calcium 
Phosphate 

Introduction 

Although bone tissue possesses the capacity for regen¬ 
erative growth, the bone repair process is impaired in 
many clinical and pathological situations. Large bone 
loss caused by trauma and tumor resection and/or 
aging, require reconstructive surgery and/or 
bone regeneration. At present, bone surgeons have 
three different possibilities when it comes to replacing 
bone. 

Autogenous bone grafts are considered as the gold 
standard for bone replacement, in spite of large pain, 
septic complications, limited amount harvested from 
the iliac crest or other sites. Allogenic bone grafts 
obtained from tissue banks also have limitations 
because of the possible transmission of 
nonconventional agents or viruses and the risk of immu¬ 
nological incompatibility 

Alloplastic Bone substitutes are produced in various 
compositions and shapes (Fig. 1). These biomaterials 
can be used alone to fill bone cavities, serving as 
a scaffold for bone regeneration from the peri-implant 
region. Bone substitutes can also be used to supple¬ 
ment autogenous bone or in combination with bone 
marrow aspirates (Nath and Basu 2009). The ideal 
biomaterial should have a variety of forms and sizes, 
all with sufficient strength for use in load-bearing sites, 
and also be biocompatible, biodegradable and able to 
be substituted by newly formed bone. 


At the turn of the millennium, numerous synthetic 
bone graft materials are available as alternatives to 
autogenous bone for repair, substitution or augmenta¬ 
tion. Synthetic biomaterials include essentially special 
glass ceramics described as bioactive glasses and 
calcium phosphates (calcium hydroxyapatite, HA; 
tricalcium phosphate, TCP; and biphasic calcium 
phosphate, BCP). The review of Daculsi et al. (2009) 
reported the important historical data for CaP 
Bioceramics. Albee (1920) reported the first successful 
application of a calcium phosphate reagent for the 
repair of bone defect in human. More than 50 years 
later, clinical use of a “tricalcium phosphate” prepara¬ 
tion in surgically created periodontal defects in ani¬ 
mals was reported and the use of dense hydroxyapatite 
HA as immediate tooth root replacements. In the early 
1980s, synthetic HA and P-tricalcium phosphate 
P-TCP, became commercially available as bone 
substitute materials for dental and medical applications 
(De Groot 1983). 

The term biphasic calcium phosphate (BCP) 
describes bioceramics, that consisted of a mixture of 
HA and p-TCP (Legeros et al. 2009a). The first studies 
on BCP with varying HA/P-TCP reported by Daculsi 
and LeCeros (2008) demonstrated that the bioactivity 
of these ceramics may be controlled by manipulating 
the HA/p-TCP ratios. Subsequently, focused 
studies on BCP led to the significant increase in man¬ 
ufacture and use of commercial BCP bioceramics 
as bone substitute materials for dental and orthopedic 
applications (LeCeros 1991; Daculsi and LeCeros 
2008). 

There is then a long history on calcium phosphate, 
and especially calcium orthophosphate (Table 1). 
A large review of the chemical processing, 
manufacturing, biological properties, and applications 
was realized recently (Dorozhkin 2010). 

Formation of calcium phosphate, pure HA, pure 
P-TCP, or BCP (mixture of HA and p-TCP) is obtained 
after sintering biologic apatite or synthetic apatite 
obtained either by precipitation or hydrolysis methods. 
The BCP composition (HA/P-TCP ratio) obtained 
after sintering depends on the calcium deficiency of 
the unsintered biologic or synthetic apatite and tem¬ 
perature (LeCeros 1991). Presence of other ions during 
the preparation of the unsintered CDA can also affect 
the HA and P-TCP or BCP sintering. 

Sintering calcium phosphate reagents 
(labeled as “hydroxyapatite” or “calcium phosphate. 
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Calcium Phosphate- 
Derived Biomaterials, 

Fig. 1 Different CaP 
bioceramics currently used in 
orthopedic and dental 
surgeries (Source: 

Biomatlante company, 
Vigneux-de-Bretagne, France) 
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Calcium Phosphate-Derived Biomaterials, Table 1 Different calcium orthophosphate (Adapted from Dorozhkyn (2010)) 


Ca/P atomic 
ratio 

Chemical name 

Acronym 

Formula 

Solubility 
at 25°C, Ks 

Temperature of 
phase stahility” 

0.5 

Monocalcium phosphate anhydrous 

MCPA 

Ca(H2P04)2 

~10^' 

Low 


Monocalcium phosphate 
monohydrate 

MCPM 

Ca(H2P04)2, H 2 O 

-10-' 

Low 

1.0 

Dicalcium phosphate anhydrous 
(monetite) 

DCPA 

CaHP 04 

-10-’ 

Medium 


Dicalcium phosphate dihydrate 
(brushite) 

DCPD 

CaHP 04 , 2 H 2 O 

-10-’ 

Low 

1.33 

Triclinic octacalcium phosphate 

OCPt 

Ca8(P04)4(HP04)2, 5 H 2 O 

-10-'™ 

Low 


Amorphous octacalcium phosphate 

OCPam 

Ca8(P04)4(HP04)2, nH20 

b 

Low 

1.5 

Qt or a! tricalcium phosphate 

a-TCP 

Ca3(P04)2 

- 10 -“ 

High 


P tricalcium phosphate 

P-TCP 

Ca3(P04)2 

- 10 -™ 

Medium 


Amorphous tricalcium phosphate 

TCPam 

Ca9(P04)6, nH20 

b 

Low 

1.5-1.67 

Calcium Deficient Hydroxyapatite 

CDHA 

Cai0_x(HPO4)x(PO4)6-x(OH)2_x 
(0 < X < 1) 

-10“®^ 

Low 

1.67 

Hydroxyapatite 

HA or 

HAp or 
OHAp 

Caio(P04)6(OH)2 

- 10 -'™ 

High 


Oxyapatite 

OA or 

OAp or 
OXA 

Caio(P04)60 

- 10 -™ 

High 


Fluoroapatite 

FA or FAp 

Caio(P04)6F2 

- 10 -'™ 

High 

2.0 

Tetracalcium phosphate 
(hilgenstockite) 

TTCP Or 
TetCP 

Ca 4 (P 04)20 

- 10 -^" 

High 


"“High >1,100°; Medium until 1,100°C; Low < 800°C 

'’Solubility of amorphous calcium phosphate cannot he determined precisely but Ks range is ~10^^^-10^^^ 


tribasic” or “tricalcium phosphate”) above 900°C was 
shown to result in pure HA, pure p-TCP, or BCP. 
BCP ceramic may be also prepared by mechanically 
mixing two types of synthetic apatites or 
commercial calcium phosphate reagents. (LeGeros 
et al. 2009b). 


The Major Properties of Calcium Phosphate 
and Related Bioceramics: Micro- and 
Macrostructure of Calcium Phosphate 

Two physical properties of bioceramics are believed to 
be very important for optimum biological 
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Calcium Phosphate-Derived Biomaterials, Fig. 2 SEM of 

Mbcp® block showing interconnected macroporous structure 


performance, including bioceramic-cell interactions, 
bioceramic resorption, bioceramic-tissue interface, 
and new bone formation. These fundamental physical 
properties are interconnecting macroporosity and 
appropriate microporosity. Macroporosity (Fig. 2) in 
the BCP ceramics is introduced by incorporating 
volatile materials (e.g., naphthalene, hydrogen perox¬ 
ide, or other porogens), which form macropores once 
sintered. The mixture is heated at temperature below 
200° C and followed by sintering at higher tempera¬ 
tures. Microporosity (Fig. 3) is a consequence of the 
temperature and duration of sintering: the higher the 
temperature, the lower the microporosity content and 
the lower the specific surface area (Daculsi et al. 2008). 

Mechanical Properties 

The interest of smart calcium phosphate bioceramics is 
to be replaced by natural bone, without transient loss of 
the mechanical support. This challenge is difficult to 
achieve, as all efficient bioactive calcium phosphate 
bone substitutes are initially too brittle to support high 
bearing conditions. They need to be associated to 
osteosynthesis if the host bone is unable to support 
immediate loading (Cho et al. 2005; Legeros 
et al. 2009a; Nath and Basu 2009). 

It is expected that the pore size and percentage of 
macroporosity of the bioceramic will affect the 
mechanical properties. The preparation method also 
is found to have a significant influence on compressive 
strength. Bioceramics prepared from a single calcium- 



Calcium Phosphate-Derived Biomaterials, Fig. 3 Higher 
magnification of MBCP® block showing crystal size and micro- 
porous structure 


deficient apatite phase is reported to exhibit higher 
compressive strength compared to bioceramics 
prepared by mixing two unsintered calcium phosphate, 
the first resulting in only HA or only P-TCP after 
sintering at 1,200°C (Gauthier et al. 1999). 

The initial mechanical property is not the best 
property for bone ingrowth efficacy, particularly the 
increase of the mechanical property is at the expense of 
the microporosity due to use of high sintering temper¬ 
ature; the limited microporosity reducing resorption 
and bioactive processes. 

Today, the best bone substitutes are represented 
when the bone substitutes are resorbable and adsorb- 
able, with a micro- and macroporous structure. During 
the process of adsorption and bone ingrowth at the 
expense of the bioceramics, secondary mechanical 
property are obtained depending of the site of implan¬ 
tation, spongious bone, or cortical bone. The difficulty 
is to obtain enough mechanical property for machining 
and handling the bone substitute, and the bioactivity. 
The bioactivity is directly related to porosity, and 
higher the total porosity lower will be the mechanical 
property. 

Biological Property 

It is considered according the coefficient of dissolution 
that HA hydroxyapatite Caio(P04)60H2 is the less 
resorbable compared to the tricalcium phosphate 
Ca 3 (P 04 ) 2 . This is not representative of CaP 
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bioceramics. The resorbability depends on the SSA 
(Specihc Surface Area) of the crystals, and the 
nano/p porosity of the biomaterials. The SSA is 
depending on the sintering temperature and lower is 
the SSA (around 0.5-1 m7g) lower is the dissolution. 
Generally, the demonstrated adsorbable CaP 
bioceramics have a SSA around 3-6 m^/g. 

According to basic data, we can have higher 
dissolution/adsorption for very small crystals of HA 
(nanocrystals, high SSA, >5 m^/g) than large TCP 
crystals obtained at high temperature (low 
SSA<1 m^/g). 

We must consider today that all kinds of 
bioceramics (both HA and TCP) can gradually dis¬ 
solve in the body, seeding new bone formation as it 
releases calcium and phosphate ions into the biological 
medium easily absorbed for physiological mineraliza¬ 
tion (Ducheyne et al. 1993; Bohner 2008; Legeros and 
Daculsi 1990). 

The main attractive feature of bioactive bone 
graft materials such as BCP ceramic is their ability 
to form a strong direct bond with the host bone 
resulting in a strong interface compared to bioinert or 
biotolerant materials which form a fibrous interface 
(De Groot 1983; Daculsi 1998; Basu and Nath 2009). 

The formation of this dynamic interface is believed 
to result from a sequence of events involving interac¬ 
tion with cells, formation of carbonate hydroxyapatite 
CHA (similar to bone mineral) by dissolution/precipi¬ 
tation processes (Daculsi et al. 1990). 

The osteoinductive properties of calcium phosphate 
bioceramics are due to the formation of microcrystals 
with Ca/P ratios similar to those of bone apatite crys¬ 
tals observed after implantation. The abundance of 
these crystals is directly related to the initial 
HA/B-TCP ratio in the BCP: the lower the ratio the 
greater the abundance of the microcrystals associated 
with the BCP crystals (Daculsi and Legeros 2008). 
Using high resolution TEM, we demonstrated that 
the formation of these bone apatite-like microcrystals 
after implantation of calcium phosphates (HA, BCP) 
are nonspecific, that is, not related to implantation 
site (osseous or non-osseous sites), subjects of 
implantation, and types of CaP ceramics. The crystals 
growth is due to secondary nucleation and hetero 
epitaxy of biological HA on synthetic HA, TCP, and 
BCP crystals (Daculsi et al. 1990). 

The coalescing interfacial zone of biological apatite 
and residual bioceramics crystals provides a scaffold 


for further bone-cell adhesion and stem cells differen¬ 
tiation in osteogenic lines, and further bone ingrowth 
(Habibovic et al. 2008; Lenihouannen et al. 2005). The 
bone repair or bone regeneration process involves 
dissolution of calcium phosphate crystals and then 
a precipitation of carbonate hydroxyapatite (CHA) 
needle-like crystallites in micropores close to the 
dissolving crystals. The coalescing zone constitutes 
the new biomaterial/bone interface, which includes 
the participation of proteins and CHA crystals origi¬ 
nating from the BCP ceramic crystals. This has been 
described as a coalescing zone and dynamic interface 
(Daculsi and Dard 1994). The association of the chem¬ 
ical nature and the micro-macro structure has been 
described to be key properties required for osteogenic 
and/or osteoinductive synthetic biomaterials. 

Related Calcium Phosphate Bone 
Substitutes 

The need of material for minimally invasive surgery 
(MIS) induced the development of a concept of gran¬ 
ules combination with polymer or calcium phosphate 
cement for injectable/mouldable bone substitutes. 
Different kinds of injectable/moldable bone substitutes 
have been developed, like suspension of granules and 
polymers (De Santis et al. 2009). Non-self-hardening 
injectable biomaterial, are constituted of bioceramics 
granules associated to a hydrosoluble polymer. Such 
injectable bone substitutes are ready to use and able 
to be largely invaded by osteogenic cells. These 
materials are perfectly biocompatible and potentially 
resorbable and, thanks to their initial plasticity, 
they can fit bone defects very easily, without the neces¬ 
sity to elaborate shaping of implantation site. The 
putties do not have mechanical properties like hydrau¬ 
lic bone cement able to have a hardening process. 
However, bone cells are able to invade the spaces 
released by the disappearance of the polymer. Bone 
ingrowth takes place all around and at the expense of 
the resorption of the bioceramic grains. In time, 
mechanical property could be observed due to the 
presence of bone (Daculsi et al. 2010). 

Another concept of injectable bone substitute is 
calcium phosphate cement (Ginebra 2009). These 
materials are a combination of a calcium phosphate 
powder phase mixed before using with a liquid phase, 
the paste can be moulded and implanted and have the 
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ability to set once implanted within the body. How¬ 
ever, these ionic cements cannot be compared to CaP 
bioceramics in spite of HA crystal formation on time. 

At the present time, bioceramics are commercially 
available in blocks, particulates, and customized 
design. The need of material for “granules easy han¬ 
dling” and/or minimally invasive surgery (MIS) 
induced the development of a concept of granules 
combination with polymer or calcium phosphate 
cement for injectable/mouldable bone substitutes. 

Besides the medical and dental applications, BCP 
has a potential for other applications such as delivery 
system for drugs, antibiotics, hormones, carriers for 
growth factors, and scaffolds for tissue engineering. 
It depends on the chemical nature, nano-, micro-, and 
macrostructure, and in some cases combination with 
different kind of polymers. 

Summary 

More than half century has been necessary to develop 
bone substitutes and calcium phosphate bioceramics. 
From the first description from Albee (1920) of 
tricalcium phosphate as a bone substitute, numerous 
publications have proved the efficiency, the safety of 
calcium phosphate bioceramics, and related products. 
The more recent researches and development of cal¬ 
cium phosphate bioceramics to replace natural bone 
grafting has been focused to a better control of the 
process of resorption, absorption, and bone substitution. 
Hydroxyapatite HA is the more stable bioceramics, 
tricalcium phosphate is more soluble. The combination 
of different calcium phosphates is very popular and 
concerned the biphasic calcium phosphate. Depending 
of the basic chemical and structural properties, large 
differences were observed in kinetic of resorption/ 
adsorption, and bone architecture regeneration. 
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Definition 

Capillary electrophoresis is based on the use of narrow 
glass capillaries for the separation of molecules in an 
electric field. 

Basic Characteristics 

Electrophoresis is the separation of charged molecules 
in a fluid under the influence of an applied electrical 
field. Electrophoresis-based methods have many appli¬ 
cations in biochemical analysis since the pioneering 
work of Tiselius (1937) who employed moving¬ 
boundary electrophoresis for the separation of serum 
albumin and globulins. Tiselius was awarded the 
Nobel Prize in Chemistry in 1948 for this contribution. 
Currently, the most widely used electrophoresis 
method is gel electrophoresis for the separation 
and analysis of nucleic acids in molecular biology, 
for protein characterization during purification 
and for protein identification in proteomics 
(Westermeier 2005). 

Capillary electrophoresis (CE) was first described 
by Jorgenson and Lukacs in 1981 and is based on the 
use of narrow (10-200 um i.d.) fused silica (glass) 
capillaries for the separation of molecules in an electric 
field. Separation depends on the electrophoretic mobil¬ 
ity of the molecules which is proportional to their 
charge/size ratios. An additional factor that affects 
the mobility of molecules in CE is the electroosmotic 
flow (EOF) which is a bulk solvent flow directed 
toward the negatively charged cathode. EOF is due to 
the glass capillaries which have negatively charged 
silanol groups on their walls. This attracts positively 
charged ions from the CE buffer solution which form 
a diffuse double electrical layer. Under an applied elec¬ 
tric field, the positively charged ions in the layer migrate 
to the cathode carrying water along with them. EOF is 
the bulk solvent flow to the cathode. The ionic strength 
and pH of the buffer affect the magnitude of EOF. 
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Glass Capillary 



Schematic of a Capillary Electrophoresis System 


Capillary Electrophoresis of Cell Metabolism, Fig. 1 The 

basic instrumental set-up consists of a high voltage power 
supply, two electrodes, two electrolyte buffer reservoirs linked 
by a capillary and a detector 

When EOF is greater than the electrophoretic mobility 
of species in solution, there is a net movement of pos¬ 
itively charged, neutral, and lastly negatively charged 
species toward the cathode. Since all types of molecules 
can be separated by CE including nucleic acids, amino 
acids, peptides, proteins, carbohydrates, drugs, 
metabolites etc., CE finds applications in diverse areas 
such as food and environmental analysis, forensics, 
medical diagnostics, pharmaceutical science, genetic 
typing, and biotechnology (Landers 2008; Unger 
2009; Volpi 201 1). 

The basic instrumentation to perform capillary elec¬ 
trophoresis is relatively simple and requires a high- 
voltage power supply, electrodes, two electrolyte 
buffer reservoirs linked by a capillary and a detector 
(Fig. 1). Online detection methods include UV-visible 
absorption and fluorescence, but nonabsorbing mole¬ 
cules can be derivatized before analysis. CE has numer¬ 
ous advantages including speed, minute sample size, 
high separation efficiencies, low volumes of waste pro¬ 
duction, and it can be automated. The completion of the 
Human Genome Project ahead of schedule and at 
a fraction of the estimated cost was due to commercial 
development of CE-based DNA sequencers. 

Metabolomics is the analysis of low-molecular 
weight endogenous metabolites that are intermediates 
and products of cell metabolism. Due to their chemical 
diversity, no single technique is amenable for all metab¬ 
olites. CE coupled to mass spectrometry (CE-MS) 


is being developed for both the fingerprinting and quan¬ 
titative analysis of metabolites in bacterial, plant, and 
tissue extracts and biological fluids such as plasma and 
urine (Ramautar et al. 201 1). However, cell populations 
are usually heterogeneous, thus increasingly sensitive 
methods are being sought for quantitative single 
cell metabolomics including CE-MS and CE 
with laser-induced fluorescence detection (Landers 
2008; Amantonico et al. 2010; Heinemann 
and Zenobi 2010). Sample processing is also challeng¬ 
ing with respect to preservation of the original 
metabolome which is subject to turnover and alteration 
by enzymes. 
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Carbohydrate Antibiotics 

Derek Horton 

Department of Chemistry, American University, 
Washington, DC, USA 


Synonyms 

Carbohydrates; Carbohydrate-structured microbial 
antimetabolites 


Definition 

Carbohydrate antibiotics refers to microbial metabo¬ 
lites (mainly from Actinomyces species) having 
antibacterial and antifungal properties and which are 
carbohydrates or contain a carbohydrate component 
in their chemical structure. Types include the 
aminoglycosides, macrolides, nucleoside analogs, 
and glycosylated aromatic structures. 


Basic Characteristics 

A revolution in therapies to combat infectious patho¬ 
gens took place in 1937 with the discovery by Fleming 
of penicillin, a microbial metabolite, and the first 
of a class of P-lactam antibiotics. The field of carbo- 
hydrate-structured antibiotics of major clinical utility 
opened subsequently with Waxman’s discovery 
(Waxman et al. 1944) of streptomycin in the fermen¬ 
tation broth of an Actinomyces species. Streptomycin 
was the first example of the class of aminocyclitol 


(also termed aminoglycoside) antibiotics, and it was 
found particularly useful in the treatment of tubercu¬ 
losis, although a high incidence of ototoxicity is 
a limitation in its modem clinical use. 

Elucidation of its chemical stmcture provided a por¬ 
tent of the rich diversity of unusual sugar structures in 
microbial metabolites, as later revealed in 
numerous other antibiotics in this class and such other 
groups as the macrolides (for examlpe erythromycin); 
nucleoside antibiotics such as puromycin; glycosylated 
aromatic structures such as doxorubicin (adriamycin); 
and many others. Structure elucidation of the monosac¬ 
charide components of these antibiotics divulged a pro¬ 
fusion of novel structural types, such as deoxy sugars, 
amino sugars, branched-chain sugars, higher-carbon 
sugars, sugars of unusual stereochemistry, to name just 
a few (Hanessian and Haskell 1970), and provided the 
stimulus for a great deal of innovative work in devel¬ 
oping procedures for their chemical synthesis. 

Streptomycin is a pseudotrisaccharide having an 
amino sugar, A-methylglucosamine as the unusual 
L enantiomer, linked by way of a branched-chain 
sugar component (streptose), in turn connected to 
a guanidinocyclohexanepolyol termed “streptidine,” 
as shown in the accompanying scheme: 


c 




NHMe 


Streptomycin 


The structure was largely elucidated by Wolfrom 
and his group (Lemieux and Wolfrom 1948). This 
pioneering work presaged the subsequent discovery of 
numerous other aminocyclitol antibiotics produced by 
microorganisms, notably the neomycins, kanamycins, 
and gentamicins. The discovery, structure determination. 
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and understanding of the mode of action of these agents 
owe much to the work of the Umezawa brothers 
(Umezawa 1974a, b), who also showed that synthetic 
modification of parent antibiotic structures could over¬ 
come the problem caused by development of drag- 
resistant strains of bacteria. Later work has disclosed 
the role of aminocyclitol-nucleic acid recognition, and 
in particular, ribosomal binding in the mechanism of 
bactericidal action (Wills and Arya 2006). 

Access to carbohydrate antibiotics by chemical 
assembly from their component sugars has demon¬ 
strated exercises in synthetic virtuosity, but in general 
their production through microbial fermentation 
remains the most economically viable approach. 
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Carbohydrate Circular Dichroism 

Kunihiko Gekko 

Hiroshima University, Higashi-Hiroshima, Japan 

Synonyms 

Carbohydrates; Optical activity; Optical rotation 


Definition 

Circular dichroism (CD) is defined as the difference 
between the absorptivity of left- and right-circularly 
polarized light. Chiral molecules containing asymmet¬ 
ric carbon atoms have different optical activities for 
the two circularly polarized components, which results 
in characteristic CD spectra in the visible to vacuum- 
ultraviolet (VUV) region. Since CD is very sensitive to 
the chemical bonds and substituents (i.e., chromo- 
phores) surrounding asymmetric carbon atoms, CD 
spectroscopy is a powerful technique for analyzing 
the stereochemistry and conformation of organic com¬ 
pounds and biomolecules (Fasman 1996; Berova et al. 
2000). Carbohydrates contain hydroxyl and acetal 
groups with high-energy electronic transitions below 
190 nm, which is the short-wavelength limit of con¬ 
ventional CD spectrophotometers in aqueous solution. 
CD measurements have been extended into the VUV 
region below 190 nm by improvements to the optical 
device and cell, and recently by using ► synchrotron 
radiation (SR) as an intense light source (Wallace and 
Janes 2009), allowing direct observation of the 
chiroptical properties of the electronic transitions of 
carbohydrate molecules. 

Basic Characteristics 

CD spectra of carbohydrates can be roughly divided 
into three wavelength regions. The two most common 
substituents, acetamido and carboxyl groups, display 
CD bands associated with the n-tt* transitions at 
200-240 nm and the ti-ti* transitions at 180-200 nm, 
whereas the n-a* transitions of acetal and hydroxyl 
groups produce bands at 140-180 nm. Many CD data 
of carbohydrates are comprehensively discussed in the 
reviews of Johnson (1987) and Stevens (1996). 

Unsubstituted Carbohydrates 

Monosaccharides exist in a complex equilibrium of 
two ► anomeric forms (a and (J) of the hydroxyl 
group at C-1, three staggered conformations of the 
hydroxymethyl group at C-5 {gauche-gauche, 
gauche-tram, and tram-gauche), and two chair con¬ 
formations {*Ci and *C 4 ). The CD spectra of many 
monosaccharides and methyl aldopyranosides have 
been measured down to 165 nm in water and D 2 O, 
and to 140 nm using film samples (Johnson 1987; 
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Carbohydrate Circular Dichroism, Fig. 1 VUVCD spectra of 
monosaccharides (top) and disaccharides (bottom) in aqueous 
solution at 25°C (redrawn from Matsuo and Gekko 2004) 


Stevens 1996). Film CD spectra provide important 
information on the originating orbital and energy 
levels (state assignments), but the CD bands in aque¬ 
ous solution have not been explicitly assigned due to 
the complexity of the equilibrium conformations. 

Figure 1 shows the ► vacuum-ultraviolet circular 
dichroism (VUVCD) spectra of five monosaccharides 
and disaccharides in aqueous solution, which were 
measured down to 160 nm using an SR-VUVCD spec¬ 
trophotometer (Matsuo and Gekko 2004). These 
monosaccharides have very similar structures, but 
they exhibit markedly different spectra. The CD 
bands around 180-160 nm predominantly arise from 
the electronic transitions (n-a*) of ring oxygen atom 
(Listowsky and Englard 1968). Deconvolution of these 
spectra suggests that the anomeric hydroxyl group at 
C-1 and the staggered hydroxymethyl group at C-5 
exert marked effects on electronic transitions of ring 
oxygen atom. The orientation of hydroxyl groups at 
C-2 and C-4 may also make some contributions to the 
spectra. 

The VUVCD spectra of disaccharides qualitatively 
resemble those of the monomer components. However, 


it is apparent that the spectra of the constituent mono¬ 
mers are not additive since the conhguration of the 
anomeric hydroxyl group that links two monomers is 
hxed and the resulting ether linkages (1 —> 2, 1 ^ 3, 
1^4, and 1^6) have different CD. The linkage CD 
may be estimated as the difference between the spectra 
of disaccharide and the constituent monomers. The dif¬ 
ference between the solution and him CD spectra of 
disaccharides suggests that they are affected by hydra¬ 
tion and conformational flexibility. 

There are many VUVCD data for ► polysaccha¬ 
rides, most of which have been reported by Stevens 
and coworkers (Stipanovic and Stevens 1981; Stevens 
1996). All of the studied o-glucans consisting 
of (1 ^ 3)- and (1 ^ 4)-lirLkages exhibit positive 
bands for the a-linkage (e.g., amylose) and 
negative bands for the (l-linkage (e.g., cellulose) at 
164-172 nm, but no such correlation with anomeric 
conhguration is observed for (1 ^ 6) glucans: both 
dextran (a-linkage) and pustulan (P-linkage) show 
a positive band around 167 nm. Some of the gel¬ 
forming polysaccharides display a small negative 
band at 180-190 nm. This band is assigned to the 
ether oxygen atom of the linkage and so may reflect 
the local flexibility of the polysaccharide chain. Aga¬ 
rose solution shows a positive band at 180 nm, which 
decreases in intensity and shifts to longer wavelengths 
due to melting of the gel as the temperature increases. 

Substituted Carbohydrates 

Many biologically important carbohydrates include 
various substituents, such as carboxyl, acetamido, 
and sulfate, with some of them exhibiting absorption 
in the far-UV region. SR-VUVCD measurements in 
aqueous solution reveal that V-acetylglucosamine has 
a negative band around 210 nm, a positive band around 
180 nm, and a shoulder around 190 nm, while 
V-acetylgalactosamine shows two negative bands 
around 210 and 170 nm and a positive band around 
190 nm (Matsuo et al. 2009). o-Glucuronic acid shows 
a positive band around 210 nm and a negative band 
around 185 nm, both of which are strongly pH depen¬ 
dent. The electronic transitions of these chromophores 
are complicatedly influenced by the solvent condi¬ 
tions, the surrounding substituents, and the ring 
conformation. 

Typical substituted polysaccharides are ► glycos- 
aminoglycans, which are linear polymers composed 
of repeating disaccharide units of hexosamine 
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Carbohydrate Circular Dichroism, Fig. 2 VUVCD spectra of 
glycosaminoglycans in aqueous solution at 25° C (redrawn from 
Matsuo et al. 2009) 

(glucosamine or galactosamine) and uronic acid 
(glucuronic or iduronic acid). These disaccharides 
units have similar structures, but glycosaminoglycans 
exhibit very different VUVCD spectra in aqueous 
solution, as shown in Fig. 2 (Matsuo et al. 2009). The 
configuration of the carboxyl group and the number 
and position of sulfate group definitely influence the 
spectra in the VUV region. Theoretically assigning the 
bands in the VUV region is difficult due to the large 
number of overlapping electronic transitions (n-ti*, 
7t-7i*, and n-a*) and the modified ring conformation. 
However, the contributions of the substituents may be 
estimated by comparing the spectra of polysaccharides 
and the constituent monomers. 

Carbohydrates exhibit complex CD spectra due to 
their structural diversity. However, their high sensitiv¬ 
ity to the electronic transitions of chromophores pro¬ 
vides important information on the conformation, 
interaction, and hydration of carbohydrates in solution. 
VUVCD spectrophotometers have been developed 
using SR light sources at many facilities worldwide 
(Wallace and Janes 2009). The accumulation of 
VUVCD data in solution and their theoretical assign¬ 
ments would provide new insight into carbohydrate 
chemistry and structural biology. 

Cross-References 

► Anomeric Effect in Sugars 

► Far UV Protein Circular Dichroism 

► Glycosaminoglycan (GAG) 


► Polysaccharides 

► Synchrotron Radiation 
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Synonyms 

Carbohydrates 


Definition 

Carbohydrate enzymology includes the enzyme- 
catalyzed formation, degradation, and functional mod¬ 
ification of glycosides of glycoproteins and glycolipids 
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and the biosynthesis of polysaccharides, lipopolysac- 
charides, glycosaminoglycans, etc. Biophysical, bio¬ 
chemical, and genetic methods are used to unravel the 
mechanisms involved and have provided information 
for biotherapeutic development. 

Introduction 

In nature, carbohydrates exist as monosaccharide units 
connected to one another and to proteins, lipids, or 
other species (Stick and Williams 2009; Varki et al. 
2009). The degradation of the glycosidic bond 
connecting these monosaccharides is catalyzed by sev¬ 
eral types of enzymes: glycoside hydrolases, which 
result in hydrolysis; carbohydrate lyases, which 
effect cleavage by elimination; and phosphorylases, 
which cleave glycosidic bonds by phosphorolysis. 
The formation of the glycosidic bond is catalyzed by 
several types of enzymes: glycosyltransferases, 
transglycosidases, and phosphorylases. Glycosyl¬ 
transferases use sugar nucleotide (di)phosphates 
or sugar (di)phospholipids as glycosyl donors. 
Transglycosidases are mechanistically related to gly¬ 
coside hydrolases and catalyze the transfer of 
a glycosyl fragment from one glycoside to another. 
Phosphorylases can synthesize glycosidic bonds 
by the reverse of their cleavage mechanism. Co- or 
post-biosynthetically, glycoconjugates are modified 
with a range of adorning groups including sulfates, 
phosphates, acyl groups, and ethers, which utilize 
a series of transferases and hydrolases for their instal¬ 
lation and removal. 

Formation and Cleavage of the Glycosidic 
Bond 

Glycoside Hydrolases and Transglycosidases 

Glycoside hydrolases catalyze the cleavage of glyco¬ 
sidic bonds through C-0 scission of the exocyclic 
acetal bond. Glycoside hydrolases can be classified as 
retaining or inverting enzymes on the basis of whether 
this reaction occurs with retention or inversion of 
the anomeric stereochemistry (Vocadlo and Davies 
2008). After enzymatic cleavage the resulting sugar 
hemiacetal mutarotates to form a mixture of anomeric 
isomers. Glycoside hydrolase specihcity can be 
dehned by the following characteristics: (a) the 


anomeric specificity (a or P); (b) the stereochemical 
outcome (retaining or inverting); (c) whether the 
cleavage occurs within (endo-) or at the non-reducing 
end (exo-) of a sugar chain; and (d) the nature of 
the sugar that is cleaved. For example, an inverting 
exo-p-galactosidase forms single a-o-galactose mono¬ 
mers from cleavage of the non-reducing end of 
P-linked galactoconjugates. 

Inverting glycoside hydrolases operate by a one- 
step reaction in which a catalytic dyad of enzymatic 
carboxylic acid residues act as a general acid and base, 
respectively, to simultaneously deprotonate a water 
molecule and protonate the glycosidic oxygen, leading 
to a hydrolytic substitution with anomeric stereo¬ 
inversion (Fig. la). 

Retaining glycoside hydrolases operate through 
a two-step mechanism involving two groups acting as 
nucleophile and general acid/base (Fig. lb). In the hrst 
step the nucleophile attacks the anomeric center and 
the general acid protonates the glycosidic bond, 
resulting in the formation of a covalent intermediate 
with inversion of anomeric stereochemistry. In 
the second step the former general acid acts as 
a general base to deprotonate a water molecule, 
which attacks the covalent intermediate leading to 
its hydrolysis with a second inversion of anomeric 
stereochemistry. The net outcome of the reaction is 
therefore overall retention of anomeric configuration. 
For both inverting and retaining glycoside hydrolases, 
there is strong evidence from kinetic isotope effect 
studies that the transition states possess substantial 
oxocarbenium ion character with little participation 
by the nucleophile and nucleofuge (Zechel and 
Withers 2000). 

Most commonly, the nucleophile and general acid/ 
base of retaining glycoside hydrolases are enzymic 
carboxylates (aspartate or glutamate). Many (but not 
all) retaining P-A-acetylhexosaminidases, chitinases, 
and hyaluronidases proceed through an alternative 
mechanism wherein the 2-acetamido group of the sub¬ 
strate acts as a nucleophile in a mechanism involving 
substrate-assisted catalysis (Fig. Ic) (Vocadlo and 
Davies 2008). Myrosinases, which are retaining 
P-glucoside hydrolases that catalyze cleavage of 
glucosinolates, naturally occurring oximylsulfate 
thioglycosides, have the typical general acid/base 
glutamic acid mutated to a non-nucleophilic glutamine 
(Zechel and Withers 2000). The glucosinolate sub¬ 
strate possesses an anomeric substituent that is 
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Carbohydrate Enzymology, Fig. 1 Mechanisms of glycoside 
cleavage by glycoside hydrolases, transglycosylases, polysac¬ 
charide lyases, and a-glucan lyase: (a) inverting [l-glycosidase; 
(b) retaining P-glycosidase (Rj = H) and transglycosylase 


(Rj = sugar); (c) retaining W-acetylhexosaminidase using 
anchimeric assistance (R, = H) or transglycosylase 

(Ri = sugar); (d) polysaccharide lyase; (e) a-glucan lyase 


a sufficiently good leaving group that it does not 
require general acid assistance. However, in the second 
step of the reaction, hydrolysis of the glycosyl enzyme, 


it has been found that the co-factor ascorbate functions 
in the role of a general base. In some sialidases, the 
enzymic nucleophile has been shown to be an enzymic 
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tyrosine group rather than a carboxylate (Vocadlo and 
Davies 2008). In this case it has been proposed that 
a neutral tyrosine nucleophile prevents potentially 
destabilizing charge repulsion between the anionic 
substrate and a carboxylate residue. 

Transglycosylases are enzymes that catalyze the 
transfer of glycosyl groups from one glycoside to 
another (Fig. lb, c). These enzymes use essentially 
the same mechanisms as retaining glycoside hydro¬ 
lases. Following formation of a glycosyl enzyme with 
an enzymic aspartate, glutamate or tyrosine, transgly- 
cosylation to another alcohol (most commonly a sugar) 
as a nucleophile in place of water results in the forma¬ 
tion of a new glycoside. 

Lyases 

Carbohydrate lyases are enzymes that cleave glyco- 
sidic linkages through a process of elimination 
(Yip and Withers 2006). There are two main types of 
lyases: polysaccharide lyases that result in cleavage 
of substrates by loss of an acidic proton adjacent 
to a leaving group in glycosaminoglycans and 
other uronate-containing polysaccharides; and the 
starch-degrading a-glucan lyases that cleave substrates 
by elimination of an intermediate glycosyl enzyme. 
Polysaccharide lyases use an enzymic general base to 
deprotonate the proton a to the carboxylate resulting in 
a simple elimination reaction (Fig. Id). a-Glucan 
lyases share sequence homology with starch-degrading 
glycoside hydrolases, and in common with them 
share the first half of a retaining glycoside hydrolase 
mechanism, resulting in the formation of a glycosyl 
enzyme intermediate (Fig. le). In the second step 
an elimination occurs, to afford an unsaturated 
2-hydroxyglycal, which tautomerizes to its keto form, 
1,5-anhydrofructose. 

Glycosyltransferases and Phosphorylases 

Glycosyltransferases catalyze the transfer of glycosyl 
residues from activated sugar-1-phosphate donors to 
various acceptors. The activated donors include 
mono and disaccharide (di)phosphonucleosides 
(so-called Leloir enzymes), carbohydrate (di)phospho- 
lipids (non-Leloir enzymes), and sugar-1-mono- 
and diphosphates (phosphorylases and 
pyrophosphorylases). 

Leloir and non-Leloir glycosyltransferases catalyze 
glycosyltransfer with either retention or inversion of 
anomeric configuration (Lairson et al. 2008). 


Structural and kinetic data for inverting glycosyl¬ 
transferases support a mechanism that proceeds by 
direct displacement through an oxocarbenium-ion- 
like transition state with general base catalysis by 
carboxylate or histidine residues (Fig. 2b). For 
retaining glycosyltransferases two mechanistic alter¬ 
natives are viable possibilities. A two-step mechanism 
involving sequential inversions by an enzymic 
nucleophile that proceeds through a glycosyl enzyme 
intermediate, or an SNi mechanism that involves the 
non-concerted loss of the leaving group and the same- 
side attack by the acceptor group (Fig. 2a). 

Phosphorylases are enzymes that cleave glycosidic 
bonds by reaction with phosphate (Kitaoka and 
Hayashi 2002). This reaction is in practice reversible, 
and so the physiological direction in which this 
reaction occurs may result in either the synthesis or 
cleavage of a glycosidic linkage. Phosphorylases 
may utilize glycoside-hydrolase-like mechanisms, 
involving retention or inversion of configuration 
through the involvement of auxiliary carboxylate 
residues, and the resulting enzymes are structurally, 
and sequence, related to glycoside hydrolases (e.g., 
maltose phosphorylase). Other phosphorylases 
are more closely related to glycosyltransferases. 
One such enzyme, glycogen phosphorylase, requires 
the cofactor pyridoxal phosphate, which is believed 
to be a vestigal mimic of its evolutionary glycosyl- 
transferase precursor nucleoside monophosphate 
(Lairson et al. 2008). Pyrophosphorylases are 
enzymes that cleave glycosidic bonds by reaction 
with pyrophosphate. Like phosphorylases, the physio¬ 
logical direction of this reaction may be in the reverse 
sense. 

Modification of Carbohydrates 

Sulfotransferases and Sulfatases 

Carbohydrate sulfation is a modification that occurs 
after the assembly of the carbohydrate glycan struc¬ 
ture, and is a frequent modification of glycosamino¬ 
glycans such as heparin/heparan sulfate, chondroitin 
sulfate, dermatan sulfate, and keratin sulfate (Varki 
et al. 2009). Sulfate groups are installed by the action 
of sulfotransferases, which catalyze the transfer of 
a sulfuryl group (SOs^) from the reactive donor 
3'-phosphoadenosine-5'-phosphosulfate to a carbohy¬ 
drate alcohol or amine (Chapman et al. 2004) (Fig. 3a). 
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Glycosyltransferases: 



R' = nucleoside, phosphonucleoside, lipid, phospholipid 


Phosphorylases: 



Carbohydrate Enzymology, Fig. 2 Glycoside forming reactions. Reactions catalyzed by (a) retaining and (b) inverting glycosyl¬ 
transferases, and (c) retaining and (d) inverting phosphorylases 


X-ray crystal structures of sulfotransferases support 
a mechanism involving direct in-line transfer of the 
sulfuryl group from PAPS to the acceptor, releasing 
3'-phosphoadenosine-5'-phosphate. 

Carbohydrate desulfation is catalyzed by hydrolytic 
enzymes termed “sulfatases” (Hanson et al. 2004) 
(Fig. 3b). The majority of sulfatases possess a unique 
amino acid residue, formylglycine, which is critical for 
their ability to function as catalysts. Several enzymic 
mechanisms have been proposed for this cleavage 
reaction, none of which has definitive evidence in 
their favor. 

Phosphotransfer and Phosphatases 

Phosphorylation of monosaccharides is common in 
primary metabolism but is a relatively rare modifica¬ 
tion of complex carbohydrates in secondary metabo¬ 
lism. Phosphorylated carbohydrates are found as part 
of the mannose-6-phosphate group used as a lysosomal 
targeting signal (Fig. 3c), in phosphoglycan in proto¬ 
zoa, and in the (lipo)phosphoglycans lipoteichoic acid 
and wall teichoic acid of Gram-positive bacteria. 


Mannose-6-phosphate residues are formed in a two- 
step process through the initial transfer of an A-acetyl- 
D-glucosamine-1-phosphate group (by UDP-A- 
acetylglucosamine phosphotransferase) onto acceptor 
mannose residues, followed by the cleavage of 
the Ai-acetyl-D-glucosamine residue (by A-acetyl- 
glucosamine-1 -phosphodiester a-A-acetylglucosa- 
minidase) (von Figura and Hasilik 1986). 

Acyl Transferases and Esterases 

Carbohydrates can be modified as esters of acetic acid 
or longer chain fatty acids (Stick and Williams 2009). 
Esterification is common in bacterial lipopolysaccha- 
rides such as lipid A (Fig. 3d), and in bacteria of the 
grouping Corynehacterinae such as mycobacteria. It is 
also found as a relatively common modification of 
sialic acid residues (Fig. 3e). Transfer is catalyzed 
by acyltransferases from acetyl-coenzyme A and 
fatty acyl-coenzyme A. Hydrolysis of carbohydrate 
esters is catalyzed by esterases that typically possess 
an a/p hydrolase fold and a conserved catalytic 
triad, Ser-His-Asp, which is common to lyases. 
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Carbohydrate Enzymology, Fig. 3 Biosynthesis and removal of carbohydrate- glycophospholipid; (e) Structure of 9-0-acetyl sialic acid, the preferred ligand for 

modifying groups: (a) Sulfation catalyzed by sulfotransferases; (b) Desulfation cata- influenza C; (f) Structure of 3-O-methylmannose polysaccharide from Mycobacterium 

lyzed by sulfatases; (c) Phosphorylation of mannose residues in N-linked glycoproteins; smegmatis 
(d) Structure of core oligosaccharide glycosylated lipid A, a potently immunogenic 
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Other carbohydrate esterases are metal-ion-depen- 
dent. The antigen 85 complex of mycobacteria can 
catalyze the transfer of fatty acyl groups between 
sugar residues. 

Methylation 

Methylation of carbohydrate hydroxyl groups is wide¬ 
spread in bacteria but comparatively little is known 
about its biological intent. 0-Methyltransferases use 
S-adenosylmethionine as the methyl donor (Stick and 
Williams 2009). The 3-(9-methylmannose-containing 
polysaccharides and 6-(9-methylglucose-contaimng 
lipopolysaccharides of certain mycobacteria form 
stoichiometric complexes with long chain fatty acyl- 
coenzyme A and may have roles in regulating its 
synthesis or transport within the organism (Fig. 3f). 

Cross-References 

► Glycoproteins 

► Glycosaminoglycan (GAG) 

► Glycoside Hydrolases - Computational Studies 

► Glycosylphosphatidylinositol 
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Carbohydrates; Molecular dynamics simulations of 
carbohydrates 

Introduction 

Structural diversity of carbohydrates is enormously 
large and plays the central part in intermolecular inter¬ 
actions regulating a broad spectrum of biological func¬ 
tions. Apart from primary and secondary structures, 
conformation and dynamics are properties that are 
important for our understanding of carbohydrate roles 
in living systems. Whereas 3D molecular structures 
describe only static pictures in the solid state or in 
solution, molecular dynamics analysis provides infor¬ 
mation on molecular motions, both overall and inter¬ 
nal, and the timescales. Knowledge of dynamics 
in solution is particularly important for a detailed 
understanding of intermolecular interactions, such 
as protein-carbohydrate interactions. Weak 
intermolecular interactions are energetically governed 
by two opposing thermodynamic quantities - enthalpy 
and entropy. As molecular flexibility is strong 
entropic drive, dynamics considerably influences car¬ 
bohydrate interactions with other biomolecules. 
In many cases, carbohydrate binding to proteins cannot 
be fully explained considering only static 3D struc¬ 
tures. Dynamics of the molecules in solution thus 
significantly influence the equilibrium association 
constants. Analysis of carbohydrate 3D structure and 
solution dynamics has therefore been of noticeable 
interest to experimental and theoretical chemists 
in recent years. “Dynamics” is used here to 
describe motions at the glycosidic linkages, 
pendant groups (such as hydroxymethyl groups), 
as well as pseudorotation of furanose and pyranose 
rings. 
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NMR Spectroscopy in Analysis of 
Carbohydrate Dynamics 

NMR spectroscopy is the main experimental tool used 
in analysis of carbohydrate dynamics at present 
(Jimenez-Barbero and Peters 2002). Besides conven¬ 
tional one-dimensional methods, two-dimensional 
methods are performed to collect relaxation parame¬ 
ters such as spin-lattice, spin-spin relaxation times and 
NOEs in oligo- and polysaccharides. Proton-detected 
methods based on double INEPT schemes yield suffi¬ 
ciently resolved spectra and the relaxations rates can 
be derived from series of 2D spectra collected with 
various relaxations intervals. In most cases, cross¬ 
correlation between chemical shift anisotropy and 
dipolar relaxation mechanisms should be suppressed 
to achieve higher accuracy of the derived relaxation 
parameters due to the fact that interference effects 
cannot be neglected in carbohydrates in most cases. It 
should be also noted that carbohydrates are constituted 
largely from three different elements only; thus 
data are predominantly collected for analyses. To 
a lesser extent, measurements are also performed 
(Dais and Perlin 1987); however, applications 
are often limited due to the complexity of *H NMR 
saccharide spectra. 

Various models for both isotropic and non-isotropic 
overall motions, with and without the presence of 
internal motions, have been proposed for interpretation 
of relaxation data (Heatley 1986). However, not all 
these models are suitable for studies of the motional 
parameters of saccharides in solution. Too complex 
models, especially those that consider the molecular 
motion as fully non-isotropic, require an extended set 
of relaxation data to compute the motional parameters. 
Apart from overall molecular motions, the presence of 
internal motions can influence the overall molecular 
shape and the molecule may tend to adopt a less 
ordered shape. Substituents, nonbonded interactions, 
hydrogen bonds, and solvents considerably affect 
molecular structures and dynamics. Consequently, 
oligo- and polysaccharides can have complex molecu¬ 
lar shapes and motional properties. Some saccharides 
have relatively well-defined shape, such as helical 
structures (some glycosaminoglycans, alginates), 
some, on other hand, have quite extensive internal 
motions resulting in less ordered molecular shapes, 
e.g., random coil (e.g., amylose). The overall molecu¬ 
lar shape is difficult to determine as it is affected by all 


the above-mentioned phenomena. Thus, the choice of 
the appropriate model for description of saccharide 
motional properties is not straightforward. 

Generally, internal motions can be divided into 
three groups depending on their rate: fast, intermedi¬ 
ate, and slow. East motions range from small-scale 
fluctuations (up to 10-15°) within the energy 
well with frequencies in the order of about 10*^ Hz, 
through rotations of the hydroxymethyl groups or 
motions at the glycosidic linkages at frequencies on 
the picosecond or nanosecond timescales (^10*°-10^ 
Hz) to overall motions at frequencies on the nanosec¬ 
ond timescale. Intermediate timescales cover 
ranges from microseconds to milliseconds and 
are associated with exchange phenomena in oligosac¬ 
charides or overall motions in large polysaccharides or 
glycoproteins in viscous media. However, intermedi¬ 
ate and slow motions (milliseconds up to seconds) 
have not been systematically studied in saccharides. 

The most comprehensive studies using NMR 
have focused on analysis of fast internal motions. 
A number of theoretical approaches enable calculation 
of motional parameters in case of fast internal 
motions (Heatley 1986). In the model-free approach, 
the assumption is made that internal motion is 
fast compared with the overall tumbling. In such 
case, the spectral density functions for an isotropically 
tumbling molecule can be expressed as (Lipari and 
Szabo 1982) 


C 




Sc-H^tO 
1 -|- (toCu)^ 


+ (l — ^C-H^) 


T 

1 -I- (rojf 


where Tq is the overall correlation time of the molecule, 
1/t = I/tq -I- l/te and tg is the effective correlation time 
characterizing the rate of internal motion. 5 c-h^ is the 
order parameter that describes the spatial restriction of 
the internal motion of the C-H relaxation vector. 
Assumption of isotropic motion is usually not correct 
for oligo- and polysaccharides as these types of mole¬ 
cules have a prolonged shape. If the molecular shape 
can be approximated as an axially symmetric prolate 
ellipsoid, the spectral density can be expressed in the 
form 


,/ls(cu) = 5c-H^7an(m) + (l ” ^C-H^) 


1 -f (tea) 


2 ( 1 ) 


where 
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and Tj^ = l/6Dj^ and t|| = 1/6D||. t|| and Tj^ are the 
correlation times for reorientation about the long and 
short axis, respectively; 0 is the angle between the C-H 
relaxation vector and the symmetry axis. 

NMR relaxation experiments performed at different 
magnetic fields and applications of the described 
models enabled calculation of motional parameters in 
various oligosaccharides. For example, in heparin 
pentasaccharide (Fig. 1), two-dimensional double 
INEPT methods were used to determine the Ti 
and T 2 relaxation times for ring carbons (Hricovini 
and Torri 1995). The experimental Tj and T 2 curves 
with the best fit for the A3 (nonreducing end residue), 
II (for Ti), and 13 {T 2 ) carbons from the iduronic acid 
residue are shown in Fig. 2. T 1 and T 2 relaxation times 
were determined from these decays. Theoretical anal¬ 
ysis of the relaxation times showed that the experimen¬ 
tal values can be interpreted only with spectral 
densities corresponding to a symmetric top model 
with internal motion (Eq. 1). Computed values of t|| 
and Tj^ (450 ps and 1.6 ns, respectively), the internal 
motion correlation times (tg) for individual residues, 
and the order parameters for C-H relaxation vectors 
are listed in Table 1. The computed data show that the 
order parameters are different for each residue and 
decrease from the central residue toward both ends of 
the molecule indicating that the terminal residues are 
less restricted in their internal motions. The analysis 
also showed differences in the values for different 
relaxation vectors and that the rate of internal motion is 
on the picosecond timescale. 

Other NMR experimental studies also revealed 
quite pronounced dynamics in oligosaccharides with 


internal motion correlation times varying from about 
50 up to 500 ps (Jimenez-Barbero and Peters 2002; 
Hricovini 2004; Vliegenthart and Woods 2006). Order 
parameters were found predominantly in the interval of 
0.7-0.9 where the lower values were found for 
hydroxymethyl groups or terminal residues. The over¬ 
all molecular motions in oligosaccharides have been 
characterized by symmetric top models with few 
exceptions (mainly for monosaccharides) where the 
model for isotropic motions has been used. It should 
be also mentioned that the conformational behavior at 
different glycosidic linkages depends on the size of 
saccharide. As expected, small oligosaccharides tend 
to be more flexible owing to the reduced steric restric¬ 
tions of neighboring units. In larger branched oligo¬ 
saccharides, both rigid and flexible parts are present in 
the structure. Example of the molecule with such 
dynamics is Man 9 GlcNAc 2 (Woods et al. 1998). 
Molecular conformation of Man 9 GlcNAc 2 is relatively 
well preserved despite the local flexibility and does not 
much differ from the conformation adopted in the 
glycoconjugate from. It seems that the comparable 
structures and motional properties are due to the com¬ 
pensatory motions at the glycosidic linkages and 
together with correlated motions influence the internal 
dynamics. The similar type of motions are present also 
in the trisaccharide, Man(al-3)[Man(al-6)]Manp- 
OMe (Tian et al. 2001). Both types of motions, com¬ 
pensatory and correlated, are possibly due to the 
complex hydrogen bond network between these 
oligosacccharides and solvent. 3D structure and 
dynamics of this glycan, together with the hydrogen 
bond system (partially mediated by water molecules), 
brings essential information on the mechanism of 
interaction with proteins. 

Dynamics is even more complex in glycoproteins 
where the effect of the protein upon internal motions in 
the oligosaccharide part should be considered. NMR 
data collected in oligosaccharide linked to the Asn78 
residue in human chorionic gonadotropin indicated 
that the flexibility of 1-3 and 1-6 linkages in the 
core trimannoside fragment (Man(al-3)[Man(al-6)] 
ManP) differed from that in the aqueous solution 
(Thijssen-van Zuylen et al. 1998). In general, the anal¬ 
ysis of dynamics in glycoproteins showed that even 
relatively small peptides can reduce flexibility of the 
glycan core. The effect is obviously mutual, i.e., the 
saccharide part affects the protein part in 
glycoproteins. 
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Carbohydrate Molecular Dynamics: Oligosaccharides, Polysaccharides, and Glycoconjugates, Fig. 1 Structure of heparin 
pentasaccharide 



Carbohydrate Molecular Dynamics: Oligosaccharides, 
Polysaccharides, and Glycoconjugates, Fig. 2 Experimen¬ 
tal Ti and T 2 curves with the best fit for the A3 carbon 

Theoretical Approaches in Analysis of 
Carbohydrate Dynamics 

Theoretical analysis can also provide information on 
structures and dynamics of carbohydrates in solutions 
as well as on weak intermolecular interactions between 
carbohydrates and other biomolecules. Molecular 
dynamic simulations (MD) and ah initio MD methods 
can be considered as powerful complementary 
methods to experiments in investigating the properties 
of carbohydrates at atomic resolution and contribute to 
the understanding of relationships among structure, 
dynamics, and functions. 

MD is at present the most frequently used theoret¬ 
ical method due to its relative simplicity and atomic 
and temporal resolution. Force fields, such as 



(nonreducing end GlcN,6 S residue), II (ri cuiwe) and 13 
(T 2 curve) carbons from the iduronic acid residue (IdoA2S) 


CHARMM, AMBER, OPES, GLYCAM, CSEE, or 
GROMOS 45 A4, have been applied for MD carbohy¬ 
drate simulations (Csonka et al. 2009). More rigorous 
approaches include quantum mechanics/molecular 
mechanics (QM/MM) methods or ab initio MD simu¬ 
lations. However, as ab initio MD simulations suffer 
from high demands on computing time these methods 
are at present effectively limited to relatively small 
molecules. Solvent is evaluated either through explicit 
representation of the solvent molecules or with contin¬ 
uous models. 

Similarly to experiments, theoretical analysis has 
also been focused on fast internal dynamics in carbo¬ 
hydrate molecules, i.e., internal rotations of OH 
groups, hydroxymethyl groups, and motions at the 
glycosidic linkages. In monosaccharides, MD analysis 
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Carbohydrate Molecular Dynamics: Oligosaccharides, 
Polysaccharides, and Glycoconjugates, Table 1 Computed 
values of the internal motion correlation times (le, picosecond/ 
radian) and the order parameters (Sc-h^) for individual residues 
in heparin pentasaccharide 


Residue 

Te 

>5c-h^ 

GlcN,6 S 

31 

0.72 

GlcA 

32 

0.85 

GlcN,3,6 S 

50 

0.91 

IdoA2S 

28 

0.83 

GlcN,6 S 

15 

0.70 


of internal rotations of OH groups showed that the rate 
of these motions is on a picosecond timescale, typi¬ 
cally up to 100 ps, and that the motions are weakly 
correlated (Krautler et al. 2007). The range of the rate 
of rearrangement among three stable conformers 
{gauche-gauche, gauche-trans, or trans-gauche) of 
the hydroxymethyl group seems larger than that for 
OH groups and span from picoseconds up to nanosec¬ 
ond timescales. However, the conformational distribu¬ 
tions, gauche-gauche, gauche-trans, or trans-gauche, 
depend upon stereoelectronic effects and hydrogen 
bonds and therefore differs in various monosaccha¬ 
rides. In oligosaccharide models of cellulose, e.g., in 
cellobiose (Sarko et al. 1990), the hydroxyl group 
remained in each of these three staggered conformers 
considerable time with the exchange rate within 100 ps 
(Fig. 3). 

Particularly important are alternations in the 
furanose and pyranose ring forms. Pseudorotation can 
considerably influence 3D oligo- and polysaccharide 
3D structures and, consequently, also affect 
intermolecular interactions. Pseudorotation of the 
L-iduronic acid (IdoA) pyranose ring and its 2-0- 
sulfated form has been particularly intensively studied 
due to role of this residue in interactions with proteins. 
The IdoA unit adopts three forms in aqueous solutions, 
namely, "*Ci, *C 4 , and '^Sq (Casu et al. 1986; Mulloy 
and Forster 2000; Rudd et al. 2009). MD simulations 
(Pol-Fachin and Verb 2008) highlighted the impor¬ 
tance of hydrogen bonds that stabilize the IdoA forms 
in oligo- and polysaccharides and indicate that inter¬ 
conversion among these forms is on nanosecond time- 
scale. Such conformational equilibria of IdoA residues 
were in agreement with experimental data in MD sim¬ 
ulations performed for a decasaccharide fragment of 
heparin (Verb and Guimaraes 2004). More rigorous 
analysis using ab initio MD enabled description of 


400 



time (picoseconds) 


Carbohydrate Molecular Dynamics: Oligosaccharides, 
Polysaccharides, and Glycoconjugates, Fig. 3 Trajectories 
of the rotation of the hydroxymethyl group in cellobiose (Sarko 
et al. 1990) 

different conformations (*C 4 , *’^5 and °5'2) in the pyra¬ 
nose forms, simulation of trajectories for the inversion 
of these forms, and calculation of free-energy barriers 
(lonescu et al. 2005). 

Internal motions at the glycosidic linkages are 
important as motions with even small amplitude accu¬ 
mulate and amplify over long chains in polysaccha¬ 
rides and result in extensive intramolecular motions. 
On the other hand, motional properties in “small” 
molecules, such as disaccharides, can also be quite 
pronounced and many disaccharides or trisaccharides 
exhibit considerable internal dynamics. MD simula¬ 
tions showed that flexibility is inherent in most oligo¬ 
saccharides and that the rate of internal motions is on 
pico- and nanosecond timescales (French and Brady 
1990; Jimenez-Barbero and Peters 2002; Vbegenthart 
and Woods 2006). Nevertheless, as already mentioned 
in the previous part, formation of intra- and 
intermolecular hydrogen bonds, together with the sol¬ 
vent effect, are important factors that affect the dynam¬ 
ics of oligo- and polysaccharides. For example, MD 
simulations of methyl-P-cellobioside showed that 
a hydrogen bond between the ring oxygen on the 
nonreducing end and the hydroxyl group (OH-3) at 
the reducing end strongly influences the flexibility at 
the glycosidic linkage as well as the overall molecular 
structure. Apart from intramolecular hydrogen bonds, 
interresidue water bridges also affect molecular flexi¬ 
bility (Christensen et al. 2010). In general, 1-6 disac¬ 
charides are more flexible than 1-2,1-3, or 1^-bnked 
disaccharides due to the existence of additional degree 
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of freedom at the C5-C6 linkage and the lack of 
intramolecular hydrogen bond. 

Dynamics of saccharides has also been analyzed 
using the optimized Rose Zimm local dynamics 
approach (Perico et al. 1999). The analysis of (al-3) 
Glc, (pl-3)Glc, (al-4)Glc, (pl-4)Glc, (al-4)Gal, and 
(Pl-4)Gal polysaccharides with different degree of 
polymerization showed that internal residues are less 
mobile than those on both reducing and nonreducing 
ends of polysaccharide chains and local dynamics 
depends upon the anomeric configuration, the linkage 
type, and the chain length. In flexible polysaccharides, 
(Pl-3)Glc, (al-4)Glc, (pi-4)Glc, (pi-4)Gal, correla¬ 
tion times (ti) of individual residues are on 
a picosecond timescale (varying from 200 ps up to 
about 1 ns) whereas T; values are considerably longer 
(10-600 ns) for polysaccharides with limited internal 
dynamics, such as (al-3)Glc or (al-4)Gal. It should be 
also noted that as the polymer chains grows local 
motions are becoming less dependent upon the chain 
lengths. The above data were found in agreement with 
dynamics inferred from NMR experiments and MD 
simulations. 

Summary 

Experimental and theoretical data indicate that carbo¬ 
hydrates experience quite complex overall and internal 
motions in solutions. In most cases, the overall motions 
are non-isotropic and should be approximated with 
either symmetric top model or, more generally, con¬ 
sidering full anisotropic motions. Timescales of intra¬ 
molecular reorientations are in most cases on 
picoseconds to nanoseconds timescales. This is the 
case for rotations of OH groups, hydroxymethyl 
groups, and dynamics at the glycosidic linkages in 
oligosaccharides or flexible polysaccharides. Slower 
internal dynamics is in stiff polysaccharides, typically 
with diaxially oriented C-O bonds at the glycosidic 
linkages and saccharides having strong intramolecular 
hydrogen bonds. Data also indicate that pseudorotation 
could also be on a nanosecond timescale. However, 
intermediate and slow motions have not yet been fully 
studied either theoretically or experimentally and fur¬ 
ther investigations are needed to cover all timescales of 
carbohydrate dynamics. Such knowledge will contrib¬ 
ute for detailed understanding of interactions between 
carbohydrates and other biomolecules. 
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Introduction 

In the past years, the application of nuclear magnetic 
resonance (NMR) spectroscopy to carbohydrates has 
become a powerful tool for a full understanding of the 
molecular processes in which carbohydrates are 
involved. Determining the three-dimensional structure 
of carbohydrates is crucial since their biological func¬ 
tion originates from their conformation. And NMR 
methods have become indispensable to gain insights 
into these conformational, structural, and dynamic fea¬ 
tures at both the atomic and molecular levels, 
a fundamental requirement for further successful drug 
design. Given the increasing interest in the biological 
roles of carbohydrates, recent advances in NMR can 
offer new insights into the functional understanding of 


carbohydrate interactions, which can overcome the 
challenge of developing new carbohydrate-derived 
therapeutics in the future. 

There are several factors that have made possible 
the recent advances in this topic. On the one hand, the 
development of new methods for the efficient synthesis 
of carbohydrates of increasing size and complexity; 
and on the other hand, the recent advances in NMR 
spectroscopy, both from the hardware and software 
perspectives, such as the access to spectrometers with 
higher and higher magnetic field strengths and the 
development of new NMR techniques, facilitating 
higher sensitivity as well as increased resolution. 
These features have contributed to the characterization 
of very complex oligosaccharides and glycoconjugates 
(Jimenez-Barbero and Peters 2002). In addition, the 
development of small molecules that may mimic the 
structure and function of complex carbohydrates has 
also emerged as potential source for understanding 
carbohydrate recognition and has opened new avenues 
from the structural and conformational viewpoints 
(Ernst and Magnani 2009). 

In contrast to X-ray crystallography, NMR spec¬ 
troscopy allows one to monitor, from the qualitative 
to quantitative perspectives, the intrinsic physical flex¬ 
ibility of carbohydrates, providing information on their 
geometries and dynamics. However, given the inherent 
flexibility of carbohydrates, the combination of the 
NMR parameters with computational chemistry tools 
are usually required. Therefore, a multidisciplinary 
approach is often necessary to gain insight into the 
physical chemistry structural properties of carbohy¬ 
drates. The advantages and limitations of these struc¬ 
tural techniques have been previously reviewed 
(Wormald et al. 2002). 

The present chapter will briefly address some basic 
features of carbohydrate NMR as well as some of the 
recent methods and applications of this technique in 
the rapidly evolving carbohydrate research field, from 
the chemistry to the molecular recognition areas. 

Structure Determination 

NMR spectroscopy methods are able to provide basi¬ 
cally all the key information necessary to completely 
define the primary structure of a glycan molecule. 
The NMR chemical shift, the simplest parameter to 
access, is highly sensitive to the chemical environment 
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that is surrounding each NMR-sensitive atom. For 
instance, the presence of an electronegative group, as 
S 03 ~, in glycosaminoglycans produce a significant 
deshielding of the chemical shift of the directly 
bound protons to the corresponding carbon 0-sulfation 
point in the *H NMR spectra. Due to the limited *H 
chemical shift dispersion in the NMR spectra of carbo¬ 
hydrates, the anomeric proton resonances, found in the 
shift range 4.4-5.5 ppm, can be firstly analyzed and is, 
in principle, the easiest accessible nucleus. The chemi¬ 
cal shifts of these protons, as well as those belonging to 
the rest of the sugar ring, strongly depend on the struc¬ 
ture of the carbohydrate chain and are the basis for the 
concept known as structural reporter group. Some 
empirical rales can be followed for the identification 
and assignment of the *H NMR spectra of oligo and 
polysaccharides: (1) The anomeric protons are found 
more downfield than the remaining proton resonances. 
(2) In pyranose rings, the equatorial proton resonances 
appear downfield than the axial ones. For instance, H-2 
protons of Man moieties resonate at lower field than 
those of Glc rings. For anomeric protons, H-la reso¬ 
nates between 5.0 and 5.8 ppm, while H-ip resonates 
between 4.4 and 5.2 ppm (3) Vicinal coupling constant 
between the anomeric H-1 and the H-2, indicates 
the relative orientation of the two protons; in the case of 
D sugars and P anomers, a large coupling constant 
(7-9 Hz) is observed, whereas for a anomers this J is 
smaller (2-4 Hz). 

A combination of experiments based on interactions 
through bonds (spin-spin coupling) and interactions 
through space (dipole-dipole coupling) are required for 
the determination of the molecule assignment. Classical 
2D NMR homo- and heteronuclear experiments, such as 
homonuclear TOCSY and DQF-COSY, can help in the 
identification of individual spin systems. Given the addi¬ 
tional dispersion in the carbon dimension, heteronuclear 
'H-^^C HSQC or HMQC, together with HMBC exper¬ 
iments, provide important short- and long-range corre¬ 
lations, respectively, the latter often useful for the 
sequence assignment information. 

Despite the enormous structural diversity of oligo¬ 
saccharides, *H and chemical shifts for most 
monosaccharides and variations thereof can be found 
in the literature. In fact, there is detailed information 
for the wide variety of component monosaccharide 
residues, different anomeric configurations, ring 
forms, and for disaccharides and higher oligosaccha¬ 
rides, also for the different inter-glycosidic linkages. 


There are also a variety of sugar databases, containing 
chemical shift and structural features of characterized 
monosaccharide, oligosaccharide, and more complex 
carbohydrates, which can be very helpful for the 
assignment procedure. Some of them are CarbBank, 
Complex Carbohydrate Structure Database (CCSD), 
and SUGARBASE. In particular, CarbBank contains 
branched carbohydrates structures, while 
SUGARBASE is a carbohydrate-NMR database that 
combines CarbBank Complex Carbohydrate Structure 
Data (CCSD) with proton and carbon chemical shift 
values (Van Kuik and Vliegenthart 1993). Further¬ 
more, GlyNest and CASPER are two independent 
approaches to estimate *H and *^C NMR shifts of 
diverse glycans. 

Eor a basic view of NMR background, methodol¬ 
ogy, and routine applications for structural elucidation 
of different molecules, the reader is referred to special¬ 
ist books (i.e., Claridge 2009). Eor specific structural 
characterization of carbohydrates as well as practical 
considerations in this field, the reader can find 
more detailed information in different books (i.e., 
Sinott 2007) and review articles (Duus et al. 2000; 
Bubb 2003). 

Conformation 

The conformational features of biomolecules, includ¬ 
ing carbohydrates, are usually (but not uniquely) char¬ 
acterized by measuring nuclear Overhauser effect in 
different types of experiments, such as NOESY and 
ROESY. The NOE between two nuclei depends on the 
distance between them (oc r“®). Because of the confor¬ 
mational flexibility of carbohydrates around the glyco- 
sidic linkages, the derivation of molecular structure 
from NOE values is not straightforward. In addition 
to the I/;-® dependence of distance, the NOE has 
a complex time dependence, in which the overall 
molecular tumbling may interact with kinetics of the 
conformational exchange involved in the internal 
motion (Neuhaus and Williamson 1989). Moreover, 
the NOE-based approach works well when there are 
sufficient measurable NOEs that are sensitive to 
molecular shape, which is not usually the case for 
oligosaccharide structural analysis. In fact, one of the 
major targets to derive the conformational features of 
carbohydrates is the derivation of the torsion angle 
values, and T', which define the conformation 
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around the 0-glycosidic linkages. Very few NOEs 
possess information on these values. Additionally, vic¬ 
inal proton-proton coupling constants might also pro¬ 
vide crucial information for determining the 3D 
structure of molecules in solution, but they cannot be 
employed to derive the conformation around the gly- 
cosidic torsions. Nevertheless, in flexible structures 
such as carbohydrate molecules, scalar coupling con¬ 
stants may have the advantage that calculation of aver¬ 
age values over an ensemble of conformations is 
simpler than that for NOEs. Recent developments in 
multidimensional NMR have made possible a number 
of new methods for measurements of the 
interglycosidic heteronuclear coupling constants, 
which might be useful for conformational analysis. 
Experiments such as HMQC-NOESY and HMQC- 
TOCSY for the determination of long-range H-C and 
of the HMBC-type for the determination of C-H and 
C-C coupling constants permit access to explore trans- 
0-glycoside scalar couplings to derive the conforma¬ 
tion around pairs. In any case, taking into account 
the existing conformational averaging, it is worth not¬ 
ing that combining the NMR parameters with the data 
obtained from molecular mechanics (MM) and molec¬ 
ular dynamics (MD) simulations may be necessary to 
determine carbohydrate conformation (Vliegenthart 
and Woods 2006). Further examples of new sophisti¬ 
cated NMR methods for carbohydrate NMR assign¬ 
ment as 3D homo- and heteronuclear experiments, 
using *^C-enriched samples, have been carefully 
described (Duus et al. 2000). 

Dynamics and Recognition 

Other NMR techniques, such as *^C- and ^H-based 
NMR relaxation experiments, allow us to deduce 
dynamic parameters through the measurement of 
relaxation parameters, thus revealing important 
dynamic information in oligo- and polysaccharides 
(Jimenez-Barbero and Peters 2002). 

From the molecular recognition perspective, it is 
well known that carbohydrate-protein interactions 
are implicated in a variety of cell-cell and cell-matrix 
recognition events. Along these lines, the great poten¬ 
tial of the application of NMR technologies in the drug 
discovery process is also acknowledged (Zerbe 2003). 
In this context, NMR has become crucial for the study 
of the speciflcity and molecular recognition features of 


different neutral and charged (glycosaminoglycans) 
carbohydrates by carbohydrate-binding proteins 
(lectins, antibodies, and enzymes). These proteins spe- 
ciflcally recognize particular carbohydrate structures, 
usually with well-deflned molecular topographies. In 
these interaction studies, NMR offers some unique 
features that make it an attractive means for monitor¬ 
ing interactions either using single putative ligands or 
even for screening of carbohydrate libraries to identify 
target compounds that bind to speciflc proteins. By 
using NMR, it is also possible to deduce the specificity 
and affinity of binding, association constants, and equi¬ 
librium thermodynamic parameters (Poveda and 
Jimenez-Barbero 1998). 

Intermolecular Interactions 

NMR has the key advantage of being able to detect 
weak intermolecular interactions in the mM range, 
which can be beneficial in the first steps of the discov¬ 
ery procedure. Indeed, NMR is the ideal technique for 
a fragment-based screening protocol (Coles et al. 
2003). Basically, all NMR parameters may serve as 
sensors for detecting the binding of a ligand to 
a protein. Generally speaking, by using NMR, two 
different approaches can be used: receptor-based and 
ligand-based experiments, depending on the signals 
that are chosen to monitor the interaction. 

The employed protocols range from the robust and 
well-known chemical shift mapping approach, from 
the receptor’s viewpoint, to ligand-based techniques 
that monitor changes in the nuclear spin relaxation 
properties of the ligand’s nuclei upon binding, passing 
to measuring the changes in the translational diffusion 
coefficient of the ligand when passing from the free to 
the bound states. 

Starting from the ligand’s perspective, very proba¬ 
bly, the most frequently used NMR strategies are the 
saturation transfer difference experiment (STD) and, to 
a lesser extent, the transferred nuclear Overhauser 
spectroscopy (trNOESY) technique. Provided that the 
dissociation rate is fast in the relaxation timescale, 
these methods can provide detailed structural informa¬ 
tion on the binding characteristics of particular carbo¬ 
hydrate-protein interactions. STD experiment gives 
knowledge of the binding of a ligand to the target 
receptor and, in some favorable cases, defines the 
recognition epitope of the ligand. The information is 
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achieved by selectively saturating several resonances 
that belong to the receptor. The saturation is rapidly 
transferred throughout the whole protein reaching the 
protons at the binding site, which are also saturated. If 
the ligand binds, the saturation is transferred by cross¬ 
relaxation to the binding compound at the ligand- 
receptor interface (Mayer and Meyer 1999) thus 
changing the intensities of the ligand. These changes 
in intensities are easily detected by using difference 
spectroscopy and permit differentiation of binding 
from nonbinding entities. Also, since the first protons 
of the ligand to feel the magnetization transfer are 
those in closer contact with the protein, information 
on the binding epitope can also be achieved. The 
trNOESY experiment not only allows to deduce the 
existence of binding, but also to know the conforma¬ 
tion of the ligand in the bound state, an information 
which is of paramount importance in the drug design 
process. When small molecules bind to large receptor 
proteins, the observed exchange-transferred NOE 
cross-peaks undergo drastic changes in sign (from 
positive to negative) thus providing the required infor¬ 
mation. The observation of negative trNOESY cross¬ 
peaks relies on the existence of very different tumbling 
times Tc for the free (in the picosecond timescale) and 
bound (in the nanosecond timescale) molecules. 
Eurther analysis of these experimental data with 
molecular modeling programs can help to deduce and 
refine the three-dimensional structure of the complex. 

The final aim of the study of any molecular recog¬ 
nition process is to know the structure-activity rela¬ 
tionships (SARs) that regulate the interaction between 
a particular glycan and its protein receptors to allow 
the rational design of molecules that enhance or abol¬ 
ish the interaction, depending on the final aim. Differ¬ 
ent rational ligand design strategies can be envisaged 
that range from the preparation of molecules that are 
pre-organized in their bioactive conformation, or the 
incorporation, within a given scaffold, of additional 
binding sites that can lead to enhancement of the 
affinity and to a reduced entropy cost of the binding 
process. Within this context, an interesting approach in 
the discovery of high-affinity ligands is the “SAR by 
NMR” concept (Shuker et al. 1996). This receptor- 
based approach is based in the chemical shift pertur¬ 
bation analysis of the protein resonances (usually NH 
or CH atom pairs in HSQC-type of experiments) upon 
binding to different ligands. Thus, diverse ligands from 
a library can be identified that bind to two neighboring 


binding sites of a protein. Subsequent chemical linking 
of two of these ligands through the use of the proper 
spacers permit the production of high-affinity ligands. 
As a consequence, glycomimetics with therapeutic 
interest can be successfully designed (Ernst and 
Magnani 2009). 

Carbohydrate-Binding Proteins 

Relaxation measurements also provide valuable infor¬ 
mation on the extent of the motional changes that take 
place at specific amino acid residues of carbohydrate¬ 
binding proteins (from the receptor’s viewpoint) and 
also of the extent of rigidification that takes place at the 
oligosaccharide moieties upon binding to the receptor. 

In the last decades, together with the development 
of methods for isotope labeling of carbohydrates, 
the increasing application of residual dipolar couplings 
(RDCs) and anisotropies in carbohydrates has permit¬ 
ted retrieval of complementary structural (global 
orientational restraints) and dynamic (internal molec¬ 
ular motion) information in anisotropic media. This 
long-range type of information describes the relative 
orientation of remote atoms or molecular fragments, 
important information which is unavailable from J 
coupling- or NOE data-analysis alone. Measurements 
of both one-bond and two-bond and ^H-*H 

RDCs provide conformational information in oligo¬ 
saccharides and glycolipids, in a variety of alignment 
media, such as bicelles, bacteriophage, polyacryl¬ 
amide gels, and others. This approach has also been 
taken for the elucidation of carbohydrate-protein inter¬ 
actions, making possible determination of the relative 
orientations of the ligand and the protein. Advances 
made in alignment, data acquisition and analysis 
methods, and some examples have been reviewed 
(Jimenez-Barbero and Peters 2002; Prestegard et al. 
2004). 

Other Methods 

In addition to the use of standard *H and spectros¬ 
copy, further NMR methods have been developed 
employing other NMR-active nuclei such as *^P and 
^*P. In this context, possesses favorable NMR 
properties: 100% natural abundance, high sensitivity 
(yp/yH = 0.94), and high chemical-shift dispersion. 
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Thus, it is not unexpected that over the last decade, 
fluorinated carbohydrates have been used as NMR 
probes for investigating biologically relevant systems 
(Michalik et al. 2000). 

Most recently, solid-state NMR applications have 
been used for the study of biological molecules that 
were intractable to high-resolution liquid-state NMR 
techniques, based on experiments performed on static 
samples or recorded under magic-angle sample spin¬ 
ning (MAS) conditions (Zerbe 2003). 

Summary 

In the past years, NMR spectroscopy has been demon¬ 
strated to be a robust tool for carbohydrate research. 
The challenge, therefore, is to continue designing new 
NMR methodologies to completely characterize the 
structural, dynamic, and binding properties of a wide 
range of carbohydrates in biological systems, with the 
additional further possibility of developing carbohy¬ 
drate-derived therapeutics in the future. 

Cross-References 

► Bacterial Lipopolysaccharide, OPS, and Lipid A 

► Lectins 

► Molecular Dynamics Simulations of Carbohydrates 

► Polysaccharides: Biophysical Properties 

► Proteoglycans 

► Techniques Applied to Glycan Structure and 
Conformation 
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Synonyms 

Carbohydrates 


Introduction 

The name glucose was coined by Dumas in 1838 for 
the sugar obtained from honey, grapes, starch, and 
► cellulose; 20 years later, its molecular formula of 
C 6 H 12 O 6 was established, and subsequently it was 
shown to be a linear six-carbon aldehyde having 
hydroxyl groups on each of the chain carbon atoms. 
Polarimetric studies led Kekule in 1866 to propose the 
name “dextrose” because glucose is dextrorotatory, 
and the levorotatory “fruit sugar” (Fruchtzucker, 
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fructose) obtained by hydrolysis of cane sugar 
(sucrose) was for some time named “levulose.” The 
French word “cellule” for cell and the “-ose” suffix led 
to the term “cellulose,” long before its structure was 
known. The term “carbohydrate” (French “hydrate de 
Carbone,” German “Kohlenhydrate”) was applied orig¬ 
inally to monosaccharides in recognition of the fact 
that their empirical composition can be expressed as 
C„(Fl20)m. However, the term is now used generically 
in a wider sense. 


Carbohydrate 
Nomenclature, 
Scheme 1 o-glucose, 
Fischer projection 
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Structure and Conformaion 

During the last two decades of the nineteenth century, 
Emil Fischer (Fischer 1909) began his fundamental 
studies on carbohydrates, showing that phenylhydrazine 
reacts with glucose, mannose, and fructose to give the 
same crystalline phenylosazone, and he utilized the 
reaction introduced by Kiliani, the addition of hydrogen 
cyanide to a sugar, to give two isomeric acids. In his 
1890 address to the German Chemical Society (Fischer 
1890), he showed that “Traubenzucker” is a 2,3,4,5,6- 
pentahydroxyhexanal and that “Fruchtzucker” is 
a 1,3,4,5,6-pentahydroxy-2-hexanone. 

The concept of stereochemistry, developed since 
1874 by Van’t Hoff and Le Bel, had a great impact 
on carbohydrate chemistry because it could easily 
explain isomerism. From extensive chemical manipu¬ 
lations conducted with great skill, coupled with 
brilliant reasoning, Fischer correlated the families of 
sugars, and within 10 years, he was able to assign the 
relative configurations of most known sugars and to 
synthesize many additional examples (Fischer 1909). 
He assigned to the dextrorotatory glucose (via the 
derived glucaric acid) the projection having the OH 
group at C-5 pointing to the right (Scheme 1), well 
knowing that there was a 50% chance that this was 
wrong. Much later. Trommel and Bijvoet (1954) 
proved Fischer’s arbitrary assumption to be correct in 
the absolute sense, by using a special X-ray technique. 

Rosanoff (1906) selected the enantiomeric glycer- 
aldehydes as the point of reference; any sugar 
derivable by chain lengthening from what is now 
known as o-glyceraldehyde belongs to the d series, 
a convention still in use (Scheme 2). 

For a sugar having n stereocenters, there are 2" 
isomers, so that there are 77 = 16 aldohexoses, 8 in 
the D series and 8 in the l series. The names of the 


CHO 



Carbohydrate E 

Nomenclature, Scheme 2 = 

D-Glyceraldehyde CH 2 OH 

D-aldoses and their stereochemical relationships are 
illustrated in Scheme 3. These names form the basis 
of the group-configurational designators.- alio, altro, 
gluco, manno, gulo, ido, galacto, and talo for four 
stereocenters; riho, arahino, lyxo, and xylo for 
three stereocenters; and erythro and threo for two 
stereocenters. 

Cyclic Forms 

Toward the end of the nineteenth century, it was real¬ 
ized that the free sugars exist as cyclic hemiacetals or 
hemiketals. Individual sugars react with methanol under 
acid catalysis to give stable products termed methyl 
glycosides, which are mixed full acetals. These have 
a new asymmetric center at the original carbonyl atom 
and were thus isolated as pairs of isomers that 
were designated as the a and P forms, later termed 
anomers. In the d series, the a anomer was defined as 
the one having the more-positive specihc rotation, and 
in the l series, the a anomer has the more-negative 
specihc rotation. The phenomenon of mutarotation, 
discovered in 1846 by Dubranfaut, wherein the specihc 
rotation of a free sugar in water solution changes with 
time, was now interpreted as being due to equilibration 
of the anomeric hemiacetals formed through reaction of 
one of the chain hydroxyl groups with the carbonyl 
group. Emil Eischer assumed the cyclic form to be 
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Carbohydrate Nomenclature, Scheme 3 The D family of aldoses 
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a five-membered ring, but later work showed that the 
six-memhered ring is the common form. The a anomers 
of D-glucose in these two ring forms are here depicted in 
the Fischer projection (Scheme 4). 


Furanose and Pyranose 

In the 1920s, Haworth and his school (Haworth 1929) 
proposed the terms “furanose” and “pyranose” for the 
two forms, and introduced what became known as the 
“Haworth depiction” for writing structural formulas, 
a convention that was soon widely followed 
(Scheme 5). 

The Haworth projection formula presents 
a nominally planar ring with planes of substituents 
above and below the ring, whereas the Haworth con¬ 
formational formula depicts the actual shape of the 
molecule with tetrahedral bond angles (Scheme 6). 
The Mills formula also depicts the ring arbitrarily as 
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a-D-Glucofuranose a-D-Glucopyranose 

Carbohydrate Nomenclature, Scheme 4 Fischer projection 
of Qt-D-glucofuranose and a-D-glucopyranose 

a plane, with substituents respectively above and 
below the plane. All three types of representation 
remain widely in use. 
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Standard Abbreviations 

For graphical representation of the structures of complex 
oligosaccharides and polysaccharides and for encoding 
in electronic databases, the individual sugar residues are 
assigned three-letter abbreviations, for example Glc 
(glucose), Man, Gal, Rib, and Xyl, with appropriate 
modifiers, for instance Man/? (mannopyranose), Fru/ 
(fructofuranose), GlcA (glucuronic acid), and GalNAc 
(A-acetylgalactosamine). 

Inter-residue linkages are designated by position 
locants and connecting arrows, for example (I-D-Gal/?- 
(1 ^4)-a-D-Glc/? (lactose). For polysaccharides, the 
generic suffix “-an” is used rather than the prefix 
“poly” employed with synthetic polymers, thus mannan, 
xylan, carrageenan, xanthan, dextran, and the general 
term glycan. A few nonsystematic traditional names 
remain; for instance, cellulose, starch, inulin, glycogen, 
heparin, and pectin. 

Mono-, Oligo-, and Poly-Saccharides and 
Glycosides 

The borderline between oligosaccharides and polysac¬ 
charides cannot be drawn strictly; however, the 


term “oligosaccharide” is commonly used to refer 
to a defined structure as opposed to a polymer 
of unspecified length, or a homologous mixture. 
The generic term “carbohydrate” includes mono¬ 
saccharides, oligosaccharides, and polysaccharides as 
well as substances derived from monosaccharides by 
reduction of the carbonyl group (alditols), by oxidation 
of one or more terminal groups to carboxylic acids 
(aldonic and uronic acids), or by replacement of one 
or more hydroxyl group(s) by a hydrogen atom (deoxy 
sugars), an amino group (amino sugars), a thiol group, 
or other functional groups. It also includes numerous 
derivatives of these compounds formed by acylation, 
etherification, acetalation, and other reactions. The 
term “sugar” is frequently applied to monosaccharides 
and lower oligosaccharides. There are probably close 
to a million known compounds that conform to these 
definitions. 

Numerous natural products are glycosides, 
constituted of a sugar component (the glycon, fre¬ 
quently D-glucose, but many other sugars may be 
encountered) coupled through the anomeric position 
to an alcohol or phenol (the aglycon), a component 
often of considerable complexity. Examples include 
amygdalin (a cyanogenic glycoside), the anthocyanin 
plant pigments, the digitalis glycosides, and numerous 
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(Kdo) 



antibiotics. Many of these have wide applications in 
medicine. 

Among the higher-carbon monosaccharides, the 
most important examples are the nine-carbon amino 
sugar A-acetylneuraminic acid (5-acetamido-3,5- 
dideoxy-D-g/ycero-a-D-g(a'/actci-non-2-ulopyranosonic 
acid, NeuSAc; Scheme 7), the parent member of the 
family of sialic acids, and the eight-carbon sugar 
3-deoxy-D-»7a«no-oct-2-ulosonic acid, Kdo. The for¬ 
mer is widely distributed as a constituent of 
glycoconjugates in the animal kingdom, as well as in 
microorganisms. 

The system of nomenclature of carbohydrates has 
evolved over the years. Joint work by the lUPAC 
Commission on Nomenclature of Organic Chemistry 
and the lUPAC-IUB Commission on Biochemical 
Nomenclature has led to the current recommendations. 
These were published in 1996 by the two commissions 
(lUPAC-IUBMB Nomenclature of carbohydrates 
1997). The Web version (Nomenclature of carbohy¬ 
drates http://www.chem.qmul.ac.uk/iupac/2carb/) of 
that document includes revisions and corrections 
made since its initial publication. 

Summary 

The chemistry and biochemistry of carbohydrates in 
general is detailed in a four-volume series (Pigman 
and Horton 1972,1980), and a comprehensive treatment 
of polysaccharides can be found in the three volumes 
edited by Aspinall (1982, 1983, 1985). A multiauthored 
treatise on polysaccharides (Dumitriu 2005) covers 
a wide range of aspects of the structural diversity, 
biological relevance, and technological applications of 
polysaccharides. A volume in the series Comprehensive 
Natural Product Chemistry edited by Pinto (1999) 
provides extensive detail on the biosynthesis and 
biological role of carbohydrates. 


Cross-References 
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Synonyms 

Carbohydrate tensides; Carbohydrates; Saccharide 
surfactants 

Definition 

In general, surfactants are substances added to a liquid 
in order to increase its wetting or spreading properties 
due to their ability to lower the surface tension 
of a liquid and the interfacial tension between two 
liquids, or between a liquid and a solid. On dissolving 
surfactants in water at concentrations exceeding the 
critical micellar concentration, they form aggregates 
(micelles). In these micelles, the polar head group 
(hydrophilic part of surfactant) is in contact with 
water while the nonpolar hydrocarbon tail (hydropho¬ 
bic part) is inside the micelle. Carbohydrate surfactants 
are amphiphilic molecules possessing a carbohydrate 
represented by a mono- or oligosaccharide as a hydro¬ 
philic structural part and a long alkyl chain derived from 
fatty acid as a usual hydrophobic part. These two moi¬ 
eties can be linked either directly via a functional group 
(ether, ester, amino, amido, hydrazino, sulfo, etc.) 


to form surfactants with monocatenary structure. Alter¬ 
natively, two identical carbohydrate moieties can be 
connected by a spacer group (hydrophobic or hydro¬ 
philic) to form so called gemini (dimeric) surfactants. 
On the other hand, bolaform structure sugar surfactants 
(bola-amphiphiles) are composed of two or more 
saccharide units at both ends of one (single-chain 
bolaform) or two hydrophobic chains (pseudoma- 
crocyclic or macrocyclic compounds) (see Fig. 1). 
With regard to their low toxicity, favorable biocompat¬ 
ibility, notable dermatological properties, and fast bio¬ 
degradability, carbohydrate surfactants are used in 
several areas, such as personal care products and cos¬ 
metics (soaps, foams), food industry (emulsifiers, stabi¬ 
lizers), biology (purification of membranes proteins), 
glycobiology or immunology, and cleaning and wash¬ 
ing agents (detergents, dispersants). Some of them are 
potential drug delivery components in pharmacy. 
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Definition 

Carbon nanotubes as biomaterials mean that carbon 
nanotubes are with satisfactory biocompatibility and 
can be used to direct some therapeutic or diagnostic 
procedure in animal or human body. 

Basic Characteristics 

The nano-dimensionality of nature has logically given 
rise to the interest in using nanomaterials in biomedical 
field. Carbon nanotubes (CNTs), with a range of 
unique properties, appear suited as a biomaterial and 
is becoming a useful tool for tissue engineering 
(Yokoyama et al. 2005; Li et al. 2009). The results of 
a lot of biomedical investigations on CNTs have indi¬ 
cated that the use of CNTs for biomedical applications 
potentially rewards opportunities to develop the next 
generation of engineered biomaterials. CNTs are 
cylindrical carbon tubes possessing nanometer diame¬ 
ters with much longer lengths (4,100 nm) resulting in 
very large aspect ratios and can adsorb a large amount 
of proteins, which not only benefit cell attachment and 
proliferation, but also induce cell differentiation 
(Li et al. 2009). Also, CNTs have the ability in con¬ 
centrating related irons and making apatite crystallites 
at nanoscale levels form on their surface (Liao et al. 
2007). Eurthermore, electrical conductivity of CNTs 
brings them advantages of serving as biomaterials 
implanted in certain tissues, such as cardiac muscle 
and nerve tissue, which are specialized to propagate 
electrical potentials. 

However, for biological and biomedical applica¬ 
tions, the lack of solubility of CNTs in aqueous 
media has been a major technical problem. The recent 
expansion in methods to chemically modify and 
functionalize CNTs has made it possible to solubilize 
and disperse CNTs in water, thus opening the path for 


their facile manipulation and processing in physiolog¬ 
ical environments (Allen et al. 2008). 

Another noteworthy point is that possible toxic 
effects of CNTs in some cases have become an issue 
of concern for human health. Fortunately, cytotoxicity 
of CNTs can be mitigated by chemical functiona¬ 
lization if they do have the cytotoxicity, which is 
because the main causes of toxicity are the presence 
of residual metal catalyst and the insolubility of the 
material and the functionalized CNTs are instead 
highly soluble in aqueous biological media and exhibit 
notably reduced cellular toxicity in vitro. Furthermore, 
adsorption or attachment of various molecules, growth 
factors, or antigens on CNTs, which can subsequently 
have positive effect on the desired cellular functions or 
therapeutic efficacy, can be made possible by surface 
functionalization. 

Functionalized CNTs for the Delivery of 
Genes and Drugs 

Many different functional groups can be placed at the 
tips and around the side walls of CNTs to make 
functionalized CNTs (f-CNTs). Most of f-CNTs are 
highly soluble in water, which allows the formation of 
supramolecular complexes with biologically relevant 
macromolecules and substrates, based on electrostatic 
interactions. A series of amino acids, fluorescent 
probes, and bioactive peptides can be covalently linked 
to CNTs through amino groups. Most importantly, the 
macromolecular and cationic nature of the f-CNTs can 
help to form supramolecular complexes with plasmid 
DNA. In addition, f-CNTs have high affinity for 
ssDNA, and are functionalized covalently with DNA, 
allowing the formation of supramolecular nanotube- 
based structures by DNA-DNA interactions. The 
plasmid DNA can be condensed by forming toroidal 
clusters, or globular and supercoiled structures on the 
bundles of different diameters of f-CNTs. Gene 
expression offered by the complexes between plasmid 
DNA with f-CNTs was five to ten times higher than 
that of DNA alone (Pantarotto et al. 2004). 

Furthermore, the aqueous solubility and cationic 
surface character of f-CNTs 1 render them potentially 
novel delivery vehicles. Most f-CNTs can be rapidly 
translocated to the cell nucleus. Moreover, f-CNTs are 
instead highly soluble in aqueous biological media and 
exhibit notably reduced cellular toxicity in vitro. 
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Therefore, since functionalized CNTs penetrate easily 
into cells with reduced toxicity, they can be considered 
very interesting and innovative carriers for drug 
delivery. 

CNT-Based Biosensors 

CNTs have been found to be paving the way to new 
and improved sensing devices, in general, and electro¬ 
chemical biosensors, in particular. The diagnosis and 
management of diabetes mellitus requires a tight mon¬ 
itoring of blood glucose levels. The electrochemical 
biosensors for glucose play a leading role in this case. 
The dramatic decrease in the overpotential of hydro¬ 
gen peroxide as well as the direct electron transfer of 
glucose oxidase (GOx) observed at CNT-modified 
electrodes indicate great promise for the biosensing 
of glucose. CNT/Nafion/GOx-coated electrodes, cou¬ 
pling the electrocatalytic detection of hydrogen perox¬ 
ide with the permselectivity of Nation offered a highly 
selective low-potential biosensing of glucose (Wang 
et al. 2003). Moreover, controlled attachment of GOx to 
the CNT sidewall can be achieved through a linking 
molecule. Such immobilization leads to decreased con¬ 
ductivity. The enzyme-coated tube acted as a reversible 
pH sensor and displayed increase in the conductance 
upon adding glucose (while applying a gate voltage 
between a standard reference electrode and the semi¬ 
conducting CNTs). Also, glucose biosensors based on 
CNTs coated with inherently conducting polymer can 
be achieved This will result in the formation of 
a bioactive conducting coaxial sheath around the indi¬ 
vidually aligned CNT, and lead to an attractive low- 
potential detection of the liberated hydrogen peroxide. 

CNT-Based Coatings for Biomedical 
Applications 

The good biocompatibility, in particular, the abilities of 
CNT coatings in inducing cellular functions in vitro, 
stated above, make it easy to understand that CNT 
coatings on the metal biomaterials will improve their 
related properties. Furthermore, the CNTs coating were 
thought to promote the scaffolds osteoconductivity and 
mineralization potential as well as provide not only 
a nanostructured surface but also an electric conductiv¬ 
ity function. Most used technique for CNT-based 


coatings is electrophoretic deposition (EPD). For CNT 
coatings on other material, the CNTs suspension is 
usually prepared by adding to an aqueous solution of 
CNTs, Triton X-100 as an ionic surfactant and iodine 
99.999% as a charger. After centrifugation, the super¬ 
natant suspension was carefully extracted from the cen¬ 
trifuge tube and the remaining suspension was placed in 
a recipient for EPD. CNTs could be coated on the 
metals with excellent packing density without any mac¬ 
roscopic porosity within the film. The release of metal 
ions from metallic implants can be significantly 
reduced. The good bonding strength between the 
CNT-based coatings and the metal substrates could be 
achieved. Addition of CNTs can reinforce the compos¬ 
ite coatings and the CNTs-reinforced coatings are 
expected to be promising for high load-bearing ortho¬ 
pedic implants, such as hip, knee, and shoulder joints. 
CNTs could nurture precipitation and mineralization of 
HA over its surface in CNT-HA coatings. Eurthermore, 
based on their unique properties, CNT scan attach to the 
surfaces of dentin and cementum but not to the surface 
of enamel, suggesting that the teeth coated with CNTs 
should be a possible candidate for dental materials. In 
addition, CNTs can be coated on polymers, bioglasses, 
collagen, etc., to prepare bioactive, electrically conduc¬ 
tive 3D scaffolds for tissue engineering. 

CNT-Reinforced Biomaterials 

Biomaterials should provide adequate mechanical 
strength during the initial healing state; thereby keep 
their structure, which should provide an ideal environ¬ 
ment for the migration and proliferation of cells. Addi¬ 
tionally, satisfactory mechanical properties are needed 
to ensure that, during surgical operations, the bioma¬ 
terials can be properly handled and sutured. In recent 
years, CNTs have become popular for reinforcing 
metals and even polymers. These nanotubes show 
very high mechanical properties (tensile strengths of 
up to 150 GPa) and flexibility (elongations at break 
of up to 15%) as well as a high Young’s modulus (up 
to 1,800 GPa). Their small size of CNTs can reduce the 
possibility of thermal mismatch-induced dislocation 
generation at the matrix/CNT interface. Purification 
and surface modification or functionalization of 
CNTs are the most common strategies used to increase 
the interactions between CNTs and host matrix, 
improve the ability to disperse CNTs within the matrix. 
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decrease filler self-aggregation, optimize alignment 

within the host matrix, and thus improve load transfer. 

Cross-References 

► Extracellular Matrix Components 

► Nanoparticles for Drug and Gene Delivery 
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Physics Center, Donostia-San Sebastian, Euskadi, 
Spain 

Synonyms 

NEB (nudged elastic band); Self-Consistent Charge 
Density Function Tight Binding (SCC-DFTB) 

Definition 

Quantum Mechanics (QM) methods describe the 
electronic structure of molecules by solving the 
Schrddinger equation. 


Molecular Mechanics (MM) methods describe the 
molecular structure without explicit description of 
electrons, using an empirical force held. 

► QM/MM methods combine a quantum mechani¬ 
cal method to characterize the active site of a molecular 
(e.g., enzyme) system with a molecular mechanical 
method to characterize the remaining part of the system. 

An all-atom force held is a molecular mechanics 
potential function, i.e., a set of parameters and functions 
to determine the potential energy of a molecular system, 
in which all atoms are treated explicitly. 

A “cluster model” is a truncated model of a large 
molecular system where only the most relevant atoms 
of the system under study (e.g., catalytic residues, 
substrate, cofactors) are included in the calculation. 

Biological reactions, which otherwise would 
require a very long time to occur, are efficiently accel¬ 
erated by enzymes, which catalyze these reactions with 
very precise mechanisms. Experimental techniques 
such as X-ray crystallography, nuclear magnetic reso¬ 
nance (NMR) studies, and site-directed mutagenesis 
allow for indirect study of the reaction mechanisms. 
However, each of these methods has inherent 
limitations that prevent them from giving a full under¬ 
standing of the nature of reaction in a given enzymatic 
system. In this context, the contribution of computa¬ 
tional chemistry is crucial as it can provide valuable 
complementary information into the system at the 
atomic level. Computations allow the characterization 
of the potential energy surface of the reaction, includ¬ 
ing intermediates, products, and more importantly 
transition states, which are particularly elusive to 
experimental techniques (Lonsdale et al. 2010). 

Here, we review the contribution made by compu¬ 
tational chemistry to understanding of one of 
the most studied ► zinc-containing metalloproteases, 
carboxypeptidase A. The crystal structure of CPA was 
one of the hrst metalloproteins solved, and 
since then CPA has been a prototype protein 
of zinc-containing enzymes. These enzymes 
(angiotensin converting enzyme (ACE) is one of the 
most relevant examples) are involved in several 
diseases. 

CPA is found in the digestive system and catalyzes 
the scission of a C-terminal amino acid residue with an 
aromatic or branched aliphatic side chain from 
a peptide or ester substrate (active site shown in 
Fig. 1). Two main mechanisms have been suggested 
for the reaction catalyzed by this enzyme: in the 
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“water-promoted” mechanism, the zinc-bound water 
molecule is the nucleophile, activated by Glu270 
(acting as a base) for its nucleophilic attack. Since 
Glu270 acts as an acid to provide the necessary proton 
to the peptide bond (or ester bond) in the second step, 
this reaction pathway is also known as general 
acid-general base (GAGB) mechanism. Alternatively, 
in the “anhydride” mechanism, Glu270 directly attacks 
the substrate leading to an acyl-enzyme intermediate. 

Vardi-Kilshtain and Warshel employed the ► empir¬ 
ical valence bond (EVB) approach (Vardi-Kilshtain and 
Warshel 2009), in which the system is described by 
molecular mechanics structures representing resonance 
states corresponding to valence bond structures. 
These basis states are mixed to describe the reactant, 
intermediates, transition, and product states. The group 
of Guo used the self-consistent charge density 
functional tight binding (SCC-DFTB) ► semiempirical 
method to describe the QM region and the 
CHARMM22 all-atom force field to characterize the 
MM region (Xu and Guo 2009) in ► QM/MM 
calculations on CPA. The authors validated the 
QM/MM model by comparing the geometries and 
potential energies on a cluster model at the SCC-DFTB 
and ► density functional theory (DFT) level. In both 
studies, the reaction of a peptide substrate was investi¬ 
gated, and the anhydride pathway was ruled out, 
because the barrier of this pathway was too high in 
one case (Vardi-Kilshtain and Warshel 2009) and the 
acyl intermediate was unstable in the other (Xu and Guo 


2009). Therefore, these studies focused on the GAGB 
mechanism, giving free energy barriers in the range of 
13-18 kcal/mol (depending on the substrate and method 
employed). These values are in very good agreement 
with the available experimental data. Moreover, in both 
studies the free energy surface computed for the 
reaction catalyzed by wild-type CPA was compared 
with the R127A mutant. The mutation of Argl27 led 
to higher activation barriers, confirming the catalytic 
role of this amino acid in the reaction. However, there 
are some differences between the results of these 
studies. While in the work of Xu and Guo the activation 
of the water molecule and the nucleophilic attack are 
concerted in a single step, Vardi-Kilshtain and Warshel 
modeled them as separate steps. Nonetheless, these 
studies provide valuable information about the role 
played by the zinc ion and the amino acids of CPA 
involved in the reaction, especially Argl27. The 
simulations indicate that the zinc ion does not interact 
with the substrate carbonyl group in the reactant but 
instead with the water molecule in order to facilitate 
its deprotonation. Then, the zinc cation stabilizes 
the oxyanion intermediate. Argl27 interacts with the 
carboxylate end of the substrate, but also with the 
scissile peptide carbonyl oxygen atom in the reactant 
as well as in the intermediate. These results indicate no 
significant polarization of the carbonyl group by the 
cation in the reactant complex. Vardi-Kilshtain and 
Warshel interpreted the role of the cation in a slightly 
different way, as they ascribe the catalytic role of 
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the zinc ion to stabilizing the intermediate rather 
than the initial reactant, i.e., stabilizing the hydroxide 
ion and the C-0 oxyanion system. 

A similar conclusion was reported by Szeto et al. 
(2009) in their study on the rate-limiting step of the 
GAGB mechanism catalyzed by CPA, formation of the 
tetrahedral intermediate. A QM/MM scheme was 
employed to characterize the potential energy surface, 
in which a DFT functional was employed to describe 
the QM part. This high level of theory, combined with 
the nudged elastic band (NEB) method, should give an 
accurate description of the reaction pathway, 
especially the transition states. In the initial reactant, 
the substrate carbonyl group interacted with the zinc 
cation. The energy barrier computed (14.6 kcal/mol) 
was in very good agreement with the experimental 
data. These results therefore indicate that the interac¬ 
tion mode of the substrate with the zinc ion in the 
reactant has little effect on the reactivity, and confirm 
that the main role of the cation is to stabilize the 
tetrahedral intermediate oxyanion. 

The group of Guo (Wu et al. 2010) also analyzed the 
reaction of ester substrates catalyzed by CPA. They 
hypothesized that the reason the anhydride mechanism 
does not operate for peptide substrates is the presence 
of a hydrogen bond between Glu270 and the peptide 
bond N atom of the substrate, which reduces the nucle- 
ophilicity of Glu270. The absence of this NH group in 
ester substrates should then make the anhydride 
mechanism feasible. In fact, kinetics experiments indi¬ 
cate that peptide and ester substrates may follow 
different hydrolysis mechanisms (Auld et al. 1984). 
In agreement with this hypothesis, the computed free 
energy profiles showed that the formation of the 
acyl intermediate in the anhydride pathway for an 
ester has a much lower free energy barrier than 
the water-promoted mechanism. However, the 
deacylation step was calculated to be energetically 
much more demanding. These results led the authors 
to propose that at low temperatures, the anhydride 
mechanism can take place for esters, but due to the 
high barrier of the deacylation step, once the acyl 
intermediate is formed the system reverts to the initial 
Michaelis complex. At this stage, once a water 
molecule is introduced into the zinc ion solvation 
shell, CPA is suggested to catalyze the cleavage of 
the ester bond following the GAGB mechanism. 

A particular feature of the reaction catalyzed by 
CPA is that the above-mentioned reaction mechanism 


is apparently followed by oligopeptide substrates, but 
not by dipeptides. It seems that the Gly-Tyr (GY) 
dipeptide is hydrolyzed at low pH but it is an inhibitor 
at higher pH values. This apparent discrepancy was 
investigated by the group of Guo (Wu et al. 2011) 
using the QM/MM scheme employed in their previous 
studies. The authors characterized the anhydride 
mechanism and reported that the protonation state of 
the substrate’s N-terminus is a key factor; protonated 
(and therefore bearing a positive charge) at low pH 
but neutral at more basic conditions. The QM/MM 
molecular dynamics simulations showed that the 
protonation state of the N-terminus determined the 
interaction mode of the substrate with the protein and 
the zinc ion. Moreover, no minimum was located in the 
free energy profile characterized with the charged 
N-terminus. On the other hand, a stable anhydride 
intermediate was found when the N-terminal was in 
the neutral protonation state. The second step of the 
reaction, the deacylation from the anhydride interme¬ 
diate, was determined to be the rate-limiting step. 

Conclusions 

The computational work reviewed here provides better 
understanding of the intriguing chemistry involved 
in the catalytic reaction of Carboxypeptidase A. Of the 
two mechanisms proposed for the reaction catalyzed by 
CPA, the computational works point to a GAGB-type 
mechanism to hydrolyze oligopeptide substrates. In this 
mechanism, Glu270 increases the nucleophility of the 
water molecule and Argl27 stabilizes the intermediate. 
In addition, calculations have analyzed the role played 
by the zinc ion and found its catalytic role to be mainly 
stabilizing the negative charge of the oxyanion interme¬ 
diate. The same mechanism is proposed for ester sub¬ 
strates. On the other hand, a recent study indicates that 
the anhydride mechanism is preferred when the substrate 
is a dipeptide, but only at high pH values, because the 
same substrate inhibits the reaction at low pH due to the 
protonation of its N-terminal at more acidic conditions. 

Cross-References 

► Computational Enzymology 

► Empirical Valence Bond Methods 

► QM/MM Methods 
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Vibrational Spectra 

► Semiempirical Quantum Mechanical Methods 
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Car-Parrinello Molecular Dynamics 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

CPMD 


Definition 

Car-Parrinello molecular dynamics is the ah initio 
molecular dynamics technique first proposed by Car 


and Parrinello (Physical Review Letters, 1985). The 
method combines molecular dynamics simulation and 
density-functional theory: It integrates fictitious wave 
function coefficient dynamics with classical molecular 
dynamics, with the electronic wavefunctions included 
as dynamical variables. An initial converged 
wavefunction is determined, after which the orbitals 
evolve simultaneously with the positional changes of 
the nuclei. The orbital parameters are included in the 
dynamics as variables with fictitious masses. Nuclear 
forces are not exactly correct in the dynamics as the 
electronic wavefunction is not converged, but this 
error is controlled by an appropriate choice of the 
fictitious masses. Car-Parrinello molecular dynamics 
simulations are extremely computationally expensive, 
requiring large amounts of supercomputing time. 
Because the electronic structure methods are able to 
describe bond breaking and forming reactions, the 
method allows, in principle, the direct simulation of 
chemical reactions. Additionally, other limitations of 
molecular mechanics force fields are overcome by the 
method; for example, electronic polarization effects 
are included naturally. Due to the computational 
expense, the practical limitations of the method are 
the size of the systems that can be simulated, and 
the timescale of the simulations. Combination with 
QM/MM approaches is therefore attractive and has 
been applied widely to study enzyme reactions. 

Cross-References 
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► QM/MM Methods 


Carr-Purcell-Meiboom-Gill Spin Echo 

► CPMG 

► Relaxation Dispersion 


Catalases and Catalase-Peroxidases 

► Electron Transfer in Catalases and Catalase- 
Peroxidases 







Cell Sensing with Raman Spectroscopy 


245 


c 


CcP 

► Cytochrome c Peroxidase - Computational Studies 


Cell Sensing with Raman Spectroscopy 

Colin J. Campbell 

EaStCHEM School of Chemistry & Division of 
Pathway Medicine, The University of Edinburgh, 
Edinburgh, UK 

Synonyms 

Raman spectroscopy; Sensor; SERS 

Definition 

Raman spectroscopy is a technique which allows 
vibrational modes of individual molecules to be probed 
optically and is thus rich in chemical information. In 
Raman spectroscopy, the difference in frequency 
between incident photons and scattered photons is 
measured and reported on. 

Basic Characteristics 

The utility of Raman spectroscopy as a tool for the 
study of cell biology is becoming increasingly well 
recognized. The strengths of Raman spectroscopy 
which make it useful for studies of cells are: 

• Most molecules have Raman-active vibrational 
modes and thus generate a Raman spectrum. 

• Raman spectroscopy is sensitive enough to pick up 
small changes in the structure of molecules. 

• Raman spectroscopy does not require a specific exci¬ 
tation wavelength and can therefore be tuned in order 
to be minimally invasive to the cell (e.g., using near 
Infra-Red radiation). 

• The intensity of Raman spectra from molecules 
close to the surface of noble metal nanoparticles 
can be amplified by up to 14 orders. This technique 
is known as surface enhanced Raman spectroscopy 
(SERS) (Lombardi and Birke 2009) and noble 
metal nanoparticles can be introduced into cells 
without causing any damage. 


As a result of the high intensity spectra generated, 
SERS is a very useful technique for measuring Raman 
spectra inside cells. SERS has been used for cellular 
imaging in both bacterial and eukaryotic cells and the 
spectra typically show contributions from a range of 
biomolecules including nucleic acids and proteins (Jarvis 
and Goodacre 2008; Ochsenkuhn et al. 2009; 
Kneipp et al. 2002). However measuring a spectrum of 
everything in the cell generates complicated spectra 
whose interpretation is challenging and sensing often 
benefits from a more focused approach. Generally 
a sensor is a system which detects a particular stimulus 
by changing its properties in a measurable manner. SERS 
sensors typically consist of a reporter molecule which 
changes its conformation in response to a stimulus. The 
change in conformation can be detected via a change in 
the reporter’s Raman spectrum. By coating a metal 
nanoparticle with reporter molecules, the reporter’s 
Raman spectrum is amplified, meaning that SERS 
spectra can be measured from single particles. These 
particles can be considered as sensors that can be 
delivered into cells to measure physiologically interesting 
events in single live cells, in real time (Fig. 1). 

The first type of SERS sensor developed in this 
way was designed for the measurement of pH in living 
cells (Talley et al. 2004; Bishnoi et al. 2006). For the 
measurement of pH, reporter molecules should incor¬ 
porate a labile proton which is sensitive to pH changes 
and a thiol group for attachment to the metal nanopar¬ 
ticle; molecules such as mercaptobenzoic acid or 
thioaniline are ideal candidates and have been used 
(Wang et al. 2008). Since such reporter molecules can 
exist either as a deprotonated or protonated molecule, 
the ratio of protonated to deprotonated molecules can 
be used to calculate pH via the Henderson Hasselbach 
equation. The range of this type of sensor is determined 
by the pKa of the reporter molecule attached to the 
particle and since this varies between molecules, 
a larger range of pHs could be probed by using different 
reporters. This type of pH sensor has been used to 
investigate pH inside bacteria and also to investigate 
pH changes inside endosomes as they develop within 
cells (Wang et al. 2008; Kneipp et al. 2010). 

More recently, it has been reported that sensors can 
be designed to measure redox potentials inside cells 
(Auchinvole et al. 2012). In this case, the reporter 
molecule can exist in either an oxidized or reduced 
state and should incorporate a thiol group for attach¬ 
ment to a metal nanoparticle. Since the reporters are 
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Cell Sensing with Raman 
Spectroscopy, 

Fig. 1 Schematic for 
intracellular SERS sensing 
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designed such that the Raman spectra of the oxidized 
and reduced forms are distinct, the redox potential of 
the cellular environment can be calculated via the 
Nernst equation. These sensors have been delivered 
to cells and have been used to measure changes in 
intracellular redox potential which result either from 
exposure to reactive oxygen species or as a result of 
programmed cell death (apoptosis). 

The previous two examples demonstrate the utility of 
SERS sensors which incorporate a reporter whose struc¬ 
ture changes in response to a change in pH or a change in 
redox potential and whose use has been demonstrated in 
live cells. The following two examples of SERS sensors 
demonstrate the potential for new families of SERS 
sensors to probe biomolecular interactions in cells, 
although their use has not yet been reported in cells. 
Recent work has demonstrated that SERS sensors can 
be developed to report on changes in protein concentra¬ 
tion using aptamer reporters (short nucleic acid 
sequences designed to bind specific proteins) 
(Ochsenkiihn and Campbell 2010). In this case the 
aptamer changes structure on binding its cognate protein 
and the SERS spectrum of the sensor changes as a result 
of this conformational change. Suitably designed 
aptamers could be used to measure changes in protein 
concentration within cells. SERS sensors incorporating 
peptide reporters have also been developed. In one 
example the peptide was chosen because it is a specihc 
substrate for a protease whose digestion is important in 
development of an autoimmune disease. Not only does 
the SERS spectrum of these sensors change when 
a particular protease (Cathepsin D) binds to the peptide, 
but the spectral features corresponding to the peptide 
decrease in intensity as a result of digestion by the 


protease allowing the measurement of protease activity 
(Ochsenkuhn et al. 201 1). 

SERS sensing using reporters whose structure 
change in response to a given stimulus are currently 
being used in a range of biological studies and their use 
is likely to become more widespread as more reporter 
molecules are developed. 

Cross-References 

► Raman Spectroscopy and Microscopy of Cells and 
Tissues 
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Cell-Free Protein Production 

Y aeta Endo 

Cell-Free Science and Technology Research Center, 
Ehime University, Matsuyama, Japan 


Synonyms 

Cell-free expression 


Definition 

Cell-free translation means protein production in a test 
tube using an extract prepared from a living organism. 
It owes its birth to T. Winnick who discovered, more 
than 60 years ago, that homogenates of embryonic and 
tumor tissues had the ability to incorporate amino acids 
into protein. This pioneering work led to the develop¬ 
ment of cell-free expression systems from various 
cells and tissues. The most notable among them was 
the E. coli cell-free system that supported newly born 
life science. It was used in tracer experiments by 


M. W. Nirenberg for decoding genetic codons and 
by G. Zubay for understanding the mechanisms of 
translation. Until the start of this century, however, 
those cell-free systems could not be used for protein 
production because of their instability, which limited 
the duration of translation to less than an hour and 
the resultant yield to trace level detectable only 
with radioactive labels. This shortcoming was partially 
overcome by the principle of continuous flow transla¬ 
tion applied by A. S. Spirin to the E. coli system. This 
principle was adapted by Y. Endo to create a novel 
cell-free system using a robust enzymatic translation 
solution he made from wheat embryos, which main¬ 
tains its stability for weeks. The new wheat system 
proved to be capable of producing as much protein as 
living cells, nearly 1 g per 100 ml of the translation 
solution. Among new cell-free systems, the E. coli and 
wheat systems are characterized by their high produc¬ 
tivity and have provided basic means to produce pro¬ 
tein. The new wheat system has proved to excel in 
producing high-quality multi-domain proteins com¬ 
monly found in eukaryotes. This is attributed to its 
eukaryotic nature of co-translational folding mecha¬ 
nism. The wheat cell-free system has been used for 
sample preparation for high throughput functional and 
structural genomics. Being made from embryos in 
hibernation, the wheat system has very low amino- 
acid metabolism enzyme activity, which suppresses 
isotope label scrambling among amino acids and facil¬ 
itates assignment of amino acid residues in NMR 
HSQC spectra. In addition, the new wheat cell-free 
method has overcome the hurdles of life ethic, 
biohazards, and costs, and has demonstrated its ability 
to produce protein complexes and membrane proteins 
that are difficult to express in living cells. Most of 
known posttranslational modifications including 
ubiquitination can be done in the new wheat system, 
but glycosylation remains to be a challenge. Producing 
functional proteins in cell-free systems requires certain 
key techniques, a generally applicable one being per¬ 
form translation reaction in the presence of intrinsic 
factors such as prosthetic groups or partner molecules. 
The new cell-free systems, one using E. coli cells and 
the other wheat embryos, are expected to become 
viable alternatives for protein production. To facilitate 
sample preparation, fully automatic protein synthe¬ 
sizers have been developed and are commercially 
available. Lately, a reconstructed E. coli cell-free 
system was developed by T. Ueda. It consists of 
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purified components and has been used for the analysis 
of molecular mechanisms of translation and other 
studies in the field of synthetic biology. 


Cellulose 

Alfred D. French 

US Department of Agriculture, Southern Regional 
Research Center, Agricultural Research Service, 

New Orleans, LA, USA 

Synonyms 

P-l,4-glucan; Carbohydrates; Polysaccharides 

Introduction 

Cellulose, the saccharide of cell walls, is the most 
prevalent biomolecule. Besides occurrence in plant 
and algal cell walls, this P-l,4-glucan is also found 
external to cells in bacterial pellicles and in tunics of 
a simple marine animal. Most of the billions of tons 
(Perez and Samain 2010) created each year decom¬ 
poses, but some is harvested. In humans, dietary cel¬ 
lulose is not digested, but animals such as cattle rely on 
bacteria in their digestive systems to break cellulose 
into glucose, providing energy. 

Cellulose is often used in the fibrous form. Fibers 
from various plants (cotton, kapok, milkweed) are seed 
hairs (trichomes) that are complete cells composed 
mostly of cellulose. Domestic cotton fibers are about 
30 mm long. Bast hbers - from under the outer bark of 
stems of plants such as flax (linen), jute, and ramie - 
are obtained initially in the form of “technical fibers.” 
Technical fibers are bundles of “ultimate fibers” that 
are single cells glued together by pectin, pentosans, 
and lignin. The amount of the non-cellulosic material 
removed by “retting” to give finer, stronger fibers 
depends on the final use. Cotton and linen are used 
directly in textiles, whereas lower-cost cellulose is 
obtained from wood by pulping and used for paper 
making and as a dissolving feedstock for making cel¬ 
lophane film and rayon - a regenerated fibrous form. 

Ansleme Payen discovered cellulose in 1838 and 
determined its empirical formula and breakdown into 


glucose, but some 90 years elapsed before the chemical 
and basic physical structures were understood (French 
2000; Zugenmaier 2008). As biosynthesized, and after 
many types of processing, cellulose gives diffraction 
patterns indicating crystalline material, but some 
details are still being resolved. Unraveling its three- 
dimensional structure has been difficult because its 
crystallites are small, fewer than 100 molecules in 
ramie. Noncrystalline cellulose is usually present, as 
are other molecules. 

Crystallites 

Although molecular lengths may exceed 10,000 glu¬ 
cose residues, ramie crystallites (for example) are only 
about 300 glucose residues long (Nishiyama et al. 
2003a). Other sources will give somewhat different 
values. Besides those neutron scattering studies, crys¬ 
tallite length has been assessed by the “leveling off 
degree of polymerization.” When cellulose fibers are 
subjected to acid hydrolysis, the molecular weight of 
the remaining solid drops rapidly at first and then is 
nearly constant until the fibers are completely hydro¬ 
lyzed (Krassig 1996). Other hydrolysis procedures 
result in nanoparticles, the sizes of which differ 
depending on the source (Flabibi et al. 2010). With 
the extreme difference between molecular length and 
crystallite size, a molecule must pass through numer¬ 
ous crystallites with noncrystalline segments in 
between. Although the “fringed micelle” and “fringed 
fibril” models are traditional characterizations 
of moderately crystalline polymers where molecules 
participate in different crystallites, the 
periodic noncrystalline zones of cellulose may instead 
be interruptions of crystallinity that quickly resumes, 
still involving most of the same molecules in 
a microfibril. 

Polymorphy 

Cellulose occurs naturally in four crystalline poly¬ 
morphs called la. Ip, II, and IV. la and ip occur 
together, especially in algal cell walls, with ip domi¬ 
nant in the higher plants that constitute commercial 
sources of cellulose. (Cellulose III has not been dis¬ 
covered as a native form.) The purest la comes from 
the alga Glaucocystis nostochinearum and tunicate 
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Cellulose, Fig. 1 Two 

cellulose chain fragments of 
six glucose residues, passing 
through the monoclinic 
cellulose 1(3 unit cell 
(hydrogen atoms not shown). 
Coordinates were taken from 
Nishiyama et al. (2002). Left: 
A view down the c-axis, with 
the flat ribbon shape evident. 
Right. A view nearly 
perpendicular to the c-axis. 
Roughly one-fourth of the 
illustrated corner chain is 
within the unit cell, but the 
remaining three-fourths would 
be allocated at the other three 
comers, one quarter inside 
each other comer. The center 
chain is displaced 2.72 A 
along the c-axis from the 
comer chains. The unit cell 
dimensions are shown (a and [5 
angles are 90.0°), along with 
the origin (o) and the a-, b-, 
and c-axes. Atoms of a glucose 
residue in each chain are 
labeled 



cellulose from the marine animal Halocynthia roretzi 
is the best 1(3. In retrospect, the need to analyze cellu¬ 
lose I in terms of two different crystalline forms was 
apparent from Infrared studies, but seminal Nuclear 
Magnetic Resonance work by vander Hart and Atalla 
(1984) clarified the situation. Observation by electron 
diffraction of two distinct unit cells in the same fiber 
furthered understanding (Sugiyama et al. 1991). 

Native cellulose II is produced by some mutant 
bacteria and is created by other bacteria in cold condi¬ 
tions. With molecular axes perpendicular to the broad 
exuded ribbon, bacterial cellulose II has morphology 
completely different from the fibrils of bacterial cellu¬ 
lose I. The structure of cellulose IV remains unclear. 
Material in parenchymal or primary cell-wall struc¬ 
tures, identified as IV, could just be smaller crystallites 
of ip (Wada et al. 2004a). IV was originally identified 
as the result of heating cellulose to 240° C in glycerol. 
That could have caused a phase change, but it could 
also be small crystals of ip. Cellulose IV structures 


from heating in glycerol depend on whether cellulose 
I or II was heated, with designations IVj and IVu. 

The recent determinations of cellulose crystal struc¬ 
ture used samples that are purer and more crystalline 
than the ramie fibers often used in early work. Also, 
synchrotron X-ray and neutron diffraction are used 
now. Synchrotron radiation produces more X-ray 
data than laboratory generators. When hydroxyl hydro¬ 
gen atoms are replaced by deuterium, neutron diffrac¬ 
tion can define the hydrogen bonding scheme. 

Cellulose ip (Nishiyama et al. 2002) is shown in 
Fig. 1. Two parallel chains, designated corner and 
center, pass through the unit cell. Cellulose ip differs 
from the less stable cellulose lot (Nishiyama et al. 
2003b) mainly in chain packing. Both have hydro- 
gen-bonded sheets, illustrated in Fig. 2. Each chain 
has intramolecular hydrogen bonds from 03 to 05 
and from 02 to 06. 06 atoms take the otherwise 
unusual tg {trans to 05 and gauche to C4) orientation 
of 06, although a similar orientation and hydrogen 
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Cellulose, Fig. 2 Sheet structure for comer chains in cellulose 
ip. Three cellohexaose segments are shown, along with the unit 
cell. O-H.. .0 hydrogen bonds are shown. In the sheets formed 
by center chain in ip, an added weak hydrogen bond exists. 
Another hydrogen bond scheme, corresponding to a minor dis¬ 
ordered component, also exists. Similar sheets are found for 
cellulose la. Intra-chain bonds include 03-H.. .05 and 
a linkage between 06 and 02, with either being the donor or 
the acceptor. Interchain hydrogen bonds are as shown here, with 
06 donating to 03, or with a bifurcated bond to both 03 and 02. 
Alternatively, 02-H can donate to 06 of the adjacent chain. 
Sheets pass through the comers and centers of the unit cell 
parallel to the h-c plane. Conventional O-H.. .0 hydrogen 
bonds do not connect the sheets, but C-H.. .0 bonds have been 
reported (Nishiyama et al. 2003b) 

bond have now been observed in a small-molecule 
crystal (Yoneda et al. 2008). Figure 2 shows 
intermolecular links between 06-H and 03. In 1(3, 
molecules in these sheets constitute the comers and 
centers of the monoclinic unit cell for a net of two 
chains or four glucose residues. The center sheet is 
displaced 2.72 A upward along the molecular axis 
from the comer sheets. In triclinic la, the 2.81 A shift 
or stagger of adjacent sheets is continuous (up, up, up, 
or 0, 2.81, 5.62, 8.43 A ...) (Yoneda et al. 2008) rather 
than up-back-up-back (0, 2.72, 0, 2.72 A ...), the 
scheme in 1(3. The packing of la and 1(3 is parallel-up, 
i.e.. Cl atoms of all residues all have greater c-axis 


coordinate values than the C4 atoms in the same resi¬ 
due. The inter-sheet relationships differ for parallel-up 
and parallel-down packing, but there is no biological 
signihcance to the “up” or “down” characterization 
that depends on the particular convention used to 
assign crystallographic axes. On the other hand, the 
distinction between parallel and antiparallel packing is 
critical for understanding the mechanisms of biosyn¬ 
thesis. In antiparallel packing, such as found for cellu¬ 
lose 11, both chains in the unit cell are parallel to the 
c-axis, but one will be “up” with Cl atoms of a given 
residue having more positive c-axis coordinates than 
the C4 atoms of the same residue. The other chain will 
be “pointed” in the opposite direction so that its Cl 
coordinates are more negative than the C4 coordinates. 

Native examples of cellulose 11, the lowest-energy 
form, are rare. Usually it is created in the laboratory. 
Cellulose II always results when cellulose is returned 
to the solid state from solution. More perplexing is 
cellulose II created from a solid state substantially 
swollen with concentrated aqueous NaOH (merceriza- 
tion). (The widely used industrial mercerization uses 
warmer, dilute NaOH and results in little if any crystal 
structure conversion.) The latter case has been 
perplexing because cellulose II has antiparallel chains 
(Langan et al. 2001) while cellulose 1 has parallel 
chains. Besides several fiber diffraction results that 
show cellulose II is antiparallel, inference can be 
drawn from powder diffraction studies. Cellulose II, 
methyl cellotrioside, and cellotetraose ethanol solvate 
give similar powder diffraction patterns (Fig. 3). Sin¬ 
gle-crystal studies of the tri- and tetraose unambiguously 
show antiparallel packing, supporting the antiparallel 
results from both rayon and mercerized linen fibers. 
How could half of the molecules in a cotton fiber, with 
a length of thousands of glucose residues, appear to 
suddenly reverse direction? Is it reasonable for bacteria 
to make parallel chain cellulose I at room temperature 
and then antiparallel II at low temperature? 

To meet the need for an explanation of the conver¬ 
sion during mercerization, Okano and Sarko proposed 
“interdigitation” or “interdiffusion” to explain the I ^ 
II conversion with NaOH (Wertz et al. 2010). Initially, 
crystallites in higher plant fibers are individually com¬ 
posed of parallel chains, but neighboring crystallites are 
antiparallel. The chains in the swollen structures are 
thought to diffuse and establish new crystals composed 
of antiparallel chains. These new crystals are complexes 
of NaOH, water, and cellulose, and with rinsing. 
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Cellulose, Fig. 3 Simulated diffraction patterns for cellulose II 
(green), methyl cellotrioside ethanol solvate (blue), and 
cellotetraose hydrate crystal structures (red). Created with the 
Mercury software from the Cambridge Crystallographic Data 
Centre and the published coordinates for the three structures. 
All peak widths were set to 0.5°. The trioside and tetraose lack 
the symmetry of cellulose II and give more complex patterns. 
However, the strong correspondence of three major peaks sup¬ 
ports the idea of nearly identical crystal packing 

eventually result in cellulose II. Conversion with reten¬ 
tion of fiber character works only in the case of cellulose 
in cell walls. Fibers of extracellular bacterial cellulose, 
with all crystallites parallel, do not retain fiber form 
when converfed to cellulose II. When mutant or chilled 
bacteria produce II, there is reduced production of cel¬ 
lulose. In those cases, one idea is that adjacent sites of 
synthesis are not all actively producing cellulose and 
there are gaps between emerging chains. Then, instant 
crystallization into parallel cellulose I does not occur 
and thermodynamically favored II has time to occur. 

“Cellulose III” represents two distinct structures, 
nil and IIIu, that arise from conversions with ammonia 
or amines started with cellulose I or II, respectively. 
Early diffraction diagrams were sufficiently similar 
that the structures were thought to be the same. Unlike 
the irreversible I ^ II conversion, IIIj can return to I. 
Cellulose IIIi has a monoclinic unit cell with a single 
chain so the structure must have parallel packing with 
no stagger. A fundamental structural feature must be 
preserved in the I ^ IIIi conversion that allows and 
guides the return of IIIi to I that is not preserved in the 
I ^ II conversion. The parallel packing of I is the most 
significant feature that is preserved in IIIi. Another 
major feature is the orientation of the 06 group. How¬ 
ever, it is clearly gt (gauche to 05 and trans to C4) in 
both cellulose II and IIIi (Wada et al. 2004b) and tg in 
cellulose I, so that is not a constant feature. Since 06 is 


gt in both nil and II, there must be some other differ¬ 
ence. Cellulose IIIu has a statistical distribution of up 
and down chains (Wada et al. 2009). 

Pure cellulose has essentially twofold screw-axis 
{2i) symmetry in all its polymorphs. Thus the mole¬ 
cules are helices in the mathematical sense with two 
residues per turn. The exception, la, still resembles 2 j 
symmetry (French and Johnson 2007). This symmetry 
means that the x, y and z coordinates of the atoms in the 
second glucose residue in the chain are related to the 
counterpart atoms in the first by operators -x, -y, and 
z + Vi. (The third residue is then specified by x, y 
and z + 1.) These values refer fo coordinates when c 
is the molecular axis. Earlier crystallographic work 
used a convention with h as the molecular axis and an 
acute instead of obtuse monoclinic angle. This is 
a source of confusion, as many studies that focus on 
the degree of crystallinity and polymorph identifica¬ 
tion continue to use the older convention, with major 
peaks labeled 101, 10-1, 002, and 040, while the 
modem determinations of molecular structure use the 
above convention, with Miller indices for the major 
peaks of —110, 110, 200, and 004. 

Molecular Modeling 

Computerized molecular modeling is also adding 
information on cellulose properties. The preferred 
shape of the linkage between just two glucose residues 
is an important predictor of cellulose shape. 
Ramachandran type analyses for this linkage are 
often carried out (Eig.4). Modeling studies of cellobi- 
ose have been carried out with empirical force fields 
(molecular mechanics), both by energy minimization 
as in Eig. 4 and molecular dynamics, by a hybrid 
method, as well as with Hartree Eock (HE) and Density 
Functional (DFT) electronic structure theory (quantum 
mechanics). A highlight from this body of work is the 
prediction of a different conformation in the gas phase 
than is observed in the solid state for cellobiose (Strati 
et al. 2002), with subsequent experimental verification 
(Cocinero et al. 2009). Other modeling work concerns 
calculation of mechanical properties such as modulus. 

Studies of larger systems, either with longer cellu¬ 
lose chains, with explicit solvent water or model crys¬ 
tals are usually handled with far faster molecular 
mechanics methods. They permit models of crystals 
to have realistic cross-sectional size and substantial 
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<I>05'-C1'-C4-C4 



Cellulose, Fig. 4 Ramachandran map for P-cellobiose, com¬ 
puted with MM3(96) and a dielectric constant of 7.5, using the 
hydrogen bonding constants from MM3(92). Also shown is the 
cellobiose molecule, with the definitions of the (() and v]/ torsion 
angles and 06 orientations (gg, gt, and fg). The cyan atoms 
represent alternative orientations for the white hydrogen atoms 
or the red oxygen atoms. The elevated dielectric constant reduces 
the calculated strength of hydrogen bonds, and is meant to com¬ 
pensate for a condensed phase environment. One thousand, eight 
hundred and sixty-three different combinations of the exocyclic 
group orientations were used to make individual maps. The lowest 


energy at each ct),v|/ point, considering all of the individual maps, is 
indicated. The contour lines are shown in magenta at 0.5 kcal/mol 
and in blue (1.0 kcal/mol) and black (in 1.0 kcal/mol increments as 
marked). Orange dots indicate the conformations observed by 
X-ray crystallography in molecules having a cellobiose linkage, 
including derivatives and complexes. Almost all are inside the 
1.0 kcal/mol contour. The gas phase conformation is at approxi¬ 
mately 4> = 60°, vl/= —120°. The dashed diagonal line indicates 
combinations of tj) and v]/ that would lead to cellulose molecules 
with twofold screw-axis conformations. Other calculations yield 
considerably different contours 


length. Those models have tended to twist (Yui and 
Hayashi 2007), a topic of current research. Such model 
crystals have found use in studies of enzyme action to 
convert cellulose into fermentable sugars, an area of 


intense research with the eye to producing fuels from 
biomass (Wertz et al. 2010). Although cellulose can be 
grown in large amounts, it is far more difficult to 
convert into the monosaccharide glucose than is starch. 
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This recalcitrance is widely thought to result from the 
highly crystalline and insoluble nature of cellulose. 

Summary 

This entry has concentrated on the molecular levels of 
cellulose architecture. For one thing, there is a huge 
literature to cover in this area alone and, for another, the 
details of higher levels depend on the source of cellulose. 
But more important, there seems to be substantial confu¬ 
sion among scientists on these fundamental aspects such 
as parallel-up and antiparallel. It can be argued that 
a number of disciplines have been short-changed, espe¬ 
cially electron and atomic force microscopy, and for 
those omissions, I apologize. Cellulose is likely to be 
important to humankind well into the future because of 
its widespread application and renewability. It is also 
likely to remain the subject of considerable research, 
given the difficulty of unlocking its secrets. Whether the 
promises of renewable fuels from cellulosic biomass are 
realized, and whether nanocellulose particles will find 
widespread utilization, is unknown. But with the preva¬ 
lence of cellulose, it is well worth pursuing these targets. 

Cross-References 

► Atomic Force Microscopy 

► Computational Studies of Enzyme Motions 

► Molecular Dynamics Simulations of Carbohydrates 

► Molecular Dynamics Simulations of Lipids 

► Neutron Scattering of Membranes 

► NMR 

► NMR Methods for Kinetic Analysis 

► Polysaccharides: Biophysical Properties 

► Protein Complexes 

► QM/MM Calculations of Vibrational Spectra 

► QM/MM Methods 

► Wide and Small Angle X-Ray Scattering 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 

► X-Ray Scattering of Lipid Membranes 
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Synonyms 

Chloride Channels 


Introduction 

CFTR (cystic fibrosis transmembrane conductance 
regulator) is an integral membrane protein that func¬ 
tions as an epithelial CU channel, which is rendered 
defective by inherited gene mutations in patients with 
cystic fibrosis (Riordan et al. 1989). Its function under¬ 
lies fluid secretion in the airways, sweat ducts, pancre¬ 
atic duct, and vas deferens, and it mediates the 
excessive intestinal water loss in secretory diarrheas 
(Verkman and Galietta 2009). CFTR belongs to the 
large ABC (ATP-binding cassette) transporter super¬ 
family, of which most members are ATP-hydrolyzing 
pumps. CFTR is the sole ABC protein known to func¬ 
tion as an ion channel. Despite this difference in func¬ 
tion, the gates of CFTR channels are opened and closed 
by interactions with ATP similar to those that in other 
ABC proteins result in energetically uphill substrate 
transport. 

CFTR Is an Ion Channel 

A CFTR channel in its open conformation provides 
a pathway for CU ions (or other small anions, such as 
bicarbonate) to cross the cell membrane, down the 
gradient of their electrochemical potential. Because 


the ions moving through the channel carry an electrical 
current across the membrane, the ► patch-clamp 
recording technique (Gibb 1995) can be employed to 
study CFTR function. Once a saline-filled glass pipette 
connected to an electrode is sealed tightly against 
a small patch of membrane, the gating of any CFTR 
channels present in the patch can be accurately 
monitored by observing how the current flowing 
through the membrane patch changes with time. 
If the patch contains few channels, individual events 
reflecting opening and closing of single-channel gates 
can be detected as abrupt changes in the current level 
(Fig. la, black trace, recorded in an excised, inside-out 
patch with one active CFTR channel). 

For quantitative analysis of such current records, 
gating is treated as a stochastic process described by 
a kinetic scheme (which includes the states the channel 
can visit, how those states are connected, and rates of 
transitions between the states; e.g.. Fig. lb inset, 
“► Hidden Markov Modeling”). The hrst task for the 
analysis is collection of all dwell times in the closed 
state (represented by green bars in Fig. la), and of all 
dwell times in the open state (red bars). In general, 
dwell times in a particular state provide information 
about the rates of leaving that state. Simple 
calculation of average dwell times can yield useful 
information, but a more powerful approach is analysis 
of distributions of dwell times (e.g., simulated 
open dwell-time distribution shown in Fig. lb, 
“► Data analysis. Single Molecule Kinetic Methods”; 
Colquhoun and Hawkes 1995; Colquhoun and 
Sigworth 1995). In this analysis, the time axis is 
subdivided into bins, and the ordinate reports the num¬ 
ber of dwell times observed to have a duration that falls 
within the period encompassed by each bin. Dwell 
times in a single conformational state are exponen¬ 
tially distributed, which means that the bins containing 
brief durations have the highest event count, and the 
distribution decays monotonically. This is because the 
probability of leaving a state is the same for each time 
interval spent in that state, so that the chance for 
a channel to survive in a given state declines with 
time. Fitting of dwell-time distributions using 
a statistical procedure called maximum likelihood 
(ML) can give rate estimates for the underlying 
kinetic scheme. For example, in Fig. lb, the red 
curve superimposed on the depicted simulated 
open dwell-time distribution data is the theoretical 
open dwell-time distribution predicted for a C«-0 
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CFTR, an Ion Channel Evolved from ABC Transporter, 
Fig. 1 (a) Single CFTR channel cuiTent trace (black), in 
which upward deflections correspond to channel-opening events 
that allow CP ions to flow across the patch. In this case, anions 
flow from cytoplasmic to extracellular side of the membrane, 
constituting an inward cuiTent; but, for consistency between fig¬ 
ures, in this entry, channel openings are plotted as upward deflec¬ 
tions in all figures, regardless of the actual direction of ion flow 
determined in each case by the CP electrochemical gradient.) The 
current flowing through open channels corresponds to million 
CP ions per second. Although not evident in the traces shown in 


this entry, at a higher time resolution, CFTR can he seen to open 
into “bursts” of openings: clusters of openings separated by short¬ 
lived, “flickery” closures. Duration and frequency of flickery 
closures are not nucleotide dependent. Rather, NBD dimer forma¬ 
tion and dissociation drive initiation and termination of bursts; 
here, for simplicity, we use the terms opening and closing to 
indicate entry and exit from bursts.) Red and green bars illustrate 
length of individual dwell times in open and closed states, respec¬ 
tively. (b) Example of an open dwell-time distribution (yellow 
histogram), and ML fit (red line), for a current trace simulated 
using the simple kinetic scheme in the inset, with koc = 4 s~' 


model that has a closing rate, koc, of 4 s“' (Fig. lb, 
inset; this was the model used to simulate the data). 
According to the ML criterion, this curve is the 
“best fit” to the observed distribution. Thus, given 
a kinetic scheme, ML htting finds values for the rates 
(e.g., rate koc for the open dwell-time distribution in 
Fig. lb) such that the likelihood of obtaining the 
observed distribution is maximized. 

Two main factors influence the gate in a CFTR 
channel: they are ► phosphorylation, principally by 
(cAMP-dependent) protein kinase A (PKA), and 
interaction with ATP (Fig. 2; Gadsby et al. 2006). 
Phosphorylation is a prerequisite for CFTR channel 
gating in response to binding and hydrolysis of ATP. 
By adding PKA and ATP to the cytosolic side of the 


excised patch, channels become phosphorylated; after 
that the current starts fluctuating in a stepwise fashion. 
This pattern reflects fluctuations in the number of 
channels that are open at any point in time. Once the 
channels have been phosphorylated, ATP is sufficient 
and necessary for channel gating: adding ATP can 
activate them, but after washing the ATP away no 
further openings occur (Fig. 2). 

At the molecular level, ATP interacts with CFTR’s 
two cytosolic wucleotide-Z^inding domains (NBDs), 
NBDl in the N-terminal half of CFTR, and NBD2 
near the C-terminus, whereas phosphorylation occurs 
at several serines within the cytosolic “regulatory” (R) 
domain, a region unique to CFTR and absent from 
other ABC proteins. On the other hand, the permeation 
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CFTR, an Ion Channel Evolved from ABC Transporter, 
Fig. 2 Current trace from patch containing at least three 
CFTR channels; the number simultaneously open is indicated 


at the right. Solid lines below the trace signify the presence at the 
cytosolic face of the patch of the specified reagent: red line, 
5 mM ATP; blue line, catalytic subunit of PKA 



CFTR, an Ion Channel Evolved from ABC Transporter, 
Fig. 3 (a) CFTR topology including TMDs {dark gray), 
NBDl {green), NBD2 {blue), and R domain {white). Intracellu¬ 
lar linking loops are shown in magenta, (b) crystal structure of an 
NBD homodimer, from a bacterial ABC transporter, (c) sche¬ 
matic representation of a head-to-tail NBD dimer 

pathway through which Cl~ ions cross the membrane 
is constructed by the two transmembrane domains 
(TMDs, Fig. 3a). 


CFTR's NBDs and Their Role in Channel 
Gating 

Fligh resolution crystal structures have been determined 
for many NBDs, including CFTR’s NBDl and a mod¬ 
ified NBD2. In all ABC proteins,nucleotide interacts 
directly with conserved sequences of residues (Walker 
motifs) in the “head” of an NBD. But the conserved 
ABC signature (LSGGQ-like) sequence occurs in the 
helical “tail” of the NBD, quite distant from where the 


nucleotide is found bound in monomeric crystals. How¬ 
ever, NBDs can interact to form head-to-tail dimers, 
with an ATP molecule buried within each of two com¬ 
posite nucleotide-binding sites created at the dimer 
interface (Fig. 3b-c). The ATP acts as “molecular 
glue,” with the two ATP molecules, and in particular 
their y-phosphates, providing molecular contacts to 
both sides of the composite binding sites and thereby 
making important contributions to dimer stability. 

In CFTR NBDl and NBD2 form an intramolecular 
heterodimer with two distinctly different composite 
binding sites at the interface, referred to as composite 
site 1 and composite site 2, comprising Walker motifs 
in the head of NBDl and head of NBD2, respectively 
(“► Heteromeric Versus Homomeric Association of 
Protein Complexes”). 

Quantitative analysis of how ATP affects CFTR 
channel gating has been carried out by exposing 
excised patches to different test [ATP] and monitoring 
how gating parameters are altered (Fig. 4). 

Closing rate (reciprocal of the mean open dwell 
time) shows no clear dependence on [ATP] over 
a three orders of magnitude concentration range 
(Fig. 4c). In contrast, over the same [ATP] range, the 
opening rate (reciprocal of the mean closed dwell time) 
hrst increases and then saturates (Fig. 4b). The data are 
described by Michaelis-Menten kinetics (or the Hill 
equation with Hill coefficient of 1) with apparent ATP 
affinity ^50 pM. Therefore, at low [ATP], CFTR 
channel opening is rate-limited by an ATP-binding 
step, which means that ATP binding must occur on 
closed channels and precedes channel opening; the Hill 
coefficient of 1 is consistent with a single binding step 
controlling opening. 

ABC proteins do not merely bind ATP, they also 
catalyze ATP hydrolysis (“►ATPase: Overview”). 
Native, wild-type (WT) CFTR, too, is an active 
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CFTR, an Ion Channel 
Evolved from ABC 
Transporter, Fig. 4 [ATP]- 
dependence of CFTR gating, 
(a) current trace from patch 
containing multiple CFTR 
channels, exposed to a test 
concentration of 50 pM ATP, 
in between control exposures 
to 5 mM ATP. Red and green 
bars highlight duration of 
dwell times at conductance 
levels 1 (one open channel) 
and zero (no open channel), 
(b, c) [ATP]-dependence of 
opening rate and closing rate, 
respectively 
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CFTR, an Ion Channel 
Evolved from ABC 
Transporter, Fig. 5 Current 
recordings from patches 
containing hundreds of WT 

(a) or mutant K1250A 

(b) CFTR channels. 
Macroscopic current decay 
upon abrupt removal of ATP 
monitors channel closing rate 
(superimposed blue lines are 
single exponential fits) 


ATP 

b 



ATPase, with a measured turnover rate on the order of 
1 ATP (Li et al. 1996). How this hydrolysis affects 
gating is most readily observed by studying channel 
closing rates in patches containing large numbers (hun¬ 
dreds, or even thousands) of CFTR channels (e.g., 
records in Fig. 5 - compare current scale to those in 
Figs. 1 and 2; “► Analysis of Macroscopic Currents”). 
In such recordings, individual opening and closing 
events are lost in noise. Upon removal of ATP, the 


opening rate falls to almost 0 (see Figs. 2 and 4b), so 
the time course of current decline reflects only the 
charmel closing rate. WT channels close with a time 
constant below 1 s (Fig. 5a). Mutation of a crucial 
catalytic lysine, K1250, in the Walker A motif of 
composite site 2, abolishes ATPase activity of purified, 
reconstituted CFTR (Li et al. 1996), and K1250A 
CFTR channels close extremely slowly (time constant 
tens of seconds. Fig. 5b). Mutation of other catalytic 
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residues in composite site 2 (e.g., E1371, the “catalytic 
glutamate,” and D1370, see below) also slows channel 
closure. These observations suggest that hydrolysis at 
site 2 is required for normal, fast channel closure. 

In contrast, CFTR channels carrying a mutation at 
the equivalent conserved lysine, K464, in the Walker 
A motif of composite site 1, appear to close normally 
and display average open dwell times not different 
from those of WT channels. In fact, biochemical 
experiments suggest that, in WT CFTR, ATP binds 
tightly in composite site 1 but remains there for 
minutes without being hydrolyzed. This lack of activ¬ 
ity is accounted for by several noncanonical substitu¬ 
tions of otherwise conserved key residues on both sides 
of the NBD1-NBD2 heterodimer interface in compos¬ 
ite site 1. Although gating kinetics of K464A mutant 
channels look very similar to those of 
WT, biochemical measurements show that the 
ATPase rate of K464A is only one tenth that of WT 
(Ramjeesingh et al. 1999). This will be considered 
again later (see section “Evidence for 
a nonequilibrium gating scheme for CFTR”). Other 
ABC proteins (e.g., multidrug-resistance-related 
proteins, ► sulfonylurea receptors, transporter 
associated with antigen processing) share with CFTR 
the presence of a noncanonical interfacial 
ATP-binding site, so that only one of their two 
composite sites catalyzes ATP hydrolysis at 
a measurable rate. 

The inference from findings like these is that 
a scheme that relates ATP interactions with the 
NBDs to channel gating must include the following 
features. ATP binds tightly to the Walker motifs at the 
NBDl head (in composite site 1) where it remains 
unhydrolyzed for several gating cycles. Then, after 
ATP binds to the Walker motifs at the NBD2 head of 
composite site 2, that is, once ATP is bound at both 
sites, the channel can open. The open state of the 
channel, with 2 ATP molecules bound, is very stable, 
and rapid closure can occur only after the ATP at 
composite site 2 is hydrolyzed. Figure 6a shows 
a working hypothesis for such a CFTR-gating scheme, 
in terms of channel closed (Ci, C 2 ) and channel 
open (Oi, O 2 ) conformations (only the nucleotide 
bound - and hydrolyzed - at site 2 is indicated; ATP 
bound at site 1, assumed present in all four states, is 
omitted for simplicity). A structural interpretation of 
this scheme can be made in the light of structural 
knowledge recently obtained for other ABC proteins 
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CFTR, an Ion Channel Evolved from ABC Transporter, 
Fig. 6 Working hypothesis linking ATPase and gating cycles 
of CFTR. (a) Proposed gating scheme relating nucleotide present 
at site 2 to channel state (closed states, Ci and C 2 ; open states, 
O] and O 2 ). (b) Structural interpretation of gating scheme in 
A, color coding as in Fig. 3. The small orange balls represent CP 
ions flowing through CFTR’s transmembrane pore 


(Locher 2009): conformational changes in the TMDs 
that open and close the Cl~ ion pathway could 
be coupled to the ATPase cycle via formation 
and dissociation of the NBD1-NBD2 dimer (Fig. 6b). 
Formation of an NBD1-NBD2 tight dimer after bind¬ 
ing of both ATPs could be coupled to opening of the 
channel pore. Then hydrolysis of the ATP at site 2 
would destabilize the NBD dimer, triggering fast 
channel closure. So, while ATP at site 1 remains 
bound, without being hydrolyzed, for many gating 
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cycles, the ATPase cycle at site 2 is coupled to 
opening and closing of the Cl~ permeation pathway. 
This interpretation is supported by experimental 
evidence, obtained using ► double mutant cycle 
analysis, that opening of the CFTR pore corresponds 
to closure of the NBD1-NBD2 dimer interface gap at 
composite site 2 (Gadsby et al. 2006). 

Phosphorylation of the R Domain Relieves 
Gating Inhibition 

R-domain phosphorylation is required for CFTR 
channel activation, and can be considered the major 
regulatory mechanism in vivo because the millimolar 
range of [ATP] in cells is enough to fully activate 
channel gating (Fig. 4). Studies on full-length CFTR 
in intact cells, and on isolated R-domain peptides 
in vitro, have identified at least ten R-domain serines 
that become phosphorylated by PKA, most of them 
occurring within dibasic PKA recognition consensus 
sequences (R/K,R/K,X,S). This large number of target 
serines allows for a graded functional response, and 
CFTR channel activity increases in rough proportion to 
phosphorylation stoichiometry (reviewed in Gadsby 
et al. 2006). 

Either of two kinds of regulatory mechanisms could 
account for the findings. The unphosphorylated 
R domain could inhibit channels that would be active 
in its absence, or the phosphorylated R domain could 
stimulate channels that would be otherwise inactive. 
Experiments show that excision of the R domain 
results in constitutive, phosphorylation-independent, 
channel activity; indeed, cutting CETR in half and 
co-expressing the two segments, TMDl-NBDl and 
TMD2-NBD2, yields “cut AR” CFTR channels 
whose ATP-dependent gating in the absence of PKA 
phosphorylation is comparable to that of phosphory¬ 
lated WT CFTR. Because the stimulation of WT chan¬ 
nels by PKA corresponds to a > 100-fold increase in 
open probability (Pq), while the constitutive Pq of cut 
AR is about half that of fully phosphorylated WT, the 
dominant role of the R domain appears to be inhibition 
of channel activity when the R domain is 
dephosphorylated. A small, <2-fold, stimulation by 
the phosphorylated R domain has also been proposed 
(reviewed in Gadsby et al. 2006). 

How phosphorylation influences R-domain 
structure and its interactions with the rest of CFTR 


remains unclear. The R domain lacks sequence 
homology to any known protein, and CD spectral 
analysis and NMR studies of isolated R-domain 
peptide indicate the absence of stable 3-dimensional 
structure (“► Structurally Disordered Proteins”). 
Nevertheless, phosphorylation of R-domain peptide 
by PKA diminished the a-helical contribution to its 
CD spectrum. Correspondingly, NMR measurements 
revealed stretches of dephosphorylated R-domain 
peptide, near target serines, with distinct helical 
propensity and a tendency to interact with 
isolated NBDl protein; and both characteristics were 
weakened by phosphorylation. But whether such 
interactions between the R domain and NBDl occur 
in intact CFTR and, if so, whether they are modulated 
by phosphorylation is not known (Gadsby et al. 2006; 
Baker et al. 2007). 

Phosphorylation is unlikely to exert its effect 
through mere accumulation of negative charge. 
Although mutation of eight consensus serines to 
negatively charged aspartates or glutamates results in 
some constitutive, PKA-independent, channel activity, 
this constitutive Pq is very low compared to that of 
fully phosphorylated WT, suggesting that negative 
charge per se is a poor mimic of phosphorylation. 
Also, phosphorylation of serines 737 and 768 
decreases Pq, which again argues against a simple 
effect of charge accumulation. 

Similarly, the graded increase of channel Pq with 
phosphorylation stoichiometry is hard to reconcile 
with initial suggestions that the unphosphorylated 
R domain might physically obstruct the pore; and 
phosphorylation has no major effect on ATP binding 
affinity of the NBDs. Because phosphatase-mediated 
dephosphorylation essentially abolishes ATP hydroly¬ 
sis of purified CFTR protein (Li et al. 1996), the cata¬ 
lytic cycle appears to be stalled in unphosphorylated 
CFTR, rather than uncoupled from gating. However, 
because catalytic and pore-gating cycles appear to be 
strictly coupled (Csanady et al. 2010), the “brake” 
could arrest either the NBDs (e.g., by preventing 
NBD dimer formation, see below) or the TMDs (e.g., 
by preventing conformational changes associated with 
pore opening) - just as a seesaw can be stopped by 
immobilizing either end. Interestingly, two 
nonconserved segments in NBDl, both containing 
phosphorylatable serines, had been suggested to 
prevent NBD dimerization in their nonphosphorylated 
states, but deletion of neither segment affected strict 
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phosphorylation dependence of gating. Instead, 
mounting evidence suggests that the “brake” acts on 
the TMDs rather than on the NBDs. First, simple 
severing of covalent linkage between the R domain 
and TMD2 (between residues 835 and 837) disrupts 
strict phosphorylation dependence of gating, yielding 
channels with ^20% of maximal activity in the 
absence of phosphorylation. More importantly, the 
completely ATP-independent gating of truncated 
CFTR channels, lacking the entire NBD2 domain, 
remains strictly dependent on phosphorylation. This 
suggests that the R domain directly interacts with the 
intracellular linking loops of the TMDs, consistent 
with conclusions from low-resolution cryo-EM imag¬ 
ing. By altering the specihc pattern of interactions 
between the R domain and the intracellular linking 
loops and/or CFTR’s N terminal tail, phosphorylation 
likely lowers the energetic cost of the TMD conforma¬ 
tional change associated with pore opening. Once the 
“weight” on the TMD-end of the seesaw has been thus 
diminished, the energy released during 
ATP-dependent NBD dimer formation in WT CFTR 
becomes sufficiently “heavy” to flip the seesaw 
from an open-dimer/closed-pore conformation into 
a tight-dimer/open-pore state (Gadsby et al. 2006; 
Zhang et al. 2009; Wang 2010; Wang et al. 2010). 

Besides PKA, other kinases can phosphorylate the 
R domain, some targeting the same serines as PKA, 
others distinct sets of residues. Phosphorylation of 
serine 686 by protein kinase C (PKC) seems to play 
a permissive role: PKA-dependent CFTR activation 
requires prior phosphorylation of serine 686 by PKC. 
Recently, the “inhibitory” serine 768 (see above) was 
shown to be a substrate of AMP-kinase (AMPK), 
which explains the suppressive effect of AMPK 
activity on whole-cell CFTR currents (Hallows et al. 
2000). Still unclear are functional and physiological 
consequences of CFTR phosphorylation by cyclic 
GMP-dependent protein kinase (PKG), and calcium/ 
calmodulin-dependent protein kinase I (CaM kinase I) 
(Gadsby et al. 2006; King et al. 2009; Kongsuphol 
et al. 2009). 

Evidence for a Nonequilibrium Gating 
Scheme for CFTR 

The cyclic kinetic scheme underlying CFTR gating 
(Fig. 6) is not yet universally accepted. Some 


researchers believe a simple equilibrium kinetic 
scheme best describes CFTR gating (Aleksandrov 
et al. 2009). In an equilibrium scheme (within blue 
dotted line. Fig. 7 top), closing and opening occur 
along the same pathway (i.e., closing is simply reversal 
of the opening transition). Such a scheme predicts 
a monotonically decaying open dwell-time distribution 
(as in Fig. lb). In the nonequilibrium, cyclic, gating 
scheme in Fig. 6a, for most opening events, closing 
occurs through a different pathway than opening. After 
opening (to state Oi ATP), because the nonhydrolytic 
closing pathway (Oi-ATP to Ci ATP) is slow, the 
channel usually proceeds to a second open state. 
It transits through these two sequential open states, 
and only then it closes. This causes a rarity of brief 
open events. Therefore, the nonequilibrium scheme 
predicts a peaked open dwell-time distribution with 
a rising phase, reflecting low event counts in the bins 
corresponding to shortest open dwell times. 

For a given observed dwell-time distribution, two 
alternative ML fits can be obtained: one assuming the 
equilibrium scheme, and another assuming the 
nonequilibrium scheme. The log-likelihood score 
(LL) quantitatively describes the “goodness” of a fit. 
A LLequilibrium. assuming the equilibrium scheme, 
can be calculated as well as a LLnqnequilibriumj 
assuming the nonequilibrium scheme. If the number 
of free parameters is the same in the two schemes being 
compared, the model that yields a higher LL value is 
the one that best describes the data. But if the number 
of free parameters is different, the choice is not as 
simple. More free parameters often allow a better fit, 
even if the extra parameters are not really required. 

In cases in which models have different numbers of 
free parameters, can the significance of a better fit be 
quantified? The answer is yes, if the model with fewer 
parameters is a submodel of the more complex one. 
Since the equilibrium scheme is a submodel of the 
cyclic scheme (Fig. 7 top) the significance of a better fit 
can be quantified. This is done using the log-likelihood 
ratio (ALL), which quantifies the improvement in fit: 
all = LLnonequilibrium — LLequilibrium- The 
distribution of expected ALL can be calculated assum¬ 
ing the simple model is true, and the observed value of 
ALL (ALLobs) can be used to calculate a P value: the 
probability of observing the ALLobs assuming the sim¬ 
ple scheme is true (Csanady 2006). 

Figure 7a shows the WT open dwell-time distribu¬ 
tion (yellow histogram bars). The ML fit assuming the 
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CFTR, an Ion Channel 
Evolved from ABC 
Transporter, Fig. 7 Open 
dwell-time distributions 
(yellow histograms) for WT 
(a), D1370N mutant (b) and 
K464A mutant (c) CFTR 
channels, and ML fits 
assuming either the 
equilibrium scheme (top left 
scheme, dotted blue lines) or 
the nonequilibrium scheme 
(top right scheme, solid red 
lines). For each distribution P 
values are shown, which 
describe significance of 
improvement of fit by using 
the nonequilibrium scheme 
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equilibrium model is the blue dotted line, and the ML 
fit assuming the nonequilibrium model is the red line. 
Using ALLobs the alternative gating schemes can be 
ranked. The cyclic scheme provides a far better fit to 
the observed data: the P value (the probability of 
obtaining the observed distribution, with several of 
the leftmost bins having relatively low counts, assum¬ 
ing that the underlying scheme is the equilibrium one) 
is very low, 2-10“^. Thus, the data are consistent with 
a kinetic scheme in which rate k_i (nonhydrolytic 
closing) is negligibly small (i.e., the ATP-bound open 
state is very stable) so that for most openings the 
channel must hydrolyze ATP before it can close. It 
must go through two sequential steps before closing, 
resulting in a rarity of very brief open events. So, 
for WT CFTR, gating and ATP hydrolysis are strictly 
coupled. 

If a mutation is introduced into CFTR’s catalytic 
composite site 2, at the conserved Walker B motif 
aspartate (D1370N, a mutation that in other ABC 
proteins abolishes hydrolysis), the open dwell-time 
distribution becomes monotonically decaying 
(Fig. 7b). There is no significant improvement using 
the more complex model, and the observed data are 
consistent with the hydrolytic rate kj falling to zero. 
Thus, for this mutant, the cyclic scheme is reduced to 
the equilibrium scheme. 

Introducing the mutation K464A at the crucial 
Walker A motif catalytic lysine in CFTR’s 
noncanonical composite site 1, the open dwell-time 
distribution decays monotonically (Fig. 7c). But very 
short open dwell times are somewhat less frequent than 
would be expected assuming an equilibrium scheme 
(compare leftmost bins of yellow histogram bars to 
blue dotted line in Fig. 7c). If the k_i rate is not 
constrained, the fit can be signihcantly improved by 
using the cyclic model. The P value (the probability of 
obtaining this rarity of brief events by chance, assum¬ 
ing the equilibrium model is true), is 0.027. The data 
are more consistent with a kinetic scheme in which the 
nonhydrolytic closing rate, k_i, is increased (possibly 
linked to a destabilization of the NBD1-NBD2 dimer), 
and the hydrolytic rate ki is reduced. So for K464A 
mutant CFTR channels, in four out of five openings, 
closing occurs through reversal of the opening 
transition: in this mutant, coupling between the 
ATPase and gating cycles is partially lost. 

Using the rates estimated from ML fitting to 
dwell-time distributions for the WT and the 


two mutants, the ratio ki/(ki -i- k_i), which describes 
the proportion of channel openings that are coupled 
to ATP hydrolysis, can be quantified. For WT, the ratio 
suggests >95% coupling. For K464A, the ratio 
suggests ^20% coupling (reduced ki, increased k_i); 
and, because the overall time taken for the whole 
gating cycle is also twofold longer, the result is in 
very good agreement with the reported ~ 10-fold 
slower ATPase turnover (see above; Ramjeesingh 
et al. 1999). For D1370N, coupling is completely 
abolished, as expected for a mutation that drastically 
impairs hydrolysis (Csanady et al. 2010). 

Cl Permeation Pathway 

Although the gating mechanism of CFTR has been 
steadily coming into focus, until recently very little 
was known on the location of the channel gate(s) and 
on the identity of residues that line the hydrophilic 
anion permeation pathway through CFTR. Effects of 
substitutions (cysteines and/or alanines) at several sites 
(including R334, K335, F337, T338 in transmembrane 
helix 6, TM6, and K95, Q98 and P99 in TMl; Fig. 8) 
on single-channel current and channel permeation 
properties suggest that these positions might line 
the channel pore or at least play a role in maintaining 
pore structure in the open-channel state (Ge et al. 2004; 
Bai et al. 2011). Recent development of a functional 
“cys-less” CFTR (Mense et al. 2006), with all 18 
endogenous cysteines substituted by alternative 
amino acids, has facilitated systematic investigation 
of pore structure. After introduction of a unique target 
cysteine at a chosen location in cys-less CFTR, its 
accessibility to, and consequences of modification by, 
thiol-specific channel-permeant or -impermeant 
reagents can be monitored (“► Substituted Cysteine 
Accessibility Method”, SCAM). As a result, structures 
that delineate CFTR’s pore are now beginning to be 
identified (Linsdell 2006; Wang et al. 2011; Norimatsu 
et al. 2012). 

Cysteines introduced at 1331, L333, R334, K335, 
1336, T338 (Fig. 8a-c, orange, yellow) can be modified 
by thiol-specific reagents added to the extracellular solu¬ 
tion, and 1331C, L333C, K335C (Fig. 8a-c, orange) are 
modihed more slowly in the open-channel state, indicat¬ 
ing changes in the accessibility of these residues during 
channel gating (Beck et al. 2008). Moreover, with extra- 
cellularly applied thiol-reactive probes, cysteines at 
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CFTR, an Ion Channel Evolved from ABC Transporter, 
Fig. 8 Anion permeation pathway. Homology model of 
CFTR, built on the “outward facing” crystal structure of 
a bacterial homolog (Savl866, Locher 2009). Only selected 
residues mentioned in text are shown. In TMl: K95 in dark 
green and Q98 in light green. In TM6: 1331, L333, K335 in 
orange; R334, T338 in yellow, T339 and F342 in red', 1344 in 
pink; V345 in purple. InTMI2: NII38 and MI 140 in light blue; 

positions intracellular to F337 (e.g., T339, 1340, S342, 
Fig. 8a-c, red) react only with small reagents capable of 
permeating the channel pore (Alexander et al. 2009). 
Of interest, reactivity of V345C (Fig. 8a-d, purple) to 
thiol-specific reagents applied at the intracellular side of 
the membrane seemed independent of channel gating 
state, whereas I344C (Fig. 8a-d, pink) reacted only in 
the channel open state, interpreted as suggesting that 
some accessibility “barrier” between positions 345 and 
344 might be removed during channel opening (El Hiani 
and Linsdell 2010). Also, alterations of Pq caused by 
covalent modification of cysteines introduced along 
TM6 were similarly interpreted to suggest that part of 
this helix might contribute to the channel gate and move 
relative to other TMs during gating (Bai et al. 2010). On 
TMl, K95C (Fig. 8a-d, dark green) was modified regard¬ 
less of channel state, while Q98C (Fig. 8a-d, light green) 
was accessible only in the open state, implying that K95/ 
Q98 on TMl and 1344A^345 on TM6 might lie at 
a similar depth along the pore axis, a suggestion 
supported by disulfide bridge formation between K95C 
or Q98C and 1344C, associated with marked reduction in 
current (Wang et al. 2011, Fig. 8d). Very recent SCAM 


SI 141 in dark blue, (a, b) View of CFTR model in membrane 
(light gray), with and without NBDs. Domains are color-coded 
as in Figs. 3 and 6; R domain, not present in homologous crystal 
structures, is omitted, (c) Extracellular view of outer mouth of 
pore, (d) Outermost structural elements have been removed from 
c to show transmembrane helices at the level of suggested 
permeation “barrier.” Residues colored with lighter hues are 
accessible to intracellular reagents only in the open-channel state 

scans along TMl2 suggest that it too likely makes sig¬ 
nificant contributions to the surface lining the pore. 
Open-state dependence of accessibility to reagents 
applied from the cytosolic side for N1138C and 
M1140C, but not SI 141, again might imply location of 
a gating “barrier” between the latter two residues (Qian 
etal. 2011, Fig. 8d). 

Despite these steps forward, a clear picture of how an 
anion-selective pathway is formed through CFTR in the 
open-channel conformation, but is interrupted in the 
closed-channel state, has yet to emerge, ftesently, avail¬ 
able crystal structures of homologous ABC exporter 
proteins group into “outward facing” conformations 
with closely juxtaposed NBDs and with TMDs forming 
a cavity open to the extracellular solution, and “inward 
facing” conformations with NBDs very distant from 
each other and with a wide intracellular vestibule. 
Short “coupling helices” (magenta; Fig. 8a-c) at the 
intracellular linking loop ends furthest from the mem¬ 
brane, but positioned parallel to the membrane plane, 
contact the surfaces of the NBDs in a “► domain-swap” 
arrangement and are believed to transmit the conforma¬ 
tional signal generated by NBD dimerization to the 
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TMDs. Based on evidence that CFTR channel opening 
corresponds to the formation of a tight NBD dimer 
(Gadshy et al. 2006; Mense et al. 2006), the expectation 
is that CFTR channel opening also corresponds to the 
flipping of TMDs from “inward facing” to “outward 
facing.” However, in the latter conformation, helices 
corresponding to CFTR’s TMl, TM12, and TM6 are 
modeled to be very distant (in homology models these 
helices are separated by 14 A and 18 A, at the level of 
the putative “barriers,” Fig. 8d), making it difficult to 
envisage how in such a conformation they could form 
a pathway selective for anions. Therefore, the limited 
structures available so far, and the limitations of homol¬ 
ogy models, combine to leave unanswered the question 
of precisely how these TMs contribute to the anion 
selective pathway (but see Norimatsu et al. 2012). 

Summary 

CFTR, encoded by the gene that is mutated in cystic 
fibrosis patients, belongs to the superfamily of ABC 
transporters but is a quite unusual family member. 
First, unlike all other members examined so far, it 
functions as an ion channel: when open, millions of 
Cl~ ions flow through it every second, down their 
electrochemical gradient. CFTR seems to work as 
a broken pump. Instead of securely occluding its 
transported “substrate” in a binding site which never 
has simultaneous access to both faces of the membrane, 
one of the two pump gates is always open, allowing 
some conformations within the CFTR catalytic cycle to 
form a continuous aqueous pathway from extracellular 
solution to cytosol (Gadshy 2009). A first picture of the 
molecular structures lining this permeation pathway is 
beginning to emerge. Second, unlike other ABC trans¬ 
porters, it requires phosphorylation of several serines in 
its unique R domain before it becomes active. But 
CFTR is not only unique as an ABC protein, it is also 
very unusual as an ion channel. What is known about 
the molecular mechanism underlying its ATP- 
dependent gating suggests that gating is strictly coupled 
to an ATP hydrolysis cycle. This behavior is typical of 
active transporters. But is it just a vestige of CFTR’s 
evolution from its ABC transporter ancestors? Or could 
some of the many proteins that interact with CFTR slow 
its ATPase activity, thereby prolonging channel-open 
dwell times and so providing a sensitive means of 
regulating channel activity, analogous to modulation 


of the GTPase activity of G proteins? Or could CFTR 
still be an active transporter of some as yet unknown 
substrate, and the concomitant electrodiffusive flow of 
Cl~ be a para-phenomenon - albeit a physiologically 
vital one - of that transport process? 
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Synonyms 

Rhodopsin 

Channelrhodopsins are directly light-gated ion chan¬ 
nels, or, in other words, rhodopsins with intrinsic ion 
conductance. So far, they have been found only in 
green algae (Chlorophyta and Streptophyta) (Zhang 
et al. 2011). In the alga ChRs mediate intracellular 
signaling via a long C-terminal domain whereas at 
high light intensities, the intrinsic conductance dehned 
by the 7TM-fragment is seen as short-delay currents 
that contribute up to 80% of the total current. ChR 
proteins are located within the algal eye in the over¬ 
laying part of the plasma membrane and are both 
photoreceptors for phototaxis and phobic responses 
(Sineshchekov et al. 2002; Berthold et al. 2008). 
Under physiological conditions, photocurrents are car¬ 
ried primarily by and Ca^"^ whereas the contribution 
of Na"^ is small. This is different for application in host 
cells where in general the extracellular Na"^ concentra¬ 
tion is high. The algal eye contains around 100,000 
ChR which allows depolarization of the plasma mem¬ 
brane that is graded with the light intensity over a wide 
range due to gradual activation of the ChR pool. 
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Channeirhodopsin, Fig. 1 (a) Schematic photocuiTents in 
response to two subsequent light pulses. The initial transient 
current inactivates to a stationary level and is found smaller 
upon repetitive illumination, (b) Simplified scheme of the 
photocycle with the dark state (D470), the desensitized state 
(Des480), and 4 photocycle intermediates. D520 is the 


conducting open state, (c) Cartoon of the ChR 7TM-fragment 
according to Kato et al. (2012) with key residues seen 
in color: voltage sensor El23 (cyan), residues of the access 
channel (magenta), residues of the central gate (blue), 
and the inner gate (orange). The retinal Schiff base is seen in 
yellow 


The natural current is less than 10 fA per channel or 10 
transported ions per absorbed photon. The measured 
currents in living algae reaches up to 40 pA for a half- 
saturating light flash (Harz et al. 1992). In host cells, 
a conductance up to 100 fSi may be reached at high 
ionic conditions (Lin et al. 2009; Feldbauer et al. 
2009). 

The first known and described channelrhodopsins, 
channeirhodopsin-1 (ChRl) channelrhodopsin-2 
(ChR2) were identified in Chlamydomonas 
veinhardtii, a green unicellular alga from temperate 
freshwater environments (Nagel et al. 2002, 2003). 
Both, ChRl and ChR2 have unselective cation con¬ 
ductances including for H"^, Na"^, K"^, and even Ca^"^. 
Detailed characterization of the electric properties has 
been achieved in Xenopus oocytes and HEK cells 
where currents of ChR2 reach values of above 1 pA 
(Nagel et al. 2003). Under most symmetric ion condi¬ 
tions, currents are inward rectifying with large inward 
and small outward currents. Rectification is caused by 
the voltage dependency of transport and only to 
a lower degree by voltage-modulated photocycle 
kinetics. Weak illumination is evoking photocurrents 


that slowly rise and approach a stable stationary level. 
However, strong illumination causes photocurrents 
with a rise time of below one ms that decay in the 
light to a reduced stationary level of about 30% of the 
maximum (Fig. la). This process of inactivation is 
explained in a dual fashion. First, when a bright light 
flash all ChR molecules of a population are sent into 
the photocycle they tend to populate to a significant 
degree late photocycle intermediates like P480 and 
Des480 in case of ChR2 (Fig. lb). Des480 reconverts 
to the dark state only within >20 s which is the reason 
why this state is considered as the desensitized 
state (Des-state). Photochemically activated Des 
approaches a conducting state 02 that slightly differs 
from the 01 respective selectivity and kinetics. 

The photocycle of ChR2 (Fig. lb) is similar to that 
of BR except with different spectral characteristics due 
to variations in the ionic environment near the retinal 
Schiff base, larger conformational changes within the 
protein (Ritter et al. 2008; Radu et al. 2009) and higher 
water content within the proton network (Kato et al. 
2012). Upon light absorption the dark-adapted state, 
absorbing in ChR2 at 470 nm (D470), rapidly converts 
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into the conducting state P520 via two short-lived 
photo-intermediates P500 and P390. P500 returns 
back to the dark state via the late photocycle interme¬ 
diates P480 and Des480 (Ritter et al. 2008; Bamann 
et al. 2008). 

Unlike opsin-derived pumps, the kinetic parameters 
related to channelrhodopsins are determined by two 
factors, the efficiency of light absorption and the life¬ 
time of the resulting conducting state, given that ion 
transport (down the gradients) is coupled to occupancy 
of the conducting state. The TMl-3 of ChRs contains 
key residues governing channel gating and lifetime of 
the conducting state (Fig. Ic). The hydrophilic ion 
entrance channel is dominated by three Glutamates, 
namely, Glu 97, 101, 104 (ChR2-numbering), and 
Lys93. This pore is assumed to allow ions from the 
extracellular bulk phase to freely float into this channel 
before they are stopped by the central gate. The gate 
consists of three residues, Ser63, Glu90, and N258. 
From the intercellular side ion flux is blocked in the 
dark by the inner gate defined by two His (134 and 
265) and two Glu (82 and 83). The space between 
central gate (CG) and inner gate (IG) is water deprived 
and significant conformational changes must occur 
before the vestibule is flooded with water and 
a continuous membrane spanning ion channel is 
established. The CG and IG residues are the key ele¬ 
ments for ion selectivity of the channel and ChR- 
variants with modihed gates show altered Na"^ or 
preference (Kato et al. 2012). 

How is the photoswitch controlling the gating pro¬ 
cess? The light-sensitive retinal moiety is embedded 
into a rather hydrophobic and highly conserved retinal 
binding pocket dehned by three Phe and two Tyr 
residues, and the aldehyde function of retinal links to 
the protein via a protonated retinal Schiff Base 
(RSBH). Two polar residues, C128 and D156 
(ChR2), are contacting the polyene chain of the 
RSBH and are critical for both, ON- and OFF-gating. 
Modification of Cl28 in ChR2, alone or in combina¬ 
tion with D156, dramatically slows down both opening 
and closing of the channel and extends the lifetime of 
the open state by several hundred or even several 
thousand-fold, and enables long-acting inward current 
in response to single pulses of light (step function 
rhodopsins, SFRs). These variants in principle can 
achieve similar maximal current magnitudes as wild- 
type ChR2 but using much lower levels of light (Bemdt 
et al. 2009; Bamann et al. 2010). 


As typical for microbial rhodopsins, in ChR the 
RSBH charge is counter balanced by a complex nega¬ 
tive charge defined by El23 and D253. D253 is in 
closer contact to the RSBH and transiently accepts 
the H"^ from it during the photocycle. In contrast, 
El23 is involved in color tuning and fine-tuning of 
the kinetics (Fig. Ic). It bridges the chromophore 
with the ion pore and senses the membrane voltage. 
As the primary voltage sensor, it slows down the 
photocycle kinetics when the membrane voltage 
approaches positive values. The proximity of the coun¬ 
terion to the RSBH is considered as one of the reasons 
why the absorption of ChRl and ChR2 is significantly 
blue-shifted compared to bacteriorhodopsin. Struc¬ 
tures of the open state(s) or other photocycle interme¬ 
diates are not known yet and await elucidation. 

Channelrhodopsins are widely used in the neurosci¬ 
ences to depolarize cells with light and to study cell¬ 
cell interactions and neuronal networks in tissues or 
living animals. It is the most versatile tool in the new 
research held Optogenetics. 

Cross-References 
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Synonyms 

Lipids 


Introduction 

Lipids are often defined as organic molecules that are 
soluble in organic solvents and poorly soluble in water. 


Thus, they represent a varied group of compounds, and 
their diversity is pivotal for their cellular functions. 
Lipids can be divided into two main groups: the non¬ 
polar lipids (acylglycerols, sterols, free fatty acids, 
hydrocarbons, alcohols, wax, and steryl esters) and 
polar lipids (phosphoglycerides, glycosylglycerides, 
and sphingolipids). Triacylglycerols serve as energy 
stores and, in some organisms, as a reserve of essential 
fatty acids. Polar lipids and sterols are 
important structural components of cell membranes 
where they form bilayers, acting as selectively 
permeable barriers for cells and organelles 
(Gurr et al. 2002). Other lipids (e.g., waxes) contribute 
to surfaces that regulate water balance and also protect 
organisms from noxious environmental conditions. In 
addition to these structural roles, many lipids have 
biologically active functions, e.g., cell-signaling inter¬ 
mediates and enzyme and gene modulators (Gunstone 
et al. 2007). 

Glycerolipids 

Acylglycerols (Glycerides) 

The basic structure of glycerolipids is a glycerol 
(propane-1,2,3-triol) backbone to which are esterified 
hydrophobic acyl groups (fatty acids). Monoacyl- 
glycerols and diacylglycerols are fatty acid mono- 
and diesters of glycerol, respectively. These partial 
glycerides are important intermediates in metabolism. 
Triacylglycerols are fatty acid triesters of glycerol. 
Triacylglycerols are the major compounds of most 
natural storage lipids (Gurr et al. 2002). 

Glycerol Ethers 

Two general types of glycerol ethers are alkyl ethers 
and alk-l-enyl ethers (or plasmalogens). Mono- and 
dialkyl ethers as well as their acyl derivatives have 
been reported to occur naturally. Examples include 
batyl alcohol (5«-l-G-octadecylglycerol) and chimyl 
alcohol (s«-3-(9-tetramethylhexadecylglycerol) as 
monoalkyl ethers and diphytanylglycerol (sn-2,3-di- 
0-teramethylhexadecylglycerol) as a dialkyl ether. 
Similarly to alkyl forms, the mono- and dialk-l-enyl 
ethers of glycerol have been identified in natural sam¬ 
ples, whereas no trialk-1-enyl ethers have been iso¬ 
lated so far. Acylated (mono- and diacyl derivatives) 
alk-l-enyl ethers (or neutral plasmalogens) have been 
also reported (Gunstone et al. 2007). 
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Chemical Diversity of 
Lipids, Fig. 1 Examples of 
the most abundant 
phospholipids 
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Glycerophospholipids 

The structure of glycerophospholipids is characterized 
by a l,2-diacyl-3-phospho-OT-glycerol, or phosphati¬ 
dyl moiety, and a variable headgroup linked to the 
phosphate. Major phosphoglycerides found in nature 
are phosphatidylcholine (PC), phosphatidylethanol- 
amine (PE), phosphatidylglycerol (PG), phosphati- 
dylserine (PS), phosphatidylinositol (PI), and 
diphosphatidylglycerol (DPG) (or cardiolipin). The 
simplest phosphoglyceride, phosphatidic acid, is 
a minor compound in most tissues. Phosphatidylcho¬ 
line is the most abundant phospholipid in animal tis¬ 
sues (up to 50% of the total lipids) and the main 
component of nonchloroplast membranes in plants 
(Fig. 1) (Gurr et al. 2002). Phosphatidylethanolamine 
is widespread and usually the second most abundant 
phospholipid in animal and plant membranes (Fig. 1). 
It is also often common in microbial membranes. In 
animal tissues, phosphatidylethanolamine may exist in 
diacyl, alkylacyl, and alkenylacyl forms. The partly 
methylated derivatives of PE (phosphatidyl-N- 
monomethyl-ethanolamine, phosphatidyl-Af-dimethyl- 
ethanolamine) are found in small amounts in many 
organisms, and A'-acylphosphatidylethanolamine is 
found in plants, fungi, and in several animal tissues 
(Chapman 2004). Phosphatidylglycerol is the major 
phosphoglyceride in photosynthetic tissues and many 
bacteria (Fig. 1). In plants, phosphatidylglycerol is an 
only phospholipid present in significant quantities in the 


thylakoid membranes where it comprises about 10% of 
the total lipids. An unusual fatty acid, A^-trans- 
hexadecenoic acid (16:l(3t)), is found in all eukaryotic 
photosynthetic organisms at the sn-2 position 
(Tremolieres and Siegenthaler 1998). For bacteria, 
phosphatidylglycerol is present in almost all types. 
Moreover, a number of its unusual derivatives have 
been isolated from bacterial sources and the 
Haloarchaea (Roy 2009). Diphosphatidylglycerol (triv¬ 
ial name cardiolipin) has a unique dimeric structure with 
four acyl groups. It is common in bacteria, and it can be 
found in the inner mitochondrial membranes of eukary¬ 
otes. Phosphatidylserine is a widespread but minor lipid 
in eukaryotes, accounting usually for less than 10% of 
the total phospholipids. Its greatest concentration has 
been noted for myelin from brain tissue. Phosphatidy¬ 
linositol is another minor but important lipid, both as 
a membrane constituent and in various phosphorylated 
forms as key molecules in signaling processes in eukary¬ 
otes. The phosphatidylinositol phosphates are found 
with phosphates attached at any position of the inositol 
ring. However, PI-4-phosphate, Pl-4,5-bis phosphate, 
and their 3-phosphates are the most important. 

Glycosylglycerides 

In glycosylglycerides, a mono- or oligosaccharide is 
attached at the sn-3 position of the glycerol backbone. 
In photosynthetic bacteria, cyanobacteria, algae, and 
higher plants, glycosylglycerides (glycolipids) (Fig. 2) 
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glycoglycerolipids 



1,2-diacyl-3-0-{p-D-galactopyranosyl)-sn-glycerol 
(or monogalactosyidiacylglycerol, MGDG) 


OH 



1,2-diacyl-3-0-(a-D-galactopyranosyl)-(1 —► 6)-0-p-D-galactopyranosyl-sn-glycerol 
(or digalactosyidiacylglycerol, DGDG) 



1,2-diacyl-3-0-(6-deoxy-6-sulfo-a-D-glucopyranosyl)-sn-glycerol 
(or sulfoquinovosyidiacylglycerol, SQDG) 


are abundant in photosynthetic membranes. The major 
plastid lipids, galactosylglycerides, have one or two 
galactose molecules linked to the 5n-3 position of the 
glycerol corresponding to monogalactosyidiacyl¬ 
glycerol (MGDG) and digalactosyidiacylglycerol 
(DGDG) (Fig. 2). In addition to these main 
galactolipids, tri- and tetragalactosyldiacylglycerols 
as well as other homologues, with a large number of 
different sugar combinations (e.g., mannose, glucose, 
rhamnose), have been reported for bacteria, algae, and 
higher plants (Kates 1990; Guschina and Harwood 
2006). From animal tissues, various gluco- and 
galactoglycerolipids as well as their alkylacyl, diacyl, 
and sulfated forms have been isolated in small amounts 
(Gunstone et al. 2007). 

Another class of glycosylglyceride, which is pre¬ 
sent in appreciable amounts in both photosynthetic and 
in nonphotosynthetic species, is the plant sulfolipid. 


sulfoquinovosyidiacylglycerol (SQDG) (Fig. 2). In the 
unicellular alga Chlamydomonas reinhardtii, an 
acylated derivative of this sulfolipid, 2'-G-acyl- 
sulfoquinovosyldiacylglycerol, has been found. 
Sulfoquinovosylmonogalactosylglycerol has been iso¬ 
lated from the marine red alga, Gracilaria verrucosa 
(for review on algal lipids, see Guschina and Harwood 
2006, 2009). 

In addition to sulfoquinovosyidiacylglycerol, the 
marine diatom Nitzschia alha has been found to con¬ 
tain a sulfonium analogue of phosphatidylcholine, 
phosphatidylsulfocholine (Guschina and Harwood 
2006). A number of lipids conjugated to taurine 
(ethanolaminesulfonic acid) have been reported for 
many organisms. So membrane lipids of certain bacteria 
and algae contain taurine linked to diacylglucuronyl- 
glycerol. For a review on the isolation, characterization, 
and biochemistry of taurolipids, see Kaya (1992). 
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Chemical Diversity of 
Lipids, Fig. 3 

1,2-diacylglyceryl-3-0- 
4'-(W,W,W-trimethyl)- 
homoserine (DOTS), an 
example of betaine lipids 



Betaine Lipids 

Betaine lipids contain a betaine moiety as a polar group 
which linked to the 5/7-3 position of glycerol by an 
ether bond. There is no phosphorus or carbohydrate 
group in betaine lipids. Three types of betaine lipids 
have been identified: l,2-diacylglyceryl-3-0- 
4'-(A^,Ai^,A^-trimethyl)-homoserine (DOTS) (Fig. 3), 

1.2- diacylglyceryl-3-(9-2'-(hydroxymethyl)-(A^,A',Af- 
trimethyl)-P-alanine (DGTA), and 1,2-diacylglyceryl- 
3-0-carboxy-(hydroxymethyl)-choline (DGCC). Betaine 
lipids are not found in higher plants but are quite 
widely distributed in photosynthetic bacteria, algae, 
ferns, bryophytes, lichens, and some fungi and pro¬ 
tozoans (Dembitsky 1996). 

Diol Lipids 

Lipids of various animals, plants, and microorganisms 
frequently contain minor amounts of diol lipids, lipo¬ 
philic derivatives of ethyleneglycol, and related lower 
dihydric alcohols. Small quantities of diacylpropane- 

1.3- diol, diesters of butane-1,3-diol, and butane- 

1.4- diol are found in mammalian and fish liver, 
mammalian adipose tissues, egg yolk, com seed, and 
yeast (Gunstone et al. 2007). From Actynomyces 
olivaceus, mixed acyl- and alk-l-enyl derivatives of 
simple diols have been isolated. The relatively large 
amounts of these lipids of l-alk-l'-enyl-2-acyl type (up 
to 35-60% of neutral lipids) have been reported for the 
starfish Distolasterias nipon, the mollusk Acmaea 
pallida, and the sea urchin Strongylocentrotus nudus 
as a basic form of reserve lipids (Gunstone et al. 2001). 

SphingoMpids 

Sphingolipids are lipids that contain sphingosine (trans- 
D-e/3'r///'o-l,3-dihydroxy-2-amino-4-octadecene) or a 
related amino alcohol instead of glycerol. Sphingolipids 
are a group of very complex lipids with a range of 


diverse functions. Sphingolipids are involved in many 
human diseases including diabetes, cancers, neurologi¬ 
cal syndromes, infections, and diseases of the cardio¬ 
vascular and respiratory systems (Zheng et al. 2006). In 
plants, they are important for abiotic stress resistance, 
plant development, and plant-pathogen interactions 
(Chen et al. 2010). 

Sphingophospholipids and 
Glycosphingophospholipids 

Sphingomyelin (or ceramide phosphorylcholine) 
is a common lipid in this class and is designated as 
the phosphorylcholine ester of an A-acylsphingosine 
(or ceramide) (Fig. 4). It is a major lipid in certain 
membranes in animal tissues (e.g., plasma membrane). 
Sphingomyelin has only minor importance 
in plants, and it seems absent from microorganisms. 
Ceramide phosphorylinositol, the sphingolipid 
analogue of phosphatidylinositol, is a major 
component of sphingolipids in yeasts, 
e.g., Saccharomyces cerevisiae. Hexose-hexuronic- 
inositolphosphoceramide and A-acetyl-hexosamine- 
hexuronic-inositolphosphoceramide are the major 
polar sphingolipids isolated from leaves of 
Arabidopsis thaliana, soybean, and tomato (Chen 
et al. 2010). Ceramide phosphorylethanolamine, the 
sphingolipid analogue of phosphatidylethanolamine, 
has been reported for insects, bacteria, some pro¬ 
tozoa, snails, marine bivalves, and also for chicken 
and rat liver (Gunstone et al. 2007). Ceramide 
phosphorylglycerol is known as a membrane com¬ 
ponent in anaerobic bacteria of the genus 
Bacteroides and in the oral pathogen 
Porphyromonas gingivalis (Olsen and Jantzen 2001). 

The first glycolipids identified in some plants were 
glycolipids with phytosphingosine and/or 
dihydrosphingosine backbone containing both inositol 
and phosphate. They have been named “phytoglyco- 
sphingolipids,” and more than 20 molecular forms 
have been identified with glucuronic acid, glucos¬ 
amine, and many other carbohydrate moieties. 
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Fig. 4 Sphingomyelin (or 
ceramide phosphorylcholine) 



The analysis of such compounds has been thoroughly 
discussed by Sperling and Heinz (2003). 
Archaebacteria have been shown to contain various 
phosphoglycolipids (Kates 1990). 

Ceramides 

Ceramide is formed by attachment of a fatty acid 
(linked by an amide bond) to the amino group of 
sphingosine or other related amino alcohol. The most 
commonly found sphingosyl alcohols are D-erythro- 
sphinganine (dihydrosphingosine, sphinganine), 
D-erythro-sphingosine (sphingosine, 4-sphingenine), 
C20-dihydrosphingosine (icosasphinganine), and 
4-hydroxy-D-erythro-sphinganine (phytosphingosine, 
4-hydroxysphinganine). Although ceramides are usu¬ 
ally found at low levels in tissues, they play important 
biological roles (Gurr et al. 2002). 

Cerebrosides 

Cerebrosides are glycosides of A-acyl long-chain 
bases (ceramides). Galactose and glucose are the 
monosaccharides usually found. Galactosylceramide 
is the principal glycosphingolipid in brain tissue 
(Gunstone et al. 2007). In general, galactosylceramides 
are present in all nervous tissues, accounting for up to 
2% of the dry weight of gray matter and 12% of white 
matter. Oligodendrocytes contain large amounts of 
these lipids. 

Glucosylceramides are considered as characteristic 
compounds of plants but also found in low amounts in 
animal tissues, such as spleen, nervous tissues, and 
erythrocytes. In plants, they are abundant in photosyn¬ 
thetic tissues (Sperling and Heinz 2003). From 
nonphotosynthetic tissues, monoglycosylceramides 
containing a P-D-mannopyranosyl residue have been 
isolated (Gunstone et al. 2007). 

Oligoglycosylceramides 

Di- and oligoglycosphingolipids (with two or more 
sugar residue linked to a ceramide base) are important 
components of cellular membranes of most eukaryotic 


organisms and bacteria. Lactosylceramide is the most 
abundant diglycosphingolipids (or diosylceramide) 
which is found in most animal tissues at low levels. 
In kidney and pancreas, galabiosylceramide has been 
also found. Diosylceramide with mannose units has 
been reported for some primitive animal species and 
in plants (Gunstone et al. 2007). 

Neutral oligoglycosylceramides containing three and 
up to over 20 sugar units in the chain are characteristic 
of some animal tissues. There are seven main types of 
these compounds based on their structural and biosyn¬ 
thetic relationships. In each type, glucose is the primary 
sugar linked to ceramide with galactose, A-acetylglu- 
cosamine, A-acetylgalactosamine, or mannose as addi¬ 
tional monosaccharides (Gunstone et al. 2007). 

In plants, two series of oligoglycosylceramides with 
either mannosyl or galactosyl residues are synthesized 
by elongation of glucosylceramide. They are located in 
the endoplasmic reticulum, Golgi, tonoplast, and 
plasma membranes. Their functions in plants are not 
well known (Sperling and Heinz 2003). 

Gangliosides 

Gangliosides are glycosphingolipids that contain 
a ceramide linked to a glucosylgalactosylsialic acid 
(P-D-A-acetylneuraminic acid or sialic acid) residue 
via glycosidic linkages to one or more of the monosac¬ 
charide units, i.e., via the hydroxyl group on the posi¬ 
tion 2, or to another sialic acid residue. The level of 
these lipids is around 6% of total lipids in the brain 
tissues. Neuronal membranes contain 10-12% gangli¬ 
osides of the total lipid content. They are also present at 
low levels in all animal tissues but have not been found 
outside the animal kingdom (Gunstone et al. 2007). 

Glycosphingolipid Sulfates 

Glycosphingolipids with a sulfate ester group attached 
to the sugar moiety are termed glycosphingolipid sul¬ 
fates (sulfatides or sulfoglycosphingolipids). The dis¬ 
tribution of these compounds in mammalian tissues as 
well as their fatty acid composition and base 
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composition are usually similar to those of cerebro- 
sides (Gunstone et al. 2007). 

Phosphonolipids 

Phosphono analogues of various phospholipids consist 
of aminoethylphosphonic acid residues which may be 
linked through a phosphorus-carbon bond to either 
a ceramide or diacylglycerol backbone of 
phosphonolipids. Ceramide aminoethyl phosphonate 
(or ceramide ciliatine) was the first phosphono ana¬ 
logue of sphingosylphosphatides which has been iden¬ 
tified in sea anemones. Lately, it has been also detected 
in a number of mollusks, protozoa, bacteria, and in 
bovine brain tissue together with its A^-methyl deriva¬ 
tive. In some bacterial species, phosphonolipids with 
a 1-hydroxy-2-aminoethane residue linked to the phos¬ 
phorus moiety have been also detected. Natural prod¬ 
ucts containing carbon-phosphorus bonds (phosphonic 
and phosphinic acids) and their biosynthesis were 
reviewed by Metcalf and van der Donk (2009). 

Arsenolipids 

Lipid-soluble arsenic compounds were first identified 
from marine organisms over 30 years ago. By now, 
more than 100 naturally occurring arsenolipids have 
been detected in a wide range of organisms, including 
lichens, fungi, freshwater and marine algae, inverte¬ 
brates, fishes, plants, and animals (Dembitsky and 
Levitsky 2004). Many species of bacteria and algae 
seem to be capable of converting arsenic compounds 
into their methylated derivatives, such as trimethylarsine. 
Structures, distribution, the possible metabolism of 
arsenolipids, as well as analytical techniques used for 
their separation, identification, and quantification are 
thoroughly reviewed in Dembitsky and Levitsky (2004). 

Other Esters 

A group termed “other esters” includes a large variety 
of functionally diverse lipids. Wax esters are a typical 
example. The term “wax” is commonly used for esters 
of long-chain fatty acids with long-chain primary alco¬ 
hols, but sometimes it is used for the entire mixture of 
lipids containing waxes as a part. Ester waxes are 


present in both plants and animals where they form 
the water-repellent surface coating, e.g., skin surface 
of animals and the leaf cuticle. Esters of normal alco¬ 
hols with mono- or multibranched fatty acids have 
been found in the preen glands of birds (Gunstone 
et al. 2007). Complex waxes (where either the fatty 
acid or the alcohol component or both has a complex 
structure) have been isolated from some bacteria. Dies¬ 
ters of phthiocerols (C33-C35 branched-chain diols) 
with mycocerosic acids (C29-C32 branched-chain 
acids) are examples of these compounds which have 
been reported for Mycobacterium spp. (Barry et al. 
1998). 

Two types of diester waxes are characteristic of 
animal skin-surface lipids. In the first, a hydroxyl 
group of hydroxy fatty acid is esterified to a normal 
fatty acid whereas its carboxyl group is attached to 
a fatty alcohol. In the second type, waxes consist of 
an alkane a,P-diol in which both hydroxyls are ester¬ 
ified with fatty acids (Gunstone et al. 2007). 

Fatty Acids 

Eatty acids are usually aliphatic monocarboxylic acids. 
Over 1,000 fatty acids are known with different chain 
length, positions, configurations, and types of 
unsaturation. Addition of various functional groups 
along the aliphatic chain brings more structural and 
functional diversity to this group. 

Saturated Fatty Acids 

In most natural lipids, straight- or normal-chain satu¬ 
rated components (even-numbered) account for up to 
10-40% of the total fatty acids. Straight-chain fatty 
acids with 16 and 18 carbon atoms are the most com¬ 
mon naturally occurring compounds in animals and 
plants although odd- and even-numbered homologues 
from 2 C to 36 C carbon atoms have been reported 
from natural samples (Gunstone et al. 2007). In gen¬ 
eral, fatty acids do not exist as free carboxylic acids 
because of their high affinity for many proteins. Table 1 
shows the examples of the most important straight- 
chain saturated fatty acids together with the informa¬ 
tion on their distribution. 

Branch-Chain Fatty Acids 

Branch-chain fatty acids are usually saturated and have 
a methyl group as the branch. In marine animals, 
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Chemical Diversity of Lipids, Table 1 Straight-chain saturated fatty acids 


Number of 
carbon atoms 

Systematic name 

Common name 

Melting point (°C) 

Occurrence 

2 

Ethanoic 

Acetic 

16.7 

Esterified to glycerol in ruminant milk fats in some 
plant triacylglycerols and as alcohol acetates in many 
plants. As a component of platelet-activating factor 

3 

Propanoic 

Propionic 

-22.0 

In the rumen 

4 

Butanoic 

Butyric 

-7.9 

In milk fat of ruminants, in the rumen 

6 

Hexanoic 

Caproic 

-8.0 

In milk fat 

8 

Octanoic 

Caprylic 

12.7 

Minor compound of most animal and plant fats, 
major component of milk, and some seed 
triacylglycerols 

10 

Decanoic 

Capric 

29.6 

Minor compound in many tissues. Major acid in 
milk and some seed triacylglycerols 

12 

Dodecanoic 

Laurie 

42.2 

Widely occurring, major acid in some seed fats, e.g., 
coconut oil and palm kernel oil 

14 

Tetradecanoic 

Myristic 

52.1 

Widespread, occasionally as a major acid 

16 

Hexadecanoic 

Palmitic 

60.7 

The most common saturated fatty acid in animals, 
plants, and microorganisms 

18 

Octadecanoic 

Stearic 

69.6 

Major fatty acid in animal tissues and some fungi, 
usually minor in plants, major only in a few plant 
species (e.g., cocoa butter) 

20 

Eicosanoic 

Arachidic 

75.4 

Usually minor compounds and only occasionally 
significant 

22 

Docosanoic 

Behenic 

80.0 

Minor component in some seed triacylglycerols and 
plant waxes 

24 

Tetracosanoic 

Lignoceric 

84.2 

Widespread as a constituent of sphingolipids, minor 
acid in some seed triacylglycerols and plant waxes 

26 

Hexacosanoic 

Cerotic 

87.7 

Usual as component of plant and insect waxes 

28 

Octacosanoic 

Montanic 

90.9 

Major acid in some plant waxes 


unsaturated branched-chain fatty acids have been 
found, and branches other than methyl are present in 
microbial lipids. Two distinct types, which are often 
found in bacteria, are the wo-methyl-branched fatty 
acids with the branch point on the penultimate carbon 
(one from the end) and anfetlso-methyl-branched fatty 
acids with the branch point on the antepenultimate 
carbon atom (two from the end). 

Monoenoic (Monounsaturated) Fatty Acids 

Straight- or normal-chain (even-numbered) monoenoic 
fatty acids (with one double bond) amount to 
a signihcant proportion of the total fatty acids in most 
natural lipids. The double bond is usually of the cis- or 
Z-configuration, but some fatty acids with trans- or 
£-double bonds are found. The most common 
monoenes are straight-chain compounds with 16 or 18 
carbon atoms. Table 2 shows examples and the occur¬ 
rence of the most abundant monoenoic fatty acids. In 
addition to the normal ethylenic double bonds, some 


fatty acids possess acetylenic bonds. They have been 
found in rare seed oils and some moss species. 

Methylene-Interrupted Polyunsaturated Acids 

Signihcant amounts of polyunsaturated fatty acids 
(PUFA) with methylene-interrupted (with two or 
more double bonds of the cA-conhguration separated 
by a single methylene group) are found in all higher 
organisms. In higher plants, the number of double 
bonds Is usually two or three, but in algae and other 
lower plants, as well as in animals, there can be up to 
six (Table 3). The n-6 and n-3 families are two princi¬ 
pal families of polyunsaturated fatty acids occurring in 
nature and derived biosynthetically from linoleic 
{9-cis, 12-cA-octadecadienoic, or C18:2n-6) and 
a-linolenic (9-cis, 12-cis, 15-cA-octadecatrienoic, or 
C18:3n-3) acids, respectively. Both of these fatty acids 
are synthesized in plants and algae but not usually in 
animals. In animal biology, long-chain metabolites of 
linoleate and linolenate may be very important. The 
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Chemical Diversity of Lipids, Table 2 Monoenoic fatty acids 
No. of Melting 


C atoms 

Systematic name 

Common name 

point (°C) 

Occurrence 

16 

fra/ti-3-hexadecenoic 


53 

Plant leaf tissues, eukaryotic algae, Exclusively as an acyl 
chain of phosphatidylglycerol in chloroplasts 

16 

c;^-5-hexadecenoic 



Bacilli 

16 

cis-1 -hexadecenoic 



Bacteria, algae, higher plants 

16 

c/i-9-hexadecenoic 

Palmitoleic 

1 

Wide-distributed in microorganisms, plants and animals. 
Major components in some seed oils 

18 

c/^-6-octadecenoic 

Petroselenic 

33 

Major acid in seed oil of the Umbelliferae family 

18 

c/^-9-octadecenoic 

Oleic 

16 

Most abundant in plants and animals in both structural and 
storage lipids. Also found in microorganisms 

18 

frani-9-octadecenoic 

Elaidic 

44 

In the rumen and in hydrogenated margarines 

18 

trans -11 -octadecenoic 

Vaccenic 

44 

In rumen fats as a result of hydrogenation of polyunsaturated 
fatty acids 

18 

cis -11 -octadecenoic 

Cis-vaccenic 

15 

Common in bacteria, minor acid in most animal and plant lipids 

20 

cw-ll-eicosenoic 

Gondoic 

24 

In fish oils and rape seed oil 

22 

cis- 13-docosenoic 

Erucic 

34 

Major component of rapeseed oil and oils of Cruciferae family, 


small amounts in animal tissues and fish oil 


functions of arachidonic (C20:4n-6), eicosapentaenoic 
(C20:5n-3), and docosahexaenoic (C22:6n-3) acids 
make them also conditionally essential for humans 
(Cunnane 2003). 

Bis- and Polymethylene-Interrupted Acids 

Fatty acids with his- or polymethylene-interrupted 
double bonds, or a mixture of methylene- and 
polymethylene-separated unsaturation, are found in 
some plants and marine organisms. For example, 
from sponges and some other marine invertebrates, 
a wide range of such fatty acids, the demospongic 
acids, have been isolated. They have hii-methylene- 
interrupted cw-double bonds and chain lengths (both 
odd and even) ranging from C16 to C34. Double bonds 
in positions 5 and 9 are usual, although additional 
bonds may be present in position 7 and/or 9 together 
with methyl branching. 

Conjugated Fatty Acids 

Fatty acids with two or more conjugated double bonds 
are found in some plants and animals. In tissues of 
ruminant animals (and, hence, in meat and dairy prod¬ 
ucts), fatty acids with conjugated diene system were 
detected as intermediates or by-products in the 
biohydrogenation of linoleic acid by microorganisms 
in the rumen (Wahle et al. 2004). The main isomer, 
9-c«,l 1-fran^-octadecadienoic acid, may account up 
to 1% of the total fatty acids of milk fat. The only 
conjugated dienoic fatty acid isolated from plant 


sources (the seed oil of Chilopsis linearis) is reported 
to be fran^-10,fra«y-12-octadecadienoic acid. In con¬ 
trast, fatty acids with conjugated triene systems have 
been detected in various plant species. Tung oil is 
the commercial source of the most widespread fatty 
acid from this group, 9-cis,ll-trans,13-trans- 
octadecatrienoic (tx-eleostearic) acid (Gurr et al. 
2002; Gunstone et al. 2007). 

Cyclic Fatty Acids 

Cyclic fatty acids with a carbon ring alone or at the end 
of the alkyl chain occur naturally in plants, especially in 
certain seed oils and in microorganisms. Cyclopropane 
fatty acids are occasionally reported from marine ani¬ 
mals and may be synthesized by symbiotic bacteria. In 
addition, a variety of carbocyclic structures are formed 
from methylene-interrupted polyenes during food 
processing (Gurr et al. 2002; Gunstone et al. 2007). 

Fatty Acids with Oxygen-Containing Functional 
Groups 

A number of fatty acids and their metabolites have 
oxygen-containing functional groups, most commonly 
a hydroxyl, epoxide, or furan ring. For example, 
ricinoleic acid (7?-12-hydroxy-9-cA-octadecenoic 
acid) accounts for 80-90% of fatty acids in castor oil 
(from Ricinus communis). 2-Hydroxy or a-hydroxy 
acids have been shown to occur in sphingolipids, skin 
lipids, wool wax, bacterial cell wall lipids, and in some 
seed oils. 3-Hydroxy or P-hydroxy acids are present in 
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Chemical Diversity of Lipids, Table 3 

Polyunsaturated fatty acids 


No. of C atoms 

Systematic name 

Common name 

Melting point (°C) 

Occurrence 

Dienoic acids 

18 

cis,cis-6,9- 

Octadecadienoic 

Petroslinoleic 

-11 

Minor acid in animal tissues 

18 

cis,cis-9,l2- 

Linoleic 

-5 

Major acid in algae and plant tissues 





Essential components for mammals 

Trienoic acids (methylene interrupted) 

16 

All-cw-7,10,13- 
hexadecatrienoic 



Algae and higher plants 

18 

All-cA-6,9,12- 

y-Linolenic 


Minor component of animal tissues, found in 


octadecatrienoic 



a few seed oils (e.g., evening primrose oil) 

18 

All-cw-,9,12,15- 

octadecatrienoic 

a-Linolenic 

-11 

Higher plants and algae, component of 
galatosyldiacylglycerols 

Trienoic acids (conjugated) 

18 

cis-9, trans-ll,trans- 
13-Octadecatrienoic 

Eleostearic 

49 

Occasional in some seed oils, especially tung oil 

Tetraenoic acids 

16 

All-cA-4,7,10,13- 

hexadecatetraenoic 



In some algae, e.g., Chlorella spp. 

20 

All-cw-5,8,11,14- 

Arachidonic 

-49.5 

Most abundant PUFA of animal phospholipids. 


eicosatetraenoic 



Common in some algae and bryophytes 

Pentaenoic acids 

20 

All-cw-5,8,11,14,17- 

Timnodonic 


In marine algae and fish oils as a major 


eicosapentaenoic 



component, ubiquitous component of animal 
phospholipids 

22 

All-cw-7,10,13,16,19- 
docosapentaenoic 



In marine organisms, ubiquitous component of 
animal phospholipids 

Hexaenoic acids 

22 

All-cis-4,7,10,13,16,19 

docosahexaenoic 

- Clupanodonic 


In animals as phospholipid component, in fish 
and some algae 


bacterial lipids. Vernolic acid (or cw-12,13-epoxy- 
octadec-cw-9-enoic acid) was the first naturally occur¬ 
ring epoxy fatty acids isolated from the seed oil of 
Vernonia anthelmintica. Furanoid fatty acids were 
first detected in hsh oils, but they originate from pho¬ 
tosynthetic organisms. They are now isolated from 
a wide range of organisms including yeasts, algae, 
marine bacteria, as well as plants (seeds, leaves, and 
fruits) (Gurr et al. 2002). 

Terpenes 

Terpenes (also known as terpenoids and isoprenoids) 
are probably the largest and widespread group of nat¬ 
ural products. Approximately 25,000 terpene struc¬ 
tures have been reported. A building block of 
terpenes is the hydrocarbon isoprene (methylbuta- 
1,3-diene, named hemiterpene, with five carbon 


atoms). Terpenes therefore have molecular formulas 
(C 5 Flg)n and can be classified according to the number of 
isoprene units. Terpenoids represent a group of modified 
ferpenes where methyl groups are moved or removed, or 
oxygen atoms added (Holstein and Hohl 2004). 

Monoterpenes 

Monoterpenes consist of two isoprene units, and they 
may be linear (acyclic) or contain rings. 
Monoterpenoids can be produced from monoterpenes 
through oxidation and rearrangement reactions. Exam¬ 
ples of monoterpenes are menthol, camphor, nerol, 
pinene, limonene, and geraniol. Monoterpenes (together 
with sesquiterpenes) are the main compounds of essen¬ 
tial oils with a number of allelochemical functions. 

Sesquiterpenes 

Sesquiterpenes are formed by condensing of fhree 
isoprene molecules, and they are the most 
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diverse group of isoprenoids including several acyclic, 
mono-, bi-, and tetracyclic derivatives. Two common 
acyclic sesquiterpenes, farnesol and nerolidol, are 
isolated from essential oils of various sources. 
Abscisic acid is an example of cyclic terpenes. 
It plays an important role in plants in the 
regulation of water exchanges as well as in plant 
development. 

Diterpenes 

Diterpenes are composed for four isoprene units with 
a molecular formula C 20 H 32 , and they are derived from 
geranylgeranyl pyrophosphate. They are predomi¬ 
nantly fungal and plant origin, isolated from resins 
and gummy exudates, e.g., abietic acid from pine and 
cafestol and kahweol from coffee beans. Many 
diterpenes possess very important biological function¬ 
alities, including vitamin A activity (retinol), plant 
growth and germination regulators (e.g., gibherellin), 
reproduction hormones in fungi (e.g., trisporic acid), 
disease resistance molecules, or phytoalexins (e.g., 
casbene and podocarpic acid), some anticancer agents 
(e.g., Taxol), and natural cannabinoids. Oxygenated 
acyclic diterpenes are found in geranylgeraniol or 
phytol as a part of chlorophyll and the building blocks 
of vitamin E and K. 

Sesterpenes 

Sesterpenes consist of five isoprene subunits, and they 
are derived from geranylfamesol pyrophosphate. They 
have been isolated from insect waxes and several spe¬ 
cies of fungi. They are common compounds of marine 
sediments. Various unsaturated and branched 
sesterpenes, haslenes, were determined in diatom 
algae. Marine sponges synthesize a number of 
sesterpenes with unusual skeleton types which possess 
antimicrobial activities. 

Triterpenes 

Triterpenes contain six isoprene units and have a C30 
skeleton structure. They are synthesized by the cycli- 
zation of squalene, and more than 20,000 triterpenes 
are known in nature. They may occur either in a free 
form or as glycosides (saponins) or other derivatives 
(Volkman 2005). According to their skeletons, 
triterpenoids can be divided into acyclic, mono-, bi-, tri- 
, tetra-, and pentacyclic triterpenes. The most studied 
triterpenes are tetracyclic triterpenes (e.g., protostanes, 
cycloartanes, dammaranes, and euphanes) and 
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pentacyclic triterpenes (e.g., ursanes, gammaceranes, 
lupanes, oleananes, and hopanes) (Sheng and Sun 2011). 

Steroids 

Steroids are derivatives of triterpenes with 26 or more 
carbon atoms with a specific and characteristic 
rearrangement of four cycloalkane rings which are 
joined together. 

Plant and animal tissues contain large amount of 
steroids whereas bacteria synthesize only small 
amounts of these compounds. Cholesterol (cholest-5- 
en-3P-ol) is the most abundant sterol in animal tissues 
and some fungi (Fig. 5). It plays a vital role in mem¬ 
branes in the lipid metaholism in general being 
a precursor in biosynthesis of bile acids, vitamin D, 
and steroid hormones. In addition to cholesterol, 
a number of other sterols are present in animal tissues 
in small amounts. Most of these sterols are intermedi¬ 
ates in the pathway from lanosterol, which has 
been first found in wool wax, to cholesterol. 
Further examples of animal sterols include 7- 
dehydrocholesterol from the skin, desmosterol from 
the brain tissues, lathosterol from human serum, and 
cholestanol and coprastanol which are formed by 
microbial biohydrogenation of cholesterol in the 
intestines. 

Plant sterols are rather different from sterols of 
animals. The basic sterol structure for other sterol 
derivatives is defined as 5a-cholesten-3p-ol. The 
group of plant sterols includes campesterol, p-sitos- 
terol, stigmasterol, and A5-avenasterol as the more 
abundant compounds. Yeasts and fungi contain ergos- 
terol as a main sterol. Prokaryotic organisms, such as 
bacteria and blue-green algae, do not usually contain 
sterols found in plants and animals, but many species 
contain pentacyclic triterpenoids called the hopanoids. 
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They are based on a hopane skeleton with 
a cyclopentane E-ring and are structurally highly 
diverse. In prokaryotic cell, the hopanoids are also 
important membrane constituents (Volkman 2003). 

Tetraterpenes 

Tetraterpenes consist of eight isoprene units, thus hav¬ 
ing a C40 skeleton. Carotenoids, naturally occurring 
pigments, represent an important subgroup of 
tetraterpenoids. They are predominant in the chloro- 
plasts and chromoplasts of photosynthetic algae and 
plants but are also found in some fungi, bacteria, and 
one species of aphid. Over 600 carotenoids have been 
identified. They may be acyclic (seco-carotenoids) or 
cyclic (mono- or bi, alicyclic aryl). Xanthophylls are 
a large subclass of carotenoids containing oxygen in 
the form of a hydroxyl, methoxyl, carbonyl, oxo, for¬ 
myl, or epoxy group. In photosynthetic organisms, 
carotenoids are involved in several aspects of photo¬ 
synthesis, notably light absorption and energy transfer 
to the reaction center complex as well as protection of 
the photosynthetic apparatus from photodamage. In 
humans, a-carotene, P-carotene, and P-cryptoxanthin 
are provitamin A carotenoids which can be converted 
by the body to retinol (vitamin A), and these and other 
carotenoids (e.g., lutein, zeaxanthin, lycopene) can act 
as antioxidants (Fig. 6). 

Polyisoprenoids 

Polyisoprenoids contain multiple isoprene units. 
Natural polymers, such as rubber, gutta-percha, and 
chicle, are examples of polyisoprenoids. Rubber 
consists of c;s-1,4-polyisoprene molecules that confer 
stretchiness, whereas gutta-percha is the tran^-isomer, 
and not elastic. Chicle, used to make chewing gum, is 
a polymer with both cis- and f;a«j-linkages. 
Polyisoprenols (polyisoprenoid alcohols) are minor 
but ubiquitous membrane compounds of most living 
organisms from bacteria, plants, and mammals. Their 
chain length varies from 11 isoprene units in 


eubacteria to 20 in pigs and humans. In plants, 
the usual range is from 8 to 22 units. In Arahidopsis 
and many other higher plants, solanyl pyrophosphate, 
which is the most common mani-polyisoprenol with 
nine isoprene units, is essential in the biosynthesis of 
plant quinones (Swiezewska and Danikiewicz 2005). 

Summary 

Lipids are a particularly diverse group of molecules 
because they are defined by their solubility properties 
rather than by their chemistry. Nevertheless, many 
lipids contain fatty acids as part of their structure. 
This section describes the diversity of lipids in terms 
of different classes which make up the main membrane 
or storage constituents. Some of these molecules are 
also (or give rise to) signaling molecules. 

Cross-References 
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Synonyms 

CSI 

Definition 

The chemical shift index or CSI is a simple graphical 
method that can be used to display and identify the type, 
location, and extent of secondary structures (helices, 
beta strands, and random coil regions) in polypeptides 
using only backbone chemical shift information. Orig¬ 
inally developed for interpreting *Ha chemical shifts, 
the chemical shift index exploits the well-known ten¬ 
dency of ^Ha shifts to be shifted upheld of their random 
coil values in helices and downheld of their random coil 
values in beta strands. To facilitate the identihcation of 
secondary structures in polypeptides from these shift 
tendencies, the chemical shift index employs a residue- 
specihc “noise hlter” to generate a visually pleasing, 
three-state or ternary (— 1,0, J- 1 ) index of chemical shift 
values. More specihcally, if the upheld 'Ha shift 
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(relative to a residue-specific random coil value) of 
a given residue is greater than 0.1 ppm then that residue 
is given a value of —1. Likewise, if the downfield ^Ha 
shift of a given residue is greater than 0.1 ppm then that 
residue is given a value of -i-l. If a residue’s chemical 
shift is not shifted sufficiently upheld or downheld, it is 
given a value of 0. When this ternary index is plotted 
as a bar graph, simple visual inspection allows the 
identihcation of helices (clusters of — 1 values), beta 
strands (clusters of -i-l values), and random coil regions 
(clusters of 0 values). Using only *Htx chemical 
shifts and simple clustering rules, the CSI is typically 
80-85% accurate in the identihcation of secondary 
structures. A consensus CSI method that hlters 
upheld/downheld chemical shift changes in *^Ca, 
*^CP, and *^C’ atoms in a similar manner to *Ha shifts 
has also been developed. The consensus CSI combines 
the CSI plots from backbone 'H and '^C shifts to 
generate a single CSI plot. It is typically 85-90% accu¬ 
rate. Proper implementation of the consensus CSI 
requires careful chemical shift referencing and the 
appropriate choice of “random coil” chemical shift 
values. As a general rule, the CSI method is more 
accurate at identifying helices than beta strands. Like¬ 
wise, the CSI method tends to work better for identify¬ 
ing longer (>6 residues) secondary structures than very 
short (<4 residue) secondary structures. Since its orig¬ 
inal description in 1992, the CSI has been used to 
characterize the secondary structure of thousands of 
peptides and proteins. Its popularity is largely due to 
the fact that it is easy to understand and can be 
implemented without the need for a specialized com¬ 
puter program. Nevertheless, a number of commonly 
used data processing programs (NMRView) and struc¬ 
ture generation web servers (CS23D, RCI, PANAV) 
have incorporated CSI into their protocols. Several 
improvements to the CSI method have also been pro¬ 
posed over the past 10-15 years. These include: 
(1) a probabilistic approach to secondary structure iden¬ 
tification (PSSI), (2) a method that combines secondary 
structure predictions from chemical shift and sequence 
(PsiCSI), (3) a secondary structure identihcation 
approach that uses prespecihed chemical shift patterns 
(PLATON), (4) a prediction method that employs sta¬ 
tistically derived chemical shift/structure potentials 
(PECAN), and (5) a two-dimensional cluster analysis 
method known as 2DCSi. The performance of these 
newer methods is generally slightly better (2^%) 
than the original CSI method. 
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Introduction 

Membrane ► glycosphingolipids (GSLs) comprise 
over 400 molecular species according to carbohydrate 
structure, and this is increased by at least hve- to 
tenfold due to heterogeneity in the ceramide, primarily 
the fatty acid moiety, characteristic of all GSLs. 

The essentially unknown mechanism by which 
some GSLs accumulate in cholesterol-enriched 
microdomains or “lipid rafts,” to impinge an ever- 
increasing number of human diseases, suggests that 
the future synthesis of GSL analogues and mimics 
may have wide reaching impact in cell biology and 
pathology. The demonstration that the membrane lipid 
environment, particularly cholesterol (Yahi et al. 2010; 
Lingwood et al. 2011), can affect the carbohydrate 
conformation of the membrane-embedded GSL sug¬ 
gests GSL function could be conformation dependent. 
Thus, conformationally restricted GSL mimics may be 
the most useful. 

The functions of individual GSLs have, for the most 
part, remained an enigma. The notable exceptions in 
terms of the receptor function, GMl ganglioside for 
cholera toxin (CT) and globotriaosyl ceramide (Gb3) 
for verotoxin (VT, Shiga toxin), do not provide exam¬ 
ples of endogenous cellular function, but these ligands 
have provided the specific probes to investigate such 
function. It follows that when the function of these, and 
eventually, the GSLs for which there is no convenient 
bioactive probe, is ascertained, the generation of spe¬ 
cific GSL analogues will provide a means to manipu¬ 
late function. 

CT and VT show that these receptor GSLs can be 
asymmetrically distributed in heterogeneous 
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cholesterol-enriched lipid microdomains in model and 
cell membranes (Kaiser et al. 2009). It is this restricted 
cell membrane domain localization that likely plays 
the key role in GSL function since these domains can 
represent the foci for a variety of trans-membrane 
signaling pathways. Other GSLs can impinge on the 
lipid raft distribution of GMl, for example (Togayachi 
et al. 2010), and the heterogeneous nature of the GSL 
ceramide moiety, which is key for the GSL-cholesterol 
interaction (Mahfoud et al. 2009), will also play 
a central, but as yet largely unknown, role in assigning 
GSL membrane distribution. 

GSL Structures 

GSLs are divided into neutral and acidic, sialic acid- 
containing gangliosides. The neutral GSLs are further 
subdivided according to their core sugar sequences. 
Glucosyl ceramide is converted first to lactosyl 
ceramide by a (31-4 galactosyl transferase (lactosyl 
ceramide synthase), and lactosyl ceramide provides 
the common precursor for all the ganglioside and 
neutral GSL subgroups. The four main neutral GSL 
series are gloho series in which LacCer is substituted 
with al-4galactose, lacto series in which LacCer 
is substituted with pi-3 N acetylglucosamine pi-3 
galactose, neolacto series in which LacCer is 
substituted with pl-3 acetylglucosamine pi-4 galac¬ 
tose, and ganglio series in which LacCer is substituted 
with pl-3 acetylgalactosamine. The last is the gala 
series which is based on galactosyl instead of glucosyl 
ceramide and comprises galabiosyl ceramide and 
3'sulfogalactosyl ceramide only. 

The carbohydrate synthesis necessary for assembly 
of oligosaccharides for GSL and other glycoconjugate 
mimics is a distinct field of expertise which has been 
recently reviewed (Lepenies et al. 2010) and will not 
be considered further here. Rather, the ability to deriv- 
atize and functionalize native GSLs will be reviewed. 

GSL Role in Human Disease 

Pharmacological inhibition of GSL biosynthesis has 
identified a significant role for GSLs in many human 
diseases. The majority of GSLs (over 90%) are based 
on glucosyl ceramide, generated by the action of 
glucosyl ceramide synthase adding a glucose in 


P-linkage to the primary hydroxyl of ceramide. Regu¬ 
lation of glucosyl ceramide synthase activity is there¬ 
fore central to the regulation of total cellular GSL 
content. This enzyme has been the focus for the gen¬ 
eration of inhibitors able to prevent glycolipid biosyn¬ 
thesis (Lee et al. 1999; Mellor et al. 2004). However, 
these inhibitors result in the inhibition of all (glucosyl 
ceramide based) GSLs. Since disease pathology is 
mostly based on a single GSL species, more selective 
approaches for the depletion of single or subsets of 
GSLs are likely to decrease off-target effects. 

Nevertheless, glucosyl ceramide synthase inhibitors 
have shown GSLs to play a key role in many important 
human diseases. In cell and animal models of disease, 
such inhibition of GSL synthesis results in the reversal 
of disease symptoms. These diseases include type 2 
diabetes (Aerts et al. 2007), arthrosclerosis (Bietrix 
et al. 2010), hepatic dyslipidemia (Bijl et al. 2009), 
obesity (Yew et al. 2010), polycystic kidney disease 
(Natoli et al. 2010), asthma (Karman et al. 2010), and 
cystic fibrosis (Norez et al. 2009). It is, however, 
unlikely that the aberrant GSL synthesis responsible 
for the symptoms in each case is the same. In addition, 
specific GSLs have been shown to play a central role in 
pathology of infection disease, most notably GMl in 
cholera (Heyningen 1974) and Gb3 in verotoxin (Shiga 
toxin)-induced hemolytic uremic syndrome 
(Lingwood et al. 1987). Thus, preventing all GSL 
synthesis may be an unnecessarily heavy-handed 
approach to reversal of symptoms. However, at pre¬ 
sent, the selective pharmacological shutdown of the 
offending GSL species in any given pathology is not 
yet possible. If pharmacologically selective inhibitors 
towards the various cellular tyrosine kinases can be 
generated (Hartmann et al. 2009), it would seem that 
understanding the possible means to derivatize 
a specific GSL to generate a means to selectively 
inhibit the glycosyl transferases responsible for the 
synthesis of a single or subset of GSLs should be 
eminently feasible. Indeed, a simple concept would 
be to provide an alternative deoxy substrate analogue 
in which the acceptor hydroxyl function is missing. 

The fact that glucosyl ceramide synthase uses cer¬ 
amide, a highly bioactive molecule in its own right, as 
a substrate means that glucosyl ceramide synthase can 
regulate the cellular ceramide pool and thereby have 
effects independent of the nature of the GSLs made. 
Indeed, glucosyl ceramide synthase is associated 
with tumor cell drug resistance due to its depletion of 
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(cytotoxic) ceramide pools (Gouaze-Andersson and 
Cabot 2006). 

The fatty acid heterogeneity of GSLs has been long 
appreciated, but its biosynthetic basis has only become 
recently apparent with the identification of a series of 
fatty acid selective ceramide synthases. The role of 
fatty acid heterogeneity in ceramide signaling function 
has been previously largely ignored since many effects 
have been based on studies of the cellular effects of 
exogenous C2 ceramide, which likely circumvents 
functional subtleties between native ceramide fatty 
acid isoforms. The selective knockouts of different 
ceramide synthases generate remarkably different phe¬ 
notypes in mice indicating that the different fatty acid 
isoforms of ceramide or the GSLs into which they are 
converted have distinct properties (Ben-David and 
Futerman 2010). Although, as yet no ceramide 
synthase selective for hydroxy fatty acid acyl-CoA 
has been described, 2-hydroxy fatty acids can have 
a strong influence on GSL receptor function 
(Stromberg and Karlsson 1990; Tang et al. 2001) and 
the carbohydrate conformation of the GSL in which 
they are found (Calander et al. 1988; Yahi et al. 2010). 
The interaction between cholesterol and GSLs, which 
forms a basis for their membrane domain formation 
(Lingwood and Simons 2010), can alter GSL carbohy¬ 
drate from a membrane perpendicular to a membrane 
parallel conformation (Yahi et al. 2010; Lingwood 
et al. 2011) and thereby reduce exogenous ligand bind¬ 
ing (Mahfoud et al. 2010). This effect is ceramide fatty 
acid dependent (Mahfoud et al. 2009; Lingwood et al. 
2011) and indicates significant potential to alter GSL 
function using ceramide-modified mimics. Indeed, the 
model GSL membrane carbohydrate “thickness” is an 
inverse function of the cholesterol concentration 
(Lingwood et al. 201 1), implying a spectrum of inter¬ 
mediate carbohydrate conformations, providing scope 
for conformationally restricted analogues. This prop¬ 
erty is encompassed within the “aglycone” effect 
(Lingwood 1996) regulating GSL function. 

Chemical Modification of GSLs 

The functional derivatization of GSLs largely involves 
one of three areas: (1) substitution of the fatty acid 
within the ceramide moiety, (2) oxidation of the double 
bond within the sphingosine moiety of ceramide, or 
(3) modification of the carbohydrate moiety itself. 


GSL Ceramide Fatty Acid Substitution 

Substitutions of the sphingosine amino function of 
lysoGSLs is a common means of derivatization within 
the GSL lipid moiety. The reaction requires the acti¬ 
vation of the carboxyl function of the species to be 
coupled and the use of a carbodiimide-based catalyst. 
The coupling efficiency however remains low. We 
have designed a new coupling procedure with an effi¬ 
ciency of >98% using benzotriazole-1-yl-oxy-tris- 
(dimethylamino)-phosphonium hexafluorophosphate 
(BOP) as the means of carboxyl activation (Kamani 
et al. 2011). Under anhydrous conditions, a highly 
reactive ionic adduct with the carboxyl function is 
generated, driven by salt formation, (Fig. 1), and quan¬ 
titative coupling reaction proceeds smoothly at 
—70°C. The derivatized GSL product is readily iso¬ 
lated, and problems due to carbodiimide-generated 
toxic (to cells) coproducts are avoided. 

Neutral GSLs 

The lipid heterogeneity of natural GSLs, though 
largely ignored in terms of function, is nevertheless 
a problem in studying the function of a single GSL. 
Studies to replace ceramide fatty acid mixes to gener¬ 
ate homogeneous fatty acid isoforms of GSLs have 
been described (Sonnino et al. 1985; Zhai et al. 2006; 
Lingwood et al. 201 1). 

Fluorescent derivatives. C6 nitrobenzyoxadiazol 
(NBD) coupling to the amino function of lysoglyco- 
sphingolipids has provided a widely used means to 
generate fluorescent GSL analogues to monitor intracel¬ 
lular trafficking. These studies led to the development of 
similar fluorescent GSL analogues using borodidepur- 
omethanediflouride (BODIPY) which allows better 
quantitation since the emission spectmm is concentra¬ 
tion dependent and accumulation in cells, for example, 
in lipid rafts, can be readily detected. BODIPY GSL 
analogues, particularly that of lactosyl ceramide, were 
used as probes of the abnormal GSL trafficking charac¬ 
teristic of cells of lysosomal storage disease patients. 
These fluorescent analogues were also used in fluores¬ 
cent resonance energy transfer experiments to study 
GSL-binding ligands and membrane GSL clustering. 

Adamantyl derivatives. The fatty acid of several 
neutral GSLs has been exchanged with an a 
adamantane frame to generate water/membrane- 
soluble GSL mimics (Lingwood et al. 2006). Partition 
studies with sulfatide and with Gb3 derivatives showed 
that, unlike the parental GSL, the adamantyl derivative 
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Glycosphingolipids, 

Fig. 1 Fatty acid carboxyl 
group is activated with BOP 
and coupled to deacyl GSL 
at —70“ C at high yield 



preferentially partitions into the aqueous phase of 
a water/ether system. This solubility characteristic is 
in part due to the inability of the adamantane structure 
to interact laterally to form lamellar structures in aque¬ 
ous media. The increased water solubility of 
adamantylGSLs is surprising. AdamantylGSLs are 
only slightly more polar than their GSL counterpart, 
as monitored by TLC separation. Adamantane itself is 
one of the most hydrophobic structures. The head-to- 
tail ratio of adamantylGSLs is significantly altered, 
and the sphingosine acyl chain may interact with the 
adamantane frame to generate a more compact hydro- 
phobic domain to minimize disturbance of the organi¬ 
zation of water (Chandler 2002). Adamantane 
conjugation is more commonly used as a means to 
increase membrane permeability of hydrophilic 
species. 

Unlike the lipid-free GSL carbohydrate, 

adamantylGSLs retain the receptor binding of the 


parental species (Lingwood et al. 2006). Thus, the 
“aglycone effect” (Lingwood 1996) of the lipid moiety 
and membrane environment on membrane GSL pre¬ 
sentation for ligand binding is in part mimicked by the 
adamantan fame. For example, HIV gpl20 binds to 
adaGb3>> Gb3 monolayers, and only Gb3 monolayer 
binding can be enhanced by cholesterol (Mahfoud 
et al. 2002). 

AdamantylGSLs show high bioactivity and are 
taken up by cells (Park et al. 2009; Kamani et al. 
2011). Indeed, adamantyl glucosyl ceramide and 
adamantyl galactosyl ceramide are taken up by cells 
and subvert the endogenous metabolic pathways by 
acting as inhibitors or alternative substrates of specific 
glycosyl transferases/hydrolases within cells (Kamani 
et al. 2011), indicating traffic into both Golgi and 
lysosomes. Because of their solubility characteristics, 
any cellular adamantylGSL products generated are 
readily lost into the medium. Thus, such alternative 
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products “bleed off’ a specific branch of the target cell 
GSL anaholism to selectively deplete cells of a GSL 
subset. 

In this way, adamantylGSL analogues represent 
a more selective approach to the targeted regulation 
of GSL metaholism. Currently, the only means for 
pharmacological modulation of GSL synthesis is the 
use of one of several (Lee et al. 1999; Mellor et al. 
2004) inhibitors of glucosyl ceramide synthase. 
Though effective in many biological scenarios, this 
approach has two drawbacks: (1) it prevents the syn¬ 
thesis of all glucosyl ceramide-hased GSLs and (2) it 
does not affect the (minor) subset of GSLs 
biosynthesized from galactosyl ceramide. Since single 
GSL species can mediate cellular effects, 
a pharmacological approach to selective GSL deple¬ 
tion is still required. 

Gangliosides 

Fluorescent analogues. To generate ceramide- 
derivatized analogues of gangliosides, it is necessary 
to deacylate the ceramide without deacetylating the 
sialic acid or to protect one amino function generated 
and coupled to the other. The means for the selective 
deacylation and deacetylation of GM3 ganglioside 
have been established (Valiente et al. 2001). 
Swartzman more recently made novel NBD- 
derivatized ceramides within the gangliosides GMl 
and GM2 (Schwarzmann et al. 2005). This was 
achieved by first simultaneously deacetylation of the 
sialic acid and deacylation of the ceramide. The amine 
of the sphingosine was then selectively Fmoc protected 
and the sialic acid reacetylated with acetic anhydride. 
After removing the Fmoc protecting group with 
piperidine, 2-azido octodecanoacid was coupled to 
the sphingosine amino function. The azido moiety 
was reduced with hydrogen sulfide to generate the 
2- amino function. This was then coupled to NBD 
fluoride to generate a novel fluorescent ganglioside 
analogue in which the fluorophore was in close prox¬ 
imity to the carbohydrate head group to allow more 
efficient FRET to ganglioside binding ligands. 

The selected deacylation of the ceramide moiety of 
gangliosides has been previously reported and has 
been used to generate gangliosides of single fatty 
acid chain length (Sonnino et al. 1985). We have 
found, however, that such reactions with neutral 
GSLs result in significant deacetylation of 
A-acetylamino sugars, in addition to ceramide 


cleavage. In order to generate derivatives of the sphin¬ 
gosine selectively, we developed procedures for the 
selective reacetylation of amino sugars or the long- 
chain sphingoid base without protection (Lingwood 
et al. 2007), largely by solvent manipulation. Acetyla¬ 
tion in aqueous buffer only derivatized the carbohy¬ 
drate-based amino function. 

Photolahile analogues. Photolabile derivatives of 
GM3, GMl, and GDlb ganglioside have been made 
by fatty acid exchange (Mauri et al. 2004), whereby the 
photolabile nitrophenyl azido function is coupled to 
the fluorenyl-protected amino group of aminodecanoic 
acid after coupling to lysoganglioside. This generates 
a probe capable of “transmembrane” cross-linking to 
protein species within the cytosolic plasma membrane 
leaflet and to identify cytosolic candidates for associ¬ 
ation with ganglioside enriched lipid rafts. 

Sphingosine Oxidation 

Oxidative cleavage of the sphingosine double bond of 
GSLs by ozone to generate the glycoceramidic acid 
was the first universal method for the chemical immo¬ 
bilization of GSLs on solid supports (Laine et al. 
1974). Formation of the ozonide across the double 
bond and its subsequent acid cleavage was also used 
as an effective means to couple to protein carriers to 
generate potent anti-GSL immune response (Helling 
et al. 1994; Livingston and Ragupathi 2006). 

The procedure for the oxidative cleavage of the 
sphingosine double bond of GSLs was simplified by 
the introduction of the use of KMn 04 /NaI 04 in an inert 
solvent (Mylvaganam et al. 1999) which enabled ana¬ 
lytical cleavage of small GSL samples. This also 
allowed the identification of the position of any addi¬ 
tional double bonds within the GSL fatty acid 
(Mylvaganam et al. 1999). 

Ozonolysis has been recently used as a means to 
couple GSLs to gold nanoparticles to facilitate mass 
spectrometric structural profiling of cellular GSLs 
(Nagahori et al. 2009). 

Modification of GSL Carbohydrate 

Sialyl Modification of Gangliosides 
The readily available functional groups within sialic 
acid ensured that gangliosides were the first GSL spe¬ 
cies to be chemically modified (Handa and Nakamura 
1984). The increased susceptibility of sialic acid to 
periodate oxidation was also an early chemical modi¬ 
fication of gangliosides (Buk and High 1986). 
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Considerably more functional significance has been 
ascribed to gangliosides compared to neutral GSLs, 
and this is reflected in their chemical derivatives. 

GM3 lactone as a cancer vaccine. Changes in 
plasma membrane glycolipids are typical of neoplastic 
transformation and have long been considered tumor- 
associated antigens. GM3 ganglioside is particularly 
upregulated in forms of metastatic melanoma (Tardif 
et al. 1996). GM3, together with its lactone metabolite, 
an acid-promoted internal ester of the sialic acid car¬ 
boxyl group, is considered a potential immunogen to 
induce immune elimination of this tumor type. 

Since the GM3 lactone is unstable, a thioether ana¬ 
logue of the terminal sialyl galactose lactone has been 
generated. Even though the sialic moiety was replaced 
by a mannose derivative, lacking the glycerol and 
amino functions, comparison of the three-dimensional 
plot with that of sialyl a2-3galactose lactone showed 
close structural superimposition, and the thioether was 
subsequently shown to induce a strong anti-GM3 
immune response in mice (Arcangeli et al. 2010). 
Although a liposomal format was considered appropri¬ 
ate for immunization (Ristori et al. 2009), coupling of 
this thio sialyl galactose mimic to KLH proved to be 
the effective immunization route. The antibodies gen¬ 
erated specifically bound GM3 but not GMl ganglio¬ 
side and bound to human melanoma cell lines 
expressing GM3. 

aGalactosyl Ceramide Analogues 
The discovery that a galactosyl ceramide is selectively 
bound by CD Id on antigen-presenting cells to activate 
the natural killer T cell subset (Morita et al. 1995; 
Shimosaka 2002) to effect broad-based antitumor and 
anti-infectious responses (Bendelac et al. 2006), has 
lead to much interest in generating aGalCer analogues. 
aGalCer is a marine sponge GSL, not found in mam¬ 
mals and a search for a mammalian (Mattner 2005 
#4953) or other GSLs (Zhang et al. 2008) capable of 
similar binding is still underway. IsoGb3 (gala 1-3 gal 
pi-4Glc) and Gb3(galal-4 gal pi-4Glc) sugars in a 
ceramide linkage are effective (Yin et al. 2009). At 
least eight procedures for a GalCer synthesis have 
been published, the latest being Veerapen et al. 
(2009). A 6' acetamide derivative improved solubility 
(Liu et al. 2006). The 6 azido, 6 deoxy aGalCer ana¬ 
logue was synthesized (Jervis et al. 2011) as 
a convenient precursor for generation of species of 
increased bioactivity. However, this species was 


already generated from azidogalactose as an interme¬ 
diate in the synthesis of 6"fluorophore or biotinylated 
derivatives (Liu et al. 2006). These bulky substituents 
indicate considerable tolerance of primary hydroxyl 
substitution for CD Id binding. The hydrolase resistant 
C-glycoside analogue of aGalCer shows a marked 
increase in activity (Fujii et al. 2006). The 
thiogalactoside analogue has been made from the 
galactosyl thiol (Dere and Zhu 2008), but its biological 
activity has yet to be reported. 

Primary Pyranose Hydroxyl Oxidation 
Despite the fact that galactose oxidase has been 
described for more than 30 years as an enzymatic 
means to oxidize the primary hydroxyl group of galac- 
tose/galactosamine-terminating glycoconjugates, 
a chemical means for the selective oxidation of the 
C6 hydroxyl of galactose or other hexose-terminating 
glycoconjugates had not been reported prior to our 
recent study (Yonekawa et al. 2009). The primary 
hydroxyl group was first tritylated and residual hydroxyl 
functions benzylated. After selective removal of the 
trityl group, the 6 keto function was generated via oxi¬ 
dation with iodoxybenzoic acid. Acetylation could not 
substitute for benzyl protection since the acetyl migra¬ 
tion within the sugar ring was observed after removal of 
the trityl group. The 6 keto function provides the starting 
material for a variety of potential novel glycoconjugate 
analogues and, dependent on solubility, is applicable to 
other hexose-containing glycoconjugates. 

H2O2 Aminosugar Peeling 

A novel means of selectively removing the terminal 
N-acetyl hexosamine of GSLs was discovered after 
basic deacetylation of such species (Lingwood and 
Nutikka 1994). Treatment of the aminohexose- 
containing glycoconjugates with H 2 O 2 resulted in 
a peeling reaction in which the terminal sugar was 
completely removed with ^30% loss of the n-1, n-2 
sugars. The mechanism was proposed to involve a free 
radical reaction in which an oxidized 2 ’NH 2 function 
forms a cyclic intermediate involving the adjacent 
anomeric linkage which is cleaved with the release of 
NO. If this reaction occurred in cells, it might therefore 
provide a novel carbohydrate-based source of NO. 

This reaction was successfully used to make new 
heavy atom standards of the Forssman GSL, to allow 
the mass spectrometric quantitation of this membrane 
GSL (Gerl et al. in press). 
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Concluding Remarks 

The lipid heterogeneity and potential for differential 
carbohydrate conformation of membrane glycosphin¬ 
golipids generates not only a large potential for GSL 
function but also an opportunity for the development of 
specific chemical analogues able to mimic or inhibit 
each specific function. 

Cross-References 

► Functional Roles of Lipids in Membranes 

► Glycosphingolipids 

► Lipid Domains 

► Lipid Trafficking in Cells 

► Sphingolipids and Gangliosides 
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Synonyms 

Spin labeling 


Definition 

Spin labeling is a methodology to introduce stable 
paramagnetic centers into macromolecules (proteins, 
nucleic acids, polymers) enabling the application of 
electron paramagnetic resonance (EPR) spectroscopy 
to study structure and dynamics of these molecules. 
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Introduction 

Since the seminal work of Wayne L. Hubbell 
and coworkers (Altenbach et al. 1989) site-directed 
spin labeling (SDSL) in combination with electron 
paramagnetic resonance (EPR) spectroscopy (► EPR 
Spectroscopy: General Principles) has become an effi¬ 
cient tool for investigations of the structural and 
dynamic properties of biomolecules. This methodol¬ 
ogy can be applied to soluble molecules like proteins 
and nucleic acids (DNA and RNA), as well as to 
membrane proteins (Membrane Biophysics, Proteins). 
Remarkably, the size and complexity of the system 
under investigation is almost arbitrary. Continuous 
wave (cw) EPR (► EPR Spectroscopy: General Prin¬ 
ciples) spectroscopy of the spin-labeled molecules 
yields information about the nitroxide side chain 
mobility, solvent accessibility, the polarity of its 
immediate environment, and intra- or intermolecular 
distances between two nitroxides or a single nitroxide 
and another paramagnetic center in the system. Pulse 
EPR (► EPR Spectroscopy: General Principles) tech¬ 
niques are emerging, which stretch the range of mea¬ 
surable interspin distances up to 8 nm, thereby 
matching nicely the distance range of interest in bio¬ 
molecules or biomolecular assemblies. The analysis of 
the EPR data (Dynamics and Water Accessibility 
of Spin-Labeled Side Chains) for a series of spin- 
labeled protein variants allows modeling of the protein 
structures (► Protein Secondary Structure Prediction 
in 2012) with a spatial resolution at the level of the 
backbone fold (for reviews see, Bordignon and 
Steinhoff 2007; Eajer 2000). Substrate or light-induced 
conformational changes can be followed with sub¬ 
millisecond time resolution using time-resolved cw 
detection or characterized by trapping the molecules in 
activated or intermediate states (Altenbach et al. 2008; 
Steinhoff et al. 1994). 

This chapter gives an overview about the basic 
approaches as well as recent progress in SDSL 
methodology, focusing on spin labeling of proteins. 

Spin Labels 

In general, spin labels are characterized by a stable 
paramagnetic center, usually a nitroxide moiety carry¬ 
ing an unpaired electron, and a functional group used 
for site-specific coupling of the label to the protein of 


interest. Isotope-labeled nitroxide compounds with 
*'^N exchanged by are used for specialized appli¬ 
cations. The corresponding EPR spectra are character¬ 
ized by a two-line spectrum (nuclear spin 16) instead of 
a three-line spectrum of the 14N (nuclear spin 1). The 
lines of a spectrum are well separated from the '"'N 
lines so that both labels can be used simultaneously in 
a single experiment. Also pH-sensitive spin probes 
have been used to label the thiol group and to obtain 
information about the local pH with high spatial 
resolution (Smirnov et al. 2004). 

Eor special purposes, if the spin label has to be 
insensitive toward reducing conditions which might 
be required for investigations on redox enzymes, 
redox-insensitive paramagnetic centers like Gd^"^, 
coupled to the protein via a chelating moiety, may be 
introduced (Potapov et al. 2010). 

Site-Directed Spin Labeling of Proteins 

Spin labels are introduced at selected sites within 
proteins usually by cysteine substitution mutagenesis 
followed by modihcation of the sulfhydryl group with 
a nitroxide reagent. One of the most commonly used 
spin label is the methanethiosulfonate spin label 
(MTSSL) (Fig. la). Alternatively, nonsense suppressor 
methodology or solid-phase peptide synthesis can be 
used to introduce the desired unpaired electron into 
proteins. 


Spin Labeling of Cysteine Replacement 
Mutants 

This method utilizes the reactivity of the sulfhydryl 
group of cysteine residues engineered into the protein 
applying site-directed mutagenesis. This approach 
usually requires that the protein possesses only cyste¬ 
ine residues at the desired sites and that additional 
native cysteines can be replaced by serines or alanines. 
Among the various spin labels available, the (1-oxyl- 
2,2,5,5-tetramethylpyrroline-3-methyl) methanethio¬ 
sulfonate spin label (MTSSL) (Berliner et al. 1982) is 
most often used in SDSL studies due to its sulfhydryl 
specificity and its small molecular volume, similar to 
that of a tryptophane side chain (Fig. la). This 
spin label is bound to the protein by formation of 
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Chemistry of Spin Labeling, 
Fig. 1 Spin labeling of 
cysteines, (a) Reaction of the 
methanethiosulfonate spin 
label (MTSSL) with the 
sulfhydryl group of a cysteine 
side chain, generating the 
spin-label side chain R1. 

(b) Flexible bonds within the 
R1 side chain are indicated. 

(c) Chemical structure of the 
MTS-4-oxyl spin label. 

(d) Chemical structure of 
a Gd^"^ spin label bound to 

a protein, (e) Reaction of the 
maleimide spin label 
N-(l-Oxyl-2,2,6,6- 
tetramethyl-4-piperidinyl) 
maleimide with a cysteine side 
chain, (f) Reaction of the 
iodoacetamide spin label 
N-(l-oxyl-2,2,6,6- 
tetramethyl-4-piperidinyl) 
iodoacetamide (JAA6) with 
a cysteine side chain 



e 



b 



C 


protein backbone 



a disulfide bond to the sulfhydryl group of the cysteine. 
(The resulting spin-label side chain is commonly 
abbreviated Rl.) The link between the piperidine- 
oxyl moiety and the protein backbone renders 
the Rl side chain flexible (see Fig. lb), what mini¬ 
mizes disturbances of the native fold of the protein. 
Furthermore, these unique dynamic properties of the 
Rl side chain provide detailed structural information 
from the shape of its room temperature EPR spectrum. 
Nevertheless, the large conformational space accessi¬ 
ble for the spin-label side chain (the distance between 


the CP atom of the side chain and the NO group, where 
the unpaired electron is located, varies between 5 and 
8 A, depending on the conformation) does not allow 
relating the data obtained directly to the properties of 
the native side chain replaced by Rl, and the flexibility 
of Rl may result in large distance distributions 
between two spin labels rather than a sharp 
defined distance. This circumstance can be encoun¬ 
tered by the study of a number of different spin-labeled 
sites and eventually of conformational searching 
approaches. 





290 


Chemistry of Spin Labeling 


Besides MTSSL, a variety of other different 
nitroxide radical compounds are commercially 
available, like the l-oxyl-2,2,5,5-tetramethyl, 2,5- 
dihydro-lH-pyrrol-3-carboxylic acid (2-methanethio- 
sulfonyl-ethyl) amide (MTS-4-oxyl) spin label 
(Fig. Ic), comprising longer or sterically more 
demanding linkers. Also other functional groups than 
methanethiosulfonate for formation of the disulfide 
bond can be used. One possible route is the activation 
of the cysteine sulfhydryl group with 5,5'-dithiobis 
(2-nitrobenzoic acid) (DTNB, Ellman’s reagent) and 
subsequent formation of the disulhde linkage with 
thiol-functionalized compounds, which is used to link 
lanthanide-binding tags (Su et al. 2006) or Gd^"^ spin 
labels site specifically to proteins. 

Alternatives to the methanethiosulfonate and thiol- 
functionalized spin labels, which lead to formation of 
a disulfide bond, are maleimide-functionalized spin- 
label compounds (Griffith and McConnell 1966) 
(Fig. le). Their advantage compared to MTS spin 
labels is the robustness of the linkage toward reducing 
agents like dithiothreitol (DTT), which lead to 
immediate release of spin-label side chains bound via 
disulhde bonds. However, this method suffers from 
two major drawbacks. First, the maleimide spin 
labels are sterically much more demanding compared 
to the MTSSL and, second, depending on pH, 
maleimides may also react with primary amines. 
Whereas at pH 7, the maleimide group is much more 
reactive toward cysteines than amines, at pH values 
above 7.5 the reactivity toward primary amines 
increases. Additionally, at pH > 7.5 the maleimide 
group may undergo hydrolysis before, leading to an 
open unreactive maleamic acid form, or after sulfhy¬ 
dryl coupling. 

Another also widely used alternative which is less 
sensitive to pH are iodoacetamide spin labels (Fig. If). 
A comprehensive and detailed overview on functional 
groups used for linkage of spin labels to cysteine side 
chains including protocols for chemical synthesis can 
be found in Esmann et al. (1993). 

Spin Labeling by Peptide Synthesis 

If spin labeling via cysteines is not applicable, because 
of the presence of native cysteine side chains which are 
essential for protein structure and/or function, or the 
spin label has to be attached as close to the protein 



Chemistry of Spin Labeling, Fig. 2 Spin labeling by peptide 
synthesis, (a) TOAC amino acid spin label, (a) Chemical 
structure, (b) Three-dimensional structure of the spin label 
incorporated into an a-helix 

backbone as possible to study specifically backbone 
dynamics (Proteins, ► Protein Dynamics), spin labels 
can also be introduced by peptide synthesis. A large 
variety of spin-label building blocks (mainly a-, P-, 
and y-amino acids) for Boc- or Fmoc-based step-by- 
step peptide synthesis on a solid support (► Solid 
Phase Peptide Synthesis, SPPS) are available. The by 
far most popular one is the paramagnetic a-amino acid 
TOAC (4-amino-1 -oxyl-2,2,6,6,-tetramethyl-piperi- 
dine-4-carboxylic acid). It is characterized by only 
one degree of freedom, which is the conformation of 
the six-membered ring (Pig. 2a, b). The nitroxide moi¬ 
ety is rigidly coupled to the peptide backbone, thereby 
providing direct information about the orientation of 
secondary structure elements. It has been used to study 
the secondary structure of small peptides in liquid 
solution and has also been successfully incorporated 
into the a-melanocyte stimulating hormone without 
loss of function. However, there is evidence that 
a-helical structures might be distorted due to the pres¬ 
ence of the rigid spin-labeled amino acid. See Klare 
and Steinhoff (2009) for a review. 

Besides TOAC, several N-Boc-protected paramag¬ 
netic amino acids with side chains of different polarity 
and flexibility have been synthesized using O’Donnell 
synthesis (see Balog et al. 2004). 

In general, the chemical synthesis of proteins with 
incorporated unnatural spin-labeled amino acids relies 
on the ability to produce the constituent peptides, 
typically by SPPS. Although polypeptides consisting 
of more than 150 amino acids can be synthesized 
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nowadays through improvements in peptide chemistry, 
the incorporation of spin-labeled amino acids is still 
challenging due to the delicate nature of the nitroxide 
moiety. As for instance the nitroxide can be readily 
protonated under acidic conditions, synthesis 
conditions have to be identified which are chemically 
compatible in terms of retaining the oxidation state of 
the nitroxide group. This problem could be overcome 
by employing Fmoc-based synthesis strategies either 
on the solid phase in combination with HF cleavage or 
in solution. 


Spin Labeling by Chemical Ligation 
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Aiming at the incorporation of spin labels into large 
proteins, especially membrane proteins, SPPS has to 
be combined with recombinant techniques. The 
expressed protein ligation (FPL), also named intein- 
mediated protein ligation (IPL) technique, can be used 
to semisynthesize proteins from recombinant and 
synthetic fragments, thereby extending the size and 
complexity of the protein targets (Blaschke et al. 
2000). The underlying chemical ligation of two poly¬ 
peptide fragments requires an N-terminal cysteine on 
one and a C-terminal thioester moiety on the other 
fragment. After rearrangement through an S ^ N 
acyl shift, a native peptide bond is formed (Fig. 3). 
The reaction can be performed also in the presence of 
other unprotected cysteine residues because of 
a reversible reaction preceding an irreversible step. 
Using this methodology, a spin-labeled Ras binding 
domain (RBD) has been synthesized, showing 
a stable paramagnetic center detected by EPR (Becker 
et al. 2005). 


peptide 1 


-C. 




\ 
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Chemistry of Spin Labeling, Fig. 3 Spin labeling by chemi¬ 
cal ligation. Native and expressed protein ligation (IPL, EPL). 
The chemical ligation occurs between a C-terminal thioester 
(peptide 1) and a sulfhydryl group of an N-terminal cysteine 
(peptide 2). After rearrangement through an N ^ S acyl shift 
a peptide bond is formed 


probes and also spin labels to biomolecules (Fig. 4) 
(Kalai et al. 2009). 


Spin Labeling Using Nonsense Suppressor 
Methodology 


Spin Labeling Using Click Chemistry 

The basic concept of “click chemistry” has been intro¬ 
duced by Kolb, Finn, and Sharpless in 2001 (Kolb et al. 
2001). Its basis is the highly selective formation of 
a carbon-heteroatom bond under mild conditions with 
high yield. Its modular concept renders it a favorable 
tool for introducing labels into biomolecules. 
An example is the 1,3-dipolar cycloaddition of organic 
azides with alkynes in the presence of Cu(I) (copper- 
catalyzed azide-alkyne 1,3-dipolar cycloaddition - 
CuAAC) which can be used to attach fluorescent 


Spin-label amino acids have been incorporated in 
proteins by employing nonsense suppressor methodol¬ 
ogy, e.g., by utilizing the amber suppressor tRNA 
chemically aminoacylated with the desired spin-label 
amino acid (Mendel et al. 1995; Shafer et al. 2004). 
Although this strategy might prove generally applica¬ 
ble in the future using unique transfer RNA(tRNA)/ 
aminoacyl-tRNA-synthetase pairs, only few laborato¬ 
ries are currently equipped to apply this methodology 
successfully. One example is the incorporation of the 
genetically encoded unnatural amino acid /7-acetyl- 
L-phenylalanine (p-AcPhe) that has been subsequently 
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Chemistry of Spin Labeling, 
Fig. 4 Spin labeling by click 
chemistry, (a) 1,3-dipolar 
cycloaddition of organic 
azides, (b) Paramagnetic 
azides and acetylenes usable 
for click chemistry spin 
labeling 



biomolecule 





Chemistry of Spin Labeling, Fig. 5 Spin labeling using 
nonsense suppressor methodology. The genetically encoded 
amino acid p-acetyl-L-phenylalanine (1) is reacted with 


a hydroxylamine-functionalized spin-label compound (2). 
yielding the ketoxime-linked spin-labeled protein (3) 


reacted with a hydroxylamine reagent to result in 
a ketoxime-linked spin-label side chain (Fig. 5) 
(Fleissner et al. 2009). 

Summary 

Site-directed spin labeling of biomolecules allows site- 
specific attachment of stable paramagnetic probes for 
use in EPR spectroscopy. A variety of spin-label com¬ 
pounds with different dynamic properties are available 
to address specific questions about the system under 
investigation. Different coupling schemes, via disul¬ 
fide bonds, redox-insensitive sulfur-carbon or carbon- 
carbon bonds, can be used to attach spin labels to the 
biomolecule. Furthermore, peptide synthesis, also in 
combination with expressed protein ligation, or the 
nonsense suppressor methodology can be used to 


incorporate unnatural spin-functionalized amino 
acids into polypeptides. 

Cross-References 

► EPR Spectroscopy: General Principles 

► Peptide Synthesis on the Ribosome - Computational 
Studies 
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► Protein Secondary Structure Prediction in 2012 
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Synonyms 

Chemoreceptor arrays/clusters; Methyl-accepting che¬ 
motaxis protein (MCP) arrays/clusters 

Definition 

Chemotaxis receptor arrays are clusters of transmem¬ 
brane signaling proteins that employ a high degree of 
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cooperativity to direct bacterial motion in response to 
changes in the concentration of extracellular ligands. 
This sensing and the resulting biasing of movement 
toward favorable environments are known as 
chemotaxis. 

Basic Characteristics 

Bacteria need to sense and respond to changes in their 
environment in order to survive. In the case of external 
chemical stimuli, motile bacteria such as Escherichia 
coli sense temporal changes as they move rather than 
spatial gradients along their length (Eisenbach 2011). 
In order to respond to variations in ligand concentra¬ 
tion at sequential time points, bacterial chemosensory 
systems must (a) react to changes on a sub-second 
timescale, (b) possess short-term memory to compare 
current levels to that of 1-2 s earlier, and (c) gradually 
reset the state of the system so that it can continue to 
respond (Sourjik and Armitage 2010). 

Signal Transduction in Bacterial Chemotaxis 

Direct interaction between chemoreceptor arrays and 
flagella is prevented by the restricted localization of the 
former to the bacterial cell poles. Thus, communication 
between these two components requires a signal trans¬ 
duction pathway which, in the canonical E. coli system, 
involves clustered chemoreceptors that phosphorylate 
the cytoplasmic histidine autokinase CheA in response 
to changes in external ligand concentration. This results 
in control of the flagellar motor through the modulation 
of the intracellular response regulator CheY. 

Unlike E. coli, which has a single chemotaxis 
signaling pathway, many other bacterial species, such 
as Rhodohacter sphaeroides, have two or more 
potentially interacting pathways. In addition to using 
chemoreceptor arrays located in the inner membrane, 
R. sphaeroides can detect chemoattractants or their 
metabolites intracellularly using cytoplasmic chemore¬ 
ceptors. Evidence suggests that larger bacterial species 
(longer than 20 pm) such as spirochaetes utilize elec¬ 
trical signals for chemotaxis to overcome the short 
range of enzyme-mediated pathways (Eisenbach 2011). 

Chemoreceptor Structure and Conformational 
States 

Chemoreceptors, also known as methyl-accepting 
chemotaxis proteins (MCPs), contain four regions 


with distinct functions: a ligand-binding periplasmic 
domain, transmembrane helices, a HAMP signaling 
domain (found in histidine kinases, adenylyl cyclases, 
methyl-accepting chemotaxis proteins, and phospha¬ 
tases), and a long cytoplasmic domain. The cytoplas¬ 
mic domain consists of methylation sites for 
adaptational modifications, a flexible coiled-coil 
bundle, and a protein interaction region used 
to modulate CheA activity. Prom various bacterial 
species, seven major receptor classes have 
been distinguished based on seven residue (heptad) 
insertions or deletions that occur in the flexible 
bundle region. In order to maintain the coiled-coil 
register and packing, these heptads are symmetrically 
placed on either side of the hairpin turn on 
the cytoplasmic end of the receptor (Hazelbauer 
et al. 2008). 

E. coli chemoreceptors form trimers of receptor 
homodimers in vivo as demonstrated by cysteine- 
directed cross-linking studies. Nanodisc-segregation 
of homodimers has established that a trimer-of-dimers 
is the basic functional unit necessary for kinase 
activation even though individual homodimers are 
sufficient for ligand binding, transmembrane 
signaling, and adaptational modification (Hazelbauer 
et al. 2008). Subsequent cryo-electron tomography 
data suggests that the trimer-of-dimer arrangement is 
conserved among other bacterial species from six dif¬ 
ferent taxonomic groups (Briegel et al. 201 1). 

Cryo-electron tomography of intact E. coli 
cells revealed that trimers-of-dimers of the serine che¬ 
moreceptor, Tsr, exist in an equilibrium between 
two conformational states that differ primarily in 
the structure of their HAMP domains. Addition 
of the serine ligand shifts the equilibrium toward the 
expanded state, while methylation favors the compact 
conformation (Pig. 1). 

Adaptation 

Adaptation is a feature that resets the system back to 
pre-stimulus levels, thereby allowing the cell to 
respond to future concentration changes. In E. coli, 
the mechanism for adaptation involves methylation 
and demethylation of four glutamate residues on the 
chemoreceptors by the methyltransferase CheR and 
the methylesterase CheB, respectively. Methylation 
by the constitutively active CheR counters the effects 
of ligand binding while CheB provides negative feed¬ 
back of the signaling pathway when the receptor is in 



Chemotaxis Receptor Arrays 


295 


c 


Chemotaxis Receptor 
Arrays, Fig. 1 Chemotaxis 
receptor arrays consist of 
trimers-of-dimers arranged in 
a hexagonal lattice in the plane 
of the membrane. Trimers-of- 
dimers exist in an equilibrium 
between two conformational 
states with ligand binding and 
methylation favoring the 
expanded and compact states, 
respectively 



the unbound state. In addition, adaptation is facilitated 
by the substrate specificity of the adaptation enzymes - 
CheR preferentially methylates inactive receptors, 
while CheB preferentially demethylates active 
receptors. 

The methylation level lags behind the changes 
in receptor activity by about a second in a typical 
gradient. This provides short-term memory that is 
required for the system to compare current attractant 
levels with that of 1-2 s earlier, while also gradually 
refreshing the memory as a cell moves. This methyla¬ 
tion lag is an essential element for the temporal sensing 
used in chemotaxis (Hazelbauer et al. 2008; Sourjik 
and Armitage 2010). 

Chemoreceptor Arrays 

Chemoreceptors form ternary signaling complexes 
with CheA and the adapter protein CheW, which 
appear to stabilize trimer arrangements when present 
in the appropriate stoichiometric quantities. Extended 
arrays consist of receptors perpendicular to the 
membrane, with CheA and CheW forming a “base 
plate” parallel to the membrane along the cytoplasmic 
ends of the receptors (Zhang et al. 2007). Variations in 
the distance from the membrane to the base plate have 
been observed with cryo-electron tomography for 
different receptor classes based on the number of hep- 
tad sequences they contain in the flexible bundle of the 
cytoplasmic domain. These membrane-associated 
patches vary in size, are generally circular or ellipsoid 
in shape, and cluster primarily, but not exclusively, to 
the cell poles. Such clusters of chemotaxis proteins 
have been observed in all nine phylogenetically 
diverse bacterial species that have been studied 
(Gestwicki et al. 2000). 


Cryo-electron tomographic studies of chemorecep¬ 
tor arrays from various bacterial species have demon¬ 
strated that these complexes consist of distinct, dense 
clusters with hexagonally packed trimers-of-dimers 
(Fig. 1). One proposed arrangement is that a ring 
consisting of three CheA regulatory domains alternat¬ 
ing with three CheW proteins is surrounded by a hex¬ 
agonal lattice of six trimers of receptor dimers (Briegel 
et al. 2012). 

Cooperativity and Signal Amplification 

The higher order array organization and its allowance 
for interactions between neighboring receptors permit 
the chemotaxis system to achieve notable sophistica¬ 
tion including high sensitivity, wide dynamic range, 
extensive cooperativity, and precise adaptation with 
a limited number of components (Hazelbauer et al. 
2008). For instance, E. coli are able to sense 
and respond to small changes, on the order of a few 
molecules, in chemoattractant concentration. The 
E. coli signaling complex has the ability to amplify 
the effect of chemoattractant binding, an attribute often 
referred to as gain, as much as 36-fold when initiating 
the downstream signal. 

Trimers-of-dimers in chemoreceptor arrays display 
cooperativity by propagating changes in activity to adja¬ 
cent chemoreceptors. This cooperativity is dependant on 
the relative composition of the different chemoreceptor 
molecules in the cluster, as was measured by in vivo 
fluorescence resonance energy transfer measurements 
of sensitivity to attractant stimuli resulting from varia¬ 
tions in the composition of chemoreceptor clusters dem¬ 
onstrated a high level of cooperativity similar to that of 
allosteric proteins (Sourjik and Berg 2004). Cryo- 
electron tomography has been used to show that nutrient 
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richness of growth media can influence sensitivity as 
much as 30% by modulating later packing densities 
within a receptor array. The experimentally determined 
data regarding modifications in receptor packing was 
used to develop a testable, predictive computational 
model for the cooperativity and sensitivity of the chemo- 
taxis response (Khursigara et al. 2011). However, other 
cryo-electron tomography data have also been reported 
which suggest that there are no significant changes 
in structural packing based on growth media or 
adaptation state and that the observed behavioral 
changes may result from small, nanoscale changes 
(Briegel et al. 201 1). 

Clustering raises local concentrations of all of the 
chemotaxis pathway components and is necessary for 
maintaining the efficiency of the adaptation enzymes 
CheR and CheB. Close packing of the receptors allows 
the adaptation enzymes to modify adjacent receptors 
while tethered to their binding site on a particular 
homodimer. This gives the enzymes access to methyla- 
tion sites in a neighborhood of about six receptors, 
thereby allowing them to interact with 48 rather than 
8 methylation sites (Li and Hazelbauer 2005). These 
methylation neighborhoods contribute to signal amplifi¬ 
cation and prevent enzyme saturation, resulting in 
high precision adaptation to a wide range of stimulus 
strengths. 
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Chlorophylls and Light-Harvesting 
Complexes 
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Synonyms 

Light-harvesting complexes; Outer antenna complexes 

Definition 

Chlorophylls (Chi) are pigment molecules that absorb 
light in the visible region of the electromagnetic spec¬ 
trum and they are responsible for light absorption in all 
oxygenic photosynthetic organisms. 

Light-harvesting complexes (Lhc) are chlorophyll- 
carotenoid-protein complexes which perform the first 
step of the photo synthetic process (light harvesting). 

Introduction 

The first steps of photosynthesis are the capture 
(absorption) of sunlight energy and the transfer of the 
resulting excitation energy to the reaction center, 
where it is used to promote charge separation. In higher 
plants and green algae these processes are performed 
by several light-harvesting complexes associated with 
► Photosystem I (PSI) and ► Photosystem II (PSII). 
These complexes are integral membrane proteins coor¬ 
dinating chlorophylls and carotenoids, which are the 
pigments responsible for light absorption and excita¬ 
tion energy transfer (EET). They increase the absorp¬ 
tion cross section of the Photosystems. It is important 
to realize that the maximum irradiance in the photo- 
synthetically active region (400-700 nm) naturally 
occurring at the Earth’s surface is around 2,000 pE 
m~^ s“^ (1 Einstein = 1 mole of photons), which 
means that a chlorophyll molecule (Chi) absorbs 
a maximum of 10 photons s“* (Blankenship 2002). 
A large number of Chls (up to 230) transferring exci¬ 
tation energy to one reaction center is thus a cheap 
solution to strongly increase the yield of the photosyn¬ 
thetic process. In addition, in high light conditions 


these complexes are able to dissipate the energy 
absorbed in excess, thus protecting the system from 
photodamage. 

Chlorophylls: Chemical Structure and 
Spectroscopic Properties 

The first building block needed to assemble an efficient 
light-harvesting system is a pigment that absorbs light 
with a high extinction coefficient (absorption cross 
section) over a wide range of the solar spectrum. 
Most of the photosynthetic organisms use chlorophyll 
molecules (Grimm et al. 2006). Chlorophylls are tet- 
rapyrrole derivates that coordinate a Mg atom in the 
center of the macrocycle (dimensions 10 A x 10 A). 
Several types of Chls exist in nature; chlorophylls a-d 
and bacteriochlorophylls a-g, which differ for the sub¬ 
stituents on the macrocycle. Higher plants and green 
algae contain chlorophylls a and h, having either 
a methyl (Chi a) or a formyl (Chi h) group at the C7 
position (Fig. 1). In addition to the macrocycle, which 
determines the spectroscopic properties of the mole¬ 
cule, these chlorophylls also contain a long hydrocar¬ 
bon chain (phytol chain) which confers them 
a hydrophobic character. The absorption spectrum of 
the chlorophylls (Fig. 2) is characterized by two major 
absorption bands (originating from ti-ti* transitions) in 
the red (Q bands) and blue region (B bands) of the 
electromagnetic spectmm. Green photons are absorbed 
less efficiently, conferring to the chlorophylls their typi¬ 
cal green color. The electronic transitions corresponding 
to the visible absorption bands differ concerning the 
magnitude and orientation of their transition dipole 
moments: In all Chls the lowest energy electronic transi¬ 
tion (Qy band - 664 nm for a Chi a and 646 nm for Chi h 
in acetone) is polarized along the y-axis of the molecule 
(see Fig. 1). It should be noticed that at whatever wave¬ 
length the Chi is excited, it always relaxes (in around 
100 fs) to its first excited state and from there it transfers 
excitation energy to a neighboring pigment. 

For efficient light harvesting, pigments should not 
be significantly quenched, in order to have a long 
excited state lifetime (the time the molecule spends 
on average in the excited state after absorption of 
a photon). The longer this time is, the higher the 
probability to deliver the captured energy to the reac¬ 
tion center via the process of EET between pigments. 
The excited-state lifetime (t) of an isolated chlorophyll 
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Chlorophylls and Light-Harvesting Complexes, Fig. 1 Struc¬ 
tures of Chi a and h 



Chlorophylls and Light-Harvesting Complexes, Fig. 2 Absor¬ 
ption spectra of Chi a {blue) and Chi h {green) in acetone 


molecule depends on the rate constants (k) of the 
various de-excitation processes: 

T = (k) ^ = (kp + kic + kisc) * 


Chlorophylls and Light-Harvesting Complexes 



Chlorophylls and Light-Harvesting Complexes, Fig. 3 Model 
of the structure of LHCII: red, apoprotein; blue, Chi a; green, 
Chi 6; orange, lutein; yellow, neoxanthin; magenta, violaxanthin 


These processes are: (1) internal conversion 
(kic = 0.02 ps“*); (2) intersystem crossing, leading 
to a triplet state (kisc = 0.125 ps~^); and (3) ► fluores¬ 
cence (kp = 0.06 ps“ ), leading to an excited-state 
lifetime of 5 ns for Chi a in solution. Whereas the 
relatively long excited-state lifetime of the Chls 
makes them good molecules for light harvesting, 
their high probability to form triplets represents 
a potential danger for the photosynthetic organism. 
A chlorophyll in its triplet state can react with molec¬ 
ular oxygen (a triplet in its ground state), forming 
singlet oxygen, a ► reactive oxygen species that can 
oxidize the membranes, ultimately leading to the death 
of the organism. To deal with this problem, Nature uses 
carotenoid molecules that can quench Chi triplets and 
act as oxygen scavengers (Frank et al. 1999). Both for 
efficient EET and triplet quenching the pigments need 
to be positioned at the correct distance and geometry 
with respect to each other, which requires the presence 
of an additional building block: the protein. 

The Role of the Protein Scaffold 

In oxygenic photosynthetic organisms Chls are bound to 
proteins via the coordination of their central magnesium 
to nucleophilic residues (e.g., histidines but also water 
molecules or peptide backbones). The structure of the 
major light-harvesting complex of Photosystem II, 
LHCII (Liu et al. 2004), is shown in Eig. 3. This 
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Chlorophylls and Light-Harvesting Complexes, Fig. 4 Model 
of the structure of the higher plants PSII supercomplex showing 
the organization of the Lhcb complexes (From Caffarri et al. 
2009). (Left) A dimeric core complex (magenta) is suiTounded 
by Lhcb complexes: four LHCII trimers (red), two Lhcb4 


(orange), two LhcbS (cyan), and two Lhcb6 (yellow), Chls a 
are in blue and Chls h in green. (Right) the same model but only 
showing the organization of the pigments: blue, Chi a; green Chi 
h; in red are the Chls of the reaction center. For clarity the phytol 
chains of the Chls are omitted 


complex is composed of three transmembrane helices, it 
has a molecular weight of 25 kDa and it coordinates 14 
Chi molecules (8 Chls a and 6 Chi h) and four caroten¬ 
oids, for a total of 15 kDa of pigments. LHCII is a mem¬ 
ber of the Lhc (light-harvesting complex) family which 
is composed of a large number of gene products, which 
share a very similar structure and act as antenna 
for both Photosystems in plants and algae (it should be 
mentioned that prokaryotic organisms use different 
antenna systems) (Green et al. 2003). 

The role of the protein scaffold is to organize the 
pigments to favor efficient excitation energy transfer 
between them, avoiding (or regulating, see below) 
singlet quenching (which can occur when, e.g., the 
pigments are very close together), and to favor triplet 
quenching (by maintaining carotenoids and chloro¬ 
phylls in close proximity). In addition, the protein 
environment modifies the properties of the individual 
pigments (via interactions with the surrounding resi¬ 
dues but also with neighboring pigments), tuning their 
excited-state energy but also their ► redox potential. 
The end result is not only that the Chls a which are 
associated with photosynthetic proteins have their low¬ 
est electronic transition tuned between 660 and 


710 nm, but also that they can act as electron donors 
or acceptors in the reaction center of PSI and PSII. 

Excitation Energy Transfer in the Light-Harvesting 
Complexes 

The energy transfer dynamics in several antenna com¬ 
plexes have been studied extensively, showing that 
EET between individual pigments coordinated by 
these complexes occurs on a time scale from 100 fs 
to a few picoseconds (Novoderezhkin et al. 2005), and 
thus it is extremely efficient. The mechanism of EET in 
these complexes has not been fully clarified yet. The 
average distance between individual Chls coordinated 
to Lhc is around 10-11 A, with some of the pigments 
which are strongly coupled, leading to mixing of their 
excited states, while other pigments are only “weakly” 
coupled. This implies that to theoretically describe the 
functioning of the system both localized (weak inter¬ 
actions) and delocalized (strong interactions) excita¬ 
tions need to be taken into account. Typically, EET 
occurs via excitations that are partly delocalized over 
several pigments at the same time, also called photo¬ 
synthetic excitons (van Amerongen et al. 2000). In 
LHCII, a modified Redfield theory was successfully 





300 


Cholesterol 


applied to describe both steady-state and time-resolved 
spectroscopic properties (Novoderezhkin et al. 2005). 

In vivo Lhcs are associated with PSI and PSII core 
complexes. In Fig. 4, the organization of the antenna 
system in the PSII of higher plants is shown. To reach 
the reaction center excitation energy thus needs to be 
transferred from one complex to another. It has been 
shown that both in the case of PSI-LHCI and PSII- 
LHCII the time an excitation in average needs to reach 
the reaction center is extremely short (often tens of ps) 
allowing for very high efficiency (Renger 2008). 

Non-photochemical Quenching 

In addition to their role in light harvesting, Lhc anten¬ 
nas are involved in photoprotection. In high light con¬ 
ditions, when the harvested energy exceeds the 
photosynthetic capacity and thus cannot be used for 
charge separation, a large part of the absorbed energy 
is dissipated as heat at the level of the antenna, in 
a regulatory process known as non-photochemical 
quenching, thus protecting the reaction center against 
photodamage. Although the precise identity of the 
quencher and the quenching mechanism(s) have not 
been fully elucidated yet, it has been suggested that the 
antenna complexes are able to “switch” from a light¬ 
harvesting conformation to a quenched conformation, 
thus regulating the singlet excited-state levels in the 
photosynthetic apparatus (Muller et al. 2001). 

Cross-References 

► Fluorescence 

► Photosynthetic Electron Transport 

► Photosystem I 

► Photosystem II 

► Reactive Oxygen Species 

► Redox Potential 
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Chorismate Mutase - Computational 
Studies 
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Chemistry, University of Bristol, Bristol, UK 

Synonyms 

Chorismate pyruvate mutase 

Definition 

Chorismate mutase is an enzyme that catalyzes the 
Claisen rearrangement of chorismate to prephenate, 
an important branch point in the biosynthesis of tyro¬ 
sine and phenylalanine. 

Introduction 

Chorismate mutase (EC 5.4.99.5) catalyzes the Claisen 
rearrangement of chorismate to prephenate (Fig. 1), 
a rare example of a hiochemically catalyzed pericyclic 
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Chorismate Mutase - Computational Studies, Fig. 1 The 

reaction of chorismate to form prephenate showing (a) the 
diequatorial form of chorismate (the global minimum in 

reaction. The mechanism of the reaction is the same in 
solution and in the enzyme, and the enzymatic reaction 
does not involve any covalent interaction between the 
enzyme and the substrate, so it is also an ideal model 
system to investigate catalysis. At 25°C an activation 
free energy of A+G =15.4 kcal/mol has been found for 
Bacillus subtilis chorismate mutase (BsCM), com¬ 
pared to A^G = 24.5 kcal/mol for the uncatalyzed 
reaction in aqueous solution (Kast et al. 1996), a rate 
acceleration by the enzyme of 10°. The enzymic reac¬ 
tion is part of the shikimate pathway for the biosynthe¬ 
sis of aromatic amino acids in plants, fungi, and 
bacteria, playing an important role in balancing the 
amount of phenylalanine and tyrosine produced, com¬ 
pared to tryptophan which can also be produced from 
chorismate (Fig. 2). As a consequence, chorismate 
mutase is a potential target for the development of 
new herbicides, fungicides, and antibiotics. 

Chorismate mutases have been identified in two 
structurally distinct forms: helical AroQ proteins 
(Lee et al. 1995), such as Escherichia coli (EcCM) 
and yeast (ScCM); and AroH forms, such as BsCM 
(Chook et al. 1993), which have a pseudo-a/P-barrel 
fold (see Fig. 3). There is little sequence similarity 
between AroH (BsCM) and AroQ forms (EcCM or 
ScCM), but the similarity of the kinetic parameters 
suggests that their function is similar. The active sites 
in these different chorismate mutases have a common 
motif in that two arginine residues form bidentate 
interactions with the carboxylate groups of chorismate, 
for example, Arg7 and Arg63 in BsCM, where Arg7 is 
from the binding subunit and Arg63 is from the adja¬ 
cent subunit (BsCM is a homo-trimer). BsCM also has 
another arginine residue, Arg90, which stabilizes the 
ether oxygen of chorismate: This residue is not con¬ 
served in the other CMs but there is a positively 
charged lysine instead (see Fig. 3). From experimental 
mutagenesis studies, a positively charged hydrogen 
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solution), (b) the reactive diaxial form of chorismate, (c) the 
chair-like transition state for the pericyclic reaction, and (d) the 
product prephenate 

bond donor in the vicinity of the breaking bond is 
thought to be important to the catalytic activity of 
CMs (Kienhofer et al. 2003). 

Chorismate mutase has been the subject of many 
computational studies over the years both from the 
point of view of addressing mechanistic questions 
and testing developments in QM/MM methods 
(reviewed in Ranaghan and Mulholland 2010). There 
has been considerable debate over the origin of catal¬ 
ysis in this enzyme, centring on the dominance of 
either transition state stabilization or conformational 
effects in catalysis. Important computational studies 
that have furthered understanding of catalysis in this 
enzyme and aided the development of techniques for 
modeling enzymatic reactions are discussed below. 

Transition State Stabilization 

The first QM/MM modeling of this enzyme at the 
AM1/CHARMM22 level indicated that stabilization 
of the transition state in the enzyme is important and 
suggested that substrate destabilization or strain may 
also play a role (Lyne et al. 1995). Subsequent studies 
at the semiempirical, DFT, and ah initio levels of QM/ 
MM theory also indicated that transition state stabili¬ 
zation is important in this enzyme. Claeyssens et al. 
(2011) have compared the enzyme-catalyzed reaction 
to the uncatalyzed reaction in solution using DFT QM/ 
MM methods (B3LYP/6-31 G(d)/CHARMM27) using 
adiabatic mapping techniques to generate pathways 
along a reaction coordinate. For the chorismate to 
prephenate reaction, the reaction coordinate (rc) was 
defined as the difference in the distances of the break¬ 
ing carbon-oxygen bond and the forming carbon- 
carbon bond {rc = r/(C2-013) - <7(C4-C14), see 
Fig. 1). Multiple starting structures from lower level 
QM/MM MD simulations were used to generate the 
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Chorismate Mutase - Computational Studies, Fig. 2 Part of the shikimate pathway showing the importance of chorismate as 
a metabolic node in the biosynthesis of aromatic amino acids 


pathways giving an average potential energy barrier of 
11.3 (±1.8) kcal/mol in the enzyme and 17.4 (±1.9) 
kcal/mol in solution. This potential energy barrier for 
the reaction in solution is from a diaxial conformation 
rather than a diequatorial conformation which is the 
global minimum in solution (see Fig. 1). This diaxial/ 
diequatorial energy difference was estimated to be 
0.9-3.6 kcal/mol based on experimental data and 
other computational studies, increasing the barrier in 
solution to 18.3-21.0 kcal/mol. These potential energy 
barriers are in excellent agreement with the experi¬ 
mental enthalpy barriers of 12.7 (±0.4) kcal/mol in 
the enzyme and 20.7 (±0.4) kcal/mol in solution. By 
comparing the energy prohles in the enzyme/solution 
environment and in the gas phase, it was found that the 
transition state is stabilized significantly more than the 
reactant in the enzyme (^7 kcal/mol), whereas there is 
only a small amount of transition state stabilization in 
solution (^1 kcal/mol). The barrier height of the 


individual profiles also correlated strongly with the 
amount of transition state stabilization, indicating 
that it is the determining factor for reactivity in the 
enzyme (see Fig. 4a). There are small differences 
between the reaction pathways in the enzyme and in 
solution which arise due to the carboxylate groups of 
the substrate being closer together in the enzyme. This 
destabilizes the enzyme-bound substrate compared to 
that in solution but this distortion is constant through¬ 
out the reaction (see Fig. 4b). Arg90 was identified as 
the residue making the largest contribution to the tran¬ 
sition state stabilization with other significant contri¬ 
butions from Arg7 and Glu78. 

Conformational Effects in Catalysis 

Much of the debate about catalysis in chorismate 
mutase has focused on the importance of 
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Chorismate Mutase - 
Computational Studies, 

Fig. 3 Different forms of 
chorismate mutases (shown in 
cartoon representation) in 
complex with Bartlett’s 
transition state analogue 
(shown as spheres) (Bartlett 
and Johnson 1985): (a) AroH 
from B. suhtilis (PDB ID 
2CHT (Chook et al. 1993)); 
(b) AroQ form from E. coli 
(PDB ID lECM (Lee et al. 
1995)); (c) the active site of 
BsCM showing the transition 
state analogue interacting with 
Arg90, Arg7, and Arg63 
(in stick form); (d) the active 
site of EcCM showing Lys39, 
Arg28, and Argl 1 (in stick 
form) 



conformational effects. As the reaction proceeds 
through a chair-like transition state, it has been pro¬ 
posed that the enzyme preferentially binds the 
pseudodiaxial form of chorismate from solution (see 
Fig. 1). However, it is thought that the proportion of 
chorismate in the reactive pseudodiaxial form is rela¬ 
tively low in solution, so it has been proposed that the 
enzyme binds the more abundant conformers of 
chorismate and then converts them to a diaxial form 
in the active site. Guo et al. (2001) identified several 
conformations of chorismate from gas phase and PCM 
studies, which were then docked into the active site of 
ScCM. QM/MM MD simulations at the SCCDFTB 
level of QM/MM theory were performed on different 
chorismate-ScCM complexes. The C15-013-C2-C3 


dihedral was used to monitor structural transitions 
with reactive chair-like conformations having a value 
of 70° whereas this dihedral angle lies between —30° 
and —70° in nonreactive conformations. Nonreactive 
starting conformations of chorismate were found to 
convert to chair-like conformations within 50 ps at 
300 K, whereas chair-like structures were stable in 
the enzyme but unstable in solution even in the pres¬ 
ence of guanidinium ions. 

Bruice and co-workers (Hur and Bruice 2003) have 
controversially proposed that catalysis in chorismate 
mutase (and many other enzymes) is due to the ability 
of the enzyme to bind what they termed “near attack 
conformers” (NACs) of the substrate, and that almost 
no stabilization of the transition state is necessary for 
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Chorismate Mutase - Computational Studies, Fig. 4 (a) 

The correlation between barrier height and transition state stabi¬ 
lization showing the importance of transition state stabilization in 
catalysis by the enzyme and (b) the in absolute in vacuo energies 
of structures from conversion of chorismate to prephenate in the 



enzyme and in solution from the B3LYP/6-31 G(d)/ 

CHARMM27 pathways generated by Claeyssens et al. (2011), 
showing that the absolute energy of the substrate is higher in the 
enzyme due to the closer proximity of the carboxylate groups, but 
that this difference is constant throughout the path 


catalysis. Several different definitions of a NAC have 
been suggested, such as conformations in which the 
reacting atoms are at van der Waals distance (e.g., 
a carbon-carbon distance <3.7 A in chorismate 
mutase) or the distance criterion with additional 
terms relating to the angle of attack. The “NAC effect” 
was originally proposed based on MM MD simulations 
only, where NAC populations were counted to esti¬ 
mate the mole fraction in the enzyme and in solution. 
The calculation of potential catalytic contributions 
through unrestrained molecular dynamics simulations 
in this way has been criticized as being unreliable 
(Strajbl et al. 2003): It is known that populations 
of unfavorable (high-energy) states will be 
underestimated generally in typical unbiased molecu¬ 
lar dynamics simulations. 

In contrast to the “NAC proposal,” QM/MM model¬ 
ing of the enzyme reaction and mutagenesis studies 
indicate significant transition state stabilization in 
chorismate mutase. Jorgensen and co-workers 
(Repasky et al. 2003) have used QM/MM Monte 
Carlo/Free Energy perturbation (MC/FEP) techniques 
to study solvent effects on the reaction of chorismate in 
chorismate mutase. Their results showed that 82%, 
57%, and 1% of conformers were “NACs” in water. 


methanol, and in the gas phase, respectively. They 
found that the conversion of non-NACs to NACs pro¬ 
vides no contribution to the relative reaction rates in 
water versus methanol and that better TS stabilization 
by water was the origin of the rate enhancement in 
water compared to methanol. 

It is more generally useful to ask what the catalytic 
benefit is of forming the substrate conformation bound 
to the enzyme rather than trying to define “NACs” for 
which no unique definition exists. This question has 
been investigated using semiempirical AMI/ 
CHARMM22 QM/MM simulations involving exten¬ 
sive free energy perturbation molecular dynamics 
methods (Ranaghan and Mulholland 2004). It was 
found that forcing the conformation of chorismate in 
solution into the significantly different conformation 
found in the enzyme “costs” 3.8^.6 kcal/mol. A sim¬ 
ilar estimate of 5 kcal/mol was calculated using empir¬ 
ical valence bond methods (Strajbl et al. 2003). The 
agreement between these results, obtained with differ¬ 
ent theoretical methods, is encouraging and suggests 
that they are reliable. The difference between the 
experimental activation barriers for the reaction in 
the enzyme and in solution is 9.1 kcal/mol. The cata¬ 
lytic benefit of the binding of chorismate to the enzyme 
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in a more reactive conformation thus cannot be the sole 
factor in catalysis. Further stabilization of the transi¬ 
tion state relative to the bound substrate (or reactant) 
must, therefore, be involved. 

High-Level QM/MM Calculations and Tests of 
QM/MM Methods 

Claeyssens et al. (2006) have used high-level QM/MM 
calculations to provide activation enthalpies for the 
conversion of chorismate to prephenate to within 
chemical accuracy. Geometries from the 16 reaction 
paths generated at the B3LYP/6-31 G(d)/ 

CHARMM27 level were used to perform single point 
QM/MM energy calculations using high-level QM/ 
MM methods, including the effects of thermal and 
zero-point energy corrections (at 300 K). Activation 
enthalpies of 28.3 (±2.1), 10.2 (±1.8), 9.5 (±1.0), and 
13.1 (±1.1) kcal/mol were calculated at the HF, 
B3LYP, LMP2, and LCCSD(TO) QM/MM levels, 
respectively. The numbers in parentheses indicate the 
standard deviation of the 16 calculated paths. The L in 
these acronyms indicates that local approximations 
were used, TO is an approximate triples correction, 
and CCSD denotes coupled-cluster theory with single 
and double excitations. The effect of the size of the QM 
region was tested: An extended QM region (including 
Arg7, Arg63, Glu78, and Arg90 sidechains and three 
water molecules in addition to the substrate) only 
changed the B3LYP/MM barrier by a very small 
amount, around 0.7 kcal/mol. DFT and LMP2 methods 
overestimated the effects of electron correlation quite 
significantly, giving barriers that are 3-5 kcal/mol 
lower than the experimental value. Only the LCCSD 
(TO) results were found to be in close agreement with 
experiment. The accuracy provided by CCSD(TO) 
methods, which were previously only feasible for 
small molecules in the gas phase, is now achievable 
for enzyme systems within a QM/MM scheme. This 
advance in accuracy means that, in the best cases, 
calculations can predict enzyme mechanisms and acti¬ 
vation energies with quantitative accuracy. 

Most QM/MM studies of enzymes are carried out 
using a invariant point charge representation for the 
MM region. Illingworth et al. (2008) have tested the 
effect of MM polarization on the conversion of 
chorismate to prephenate. The results showed that MM 
polarization is a short-range effect with 75% of the 


polarization energy arising from residues within 5 A of 
the substrate and 93% of the polarization energy arising 
from residues within 10 A of the substrate. The most 
significant induced charges are observed on residues in 
the region of the carboxylate groups of chorismate, par¬ 
ticularly Arg90. This work demonstrated that the inclu¬ 
sion of MM polarization in QM/MM calculations is 
feasible, but had little effect on the calculated reaction 
barrier. 


Summary 

The reaction of chorismate to form prephenate cata¬ 
lyzed by chorismate mutase has been the subject of 
many experimental and theoretical studies over the 
past three decades. Catalysis by this enzyme is due to 
a combination of conformational effects and transition 
state stabilization. The affinity of the enzyme is higher 
for the transition state than for the reactant (or prod¬ 
uct), and so both catalytic contributions can be under¬ 
stood as arising from the complementarity of the 
enzyme to the transition state. 

Cross-References 

► QM/MM Methods 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 
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Definition 

Chromatin fibers are nucleosomal arrays bound 
to other nuclear proteins. Higher order chromatin 


structure refers to folded and oligomeric nucleosomal 
arrays/chromatin fibers in vitro, and to levels of 
condensation beyond the 10 nm fiber in chromosomes 
in vivo. 

Introduction 

In eukaryotes, genomic DNA is assembled with spe¬ 
cialized proteins into a dynamic macromolecular com¬ 
plex called chromatin (Wolffe 2000). This hierarchical 
packaging system compacts and organizes an entire 
eukaryotic genome (estimated to be ^2 m in length) 
into a single nucleus (^10 pm in diameter). In addi¬ 
tion, a growing body of evidence has directly linked 
DNA packaging by chromatin to regulation of nuclear 
processes such as gene expression, DNA replication, 
DNA repair, mitosis, and meiosis (Turner 2001; 
Laurent 2006). Our current understanding of chroma¬ 
tin structure and function originates from over 
100 years of cell biological, biochemical, and biophys¬ 
ical studies that have identihed both the DNA and 
protein components of chromatin and the mechanistic 
relationships between nuclear function and the 
dynamic nature of chromatin structure. 

Chromatin (originally called “nuclein”) was first 
identified by Friedrich Miescher in 1868 (van Holde 
1989). Preparations of nuclei from white blood cells 
were shown to contain both an acidic, phosphorus-rich 
substance now known as DNA and many basic proteins. 
Approximately half the proteins extracted from cell 
nuclei belonged to a single class, now known as the 
histones (Turner 2001). First described by Albrecht Kos- 
sel in 1884, the histones are relatively small proteins that 
have molecular weights of ^10-30 kDa, and contain 
a high percentage of basic amino acids (van Holde 
1989). In time, five distinct families of histone proteins 
were identified based on their unique amino acid com¬ 
position and sequence, charge, solubility, and size. 
These include the core histones, H2A, H2B, H3, H4 
and the linker histone, HI. However, it was not realized 
for 90 years after their discovery that the core histones 
assemble with DNA into a disk-like structure known as 
the nucleosome, which is the fundamental building 
block of chromatin (van Holde 1989; Wolffe 2000). 

The Nucleosome 

The nucleosome is the subunit of chromatin. It consists 
of 147 base pair (bp) DNA wrapped nearly twice 
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Chromatin Structure and Dynamics, Fig. 1 Crystal 
structure of the nucleosome core particle consisting of 146 bp 
base pair of DNA and histone octamer containing H2A {yellow), 
H2B {red), H3 {blue), and H4 (green) (histone amino-terminal 
domains (NTDs) are absent) 

around an octamer (see below) of core histone proteins 
(Fig. 1) (Zlatanova and Leuba 2004). Core histones 
contain two evolutionarily conserved domains: 
a structured carboxyl-terminal “histone fold” dimer¬ 
ization domain and a disordered amino-terminal 
domain (NTD) or “tail” domain. The histone fold 
domains of the histones occupy the inner space of the 
nucleosome and hind nucleosomal DNA, whereas the 
histone tail domains extend outside the nucleosome 
core and away from the DNA gyres. 

Early chemical cross-linking of the nucleosome 
core particle found that the histone octamer consists 
of an H3-H4 tetramer and two H2A-H2B dimers 
(Kornherg and Thomas 1974). In 1993, X-ray crystal¬ 
lographic analysis of the histone octamer at ^3.1 A 
resolution orientated the histone subunits into 
a complex with pseudo-dyad symmetry consisting 
of two heterodimers of H2A-H2B associated on 
either side of the central H3-H4 tetramer 
(Moudrianakis and Arents 1993). In 1997, the crystal 
structure of the nucleosome at 2.8 A gave 
a detailed view of both the path of DNA around the 
histone octamer and the domain architecture of the 
histones with respect to DNA (Luger et al. 1997) 
(Fig. 1). 


Nucleosomal Arrays, Chromatin Fibers, and 
Their Structures 

The nucleoprotein complex formed when core histone 
octamers are spaced repetitively at 20-60 bp intervals 
along a DNA molecule is called a nucleosomal array 
(Hansen 2002). When chromosomal proteins (e.g., linker 
histones - see below) are bound to nucleosomal arrays 
they are called chromatin fibers, or just chromatin. 

In vitro, nucleosomal arrays and chromatin fibers 
can adopt primary, secondary, and tertiary structures 
depending on the solution conditions (Woodcock and 
Dimitrov 2001) (Hansen 2002) (Fig. 2). Our current 
understanding of the dynamics of nucleosomal arrays 
and chromatin fibers can be directly attributed to 
the development over the last 25 years of length and 
composltionally defined model systems assembled 
from pure components (Hansen 2002). In this 
experimental approach, nucleosomal arrays are 
reconstituted from recombinant or native core histone 
proteins and an engineered DNA template consisting 
of tandem repeats of a nucleosome positioning 
sequence, such as the natural Lytechinus 5S rDNA 
(Zlatanova and Leuba 2004) or the SELEX-derived 
“601” DNA (Lowary and Widom 1998). A nucleo¬ 
some positioning sequence is a high affinity binding 
site that leads to the predictable deposition of 
positioned histone octamers onto DNA. 

Primary structure refers to a completely unfolded 
“beads-on-a-string” nucleosomal array or chromatin 
fiber. Importantly, primary structure is observed 
in vitro only under very low salt conditions (Hansen 
2002). By contrast, under physiological salt conditions 
nucleosomal arrays and chromatin fibers condense via 
a series of hierarchical nucleosome-nucleosome inter¬ 
actions into its higher order secondary and tertiary 
chromatin structures. Secondary structure results 
from local folding of the array or fiber into 30 nm 
diameter structures (see below). Tertiary structure 
refers to cooperative oligomerization of short nucleo¬ 
somal arrays and chromatin fibers to form large 
macromolecular assemblies. In vivo, 10 nm diameter 
chromatin fibers are successively folded, twisted, and 
coiled into 200-400 nm diameter interphase chroma¬ 
tids (van Holde 1989). This hierarchical packaging 
of DNA is mediated by intrinsic nucleosome- 
nucleosome and nucleosome-DNA interactions as 
well as specific non-nucleosomal proteins (discussed 
below). 
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Chromatin Structure and 
Dynamics, Fig. 2 Schematic 
representation of primary, 
secondary, and tertiary 
structural states of chromatin 
in vitro in solution 


Primary chromatin structure 



Secondary chromatin structure 




Tertiary chromatin structure 


Intrinsic Dynamics of Nucleosomal Arrays 

Nucleosomal arrays in vitro are not static entities 
but rather are dynamic protein-DNA complexes 
that intrinsically equilibrate between their primary, 
secondary, and tertiary structures in solution (Hansen 
2002) (Fig. 2). From a technical perspective, analytical 
ultracentrifugation (Hansen 2002), microscopy 
(Woodcock and Ghosh 2010), and single-molecule 
force spectrometry (Kruithof et al. 2009) have the 
resolution and the suitability for distinguishing 
between primary and secondary structures, while 
differential centrifugation is used to probe tertiary 
structure (Gordon et al. 2005; Shogren-Knaak et al. 
2006). The formation of folded and oligomeric nucle¬ 
osomal arrays requires cations (e.g., Mg^"^), indicating 
that there is an electrostatic-based component that 
contributes to the mechanism of the intrinsic conden¬ 
sation pathway. However, cation-dependent DNA 
charge neutralization is not the only mechanism at 
play. In addition, the NTDs of the core histone proteins 
are absolutely necessary for the formation of higher 
order secondary and tertiary chromatin structure under 
physiological conditions in vitro. Nucleosomal arrays 
in which the NTDs have been removed fail to condense 
in physiological salt concentrations, even in the 
presence of linker histones (Hansen 2002). Whereas 
cations screen DNA charge, the mechanism of action 


of the NTDs is less clear. It has been proposed that the 
H4 NTD is the primary mediator of folding and that it 
does so by binding to a specihc region on the surface of 
neighboring nucleosomes (Zlatanova and Leuba 2004; 
Woodcock and Ghosh 2010). In contrast, formation of 
tertiary chromatin structures involves the independent 
and additive action of all four core histone NTDs 
(Gordon et al. 2005). The mechanism of intrinsic con¬ 
densation process is clearly a complicated one, in which 
separate contributions from cations and the core histone 
NTDs work together to mediate formation of folded and 
oligomeric higher order chromatin structures. 

Of note, the core histone NTDs are the sites of exten¬ 
sive posttranslational modihcations in vivo. These mod- 
ihcations have been referred to as the “histone code,” 
and include lysine acetylation and methylation, arginine 
methylation, serine phosphorylation, and many others 
(Turner 2001; Laurent 2006). Chromatin-modifying 
enzymes (e.g., histone acetyltransferases) target specihc 
NTD residues for modihcation. Of the known modih¬ 
cations, specihc lysine acetylation (e.g., H4K16Ac) has 
been shown to shift the equilibrium to less condensed 
states (Shogren-Knaak et al. 2006; Szerlong et al. 2010). 
At this point, very few other posttranslational modih¬ 
cations have been tested for their effects on higher order 
chromatin structure. We anticipate that specihc 
modihcation on many strategic NTD residues will be 
shown to inhuence chromatin hber dynamics. 
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Chromatin Structure and 
Dynamics, Fig. 3 Schematic 
representation of three 
different models for the 30 nm 
chromatin fiber, (a) Two-start 
helical ribbon, (b) Two-start 
crossed-linker. (c) One-start 
solenoid fiber. Basic 
nucleosome arrangements are 
shown for each model 


b c 

Two-start crossed-linker One-start solenoid 




Zigzag 


Zigzag Direct contact 



Chromatin Condensation and Non- 
Nucleosomal Proteins 

In vivo nucleosomal arrays are commonly associated 
with non-nucleosomal proteins. The most prevalent 
non-nucleosomal proteins in most eukaryotic organ¬ 
isms are the linker histones, such as HI or H5. Nucle- 
osomes containing bound linker histone and linker 
DNA have been called chromatosomes (van Holde 
1989), although this term is no longer widely used. 
Linker histones are fundamentally different than the 
core histones in that they have a winged helix domain 
that binds nucleosomes and a long disordered 
C-terminal domain (CTD) that binds the short stretches 
of DNA connecting neighboring nucleosomes, 
called “linker DNA.” The association of linker his¬ 
tones at the DNA entry/exit site of the nucleosome 
protects an additional 20 bp of DNA from nuclease 
digestion and effectively stabilizes a more compacted 
state, in part by altering the orientation of linker DNA 
with respect to the nucleosome (Wolffe 2000; 
Turner 2001). When linker histones are bound to 
nucleosomal arrays in vitro they shift the equilibrium 
toward the higher order states, effectively 
stabilizing intrinsic secondary and tertiary chromatin 
structures. 

Whereas linker histones function through the intrinsic 
condensation pathway, there is a diverse class of 
non-nucleosomal proteins that can bind and condense 
chromatin fibers into specific secondary and tertiary 
structures in the absence of salt, i.e., they bypass the 
intrinsic folding pathway to facilitate the formation of 


unique secondary and tertiary chromatin stractures. 
These proteins have been termed chromatin architectural 
proteins and also include Methyl CpG Binding Protein 
2 (MeCP2), Poly-ADP Ribose Polymerase (PARP), 
Heterochromatin Protein 1 (HPl), polycomb, and Silent 
Information Regulator 3 (Sir3p) (Wolffe 2000; Turner 
2001; Woodcock and Ghosh 2010). Presumably, 
chromatin architectural proteins assemble distinct 
secondary and tertiary chromatin stmctures at localized 
regions of the genome for specific regulatory function(s). 
For example, Sir3p in budding yeast functions as 
a gene repressor through the establishment and (or) 
maintenance of transcriptionally silent chromatin at 
telomeres and the silent mating loci in budding yeast 
chromosomes (Turner 2001). 

The 30 nm Fiber: A Secondary Chromatin 
Structure of Special Interest 

The 30 nm fiber is a superhelical structure that forms 
when nucleosomes in an array or fiber interact locally 
in cis, leading to extensive folding of the fiber. 
The structure of the 30 nm fiber has fascinated and 
puzzled researchers since the 1970s. Proposed models 
for the 30 nm fiber include the two-start helical ribbon 
model, the two-start crossed-linker model, and the one- 
start solenoid model (van Holde 1989; Zlatanova and 
Leuba 2004) (Fig. 3). Two main distinctions between 
the two-start and the one-start solenoid chromatin 
fibers are their respective organization of nucleosomes 
and the path of DNA within the fiber. 
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The two-start fibers consist of nucleosomes folded 
into a zigzag arrangement. The zigzag arrangement is 
stabilized by nonsequential nucleosome-nucleosome 
stacking interactions (i ± 2). The existence of the two- 
start fiber is supported experimentally by both the 
crystal structure of a tetra-nucleosome at 9 A resolu¬ 
tion (Schalch et al. 2005) and disulfide cross-linking 
experiments from compacted reconstituted arrays 
(Dorigo et al. 2004). Both studies found nucleosome- 
nucleosome stacking interactions consistent with 
a two-start fiber. 

The one-start solenoid fiber does not adopt a zigzag 
but forms from nucleosome-nucleosome stacking 
interactions that occur between consecutive nucleo¬ 
somes (i ± 1). The model of the one-start solenoid 
fiber is supported experimentally from topological 
analysis of compacted reconstituted fibers using elec¬ 
tron-microscopy measuring fiber length, diameter, and 
nucleosome packing density. 

Importantly, heteromorphic chromatin fibers 
consisting of both the n ± 1 and n ± 2 nucleosome- 
nucleosome stacking interactions were observed 
in folded reconstituted nucleosomal arrays using EM- 
assisted nucleosome interaction capture (EMANIC) 
together with computer modeling (Grigoryev et al. 
2009). Importantly these studies also conclude 
that linker DNA length, salt concentration, and linker 
histone directly affect nucleosome stacking 
orientation. 

In conclusion, the 30 nm fiber is as an enigmatic 
chromatin structure that represents the most folded 
conformation that is observed after short-range nucleo¬ 
some-nucleosome interactions in cis in vitro and in situ. 
After nearly 40 years of investigation, there still is no 
consensus as to the structure of the 30 nm fiber, and there 
is no indication that this situation will change soon. 
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Circular Dichroism and Chirality 
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Department of Chemistry, University of Warwick, 
Coventry, UK 


Synonyms 

Differential circularly polarized light spectroscopy 


Definition 

Molecules that cannot be superposed on their mirror 
images are known as “chiral,” from the Greek word 
Xetp. Examples of chiral molecules are given in 
Fig. 1. A molecule is achiral (i.e., not chiral) if it has 
a reflection plane or inversion center, so its mirror image 
can be rotated round to look exactly like the starting 
molecule. Circular dichroism (CD) is the difference in 
absorption of left and right circularly polarized light 


(Cantor and Schimmel 1980; Craig and Thirunama- 
chanrdan 1984; Michl and Thulstrup 1986; Rodger 
and Norden 1997; Barron 2004; Norden et al. 2010). 


CD=Ae-Ar = AA (1) 


Solutions of chiral molecules give CD signals at 
wavelengths where they absorb light whereas solutions 
of achiral molecules have zero CD. Furthermore, if the 
sample contains equal proportions of the two mirror 
images, then the net CD signal will also be zero. 
However, if there is an excess of one enantiomer, 
then there will be a nonzero CD signal wherever 
the molecule absorbs light. Molecules from biological 
sources are usually enantiomerically pure, so 
we expect to see CD spectra for, e.g., proteins and 
nucleic acids. 

CD is particularly useful for studying solutions of 
chiral molecules such as those in Fig. 1. A chiral mol¬ 
ecule is one that cannot be superposed on its mirror 
images. The signal that is measured contains informa¬ 
tion about the asymmetry or handedness or helicity of 
the system. 


c 


Circular Dichroism and 
Chirality, Fig. 1 Chiral 
molecules: (a) R-fi-equatorial 
bromoadamantanone, 

(b) benzophenone, 

(c) A-[Ru(l,10- 
phenanthroline) 3 ]^’^, 

(d) B-DNA, and (e) human 
leukocyte antigen protein 
where red denotes a-helices 
and yellow denotes P-strands 
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Circular Dichroism and Chirality, Fig. 2 CD spectrum of the tris-chelate metal complex A-[Co(ethylenediamine)3]^'^. 
^£490 nm = —1-89 moP* dm^ cm^* 


Basic Characteristics 

The idea behind the definition of CD is that because 
a chiral molecule has no reflection plane, any 
rearrangement of its electrons will not have one either. 
So if the electron distribution of a molecule changes 
when light of a given wavelength is incident on it, the 
electrons may be thought of as moving in some kind of 
helix. Since for circularly polarized light the electric 
field vectors trace out helices, the interactions between 
a chiral molecule and left- and right-handed photons 
are different, and so we can measure a CD spectrum by 
changing the frequency (or wavelength) of light inci¬ 
dent on the sample as illustrated in Fig. 2. Even in an 
“obviously” helical molecule, different electronic tran¬ 
sitions involve electron redistributions of different 
handedness, and so any one molecule will have both 
positive and negative CD signals. The challenging task 
is then to relate the helical motions of the electrons to 
the arrangement of the atoms and bonds in space. 

CD is now a routine tool in many laboratories. The 
most common applications include proving that a chiral 
molecule has indeed been synthesized or resolved into 
pure enantiomers and probing the structure of biological 
macromolecules, in particular determining the secondary 
structure content of proteins. It is also useful for probing 
the binding of molecules to a chiral molecule as the CD 
spectrum is perturbed by the interaction. In particular, if 


an achiral molecule binds to a chiral molecule, an induced 
CD signal will appear in the absorption bands of the 
achiral molecule. Most CD applications involve ran¬ 
domly oriented samples. CD measurements on aniso¬ 
tropic (i.e., not randomly oriented) samples require 
meticulous care including correction for sample imper¬ 
fections as well as instrument optics (Davidsson and 
Norden 1976; Norden and Seth 1985). 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Circular Dichroism Spectroscopy: Units 

► DNA-Ligand Circular Dichroism 

► Far UV Protein Circular Dichroism 

► Near UV Protein CD 

► Oriented Circular Dichroism Spectroscopy 

► Protein Circular Dichroism Analysis 

► Protein Folding 

► Protein Structure Prediction 
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Circular Dichroism Signals: Qualitative 
Description of Origins 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

Optical activity; Spectroscopy of chiral molecules 


Definition 

One can use circular dichroism (CD) spectroscopy as 
an empirical technique without understanding its 
underlying theory. However, it is helpful to have at 
least a pictorial understanding of the origin of CD 
before attempting to interpret data. This entry contains 
a description of why the handedness of a molecule 
(also called its absolute conhguration) relates to the 
sign, or sign pattern, of its CD spectrum. 

An electronic transition occurs because either the 
electric or magnetic held (or both) of the electromag¬ 
netic radiation “pushes” the electrons to a new station¬ 
ary state. The effect of the electric held is to cause 
a linear rearrangement of the electrons called the elec¬ 
tric dipole transition moment of the transition, which 
we denote by the vector fi. The direction in which fi 
points is the same as the transition polarization—it is 
the direction in which the electrons are pushed during 
the transition. In contrast to the linear effect of the 


Circular Dichroism Signals: 

Qualitative Description of 
Origins, Fig. 1 The chiral 
dimethyl derivative of 
benzophenone 

electric held, the magnetic held induces a circular 
rearrangement of electron density. The net circulation 
of charge is the magnetic dipole transition moment, m. 
The vector pt is known as a polar vector: It has 
magnitude and direction, m describes rotation about 
an axis and is known as an axial vector. Its direction is 
such that when viewed from its tip the rotation is 
anticlockwise. The key feature that gives rise to 
a nonzero CD spectrum is that the electron redistribu¬ 
tion that happens during a transition is helical. This is 
mathematically expressed by the transition having 
/«. 7 ^ 0 and m ^ 0. 

Basic Characteristics 
Introduction 

The mathematical expression for CD is that the signal 
is proportional to the scalar product between the elec¬ 
tric dipole transition moment (/«,) and the magnetic 
dipole transition moment (m): 

CD (X fjt m (1) 

This is the Rosenfeld equation which tells us that 
the CD is positive if yu, and m are parallel and negative 
if they are antiparallel. To understand the origin of CD 
signals, we consider a simple chiral system made up of 
two identical units. If two achiral chromophores are 
arranged in some chiral fashion relative to one another, 
such as in the case of the benzophenone illustrated in 
Fig. 1 where two toluene molecules are linked 
together, they can exhibit circular dichroism. 

Exciton Coupling Between Two Monomers 

When the two transitions on different identical chro¬ 
mophores are brought into a chiral arrangement with 
respect to one another, two new “exciton” transitions 
(Fig. 2b(ii)) will appear in the spectrum: one at higher 
energy (shorter wavelength) and one at lower energy 
(longer wavelength) than the transition in the isolated 
(monomeric) chromophores (Fig. 2b(i)). The one at 
higher energy corresponds to a repulsive (destabilizing) 
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Circular Dichroism Signals: 
Qualitative Description of 
Origins, Fig. 2 (a) Diagram 
illustrating exciton coupling 
between two chirally aiTanged 
monomeric chromophores 
with angle between the two 
transition moments of ~36°. 
The net dipole is shown to the 
right, (b) The associated 
absorption and CD spectra: (/) 
Monomer absorbance; (ii) 
dimer absorbance; (Hi) dimer 
CD exciton bands, (iv) net CD 
spectrum. For fuller 
explanation see text, (c) Right- 
hand-thumb rule: if fingers are 
aligned with the electron flow, 
then the thumb shows the 
direction of the axial magnetic 
dipole moment vector 
(Figure from ref. Norden et al 
(2010) with permission) 



arrangement of the electric dipole transition moments 
relative to one another, the one at lower energy 
corresponds to an attractive (stabilizing) arrangement 
(Norden et al. 2010; Berova et al. in press). 

We may express the new transitions resulting from 
the coupled transitions as the sum or difference of the 
two monomeric electric dipole transition moments. 


/u,(dimer) 


1 

72 


[/t(monomer 1) ± /u,(monomer 2)] 


( 2 ) 


Here the factor of 1/V2 arises to ensure the total 
absorption intensity is conserved. However, as a result 
of the + and - signs in Eq. 2, the absorption intensities 
of the two transitions may differ from each other. In the 
extreme case, if the monomeric transition moments are 
parallel, the plus combination will be a transition that 
contains all the absorption intensity of the two mono¬ 
mers while the minus combination will have zero 
absorption (the two antiparallel monomeric dipoles 
exactly canceling each other). In the spectrum, this 
extreme case will look like a shift of the monomer 


absorption band to either shorter or longer wavelength 
depending on whether the dipole combination is, 
respectively, repulsive or attractive. However, if the 
monomeric transitions are oriented at any other angles 
than parallel, antiparallel, or orthogonal to each other, 
the new transitions of the dimer will both have non¬ 
vanishing (generally different) absorption intensities 
and be oriented in different directions. 

The magnetic dipole moment of the combined sys¬ 
tem is the circular component of the motion of electron 
density that occurs when one follows first one /jl and 
then the other in either the plus or minus combination. 
The direction of any component of the total magnetic 
dipole transition moment that is collinear with the total 
electric dipole transition moment /Lt(dimer) is deter¬ 
mined by the right-hand-thumb rule (align one’s right 
hand thumb along the electric dipole transition 
moment direction and curl ones fingers around that 
direction. Fig. 2): Is there any circulation of charge in 
a plane perpendicular to /u.(dimer)? 

The CD of the dimer is now given by Eq. 1 with 
/u,(dimer) being the total electric dipole transition 
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moment of the dimer and /M(dimer) being the magnetic 
moment component as determined above. Figure 2 
illustrates the process. A chiral dimer is illustrated in 
Fig. 2a. This dimer might, for example, be two identi¬ 
cal DNA bases stacked on top of each other in a right- 
handed helix (with 36° dihedral angle between), but it 
could also represent general cases of chirally twisted 
bi-aryls etc. Figure 2b is a book-keeping diagram 
showing (from the top) (i) the absorption of the isolated 
monomeric transitions, (ii) the absorption bands of the 
two new (exciton) transitions, (iii) the deduced CD 
components at higher and lower energy, and (iv) the 
expected net CD spectrum. The derivation of Fig. 2b 
follows from the step-by-step process given below 
(Norden et al. 2010): 

1. Draw the directions of the monomeric transition 
moments and put heads on the arrows (the choice 
of direction at this stage does not matter) (Fig. 2a). 

2. Put a minus charge at the tail and plus charge at the 
head of each arrow. Determine whether, with the 
geometry of the dimer system being considered, the 
arrangement is repulsive or attractive. If, as in 
Fig. 2a, there is a small angle (36°) between the 
dipoles, we conclude a repulsive arrangement is 
illustrated. Thus, we have the case of the higher 
energy (shorter wavelength, solid curve) relative 
to the monomer transition in Fig. 2b(ii). 

3. Form the sum of the monomeric transition dipole 
vectors, /t(dimer), as illustrated by the longer arrow 
to the right of Fig. 2a. (This is thus the polarization 
of the short-wavelength exciton transition.) 

4. Look for circulation of charge in plane perpendicu¬ 
lar to /t(dimer) as shown in Fig. 2c using the right- 
hand-thumb rule. Your thumb points in the opposite 
direction from the yu,(dimer). 

5. [JL and m are thus antiparallel and the CD is 
negative. 

6. Now change the direction of one of the monomer 
dipole arrows in Fig. 2a. This combination is an 
attractive one (longer wavelength), with shorter 
/u,(dimer), as shown by the smaller (dashed) absorp¬ 
tion band to the right of the monomer transition on 
the second line of Fig. 2b(ii). In this case m is 
parallel with /u,(dimer), so the CD is positive. 
While [JL is shorter than in the case of the plus 
combination, m is bigger. Thus, the two combina¬ 
tions end up with opposite signs but the same mag¬ 
nitude, and give rise to the symmetric bisignate CD 
pattern shown in Fig. 2b(iv). 
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Chiral Spectroscopy - Absorbance of circularly polar¬ 
ized light 


Circular dichroism {CD) spectroscopy is now more 
widely used to study biological molecules than 
nonbiological ones (Berova et al. 2000; Norden et al. 
2010). This is despite the fact that the molecules 
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Circular Dichroism Spectroscopy: Units 


concerned are typically DNAs and proteins which are 
large and that detailed structural analysis of their CD 
spectra is not possible (though work is in progress by 
both fitting algorithms (Wallace and Janes 2009) and 
calculations to enhance what can be deduced from the 
data). 

The CD of biological macromolecules is mostly 
measured to investigate one of four things: 

1. The conformation of the macromolecule itself 

2. Changes in the conformation of the macromolecule 

3. Interactions of the macromolecule with small mol¬ 
ecules, especially achiral ones whose induced CD is 
due solely to their interaction with the 
macromolecule 

4. The interactions between the macromolecule and 
other chiral molecules, including DNAs and 
proteins 

The data analysis that is performed for such systems 
is usually qualitative or empirical in nature. The attrac¬ 
tions of CD include the following: 

• The experiments are simple and quick to perform. 

• CD is uniquely sensitive to the asymmetry of 
a system. 

• CD experiments generally deal with the solution 
phase. 

• By changing the path length from 10 pm even up to 
10 cm, a wide dynamic range of concentrations can 
be used. 

• CD may use much lower concentrations than are 
required for other structural techniques such as 
nuclear magnetic resonance spectroscopy (NMR), 
thereby also avoiding solubility problems. 

• The intrinsic timescale of CD measurements 
(10“*^ s for absorption of a UV photon) is much 
shorter than that for NMR which uses radio waves 
(timescale 10“^ s). 

• A CD spectrum may be collected in a few minutes 
and single wavelength measurements require only 
a few milliseconds. 

• CD data can be collected on any sized molecule. It 
has neither an upper nor a lower limit. 

• CD data can be collected on both rigid and flexible 
systems. 
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The CD analogue of the Beer-Lambert Law is 

CD = AA(absorbance units) = Ai— A,- 
= Aeci 

However, most CD spectropolarimeters, although they 
measure differential absorbance (at least at differential 
absorbance signals below ^0.1) (Norden et al. 2010; 
Rodger and Norden 1997), largely for historical rea¬ 
sons, give a CD spectrum in units of ellipticity, 6, in 
millidegrees, versus X, rather than AA versus X. The 
ellipticity, 6, is obtained from the ratio of the minor and 
major axes of the ellipse traced out by the electric field 
vector of the elliptically polarized light when it 
emerges from the chiral sample onto which linearly 
polarized light was incident (Rodger and Norden 1997; 
Barron 2004; Norden et al. 2010). The conversion 
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between AA and millidegrees is (Barron 2004; Norden 
et al. 2010) 


CD = AA(absorbance units) = Ai — A,- 
47r0(degrees) ^(degrees) 

^ 180 In 10 ^ 32.982 (2) 

0(millidegrees) 

32,982 


It is best practice to express absorbance (and hence 
CD) intensity in terms of molar concentration and path 
length. The units of the extinction coefficient for absor¬ 
bance spectroscopy, s, are mol“^ cm“* dm^. The CD 
analogue, Ae, Eq. 1, has the same units. With 
biomacromolecules such as DNA and proteins, it is 
often convenient to use concentration in terms of the 
residues (nucleic acids (or sometimes base pairs) or 
amino acids) rather than whole molecules. Thus, for 
example, a 5 pM solution of the 129-residue protein 
lysozyme has a residue concentration of 645 pM. 
(Strictly speaking for CD one should perhaps work 
with the concentration of amide bonds, so use 128 
and 640 pM.) This enables the shape and intensity of 
the CD of molecules of different lengths (especially 
with DNA the length is often unknown) to be com¬ 
pared more easily. The mean residue extinction coef¬ 
ficient also has units of mol“^ cm” dm^. For 
biomolecules concentration is also commonly 
expressed as mg/mL (or equivalently mg/cm^, g/L, 
and g/dm^) making the units of e: mg”* cm^. 

Another commonly used unit for reporting CD 
intensity is molar ellipticity Mq'- 


Me 


[6\M 

loo 


1000 


(3) 


where M is molar mass in g, 6 is the ellipticity in 
degrees, C is concentration in moles dm”^, i is path 
length in cm, and [0] is the specific ellipticity: 




0 

Cgd 


(4) 


where Cg is sample concentration in g cm”^ and d is 
path length in dm (defined for the convenience of old- 
fashioned polarimeter measurements of oco where the 
path length was typically 10 cm and the concentration 
reported in g cm”^). (It should be noted that often the 


notation [0] is used instead of Mg leading to confusion 
as its dehnitions in Eqs. 4 and 5 would not be the 
same.) We then write 


Me = 3298(£f - e,) = 3298AS (5) 


which has a disconcerting factor of 10 difference from 
Eq. 2 due to the dm to cm conversion that intruded 
along the way. With proteins, an extremely 
useful empirical check on intensity magnitudes is 
to note that a 100% a-helical protein would have 
As ^ 12 mol”* cm”* dm^ and Mg ^ 40,000 
degrees”* cm^ dmol”*, approximately a factor of 
3333 between them. 
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Citrate Synthase - Computational 
Studies 

Marc Willem Van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

Citrate condensing enzyme; Condensing enzyme 

Definition 

An enzyme that catalyzes a condensation and 
a hydrolysis reaction to form citrate and CoA from 
acetyl-CoA, oxaloacetate, and water. 

Basic Characteristics 

Citrate synthase (or ^/-Citrate synthase, EC 2.3.3.1, 
previously EC 4.1.3.7) is a crucial enzyme for 
almost all living cells, due to its important role in the 
citric acid cycle (also tricarhoxylic-acid cycle or Krebs 
cycle), the final common pathway in the oxidation of 
fuel molecules. In addition, certain fungi (e.g., yeast) 
and many plants also have a second citrate synthase 
(CS) enzyme, which allows them to grow on 
acetate via the glyoxylate pathway. Apart from its 
omnipresence and importance in nature, it is also 
the archetypal enzyme for catalyzing a condensation 
reaction and was named “the condensing enzyme” 
upon its discovery. CS catalyzes two different 

reactions to convert acetyl-coenzyme A (acetyl-CoA) 
and oxaloacetate to citrate and CoA: First, it performs 
a Claisen-type condensation to form citryl-CoA, 
an intermediate that is stable in solution; then, 

it hydrolyzes citryl-CoA to form citrate and CoA. 
As a result of its importance and early discovery, 
citrate synthase has been widely studied. Early 

mechanistic studies indicated that citrate formation 
occurs by adding the CH 2 -COO“ group to the Si-face 
of oxaloacetate, and that condensation (not hydrolysis) 
is rate-limiting in the reaction catalyzed by 

the mesophilic enzyme (Eggerer 1965). The first 


crystal structures of CS revealed the conformation of 
the active site, leading to the first detailed proposal 
of the reaction mechanism (Remington 1992). 
Several high-resolution crystal structures, from 
a variety of organisms and complexed with various 
substrates and inhibitors, are currently available. 
Mutagenesis and kinetics studies of citrate synthases 
from several organisms have also been performed. 
In spite of these extensive experimental studies, 
questions about the enzyme mechanism and means of 
catalysis remain, prompting several computational 
modeling studies, in particular with hybrid 
quantum mechanical/molecular mechanical (QM/ 
MM) methods. These computational studies have pro¬ 
vided proof that elements of the originally proposed 
mechanisms, often still included in biochemistry 
textbooks (Berg et al. 2006; Voet and Voet 2011), 
are incorrect. 

Conformational Change Triggered by 
Oxaloacetate Binding 

The initial structures obtained by Remington et al. 
revealed that the enzyme, a homodimer with 
monomers consisting of one large and one small 
domain, can exist in two distinct states; an “open” 
and a “closed” form (Fig. 1) (Remington 1992). 
Daidone et al. studied the conformational change 
between these forms by computational modeling 
(Daidone et al. 2004). Principal component analysis 
of molecular dynamics simulations was used to filter 
out the main modes of collective motion (the so- 
called essential dynamics). As indicated by the crys¬ 
tal structures, closure involves a large change in 
position of the small domain, which rotates by 
about 18°, and a shift of a-helix 328-341 (residue 
numbering based on pig citrate synthase) along its 
own axis. The simulations indicated a considerable 
energetic barrier between the open and closed form 
in the absence of substrates, most likely related to the 
shift of a-helix 328-341. When oxaloacetate binds in 
the binding cleft between the large and the 
small domain, the enzyme can reach the closed 
conformation, in line with crystallographic evidence. 
A salt-bridge interaction between oxaloacetate and 
Arg329 is likely to be involved in facilitating closure 
upon oxaloacetate binding (Daidone et al. 2004). 
When the enzyme is in the closed conformation, 
acetyl-CoA can bind, which shields the active 
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Citrate Synthase - 
Computational Studies, 

Fig. 1 The open {left, PDB 
code 5CSC) and closed {right, 
PDB code ICSI) forms of 
citrate synthase, with the large 
domains in green (one 
monomer) and orange (the 
other monomer) and the small 
domains in light blue. In the 
closed form, oxaloacetate is 
shown in cyan and red spheres 



Citrate Synthase - 
Computationai Studies, 

Fig. 2 The three reaction 
steps involved in the formation 
of citrate by citrate synthase: 
proton abstraction, 
condensation, and hydrolysis. 
Modeling of the reaction with 
QM/MM methods has been 
crucial in revealing these 
mechanistic details. Residues 
important for catalysis are 
labeled 
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site completely from bulk solvent. This ordered 
sequential binding of the two substrates and the con¬ 
formational change upon oxaloacetate binding pre¬ 
vent the enzyme from hydrolyzing acetyl-CoA, 
which would destroy the high-energy thioester- 
linkage in this compound. 


The Reaction Mechanism: Formation of an Enolate 
Intermediate and Arginine Acting as an Acid 

Once the substrates (oxaloacetate and acetyl-CoA) are 
bound, the conversion to citrate can start. It has been 
established that this happens in three separate reaction 
steps (Remington 1992) (Fig. 2). The first is 
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Citrullination 


abstraction of the a-proton from acetyl-CoA by 
Asp375, creating the enolate form. It was long believed 
that this was converted into an enol (e.g., by proton 
transfer from His274), but several computer modeling 
studies (in particular using high-level QM/MM 
methods) indicated that the enolate is the true transient 
intermediate species in the enzyme reaction (e.g., 
Mulholland et al. 2000; Van der Kamp et al. 2010). 
The enolate form is stabilized by electrostatic interac¬ 
tions in the enzyme active site, which include conven¬ 
tional hydrogen bonds from His274 and a conserved 
water molecule to the enolate oxygen (Fig. 2); no “low- 
barrier hydrogen bonds” are involved (Mulholland 
et al. 2000). 

When the enolate intermediate is formed, the 
carbonyl carbon of oxaloacetate can undergo a 
nucleophilic attack. Citryl-CoA is formed as an inter¬ 
mediate, which requires proton donation. Initially, 
it was suggested that His320 donated the proton, 
but high-level QM/MM studies indicate that 
donation by Arg329 is most likely (Van der 
Kamp et al. 2008) (Fig. 2). This unusual role of an 
arginine as proton donor probably prevents overstabi¬ 
lization of the citryl-CoA intermediate and may 
trigger opening of the enzyme active site 
{vide supra), which is likely to be important for hydro¬ 
lysis. Through its involvement in catalysis, Arg329 
thereby is proposed to provide a mechanism for cou¬ 
pling condensation and hydrolysis in citrate synthase, 
and for coupling the chemical and conformational 
changes during the catalytic cycle (Van der Kamp 
et al. 2008). 

Citryl-CoA subsequently undergoes hydrolysis to 
form citrate and CoA-SH. Asp375 is implicated 
to play a role in this step, but the precise mechanism 
for this step is as yet unknown. The breaking 
of the thioester-linkage is energetically very favorable, 
which helps to drive the reaction in the 
forward direction, making it possible for the citric 
acid cycle to turn over, even with the typically 
low concentration of oxaloacetate in vivo (Voet and 
Voet 2011). 
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Overview 

Chloride ions are physiologically relevant in most organ¬ 
isms, and consequently a multitude of different chloride¬ 
transporting transmembrane proteins have evolved. The 
present entry describes a particular class of anion¬ 
transporting proteins, the family of CLC proteins. 

Research on CLC proteins began with the identifi¬ 
cation of the first member, the Cl~ channel CLC-0 of 
the electric organ of Torpedo (Miller and Richard 
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1990). The cloning of the Torpedo channel and of the 
human CLC isoforms (Jentsch et al. 1990, 2005; 
Jentsch 2008) rendered CLCs accessible to the tools 
of molecular biology and established a link between 
CLC dysfunction and genetic diseases. In mammals 
(Fig. 2), 9 CLC genes code for the plasma membrane 
localized Cl~ channels (CLC-l, CLC-2, CLC-Ka, and 
CLC-Kb) and the CP/H"^ antiporters (CLC-3, CLC-4, 
CLC-5, CLC-6, and CLC-7) localized in the mem¬ 
brane of intracellular endosomes/lysosomes (Zifarelli 
and Pusch 2007; Jentsch 2008). 

All members characterized so far have a common 
structural architecture. This similar scaffold allows 
transport mechanisms that are classically viewed as 
noncompatible. CLC channels passively conduct Cl~ 
ions according to their electrochemical driving force, 
whereas secondary active CLC antiporters are anion/ 
exchangers, in which the transport of anions across 
the membrane is coupled to the countertransport of 
protons with a defined 2:1 stoichiometry (Accardi 
and Miller 2004; Zifarelli and Pusch 2007). 



CLC Channels and Transporters, Fig. 1 Structure of CLC- 
Cm (pdb entry 30RG) viewed from the membrane plane (extra¬ 
cellular side above and cytoplasmic side below). The two subunits 
are shown in green and blue. The residue E210, the gating gluta¬ 
mate, is colored in red, chloride anions are shown in magenta 


Functional Properties of CLC Channels 
and Transporters 

Single-channel measurements of CLC-0 showed the 
presence of bursts of activity comprising three 
equidistant levels of conductance separated by long 
closed intervals. This was interpreted in terms of cur¬ 
rent mediated by a two-pore protein characterized by 
the presence of two separate but not independent 
voltage-dependent gating process called “fast” or 
protopore gating and “slow” or common gating. The 
protopore gate regulates the gating state (open vs 
closed) of the individual subunits independently one 
from the other (Miller and Richard 1990). Opening of 
the fast gate is favored by acidic pH^xt and by increas¬ 
ing [Cl~]ext, which act on a conserved glutamate 
residue called the “gating glutamate.” In fact, 
mutations of this glutamate lead to a complete loss of 
gating and to a constitutively open phenotype in all 
CLCs. In addition, mutation of this residue produces 
a loss of anion/H"^ coupling in CLC transporters 
(Zifarelli and Pusch 2007). 

The common gate, defined also as slow because in 
CLC-0 it is several times slower than the protopore 
gate, controls simultaneously the activity of both 
pores of the dimer. The molecular mechanism of 


the common gate is still unclear. Gating of other CLC 
channels has different properties but most CLC 
charmels are dependent on voltage, [Cl~] and pH. 
Moreover, several CLC transporters are characterized 
by a strong outward rectification whose origin is 
unknown (Zifarelli and Pusch 2007). 

Crystal Structures of CLC Proteins 

Structure of the Membrane Domain 

The structures of the bacterial homolog CLC-ec 1 from 
Escherichia coli and of the eukaryotic CLC-Cm from 
the alga Cyanidioschyzon merolae, have been 
determined by X-ray crystallography (Dutzler et al. 
2002; Feng et al. 2010) (Fig. 1). In contrast with 
CLC-ec 1, CLC-Cm presents a long cytoplasmic 
domain (Fig. 1). The other structural features are in 
general quite similar in spite of a modest sequence 
identity (<25%). 

The transmembrane part of the protein is formed by 
two subunits of triangular shape that assemble as 
a homodimer. Each monomer contains an independent 
transport pathway comprising three anion-binding 
sites, Sext, Seen- ^nd Sint. Anions bound at Seen are 
coordinated by conserved serine and tyrosine residues 
(S107 and Y445 in CLC-ec 1). This site is important for 
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CLC Channels and 
Transporters, Fig. 2 The 

mammalian CLC proteins. For 
each of the nine mammalian 
CLCs, the possible association 
with a beta subunit, the 
function as channel or 
transporter, and associated 
diseases or the phenotype of 
murine knockout models are 
summarized 
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anion selectivity and the proton coupling mechanism. 
The serine residue, conserved in all CP/H"^ 
exchangers, is substituted by a proline in the 
N 03 ~/H’^ antiporter AtCLC-a (Zifarelli and Pusch 
2009a). Exchanging the WT serine with a proline 
turns CLC-5 from a Cl~/ to a N 03 ~/H'^ antiporter and 
confers N 03 ~ over Cl~ selectivity to CLC-0 and 
CLC-ecl (Zifarelli and Pusch 2009b). Polyatomic 
anions, like N 03 ~ and SCN“, are transported by 
CLC-5 and CLC-ecl, but with much less coupling 
with countertransport compared to Cl~. The mech¬ 
anism of this uncoupling is still not clear but it is 
thought that N 03 ~ and SCN“ induce a certain degree 
of slippage (Zifarelli and Pusch 2007). The three 
structures available show different conformations 
of the “gating glutamate,” which occupies Seen 
(structure of CLC-Cm, Fig. 1), Sext (structure of WT 
CLC-ecl), or can be displaced from the pore toward 
the extracellular space (structure of E148Q mutant of 
CLC-ecl); this directly confirms its role in gating. 

The Proton Glutamate 

The normal transport activity of CLC exchangers 
requires protonation of the gating glutamate from 


the intracellular side. An intracellularly exposed glu¬ 
tamate residue (E203 in CLC-ecl), conserved in all 
mammalian and bacterial CLC exchangers, seems to 
mediate this function (Accardi et al. 2005) and has 
therefore been defined as the “proton glutamate.” 
Surprisingly, the corresponding residue is a threonine 
in the CLC-Cm exchanger. Substitution of the proton 
glutamate in CLC-ecl led to an uncoupling of Cl~ 
and movements, whereas in CLC-5 it produced 
a complete inhibition of transport (Jentsch 2008). 
However, the route of proton movement is still not 
clear. The lack of transport activity of CLC-5 mutants 
in which the proton glutamate (E268 in CLC-5) was 
substituted by alanine, allowed the detection of gating 
currents of CLC proteins for the first time. The gating 
currents have been ascribed to intrinsic protein confor¬ 
mational changes rather than to movement of anions in 
the protein (Smith and Lippiat 2010). 

CBS Domains 

In addition to the membrane embedded region, all 
eukaryotic and several prokaryotic CLC also have 
a cytosolic C-terminal domain marked by the presence 
of two CBS domains, CBSl and CBS2, originally 
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discovered in the enzyme cystathionine-P-synthetase 
(Bateman 1997). Tandem pairs of these domains 
constitute nucleotide-binding modules that regulate 
the activity of several unrelated protein families. 
Functional analysis and the presence of disease- 
causing mutations in this region in several CLC 
indicate that this domain is important for correct 
assembly and gating properties (Jentsch 2008). The 
structure of CLC-Cm shows the relative position of 
transmembrane core and C-terminal region (Feng et al. 
2010) (Fig. 1). It indicates that CBS2, interacting with 
several transmembrane helices, might transmit 
rearrangements of the C-terminal domain to the 
transmembrane core and influence pore and gating 
properties. This is of possible relevance for the 
functional effects of nucleotides demonstrated for 
CLC-1, CLC-5, and AtCLC-a (Jentsch 2008; Zifarelli 
and Pusch 2009a). 

Physiological Roles and Genetic Diseases 

CLC Proteins in Model Organisms 

CLC proteins are present in all phyla. Relatively, little 
is known about their function in bacteria, archaea, and 
fungi. The model bacterium E. coli possesses (at least) 
two CLC homologs which seem to contribute to acid 
resistance, which helps ingested E. coli to survive in 
the acid environment of the stomach (Zifarelli and 
Pusch 2007). The CP/JT^ exchange activity has 
been proposed to fulfill the role of a shunting leak 
Cl~ conductance. 

A single yeast homolog, the CP/IT^ antiporter 
Gefl, is indirectly involved in iron metabolism/ 
transport. Inactivation of Gefl leads to impaired 
growth in low iron. The Gefl transporter is localized 
in intracellular structures but its precise role in iron 
transport is unclear (Jentsch et al. 2005). 

Caenorhahditis elegans is a popular model organ¬ 
ism. Its genome encodes 6 CLC proteins. The role of 
one of these, CLH-3, has been investigated in detail 
(Strange 2003). This Cl~ channel has 
properties which are similar to those of the mammalian 
CLC-2. It is regulated by protein kinases and plays 
a role in the egg-laying process of the nematode 
(Strange 2003). 

Best understood are the physiological roles of the 
mammalian and some plant CLC-proteins. Research 
on mammalian CLCs (Fig. 2) was driven by 



T-tubule 



CLC Channels and Transporters, Fig. 3 Schematic drawing 
of a muscle fiber with sarcolemmal localization of CLC-1 


their involvement in various genetic diseases and the 
phenotype of murine knockout models (Jentsch et al. 
2005; Jentsch 2008). 

The Muscle Cl“ Channel CLC-1 and Myotonia 

CLC-1 is a low conductance voltage-dependent Cl~ 
channel which is almost exclusively expressed in the 
plasma membrane of skeletal muscle fibers (Lueck 
et al. 2010) (Fig. 3). In these fibers, the Cl~ equilibrium 
potential is close to the resting potential (^—80 mV) 
and the large resting Cl~ conductance stabilizes the 
membrane potential and contributes to the repolariza¬ 
tion of the action potential. Mutations in CLC-1 lead to 
dominant and recessive myotonia (also called 
Thomsen and Becker type myotonia, respectively) 
(Pusch 2002). Dominant myotonia is often caused by 
a dominant negative effect of the mutated subunit on 
the voltage-dependence of the heterodimeric channel 
complexes, reducing the open probability at 
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physiologically relevant voltages (Pusch 2002). 
The CLC-1 channel is also very sensitive to a variety 
of small organic acid molecules (e.g., 9-anthracene 
carboxylic acid, 9-AC, and p-chlorophenoxy- 
propionic acid, CPP) that bind in the pore of the 
channel and strongly interact with its gating machinery 
(Pusch et al. 2007). 

The Ubiquitous Plasma Membrane CLC-2 Channel 

CLC-2 is a plasma membrane Cl“ channel that is 
activated by strong hyperpolarization. It is rather 
ubiquitous with predominant expression in certain 
epithelial cells, neurons, and astrocytes (Jentsch et al. 
2005; Jentsch 2008). The channel is very sensitive to 
extracellular pH with inhibition both at alkaline as well 
as acidic pH (Niemeyer et al. 2009). Most insights into 
the role of CLC-2 came from knockout studies (Jentsch 
et al. 2005; Jentsch 2008). CLC-2 KO mice are blind 
due to retinal degeneration and males are infertile due 
to testicular degeneration (Jentsch et al. 2005; Jentsch 
2008). Both effects have been ascribed to a role of 
CLC-2 in ionic homeostasis in regions with narrow 
extracellular spaces. Additionally, CLC-2 knockout 
leads to vacuolation in the white matter of the brain 
and spinal cord (Jentsch 2008). 

Kidney and Inner Ear Specific Epithelial CLC-K 
Channels 

CLC-Ka and CLC-Kb are two very similar (>80% 
identity) plasma membrane localized Cl~ ion 
channels. They are mostly localized on the basolateral 
membrane in epithelial cells in the kidney and in the 
inner ear, where they contribute to the transepithelial 
NaCl reabsorption and to the production of endo- 
lymph, respectively (Jentsch et al. 2005; Jentsch 
2008) (Fig. 4). CLC-Ka and CLC-Kb obligatorily 
associate with the small transmembrane protein 
barttin, which is essential for their expression in the 
plasma membrane. No other CLC associates with 
barttin. Mutations in CLC-Kb and also mutations in 
barttin lead to Bartter’s syndrome, a salt-wasting 
nephropathy. Mutations in barttin additionally lead to 
deafness (Jentsch 2008). Knockout of CLC-Ka in 
mice leads to a loss of the ability to concentrate 
the urine and consequently to severe water loss 
(Jentsch et al. 2005; Jentsch 2008). CLC-K channels 
are potentially interesting drug targets that are modu¬ 
lated by a variety of organic and inorganic ligands 
(Pusch et al. 2007). 



CLC Channels and Transporters, Fig. 4 Basolateral locali¬ 
zation of CLC-K/barttin channels in various types of epithelial 
cells. CLC-K channels are involved in vectorial transepithelial 
chloride transport in collaboration with various other transport 
proteins depicted schematically in the figure 

The Endosomal CLC-3, CLC-4, and CLC-5 CC/H"^ 
Antiporters 

The three proteins CLC-3, CLC-4, and CLC-5 are 
quite homologous to each other, and all three are 
CK/H"^ antiporters that are expressed mostly in 
endosomes (Fig. 5). CLC-4 and CLC-5 can be reliably 
expressed in the plasma membrane in heterologous 
expression systems despite their normally intracellular 
localization. On the other hand, contradictory results 
on the functional properties of CLC-3 have been 
reported (Jentsch 2008). 

The functional properties of CLC-4 and CLC-5 are 
very similar. In heterologous expression systems, both 
are extremely outwardly rectifying, but strictly 
coupled 2 Cl~/1 H"^ antiporters. 

Many functions have been ascribed to CLC-3 
(Jentsch 2008) but most remain to be ascertained. 
From knockout studies, it emerged that CLC-3 is 
particularly important in the nervous system because 
its inactivation leads to the loss of the hippocampus 
(Jentsch 2008). 

CLC-5 is mostly expressed in the kidney 
with a predominant localization in subapical 
endosomes in the proximal tubule. The CP/H"^ 
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antiporter is essential for receptor-mediated and fluid 
phase endocytosis (Zifarelli and Pusch 2007; Jentsch 
2008). Mutations of the X-chromosomal CLC-5 lead to 
Dent’s disease, which is characterized by tubular 
proteinuria, hypercalciuria, calcium nephrolithiasis, 
nephrocalcinosis, and chronic renal failure. The 
primary defect is an impaired endocytosis in the prox¬ 
imal tubule, leading directly to low molecular weight 
proteinurea (Jentsch et al. 2005; Jentsch 2008). Most 
secondary effects are caused indirectly by the altered 
endocytosis of proteins and hormones that are involved 
in Ca^"^ homeostasis. Substitution of the WT CLC-5 
gene in mice with an uncoupling mutation (E211A) 
that retains Cl~ transport but is incapable of 
countertransport is phenotypically very similar to 
a complete loss of function (Novarino et al. 2010). 
Thus, the coupling of the Cl~ and IT’' transport is 
essential for physiological function. A possible role 
of CLC-5 (and of all intracellular mammalian CLCs) 
is to increase the intravesicular Cl~ concentration 
exploiting the H'’' gradient provided by the V-type Tf’’ 
ATPase. 

The Neuronal Late-Endosomal/Lysosomal CLC-6 
cr/H^ Antiporter 

CLC-6 is a CP/H’’’ exchanger that is almost exclu¬ 
sively found in late endosomes and/or lysosomes of 
central and peripheral neurons (Fig. 5). Its knockout in 
mice leads to mild neuronal ceroid lipofuscinosis 
(NCL) (Jentsch 2008). 

The Ubiquitous Lysosomal CLC-7 Cr/H’’’ 
Antiporter 

CLC-7 is a CP/H’’’ exchanger that is expressed rather 
ubiquitously. In most cells it is found in lysosomes 
(Fig. 5). Knockout of CLC-7 in mice and mutations 
of CLC-7 in humans lead to two major phenotypes: 
neurogeneration and osteopetrosis (Jentsch 2008). 
Osteopetrosis is characterized by dense and fragile 
bones and derives from a defective osteoclast- 
mediated bone resorption. CLC-7 localizes to the 
special lysosome-derived ruffled membrane of osteo¬ 
clasts, which attaches to bone and mediates its resorp¬ 
tion. The neurodegenerative phenotype is related 
to lysosomal dysfunction in neurons. CLC-7 
associates with a small single-span transmembrane 
protein called Ostml. Association of CLC-7 with 
Ostml is required for the stability of both proteins 
(Jentsch 2008). 



CLC-3: endosomes / late endosomes 
CLC-4: endosomes (?) 

CLC-5: endosomes 

CLC-6: late endosomes / lysosomes 

CLC-7/Ostml: lysosomes 

J 

CLC Channels and Transporters, Fig. 5 Localization of the 
antiporters CLC-3, -4, -5, -6, and -7 in endosomal/lyso- 
somal membranes 


Plant CLCs 

The genome of the model plant Arahidopsis thaliana 
encodes 7 CLC proteins named AtClC-a to AtClC-g, 
most of which seem to be expressed in intracellular 
membranes. AtCLC-a and AtCLC-b are vacuolar 
NOs^/H’’' antiporters that are involved in vacuolar 
nitrate storage (Zifarelli and Pusch 2009a). The differ¬ 
ence in anion preference of these antiporters compared 
to the mammalian and bacterial CP/H’’' antiporters 
could be pinpointed to a single residue difference in 
the selectivity Alter (proline in AtCLCa vs serine in 
CLC-5 and ec-CLC-1) (Zifarelli and Pusch 2009a). 
AtCLCc, bearing a serine at the corresponding 
position, is involved in vacuolar CP transport during 
stomatal opening (dossier et al. 2010). 

Summary 

The physiological functions of CLC proteins fall into 
two classes. Plasma membrane CLCs are passive CP 
channels involved in the regulation of the membrane 
potential and CP transport. The intracellular anion/H’’' 
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antiporters appear to be important in regulation of 
intralumenal anion concentration. 

Cross-References 

► Analysis of Macroscopic Currents 

► Automated Patch Clamp: Advantages and 
Limitations 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 

► Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET 

► Kinetics: Overview 

► Membrane Proteins: Structure and Organization 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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Synonyms 

Crystallographic refinement; Low-resolution diffrac¬ 
tion data 
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Definition 

The program Crystallography and NMR System 
(CNS) was the first advanced software system in struc¬ 
tural biology that made use of a modular, multilevel 
approach to computing, utilizing a high-level symbolic 
structure determination language (Brunger et al. 1998). 
Many algorithms were moved from the source code into 
a symbolic structure determination language. The high- 
level CNS computing language allows definition of sym¬ 
bolic target functions, data structures, procedures, and 
modules. The compiled CNS program, written in For- 
tran77, acts as an interpreter for the high-level CNS 
language and includes hard-wired functions for the effi¬ 
cient processing of computing-intensive tasks. Methods 
and algorithms are therefore more clearly defined, and 
easier to adapt to new and challenging problems. The 
result is a multilevel system that provides maximum 
flexibility to the user. The CNS language provides 
a common framework for nearly all of the computational 
procedures required for structure determination. 
A comprehensive set of crystallographic procedures for 
phasing, density modification, and refinement has been 
implemented in this language. The CNS language per¬ 
mits the design and execution of nearly any numerical 
task in X-ray crystallographic structure determination 
using a minimal set of “hard-wired” functions and rou¬ 
tines. Task files consist of CNS language statements and 
module invocations. The task files, which can also be 
accessed through an HTML graphical interface, are avail¬ 
able to carry out these procedures. Recent developments 
include improved treatment of bulk solvent (Brunger 
2007), the DEN method to obtain more accurate crystal¬ 
lographic models at low resolution (Schroder et al. 2010), 
and docking calculations to obtain models of macromo¬ 
lecular complexes based on single molecule FRET- 
derived distances (Choi et al. 2010; Vrljic et al. 2010). 

Cross-References 

► Macromolecular Crystallography: Overview 

► NMR 
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CoAT Mitochondrial CoA Transporter 

► Mitochondrial Transport Protein Eamily 


Codes of Practice 

► Absorbance Spectroscopy: Quality Assurance 


Coherent Anti-Stokes Raman Scattering 
Spectroscopy 

► Nonlinear Raman Spectroscopy: Coherent Anti- 
Stokes Raman Scattering (CARS) 


Coherent Diffraction Imaging (CDI) or 
Coherent Diffractive Imaging 

► Macromolecular Crystallography: Future of FELs 
for Structural Biology 


Coiled Coils 

► a-Helical Coiled Coils 


Colloidal & Macromolecular Dynamics 

► Dynamics of Colloids and Macromolecules 
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Combined Assignment and Dynamics Algorithm for NMR Applications 


Combined Assignment and Dynamics 
Algorithm for NMR Applications 

► CYANA 


Comparative Protein Structure Modeling 

► Homology Modeling of Protein Structures 


Complement Factor H and Its Ligands: 

A Multidisciplinary Approach to 
Interactions and Modeling 

Stephen J. Perkins, Ruodan Nan, Keying Li and 
Sanaullah Khan 

Department of Structural and Molecular Biology, 
University College London, London, UK 

Synonyms 

Protein-protein interactions; Protein-carbohydrate 
interactions 

Definition 

A combination of X-ray and neutron scattering, ana¬ 
lytical ultracentrifugation, constrained modeling and 
surface plasmon resonance methods is described in 
order to elucidate the solution structure of complement 
Factor H and its interactions with its major ligands. 

Introduction: Multivalent Weak Interactions 
of Factor H 

In the complement system of innate immunity, C3 is 
the central complement protein in plasma, being nor¬ 
mally present at 1.2 mg/ml (6.4 pM). When this is 
converted to C3b by one of three activation pathways, 
the small C3a domain is cleaved to leave the major 
active C3b protein. C3b is able to attack pathogenic 
bacteria by triggering the pathway that leads ultimately 


to the formation of the membrane attack complex that 
lyses these bacteria. C3 (189 kDa) is comprised of 
a compact arrangement of 13 domains, including 
a thioester domain (TED), eight macroglobulin (MG) 
domains, and four others. It is essential to control 
the excessive generation of C3b in order to 
prevent undesired human host cell damage. 
Factor H (154 kDa) is the main complement regulatory 
protein in plasma that controls C3b degradation 
in association with the protease Factor I. The 
Factor H concentration in plasma ranges between 
0.235 and 0.81 mg/mL (1.5-5.3 pM). Factor H is com¬ 
prised of 20 short complement regulator (SCR) 
domains, formed in an extended arrangement that is 
partially folded back upon itself, each domain being 
about 61 residues in size, and joined by linkers of 
lengths 3-8 residues. 

The three functionally important regions in Factor 
H are centered on SCR-1/4, SCR-6/8, and SCR-19/20 
(Fig. 1). These bind in a multivalent manner with weak 
pM dissociation constant Kq values to C3b and also to 
other ligands including Factor H itself, heparin, 
C-reactive protein, and zinc (Perkins et al. 2010a). 
All three regions bind to C3b, the second and third 
regions bind to heparin and C-reactive protein, and 
have the self-association sites for Factor H, and the 
central region binds to zinc (Nan et al. 201 1). Bacterial 
evasion strategies involve proteins that bind to Factor 
H, including the pathogen-expressed Protein M and 
Factor H-binding protein (fHbp: Fig. 1). Because the 
Ko values are pM in value (Table 1), this means that 
full complex formation in vitro will not be achieved at 
physiological concentrations and buffers. In the com¬ 
plex environment of the hundreds of proteins found in 
plasma, the major functional interactions of Factor 
H become a balance between unbound and bound pro¬ 
tein. Too little or too much C3b regulation leads to 
disease. Polymorphisms in Factor H have been geneti¬ 
cally associated with age-related macular degeneration 
(AMD), the most common cause of blindness in the 
Western world. C-terminal mutations in Factor H are 
associated with atypical hemolytic uremic syndrome 
(aHUS) which causes renal failure in young children. 

Introduction: Multidisciplinary Approaches 

A comprehensive molecular understanding of Factor 
H-ligand interactions is presently incomplete. 
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Complement Factor 
H and Its Ligands: 

A Multidisciplinary 
Approach to Interactions 
and Modeling, Fig. 1 The 
ligand binding sites in the 
20-SCR domain structure of 
Factor FI. (a) Scattering 
modeling best fit structure of 
Factor FI from 2009. The SCR 
domains are numbered from 1 
to 20. Note, however, that no 
specific proximity 
arrangement between 
nonadjacent SCR domains in 
Factor FI is implied by this 
view, only that the SCR 
domains are generally folded 
back. The eight 
oligosaccharide chains are 
shown in green outlines, (b) 
The SCR domains that bind to 
bacterial M-protein and fHbp, 
to heparin, sucrose octasulfate 
and sialic acid, to zinc, to 
C-reactive protein, to Factor 
H itself and to C3b are 
indicated by lines labeled to 
identify the ligands 
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Complement Factor H and Its Ligands: A Multidisciplinary Approach to Interactions and Modeling, Table 1 Summary of the 
major factor H interactions and their dissociation constant values 


Factor H ligand 

Major outcome 

Methods” 

Kd value 

Self-association 

Dimer of SCR-6/8 

AUC 

40 pM 


Dimer of SCR-16/20 

AUC 

16 pM 


Oligomers of Factor H 

AUC 

28 pM 

Zinc 

Large Factor H-zinc oligomers 

AUC; SAXS 

10 pM 

C3 and C3u 

Self-association 

AUC 

Not available 

C3d 

Self-association (dimer)'’ 

AUC 

360 pM (50 mM NaCl) 


Self-association (trimer assumed) 


0.07 pM^ 

C-reactive protein 

Factor H-CRP 

SPR 

4.2 pM 


SCR-6/8 (Tyr402)-CRP 

SPR 

3.9 pM 


SCR-6/8 (His402)-CRP 

SPR 

11.9 pM 


SCR-16/20-CRP 

SPR 

15.3 pM 

Heparin 



Not available 


’^AUC analytical ultracentrifugation, SAX'S small angle X-ray scattering, SPR surface plasmon resonance 

'’If C3d is analyzed in terms of only a monomer-dimer equilibrium, the apparent value is 20 pM in 50 mM NaCl. This C3d self¬ 
association is removed in 137 mM NaCl 
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Biophysical analyses are complicated by the follow¬ 
ing: (1) the multivalency of binding, meaning that 1:1 
and 2:1 complexes may be formed (Fig. 1); (2) different 
ligands can bind simultaneously to Factor H (Fig. 1); 
and (3) incomplete complex formation. The use of 
a single biophysical approach based on structural 
biology or molecular interaction technology will 
not provide a complete explanation of the 
Factor H interactions with its ligands, given that each 
of these approaches has strengths and limitations. 
A multidisciplinary strategy offers a more complete 
answer to these issues, and the combination of four 
methods is summarized in this entry: 

1. Modern analytical ultracentrifugation will 
identify the presence of single or multimeric 
Factor H species and their associations in a wide 
concentration range in solution from sedimentation 
coefficient distribution analyses c(s), and their Kq 
values. The sedimentation coefficients of these spe¬ 
cies provide shape information (Cole et al. 2008). 

2. Scattering offers three-dimensional views of the 
molecular Factor FI solution structure and the com¬ 
plexes that form with its ligands. The advent of 
more sensitive X-ray detectors and much improved 
radiation damage control means that Factor H can 
now be studied over a comparatively wide concen¬ 
tration range (Perkins et al. 2008). 

3. Constrained modeling offers a better understanding 
of the Factor H scattering curves and sedimentation 
coefficients in terms of molecular structures derived 
from homology (“comparative”) models or experi¬ 
mentally derived crystal and NMR structures 
(Fig. 2), compared to the commonly used “ab initio” 
methods, and Kq values can be obtained (Perkins 
et al. 2008, 2011). Homology modeling provides 
insights into the genetic polymorphisms and muta¬ 
tions in Factor H (Saunders et al. 2007). 

4. Surface plasmon resonance studies the molecular 
interaction parameters (on-rates and off-rates; Kq 
values) between an immobilized protein and its 
interaction partner in solution (Tanious et al. 
2008). The current generation of surface plasmon 
resonance instruments is able to address lower Fac¬ 
tor H and ligand concentrations with greater instru¬ 
mental stability. 

For these experimental methods, it is essential that 
the protein and ligand concentrations span the Ko values 
of their interactions by at least one order of magnitude 
above and below the Kd value. Because the Kq value 
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Complement Factor H and Its Ligands: A Multidisciplinary 
Approach to Interactions and Modeling, Fig. 2 Flow 
diagram of the constrained modeling algorithm used for 
multidomain proteins. The atomic coordinates of the SCR-6/8 
domains of Factor FI are converted into spheres using a 3-D grid 
of cubes. A Debye sphere (gray) is assigned to each cube if 
a sufficient number of atoms is present. The resulting unhydrated 
model with 160 spheres is convened to a hydrated model by 
adding 56 spheres to its surface (black) The neutron scattering 
curve measured in heavy water buffer is calculated from the 
unhydrated model, while the X-ray scattering curve (as shown) 
and the sedimentation coefficient are calculated from the 
hydrated model. The automation of this modeling cycle leads 
to the scattering parameters for about 10,000 models. The filter¬ 
ing and elimination of poor-fit models identifies the best-fit 
SCR-6/8 structures (Taken from Perkins et al. 2008 with 
permission) 


corresponds to 50% complex formation, sufficient data 
on the unbound species and their complexes in the 
mixture is needed to characterize these equilibria fully. 

Other review articles present a basic outline of our 
multidisciplinary approach (Perkins et al. 2009), 
detailed analytical ultracentrifugation methods (Cole 
et al. 2008), detailed X-ray and neutron scattering and 
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Complement Factor H and Its Ligands: A Multidisciplinary 
Approach to Interactions and Modeling, Fig. 3 Ultracentri¬ 
fugation c(s) distribution analyses of full-length Factor H and 
three recombinant fragments. The c(s) peaks correspond to sta¬ 
ble species. The monomer is denoted by 1, the dimer hy 2, and so 
on. In (a-c), it can he seen that SCR-1/5 forms only monomer, 
SCR-6/8 forms monomer and a small amount of dimer, and 
SCR-16/20 forms almost equal amounts of monomer and 


dimer. For full-length Factor Ft, boundary scans measured at 
5.92 mg/mL and 0.17 mg/mL are shown in red, where the 
perturbations in (d) show the existence of more rapidly 
sedimenting species. The boundary fits are shown in black. In 
(f), the five c(s) analyses for Factor Ft between 0.17 mg/mL 
(bottom) and 5.92 mg/mL (top) revealed the monomer and 
a series of oligomers labeled from 2 to 7 (Taken from Perkins 
et al. 2011 with permission) 


constrained scattering modeling (Perkins et al. 2008), 
the combination of ultracentrifugation and scattering 
(Perkins et al. 2011), and the applications of these 
methods to Factor H (Perkins et al. 2010a, b). 

Self-association of Factor H 

The self-association of Factor H was first 
biophysically revealed in our 1991 study where X-ray 
and neutron scattering at comparatively high 
Factor H concentrations of 2 mg/mL (13 pM) showed 
that the Factor H molecular weight was almost double 
its sequence-expected value. Electron microscopy and 
size-exclusion gel filtration in 1992 by others. 


however, suggested monomers. Our 2001 study at 
lower Factor H concentrations then showed that Factor 
H was largely monomeric (Perkins et al. 2010a). 

The existence of Factor H dimers was clarified 
when the SCR-6/8 and SCR-16/20 recombinant 
fragments showed monomer and dimers by ultracen¬ 
trifugation c(s) analyses (Fig. 3). Sedimentation equi¬ 
librium showed that the dissociation constant Kq 
values were 40 pM and 16 pM respectively 
(Table 1). This prompted a reinvestigation of hetero¬ 
zygous Factor H in 2008. The “daisy-chaining” of 
these separate dimer sites resulted in dimers and 
smaller amounts of oligomers up to heptamers that 
were observed as well-resolved peaks in size- 
distribution analyses. Sedimentation equilibrium 
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Complement Factor H and Its Ligands: A Multidisciplinary 
Approach to Interactions and Modeling, Fig. 4 The 

constrained scattering modeling of Factor H between 2001 and 
2010. The experimental X-ray scattering curves (red) in 2001, 

2008, and 2010 are compared with each other. The Q ranges used 
to determine the Rg, Rxs-i, and Rxs -2 values are also indicated. 
The Factor Ft concentration was 3.6 mg/mL in 2001 and 0.4 mg/ 
mL in 2009 and 2010. The black solid lines correspond to the 
fitted curve from each modeling analysis. The agreement 
between the experimental and calculated curves as Q increases 
is improved from 2001 to 2010. The inset shows the 2009 model, 
also shown in Figs. 1 and 6 

showed a reversible monomer-dimer equilibrium with 
a Ko of 28 pM but only at low concentration (Table 1). 
At typical plasma Factor H concentrations, about 10- 
15% of Factor H will be dimeric if no other factors 
influence this equilibrium. 

Other experiments support self-association. By 
scattering, dilution series showed a concentration 
dependence of the radius of gyration Rg value. The 
constrained scattering modeling of Factor H to yield 
molecular structures was performed at 0.4 mg/mL in 

2009, where the curve at 0.4 mg/mL does not differ 
signihcantly from that expected at zero concentration. 
Constrained modeling showed an extended but par¬ 
tially folded-back solution arrangement for the 20 
SCR domains in Factor H, and the quality of the 
curve fits has improved in the past decade (Fig. 4). 
Variations of the NaCl concentration and pH in scat¬ 
tering and ultracentrifugation experiments showed that 
Factor H oligomers prevailed in a wide range of buffer 
conditions. Other evidence using surface plasmon res¬ 
onance and mass spectrometry experiments also indi¬ 
cate that Factor H self-associates (Nan 2010). 

The biomedical significance of fhe constrained 
molecular model of Factor H from scattering was to 
provide many of the first structural explanations for the 


effect of genetic polymorphisms and mutations in Fac¬ 
tor H (Saunders et al. 2007). Interactive views of these 
models in terms of movable molecular displays that 
display each mutation are available on our website 
http://www.fh-hus.org. For example, the homology 
model for SCR-7 showed that the AMD-risk polymor¬ 
phism Tyr402His is located at the exposed surface of 
SCR-7, while another AMD-risk polymorphism 
Val62Ile is also surface-exposed in SCR-l. These 
explanations have been supported by subsequent crys¬ 
tal and NMR structures. 


Factor H Oligomers with Zinc 

The presence of transition metals causes Factor H to 
form oligomers. Experimentally, this was first 
observed using zinc titrations in X-ray scattering in 
1991. The results explained the inhibition of Factor 
H regulatory activity in C3b breakdown in the presence 
of zinc. A more extensive follow-up study employed 
both ultracentrifugation c(s) and scattering analyses 
(Perkins et al. 2010a). This looked at the effect of 
zinc and copper on Factor H, as well as other transition 
metals. The largest effect was seen with zinc, followed 
by copper, and followed with other transition metals. 
No well-defined oligomer peaks were observed in c(s) 
plofs, unlike for native Factor H (Fig. 3). Instead 
a broad distribution of polydisperse supra-large oligo¬ 
mers was seen with zinc. The zinc concentration 
dependence of this oligomer formation indicated an 
approximate Factor H-zinc Kq value of about 10 pM 
(Table 1). Given that millimolar amounts of zinc are 
found in drusen, it is possible that Factor H-zinc olig¬ 
omers may be biologically relevant to drusen deposits 
in the retina if sufficient bioavailable zinc is present. 
Recent work with Factor H localised its major zinc 
binding site in the SCR-6/8 region (Fig. 1; Nan et al. 
2011 ). 

Interaction Between Factor H and the 
C3 Proteins 

During C3 activation, the thioester group in the TED 
domain is slowly hydrolyzed to C3u. The presence of 
C3u mediates the rapid conversion of more C3 to 
functionally active C3b through a complex formed 
between C3u and factor B. C3u shows functional 
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similarities to C3b. Biophysical studies of the Factor 
H-C3u interaction required knowledge of the solution 
properties and structures of C3 and C3u. The C3 and 
C3b crystal structures were all obtained using low salt 
buffer. It was not known whether the crystal structures 
correspond to similar domain arrangements and oligo¬ 
meric state in physiological buffer conditions. 

Ultracentrifugation revealed several c(s) peaks for 
C3 and C3u that showed that two different dimeriza¬ 
tion events were present in addition to the predominant 
monomer. A comparatively fast monomer-dimer 
equilibrium was observed in 50 mM NaCl as a 
concentration-dependent single major peak at 
7.6-8.4 S, which is in fact a reaction boundary. A sec¬ 
ond invariant minor peak at 11.2-11.5 S was attributed 
to a slowly exchanging dimer. On extrapolation to zero 
concentration, the different s° 2 o,w values of 7.85 S for 
C3 and 7.44 S for C3u indicated a conformational 
change between the C3 and C3u monomer. 

X-ray scattering also indicated concentration- 
dependent Rq values and two different monomer con¬ 
formations for C3 and C3u. Solution conditions known 
from the ultracentrifuge data that resulted in only 
monomers were used to obtain scattering data for use 
in constrained modeling. The C3 crystal structure fitted 
the C3 scattering curve well. For C3u, constrained 
modeling was needed in order to optimize what turned 
out to be an extended arrangement of the TED domain 
relative to the rest of the C3u structure. The importance 
of this result is to show that the conformational vari¬ 
ability of the TED domain would enable activated C3 
to bind better to antigenic targets. 

The interaction between Eactor H and C3u is cur¬ 
rently under investigation, while Eactor H does not 
interact with C3. At present, only the interaction 
between C3d (which corresponds to the TED domain: 
35 kDa) and Eactor H has been studied. The c(s) 
analyses of C3d in low salt revealed three species 
that were assigned to monomer, dimer, and trimer 
forms, while only monomer was seen in physiological 
salt. Given that each of Eactor H and C3d formed 
multimers, all three methods of ultracentrifugation, 
scattering, and surface plasmon resonance in two 
NaCl buffers were used to unravel the major features 
of this interaction. Thus multiple peaks in the c(s) plots 
of protein mixtures showed that multimeric complexes 
were formed when C3d was mixed with either the 
SCR-16/20 fragment or intact Eactor H. The simplest 
explanation of this was based on the postulate of 



Sedimentation coefficient (S) 

Complement Factor H and Its Ligands: A Multidisciplinary 
Approach to Interactions and Modeling, Fig. 5 Ultracentri¬ 
fugation c(s) size-distribution analyses of Factor H, C-reactive 
protein, and their 1:1 mixture. Factor FI at 1.0 mg/mL showed 
monomer (peak 1) and small amounts of dimer, trimer, 
and tetramer (peaks 2^). C-reactive protein at 1.2 mg/mL 
showed pentamer (P) and decamer (D). The 1:1 mixture of 
Factor H and C-reactive protein showed new peaks A, B, and C 
at different S values from those of peaks 1, 2, 3, 4, P and D. 
Unlike Fig. 3, peaks A, B, and C correspond to reaction bound¬ 
aries, because the equilibrium rate between Factor FI and CRP 
and their complexes is comparable with the rate of sedimentation 
of these proteins in the ultracentrifuge 

monomer-dimer equilibria, in which a series of alter¬ 
nating C3d dimers and SCR-16/20 dimers interacted 
with each other. Erom the biological standpoint, 
multimeric Factor H-C3d complexes offer an addi¬ 
tional means of complement regulation during chronic 
inflammation when C3 levels become elevated. 


Interaction Between Factor H and C-reactive 
Protein 

C-reactive protein (CRP: 115 kDa) is formed as 
a pentameric ring. Its plasma concentration increases 
rapidly from 800 ng/mL (7 nM) to 0.5 mg/mL (4.4 pM) 
following a stimulus. CRP binds to damaged host cells 
in addition to other targets such as bacterial pathogens. 















334 


Complement Factor H and Its Ligands: A Multidisciplinary Approach to Interactions and Modeling 


Complement Factor 
H and Its Ligands: 

A Multidisciplinary 
Approach to Interactions 
and Modeling, Fig. 6 The 
solution structures for Factor 
H and heparin are drawn to the 
same scale as crystal structures 
for C3h and CRP. The FH 
structural model from 2009 is 
shown in cyan, with the C3b- 
binding SCR-1/4 domains 
shown in blue, and the 
heparin-binding SCR-7 and 
SCR-19/20 domains in red. 
Oligosaccharides are shown in 
outline. C3b is shown in red 
for the TED and CUB 
domains, and in blue for the 
ten MG1-MG8, C345C, and 
LNK domains. The CRP 
pentamer and the heparin dp36 
structures are shown in red 



Importantly, 2 mM is present in plasma to stabi¬ 
lize CRP. As for C3 and C3u, knowledge of its oligo¬ 
meric state is required to study its interactions. 
Oligomer formation was resolved by the combination 
of ultracentrifugation, X-ray scattering, constrained 
modeling, and surface plasmon resonance. The c(s) 
analyses showed two peaks that each corresponded to 
pentameric CRP and a reaction boundary between 
pentameric and decameric CRP (red; Fig. 5). X-ray 
scattering was not affected by reaction boundaries, 
and the directly measured proportions of CRP 
pentamer and decamer by scattering gave an averaged 
Kd value of 21 pM in 140 mM NaCl, 2 mM Ca^-". 
Surface plasmon resonance of soluble CRP bound to 
immobilized CRP gave a similar Kd value of 23 pM 
(Table 1). It was also found that the removal of Ca^"^ 
had measurable effects on CRP stability and 
denaturation. 

Eleven prior studies of the Factor H-CRP interac¬ 
tion had concluded that the two proteins either 
interacted with each other or they did not. The different 
outcomes resulted from the various buffers and protein 
concentrations in use, and also whether CRP was dena¬ 
tured or not. Here, ultracentrifugation c(s) plots 
showed that adding CRP to Factor H reduced the 
amounts of CRP decamer and FH oligomers present. 


while two new species were seen (pink; Fig. 5). This 
outcome was confirmed from surface plasmon reso¬ 
nance using both immobilized Factor H and 
immobilized CRP, which gave a Kq of 4 pM for the 
interaction (Table 1). 

The removal of the Factor H oligomers after adding 
CRP showed that CRP had blocked both Factor H self¬ 
dimerization sites at SCR-6/8 and SCR-16/20. Further 
surface plasmon resonance showed that each of 
SCR-6/8 and SCR-16/20 was bound to CRP. Further¬ 
more a large difference was seen in the binding of the 
SCR-6/8 (Tyr402) and SCR-6/8 (His402) allotypes to 
native CRP with Kq values of 4 pM (Tyr402) and 
12 pM (His402) (Table 1). This difference in allotype 
binding suggests that the Factor H-CRP interaction 
is involved with age-related macular degeneration 
(Perkins et al. 2010a, b). 

Interaction Between Factor H and Heparin 

Factor H binds to heparin in at least two major sites. 
However, the solution structure of heparin was not 
well understood. Given that heparin is a readily avail¬ 
able analogue of heparan sulfate that occurs on human 
host cell surfaces, the heparin solution structure is 
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important to know for Factor H interaction studies. The 
crystal structures for small heparin fragments of sizes 
dp4-dpl0 were too small to be useful. However, hep¬ 
arin fragments of sizes dp6-dp36 could be purified 
(i.e., 3-18 iduronate and glucosamine disaccharide 
units). Thus, ultracentrifugation and constrained 
scattering modeling were applied to identify their 
molecular structures in solution (Fig. 6). 

Ultracentrifugation revealed that the fragments 
were monodisperse from the observation of single 
peaks in the c(s) analyses. The sedimentation coeffi¬ 
cients and scattering Rq values increased with heparin 
size. By scattering, the greater sensitivity of the Rq 
values to the conformation revealed that the heparin 
solution structures became progressively more bent 
with increased size. This was confirmed from the 
constrained scattering modeling of conformationally 
randomized heparin structures. The best-fit solution 
structures showed moderate flexibility in mildly bent 
structures. Interestingly similar angular orientations 
between the iduronate and glucosamine rings were 
seen in both the solution and crystal states. It was 
concluded that heparin has a preformed conformation 
that is ready for binding to proteins such as Factor H. 
This study has permitted the follow-up study of the 
Factor H-heparin complex, which is now under 
investigation. 

Summary 

The Factor H interactions are not straightforward to 
work with. A multidisciplinary and biophysical strat¬ 
egy based on four complementary techniques offers 
a means of unraveling the complexity caused by 
multivalency, multiple binding, and weak associations 
involving Factor H self-association, zinc, C3u, CRP, 
and heparin (Fig. 6). The outcome of biophysical stud¬ 
ies with Factor H is summarized in Table 1. One 
noticeable conclusion is that, in consistency with the 
2-6 pM concentrations of C3 and Factor H in plasma, 
all the Kd values measured so far are also in pM 
ranges, that is, they are comparable with the plasma 
concentrations of C3 and Factor H. For example, for 
Factor H itself, the self-association Ko value of 28 pM 
means that 85-90% monomer and 10-15% oligomers 
will be normally present. Another noticeable outcome 
of Table 1 is that the analyses often revealed that 
multimeric free proteins or multimeric complexes are 


formed. The complexes do not correspond to simple 
1;1 complexes. 

The merit of the combination of ultracentrifugation, 
scattering, constrained modeling and surface plasmon 
resonance, sometimes supported by mass spectrometry 
(Nan 2010), is that the methods give comparable and 
complementary results. Thus different views of the 
solution shapes and structures are provided by ultra¬ 
centrifugation, scattering, and constrained modeling. 
The main interaction parameters in terms of Kq values 
are provided by all four methods, some corresponding 
to solution measurements, and others to surface mea¬ 
surements where local concentration effects may be 
important. An understanding of the molecular interac¬ 
tions that take place between Factor H and its ligands 
will impact directly on our understanding of both 
AMD and aHUS diseases. Examples of these biomed¬ 
ical insights were noted above and show that biophys¬ 
ics methods will play an important role in unraveling 
Factor H mechanisms. 

Cross-References 

► HYDFIT and Related Packages for Linear 
Molecules 

► Neutron Scattering of Membranes 

► Sedimentation Equilibrium Analytical 
Ultracentrifugation 

► Sedimentation Velocity Analytical 
Ultracentrifugation 

► Small Angle Neutron Scattering (SANS) Software 

► Solid-Phase Detection Techniques 

► Wide and Small Angle X-Ray Scattering 

► X-Ray Scattering of Lipid Membranes 

References 

Cole JL, Lary JW, Moody TP, Laue TM. Analytical ultracentri¬ 
fugation: sedimentation velocity and sedimentation equilib¬ 
rium. Methods Cell Biol. 2008;84:143-211. 

Nan R. [PhD thesis]. Self-association of complement Factor H in 
the presence and absence of metals. London: University of 
London; 2010. 

Nan R, Farabella I, Schumacher F, Miller A, Gor J, Martin ACR, 
Jones DT, Lengyel I, Perkins SJ. Localisation of the 
major zinc-binding site of complement Factor H to the 
SCR-6/8 domains: possible implications for age-related 
macular degeneration. J Mol Biol. 2011;408:714—35. 

Perkins SJ, Okemefuna AI, Fernando AN, Bonner A, Gilbert HE, 
Furtado PB. X-ray and neutron scattering data and their 



336 


Complex V 


constrained molecular modelling. Methods Cell Biol. 
2008;84:375-423. 

Perkins SJ, Okemefuna AI, Nan R, Li K, Bonner A. Constrained 
solution scattering modelling of human antibodies and com¬ 
plement proteins reveals novel biological insights. J R Soc 
Interface. 2009;6:S679-96. 

Perkins SJ, Nan R, Okemefuna AI, Li K, Khan S, Miller A. 
Multiple interactions of complement factor H with its ligands 
in solution: a progress report. Adv Exp Med Biol. 
2010a;703:25^7. 

Perkins SJ, Okemefuna AI, Nan R. Unravelling protein-protein 
interactions between complement factor H and C-reactive 
protein by a multidisciplinary strategy. Biochem Soc Trans. 
2010b;38:894-900. 

Perkins SJ, Nan R, Li K, Khan S, Abe Y. Analytical ultracentri¬ 
fugation combined with X-ray and neutron scattering: exper¬ 
iment and modeling. Methods. 2011;54:181-99. 

Saunders RE, AbaiTategui-Garrido C, Fremeaux-Bacchi V, 
Goicoechea de Jorge E, Goodship THJ, Lopez Trascasa M, 
Noris M, Ponce Castro IM, Remuzzi G, Rodriguez de 
Cordoba S, Sanchez-Corral P, Zipfel PE, Perkins SJ. The 
interactive factor H - atypical haemolytic uraemic syndrome 
mutation database and website: Update and integration of 
membrane cofactor protein and factor I mutations with struc¬ 
tural models. Hum Mutat. 2007;28:222-34. 

Tanious FA, Nguyen B, Wilson WD. Biosensor-surface plasmon 
resonance methods for quantitative analysis of biomolecular 
interactions. Methods Cell Biol. 2008;84:53-77. 


Complex V 

► ATP Synthase Structure 


Composition Fluctuations 

► Critical Fluctuations in Lipid Mixtures 


Composition-Gradient Multiangle Light 
Scattering 

Daniel Some 

Wyatt Technology Corporation, Santa Barbara, 

CA, USA 

Synonyms 

Concentration-gradient multiangle light scattering 


Definition 

Characterization of the equilibrium and kinetic prop¬ 
erties of interacting macromolecules in solution, by 
means of static light-scattering analysis performed 
over a series of sample compositions or concentrations 
and possibly at multiple scattering angles. 

Introduction 

Composition-gradient multiangle light scattering 
(CG-MALS) is a biophysical technique for characteriz¬ 
ing macromolecular interactions in solution, without 
recourse to labeling or immobilization. The detectors 
employed are similar to those of ► multiangle light 
scattering from separated samples: a multiangle 
light-scattering photometer, often in combination with 
a solution-based concentration detector such as 
a differential refractometer or UVWis absorption spec¬ 
trophotometer. Instead of a continuously flowing frac¬ 
tionation apparatus, CG-MALS utilizes a multi-syringe 
pump system, programmed to provide stop-flow injec¬ 
tions at a series of well-defined concentrations or com¬ 
positions. At each composition, the sample in the 
detector cells is permitted sufficient time to complete 
any ongoing reaction. Data acquired at each step, after 
equilibration, is analyzed to determine equilibrium prop¬ 
erties including association stoichiometry and binding 
affinity (Ka) in the case of specific complex formation 
and/or thermodynamic parameters such as virial coeffi¬ 
cients in the case of nonspecific interactions. The reac¬ 
tion rates at each composition may also be analyzed to 
determine kinetic parameters such as the Michaelis 
constant (k|y[) or association/dissociation rate constants 

(ko n? koff)- 

Conceptually, CG-MALS is closely related to 
► analytical ultracentrifugation - sedimentation equi¬ 
librium: it is an “absolute” technique based entirely on 
first principles and a rigorous thermodynamic founda¬ 
tion. There are no matrix or foreign tag interactions, and 
the choice of solvents is almost unrestricted (though the 
solutions must be purified and particulates removed via 
filtration). Since static light-scattering analysis 
provides a measure of weight-averaged molar mass, 
CG-MALS goes beyond the stoichiometric ratio to 
determine full complex stoichiometry. Systems under¬ 
going self-interaction, hetero-association, or even 
simultaneous reversible self- and hetero-association 
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may be quantified. Unlike sedimentation equilibrium, 
a typical automated CG-MALS run may complete in 
less than an hour, unless the reaction times of the system 
are particularly slow. Reaction kinetics may be 
observed and quantified, subject to the limitations of 
the instrumentation. Interaction analysis is not limited 
to macromolecules but may be extended to the behavior 
of larger objects such as protein uptake by microgel 
particles (Smith and Lyon 201 1), and the same instru¬ 
mentation may serve to automate the study of additional 
phenomena such as nanoparticle degradation kinetics 
(Smith etal. 2011). 

Characterization of biomacromolecular interactions 
by means of static light scattering has been known and 
well-understood since the inception of polymer 
analysis by MALS (Burchard and Cowie 1972). 
Traditional CG-MALS was labor-intensive, required 
copious sample volumes, and was quite limited in 
sensitivity, restricting the range of addressable interac¬ 
tions. The technique became viable for widespread use 
upon two key developments: the introduction of 
modem, highly sensitive multiangle laser-scattering 
instramentation adapted for flowing solutions 
(Wyatt 1993) and the combination of this instmmenta- 
tion with an automated stop-flow composition-gradient 
device and analysis algorithm (Attri and Minton 
2005a, b; Kameyama and Minton 2006). Analysis of 
reversible association via CG-MALS was extended 
well into the upper concentration range where signifi¬ 
cant thermodynamic non-ideality is present, in order to 
address very weak interactions (Minton 2007). 


CG-MALS Theory for Thermodynamically 
Ideal Systems 

The thermodynamically ideal limit implies an ensemble 
of point-like, non-interacting particles, akin to the “ideal 
gas law” familiar from elementary chemistry. For 
biomacromolecules in solution, this condition is typi¬ 
cally met when the concentration of macromolecules is 
below ^1 g/L, though a lower concentration may be 
required if the solvent ionic strength is relatively low 
and the molecules highly charged, or if the macromolec- 
ular conformation is extended. This condition of ideality 
is also known as “the dilute limit.” 

In the most common configuration, an incident 
beam of light with vacuum wavelength Aq and linear 
polarization propagates through the solution in 


a uniform direction (Teraoka 2002; van Holde et al. 
1998; Young 1981). Photometric detectors are placed 
within the plane that contains the incident beam and is 
perpendicular to the polarization vector, at one or 
more azimuthal angles 9 relative to the direction 
of propagation of the beam. The “ideal gas law” 
for static light scattering from a solution comprising 
multiple, non-interacting species i, presented in 
Eq. 1, is simply a sum of terms, each of which corre¬ 
sponds to the scattered intensity from an individual 
species. 



( 1 ) 


Here R(9) represents the “excess Rayleigh ratio” 
measured at each detector angle 9. The total Rayleigh 
ratio is the ratio of the radiant intensity scattered by 
a unit volume of the solution, to the intensity of inci¬ 
dent light. The excess Rayleigh ratio is obtained by 
subtracting from the total Rayleigh ratio that portion 
due to scattering by the solvent, resulting in a measure 
of the light scattered by the solute alone. K is a system- 
dependent constant which includes Aq and the solvent 
refractive index «o- c,, and dn/dCj refer to the molar 

mass, mass/volume concentration, and refractive 
increment of each species i, respectively. P(9, /g ,) is 
the angular dependence of the scattered light vs. azi¬ 
muth 0, as given by Eq. 2. 


P{9,r,) = l 


2 

3aI 



( 2 ) 


In Eq. 2, the quantity is the molecular 

mean square radius defined as J r^dm/^ dm where 
the mass integral is taken over the entire molecule; 
the square root of (^>'1^ is often called the radius of 
gyration, or fg. In many cases of interest, e.g., proteins 
and polysaccharides smaller than a few hundreds of 
kDa, Vg is so much smaller than Aq/wo that Pwl and 
angular dependence may be ignored. Such molecules 
are known as “isotropic scatterers.” 

Specific binding of biomacromolecules such as 
antibodies and antigens are characterized by an equi¬ 
librium dissociation constants Kq = HKj^ usually well 
below 1 g/L (equivalent to 100 pM for a 10 kDa 
molecule, or 10 pM for a 100 kDa molecule), well 
within the dilute limit. Upon reaching equilibrium, 
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the solution is considered to consist of a mixture of 
several discrete macromolecular species: free mono¬ 
mers and one or more complexes. This solution can be 
analyzed in terms of ideal light scattering using Eq. 1 
and an additional set of equations pertinent to the 
precise association model (Burchard and Cowie 
1972; Attri and Minton 2005a, b). 

For a self-associating system, we denote the oligo¬ 
mer order by j. The equilibrium between oligomers and 
free monomer is described by the ► law of mass action 
Eq. 3. The set of equations describing the association 
model is completed with the conservation of mass 
Eq. 4. Hence, the full set of equations used in 
CG-MALS analysis of reversible self-association of 
a single monomer to form one or more oligomers 
includes Eqs. 1, 3, and 4: one version of Eq. 3 for 
each oligomeric species. 


A-b A-b 




(3) 


[A]'" = (4) 

j 


For a hetero-associating system, we denote the 
monomeric species by A and B, and the complex 
stoichiometry by j and k. The mass action equation 
for each jk complex is given by Eq. 5, and the conser¬ 
vation of mass equations for A and B, by Eq. 6 . 
In combination with Eq. 1, these constitute a full 
set of equations with which to analyze CG-MALS 
data. 


A'A 

A + B-b... -^^A,'B,t; 




[AyB,] 

[Af[B]^ 


(5) 


CG-MALS Theory for Thermodynamically 
Non-ideal Systems 

At concentrations above ^ 1 g/L, even in the absence 
of specific complex formation, biomacromolecules 
begin to experience the effects of relatively weak, 
nonspecific interactions due to several physical and 
chemical effects (Neal et al. 1998). These include 
hard-core (Van der Waals) repulsion; electrostatic 
repulsion arising from net molecular charge; attractive 
monopole-dipole, dipole-dipole, and higher-order elec¬ 
trostatic interactions arising from charge distributions 
across the molecules; and surface hydrophobic patches, 
hydrogen bonds, and others. The combination of all 
these contributions leads to an intermolecular potential 
W(R,0,(l>) that depends on both molecular separation R 
and relative orientation angles (0,<P) as well as solvent 
conditions, e.g., pH and ionic strength. This potential 
lends itself to thermodynamic treatment by means of 
statistical mechanics, e.g., defining a pair correlation 
function from which various thermodynamic parame¬ 
ters may be derived (see ► Thermodynamics of Bio- 
molecular Interactions). 

A full and rigorous analysis of multicomponent static 
light scattering, assuming a series of discrete species 
subject to such nonspecific interactions, was derived in 
the context of the grand canonical ensemble (Kirkwood 
and Goldberg 1950; Blanco et al. 2011 and references 
therein). In the limit of isotropic scatterers, as well as the 
limit 0^0, the excess Rayleigh ratio is given by Eq. 7. 



[A]"" = E^'[AyB.]; [B]“' = E ^[AyB.] ( 6 ) 

J k 

The most common protocol for analyzing hetero¬ 
association of macromolecules via CG-MALS 
employs a composition gradient beginning with 
100% A, then proceeding through a series of steps 
where the amount of A is decreased and B is increased, 
until the final step consisting of an injection of 100% B. 
In this manner, many different stoichiometries are 
sampled in order to determine the association model 
as well as quantify binding. 


of Eq. 7 is the mn sub-determinant of i/^, is 
the thermodynamic activity coefficient of species m 
(which depends on the concentrations of all species 
in solution), and (5„,„ is the Kronecker delta. 

In the limit of a single, non-self-associating macro- 
molecular species, Eq. 7 is often simplified to Eq. 8 or 
approximations thereof, using the virial expansion, 
where the coefficients A 2 , A 3 , etc. (also denoted B 22 , 
8222 ^ etc.) are called the virial coefficients. Calcula¬ 
tions of virial coefficients, especially A 2 , from W 
(R,0,^) are provided in standard textbooks on statis¬ 
tical mechanics and chemical thermodynamics. Virial 
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coefficients are measured via static light scattering by 
means of a series of concentration steps. 


R Me Me 

K ^ 1 , ^ 1 -f 2A2MC + 3A3MC2 + ... 

dc 


For two interacting, non-associating species A and 
B, one common way of approximating Eq. 7 with 
a virial expansion is provided in Eq. 9. The term Ajj 
(a.k.a. B 23 ), which captures first order, nonspecific 
hetero-interaefions, may be determined by measuring 
light scattering from a series of compositions of A and 
B, if the self-virial coefficients A 2 ^ and A 2 ,b have been 
predetermined. Equation Eq. 7 may be expanded to 
include higher-order virial coefficients and/or to other 
functional forms. 


R 

K 


_MaCa_ 

1 + 2A2aMaCa + ... 

^ Mgeg 

1 + 2 A 23 MgCB + ... 


AAnMAMgeAeg (9) 


At very high concentrations, e.g., 100 mg/mL, 
macromolecules may experience quasi-specific asso¬ 
ciation, wherein nonspecific attraction due to, e.g., 
dipole-dipole interactions leads to weak complex for¬ 
mation. Such systems may be described by models that 
assume all attractive effects are included in the equi¬ 
librium association constants of complexes with 
defined stoichiometries, while repulsive effects are 
accounted for by thermodynamic non-ideality (Minton 
2007; Eernandez and Minton 2009). A useful approx¬ 
imation for self-associating macromolecules describes 
the repulsion in terms of an effective specific volume 
Vp^and a hard-sphere inferaction wherein all oligomers 
retain a constant v^ff. Antibodies at these concentra¬ 
tions are well-described by such hybrid models, often 
presenting the apparent formation of weakly bound 
dimers and higher oligomers (Scherer et al. 2010). 


CG-MALS Apparatus 


CG-MALS system comprising the necessary hardware 
and control and analysis software is the Calypso 
(Wyatt Technology Corp., Santa Barbara, CA). 

Knowledge of sample concentrations is essential to 
CG-MALS data analysis. In the simplest implementa¬ 
tion, the concentrations at each composition are calcu¬ 
lated from the pre-measured initial stock solution 
concentrations and the programmed dilution ratio. 
However, it is often desirable to measure concentrations 
in real time by means of an in-line concentration detec¬ 
tor, e.g., a differential refractometer or UV/Vis absorp¬ 
tion spectrophotometer such as is found in HPLC 
systems. The choice will be influenced by the particular 
samples. UV/Vis spectrophotometers are generally 
a good option for proteins at concentrations up to ^ 1 
g/L. A differential refractometer is particularly sensitive 
to incomplete dialysis and stop-flow pressure fluctua¬ 
tions; however, not all macromolecules exhibit absorp¬ 
tion while a differential refractometer is almost 
universally applicable to all types of macromolecules. 

The concentration detector may be plumbed in par¬ 
allel to the MALS detector (Attri and Minton 2005a, b) 
or in series (Some et al. 2008). The parallel detector 
configuration requires careful calibration and mainte¬ 
nance of the split-flow ratio between detectors, but can 
be arranged to use less sample than the serial detector 
configuration. Conversely, the serial configuration is 
more robust and simpler to implement but requires 
more sample. 

Even if the initial solutions are carefully purified, fhere 
are several sources of particles in a CG-MALS system, 
such as dust in the sample reservoirs and the mechanical 
motion of syringes or valves. These particles must be 
filtered out of the solution prior to reaching the MALS 
detector in order to obtain low-noise data. At least two 
different designs for in-line filter placement have been 
implemented, each with certain advantages and disad¬ 
vantages. Figure la depicts the original Attri and Minton 
parallel detector/single filter configuration, while Fig. lb 
depicts the Calypso detector and filter configuration. 
More considerations regarding CG-MALS hardware 
configuration may be found in Some and Kenrick (2012). 


The core of an automated CG-MALS system includes 
a MALS photometer incorporating a flow cell, a laser, 
and sensitive detectors plus a syringe-pump system 
programmable to create and deliver composition gra¬ 
dients as a series of stopped-flow injections. As of the 
time of writing, the only commercially available 


Experimental Protocol, Method Design and 
Optimization 

CG-MALS is very sensitive to the presence of molec¬ 
ular aggregates and foreign particulates. Sample 
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Composition-Gradient 
Multiangle Light 
Scattering, Fig. 1 (a) CG- 

MALS hardware configuration 
as per Attri and Minton 
(2005a): single in-line filter, 
post-mixing point, serves as 
both mixer and filter; mixed 
solution is split between 
parallel detectors, (b) 
Configuration as per Some 
et al. (2008), each solution 
filtered separately prior to 
mixing; serial detectors 
(Figure reproduced and 
modified with permission 
(Some and Kenrick 2012)) 



solutions should be purified and filtered to the smallest 
practical pore size (typically 20 nm for most proteins) 
in order to remove particulates. Solvents should also be 
filtered, typically to 100 nm and even smaller for 
the most sensitive measurements. All containers 
must be sterile and particle-free. Prior to beginning 
a measurement, the instruments should be flushed 
extensively with the designated solvent and low scat¬ 
tering noise levels verified. In-line hlter membranes 
typically make use of 100-nm pore sizes. 

A standard method will include at least four 
segments, shown in Fig. 2. The gradient of Fig. 2 
includes both single-species concentration gradients 
and a dual-species crossover gradient, making it 
generally applicable to characterizing simultaneous 
self- and hetero-association, or self- and cross-virial 
coefficients in the case of nonspecific interactions. 
Variations on this theme will adapt it to any particular 
system, e.g., concentration series Ilia for a single¬ 
species system exhibiting self-interaction, or just the 
crossover gradient Illb for characterizing the hetero¬ 
association of species known not to self-associate in 
the pertinent concentration range. 

The actual concentrations employed in an optimal 
measurement are dictated by the equilibrium associa¬ 
tion constants and molar masses of the molecules. As 


a general rule of thumb, the initial stock solution con¬ 
centrations should be on the order of 3-10 x K^. 
Disparity in the molar masses of hetero-associating 
molecules by more than a factor of 2-3 will shift the 
optimal conditions toward a higher concentration of 
the lower molar mass species. A very large disparity 
may call for a titration-like gradient with a fixed con¬ 
centration of the high-molecular-weight species. Sim¬ 
ulation of CG-MALS signals, assuming different 
plausible association schemes and range of binding 
affinities, is a useful tool to further optimize the exper¬ 
imental composition gradient (Fig. 3). 

Experimental MATS and concentration signals are 
analyzed and fit to association models, defined by 
Eqs. 3 and 4 plus the relevant ideal or non-ideal 
light-scattering equation, using standard nonlinear 
least-squares methods. The information inherent to 
a well-designed CG-MALS measurement, which 
includes stoichiometric ratio (determined from the 
peak position in the crossover gradient) and weight- 
averaged molar mass (determined from the ratio of 
light-scattering intensity to concentration), will usu¬ 
ally permit proper fitting of only a small number of 
models to a given data set. Extraneous interaction 
terms will often “fall out of the fit,” i.e., their contri¬ 
butions will go to zero. Hence, arbitrarily increasing 
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Composition-Gradient Multiangle Light Scattering, 
Fig. 2 A CG-MALS method includes four essential segments: 
solvent injections for baseline signals (I), pre-load of sample (II), 
the actual composition gradient (III), and a final solvent flush to 
re-measure baseline (IV). The composition gradient shown 


includes three segments, designed to measure self-interaction 
of molecule A (Ilia), hetero-interaction of A and B (Illb), and 
self-interaction of B (IIIc) (Figure reproduced and modified with 
permission (Some and Kenrick 2012)) 



Concentration of Species B <- 


Concentration of Species B 


Protein Concentration 



Composition-Gradient Multiangle Light Scattering, 
Fig. 3 CG-MALS simulations calculated by the Calypso soft¬ 
ware in order to optimize method design, (a) Self-association 
assuming monomer-dimer equilibrium (different K^s) or 


monomer-trimer equilibrium; (b) hetero-association assuming 
1:1 binding, different K^s; and (c) hetero-association assuming 
different stoichiometries, fixed (Reproduced with permission 
(Some and Kenrick 2012) 


the number of interaction terms will not necessarily 
lead to an arbitrary improvement in goodness of fit. 
The simulation tool can assist in the design of a follow- 
on measurement to refine the analysis by means of 
global fitting of multiple data sets. 

Kinetics 

Technological limitations on the minimum dead time 
of current CG-MALS instrumentation are imposed by 
the necessary in-line filtration as well as low scattering 


signals at concentrations typical of 0.1-100-nM 
biomacromolecular interactions. Still, binding, 
aggregation, and dissociation kinetics on the order of 
seconds and longer may be measured. Quantification 
of molar mass and size means CG-MALS 
kinetics measurements are useful for a large class of 
interacting systems, especially those involving 
competing interactions (Some and Hanlon 2010) or 
formation of metacomplexes by self-association 
of lower-order reversible complexes (unpublished 
data, several multivalent systems). The equations 
describing binding kinetics, presented elsewhere 
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(see ► Kinetics: Overview), are combined with Eq. 1 
in order to analyze CG-MALS data. 

Summary 

The emergence of automated CG-MALS signals an 
important addition to the biophysicist’s toolkit for 
studying a hroad range of macromolecular interactions, 
without labeling or immohilization. Recent studies 
(many not yet published) have characterized systems 
ranging from ^ 1-kDa peptides up to submicron parti¬ 
cles and binding affinities from sub-pM up to > 1 mM. 
Nonspecific interactions, both attractive and repulsive, 
may also be quantified. Particular strengths of 
CG-MALS include relatively short experiment times; 
determination of the true stoichiometry (rather than just 
stoichiometric ratio) and binding constants even for 
difficult systems such as those exhibiting simultaneous 
self- and hetero-association or those forming 
metacomplexes such as (AB) -i- (AB )2 -i- (AB )3 -i- ...; 
determination of particle size evolution in the diameter 
range 25-1,000 nm; quantitative analysis at high con¬ 
centrations representative of macromolecular crowding 
in the cytosome or therapeutic antibody formulations; 
and cooperativity and bi-specific molecules. Kinetics of 
association, dissociation, and aggregation may also be 
observed and analyzed for many relevant systems. As 
such, it complements other interaction techniques based 
on fluorescence, surface plasmon resonance, and 

► isothermal titration calorimetry. 

Cross-References 

► Kinetics: Overview 

► Law of Mass Action 

► Linear Motifs in Protein-Protein Interactions 

► Macromolecular Crowding: Effects on Association 
Equilibria and Kinetics 

► Multiangle Light Scattering from Separated 
Samples (MALS with SEC or EEE) 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► Sedimentation Equilibrium Analytical 
Ultracentrifugation 

► Stopped-Flow Techniques 

► Thermodynamics and Thermodynamic Nonideality 

► Thermodynamics of Biomolecular Interactions 
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Computational Enzymology 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Definition 

Computational enzymology is the scientific 
subdiscipline that applies computational molecular 
simulation and modeling to enzymes, in particular to 
simulate enzyme-catalyzed reactions. 

Basic Characteristics 

Computational enzymology is a rapidly developing 
and maturing scientific area, which is increasingly 
confribufing fo understanding mechanisms of enzyme 
catalysis, as well as for practical applications of 
enzymes (Van der Kamp and Mulholland 2008; 
Lonsdale et al. 2010). Computational modeling and 
simulation have the unique potential to offer detailed, 
atomic-resolution insight into the dynamics and reac¬ 
tions of biomolecules. Quantum mechanical (QM) 
electronic structure calculations can be useful for 
modeling reactions in models of enzyme active sites. 
Larger models can be treated with combined quantum 
mechanics/molecular mechanics (QM/MM methods), 
or empirical valence bond methods, with which molec¬ 
ular dynamics simulations can be performed. Applying 
these methods to enzymes can help resolve open ques¬ 
tions by interpreting, complementing, and expanding 
results obtained from experiment. Theories of catalysis 
and alternative proposed mechanisms can be tested, 
and novel catalytic mechanisms can also be investi¬ 
gated. Increasingly, computer simulation is contribut¬ 
ing directly to experimental studies of enzymes. 
Potential practical applications of computational enzy¬ 
mology include interpretation of experimental data, 
catalyst design, and drug development. 


Cross-References 

► Acetylcholinesterase: Computational Studies 

► Beta-Lactamases: Computational Studies 

► Biliverdin IX-Beta Reductase: Computational 
Studies 

► Carboxypeptidase A - Computational Studies 

► Chorismate Mutase - Computational Studies 

► Citrate Synthase - Computational Studies 

► Cyclooxygenase - Computational Studies 

► Cysteine Dioxygenase - Computational Studies 

► Cytochrome c Peroxidase - Computational Studies 

► Cytochrome P450 - Computational Studies 

► Dihydrofolate Reductase - Computational Studies 

► Enzymatic Chlorination/Halogenation - 
Computational Studies 

► Enzymatic Eluorination - Computational Studies 

► Earnesyltransferase - Computational Studies 

► Eatty Acid Amide Hydrolase - Computational 
Studies 

► Elavoenzyme Mechanisms: p-Hydroxybenzoate 
Hydroxylase and Phenol Hydroxylase - 
Computational Studies 

► Glutathione S-Transferase - Computational Studies 

► Glycoside Hydrolases - Computational Studies 

► Heme Dioxygenases - Computational Studies 

► HIV Protease - Computational Studies 

► HIV-1 Reverse Transcriptase - Computational 
Studies on the Polymerase Active Site 

► Influenza Neuraminidase - Computational Studies 

► Insulin-Degrading Enzyme - Computational 
Studies 

► Lactate Dehydrogenase 

► Lysozyme - Computational Studies 

► Manganese-Containing Arginase - Computational 
Studies 

► Methylamine Dehydrogenase (MADH): Studies 
with Variational Transition State Theory (VTST) 

► Nitric Oxide Synthase - Computational Studies 

► Phosphatases - Computational Studies 

► Phosphotriesterase - Computational Studies 

► Protein Lysine Methyltransferase - Computational 
Studies 

► Reaction Paths and Rates 

► Ribonucleotide Reductase 

► Ribosomal Peptide Synthesis 

► Semiempirical Quantum Mechanical Methods 

► Taurine/Alpha-Ketoglutarate Dioxygenase - 
Computational Studies 


c 
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► Triosephosphate Isomerase - Computational 
Studies 

► Zinc-Dependent Metalloenzymes - Computational 
Studies 
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Synonyms 

Neural networks 


Introduction 

Human brain contains about ten billion neurons, each of 
which has about 10 ^ 10,000 nerve endings from which 
neurotransmitters are released in response to incoming 
spikes, and the released neurotransmitters then bind to 
receptors located in the postsynaptic neurons. Individ¬ 
ually, neurons are noisy, but groups of neurons that are 
arranged in specialized modules can collectively per¬ 
form complex information processing tasks robustly 
and reliably. How groups of neurons perform behavior- 
ally related tasks crucially relies on a coherent organi¬ 
zation of dynamics from membrane ionic kinetics to 
synaptic coupling of the network and of rhythmic oscil¬ 
lations that are tightly linked to behavioral state. 

To capture essential features of the biological sys¬ 
tem at multiple spatial-temporal scales, it is important 
to construct a suitable computational model that is 


closely or solely based on experimental data. 
Depending on what one wants to understand, these 
models can either be very functional and biologically 
realistic descriptions with thousands of coupled differ¬ 
ential equations (Hodgkin-Huxley type) or greatly 
simplified caricatures (integrate-and-fire type), which 
are useful for studying large interconnected networks. 

Single Spiking Neuron 

The single neuron is the fundamental building block of 
every nervous system. Despite varying more or less in 
size and shape, almost all neurons have three basic parts: 
a cell body or soma, dendrites, and an axon. Dendrites 
play the role of “input receivers” that collect signals from 
other neurons and transmit to soma. The soma is the 
“central information processer,” while the axon functions 
as an “information sender.” The cell membrane of the 
axon and soma contain voltage-gated ion channels. These 
channels are selective and permeable only to one partic¬ 
ular ionic species, and the opening and closing of a single 
channel depends on the membrane voltage, ligand, or 
second messengers, and is intrinsically stochastic. Here, 
a single neuron of Hodgkin-Huxley (HH) type, with 
a finite number of channels, is considered to show how 
various ion channels contribute to spike generation. 

Neurons communicate information via synapses, 
junctions that allow a neuron to pass electrical or 
chemical signals to one or another. Classically, neu¬ 
rons receive synaptic inputs from other neurons via 
their dendrites, and encode these inputs as very brief 
electric events (action potentials, also called spikes). 
These spikes are propagated along axons to nerve 
endings, where the signals are transmitted to other 
neurons via release of neurotransmitters at synapses. 
Most neurons in the central nervous system (CNS) use 
either the excitatory transmitter glutamate (acting 
through AMPA, a-amino-3-hydroxy-5-methyl-4- 
isoxazolepropionic acid, or NMDA, A-Methyl-o- 
aspartic acid, receptors) or the inhibitory transmitter 
GABA (gamma-aminobutyric acid), but some use 
a relatively small number of alternative neurotransmit¬ 
ters, the best known of which are acetylcholine, hista¬ 
mine, noradrenaline, serotonin, dopamine, etc. 

The Hodgkin-Huxley (HH) Model 

Instead of considering the deterministic HH model 
(Hodgkin and Huxley 1952), here a stochastic version 
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of the HH model is presented to account for the 
stochastic effect of a hnite number of ion channels 
(Neher and Sakmann 1976). Consider a certain patch 
of membrane with an area, on which there are 
Nk = Pk ^ Area potassium channels and 
Nno = Pno ^ Area sodium channels, where is 

the channel density. The corresponding membrane 
potential V(t) is described by 


— = -[gL{V - Vl) + gK{V, t)-{V- Vk) 

+ gNa{V,t)-{V-VNa)]+Iapp- (1) 

where Cm is the membrane capacity, Vp is the reversal 
potential for leakage currents, Vk and Vno are the 
reversal potentials for potassium and sodium, respec¬ 
tively. lapp is the external input; gKiV, t) and gNa{V, t) 
are the voltage- and time-dependent potassium and 
sodium conductance given by 


potential. Similarly, the kinetics of the potassium 
channel is characterized by 

4a„ 3a„ 2a„ a„ 

«0 < ^ «1 < ^ «2 < ^ ^ «4 (4) 

A, 2A, 3I3„ 4/J„ 


c 


Corresponding to the kinetics of a single sodium 
channel, the matrices of transition rates for m gate and 
h gate are 


Qm — 

—3a„, 3a„, 0 

_ Am Am 2(Xm 

0 2Am -2Am - Km 

0 0 3Am 


0 

0 


GC/jj 

-3A, 


(5) 


here ij = 0(mQ),l(mi),2(m2),3(mj,); and 


gK{V, t) = gK 


OK{V,t) 

Nk 


gNa{V,t) — gMa 


ONa{V,t) 

Nno 


( 2 ) 


In (2), gK and g^a are the potassium conductance 
per channel and the sodium conductance per channel, 
respectively; OK{V,t) and ONa{V,t) are the number 
of open potassium and sodium channels; both are 
voltage- and time-dependent. In the Hodgkin-Huxley 
model (Hodgkin and Huxley 1952), each sodium chan¬ 
nel has three identical m activation gates and one h 
inactivation gate, and each potassium channel has four 
identical n activation gates. 

Let {rrij, hj) be the state of a sodium channel with i m 
gates and j h gates open, then the state evolution of the 
sodium channel can be considered as a Markov pro¬ 
cess, with kinetics being given by 


3o:,„ 0^, 

mo/iQ < ^ ^ 

Am 2/i,„ 

a/4TAA “UtA* echi'lPh a/4TA* (3) 

3o:,„ 2o^, ctm 

mo/Jl < ^ ^ <- > mj/i, 

Am Wm 3A„ 

where all a,p are transition rate constants from one 
state to another, usually depending on the membrane 


Qh = {Qh{iJ)) 


CHh GCfj 

Ph —Ph 


( 6 ) 


with iJ = 0(/!o),l(/2i), respectively (see (3) and (4)). 
A similar transition matrix Q„ can be dehned for 
a single potassium channel. 

For a given membrane patch, the open number 
0Na{V,t) of the sodium channels corresponds to the 
number of channels in the state {nij, hi), and the open 
number Ok{V, t) of the potassium channels is the num¬ 
ber of potassium channels in the state « 4 . It is therefore 
necessary to consider a system whose state at time fo 
can be completely characterized by a vector with 
13 elements 


{Nmaho — 4^15 ‘ ‘ ‘ ^Nm^hi — ^H^N^q — ^9: ‘ ' ‘ ^Nfi^ — ^^13) 

(7) 


Let 

lij = +Qh{j,i)),i = 0,1,2,3; y = 0,1: it 

represents the escape rate of a sodium channel from 
a state (m,, Ay). Similarly, U = -{Qn{k, k)), 
k = 1,2,3,4 represents the escape rate of 

a potassium channel from a state np. 

1 3 4 

7-0 /-O it-0 
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is the transition rate from a given state of Eqs. (3-4), 
for example, (Nm„h„ =xo,N„,^hB = Xi, = xs, 

Nnb = xg,NN^ = Xi3). So the sojourn time this above 
state, obeys an exponential distribution, that is, 

P{t > t) = t > to (9) 

In simulations, the sojourn time of the system at this 
state can be determined by picking up a random vari¬ 
able u € U{0, 1] (uniform distribution), and then using 
the relationship 


tr = -ln{u)/L (10) 

Then, at time to + f,, the state of the system has to 
be updated. 

Next, one has to determine which state the system 
will occupy at time to + t,-. From the Markov diagram 
of the sodium and potassium channels, one can see that 
there are totally 28 conformational transitions with 
transition rate rij: 

{mo, ho) —{mu ho), 

{my,ho) — 


(»i2,/ri) — {m,hi), 

(ffi3,/ri) — {m2,hi). 


There are 20 pairs in total for the sodium channel (see 
(3)) and similarly there are eight pairs of conforma¬ 
tional transitions in total for the potassium channel 
(see (4)). The probability of escaping from state j 
during time interval [fo + t^, to + t,. + dt\ is given by 


rjjdt 


27 


E Vjdt 
7=0 


1 ■ 


( 11 ) 


E 

Let Po = 0, Pk = ^ = 0, • • •, 27. Numeri¬ 

cally, the state transition at time to + t,- is realized by 
picking up a uniformly distributed random number 
r e 1/(0,1], and finding k such that 


transition {m\,ho) —> {mo, ho) occurs, is then 

replaced by -f 1 and is replaced by 

^m,hQ !■ 

Once a state transition occurs, the values of 
and Nn^ are recorded, and the membrane potential V (?) 
is updated, accordingly. This numerical method is 
called the Gillespie algorithm (Gillespie 1976). One 
can instead use a Monte Carlo method by tracking the 
states of each individual channel. But the latter is much 
more time consuming if the number of channels is 
large. 

Although the importance of channel noise on infor¬ 
mation transmission between neurons is not fully 
understood, it has been shown that the stochastic flick¬ 
ering of ion channels is crucial for spontaneous firing 
(Chow and White 1996) and subthreshold oscillations 
(Dorval and White 2005). Figure 1 shows the sponta¬ 
neous firing without an external input. It is seen that 
even without any external injected current, a single 
neuron can spontaneously fire due to the intrinsic 
noise aroused by stochastic opening and closing of 
ion channels with finite number. It is also seen that 
with an increasing number of channels, the rate of 
spontaneous firing decreases. Actually, if there are 
sufficient channels, a single neuron without any exter¬ 
nal input will become silent, and the stochastic model 
becomes identical to the classic HH model (Hodgkin 
and Huxley 1952). For the detailed link between sto¬ 
chastic and deterministic HH models, see Fox and Lu 
(1994). 

Integrate-and-Fire (IF) Model 

In 1907, long before the mechanism of spike genera¬ 
tion was known, Lapicque developed a very simple 
integrate-and-fire model to describe basic neuron oper¬ 
ation (Brunei and Van Rossum 2007). In this model, an 
action potential Is generated when the integrated mem¬ 
brane potential V{t) crosses a prefixed threshold V,/,. 
The dynamics of the membrane potential is given by 

f CmdV = —{V — Vrest)dt + dlsy„{t) + Pppdt, V < V,h 
\ V{t) = Vrese,, F > V,h 

(13) 


Pk<r< Pk+y (12) 

The determined k is used to update the state of the 
system. For example, when k — the conformational 


The synaptic input is 

p ■? 

^syn{t) — ^ ^ ( Pi h ^ ( Ij. 

1=1 7=1 


( 14 ) 









Computational Modeling of Neuronal Networks 


347 


c 



Computational Modeling of Neuronal Networks, for different patch sizes: (a) is for Area = 0.5 (b) is for 

Fig. 1 The time evolutions of the membrane voltage of the Area = 5 jirr?, and (c) is for Area = 100 /tm^. Other parameters 
stochastic HH model in the absence of an external input current for the model are from Chow and White (1996) 


where £, = {£,(/),? > 0},/^ = {lj{t),t > 0} are inho¬ 
mogeneous Poisson processes with rates Iej and 
respectively (Shadlen and Newsome 1994). a> 0,h > 
0 are the magnitudes of each EPSP and IPSP, and p and 
q are the total number of active excitatory and inhibi¬ 
tory synapses. In the case where there are numerous 
presynaptic inputs and both a and h are small, one can 
use a diffusion approximation to approximate the syn¬ 
aptic inputs (Tuckwell 1988; Feng 2004). In the case 
a = h, p = q, the LIE model is simplified as 


CmdV = —{V — Vrest)dt + pdt + (jdB{t), V < Vth 

(15) 


with 


ji = a/l(f)(l — r) 

= a^l{t){ \ + r) 


where X{t) = X) rX{t) = the ratio 

/ / 

between the inhibitory input and the excitatory input 
and B(t) is the standard Brownian motion. 

The interspike interval (ISI) of efferent spikes is 


Despite its simplicity, to find an analytical expression 
of T has been an endeavor for many years. It was 
recently found out that the probability density of T, 
denoted as px,r{t), has the following form (here we 
assume that Vrest =0): 


Plr{t) = exp 


+ 2p,yV,h 

V 2yCT 


X Eo=v„. 



1 

2y^a 



(vi - V,i, + pyf ds I 
(17) 


The expression looks a bit complicated, in which 
y = I/Cm and 


P\t) 


Vth 

\/2%aP’ 


exp 



(18) 


Besides, there is a stochastic process, called 3- 
dimensional Bessel bridge being included. 

Mathematically, it satisfies the following stochastic 
differential equation 


t/v. = 


Vth — Vo (T \ ^ 

H- ]ds + y/adBs, Q < s < t, 


t — s 


Vo = 0, Vf = V,h. 


(19) 


T = inf{r: V{t) > V,h}. 
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Computational Modeling 
of Neuronal Networks, 

Fig. 2 MLE in a network 
without interactions. 

(a) Schematic plot of reading 
dynamic inputs from an 
ensemble of neurons. For 
a fixed time window (indicated 
by the spikes are 

collected and the input is 
decoded by means of 
maximum likelihood 
estimator, (b) Raster plot of 
spikes in five different 
decoding windows. The 
information is read out from 
the spikes in each window. 

(c) An example of reading out 
the dynamic input rates from 
an ensemble of neurons is 
shown. The original signal, A, 
is plotted in the continuous 
line, while dots are estimated 
values of A (Adapted from 
Zhang et al. (2009)) 



In (19), Eo^Vn represents the expectation with 
respect to the stochastic process with 

a starting pointO and an ending pointV,/,. 

One of the most puzzling issues in neuroscience is 
how the brain encodes and then decodes information. In 
the setting here, the input information is the input rate 
1(f), and the observed information is firing patterns of an 
ensemble of neurons (see Fig. 2). Having an exact expres¬ 
sion of the distribution PA,r(0 of the interspike interval, 
the best strategy to decode the input information is by 
applying the maximum likelihood estimate. The detailed 
strategy of this method was given in Zhang et al. (2009). 
Here spikes from an ensemble of IF neurons are collected 
and, based on the spike patterns, an external dynamic 
input can be accurately and reliably read out. Assuming 
that each neuron receives a common Poisson synaptic 
input (for simplicity, only excitatory input is taken into 
account), with a rate given by 1(0 = 
2 + 4(sin^(27ir) + sin^(|7tf)), for example. Fig. 2c 
depicts the decoded rate versus the input frequency for 
a decoding time window of Tw = 25 ms. Although Tw is 
very short, one can see that the estimate is accurate 
(except that it is slightly downward biased). It is clearly 
shown that within a very short time window (^25 ms), 
the spikes generated from an array of neurons contain 
enough information to decode the input information. 


Differences between IF and HH Models 

As two most commonly used neuronal models, the HH 
model and the IF model could behave quite differently 
in response to an external input. For example, these 
two models respond to correlated input in totally oppo¬ 
site ways. It has been summarized in Feng (2001) and 
Feng and Zhang (2001) that increasing the input cor¬ 
relation to an IF neuron increases the output variability 
while, in contrast, the correlation of input decreases 
rather than increases the output variability of a HH 
neuron. This says that increasing the input correlation 
will increase the signal-to-noise ratio of the efferent 
spike train in the HH model, whereas for the IF neuron, 
the signal-to-noise is decreased with the increase of the 
correlated input. It was therefore hypothesized that the 
IF models may work better in an environment of asyn¬ 
chronous inputs, while the HH model is more suitable 
working in an environment of synchronous (corre¬ 
lated) input. The difference between these two models 
can be clearly seen from Fig. 3. 

Network Model 

Neurons are usually grouped in specialized modules in 
order to perform cognitive tasks, such as visual 
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Computational Modeling of Neuronal Networks, 
Fig. 3 The coefficient of variability of efferent spike trains 
versus the input correlation and the number of inhibitory 


synapses, (a) is the IF model, (b) is for the HH model (Adapted 
from Feng and Zhang (2001)) 


recognition, perception, working memory, decision 
making, spatial navigation, etc. All these neuronal 
activities are marked by oscillatory rhythms whose 
signal properties are characterized by frequency, 
amplitude, and phase. Using EEG recording, most of 
the rhythmic activities in the brain are classified into 
different frequency bands. The best known are: delta 
(1^ Hz), theta (4-8 Hz), alpha (8-12 Hz), beta (12- 
30 Hz), and gamma (30-70 Hz) frequency bands. 
Neuronal network models have been extensively 
investigated to address the functional meaning and 
the corresponding mechanisms of brain oscillations. 
Although rhythms with different frequencies and 
amplitudes operate in different spatiotemporal scales, 
in many cases, slow brain oscillations, such as theta 
waves, coexist with fast brain oscillations such high- 
gamma waves, and there are cross-frequency modula¬ 
tions between slow and fast oscillations. This has been 
reported in both cortex and hippocampus (Buzsaki 
2006; Canolty et al. 2006; Tort et al. 2009). Among 
different types of cross-frequency interactions, nesting 
was proposed to be the most likely form of 
interactions. 


To see how neuronal oscillations with different 
rhythms work together to generate complex brain func¬ 
tions, one can consider a spiking neuronal network 
which could generate coupled theta/gamma oscilla¬ 
tions by utilizing a combination of fast- and slow- 
type GABA receptor interneurons. Each neuron 
obeys an integrate-and-fire equation (13); it receives 
AMPA and NMDA receptor-mediated currents from 
excitatory (EX) cells, GABA receptor-mediated cur¬ 
rents from fast inhibitory neurons, as well as slow 
inhibitory neurons (see Eig. 4). The gating variable s 
for AMPA and NMDA receptors is described by two 
first-order kinetics: 


J 

^ = asx{ \ - s) - s/xs, 


( 20 ) 


where tj is the presynaptic spike time. The parameters 
are set as follows: =1.0 (dimensionless), 

Xx = 0.05ms, 0 £j = l.Oms^*, Tj = 2.0ms for AMPA 
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Computational Modeling of Neuronal Networks, 
Fig. 4 Neural network model showing theta/gamma dual oscil¬ 
lations and the response of such a network to an external input, 
(a) Schematic showing three populations of neurons which are 
connected recurrently by different synaptic receptors as the basic 
functional units in the neural network model. Here, EX, INf, and 
INs represents excitatory, fast inhibitory, and slow inhibitory 
neurons. The connections, for example, NMes represents the 


connection from Ex to INs neurons mediated by NMDA 
receptor, (b) Responses to a step input stimulus of 
a randomly chosen neuron from three populations of neuron, 
(c) Top: The power contribution of different frequencies 
across the theta/gamma range. Bottom', comparison of the 
mean theta and gamma powers. Here, the time-frequency anal¬ 
ysis is applied to the averaged membrane potentials of all EX 
neurons 


receptors; and = 1.0 (dimensionless), = 2.0 ms, 
0 £s = 1.0ms^\ Ts = 80ms for NMDA receptors. The 
gating variable Sgaba for GABA receptors obeys 
a simple first-order kinetics: 

—^ ~ ~ Sgaba) — sgaba^i, 

j 

( 21 ) 

where fr indicates the time immediately before the 
spike at time tj. Here, T/ = 9.0ms,a/ = l.Oms^' for 
the fast GABA channels, and T/ = 50ms, 


a/ = 0.2ms^* for the slow GABA channels. The 
AMPA and NMDA receptor-mediated currents are 
given by: I amp a = gAMPASAMPA{V - Ve) and 

Inmda = gNMDASNMDAB{V){V - Ve), respectively, with 

B{V) = [1 -f exp(-0.062V)/3.57]^' and Ve is the reversal 
potential of excitatory inputs. The GABA receptor- 
mediated current is given by Igaba = Sgaba 
X sgaba{V — Vi) and Vi is the reversal potential of 
excitatory inputs. Here, Ve = OmV, Vi = —70 mV. 

A diagram of the spiking neural network, which 
incorporates two kinetic classes of GABA current, is 
depicted in Fig. 4a. The firing pattern of the network in 
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Computational Modeling of Neuronal Networks, 
Fig. 5 (a) Diagram of the two-pool network model, (b) Distri¬ 
bution of the reaction times for the three different models. 
Circles, two-pool model; triangles, fast model; squares, slow 
model, (c) Comparison for the firing behaviors of the one-pool 
fast model (left) and the two-pool model (right). Top, firing 
rates for all the pools for sample trials of the one-pool fast 


model. Arrows mark the time of input cues applied, and the 
vertical dashed line marks the moment where the decision was 
taken. Thick gray cun’e and thick black curve are the rates for 
specific pool A and specific pool B, respectively. Bottom, 
Rastergrams of the spiking activity for ten randomly chosen 
neurons from each population (Adapted from Smerieri et al. 
( 2010 )) 


response to a square-shaped stimulus is shown in 
Fig. 4b where excitatory neurons, which receive syn¬ 
aptic currents from slow and fast inhibitory neurons, 
are shown to exhibit the theta-nested gamma rhythm, 
and the stimulus-evoked response is clearly demon¬ 
strated. By using a Morlet wavelet in the range of 
frequency between 1 and 100 Flz, it was shown that 
the corresponding spectral component to onset of stim¬ 
ulus is highly concentrated in a low frequency band 


(around 4-8 Hz, which is referred as theta band; see the 
second trace of Fig. 4c). Besides strong powers in 
the theta band, there are weak powers in the 
gamma band (30-80 Hz). Furthermore, it was 
seen that when the stimulus is on, both theta and 
gamma powers are increased, but the magnitude of 
theta wave in the presence of stimulus is much higher 
than that of gamma wave (see the bottom trace of 
Fig. 4c). 































352 


Computational Modeling of Neuronal Networks 


Some studies reported that the coupling between 
theta and gamma may depend on the theta phase rather 
than on its amplitude (Buzsaki 2006; Rizzuto et al. 
2003); while some revealed that the magnitudes of 
both theta and gamma oscillations are involved in 
predicting the efficacy of the subsequent memory 
recall (Sederberg et al. 2003). It was also reported 
that the theta oscillation can modulate both gamma 
amplitude (Lakatos et al. 2005) and the firing of single 
neuron (Lee et al. 2005). Recently, experimental stud¬ 
ies on the frontal and temporal lobes in humans and on 
rat hippocampus during olfactory learning, revealed 
that the modulation of theta phase to gamma amplitude 
is enhanced (Canolty et al. 2006; Tort et al. 2009). 
Simultaneous local field potential and multiunit neu¬ 
ronal activity recordings in the inferotemporal cortex 
of conscious sheep further revealed that the amplitude 
of theta (but not gamma) and coupling between theta 
phase and gamma amplitude are enhanced following 
learning (Kendrick et al. 2011). The synaptic mecha¬ 
nism of the learning-altered theta/gamma interaction 
can be explained by the computational model 
described above. The computational studies suggested 
that the strengthened modulation of theta phase to 
gamma amplitude requires co-ordinately increase in 
activation of both NMDA receptors and GABAslow 
receptors (Kendrick et al. 2011). This conclusion is 
inconsistent with another modeling study on hippocam¬ 
pal population activity during sequence learning and 
where it was shown that the conductance of GABAslow 
receptor which is responsible for the generation of the 
theta rhythm rises and falls in a rhythmic fashion with 
the theta rhythm (Wallenstein and Hasselmo 1997). 

There is another similar spiking neuronal network 
that incorporates two kinetically different GABA 
receptor-mediated currents to study the role of the 
theta rhythm in decision making. To mimic decision 
making between two choices A and B, a computational 
model is generally composed of a population of excit¬ 
atory neurons and a population of inhibitory neurons 
(Fig. 5). Intuitively, the network has two attractors 
(attractor A represents decision A and attractor 
B represents decision B); both are specified by two 
pools of excitatory neurons. 

Smerieri et al. (2010) compared three versions of 
the inhibitory population: (1) only one pool of fast 
GABA receptor interneurons with a GABA time con¬ 
stant of xcABAfast = 10 ms(one-pool fast model); (2). 
two pools of interneurons, the diagram of such 


a two-pool network is depicted in Fig. 5a, where a frac¬ 
tion S (e.g., S — 0.25) of the inhibitory neurons has 
tGABA,siow = lOOmi'; (3). only one pool of slow GABA 
receptor interneurons. It was found that although the 
average time-integrated changes of conductivity of the 
three models are the same, the reaction time of the two- 
pool network is considerably reduced in decision mak¬ 
ing, compared with the previous two networks with 
only one pool of interneurons (see Fig. 5b). To see the 
difference between the bring behaviors of one-pool 
and two-pool networks more clearly, comparison of 
the reaction times of networks with only one fast pool 
of inhibitory neurons and with two pools of fast and 
slow inhibitory neurons is shown in Fig. 5c. The 
advantage of the two-pool model in the reaction time 
of decision making is clearly seen. Having a further 
comparison of the average bring rates before the deci¬ 
sion cues are applied, the two-pool model shows 
a higher bring rate, on average, for the neurons that 
win in the competition than for the neurons in the slow 
and fast one-pool model. Moreover, the average mem¬ 
brane potentials of the two-pool model are higher than 
the other two models. 

Summary 

Two classes of the single-neuron model are most 
widely used in the literature: the IF type, abstract 
model and the HH type, biophysical model. Choosing 
which type of model in research critically depends on 
the scientibc question one wants to address and the 
experimental data to hand. Ideally, one should have all 
voltage clamped data of each channel class for the cells 
in the neuronal network. It is then natural to use the 
HH-type of the model by carefully matching the model 
output with the experimental recordings. Unfortu¬ 
nately, it is very rare to have all the data available 
and one has to resort to using the IF-type of model. 

Cross-References 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 
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Synonyms 

Protein dynamics 


Introduction 

Numerous experimental and computational studies 
have demonstrated that biomolecules undergo motions 
that span a broad range of time and spatial scales; we 
adopt a rather broad notion of “motion” here, which 
includes both equilibrium fluctuations in a single state 
and structural transitions between two (or more) dis¬ 
tinct kinetic states of a system. A natural question of 
interest is what motions are essential to the biological 
function. In the case of enzymes, since the catalytic 
cycle involves substrate(s) binding, chemical catalysis, 
and product(s) release, it is logical to expect that dif¬ 
ferent motions are important at different stages. For 
substrate(s) binding and product(s) release, which can 
be slower than the chemical step and rate-limiting in 
the catalytic cycle, the required protein motion can be 
inferred by comparing crystal or Nuclear Magnetic 
Resonance (NMR) structures for the apo and inhibitor 
bound conformations. In favorable cases, the timescale 
of the structural transition can be determined using 
various experiments, whereas the kinetic bottleneck 
of the transition is more difficult to determine in gen¬ 
eral. For the chemical step, it is conceivable, especially 
in large enzyme systems or enzymes catalyzing 
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multiple-step reactions, that specific “gating motions” 
are required to bring the catalytic motifs to the prox¬ 
imity of reactive sites in the substrate, which corre¬ 
spond to transition between well-defined kinetic states 
of the enzyme-substrate complex; the timescale of 
these “gating motions” is longer than the timescale of 
the chemical step (typically > ms) and the nature of 
such “gating motions” is not always easy to character¬ 
ize experimentally at an atomic level. Finally, there are 
motions that occur at the timescale faster than that of 
the chemical step, and thermal average of these 
motions may contribute to both the activation 
free energy of the chemical step and the transmission 
coefficient for crossing the barrier of the chemical 
step. In the following, using specific examples, 
we illustrate how computational studies can comple¬ 
ment experimental studies in dissecting the nature of 
all three types of enzyme motions, which we loosely 
refer to as binding motion, gating motion, and 
fast motion in the following discussions; these ana¬ 
lyses offer new insights into factors that dictate 
the catalytic functions of enzymes. Challenges that 
need to be met with further developments of computa¬ 
tional approaches are also highlighted wherever 
appropriate. 

Summary of Fundamental Methodologies 

In all computational studies of biomolecules, two fun¬ 
damental components are the potential function and 
the sampling algorithm, which differ depending on the 
problem at hand. For studying the barrier-crossing 
event that corresponds to the chemical step, 
a potential function that allows bond formation and 
breaking is required, which can be based on either the 
combined Quantum Mechanical/Molecular Mechani¬ 
cal (QM/MM) approach (Senn and Thiel 2009) or the 
Empirical Valence Bond (EVB) formulation (Warshel 
1991); QM/MM methods are more general, although 
practical studies have to carefully balance computa¬ 
tional speed and accuracy. As for sampling, since the 
barrier crossing events are fast, appropriate techniques 
are activated dynamics (Brooks et al. 1988) and tran¬ 
sition path sampling (TPS) (Bolhuis et al. 2002). The 
former starts molecular dynamics (MD) trajectories 
from transition states (TS) and therefore requires iden¬ 
tification of the (approximate) TS first by, for example, 
umbrella sampling calculations along an assumed 


reaction coordinate. The TPS technique, by contrast, 
searches barrier crossing trajectories directly without 
any presumed reaction coordinate and transition state, 
although the computational cost is substantially higher 
than activated dynamics simulations. Both methods 
make possible the analysis of the nature and timescale 
of protein motions that actively participate in the bar¬ 
rier-crossing process for the chemical step. Alterna¬ 
tively, by comparing the average structures between 
approximate transition state and ground state ensem¬ 
bles from equilibrium simulations (e.g., umbrella sam¬ 
pling), one can also identify fast motions (i.e., 
thermally averaged over the course of the chemical 
step) that contribute to the free energy barrier (thus 
rate) of the chemical step; the precise timescale and 
causal effect of such motions, however, are not readily 
available (see Section on “Past Coupled Motions and 
Enzyme Catalysis”). 

Por studying the slower gating motions that control 
the chemical step, free energy (potential of mean force) 
simulations along relevant conformational coordinates 
are useful for dissecting the energetics and kinetic bot¬ 
tleneck of the relevant motions. Por highly complicated 
motions that cannot be adequately described by a simple 
reaction coordinate, more approximate methods such as 
biased/targeted molecular dynamics simulations have to 
be used (see Section on “Slow Gating Motions that 
Regulate Enzyme Catalysis”). These motions are 
not directly coupled to the barrier crossing of the chem¬ 
ical step, rather they dictate the activation free energy of 
the chemical step (thus gating the chemical step), thus it 
is sufficient to use a MM (i.e., nonreactive) potential 
function in the simulation. To confirm a specific motion 
is involved in gating the chemical step, however, it 
is useful to carry out QM/MM simulations with differ¬ 
ent conformations of the enzyme. 

Pinally, for the very large-scale “binding motions” 
that correspond to opening/closing of an active site or 
allosteric regulation, direct atomistic simulations 
remain challenging although effective novel methods 
are emerging (Escobedo et al. 2009; Weinan and 
Vanden-Eijden 2010). Due to the collective nature of 
such motions, useful insights can often be gleaned 
through very coarse-grained models (see Section on 
Slow “Binding Motions” of Enzymes from Coarse- 
Grained Models”), such as Go models (Clementi 
et al. 2000), which use a single bead to represent each 
amino acid and employ native-centric potential 
functions. 
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Computational Studies of Enzyme Motions, Fig. 1 An 

example of enzyme motions faster than the chemical step. The 
arrows indicate motions of residues that are coupled to the 
hydride transfer between NADPH and DHF in dihydrofolate 
reductase. Such motions contribute to the free energy barrier of 

Application Examples 

In this section, we use specific examples, some from 
our own studies, to better illustrate how diverse com¬ 
putational techniques can be used to identify and ana¬ 
lyze motions at different scales that are relevant to the 
catalytic steps of enzymes. 

Fast Coupled Motions and Enzyme Catalysis 

Much of the discussions concerning the subject of 
enzyme motion (dynamics) and catalysis were stimu¬ 
lated by experiments on enzymes that catalyze proton 
or hydride transfers; two excellent examples have 
emerged as “dynamic paradigms,” which are 
dihydrofolate reductase (DHFR) (Boehr et al. 2006; 
Bruice and Benkovic 2000) and alcohol dehydroge¬ 
nase (ADH) (Kohen et al. 1999). Among extensive 
experimental results, two very interesting observations 
are as follows: (1) In DHFR, mutations that occur in 
several loop regions remote from the active site (>20 A) 
have a significant (^10^) impact on the hydride transfer 
step. (2) In both ADH and DHFR, there are intriguing 
trends in the temperature-dependence of kinetic isotope 
effects and hydride transfer rates in different homologs 
(mesophile vs. thermophile). Both sets of data imply that 
fairly delocalized protein motions are directly coupled to 
the seemingly localized hydride transfer steps in these 
enzymes, although it was not possible to establish the 
nature of such motions in the experimental studies. 

Several research groups have carried out computa¬ 
tional analysis of DHFR and ADH and offered 
new insights. Hammes-Schiffer and coworkers 


the hydride transfer because they modulate key properties of the 
active site, such as the distance of the hydride donor (Cd) and 
acceptor (Ca) and nearby electrostatics. The figure is reproduced 
with permission from Agarwal et al. (2002) in Hammes-Schiffer 
and Benkovic (2006) 

(Hammes-Schiffer and Benkovic 2006) carried out 
EVB based simulations for both the equilibrium free 
energy profile for the hydride transfers and (hybrid 
quantum/classical) activated dynamics analysis of the 
barrier-crossing event. In DHFR, these analyses have 
revealed a network of coupled motions (see Fig. 1) that 
connect the active site with remote regions, including 
the loop that contained Glyl21, a residue that has been 
subjected to numerous mutation studies and shown to 
influence the hydride transfer step. As clarified in the 
original work, these coupled motions are in thermal 
equilibrium with the chemical step, and they lead to 
configurations that facilitate the hydride transfer 
through shorter donor-acceptor distances and favor¬ 
able electrostatic environment for the transfer of 
charge. The delocalized nature of such coupled motion, 
which has also been analyzed by Brooks and coworkers 
(Rod et al. 2003) by comparing structures from MD 
simulations of different mutants, was further demon¬ 
strated by the finding that constraining the distance of 
two far-apart (^28 A) distal residues during simula¬ 
tions perturbed the active site conformations and raised 
the hydride transfer barrier by as much as 3 kcal/mol. 

Gao and coworkers (Pu et al. 2006) used a combined 
QM/MM potential to compare hydride transfer rates in 
the hyperthermophilic DHFR (TmDHFR) and the 
mesophilic (Escherichia coli) DHFR; since TmDHFR 
is dimeric while E. coli DHFR is monomeric, calcula¬ 
tions were also carried out for a hypothetical mono¬ 
meric TmDHFR. Interestingly, the hydride transfer rate 
was found to decrease by ^2 orders of magnitude in the 
monomeric TmDHFR as compared to the dimer. 
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By analyzing active site structural features and the 
covariance matrix in different DHFR systems, the 
authors concluded that the increase in the hydride trans¬ 
fer barrier in the monomeric TmDHFR was due to 
(1) displacement of a key charged residue (Arg 28) 
near the cofactor and (2) the lack of intersubunit corre¬ 
lated motions as well as the decrease in the intrasubunit 
correlated motions relative to E. coli. 

There have been few applications of TPS to barrier 
crossings in enzymes. Schwartz and coworkers 
(Quaytman and Schwartz 2007) analyzed the transition 
state ensemble collected from TPS of lactate dehydro¬ 
genase. They observed that a series of residues behind 
the active site were involved in a fast-scale compression 
motion that brought the hydride transfer donor/acceptor 
together. This type of motion has also been observed in 
TPS analyses of two other enzymes, human purine 
nucleoside phosphorylase and chorismate mutase. 
Taken together, these TPS works and other computa¬ 
tional studies emphasized the fact that enzymes have 
compact structures and therefore even a relatively local¬ 
ized chemical step such as proton or hydride transfer is 
likely coupled to motions of distant residues, and pertur¬ 
bations in such coupled motions (e.g., by mutation) can 
lead to structural changes in the active site that ulti¬ 
mately result in variation in the rate of the chemical step. 

Slow Gating Motions That Regulate Enzyme 
Catalysis 

While the coupled motions discussed in Section on 
“Fast Coupled Motions and Enzyme Catalysis” are 
fast and therefore in equilibrium to the chemical step, 
many enzyme-substrate complexes undergo distinct 
structural transitions that are slow compared to the 
chemical step; these structural transitions can be thought 
as gating motions that only turn on chemistry at the 
appropriate stage of the functional cycle. In this context, 
the most remarkable example is perhaps molecular 
motors (Schliwa 2002), which are efficient 
nanomachines that convert energy from chemical (e.g.. 
Adenosine triphosphate (ATP) hydrolysis) to mechani¬ 
cal forms. To avoid futile cycles, it is important that 
ATP hydrolysis is controlled tightly by the conforma¬ 
tional state of the motor, making molecular motors ideal 
systems for probing how enzyme catalysis is regulated. 

The specific motor we have analyzed is myosin II, 
which is involved in muscle contraction. The two 
kinetic/conformational states of relevance here are 
the “post-rigor” and “pre-powerstroke” states of the 


motor domain, which differ in both the nucleotide 
binding domain and the converter domain (Fig. 2a) 
(Geeves and Holmes 2005); the two are separated by 
more than 40 A and rotation of the converter is prop¬ 
agated into the striking displacement of the lever arm. 
The conversion between the two states occurs on the 
^ 10 ms scale, and the hydrolysis of ATP is believed to 
occur only in the pre-powerstroke conformation. In the 
context of analyzing enzyme motion and catalysis, two 
key questions of interest are: (1) What is the transition 
pathway between these two states and what constitutes 
the kinetic bottleneck? (2) Among the motions impli¬ 
cated during the transition, which are most important 
to the activation of the ATPase activity? 

For the first question, several computational studies 
have been carried out. Fischer and coworkers (Fischer 
et al. 2005) have calculated a minimum energy path 
(MEP) that connects the two conformational states, and 
the results pointed to a two-phase transition mechanism 
that initiates from a hydrogen-bond formation near the 
active site (between Gly457 and the y-phosphate of ATP) 
and propagates sequentially through the relay helix to the 
converter domain. A set of hydrophobic interactions that 
form during the transition were proposed to stabilize the 
local unwinding of the relay helix, which ultimately leads 
to the translation/rotation of the C-terminal helix and 
converter. A recent set of milestoning calculations 
(Elber and West 2010) based largely on the MEP path¬ 
way led to an estimated timescale for the transition that is 
consistent with the experimental transition rate. Using 
a rather different targeted molecular dynamics (TMD) 
approach, Yu and coworkers (2007a, b) found that most 
rotation of the converter domain occurs in the first stage 
of the transition, while structural changes in the relay 
helix and complete closure of the active site occur at 
a later stage to stabilize the converter conformation via 
a series of hydrogen bonding interactions as well as 
hydrophobic contacts (Pig. 2b). Since there are major 
approximations in all reported computational studies so 
far, it remains unclear which mechanism is closer to 
reality. The encouraging aspect is that both TMD and 
MEP studies point to the importance of a consistent set of 
hydrophobic interactions (e.g., between Phe482, Phe487, 
Phe503, Phe506, and Phe652) and hydrogen bonding/salt 
bridge interactions (e.g., Glu497-Lys743) between the 
relay helix, active site, and the converter domain. These 
discussions highlight the need for development of novel 
techniques that allow a quantitative computational anal¬ 
ysis of slow motions in biomolecules. 
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To provide insights into the issue of ATPase acti¬ 
vation, we have carried out QM/MM simulations for 
ATP hydrolysis in not only the two x-ray structures but 
also a hybrid conformation (Yang et al. 2008) in which 
the active site in the post-rigor x-ray structure was 
closed in silico by displacing the commonly discussed 
Switch II (SwII) loop. This was meaningful to do 
because classical potential of mean force calculations 
in our group have shown that Swll closure has a rather 
flat free energy profile in the post-rigor state (Yu et al. 
2007a, b). Remarkably, QM/MM calculations found 
that ATP hydrolysis tends to have very high barriers 
even though the key residues directly in contact with 
the y phosphate have the same average configuration as 
in the pre-powerstroke state. Further analysis of the 
result found that this is because displacing SwII alone 
to close the active site in the post-rigor state leaves 
many crucial interactions unformed or even breaks 
existing interactions (Fig. 2c). As a result, residues in 
the “second coordination shell” of the y phosphate may 
adopt configurations that hamper effective ATP hydro¬ 
lysis. Therefore, the emerging picture is that the tran¬ 
sition from the post-rigor state to a structurally stable 
closed active site (which apparently is critical to effi¬ 
cient ATP hydrolysis) relies on not only the displace¬ 
ment of SwII but also more extensive structural 
rearrangements in the nearby region. In other words, 
structural transitions remote from the active site can 
play an active role in regulating the hydrolysis of ATP, 
rather than passively responding to structural changes 
in the active site; this is likely a general feature shared 
among biomolecules whose function relies on mecha- 
nochemical coupling between distant sites. 

Slow "Binding Motions" of Enzymes from Coarse- 
Grained Models 

Large-scale structural transitions coupled to the bind¬ 
ing (dissociation) of substrate (product) can be 
rate-limiting in many enzyme systems. Therefore, to 
understand motions that are essential to enzyme catal¬ 
ysis, it is equally important to elucidate the mechanism 
of “binding motions” that do not directly influence 
chemistry. Along this line, adenylate kinase (AK) has 
emerged as a prototypical system and its chemically 
rate-limiting open/closed (0/C) transition has been 
subjected to numerous experimental (Henzler- 
Wildman et al. 2007) and computational analyses 
(Whitford et al. 2007); still, factors that dictate the 
0/C transition rate remain elusive. 


Since the timescale of the transition is millisecond 
and the transition process is complex, atomistic simu¬ 
lations (Arora and Brooks 2007) have been limited to 
the elucidation of the most salient features of the tran¬ 
sition (e.g., intrinsic flexibility of the LID domain, 
local flexibility of hinges, and the role of ligands in 
stabilizing the closed conformation) and coarse¬ 
grained models have been found valuable for probing 
the transition mechanism. For example, we have car¬ 
ried out detailed analysis of the TS ensemble for the 
0/C transition using a double-well Go models with and 
without pseudo-contacts added to the closed potential 
to simulate ligand binding (Daily et al. 2010). By 
simultaneously characterizing the contributions of 
rigid-body (Cartesian), backbone dihedral, and contact 
breaking/formation motions to the TS structure and 
energetics, we were able to predict specific residues 
and contacts that influence the 0/C transition rate. For 
example, we found that backbone fluctuations are 
reduced in the O/C transition in parts of all three 
domains. Among these “quenching” residues, most in 
the CORE domain, especially residues 11-13, are rigid- 
ified in the TS of the ligated simulation and thus slow 
the O/C transition by entropically raising the free energy 
of the TS relative to the native states, while residues 
42^14 in the NMP domain are flexible in the TS and 
thus facilitate the O/C transition (Fig. 3a). In contact 
space, in both unligated and ligated simulations, one 
nucleus of closed-state contacts includes parts of the 
NMP and CORE domains. These results allowed us to 
predict mutations that will perturb the opening and/or 
closing transition rates by changing the entropy of dihe¬ 
drals and/or the enthalpy of contacts. Moreover, we 
observe that LID closure precedes NMP closure in the 
ligated simulation, consistent with other coarse-grained 
models of the AK transition. However, NMP-first clo¬ 
sure is preferred in the unligated simulation (Fig. 3b), 
which highlights that ligand binding cannot only stabi¬ 
lize the closed conformation but also alter the kinetic 
pathway of closure. In this case, the driving factor for 
the pathway switch is likely enthalpic, since ligand 
binding adds many more contacts to the CORE-LID 
interface than to the CORE-NMP interface in the TS. 

Although these Coarse-grained (CG) simulations 
are informative and can stimulate new experiments, it 
remains a fundamental challenge to validate a CG 
model for complex biomolecules. For example, param¬ 
eters in our model were calibrated to make Root Mean 
Square Deviation (RMSD) from the CG simulations of 
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Computational Studies of Enzyme Motions, Fig. 3 Analy¬ 
sis of the large-scale “binding” motion in adenylate kinase, (a) 
Fractional progress of folding in the TS ensemble relative to the 
O and C ensembles. For residues that differ substantially in 
flexibility (characterized by in Daily et al. (2010)), the 
flexibility in the TS ensemble is contoured on a rainbow scale 
from red (0) to purple (1); the results highlight that different 


regions contribute differently to the entropic barrier of the tran¬ 
sition. Labels F11-H8 indicate the respective central residues of 
the eight hinges identified in Henzler-Wildman et al. (2007). (b) 
Potentials of mean force that contrast the free energy landscape 
for NMP versus LID motions in ligated and apo simulations. The 
circles indicate the location of representative TS ensembles 
collected during the simulations 


individual O and C states fit atomistic MD results. It is 
possible, however, the atomistic MD results were 
biased by the short (^50 ns) timescale and therefore 
the system is inherently more flexible that what the 
current model predicts, as hinted by hydrogen/deute¬ 
rium exchange (H/D exchange) data from a recent 
NMR study (Rundqvist et al. 2009). Establishing 
proper benchmarks for CG models and improving 
their robustness remains an active and fruitful area 
for theoretical research. 


Summary 


vary although one may argue that ultimately 
a complete atomistic model with predictive power is 
the “holy grail.” At the present stage, the most signif¬ 
icant value of computational models, especially those 
implicating slow (^ms) and large-scale motions, is 
perhaps to stimulate clearly defined experimental stud¬ 
ies to test new hypothesis. 

Cross-References 

► Dihydrofolate Reductase - Computational Studies 

► Molecular Dynamics Simulations of Lipids 

► Myosin: Fundamental Properties and Structure 


Enzyme catalysis is a multistep process that involves 
complex interplay of chemistry and conformational 
motions that span many temporal and spatial scales. 
To clearly identify motions that dictate the catalytic 
function of enzymes, therefore, it is imperative to 
develop models that go beyond atomistic models for 
the enzyme active site. Depending on the specific step 
of interest, the appropriate resolution of the model may 
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Beer-Lambert Law 
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Definition 

The Beer-Lambert law for the absorption. A, of light by 
a sample of concentration C is 

A = sC£ (1) 

where i is the length of the sample through which the 
light passes and e is known as the (wavelength depen¬ 
dent) extinction coefficient. If C is measured 
in mol dm~^ and I is measured in cm, then e has 
units of moU* dm^ cm~*. If e and t are known, 
the absorbance at a given wavelength can then be 
used to determine concentration. Ideally, one uses 
a point of maximum absorbance as the absorbance 
signal will be less affected by small errors in wave¬ 
length calibration. However, sometimes this is not 
possible. 

For DNA, one usually uses the absorbance signal at 
or near 260 nm (wherever the maximum is for a partic¬ 
ular DNA). For DNA, the linear relationship between 
concentration and absorbance breaks down when the 
absorbance of a 1-cm path length solution exceeds 
^1.5-2 absorbance units, presumably due to DNA- 
DNA interactions. Some DNA extinction coefficients 
are given in Table 1. 

Protein absorbances in the near-UV region (above 
250 nm) will usually be dominated by tryptophan 
residues if present and will have a maximum at 
280 nm. Tyrosines also absorb at 280 nm. Absorbance 
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Concentration Determination Using Beer-Lambert Law, 
Table 1 Long wavelength absorbance maxima and extinction 
coefficients for some DNAs. Calf thymus DNA is ~60% A-T. A 
= adenine, T = thymine, G = guanine, and C = cytosine 


DNA 

Wavelength of 
Amax / nm 

Emax / mol ‘ 

dm^ cm 

calf thymus 

260 

6,600 

poly[d(A-T) 2 ] 

262 

6,600 

poly(dA).poly(dT) 

260 

6,000 

poly[d(G-C) 2 ] 

254 

8,400 

poly(dG).poly(dC) 

253 

7,400 


at 280 nm may therefore be used to give an estimate of 
protein concentrations. At 280 nm a 1 mg/cm^ protein 
solution in a 1 -cm path length cell often has an absor¬ 
bance of ^1 absorbance unit. This is because many 
proteins have a similar percentage of tryptophans and 
tyrosines. However, the A 280 (1 mg/cm^) can vary from 
0 to 1.8. For example, the A 280 (1 mg/cm^) for pure 
bovine serum albumin is usually ^0.66. In cases 
where the protein amino acid content and molecular 
weight are known, a reasonably accurate estimate of 
£ for a given protein may be calculated using the above 
£ values for the residues and then the Beer-Lambert 
law applied. Programs written to do this use 
values such as £279 nm "^5,700 mol”' dm^ cm”* 
for tryptophan; £274 nm "^1,400 mol”* dm^ cm”* for 
tyrosine; and a value of either £ ^0 or 100 - 
300 mol”* dm^ cm”* for half the cysteines present 
(i.e., minimum or maximum number of disulfide 
bonds). 

Protein absorbances in the far UV can also be used 
to determine protein concentration despite not being 
maximum signals and being very prone to error from 
buffer contributions and dirt on cuvettes. According to 
Kelly et al. 2005, the absorbance of a 1 mg/mL protein 
in a 1 -cm path length cell is ^1 at 280 nm (depending 
on the aromatic chromophores present), 5 at 230 nm, 
15 at 215 nm, 45 at 200 nm, and 68 at 193 nm. 

The Beer-Lambert method for concentration deter¬ 
mination of nucleic acids and proteins is based on the 
assumption that the samples are pure. Nucleic acids, if 
present, will interfere with the protein concentration 
determination because they also absorb at 280 nm and 
conversely. In these cases, the following formula per¬ 
mits a rough estimate of protein concentration in the 
presence of nucleic acids (for a 1 -cm path length 
cuvette): 


1.55 X A 280 — 0.76 X A 260 = mg protein/cm^ (2) 

It is also common practice to report the A 280 M 260 
ratio as an indication of purity for a given sample 
since nucleic acid absorbances are maximum 
near 260 nm and protein aromatic absorbances near 
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Synonyms 

Confocal microscopy 

Definition 

Confocal laser scanning fluorescence microscopy is 
a technique, able to operate with other contrast 
mechanisms such as reflection and transmission, that 
utilizes pinholes at the illumination and detection 
planes to optically select information toward imaging. 
The detection pinhole selects the sample-emitted fluo¬ 
rescence so that only in-focus emission reaches the 
detector, rejecting the majority of the out-of-focus 
light. 

Basic Characteristics 

The held of view of the wide-field optical microscope 
is influenced by the geometric optics and by the ability 
to make aberration-free optics over a large field 
of view. If a scanning arrangement is set up, the objec¬ 
tive can be used over a continuous series of small fields 
covering a comparatively large region. The concept 
has been capitalized upon in the confocal scanning 
microscope coupled with other important advantages. 
Confocal microscopy’s main feature is that only what 
is in the geometrical focus of the lens is detected, and 
anything out of focus is theoretically discarded during 
the image formation process. This is achieved by 
focusing the light source, usually a laser, to a point 
and detecting the signal through an aperture, i.e., pin¬ 
hole, properly placed along the optical pathway. Since 


the pinhole allows only light from the focused point to 
travel to the detector, it is particularly useful for three- 
dimensional selection of the signal. This endows the 
confocal laser scarming fluorescence microscope with 
three-dimensional optical sectioning ability, i.e., the 
possibility of the elucidation of fine and three- 
dimensional biological structures within biological 
specimens at improved resolution. Practically, in 
a confocal laser scanning microscope (CLSM), the 
focal point of a laser generated by a lens is scanned 
across a specimen to build up a two-dimensional (2D) 
optical section. Three-dimensional (3D) images can be 
reconstmcted by capturing a series of 2D images at 
different focal depths in the specimen. Usually 2D 
scanning is known as x-y scanning while the third 
dimension is referred to the z-axis, i.e., the beam prop¬ 
agation axis is also named as a z-series. Argon and 
krypton/argon lasers are commonly used, and when 
using two-photon excitation for priming fluorescence 
titanium-sapphire laser sources are employed. A range 
of spectrally separated excitation beam lines and emis¬ 
sion wavelengths can thus be utilized to achieve 
multicolor or multichannel imaging. Scanning rates 
can be comparatively high and in some modem sys¬ 
tems an image frame is scanned up to 50-100 frames 
per second at the expense of the number of photons 
collected. This means that the extension to the fourth 
dimension, i.e., time, is hampered by the scanning rate. 
However, since the pixel resident or dwell time is in the 
microsecond range, a sequence of images showing 
how a specimen changes over time in three dimensions 
can be assembled. Such a method is suitable for live 
cell imaging. 

Minsky, in 1957, invented a confocal microscope 
identical with the concept later developed extensively 
by Egger and Davidovits at Yale, by Sheppard and 
Wilson at Oxford, and by Brakenhoff and colleagues 
in Amsterdam. Petran of the Charles University in 
Prague, Czechoslovakia, also having a collaboration 
with Egger and Galambos at Yale University, built 
a pocket-size pinhole scanning microscope and 
reported unstained images of brain and ganglion cells 
using reflected light. Many intermediate steps, theoret¬ 
ical and experimental, were carried out from the 1960s 
to the 1980s. However, two key applications of 
confocal microscopy to cell biology came from 
White, Amos, and Fordham, working at the Medical 
Research Council laboratory in Cambridge, UK, who 
compared images of various cells and tissues - from 
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the endoplasmic reticulum of plasmacytoma cells to 
the mitotic spindles of worm embryos, and from van 
Meer, Stelzer, Wijnaendts-van Resandt working in Kai 
Simon’s laboratory at the European Molecular Biology 
Laboratory in Heidelberg, Germany, who studied the 
transport of newly synthesized sphingolipids in epithe¬ 
lial cells. In this way, the advantages of confocal 
microscopy for analyzing subcellular processes 
became clear. The advent of affordable computers 
and laser systems and the development of digital 
image processing software pushed the CLSM to 
become available in an increasing number of laborato¬ 
ries and applied to biological and material specimens. 
Nowadays confocal laser scanning microscopes with 
incredibly high performances are available. 

Now, when thinking of conventional wide-held 
microscopes, one hgures out some specimen entirely 
bathed in the radiation from the light source, viewed 
directly through any capture device (eyes or charge- 
coupled device (CCD) camera, for example). This 
means that specimens are impinged by light continu¬ 
ously transmitting information from in- and out-of- 
focus light regions that, due to overlapping, are wors¬ 
ening axial resolution, and producing a typical blurring 
in the recorded images that, coupled with the light 
diffraction consequences, limits the ability of the 
instrument to distinguish hne details within the sam¬ 
ple. On the contrary to this, the image formation in any 
confocal setup is intrinsically “clean” plane by plane 
while one is navigating with the point-like shaped 
beam into the full three-dimensional specimen. 

Summarizing, such a confocal approach is achieved 
via two principal mechanisms that are at the basis of 
a point-to-point scanning of the selected plane of focus 
within the sample; 

1. Incident light is focused to a spot (much smaller 
than the usual field of view) within the specimen 
through a very small aperture (pinhole). The 
benefits of such a focusing technique lie on the 
possibility of limiting the excitation of fluorescent 
dyes above and below the plane of focus. 

2. Fluorescence emitted from regions above and 
below the current focal plane is physically rejected 
by means of a second pinhole (or the same one, 
depending on the architecture of the system) having 
a diameter/size that depends on the position along 
the optical pathway. This determines the Airy disks 
which are rejected; usually it is built to allow only 
the first Airy disk to reach the detection unit. 


Moreover, since the confocal optical situation is 
comparable to having two lenses working on the very 
same focus, the optical resolution is improved by 
a factor V2 along the spatial directions x-y-z under 
optimal alignment. Sometimes, when the fluorescent 
signal is inherently poor, the pinhole can be enlarged in 
order to get more emitted photons point by point. The 
spatial selection of the region responsible for the point- 
by-point signature does not mean that fluorescent mol¬ 
ecules are excited only in a small volume. Unfortu¬ 
nately all the molecules in a double cone geometry 
aligning the focus position are excited and only 
a portion of the emission is collected. In terms of 
photophysics of the process this means that, in contrast 
with the two-photon excitation case, overall 
photobleaching is not reduced. The simplified optical 
setup is reported in Fig. 1 and the excitation/emission 
figure is sketched in Fig. 2. 

To acquire an image, the excitation light has to be 
fully delivered to each point or to selected points of the 
specimen and the emission is generally detected by 
a photomultiplier tube (PMT) or/and avalanche 
photodiode (APD) and displayed on a screen. 

A different possible approach to forming confocal 
images, instead of using raster scanning, consists of 
employing a multi-pinhole Nipkow spinning disk, 
which is a disk containing multiple sets of spirally 
arranged pinholes placed in the image plane of the 
objective lens. A large parallel beam of light is then 
shone on a particular region of the “holed” disk, and the 
lights passing through the illuminated pinholes are 
focused by the objective lens straight onto the 
specimen. When rotating the disk at a fast rate, 
fluorescence can be primed several hundred times per 
second. So, emitted light is collected and imaged typ¬ 
ically by a high-resolution and high quantum efficiency 
CCD camera. The main advantage of the Nipkow 
spinning disk approach with respect to the previous 
one is an improvement of time resolution without 
compromising resolution at the expense of some addi¬ 
tional aberrations increasing their influence as function 
of thickness and scattering properties of the specimen. 

Now, the degree of confocality is a function of the 
pinhole selectivity in the discrimination of focused 
light rays from stray ones at the cost of a lower light 
throughput, which makes things difficult when dealing 
with particularly dim samples. Axial resolution and 
optical sectioning thickness depend on several factors 
such as the numerical aperture (NA) of the objective 
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Confocal Laser Scanning Fluorescence Microscopy, 
Fig. 1 Confocal optical pathways. An illumination and 
a detection pinhole are placed in the optical pathway. The detec¬ 
tion pinhole - the mask - is placed in front of the detector at 
a plane that is conjugate to the in-focus or “j” plane, such that the 
illumination spot and the pinhole aperture are simultaneously 
focused at the same specimen volume. This coincidence of the 
illumination and detected volume is responsible for confocality. 
The illumination pinhole allows to perform point-like scanning 
(Taken from open access http://www.biomedical-engineering- 
online. com/content/5/1/36) 

lens, the wavelength of the excitation/emission light, 
the pinhole size, the refractive index of components 
along the light path, and last but not least the overall 
alignment of the instrument. Theoretical models for 
describing the properties of an optical system like the 
optical microscope in its different configurations 
mainly based on a focusing lens need some assump¬ 
tions to be made to simplify computation. It is reason¬ 
able, at a first glance, to use a linear space invariant 
(LSI) model to get important insights and develop 
suitable mathematical tools for the analysis of most 
concrete situations including the possibility of compu¬ 
tationally reconstructing the objects producing the 
observed images. This process, mostly used for con¬ 
ventional microscopy, is known as the image 



Confocal Laser Scanning Fluorescence Microscopy, 
Fig. 2 Confocal fluorescence excitation/emission distribution. 
The emission process, in green, under blue IPE excitation is 
broadened in the whole double cone excitation shape within the 
analyzed cell (Taken from open access http://www.biomedical- 
engineering-online.com/content/5/1/36) 

deconvolution process. In the confocal and the 
two-photon excitation approaches it is facilitated 
since contributions of undesired signals are removed 
by means of the image formation process used 
(pinhole rejection or spatial confinement of nonlinear 
excitation). Now, within a linear spatial invariant 
hypothesis, the image formation system can be 
completely described in terms of an impulse response. 
This can be simulated utilizing architectural informa¬ 
tion or can be measured finding a suitable object 
capable of being experimentally equivalent to a Dirac 
function. Such an object is usually made by 
a fluorescent sphere having subresolved dimensions, 
i.e., <100 nm in diameter for high NA objective-based 
systems. The resulting signature is known as the point 
spread function (PSF) of the microscope. It can be 
demonstrated that its full width half maximum 
(FWHM) extent is an excellent measure of the 
resolution performance of an optical microscope. 
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Confocal Laser Scanning Fluorescence Microscopy, 
Fig. 3 Point-like emitter optical response. From left to right: 
calculated x-y (above) and r-z (below) intensity distributions, 
in logarithmic scale, for a point-like source imaged by means of 
wide-held, 2PE, and confocal microscopy. Both 2PE and confo¬ 
cal shapes exhibit a better signal to noise ratio than 
wide-held case. 2PE distribution is larger due to the fact that 
a wavelength twice than in the wide-held and confocal case is 

Figure 3 shows a comparison of the conventional, 
confocal and two-photon excitation (2PE) point spread 
functions. 

Now, there is a physical dependence between the 
pinhole size and the PSF: The more the pinhole size is 
increased, the more the confocal microscope’s 
response tends to fit the conventional one. 

As noted before, this means that in the case of dim 
or highly photosensitive specimens, some compromise 
should be found between the resolution and the amount 
of the collected photons, according to the kind of 
analysis one is going to perform. Moreover, the PSF 
is obviously dependent on all other physical parame¬ 
ters such as the sample immersion medium refractive 
index and turbidity, the degree of homogeneity of the 
sample, and the photochemical properties of the 


responsible for the intensity distribution. Such intensity distri¬ 
butions are also known as point spread functions of the related 
microscopes. Optical conditions: excitation wavelengths are 
488 nm and 900 nm for IPE and 2PE, respectively; emission 
wavelength is 520 nm; numerical aperture is 1.3 for an oil 
immersion objective with oil refractive index value set at 1.515 
(Taken from open access http://www.biomedical-engineering- 
online. com/content/5 /1 /36) 

fluorescent labels. This is the reason why the develop¬ 
ment of extremely rigorous (optically speaking) and 
complicated computations often leads to poorly appli¬ 
cable results in practice because of the practical con¬ 
ditions. Web-tools are currently available at http:// 
www.svi.nl/ and http://www.powermicroscope.org. 

In fact, one of the most meaningful PSF-dependences 
is the one related to the refractive index mismatch between 
the objective immersion medium and the specimen. 

The system resolution is still worse along the opti¬ 
cal axis and is strictly dependent on the degree of 
confocality (pinhole size). Again, pinhole size is 
uniquely architecture dependent. 

Variation of the lateral FWHM, with respect to the 
focusing depth, swings from 6% (oil-immersed PSF) to 
48% (air-immersed PSF), whereas the axial FWHM 
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varies up to 130% (air-immersed PSF). This is in 
contrast with the utilization of a LSI approach that, 
despite this, is an excellent starting point for any quan¬ 
titative analysis. 

In terms of applications, the 3-D ability of the 
confocal scheme coupled to the improvement in reso¬ 
lution and the natural properties of fluorescence optical 
microscopy to allow imaging of living samples moves 
the imaging scenario in a four-dimensional world 
(x-y-z-t), where other dimensions can be added and 
other mechanisms of fluorescence excitation can be 
considered, such as two-photon excitation. 

It is difficult to report exhaustively the applications 
of confocal microscopy in Biophysics. Most common 
applications are related to producing optical slices of 
transparent fluorescent specimens for 3-D reconstruc¬ 
tion, stereo image production and four-dimensional 
imaging; tracing specific molecules, cells, or structures 
through organs and tissues; determining the cellular 
localization of ions, RNA, DNA, proteins, cytoskeletal 
components, and organelles. Confocal microscopy can 
be efficiently coupled to approaches like FRAP (fluo¬ 
rescence recovery after photobleaching), FRET 
(Fortser fluorescence resonance energy transfer), 
FLIM (fluorescence life-time imaging), and FCS (fluo¬ 
rescence correlation spectroscopy). 
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Synonyms 

Rapid kinetic methods 


Definition 

A method for studying rapid reactions on the sub¬ 
millisecond timescale under flow conditions. In con¬ 
tinuous flow, the progress of the reaction is measured 
from the distance travelled by the reaction mixture 
along an observation flow channel. The method is 
widely applicable to processes in solution that can be 
initiated through rapid mixing of reagents. 

Historical Perspective of Continuous Flow 

The very beginnings of continuous flow can be traced 
back to the early twentieth century when a simple 
apparatus was utilized to follow the mixing of sodium 
hydroxide and hydrochloric acid (Hartridge and 
Roughton 1923). In this instrument, solutions deliv¬ 
ered by six narrow jet inlets came together at a high 
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velocity within a restricted space and passed down 
a wider observation tube. Though primitive, the instru¬ 
ment allowed the first reliable observation to be made 
in one-hundredth of a second after the reaction com¬ 
menced, achieving a dead time comparable to many 
modern ► stopped-flow instruments. The only disad¬ 
vantage of the instrument was its size and, consequently, 
sample consumption. With observation channel more 
than 6 mm wide and 300 cm long, and a minimum flow 
rate required to generate turbulent flow exceeding 4 m/s, 
the sample volume for a single measurement was 
a staggering 7.5 L. Not surprisingly, applications were 
limited to basic chemistry with cheap, abundant reagents. 

In 1930s and 1940s, similar instruments were 
constructed, utilizing the same multiple-inlet principle. 
Considerably smaller, those instruments provided bet¬ 
ter sample economy, which combined with advances 
in the fields of optics and electrics could be used to 
monitor processes such as oxygen binding and release 
in hemoglobin (DuBois 1941). Still, their detection 
capabilities were very limited, relying on measuring 
changes in sample absorbance by photocells coupled 
with galvanometers and recording on photosensitive 
paper. Those instruments were limited to reactions that 
could be monitored through absorbance changes, and 
this coupled with rather high inaccuracy of detection 
narrowed their already limited applications. 

The next phase in continuous-flow development is 
an era of free-jet mixers, where solutions mixed 
through high-velocity flows were sprayed out of the 
mixing tip. These small, very robust devices were 
much more suitable for chemistry and biological reac¬ 
tions, yet had limited detection capabilities. Typical 
examples of such instruments are those described 
by Kletenik (1963) and Moskovitz (Moskowitz and 
Bowman 1966). Moskovitz’s jet mixer was made by 
fusing a bundle of thin-wall glass tubules into a larger 
outer tube. The bundle was then pulled to a tip few 
millimeters in diameter where the two fluids were 
mixed (Fig. la). One of the solutions reached the tip 
by flowing inside the tubules and the other through the 
interstices. Mixed solutions were then sprayed from 
the tip in a narrow stream which could be monitored by 
a number of spectroscopic techniques, allowing the 
progress of the reaction to be followed. By altering 
the number of capillaries, their dimensions and sizes, 
solutions with different physical properties could be 
mixed and parameters such as pressure and flow rate 
modified. The most effective mixer described by 


Moskovitz allowed for flow rates up to 7.2 m/s and 
dead times as short as 28 ps. Due to its small size, the 
mixer consumed relatively small amounts of sample, 
allowing for multiple repeat experiments. 

Over the next two decades, several attempts were 
made to further improve the mixing efficiency of cap¬ 
illary mixers, the most successful being the ball mixer 
(Regenfuss et al. 1985). Its basic concept is very sim¬ 
ilar to multi-capillary mixer, with the addition of 
a small glass or metal sphere at the base of the mixing 
tip. Solutions exiting the capillaries were forced to flow 
around the sphere, resulting in much higher turbulent 
flow and consequently improved mixing. This reduced 
the dead time to only 10 ps, by far the shortest dead 
time for any continuous-flow instrument reported to 
date. However, that was achieved with an inlet velocity 
of 10 m/s, resulting in increased sample consumption. 

Obstructing the flow of solutions with a sphere had 
its drawbacks, the most serious being the tendency of 
the mixer to clog, despite extensive filtering of the 
solutions. Coupled with the fact that the removal of 
the sphere for cleaning was impossible, the very mod¬ 
ification that improved the efficiency reduced the life 
span of the instrument significantly, particularly if 
viscous solutions were used. 

A simplified design was then made, in which the 
multiple inner capillaries were replaced by a single 
inner tube positioned centrally within the outer tube 
(Fig. lb). Solutions arriving at the tip through inner and 
outer capillaries were mixed around a platinum sphere. 
This simplified ball mixer only achieved a dead time of 
40 ps but its simpler design made its legacy into later 
generations of continuous flow. 

Though mechanically robust, jet mixers were far 
from perfect. Mixed solutions travelling at high linear 
velocities were sprayed from the tip, making it difficult 
to follow the reaction due to the inherent instability of 
a jet, its poor definition, and multiple scattering arti¬ 
facts. This issue was solved almost three decades later, 
in the late 1990s, when a channel (flow cell) was 
introduced to constrain the flow, allowing for the reac¬ 
tion to be captured through imaging with newly intro¬ 
duced charge coupled devices (CCDs) (Shastry et al. 
1998). The addition of a flow cell and CCD allowed for 
more reliable data to be collected, making continuous 
flow a desirable method for the investigation of very 
rapid reactions, such as early events of protein folding 
(Apetri et al. 2006; Shastry et al. 1998; Kuwata et al. 
2001) and enzyme kinetics (Pinheiro et al. 2001). 
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Continuous Flow, 

Fig. 1 Simplified diagrams of 
three main types of mixer 
designs employed in 
continuous flow, (a) multi¬ 
capillary jet mixer; (b) ball 
mixer coupled to flow cell; and 
(c) T-mixer with flow cell 
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Despite the simplicity of ball mixers, their 
manufacturing and assembly relied on ingenious 
handcrafting and suffered from low reproducibility. 
These limitations coupled with still unsolved tendency 
to clog led to research into alternative designs that 
could be mass produced, robust, and offering high 
degree of reproducibility. By far the easiest way of 
manufacturing large quantities of simple, 
nonmechanical components (such as ink jets, laser 
diodes, or semiconductors) is microfabrication, which 
was incorporated into the next generation of continu¬ 
ous-flow mixers. Microfabricated mixers produced 
from chemically inert, durable materials are an attrac¬ 
tive alternative to handmade devices. However, 
microfabrication is essentially limited to two- 
dimensional objects, making both ball mixers and 
multi-capillary mixers difficult to manufacture. 
Thus, a simplified mixer design which could be 
microfabricated in a single plane was produced (Bilsel 
et al. 2005). It followed the T-mixer geometry widely 
used in stopped-flow instruments (Fig. Ic), where two 
delivery channels and an observation channel 
machined in a thin sheet of chemically inert polymer 
are sandwiched between two layers of glass. Although 
dead times achieved by this mixer were comparable to 
those of ball mixers, its unique planar geometry proved 
challenging for detection, requiring a laser source for 
observable spectroscopic changes. The cost associated 
with a dedicated laser proved to be a limiting factor and 
such continuous-flow designs were only limited to 
a few laboratories (Bilsel et al. 2005; Matsumoto 
et al. 2007). Alternative designs using more conven¬ 
tional light sources couple the T-mixer geometry for 
the delivery and mixing of reagents with a quartz flow 


cell for the observation in a conventional three- 
dimensional arrangement (Masca et al. 2006). 

A Typical Continuous-Flow Instrument 

A continuous-flow instrument requires a delivery sys¬ 
tem for reagents, a mixer coupled to an observation 
channel, as well as an appropriate light source and 
a detection system (Fig. 2). Solutions are delivered to 
the mixer from at least two separate reservoirs (usually 
syringes) driven preferably by a stepper-motor device 
(this offers more stable flow velocities than those gen¬ 
erated by pressure-driven devices). Upon reaching 
the mixer, solutions are mixed as rapidly and as effi¬ 
ciently as possible (starting the reaction) and enter 
the observation channel where reaction progresses 
as it travels down the channel. The reaction is then 
measured through an appropriate spectroscopic 
method. The most common observation methods 
widely used in biological and chemical applications 
rely on absorbance or ► fluorescence changes during 
the reaction. 

A typical layout of a continuous-flow instrument 
assembled for reactions detected through fluorescence 
changes is shown in Fig. 2. In this configuration, a light 
source, typically a strong mercury or mercury-xenon 
arc lamp, is used and specific wavelengths selected 
with a monochromator. 

Fluorescence from the sample is collected through 
a set of lenses and optical filters on the sensor of a CCD 
camera. The output of a continuous-flow instrument is 
an image of the reaction progressing along the obser¬ 
vation channel (Fig. 3a). 
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Continuous Flow, Fig. 2 Schematic diagram of a continuous- 
flow instrument which uses fluorescenee measurements to follow 
the reaction. A stepper motor drive (a) delivers reagents from 
solution reservoirs (b) into the mixer (c) where reaction is initi¬ 
ated. A light source (d) provides illumination selected by 


a monochromator (e). Focusing lenses (f) focus the light exiting 
the monochromator into a narrow band aligned with the observa¬ 
tion channel inside the flow cell (g). As the reaction progresses, 
fluorescence emitted from the flow cell is gathered by focusing 
lenses and filters (h) before reaching the CCD camera (i) 


Continuous Flow, 

Fig. 3 Typical raw and 
processed data from 
a continuous-flow 
measurement, (a) Image of 
a reaction showing decrease in 
fluorescence as reaction 
progresses; (b) fluorescence 
intensity profile of the 
reaction; and (c) a kinetic trace 
of the reaction calculated 
using Eq. 1 
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From Pixels to Kinetics 

The raw data produced by the CCD camera in the form 
of images are of little value and require processing in 
order to yield useful information, such as kinetic 
parameters (rates and amplitudes of the reaction). 
A typical CCD image for a reaction where fluorescence 
decreases as the reaction progresses is shown is Fig. 3a. 
This image is processed by averaging the fluorescence 
intensity of each column of pixels between the edges of 
the flow channel to produce a typical intensity profile 
for the reaction (Fig. 3b). Finally, a kinetic trace (reac¬ 
tion on a timescale) is derived from the fluorescence 
intensity profile using Eq. 1 : 


where A is the area of the cross section of the flow 
channel (mm^), L is the length of the channel captured 
by the CCD (mm), T is the flow rate in mLs~*, and At is 
the time interval (s) over which the reaction was 
observed. 

Applications 

Continuous flow is widely applicable to reactions in 
solution that occur in a sub-millisecond timescale and 
complements applications which are not fully resolved 
by stopped flow. Although not commercially available, 
continuous flow has proved to be an effective method 
for the study of early events in >- protein folding 
(Apetri et al. 2006; Chan et al. 1997; Kuwata et al. 
2001) and ► enzyme kinetics (Pinheiro et al. 2001). 

Summary 

The concept of continuous flow has changed little since 
its invention almost a century ago. With the basic 
principles remaining the same, the major changes in 
continuous flow revolve around progressive mixer 
designs and improved detection methods. Efficient 
multi-capillary free-jet mixers were already available 
in the 1960s, achieving dead times of 30 ps. Eree-jet 
ball mixers in the 1980s achieved the best dead time of 
10 ps and more recent assemblies coupling a flow cell 
to ball mixers or microfabricated T-mixers have dead 


times below 45 ps. By far the biggest progress between 
early and current continuous-flow instruments lies in 
optical components and detection methods used, from 
earliest sensors which measured absorbance with gal¬ 
vanometer-coupled photocells and photosensitive 
paper to highly sensitive CCD cameras available 
today capable of detecting all wavelengths of light, 
from deep UV to IR. 

Although mixer designs evolved and materials used 
for manufacturing have changed, all mixers rely on 
generating turbulent flow for rapid mixing of the 
reagents. Mixing based on turbulent flow limits 
continuous flow to dead times of 10-20 ps, as demon¬ 
strated by the various mixers over the last 50 years. Until 
a radical new method of mixing that does not involve 
turbulent flow is invented, the efficiency of mixers will 
be limited to dead times achieved throughout history. 

Cross-References 

► Electro-optics and Macromolecular Hydrodynamics 

► Fluorescence 

► Infrared Spectroscopy of Protein Dynamics: 

Ultrafast Kinetics 

► Kinetics: Overview 

► Stopped-Flow Techniques 

► Transient-State Kinetic Methods 
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ies, Fig. 1 A schematic representation of the spliceosome 
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Introduction 

In eukaryotes, DNA is first transcribed into pre- 
mRNAs which often contain introns. The removal of 
introns (pre-mRNA splicing) is an essential step in the 
gene-expression pathway of all eukaryotes. Alterna¬ 
tive splicing greatly enhances the gene coding capacity 
and is an important approach for eukaryotic gene reg¬ 
ulation. A typical intron has a conserved 5' splice site 
(5' ss), a Branch Point Sequence (BPS) followed 
by a Polypyrimidine Tract (PYT), and a 3' splice site 


(3' ss) (reviewed in Burge et al. (1999)). Introns are 
removed through two transesterification reactions. In 
the first reaction, the 2' hydroxyl group of the critical 
adenosine residue in the BPS attacks the phosphate 
group at the 5' ss, generating a lariat intermediate. In 
the second reaction, the newly freed 3' hydroxyl group 
from the 5' exon attacks the phosphate group at the 
3' ss, releasing the lariat and ligating the two exons. 

Pre-mRNA splicing is catalyzed by the 
spliceosome, a huge RNA/protein complex. The 
spliceosome contains 5 small nuclear RNAs (Ul, U2, 
U4, U5, and U6 snRNAs), which form 5 snRNPs with 
their associated proteins, and over 100 protein splicing 
factors. Spliceosomes typically assemble on each pre- 
mRNA in a stepwise manner (Fig. 1). The formation of 
the E complex involves the recognition of the 5' ss by 
Ul snRNP, the BPS by SFl, the PYT by U2AF^^ and 
the 3' ss by U2AF^^. Subsequently, the U2 snRNP joins 
the spliceosome to form the A complex, followed by 
the joining of the U4/U6. U5 tri-snRNP to form the 
B complex. Extensive structural rearrangements occur 
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at this stage to form the catalytically active B* com¬ 
plex that is ready for the first step reaction. During this 
activation process, the base pairing between the 5' ss 
and U1 snRNA as well as the interaction between U4 
and U6 are disrupted, and U1 and U4 leave the 
spliceosome. The invariant ACAGAGA box of U6 
interacts with the 5' ss, and the 5' end region of U2 
interacts with the BPS. The same regions of U2 and U6 
also base pair with each other, bringing the branch 
point adenosine and the 5' ss close together. U5 inter¬ 
acts with both exons, likely important for aligning the 
two exons during the second step reaction. After the 
first step reaction, the spliceosome repositions the sub¬ 
strate for the second catalytic reaction and forms the 
C complex. The second reaction is followed by post- 
catalytic rearrangements to liberate the mature mRNA 
for export, release the lariat intron to be degraded, and 
snRNPs to be recycled. 

Although the large size and highly dynamic nature 
(both in composition and conformation) of the 
spliceosome have presented challenges to structural 
studies, there has been significant progress in under¬ 
standing the structural organization and dynamics of 
the core splicing machinery using a variety of struc¬ 
tural and biophysical approaches. Highlighted below 
are a number of EM structures which generated 
the overall (but less detailed) view of the shape 
and structural organization of the spliceosome. In 
addition, crystallographic and NMR studies provided 
high-resolution structures of the individual compo¬ 
nents, fragments, or small sub-complexes, and single 
molecule fluorescence offered a dynamic view of the 
spliceosome on a single molecule level. 

Electron Microscopy (EM) Studies of SnRNPs 
and Spliceosome 

Determining high-resolution crystallographic struc¬ 
tures of large spliceosomes or snRNPs is challenging 
due to difficulty in obtaining large quantity, high 
purity, and homogeneous samples. Single particle 
cryo-EM is often the method of choice to determine 
the overall structures (albeit in low resolutions) of 
spliceosome and snRNPs, which requires 100-1,000- 
fold less samples than what is typically needed 
for crystallization trials. Spliceosome or snRNPs used 
for EM studies are typically purified from HeLa 
nuclear extract or yeast whole cell extract through 


affinity chromatography using antibody targeting 
spliceosomal components, biotinylated pre-mRNA, 
or pre-mRNA containing MS2-binding aptamers. 
Sometimes double affinity purification or additional 
gradient centrifugation is used to achieve maximum 
homogeneity, although heterogeneity in sample still 
remains the major challenge in improving resolutions 
of EM structures. Cryo-EM samples are either pre¬ 
pared by directly flash-freezing or by combining neg¬ 
ative stain with flash-freezing. The latter is often used 
for EM structural determination of spliceosome and 
snRNP to increase sample contrast, since spliceosome 
and snRNP are often purified in the presence of glyc¬ 
erol which stabilizes the complex but significantly 
reduces sample contrast. EM images are then recorded 
which are 2D projections of the 3D density of 
spliceosome or snRNP. Many 2D projections 
(obtained through random conical tilt or angular recon¬ 
stitution) are combined to reconstruct the 3D structure 
of spliceosome or snRNP. Since resolution of most EM 
structures of spliceosome or snRNP is too low to 
directly identify secondary structures and individual 
components, labeling technique is often used to deci¬ 
pher structural organizations. The component of inter¬ 
est can be detected by antibody (or gold-attached 
antibody) against a spliceosomal component or anti¬ 
body against a genetic tag (such as TAP, HA, or Myc) 
attached to spliceosomal components. Genetic tags 
alone (often large fusion proteins) can also be used to 
identify components in the EM structures. 

The EM structures of human spliceosomal 
A complex (40-50 A resolution), B AU1 complex (the 
B complex that lacks U1 snRNP, 40 A resolution), 
B complex (2D averaged image only), and 
C complex (30 A resolution) have been determined 
(reviewed in (Jurica 2008; Luhrmann and Stark 
2009)) (Fig. 2). The A complex contains a globular 
body with several smaller protruding elements. Both 
B complexes consist of a flexible head domain 
attached to a rigid triangular body domain that resem¬ 
bles the tri-snRNP in size and shape. The head domain 
can accommodate the A complex structure fairly well. 
Labeling studies show that the 5' exon, 3' exon, intron, 
and SF3bl55 (a protein in close proximity to the BPS 
and 3' ss) are all located in the head domain at several 
distinct areas. The C complex is ^270 x 220 x 240 A 
which is similar in size and shape to the triangular 
domain in the B complex and may represent a stable 
core C complex. The core C complex contains 



Core Splicing Machinery: Structural and Biophysical Studies 


373 


c 




C 


Core Splicing Machinery: Structural and Biophysical Studies, Fig. 2 Representative EM structures of the spliceosome and 
sub-complexes (Luhrmann and Stark 2009) 


a cylindrical top domain and a larger bottom domain 
with an extended arm domain. These three domains are 
arranged in a relatively open conformation and U5 
snRNP, U2 snRNP, and Prpl9 potentially fit into 
these different domains. Labeling studies show that 
the two exons in pre-mRNA are close to each other 
and located in the general area between the top and 
bottom domain (Alcid and Jurica 2008). 

Human U1 snRNP is the smallest and the most 
abundant snRNP in the cell, making U1 snRNP the 
best studied snRNP biochemically and structurally. 
The cryo-EM structure of U1 snRNP is determined at 
10-14 A resolution (reviewed in Luhrmann and Stark 
(2009)). The EM structure is highly consistent with the 
5.5 A crystallographic structure of U1 snRNP deter¬ 
mined recently, and the structural details will be 
discussed later in the crystallographic section. Human 
U2 snRNP contains U2 snRNA, 7 Sm proteins, and 15 
U2-specific proteins, most of which are components of 
the SF3a and SE3b sub-complexes. The 3D structure of 
the entire U2 snRNP is unknown but the EM structure 
of SE3b has been determined to be less than 10 A 
resolution (reviewed in Luhrmann and Stark (2009)). 
SF3b plays an important role in BPS recognition and 
its pl4 protein component directly cross-links to the 
branch point adenosine. The intermediate resolution of 
the EM structure allowed the assignment of several 
protein components with known structural folds, 
including SF3b49 and pl4 (both containing RRM 
domain) as well as Sf3bl55 (containing 22 tandem 
HEAT repeats). Interestingly, pl4 seems to be located 
in the interior of the closed SF3b structure, implying 
that conformational changes of SF3b may be needed 
for pl4 to access the BPS. Tri-snRNP is the largest 
snRNP in the spliceosome. The EM structures of the 


human tri-snRNP and its U5 and U4/U6 snRNP com¬ 
ponents (reviewed in Luhrmann and Stark (2009)), as 
well as the yeast tri-snRNP (Hacker et al. 2008) and 
U6 snRNP (Karaduman et al. 2008) all have been 
determined. Tri-snRNP has a triangular shape with 
a head, body, and arm domain connected at the center. 
Labeling studies show that U4/U6 snRNP is located in 
the arm domain and the head domain contains 
U5 protein Brr2, while Prp8 and Snull4 are located 
around the center of tri-snRNP where the head and 
body domains meet. The head and arm domains are 
flexible, which may have functional implications dur¬ 
ing spliceosomal activation. 

Crystallographic and NMR Studies of 
Spliceosomal Components 

Crystallography and NMR are powerful approaches to 
determine high-resolution structures of macromole¬ 
cules. Both methods need large quantity, high concen¬ 
tration, and high purity samples, in comparison with 
EM studies. The major limitation for crystallography is 
the need for crystallization which is often 
unpredictable. The size of the complex it can tackle 
is unlimited, although it becomes challenging to obtain 
large quantity and highly homogeneous sample of 
large complexes. The NMR method studies macromol¬ 
ecules in solution and bypasses the need for crystalli¬ 
zation. However, the size of the macromolecule it can 
determine the structure of is limited. Below are exam¬ 
ples of spliceosomal sub-complexes or components 
structures that have been determined by crystallogra¬ 
phy or NMR, which provided important insight in the 
function of these macromolecules in splicing. 
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Core Splicing Machinery: Structural and Biophysical Stud¬ 
ies, Fig. 3 Crystal structure of the U1 snRNP (Pomeranz 
Krummel et al. 2009) 

U1 SnRNP 

U1 snRNP is the largest spliceosomal sub-complex 
that has a crystallographic structure (Fig. 3) (Pomeranz 
Krummel et al. 2009). Human U1 snRNP contains 
U1 snRNA, seven Sm proteins common to all snRNPs, 
and three U1-specific proteins (Ul-A, U1-70K, and 
Ul-C). The base pairing between the 5' end of 
U1 snRNA and the 5' ss is critical for 5' ss recognition. 
Previous biochemical studies and computer modeling 
suggest that U1 snRNA forms four stem loops (SLl-4). 
U1-70K and Ul-A bind to SLl and 2, while Ul-C does 
not bind to U1 snRNA by itself. Crystal structures of 
several Sm proteins led to a model of seven Sm pro¬ 
teins arranged in a ring-like structure. The crystal 
structures of the RED (RNA-binding domain, also 
known as RNA recognition motif or RRM) of Ul-A 
in complex with SL2 and the NMR structure of the Zn- 
finger domain of Ul-C were also determined. Building 
on these success, Nagai and colleagues reconstituted 
the entire human U1 snRNP using recombinant pro¬ 
teins (omitting the non-essential UlA) and in vitro 
transcribed U1 snRNA and determined the crystal 
structure of U1 snRNP at 5.5 A resolution. The Sm 
proteins, UlC Zn-finger, and a homology model of 
U1-70K RED were fitted into the electron density 
map using anomalous peaks from zinc and Se-Met 
substituted proteins as landmarks. Path of the long 


N-terminal arm of U1-70K was established utilizing 
seven Met mutations engineered in the N-terminus. 

U1 snRNA upstream of the Sm binding site forms 
a four-way junction where SLl and 2 as well as SL3 
and helix H coaxially stack. The seven Sm proteins 
form a heptameric ring on the Sm binding site, which 
interacts and stabilizes the four-way junction. The 
RED of U1-70K interacts with SLl. The N-terminal 
arm of U1-70K extends over 180 A from its RED, 
wraps around the core Sm domain, and finally contacts 
Ul-C, which explains why the N-terminus of U1-70K 
is necessary and sufficient for Ul-C binding to the U1 
core domain. In the crystal structure, the 5' end of U1 
snRNA base pairs with its counterpart from 
a neighboring molecule and mimics the interaction 
between U1 snRNA and the 5' ss. Helix A of Ul-C 
binds in the minor groove of this duplex. The loop 
between the two His residues in the Zn-finger is in 
close proximity to the proposed pre-mRNA strand. 
Arg21/Lys22 on Helix A and Arg28/Lys29 in the 
loop can potentially interact with the phosphate back¬ 
bone of RNA. These structural features are consistent 
with the essential role of Ul-C and R28/K29 in medi¬ 
ating the binding between U1 snRNA and the 5' ss to 
form the E complex. 

RRM Proteins 

RNA Recognition Motif (RRM) is the most common 
RNA-binding motif in eukaryotes (reviewed in Clery 
et al. (2008)). The RRM is approximately 90 amino 
acids long and contains an RNPl motif ([RK]-G-[FY]- 
[GA]-[FY]-[ILV]-X-[FY]) and an RNP2 motif([ILV]- 
[FY]-[ILV]-X-N-L). The first 3D structure of the RRM 
domain determined was UlA RRM domain (see refer¬ 
ence in (Clery et al. 2008)). The structure is made of 
two a-helices packed against four anti-parallel 
P-strands with a topology of plaip2p3a2p4. RNPl 
is located on P3 and RNP2 on pi. Typically, the three 
aromatic rings in RNPl and 2 interact with RNA 
through base stacking or interaction with sugar rings. 

The crystal structure of the U2AF^^/U2AF®^ 
heterodimer and the NMR structure of U2AF®^/SF1 
reveal that U2AF^^ and U2AF®^ both contain an RRM- 
like structure that interacts with protein instead of 
RNA (reviewed in Kielkopf et al. (2004)). These 
RRM-like domains are named U2AF®^ homology 
motifs (UHM). UHMs often contain aliphatic instead 
of basic or aromatic residues in RNPl and RNP2. 
A Trp residue in the protein ligand inserts into 
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a hydrophobic pocket formed between the a-helix and 
the RNPl and RNP2-like motifs in UHM. A conserved 
R-X-F motif on the a2/p4 loop contributes to the Trp- 
binding pocket. In addition, a series of acidic residues 
in al of UHM interact with basic residues at the N- 
terminus of the protein ligand. More proteins were 
subsequently found to contain UHM that interact 
with proteins. 

Proteins of the U5 SnRNP 

Several U5 proteins play critical roles in spliceosomal 
activation and potentially the catalytic reaction. Prp8 is 
a U5 protein and is also present in the tri-snRNP and 
the spliceosome (reviewed in Grainger and Beggs 
(2005)). Prp8 is one of the largest (>2,000 amino 
acids in length) and most conserved proteins in the 
nucleus. Prp8 has remarkably low sequence similari¬ 
ties with other proteins, making it difficult to deduce 
function from sequence analyses. Prp8 is the only 
spliceosomal protein that extensively cross-links with 
the 5' ss, 3' ss, BPS, U5, and U6, leading to the hypoth¬ 
esis that Prp8 may help forming/stabilizing the cata¬ 
lytic core or even contributing functional groups to the 
splicing reaction. Genetic analyses identified numer¬ 
ous Prp8 mutants that either suppress or exacerbate 
other splicing mutations, many of which are on splic¬ 
ing factors important in spliceosomal activation, lead¬ 
ing to the hypothesis that Prp8 may be a master 
regulator of spliceosomal activation. 

The crystal structure of the C-terminal domain 
(CTD) of Prp8 reveals a central MPN domain with 
N- and C-terminal extensions (Pena et al. 2007; 
Zhang et al. 2007). Although the general function 
of most MPN domains is unclear, a subset of the 
MPN domains contains a JAMM motif that was 
thought to coordinate a Zn"^"^ ion and function as 
metalloproteases. Prp8 CTD only contains a partial 
JAMM motif which does not coordinate a Zn"^"^ ion 
and is unlikely to be a Zn-dependent metalloprotease. 
On the other hand, GST pull down and yeast two- 
hybrid experiments demonstrate that the CTD of Prp8 
interacts with Brr2 and Snull4, which may have 
implications in the role of Prp8 as a regulator of 
spliceosomal activation. Prp8 contains a number of 
mutations implicated in human genetic disorder Reti¬ 
nitis Pigmentosa Type 13 (RP13). All RP13 mutations 
are located on the C-terminal extension of the Prp8 
CTD structure. Deletion of the C-terminal extension 
reduces the binding between Prp8 CTD and Brr2 or 
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Core Splicing Machinery: Structural and Biophysical Stud¬ 
ies, Fig. 4 Crystal structure of Prp8 (l-finger domain {cyan) 
superimposed with an RNase H domain structure (yellow) 
(Yang et al. 2008) 

Snul 14, suggesting a possible mechanism for the dis¬ 
ease phenotype of these mutations. 

The structure of another domain immediately 
upstream of the CTD was determined by three groups 
(Pena et al. 2008; Ritchie et al. 2008; Yang et al. 2008). 
This domain contains an RNase H core and 
a prominent P-hairpin finger protruding out of the 
middle of the protein (Fig. 4). RNase H domains con¬ 
tain a conserved DDE active site that coordinates 
a Mg"^"^ ion, critical for catalysis. The RNase 
H domain in Prp8 does not contain a complete DDE 
triad and does not coordinate Mg"^"^ ion. However, the 
RNase H domain in Prp8 seems to have maintained the 
RNA-binding capacity of canonical RNase H domains. 
This domain binds a U2/U6 mimic and U4/U6 much 
better than arbitrary RNAs. Modeling RNA on the 
RNase H domain of Prp8 places the 5' ss close to the 
previously observed 5' ss cross-linking site on Prp8 in 
a cleft close to the P-hnger. The protruding P-finger is 
strongly reminiscent of many ribosomal proteins with 
extensions (in the form of extended loops, a-helix, or 
P-hairpin fingers) protruding from globular protein 
bodies. These protrusions in ribosomal proteins insert 
into folded 16S or 23S rRNAs and stabilize the RNA 
structure. Mutations at the P-finger of Prp8 affect the 
conformational equilibrium between the first and sec¬ 
ond catalytic step and suppress U4-csl cold sensitivity. 
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Core Splicing Machinery: Structural and Biophysical Stud¬ 
ies, Fig. 5 A model depicting the overall structure of Brr2 
consisting the NTD, Hel308-I, and Hel308-II (Zhang et al. 


2009). The six domains in each Hel308-like module are 
represented in different colors. The two tandem sets of helicase 
and Sec63 domains are indicated in ovals on the structural model 


suggesting that Prp8 inserts its p-finger into the 
spliceosomal complex or tri-snRNP and stabilizes 
these complexes. Mutations on the P-finger alter 
these interactions, resulting in the observed pheno¬ 
types. These observations present an intriguing anal¬ 
ogy between a spliceosomal protein and ribosomal 
proteins. 

Brr2 is a U5 protein and DExD/H-box helicase that 
is responsible for U4/U6 unwinding during 
spliceosomal activation and U2/U6 unwinding during 
spliceosomal disassembly (see references within 
Zhang et al. (2009)). Prp8 and Snull4 both have 
been implicated in regulating the activity of Brr2, 
although the mechanism of regulation is unclear. 
Brr2 contains an N-terminal domain and two tandem 
sets of a helicase-like domain followed by a Sec63 
domain with unknown structure and function. Motifs 
in the first but not the second helicase-like domain are 
critical for ATPase activity, U4AJ6 unwinding, and 
cell viability. The crystal structure of the second 
Sec63 domain of Saccharomyces cerevisiae Brr2 was 
determined, which unexpectedly resembles domains 
4 and 5 of DNA helicase Hel308 (Pena et al. 2009; 
Zhang et al. 2009). The helicase-like domain upstream 
of Sec63 has clear sequence similarity with domains 
1-3 of Hel308. The full-length Brr2 is therefore likely 
composed of an N-terminal domain and two consecu¬ 
tive Hel308-like modules (Hel308-I and II) (Fig. 5). 


Structural model and mutagenesis data suggest that 
Hel308-I of Brr2 unwinds dsRNA using a helicase 
mechanism similar to Hel308, different from many 
well-studied DEAD-box proteins. Hel308-II interacts 
with Prp8 and Snul 14 in vitro and in vivo, potentially 
serving as a mediator for the regulation of Brr2’s 
activity by Prp8. The C-terminal region of Prp8 
(Prp8-CTR) facilitates the binding of the Brr2/Prp8- 
CTR complex to U4/U6, suggesting a potential role of 
Prp8-CTR as an auxiliary substrate binding and spec¬ 
ificity domain for Brr2. 

SnRNAs 

The chemistry of the splicing reaction is identical to 
the group II self-splicing introns and both reactions 
require Mg"^"^ ions. This led to the hypothesis that the 
spliceosome is a ribozyme and snRNAs play important 
roles in the splicing reaction. In particular, phyloge¬ 
netic, genetic, and biochemical analyses point to U6 as 
the snRNA most likely to play a catalytic role in the 
splicing reaction. The NMR structure of the highly 
conserved U6 Intra-molecular Stem Loop (ISL) from 
yeast was determined (reviewed in Butcher and Brow 
(2005)). The Watson-Crick paired region of the stem 
has standard A-form helical geometry. U80 in the 
internal loop of ISL coordinates a Mg"^"^ ion which 
may be the Mg"^"^ ion required for catalysis. The 
GCAUA pentaloop in ISL makes a GNRA-type fold 
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Core Splicing Machinery: Structural and Biophysical Stud¬ 
ies, Fig. 6 Secondary structure of U2/U6 determined by NMR 
and secondary structure of domain 5 of group II intron (reviewed 
in Butcher and Brow 2005). The invariant AGC triad is in red 
and the nucleotide that coordinates a Mg"^ ion is labeled by a * in 
both structures 


that often mediates tertiary interaction with RNA and 
was hypothesized to bring the metal ion in ISL close to 
the 5' ss. U6 ISL seems to be a structural and functional 
equivalent of domain V of the group II intron. The two 
stem loops are similar in length and secondary struc¬ 
tures. Each stem loop has two helices separated by an 
internal Mg'^'^-binding bulge and both stem loops have 
a GNRA-type loop. 

Butcher and colleagues analyzed the structures of 
various yeast U2/U6 constructs (including the ISL 
region) using NMR (reviewed in Butcher and Brow 
(2005)) (Fig. 6). U2/U6 forms a four-way helical junc¬ 
tion and this fold may be important in bringing the 
metal binding site in U6 close to the 5' ss for the first 
step of catalysis. The invariant AGC triad in U6 base 
pairs with nucleotides in U6 and extends the length of 
the U6 ISL, which is in contrast to the previous model 
derived from genetic analyses where AGC base pairs 
with nucleotides in U2 to form helix Ib. These models 
could reflect two U2/U6 conformations required for 
different stages of the splicing reaction. It is interesting 
to note that the base pairs formed between AGC and 
nucleotides in U6 at the base of ISL are identical to the 


base pairs in the lower stem of domain 5 of group II 
self-splicing intron which are critical for catalysis. 


Single Molecule Fluorescence (SMF) Studies 
of the Spliceosome 

SMF can observe the presence of a fluorophore 
attached to the molecule of interest or the distance 
between two fluorophores using fluorescence reso¬ 
nance energy transfer (FRET) on a single molecule 
level. SMF can possibly detect the structural or kinetic 
heterogeneity in the pre-mRNA splicing system that is 
difficult to achieve in bulk analyses. SMF has the 
potential to observe discrete steps in the spliceosomal 
assembly and catalysis in real time. The small sample 
size and low quantity requirement are also highly suit¬ 
able for the pre-mRNA splicing system. 

Crawford et al. covalently attached a Cy3 
fluorophore to the 5' exon of yeast RP51A pre- 
mRNA and an Alexa647 fluorophore to the intron 
(Crawford et al. 2008). The pre-mRNA was tethered 
on a glass surface through its 3' end. A Total Internal 
Reflection Fluorescence (TIRF) microscope is used to 
observe splicing of single pre-mRNA molecules in 
S. cerevisiae whole cell extract by monitoring the loss 
of intron-specific fluorescence. At the start of the exper¬ 
iment, many molecules have the two fluorescent colors 
co-localized on the glass surface. About 25% of pre- 
mRNA loses the Alexa647 fluorescence associated with 
intron in 60 min, representing the release of introns. The 
majority of the splicing events occurs between 20 and 
45 min and is essentially all completed by 60 min. 

Abelson et al. incorporated a Cy3 donor fluorophore 
in the 5' exon close to the 5' splice site and a Cy5 
acceptor fluorophore in the 3' exon adjacent to the 
3' splice site of modified yeast Ubc4 pre-mRNA, and 
observed the conformational dynamics of individual 
pre-mRNA using single molecule FRET (Abelson 
et al. 2010). The WT as well as BP and 3' ss mutated 
pre-mRNA all fluctuate between high and low FRET 
states, likely reflecting the transient formation and 
disruption of secondary structures in pre-mRNA. 
Upon addition of splicing extract in the presence of 
ATP, WT and 3' ss mutated pre-mRNA demonstrated 
a shift toward mid- and high-FRET states (indicating 
close proximity of the two exons) within 15 min of 
incubation, while BP mutated pre-mRNA shows 
no signihcant change. After 60 min incubation, WT 
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pre-mRNA transit into an even higher FRET state that 
is also found in the mature mRNA control. Most con¬ 
formation changes are reversible, suggesting that 
spliceosome assembly occurs close to thermal 
equilibrium. 

Summary 

Over the past several decades, we have made signifi¬ 
cant progress in understanding the structure and 
dynamics of the spliceosome. However, high- 
resolution structures of large spliceosome or sub¬ 
complexes are still lacking. In the future, improving 
sample homogeneity will be key to improve resolution 
of the EM structure and enable crystallographic studies 
of large spliceosome or sub-complexes. Purifying the 
spliceosome or snRNPs under more stringent condi¬ 
tions or from different species, using small molecule 
inhibitors or temperature-sensitive mutants to trap the 
spliceosome, or reconstitute spliceosomal sub¬ 
complexes from recombinant proteins are all 
approaches that can potentially improve sample homo¬ 
geneity. Advances in EM sample preparation, data 
collection, and processing, and new methods in sepa¬ 
rate images into different subpopulations will also help 
improve the resolution of EM structures. Single mole¬ 
cule fluorescence studies of the spliceosome have just 
begun and will continue providing valuable insights in 
the dynamics of spliceosome assembly and catalysis. 
The combination of all these information as well as 
biochemical and genetic data is likely the best 
approach in the foreseeable future to obtain a detailed 
and comprehensive view of this complicated and 
dynamic molecular machinery. 

Cross-References 

► Alternative Splicing 

► Electron Microscopy 

► Helicases 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Nucleic Acid NMR - Introduction 

► Protein NMR - Introduction 

► Single-Molecule Methods 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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Synonyms 

Electron transfer; Kinetically coupled electron transfer 

Definition 

Coupled electron transfer is a process in which 
a relatively slow electron transfer reaction is preceded 
by a rapid but thermodynamically unfavorable non¬ 
electron transfer reaction that is required to activate 
the system for electron transfer. 

Basic Characteristics 

Coupled electron transfer occurs when a relatively 
slow nonadiabatic (true electron transfer) reaction is 
preceded by a rapid but thermodynamically unfavor¬ 
able, adiabatic (non-electron transfer) reaction that is 
required to activate the system for electron transfer. 
Since the activated and unreactive states exist in an 
equilibrium that favors the unreactive state, the 
observed rate constant (k^hs) for the redox reaction 
under saturating conditions is an apparent value equal 
to the product of the true electron transfer rate constant 
(^et) and the equilibrium constant for the preceding 
reaction step (Eq. 1). 

^obs — ^ET * ^eq (1) 

It is known that the observed rates of electron trans¬ 
fer reactions may be influenced by protein 


conformational changes (Hoffman and Ratner 1987; 
Brunschwig and Sutin 1989). The term “coupled elec¬ 
tron transfer” was coined by Harris et al (1994) to 
explain the ionic strength dependence of the rate of 
an interprotein electron transfer reaction. This is an 
example of a bimolecular electron transfer reaction in 
which some rearrangement of reactants must occur 
after initial binding to optimize the system for electron 
transfer (Eq. 2). Coupled electron transfer may also 
occur during an intramolecular electron transfer if two 
or more states of the system exist with only one opti¬ 
mized for electron transfer (Eq. 3). In these schemes, 
A and B are the reactants in the oxidized (ox) and 
reduced (red) states. 

k kx f ^’£T 

■Aqx Bred ^'^ox/Bred’'— [^ox/^red] -'^red/J^ox (2) 

k-x k-ET 

kx f kET 

Aox + Bred ^ [Aox/Bred] ^ A^ed/Box (3) 

k-x k^Ej 

In this model, the prerequisite adiabatic reaction 
step has a forward rate constant and an equilibrium 
constant Xx The reaction is kinetically 

coupled electron transfer if and Xx < < 1. 

In this case, ^obs for the conversion of Aox/Bred to Ared/ 
Box will be the product of ^et and Xx (Eq. 1). The 
distinction between coupled and true electron transfer 
reactions is difficult. Since ^obs is proportional to 
the reaction rate of the coupled reaction will still vary 
with AG° (i.e., the ► redox potential difference, AEn,, 
of the redox centers). Coupled electron transfer reac¬ 
tions are more easily distinguished from ► gated elec¬ 
tron transfer reactions in which ky_ < /tet. and for which 
the reaction rate does not depend on AG° because ^et 
is not rate-limiting in that case (Davidson 1996). 

True electron transfer reactions are described by 
► electron transfer theory (Eq. 4) in which AG° is the 
driving force determined from the redox potential dif¬ 
ference for the electron transfer reaction, 1 is the reor¬ 
ganization energy, and //ab is the electron coupling 
matrix element (Marcus and Sutin 1985). To experi¬ 
mentally determine the values of X and //abj one typ¬ 
ically measures the rate of reaction with a fixed AG° 
over a range of temperature, or at a fixed temperature 
over a range of AG°. If one attempts to analyze 
a coupled electron transfer reaction in this manner, 
then one would obtain misleading values of A and 
//ab which do not describe /tet- As such, it is important 




380 


Coupled Electron Transfer 


to understand the consequences of analyzing 
a kinetically coupled electron transfer reaction by elec¬ 
tron transfer theory. 


^ET — 


47i2//2 0/h(47tlRT) 


0.5 


exp 


/AaKI 

(4) 


The effects of kinetic coupling on the experimen¬ 
tally determined values of a and //ab will depend upon 
whether one is analyzing the AG°-dependence or the 
temperature-dependence of the reaction. When 
the AG°-dependence of the limiting first-order 
rate constant of a coupled electron transfer reaction is 
analyzed, an accurate value of X will be obtained, 
but the experimentally determined //ab will be less 
than the //ab for the true electron transfer reaction 
(i.e., that described by /:et)- When the temperature- 
dependence of the limiting first-order rate constant of 
a coupled electron transfer reaction is analyzed, the 
experimentally determined values of X and //ab will 
both be affected. How the apparent values of these 
parameters will be affected will depend upon the 
values of A//° and AS° for the preceding adiabatic 
reaction. 

To determine precisely and predictably how the 
pre-equilibrium in a coupled electron transfer 
reaction will influence the experimentally determined 
electron transfer parameters for that reaction, simu¬ 
lated data sets for the variation in with temperature 
were analyzed by electron transfer theory (Davidson 
2000). These simulated data sets were generated by 
arbitrarily assigning values of A//° and A5'° for theo¬ 
retical prerequisite adiabatic reaction steps. From these 
simulated data sets, a value for A'x was determined at 
a particular temperature according to (Eq. 5) and the 
kobs was calculated as k^T:*Kx. The following general 
trends were reported. 

In = -AH/RT 4- ^S/R (5) 

The magnitude of the apparent X determined from 
the temperature-dependence of the rate of a coupled 
electron transfer will depend on the magnitude of A//° 
for the unfavorable reaction step that precedes electron 
transfer. Since this prerequisite adiabatic reaction is by 
definition very unfavorable, the A//° for this reaction 
will most likely be positive. As a consequence of this 
positive A//°, the apparent value of X that is obtained 


from the analysis of temperature-dependence data will 
be greater than the true value of X for ^et- The magni¬ 
tude of this A//° has no significant effect on the exper¬ 
imentally determined value of //ab- Conversely, the 
magnitude of AS° for the unfavorable reaction step that 
precedes electron transfer has no significant effect on 
the experimentally determined value of X, but primar¬ 
ily influences the experimentally determined value of 
//ab- The value of //ab determined from the tempera¬ 
ture-dependence of the rate of a coupled electron trans¬ 
fer will be accurate only if AS° for the preceding 
reaction is zero. If AS° is positive, the apparent value 
of //ab will be greater than that of the //ab for the true 
electron transfer reaction. If A5° is negative, the appar¬ 
ent value of //ab will be less than the //ab for the true 
electron transfer reaction. 

Cross-References 

► Dynamic Docking 

► Electron Transfer Theory 

► Electron Transfer Through Proteins 

► Gated Electron Transfer 

► Proton-Coupled Electron Transfer 
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Synonyms 

Protein fluorescent dye labeling 


Folded-Unfolded Continuum 


Proteins are observed in many different states of 
“foldedness,” from a fully random ensemble of 
completely different solution structures to a rigidly struc¬ 
tured ensemble where all of the members have closely- 
related, if not identical, three-dimensional stmctures. The 
concept of “folding coupled to binding” can therefore be 
expanded to embrace a continuum of interactions, where 
one or both partners may show varying degrees of con¬ 
formational differences between the free and bound state. 
This is illustrated in Fig. 1 (Dyson and Wright 2005). 
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Introduction 

The recent understanding that functional proteins do 
not necessarily have stable three-dimensional struc¬ 
tures leads inevitably to the question as to how such 
“intrinsically disordered proteins” (IDPs) perform 
their various functions, and what the advantages 
might be for biological systems to include elements 
of protein structural disorder. A number of possible 
functional advantages that unstructured polypeptide 
elements might have over stable structured forms in 
particular situations have been identified. One of these 
situations is the case where intermolecular interactions 
must occur (for example, as part of a signaling cas¬ 
cade) with high specificity and sufficiently high affin¬ 
ity, but which must subsequently be reversed when the 
signal is turned off. As well, responses to signaling 
may arise due more to competition between various 
signaling molecules for the downstream components 
of the transcriptional machinery such as transcriptional 
adaptors and activators. In both of these scenarios, the 
participation of intrinsically disordered protein seg¬ 
ments provides a vital advantage. These interactions 
generally occur through a mechanism of coupled fold¬ 
ing and binding, which provides both for the extremely 
high specificity, at the level in some cases of individual 
amino acid side chains, and the relatively modest affin¬ 
ity that allows for the dissociation of the complex after 
passage of the signal. 

This article will discuss examples of processes that 
involve the coupling of folding of intrinsically disor¬ 
dered domains with binding to a target molecule, which 
may itself be more or less disordered. Intrinsically 
disordered proteins (IDP) are associated with a broad 
repertoire of biological functions and processes, includ¬ 
ing cellular control mechanisms and signaling. 


Coupled Folding and Binding of IDPs 

Significant tracts (>50 amino acid residues) that are 
predicted to be unstructured are surprisingly common 
in protein sequences inferred from published genomes, 
especially those of eukaryotes (Dunker et al. 2008). 
A few proteins are completely disordered in the 
absence of their physiological partner and fold into 
globular structures only upon binding. However, 
most coupled folding and binding interactions so far 
observed involve relatively short segments either 
within longer disordered sequences or between fully 
folded domains (Dyson and Wright 2002; 2005). These 
folding motifs (which have been termed molecular 
recognition elements or MoRFs) (Mohan et al. 2006) 
may form helix, /1-strand, or irregular structure on 
binding to a target protein, mainly through intimate fit 
with their binding partners to maximize intermolecular 
hydrophobic interactions. This behavior contrasts with 
the general tendency of folded domains to interact 
primarily through surface charge effects. 

One of the advantageous features of disordered 
recognition elements is that they have the potential to 
fold into different structures on binding to different 
target proteins. This increases the versatility of the 
proteome, as well as providing for different signals 
under different cellular conditions, where concentra¬ 
tions of the various partners may differ. For example, 
the intrinsically disordered regulatory region near the 
C-terminus of p53 folds into helical, /1-strand, and 
extended irregular structures on binding to different 
protein partners (Oldfield et al. 2008), and the 
C-terminal activation domain of the hypoxia inducible 
factor HIF-la adopts helical and extended structures, 
respectively, in its complexes with CBP/p300 and the 
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increasing content of stable three-dimensional structure 



unstructured 

conformational ensemble 

e.g. ACTR (no NCBD) 



linked folded domains 
(beads on a string) 



,■ „ ^ finger 1 

f"^9er3 

e.g. zinc fingers (no DNA) 


mostly folded, 
local disorder 



elF4E (N-ferminus unfoldea) 



Coupled Folding and Binding, Fig. 1 The continuum of 
protein structure. The upper panels show schematic diagrams 
of examples on the continuum of protein structure: an unstruc¬ 
tured conformational ensemble, the interaction domain of 
activator for thyroid hormone, and retinoid receptors (ACTR) 
(left panel)', a molten globule-like domain such as the 
nuclear-coactivator-binding domain (NCBD) of cyclic-AMP- 
response-element-binding protein (CREB)-binding protein 

asparagine hydroxylase FIH (Dyson and Wright 2005). 
Structural plasticity in the final bound state is 
displayed by many intrinsically disordered proteins. 

Regulation by Competition for Nonidentical 
Binding Sites 

A number of the interaction domains of central 
transcriptional activators such as CBP/p300, while 
well-structured themselves, interact with a variety of 
partners, all of which appear to be intrinsically unstruc¬ 
tured. Intriguingly, these partners may bind in different 
sites on the interaction domains, allowing for allostery 
(see following section) as well as providing insights 


(CBP) (second panel)', and linked folded domains such as 
a construct that contains the first three zinc fingers of transcrip¬ 
tion factor-IlA (TFIIIA) (third panel) and free eukaryotic initi¬ 
ation factor-4E (elF4E). The lower panels show the structures of 
the domains in the upper panels folded onto their biological 
target domains or sequences. Reproduced with permission 
from Dyson and Wright 2005 

into a likely mechanism for the regulation of signaling 
interactions by competition. This principle is illus¬ 
trated by the interactions of the TAZl domain of 
CBP. The TAZl domain is a folded domain (residues 
345^36) of the transcriptional activator CBP. Its 
structure is stabilized by the presence of three zinc 
ions, ligated by the side chains of cysteine and histidine 
residues of the protein (Fig. 2), and the structure 
remains the same in the free state as in several com¬ 
plexes. The C-terminal activation domain of the hyp¬ 
oxia inducible factor HIF-la forms a complex with the 
TAZl domain where helical segments bind in the 
grooves formed by the TAZl helices. aA of HIF-la 
binds between al and a2 of TAZl, aB between al and 
a3, and aC between a4 and al. The CITED2 
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Coupled Folding and 
Binding, Fig. 2 Upper row. 
lowest energy structures of the 
complexes of the TAZl 
domain of CBP {blue) with 
(left) the interaction domain of 
HIF-lot (Dames et al. 2002), 
(middle) the interaction 
domain of CITED2 (De 
Guzman et al. 2004), and 
(right) the interaction domain 
of STAT2 (Wojciak et al. 
2009). Each interaction 
domain is colored green at the 
N-terminus, red at the 
C-terminus, and orange in the 
middle. Lower row: schematic 
representation of the topology 
of each interaction, showing 
the differences in the binding 
patterns for the three 
interaction domains 


TAZ1/HIF-1a 



N 


C 




interaction domain winds around TAZl in the same 
sense, but the N-terminal aA binds in the groove 
between al and a4 of TAZl, and the remainder of 
CITED2 binds across the “front” face of TAZl where 
al of HIF-la interacts. The C-terminus of CITED 
occupies roughly the same area of TAZl as a2 of 
HIF-la, and there is no C-terminal a3. Surprisingly, 



ensemble 




TAZ1/STAT2 


the transactivation domain of STAT2 binds to TAZ 1 
in the opposite sense from HIF-la and CITED2 
(Fig. 2). It is probably significant for function that 
each of the binding partners interacts with roughly 
the same area of TAZl, but the mode of binding and 
the secondary structures formed upon binding are dif¬ 
ferent. A situation can be envisaged where an alterna¬ 
tive binding partner, say HIF-la, could remove 
a bound activation domain such as CITED2 by 
interacting with TAZl in the area between al and a2 
that is not covered in the CITED2 complex. Focal mass 
action effects could then facilitate the removal of the 
CITED ligand and its replacement with the HIF ligand. 


Coupled Folding and Binding, Fig. 3 Schematic diagram 
showing two extreme mechanistic possibilities, folding upon 
binding or conformational selection (Reproduced with permis¬ 
sion from Wright and Dyson 2009) 

to form a ternary complex, promoting allosteric cou¬ 
pling between binding sites and offering the potential 
for synergy and cross talk between signaling pathways, 
mediated by interactions of disordered domains with 
a common binding partner. For example, the activation 
domains from the c-Myb and mixed lineage leukemia 
(MFL) transcription factors bind cooperatively to the 
KIX domain of CBP to form a ternary complex 
(De Guzman et al. 2006). 
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Allostery and Cooperative Binding 

Disordered proteins frequently bind to their targets 
through relatively short amphipathic motifs. This 
means that even quite small target proteins can be 
capable of binding more than one IDP simultaneously 


Mechanism of Coupled Folding and Binding 

How does coupled folding and binding occur? Does 
folding occur before binding or does binding occur 
before folding? These two extremes have been catego¬ 
rized as conformational selection and induced folding. 
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An example of the induced folding of an IDP upon 
binding to its partner is the association of the phosphor- 
ylated kinase-inducible domain of the transcription 
factor CREB (pKID) with the KIX domain of CBP. 
For this system, relaxation dispersion NMR spectros¬ 
copy was used to show that pKJD hrst associates with 
KIX in an “encounter complex” containing very little 
secondary structure. The complex rapidly rearranges to 
an intermediate structure where one of the two helices 
present in pKID in the complex is formed, while the 
other remains associated with KIX in a non-helical 
state. Finally the fully folded complex, containing two 
helices in pKID, is formed, on a relatively slow time 
scale (Sugase et al. 2007) (Fig. 4). 

Kinetic studies of ligand-induced folding of a series 
of mutants of staphylococcal nuclease showed evi¬ 
dence for both folding before binding and binding 
before folding (Onitsuka et al. 2008), but it has so far 
generally been found that increasing the stability of 
pre-organized secondary structure not only fails to 
enhance the binding affinity but may even slow the 
kinetics of complex formation (Bienkiewicz et al. 
2002; Zor et al. 2002). This implies that disorder in 
the free state may actually be advantageous for the 
binding process. Some IDPs remain disordered even 
after binding to their targets. 


Coupled Folding and Binding, Fig. 4 Schematic representa¬ 
tion of the induced folding mechanism of pKID binding to KIX 
(Sugase et al. 2007), derived from NMR relaxation dispersion 
measurements. The conformational ensemble of the free protein 
contains little propensity for helical structure; the initial encoun¬ 
ter complex is heterogeneous and mostly involves the C-terminal 
end of the pKID peptide. The intermediate structure shows con¬ 
tact between KIX and the entire length of the peptide, but helical 
structure is not present in either (Xa or otg. Finally the fully folded 
complex shows stable helical structure in both regions of pKID 
(Reproduced with permission from Wright and Dyson 2009) 


respectively (Wright and Dyson 2009) (Fig. 3). In the 
conformational selection mechanism, the target pro¬ 
tein “selects” a conformation approximating that of the 
bound form from the ensemble of conformations pop¬ 
ulated by the free IDP. In induced folding, the protein 
associates with its binding partner in a fully disordered 
state and subsequently folds in association with the 
target protein. Examples have been described where 
one of these mechanisms appears to predominate, but it 
is to be expected that most systems will show evidence 
of the participation of both mechanisms. 


Does Disorder Increase Association Rate? 

Since an unstructured protein samples many different 
structures in solution, some of which may be highly 
extended, it should have a larger capture radius than 
a well-folded protein of comparable size. The sugges¬ 
tion that the rate of macromolecular association is 
enhanced by the presence of disorder has been termed 
the “fly-casting” mechanism (Shoemaker et al. 2000). 
Theoretical simulations suggest that fly-casting plays 
an important role in binding and folding of disordered 
proteins, but experimental support for this model is 
only beginning to be explored. 

Are These Domains Disordered In Vivo? 

There is a growing body of evidence that important 
transcriptional and signaling domains are disordered 
in vitro, but the question arises as to whether they are 
also disordered in vivo. Would the molecular crowding 
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of the intracellular space induce structure even in dis¬ 
ordered domains? This question is difficult to evaluate 
experimentally, mainly because the intracellular con¬ 
centrations of the molecules that contain the disor¬ 
dered domains are in general extremely low. Such 
experiments as have been attempted give mixed 
answers to the question - some systems appear to 
become slightly more stmctured upon increasing 
local molecular crowding, while others do not. Also, 
since the disordered interaction domains frequently 
form part of a larger molecule that may contain more 
structured domains, there is always the possibility that 
the “disordered” segments are associated with, and 
perhaps folded upon, these structured portions. Never¬ 
theless, it is clear from a consideration of the topology 
of complexes such as those shown in Fig. 3 that no 
matter what the state of the interaction sequences in the 
free state in the cell, the sequences must be in an 
unfolded state in order to form the complexes. Disor¬ 
dered proteins are frequently highly susceptible to 
proteolytic degradation in vitro, and it has been 
suggested that sensitivity to proteolysis may be impor¬ 
tant to the signaling function of many IDPs (Dyson and 
Wright 2005). The correlation between protein disor¬ 
der and short half-life is rather weak (Tompa et al. 
2008), suggesting that many disordered proteins are 
protected from degradation, possibly by complexation 
with other proteins or nucleic acids. 

Cross-References 

► NMR 

► Relaxation Dispersion 

► Structurally Disordered Proteins 
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Synonyms 

Carr-Purcell-Meiboom-Gill spin echo 

Definition 

The CPMG (Carr-Purcell-Meiboom-Gill) pulse train 
is a fundamental component of pulse sequences 
used for the measurement of dynamic processes by 
NMR spectroscopy. As reported in his seminal paper 
of 1950, Hahn discovered that when two successive 
radio-frequency pulses, separated by a time interval i, 
are applied to an ensemble of spins, a refocused “spin 
echo” appears at time 2 t. This allowed relaxation 
studies of liquid samples with linewidths dominated 
by the severe inhomogeneity of the magnets available 
in the early days of NMR spectroscopy. However, 
Hahn also recognized that within an inhomogeneous 
field, the effect of random translational diffusion dur¬ 
ing T is not refocused by the second pulse and thus still 
contributes to the decay of the spin echo. In 1954, Carr 
and Purcell demonstrated that a 90° pulse followed 
by a (t— 180°—T)n pulse train will suppress the effect 
of diffusion, provided that the x delay is sufficiently 
short. Furthermore, a complete T 2 relaxation data set 
could be obtained in a single experiment by measuring 
each resulting echo appearing between successive 
180° pulses. Meiboom and Gill (1958) improved 
this approach by using better electronics to generate 
coherent pulses with the initial 90° pulse shifted 
in phase by 90° relative to the subsequent 180° 
pulses. This simple modification compensated for 
imperfections in the 180° pulses that would otherwise 
have accumulated over the course of the pulse train. In 
recognition of the pioneering contributions made by 
these researchers, a (x — 180° — x)|j sequence applied 
to transverse magnetization, such as that generated 
by a 90° pulse, is now commonly referred to as the 
CPMG pulse train. In 1963, Luz and Meiboom 


demonstrated that the effects of msec-psec timescale 
exchange of a spin between chemical or conforma¬ 
tional states with distinct chemical shifts could 
also be attenuated by using sufficiently short CPMG 
delays. 

Currently, the CPMG sequence is implemented in 
two basic ways for dynamic NMR studies. In one 
version, data are recorded using a 
90° — (x — 180° — x)|j-observe sequence with increas¬ 
ing numbers, n, of 180° pulses. This yields a series of 
spectra from which the transverse T 2 relaxation life¬ 
times of spins within a sample can be obtained by 
fitting their decaying signal intensities versus the total 
relaxation delay, 2nx. Alternatively, translational dif¬ 
fusion within an inhomogeneous magnetic field (i.e., 
created with a pulsed field gradient) or chemical/con¬ 
formational exchange can be investigated by recording 
data as a function of the CPMG delay x. In the case of 
exchange occurring on the appropriate timescale, 
Allerhand and Gutowsky (1964) showed that the inter¬ 
conversion rate of a spin between two states, as well as 
its T 2 lifetime and chemical shift in each state, could be 
obtained by fitting such x-dependent data. This has 
been exploited most recently through sophisticated 
relaxation dispersion methods, developed in large 
part by the groups of Palmer and Kay. In addition to 
its use in dynamic studies, the CPMG pulse train has 
also been incorporated in other pulse sequences, for 
example to reduce detrimental effects of chemical/ 
conformational exchange occurring during magnetiza¬ 
tion transfer steps. 

Cross-References 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR Methods for Kinetic Analysis 

► Protein Conformational Dynamics by Relaxation 
Dispersion 

► Protein Folding 

► Relaxation Dispersion 


CRC Mitochondrial Carnitine- 
Acylcarnitine Carrier 

► Mitochondrial Transport Protein Family 
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Critical Fluctuations in Lipid Mixtures 

Pietro Cicuta 
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Cambridge, UK 

Synonyms 

Behavior near critical points; Composition fluctua¬ 
tions; Lipid phase equilibria; Lipids 

Definition 

Critical fluctuations occur in lipid mixtures near 
conditions where phase separation can take place. 
Lipid mixtures with compositions that mimic biologi¬ 
cal cell membranes have an upper critical consolute 
temperature, and phase separate on cooling typically 
below 30°C. Near this transition the susceptibility to 
composition fluctuations diverges, enabling regions of 
different compositions to emerge spontaneously. The 
length scale and lifetime of these transient domains 
increase, the closer the system is to phase separation. 

Ternary Phases Can Phase Separate in Liquid- 
Liquid Coexistence 

The net interaction between lipids in a bilayer is the 
result of many contributions, including short range van 
der Waals forces, electrostatics, and “hydrophobic” 
forces (i.e., the complex result of accounting for 
interactions and configurations of water molecules 
near the surface). In multicomponent mixtures, these 
contributions can result in a preference for demixing, 
particularly between saturated and unsaturated species. 
Balancing against entropic contributions accounting for 
the number of independent configurations, this leads to 
the common (“regular solution”) behavior, with a single 
mixed phase at high temperature and phase separation 
below a critical temperature Tc. As well as composition 
and temperature, pressure is an important thermody¬ 
namical parameter in these systems. 

In binary mixtures, phase separation can take place 
between a saturated and an unsaturated phospholipid 
species, leading to phase coexistence between a gel 
and a liquid phase. Likewise, binary mixtures of 
a saturated phospholipid and a sterol also show phase 


separation. In all these cases, the phase rich in satu¬ 
rated lipid has a “solid” character. In ternary mixtures 
in which a significant molar fraction of a sterol, such as 
cholesterol, is added into a saturated/unsaturated 
mixture, the system can phase separate into two 
separate, but both fluid, states. This case is important 
because of its biological relevance. Of these two liquid 
phases, one is referred to as liquid ordered Lq, and the 
other as liquid disordered Lj^. The Lq phase is more 
ordered in the sense that the components are packed 
more tightly, with more extended lipid acyl chains, and 
this bilayer is thicker than the one in the disordered 
phase. Most lipid dyes and other impurities will tend 
to partition into the disordered La^ phase (however, 
specific lipid dyes have also been developed to 
label the Lq phase). Critical points are special points 
in the phase diagram (composition, pressure, and 
temperature) where tie-lines merge into single points 
and where the compositions of coexisting phases 
become identical. 

Phase separation in ternary saturated/unsaturated -i- 
cholesterol mixtures has been characterized for various 
systems and linked with possible signaling and regu¬ 
latory roles in the cell membrane (Veatch and Keller 
2003; de Almeida et al. 2003). The phase diagrams 
depend on the concentration of all three species and on 
temperature (see Fig. la). For example, a 1:1 concen¬ 
tration of DPPCiDOPC, and 30% molar cholesterol 
mixture is uniform at temperatures above 30°C, and 
phase separates on cooling through this temperature. 
At a fixed temperature, phase separation can also be 
induced by a composition change, for example, in the 
balance of saturated:unsaturated phospholipids, or in 
the concentration of sterol. The general features of 
these diagrams are well understood. 

Biological cell membranes are mixtures of typically 
thousands of lipids, but rather than displaying a vast 
number of phases and phase transitions (as would be 
allowed by the Gibbs phase rule) it seems that these 
complex mixtures have the same thermodynamic 
behavior as a ternary system (Honerkamp-Smith 
et al. 2009). This is consistent with the view that 
different saturated (and unsaturated) lipids tend to 
remain mixed. 

Observations of Critical Fluctuations in Vesicles 
and Cell Blebs 

Reconstructed bilayers (typically in the form of 
supported films or giant unilamellar vesicles) have 
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Critical Fluctuations in Lipid Mixtures, Fig. 1 Critical 
fluctuations are observed near phase separation conditions. The 
ternary phase diagram showing the composition region 
where liquid-liquid coexistence is possible, color coded for 
different temperatures, is shown in (a). This is typical of many 
mixtures of saturated, unsaturated phospholipids and a sterol 
(Figure reprinted with permission, from Veatch and Keller 


2003). Composition fluctuations (transient domains) are visible 
in fluorescence microscopy, and their size grows as the temper¬ 
ature approaches T^ = 23.3°C. The characteristic size ^ is 
measured in (b), and shows excellent agreement with the two- 
dimensional Ising critical behavior (Reprinted with permission 
from Veatch et al. 2008, © ACS 2008) 


been very useful to understand the phase behavior in 
multicomponent mixtures. Universal thermodynamic 
behavior characterizes systems close to a critical point, 
including large “critical” fluctuations. There is 
growing evidence that in living cells the lipid compo¬ 
sition is regulated to maintain a certain distance to the 
critical point (Jin et al. 1999; Baumgart et al. 2007), 
with likely effects on a range of processes that occur in 
biological membranes (Honerkamp-Smith et al. 2008). 

When a critical point is approached from low 
temperature (heating from within the two-phase 
coexistence region) the increasing osmotic compress¬ 
ibility acts to reduce the line tension of the linear 
interface between domains (the interfacial energy 
between phases); thermal energy becomes sufficient 
to cause large roughening of the domain boundaries. 
When a critical point is approached from high temper¬ 
ature (cooling in the one-phase region) the increasing 
osmotic compressibility allows for the formation of 
longer-ranged and longer-lived composition fluctua¬ 
tions. The video analysis of the fluctuations of domain 
shape in model membranes (and multicomponent ves¬ 
icles obtained from cell blebbing) can be used below 
Tc to measure the line tension, and above Tc to deter¬ 
mine the typical length scale of concentration fluctua¬ 
tions. This characterizes the critical behavior (see 
Fig. lb and Veatch et al. (2008)). 


Physics of 2D Critical Points 

The behavior of systems near critical points depends only 
on very general properties of the system, such as the 
symmetry of the order parameter and the spatial dimen¬ 
sion; it does not depend on the details of the interaction. 
Systems that share the same critical behavior are classi¬ 
fied in terms of their “universality class” (Stanley 1971). 

Ternary phospholipid mixtures are in the same 
universality class as the two-dimensional Ising 
model, a two-state model with nearest neighbor inter¬ 
actions that has been solved exactly and has been used 
to successfully describe the behavior of a wide range of 
experimental systems near critical points, including 
spin magnets and liquid crystals. 

In the 2D Ising model, the line tension, X, is a linear 
function of temperature, T, and intersects zero at the 
critical point, such that X = }io(Tc-T)/Tc, where Tc is the 
(absolute) critical temperature. This trend is confirmed 
experimentally for 25:20:55 mol% of diPhyPC:DPPC: 
cholesterol, giving Xq = 30pN (Honerkamp-Smith et al. 
2008). The same physics was seen in the plasma mem¬ 
brane blebs obtained in Veatch et al. 2008, where the 
correlation length above Tc goes as ^ = ^o(Tc/(T-Tc)), 
and experiments show = 1 pm at 0.3°C above the 
critical temperature of 23.4°C, giving =1 nm- The 
parameters measured near the critical point allow the 
behavior to be extrapolated outside the measurement 
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range (i.e., where domains are too small to be resolved 
optically): Fluctuation sizes decay away from Tc, but 
not sharply and, at a physiological temperature (setting 
T-Tc = 13°C), ^ = 23 nm which is roughly a patch of 
25-by-25 phospholipids. 

Fluctuation Dynamics and the Role of Cytoskeletal 
Pinning 

Dynamics near critical points slows down: This is 
clearly seen in the diffusion coefficient D obtained 
from the autocorrelation function of density fluctua¬ 
tions; D falls near the critical temperature, in a manner 
consistent with the prediction of the 2D Ising model: 
D ^ with z = 2 (Honerkamp-Smith et al. 2008). 
It is also clear that protein components, in particular 
the cortical cytoskeleton, can couple to the lipid 
composition fluctuations, affecting domain formation 
in vivo (Machta et al. 2011). 

Consequences of Criticality on Cell Membranes: 
Coupling with Curvature and Lipid-Protein 
Interactions 

Lipid sorting within the plane is only very weakly 
coupled to curvature, and significant sorting is only 
observed for either very tight radii of curvature, or 
close to a critical point for demixing, where the free 
energy for sorting becomes negligible (Sorre et al. 

2009) . The relevance of the membrane composition 
being in the vicinity of the critical point is being 
noted in the biological literature (Honerkamp-Smith 
et al. 2009; Simons and Gerl 2010), and it is being 
related to the concept of lipid rafts. Lipid rafts repre¬ 
sent the well-known fact that biological membranes 
present domains enriched with particular lipids, and 
that this couples to partitioning or adhesion of specific 
proteins to those regions. The current definition of lipid 
rafts is small aggregates of a particular lipid or group of 
lipids on a cell membrane surface (Simons and Gerl 

2010) . They are thought to affect cell signaling and 
transport by inducing the co-localization of specific 
membrane proteins. Evidence for lipid rafts has been 
seen in eukaryotic (Friedrichson and Kurzchalia 1998), 
bacterial (Lopez and Kolter 2010) cells, and in 
virus assembly (Polozov et al. 2008). A general role 
of rafts is unclear, but specific examples have been 
demonstrated, for example, lipid domains are known 
to cluster membrane receptors, affecting signaling. 
Elucidating the biological consequences of this prox¬ 
imity to the critical point and looking for general 


principles of membrane protein regulation by 
lipid composition remain active areas of research. 
Lipid-protein interactions are poorly understood in gen¬ 
eral, but it is commonly thought that they are likely to 
regulate protein function. At the very least, the bilayer 
viscosity determines the motility of membrane-bound 
species, and the bilayer deformation induced by protein 
inclusions will lead to interactions between these inclu¬ 
sions, the details of which depend on the bilayer 
mechanics, which is a function of composition. More 
sophisticated regulation processes, linked to the 
multicomponent lipid nature of the bilayer, are also 
plausible: Composition fluctuations can couple to local 
curvature, and composition fluctuations could mediate 
protein interactions. In these conditions, temperature 
and composition are important control parameters. 

Cross-References 

► Atomic Force Microscopy 

► Atomic Force Microscopy (AFM) for 
Topography and Recognition Imaging at Single 
Molecule Level 

► Chemical Diversity of Lipids 

► Fluorescence and FRET in Membranes 

► Functional Roles of Lipids in Membranes 

► Hierarchically Structured Lipid Systems 

► Lipid Bilayer Asymmetry 

► Lipid Bilayer Lateral Pressure Profile 

► Lipid Domains 

► Lipid Lateral Diffusion 

► Lipid Signaling and Phosphatidylinositols 

► Lipid Trafficking in Cells 

► Lipidomics 

► Membrane Fluidity 

► Phase Transitions and Phase Behavior of Lipids 

► Pressure Effects on Lipid Membranes 

► Thermodynamics of Biomolecular Interactions 

► Thermodynamics of Lipid Interactions 
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Cross saturation (CS) and transferred cross saturation 
(TCS) are NMR methods for the identification of pro¬ 
tein residues in close contact with binding proteins or 
other large biomolecules. In the CS measurements of 
protein-protein interactions, protein I, with residues to 
be identified as the contact residues, is uniformly 
labeled with and and the non-labeled target 
proteins (protein II) are dissolved in a mixture of 10% 
H20/90% D 2 O. Under these conditions, the proton 


density in protein I is very low, because the protons 
in the ligand proteins are all exchangeable, and each 
exchangeable hydrogen site is occupied by *H at 
a probability of 10%. By irradiating the sample, using 
the radio frequency (RF) field with the resonant fre¬ 
quency of the aliphatic protons, the protons in protein 
II become saturated. The saturation is rapidly trans¬ 
ferred to the protons that are not directly irradiated 
with the RF field in protein II, such as the aromatic 
and amide protons. This phenomenon is well known as 
the spin diffusion effect. The saturation of the protons 
in protein II is also transferred to the protons in 
protein I, through the binding interface. The spin dif¬ 
fusion within protein I is inefficient, due to the low 
proton density, and thus the saturation transferred to 
protein I is limited to the protons close to the contact 
residues. The saturation of the protons in protein I at 
the binding interface is observed as the reduction of 
their NMR signal intensities. In practice, the 
HSQC spectra are recorded with and without irradia¬ 
tion of the aliphatic protons just before the HSQC 
pulse sequences, and their signal intensity ratios are 
calculated. Highly sensitive methyl groups can also be 
utilized as a probe for the cross-saturation experi¬ 
ments, by using proteins with extensive deuteration, 
except for the leucine and isoleucine methyl groups. 

The CS method makes it possible to determine the 
interfaces of protein complexes more precisely than the 
methods used previously, such as chemical 
shift perturbation and H/D exchange experiments. In 
addition, the combination of the CS and chemical shift 
perturbation methods provides a useful means of sepa¬ 
rating the direct interaction effects from the secondary 
effects, such as ligand-induced conformational changes. 

Under conditions with an excess amount of protein 
I relative to protein II and a fast exchange rate between 
the free and bound states of protein I, the CS effect that 
occurs in the bound state of protein I can be efficiently 
observed in the free state of protein I. This extended 
version of the CS measurement is termed transferred 
cross saturation (TCS). The TCS experiment is appli¬ 
cable to huge complexes, because it utilizes the NMR 
resonances originating from a free protein. In addition, 
only substoichiometric amounts of receptors relative to 
ligands are required. Therefore, TCS methods have 
been utilized for interaction analyses of membrane 
proteins, liposomes, and insoluble biomolecules that 
are biologically important, but difficult to analyze by 
conventional structural biology methods. 
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Cross-Correlated Relaxation 

Christian Griesinger 
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Definition 

Cross-correlated relaxation (CCR) as opposed to auto- 
correlated relaxation in nuclear magnetic resonance 
(NMR) describes a process caused by two different 
anisotropic nuclear Interactions that are modulated by 
the tumbling of the molecule with respect to the exter¬ 
nal static magnetic Bq field with the correlation 
time(s). The modulation of the two different interac¬ 
tions has to occur in a correlated fashion otherwise, the 
cross-correlated relaxation rate is small. The amount of 
correlation of the two interactions is expressed in the 
spectral densityThe most important anisotropic 
interactions are chemical shift anisotropy (CSA) and 
dipolar coupling (DC): 

2 

P = -rp = -hth, ^ J{u}^)[Al, [AJ‘‘,p\] (1) 

2 

with the initial rate solution for small A t: 

p{At) = —Atrp{0) 

2 

= -Athhi ^ [Al [AJ\ p(0)]] (2) 

q =-2 

As in auto-correlated relaxation, there is longitudi¬ 
nal and transverse cross-correlated relaxation. Due to 
the commutation properties of longitudinal order 
and A‘j need to contain Hermitian adjunct transverse 
operators for the same spin, since otherwise longitudi¬ 
nal order is transferred to partially transverse 
coherence. This means that spectral densities at 
<X)q = 0 are not sampled. Also longitudinal 


cross-correlated cross-relaxation can occur only with 
spectral densities different from 0, such as lllj llll 
which especially for macromolecules leads to much 
less efficient transfer than the conventional NOE; yet it 
has been used, for example, in triple quantum filtered 
NOESY (Nuclear Overhauser Spectroscopy) and 
ROESY (Rotating Frame NOESY). 

For coherences, however, cross-correlated relaxation 
produces similar rates to auto-correlated relaxation and 
has, therefore, found widespread application both as 
a parameter that reports on structure and dynamics as 
well as a tool that allows to optimize NMR spectra. 

Cross-Correlated Relaxation as a Parameter 

If A^ and A‘j are dipolar couplings (DC), the CCR rate 
depends on the intervector angle, the two fi-factors, and 
the correlation time. This allows for structural 
interpretation, for example, for the measurement of (j) 
(N-H^ cross-correlated with H'^-Ho() and \|/ (Nj-H'^ 
cross-correlated with C^.i-i-Hot) in proteins or the 
ribose pucker (Cj-H cross-correlated with Ci±i-II) in 
RNA. Dynamic averaging of CCR rates occurs similar 
to auto-correlated rates and can be used for the 
characterization of correlated motion even of distant 
vectors. 

If either of Aj. or A^ is a chemical shift anisotropy 
(CSA), the CSA needs to be known fairly well to 
extract structural or dynamic parameters. Applications 
are the measurement of correlation times in proteins 
(N-CSA cross-correlated with N-H”^) as well as the 
angles a and ^ in RNA. Further, if one knows the 
tumbling properties of the molecule under investiga¬ 
tion, CSAs can be determined from or cross-validated 
against CSA/DC CCR. If one of the “dipolar 
couplings” exists between an electron and a nucleus 
and the electron relaxes fast like for many metals, the 
Curie-DC-CCR leads to a similar relaxation as 
CSA-DC-CCR. 

Like any relaxation dominated by /(O), also 
transverse cross-correlated relaxation can be trans¬ 
ferred, that is, reports on the conformation of a light 
molecule when bound to a heavy molecule. 

Cross-Correlated Relaxation as a Transfer Tool 

Due to the formal identity of evolution of a density 
matrix under cross-correlated relaxation for time At 
Eq. 2 and evolution of a density operator under two 
different coherent interactions for 2 At Eq. 3, 
a CSA cross-correlated with a dipolar coupling is 
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comparable to J-evolution except for the phase as used 
in cross-relaxation-induced polarization transfer 
(CRIPT). 

p{2At)=-^-At^CkC,{[Ct,[C;\pm] 

+ [C,,[C,,p(0)]]} 

Two cross-correlated dipolar interactions are 
similar to the evolution of two J-couplings used in 
relaxation-allowed coherence transfer (RACT). 

Cross-Correlated Relaxation as a "TROSY"-Tool 

Transverse relaxation affects differently alternative 
coherences that can be used in pulse sequences. Major 
applications of this effect is the TROSY sequence that 
uses the CSA('^N/'^H)-DC('^N-^^H)-CCR in amides of 
proteins and iminos of RNA, the methyl-TROSY (i.e., 
the well-known heteronuclear multiple quantum corre¬ 
lation (HMQC) experiment) that uses the DC-DC-CCR 
between different C-H vectors as well as different H-H 
vectors in methyl-groups and also the SPITZE-HSQC 
which applies a similar technique to methylene groups. 

Cross-Correlated Relaxation Through Isotropic 
Parameters 

Mechanistically completely different but spectroscop¬ 
ically identical is the correlated modulation of isotro¬ 
pic parameters such as combinations of chemical 
shifts, scalar couplings, or even residual CSAs or resid¬ 
ual dipolar couplings. Here, the modulation does not 
occur through tumbling but through conformational 
exchange leading to cross-correlated relaxation of the 
second kind. Chemical shift-correlated modulation is 
observed in the different relaxation behavior of double 
and zero quantum coherences, and chemical shift and 
RDC correlations lead to the characterization of vector 
orientations in excited states. 
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Definition 

Cryogenic NMR probes allow a drastic enhancement 
of the sensitivity in NMR experiments. This allows 
either an essential reduction of experiment times or 
a reduction of the required sample amount. 


Introduction 

NMR spectroscopy has established itself as one of 
the most important experimental techniques in chem¬ 
istry, physics, and structural biology to obtain infor¬ 
mation on molecular level. The signals in an NMR 
spectrum that appear at the resonant frequencies of 
the different atomic nuclei can be analyzed with 
respect to their chemical shift, splitting (hyperfine 
interaction), and line width. The spectral analysis 
provides detailed information both on topology, 
dynamics, and three-dimensional structure of mole¬ 
cules. This makes NMR an extremely powerful and 
versatile analytical technique. The applications 
cover a broad range, including the study of small 
molecules such as natural products and chemical 
compounds, as well as three-dimensional structure 
determination of biomolecules such as proteins 
and nucleic acids, and investigations of 
intermolecular interactions, for example protein- 
ligand interactions. 

However, historically the detriment to NMR spec¬ 
troscopy has been its low sensitivity. This can he 
alleviated through signal averaging which, however, 
results in experiment times from minutes to several 
days. The sensitivity of the NMR-spectrometer is pri¬ 
marily limited by thermal noise in the signal detection 
pathway. By lowering the temperature of the 
radiofrequency coil which detects the NMR-signals 
of the atomic nuclei, a reduction of the thermal noise 
in the receiver circuitry is achieved. Together with 
a cryogenically cooled preamplifier electronics, this 
leads to a sensitivity enhancement hy a factor of up to 
four or higher, and consequently to shorter experiment 
times and a lower detection limit. Although the 
radiofrequency coil of a cryogenically cooled prohe 
is kept at a temperature of about 20 K, the sample of 
investigation stays at a user-defined temperature, typ¬ 
ically in the range of 0°C to -i-80°C. Cryogenically 
cooled NMR probes for solution state NMR are com¬ 
mercially available since more than 10 years, and they 


have become indispensable for biomolecular NMR 
applications where the concentrations are typically 
low (Kovacs et al. 2005). 

While the enhancement of the sensitivity and con¬ 
sequently a reduction of the experiment time or sample 
amount is the primary advantage of cryogenically 
cooled NMR probes, care has to be taken not to com¬ 
promise the probe performance in any other respect. 
Particular experimental challenges for NMR in biomo¬ 
lecular applications are excellent solvent suppression, 
resilient power handling of radiofrequency pulses, and 
good sensitivity for samples in solutions with high 
ionic strength. 

One example of the challenges in power handling is 
the *^N relaxation dispersion NMR experiment 
(Hansen et al. 2008). This experiment is a powerful 
tool to obtain high-resolution structural information 
about dynamic processes such as conformational 
exchange events on the millisecond timescale. 
A constant mixing time is realized by a CPMG spin 
echo during which the number of *^N pulses is varied 
in order to generate effective spin rotations defined by 
yBi; these are typically in the order of 0-2 kHz. While 
the spin rotation frequency of the yBi field is modu¬ 
lated by altering the number of *^N pulses during the 
experiment, the temperature of the rf coils has to stay 
constant in order to avoid changes of the pulse flip 
angle and, thus, unwanted signal modulations. Further 
experiments which require excellent power handling 
are experiments with a very short repetition rate, such 
as the so-fast HMQC (Schanda and Brutscher 2005) or 
BEST triple resonance experiments (Lescop et al. 
2007). Figure 1 shows a 3D triple resonance BEST- 
HNCO spectrum, which was recorded using 
nonuniform sampling. The total experiment time for 
recording the complete 3D spectrum was less than 
10 min only. 


c 


Cryogenic Probe Options for 
Biomolecular NMR 

In order to cover the needs of different kinds of 
biomolecular applications, a variety of cryogenically 
cooled probes has been introduced over the years. 
With the focus on biomolecular applications, 
details of the applications for different types of cryo¬ 
genically cooled probes will be described in the 
following. 
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Cryogenic NMR Probes, Fig. 1 A 3D BEST-HNCO triple 
resonance experiment recorded with a 200uM *^C,'^N-labeled 
ubiquitin sample at 850 MHz with a triple resonance CryoProbe. 
The spectrum was recorded with nonuniform sampling with 25% 
sparse sampling points. The BEST-approach allows a reduction 
of the repetition time and thus, a further reduction of the exper¬ 
iment time. With a 100 ms relaxation delay the total experiment 
time was only 9 min 

Probe Options: Triple and Quadruple 
Resonance Inverse Probes 

The triple resonance TCI CryoProbe is the classical 
cryogenically cooled probe for the studies of proteins, 
as it enables all triple-resonance experiments, and 
allows in addition ^H-decoupling. It is a so-called 
“inverse” probe as the inner coil (observe coil) is 
tuned to ’H. The TCI CryoProbe is also equipped 
with a cryogenically cooled '^C-preamplifier to maxi¬ 
mally enhance the *^C-sensitivity, which is mandatory 
for *^C-detected bio-NMR experiments. The scope of 
research in many laboratories might, however, not be 
limited to the study of proteins but may also 
comprise, for instance, investigations of nucleic acids 
(DNA/RNA) or protein-ligand interactions by means 
of ^^F-detection. With the introduction of the QCIP 
and QCIF probes the versatility of the available cryo¬ 
genically cooled probes has been significantly 
increased. 


The option QCIP CryoProbe has an additional 
^*P-channel and, thus, enables not only triple reso¬ 
nance protein applications but also experiments on, 
for instance, labeled RNA samples where simulta¬ 
neous excitation of ^'P and *^C is required. An exam¬ 
ple for the usage of the QCIP CryoProbe for RNA 
(Richter et al. 2010) is shown in Fig. 2. For protein- 
ligand binding studies the resonance assignment of 
a protein is a prerequisite for the mapping of the 
binding position of a ligand. This requires the classical 
techniques combined with triple resonance cryogeni¬ 
cally cooled probes. However, investigations of pro¬ 
tein-ligand interactions and also screening to localize 
potential ligands for a given protein rely increasingly 
on '^F-detection, as a large fraction of the compounds 
in ligand libraries contain fluorine. With the option 
QCIF CryoProbe equipped with an additional *®F 
channel, the resonance assignment and the binding 
studies relying on ^^F-detection can now be performed 
with a single probe. Such cryogenically cooled probes 
allow *^F-based screening with femtomolar levels of 
target enzyme only, that is, concentrations similar to 
those required for traditional high-throughput screen¬ 
ing methods (Dalvit et al. 2005). 

Probe Options: Cryogenic Probes for Small 
Sample Volumes 

The mass sensitivity of a cryogenically cooled probe can 
be further enhanced by reducing the diameter of the rf 
coil. Cryogenic probes using sample tubes with a 1.7 mm 
sample diameter and 30ul volume are commercially 
available. With such a probe the sensitivity gain is typ¬ 
ically 16-fold compared with a 5 mm standard probe 
operating at room temperature. Those probes are attrac¬ 
tive for mass limited samples and are often used for 
natural products analysis. However, they can also be 
very attractive for a number of biomolecular applica¬ 
tions, such as NMR studies of proteins that are difficult 
to express, or for high-throughput protein-ligand screen¬ 
ing in order to minimize the amount of protein to be used. 

Probe Options: Cryogenic Broadband 
Observe Probes 

Triple resonance probes with proton, carbon, and nitro¬ 
gen channels are the most commonly used probes for 
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Cryogenic NMR Probes, Fig. 2 Overlay of two 2D 
correlation spectra recorded on a 14-mer *^C,'^N-labeled RNA 
using a QCIP CryoProbe operating at 600 MHz. The spectra 
allow sequential assignment of the nucleic acids in the 14-mer 
RNA hairpin. The HCPC spectrum (depicted in black color) 
shows the connectivities from a phosphor-31 to the two flanking 
ribose C’4 positions (over three-bond *^C-^’P coupling of 
11 Hz) and to the nearest ribose 5' and ribose 3' positions (over 
two-bond *^C-^’P couplings of 6 Hz and 4 Hz, respectively). 


The HCPC experiment {black) relies on INEPT type of transfer 
between phosphor-31 and carbon-13 whereas the HCPCC exper¬ 
iment (gray) relies on a heteronuclear spinlock for *^C-^'P 
transfer followed by a homonuclear '^C-’^C spinlock. The 
i3C-3ip spinlock was of strength = 1,300 Hz and 

length = 100 ms, and the '^C-*^C spinlock was of 

strength = 8 kHz and length = 5 ms. The ID ^*P NMR spectrum 
displayed on the vertical axes also illustrates the naiTow disper¬ 
sion (4 ppm) of the ^*P NMR signals in RNA 


biomolecular NMR spectroscopy. However, the detec¬ 
tion of less-common nuclei can also reveal insights 
into biomolecular structure, for instance, in the char¬ 
acterization of metal-binding sites in metalloproteins. 
Cryogenically cooled broadband probes are useful for 
such applications and allow a signihcant reduction of 
experiment times. In these cryogenic probes, the inner 
coil with highest sensitivity (observe coil) is tuned to 
the selected heteronucleus whereas the outer 
coil (decoupler coil) is tuned to *H. The cryogenic 
broadband probes are available in two different 
versions: a helium-cooled option operating at about 
20 K and a nitrogen-cooled version (BBO Prodigy), 
where the radiofrequency coil is kept at a temperature 
of about 80 K. The latter probe has about half of 
the sensitivity of the former, but, nevertheless, 
at least double the sensitivity compared to a 
corresponding conventional probe operating at 
ambient temperature. Figure 3 shows an example of 
the identification of metal-coordinating cysteines in a 
seven cadmium coordinating metalloprotein by means 
of *H, “^Cd chemical shift correlation experiment 
(HMQC). In the vertical project axis of the two- 
dimensional spectrum a one-dimensional direct- 
detected ^*^Cd-spectrum is shown (Digilio et al. 2009). 


Probe Options: Triple Resonance Probes 
Optimized for Carbon Detection 

The set of NMR experiments commonly used for 
structure determination of proteins is largely based on 
direct *H detection. However, ^^C-detected experi¬ 
ments provide valuable alternatives. Publications of 
'^C-detected experiments have traditionally focused 
on the advantages of ^^C-detection in case of paramag¬ 
netic proteins where ^H-signals are often broadened 
beyond detection. The '^C-signal broadening is 
less severe than that of the protons because of the 
smaller gyromagnetic ratio of *^C. An additional 
and often underestimated advantage of *^C-detection 
is the enhanced resolution due to higher 
chemical shift dispersion of The narrow line 
width and high signal dispersion together with 
the high carbon sensitivity of cryogenically cooled 
probes make *^C-detected experiments for biomole¬ 
cules today a competitive alternative to proton 
detection. 

For the fast acquisition of multidimensional exper¬ 
iments a number of different new techniques have been 
developed during the past years. The experiments 
optimized for longitudinal relaxation in combination 
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Cryogenic NMR Probes, 

Fig. 3 A2D'H-'‘^Cd 
HMQC spectrum of the metal¬ 
binding protein Cdv-MTIO 
metallothionein was recorded 
with a broadband observe 
CryoProbe operating at 
500 MHz. The 2D ‘H-'''’Cd 
HMQC shows the 
connectivities of the seven 
’ *^Cd ions to cysteine Ha and 
HP proton resonances and 
thus, allows the 
characterization of the metal¬ 
binding of Cdv-MTIO 
metallothionein. The ID *'^Cd 
NMR spectrum displayed on 
the vertical axes also 
illustrates the wide dispersion 
(210 ppm) of the seven '*^Cd 
NMR signals (Courtesy of 
Dr. Giuseppe Digilio) 



with nonuniform sampling offer an additional reduc¬ 
tion of experiment times for *^C-detected experiments 

(Bermel et al. 2009). 
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Synonyms 

Global-fit analysis of overall conformation of IgG; 
Hydrodynamics of macromolecules; Immunoglobulin G 


Definition 

Scheme, and constructed tools, for the determination 
of the overall structures of complex Immunoglobulin 
G structures from the atomic-level information of each 
of the domains. 


Introduction 

The determination of the overall conformation of 
Immunoglobulin G, composed by domains or subunits, 
using easily available solution properties is possible. In 
a multi-scale approach, atomic-level structures are 
used to provide simple shapes for the subunits, which 
are put together in a coarse grained model, with a few 
parameters that determine the overall shape of the 
antibody. 

Since the pioneering work of Bloomheld et al. 
(1967), bead models have overcome the limitations 
of the simplest models, becoming the usual choice for 
the prediction of solution properties or structural deter¬ 
mination of rigid macromolecules of arbitrary shape 
(Byron 2000; Garcfa de la Torre et al. 2000a; Garcia de 
la Torre and Carrasco 2002; Garcia de la Torre et al. 
2000b). Bead modeling nowadays has reached the 
atomic scale, which is remarkably helpful for studying 
the influence of structural details in the solution prop¬ 
erties of IgG. However, there are instances where 


atomic-level calculations are not possible because of 
lack of crystallographic or ► NMR determinations 
(Harris et al. 1998a, b) or, simply, atomic details may 
be relatively unimportant for their behavior in solution. 
In the case of antibodies, the overall size and shape of 
the subunits and, essentially their spatial arrangements, 
are the aspects that determine hydrodynamic or solu¬ 
tion scattering properties. 

Crystallohydrodynamics appears as the combina¬ 
tion of hydrodynamics and crystallography, 
a procedure in which simple ellipsoidal or cylindrical 
models (spheres, ellipsoids, and cylinders) are used to 
represent the subunits, thus putting emphasis on their 
spatial arrangements for building the whole structure 
of the antibody. An early form of this approach was 
reported in 1990 with the first demonstration that the 
antibody IgE was cusp-shaped in solution (Davis et al. 
1990). This is a “meso-scale” structural modeling and 
in a first step the solution properties of the isolated 
subunits are used to determine the dimensions of 
the subunits (axes of revolution ellipsoids, or length 
and diameter of cylinders). When the high-resolution, 
atomic-level structure of the individual subunits 
is available, such information is employed to deter¬ 
mine the dimensions through a fit of a whole set 
of solution properties predicted from the atomic struc¬ 
ture (Garcfa de la Torre et al. 2000a) and thus the 
resulting ellipsoidal or cylindrical subunit can 
be safely regarded as the hydrodynamically equivalent 
particle. This could be done from experimental data of 
solution properties of the isolated subunits 
(Carrasco et al. 1999), but it may happen that data 
are insufficient, imprecise, or simply unavailable. 
Once the dimensions of the individual subunits 
are known, they are joined to construct the coarse¬ 
grained model with a similar fitting procedure as the 
one employed for the determination of their 
dimensions. 

The step in which the geometric parameters are deter¬ 
mined, either for individual subunits or for the whole 
particle, is crucial and could be done following classical 
dimensionless quantities resulting from the combination 
of two solution properties (Tanford 1961). But more 
recently a different approach has been devised 
(Ortega and Garcfa de la Torre 2007), based on equiva¬ 
lent radii and ratios of radii for the global fit of 
a collection of properties to ellipsoidal, cylindrical, or 
worm-like models. Properties are calculated using 
programs included in the ► HYDRO suite of algorithms 
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and the global fitting procedure is devised via the 
computational tools ► HYDFIT and ► HYDROFIT, 
which perform the stmctural optimization. 

The arrangement of the subunits in the case of IgG 
antibodies is an essential determination, apart from 
their antigen-binding capabilities. Crystallographic 
structures of fhe antibody, particularly those with 
large hinges are difficult to obtain, as is well known, 
and this problem was previously addressed using, for 
example, ► HYDRO (Garcia de la Torre et al. 1994), 
and the first versions of ► HYDROSUB (Garcia de la 
Torre and Carrasco 2002). Nevertheless, modern 
implementation of this approach makes the structural 
determination of the antibody global structure feasible, 
in a simple and systematic way, making an adequate 
treatment of experimental values of solution 
properties. 


Methodology 
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Any pair of radii can be combined in a ratio of radii, 
XY = Oxlay. For a spherical particle, all the equivalent 
radii are obviously equal to the radius of the sphere, 
and all the ratios are equal to unity. For any other 
structure, the various radii are not the same, but do 
not differ too much, and the ratios depart slightly from 
unity, being dependent on the overall conformation but 
independent of the absolute size. 

These concepts have been implemented in a scheme 
that measures the agreement between an assumed 
structure and the observed properties of 
a macromolecule. Such agreement can be quantified 
in terms of the equivalent radius by means of quantity: 






ajc{cal) — a^-(exp)1^ 
«x(exp) 


( 1 ) 


The prediction of properties for the individual subunits 
from atomic coordinates (PDB files) is easily accom¬ 
plished using ► HYDROPRO (Garcia de la Torre et al. 
2000a). The determination of the dimensions of the 
ellipsoids of revolution or cylinders from the solution 
properties of the subunits is done using ► Single- 
HYDFIT (Ortega and Garcia de la Torre 2007) based 
on the global fit concept described in the next section; 

► HYDROSUB is then available for the shell-model 
calculation of properties of the whole particle com¬ 
posed by simple symmetrical subunits and finally 

► HYDROFIT is the tool developed for the determi¬ 
nation of the spatial arrangements of these subunits via 
the minimization function. 

The joint analysis of a set of various (conforma¬ 
tional and hydrodynamic) observable properties in 
terms of structural features of the particles is 
performed through the form of an equivalent radius, 
ax, for each property. It is dehned as the radius of 
a sphere that would have the same value for that 
property. These radii depend on size and shape in 
a more uniform manner than the properties themselves. 
For the solution hydrodynamic or light scattering prop¬ 
erties, namely, the radius of gyration, Rq, the transla¬ 
tional diffusion, D,, or equivalently, the sedimentation 
coefficient, s, and for other properties, the equivalent 
radii can be defined with straightforward equations. 
Some examples are shown below: 


Note that is the mean square relative deviation 
between the calculated and experimental equivalent 
radii. If there is a special reason to do so, the average 
can include different weights for each property; 
otherwise they can be the same for all the properties 
or, in other words, neglected. Note also that lOOv^A^ is 
the percent deviation of the calculated equivalent radii 
from the experimental value. 

The goodness of fit of the assumed structure can 
also be determined from the ratios of radii, as measured 
by the quantity: 

V^=(Y.Wxy] Y. WxY[XY{cal) - Yy(exp)]2 
\XY J XY 

( 2 ) 

which is the mean square difference between the cal¬ 
culated and experimental ratios of radii. Again, the 
weights WxY are optional. Now, lOO'/X is the percent 
deviation of the calculated ratios of radii from the 
experimental values. A and V are the target functions 
to be minimized in the search of a structure that hts best 
a set of solution properties. 

In the present multi-scale strategy, ► HYDROFIT 
is employed to evaluate the various conformations 
considered for the whole particle, comparing the 
► HYDROSUB results with the experimental one in 
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Crystallohydrodynamics of 
IgG, Fig. 1 Scheme of the 
procedure to determine the 
overall conformation of multi¬ 
subunit structures. The names 
of computational tools 
involved in each step are in 
italics 



order to find the optimum structure, while the task of 
processing, in a single run, a large number of different 
structures can be performed with the programs 
MULTIHYDRO and MULTISUB. A scheme of the 
multi-scale approach is displayed in Fig. 1 . 

Crystallohydrodynamics: Determination of 
the Conformation of Antibodies 

An example of the determination of the conformation 
of an antibody IgG is presented here. In the first stage. 


the dimensions of the hydrodynamic ellipsoidal of 
revolution models that will represent the Fab and Fc 
subunits in the coarse-grained model of the whole 
molecule are searched. Then, the arrangement of the 
subunits in the whole molecule will be determined. 

As experimental data for the Fab and Fc fragments 
are scarce, only limited information is available for the 
determination of the two parameters of the ellipsoids 
of revolution, namely, the semi-axes a and b. 
This information can be extracted instead from the 
properties calculated from the detailed atomic model 
(crystallographic structure), evaluated with the 
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Crystallohydrodynamics of IgG, Fig. 2 Contour plot of the target function A versus p and a. Note that the scale for p is 
logarithmic, in order to display well both the prolate and the oblate regions, (a) Fab; (b) Fc 


Crystallohydrodynamics of IgG, Table 1 Results for the ellipsoidal models of Fab and Fc, and comparison of the calculated 
properties with the experimental values 



Fab 

Fc 

PDB accession numbers 

IBBJ 

IFCl 

HYDROPRO: a, h (A); p/(A) 

43.7,24.9; 1.80/0.043 

14.5, 43.0; 0.32/0.027 

Experimental: M (g/mol), v (cm^/g) 

47500, 0.725 

51000, 0.732 

s (S): ellipsoid/experimental 

3.58/3.6, 3.92 

3.63/3.85 

R(} (A): ellipsoid/experimental 

25.1/26.9, 28.2 

26.9/29.3 


Obtained from Byron (2000), Carrasco et al. (1999, 2001), and Lu et al. (2007) 


program ► HYDROPRO, so that the ellipsoid will be 
hydrodynamically equivalent to the crystallographic 
model. 

In addition to 5 and Rq, several other properties can 
be selected: the diffusion coefficient D,, the intrinsic 
viscosity [ 77 ], and the longest relaxation time Xj, the 
covolume u and the longest chord, L^ax, calculated by 
► HYDROPRO from the PDB files extracted from the 
Protein Data Bank, and used as the input for ► Single- 
HYDFIT, which is first run in the mode p = alb as the 
only parameter, in an optimization of the size-indepen- 
dent, shape (/))-dependent ratios of radii, based on the 
minimization of the V target function (Eq. 2). Single- 
HYDFIT reports the absolute minimum, but it may 
also provide the full variation of V with p. 

Next, Single-HYDFIT is run in the two-parameter 
mode, that minimizes A (Eq. 1) as a function of p and 


a, giving their best values, and that oih = dp. Figure 2 
depicts the contour plots of A versus p and a, showing the 
absolute minimum and the secondary one. The resulting 
dimensions of the ellipsoids listed in Table 1 are the ones 
to be used in the coarse-grained model. Finally, the 
available values of experimental properties are used. 
As mentioned above, such values have not been used in 
the determination of the ellipsoid dimensions, but it 
should at least be checked a posteriori whether the 
deduced dimensions are compatible with them. So, 
Table 1 gives the values of Rq and 5 for the ellipsoids 
that result from the analysis, along with the experimental 
one. The typical difference is about 5%, similar to the 
expected experimental uncertainties of these properties. 

Once the dimensions of the Fab and Fc fragments 
are obtained, the whole IgG can be modeled. The full 
structure consists of two prolate ellipsoids that 
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Crystallohydrodynamics of IgG, Fig. 3 Geometry of the 
coarse-grained model for whole IgG. Dots are subunit centers, 
and dotted lines correspond to the main (revolution) axis of the 
subunits 

represent Fab’s (a^h = 43.7 A and hah = 24.9 A), one 
oblate ellipsoid that represents the Fc fragment 
(flc = 14.5 A and h^ = 43.0 A), and a rod representing 
the hinge with a length L/, (see Fig. 3), which is going 
to be determined in the following procedure. The 
diameter of this hinge is fixed arbitrarily at 7.5 A, 
taking hydration of the intertwined polypeptide chains 
into account. 

The properties for this array of four subunits will be 
calculated using ► HYDROSUB, for which the struc¬ 
tural information is a set of values, for each ellipsoidal 
or cylindrical subunit, of coordinates (x,, _y„ z,) from 
their centers, and the polar and azimuthal angles, 6, and 
(jtj that specify the orientation of the main (revolution) 
axis of the ellipsoid or cylinder, all referred to an 
arbitrary, conveniently chosen system of axes. 

Figure 3 shows how the different subunits are placed 
in the system of axes chosen for this case. We restrict the 
search of the Fab orientations in this example to 
a symmetric arrangement of the Fab subunits, with 
the aim of reducing the number of parameters; thus we 
assume the angle between the long axes of the two Fabs, 
fi, bisected by axis z. The coarse grained model for the 
overall stracture of antibody contains up to three 


Crystallohydrodynamics of IgG, Table 2 Results for the 
IgG3wt and IgGml5 best fitting models, experimental data 
from Byron (2000); Cairasco et al. (1999, 2001), and Lu et al. 
(2007) and calculated properties are shown 



IgG3wt 

IgG3ml5 

M, g/mol 

1,58,000 

1,50,000 

V, cmVg 

0.730 

0.730 

Ao.w, S (exptal) 

6.11 

6.77 

Ro, A (exptal) 

77.7 

51.5 

[rj], cmVg (exptal) 

9.9 

5.7 

Dmiix, A (exptal) 

195 

165 

L;„ A (exptal) 

92 

0 

P, degrees 

212 

40 

Ao.w, S/%dif 

5.94/2.8 

6.73/0.6 

Rg, A/%dif 

73.8/5.0 

50.8/1.3 

[rj], cm^/g/% dif 

10.8/-9.0 

6.13/-7.5 

A/% dif 

201/-3.1 

175/-6.1 


parameters; the hinge length, L*, the angle /i between 
the Fab fragments and (j), which represents the angle 
defined between the equatorial plane of Fc, and the 
plane defined by the long axes of the Fab fragments. 

The methodology for the determination of the overall 
stracture of two distinct IgG3 molecules, the wild type 
antibody IgG3wt, and mutant IgGml5 is presented here. 
Their structures have already been solved by previous, 
more complex approaches. Intensive experimental work 
on these and other antibodies has provided data for 
various solution properties: sedimentation coefficient, s, 
intrinsic viscosity, [rj], and X-ray scattering results for 
the radius of gyration, Rq, and the longest distance, D„,ax. 
The experimental data are listed in Table 2. 

According to the multi-scale calculation and 
global-fit structural search the best-fitting structure is 
determined by a ► HYDROFIT analysis of the exper¬ 
imental properties along with ► HYDROSUB for 
a number of structures, with varying parameters, 
whose data files are generated with MULTISUB. It is 
quite straightforward to implement the simple geomet¬ 
rical conditions that give the centers and orientations 
of the subunits within the MULTISUB code, whose 
output files are the input files for a multi-case 
► HYDROSUB calculation. 

Application 

Using MONTESUB and ► HYDROSUB, the solution 
properties of the IgG model were evaluated as a function 






















402 


Crystallohydrodynamics of IgG 



Crystallohydrodynamics of IgG, Fig. 4 Contour plot of the target function A versus the model parameters for the two IgG3 
antibodies. Inserts are views of the best fitting structures, (a) IgG3 wild type; (b) IgG3 ml5 mutant 


of L/, and /f. The set of results is compared to the exper¬ 
imental properties of, first, IgG3wt and next, IgGml5, 
using ► HYDROFIT to determine the best-fitting 
parameters for each antibody. ► HYDROFIT reports 
also a list of values of the target function (Eq. 1), from 
which it draws the contour plots presented in Fig. 4. 

For the wild type IgG3 antibody. Fig. 4a shows the 
existence of a unique structure that hts the experimen¬ 
tal data, whose main features are long hinge with 
Lf, « 80-100 A and the P angle, close to « 200°, 
which gives a typical “T”-shaped conformation. 
For this structure, A = 0.035, indicating that the typical 
difference between experimental and calculated equiv¬ 
alent radii is only 3.5%. Table 2 lists the solution 
properties calculated for this structure, which differ 
from the experimental values by only a few percent. 

For the hingeless, ml5 mutant, the outcome from 
► HYDROFIT has to be regarded carefully. As hinted 
above, in addition to the main results file that reports 
the lowest A structure, one should also take a look at 
the list of A values as a function of the parameters, 
which produces the contour plot in Fig. 4b. Inspection 
of such results reveals that in this case there are several 
minima with nearly the same depth. One is the 
expected structure with L « 0, in which the disposition 
of the Fab subunits gives a “Y”-shaped conformation. 
Again, the agreement of lgGml5’s calculated proper¬ 
ties with the experimental values is as good as for 


IgG3wt, as shown in Table 2. But the ► HYDROFIT 
analysis produces also other unrealistic but “numeri¬ 
cally” acceptable structures, with a nonexistent hinge 
and the Fab fragments strangely folded toward Fc. So 
this example provides also a good example of the 
limitations and cautions necessary for - not just this 
specific methodology - but for any global fitting 
structure. 

Summary 

Crystallohydrodynamics is the generic name for 
a scheme, and constructed tools, developed for the 
determination of the overall structures of complex 
(typically multi-domain) structures. It is a multi-scale 
procedure; in the spirit of what has been named 
“crystallohydrodynamic” approach, atomic-level 
information is employed to assign values to some of 
the parameters of the coarse-grained model. The most 
important parameters that determine the overall con¬ 
formation are obtained by fitting of experimental data. 
Evolving from this line of study ► HYDROFIT has 
now been developed, which provides a simple and 
useful tool for structural optimization. 

Antibody molecules are considered excellent cases 
for the application of these procedures. For the wild 
type and mutant IgG antibodies treated here. 
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essentially the same structures as obtained in previous 
studies (Lu et al. 2007) have been obtained with this 
latest and more simple form of the “crystallohydro- 
dynamics” approach introduced earlier. 

The present procedure does not require an estimate of 
protein hydration § (g/g), which is included in the data 
generated with ► HYDROPRO from the atomic struc¬ 
ture of the subunits. Also, the classical but somewhat 
cumbersome Perrin, Mittelbach, Scheraga/MandeUcem, 
etc., shape functions are not employed; the SOLPRO 
program is not needed, and instead the optimizations 
are done directly with the experimental properties. 
Thus, the subunit dimensions (including hydration) are 
obtained directly from experimental properties by 
► HYDFIT global analysis, and the calculations and 
global fitting are carried out in an automated and efficient 
way combining multi-case calculations using, for 
instance, ► HYDRO and ► MULTIHYDRO, or 
HYDROSUB and MULTISUB with the newly devel¬ 
oped ► HYDROFIT for stmctural optimizations. 


Computer Programs 

The program ► HYDROFIT and all the other pro¬ 
grams mentioned in this work are of public domain, 
and can be downloaded from http://leonardo.fcu.um. 
es/macromol/. 


Accession Numbers 

Atomic coordinates have been deposited in the Protein 
Data Bank under ID codes IBBJ (Human IgG3 Fab 
fragment) and IFCI (Human IgG3 Fc fragment). 


Cross-References 

► Dynamic Light Scattering 

► Electron Microscopy 

► HYDFIT and Related Packages for Linear Molecules 

► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules 

► Multiangle Light Scattering from Separated 
Samples (MALS with SEC or EEE) 

► Sedimentation Velocity Analytical 
Ultracentrifugation 

► X-Ray Scattering of Lipid Membranes 
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Cupredoxins 
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Synonyms 

Blue copper proteins; Type 1 copper proteins 

Definition 

Cupredoxins are single domain P-barrel proteins that 
bind a mononuclear type 1 copper redox site and are 
involved in inter-molecular electron transfer reactions. 

Basic Characteristics 

Whenever nature uses copper to catalyze an electron 
transfer (ET) reaction it is nearly always encased in 
what has become known as a cupredoxin fold (Gray 
et al. 2000; Dennison 2005). Cupredoxins are blue 
copper proteins; so called as they have an intense 
blue color due to the presence of a mononuclear type 
1 (Tl) copper site. The term “cupredoxin” was intro¬ 
duced, by comparison to ferredoxins used for ► iron- 
sulfur cluster containing ► electron transfer proteins, 
to refer to single domain Tl copper-containing ET 
proteins. Type 1 copper sites are also present in 
a number of multi-copper redox enzymes including 
laccases and nitrite reductases. More than ten distinct 
families of cupredoxins are now known and well- 
studied examples include the plastocyanins and 
stellacyanins from plants and the bacterial azurins 
and amicyanins. In most cases cupredoxins are thought 
to act as ET shuttles between redox enzymes and pro¬ 
teins. Oxidized Tl copper sites have two unusual 


spectroscopic properties compared to Cu(II) com¬ 
plexes: intense blue color and a small hyperfine cou¬ 
pling constant in their electron paramagnetic 
resonance (► EPR) spectra (Solomon et al. 2004). 
These properties, along with reduction potentials 
(► redox potentials) higher than that of the Cu(II)/Cu 
(I) aqua couple (usually ^300 mV but values range 
from ^200 to 700 mV) have resulted in cupredoxins 
fascinating biological chemists for many years. 

Significant progress in understanding the properties 
of a Tl copper site was provided by the first crystal 
structure of a cupredoxin (plastocyanin) in 1978 
(Colman et al. 1978). A large number of cupredoxin 
structures have subsequently been solved, making 
them one of the best structurally characterized families 
of metalloproteins (Eig. 1). A cupredoxin domain con¬ 
sists of a P-barrel fold with the copper bound at one 
extremity. A Tl copper site always involves strong 
coordination from a Cys and two His residues in an 
approximately trigonal planar arrangement, with 
a weaker axial Met usually present. The strong Cu-S 
(Cys) bond has been identified as being responsible for 
the key properties of a Tl copper site. The intense blue 
color arises from a band at approximately 600 nm due 
to a S(Cys)7t ^ Cu(II)dx 2 -y 2 ligand-to-metal charge 
transfer (LMCT) transition, while delocalization 
of spin density onto the thiolate results in the small 
hyperfine coupling constant (Solomon et al. 2004). 
Variations in Cu(II)-S(Cys) bonding, concomitant 
with changes in the Cu(II)-S(Met) interaction, have 
been correlated with spectroscopic alterations, and 
particularly the intensity of a second LMCT band 
involving the coordinating Cys [S(Cys)CT ^ Cu(II) 
dx 2 -y 2 ] (Solomon et al. 2004). In the stellacyanins an 
axial Gin coordinates via its side-chain amide oxygen 
(Fig. 1). In some cases a Val, Leu, or Phe is found in 
this position resulting in a three-coordinate copper site 
that is particularly prevalent in the fungal laccases. The 
azurins have a second weak axial interaction provided 
by a backbone carbonyl oxygen atom. The ligands and 
geometry at Tl copper sites are a compromise of those 
preferred by Cu(II) and Cu(I), which along with the 
rigid nature of a (I-barrel results in a constrained site 
that changes very little upon redox inter-conversion, 
thus minimizing reorganization energy and facilitating 
rapid ET (Gray et al. 2000). 

The development of Molecular Biology approaches 
in the 1980s facilitated more detailed investigations of 
the relationship between the structure and function of 
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Cupredoxins, Fig. 1 The structures of (a) plastocyanin 
(IPLC), (b) azurin (IJZF), and (c) stellacyanin (IJER). The 
side-chains of the coordinating residues are shown as stick 
models, the ligand-containing loops as C” traces, and the copper 


ions as red spheres. The backbone carbonyl oxygen of Gly45, 
which provides a second weak axial interaction at the active site 
of azurin, is omitted in (b) 


cupredoxins. Site-directed mutagenesis was initially 
used to investigate the role of the ligands and surface 
features. The Cys was found to be the only ligand 
essential for a site with T1 properties. Many mutagen¬ 
esis studies concentrated on the nonconserved axial 
Met ligand. Naturally occurring mutations at this posi¬ 
tion have been found to tune the reduction potential by 
^350 mV while only influencing ET reactivity by ^1 
order of magnitude (Yanagisawa and Dennison 2005). 
A hydrophobic patch adjacent to the active site, 
through which one of the His ligands protrudes, was 
found to be the key surface feature for association with 
ET partners. Certain cupredoxins possess additional 
surface patches, and their importance for interactions 
has also been extensively investigated by site-directed 
mutagenesis. 

The T1 copper site structures found in most 
cupredoxins are very similar, yet reduction potentials 
can vary widely (from ^200 to 700 mV). The second- 
coordination sphere of the metal has to play 
a signihcant role in tuning the reduction potential and 
more recent mutagenesis studies have focused on this 
aspect of the active site. One approach that has been 
particularly useful in this regard is loop-directed muta¬ 
genesis. Three of the four canonical ligands at a T1 
copper site are found on a single loop whose length, 
sequence, and structure varies in the different families 
(Fig. 1). Using naturally occurring loop sequences 
it has been possible to recreate a number of 


second-coordination sphere features, and thus mimic 
key properties including the reduction potential 
(Dennison 2008). For example, the introduction of 
the amicyanin loop into azurin lowers the reduction 
potential from 295 to 261 mV, which is very similar to 
the value for amicyanin. The equivalent experiment 
with the plastocyanin loop results in an increase in the 
reduction potential to 362 mV, which is in the range 
observed for plastocyanins (Li et al. 2007). 

The hydrogen bonds to the Cys ligand are thought to 
be a particularly important aspect of the second- 
coordination sphere of a T1 copper site. Azurin has 
two hydrogen bonds to the coordinating thiolate pro¬ 
vided by the backbone amides of Asn47 (an interaction 
conserved in cupredoxins) and Phel 14. The removal of 
one of these hydrogen bonds, brought about by making 
the Phel 14Pro mutation, results in a number of active 
site changes (Fig. 2, Yanagisawa et al. 2006). The 
reduction potential decreases to 212 mV mainly due 
to increased electron density on the Cys ligand as 
a consequence of hydrogen bond removal, which 
favors Cu(II). The reduction potential of the T1 copper 
site of azurin can be rationally tuned beyond the natu¬ 
rally occurring range by mutating both Phel 14 and 
Asn47 to further alter the second-coordination sphere 
hydrogen-bonding pattern, along with changes to the 
axial Met (Marshall et al. 2009). The replacement of 
the Cys ligand with an Asp in azurin, which abolishes 
T1 site properties and efficient ET, coupled with 
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Cupredoxins, Fig. 2 An overlay of the active site environ¬ 
ments in Cu(II) Phell4Pro (2GHZ, gray) and wild-type (IJZF, 
cyan) azurin. The altered hydrogen-bonding interactions (doited 
red lines) of the Cysll2 ligand and other changes including 
movement of Gly45, Cysll2, and Hisll? are evident (Metl21 
has been omitted for clarity) 

mutations to the axial Met results in the creation of 
type zero copper (Lancaster et al. 2009). The reduction 
potential (^310 mV) and ET reactivity of this site is 
typical of a T1 copper center. The type zero copper site 
is constrained by a new hydrogen-bonding network 
involving Asn47 and Phell4, further highlighting the 
importance of this aspect of the second-coordination 
sphere. 

As the cupredoxins are now so well understood, and 
approaches are established for engineering these mol¬ 
ecules in an increasingly rational manner, attempts are 
underway to utilize cupredoxins and T1 copper- 
containing redox enzymes in electronic devices. 
Immobilization on electrodes in a way that allows 
efficient ET is a developing area (Chi et al. 2006) 
which will pave the way to using these proteins in 
bioelectronic applications. A final area worthy of men¬ 
tion in this short review is the discovery that 
cupredoxins, and in particular azurin, can enter cancer 
cells and cause apoptosis (Yamada et al. 2004). 
It is possible that one day redox proteins, such as 
cupredoxins, will not only be used as components in 
a range of bioelectronic devices but may also be added 
to an oncologist’s armory. 

Cross-References 
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Introduction 

The cut-open oocyte Vaseline gap (COVG) voltage- 
clamp technique, a relatively recent addition to the 
electrophysiologist’s armamentarium, was specifically 
developed by Drs. Stefani and Bezanilla (Bezanilla 
et al. 1991) to achieve low-noise recordings of the 
membrane of Xenopus laevis oocytes with fast clamp 
speed and, thus, optimize the most popular transient 
expression system to reveal the activity voltage- 
dependent proteins previously difficult to resolve by 
alternative methods. The high degree of specialization 
of this technique is complemented by its flexibility: in 
addition to oocyte perfusion, COVG can been 
combined with optical measurements (voltage- 
clamp fluorometry and spectroscopy) and ► flash pho¬ 
tolysis for the instantaneous release of intracellular- 
caged compounds, expanding its use beyond 
electrophysiology. 

Historical Points 

During the cloning era, the oocytes of the African 
clawed frog {Xenopus laevis) emerged as a powerful 
biological tool, as their large size («! mm diameter) 
allowed the microinjection of cloned RNA into their 
cytosol, which they readily translated into functional 
proteins. A class of proteins that particularly benefitted 
from this expression system are ion channels and trans¬ 
porters, which can be abundantly expressed in the 
oocyte membrane, particularly with the use of special¬ 
ized vectors optimized for the Xenopus translation 
machinery (Cha and Bezanilla 1997; Shih et al. 1998). 

The large size of the oocyte and robust membrane 
integrity were critical factors in the development of one 
of the most widely used electrophysiological tech¬ 
niques: the two-electrode voltage clamp (TEVC). 
Briefly, TEVC involves impaling the oocyte with two 
intracellular electrodes; one which measures intracellu¬ 
lar potential and another that injects current, so that the 
measured membrane potential matches the command 
potential. However, the large size of Xenopus oocytes is 
a double-edged sword; their physical robustness can be 
countered by their large membrane capacitance, which 
significantly decreases the voltage-clamp speed. 
Eurthermore, uniform space clamp (isopotentiality) is 
difficult to achieve. These drawbacks can be rectified by 
clamping a much smaller portion of the oocyte 


membrane (a few pm^), as achieved by attached- or 
excised-patch voltage-clamp techniques. However, 
these techniques can be subject to various drawbacks 
as well, including low patch stability and the progres¬ 
sive decrease of ion channel activity (“rundown”), usu¬ 
ally attributed to the loss of intracellular factors by 
dialysis or excision, etc. 

The drawbacks of TEVC and patch-clamp tech¬ 
niques made it difficult to observe small or transient 
voltage-dependent charge movements. Specific chal¬ 
lenges were Na"^ channel ionic currents, which inacti¬ 
vate within a few milliseconds, and the “gating 
currents” caused by the transmembrane movement of 
charged residues in voltage-dependent proteins during 
activation: relatively to ionic currents, they are small 
and short lived. The cut-open oocyte Vaseline gap 
(COVG) voltage-clamp technique, whose origins can 
be traced to previous Vaseline gap techniques for volt- 
age-clamping cut-open muscle fibers, was specifically 
developed to maximize the benefits of the Xenopus 
oocyte expression system by improving on clamp 
speed, signal-to-noise ratio, and ability to effectively 
perfuse the oocyte (Taglialatela et al. 1992; Stefani and 
Bezanilla 1998). In this way, it was possible to com¬ 
bine the most popular transient expression system, and 
the associated benefits of molecular cloning and site- 
directed mutagenesis, with the superior voltage-clamp 
properties of cut-open cell techniques. 

Basic Implementation 

Xenopus oocytes are isolated from the ovary by 
a combination of enzymatic digestion and mechanical 
agitation. Their follicular layer (a layer of cells that 
surround and nourish the developing oocyte) is also 
removed, either mechanically or enzymatically, as its 
presence prevents good electrical isolation at the Vas¬ 
eline gaps of the COVG apparatus (see below). The 
oocytes are then microinjected with the clone RNA of 
the gene desired, typically 1-50 ng. After incubation 
for 2-7 days at 18°C, the oocytes express the protein of 
interest and are ready for voltage clamp. 

Briefly, the most basic implementation of COVG 
involves mounting the oocyte in an apparatus compris¬ 
ing of three electrically isolated chambers, or pools of 
solutions, separated by Vaseline gaps. The voltage- 
clamp circuit is then assembled around the mounted 
oocyte by the placement of six salt bridges and one 
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Cut-Open Oocyte Voltage-Clamp Technique, 
Fig. 1 Schematic representation of the COVG setup and prac¬ 
tical implementation, (a) Top view of the COVG chambers, (b) 
Side view (cross section) of the COVG chambers, showing the 
relative position of the oocyte, the Vaseline gaps {inyellow) and 
a simplified diagram of the circuitry. The oocyte membrane 
immersed in the lower chamber {dashed line) has been 
permeabilized with 0.1% saponin, to electrically connect the 
ooplasm with the current-injecting bridge I. (c) Photograph of 


b SIDE VIEW & V-CLAMP CIRCUITS 

I membrane 




the COVG chambers. WB well block, LC lower chamber, GS 
guard shield chamber, RC recording chamber; VI intracellular 
electrode; Gi, G2, P1,P2, CC salt bridges as shown in panel (b). 
(d) The same COVG chambers in a setup optimized for voltage- 
clamp fluorometry (see Fig. 3) using a Zeiss Axiovert FS 
epifluorescence microscope. A 40x objective is immersed in 
the RC external solution to transmit excitation light and collect 
fluorescence emission 


intracellular electrode, imposing three simultaneous 
voltage clamps. This is further described below and 
a schematic representation of the COVG setup is 
shown in Fig. 1. 

The three chambers or the COVG are referred to, 
respectively, from top to bottom, as recording, guard 
shield, and lower chamber. The recording and guard 
chambers possess 0.6 mm holes to accommodate the 
oocyte. The holes are lined with Vaseline® (or a viscous 
mixture of mineral oil and molten Parahlm®) so that 
when the oocyte is mounted the chambers are electri¬ 
cally isolated; thus the Vaseline gaps are formed. 
A commercially available electrophysiological ampli¬ 
fier for COVG is the Dagan CA-IB (Dagan Corpora¬ 
tion, MN, and USA). The electronic circuitry of the 
amplifier terminates in Ag/AgCl electrode coils lining 


wells filled with 1 M NaCl. The wells are connected to 
the chambers via 1-2 cm agar (3%) bridges typically 
containing 1 M Na-Methanesulfonate, threaded up to k, 
1 mm from their ends with Pt wire to improve 
admittance. 

The oocyte rests on the hole between the lower and 
guard chambers. The oocyte’s lower part is exposed to 
the lower chamber, and can be cut or permeabilized by 
brief (up to 1 min) exposure to 0.1% saponin, so that 
the ooplasm becomes electrically continuous with the 
internal solution pool in the lower chamber. In con¬ 
trast, the guard and recording chambers contain exter¬ 
nal solution. The top chamber is mounted over the 
oocyte, so that its upper dome (Ri20% of the total 
oocyte surface) is electrically isolated from the guard 
chamber below and exposed to the external solution in 










































Cut-Open Oocyte Voltage-Clamp Technique 


409 


c 


the recording chamber. Two distinct voltage clamps 
are used to clamp the guard and recording chamber 
solutions to the inverse of the command potential, each 
requiring the placement of two salt bridges. A third 
voltage clamp injects current intracellularly through 
the lower chamber to actively ground the ooplasm 
beneath the domus membrane, as measured by 
a sharp electrode impaling the oocyte domus in the 
recording chamber. Thus, the membrane is effectively 
clamped to the command potential. Optimal space 
clamp is achieved when the voltage electrode has 
impaled the oocyte apex (Perozo et al. 1992). During 
placement, the electrode is easier to see against a dark 
background achieved by mounting the oocyte with the 
animal pole facing up. As in other voltage-clamp sys¬ 
tems, circuitry allows for series resistance compensa¬ 
tion, leak subtraction, and capacitative transient 
compensation - the latter through an additional cur- 
rent-injecting bridge in the recording chamber. 

COVG Advantages and Scientific 
Breakthroughs 

Even at its most basic implementation described 
above, COVG is a more complex technique than 
TEVC or patch clamp. However, COVG is capable of 
fast voltage steps, with a clamp speed in the order of 
20^0 ps, and relatively low noise («1 nA r.m.s. at 
5 kHz). These advantages can be attributed to the 
following specific features: 

1. Low access resistance for intracellular current 
injection due to the physical ablation or chemical 
permeabilization of a large portion of the oocyte 
membrane at the lower chamber. 

2. The electrical isolation of the voltage-clamped 
guard and recording chambers helps to achieve 
isopotentiality at the recording chamber. 

In this respect, COVG possesses better voltage- 
clamp properties than TEVC, applied with more sta¬ 
bility, less rundown and over a much larger area of the 
oocyte than patch techniques. In addition, COVG 
allows extensive control of the perfusing solutions, 
further discussed below. 

These benefits resulted in the resolution and char¬ 
acterization of extremely small and transient voltage- 
dependent transmembrane charge movements that 
allowed the investigation of voltage-dependent 
► channel gating, in particular the gating currents 


from Shaker (Bezanilla et al. 1991) and DRKl 
► potassium channels (Taglialatela and Stefani 
1993); ► Voltage-gated sodium and calcium channels 
(Neely et al. 1993; Cha et al. 1999); EAG (Tang et al. 
2000) and HERG ► potassium channels (Piper et al. 
2003); large-conductance voltage- and Ca^'^-activated 

(BKca or MaxiK) channels (Savalli et al. 2006) and 
KyV channels (Miceli et al. 2009). 

In addition, COVG was used to reveal the “gating 
currents” of an ascidian phosphoinositide phosphatase 
(C/-VSP) (Murata et al. 2005) as well as muscarinic 
ACh receptors (Ben Chaim et al. 2006) directly dem¬ 
onstrating the intrinsic voltage dependence of these 
proteins. Representative gating current traces from 
BKca and Shaker channels are shown in Pig. 2. 

Solution Manipulation in COVG 

In some experiments, the ability to manipulate the 
perfusing solutions can be as important as fast voltage 
control. Since only a defined portion of the oocyte 
membrane is clamped in COVG (the domus exposed 
in the recording chamber), the external solution is easier 
to manipulate than in the TEVC technique, where per¬ 
fusion can be confounded by the large surface area and 
spherical geometry of the oocyte. The recording cham¬ 
ber solution is easily accessible and can be aspirated 
almost entirely before exchange during the experiment. 
Alternatively, concentrated stocks of drugs or modify¬ 
ing agents can be manually added into the existing 
external solution. More controlled solution exchange 
can be implemented with a Picospritzer. 

The internal solution is harder to control in the basic 
COVG implementation as described above due to dif¬ 
fusion barriers in the ooplasm. It is possible to micro- 
inject substances prior to the experiment; indeed, 
injecting on the portion of the oocyte to be 
permeabilized at the lower chamber ensures that the 
injection will not contribute to clamped membrane 
leak. During the experiment, a perfusion cannula or 
pipette can be introduced intracellularly through the 
lower chamber (Taglialatela et al. 1992; Stefani and 
Bezanilla 1998). One use of this is the fast and com¬ 
plete depletion of intracellular conducting ions, as in 
the investigation of gating currents from unblocked, 
wild-type Shaker channels (Olcese et al. 1997). 
Despite these maneuvers, rundown remains minimal 
(Stefani and Bezanilla 1998). Eurthermore, molecules 
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Cut-Open Oocyte Voltage- 
Clamp Technique, 

Fig. 2 Gating Currents from 
BKca and Shaker K* 
Channels, (a) A family of 
gating currents recorded using 
COVG from an oocyte 
expressing BKca (or MaxiK) 
channels (/jSto(M4)-C14S- 
R20A-C141S-S202C-W203V- 
R207Q-C277S). Zero current 
is indicated by the dashed line. 
The oocyte was voltage- 
clamped using the protocol 
shown above the traces. The 
test potential (in mV) is shown 
on the left of each trace, (b) As 
in panel (a), for Shaker 
channels {Shaker H4-(A6-46)- 
M356C-W434F). Note that 
Shaker channels have slower 
gating kinetics 
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chelated by UV-sensitive molecular cages (Ca^"^, pro¬ 
tons, etc.) can be microinjected prior to the experiment. 
The caged compounds can then be instantaneously 
released by ► flash photolysis with UV light directed 
at the oocyte domus using a light guide. 

Temperature jumps can be executed by perfusing 
with external solution of the desired temperature. Sta¬ 
ble temperature control can be achieved by including 
thermoelectric (Peltier) elements in the setup, while 
a thermistor probe in a negative feedback circuit with 
the Peltier element can provide automated temperature 
control (Stefani and Bezanilla 1998). 

Combination of the Cut-Open Oocyte 
Voltage Clamp with Optical Techniques 

Voltage-clamp fluorometry (VCF) is a new, innovative 
approach for optically tracking protein conformational 
rearrangements. It is discussed in depth elsewhere in 
this encyclopedia. In this entry, VCF will be briefly 
introduced in the context of a COVG setup. The central 
principle of VCF is that many fluorophores are sensi¬ 
tive to their local environment (Lakowicz 2006). By 
engineering such fluorophores with a sulfhydryl- 
reactive moiety, such as maleimide, it is possible to 
conjugate them to a specific protein site where 
a unique, accessible cysteine has been introduced. 


If this area of the protein then undergoes a motion 
that results in a change of the fluorophore environment, 
a deflection in the fluorescence intensity will be 
observed. VCF was pioneered in Shaker channels 
expressed in oocytes clamped by TEVC (Mannuzzu 
et al. 1996) and was shortly after implemented in 
COVG (Cha and Bezanilla 1997). Since then, COVG 
and VCF have been combined (COVG-F) to resolve 
voltage-dependent conformational rearrangements in 
many other channels and voltage-sensitive proteins 
(Cha et al. 1999; Smith and Yellen 2002; Bannister 
et al. 2005; Savalli et al. 2006; Villalba-Galea et al. 
2008). 

The practical implementation of COVG-F is exten¬ 
sively described in (Gandhi and Olcese 2008). The 
setup is illustrated in Fig. 3. Briefly, oocytes 
expressing ion channels with a single extracellular 
cysteine are incubated with the maleimide fluorophore 
to allow conjugation. The optimal conditions for the 
► fluorescence labeling of proteins vary according to 
fluorophore reactivity and the accessibility of the cys¬ 
teine. A ► labeling technique for BKca channels with 
tetramethylrhodamine-5'-maleimide (TMRM) is to 
incubate the oocytes in depolarizing solution 
(120 mM K-Methanesulfonate, 10 mM HEPES, 
pH = 7.0) with 10 pM TMRM, at room temperature, 
for 30^5' in the dark (Savalli et al. 2006, 2007; 
Pantazis et al. 2010a, b). After thorough washing in 
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Cut-Open Oocyte Voltage- 
Clamp Technique, 

Fig. 3 COVG technique 
combined with fluorometry. 
(a) The COVG chambers 
(Fig. 1) in an epifluorescence 
microscope to simultaneously 
record membrane current and 
fluorescence from 
environment-sensitive 
fluorophores site-specifically 
labeling the expressed 
channels, (b) A BKca channel 
in the oocyte membrane 
(putative structure), which 
have been conjugated 
with Tetramethylrhodamine- 
5'-maleimide (TMRM) 
fluorophores 




dye-free solution, the oocyte is mounted on a COVG 
setup which has been modified for epifluorescence 
measurement as shown in Fig. 3, while Fig. 4 shows 
simultaneously recorded ionic currents and fluores¬ 
cence signals from TMRM-labeled BKca and Shaker 

channels. 

An upright microscope with a 40 x water- 
immersion objective lens (N.A. = 0.80) is used, so 
that the entire oocyte domus is in the field of view. 
A halogen bulb (12 V, 100 W) can be used as a light 
source, connected to a stable power supply to prevent 


light fluctuations. The light passes through the excita¬ 
tion filter and is reflected by the dichroic, through the 
objective lens, onto the oocyte domus, at a wavelength 
that excites the fluorophore. Emitted light passes 
through the objective, the dichroic and the emission 
filter, to reach the light detection circuit. Figure 4 
shows characteristic ionic currents and fluorescence 
deflections from TMRM-labeled BKca and Shaker 
channels, simultaneously resolved using the 
methods described above. Although a photodiode is 
used as an example to record fluorescence in Fig. 3, 
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Cut-Open Oocyte Voltage-Clamp Technique, Fig. 4 Simul¬ 
taneously resolved ionic currents and fluorescence with COVG. 
(a) A family of currents {black traces) and simultaneously 
resolved fluorescence deflections (red traces) from BKca chan¬ 
nels labeled with TMRM outside their SO transmembrane seg¬ 
ment (/tS/o(M4)-C14S-G18C-C141S-R207Q-C277S). The 
SO-bound TMRM is dynamically quenched by a Tryptophan 
residue outside the S4 segment (W203). The fluorescence deflec¬ 
tions correspond to the relative voltage-dependent motion of the 

a photomultiplier tube or a charge-coupled camera can 
be used instead. The emitted light can also be sampled 
by a spectrometer, as the spectral properties of the 
emitted light can yield information on the environmen¬ 
tal changes experienced by the conjugated fluorophore 
(Cha and Bezanilla 1997). 

To ensure that only the fluorescent light emitted 
from ion channels embedded in the voltage-clamped 
portion of the oocyte is detected, an opaque recording 
chamber should be used. The ability to detect fluores¬ 
cence from the entire voltage-clamped area of the 
oocyte while excluding photons from undamped 
regions is an important advantage of combining VCF 
with COVG over TEVC, in addition to the voltage 
clamp and solution exchange benefits discussed above. 

Summary 

The cut-open oocyte Vaseline gap (COVG) voltage- 
clamp technique was developed during the height of 


two transmembrane helices (Pantazis et al. 2010b). (b) As in 
panel (a) for non-inactivating Shaker channels labeled with 
TMRM outside their S4 segment {Shaker H4-(A6-46)-M356C). 
The fluorescence deflections are related to motions that the 
channel undergoes during gating (Mannuzzu et al. 1996; Cha 
and Bezanilla 1997). Therefore, even though the observed volt- 
age-dependent AF exhibit opposite sign and different amplitude 
and kinetics in the two channels, they likely report functionally 
related structural changes 

the cloning era, to optimize voltage clamp on 
the Xenopus oocyte transient expression system. It 
offered the following advantages to the 
electrophysiologist: 

1. Very fast (20-40 ps), uniform voltage clamp of a 
large (20%), defined area of the oocyte. Low noise 
(1 nA r.m.s. at 5 kHz) compared with TEVC. There¬ 
fore, a great number of expressed proteins can be 
investigated under optimal voltage clamp conditions. 

2. Ability to exchange both internal and external solu¬ 
tions during the experiment, yet exhibit good sta¬ 
bility and minimal rundown for several hours. 

3. Versatility for a wide spectrum of applications, 
including fast intracellular release of caged com¬ 
pounds and voltage-clamp fluorometry. 

Cross-References 

► Elash Photolysis 

► Eluorescence 
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► Fluorescence Techniques for Studying Ion Channel 
Gating; VCF, FRET, and LRET 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Protein Eluorescent Dye Labeling 

► Voltage-Gated Potassium Channels, Kv Channels 

► Voltage-Gated Sodium and Calcium channels 
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Synonyms 

Combined assignment and dynamics algorithm for 
NMR applications 

Definition 

CYANA (and its predecessors DYANA and DIANA) 
are software packages for the structure calculation 
of proteins and nucleic acids on the basis of 
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Cyclic Nucleotides 


multidimensional NMR data. Three-dimensional 
structures are calculated by simulated annealing 
using a fast algorithm for torsion angle dynamics, 
that is, molecular dynamics simulation with the torsion 
angles as the only degrees of freedom. The efficiency 
of the approach results from the elimination of high- 
frequency bond-stretching and bond-bending motions 
that allows for much longer integration time steps than 
conventional Cartesian space molecular dynamics 
simulation, and from the use of a highly simplified 
“geometric” force field that retains only short-distance 
interactions. The principal experimental input data are 
conformational restraints on interatomic distances 
derived from nuclear Overhauser effect spectroscopy 
(NOESY). In addition, restraints on torsion angles 
derived from chemical shifts of scalar coupling con¬ 
stants, hydrogen-bond restraints, residual dipolar cou¬ 
plings, and paramagnetic pseudocontact shifts can be 
used as input data for structure calculations. CYANA 
provides an algorithm for combined automated assign¬ 
ment of NOESY cross peaks and protein structure 
calculation, which has largely replaced the cumber¬ 
some and error-prone manual NOESY assignment 
approach. Recently, CYANA has been extended with 
the ELYA algorithm for fully automated NMR protein 
structure determination that can substitute all manual 
spectra analysis, including peak picking, resonance 
assignment, and NOESY assignment, and thus over¬ 
comes a major efficiency limitation of the NMR 
method for protein structure determination (Giintert 
2009). Eurther information on CYANA can be found 
at http://www.cyana.org. 

Cross-References 

► Molecular Dynamics Simulations of Lipids 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR-based Structural Proteomics 

► Nuclear Overhauser Effect 

► Protein NMR Resonance Assignment 

► Residual Dipolar Coupling 
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► Gating of Potassium Channels by Cyclic Nucleotide 
Binding 

► Ion Channels: New Tools to Track Cyclic Nucleotide 
Changes in Living Cells 


Cyclooxygenase - Computational 
Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

COX; Prostaglandin endoperoxide synthetase; Prosta¬ 
glandin synthase 

Definition 

An enzyme that catalyzes formation of prostanoids and 
is inhibited by nonsteroidal anti-inflammatory drugs, 
which thereby help to relieve pain and inflammation. 

Basic Characteristics 

Cyclooxygenase (COX) is an important pharmacolog¬ 
ical target. Two important isoenzymes in humans are 
COX-1 and COX-2. Nonsteroidal anti-inflammatory 
drugs (NSAIDs), such as aspirin and ibuprofen, are 
believed to exert their pain and inflammation reducing 
effects through inhibiting these isoenzymes. Selectiv¬ 
ity of NSAIDs for COX-2 is desirable, because this 
isoenzyme is primarily present at sites of inflammation 
rather than the constitutively present COX-1. COX-2- 
specific NSAIDs therefore avoid many of the common 
side effects of nonspecific NSAIDs. The physiologi¬ 
cally relevant catalytic mechanism of COX has been 
studied using QM electronic structure methods, using 
only the catalytic tyrosine and a reduced model of the 
substrate arachidonic acid (Silva et al. 2003). Minima 
and transition state geometries for all reaction steps 
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were optimized at the B3LYP/6-31G(d) density func¬ 
tional theory level, with energies calculated at the 
B3LYP/6-311-i-G(3df,2p) level. Free energies were 
estimated based on vibrational frequency calculations 
for these structures. Two reaction steps, the initial 
hydrogen abstraction by the tyrosine radical and the 
formation of the peroxo-bridge, were found to have 
barriers close to the experimentally determined reac¬ 
tion rate of the enzyme. Smaller barriers were found 
for the subsequent steps. The study also ruled out 
a combined radical/cation mechanism. 

The classic NSAID aspirin (acetylsalicylic acid) is 
unique in the sense that it covalently modifies both 
isoenzymes and thereby causes irreversible inhibition. 
The mechanism of the covalent acylation of the active 
site serine (Ser530) by aspirin in both COX-1 and 
COX-2 was studied using ► QM/MM methods 
(Tosco and Lazzarato 2009). The semiempirical QM 
method SCC-DFTB was used in combination with 
CHARMM22 MM parameters to obtain two- 
dimensional potential energy surfaces (for the forming 
and breaking of covalent bonds) for the reaction in both 
isoenzymes, which were then corrected using B3LYP/ 
6-31G(d) energy calculations. The results indicated that 
Ser530 is acetylated through intramolecular general 
base catalysis provided by the vicinal carboxylate 
group of aspirin. A QM/MM approach has also recently 
been developed to incorporate polarization effects in 
protein-ligand binding free energy calculation, and 
applied to prediction of relative binding free energies 
of two COX-2 inhibitors (Beierlein et al. 201 1). 

Cross-References 

► QM/MM Methods 

► Semiempirical Quantum Mechanical Methods 
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Cysteine Dioxygenase - Computational 
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Science, University of Manchester, 

Manchester, UK 

Definition 

Enzymes that transfer two oxygen atoms of molecular 
oxygen to a cysteine molecule to give cysteine sulfinic 
acid products. 

Basic Characteristics 

Cysteine dioxygenase (CDO) is a nonheme iron- 
containing enzyme involved in the metabolism of cyste¬ 
ine in the body (Straganz and Nidetzky 2006; Stipanuk 
2004; Joseph and Maroney 2007). Cysteine is 
biosynthesized in the body from the amino acid methio¬ 
nine; however, in order to regulate the cysteine concen¬ 
tration in the body, a series of enzymes metabolize 
excess cysteine, whereby CDO catalyzes the first step 
in this process, namely, dioxygenation of cysteine to 
form cysteine sulfinic acid. This is an important reaction 
for human health and a decline in CDO activity, for 
instance, has been correlated with neurological disor¬ 
ders, including diseases such as Alzheimer’s and 
Parkinson’s. CDO utilizes one molecule of molecular 
oxygen on a nonheme iron center and catalyzes the first 
step in the biodegradation of cysteine leading to cysteine 
sulfinic acid products. Eigure 1 displays the active site of 
substrate-bound CDO as taken from the 2IC1 protein 
databank (pdb) file (Ye et al. 2007). Many studies have 
been undertaken to understand the catalytic mechanism 
of CDO enzymes, which takes place on a unique 
nonheme iron active site where the metal is bound to 
three histidine groups of the peptide backbone. Fig. 1. 
This is a unique active center in the group of nonheme 
iron-containing enzymes as most of these enzymes con¬ 
tain an iron bound to a facial 2His/l-carboxylate ligand 
system. Computational studies showed that 
the 3His ligand system in CDO enzymes is 
necessary for optimal dioxygenation of cysteine whereas 
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Cysteine Dioxygenase - 
Computational Studies, 

Fig. 1 Active site structure of 
substrate-bound CDO with 
labels taken from the 2IC1 pdb 
file 



ey/ "V 
0^ -.. O, 

^ ' -N,„ I „.His 

His 


t: 


0 . 0 . 5.7 


proximal 
attack 
14 . 1 . 14.7 



distal 

attack 

55 




o 

s 


,.His 

^His 


< 6 . 6 . 80 




His 


i2 52 
12 . 4 . 2.9 


O' 


o 

His 


- 4.3 


His 

’His 


V 


=3 

-<76 




48 


Cysteine Dioxygenase - Computational Studies, Fig. 2 Mechanism of substrate activation by CDO enzymes. Relative energies 
are given in kcal moP' 


an electron-donating carboxylate ligand would weaken 
the substrate binding and dioxygenation processes 
considerably (Aluri and de Visser 2007; de Visser and 
Straganz 2009; de Visser 2009). 

Conversion of cysteine to cysteine sulfinic acid prod¬ 
ucts by CDO enzymes uses one molecule of cysteine, one 
O 2 molecule, and no protons and/or external electrons. 
Experimental studies failed to unequivocally assign 
a catalytic cycle for CDO enzymes, due to the short 
lifetime of most intermediates after dioxygen binding, 
although evidence of a persulfenate structure was 
obtained (Simmons et al. 2008). It was suggested that 
this persulfenate structure might be a catalytic cycle 
intermediate but experimental studies were inconclusive. 


Since experimental studies failed to find a consensus 
reaction mechanism for cysteine activation by CDO 
enzymes, several computational studies were performed 
(Aluri and de Visser 2007; de Visser and Straganz 2009; 
Kumar et al. 2011). Initial studies used density functional 
theory methods on a large active site model that included 
the first and second coordination spheres of the iron, but 
later also full quantum mechanics/molecular mechanics 
(QM/MM) studies were done. Both DFT and QM/MM 
calculations predicted the same catalytic mechanism as 
displayed in Fig. 2. The studied mechanism starts from 
an iron(in)-peroxo complex (1), whereby the terminal 
oxygen atom attacks the sulfur atom of the substrate to 
form a ring structure (2). This process is accomplished by 
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a spin state crossing from an open shell singlet state to 
a quintet spin state. In this ring structure, the dioxygen 
bond is weakened from a double to a single bond and 
homolytically breaks to form an iron(IV)-oxo species 
bound to a sulfoxide complex (3). An internal rotation 
of the sulfoxide group makes the sulfur atom accessible 
for the second oxygen atom transfer. Subsequent oxygen 
atom transfer then gives cysteine sulfinic acid products 
(4). Also the alternative mechanism whereby formation 
of a persulfenate structure is formed (5) was investigated, 
but this process was found to be high in energy. There¬ 
fore, computational studies rule out the persulfenate 
structure as a viable catalytic cycle intermediate. 

In summary, computational studies predicted 
a catalytic cycle of cysteine activation by CDO 
enzymes. The rate-determining step in the oxygen 
activation process is found to be the conversion of the 
iron(IlI)-peroxo intermediate into the ring structure as 
it involves a spin state crossing. 


Cross-References 

► Computational Enzymology 

► QM/MM Calculations of Vibrational Spectra 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Taurine/Alpha-Ketoglutarate Dioxygenase - 
Computational Studies 
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Synonyms 

/tg/’complex 


Definition 

The cytochrome /tg/complex is one of three hetero- 
oligomeric integral membrane protein complexes in 
the electron transport chain of oxygenic photosynthe¬ 
sis that contribute to the trans-membrane proton elec¬ 
trochemical potential gradient, Aptj+. 
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Basic Characteristics 
Function 

Discharge of the Afip[+ through the ► ATP synthase is 
responsible for adenosine triphosphate (ATP) produc¬ 
tion. The other two protein complexes in the photosyn¬ 
thetic electron transport chain are the ► photosystem II 
(PSII) and ► photosystem I (PSI) reaction centers. 
The h(,f complex functions electrochemically in the 
center of the linear electron transport chain from H 2 O 
to NADP"^ in thylakoid or microbial membranes that 
support oxygenic photosynthesis, hg/accepts electrons 
and protons from PSII via lipophilic plastoquinol, and 
donates electrons to PSI through the soluble plastocy- 
anin or cytochrome Cg, while translocating protons 
across the complex, thereby generating a ApH+ 
(Fig. 1) (Kallas 1994; Hauska et al. 1996; Blankenship 
2002; Allen 2004; Cramer et al. 2006; Baniulis et al. 
2008). 

Crystal Structures and Implications for Function 

The moderately thermophilic cyanobacteria, M. 
laminosus (Kurisu et al. 2003; Yan et al. 2006; 
Yamashita et al. 2007) and Nostoc sp. PC 7120 
[Baniulis et al. 2009], and the green alga, C. reinhardtii 
(Stroebel et al. 2003) have been used as sources for 
crystallization and crystallography studies of the cyto¬ 
chrome fegf complex. 

The 217,070 and 214,298 molecular weight (with¬ 
out prosthetic groups) complex from M. laminosus and 
Nostoc is a symmetric dimer (Fig. 2). The monomer 
contains eight a-helical subunits, four “large” 
(► cytochromes / and hg, Rieske iron-sulfur [2Fe-2S] 
protein, and subunit IV, with molecular weights 
between 18 and 32 kDa), and four “small” subunits 
(petG, L, M, and N), 3^ kDa, “hydrophobic sticks,” 
each punctuated by 1-2 positive residues on the n-side, 
spanning the membrane with one transmembrane a- 
helix. These small subunits form a “picket fence” 
structure on opposite sides of the dimer. The molecular 
weights of the subunits, and the midpoint oxidation- 
reduction potential (► redox potential or Emy) value of 
the prosthetic groups, determined in situ in mem¬ 
branes, are summarized in Table 1. 

The 24-kDa cytochrome h subunit forms the core, 
with seven tightly bound prosthetic groups, of which 
five are bound redox groups: four hemes (cyt /, two 
non-covalently bound h-hemes and one covalently 
bound c heme, the unique heme Cn). The [2Fe-2S] 


cluster is the /t-side oxidant of PQH 2 . There are two 
additional non-redox prosthetic groups, one chloro¬ 
phyll a and one (I-carotene whose presence is unique 
to the hg/compared to cytochrome hci complexes. The 
P-carotene quenches the excited triplet state of the chlo¬ 
rophyll even though the two pigment molecules are sep¬ 
arated by 14 A, and can thereby protect the protein 
network against damage from singlet oxygen. The dis¬ 
tances between the prosthetic groups in the M. laminosus 
and Nostoc structures are summarized in Table 2. 

The quinone-exchange cavity is the space between 
the two monomers which is approximately 30 A high, 
15 A deep, and 25 A wide. The walls of each of the two 
cavities, which are on opposite sides of the dimer, are 
formed by trans-membrane helices (TMH) from each 
monomer: the “C, D” TMH of the cytochrome h 
subunit, which contains TMH “A-D.” The cavity 
walls are also lined by the “F” TMH from the suIV 
subunit from one monomer, and the A, E, and ISP 
TMH from the other. The cavity is probably filled 
with disordered lipid acyl chains. The structure 
obtained with a co-crystallized n-side quinone analog 
inhibitor (pdb: 2E75, 2E76; [Yamashita et al. 2007]) 
shows that the unique heme Cn (Eig. 3), together with 
heme to which it is electronically coupled, serves as 
the n-side quinone reductase. Heme Cn is positioned on 
the edge of the intermonomer quinone exchange cav¬ 
ity, with an OH“ bridging a propionate of heme and 
the Ee atom of heme Cn, which are separated by 4 A 
(Table 2). The heme is covalently linked via a single 
thioether group to Cys35 on the n-side of the “A” TMH 
of the cyt h subunit. Together with heme to which 
it is electronically coupled, it serves as the n-side 
quinone reductase [Yamashita et al. 2007] (Eig. 4). 
The p-side soluble domains of cyt / and the ISP are 
predominantly /(-strand, as is the p-side plastocyanin. 
The P-structure for cytochrome/is virtually unique for 
a c-type cytochrome (Soriano et al. 2001). In this 
respect, and also in their primary amino acid 
sequences, cytochrome / and cytochrome Ci of the 
cytochrome hci complex are completely different pro¬ 
teins, the only common feature being the characteristic 
five residue Cys-X-Y-Cys-His heme-binding 
sequence diagnostic of c-type cytochromes. 

Electron and Proton Transfer Pathways; the 
Q Cycle 

Oxidation-reduction reactions of plastoquinol/ 
quinone on the p- and n-sides of the complex and the 
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Heme f 


[2Fe-2S] 
cluster 
Heme bp- 

Chl-a — 
Heme 
/3-Car— 
Heme 
Sulfollpid 


Cytochrome b^f Complex, Fig. 2 (a) Ribbon diagram of the 
symmetric dimeric cytochrome 65 /complex, with 13 membrane 
spanning a-helices per monomer, from the cyanobacterium Nos- 
toc sp. PCC 7120, determined from the 3.0 A structure (pdb, 
2ZT9). Color code: p-side of complex: cyt/(perA), red', Rieske 
ISP (petA), -yellow, cyt h (petB), blue', subunit IV (petD), purple', 
petG, L, M, N, green. The bound Chi a and (I-carotene are 
shown in stick format, in dark green and orange, (b) Stick diagram 


of b(f structure showing position of the seven prosthetic 
groups (4 hemes, 1 FeS cluster, 1 chlorophyll a, 1 (I-carotene) 
per monomer in the dimeric structure. There are also 
seven potential lipid binding sites, one natural site occupied by 
a sulfolipid, two sites occupied by dioleoyl-phosphatidylcholine 
(DOPC) lipid that was used for crystallization and presumably 
displaced natural lipid, and four sites occupied by UDM deter¬ 
gent molecules 


Cytochrome b^f Complex, Table 1 Molecular weights and 
midpoint oxidation-reduction potentials (En,?), values associ¬ 
ated with the subunits of the M. laminosus cyanobacterial 
cytochrome fog/ complex. Standard deviations of En ,7 values 
are ~ ± 20 mV 


Subunit 

kDa 

Em? (mV) 

Cyt/(heme) [PetA] 

32.273 

+350-380 

Cyt bg (2 hemes) 
[PetB] 

24.712 

(-50 (bn); -50 to 
- 100 * (bp )),+100 (heme Cn) 

ISP (2Fe-2S)[PetC] 

19.295 

+295 (pH 6.5) - +320 

suIV [PetD] 

17.528 

- 

PetG 

4.058 

- 

PetM 

3.841 

- 

PetL 

3.530 

- 

PetN 

3.304 

- 


*measured in situ (in the membrane). 


membrane drive proton translocation and generate the 
Ajljj+ (Fig. 4). On the p-side: plastoquinol initially 
transfers one proton and one electron to the imidazole 
ligand and [2Fe-2S] cluster of the iron-sulfur protein 
(ISP), the first redox component in the high potential 
chain, forming the plasto-semiquinone anion, PQ-, 
which is coupled to the reduction of heme hp 
(“oxidant-induced reduction”). Because of the 


Cytochrome bgf Complex, Table 2 Distances between pros¬ 
thetic groups of the bg/complex from M. laminosus and Nostoc 
sp. PCC 7120. Distances are similar in the complex from 
C. reinhardtii (Stroebel et al. 2003) 


Distance (A) 

Prosthetic group Fe - Fe 

Edge - Edge 

[2Fe-2S]-Cyt/ 29 

25,26 

[2Fe-2S]-heme bp 28 

24 

Heme bp-heme b„ 20.6, 20.7 

7.4, 8.2 

Heme bp-heme bp 22.0,22.1 

12.9, 12.7 

Heme bn-heme b„ 35 

29 

Heme bn-heme Cn 9.6, 9.7 

4 (heme c„ Fe - b„ propionate) 

Chi a-heme bp - 

12 

Chi a-heme h„ - 

5.5, 6.1 

Chi a-fi-carotene - 

14 


proximity of the two />-hemes (Table 2), the electron 
can be transferred across the membrane to the heme 
hjcn pair. A second p-side turnover of PQH 2 can 
transfer a second electron to PQ or the plastose- 
miquinone, PQ-, on the n-side of the complex, 
regenerating PQH 2 . Thus, the h^f complex functions 
as a plastoquinol: plastocyanin/cytochrome c oxidore- 
ductase on the electrochemically positive (p) side of the 
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His86 
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HislOO 


Heme b 


Heme 


Cys35 


Cytochrome b^f Complex, Fig. 3 Structure and environment 
of heme c„; arrangement of hemes hp, h^, and c„ [8]. The two bis- 
histidine coordinated 6-hemes are separated by 7-8 A edge-edge 
(pdb: 2E74, 2ZT9) along the direction normal to the membrane 
plane 


membrane and as a plastoquinone reductase on the 
negative (n) side. This PQ/PQH 2 cycle in the be/com¬ 
plex differs in several respects from the “Q-cycle” 
model initially proposed for regeneration of ubiquinol 
through function of the cytochrome hc^ complex in the 
respiratory chain and photosynthetic bacteria: (1) the 
presence of heme Cp, which is inferred from crystal 
structure data to be the PQ reductase; (2) the electronic 
coupling between hemes bn and Cn, which implies that 
quinol reduction need not proceed through 
a semiquinone intermediate, but perhaps cooperatively 
through hemes and Cn, providing protection against 
the formation of superoxide ion and ► reactive oxygen 
species; (3) the presence of ferredoxin: NADP"^ reduc¬ 
tase (FNR) bound to purified plant (spinach) bg/com¬ 
plex implies that electrons needed for the reduction of 
PQ to PQH 2 can also be supplied by the “cyclic electron 
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Cytochrome bsf Complex, Fig. 4 “Q cycle” model for “bifur¬ 
cated” pathway of electron transfer into a high (ISP - > cyt/- > 
PC - > PSI) and low potential chain, resulting in the transfer 
from the n- to the p-side of 2 Pf^ per electron transferred to the 
high potential chain 


transport chain,” which can transfer electrons back to 
the bg^complex and the central electron transport chain 
via ferredoxin from the reducing side of photosystem I. 
Coupling of electron and proton transfer generates 
a maximum HGe” ratio = 2 for proton translocation 
across the complex from the n- to the p-side aqueous 
phase. Certain quinol analogs can act as inhibitors of 
electron transfer in the complex: (1)2, 5-dimethyl, 6- 
isopropyl benzoquinone (DBMIB), (2) stigmatellin or 
its tridecyl derivative on the p-side of the complex, at 
the site of quinol oxidation by the ISP; (3) 2-n-nonyl-4- 
hydroxyquinoline-A-oxide (NQNO) binds on the «-side 
at the side of heme Cn that faces the quinone-exchange 
cavity (Yamashita et al. 2007). 

Evolution 

Three problems of interest in the evolution of cyto¬ 
chrome be complexes in photosynthesis and respira¬ 
tion are: (1) The cytochrome b core, common to bg/ 
and bci complexes (Widger et al. 1984; Schultz et al. 
2000). The conservation of heme-binding sites and the 
distribution of hydrophobicity in the cytochrome bg 
subunit and the N-terminal heme-binding half of the 
cytochrome b polypeptide of the bci complex imply 
a common evolutionary origin of the hydrophobic core 
of be complexes. As evolution led to the assimilation 
of different subunits, the bci and bg/ structures 
diverged in directions parallel and perpendicular to 
the plane of the membrane. The similarity of function, 
and almost complete dissimilarity of amino acid 
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sequence of cytochromes/and C\, implies convergent 
evolution of these subunits. (2) With regard to the 
comparative evolution of h(f and hc^ complexes, the 
appearance of the heme Cn, unique to hg/complexes is 
of interest (Nitschke et al. 2010). (3) There is a high 
degree of similarity between the structures of the h^f 
complex in cyanobacteria and the green alga, C. 
reinhardtii. The crystal structure of the hg/complex 
at 3.0 A resolution is essentially identical in the two 
cyanobacteria that have been analyzed and the green 
alga, C. reinhardtii, whose appearance on the Earth 
was approximately 10^ years later than that of the 
cyanobacteria (Cramer et al. 2004). In both structures, 
there are eight polypeptide subunits, four (petA, B, C, 
and D) “large” (MW = 18-31 kDa) and four (petG, L, 
M, and N) “small” (MW = 3.3^. 1 kDa) subunits 
(Table 1). The root mean square deviation (RMSD) 
in atom positions between the 3.0 A structures from M. 
laminosus and Nostoc structures, relative to that from 
the C. reinhardtii, are (a) for the Rieske [2Fe-2S] 
protein, M. laminosus native versus C. reinhardtii 
(the latter with the inhibitor, TDS), 1.25 A; M. 
laminosus TDS structure versus C. reinhardtii TDS, 
1.06 A; (b) without the Rieske iron-sulfur protein, 
M. laminosus native structure versus C. reinhardtii 
TDS, 0.98 A, and M. laminosus-'l'DS structure versus 
the C. reinhardtii-TDS: 0.99 A. 
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Cytochrome c Oxidase (Complex IV) 

Shelagh Ferguson-Miller 

Department of Biochemistry & Molecular Biology, 
Michigan State University, East Lansing, MI, USA 

Synonyms 

Cytochrome aa 3 ; Mitochondrial complex IV 

Definition 

Cytochrome c oxidase is the terminal enzyme in the 
electron transfer chain that provides an electron sink 
by reducing oxygen to water. 

Basic Characteristics 
Background 

Most aerobic organisms have evolved the ability to 
obtain energy from their environment by extracting 
electrons from foodstuffs and converting the electrical 
potential energy into a useful chemical form. When 
oxygen is reduced to water as the ultimate electron 
sink, a large redox potential drop is provided, maxi¬ 
mizing the free energy available. The enzyme that 
catalyzes this important reaction in most aerobic 
organisms is cytochrome c oxidase (CcO). The respi¬ 
ratory chain of eukaryotes, for which CcO serves as the 
final electron acceptor, is contained in the mitochon¬ 
drial inner membrane (Fig. 1). The potential energy 
released by this electron transfer process is captured as 
an electrochemical gradient across the hydrophobic 
membrane barrier, which can then be used to support 
ATP synthesis and other transport activities necessary 
for cellular function. Thus CcO is a critical component 
in the process of energy capture; therefore, control 
of its rate and efficiency are important targets of 
physiological regulation. 


Cytochrome c Oxidase (Complex IV), Fig. 1 Diagram of 
cytochrome c oxidase in mitochondrial inner membrane. 
Cytochrome c, red', hemes, black', magnesium, dark blue', zinc, 
pink', membrane bilayer, light blue 

Structural and Functional Overview 

Thirteen different peptides make up the CcO complex. 
They are encoded in both the mitochondrial genome 
(the three largest) and the nuclear genome (the ten 
smaller suhunits); hence, mutations in either genome 
can affect the structure and activity of the enzyme and 
can cause severe disease states. 

The process of reducing oxygen to water can lead to 
the generation of damaging reactive oxygen species, if 
it is carried out in an uncontrolled manner by single 
electron-addition steps (e.g., O 2 -i- le“ ^ superoxide). 
To prevent oxygen radical formation, the CcO cata¬ 
lyzed reaction is designed to perform a concerted four- 
electron reduction of oxygen to water, avoiding radical 
intermediates. This is accomplished by utilizing the 
unique characteristics of the two hemes and two copper 
centers in the enzyme (Fig. 1) to store and transfer 
electrons together as needed. The enzyme also protects 
the active site where the oxygen chemistry occurs by 
burying it in the transmemhrane region of the protein 
and surrounding the three catalytic subunits with an 
array of ten additional peptides in the eukaryotic 
enzyme (Fig. 2). Many specifically bound lipids 










424 


Cytochrome c Oxidase (Complex IV) 



Cytochrome c Oxidase (Complex IV), Fig. 2 Bovine cyto¬ 
chrome c oxidase monomer with specifically bound lipids. 
The 13-subunit monomer structure (PDB 2EIL chains N-Z) is 
shown, with associated lipids in dark red, buried hemes in white, 
and membrane region delineated by black lines. Subunits: I, dark 
blue', II, sienna’. III, blue’, IV, navy, \a., purple’, Vb, light green’. 
Via, green, VIb, olive, Vic, forest green’, Vila, yellow’, 
Vllb, gold’, VIIc, orange’, VIII, khaki (Drawing produced in 
Chimera (UCSF)) 

are also resolved in the crystal structures, which may 
contribute to the insulating layer surrounding the 
chemically reactive heart of the enzyme. 

Mechanism of Oxygen Reduction 

The mechanism of oxygen reduction by CcO has been 
studied by a variety of spectral techniques that are 
capable of monitoring, in micro and millisecond time- 
scales, the fast changes in redox state of the metal 
centers and the oxygen intermediates of the reaction. 
A scheme that describes current thinking on the steps 
involved in the overall process is shown in Fig. 3. 
Starting with a fully oxidized enzyme (O) with the 
active site structure [Fea 3 ^^- OH", Cub^^-OH“] the 
coupled transfer of two electron and two protons 
gives rise to the reduced (R) state with the active site 
structure [Fea 3 ^'^, Cub*^] and formation of 2 FI 2 O. In 
this state, O 2 can bind and is rapidly reduced by four 
electrons, breaking the 0-0 bond and resulting in the 
P state [Fea3"*'^=0^“, Cub*'^-OH, tyrosine‘-0] (called 
P because it was previously thought to be a peroxy 
intermediate). Input of one electron and proton gives 


the F (ferryl) state where the tyrosine radical is elimi¬ 
nated. One more electron/proton pair returns the 
enzyme to the O state. Thus the four electrons required 
for complete reduction of oxygen come from heme 
a 3 ( 2 ) Cub( 1) and formation of a neutral tyrosine radical 
[tyrosine‘-0] (1). A nearby tyrosine covalently linked 
to a histidine ligand of Cub is thought to be the source 
of this electron. Four protons are consumed in these 
reactions, to make 2 H 2 O, and the energy released by 
this exothermic reaction is conserved by pumping four 
protons across the membrane. 

Mechanism of Proton Pumping 

The sequence of reactions involved in oxygen reduc¬ 
tion is supported by considerable evidence; however, 
the mechanism of coupling of electron transfer and the 
oxygen chemistry to proton pumping is still controver¬ 
sial. One general scheme posits that each electron that 
enters the protein drives the uptake of two protons, the 
hrst via the D path, to be pumped, and the second to 
supply the proton for the oxygen chemistry. 
This scheme requires the use of several internal 
proton-loading sites, two of which are depicted in 
Fig. 3. These allow the protons to move in discrete 
steps in response to the sequential reduction of the 
metal centers, the movement being driven by charge 
compensation and directed by available facile routes of 
transfer. 

Proton transfer itself appears to be best described by 
a so-called Grotthuss mechanism, where protons move 
rapidly through hydrogen-bonded pathways created in 
part by protein architecture and in part by the position¬ 
ing of water molecules. Conformational change as 
a factor in controlling proton flow, and backflow, is 
generally considered a likely component, involving 
individual side-chain rearrangements or more substan¬ 
tial movements of the backbone. A remarkably well- 
conserved, organized water chain of nearly 25 A in 
length is seen in one of the proton pathways (D path) 
in structures of both bacterial and mammalian CcO. 
Mutations in bacterial CcO that alter the entrance, 
the end point, or residues in between, cause significant 
disruption of oxidase activity, either strongly 
inhibiting or completely uncoupling, suggesting the 
importance of the pathway structure and water organi¬ 
zation in the kinetics of proton flow. The path of 
pumped protons for exiting from the enzyme is not as 
evident as those for entry (D and K paths), from either 
structural or mutational studies, although several 
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Cytochrome c Oxidase (Complex IV), Fig. 3 Diagram of the 
oxygen reduction reaction. Diagram showing the flow of 
electrons {red arrows) and protons {blue arrows) through 
cytochrome c oxidase and the associated oxygen reduction inter¬ 
mediates (black boxes). Proton loading sites during transit 
through the protein: purple circles. Reaction intermediates: 

possibilities have been proposed. The question of 
whether reversibility of the exit path could play a role 
in physiological control of efficiency is an important 
question that is yet to he fully explored. 

Metabolic Role of CcO 

The critical physiological role of CcO in cell metabo¬ 
lism is clear from the drastic effects of loss of activity 
on cell and organismal viability. Although the factors 
that normally regulate its activity are not well under¬ 
stood, it is established that there are a number of 
phosphorylation sites on the mammalian CcO and at 
least two on its substrate cytochrome c. There is also 
evidence of regulation by molecules such as nucleo¬ 
tides and bile salts, binding sites for the latter being 
seen in crystal structures of both bacterial and bovine 
CcO. It is yet to be determined whether sites that 
are occupied by steroids in the artihcial conditions 
of crystallography may represent sites for other 
physiological regulators, such as nucleotides or 
porphyrins. 

As more high-resolution structures of membrane 
proteins have become available, the phenomenon of 
well-resolved lipid molecules associated with these 
proteins is commonly observed, but the case of bovine 
CcO is particularly striking, with 13 lipids resolved. 


O oxidized, R reduced, P peroxy, F ferryl. Each electron that 
moves from cytochrome c into the active site, via Cua and heme 
a drives one proton across the membrane and one proton 
to support the oxygen reduction process. Input pathways 
for protons are labeled D and K as defined in most bacterial 
oxidases 

including two cardiolipins and several triglycerides 
along with the more common membrane lipids 
(Fig. 2). Even the common phospholipids appear to 
be specifically bound, and further, the binding sites are 
conserved even in the bacterial CcOs. This raises new 
questions about the specihc structural and functional 
roles of conserved lipid binding sites in CcO. These 
questions and many others relating to control of CcO 
activity and efficiency are challenging subjects for 
future investigation, with important implications for 
such serious health problems as obesity, neurological 
disease, and aging. 


Cross-References 

► Electron Transfer Cofactors 

► Electron Transfer Proteins; Overview 

► Electron Transport Chains in Bacteria 

► Hemes 

► Mitochondrial Electron Transport 

► Proton-Coupled Electron Transfer 

► Reaction-Induced Infrared Difference Spectroscopy 

► Reactive Oxygen Species 

► Redox Potential 
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Chemistry, University of Bristol, Bristol, UK 

Synonyms 

CcP; Cyt c Peroxidase 


Definition 

A heme-containing enzyme that catalyzes the reduc¬ 
tion of hydrogen peroxide to water. 


Basic Characteristics 

Cytochrome c peroxidase belongs to the large enzyme 
family of the heme peroxidases present in most life 
forms. It can reduce hydrogen peroxide (H 2 O 2 ) to water 
by taking reducing equivalents from cytochrome c (or cyt 
c), a small heme-containing protein that is a component 
of the electron transport chain in mitochondria. When 
hydrogen peroxide enters the enzyme, proton transfer 
from one oxygen to the other and subsequent loss of 
water follow. These leave the heme as an active interme¬ 
diate, called Compound I, in which the iron is formally in 
the +IV oxidation state, and three unpaired electrons. 
Two of these are located on the FeO ferryl moiety. In 
contrast to other heme peroxidases, the third unpaired 
electron in cytochrome c peroxidase is located on the side 
chain of a tryptophan rather than on the heme group. 

QM/MM calculations have been used to investigate 
the electronic structures of Compound I intermediates in 
cytochrome c peroxidase and ascorbate peroxidase 
(Harvey et al. 2006), which features a tryptophan in a 
similar position that does not pick up an unpaired elec¬ 
tron. Even though the active sites of the two enzymes 
are almost identical, calculations using B3LYP/6-31G 
(and a double-zeta LACVP basis for the iron) for the 
QM region and CHARMM22 MM parameters for the 
enzyme environment correctly reproduce the different 
experimental electronic structures of Compound I. 

A combination of experimental methods and QM/ 
MM modeling has also been used by other workers to 
investigate the possible formation of a covalent bond 
between the tryptophan and the heme (Pipirou et al. 
2009). Although never observed directly by experi¬ 
ment, the combination of experimental data and QM/ 
MM modeling indicated that cytochrome c peroxidase, 
like ascorbate peroxidase and other class I heme per¬ 
oxidases, can allow formation of a covalent trypto- 
phan-heme link. 

Using a special implementation of the QM/MM 
approach, which involves selectively “turning on and 
off’ different residues in the quantum region (i.e., 
progressively changing which residues are included 
in the QM region), the pathway for long-range 
protein-protein electron transfer between cytochrome 
c peroxidase and cyt c has been studied (Guallar and 
Wallrapp 2008). A detailed electron transfer 
pathway was identified in this way, pinpointing the 
importance of specific residues on both proteins 
(► QM/MM Methods). 
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Systematic application of molecular dynamics sim¬ 
ulation and QM/MM minimization has been proposed 
to improve results of molecular “docking” (Burger 
et al. 2011). Docking is widely used in inhibitor design 
to identify rapidly possible ligands and their modes of 
binding to protein targets. The approach was applied to 
the docking of a set of binders and decoys in cyto¬ 
chrome c peroxidase. This led to a significantly 
improved discrimination between binders and decoys 
compared to a traditional docking approach, indicating 
the importance of including electronic polarization 
effects in docking refinement. 

Cross-References 

► QM/MM Methods 
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Definition 

Cytochrome P450 enzymes make up a superfamily of 
heme-containing enzymes typically involved in 


catalyzing hydroxylation, oxidation, and methylation 
of organic substances. 

Basic Characteristics 

Cytochrome P450s are a diverse and widespread group 
of heme-containing enzymes. They are potentially use¬ 
ful in biocatalysis as oxygenases. In humans, cyto¬ 
chrome P450s are primarily membrane-associated 
proteins that metabolize endogenous and exogenous 
compounds. Their involvement in drug metabolism is 
particularly important: They often catalyze the first 
reaction (Phase I) necessary for clearance of a very 
wide range of pharmaceutical compounds. In addition 
to affecting drug bioavailability, cytochrome P450s can 
be the cause of unwanted drug-drug interactions, and of 
variation in drug metabolism between individuals. 
Understanding the reactivity and selectivity of P450 
enzymes is therefore important, and quantum mechan¬ 
ical (QM) electronic structure and combined QM/MM 
methods have been widely employed for this purpose 
(Shaik et al. 2010). In QM/MM calculations on these 
enzymes, the complex electronic structure of the heme 
group can be treated using reliable QM methods (e.g., 
B3LYP density functional theory), while the enzyme 
environment is treated MM. This allows large models 
of the proteins to be studied, which can be important to 
investigate potential differences between isoenzymes 
(Lonsdale et al. 2011). QM/MM methods are also 
proving useful in analyzing the determinants of drug 
metabolism, for example, in explaining why dextrome¬ 
thorphan reacts via O-demethylation, and not aromatic 
carbon oxidation, in human cytochrome P450 2D6 
(Olah et al. 2011). Applications of reaction modeling 
of drug-metabolizing P450s are described in the entry 
on “► Drug Metabolism - Computational Studies of 
Mechanisms.” A recent tutorial review on the applica¬ 
tion of ► QM/MM methods to enzyme reactions used 
cytochrome P450s as a specific example, and provides 
a practical guide to the modeling of reactions in these 
enzymes (Lonsdale et al. 2012). 

Cross-References 

► Drug Metabolism - Computational Studies of 
Mechanisms 

► QM/MM Methods 
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Synonyms 

Hemoproteins 


Definition 

Cytochromes are proteins containing one or more 
heme groups that are involved in electron transfer. 

Introduction 

The term cytochrome was first coined by David Keilin 
(1925) to describe a set of intracellular heme-binding 
proteins that display redox activity with the formation 
of intense absorption bands between 510 and 615 nm 
upon reduction. It has subsequently come to be used as 
a general term for proteins which contain one or more 
► hemes. However, many hemoproteins possess cata¬ 
lytic activity, utilizing either the heme group or 
another protein-associated cofactor to effect an enzy¬ 
matic reaction. This may be exemplified by the ► cyto¬ 
chrome P450 family of enzymes, which use the heme 
group to activate dioxygen and carry out mono¬ 
oxygenation of substrates. Accordingly, the usage of 


“cytochrome” should be reserved for referring to 

► electron-transfer proteins involved in processes 
such as respiration and photosynthesis, accomplished 
by the reversible change in oxidation state of the 

► electron-transfer cofactor. Eormally, this change in 
heme oxidation state is the reversible equilibrium of 
the iron atom between Ee(II) and Ee(III). 

Classification and Nomenclature 

Cytochromes are classified in terms of their heme type, 
the mode of heme-iron axial ligation, and the amino 
acid sequence similarity between members of the 
family. This leads to a set of nomenclature as 
recommended by the Nomenclature Committee of the 
International Union of Biochemistry (NCIUB 1991). 

Heme Types and Ligation 

The heme group is a tetrapyrrolic chelate of iron, with 
its structure and mode of attachment to the protein 
varying among members of the cytochrome family. 
These differences lead to the designation of cyto¬ 
chromes as members of groups a, h, c, or d. Cyto¬ 
chromes a bind heme a (iron cytoporphyrin IX) while 
cytochromes h and cytochromes c both contain iron 
protoporphyrin IX (lUPAC-IUB Joint Commission on 
Biochemical Nomenclature 1986). The heme is non- 
covalently attached to the protein in the case of the 
f)-type cytochromes, whereas in cytochromes c it is 
covalently attached via thioether linkages between 
one or two cysteine residues of the protein and heme 
vinyl groups. Cytochromes d possess more variation 
among their heme group structures, but generally contain 
less conjugation in the tetrapyrrole moiety. These include 
dihydroporphyrin (heme d) and tetrahydroporphyrin. 
The stmctures of hemes a-d are shown in Pig. 1 . 

Where a heme iron is axially ligated by two strong- 
held ligands (such as certain amino-acid side chains or 
exogenous ligands), a low-spin iron-porphyrin com¬ 
plex results, this is known as a hemochrome and is 
designated as such using a lower-case italicized letter 
(e.g., cytochrome c). If this hemochrome ligation does 
not exist in both oxidized and reduced forms of the 
protein, then this is indicated by a primed letter (e.g., 
cytochrome c'). Por certain well-characterized cyto¬ 
chromes, consecutive subscript numbering is used to 
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Cytochromes, Fig. 1 The structures of hemes a, h, c, and d (panels a-d respectively) 
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differentiate between them (e.g., cytochrome Ci, cyto¬ 
chrome C 2 etc.). This system becomes more compli¬ 
cated in the case of cytochromes with more than one 
heme group. 

The amino acid side chains used as axial ligands in 
cytochromes have a profound effect on the ► redox 
potential, and hence reactivity, of the heme iron. The 
most common ligands are histidine and methionine, 
but (especially in the case of hemes which also act as 
catalytic centers) other residues are utilized. These 
include tyrosine (the di-heme of cytochrome cdi nitrite 
reductase), lysine (the active site of bacterial 
cytochrome c nitrite reductases), and cysteine 
(heme-thiolate proteins, e.g., cytochrome P450, 
► nitric oxide synthase). In these cases the heme is 
typically five-coordinate to allow substrate binding. 


Classification by Visible Absorption Spectra 

The environment of the heme group in cytochromes 
determines the spectral characteristics of the protein, 
thus introducing a further method by which they may 
be classihed. In the case where a protein is not simply 
named after the type of heme bound (and additionally by 
sequential subscript numbering as mentioned above), 
the convention is to use the position of the a-band in 
the visible absorption spectrum, e.g., cytochrome C554. 


Cytochromes a 

The best-known example of an a-type cytochrome is 
cytochrome aa^, also known as ► cytochrome c oxi¬ 
dase. This enzyme complex (Complex IV in the 
► mitochondrial electron transport chain) varies in pro¬ 
tein composition, ranging from as little as three subunits 
in some bacteria to 13 subunits (per monomer; the 
enzyme is dimeric) in mammals. Although cytochrome 
aaj, catalyzes the four-electron reduction of O 2 to H 2 O 
with mitochondrial cytochrome c as the electron donor, 
its fundamental role in aerobic respiration renders it 
distinct from other heme-dependent enzymes. Further¬ 
more, the enzyme couples oxygen reduction to trans¬ 
membrane proton pumping (Muramoto et al. 2010). 

Cytochrome c oxidase contains several redox cen¬ 
ters; in addition to the low-spin a-type hemochrome 
there is a binuclear copper center (the Cua site) where 
the enzyme is reduced by cytochrome c, and 
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Cytochromes, Fig. 2 The catalytic core of cytochrome c oxi¬ 
dase with heme on the left and heme a on the right 


a binuclear heme a 3 -CuB center (Fig. 2). Electron 
transfer through the protein proceeds via heme a to 
the site of oxygen binding and reduction, the heme 
a 3 -CuB center. This center represents an unusual 
arrangement and is comprised of the high-spin histi¬ 
dine-ligated five-coordinate heme and a tris-histidinyl- 
ligated copper atom, with dioxygen binding as 
a bridging ligand between the two metal ions. The 
catalytic cycle is believed to involve a high-valent 
Fe”^"^ oxyferryl species at the a 3 -heme (Compound F). 

In addition to cytochrome c oxidase, there are few 
known examples of a-type cytochromes, but these 
include the cytochrome ai from Nitrohacter which is 
involved in nitrite oxidation (Bock and Koops 1992). 


Cytochromes b 

Cytochrome h is the central redox subunit in both the 
► cytochrome hci complex (ubiquinolxytochrome c 
oxidoreductase, mitochondrial respiratory Complex III) 
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and the ► cytochrome hg/complex (plastoquinohplas- 
tocyanin reductase of the photosynthetic apparatus). 
The enzyme acts as an energy transducer, coupling 
electron transfer from a hydroquinone derivative to an 
acceptor protein with proton translocation across the 
membrane. The resulting transmembrane potential is 
used to drive production of ATP via ATP synthase. 
The composition of the cytochrome bci complex varies 
depending upon the source organism, with bacterial 
complexes typically dimers of the cytochrome b, cyto¬ 
chrome Cl, and Rieske iron-sulfur proteins, while 
eukaryotic enzymes may have an additional 6-8 sub¬ 
units per monomer. The cytochrome b subunit contains 
two bis-histidinyl-ligated hemes that have quite differ¬ 
ent reduction potentials. Consequently these centers are 
referred to as the b^ (high potential, also known as 
cytochrome bs^o or 6562 ) and b]^ (low potential; cyto¬ 
chrome hggg or hggg) hemes. In the case of both respira¬ 
tory and photosynthetic roles, the mechanism is 
complex but similar. For discussion of this, see e.g., 
Cramer et al. (2006) and Hunte et al. (2008). 

There are several other examples of widely studied, 
well-understood h-type cytochromes, including cyto¬ 
chrome P450 and nitric oxide synthase. However, due 
to their primarily catalytic function these will not be 
discussed here. Perhaps the best-known example of an 
electron-transfer h-type cytochrome is cytochrome hg, 
a bis-histidinyl-ligated monoheme protein found in 
animals, plants and fungi. In the case of microsomal 
and mitochondrial cytochromes hg the protein is mem¬ 
brane-bound, while a soluble variant exists in other 
animal tissues and red blood cells. The protein has its 
own electron donor in cytochrome hg reductase, an 
► NADH-dependent flavoprotein. The cytochrome hg 
protein fold (Argos and Mathews 1975) is also com¬ 
monly found as a hemoprotein domain in larger 
enzymes such as flavocytochrome ^2 (yeast L-lactate 
dehydrogenase). The widespread nature of cytochrome 
hg is reflected in the variety of reactions it is involved 
in, including fatty acid desaturation and the mainte¬ 
nance of hemoglobin in its ferrous state. However, one 
of its major roles appears to be as an electron donor to 
cytochrome P450, thus effecting hydroxylation and 
detoxification reactions (Finn et al. 2008). 

Another member of the family of h-type cyto¬ 
chromes is cytochrome hg 62 , a soluble monoheme pro¬ 
tein in which the heme iron is axially ligated by one 
histidine and one methionine residue (Fig. 3; Arnesano 
et al. 1999). This ligation results in a higher reduction 



Cytochromes, Fig. 3 The structure of cytochrome 


potential than those typically displayed by bis-histidinyl- 
ligated heme groups, thus thermodynamically allowing 
access to different donor/acceptor redox couples. 

Finally, worth mentioning due to its unusual and 
distinctive structure is bacterioferritin or cytochrome 
bi. This oligomeric protein binds heme via two axial 
methionine residues and is thought to be involved in 
iron uptake and storage (Keren et al. 2004), among 
other functions. 

Cytochromes c 

The c-type cytochromes are distinct from others due to 
the presence of a covalent thioether linkage between the 
porphyrin vinyl groups and cysteine residues from the 
protein (Fig. 1). These cysteines are typically part of 
a CXXCH motif in the protein sequence (where X = 
any amino acid), but it has been shown that this may 
vary such that as many as four amino acids may sepa¬ 
rate the cysteines. Heme attachment is accomplished 
via one of a number of different sets of apparatus, and 
this is species-dependent (Allen and Ferguson 2006). 
The prototypic c-type cytochrome is cytochrome c, 
a small low-spin His/Met-ligated monoheme protein 
that serves as the reductant for respiratory terminal 
oxidases as well as an electron-transfer partner for 
various redox enzymes (Fig. 4; Bertini et al. 2006). 
Cytochrome c is a member of the Class I (Ambler 
1991) cytochromes c, with Class II including the high- 
spin cytochrome c' and proteins such as cytochrome 
Cggg that are structurally more similar to the His/Met 
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Cytochromes, Fig. 4 The structure of mitochondrial cyto¬ 
chrome c 

ligated cytochrome than they are to the Class 
I proteins. Class III encompasses the low-potential 
multiheme cytochromes c, and includes the tetraheme 
cytochromes C3 and triheme cytochromes Cy. These 
proteins are typified by bis-histidine ligation and 
a relatively low ratio of proteinrheme content (^30-40 
residues per heme). Class IV cytochromes c were orig¬ 
inally defined by Ambler as those proteins which contain 
additional cofactors, but are now commonly referred to as 
tetraheme proteins, with one bis-histidine- and three his- 
tidine/methionine-ligated hemes, that are structurally 
related to the protein from the bacterial photosynthetic 
reaction center (Moore and Pettigrew 1990). 

In addition there are many c-type cytochromes 
which are considered to belong to classes of their 
own. One example of this is the octaheme hydroxyl- 
amine oxidoreductase, one of a growing number of 
characterized multiheme enzymes that also includes 
the bacterial pentaheme nitrite reductase (CcNiR) and 
the Shewanella octaheme tetrathionate reductase 
(OTR). While such enzymes contain one five- 
coordinate heme group which forms the substrate bind¬ 
ing site, there are numerous c-type hemes which serve 
as electron reservoirs (or sinks, depending upon the 
reaction catalyzed). Interestingly, in these three pro¬ 
teins the active site heme is unique in terms of its 
protein attachment (a cross-linked P460 in HAO, 
lysine-ligated in CcNiR and OTR). Despite no clear 
sequence or global structural homology between these 
three proteins, their hemes are essentially 


Cytochromes, Fig. 5 The structure of dimeric cytochrome cdi 
nitrite reductase. The tyrosine-ligated dj heme is indicated 

superimposable, supporting the idea that there are con¬ 
served structural modules for heme arrangement. 
These are typically close-packed near-parallel and 
near-perpendicular pairs, and it has been proposed 
that electron transfer through the enzyme(s) is facili¬ 
tated by such arrangements. Given that these enzymes 
carry out multi-electron redox reactions, such as the 
six-electron reduction of nitrite to ammonia, efficient 
transfer of electrons to or from the active site heme 
helps prevent the release of intermediates during catal¬ 
ysis (Mowat and Chapman 2005). 

Also worthy of mention is cytochrome /, a c-type 
cytochrome which forms part of the photosynthetic 
cytochrome h^f complex (see cytochromes h above). 
Cytochrome / is closely related to the cytochrome C\ 
subunit of the respiratory cytochrome hci complex. In 
both cases the fi-type subunit is reduced by quinols and 
passes reducing equivalents to the other subunit (ci or/), 
which relays the electrons to the acceptor (cytochrome c 
in respiration, plastocyanin in ► photosynthetic electron 
transport). Both cytochrome C\ and cytochrome / 
are structurally similar to Class I cytochromes c, with 
the distinction that in cytochrome / the heme iron is 
axially ligated by one histidine residue and the 
N-terminal tyrosine of the protein, rather than the typical 
methionine. 

Cytochromes d 

Examples of cytochromes that incorporate a cZ-type 
heme include the cytochrome hd complexes. 
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► bacterial respiratory oxidases produced under hyp¬ 
oxic conditions by certain species, e.g., Escherichia 
coli (Mason et al. 2008). Apart from these there are few 
examples of ri-type cytochromes, with perhaps the best 
known being the cytochrome cdi nitrite reductase from 
organisms such as Thiosphaera pantotropha and Pseu¬ 
domonas aeruginosa that catalyzes the reduction of 
nitrite to nitric oxide. The -heme-binding domain 
of the enzyme has a P-propeller structure, with the 
heme ligation being dynamic (Williams and Fulop 
1997). In the oxidized T. pantotropha structure the 
r/j-heme is ligated by one histidine residue and, unusu¬ 
ally, a tyrosine side chain from the c-domain of the 
enzyme. In the reduced enzyme the tyrosine ligand is 
lost, providing a substrate-binding site, and this 
rearrangement is accompanied by a change of ligation 
at the c-heme from bis-histidine to His/Met. In the 
structure of the oxidized Pseudomonas enzyme, the 
c-heme is His/Met-ligated, while the fifi-heme is 
observed ligated by one histidine and a hydroxyl ion 
which is hydrogen-bonded to the distal tyrosine resi¬ 
due. It has been proposed that ligand switching phe¬ 
nomenon is important for regulation of activity. 

Summary 

This entry describes the cytochromes that are crucial 
for biological electron transport. The similarities and 
differences between these heme-containing proteins 
are clearly illustrated. Examples are used to show 
how the protein environment around the heme group 
serves to tune the properties of the protein, thus deter¬ 
mining the functional diversity. 

Cross-References 

► Bacterial Respiratory Chains 

► Coupled Electron Transfer 

► Cytochrome /je/Complex 

► Cytochrome c Oxidase (Complex IV) 

► Cytochrome P450 - Computational Studies 

► Electron Transfer Cofactors 

► Electron Transfer Proteins: Overview 

► Electron Transport Chains in Bacteria 

► Hemes 

► Mitochondrial Electron Transport 

► Nitric Oxide Synthase - Computational Studies 


► Photosynthetic Electron Transport 

► Redox Potential 
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Synonyms 

Double electron-electron resonance (DEER); 
PELDOR and DEER; Pulsed electron double reso¬ 
nance (PELDOR) 

Definition 

DEER spectroscopy allows the dipolar interaction 
between two immobilized paramagnetic centers to be 
measured, from which their relative orientation and 
distance (^15-80A) can be calculated. 


Introduction 

Distance and orientation information between two 
paramagnetic centers can be obtained from the dipolar 
spectrum measured with double electron-electron 
resonance (DEER) spectroscopy, also known as pulse 
electron double resonance (PELDOR) (Berliner 
et al. 2000; Schweiger and Jeschke 2001; Schiemann 
and Prisner 2007; Smith et al. 2010). The technique is 
applicable to the study of long-range order in powder 
and frozen solution samples when the two paramag¬ 
netic centers have a separation in the range ^15-80A. 
A single-crystal sample is not required. Molecules, 
or more correctly the dipolar interaction, should be 
static on the timescale of the experiment. 

The technique requires two (or more) paramagnetic 
centers and in this entry we consider metalloproteins with 
at least one intrinsic paramagnetic center, either an 
organic radical, metal center or metal cluster. If the 
protein under study is diamagnetic, then spin labels can 
be added, typically by site-directed mutagenesis (Bohme 
et al. 2010). The discussion of double-quantum coher¬ 
ence electron paramagnetic resonance (DQCEPR), 
which can access distances down to ^lOA (Saxena and 
Freed 1997), or techniques for transient radical pairs 
generated, for example, by a laser flash, are outside the 
scope of this chapter (Moebius and Savitsky 2009). 

DEER Sequence 

The DEER pulse sequence is designed to separate 
the dipolar interaction from other interactions of a 
paramagnetic center with its environment, such as the 
electron Zeeman and hyperhne interactions. Two pulse 
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sequences are routinely used (Fig. 1): a three-pulse 
version established first by (Milov et al. 1981) and 
later a four-pulse version was introduced with the 
advantage of zero dead-time (Pannier et al. 2000). 
Both sequences employ two microwave (mw) frequen¬ 
cies to measure the dipolar coupling. In the three-pulse 
sequence, pulses at frequency v^et applied to the detec¬ 
tion-spins produce a primary echo at time 2 t. The 
7 t-pulse at frequency Vp^mp inverts the pump-spins 
changing the local magnetic field at the detection- 
spins. A DEER trace is recorded by monitoring the 
detection-spin echo height with a fixed t value while 
the pump pulse is moved within the first t evolution 
period. Because the pump-spin 7 t-pulse changes the field 
at the detection-spins, the primary echo is not fully 
refocused and is consequently modulated by dipolar 
coupling frequencies. Typically, the first ^40 ns of the 
time-trace is obscured due to overlap of the detection- 
and pump-spin pulses at time t — 0 (to). This shortcom¬ 
ing is overcome with four-pulse DEER by an additional 
refocusing pulse for the detection-spin subsequence; the 
sequence is however longer and less sensitive. 


Theoretical Basis 


At the heart of the DEER technique is the electron- 
electron dipolar interaction between the magnetic 
moments of two spins 1 and 2 separated by distance ri 2 . 
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with the Hamiltonian 


Hi2 = SiDi2 S 2 

= ^gig2{A+B + C + D + E + F) 
2h 

A = SiJ2z{l - 3cos20) 

B = - l(Si+S2- + 5 i_S 2+)(1 - 3cos20) 
C = — |(Si+S 2 z -I- Siz52+) sin 6cos 9e^“^ 
D = — |(5 i_52z + SizS 2 -) sin 6cos 9e^"^ 
E = - lSi+S2+sin^9e-^‘‘^ 

E = - lSi-S2-sm^9e+^‘'f 


Jl/2 71 



^pump 
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b 

71/2 71 71 



DEER of Metalloproteins, Fig. 1 (a) Three-pulse DEER. The 
subsequence at v^et is applied to the detection-spins, and 
a 7i-pulse at frequency Vpun,p is applied to the pump-spins 
and moved within the first r evolution period, (b) Four-pulse 
DEER. The refocused echo from the detection spins at time 
2 ti + 2 t is observed as a function of time (t), where tg = 2 ti is 
outside the observation dead-time 


where gi/g 2 are the g-matrices, gi/g 2 are the g-values 
along the spectrometer field direction Bq, ni 2 is the unit 
vector parallel to the 1 — 2 direction, 9 is the angle 
between ni 2 and Bq, S = [5^: Sy 5z] are the spin 
vectors and S+/S- are the raising and lowering oper¬ 
ators. T represents a transpose. The Heisenberg 
exchange coupling, //exchange = 81 / 1282 , can usually 
be ignored as most systems of interest involve 
localized spins separated by >15A. Terms C, D, E, 
and F are not important for the dipolar energy splitting. 
Term-A describes the secular part of the dipolar 
coupling. Term-// describes the pseudosecular contri¬ 
bution which can be neglected provided the dipolar 
coupling is small compared to the Zeeman interactions 
of the spins (as at X-band with spins separated by 
>15A) and the g-anisotropy is small. Paramagnetic 
centers with transition metals (e.g., Cu/Ni/Co) or 
metal clusters usually have considerable g-anisotropy 
and thus term B can become significant depending 
upon where the detection and pump pulses are 
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positioned and the relative g-matrix orientations of the 
two paramagnets. For example, consider two cen¬ 
ters with axial g-matrices (gn = 2, gj^ = 2.2) separated 
by r = 25A. Using pump/detection pulses exciting gn/ 
g±, respectively, the error in going from the full 
expression to the secular approximation is anticipated 
to be Ar ^ lA or less at X-band. If only the A-term is 
retained a simple expression results. 


Mo/5 

^12 = ^8182 


3cos20'" - 1 


471/7 


( 2 ) 


The index m denotes a single orientation of Bo™ 
relative to ni 2 (Fig. 2), g-values gi™ and g 2 ™ are for 
detection-spin and pump-spins at orientation Bq™, 
respectively. 

A DEER time-domain trace y{t) oscillates as: 



DEER of Metalloproteins, Fig. 2 Geometric parameters for 
Eq. 1 showing detection-spin, pump-spin, and angle 9"' relative 
to Bo™ and ni 2 


yit) = - Cod(l - cos(27tv™ 0)) (3) 

m 


The term describes the number of detection- 
spins excited at orientation B™, and is the oscil¬ 
lation depth which quantifies the extent to which the 
pump 7t-pulse is resonant with the pump-spin (assum¬ 
ing only interacting spin pairs). To account for the 
random distribution of molecules in frozen-solution 
samples, a summation must be performed over all 
orientations of the molecules with resonant detec¬ 
tion-spins. DEER measurements at X-band made 
between two nitroxide radicals often excite all possi¬ 
ble orientations (albeit not all spins) and a complete 
powder-averaged dipolar spectrum (Pake pattern) 
results (Fig. 3b). The distance can then be extracted 
from the Vj^ feature (where 0™ = 7t/2 in Eq. 2) in the 
dipolar spectrum, and 


v± = 


52063 


MHz 


(4) 


when gj 2 — 2.0036 and r is in A (Fig. 4). Several 
sophisticated analysis methods exist for this case 
(Jeschke et al. 2006). 

For accurate calculations, the spin density distribu¬ 
tion of the center needs to be considered. 


Ms(AUz) = Pa{x,y,2) - Pij(x,y,z). (5) 


where and pp are the sums of the electron densities 
at each point in space with spin-up and spin-down, 
respectively. In paramagnetic centers where the spin 
ground-state results from spin-coupling, as for exam¬ 
ple in FeS-clusters, the at each iron can be modeled 
with spin projection factors (Bencini and Gatteschi 
1990). For a delocalized spin distribution, as in a Cu 
(II) complex coordinated by nitrogens, a ps value can 
be attributed to each nucleus. Dipolar frequencies can 
then be calculated considering a sum of point-dipole 
contributions for each paramagnetic center 


''12 




3cos^ 


- 1 


( 6 ) 


where kn and k 2 j model the spin density at nucleus i(j) 
of paramagnetic center 1(2), or are spin projection 
factors as depicted in Fig. 5. 

If a limited number of orientations of the detection- 
spin and/or pump-spin system contribute because the 
mw pulses only partially excite the corresponding EPR 
spectra, then a limited angular range of dipolar 
frequencies is measured and the analysis must consider 
explicitly this reduced orientation set. Provided indi¬ 
vidual DEER traces can be recorded such that suffi¬ 
cient orientations are sampled then a full analysis will 
deliver not only distance information but also the 
orientation of the spin-spin vector relative to the 
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DEER of Metalloproteins, Fig. 3 (a) Model system for two 
spins, (b) Simulated DEER time-domain trace and its Eourier 
transform without any orientation selection. All time-domain 
data are scaled to have amplitude 1 at r = 0, 0.2 on the y-axis 
scale is indicated by the vertical double-headed arrow. C.1-C.3: 
Simulated DEER data with orientation selection. All orienta¬ 
tions of the detection-spins with isotropic g-matrix are excited 
but not uniformly because of the A(*"^N) hyperiine coupling. The 


pump mw frequency is varied to select different spin-spin 
orientations (9"' values). The position and excitation profile of 
the pump (red-line) and detection (blue-line) pulses are shown 
(left) together with their corresponding EPR spectrum. Simula¬ 
tion C.l shows excitation around gx where 9"' = 7i/2°. The time- 
domain trace and its Fourier transform reflect the strong 
feature in the dipolar spectrum, simulation C.3 selects orienta¬ 
tion around gn and thus the Vn feature at ff" = 0° 


g-matrix reference frames of the detection-spin and 
pump-spin (Fig. 3c). This orientation selection 
scenario is the typical situation with metal clusters or 
transition metal centers at X-band, or with organic 
radicals at approximately W-band and above. 


Tm and T■^ Relaxation Requirements 

The electron phase memory 7 m and spin-lattice Tie 
relaxation times are critical parameters. Tm can be mea¬ 
sured from a primary echo sequence (7t/2-T-7t-T-echo), 
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DEER of Metalloproteins, Fig. 4 Dipolar frequency (blue 
solid) and the corresponding time-trace period T {red dashed) 
versus the electron-electron distance. To record one oscillation 
requires t = T so Tm >2T (need to record an echo at 2 t, Fig. 1, 
Ty[ is defined in the “Relaxation Requirements” section) 



DEER of Metalloproteins, Fig. 5 Example of a spin-coupled 
[2Fe-2S]-cluster with S“‘“‘ = 1/2, k 2 j{Fe^*) = 7/3, 

k 22 {Fe.^*) = —4/3, and an axial g-matrix. The NO' spin is 
represented hy an isotropic g-matrix at a point, kn = \ 

which monitors the decay of the transverse magnetiza¬ 
tion. Tie can he measured, for example, using an 
inversion-recovery sequence (Schweiger and 
Jeschke 2001). The Tm time of the detection-spins 
needs to be sufficiently long to observe at least one or 
preferably several dipolar oscillation periods so that 
a reliable distance measurement results (Fig. 4). 
Ultimately, Tm dictates the maximum distance measur¬ 
able. The important parameter for the pump-spins is T le 


since the 7i-pulse at frequency Vpu„p inverts this spin’s 
magnetic field, which then should be constant during the 
pulse sequence so that the magnetic field af the detection- 
spin remains constant. 

A good reference for typical Tie and Tm times has 
been compiled by the (Berliner et al. 2000). EPR of 
paramagnetic metal clusters and centers typically 
requires temperatures in the range ^5-20 K, much 
lower than the ^50-80 K appropriate for organic rad¬ 
icals. Measurements of organic radicals in the presence 
of nearby paramagnetic centers, like high-spin Fe^"^, 
are achievable since their relaxation times are typically 
too fast (sub ns above ^50 K) to dramatically affect 
organic radical relaxation. The fluctuating fields 
caused by a nearby metal center, for example, Fe^"^ 
(5 = 5/2), have a spectral density (frequencies) 
governed by its Ti, if these frequencies are much 
above the resonant frequency of the organic radical 
(^9.5 GHz at X-band) then the mechanism is not an 
efficient Ty (or T 2 ) relaxation process. A modest short¬ 
ening of the organic radical’s Ti time often results, this 
may be advantageous since the experimental repetition 
time can be increased. 

Spectrometer Frequency and Bandwidths 

The most important measurement considerations are 
the EPR spectral widths of the two paramagnets com¬ 
pared to the maximum frequency difference needed 
between the pump and detection pulses Iv^et — VpumpI, 
while still maintaining an acceptable mw excitation 
power. Successful measurements typically require 
mw TT-pulses less that ^40 ns given the requirement 
for the pulse bandwidth (f.w.h.h. "^0.6/fpuise [Hz]) to be 
broader than the dipolar coupling. At X-band, com¬ 
mercial 3 mm split-ring resonators can be over¬ 
coupled to allow Ivjet — Vpurnpl Separations of up to 
^400 MHz while still achieving n-pulse times of fpuise 
^ 20 ns. Presently, as the mw frequency is increased, 
the available mw power drops dramatically, but high 
power systems at W-band are being developed (Smith 
et al. 2010). Bandwidth limitations imposed by the use 
of resonators can be overcome by jumping the field 
rather than the mw frequency, however this is techni¬ 
cally very challenging (Schweiger and Jeschke 2001). 

As metal centers usually have broad EPR spectra 
already at X-band, high-frequency spectrometers are 
not required and measurements would suffer from poor 
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sensitivity as the EPR spectrum spreads further. This 
would particularly affect Cmod (Eq. 3). Many para¬ 
magnetic centers involving metals are too broad and 
unsuitable, an important example being high-spin 
Fe^"^ with effective g-values of g = 2 to g = 4-5 
equating to an X-band width of ^5 GHz; added to 
this difficulty are the very short relaxation times. 
Low-spin metal centers (e.g., Co(II), Cu(I), Ni(I/III)) 
and spin-coupled metal clusters exhibit EPR 
spectra which are often only slightly too broad at 
X-band (^1 GHz) to allow pump and detection pulses 
to be separated by the total EPR spectral width. 
However, within certain constraints, reliable distance 
estimates in these systems can be achieved (see 
later). High-field spectrometers possessing high mw 
power are ideally suited for organic radical measure¬ 
ments with examples up to 180 GHz (Denysenkov 
et al. 2006). 

Examples 

For S — 1/2 organic radicals, the g-matrix anisotropy is 
small and X-band spectrometers are able to uniformly 
excite both pump- and detection-spins. At approxi¬ 
mately W-band and above, the EPR spectrum gener¬ 
ally becomes too broad to be uniformly excited by 
a single mw pulse, but good orientation-selective 
measurements are possible (see Denysenkov 
et al. (2006) for a study at 180 GHz). Several examples 
of DEER experiments employing metal to organic 
cofactor or spin label have been reported. Examples 
include the oxygen-evolving manganese cluster (OEC) 
of PSII using the tyrosin D cation radical (YD"^') with 
the S2 state (S — 1/2) or SO state of the manganese 
cluster (Hara et al. 1996) and the protein complex 
formed between palustrisredoxin reductase (PuR) and 
palustrisredoxin (Pux-B) studied by measuring 
distance and orientation between the reduced 
[2Fe-2S] cluster of Pux-B and a MTS spin-label 
attached to PuR (Lovett et al. 2009). Calculations 
involving FeS clusters require inclusion of spin pro¬ 
jection factors to describe the magnetic coupling. 
The first dipolar measurement between two metal 
sites determined a Ni (S = 1/2) to [SFe^S]"^ cluster 
(S = 1/2) distance of 18A for the [NiFej-hydrogenase 
from Desulfovihrio vulgaris Miyazaki F (Elsasser 
et al. 2002). In Mitochondrial complex 1 measurements 
on the paramagnetic [4Fe-4S] and [2Fe-2S] clusters 


were used to assign the redox properties of the FeS 
clusters to their positions in order to obtain a profile of 
the FeS cluster electron transfer chain (Roessler et al. 
2010). If the g-matrix anisotropy and corresponding 
EPR spectrum are broad as compared to the maximum 
pulse separation, then it is impossible to probe all 
possible orientations. However, reliable distance mea¬ 
surements can be recorded in certain cases, such as 
between two Cu(II) centers with approximately axial 
EPR spectra by pumping and detecting around the 
region to record the feature (van Amsterdam et al. 
2003). 

High-spin systems like Fe^"^ or Mn^"^ (5 = 5/2) are 
very challenging due the zero-field interaction and thus 
the dependence of the dipole frequencies upon the EPR 
transitions excited, and the short Tie and Em times. 
A strategy which can be successful focuses on the 
ffJs = l/2«-l/2 transition, which is relatively sharp 
provided the zero-field interaction is small, and yields 
dipole frequencies as for the S = 1/2 case (i.e., Eq. 1 
can be used). Nanometer-scale distance measurements 
in proteins using Gd^"^ spin labeling and the narrow 
ffJs = l/2»-‘-l/2 transition have been demonstrated 
(Potapov et al. 2010). 

Summary 

Distance measurements between paramagnetic metal 
centers or metal cluster centers typically have to nego¬ 
tiate broad EPR spectra and fast electron relaxation 
times. Successful measurements are possible provided: 
(1) the Tm and Tie times are long enough to record the 
dipolar oscillations, (2) the mw excitation power is 
sufficient to provide pulse bandwidths larger than the 
dipolar coupling over a frequency range comparable to 
the EPR spectrum to allow sufficient orientations to be 
probed from a set of DEER traces, and (3) the spin 
density distribution is known. Orientation selection 
can readily be included in the analysis to provide 
distance and orientation information. 

Cross-References 

► Chemistry of Spin Labeling 

► Double Electron-Electron Resonance (DEER) 

► EPR Spectroscopy: General Principles 

► Interspin Distance Determination by EPR 
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► Iron-Sulfur Clusters 

► Spin-Labeling EPR of Proteins: Dynamics and 
Water Accessibility of Spin-Label Side Chains 

► Spin Labeling of DNA and RNA 
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Introduction 

Over the last century, synthetic polymers have revolu¬ 
tionized our lives and a recent arrival to this chemistry 
class is the dendrimer. Dendrimers (Fig. 1) are highly 
branched synthetic polymers with a narrow size distri¬ 
bution range, which were first synthesized by Voegtle’s 
group at the end of the 1970s (Buhleier et al. 1978). 
Other workers furthered their development in the 1980s 
(Tomalia et al. 1985), christening these new chemistries 
dendrimers or arborols. Their unique properties, con¬ 
trolled molecular weight and size, an abundance of 
reactive functional groups, and the ability to participate 
in host-guest associations, have led to a boom in 
dendrimer research in helds as diverse as drug and 
gene delivery, tissue engineering, synthetic processing, 
and diagnostics. This entry focuses on the use of 
dendrimers in drug and gene delivery. 

Dendrimers: What Makes Dendrimers 
Different? 

In contrast to linear polymers, the synthesis methods 
used to produce dendrimers allow the formation of 
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Dendrimers for Drug Delivery, Fig. 1 Dendrimer structure. 
This hypothetical dendrimer is formed by three covalent root 
attachment points but other common cores have di- or tetra- 
attachment points. The number of core attachment points 
determines the number of branches and the symmetry of the 
molecule. In this case, each new generation will be formed 
by the covalent binding of two monomers to each branching 
point. Generation count is not always consistent: normally 
generation 0 refers to the core while sometimes is used 
to define the dendrimer after the first reaction cycle. 

molecules with a narrow polydispersity (Mw/Mn 
<1.05), a well-defined globular shape, and a high den¬ 
sity of peripheral functional groups (Tomalia 2004) 
(Fig. 1). 

Dendrimers have been successfully synthesized 
from several materials, such as poly(amidoamine) 
(PAMAM), poly(propylenimine) (PPI), poly(L-lysine), 
polyaryl ether, polyesters, and polyglycerol (Gillies 
and Frechet 2005). Two synthesis strategies are used 
to obtain dendritic structures (Fig. 2): (1) a divergent 
method, in which the structure is created from the core 
in a stepwise fashion with the monomeric units being 
added layer by layer and radiating from a central core 
(each of these grafting cycles leads to a different 
dendrimer generation) (Nanjwade et al. 2009); 
(2) a convergent method, in which the structure is 
created from the shell to the core (Tomalia et al. 
1985). The divergent method is less efficient, with 
incomplete and side reactions leading to the accumu¬ 
lation of structural defects. In contrast, the convergent 
method limits the number of side reactions, enables 
easier purification of intermediate products, and hence 
minimizes structural defects. However, it is important 
to point out that despite the potential defects in 
their structure, dendrimers still present with low 


The architecture of the monomers (binding points, bond 
angles, and molecular weight) determines to what extent 
each generation increases molecular volume versus surface 
area. Due to steric interactions, at high generations, 
dendrimers reach a point in which it is not possible to add 
new generations (starburst effect). Under favorable conditions 
high-generation dendrimers usually present a spherical shape 
with a high molecular density outer shell and a lower density in 
the dendrimer interior. The dendrimer interior is capable of 
accommodating guest molecules 

polydispersities (^1.0) (Fig. 3). On intravenous 
administration, the biodistribution of dendrimers is 
controlled by their structure, size, and charge; hence 
it is an important advantage to be able to finely control 
the chemistry of dendrimer molecules. 

Once synthesized, dendrimers usually present as 
spherical structures with a high surface charge 
(Figs. 1, 3). In solution, monomer chemistry, dendrimer 
generation, and the bulk conditions all contribute to 
the dendrimer’s conformation. For example, while 
low-generation dendrimers tend to adopt a planar or 
elliptical shape, the higher-generation dendrimers pre¬ 
sent as spheres (Tomalia et al. 1985). Additionally, 
a decrease in solvent solubility may lead to a change 
from an extended to a folded conformation (Chai et al. 
2001) and the dendrimer ionizable groups will react to 
the solvent pH and charge screening from dissolved 
electrolytes to either become extended when ionized 
or less extended when electrostatic repulsion is reduced 
by charge screening (Jones et al. 2003) (Fig. 4). 

The number of surface groups increases exponen¬ 
tially with dendrimer generation (Esfand and 
Tomalia 2001). Together with the aforementioned 
characteristics, a high density of derivatizable func¬ 
tional groups makes dendrimers quite versatile 
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Dendrimers for Drug Delivery, Fig. 2 Methods of synthesis. 
Dendrimers are synthesized by two opposite strategies, the 
divergent and the convergent approaches. Divergent method'. 
In this case, the dendrimer is synthesized starting from the 
multifunctional core and built up by the layer by layer addition 
of monomers. Each new layer of monomers represents a new 
generation. In the divergent method, monomers present one 
reactive group and two or more protected groups. The reactive 
part of the monomer will react with the core leading to the first 
generation of the dendrimer. The next step is the activation of 
the peripheral groups of the dendrimer and the incorporation of 
the next layer (generation) of monomers. This sequence may 

structures for drug delivery, as they may be 
derivatized with several moieties that either: (a) pro¬ 
mote targeting of the formulation toward the desired 
tissue or (b) modulate the interaction of the 


be repeated until steric interactions hamper the addition of new 
generations (starburst effect). Convergent method'. Although 
this method is also based in the building up of the dendrimer 
layer by layer (or generation by generation), the starting 
point is the peripheral groups, finishing the synthesis with 
the addition of the branches to the core. In the convergent 
method, two or more peripheral groups are joined by a single 
monomer. The next reaction will be the binding of two or more 
preformed branches through a new monomer. When the 
branches have reached the desired size they are attached to 
the core molecule. This method generally leads to highly 
symmetrical dendrimers 

dendrimer with biological membranes (Choi et al. 
2004; Han et al. 2010). 

With respect to drug delivery, drugs must be asso¬ 
ciated with the dendrimer in order to create a drug 
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Dendrimers for Drug Delivery, Fig. 3 Synthesis of commer¬ 
cially available dendrimers. Polyfamido amine) (PAMAM) 
dendrimer {top panel). The PAMAM dendrimers are normally 
based on an ethylenediamine or ammonia core with four and 
three branching points, respectively. Using a divergent 
approach, the molecule is built up iteratively from the core 
through the addition of methylacrylate followed by amidation 
of the resulting ester with ethylenediamine. Each complete reac¬ 
tion sequence results in a new “full” dendrimer generation 
(e.g., G3, G4 ...) with terminal amine functionality, whereas 
the intermediate “half” generations (e.g., G2.5, G3.5 ...) 


terminate in anionic carboxylate groups. Poly(propylenimine) 
(PPI) dendrimer {bottom panel). The other commercially avail¬ 
able dendrimer with relevance to drug and gene delivery is 
based on PPI units with a diaminobutane (DAB) core. 
The repetitive reaction sequence involves Michael addition of 
acrylonitrile to a primary amino group followed by hydrogena¬ 
tion of nitrile groups to primary amino groups. These 
dendrimers are frequently referred to as DAB- x, or DAB-Am- 
X, with X giving the number of surface amines; depending on the 
source, DAB-Am4 is sometimes referred to as GO or more 
usually G1 
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Dendrimers for Drug 
Delivery, Fig. 4 Dendrimer 
conformation. Dendrimer 
chemistry as well as solvent 
properties (polarity, pH, 
electrolytes, etc.) affect the 
molecular density of the 
molecule. Bad solvents favor 
the back-folding of the 
peripheral groups and the 
formation of a dense core 
structure, while good solvents 
promote the extended 
conformation of the dendrimer 
leading to a dense shell 


Solution structure 


Good solvent Bad solvent 


extended conformation 


back-folding 



delivery system. Drugs may be encapsulated within 
single dendrimer molecules, bound to the surface of 
single dendrimer molecules, encapsulated within and 
bound to the surface of the same dendrimer or encap¬ 
sulated within dendrimer self-assemblies (Tanis and 
Karatasos 2009; Chooi et al. 2010). 

Tanis and others have demonstrated that the distinc¬ 
tion between drug encapsulation and drug binding to 
the surface is not always clear-cut. Using PAMAM 
dendrimers and ibuprofen, an ionizahle nonsteroidal 
anti-inflammatory drug (NSAID), these authors showed, 
through simulations, that bulk conditions will determine 
the nature of the drug-dendrimer interactions. In acidic 
conditions, ibuprofen forms clusters that minimize its 
interaction with the open structure of the dendrimer. At 
neutral pH, there is a clear insertion of the ibuprofen 
molecules in the dendrimer, driven hy the hydrophobic 
interactions and the hydrogen bonding between the inte¬ 
rior PAMAM amide hydrogen atoms and the drug car¬ 
bonyl oxygen atoms. However, in alkaline medium, 
ionization of ibuprofen leads to a clear displacement of 
the dmg toward the surface of the dendrimer as there is 
a reduction in both the hydrophobic interactions and the 
hydrogen bonding, and an increase in the electrostatic 
interactions between the dendrimer surface and the drug 
(Tanis and Karatasos 2009). 

Dendrimers have been functionalized to form 
amphiphiles by the attachment of hydrophobic moie¬ 
ties to the surface of the dendrimer (Chooi et al. 2010). 
The level of derivatization, and hence molecular archi¬ 
tecture, has a direct influence on the size of the hydro- 
phobic domains formed within the self assembly and 
hence the hydrophobic drug-carrying capacity (Fig. 5); 


while derivatization with 1 or 5 hydrophobic units per 
dendrimer molecule leads to the formation of adequate 
hydrophobic domains which are able to encapsulate 1 
mole of cyclosporine A per 10 moles of dendrimer 
amphiphile, derivatization with 3 hydrophobic units 
per dendrimer molecule resulted in a dendrimer 
amphiphile which was sterically hindered from 
forming hydrophobic domains and drug encapsulation 
dropped to 0.1 moles of drug per 10 moles of 
dendrimer amphiphile (Chooi et al. 2010) (Fig. 5). 
Introduction of a flexible poly(ethylene glycol) 
spacer between the hydrophobic groups and the 
dendrimer allowed the hydrophobic groups to form 
hydrophobic domains more efficiently and increased 
the drug encapsulation ratio to 7 moles of drug 
per 10 moles of dendrimer amphiphile (Chooi 
et al. 2010) (Fig. 5). 

Dendrimers are ideal candidates for the delivery of 
genes and other nucleic acid-based drugs, as they form 
an electrostatic complex of 100-500 nm in diameter 
with DNA (Zinselmeyer et al. 2002). This complex is 
termed a dendriplex and is formed by the interaction of 
negatively charged DNA with a positively charged 
dendrimer. Their DNA condensation and gene trans¬ 
fection efficiency means that dendrimers compare 
favorably to other viral and more toxic linear polymer 
gene vectors (Dufes et al. 2005a; Chisholm et al. 
2009). The transfection efficiency of dendrimers 
depends on their architecture, generation, and nature 
of the surface groups and transfection efficiency is 
usually determined by their capacity to interact with 
biological membranes and escape from the acidic 
endosome (Zinselmeyer et al. 2002; Cho et al. 2003). 
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Dendrimers for Drug 
Delivery, Fig. 5 The self- 
assembly and drug loading of 
PPIgs dendrimer amphiphiles. 
Self-assembly and drug¬ 
loading capacity of the self- 
assemblies is highly 
dependent on the molecular 
architecture of the dendrimer; 
the level of cyclosporine 
A encapsulation per 10 moles 
of dendrimer amphiphile is 1, 
0.1, 1, and 7 for 
monosubstituted, 
trisubstituted, 
pentasubstituted, and PEG 
spacer substituted dendrimers, 
respectively. Introduction of 
the flexible PEG spacer unit 
increases drug loading 
substantially, as it allows the 
efficient formation of 
hydrophobic domains 



DPD5 - PPIG3 dendrimer amphiphile 
with PEG spacer 


Host-Guest Nanostructures 

As stated above, dendrimers may be considered to be 
nanocontainers with a compact shell that isolates 
the inner cavity from the surrounding media. 
The dendrimer interior is ideal for the accommoda¬ 
tion of poorly soluble low molecular weight 
drugs, through hydrophobic interactions and hydro¬ 
gen bonding (D’Emanuele and Attwood 2005). 
Dendrimer-drug host-guest systems have been tested 
preclinically as a means of reducing drug side 
effects. For example, the utilization of PAMAMg 3 5 
dendrimers encapsulating cisplatin, an anticancer 


drug that inhibits DNA replication (Yellepeddi 
et al. 2011 ) led to a reduction in the drug’s cellular 
toxicity. Furthermore, drug encapsulation within 
dendrimers has also been useful in reducing the hep- 
atotoxicity associated with the use of the cancer 
drugs, methotrexate (MTX) and 6 -mercaptopurine 
( 6 -M) (Neerman et al. 2004). Intraperitoneal injec¬ 
tion of MTX or 6 -M encapsulated in melamineos- 
based dendrimers to C3H male mice resulted in 
a signihcant reduction in blood alanine transaminase 
(ALT) levels, when compared to the free drug; ele¬ 
vated blood ALT levels are indicators of hepatotox- 
icity (Neerman et al. 2004). 
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In an effort to improve the biocompatibility of 
PAMAM dendrimers, Bhadra and coworkers have 
employed poly(ethylene glycol) - PAMAMg 4 
dendrimer conjugates (Bhadra et al. 2003), 
using ^5,000 Da poly (ethylene glycol) (PEG). 
These conjugates were less hemolytic than the 
PAMAMg 4 dendrimers alone and, when encapsulat¬ 
ing 5-fluorouracil (5-FU), facilitated sustained 
plasma exposure to the drug. Furthermore, the 
administration of 5-FU loaded PEG-PAMAMg 4 
dendrimers (PEG MW = 4,000 Da), modified with 
folic acid (as a targeting agent), to BAFB/c mouse 
xenograft models (human epidermoid carcinoma KB 
cell line) resulted in a clear improvement in both the 
pharmacokinetics and pharmacodynamics of 5-FU 
(Singh et al. 2008). 

Other drugs have also been shown to benefit 
from encapsulation within dendrimers; the NSAIDs 
ketoprofen and diflunisal have both been encapsu¬ 
lated within PAMAMqs dendrimers for transdermal 
delivery; increasing systemic exposure threefold, 
when compared to the free drugs (Yiyun et al. 
2007). Folate-PEG 335 o-PAMAMG 3.5 dendrimers 
have also been used to target other NSAIDs such as 
indomethacin to activated macrophages in a rat arthri¬ 
tis model (Turk et al. 2002). On intraperitoneal 
administration of the dendrimer formulation, accu¬ 
mulation of indomethacin in the stomach, the site of 
indomethacin-mediated gastric hemorrhage, was 
reduced and accumulation in the inflamed paw 
increased (Chandrasekar et al. 2007). Unfortunately, 
encapsulation of indomethacin did not reduce the 
accumulation of the drug in the kidneys and thus 
was not effective in reducing the potential risk of 
indomethacin-induced renal dysfunction. 

Bioactive Dendrimers 

Dendrimers may also be bioactive per se. Our 
group was the first to report on the bioactivity of 
dendrimers (Dufes et al. 2005b), reporting that 
PPIg 3 dendrimers are antiproliferative and are 
tumoristatic against A431, FS1747, and C33a tumors 
in GDI mouse models. Fikewise, PAMAMg 4 - 4.5 
dendrimers with surface amino or surface hydroxyl 
groups have been reported to have anti-inflammatory 
activity in carrageenan inflamed paw edema and 
rat arthritis models (Chauhan et al. 2009). These 


bioactive dendrimers may serve as useful leads for 
a new class of macromolecular drugs. 

Dendrimers as Synthetic Vectors for Nucleic 
Acid-Based Therapies 

The highly functionalized dendrimer surface may be 
used, not only for the association of small molecules 
but also to efficiently condense nucleic acid therapeu¬ 
tics into nanoparticles known as dendriplexes 
(Zinselmeyer et al. 2002; Dufes et al. 2005b). Due to 
the anionic nature of nucleic acids, the use of 
dendrimers in gene therapy is usually restricted to 
positively charged dendrimers. 

Early demonstrations of dendrimer gene vector 
activity were based on local injections. For example, 
intratumoral administration of angiostatin and tissue 
inhibitor of metalloproteinase-2 genes co-condensed 
with a 36-mer oligonucleotide (ODN) by the 
PAMAM SuperFect® dendrimer successfully halted 
the growth of MDA-MB-435 tumors in a murine 
model (Vincent et al. 2003), while when Superfect® 
was used to deliver the TNF-alpha gene via the tail 
vein in a murine A431 xenograft model, there was 
virtually no therapeutic effect from the expression of 
the gene (Dufes et al. 2005a). However, there was 
a persistent suppression of tumor growth by the 
PAMAM Superfect® dendrimer itself. 

Delivery of drugs to the brain is probably the most 
challenging task in drug delivery, due to the highly 
efficient blood-brain barrier (BBB), which protects the 
brain from any noxious substances present in the sys¬ 
temic circulation. Dendrimer gene therapy experi¬ 
ments have thus involved stereotactic injections in an 
effort to circumvent the barrier. Folic acid-modified 
PAMAMqs dendriplexes condensing an antisense 
ODN against epidermal growth factor receptor 
(EGFR-ASODN) have been intracranially injected 
for the treatment of C6 glioma tumors (Kang et al. 
2010). The inclusion of folic acid increased the 
in vitro knockdown of EGFR expression in C6 glioma 
cells. Moreover, after two stereotactic administrations 
of the ASODN, a significant increment in the mean 
survival time of treated animals was observed. An intra¬ 
cranial injection was also used to knockdown the cyto¬ 
kine mediator high mobility group box 1 (HMGBl) 
gene, using siRNA condensed with PAMAMg 4 
dendrimers modified with arginine (Kim et al. 2010). 
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Reverse transcription-polymerase chain reaction anal¬ 
ysis showed that 18-24 h after intracranial administra¬ 
tion of the siRNA complex, there was a clear reduction 
of the HMGB1 mRNA levels in the hrain. The results 
of these intracranial studies are impressive; however, 
administering local injections into the brain is 
plagued with certain problems, foremost of which 
are: the difficulty in achieving homogeneous dis¬ 
tribution of the formulation away from the site of 
injection and the need for highly trained personnel to 
administer such therapeutics. Furthermore, the local 
administration of drugs to tumors does not address 
the problem central to neoplastic disease metastasis. 
For metastasized cancers, local administration will 
not be sufficient and systemic administration will be 
required. 

Systemic administration of genetic therapies is 
hampered by their interaction with biological fluids 
(which leads to particle aggregation) and the various 
membrane and enzymatic barriers to gene transfer. The 
consequences of polyplex and high molecular weight 
dendriplex (PAMAMog) aggregation within the blood 
is a tendency for such polyplexes and dendriplexes to 
be deposited in the lung on intravenous administration 
(Kukowska-Latallo et al. 2000; Schatzlein et al. 2005), 
where they are physically entrapped within the fine 
capillaries. 

Our work has shown that the low molecular 
weight PPIgs dendrimers (MW = 1,687 Da) avoid 
the lung on intravenous administration (the lung avoid¬ 
ance hypothesis): low molecular weight polymer/ 
dendrimer-gene complexes avoid the lung due pre¬ 
sumably to a lower tendency to aggregate and instead 
accumulate within the next capillary bed encountered - 
the liver; the lower level of particle aggregation is 
hypothesized to stem from the reduced molecular 
weight of the cationic species in the complex 
(Schatzlein et al. 2005). We have exploited this finding 
to produce a gene therapeutic that is active in all 
preclinical tumor models tested (Zinselmeyer et al. 
2002; Dufes et al. 2005a; Bell et al. 2007; Chisholm 
et al. 2009). We have shown that the systemic admin¬ 
istration of PPIcs-TNF-alpha dendriplexes to murine 
xenografts tumor leads to tumor regression just 24 h 
after administration, and after 17 weeks 80% of tumors 
disappeared completely (Dufes et al. 2005b). Interest¬ 
ingly, these systems also show a high level of tumor 
selective gene expression (Chisholm et al. 2009). 
This level of specificity is surprisingly high for 


a nanoparticle vector system without active targeting 
and it seems likely that the interaction of these 
dendriplexes with specific blood components may 
also play a role in tumor cell uptake. 

Similar to low molecular weight drugs, active 
targeting of genes has been undertaken by conjugating 
receptor-specific ligands to the dendrimers. For exam¬ 
ple, transferrin, a glycoprotein involved in iron metab¬ 
olism is overexpressed in some types of tumors, and 
when conjugated to PAMAMqs dendrimers via 
a PEG 3400 spacer (Huang et al. 2007) increases the 
uptake of reporter genes in the brain, lungs, and kidney 
after intravenous injection. Koppu and coworkers 
recently reported that transferrin-modified PPIg 3 
dendrimers were able to reduce the off-target expres¬ 
sion in noncancer tissue when used to administer 
a TNF-alpha plasmid to a murine xenograft model 
(Koppu et al. 2010). Targeting peptides, such as 
RVG29 (against acetylcholine receptors of neuronal 
and brain capillary endothelial cells) and a leptin- 
derived peptide, have also been used to promote the 
accumulation of PAMAMgs vectors in the brain (Liu 
et al. 2009, 2010). A final example of receptor ligand- 
mediated targeting is the use of the LHRH peptide 
bound to PPIG 5 -PEG 5000 dendrimers, which has been 
used to target dendriplexes carrying siRNA to A549 
tumor tissue after systemic administration (Taratula 
et al. 2009). This formulation resulted in gene silenc¬ 
ing and the incorporation of LHRH into the dendriplex 
shifted the siRNA away from the liver and kidneys and 
to the tumor cell cytoplasm. 

Dendrimer-Drug Conjugate (Prodrugs) 

The final aspect of dendrimer drug delivery to be 
considered is the use of dendrimer prodrugs (Patri 
et al. 2005). These are similar in concept to the poly¬ 
mer-drug conjugates developed by Kopecek and 
Duncan’s groups (Duncan 2003), with the dendrimer 
usually linked to the drug molecule via a hydrolyzable 
linker. An example of such a system is the attachment 
of methotrexate to PAMAMgs dendrimers (Patri et al. 
2005), which resulted in an increase in methotrexate’s 
aqueous solubility. The conjugation of folate to the 
prodrug to give methotrexate-Eolate-PAMAMGs 
prodrugs reduced drug toxicity in vivo and achieved 
tumor targeting in CB-17 severe combined immuno¬ 
deficiency mice bearing UM-SCC-17B tumor xeno¬ 
grafts (Ward et al. 2011). UM-SCC-17B tumors 
overexpress folate receptors. This same approach has 
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also been successfully used to target methotrexate to 
macrophages in inflammatory diseases (Thomas et al. 
2011). Further examples of prodrugs include asymmet¬ 
ric polyester dendrimers conjugated to doxorubicin via 
hydrolyzable hydrazone bonds, which on intravenous 
injection to a murine C26 colon carcinoma model lead 
to tumor targeting, tumor regression, and 100% sur¬ 
vival over a 60-day period (Lee et al. 2006). The 
hydrolyzable hydrazone bond was key to the conju¬ 
gate’s activity. A final dendrimer prodrug example 
demonstrates the way in which dendrimer prodrugs 
are able to circumvent various physiological processes. 
In vitro P-gp efflux of propanolol, a P-glycoprotein 
(P-gp) substrate and beta blocker used to treat hyper¬ 
tension and anxiety, was inhibited when the drug was 
covalently bound to lauryl-PAMAMos dendrimers 
with the lauryl group specifically shown to promote 
monolayer permeability (D’Emanuele et al. 2004). 
Dendrimer conjugation made the drug unrecognizable 
by the efflux pump. 

Other Biomedical Applications 

For completeness, other biomedical exploitations of 
dendrimers are included. Dendrimers have also been 
used as components in tissue engineering scaffolds and 
to construct diagnostic imaging platforms. Cross- 
linking collagen gels, to be used as corneal tissue scaf¬ 
folds, with PPIg 2 dendrimers and incorporating cell 
adhesion peptides into the gels, results in a material 
with improved mechanical, adhesion, optical transpar¬ 
ency, and glucose permeability properties when 
compared to the use of other cross-linking agents 
such as glutaraldehyde and l-Ethyl-3-(3-dimethylami- 
nopropyljcarbodiimide (Duan et al. 2007). Moreover, 
in vitro studies confirmed the lack of toxicity from the 
dendrimer. 

Eor magnetic resonance imaging (MRl) contrast, 
dendrimers have been covalently bound to chelating 
agents such as diethylenetriamine pentaacetic acid 
(DTPA), the latter of which is a gadolinium chelate 
(Wiener et al. 1994). The DTPA-gadolinium chelate is 
the actual MRI contrast agent. These gadolinium- 
DTPA-dendrimer conjugates allow the administration 
of higher doses of gadolinium and the conjugate has 
a significantly longer half-life (Wiener et al. 1994). 
Specifically, these dendrimer imaging agents are 
useful in imaging the kidneys and PAMAMqi and 
PPIg 2 -g 3 dendrimers are efficient contrast agents to 
detect early renal tubular damage (Kobayashi et al. 


2004). Higher generation PAMAM (PAMAMgs) and 
PPI (PPIgs) dendrimers have been used to image other 
tissues such as the lymph nodes (Kobayashi et al. 2003) 
and liver (Kobayashi and Brechbiel 2004), respectively. 
The concept of the organ-specific contrast agent has been 
taken a step further with the use of specific MRl contrast 
agents to image cells expressing certain receptors; folate- 
PAMAMg 4.6 dendrimer conjugates have been used to 
visualize ovarian tumor cells overexpressing folate 
receptors and these were superior to non-folate-bearing 
imaging agents (Konda et al. 2001). 

Summary 

Over the last 30 years, dendrimers have shown that 
they have excellent properties for the formation of 
drug delivery systems, demonstrating an ability to 
solubilize hydrophobic drugs, target drugs to tumors, 
and thus reduce side effects and target drugs to macro¬ 
phages. Additionally, recent evidence of their intrinsic 
bioactivity has emerged and their use as MRI imaging 
agents is a growing field. Their exploitation as drug 
delivery systems is based on their well-defined molec¬ 
ular size, abundance of reactive functional groups, and 
their little understood biodistribution characteristics, 
which enable them to target genes to tumor cells, for 
example. It is likely that over the coming decades, the 
potential of these prodrugs, diagnostic platforms, and 
drug delivery formulations will begin to be tested in 
clinical studies. 

Cross-References 

► Dynamic Light Scattering 

► Electron Microscopy of Membrane Lipids 

► Lipid Organization, Aggregation, and 
Self-assembly 

► Multiangle Light Scattering from Separated 
Samples (MALS with SEC or FFE) 

► Nanoparticles for Drug and Gene Delivery 
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Introduction 

An accurate determination of the physical forces 
governing the structure and dynamics of biological 
systems is essential for a full understanding of their 
organization and behavior. In this regard, calorimetry 
can provide a wealth of thermodynamic information 
about such forces and about the thermotropic, 
barotropic, and lyotropic behavior of biomolecules 
and systems which are not generally or easily avail¬ 
able through other biophysical methods. In this short 
review, we provide a brief description of three of the 
most important calorimetric techniques which have 
been applied specifically to lipid bilayer model and 
biological membranes systems, namely, DSC, PPC, 
and ITC, as well as the related technique of water 
sorption calorimetry (WSC). We first briefly describe 
each calorimetric technique and the types of informa¬ 
tion which can be derived from it, providing appro¬ 
priate references primarily to review articles, book 
chapters, and monographs, where more detailed 
information on the underlying physical principles, 
instrument design, operation, and data analysis can 
be found. We then briefly discuss the general appli¬ 
cation of these techniques to lipid bilayer and biolog¬ 
ical membranes, including some of the newer specihc 
applications of these techniques, alone or in combi¬ 
nation, which have provided new insights into 
a variety of processes involving these systems. 
Finally, we close with a short discussion of potential 
future applications of calorimetry to lipids and 
membranes. 


Differential Scanning Thermotropic 
Calorimetry 

DSC has become the primary technique for studies of 
the ► thermodynamics of lipid interactions and lipid 
phase transitions in both model and biological mem¬ 
branes. The principles underlying the application of 
this non-perturbing thermodynamic technique to the 
study of lipid systems are fairly straightforward. 
A sample containing a known quantity of lipid, nor¬ 
mally dispersed in a given volume of water or buffer, 
and a reference containing the same volume of water or 
buffer alone are simultaneously heated or cooled at 
a predetermined constant rate, dT/dt. The DSC instru¬ 
ment then measures the differential heat flow between 
the sample and reference, dC/dt, over a predetermined 
temperature range. The temperatures of the sample and 
reference may be actively varied by independent 
heating or cooling elements, as in power compensation 
instruments, or the temperatures of the sample and 
reference cells can be varied through their contact 
with a common heat source having a thermal mass 
greatly in excess of that of the sample and reference, 
as in heat conduction calorimetry. If the typically 
rather dilute dispersion of lipid used in modern high- 
sensitivity DSC units does not undergo a phase transi¬ 
tion, then the temperatures of the sample and reference 
cells should remain matched over the entire tempera¬ 
ture range scanned so that the temperature differential 
between them remains near zero. In this case, the DSC 
instrument should not record a differential heat flow 
and, under ideal conditions, a flat, horizontal baseline 
should result. However, if the lipid dispersion 
undergoes a phase transition over a particular range 
of temperature, a temperature difference will develop 
between the sample and the reference due to the gain or 
loss of heat. The DSC instrument then records the 
amount of additional power which must be actively 
added to or subtracted from the sample in order to 
maintain the temperature differential between it and 
the reference cell at zero (power compensation calo¬ 
rimetry) or passively records the temperature differen¬ 
tial between the two cells until the temperature 
differential between them disappears (heat conduction 
calorimetry). In either case, the DSC instrument 
records the differential change in the flow of heat 
associated with the lipid phase transition and an exo¬ 
thermic (upward by convention) or an exothermic peak 
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(downward by convention) is recorded. At the comple¬ 
tion of the lipid phase transition, the original baseline is 
reestablished if no measurable change in the specific heat 
of the sample occurs, or a new baseline is established 
if a measurable change in specific heat occurs. Thus, 
the DSC instrument records a plot of differential heat 
flow as a function of the temperature of the sample. 

A schematic DSC heating thermogram illustrating 
the variation of dC/dt as a function of temperature for a 
simple, two-state, first-order endothermic lipid phase 
transition, such as a gel/liquid-crystalline (chain-melt¬ 
ing) phase transition of a single molecular species 
of a glycerophospholipid, is shown in Fig. 1 . A number 
of important thermodynamic parameters of such a 
phase transition can be directly determined from this 
thermogram. The phase transition temperature, usually 
denoted T^, is defined as the temperature at which the 
dC/dT reaches a maximum. For a symmetrical peak, 
the Tm is also the temperature at which the phase 
transition is half complete (i.e., equal quantities of 
the gel and liquid-crystalline phases coexist). How¬ 
ever, in the case of more complex, multicomponent 
phase transitions observed for certain individual 
molecular species of glycerophospholipids and 
sphingolipids or mixtures of lipids such as present in 
biological membranes, asymmetric DSC thermograms 
are observed, in which case the Tn, does not correspond 
to the midpoint of the phase transition. After normal¬ 
ization with respect to the heating scan rate employed, 
the area under the peak corresponds to the enthalpy of 
the phase transition, sometimes denoted AHcai to dif¬ 
ferentiate it from the van’t Hoff enthalpy (see below). 
Since at the T^ (or midpoint temperature), the change 
in free energy, AG, of the system is zero, the entropy 
change of the phase transition, AS, can be calculated 
from the equation: 

AS = AHcai/T„ 

Moreover, the cooperativity of the lipid phase 
transition can be determined directly from the DSC 
trace since it is inversely related to the temperature 
range over which the phase transition occurs, AT, 
which is simply the difference between the onset tem¬ 
perature, denoted Ts, and the completion temperature, 
denoted Ti, for a solid-like (gel) to liquid-like (liquid- 
crystalline) phase transition. More specifically, the 
temperature width at half-height, ATi,^, which ranges 



Temperature,°C 


Differential Scanning Calorimetry (DSC), Pressure Pertur¬ 
bation Calorimetry (PPC), and Isothermal Titration Calo¬ 
rimetry (ITC) of Lipid Bilayers, Fig. 1 Idealized heating 
thermogram exhibited by a two-state thermotropic process pro¬ 
ceeding at equilibrium, such as the gel/liquid-crystalline phase 
transition of certain phospholipid vesicles. Tn, denotes the phase 
transition midpoint temperature, AT^^, the transition width at 
half-height, and Tg and T| the starting and ending temperatures 
of the phase transition, respectively. The area under the peak is 
a measure of the transition enthalpy, AHe^i 

from <0.1°C for certain pure synthetic lipids to 
10-15°C for the complex mixture of lipids present in 
biological membranes, can be used to calculate a “coop¬ 
erative unit size” (CUS) for the phase transition, by 
determining the van’t Hoff enthalpy of the phase tran¬ 
sition, AHvh, from the following relationship: 

AH,h = 6.9T^/ATi/2 

The CUS is then simply defined as the ratio of 
AHvn/AHcai and is taken as a measure of the degree of 
cooperation between the molecules in a lipid assembly, 
such as a lipid bilayer vesicle. In principle, for 
a completely cooperative phase transition in which all 
the individual lipid molecules in a vesicle undergo the 
phase transition simultaneously, the CUS should 
approach infinity, while for a completely noncoopera¬ 
tive “phase transition,” the CUS values should approach 
unity. However, as the exact physical meaning of the 
CUS value is not entirely clear, and as it depends crit¬ 
ically on the purity of the lipid sample and may be 
limited by the DSC instrumental response times and 
thermal lag, this parameter should be used cautiously. 
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One should note that with DSC, and indeed with all 
calorimetric and other thermodynamic techniques, the 
values of the thermodynamic parameters being deter¬ 
mined will only be valid if the lipid system being 
studied is at thermodynamic equilibrium both before 
the DSC analysis is begun and during the DSC run 
itself. For lipid systems in general, it is advisable to 
maintain the system at low temperatures for various 
periods of time prior to beginning a DSC heating run, 
in order to insure that thermodynamic equilibrium is 
attained. The existence of an exothermic peak upon 
heating indicates that thermodynamic equilibrium has 
not been attained, but the absence of such an exother¬ 
mic peak does not assure, in and of itself, that it has 
been. Similarly, a high-sensitivity DSC instrument 
should be utilized, the size of the lipid sample should 
be as small as practical, and the heating scan rate 
employed should be slow relative to both the thermal 
time constant of the instrument and the kinetics of the 
lipid phase transition under investigation. Although the 
kinetics of the lamellar gel/llquid-crystalline phase 
transitions of pure synthetic lipids are fast relative to 
the DSC timescale, particularly on heating, one should 
note that phase transitions between different lamellar 
gel phases, and especially between lamellar crystalline 
and lamellar gel phases, are typically not. Moreover, the 
same is often true of lipid phase transitions between 
lamellar liquid-crystalline and reversed cubic or hexag¬ 
onal phases. Under these circumstances, DSC can still 
provide useful information if carefully used, particu¬ 
larly if a series of runs are made at different scan rates 
and extrapolation to zero scan rate appears justihed. 

One should note that although the basic principles 
of DSC are relatively straightforward and well 
established, the successful practical application of 
this technique to lipid systems requires care in order 
to obtain valid results. In particular, the selection of the 
appropriate type of calorimeter, starting and ending 
temperatures, and scan rates and instrumental settings 
is not trivial and requires a thorough knowledge of the 
thermodynamic principles involved, particularly for 
lipid phase transitions which may not be at equilibrium 
before or during the DSC run. As well, care must be 
taken in the selection of the size and method of prep¬ 
aration of the lipid sample and to ensure that its thermal 
history is appropriate for the task at hand. The reader is 
referred to a recent review by the authors for a much 
fuller treatment of these and other issues (Lewis et al. 
2007). As well, reviews in which DSC data on the 


dependence of lipid phase transitions on the structures 
of the lipid polar headgroups, glycerol backbone 
regions, and hydrocarbon chains, as well as on the 
pH, ionic strength, etc., of the aqueous medium, are 
available (Lewis and McElhaney 2004; Lewis et al. 
2007; Marsh 2011; McElhaney 1982), as is a compre¬ 
hensive review focusing on lamellar/nonlamellar lipid 
phase transitions (Lewis et al. 1997). 

DSC can also be used to study lipid-protein inter¬ 
actions in reconstituted bilayer model membranes 
composed of one or more synthetic lipids, into which 
a specific purified integral membrane protein has been 
incorporated or onto which a specific purified periph¬ 
eral membrane protein has been bound, by determining 
the thermotropic phase behavior of the resulting 
proteoliposome. Moreover, this approach has been 
extended to actual biological membranes, which con¬ 
tain a complex mixture of lipids and proteins. The 
effect of the incorporation of various quantities of 
a particular membrane protein on the thermotropic 
phase behavior of the host lipid bilayer is most com¬ 
monly studied, and valuable information about the 
nature of the interactions of that protein with various 
lipid molecules can be obtained, as well as an estimate 
of the number of lipid molecules interacting with each 
protein molecule. Moreover, the effect of varying the 
composition and physical properties of the host lipid 
bilayer on the thermal stability (denaturation) on the 
guest protein can also be determined, especially the 
effects of ► membrane fluidity and phase state on 
membrane protein function. In addition, the effect of 
the incorporation of various additives, ranging from 
cholesterol and other sterols to detergents, drugs, and 
toxins to antimicrobial and transmembrane peptides, 
on lipid and protein thermotropic behavior can be 
investigated in a similar fashion, providing important 
information about their interactions. When such DSC 
studies are combined with ITC studies (see below), 
they can be even more informative. The reader is 
referred to several reviews for a detailed summary of 
previously published DSC studies of lipid-protein 
interactions (McElhaney 1982, 1986; Shnyrov et al. 
1997; Lewis and McElhaney 2004). 

Pressure Perturbation Calorimetry 

PPC is a recently developed calorimetric technique 
which can be used to study the coefficient of thermal 



DSC, PPC and ITC 


455 


D 


expansion of various solutes, particularly proteins or 
colloidal particles in aqueous solution. However, this 
technique can also be applied to model lipid bilayer 
membranes and also potentially to biological mem¬ 
branes, to good effect (Heerklotz and Seelig 2002). 
The PPC instrument measures the heat gained or lost 
from a sample undergoing a small increase or decrease 
in pressure at a constant temperature, dQ/dp. At tem¬ 
peratures where a lipid phase transition does not occur 
in response to these small pressure changes, the thermal 
volume expansion at constant pressure, dV/dT, can be 
calculated using the Maxwell relation to the isothermal, 
pressure-dependant charge in entropy, dS/dp: 

dS _ dV 

dp“^ 

where dS for a fully reversible process is dS = dQ/T. 

After calculation of the change in the partial molar 
volume of the solute. Vs, and the molar volume, V, the 
coefficient of thermal expansion of the solute (ots) is 
given by: 

as = dVs/VdT 

PPC is thus related to other techniques developed 
earlier for measuring the heat accompanying an iso¬ 
thermal pressure change, which were variously called 
piezothermal analysis, scanning transitometry, or pres¬ 
sure jump calorimetry. However, PPC utilizes only 
small, reversible changes in pressure, which in turn 
cause only small differential changes in physical 
state. PPC is available commercially as an attachment 
for a high-sensitivity, power compensation DSC 
instrument (MicroCal Corporation, Northampton, 
MA). This instrument allows one to study very small 
amounts of proteins or lipid vesicles using sudden 
small pressure jumps of about 5 bars. Although each 
pressure-induced heat change measurement is 
performed isothermally, individual measurements are 
made sequentially at small temperature intervals over 
a range of temperatures to produce a PPC “peak” (see 
Figs. 2, 3). 

The measurement of the coefficient of thermal 
expansion of particular lipid bilayer membranes, 
whose polar headgroups and fatty acyl chains are var¬ 
ied in structure, can yield important information about 
the molecular organization of such membranes and the 
volume effects of hydration of the polar headgroups 



Differential Scanning Calorimetry (DSC), Pressure Pertur¬ 
bation Calorimetry (PPC), and Isothermal Titration Calo¬ 
rimetry (ITC) of Lipid Bilayers, Fig. 2 Detail of a PPC 
experiment: Pressure (p) jumps between ambient pressure and 
5 bar (above ambient) are applied to sample and reference cells, 
and the temperature, T, is kept constant by a feedback heater. The 
power of the heater, dq/dt, is recorded (bottom panel), and integra¬ 
tion of the peaks yields the isothermal heat response, dQ/dp. The 
heat response is measured twice (after a down- and an up-jump in 
p), and then, the insh'ument is automatically equilibrated at the next 
desired temperature (3 out of typically 50 scheduled temperature 
points are shown) (Reproduced with permission from Heerklotz 
and Seelig (2002). © 2002 Biophysical Society) 

and polar/nonpolar interfacial regions of these bila¬ 
yers. Moreover, if the temperature range selected for 
the analysis encompasses a thermotropic phase transi¬ 
tion of the lipid dispersion, the volume change accom¬ 
panying, for example, a gel liquid-crystalline phase 
transition of the bilayer can be calculated from the 
PPC peak (see Fig. 3). Moreover, the pressure depen¬ 
dence of the phase transition temperature can be cal¬ 
culated if the heat accompanying the phase transition is 
known from DSC measurements, for example. In prin¬ 
ciple, other types of lipid phase transitions could also 
be studied in a similar way, provided that such phase 
transitions are reasonably rapid and reversible and that 
the lipid phases involved do not aggregate or float or 
sediment in the sample cell. 

PPC is also valuable as an independent way to 
confirm the existence and the peak shape of 
a thermotropic phase transition detected by another 
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Differential Scanning 
Calorimetry (DSC), Pressure 
Perturbation Calorimetry 
(PPC), and Isothermal 
Titration Calorimetry (ITC) 
of Lipid Bilayers, 

Fig. 3 Comparison of tfie 
DSC (line, left-liand axis) and 
PPC (O, riglit-hand axis) 
cun’es at tire cliain-melting 
transition of egg 
spliingomyelin. Integration of 
tlie pealts yields 
AH = 7.31<;calmor‘ (DSC) 
and AV = 21 ml moD' (PPC) 
(Reproduced with permission 
from Heerklotz (2002). 

© 2002 Biophysical Society) 



technique, particularly DSC. When one is dealing with 
weak, broad transitions in complex lipid mixtures, the 
shape and even the existence of a phase transition may 
be ambiguous due, for example, to problems in deter¬ 
mining the correct baseline over the temperature range 
of interest. Also, the shape of the PPC peak should be 
related to the cooperativity of the lipid phase transition, 
determined as discussed earlier by DSC. Finally, the 
kinetics of the pressure jump-induced transition can be 
quantitated under favorable circumstances. For an 
excellent review of all aspects of PPC, the reader is 
referred to a recent review (Fleerklotz 2007). 

Isothermal Titration Calorimetry 

ITC is a convenient and nonperturbing method for 
studying the thermodynamics of the binding of ligands 
to biological macromolecules such as proteins and 
nucleic acids. Since a long entry on the principles and 
application of ► isothermal titration calorimetry gener¬ 
ally is available in the ► Thermodynamics section of this 
encyclopedia, the discussion of the technique itself will 
be brief in this entry and will concentrate only on the 
application of ITC to lipid bilayer model membranes. 

In lipid ITC, generally, a solution containing an 
appropriate, known concentration of unilammellar 
lipid vesicles is placed in an automatically controlled 
syringe, and the instrument performs a titration by 
sequentially injecting a predetermined volume of 


lipid vesicle solution at selected time intervals into 
the calorimeter cell containing a known volume of 
a ligand. Each injection results in a gain or loss 
of heat, which is accurately recorded, with the sum of 
such heats being a measure of the total binding 
enthalpy, AH. As the titration proceeds, the binding 
of the ligand to the lipid bilayer may go to completion 
(saturation of binding), in which case no further heat is 
lost or gained (after corrections for dilution and other 
effects). In such cases, the variation in the amount of 
heat lost or gained in sequential injections allow the 
determination of the total binding enthalpy, AH; the 
apparent equilibrium binding constant, K; and the total 
amount of ligand bound per vesicle or lipid molecule. 
However, to accomplish this analysis, the experimen¬ 
tal binding curve must be fit with a model equation 
appropriate for the process under study. Although this 
is rarely a problem for the specific, high-affinity bind¬ 
ing of ligands to one or more discrete binding sites on 
a protein, for instance, this type of binding is rare for 
lipid bilayers. Instead, a partitioning model is usually 
more appropriate for the rather less specific, 
nonsaturable interactions of most compounds with 
lipid model membranes. However, in such cases, the 
physical meaning of K and AH are not clear, and more 
specific and detailed models for the particular system 
being studied, which account for nonideal interactions 
and electrostatic effects, are required. Moreover, if the 
additive in question cannot permeate across the lipid 
bilayer during the titration, it will have access only to 
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the outer monolayer of the lipid vesicle, so the effec¬ 
tive lipid concentration will only be half of the total 
lipid concentration added. A general method for deter¬ 
mining whether an additive can permeate the lipid 
bilayer membrane and therefore can access both mono- 
layers, based on an ITC uptake and release experiment, 
can be used to address this issue (Heerklotz et al. 1999; 
Tsamaloukas et al. 2007). The reader is referred to an 
excellent review for a much fuller discussion of the 
principles and practice of ITC as applied to lipid model 
membranes (Heerklotz 2004). 

There have been a number of notable applications of 
ITC to lipid bilayers, beyond the more conventional 
ligand binding or additive partitioning studies, which 
have provided important thermodynamic insights into 
various physical properties and biologically relevant pro¬ 
cesses involving lipid membranes. For example, the heat 
accompanying a change in the area of the lipid bilayer 
(lipid bilayer area elasticity) can be studied by ITC 
experiments in which large unilammellar vesicles, pre¬ 
pared in the presence of membrane impermeable solutes, 
are injected into hypo- or hypertonic solutions of these 
same solutes (Nebel et al. 1997). As the lipid vesicles 
swell or shrink due to the osmotic movement of water 
into or out of the vesicles, the heat that is lost or gained, 
respectively, can be measured as the lipid membrane 
stretches or is compressed laterally in order to accommo¬ 
date the osmotically induced changes in the volume of 
the vesicles. Similarly, one can study the curvature stress 
(► lipid bilayer curvature elasticity) of lipid bilayer ves¬ 
icles, a property related to the tendency of each mono- 
layer of the bilayer to curve in a concave or convex 
fashion in the absence of external forces, due to the 
intrinsic frustration resulting from the different packing 
requirements of various parts of the lipid molecule, 
which ► produces a bilayer lateral pressure profile. 
More specifically, by titrating in various additives of 
particular “effective molecular shapes” and monitoring 
the heat absorbed or liberated upon their incorporation 
into the vesicle, it has been found that the incorporation 
of additives which increase bilayer curvature stress tends 
to produce more strongly endothermic effects than 
predicted, while those which decrease bilayer curvature 
stress may even produce an exothermic effect upon titra¬ 
tion. These results suggest that the excess enthalpy of the 
lipid bilayer is determined in large part by its curvature 
strain. In this regard, DSC can also be used in 
a complementary way to study the effective shape of 
various additives, by determining the effect of the 


incorporation of these additives on the temperature of 
the liquid-crystalline lamellar/reversed hexagonal phase 
transition characteristic of certain unsaturated phoshati- 
dylethanolamines, which, due to an increasing tendency 
for concave monolayer curvature at higher temperatures, 
prefers a curved, reversed hexagonal (water-in-oil) phase 
at higher temperatures (Epand 1985; Lee et al. 1993). 
The incorporation of additives which have an effective 
shape (conical) which reduces bilayer curvature stress in 
this system will induce a stabilization of the lamellar 
phase, where curvature stress is intrinsically lower com¬ 
pared to the reversed hexagonal phase and vice versa. 
Thus, the direction and magnitude of the temperature 
shift in the lamellar/hexagonal phase transition can be 
correlated with effective shape (conical, cylindrical, or 
inverted conical) of the additive through its effect on the 
lipid bilayer curvature strain. Moreover, this approach 
has been used to quantitate the effective shape and phase 
preferences of the major lipids of a bacterial membrane 
(Foht et al. 1995). As well, ITC can be used to study the 
permeabilization, solubilization, and fusion of lipid 
bilayer membranes. Moreover, the existence and proper¬ 
ties of membrane ► lipid domains in binary or ternary 
lipid mixtures, such as those purported to form the basis 
of “lipid rafts” in eukaryotic plasma membranes, can be 
studied by this technique, in combination with DSC and 
PPC (Heerklotz 2002). The information thus obtained 
can be used to study ► critical fluctuations in lipid mix¬ 
tures. Finally, the energetics of the folding of certain 
membrane-associated peptides, which exist as random 
coils in aqueous solution but as alpha-helices upon mem¬ 
brane binding and insertion, can be studied by ITC 
(Wieprecht et al. 1999, 2002). For a much more detailed 
summary of these and other ITC studies of lipid bilayer 
systems, see the excellent review by Heerklotz (2004). 

Water Sorption Calorimetry 

Various lipid phases in water, including the biologi¬ 
cally relevant liquid-crystalline lamellar phase, are 
stabilized primarily by the hydrophobic force, acting 
on the nonpolar hydrocarbon chain core of the lipid 
bilayer, and by favorable electrostatic forces between 
water and the polar surface of the bilayer. Thus, 
a thermodynamic analysis of the interactions of the 
polar headgroups, the glycerol backbone, and the 
hydrocarbon chains of the lipid molecule in the bilayer 
is important for an understanding of the forces 
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determining the structure and organization of lipid 
model and biological membranes. Although useful 
information in this regard can be gathered by studying 
the lyotropic phase behavior of lipid bilayers by utiliz¬ 
ing DSC to study their thermotropic phase behavior as 
a function of hydration, including the maximum num¬ 
ber of “unfreezable” (bound) water molecules in fully 
hydrated systems, several other calorimetric tech¬ 
niques are available that permit one to characterize 
thermodynamically the interaction of sequentially 
bound water molecules with the lipid membrane, as 
well as to independently determine the total number of 
water molecules bound. In particular, these techniques, 
grouped together under the heading of WSC, allow one 
to determine the enthalpy and entropy associated with 
the sequential binding of water at a particular constant 
temperature. Unfortunately, space does not permit 
a detailed discussion of these techniques nor the results 
obtained from their applications to various lipid sys¬ 
tems here. However, WSC studies have indicated that 
only a small number of water molecules are bound 
specifically and tightly to the polar headgroups of the 
lipid molecules by enthalpic forces, whereas a larger 
number of water molecules are bound less specifically 
and less tightly by entropic forces. The results of such 
studies have also provided key insights into the origin 
of the “hydration force,” which, along with bilayer 
undulations, contributes to the strong, short-range 
repulsive forces of the surfaces of hydrated lipid bila¬ 
yers which oppose membrane fission. For a fine review 
of the principles, instruments, and more detailed 
results of the application of WSC to lipid bilayers, 
the reader should consult a recent review (Heerklotz 
2004). 

Summary 

The four calorimetric techniques discussed above have 
already provided much valuable thermodynamic infor¬ 
mation about the ► physical chemistry of the organi¬ 
zation, aggregation, and lipid self-assembly. A major 
future challenge is to apply these techniques to more 
complex lipid mixtures, containing specific membrane 
proteins, to obtain information more relevant to actual 
biological membranes. As well, future applications of 
these techniques to study a larger number of biologi¬ 
cally relevant lipid bilayer physical properties and 
processes are a priority. 


Cross-References 

► Critical Fluctuations in Lipid Mixtures 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 

► Glycerolipids; Chemistry 

► Lipid Bilayer Lateral Pressure Profile 

► Lipid Domains 

► Lipid Organization, Aggregation, and Self- 
assembly 

► Lipid Shape and Curvature Stress 

► Membrane Fluidity 

► Phase Transitions and Phase Behavior of Lipids 

► Sphingolipids and Gangliosides 

► Thermodynamics of Lipid Interactions 
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Differential Transition State 
Stabilization - Nonempirical Analysis 

Ewa Chudyk 

Centre for Computational Chemistry, School of 
Chemistry, University of Bristol, Bristol, UK 

Synonyms 

DTSS method 


Definition 

Ah initio quantum chemical energy analysis, used to 
analyze the origins and fundamental nature of contri¬ 
butions to transition state stabilization relative to the 
reactant state in a given catalytic (e.g., enzyme active 
site) environment, with respect to the gas phase. 


Basic Characteristics 
Introduction 

The DTSS method consists in a nonempirical analysis 
of the interaction energy between the reacting atoms 
(which can be termed the solute) and their catalytic 
environment (which can be termed the solvent or 
environment) (Sokalski 1985; Sokalski et al. 1988). It 
calculates the transition state stabilization by the envi¬ 
ronment with respect to the reactant state. Such analy¬ 
sis is performed for structures (transition state or 
reactant structure) generated by modeling of the reac¬ 
tion (e.g., using ► QM/MM methods) The DTSS is 
calculated according to the relation: 

Adtss = AEts-r — AEs-r 

where AEts-r and AEs-r stand for the interactions 
with the environment for the transition state structure 
and reactant structure, respectively. The more negative 
the DTSS value (Aqtss), the greater the stabilization of 
the transition state relative to the reactant, and the 
larger the catalytic effect (comparing to the reaction 
in the gas phase). To describe the physical nature of the 
interactions giving rise to DTSS in a chemical reaction, 
a variational-perturbational interaction energy decom¬ 
position scheme can be applied, partitioning the Adtss 
(typically calculated at the MP2 level of ah initio 
quantum mechanical electronic structure theory, with 
subsidiary less sophisticated energy calculations) into 
contributions from electrostatics, quantum mechanical 
exchange, delocalization, and correlation effects. It has 
been shown for some biological systems (enzyme- 
catalyzed reactions) that (relatively simply) additive 
electrostatics is the most important contribution to 
DTSS, with other effects tending to cancel each other 
out. DTSS analysis can then be useful not only to 
estimate the overall catalytic effect of the molecular 
environment, but also to analyze the separate contri¬ 
bution of its various elements (e.g., the contribution to 
barrier lowering, by electrostatic interactions, of amino 
acid sidechains in the enzyme active site). This kind of 
quantitative analysis can be useful when analyzing 
enzyme active sites. This could find potential applica¬ 
tions in predicting the effects of site-directed mutagen¬ 
esis effects on reaction barriers, and also, for example, 
in the design or engineering of enzyme or biomimetic 
catalysts. It should be stressed, however, that differen¬ 
tial transition state stabilization is only one potential 
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contributor to catalysis, and other effects such as bind¬ 
ing and solvation should also be considered for a full 
analysis. The choice and number of structures 
representing the transition state and reactant state 
should also be considered carefully. 

Applications 

DTSS analysis has been performed on several 
enzyme-catalyzed reactions, including chorismate 
mutase (Szefczyk et al. 2004) and cAMP-dependent 
protein kinase (Szarek et al. 2008). For those 
systems, electrostatic interactions were found to be 
the dominant effect in differential transition state 
stabilization. Pairwise calculations of electrostatic 
interactions between reaction centre and surrounding 
it residues (and their differences between the reactant 
and transition state) were used to distinguish the most 
catalytically important amino acids. DTSS analysis 
has also been carried out for ► fatty acid amide hydro¬ 
lase, for which it was used to identify residues impor¬ 
tant for interactions with three covalent inhibitors 
(Lodola et al. 2011). 


Cross-References 

► Chorismate Mutase - Computational Studies 

► Fatty Acid Amide Hydrolase - Computational 
Studies 

► Methylamine Dehydrogenase (MADH): Studies 
with Variational Transition State Theory (VTST) 

► QM/MM Methods 
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Transmembrane Pore Formation and 
Membrane Fusion 

Huey W. Huang 
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Synonyms 

Lipid bilayer; Neutron diffraction; X-ray diffraction 

Introduction 

X-ray and neutron scattering are uniquely suited for the 
studies of structural problems related to biological 
membranes. This is due to the penetrating nature of 
X-rays and neutrons, coupled with the natural ten¬ 
dency of lipid bilayers to form smectic liquid crystals. 
In comparison, electron microscopy is limited by the 
requirement of a vacuum environment for its sample. 
Atomic force spectroscopy is limited by its perturbing 
nature to soft matter. 

Antimicrobial peptides and membrane fusion are 
each a very active held of research. The two 
helds are not related except that both involve structural 
transformations in membranes. Antimicrobial peptides 
are host-defense molecules that kill invading microbes 
by permeabilizing the microbial cell membranes. 
Membrane fusion involves merging of two membranes 
into one. Investigators have applied varieties of methods 
to study the structural changes taking place in lipid 
bilayers, either caused by antimicrobial peptides or 
during membrane fusion. So far only diffraction 
methods have succeeded in obtaining the nanometer- 
resolved structures of antimicrobial-peptide-induced 
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transmembrane pores and membrane fusion interme¬ 
diate states. These diffraction methods will be 
reviewed here. 

Peptide-Induced Transmembrane Pores 

Extensive evidence suggests that the target of antimi¬ 
crobial peptides is the lipid matrix of cell membranes 
(Boman et al. 1994). Consequently the action of the 
peptides has been studied with lipid bilayers as 
substitutes for cell membranes. To observe the effect 
of antimicrobial peptides on a lipid bilayer, a typical 
experiment is done by introducing peptide molecules 
into a suspension of lipid vesicles containing a solution 
dye. The content dye was observed to leak out while the 
vesicles remained intact. Clearly antimicrobial peptides 
form pores in the membranes. Thus a fundamental 
question about the antimicrobial peptides has been: 
What are the structures of these peptide-induced 
pores? To detect the pores in membranes by scatter¬ 
ing/diffraction methods, the lipid vesicles with bound 
peptides were converted to parallel multiple-bilayers on 
a substrate, by removing the excess water. The same 
results can be obtained by first mixing the peptide and 
lipid in organic solvent and then replacing the solvent by 
hydration (Ludtke et al. 1995). Thus a sample is 
specified by the peptide and lipid and their molar ratio 
P/L. An important parameter that controls the physical 
condition of a multilayer sample is the level of 
hydration. This parameter is usually specified by the 
relative humidity (RH) of the water vapor with which 
the sample is in equilibrium. A fully hydrated sample 
means that it is equilibrated with a water (or sometimes 
D 2 O) vapor of 100% RH. It has been shown that at full 
hydration the multilayer preparation does not affect the 
peptide behavior in individual lipid bilayers (see below). 

The grazing-angle scattering intensity from a mul¬ 
tilayer sample is cast in the 2D Q-space of and 
(the out-of-plane and in-plane components of the scat¬ 
tering momentum) (Yang et al. 1998). At the fully 
hydrated condition, the bilayers are in the fluid state. 
If there are no transmembrane pores, the multilayers 
are uniform in the xy plane; there should be no in-plane 
scattering in the range of < 1-5 A“* (the latter is the 
approximate peak position of fluid paraffin chains). 
However, if there are pores, there should be a trans¬ 
membrane water channel through each pore. When the 
water is exchanged to D 2 O, the strong D-H contrast 


allows neutrons to detect the D 2 O channels, in addition 
to the lamellar diffraction from multiple bilayers. An 
example is shown in Fig. 1 for the peptide melittin with 
the lipid DTPC (dil3:0PC) at the peptide to lipid molar 
ratio P/L = 1/30 (Yang et al. 2001). The scattering 
pattern of the sample changed with the hydration con¬ 
ditions and temperature. We divide these patterns into 
two regions: the pre-crystalline region (Fig. la-c) and 
the crystalline phase (Fig. Id). 

Pre-crystalline Region 

In the pre-crystalline region, the lamellar diffraction by 
parallel bilayers is confined to the axis. The lamellar 
diffraction can be converted into an electron density/ 
neutron scattering length density profile as a function 
of z; this represents the lipid bilayer structure averaged 
over the xy plane. The scattering by transmembrane 
pores is off the axis, in the region 0 < Q, < 1.5 A“*. 
Scattering patterns similar to Fig. la-c have been 
obtained from other peptides including gramicidin 
(by X-ray (He et al. 1993; Yang et al. 1999)), 
alamethicin (He et al. 1996; Yang et al. 1999), 
magainin (Ludtke et al. 1996; Yang et al. 1999), and 
protegrin (Yang et al. 2000). The gramicidin example 
is important because the structure of gramicidin chan¬ 
nels in lipid bilayers is well established (Olah et al. 
1991). Gramicidin forms small dimeric protein chan¬ 
nels in bilayers (He et al. 1993), thus the scattering 
from gramicidin is representative of scattering from 
membrane pores in fluid lipid bilayers. Note the simi¬ 
larity of Fig. la-c obtained from melittin to Fig. 2 
obtained from gramicidin, other than the difference in 
the Q, position of the scattering peak. The position for 
the gramicidin peak is ^2.5 higher than for the melittin 
peak, which indicates the pore size difference. The 
similarity of the scattering from antimicrobial peptides 
to the scattering from gramicidin, particularly the nar¬ 
row widths of the peaks, suggests that in each sample 
the pores are of uniform size, just like the gramicidin 
channels in a bilayer are all of the same size. Thus, the 
intensity is expressed as (Warren 1990): 

/(Gz, Qr) = fV|F(Gz, G,)|'5(Gz, Qr) (1) 

where N is the number of pores, F{Q^, Q, ) the contrast 
form factor of a pore, and 5(Gz, Qr) the structure factor 
of the pores. The contrast form factor is the form factor 
of the pore minus the form factor of a patch of lipid 
bilayer filling the space of the pore (Yang et al. 1999), 
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Diffraction Methods for Studying Transmembrane Pore 
Formation and Membrane Fusion, Fig. 1 Neutron diffrac¬ 
tion patterns of melittin in DTPC at P/L = 1/30 measured at (a) 
27°C, 99% RH; (b) 20“C, 95% RH; (c) 17°C, 98% RH; (d) 10°C, 
94% RH. The degrees of correlations between pores resided in 


different bilayers varied with temperature and hydration, from 
no correlations between bilayers in (a) to a long range correla¬ 
tion in (d). All changes were reversible and reproducible 
(Reproduced from Yang et al. (2001)) 


Diffraction Methods for 
Studying Transmembrane 
Pore Formation and 
Membrane Fusion, 

Fig. 2 X-ray diffraction 
patterns of gramicidin in 
DLPC at P/L = 1/10. (Left) at 
100% RH; (Middle) 90% RH; 
(Right) 80% RH, all at 25° C 
(Reproduced from Yang et al. 
(1999)) 
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F{Qz, Qr) = Fp{Qz, Qr) - Fii,{Qz, Qr). For neutron 
scattering it is dominated by a D 20 -filled cylindrical 
channel with height h: 


F{Qz,Qr) 


sin(Qzb/2) 

Qzh/2 


Fr(Qr) 


( 2 ) 


The structure factor S{Qz,Qr) for multiple bilayers 
(numerated by the index m — 0, ±1, ±2 ...) can be 
written as (Yang et al. 1999): 


S{Qz,Q,) = 1 + '^^ex-p[iQz(zm - zo)] I no„,{r)exp{iQr.r)dr 

= Soo(er) + 2cOs(2,d)Soi(Or) + 2008(202^)502(6/-) 

+ .... 

(3) 

where r is the position vector of the centers of the pores 
in the xy plane. The correlation function nQ„,{r) is 


defined as the average number of pores in the wth 
bilayer at position r relative to an arbitrarily chosen 
pore in the 0th bilayer. Sqo is the in-plane structure 
factor, due to the correlations of the pores in the same 
bilayer, Soi is due to the correlations between the pores 
residing in nearest neighboring bilayers, and so on. 

An important general experimental result is that at 
full hydration, the Q^-dependence of I{Qz,Qr) agrees 
with the Qz-dependence of \F{Qz, Qr)f’ (Fig. 3). In the 
case of gramicidin, the Q^-dependence of the X-ray 
contrast form factor \F{Qz,Qr)'^ was computed based 
on the known structure of gramicidin channel (Yang 
et al. 1999). For alamethicin and magainin (Yang et al. 
1999) (and melittin (Fig. 4) below), the Q^-dependence 
of the neutron contrast form factor |F(Gz,2/-)P was 
calculated by Eq. 2 with a h consistent with the bilayer 
thickness. This means that 5(2-, Q, ) (see Eq. 1) has no 

significant2z-dependence,orthat5(2r,2/) ~ 5oo(2/-)- 

All 5om = 0 for m 0. The pores are positionally 
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Diffraction Methods for Studying Transmembrane Pore 
Formation and Membrane Fusion, Fig. 3 (a) 2D plots of 
X-ray scattering intensity from gramicidin in DLPC at P/L = 1/10, 
alamethicin in DLPC at P/L = 1/30, and magainin in DLPC at 
P/L = 1/30; (b) the Q^-dependence of the scattering intensity (data 
shown in small circles) obtained from (a) compared with the 
calculated Q^-dependence of the contrast form factor, (bl) 


The dotted line is the foim factor of gramicidin channel 
|Fg(Qz)|^ calculated from the NMR structure of the gramicidin 
channel. The solid line is the Q^-dependence of the contrast form 
factor \F^{Q,) ^ = \Fg{Qz) - Fih{Qz)\^-F is the contribu¬ 
tion of the lipid background. (b2) and (b3) are calculated from 
Eq. 2 with = 30 A, the thickness of DLPC bilayer (Reproduced 
from Yang et al. (1999)) 


uncorrelated between different bilayers. In other words, 
the scattering was the sum of the scatterings from indi¬ 
vidual bilayers, not affected by the multilayer prepara¬ 
tion. Under this condition, the Q,.-dependence of 
l{Qz,Qi) can be used to analyze the pore size in 
a fluid bilayer (He et al. 1996; Ludtke et al. 1996), by 
setting Eq. 1 at Qz = 0 and normalizing to = 1 (War¬ 
ren 1990): 7(0, e,) = |7^(O,0,)p5(O,G,)- ^0,2^) is 
determined by the diameter of the D 20 -filled water 
channel, which represents the internal diameter of the 
pore. 5(0, Q, ) can be simulated by freely diffusing hard 
disks with an adjustable diameter which represents the 
pore-pore collision distance or the outside diameter of 
the pore. An example is shown in Fig. 4 where the data 
fitting yielded an inside diameter 44 A and an outside 
diameter 76 A for a melittin pore. It is important to note 
that the scattering curve was fit by a liquid state of 


identical hard disks (see Fig. 4). If the pore size in the 
sample had significant variations, the scattering curve 
would be broader than the theoretical fitting curve (He 
et al. 1996). The uniformity of the pore size in a sample 
implies that the pore size is dictated by the free energy 
minimum of the collective state of peptides and lipid, 
rather than, for example, by random aggregations of 
monomers. 

There are so far no other methods that could provide 
the pore size measurement to this accuracy. This 
method has been applied to many peptides and the 
results showed that there are two types of peptide- 
induced pores (Yang et al. 2001). The analysis by 
neutron scattering found that the alamethicin pore has 
an inside diameter of 18 A and an outside diameter of 
40 A. These dimensions are in agreement with a barrel- 
stave model consisting of 8 monomers (He et al. 1996). 
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Diffraction Methods for Studying Transmembrane Pore 
Formation and Membrane Fusion, Fig. 4 (Left) Neutron 
diffraction pattern of melittin in POPC at P/L = 1/15 measured 
at 25°C, full hydration. (Middle) The circles are the intensity 
along at Q,. — 0.066 A“'. The solid line is the 22-dependence 
of the form factor |F(Qz,2r)P given by Eq. 2 with h — 33 A. 
(Right) The circles are the intensity along 2,- 3t 2z = 0. The 
dotted line is |F(0, Qr)l^ of a cylinder representing a D 20 -filled 
channel with an adjustable inside diameter of the pore. The thin 


solid line is Soo(2r) computer-simulated by freely diffusing hard 
disks, which is determined by the disk diameter and disk con¬ 
centration. The disk diameter represents the collision distance 
between pores, or the outside diameter of the pore. The thick 
solid line is the fit to the data by |F(0,2r)| ‘Soo(2r) which 
yielded two fitting parameters: inside diameter ~44 A, outside 
diameter ~76 A. For the details of data analysis see He et al. 
(1996) (Reproduced from Yang et al. (2001)) 


A barrel-stave model has been commonly presumed 
for peptide-induced pores. For alamethicin, the barrel- 
stave model is also consistent with the ion conductivity 
induced by alamethicin (Mak and Webb 1995). How¬ 
ever, all other antimicrobial peptides that were mea¬ 
sured by neutron scattering formed larger pores with 
inside diameter in the range of 30-50 A and outside 
diameter 70-84 A, which vary with peptide-lipid ratio 
and with lipid composition. More importantly, the 
numbers of peptide monomers in each pore were 
found to be less than those in the alamethicin pore 
(this was determined by the method of oriented circu¬ 
lar dichroism (Ludtke et al. 1996; Yang et al. 2001)). 
Therefore, the pores could not be of the barrel-stave 
type, even though they were often so presumed. These 
results led to a proposal for a new type of transmem¬ 
brane pores named the toroidal (or wormhole) model 
for the great majority of antimicrobial peptides which 
are multi-cationic (in contrast, alamethicin is essen¬ 
tially uncharged.) The toroidal model is formed by 
merging of the two monolayers of the bilayer at the 
edge of the pore (He et al. 1996). 

The most unexpected result of the neutron scatter¬ 
ing experiments was that the pores in the multilayer 
samples could order into a crystalline lattice that 
diffracted neutrons. First, it was noted that when the 
hydration level of multilayers was less than 100%, 
the Qz’dspendence of I{Qz,Qr) deviated from the 


Qz-dependence of \F{Qz, (see Figs. 1 and 2). By 
Eq. 1, this implies that there is a signihcant 
Qz-dependence in S(Q:,Q,), and hence there are 
contributions from one or more terms of Som,m ^ 0 
in Eq. 3. The reduced water layer between bilayers 
caused stronger interactions between bilayers which 
in turn caused positional correlations between pores 
in different bilayers. In a condition slightly deviated 
from full hydration, the correlations were sufficiently 
weak that the scattering curve could be analyzed with 
the first two terms of Eq. 3. Figure 5 shows an example 
that demonstrated the validity of the theory. More 
complete analyses were available in Yang et al. 
(1999, 2001). The pore correlations between bilayers 
could be strengthened by lowering the hydration level 
or temperature until the correlations became long- 
ranged. Then the pores fell into a crystalline lattice of 
rhombohedral (R) symmetry (Fig. Id). 

Crystalline Phase 

A crystalline phase of the membrane pores makes it 
possible to reconstruct the electron density distribu¬ 
tions of the pores by X-ray diffraction. However, the 
phase problem here presents a challenge quite different 
from those faced by protein crystallographers. The unit 
cell of soft matter crystals is a continuous distribution 
of atoms rather than a set of atoms fixed at discrete 
positions as in a molecular crystalline. For this and 
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Diffraction Methods for Studying Transmembrane Pore 
Formation and Membrane Fusion, Fig. 5 (Left) Neutron 
diffraction pattern of melittin in DLPC at P/L = 1/30 measured 
at 25°C, at 97% RH. (Right) The circles are the intensity 
along Qz at 2, = 0.068 A~*, the position of maximum in 2,. 


The solid line is the 2z“dependeitt;e of |7^(2zi2r)|^ 
(fli + a 2 Cos{Q 2 d)) (see Eqs. 2 and 3) with h — 30 A, d = 45 
A, and fitting parameters Uj and 02 - This proves the validity of 
the theory for the pre-crystalline region (Reproduced from 
Yang et al. (2001)) 


Other technical reasons, the phasing methods of protein 
crystallography are largely inapplicable to diffraction 
from soft-matter crystalline materials. 

A method particularly suited for resolving mem¬ 
brane structures is multiwavelength anomalous dif¬ 
fraction. Firstly, it should be recognized that any 
structure formed by lipid molecules with water has 
a morphology composed of continuous monolayers. 
Therefore, a label on the lipid chain, say a bromine 
atom, will form a continuous layer parallel to the 
monolayer. Secondly, multiwavelength anomalous 
diffraction will produce not only the diffraction ampli¬ 
tudes of the whole system but also the diffraction 
amplitudes of the label atoms (e.g., Br) alone. Having 
these two sets of diffraction amplitudes greatly facili¬ 
tate the solution of the diffraction phases. 

The brominated lipid l,2-distearoyl(9-10dibromo)- 
sn-glycero-3-phosphocholine (abbreviated as 
di 18:0(9,1 OBr)PC) is particularly useful because its 
property is similar to DOPC (dil8:lPC) in its phase 
behavior (Pan et al. 2006; Wang et al. 2007; Qian et al. 
2008a, b), and in its interactions with peptides (Qian 
et al. 2008a, b). For the barrel-stave pores, a sample of 
alamethicin in dil8:0(9,10Br)PC at P/L = 1/20 was 
used. In the range of hydration 40% < RH < 55% 
(25°C), the alamethicin-induced pores form 
a crystalline phase of rhombohedral symmetry (R 
phase). For toroidal pores, a sample of the peptide 
Bax a-5 (a peptide derived from pore-forming protein 
Bax) in dil8:0(9,10Br)PC at P/L - 1/30 was used. 


At RH;S78% (25°C), the Bax a-5-induced pores also 
form an R phase. The diffraction pattern of a multilayer 
sample is best observed by grazing-angle diffraction 
(Fig. 6). In this diffraction geometry, the detector 
image is very close to the pattern on the — Q,- 
plane. The crystalline symmetry can be easily deduced 
from the diffraction pattern on the detector (more in the 
next section). However, the complete set of analyzable 
diffraction data need to be obtained by a series of 
diffraction geometries described fully in Yang and 
Huang (2003) (and in Ding et al. (2004) for neutron 
diffraction). For multiwavelength anomalous diffrac¬ 
tion (MAD), the diffraction pattern of each R phase 
was measured at 10 chosen X-ray energies below the 
K-edge of Br, such that the values of/j at successive 
chosen energies differ by A/J = 0.5 in single electron 
units (see Table 1 of Qian et al. (2008)). The method of 
data reduction from the diffraction intensity to the 
relative magnitudes of diffraction amplitudes is 
described in Yang and Huang (2003) (see also Ding 
et al. (2004)). 

Let F) {Q) be the Fourier transform of the electron 
density of the unit cell, Fq the normal diffraction ampli¬ 
tudes of the whole system (including lipids, peptides, 
and water), F 2 the normal diffraction amplitudes of the 
Br atoms alone, and / =/” +f[ + if'I the scattering 
factor of Br atom. The real and imaginary parts of 
the anomalous scattering factor can be measured 
by a standard procedure (Qian et al. 2008). Then 
the measured diffraction amplitude Fx is given by 
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Fig. 6 X-ray diffraction 
patterns of the crystalline 

phase of alamethicin pores in • 

(a) dil8:0(9,10Br)PC and (b) 

Bax a-5 pores in 
dil8:0(9,10Br)PC. (Intensity 
attenuators were used on the 

strong peaks in order to show . , 

the weak peaks.) (Reproduced 

from Qian et al. (2008a, b) ♦ 

respectively) • j 


— Fq + [(f'j -F if")/f'’]F 2 . The lipidic structures 
developed from a symmetric bilayer are most likely 
centrosymmetric - this assumption will be directly 
tested and validated (or invalidated) by the data. Under 
the assumption of centrosymmetry, the amplitudes Fq 
and F 2 are real quantities that lead to the equation 

\F>\^ = [fo + {f’,jnF 2 f + {f'UrfFl. On the right 
hand side of this equation, the second term is about 1 % 
of the hrst term, due to the fact that at energies below the 
K edge the values of f" are about 10% of \f- \ for Br 
atoms (Qian et al. 2008). Therefore, an approximate 
relation is obtained: 

\F2\-±(^Fo-^F2'^ (4) 

Both the results of alamethicin and Bax a-5 systems 
showed a high degree of linearity in the data versus 
[/)'!//" (Figs. 7 and 8). Therefore, from Eq. 4, the 
magnitudes of Fq, F 2 , and their relative phase can be 
obtained by a linear fit. From the straight-line htting to 
the data |E;J versus [/f|//", the intercept of the fitted line 
gives |Eo I; the magnitude of the slope gives 1 ^ 2 11 and the 
sign of the slope gives the sign of — F 0 /F 2 . The strong 
correlations of the linear fits indicate excellent quality of 
the data and, most importantly, justify the assumption of 
centrosymmetry in the unit cell, since the linear relation 
Eq. 4 would not be valid without this assumption. 

Thus, this method produces a set of diffraction 
amplitudes Fq, for constructing the electron density 
distribution of the whole system, and a set of diffrac¬ 
tion amplitudes F 2 , for constructing the density distri¬ 
bution of Br atoms, and also the phase relations 
between Fq and F 2 - There are many different ways of 


b 


making use of these two sets of diffraction amplitudes 
to solve the phase problem (e.g.. Pan et al. (2006); 
Wang et al. (2007)). 

It is important to keep in mind that the goal of soft 
matter diffraction is not a high-resolution structure. 
Soft matter diffraction is inherently of low resolution 
because the unit cell structure is defined by 
a continuous atomic distribution. It is simply impossi¬ 
ble to reconstruct the density to atomic resolution as in 
molecular crystals. The distribution of Br atoms alone 
is often sufficient to reveal the lipid structures. 

The phase problem for both the alamethicin pore 
and the Bax a-5 pore (Fig. 9) was solved by the swell¬ 
ing methods (Qian et al. 2008a, b). The results verified 
both the barrel-stave model for alamethicin (Qian et al. 
2008a) and the toroidal model for cationic antimicro¬ 
bial peptides (Qian et al. 2008b). 

The Force Model of Membrane Fusion 

Membrane fusion research has been hampered by the 
difficulty of trapping and determining the structures of 
the merging membranes at various stages of fusion. 
Clearly one needs to know the process of lipid trans¬ 
formation in order to perceive the actions performed 
by proteins. In SNAREs-mediated membrane fusion, 
the formation of protein complexes is thought to trans¬ 
duce force to cause the merger of lipid bilayers 
(Jahn and Scheller 2006). Thus, it is of interest to 
examine the physical configurations of compressed 
lipid bilayers. 

Consider two lipid bilayers parallel to each other 
with water in between. Imagine a cylindrical volume 
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Diffraction Methods for Studying Transmembrane Pore Formation and Membrane Fusion, Fig. 7 Data \Fx \ versus \f[ \ //" 
from the MAD measurement of alamethicin in dil8:0(9,10Br)PC (Reproduced from Qian et al. (2008a)) 


with the two bilayers as the top and the bottom. 
A pressure applied to the cylindrical volume would 
not merge the lipid bilayers unless the water is 
allowed to leave the volume. In other words, the wall 
of the cylindrical volume must be regarded as 
a semipermeable membrane allowing the passage of 


water. Such a mechanical operation is exactly equiva¬ 
lent to applying an osmotic pressure to the same 
volume. Thus, the physical configurations of com¬ 
pressed lipid bilayers can be examined by X-ray dif¬ 
fraction on an equivalent system of bilayers under 
osmotic pressure. 
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Diffraction Methods for Studying Transmembrane Pore Formation and Membrane Fusion, Fig. 8 Data |Fi| versus 
from the MAD measurement of Bax a-5 in dil8:0(9,10Br)PC (Reproduced from Qian et al. (2008b)) 


A multilayer sample of dil8:0(9,10Br)PC was 
housed in a temperature-humidity chamber while 
being examined by grazing-angle X-ray diffraction at 
25°C. At full hydration (zero osmotic pressure) the 
lipid was in a fluid lamellar (Lq,) phase (Fig. 10a). 
The lamellar diffraction showed parallel bilayers 
separated by the phosphate group-to-phosphate group 
(P-P) distance ^2 nm (Wang et al. 2006). 
This P-P distance steadily decreased as the osmotic 
pressure increased (decreasing RH). At ~64% RH, 
the lipid-water system underwent a phase transition. 
The new phase appeared in a periodic lattice of tetrag¬ 
onal symmetry (space group /4i, Fig. 10b). As the 
humidity level further decreased to ^60% RFl, the 


tetragonal (T) phase transformed to a rhombohedral 
(R) phase of lattice symmetry R3 (Fig. 10c). Below 
60% RH, the system remained in the R phase and no 
further phase transition was observed. 

The primitive unit cell of the T phase (Fig. 10b) has 
dimensions a x a x 4c with a = 7.99 nm, c = 
5.06 nm. The unit cell consists of four square bilayer 
units. If the center of the first bilayer unit is at the origin 
{x = 0, y — 0, z = 0), the centers of the second, third, 
and fourth square bilayer units are, respectively, at 
(a/2, 0, c), (a/2, a/2, 2c), (0, a/2, 3c) for right- 
hand screw symmetry, or at (0, a/2, c), (a/2, a/2, 
2c), (a/2, 0, 3c) for left-hand screw symmetry. Both 
would give the same diffraction patterns. The R phase 
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Diffraction Methods for Studying Transmembrane Pore 
Formation and Membrane Fusion, Fig. 9 Two models of 
peptide-induced pores in membranes and the experimental proof 
for each. Schematics of the (a) barrel-stave model and (b) the 
toroidal model are shown in a 3D view cut through the pores, 
showing only the layers representing the lipid headgroups 
(silver) and the Br atoms (red). Blue-gray cylinders represent 
peptide segments; the actual distribution and orientation of 


peptides are expected to vary with peptide and lipid. Schematic 
lipid molecules (dil8:0(9,10Br)PC) are shown in (a), where 
the red dots represent the Br atoms on the chains, (c) and (d) 
are the normalized electron density distributions of Br atoms in 
a unit cell of the R phase containing alamethicin and Bax-otS, 
respectively. Br atoms are distributed in the high-density 
(yellow-red-black) region (Reproduced from Qian et al. 
(2008a, 2008b)) 


a 

Diffraction Methods for 
Studying Transmembrane 
Pore Formation and 
Membrane Fusion, 

Fig. 10 Grazing-angle X-ray 
diffraction patterns of 
dil8:0(9,10Br)PC as 
a function of osmotic pressure 
(expressed by relative 
humidity RH) at 25°C: (a) in 
the L phase at humidity levels 
above 64% RH; (b) in the 
T phase between 64% and 
60% RH; (c) in the R phase 
below 60% RH (Reproduced 
from Qian and Huang (2012)) 


I 


b 


c 






lattice (Fig. 10c) has crystal axes (a, 0,0), 
(—a/2, •\/3a/2,0), {a/2,al2'</3,c) with lattice con¬ 
stants a = 7.58 nm and c = 5.00nm. 

The structure of the R phase was solved by the 
swelling method (Yang and Fluang 2003). The electron 


density distribution of its unit cell (Fig. 11) showed 
that the adjacent monolayers of the two apposed 
bilayers merged to form an hourglass-like connection, 
called a stalk. This discovery confirmed the long 
speculated stalk intermediate state. The experimental 
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Diffraction Methods for Studying Transmembrane Pore 
Formation and Membrane Fusion, Fig. 11 The R phase 
structure, (a) Electron density distribution in the unit cell of the 
rhombohedral lattice shows a stalk structure. (High density is 


white, low density is black.) (b) A 2D cut through the unit cell, 
(c) The schematic of lipid configurations in a stalk where two 
lipid monolayers merged to form an hourglass structure 
(Reproduced from Yang and Huang (2003)) 


confirmation of this intermediate structure has helped 
to make the stalk hemifusion pathway a key concept 
for interpreting the data of membrane fusion in 
recent years. 

The unit cell structure of the T phase was solved by 
using the Br-MAD method described above (Qian and 
Huang 2012). The electron density distribution in 
a bilayer unit showed that two adjacent lipid bilayers 
curved toward each other to form point contact 
(Fig. 12). The ridgeline of the electron density distri¬ 
bution representing the phosphate layer marks the 
positions of phosphorus (P) atoms. The shortest P-P 
distance between the two bilayers is 0.90 nm. Perhaps 
the two headgroups (the phosphorylcholines) are in 
contact, but there is no direct proof. For the conve¬ 
nience of discussion, the configuration of the two lipid 
bilayers in T phase is considered to be in point contact. 
Outside of this region of point contact, the P-P distance 
gradually increased to 1.61 nm at the edge of the unit 
cell. Lipid bilayers abruptly transformed to this con¬ 
figuration of point contact from a parallel configuration 
in the lamellar phase where the P-P distance was 
a uniform value of 1.20 nm, right before the phase 
transition. Starting from the fully hydrated Lot phase 
where the P-P distance was ^2 nm (Wang et al. 2006), 
osmotic pressure applied to the multilamellae mono- 
tonically decreased the P-P distance without changing 
the parallel conhguration until the P-P distance 
reached this critical value of 1.20 nm. At this point 
a slight increase in osmotic pressure transformed the 
parallel bilayers to a configuration of multiple contact 
points, one contact point for every (7.99)^nm^ area. 


Most surprisingly this point-contact configuration 
between lipid bilayers is sufficiently stable to form 
a periodic lattice. As the osmotic pressure further 
increased, the lipid conhguration at each contact 
point abruptly transformed to a stalk (Yang and 
Huang 2003), and the stalks rearranged into 
a rhombohedral lattice. 

Most signihcantly these results show that there is 
a dehnitive merging pathway for a pair of lipid bilayers 
under a compression force against the hydration bar¬ 
rier. This result supports the force model of Jahn and 
Scheller (Jahn and Scheller 2006), that is, SNARE 
assemblies exert a compressive force on the juxtaposed 
membranes, perhaps by the zippering of SNARE 
complexes toward the linker regions. The T and 
R phases allow an estimate of the energy required for 
a stalk formation. The osmotic pressure is given 
by n =—{ksT/v„)ln{RH), which is equivalent to 
a mechanical pressure P applied to the lipid bilayers 
while allowing water access to an external reservoir 
(kg is the Boltzmann constant, T the absolute temper¬ 
ature, v^^, the molecular volume of water). At 64% RH 
(25° C) when the point contacts developed, the equiv¬ 
alent pressure on the lipid bilayers is F = 6.1 x 10^ 
dyne/cm^. The repulsive pressure by the hydration 
force between lipid bilayers is empirically expressed 
asP — where Pg and 2 are fitting parameters 

and is the water layer thickness (Rand and Parsegian 
1989). By the empirical expression, the work done by 
the compressive force is Vk = A Pge^~l^dz = AXP, 
where A is the area to which the pressure is applied. 
Thus the work done to overcome the hydration 
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Diffraction Methods for Studying Transmembrane Pore 
Formation and Membrane Fusion, Fig. 12 The T phase 
structure. Electron density distributions in one bilayer unit 
consisting of the two contacting leaflets of merging bilayers, 
(a) Density distribution of the whole lipid obtained from Fg- 
(al) is the schematic of lipid distribution in (a); the bilayer unit is 
enclosed by the dashed tine; the red dots represent Br atoms, (b) 
Density distribution of Br atoms alone obtained from F 2 , 


corresponding to (a); both shown on the y = 0 cross section. 
The multiwavelength anomalous dispersion method yields the 
two densities separately, but the phases must be chosen so that 
the results are consistent with each other (note the consistency 
between (a) and (b)). (c) 1/4 of one bilayer unit of whole lipid is 
shown in 3D with the boundary of the bilayer unit in solid lines 
7.99 X 7.99 X 5.06nm^ (Reproduced from Qian and Huang 
( 2012 )) 


barrier for each contact region (64 nm^) is 118 kcal/ 
mol (using A = 2.1A) (Rand and Parsegian 1989). 
This is essentially the work done for producing 
a stalk, because very little work was done in the 
transition from the T phase (the point-contact phase) 
to the stalk phase. This energy depends on the lipid 
composition. For DOPC/DOPE (1:1), the L to 
R transition occurs near 80%RF1; the work done for 
overcoming the hydration barrier to produce each 
stalk is ~60 kcal/mol. Addition of cholesterol is 
known to raise the RH for the L to R transition. 
Thus it is reasonable to estimate that in general the 
energy required for producing a stalk is in the range of 
50-100 kcal/mol (a precise value can be measured for 
each lipid composition). This energy range is com¬ 
patible with the energy ^20 kcal/mol estimated to be 
released by each SNARE complex formation, since 
apparently multiple SNARE complexes are involved 


in each fusion reaction. This study also shows for the 
first time that the fusion zone between two lipid bila¬ 
yers is about 64 nm^. 

Summary 

A local structural feature in a lipid bilayer, such as 
a peptide-induced pore, or a feature between two lipid 
bilayers, such as a stalk, can be detected and studied by 
in-plane scattering. A local deformation in a lipid 
bilayer necessarily induces a long-range elastic stress 
held, by the intrinsic property of a lipid hilayer. There¬ 
fore, multiple local structural deformations in a bilayer 
are often positionally correlated. In multiple bilayers, 
correlations between bilayers can be strengthened by 
dehydration. As a result, the local structures in multi¬ 
layers often order into a crystalline lattice in low 
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hydration. This review shows some X-ray and neutron 
diffraction techniques that can be used to resolve the 
electron density/neutron scattering length density dis¬ 
tributions of local structural features that are important 
to biological functions of cellular membranes. This is 
a field yet to be explored extensively by the soft matter 
community. 

Cross-References 

► Neutron Diffraction 

► Neutron Scattering of Membranes 

► X-Ray Scattering of Lipid Membranes 
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Dihydrofolate Reductase - 
Computational Studies 

Kara E. Ranaghan 

Centre for Computational Chemistry, School of 
Chemistry, University of Bristol, Bristol, UK 

Definition 

Dihydrofolate reductase (DHFR) is an enzyme that 
catalyzes the reduction of dihydrofolic acid to 
tetrahydrofolic acid using nicotinamide adenine 
dinucleotide phosphate (NADPH) as a cofactor. 

Basic Characteristics 

DHFR is important in the biosynthesis of purines, 
thymidylate, and several amino acids. Consequently, 
this enzyme has been the focus of much research 
experimentally and computationally. The reduction 
of 7,8-dihydrofolate (DHF) to 5,6,7,8-tetrahydrofolate 
(THF) involves a hydride transfer from NADPH. 
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Several different computational methods have been 
used to investigate quantum effects in the reaction 
catalyzed by DHFR. Hammes-Schiffer and coworkers 
used a hybrid approach to examine the importance of 
dynamical effects in the reaction. The free energy 
profile was generated as a function of a collective 
reaction coordinate using ► empirical valence bond 
(EVB) techniques. The transmission coefficient was 
calculated by propagating trajectories initiated at the 
TS backward and forward in time to account for 
dynamical barrier recrossings. Nuclear quantum 
effects such as zero-point energy and tunneling were 
incorporated into the free energy profiles by 
representing the transferring hydrogen as a three- 
dimensional wave function. Truhlar and coworkers 
have used a variational transition state theory (VTST) 
approach with ► QM/MM methods to study DHFR 
from E. coli and also the hyperthermophile 
Thermotoga maritima (TmDHFR)(Pu et al. 2006). 
Classical potentials of mean force (PMFs) and 
multidimensional quantum tunneling and vibrational 
corrections were calculated over a range of tempera¬ 
tures to match the temperature range studied experi¬ 
mentally. It is encouraging that despite using very 
different methodologies to calculate the quantum 
effects in E. coli DHFR, Hammes-Schiffer and 
coworkers (Hammes-Schiffer 2004) and Truhlar and 
coworkers (Pu et al. 2006) obtained results in reason¬ 
ably good agreement with each other. 

There has been much debate over suggestions 
that protein dynamics are important in catalysis 
in this enzyme (Kamerlin and Warshel 2010a, b; 
McGeagh et al. 2011). Hammes-Schiffer and 
coworkers found that the thermally averaged motions 
occur on the millisecond timescale, whereas the 
dynamical motions near the transition state of the 
hydride transfer reaction occur on the femtosecond to 
picosecond timescale (Hammes-Schiffer 2004). The 
results indicated that residues within and far from 
the active site affect the free energy of reaction and 
the degree of barrier recrossing. Furthermore, they 
suggest that this type of network may have evolved to 
optimize the hydride transfer step so that it is not the 
rate-limiting step in the overall enzyme reaction. 
Allemann and coworkers (Pang et al. 2006) also 
found evidence of several correlated motions in their 
study of TmDHFR. The importance of dynamics in 
DHFR was also proposed by Allemann and coworkers 
from their study of TmDHFR. The same correlated 


motions were observed across the temperature range 
but their intensity was reduced. The authors suggested 
that the dynamics of this enzyme are coupled to reac¬ 
tion in DHFR and that DHFRs from different species 
have evolved with their quaternary structures in a way 
that maximizes the coupling of their motion to the 
reaction coordinate (Pang et al. 2006). Warshel and 
coworkers have also performed several studies of 
DHFR using EVB techniques (Kamerlin and Warshel 
2010a, b) and argue strongly against the importance 
of dynamics in DHFR catalysis. In their latest study 
(Adamczyk et al. 2011), they have investigated several 
mutant DHFRs proposed by Bhabha et al. to be dynam¬ 
ically linked to catalysis (Bhabha et al. 2011). Their 
simulations show that the mutations change the chem¬ 
ical step in catalysis by changing the activation free 
energy rather than through any dynamical effect. 
Changes in the reaction potential surface modify the 
reorganization free energy, resulting in a change in 
motion rather than the motion itself driving the change. 

Cross-References 

► Computational Studies of Enzyme Motions 

► Empirical Valence Bond Methods 

► Protein Dynamics in Catalysis - Computational 
Studies 

► Quantum Effects/Tunneling 
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DNA Curtains 

Brett Alcott* and Eric C. Greene'’^ 

'Department of Biochemistry and Molecular 
Biophysics, Columbia University, New York, 

NY, USA 

^Howard Hughes Medical Institute, New York, 
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Definition 

DNA Curtains refers to a methodology used to align 
and organize DNA molecules on the surface of 
a microfluidic sample chamber so that single DNA 
and bound protein molecules can be observed by 
fluorescence microscopy. 

Introduction 

Protein-nucleic acid interactions are critical for many 
basic cellular functions, including genome replication, 
repair and maintenance, gene expression, and protein 
synthesis. These interactions must be precisely regu¬ 
lated and defects that alter or destroy proteins involved 
in these processes can lead to a variety of disease 
phenotypes. Single-molecule imaging has rapidly 
emerged as a powerful tool that is radically changing 
the way biologically important macromolecules can be 
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studied, and these approaches have proven particularly 
amenable for studying protein-nucleic acid interac¬ 
tions. ► Single-molecule methods are unique in that 
they allow researchers to probe individual biochemical 
reactions in real time and have opened a window into 
the behavior of proteins as individual entities, reveal¬ 
ing mechanistic details that would be impossible to 
discern through bulk experiments. 

However, single molecule techniques can be tech¬ 
nically challenging and low-throughput, making col¬ 
lection of statistically relevant data sets difficult. Here 
a technique is described called “DNA curtains” that 
uses DNA molecules tethered to an inert lipid bilayer 
and organized via diffusion barriers to enable the 
simultaneous visualization of hundreds of individual 
molecules by ► total internal reflection fluorescence 
microscopy (TIRFM). 

DNA Curtains: A Tool for High-Throughput 
Single-Moiecule Imaging 

DNA curtain technology relies upon microfluidic flow 
cells wherein the surface of the cell is coated with 
a “biologically friendly” lipid bilayer (Greene et al. 
2010). The bilayer is disrupted at defined locations 
with manually etched or nanofabricated barriers to 
► lipid lateral diffusion. These barriers are then used 
to organize lipid-tethered DNA molecules into defined 
patterns via the application of hydrodynamic flow. 
This experimental platform can be adapted for numer¬ 
ous biochemical and biophysical questions involving 
protein-DNA interactions. 

TIRFM 

DNA curtain technology uses TIRFM for visualizing 
fluorescently labeled DNA and proteins (Fig. la). In 
TIRFM, a laser beam is directed through a microscope 
slide and reflected off the interface between the slide 
and an aqueous buffer to generate an evanescent wave 
at the slide surface. This wave penetrates at most 
^200-300 nanometers (nm) into the aqueous sample 
and yields a very small ► fluorescence excitation 
volume, resulting in low background signal relative 
to other wide-field illumination techniques (Axelrod 
1989). 

One option is to use a simple through-prism TIRFM 
configuration with a diode-pumped solid-state laser, 
which is focused through the fused silica prism 


onto a microfluidic flow cell to generate the evanescent 
wave within the sample chamber (Fig. la). Images are 
collected with an objective lens, passed through 
a notch filter to eliminate scattered laser light, 
and detected using a back-illuminated EM-CCD 
(electron-multiplying charge-coupled device) camera. 
For multicolor operation, the images are passed 
through an image-splitter containing a dichroic mirror 
that separates the optical paths. Such TIRFM experi¬ 
ments can utilize microfluidic flow cells machined 
and assembled as described (Greene et al. 2010). 

Preparing DNA Curtains 

To assemble DNA curtains, a lipid bilayer is first 
deposited onto the surface of the sample chamber 
(Fig. lb). The bilayer renders the slide surface inert 
and creates mobile attachment points for the DNA 
molecules. Most straightforward is the use of the linear 
48.5 kilobase (kb) genome of bacteriophage X as 
a substrate. This DNA is convenient because it has 
12-nuclotide (nt) ssDNA overhangs at either end that 
can be readily modified with biotin. A fraction of the 
lipids within the bilayer is biotinylated, and the DNA is 
coupled to the bilayer through a biotin-streptavidin 
linkage. When buffer is passed through the sample 
chamber it pushes the DNA through the chamber 
with the tethered DNA ends still attached to the bilayer 
(Fig. Ib-c). Diffusion barriers disrupt the continuity of 
the bilayer, which in turn halts the movement of the 
DNA molecules, yielding a pattern of molecules that 
resembles a curtain (Fig Ib-c). 

Nanofabricated DNA Curtains 

DNA curtain technology is based on the work of Boxer 
and colleagues, who demonstrated that lipids are 
unable to traverse manually etched microscale barriers 
(i.e., scratches) on a glass surface (Cremer et al. 1999). 
The first generation of DNA curtains used manually 
etched barriers, but these suffered from a number 
of drawbacks, including poor reproducibility and 
placement, deleterious effects of light scattering by 
barriers etched too widely or deeply, and poor-quality 
barriers could cause nonspecific adsorption of proteins 
(Greene et al. 2010). To overcome these difficulties, 
one can nanofabricate the diffusion barriers (Fig Ib-c, 
Fig 2a; Fazio et al. 2008). 

Nanofabricated barriers can be made using 
electron-beam (ebeam) lithography, which yields 
highly uniform patterns with nanometer precision 
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DNA Curtains, Fig. 1 TIRFM and DNA Curtains Schematic 
diagram of lipid tethered DNA molecules aligned at a diffusion 
barrier, (a) TIRFM used to image single molecules of DNA. The 
DNA molecules must be extended parallel to the surface of the 
sample chamber in order to remain confined within the evanes¬ 
cent field, (b) and (c) Depict the bilayer on the surface of a fused 
silica slide along with a barrier and the response of tethered DNA 


direction of hydrodynamic force 



evanescent field 
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molecules to the application of a hydrodynamic force. The upper 
and lower panels depict views from the side and above, respec¬ 
tively. In the absence of flow (b and c, left) the DNA molecules 
are not confined within the evanescent field, nor are they aligned 
at the barrier, (b and c, right) When flow is applied, the DNA 
molecules align at the diffusion barrier. Drawings are not to 
scale (Reprinted with permission from Fazio et al. (2008)) 


(Fazio et al. 2008; Greene et al. 2010). For ebeam 
lithography, the microscope slide is first coated with 
a thin polymer, and an electron beam is used to “burn” 
a desired pattern into the polymer, exposing the under¬ 
lying slide surface. Metal (typical is chromium) is then 
vaporized and deposited over the entire surface, and 
the remaining polymer is then peeled away leaving 
behind the metal barriers (Fig Ib-c, Fig 2a). These 
nanoscale barriers eliminate the aforementioned 


problems with manually etched harriers, and are highly 
effective for making DNA curtains (Greene et al. 
2010 ). 

Ebeam lithography allows for arbitrary control over 
the dimensions and geometry of the harrier patterns, 
and relatively simple variations in pattern geometry 
can enable controlled manipulation of DNA mole¬ 
cules. For example, nanopattemed barriers shaped 
like a saw tooth pattern can be used to control the 
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DNA Curtains, Fig. 2 Imaging nucleosomes with DNA Cur¬ 
tains (a) Schematic for visualizing fluorescently tagged recom¬ 
binant nucleosomes on DNA curtains, (b) DNA curtains stained 
with the intercalating dye YOYOl (green) bound by nucleo¬ 
somes (magenta) tagged with anti-Flag quantum dots. The teth¬ 
ered end of each curtain is indicated as T1-T4, and arrows 
indicate the direction of flow, (c) Kymogram illustrating five 
nucleosomes on one DNA molecule. The nucleosomes disappear 


when flow is temporarily interrupted (blue arrowhead), and 
reappear when flow is resumed (green arrowhead), verifying 
that they are bound to the DNA and do not interact with the lipid 
bilayer, (d) The theoretical distribution of nucleosomes based on 
DNA sequence composition (upper panel) compared to the 
experimentally measured distribution from DNA curtains 
(lower panel) (Reprinted with permission from Visnapuu and 
Greene (2009)) 
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DNA Curtains, Fig. 3 Double-tethered DNA curtains and one- 
dimensiona! dijfusion of Mlhl-Pmsl (a) Diagram of the 
nanofabricated device used for making the double-tethered 
DNA curtains. The rack consists of linear barriers to lipid diffu¬ 
sion, which align the lipid-tethered DNA molecules, followed by 
an array of antibody-coated pentagons that provide immobile 
anchor points for the second end of the DNA. (b) Top and 
bottom, YOYOl-stained DNA curtains {green) with and 


without QD-tagged Mlhl-Pmsl (magenta), respectively, (c) 
Kymograms illustrating the motion of Mlhl-Pmsl. The lower 
panel shows photocleavage (arrowhead) of a DNA during 
data collection. The proteins disappear from view when the 
DNA breaks, showing that the proteins and DNA are not 
adsorbed to the bilayer (Reprinted with permission from Gorman 
et al. (2010b)) 
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DNA Curtains, 

Fig. 4 Protein-protein 
collisions visualized with DNA 
Curtains (a) Histogram of 
EcoRI^*' and Lad bound to 
L-DNA. The locations of the 
five EcoRI sites found in X- 
DNA are indicated, along with 
examples of QD-EcoRI^"**^ 
bound to YOYO 1-stained 
DNA (inset, upper panel) and 
examples of QD-LacI bound 
to the DNA (inset, lower 
panel), (b) Kymograms 
showing RecBCD colliding 
with single molecules of 
EcoRl‘^““5andLacI 
(magenta), as indicated, (c) 
Distribution of event types for 
RecBCD collisions with 
EcoRl‘^““5andLacI. (d) 
Tracking data for individual 
collisions (Reprinted with 
permission from Finkelstein 
etal. (2010)) 





lateral separation of individual molecules within the 
curtain (Visnapuu et al. 2008). More complex barrier 
patterns comprised of two distinct pattern elements 
have also been devised; linear barriers for aligning 
the lipid-tethered DNA molecules; and large anti- 
body-coated pentagons that capture the free ends 
of the DNA, allowing the DNA molecules to be 
visualized along their full contour lengths even in 
the absence of continual buffer flow (Fig 3a; 
Gorman et al. 2010a). The ability to fabricate new 
nanofabricated patterns designed to suit a large array 


of conceivable experiments gives the researcher 
a powerful tool for investigating DNA-protein interac¬ 
tions with TIRFM. 

Visualizing Protein-DNA Interactions 

The main motivation for developing DNA curtains is 
for use in single-molecule studies of protein-DNA inter¬ 
actions. These studies rely upon the use of fluorescently 
tagged proteins, and quantum dots (QDs) are particu¬ 
larly favorable as labeling reagents (Greene et al. 2010). 
QDs are extremely bright and photostable, allowing 
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proteins to be viewed over long periods of time without 
photobleaching, and the positions and movements of the 
proteins can be quantihed by ► single particle tracking 
(Greene et al. 2010). Moreover, it is easy to tag recom¬ 
binant proteins that are expressed as fusions with either 
an epitope tag (FLAG, HA, etc.) or a short amino acid 
sequence that is biotinylated in vivo. These tags are 
used as handles for labeling the protein of interest with 
QDs conjugated to antibodies or streptavidin (Greene 
et al. 2010; Finkelstein et al. 2010). 

DNA curtains have been used to assess the distri¬ 
butions and lifetimes of both sequence-nonspecihc 
DNA binding proteins such as nucleosomes (Fig. 2; 
Visnapuu and Greene 2009), as well as sequence- 
specihc DNA-binding protein such as lac repressor, 
EcoRl, and RNA polymerase (Fig. 4; Finkelstein 
et al. 2010). These binding distribution measurements 
are greatly simplihed due to the fact that the DNA 
molecules are all aligned in the same orientation with 
respect to one another, such that a hypothetical line 
drawn across the curtain perpendicular to the long axis 
of the DNA would cross the same sequence on each 
individual molecule. Similarly, site-specihc DNA- 
binding proteins are revealed as fluorescent lines 
spanning the width of the curtain, the position of 
which corresponds to the preferred binding sites 
(Fazio et al. 2008; Finkelstein et al. 2010). 

In addition to protein binding distributions, DNA 
curtains have also been used to study the movement of 
proteins along DNA, including one-dimensional diffu¬ 
sion of DNA repair proteins such as Msh2-Msh6 and 
Mlhl-Pmsl, as they travel along DNA (Fig. 3; Gorman 
et al. 2010b), as well as the motion of DNA motor 
proteins such as the helicase RecBCD and E. coli rep¬ 
lication forks (Fig. 4; Finkelstein et al. 2010; Yao et al. 
2009). Protein movement can be quantihed by using 
single particle tracking to follow the QD fluorescent 
signal, or in the case of RecBCD and replication forks, 
by monitoring the length of the DNA substrates. The 
power of DNA curtains for studying these dynamic 
processes is that they allow collection of statistically 
relevant single-molecule data sets, comprised of spatial 
and temporal information, much more quickly than if 
the reactions could only be visualized one at a time. 

In summary, DNA curtains enable the alignment of 
thousands of DNA molecules, which can then be visu¬ 
alized in real time by TIRFM. The main advantages of 
this experimental platform are that one can directly 
visualize proteins bound to DNA, can see where the 


proteins are bound, as well as how they move along 
DNA, and how they interact with and influence one 
another, and all of this can be done in real time at the 
single molecule level. 

Cross-References 

► Fluorescence 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Lipid Lateral Diffusion 

► Protein Fluorescent Dye Labeling 

► Single-Molecule Methods 

► Single Particle Tracking 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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DNA Repair and Recombination 

Matthew J. Rossi and Alexander V. Mazin 
Department of Biochemistry and Molecular Biology, 
Drexel University College of Medicine, Philadelphia, 
PA, USA 

Synonyms 

DNA; Recombinational DNA repair 

Definition 

DNA repair is the term used to describe the numerous 
processes of the cell that are responsible for repairing 
damaged DNA and maintaining genome stability. 

Homologous Recombination is the transfer of 
genetic material between similar DNA sequences 
that contributes to an increase in genetic diversity. 
The enzymes of the homologous recombination 
pathway promote faithful chromosome segregation 
during meiosis and the repair of double-strand DNA 
breaks. 

Introduction 

Environmental factors and metabolic toxins such as 
ionizing radiation and reactive oxygen species 
constantly cause damage to DNA resulting in the loss 
of genetic information. In order to restore this genetic 
information and maintain genomic stability, the cell 
has committed an immense number of genes to con¬ 
stantly monitor and maintain DNA sequences. These 
genes constitute a variety of pathways that are collec¬ 
tively referred to as DNA repair (Friedberg et al. 2006). 
The most common types of DNA damage involve 
chemical modification, loss, or incorporation of incor¬ 
rect nucleotide bases. These aberrant nucleotides are 
recognized by the base excision, nucleotide excision. 


and mismatch repair pathways that remove the dam¬ 
aged bases, and allow ► DNA polymerase to restore 
the lost information by using the complementary 
strand of DNA as a template (Fig. la). 

A far more harmful type of DNA damage occurs 
when the phosphodiester bonds in both DNA strands 
are broken. Cells have evolved two different pathways 
to repair these double-strand breaks. The first pathway, 
non-homologous end-joining, is used to ligate free 
DNA ends together. This pathway, however, is 
error-prone, i.e., often resulting in the loss of one 
or more nucleotides at the break site (Fig. lb). 
Non-homologous end-joining is used during the G1 
phase of the cell cycle before DNA replication occurs. 
The second, a more accurate pathway known as 
homologous recombination, uses intact homologous 
DNA sequences to direct the repair of DNA double¬ 
strand breaks (Fig. Ic). In somatic cells, homologous 
recombination operates during or shortly after DNA 
replication in the S and G2 phases of the cell cycle and 
uses sister chromatids as homologous DNA templates. 
Besides repairing double-strand DNA breaks, recom¬ 
bination is also needed in the accurate segregation of 
homologous chromosomes during meiosis (Friedberg 
et al. 2006). Homologous chromosomes become 
linked through crossovers that ensure each pair of 
chromosomes is physically connected before division 
begins. The crossing over events result in a new 
arrangement of the genetic material that increases 
genetic diversity among gametes. Thus, recombination 
plays the dual role of maintaining the genome on 
an individual basis, and promoting genetic diversity 
on a generational basis. Malfunction of the non- 
homologous end-joining and homologous recombina¬ 
tion pathways causes genomic instability that leads to 
cancer development and various chromosomal 
abnormalities. 

Recombinase Proteins Form Filaments 
on DNA 

The salient feature of homologous recombination is the 
use of homologous DNA sequences as a template to 
achieve high fidelity repair of DNA double-strand 
breaks. The pairing of homologous DNA sequences 
is carried out by a highly conserved protein called the 
recombinase through a reaction termed “DNA strand 
exchange.” Recombinase proteins have been identified 
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DNA Repair and Recombination, Fig. 1 Scheme of nucleotide excision repair (a), non-homologous end-joining (b), and 
homologous recombination (c) 


in every domain of life and include the prokaryotic 
RecA, archeal RadA, and eukaryotic Rad51. Both the 
RecA and Rad51 proteins have been extensively 
studied through ► single molecule methods (Forget 
and Kowalczykowski 2010). The active form of the 
recombinase is a helical nucleoprotein filament that 
initiates assembly on single-stranded DNA and then 
polymerizes along single- or double-stranded DNA 
with a molecule of ATP bound between each mono¬ 
mer. The prokaryotic RecA filament contains ^6.2 
monomers per turn with each monomer binding three 
nucleotides. The DNA held within the filament adopts 
an extended, unwound conformation that has ^18 
bases per turn with an average distance between 
bases of 5.2 A, making it 50% longer than standard 


B-form DNA (Chen et al. 2008) (Fig. 2). RecA 
polymerizes in a 5' to 3' direction, meaning 
RecA monomers predominately associate on the 3' 
end of the filament and dissociate from the 5' end. 
Polymerization is regulated at the step of nucleation 
and requires 5-6 monomers of RecA to form a stable 
protofilament. Nucleation is several orders of magni¬ 
tude slower than the subsequent growth step that takes 
place at an addition rate between 2 and 20 protein 
monomer per second as measured by ► single¬ 
molecule fluorescence resonance energy transfer (Joo 
et al. 2006). Dissociation of RecA monomers from the 
filament is linked to ATP hydrolysis. However, the rate 
of dissociation (^120 min”') is several times faster 
than the rate of ATP hydrolysis (20-30 min”'). 
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DNA Repair and Recombination, Fig. 2 Model of the struc¬ 
ture of the RecA filament bound to double-stranded DNA (PDB 
ID: 3CMX). Fifteen RecA monomers (alternating red and 
magenta) are shown bound to double-stranded DNA (green) in 
the extended conformation. The model was generated by placing 
the three models of pentameric RecA from end to end and rotated 
for a contiguous filament 


RecA’s eukaryotic counterpart RadSl shares 30% 
identity and has the same structural fold as RecA. 
While the ATPase activity and dissociation rates 
of RadSl are over 100-fold slower than that of 
RecA, ► electron microscopy showed that the 
RadSl nucleoprotein filament is nearly identical to 
RecA and forces DNA to adopt the same extended 
conformation. 


Translocation of Helicase Family Proteins 
on DNA 

Following DNA strand exchange, recombination inter¬ 
mediates are processed by numerous helicase family 
proteins including RadS4 and its meiotic homolog 
Tidl. RadS4 is a robust ► ATPase (Petukhova et al. 
1998) that translocates along double-strand DNA in an 
ATP hydrolysis-dependent manner. The translocation 
activity of RadS4 and Tidl proteins fused to a ► glu¬ 
tathione S transferase affinity tag were studied using 
fluorescently labeled anti-glutathione S transferase 
antibodies. Rad54 was shown to be a rapid (^300 
base pairs/s) and processive (^11,000 base pairs/ 
binding event) translocase (Amitani et al. 2006). In 
comparison, Tidl is a slower translocase (^80 base 
pairs/s) but possesses equivalent processivity 
(^10,000 base pairs/binding event) (Nimonkar et al. 
2007). With this translocase activity, Rad54 performs 
numerous functions during homologous recombina¬ 
tion including the branch migration of Holliday 
junctions (Bugreev et al. 2006), stimulation of the 


DNA strand exchange activity of recombinases, 
dissociation recombination intermediates, chromatin 
remodeling, and disassembly of protein-dsDNA 
complexes (Mazin et al. 2010). The branch migration 
activity of Rad54 is stimulated by Rad51 recombinase 
(Rossi and Mazin 2008). 


Cross-References 

► ATPase: Overview 

► DNA Polymerase 

► Electron Microscopy 

► Fluorescence Labeling of Nucleic Acids 

► Glutathione S-Transferase - Computational Studies 

► Helicases 

► Microfluidics for Single Molecule Detection 

► Protein Fluorescent Dye Labeling 

► Single-Molecule Methods 
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DNA-Ligand Circular Dichroism 

Alison Rodger 
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Synonyms 

DNA; Induced circular dichroism; Ligand-ligand 
interactions 

Definition 

Deoxyribonucleic acid, DNA, is composed of nucleo¬ 
tides which are planar aromatic bases linked to a sugar 
and a phosphate. Ligands that bind to DNA are usually 
cationic with their binding further stabilized by 
adopting a binding site that ensures complementarity 
of DNA and ligand shapes. For example, planar aro¬ 
matic molecules often intercalate between base pairs, 
and bulky proteins adopt the major groove. As long as 
the binding mode has an element of specihcity about it, 
then one expects to see a change in the CD signal of the 
component DNA and ligand signals. This is particu¬ 
larly obvious when the ligand is achiral so has no 
intrinsic CD before binding to the DNA. The CD 
signals induced upon ligand binding to DNA (or any 
other chiral molecule) are characteristic of the interac¬ 
tion and may be interpreted to give data such as bind¬ 
ing constants. 

Basic Characteristics 
DNA-Ligand Interactions 

The CD induced upon a ligand binding to DNA may be 
used on a number of levels. The simplest application is 
to note that the induced CD, ICD, is nonzero so the 
molecule must be binding to the DNA. However, more 
information can usually be extracted as discussed in 
Garbett et al. (2007). Some illustrative examples are 
given below. It should be noted that if the ligand 
transitions of interest occur in the DNA region of 
the spectrum we usually take the ICD to be the total 
CD of the system minus that of the unperturbed DNA 
at the same concentration. This means any DNA 


conformational changes that alter the DNA CD are 
included in the ICD. 

It is often useful to measure a series of spectra 
where some variable such as the ionic strength, or 
mixing ratio (drug:DNA ratio), or temperature is 
changed. If the ICD intensity of the ligand above 
300 nm (i.e., where there is no DNA CD) changes but 
the shape of the spectrum remains the same during 
such an experiment, then it may be concluded that the 
ligand is binding in the same mode (or in a number of 
sites whose relative proportions are independent of 
DNAidrug ratio) over the whole concentration range, 
though the amount of ligand actually bound may have 
changed. In such a case, we may write 

Lt, = a X ICD (1) 

where L/, is the concentration of bound ligand, and a is 
a proportionality constant. For very high DNA concen¬ 
trations, the concentration of ligand that is bound to the 
DNA approximately equals the total ligand concentra¬ 
tion. It is seldom possible to determine what the CD is 
for the case of excess ligand since DNA-binding 
ligands typically adopt a second binding site (with 
a different ICD) when the first one is occupied. 
A typical titration series of a DNA-binding ligand is 
shown in Fig. 1. Figure la shows the low-loading data 
where only a single binding site (intercalation in this 
case) is occupied with a binding constant of (2.2 ±1.1) 
X 10^ M~ * in a mode that spans six bases (Rodger et al. 
1994). The data in Fig. lb show what happens at higher 
loading where another site is occupied. 

If an experiment is conducted at constant ligand 
concentration and varying DNA concentration, then, 
unless the ligand-binding mode is changing, there 
should be no change in the observed ligand ICD out¬ 
side the DNA absorbance region, that is, above 
300 nm. A particularly dramatic change is observed, 
however, if as the DNA concentration is decreased 
(so one would expect fewer ligands to bind and so 
a decrease in ICD), the ligands begin to stack together 
and there are significant ligand-ligand interactions 
occurring on the DNA support. Under these circum¬ 
stances, we generally observe a large excitonic CD as 
illustrated in Fig. 2. 

The Origin of Ligand ICD 

The CD induced into ligand transitions upon bind¬ 
ing to DNA almost always arises as a result of 
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DNA-Ligand Circular Dichroism, Fig. 1 Induced CD (1 cm 
path length) of DNA with an intercalated anthracene chromo- 
phore that is coupled to a cationic spermine tail. ICD signals 
are the total CD minus the free DNA CD. The smallest 
magnitude ICD signal corresponds to the lowest ligand concen¬ 
tration. (a) Poly[d(G-C )]2 (160 pM base). Ligand:DNA ratios 
are 0.008:1; 0.015:1; 0.022:1; 0.029:1; 0.037:1; 0.045:1; 


0.056:1; 0.067:1; 0.083:1; 0.091:1; 0.11:1; 0.12:1; 0.14:1; 
0.15:1; 0.16:1; and 0.17:1. The middle graph shows the CD at 
262 nm as a function of ligand:DNA ratio (Rodger et al.l994). 
(b) Poly[d(A-T )]2 DNA (130 pM base). Ligand:DNA ratios are 
0.05:1; 0.13:1; 0.19:1; 0.23:1; 0.30:1; 0.40:1; and 0.56:1 
(Rodger et al. 1995) 


non-degenerate exciton coupling between the electric 
dipole transition moment of the ligand and those of the 
DNA bases. Based on coupled-oscillator theory 
(Schipper and Rodger 1983; Lyng et al. 1992a, b; 
Norden et al. 2010) relations have been derived for 
the CD of DNA-bound ligands (Hiort et al. 1990) 
which have enabled structural characterization of 
both intercalation as well as groove-bound com¬ 
plexes. When combined with LD data, which provide 
orientation angles of the ligands relative to the 
DNA helix axis, the CD adds an extra dimension by 
probing the “azimuthal” angles, that is, directions 
referring to projections on the plane perpendicular to 
the helix axis. 

The ICD of intercalated chromophores is generally 
rather weak, with a sign that depends on how the 
transition moment is directed within the intercalation 
pocket. A simple expression applies for the case of an 
electric-dipole allowed transition in an intercalated 
chromophore (Schipper et al. 1980; Schipper and 
Rodger 1983). A positive ICD is expected for transi¬ 
tions polarized poking out into the groove and negative 
for those polarized along the longest dimension of the 
intercalation pocket (Norden and Tjemeld 1982; 
Palumbo et al. 1984; Norden et al. 1986; Ismail et al. 
1998). The 275-nm phenanthroline long axis ICD of 
a platinum metal complex binding to DNA given in 



300 350 400 

Wavelength (nm) 


DNA-Ligand Circular Dichroism, Fig. 2 CD of constant con¬ 
centration of 9-hydroxyellipticene (50 pM) in the presenee of 
calf thymus DNA (concentrations indicated in figure, 5 mM 
NaCl, 1 mM phosphate buffer). Note ICD signals increase with 
decreasing DNA concentration (Ismail et al. 1998) 

Fig. 3 illustrates this. For non-intercalated, groove- 
bound ligands a strong positive ICD for transitions 
polarized parallel to the grooves is predicted (Kubista 
et al. 1988). 
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DNA-Ligand Flow Linear Dichroism 


DNA-Ligand Circular 
Dichroism, Fig. 3 Molecular 
structure of [Pt(R,S-dach) 
(4-Me-phen)]^’^, denoted 
RR4M and its ICD (0 pM to 
125 pM) with calf thymus 
DNA (150 pM) after 
subtraction of DNA CD 
(McDonnell et al. 2008) 





Wavelen 9 m (nm) 


Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Circular Dichroism Spectroscopy: Units 

► DNA-Ligand Flow Linear Dichroism 

► Far UV Protein Circular Dichroism 

► Near UV Protein CD 

► Oriented Circular Dichroism Spectroscopy 

► Protein Circular Dichroism Analysis 


References 

Garbett NC, Ragazzon PA, Chaires JB. Circular dichroism to 
determine binding mode and affinity of ligand-DNA interac¬ 
tions. Nat. Protoc.. 2007;2:3166-72. 

Hiort C, Norden B, Rodger A. Enantioselective DNA binding of 
[Ru(l,10- phenanthroline)3]2-t studied with linear dichro¬ 
ism. J. Am. Chem. Soc.. 1990;112:1971-82. 

Ismail MA, Sanders KJ, Fennel GC, Latham HC, Wormell P, 
Rodger A. Spectroscopic studies of 9-hydroxyellipticine 
binding to DNA. Biopolymers. 1998;46:127^3. 

Kubista M, Akerman B, Norden B. Induced circular dichroism in 
nonintercalative DNA-drug complexes. Sector rules for 
structural applications. J. Phys. Chem.. 1988;92:2352-6. 

Lyng R, Rodger A, Norden B. The circular dichroism of drug- 
DNA systems. 1. Poly(dG-dC )2 B-DNA. Biopolymers. 
1992a;31:1709-19. 

Lyng R, Rodger A, Norden B. The circular dichroism of drug- 
DNA systems. 2. Poly(dA-dT )2 B-DNA. Biopolymers. 
1992b;32:1201-14. 

McDonnell U, Hicks MR, Hannon Ml, Rodger A. DNA binding 
and bending by dinuclear complexes comprising ruthenium 
polypyridyl centres linked by a bis(pyridylimine) ligand. 
1. Inorg. Biochem.. 2008;102:2052-9. 

Norden B, Tjemeld F. Structure of methylene blue DNA com¬ 
plexes studied by linear and circular dichroism spectroscopy. 
Biopolymers. 1982;21:1713-34. 


Norden B, Wirth M, Ygge B, Buchardt O, Nielsen P. Interac¬ 
tions between DNA and psoraleneamines studied with 
dichroism techniques. Photochem. PhotobioL. 1986;44: 
587-94. 

Norden B, Rodger A, Daffom TR. Linear dichroism and 
circular dichroism: a textbook on polarized spectroscopy. 
Cambridge, UK: Royal Society of Chemistry; 2010. 

Palumbo M, Capasso L, Palii G, Marciani Magno S. DNA- 
binding of water soluble furocoumarins: a thermodynamic 
and conformational approach to understanding different bio¬ 
logical effects. Nucleic. Acids Res.. 1984;12:8567-78. 

Rodger A, Blagbrough IS, Adlam GA, Carpenter ML. DNA 
binding of a spermine derivative: spectroscopic study of 
anthracene-9-carbonyl-N 1 -spermine with poly(dG-dC )2 and 
poly(dA-dT) 2 . Biopolymers. 1994;34:1583-93. 

Rodger A, Taylor S, Adlam G, Blagbrough IS, Haworth IS. 
Multiple DNA binding modes of anthracene-9-carbonyl- 
Nl-spermine. Bioorg. Med. Chem.. 1995;3:861-72. 

Schipper PE, Rodger A. Symmetry rules for the determination of 
the intercalation geometry of host/guest systems using circu¬ 
lar dichroism: a symmetry adapted coupled-oscillator model. 
J. Am. Chem. Soc.. 1983;105:4541-50. 

Schipper PE, Norden B, Tjemeld F. Determination of binding 
geometry of DNA-adduct systems through induced circular 
dichroism. Chem. Phys. Lett.. 1980;70:17-2L 


DNA-Ligand Flow Linear Dichroism 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

DNA; Induced linear dichroism - ligand-ligand 
interactions 














DNA-Ligand Flow Linear Dichroism 


487 


D 



DNA-Ligand Flow Linear Dichroism, Fig. 1 DNA-binding ligands: (a) 9-hydroxyellipticine and (b) netropsin and their geom¬ 
etries upon DNA binding (Norden et al. 2010) 


Definition 

If we know how the transition moments are oriented 
within a molecular coordinate framework, we can, 
from an observed linear dichroism (LD) spectrum, 
deduce something about how the molecules are aligned 
within the sample. Since small molecules cannot be 
flow-oriented, a simple and sometimes very useful 
application of LD is for detecting interactions between 
a small molecule and a biological macromolecule such 
as DNA that can be flow-oriented. Complete absence 
of a flow LD at the ligand transition wavelengths is 
generally a safe indication that the dye does not inter¬ 
act with DNA, while the appearance of a ligand LD 
signal indicates some interaction. The sign and relative 
amplitude of the measured LD further tells us some¬ 
thing about the binding geometry in terms of angles 
between the absorbing dye transition moments and the 
DNA helix axis (Rodger and Norden 1997). 

Basic Characteristics 

Although LD cannot tell us where a ligand binds to 
DNA, it can be used to determine the orientation of the 
ligand (if the ligand’s transition moments have been 
assigned) relative to the DNA helix axis. This often 
provides significant clues as to the binding mode. 
A ligand may bind to DNA in a number of different 
types of site, as well as with sequence and orientation 



Wavelength (nm) 

DNA-Ligand Flow Linear Dichroism, Fig. 2 Absorbance and 
LD'^ spectra of ethidium bromide in 5 mM phosphate buffer, 
pH = 6.9, bound to calf thymus DNA (200 pM), DNA base to 
ligand ratio is 20:1 


preferences. Broadly speaking, a ligand may bind in 
the different ways outlined below. 

• The ligand is externally bound to the phosphate 
backbone; this is usually orientationally fairly 
nonspecific and the resulting ligand LD is corre¬ 
spondingly small or nonexistent. 

• The ligand is intercalated between DNA bases: 
This mode requires planar aromatic molecules 
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DNA-Ligand Flow Linear Dichroism, Fig. 3 Ru 

(tpyanth) 2 Cl 2 , with z indicating the approximate position of the 
metal complex long axis; a second anthrylterpyridine ligand is 
oriented perpendicular to the one illustrated making an 


approximately octahedral coordination geometry and the 
resulting LD spectra from a titration series of [Ru(tpyanth) 2 ]^* 
with ct-DNA (100 pM). Metal complex concentrations are indi¬ 
cated in the figure (Patel et al. 2002) 


(or parts of molecules) which insert between DNA 
base pairs causing the DNA to unwind and open up 
a slot between adjacent base pairs so the ligand may 
be sandwiched between them as illustrated in Fig. 1. 
Molecules that intercalate between DNA bases push 
the base pairs apart by ^3.4 A and unwind the helix 
by up to 36°. Thus, intercalators both lengthen and 
stiffen DNA. The ligand LD of such a system 
belongs to in-plane polarized n ^ n* transitions 
which are polarized parallel to the DNA bases when 
a planar aromatic molecule is intercalated, thus giv¬ 
ing a negative LD. The intercalator’s reduced LD 
(=LD/Absorbance) often has slightly larger magni¬ 
tude that that of the average DNA base, since the 
DNA around the intercalator is somewhat stiffer than 
in other parts. This is illustrated by the reduced LD of 
the intercalator ethidium bromide in Fig. 2. 

• The ligand is in the minor groove: This mode is 
frequently adopted by aromatic molecules 
containing internuclear bonds with some rotational 
freedom, for example, long crescent-shaped mole¬ 
cules such as netropsin (Fig. 1). The long axis of 
such molecules is oriented at approximately 45° to 
the helix axis making the LD of long axis polarized 
transitions positive. The short-axis polarized transi¬ 
tions of minor groove binding molecules are 


perpendicular to the helix axis and so approxi¬ 
mately parallel to the base transitions, resulting in 
negative LD signals. Minor groove binding is 
favored by AT-rich DNAs. 

• The ligand is in the major groove: This type of 
binding is found for several regulatory proteins 
and there is enough space to accommodate most 
smaller ligands including fairly bulky metal 
complexes. 

Figure 3 shows the LD of a metal complex that 

binds to DNA with both groove binding and interca¬ 
lating chromophores. The result is a complicated spec¬ 
trum of positive and negative signals. 

Cross-References 

► Absorption Spectroscopy and Transition 
Polarizations: Pictorial Description 
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► Linear Dichroism 
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Dichroism 
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Synonyms 

Protein Structure 


Introduction 

The first protein structures were solved in the early 
1970s, and as more were deposited in the Protein 
Databank (PDB) it became clear that many homolo¬ 
gous proteins have structures that are conserved 
throughout evolution, providing a powerful signal to 
trace evolutionary ancestry and to recognize families 
of proteins. For example, while the amino acid 
sequences of hemoglobins from human and sperm 
whale share less than 20% identical residues, their 
tertiary structures are highly similar. In their 
pioneering studies of evolution in protein structure 
families, Chothia and Lesk demonstrated the relation¬ 
ship between sequence and structure and showed that, 
especially in the core of the protein domain, the struc¬ 
ture is much more highly conserved than sequence and 


can be used to recognize even very distant evolution¬ 
ary relationships (Chothia and Lesk 1986). 

A number of structure-based classifications of pro¬ 
teins emerged to capture this structural information 
and to identify evolutionary families and fold similar¬ 
ities (SCOP, CATH, 3DEE, DDD, COMPASS) all 
focusing on the domain as an important unit in evolu¬ 
tion. Studies of completed genomes have endorsed the 
choice of the domain as the primary unit as they 
revealed the extent to which domains combine with 
other partner domains to give different proteins. This 
means that while there appear to be relatively few 
domain superfamilies (<3,000 in SCOP and CATH), 
more than a million different multi-domain combina¬ 
tions have been predicted in genome sequences. 
Sequence-based classifications do exist (PANTHER, 
Phylofacts), that consider the whole protein hut all 
structural classifications focus mainly on the domain 
level. These domain-hased classifications provide an 
opportunity to understand how these primary units 
evolve and diverge to adopt diverse functions and to 
operate in multiple contexts. 

The most comprehensive domain structure classifi¬ 
cations currently available (SCOP (Murzin et al. 
1995), CATH (Orengo et al. 1997)) were both 
established in the early 1990s. While their philoso¬ 
phies differed somewhat in the manner in which rela¬ 
tionships were recognized, both used a hierarchical 
classification of structures, described in more detail 
below. The development of sensitive structure com¬ 
parison algorithms that could handle the insertion and 
deletions of residues, occurring during evolution, and 
the shifts in secondary structures that accompanied 
mutations in residues, allowed structural similarities 
to be detected even between very remote relatives 
(e.g., in the midnight zone, <10% sequence identity). 

Finding Structural Similarities Between 
Domain Structures 

While domain structure is generally well conserved 
between homologues, this is usually in the core of the 
domain and there can he substantial changes outside this 
core. Both CATH and SCOP have exploited sensitive 
structure comparison algorithms, which use sophisti¬ 
cated optimization strategies for handling the residue 
insertion/deletions (indels) (e.g., the earliest methods 
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include DALI, SSAP, COMPARER, STAMP, see 
► Protein Structure Comparison Methods). The SSAP 
method developed by Taylor and Orengo in 1989 (Tay¬ 
lor and Orengo 1989) was used to recognize the struc¬ 
tural similarities and homologous relationships reported 
in CATH. SCOP was initially established using a more 
manual approach to recognize homologues, though dis¬ 
tant relationships were often checked using the DALI 
algorithm, developed in 1993 by Holm and Sander 
(Holm and Sander 1993). 

Various strategies are used by these algorithms to 
handle the indels. Several (COMPARER, SSAP, 
STRUCTAL) exploit dynamic programming approaches 
and use a range of features to score the similarity between 
proteins, for example, distances between superposed res¬ 
idues in the domains being compared or similarities in 
internal vectors. DALI generates contact maps for 
hexapeptide fragments in the domains and uses simu¬ 
lated annealing to determine the optimal concatenation 
of equivalent fragments between domains. 

Evolutionary Classifications Versus 
Structural Neighborhoods 

Classihcation resources like SCOP and CATH clearly 
distinguish between evolutionary relationships (i.e., 
homologous domains) and structural similarities, for 
which there is no additional evidence supporting an 
evolutionary relationship (i.e., analogous domains). In 
this latter case, structural similarity could simply be the 
result of physical constraints on folding and packing of 
secondary structures in 3D, that is, a result of conver¬ 
gent evolution rather than divergent evolution. 

Some resources (e.g., CDD, PDB, CATH) generate 
information on structural neighborhoods for protein 
domains, that is, they simply list domains having struc¬ 
tural similarity to a given query domain, without spec¬ 
ifying whether this is a homologous or analogous 
domain. Various algorithms are used to reveal these 
structural similarities and generate information on 
structural “neighborhoods” for a given domain (e.g., 
VAST, CE, SSM, FATCAT, CATHEDRAL). 

Structural Classifications 

Both SCOP and CATH employ a hierarchical classih¬ 
cation of domain structures (see Fig. 1). At the top of 


the hierarchy is the protein class, that is, the amino acid 
composition of the domain whether it is primarily 
a-helix, p-sheet, or a mixture of both a—p. There is 
also a class of few secondary structures, and informa¬ 
tion on structural similarity between the multi-domain 
protein families is included. In CATH, structures are 
grouped within each protein class according to the 
architecture which describes the orientation of second¬ 
ary structures in 3D but does not take account of con¬ 
nectivity between the secondary structures. Figure 2 
shows the most highly populated architectures currently 
recognized in CATH and accounting for nearly 50% of 
all known or predicted domain structures. 

For both SCOP and CATH, the fold group or topol¬ 
ogy level considers both orientation of secondary 
structures and the connectivity between them. Within 
each fold group, structures are further classihed into 
evolutionary superfamilies where there is sufficient 
evidence, beyond structural similarity, to conhrm this 
relationship. The term “superfamily” is used to convey 
a broad set of relatives, including even the most distant 
relatives. Closely related homologues are then further 
classified into families in SCOP. These often have 
functional similarities. 

In CATH, close relatives are further grouped based 
on their sequence similarity. Sequence families are 
identified at several levels (i.e., 35%, 60%, 95%, 
100% sequence identity) chosen because they imply 
increasing confidence in structural similarity (>35% 
seq. id.) or functional similarity (>60% seq. id.). 
Domain relatives are also subclassified into “func¬ 
tional families” identified using powerful sequence- 
based approaches described further below. 

For both SCOP and CATH, information on 
predicted domain relatives is now provided through 
sister resources (SUPERFAMILY for SCOP and 
Gene3D for CATH). Predictions are made by scanning 
sequences against HMMs for each structural superfam¬ 
ily, seeded by structural relatives from each SCOP/ 
CATH superfamily. The sequence data expands these 
resources more than 15-fold from ^110,000 domain 
entries to nearly 16 million domain entries. 

Structural Hierarchy Versus Structural 
Continuum 

Protein structures are highly repetitive in their compo¬ 
sition, that is, comprising recurrent supersecondary 
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Domain Structure 
Classifications, Fig. 1 The 

CATH hierarchy 
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motifs such as the ubiquitous pap motif in aPa- 
sandwich structures. Resources providing structural 
searches (e.g., CDD, PDB, CATH) are useful for illu¬ 
minating the structural links between domains caused 
by these common motifs and provide an alternative 
view to hierarchical classihcations like SCOP and 


CATH. They shed light on the structural continuum 
that links some areas of “fold space” whereby domains 
in a SCOP or CATH superfamily have structural sim¬ 
ilarity with domains in other fold groups (see below 
and Reeves et al. (2006), Cuff et al. (2009) for 
a discussion of this effect). 
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Domain Structure 
Classifications, Fig. 2 Most 
highly populated architectures 
in CATH. The percentages 
correspond to the proportion 
of structures predicted to share 
this architecture in the 
genomes 



3-layer (apa) sandwich 
(23.61%) 



up-down bundle (8.21%) 



/(-sandwich (5.92%) 


Methods Used to Generate the 
Classifications 

The PDB has grown exponentially since the early 
1970s and there are now more than 79,000 protein 
structures. SCOP and CATH currently classify more 
than 110,000 domain structures. Both SCOP (http:// 
scop.mrc-lmb.cam.ac.uk/scop/) and CATH (http:// 
www.cathdb.info/) provide information on their clas¬ 
sifications via well-established web sites. 


Domain Identification 

The first step in the classification is to identify all the 
constituent domains in a newly solved multi-domain 
structure. While most multi-domain proteins comprise 
only two domains making visual assignment of domain 
boundaries easy to perform, especially where domains 
are clearly separated by identifiable linker regions, 
domain boundary assignment in larger and more com¬ 
plex multi-domain structures can be very challenging. 
Many approaches have been developed for automatic 
assignment of domain boundaries. These exploit dif¬ 
ferent concepts ranging from the detection of distinct 


hydrophobic clusters corresponding to compact 
domain regions, through to methods which seek to 
cut the chain in places where there are fewer inter¬ 
residue contacts and including sophisticated 
approaches that analyze the spring constants between 
different parts of the protein chain. General rules are 
usually applied, that is, domain boundaries rarely cut 
secondary structures and the cores of domains are 
highly hydrophobic with high residue contact density. 
Bourne and co-workers at the PDB compared several 
approaches and also developed their own consensus 
approach which performs well (Alden et al. 2010). 

Automatic domain identification in CATH is also 
performed using a consensus approach which com¬ 
bines the assignment from three algorithms - the 
PUU method of Holm and Sander (Holm and Sander 
1994) which approximates inter-domain dynamics 
using a harmonic model, the Detective algorithm of 
Swindells and Thornton (Swindells 1995) which iden¬ 
tifies clusters of hydrophobic residues which could 
form the core of a structural domain, and the 
DOMAK algorithm developed in the Barton group 
(Siddiqui and Barton 1995) which seeks to split the 
polypeptide chain at positions that maximize the ratio 
of internal contacts within domains to external 
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contacts between them. These methods are referred to 
as ah initio as they rely on biophysical concepts of 
domains. 

The notion of domain recurrence can also be 
exploited to identify constituent domains in a multi- 
domain protein. Since many relatives from domain 
families recur in different multi-domain contexts, 
algorithms have been developed which seek matches 
between domain regions in a new query protein struc¬ 
ture and domains already classified in SCOP or 
CATH. This can be done using purely structural infor¬ 
mation. For example, the CATHEDRAL algorithm 
(Redfern et al. 2007) looks for structure matches to 
classified CATH domains. However, since there is 
tenfold or more sequence data for each domain family 
it is often also possible to find a close relative by 
scanning the predicted sequence relatives in SCOP 
or CATH. 

On average, less than 2% of newly determined 
domain structures are found to have a novel fold, 
and therefore, the majority of new PDB entries will 
have a structural match to a domain already classified 
in SCOP or CATH. Structure-based searches are fre¬ 
quently complemented by sequence-based searches 
because in many highly populated superfamilies, 
<10% of the sequence relatives have close structural 
representatives (i.e., with more than 35% sequence 
identity), and in these cases, the sequence data allows 
more comprehensive sampling of the superfamily. 

Comparison of different domain boundary recogni¬ 
tion algorithms by Bourne and co-workers (Holland 
et al. 2006) showed that for novel folds, ab initio 
methods agreed only 53% of the time. Where a close 
homologue has already been classified in SCOP or 
CATH, better results are usually achieved by methods 
exploiting domain recurrence. While approaches can 
be combined to increase conhdence in an assignment, 
domain boundary data is highly manually curated in 
both the SCOP and CATH resources. Errors at this 
stage can propagate in a serious manner when stmc- 
tural relatives are used to seed HMM prohles for 
predicting sequence relatives, hence the need for 
caution. 


Fold Identification 

Once constituent domains have been identified they are 
searched against representative domains from SCOP 


or CATH using a sensitive structure comparison 
method. As mentioned already, for classihcation in 
SCOP, putative fold similarities and homologous 
domains are usually identihed or validated using 
DALI. CATH employs the CATHEDRAL algorithm, 
a fast version of the original SSAP method. CATHE¬ 
DRAL first performs a rapid scan using a graph the¬ 
ory-based approach which compares secondary 
structure features between two domains (orientations 
and distances between secondary structures). Putative 
fold matches are then recompared using the much 
slower and more accurate double dynamic program¬ 
ming algorithm employed by SSAP. 

Equivalent residues between related domains are 
superposed using PROEIT (McLachlan 1982) and the 
structural similarity between the domains returned as 
the Root Mean Square Deviation (RMSD): 


RMSD = 




where <5 is the distance between N pairs of equivalent 
atoms in the superimposed structures. This value 
should clearly be low to identify related structures 
and it is also important to consider the proportion of 
residues superimposed to determine whether the simi¬ 
larity is local or global. Eor example, a normalized 
RMSD (nRMSD) can be calculated by multiplying 
the RMSD by the number of residues in the larger 
domain, and then dividing by the number of super¬ 
posed residues. 

Related structures usually return an RMSD < 4A 
over at least 60% of the residues, depending on the 
size of the domain. Eor large domains (>300 residues) 
these criteria may need to be relaxed. Several 
groups have regenerated the now iconic plot of 
Chothia and Lesk, revisiting the relationship between 
sequence similarity and structural similarity for 
a more comprehensive set of domain superfamilies 
and confirming that even very remote homologues 
(<20 sequence identity) can exhibit signihcant struc¬ 
tural similarity (usually 40% or more of the residues 
are in very similar structural environments) (see 
the ► Proteins: Relationship Among Divergence of 
Sequence, Structure, and Function). These highly 
conserved residues are usually in the core of the 
domain structure. 
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Homologue Recognition 

While structural similarity is used to determine 
whether two domains have the same fold, this infor¬ 
mation is not sufficient to determine whether two 
domains are homologues as the similarity could simply 
have arisen through physical constraints on the 
domains and reflect the fact that there are limited 
folding arrangements in nature. Therefore, additional 
evidence is sought to confirm homology. This can he 
based on sequence analysis, that is, the detection of 
rare sequence motifs which may comprise highly con¬ 
served binding or catalytic residues essential for func¬ 
tion, interface residues, or residues associated with 
folding or stability of the protein, for example, con¬ 
served cysteines. 

Curators of both SCOP and CATH use sensitive 
sequence profile methods and HMM-based approaches 
to validate putative homologues, for example, HMMer 
(Finn et al. 2011) or the HHblits algorithm (Remmert 
et al. 2011) which compares HMMs seeded on the 
domains being compared, where several methods 
agree the confidence in a remote homology is 
increased. Interestingly, as these sequence-based 
approaches became more powerful they revealed inter¬ 
esting examples where extreme structural divergence 
between homologous domains had occurred, usually 
outside the structural core (see Figs. 4, 5). 

In fact, Grishin (Krishna and Grishin 2005) has 
documented many fascinating examples of homolo¬ 
gous domains which have diverged structurally to 
such an extent that they could no longer be considered 
to adopt the same fold. These extremely divergent 
relatives are sometimes missed by the structure com¬ 
parison algorithms. 

Homologous domains are usually confirmed, there¬ 
fore, if two out of three possible lines of evidence are 
satisfied, that is, significant structural, sequence, or 
functional similarity. Functional similarity may reflect 
conserved functional motifs or information from the 
literature or GO, for example, on common reaction 
intermediates or general molecular function. Machine 
learning approaches have been used to combine infor¬ 
mation on structural searches and sequence searches, 
and these tend to recognize a higher proportion of 
relatives (95% or more for most superfamilies), but 
both SCOP and CATH classifications employ inten¬ 
sive manual curation to validate distant relationships. 


Subclassifying Functional Families Within 
Superfamiiies 

In SCOP, classification of more closely related homo¬ 
logues into families - likely to have very similar struc¬ 
tures and functions - is performed by detailed manual 
inspection of the structures and assessment of the lit¬ 
erature. In CATH, further sequence-based approaches 
are used based on profile-profile or HMM-HMM 
comparisons. These clusters predicted sequences in 
a superfamily if they share similarities in their 
HMMs and possess similar experimental function 
annotations in GO. In CATH version 3.5, 2,600 super- 
families were subclassified into ^47,000 functional 
families using this iterated clustering approach (Lee 
et al. 2010). 

Automatic identification of functional homologues 
is still very challenging and thresholds must be set very 
conservatively. Recent assessment of function predic¬ 
tion algorithms in an international experiment (CAFA) 
showed that function prediction remains a consider¬ 
able challenge, and therefore, it is best to set these 
thresholds for high specificity at the cost of sensitivity. 

It is also possible to use structural information to 
identify functionally related homologues. Because the 
common structural core of a domain superfamily is so 
well conserved this region can be easily recognized, 
allowing the subsequent identification of structurally 
conserved motifs or structural decorations to this core 
that are uniquely conserved in a specific functional 
group. CATH uses the FLORA algorithm (Redfem 
et al. 2009) to build 3D-templates of such conserved 
functional motifs in order to recognize new relatives 
belonging to the same functional group. While these 
approaches have been shown to be quite powerful, at 
least three highly non-redundant relatives (having less 
than 35% sequence identity) are needed to identify 
functionally specific motifs, and there is currently 
only sufficient structural data in the PDB to build 
templates for less than half of the enzyme superfam¬ 
iiies in CATH. 

Multiple Structure Alignments 

Both SCOP and CATH provide information on align¬ 
ments of domain relatives. For CATH multiple 
alignments are generated for all domain superfamiiies 
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using a modified version of the SSAP algorithm 
(CORA). In SCOP, selected domain alignments are 
displayed in the Sisyphus web resource. These 
are alignments of homologous domains or structures 
sharing significant local or global features that 
are very difficult to align and, therefore, requiring 
considerable manual intervention, but which reveal 
very interesting and/or rare structural features 
either conserved throughout evolution or occurring 
between domains having distinct folds. Another very 
valuable resource, HOMSTRAD, provides curated 
multiple structure alignments for SCOP domain 
superfamilies. 

Recognition of Sequence Homologues for 
SCOP and CATH Superfamilies in Completed 
Genomes 

The international genome initiatives have provided the 
sequences of more than 2,000 completed genomes 
from organisms across all kingdoms of life. One of 
the criticisms of early studies exploiting CATH and 
SCOP to determine the population of different folds in 
nature was the objection that these resources were very 
biased and captured only those structures amenable to 
structure determination. The development of highly 
sensitive sequence-based methods for homologue 
recognition (e.g., HHblits) and even more sensi¬ 
tive threading-based algorithms (e.g., FFAS, 
GenTHREADER, PHYRE) meant that it was possible 
to answer the question of whether most structural 
superfamilies existing in nature had been sampled in 
the PDB. 

Currently, on average, 50-60% of domains 
(up to 70% or more if the most sensitive threading 
algorithms are used) in completed genomes can 
be assigned to a SCOP or CATH superfamily 
suggesting that these classihcations do capture 
a significant majority of all folds in nature. The 
remaining structurally uncharacterized regions of 
domain sequence are predicted to have high pro¬ 
portions of membrane-associated regions and/or dis¬ 
ordered regions. It is clear that the PDB is still 
biased in its underrepresentation of transmembrane 
structures. However, these are not expected to show 
as great a diversity of structural types as the globu¬ 
lar domains. 


Population of the Hierarchy 

Table 1 shows the number of entries at each level of the 
SCOP and CATH classifications. Analyses of different 
levels of the classification have revealed scale-free phe¬ 
nomena. Eor example, early studies of the population of 
folds groups showed that the majority of folds were 
small in population, while a few folds were very highly 
populated. Analyses of both SCOP and CATH revealed 
the existence of these highly populated, so-called 
superfolds which accounted for nearly one third of the 
domain structures. Current analyses of the classifica¬ 
tions show that this phenomenon still holds and that the 
ten most highly populated folds still account for 36% of 
the structures (see Fig. 3b showing illustrations of the 
five most highly populated folds in CATH version 3.5). 

Similar power law trends are also observed if the 
population of domain superfamilies is examined. 
Figure 3a shows that the majority of superfamilies 
are small, while the largest 100 superfamilies (<5% 
of the domain superfamilies) account for nearly 50% of 
all structures in CATH and 58% of all domain 
sequences in CATH if we include sequences whose 
structures have been predicted to be in CATH super¬ 
families using HMMer. 

What Information Do the Structural 
Classifications Provide on Evolution and the 
Structural Mechanisms by Which New 
Functions Evolve? 

Because structure tends to be much more highly con¬ 
served during evolution than sequence, the structural 
classifications SCOP and CATH have provided many 
insights into evolutionary trends and mechanisms. The 
biased population of domain superfamilies has already 
been discussed above, and several analyses have 
attempted to model these populations using various 
evolutionary frameworks. These have been based on 
different concepts such as the emergence of domains 
with favored folding arrangements; domains with spe¬ 
cific sfructural features, for example, suitable functional 
sites; and domains with greater structural plasticity and 
tolerance to residue mutations or the emergence of 
domains supporting key functions early in evolution. 

The considerable extent to which domain duplica¬ 
tion occurs during evolution has been validated by 
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Domain Structure Classifications, Fig. 3 Pie charts giving 
information on the population of the CATH database (a) dis¬ 
plays the number of non-redundant structures (<35% sequence 
identity) in the most highly populated superfamilies in CATH. 
Each segment corresponds to one of each of these superfam¬ 
ilies. It can clearly be seen that these 100 superfamilies 


correspond to almost 50% of non-redundant domain structures 
in CATH (b) displays the number of structural representatives 
in the 10 most highly populated folds in CATH. These folds 
contain almost 40% of the non-redundant domain structures in 
CATH. The five most highly populated folds are illustrated in 
Molscript 


Domain Structure 
Classifications, 

Fig. 4 Figure showing 
examples of structural 
embellishments in the UP 
superfamily. The regions 
colored pink highlight 
structurally conserved regions. 
The blue regions show 
structural variations some of 
which are mediating changes 
in function (Dessailly et al. 
2010) 



Nucleotidyl transferases, Pantoate Ligases, Aminoacyl tRNA synthetases, Electron 
transfer flavoproteins, N-type ATP-PPases, DNA photolyases. Universal stress 

proteins 


studies using predicted domains in SUPERFAMILY 
and Gene3D, the previously mentioned sister resources 
to SCOP and CATH. Various studies have also shown 
that domain insertions tend to occur at the N and 
C termini of existing single or multi-domain proteins 
and rarely within the polypeptide chain. Comparative 
genome analyses have identified <200 ancestral 
domain superfamilies present in all kingdoms of life 


and likely to have been present in the Last Universal 
Common Ancestor (LUCA). Compared to analyses 
based on purely sequenced data, which identified uni¬ 
versal superfamilies largely involved in protein bio¬ 
synthesis, the structure-based analyses revealed some 
additional superfamilies responsible for other impor¬ 
tant functions essential for any living organism, that is, 
metabolism and regulation. 
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Domain Structure 
Classifications, 

Fig. 5 Figure showing how 
structural embellishments can 
alter the binding sites (and, 
therefore, substrate access) for 
domains from the same 
superfamily. The schematic 
below the Molscript images is 
a 2Dsec plot (Reeves et al. 
2006) displaying the 
differences in secondary 
structure content of the two 
domains above; the circles 
representing beta-sheet 
structures and the triangles 
alpha-helical, the sizes of both 
corresponding to the size of 
the secondary structure 
element. Common helices and 
strands are in pink and yellow 
respectively, while inserted 
helices and strands are in 
purple and brown (Dessailly 
et al. 2010) 



Most domain superfamilies in SCOP and CATH 
contain only a few domains having very similar 
functions. By contrast <5% of superfamilies contain 
very large numbers of domains and these often 
have a high degree of structural and functional 
diversity. 

While changes in function in enzyme superfam¬ 
ilies can be brought about by mutations of catalytic 
residues in the active site (e.g., changing an enzyme to 
a nonenzyme), many analyses have revealed other, 
more structural, mechanisms modifying functions 
(e.g., Dessailly et al. 2010). Insertions of residues 
can give rise to additional secondary structure fea¬ 
tures outside the domain core. These structural embel¬ 
lishments can have a variety of different functional 
effects, for example, modifying the geometry 
of the active site or affecting protein surfaces 
and, therefore, protein-protein interactions. Structur¬ 
ally diverse superfamilies tend to be more func¬ 
tionally diverse, and in some superfamilies, relatives 
can diverge in size by threefold or more up to fivefold 
in some cases (Reeves et al. 2006; Cuff et al. 2009). 
Figures 4 and 5 show examples of structurally diverse 
relatives from the HUP domain superfamily, illustrat¬ 
ing the types of structural modifications that have 
been observed. 


Domain Structure Classifications, Table 1 Statistics from 
the CATH (v3.4) and SCOP (vl.75) classifications 


CATH 

hierarchy 

Number of 
CATH domain 
representatives 

Number of 
SCOP domain 
representatives 

SCOP 

hierarchy 

Class 

4 

7 

Class 

Architecture 

40 

- 

- 

Topology or 
fold 

1,282 

1,195 

Fold 

Homologous 

superfamily 

2,549 

1,962 

Superfamily 

Functional 
families (all 
predicted 
sequences) 

47,039 

3,902 

Family 

(structures 

only) 

Total number of 
domains 

152,920 

110,800 

Total 
number of 
domains 


Conclusion 

The advent of sensitive structure comparison algo¬ 
rithms in the late 1980s enabled the detection of very 
remote homologues and led to the development of 
several structural classifications of which SCOP and 
CATH are still widely used and remain the most 
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comprehensive. Structural classifications are valuable 
for understanding the evolutionary mechanisms by 
which new functional repertoires emerge. For exam¬ 
ple, for identifying changes in domain combinations in 
multi-domain proteins associated with changes in 
function and revealing structural changes in the 
domain that modify the features of the protein’s active 
site, binding sites, and interfaces. 


Cross-References 

► Flomology Modeling of Protein Structures 

► Protein Structure Comparison Methods 

► Proteins: Relationship Among Divergence of 
Sequence, Structure, and Function 
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Synonyms 

Protein Structure 


Introduction 

Protein multimers are favored during protein evolution 
since this endows them with structural and functional 
advantages, such as improved stability and control 
over the accessibility and specificity of active sites. 
Special cases of multimers are “3D domain-swapped” 
oligomers (Liu and Eisenberg 2002). A “domain- 
swapped” structure contains two or more polypeptide 
chains that exchange identical units. The exchanged 
portion may consist of a single secondary structure 
element such as an a-helix or P-strand, or an entire 
globular domain. 

The term “three-dimensional (3D) domain swap¬ 
ping,” or simply domain swapping, was originally 
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coined by Eisenberg and colleagues in 1994 for 
describing the X-ray structure of a diphtheria toxin 
(DT) dimer (Bennett et al. 1994). However, already 
in 1962 a report was published describing the exchange 
of an N-terminal fragment for bovine pancreatic ribo- 
nuclease A (RNase A) upon dimerization (Crestheld 
et al. 1962), foreshadowing its final structural 
verification. 


Terms and General Features of Domain 
Swapping 

True domain-swapped structures require that both 
monomeric and oligomeric states must be observed 
for the same protein (Liu and Eisenberg 2002). How¬ 
ever, this stringent designation has been relaxed in 
the literature and structures are called domain- 
swapped, even if no structure of the closed monomer 
has ever been observed or where only a homolog 
exhibits a closed monomer. Strictly speaking, in the 
first case the protein is a “candidate” for domain 
swapping, while in the second it is “quasi-domain- 
swapped.” 

In domain-swapped structures, no differences in phi 
and psi backbone angles exist between the monomer 
and oligomer, except for the so-called hinge-loop 
region that connects the exchanging domains. The 
term “domain” encompasses small secondary structure 
units or large, independently folded domains. The 
“closed” or primary interface in the oligomer possesses 
identical intramolecular counterparts in the monomer, 
while the “open” or secondary interface is only present 
in the oligomer. A schematic representation of differ¬ 
ent domain-swapping scenarios as well as delineation 
of the different structural interfaces is provided in 
Fig. 1. 

At the present time (Jan. 2010), more than 100 
domain-swapped structures are deposited in the PDB, 
19 of which have identical amino acid sequences for 
the monomer and oligomer, thus they represent exam¬ 
ples of true domain swapping. Proteins found in 
domain-swapped structures display the same diversity 
as any protein in the PDB and no special properties 
appear to be associated with their sequences, regarding 
chain lengths, structural classification, or amino acid 
composition. Although, the exchanging unit can be 
located anywhere in the sequence, it is often found at 
one of the two termini. 



P 


Domain Swapping, Fig. 1 Schematic representation of 
domain-swapped structures and their pertinent features. 
M, monomer; D, dimer; T, trimer; P, daisy chain-type multimer. 
Closed and open interfaces are enclosed in stippled purple and 
black boxes, respectively 

The Hinge-Loop 

The hinge-loop is the only part of the protein that 
adopts a different conformation in monomeric and 
domain-swapped structures and therefore has been 
the focus in the search for local signals that could 
cause or influence domain swapping. For a polypeptide 
stretch to function as a hinge-loop, it needs to be 
pliable enough to adopt different conformations. Alter¬ 
ing the length of the hinge-loop can switch the domain¬ 
swapping propensity of a protein and loop residue 
deletion and insertion has been used to manipulate 
swapping. Likewise, sequence alteration will influence 
the partitioning into monomers and dimers, with pro¬ 
lines and glycines often playing pivotal roles. 

However, no clear, predictive rules have emerged yet. 

In some cases, mutations in other parts of the pro¬ 
tein, not the hinge-loop, induce domain swapping. 
General destabilization of the monomer conformation 
that can be compensated for by dimerization may play 
a role in these cases. Thus, residue changes distant 
from the hinge region can shift the relative stabilities 
of monomer and domain-swapped dimer and thereby 
modulate domain-swapping properties. 

Stability and Folding of the Monomer 

Despite substantial efforts, no compelling proposal for 
a generally applicable and unified molecular mecha¬ 
nism of domain swapping has emerged to date (Liu and 
Eisenberg 2002; Rousseau et al. 2010; Newcomer 
2002; Gronenborn 2009). Eisenberg and colleagues 
suggested a scenario based on their studies on DT, in 
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which the “open monomer” conformation retains the 
native fold of other parts of the “closed monomer,” and 
only interactions at the closed interface are disrupted 
during unfolding of the monomer. Such partial 
unfolding scheme may be at play in multi-domain 
proteins in which separate, independently folding 
domains are exchanged. However, the existence of a 
stable “open monomer” is unlikely for most domain- 
swapped proteins in which only a few secondary struc¬ 
tural elements are exchanged. These isolated structural 
elements will be unstable and therefore complete un/ 
refolding is more likely to be at play in these cases. 

Computational Explorations 

A number of computational approaches for uncovering 
the basic events in domain swapping have been carried 
out and it has been suggested that the overall mono¬ 
meric topology, rather than local signals in the hinge 
region, determines where in the polypeptide chain 
domain swapping will occur (Yang et al. 2004). 
Although plausible, this unverified hypothesis appears 
to be at odds with some experimental results. Coarse¬ 
grained MD simulations implied that starting from 
monomeric conformations domain-swapped dimers 
can form spontaneously (Kundu and Jemigan 2004). 
A quite different proposal involves a progressive and 
reversible transformation between monomer and 
dimer (Malevanets et al. 2008), starting from either 
end of the polypeptide chain and trading intramolecu¬ 
lar contacts for equivalent intermolecular ones, with 
the total number of native contacts remaining essen¬ 
tially constant. In this manner more and more of the 
monomer chains are substituted for each other, until 
a stable state is reached. However, so far, no experi¬ 
mental evidence exists for such a mechanism and it 
remains highly speculative. 

Biological Implications 

Is domain swapping an in vitro curiosity or does it 
serve a biological function? Although domain swap¬ 
ping may be involved in ► misfolding, aggregation, 
and ► amyloid formation of many proteins, this may 
not be its only function. A number of results suggest 
that this type of oligomerization could be positively 
exploited in biology. One possible role for domain 
swapping may be to regulate protein function by mod¬ 
ulating the populations of active molecules or the 


availability of functional sites, thereby playing a role 
in ► allosteric regulation. Also, domain swapping is an 
efficient means for supramolecular structural organi¬ 
zation of ► oligomers, such as seen in viral capsid 
structures. 

Two Pertinent Examples 

RNase A 

RNase A is the classic example for a domain-swapped 
protein and with its different oligomeric states beauti¬ 
fully illustrates the remarkable options of domain¬ 
swapping modes. Dimerization via N-terminal 
exchange was proposed already in 1962 by Crestfield, 
Stein, and Moore, prior to any structural information, 
to explain the protein’s behavior at acidic pH 
(Crestfield et al. 1962). The first X-ray structure of 
a domain-swapped RNase A dimer was solved in the 
late 1990s by the Eisenberg laboratory (Bennett et al. 
1994), followed by more domain-swapped dimers, tri- 
mers, and multimers (Fig. 2) (Liu and Eisenberg 2002). 

Different conditions result in different types of 
RNase A oligomers. Dimers form at pH 6.5 and 
37 °C, with a dissociation constant for the dimer of 
~2 mM and oligomers exhibiting higher enzyme activ¬ 
ity on double-strand RNA than the monomer. This is 
explained by the active site encompassing amino acids 
from different subunits. Indeed, catalytic histidines are 
contributed by the N-terminal a-helix and the 
C-terminal (3-strand, respectively. 

In one of the trimeric forms of RNase A, both 
N- and C-terminal units are exchanged, resulting in 
a linear arrangement. In the other trimer only the 
C-terminal strands are swapped and a cyclic structure 
ensues. 

B1 Domain 

GB1 is a small, 56 residue, stable, single domain pro¬ 
tein, composed of a four-stranded (3-sheet with a single 
a-helix packed on top of it (Gronenbom et al. 1991). 
A number of surprising structural variants were 
obtained in a large mutagenesis study involving 
a library of randomized hydrophobic core residues. 
Among the alternative structures was also a domain- 
swapped dimer that exhibited (3-hairpin exchange 
between the subunits, resulting in a structure with an 
eight-stranded P-sheet and two antiparallel a-helices 
(Byeon et al. 2003) (Fig. 3). This dimer dissociates at 
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Domain Swapping, 

Fig. 2 Structures of RNase A. 
In the monomer, the two 
secondary structure elements 
involved in exchange are 
colored blue and orange. In the 
dimers and trimers, the 
individual polypeptide chains 
are colored green, blue, and 
orange, respectively. Hinge 
residues are shown with their 
side chains in stick 
representation and colored in 
magenta 



double-terminal swapped trimer 


cyclic C-terminal swapped trimer 


low micromolar protein concentration into partially 
folded, monomeric species, whose overall architecture 
resembles that of monomeric wild type GBl. There¬ 
fore, for this variant, dimerization via domain swap¬ 
ping stabilizes the molten, monomeric hydrophobic 
core (Byeon et al. 2004). 

An additional amino acid change in the domain- 
swapped dimer core caused a further dramatic change 
in structure: a symmetric tetramer ensued with 
intermolecular strand-exchange involving all four 
units (Franck et al. 2002). Three P-strands and the 
a-helix were retained in the tetramer, although their 
intra- and intermolecular interactions were radically 
different, with strand p2 of the first hairpin missing. 
The P3-P4 hairpin was changed to a side by side 
arrangement of strands P3 and p4 from one subunit, 
running antiparallel to P3 and p4 of another one. This 
topological change was accompanied by a shift in 
register. In addition to strand-exchange of the 
domain-swapping kind, a new interface between sur¬ 
face elements of the individual chains was formed. 

The B1 domain of protein L exhibits the same fold 
as the GBl monomer; however, a quite different 
domain-swapped structure was found for its mutants 
(O’Neill et al. 2001) (Fig. 3). Substitution of a glycine 
by alanine in the turn of the second p-hairpin caused 
exchange of the C-terminal p-strand between the sub¬ 
units, with the wild type hairpin straightening and 
creating the intermolecular p-sheet interface. These 
long P-strands are kinked, causing both B1 units to 
be rotated around the hinge region. Exchange of valine 


to alanine in the hydrophobic core also resulted in this 
type of domain-swapped structure. Interestingly, in the 
X-ray structure, the asymmetric unit contains two wild 
type-like monomers and a domain-swapped dimer. 
Novel intermolecular hydrophobic contacts as well as 
intermolecular hydrogen bonds between the 
exchanged P-strands contribute to the stability of the 
domain swap (O’Neill et al. 2001). 

The different oligomeric B1 structures are striking 
examples for illustrating possible structural evolution¬ 
ary paths from monomers to multimers. 

Summary 

Over the last decades, more and more domain- 
swapped protein structures have become available, 
and, at least for some cases, there is evidence in sup¬ 
port of the dimer or multimer constituting biologically 
important species. Indeed, irrespective of whether 
domain swapping is a specific mechanism for regulat¬ 
ing function in vivo, it is becoming clear that it is not 
solely an in vitro artifact. However, no unifying molec¬ 
ular mechanism of domain swapping has emerged yet: 
each protein seemingly behaves in a distinctive and 
individual fashion, and a general explanation for how 
proteins exchange domains still remains elusive. What 
seems to emerge is that domain swapping is closely 
associated with the unfolding/folding process of pro¬ 
teins. For some proteins, distinct intermediates, in 
which some hydrophobic part of the monomeric 
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Fig. 3 Structures of B1 
domains. In the monomers, 
exchanged elements are 
colored in blue. In the dimers, 
individual polypeptide chains 
are colored in green and blue, 
respectively. Hinge residues 
are shown with their side 
chains in stick representation 
and colored in magenta 



GB1 monomer 



GB1 swapped dimer 



LB1 monomer 


LB1 swapped dimer 


protein becomes exposed and, thereby, is available for 
interaction with a “like” molecule, may play a role, 
while for others, complete unfolding may occur. The 
fact that high protein concentration and additives 
(always present during crystallization) promote 
domain swapping implicates an involvement of unfa¬ 
vorable solute/solvent interaction. As a result, an olig¬ 
omeric structure can be trapped either in a crystal or an 
aggregate. Such behavior could also occur in vivo 
under conditions where “monomer” promoting (co-) 
factors are missing or where high local protein con¬ 
centrations are promoted through compartmentaliza- 
tion or the action of protein-protein interaction 
modules. 

Cross-References 

► 3D Domain swapping 
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Synonyms 

RNA 


Definitions 

RNA-hinding proteins (RBPs) are proteins with the 
ability to hind to RNA, which are widely spread in all 
living organisms. 

Double-stranded RNA-hinding proteins are proteins 
with the ability to bind to double-stranded, base paired 
RNA helices. 

Double-stranded RNA-hinding domain (dsRBD) is 
a protein domain with mixed aP topology that binds 
to double-stranded RNA (dsRNA). 

End-recognition domains recognize features like the 
presence or absence of phosphates, and the number of 
phosphates present on the 3'- and 5'-ends of double- 
stranded RNA helices. 


Introduction 

The A-form double helix is the most prominent struc¬ 
tural element of all RNAs, whose structure is com¬ 
posed of helical segments separated by loops 
and bulges and joined together by helical junctions 
where three or more double helices converge. 
Long double-stranded RNA (dsRNA) helices with 
uninterrupted run of base pairs are rare in cellular 
RNAs, but they are more commonly found in viral 
RNAs. Short but well-defined dsRNA helices are 
found in small interfering RNAs where they mediate 
sequence-specific gene silencing. Evolution has pro¬ 
vided specialized protein domains that recognize the 
structural features of A-form double helical RNA, the 
presence or absence of the 2'-hydroxyl group, as well 
as the chemical features found where helices termi¬ 
nate, such as the presence or absence of mono- and 
triphosphates, or the presence and length of single- 
stranded overhangs. Specialized protein domains that 
recognize these features are the subject of this entry. 

The Double-Stranded RNA-Binding Domain 
(dsRBD) 

The double-stranded RNA-binding domain (dsRBD) is 
a small ap domain found in many different proteins 
with diverse cellular functions. The dsRBD was first 
identified in the early 1990s, and is present in >100 
distinct proteins from different organisms ranging 
from Escherichia coli to humans (St Johnston et al. 
1992). Due to the presence of significant amount of 
double-stranded RNA (dsRNA) in the cell and among 
infectious agents, the dsRNA-binding domain is one 
of the most common RNA-binding protein domains. 
Proteins containing the dsRBD have functions in RNA 
localization, RNA editing, translational repression, 
and posttranscriptional gene silencing (Chang and 
Ramos 2005). 

The dsRBDs are ^70 amino acid long segments. 
They form a conserved cxPPPa structure (Bycroft et al. 
1995) and bind to dsRNA but not to single-stranded 
RNA or DNA (ssRNA/ssDNA) (Fig. la). Two struc¬ 
tures of complexes of dsRBDs from different proteins 
provided a rational for this activity, showing that the 
domain recognizes both the shape of the RNA A-form 
double helix and the presence of 2'-hydroxyls in the 
riboses (Ramos et al. 2000). The dsRBD interacts with 
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Double Stranded RNA and 
End-Recognition Domains, 
Fig. 1 Structure and RNA 
recognition by double-stranded 
RNA-binding domains 
(dsRBDs); (a) ribbon diagram 
of dsRBD3 from staufen (PDB 
ID ISTU); (b) the two dsRBDs 
of PKR; the two domains are 
linked by a 22 amino acid 
linker (PDB ID 1QU6); 

(c) stracture of ADAR2 
dsRBM12 bound to the GluR-2 
R/G editing site; the linker 
between the two dsRBDs is 
92 amino acids (PDB ID 
2L3J); (d) RNA recognition by 
staufen dsRBM3 (PDB ID 
lEKZ); (e) stmcture of Rntl 
dsRBD bound to a stem-loop 
RNA (1T4L); (f) Aquifex 
aeolicus RNase III complexed 
with the product of a processed 
double-stranded RNA 
processing element (2EZ6). 

All hgures were generated 
using pymol (www.pymol.org) 




RNA with two protein loops but also uses the first helix 
of the domain to make contacts with the double- 
stranded helix. While the dsRBD does not bind 
dsRNA in a sequence-specific manner, additional ele¬ 
ments emanating from the domain can recognize hair¬ 
pin loops that cap dsRNA sequences, though the extent 
of selectivity that can be achieved is limited compared 
to sequence-specific RNA-binding domains (Ramos 
et al. 2000; Stefl et al. 2010; Wu et al. 2004). 

dsRBDs can be divided into two broad categories: 
type-A dsRBDs have conserved residues along the 
entire length of their sequence, whereas in type-B 
dsRBDs, the conservation occurs only near the car¬ 
boxyl terminus. Proteins containing the first group of 
domains also contain a catalytic domain in addition to 
the presence of one or two dsRBDs. Classical exam¬ 
ples include protein kinase R (PKR) (Fig. lb) (Nanduri 
et al. 1998), which has a dsRBD at its N-terminus and 
a ser/thr kinase domain at the C-terminus that is 
activated when the dsRBDs bind to long dsRNAs. 
Additional enzymes containing dsRBDs include the 
adenosine deaminases ADARl and 2 (Stefl et al. 
2010) which converts adenosine to inosine, the RNA 
helicase A (RHA) which contains a DEXH helicase 
domain (Zhang and Grosse 1997), and the classical 


Rnaselll enzymes, including the Dicer/Drosha 
enzymes involved in microRNA maturation and 
siRNA-based gene silencing which contain an 
RNAse 111 domain for dsRNA cleavage and that there¬ 
fore participate in RNA processing (Lee et al. 2003). 
The second class of dsRBDs is found in proteins which 
do not have any catalytic domain. Representative 
examples include the staufen RNA localization factor 
where the domain was first identified (St Johnston et al. 
1991), the TAR RNA-binding protein (TRBP) that 
participates in posttranscriptional silencing, the vac¬ 
cinia virus E3L protein, Disco interacting protein 1 
(DlPl), and RDE4/R2D2 protein components of the 
RNAi/miRNA pathways (Chang and Ramos 2005). 

Interaction of dsRBDs with dsRNA and 
Non-dsRNA Species 

dsRBDs have the ability to recognize dsRNA, but not 
all dsRBDs do so and even those that do, have different 
affinity. In the case of PKR, dsRBDl has higher bind¬ 
ing affinity toward RNA compared to dsRBD2, 
whereas the reverse trend is observed in DIPT In 
staufen, dsRBD 1, 3, and 4 bind to dsRNA, whereas 
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Double Stranded RNA and End-Recognition Domains, Table 1 Biological function and properties of representative dsRNA- 
binding domains 


Protein 

Structural information 

Protein-RNA or protein-protein interaction 

Biological functions 

PKR 

Two dsRBDs plus 
a kinase domain 

dsRBD 1 has higher affinity toward dsRNA, 
whereas dsRBD2 interacts with the kinase 
domain. Dimerization of dsRBD 1 and dsRBD2 
leads to PKR inactivation, whereas RNA- 
mediated dimerization is required for 
autophosphorylation and activation during the 
antiviral response 

Antiviral-response, apoptosis, cell- 
cycle regulation, transcriptional 
regulation 

Staufen 

Five dsRBDs 

dsRBD2 and dsRBD5 act as protein-protein 
interaction motif that mediates recognition with 
Miranda, whereas dsRBMl,3 and 4 have different 
affinity toward dsRNA 

RNA localization in early 
development 

XlrbpA 

Three dsRBDs 

dsRBDs act as RNA chaperones 

RNA localization and RNA strand 
annealing 

AD AR1/2/3 

Three dsRBDs and 
adenosine deaminating 
domain 

Different dsRBDs bind to different sections of the 
target RNA 

Gene silencing, selective RNA 
editing 

Rnaselll 

Endonuclease domain 
and dsRBD“ 

dsRBD binds to dsRNA 

RNA processing, RNA interference, 
and posttranscriptional gene 
regulation 

RHA 

Two dsRBDs, helicase 
domain, and RGG box 

dsRBDs bind to dsRNA cooperatively 

Unwinding of dsRNA in the 3-5' 
direction, transcription activator, 
splicing and translational control 

TRBP 

Two dsRBDs 

RNA binding 

PKR inhibitor, posttranscriptional 
gene silencing 


“Rntl has an N-terminal domain of ~200 amino acids, whereas Drosha has a large N-terminus of ~900 residues followed by two 
endonuclease domains and one dsRBD. Dicer is similar to Drosha but with a ~ 1,500 residues N-terminal extension which includes an 
RNA helicase domain and Piwi/Argonaute/Zwille (PAZ) domain 


domains 2 and 5 are likely to have different specificity 
(Table 1). Presumably, distinct arrangements of these 
domains and modulation in their affinity play a role in 
organizing protein complexes on dsRNA-containing 
RNAs. The sequence of the dsRNA is usually not 
important for its recognition, but the length of the 
RNA and the presence or absence of structural inserts 
(bulge and internal loops) affect affinity. However, in 
the structure of the ADAR2 dsRBDs bound to a stem- 
loop encoding the R/G editing site of the GluR-2 
mRNA (Stefl et al. 2010) (Fig. Ic), the sequence of 
the minor groove of the dsRNA is recognized by the 
dsRBDs, providing a way to recognize the R/G editing 
site specihcally (Stefl et al. 2010). Several structures of 
dsRBDs elucidate the principle of RNA recognition 
(Fig. Id). In the structures, helix al, loop 2 (between 
pi and P2), and loop 4 (between p3 and a2) interact 
with RNA through the minor, major, and minor 
grooves, respectively. In some structures, the 
N-terminal part of helix a2 also interacts with the RNA 
while loop 4 makes contacts with the phosphodiester 


backbone across the major groove from it. Interest¬ 
ingly, in some structures helix al interacts with sin¬ 
gle-stranded hairpin loops which cap dsRNA helices, 
suggesting a mechanism to provide sequence selectiv¬ 
ity, if not strong specificity, to at least some dsRBDs 
(Ramos et al. 2000) (Stefl et al. 2010) (Fig. Id). This is 
clearly observed in the Saccharomyces cerevisiae 
RNAse III enzyme Rntl and its interaction with 
AGNN RNA hairpins, where an additional helix mod¬ 
ulates the positioning of helix al and may facilitate 
sequence-specific recognition (Fig. le) (Wu et al. 2004). 

A variety of biophysical and biochemical tech¬ 
niques have been used to characterize protein- 
dsRNA-binding affinities and dynamics, as presented 
in Table 2. When the activity of dsRBS has been 
systematically characterized, it has been found that 
dsRBDs may also mediate interactions with other 
dsRBDs, other proteins, DNA, and may also facilitate 
RNA annealing. Indeed each of multiple dsRBD 
domains in a protein can have distinct functions 
(Table 1). dsRBD-dsRBD interactions are observed 
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Double Stranded RNA and End-Recognition Domains, Table 2 Techniques used to study protein-dsRNA-binding affinities 


and dynamics 

System studied 

Experimental method 

Kd 

Result (reference) 

Staufen dsRBD3 

NMR, gel shift assay 

pM range 

This is one of the first protein-dsRNA structures solved (Ramos 
et al. 2000) 

dsRBD-PKR 

NMR 

ND“ 

dsRBD 1 has higher affinity toward dsRNA, whereas dsRBD2 
interacts with the kinase domain (Nanduri et al. 1998) 

ADAR2-dsRBDs 

NMR, fluorescence 
anisotropy, light 
scattering 

Sub-nM range 

Solution structure of ADAR2 bound to GluR-2 R/G site was 
reported. (Stefl et al. 2010) 

Rntl-dsRBD 

NMR, X-ray, gel shift 
assay 

Sub-pM range 

Rntlp-14 bp RNA hairpin structure shows the specificity of 
a single dsRBD toward particular RNA targets. (Wu et al. 2004) 

RNase III 

X-ray 

ND“ 

Complex structure throws light on the mechanism of RNA 
hydrolysis. (Gan et al. 2006) 

Toll-like receptor 3 
(TLR3) 

X-ray 

ND“ 

Crystal structure of TLR3 with 46 nt dsRNA was determined to 
3.4 A resolution (Liu et al. 2008) 

PIWI protein 

X-ray, double filter- 
binding assay 

0.19 pM 

A 2.5 A resolution PIWI structure bound to a 5'-phosphate- 
containing 21 nt RNA provides structural insights into RNA 
interference. (Ma et al. 2005) 

La autoantigen 

X-ray, NMR, gel shift 
assay 

nM range 

Crystal structure of La protein with UUU-OH 3'-end overhands 
was reported. (Teplova et al. 2006) 

RIG-I 

x-ray, ITC“ 

nM range 

Crystal structure of RIG-I CTD bound to both blunt ends of 
a self-complementary 5'-ppp dsRNA 12-mer (Wang et al. 2010) 

Ebola viral protein 
35 (Ebola VP35) 

X-ray, NMR, ITC” 

Sub-nM to nM 
range 

Structure of Ebola VP35 with 8 bp dsRNA provides 
a mechanism by which Ebola VP35 antagonizes host dsRNA 
sensors and immune responses (Leung et al. 2010) 

Tombusvirus pl9 

X-ray, gel shift assay 

nM range 

These structures provide an illustration of siRNA sequestering 
by a viral protein. (Vargason et al. 2003; Ye et al. 2003) 

PIWI argonaute 

X-ray, NMR 

ND“ 

Structure of PIWI domain-siRNA guide complex provides 
insight into RNA recognition and cleavage. (Lingel et al. 2004; 
Parker et al. 2005) 

Argonaute 2 PAZ 

X-ray, surface 
plasmon resonance 

nM range 

This structure shows a siRNA-end-binding module for siRNA 
transfer in the RNA silencing pathway. (Ma et al. 2004) 

“Not determined, ITC Isothermal titration calorimetry 


in the proteins PACK and TRPB, while the dsRBD2 of 
PKR interacts with its own kinase domain, just like the 
dsRBD of Rnaselll interacts with the catalytic 
RNAselll domain (Fig. If) (Gan et al. 2006). Finally 
and most unusually, the two dsRBDs of RHA protein 
have dsDNA-binding activity. Thus, the dsRBD is 
a versatile platform that mediates recognition of 
different binding targets, though it is primarily tuned 
to bind to dsRNA helices (Chang and Ramos 2005). 

Recognition of Long Double-Stranded RNAs 
by Toll-like Receptors 

A different structural solution for recognition of long 
double-stranded RNAs is provided by toll-like 


receptor 3 (TLR3) (Liu et al. 2008). This receptor 
activates the inflammatory response during the 
antiviral response by recognizing stretches of double- 
stranded RNA of at least 40-50 base pairs, as are found 
in dsRNA viruses but not commonly in cellular RNAs. 
Signal transduction is activated by dimerization of the 
receptor ligand-binding domain in response to dsRNA 
binding. TLR3 has a solenoid-like structure with 
a horseshoe shape comprised of 23 leucine-rich repeats 
(LRRs) that form a concave P-sheet surface, a common 
protein scaffold utilized for protein-protein and 
protein-RNA interactions in a variety of contexts 
(Fig. 2a). Each monomer binds to dsRNA at opposite 
ends of the horseshoe, with both the RNA presenting 
a typically undistorted A-form helix and the receptor 
undergoing only small changes in structure compared 
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Double Stranded RNA and End-Recognition Domains, 
Fig. 2 Structures and RNA recognition of end-recognition 
domains; (a) Toll-like receptor 3 (TLR3) with dsRNA (PDB 
ID 3CIY); (b) fulgidus PIWI protein (PDB ID lYTU); (c) 
recognition of UUU-OH 3'-termini of nascent RNA pol III 
transcripts by La autoantigen (PDB ID IZH5); (d) recognition 
of 5'-ppp RNA by the innate immune receptor RIG-1 (PDB ID 
3NCU); (e) structure of the Zaire Ebola viral protein 35 (Ebola 
VP35) interferon inhibitory domain bound to 8 base-pair dsRNA 


(PDB ID 3L25); the conserved Phe235 are denoted with green 
spheres; (f) the tombusvirus pl9 protein bound to a 21 nucleotide 
siRNA (PDB ID IRPU); Trp residues stacking with RNA are 
shown with green spheres; (g) structure of PIWI argonaute 
protein from Archaeoglohus Fulgidus complexed with a 16 
nucleotide siRNA duplex (PDB ID 2BGG); (h) 3'-end recogni¬ 
tion of nucleic acids by the Drosophila Argonaute 2 PAZ 
domain (PDB ID 1T2R). All images were generated using 
pymol (www.pymol.org) 


to the free receptor. The canonical A-form position 
of the grooves and of the phosphates are recognized 
by the receptor, which does not make any direct con¬ 
tacts to the bases and therefore binds dsRNA regard¬ 
less of its sequence. Interactions between dsRNA and 
TLR3 occur at both the C- and N-termini, suggesting 
that the length of the receptor, as determined by the 
number of leucine-rich repeats, is a major determinant 
of binding. Dimeric binding leads to receptor activa¬ 
tion but also to cooperativity, which overcomes the 
relatively weak interactions formed by each receptor 
with dsRNA. 


dsRNA-End-Recognition Domains 

Several conserved protein domains recognize the 
unique chemical features found at the 3'- or 5'-ends 


of dsRNA helices, and can be named end-recognition 
domains. These features include the presence or 
absence of phosphates, and the number of phosphate 
present on the 5'-end. Several of these domains have 
been structurally characterized and play important 
roles in antiviral defense and immune response. 
Among them are the PAZ (PIWI/Argonaute/Zwille) 
domains that recognize two-nucleotides overhang at 
the 3'-end (Ma et al. 2004), Piwi proteins that recog¬ 
nize the phosphorylated 5'-end (Fig. 2b) (Ma et al. 
2005); La protein that recognizes single-stranded 
UUU 3'-overhangs (Fig. 2c) (Teplova et al. 2006); 
the innate immune receptor RIG-I which recognizes 
5'-triphoshpates found at the 5'-end of viral RNAs 
(Fig. 2d) (Wang et al. 2010); the RIG-I-like receptor 
LGP2 (Li et al. 2009) and the Ebola virus VP35 
(Fig. 2e) (Leung et al. 2010) proteins that recognize 
free 5'-OH hydroxyl ends (Liu et al. 2008). 
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Recognition of Foreign RNA and Self/ 
Non-Self-Discrimination by Immune 
System Proteins 

RNAs of foreign origin contain characteristic terminal 
structures which are distinct from cellular mRNAs and 
are recognized by protein components of the immune 
system, for example mono- and tri-phoshphorylated 
5'-ends. The recognition of these non-self RNA spe¬ 
cies allows discrimination against self-RNAs because 
either cellular RNAs are capped by 7-methyl guano- 
sine or phosphates are removed during RNA matura¬ 
tion. In addition to the intrinsic importance of these 
domains to understand viral infectivity and the 
immune defense, understanding their properties and 
how they recognize defined RNA features would be 
of importance to define immunostimulatory nucleic 
acid species for treating cancer and infections, since 
immunostimulation by RNA species can have antican¬ 
cer activity. 

Viral RNAs carrying 5'-triphosphates at the 5'-end 
are specifically recognized by RIG-1 innate immune 
receptors when presented at the end of blunt-ended 
double-stranded RNAs. This interaction leads to the 
activation of interferon production and apoptosis of 
infected cells. Although the primary recognition ele¬ 
ment is the 5'-ppp character at the 5'-end of the RNA, 
RIG-I receptors contain a domain at their C-terminus 
that requires a base paired structure for optimal recog¬ 
nition of the triphosphates. Interestingly, the domain 
senses the presence of 2'-OH groups at certain posi¬ 
tions within the helix, so that incorporation of 2'-0 
methyl group, as is done to stabilize immunosti¬ 
mulatory RNA against cellular nucleases, is expected 
to lead to decreased activity. Thus, the complete 
recognition element is 5'-ppp capping a blunt-ended 
double-stranded RNA helix. An important element of 
recognition found in this and other domains that bind 
the exposed ends of dsRNA helices, as will be 
discussed several times below, is the stacking of 
the aromatic ring of, in this case, a phenylalanine on 
the blunt end of the duplex (Leung et al. 2010). 

Sequestering Away Viral RNA to Evade the 
Antiviral Response 

An almost opposite example is provided by viruses 
which seek to evade the host defenses by sequestering 


dsRNA molecules (siRNAs) that mediate antiviral 
defense. Through these interactions, viral protein 
domains interfere with the recognition of the unique 
features by RIG-I and toll-like receptors. These 
structures provide an example of the attack and 
counterattack between invading pathogens and the 
host immune system. 

Examples of this viral mechanism to evade the 
host immune response are found in both plant and 
animal viruses. Many plant viruses encode proteins 
that bind to and sequester the 21-base pairs RNA 
species that mediate siRNAs, a primary antiviral 
plant defense mechanism. Structures of tombusvirus 
pl9 proteins (Vargason et al. 2003; Ye et al. 2003) 
show that the base paired RNA of 19 base pairs is 
cradled by the concave face of the protein formed 
across a homodimeric interface that recognizes 
both the 2'-hydroxyl groups (thus discriminating 
RNA from DNA) and the phosphodiester backbone. 
The most important feature of the protein is two 
a-helices that project from opposite sides and pre¬ 
sent a pair of tryptophan residues that stack at the 
ends of the duplex (Fig. 2f), in essence measuring 
the length of the siRNA; such an arrangement can 
only accommodate 19-21 base pairs, the length of 
siRNAs. This residue is universally conserved 
among homologous proteins. A prominent example 
derived from animal viruses is the Ebola virus VP35 
protein that antagonizes antiviral pathways activated 
by RIG-I-like receptors. This protein recognizes 
the dsRNA backbone and the blunt ends of the 
helix to antagonize host dsRNA sensors and the 
activation of the host immune response. Here, rec¬ 
ognition of the dsRNA ends is achieved by a cluster 
of conserved hydrophobic residues that provides an 
end cap to the dsRNA structure (Leung et al. 2010) 
(Fig. 2e). 

The RNAI/MIcroRNA Pathways 

Protein components of the RNAi pathway must be 
able to recognize RNA effectors and mediators in a 
sequence-independent manner, yet to only associate 
with pathway components. Small interfering RNAs 
contain a two-nucleotide RNA overhang at the 3'-end 
and a phosphorylated 5'-end, separated by 19-23 base 
pairs of double-stranded RNA. One class of proteins 
that participate in RNAi recognition is the Piwi 
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(and argonaute) proteins that provide a binding pocket 
for recognition of the guide strand 5'-end, which con¬ 
tains a singly phosphorylated 5'-end. Structures show 
that the phosphate is buried in a highly conserved basic 
pocket in the protein (Fig. 2g) (Ma et al. 2005; Parker 
et al. 2005). Interestingly, the first nucleotide of the 
RNA is not stacked on the remainder of the helix, but 
is instead stacked on a conserved tyrosine residue, 
providing yet another example of the recognition of 
the RNA ends by conserved aromatic residues that 
stack on exposed RNA bases. 

The 3'-ends of siRNAs are recognized instead by 
PAZ (Piwi/Argonaute/Zwille) domains, which have 
been studied structurally when bound to both single- 
stranded RNAs (Fig. 2h) (Lingel et al. 2004) and 
siRNA-like duplexes (Ma et al. 2004). Interestingly, 
argonaute proteins contain both Piwi and PAZ 
domains and therefore would suffice to recognize 
siRNA within the complex (called RISC) that exe¬ 
cutes siRNA maturation. PAZ domains bind to 
dsRNA in a sequence-independent manner but anchor 
the two-nucleotide overhang found at the 3'-end 
of siRNA-like duplexes in a highly conserved hydro- 
phobic binding pocket, while at the same time 
recognizing the dsRNA through interactions with 
the phosphodiester backbone of the RNA strand 
containing the overhang. None of the interactions 
with the overhang are sequence specific, suggesting 
that any nucleotide can be accommodated, nor is there 
any recognition of the 2'-hydroxyls in the helical 
region, consistent with the observation that RNAi is 
not affected by modihcation of the 2'-hydroxyl 
groups. However, the two strands are recognized in 
a highly asymmetric manner, suggesting a mechanism 
to discriminate the sense and nonsense strand to elicit 
posttranscriptional gene silencing. 

Recognition of Single-Stranded Overhangs 
by La Protein 

Another example of recognition of single-stranded 
overhangs is provided by La protein (Teplova et al. 
2006) (Fig. 2c), which recognizes a trinucleotide 
UUU repeat at the end of polymerase III transcripts 
and protects them from exonuclease degradation. 
This phosphorylated abundant cellular protein was 
first identified in a patient with autoimmune diseases, 
and participates in multiple aspects of RNA 


metabolism. La protein interacts with the UUU over¬ 
hang using the La domain at its N-terminus, the sig¬ 
nature of this protein, and a common RNA-binding 
domain, the RRM. The La domain is a winged-helix 
structure homologous to several DNA-binding pro¬ 
teins, and these two domains synergistically recog¬ 
nize the overhang, although most of the contacts to 
the RNA are from the La domain. The RRM is used in 
an unusual manner in this protein, with targeting of 
the RNA not to the surface of the P-sheet but to its 
edges. The UUU overhang is splayed out in 
a conformation completely distinct from what is 
seen in the PAZ structure, against a pocket lined 
with highly conserved or invariant aromatic and acid 
residues, unusually for an RNA-binding protein. The 
bases of the overhang are exposed and available to 
form intermolecular hydrogen bonds and stacking 
interactions, while the remainder of the RNA forms 
a self-complementary partial double helix in the crys¬ 
tal. In this case, however, the RNA helix is not part of 
what is recognized by La protein (Teplova et al. 
2006). 

Summary 

Many RNA-binding proteins (RBPs) recognize 
double-stranded RNA (dsRNA) and discriminate the 
length of the double helix and terminal features found 
at either of the exposed 5'- or 3'-ends. Many proteins 
that recognize dsRNA with a variety of functional 
roles contain the double-stranded RNA-binding 
domain (dsRBD). Proteins involved in RNA silencing 
and in the antiviral response often recognize unique 
features found at the end of the RNA itself, such as 
single-stranded overhangs or the presence of phos¬ 
phate groups. In this entry, we have reviewed the 
properties of dsRNA-binding domains and of 
domains that bind to various RNA helix terminal 
structures. 


Cross-References 

► Alternative Splicing 

► Domain Structure Classifications 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► RNA-Binding Proteins - Catalytic Domains 
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Double-Mutant Cycle Analysis 

Amnon Horovitz 

Department of Structural Biology, Weizmann Institute 
of Science, Rehovot, Israel 

Synonyms 

Mutation; Thermodynamic mutant cycle analysis 


Definition 

In double-mutant cycle analysis (Horovitz 1996), two 
residues in a protein, i and j, are replaced separately 
and together by another given amino acid (designated 
by 0), thereby giving rise to a cycle (Fig. 1) that 
comprises the wild-type protein (Py), two single 
mutants (Pjo and Pqj), and the corresponding double 
mutant (Pqo)- Double-mutant cycles are referred to as 
such since they were originally developed for analyz¬ 
ing interactions in proteins by mutagenesis (Carter 
et al. 1984). However, the same conceptual framework 
can be used for probing non-covalent interactions in 
general, for example, in DNA and RNA (Siegfried and 
Bevilacqua 2009) or in synthetic systems (Cockroft 
and Hunter 2007), by replacing functional groups 
using chemical synthesis. The change in free energy 
upon the mutation i ^ 0, when leaving residue 
j unchanged, associated with a structural or functional 
property of the protein, may be expressed relative to 
the wild-type protein as AG(ij ^ Oj). Likewise, the 
change in free energy upon the mutation i ^ 0, when 
residue j has already been mutated to 0, may be 
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PiO 


AG(iO^OO) 

^ Pqo 


AG(ij^iO) 


AG(Oj^OO) 


AG(ij^Oi) 

Double-Mutant Cycle Analysis, Fig. 1 Scheme for a double¬ 
mutant cycle. P stands for the protein and i and j for the two 
residues that are being mutated to some other residue (e.g., 
alanine) designated by “0.” The cycle comprises the wild-type 
protein, Py; two single mutants, Pjo and Pqj; and the double 
mutant, Poq. AG(ij ^ Oj), AG(iO ^ 00), AG(ij ^ iO), and 
AG(0j ^ 00) are the free energies corresponding to the appro¬ 
priate mutations in the cycle 


expressed as AG(i0 ^ 00). The free energy changes 
AG(ij ^ iO) and AG(0j ^ 00) are defined similarly. 
The coupling energy, A^Gin,(ij), between residues i and 
j is given by: 


A2Gi„,(ij) = AG(ij ^ Oj) - AG(i0 ^ 00) 

= AG(ij ^ iO) - AG(0j ^ 00) (1) 

When the effects of the mutations i ^ 0 and j ^ 0 are 
independent of each other, i.e., the effect of the double 
mutation is equal to the sum of effects of the single 
mutations (AG(ij ^ 00) = AG(ij ^ Oj) -i- AG(ij ^ iO)), 
then A^Gint(ij) = 0. If, however, the effects of the muta¬ 
tions are not independent of each other, i.e., the effect of 
mutating one residue depends on whether the other 
residue has been mutated or not (AG(ij ^ Oj) ^ 
AG(i0 ^ 00) and AG(ij ^ iO) ^ AG(0j ^ 00)), then 
A"Gi„,(ij)y^0. 

The double-mutant cycle method has been extended 
to higher dimensions to study various levels of 
cooperativity between interactions (Horovitz and 
Fersht 1990). For example, analysis of the effect of 
one residue on the interaction between two other resi¬ 
dues requires constructing a triple-mutant box that 
consists of the wild-type protein, three single mutants, 
the three double mutants and the triple mutant (Fig. 2). 
Each face of such a box corresponds to a double¬ 
mutant cycle. Double-mutant cycles on “opposite” 
faces of the triple-mutant box correspond to the same 
pairwise interaction but with a different residue at the 


^iOO ^ P 000 



Double-Mutant Cycle Analysis, Fig. 2 Scheme for a triple¬ 
mutant box. P stands for the protein and i, j, and k for the three 
residues that are being mutated to some other residue (e.g., 
alanine) designated by “0.” The cycle comprises the wild-type 
protein, Py^; three single mutants (Pojk, Piok. and Pyo); three 
double mutants (Pojo> Pioo. and Pook): and the triple mutant, Pqoo 


third position. The difference between coupling ener¬ 
gies of cycles on opposite faces of the box, A^Gjnt, 
provides a measure of the effect of one residue on the 
pairwise interaction between two other residues. Like¬ 
wise, by constructing a four-dimensional mutant con¬ 
struct, it is possible to determine A^Gin, that is 
a measure of the effect of one pairwise interaction on 
another pairwise interaction. In general, the analysis of 
N interacting residues requires construction of the 
appropriate N-dimensional mutant construct (Horovitz 
and Fersht 1990) but is limited by the accuracy of 
measurements owing to the accumulation of experi¬ 
mental error. 


Basic Characteristics 

The free energy changes upon mutation, AG(ij ^ Oj), 
AG(i0 ^ 00), AG(ij ^ iO) and AG(0j ^ 00), cannot be 
measured directly by experiment. Instead, they are 
determined by assuming that each free energy change 
upon mutation is equal to the difference between the free 
energies before and after the mutation that are associated 
with some process such as folding or binding that the 
proteins undergo (e.g., AG(ij ^ iO) = AG(ij) - AG(iO)). 
It follows that coupling energies determined by experi¬ 
ment are always for one state relative to another. 
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For example, coupling energies, A^Gint(ij), determined 
by measuring the folding energies of the four proteins 
in the cycle provide a measure of the interaction 
energy between residues i and j in the folded state 
relative to the unfolded state. Similarly, A^Gint(ij) 
determined by measuring the ligand-binding energies 
of the four proteins in the cycle provides a measure of 
the interaction energy between residues i and j in the 
ligand-bound state relative to the unbound state 
(Horovitz 1987). 

Under favorable circumstances, the coupling 
energy, A^Gint(ij), provides a good estimate of the 
“true” energy interaction between residues i and 
j (i.e., the change in energy upon “turning off’ the 
interaction without causing any other changes in the 
protein[s]). Both intramolecular interactions and 
intermolecular interactions (when residue i is in one 
protein and residue j in another protein (Schreiber et al. 
2009)) can be characterized in this way. In order 
for favorable circumstances to exist, the mutations 
i ^ 0 and j ^ 0 must not introduce new interactions 
or cause large conformational changes in the protein 
(Serrano et al. 1990). Given these requirements, it is 
not surprising that mutations to glycine, for example, 
are avoided as they may cause large conformational 
changes and that alanine is usually chosen for “0” 
since mutation to this residue minimizes the likeli¬ 
hood that new interactions will be formed. It should 
be noted, however, that A^Gint(ij) can never be equal 
to zero exactly, even in the case when residues i and 
j do not interact, owing to the nonadditivity of 
entropies. 

In addition to their value in determining interaction 
energies of residues known to be in contact from struc¬ 
tural data, double-mutant cycles can also be used to 
detect interactions when structural information is not 
available or cannot be obtained using techniques such 
as X-ray crystallography and NMR. In principle, such 
information can then be used as constraints in structure 
determination. For example, constraints obtained from 
double-mutant cycle analysis of intermolecular 
pairwise interactions can be used in determining the 
structure of a protein complex when the structures of 
the individual proteins are known (Hidalgo and 
MacKinnon 1995), i.e., in docking. Similarly, con¬ 
straints obtained from double-mutant cycle analysis 
using transient kinetic data can be used to determine 
structures of intermediate and transition states in var¬ 
ious reactions such as protein folding. Double-mutant 


cycles can also be used to identify pathways of 
allosteric communication in proteins (Yifrach et al. 
2009) often in conjunction with computational tech¬ 
niques such as normal mode or correlated mutation 
analyses. 

Cross-References 

► Mutagenic Scanning to Define Binding Hotspots 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► Protein Complexes 

► Thermodynamics of Biomolecular Interactions 
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Centre for Computational Chemistry, School 
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Synonyms 

Xenobiotic metabolism 



Drug Metabolism - Computational Studies of Mecha¬ 
nisms, Fig. 1 Active site of Cytochrome P450 101 (P450cam). 
containing Compound I (Cpd 1) and substrate camphor (PDB 
code: 1DZ9) 


Definition 

The chemical steps leading to deactivation of drug 
molecules and preparation for their excretion. 

Basic Characteristics 

There are numerous mechanisms for the deactivation 
of drug molecules (also termed “xenobiotic” metabo¬ 
lism, along with the metabolism of other chemicals 
that are “foreign” to the body, such as those found in 
foodstuffs). These mechanisms, in general, increase 
the solubility and molecular weight of xenobiotic 
compounds. The most commonly performed chemical 
transformations are oxidation, hydrolysis, methyla- 
tion, sulfation, acetylation, and glucuronidation. In 
many cases, a drug molecule may undergo two or 
more successive chemical transformations during 
metabolism. For example, a molecule of benzene 
may undergo oxidation to form phenol, which is cata¬ 
lyzed by a cytochrome P450, and then undergo 
sulfation, catalyzed by a sulfotransferase. 

There may be more than one metabolic pathway 
open to a drug molecule, and the route that is 
taken will depend on the availability of the appropriate 
metabolizing enzymes. An example of this is 
the metabolic pathway of paracetamol in humans. 
At low doses, the major routes of metabolism are 
via glucuronidation and sulfation, which yields 


metabolites that are safely removed from the body. 
However, at high doses, a signihcant proportion of 
paracetamol will undergo A-hydroxylation by 
cytochrome P450 2E1 and 1A2, the product of which 
can rearrange to produce a highly toxic quinone, 
A-acetyl-p-benzo-quinone imine (NAPQI), which 
causes damage to the liver. 

Some drugs undergo conversion, in some drug 
metabolizing enzymes, to more potent analogues. 
For example, the commonly prescribed opiate drug, 
codeine, is converted into the more potent drug, 
morphine, by cytochrome P450. 

The enzymes that catalyze the metabolism of drugs 
include (but are not restricted to) the following: 


Oxidation 

Cytochrome P450 

Alcohol dehydrogenase 
Aldehyde dehydrogenase 
Xanthine oxidase 

Amine oxidases 

Hydrolysis 

Esterases 

Epoxide hydrolase 

Methylation 

Methyltransferases 

Sulfation 

Sulfotransferases 

Acetylation 

CoA-S-acetyltransferase 

Glutathione conjugation 

Glutathione S-transferase 

Glucuronidation 

UDP-glucuronosyltransferase 


Understanding the mechanisms of drug metabolism 
is of high importance to the development of new drugs. 
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Drug Metabolism - Computational Studies of Mechanisms, Fig. 2 Mechanism for hydroxylation of alkanes by cytochrome 
P450 enzymes 
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Drug Metabolism - 
Computational Studies of 
Mechanisms, 

Fig. 3 Hydroxylation of 
benzene by Cpd I of 
cytochrome P450 to form 
epoxide, ketone, and phenol 
products 

Computational enzymology has provided some vital 
insight into how theses enzymes function. Several 
computational mechanistic studies of these enzymes 
are highlighted below. 

Cytochrome P450 Enzymes 

Cytochrome P450 enzymes (CYPs) metabolize the 
majority of drugs that are currently prescribed, and 



have therefore received extensive theoretical attention. 
The mechanism of substrate oxidation by the 
high-valent iron-oxo intermediate, Compound I 
(Cpd I - Fig. 1), has received the most attention because 
of the difficulty in isolating it experimentally. Cpd I 
performs a variety of chemical transformations includ¬ 
ing aliphatic hydroxylation, aromatic hydroxylation, 
epoxidation, N/S/O-dealkylation, and N/S-oxidation. 
The most common of these transformations are aliphatic 
and aromatic hydroxylation, discussed below. 








Drug Metabolism - Computational Studies of Mechanisms 


515 


D 




Drug Metabolism - Computational Studies of Mechanisms, Fig. 4 Hydrolysis of epoxide RCHOCR’ by epoxide hydrolase 


Aliphatic Hydroxylation 

Hydroxylation of alkanes by Cpd I proceeds via hydro¬ 
gen abstraction from the substrate by the Cpd I ferryl 
oxygen, resulting in a radical intermediate. The radical 
intermediate combines with the Fe-bound OH moiety in 
a “rebound” step to form the hydroxylated product 
(Fig. 2). The electronic structure of Cpd I exists as 
a mixture of two closely lying doublet and quartet spin 
states. The hydroxylation reaction is commonly believed 


to occur via both of these spin states (Shaik et al. 2005). 
Hydrogen abstraction is the rate-limiting step with an 
activation barrier of 26 kcal moF* reported for methane 
in vacuo at the B3LYP/LACVP level of theory (Shaik 
et al. 2005). The rebound step does not have a barrier on 
the doublet spin surface for some substrates, however, 
a small barrier is observed on the quartet spin surface. 
This observation has been used to explain the formation 
of products that undergo rearrangement. 
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Aromatic Hydroxylation 

Hydroxylation of aromatic compounds by CYPs has 
a different mechanism to that of aliphatic hydroxyl¬ 
ation. The hrst step is believed to be the bond 
formation between the ferryl oxygen of Cpd I and 
substrate aromatic carbon, to yield a a-adduct 
intermediate (Fig. 3). This process has been modeled 
in DFT and QM/MM studies of benzene hydroxylation 
(Bathelt et al. 2004, 2008). The barrier to formation of 
the cj-adduct intermediate for benzene was calculated 
to be around 18 kcal moF^ at the B3LYP-LACVP/ 
CHARMM level of theory. The cj-adduct may 
rearrange via several possible pathways to form 
phenol, epoxide, and ketone products. In calculations 
of benzene oxidation, the energy barriers for all three 
rearrangement pathways were found to be similar 
(Bathelt et al. 2008). Arene oxides are highly toxic, 
and hence predicting their formation during the early 
stages of drug development may help to avoid adverse 
effects. 

Epoxide Hydrolase 

Epoxide hydrolases catalyze the conversion of 
epoxides to their corresponding diols with a high 
degree of regio- and stereoselectivity (Fig. 4). Epoxide 
hydrolase contains a catalytic triad consisting of two 
aspartate residues and a histidine. The epoxide 
undergoes nucleophilic attack by Asp333 
(corresponding to residue numbering in murine soluble 
epoxide hydrolase (sEH)), resulting in epoxide ring 
opening and formation of an ester intermediate. This 
intermediate undergoes hydrolysis to form the diol 
product. Two tyrosine residues have been identified 
as important in the catalytic cycle of soluble epoxide 
hydrolase, as they stabilize the ester intermediate 
(Eig. 5). The high degree of regio- and stereoselectivity 
observed in sEH products is due to the ability of the 
enzyme to distinguish between the two epoxide carbon 
atoms in a substrate, with a preference for nucleophilic 
addition by Asp333 at a particular carbon atom. The 
mechanism of human soluble epoxide hydrolase (sEH) 
has been studied in a gas phase small model study by 
Hopmann et al. using the B3LYP density functional 
(Hopmann and Himo 2006), where a lower barrier (by 
4 kcal mol”*) to nucleophilic attack was observed at 
the more hindered (but more electronically stabilized) 
carbon atom of trani-methylstyrene oxide. QM/MM 



Drug Metabolism - Computational Studies of Mecha¬ 
nisms, Fig. 5 Active site of murine soluble epoxide hydrolase, 
containing fra/ti-diphenylpropeneoxide. The two active site 
tyrosine residues, responsible for stabilization of the alkyl- 
enzyme intermediate, are forming hydrogen bonds to the epox¬ 
ide oxygen (Lonsdale et al. 2012) 

free energy calculations have been performed for the 
formation of the ester intermediate during the hydro¬ 
lysis of frani-styrene oxide and frany-diphenylpropene 
oxide (t-DPPO). (Lonsdale et al. 2012). The free 
energy barriers (AMl/CHARMM), calculated using 
umbrella sampling, were found to be in agreement 
with the experimentally observed preference for nucle¬ 
ophilic attack at the phenyl epoxide carbon atom of 
t-DPPO. The protonation state of His523 during the 
nucleophilic attack has been a matter of previous 
debate; however, the calculations support a model in 
which His523 is protonated at both imidazole nitrogen 
groups, as shown in Pig. 4. Additionally, hydrogen 
bonds between the Tyr465 and Tyr381 hydroxyl 
groups and the epoxide oxygen atom are found to 
lower the barrier to nucleophilic attack significantly. 
Higher-level (SCS-MP2//B3LYP/CHARMM) calcu¬ 
lations yielded activation free energy barriers that are 
in excellent agreement with barriers derived from 
experimental studies. 

Glutathione S-Transferase 

Glutathione S-transferases (GST) catalyze the conju¬ 
gation of glutathione (a tripeptide - y-Glu-Cys-Gly) to 
a wide range of electrophiles. Like the epoxide 
hydrolases, GSTs react with epoxides (along with 
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Computational Studies of 
Mechanisms, 

Fig. 6 Mechanism for 
conjugation of glutathione to 
phenanthrene 9,10-oxide to 
form the diastereomeric 
(9S,10R) and (9R,10S) 

9- (S-glutathionyl)- 

10- hydroxy-9, 

10-dihydrophenanthrene 
products 
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Drug Metabolism - Computational Studies of Mechanisms, Fig. 7 Mechanism for conjugation of glutathione to l-chloro-2,4- 
dinitrobenzene in Ml-1 GST via a Meisenheimer intermediate 


Other substrates), which are often formed during cyto¬ 
chrome P450-mediated oxidation reactions. The 
conjugation of glutathione to phenanthrene 
9,10-oxide has been modeled in Ml-1 GST using 
QM/MM (Ridder et al. 2002). In a similar mechanism 
to epoxide hydrolase, the thiolate sulfur of glutathione 
performs a nucleophilic attack at one of the epoxide 
carbon atoms, resulting in the collapse of the 
epoxide ring (Fig. 6). AMl/CHARMM free energy 
profiles (using reaction-specific AMI parameters 
for sulfur) were obtained for nucleophilic attack at 
both epoxide carbon atoms. Similar barriers (of 
20 kcal mol”*) were calculated for both carbon 
atoms, in good agreement with experiment. 

The effect of several active site mutants on the 
energy barriers and corresponding selectivity was 


also investigated, highlighting the importance of 
TyrllS and Asn8 in the reactivity of Ml-1 GST with 
epoxides. 

In a similar study, the conjugation of glutathione to 
l-chloro-2,4-dinitrobenzene in Ml-1 GST was 
modeled (Bowman et al. 2007). The reaction takes 
place via aromatic nucleophilic substitution at the 
chloro carbon (Fig. 7). This reaction is often used as 
a standard activity assay for GSTs. Calculations 
support the existence of the proposed Meisenheimer 
intermediate as a short-lived but true intermediate. 
Small barriers (^1 kcal mol”*) were observed for 
both the formation of, and elimination of chlorine 
from, the Meisenheimer complex. In the nonenzymatic 
reaction in solution, a barrier of 9.5 kcal mol”* to 
formation of the Meisenheimer complex was 







D 


518 


DTSS Method 


calculated. Several active site mutations were 
performed, with the largest decreased in activity 
observed with the TyrbPhe mutation. This difference 
in activity is due to the loss of the effect Tyr6 has in 
lowering the pKa of the thiol of GST. 


Cross-References 

► Cytochrome P450 - Computational Studies 

► Glutathione S-Transferase - Computational Studies 

► Glycosylation in Metabolism 

► QM/MM Calculations of Vibrational Spectra 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 
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Dual-Beam Fluorescence Cross- 
Correlation Spectroscopy 

Alan Van Orden 

Department of Chemistry, Colorado State University, 
Fort Collins, CO, USA 

Definition 

Cross-Correlation'. A mathematic function that com¬ 
pares the fluorescence intensity measured at one point 
in time and space to that measured at a later point in 
time and space to reveal the time-dependent processes 
of a molecular sample that cause fluctuations in the 
fluorescence intensity between the different points. 

Basic Characteristics 

Dual-beam fluorescence cross-correlation spectros¬ 
copy (FCCS), also referred to as dual focus, two-beam, 
or spatial FCCS, is a type of fluorescence correlation 
spectroscopy (FCS) in which fluorescence fluctuations 
from two spatially offset optical microscope detection 
volumes are detected on two separate detectors and 
analyzed to determine the intensity cross-correlation 
function of the fluorescence signals (Brinkmeier 

2001) . To understand the basis for dual-beam FCCS, 
it is useful to compare this technique to the more 
conventional single-beam FCS (Widengren and Mets 

2002) . In single-beam FCS, a high numerical aperture 
microscope objective is used to focus an excitation 
laser beam within an open sub-volume of a dilute 
solution of fluorescent molecules. This open sub¬ 
volume is defined by the focal region of the laser 
beam, and when focused to the diffraction limit, is 
typically on the order of ^1 femtoliter. Fluorescence 
signals from molecules diffusing through this region 
are detected using a confocal microscope setup. Fluc¬ 
tuations in the number of molecules occupying the 
focal volume give rise to fluorescence intensity fluctu¬ 
ations that are subjected to intensity autocorrelation 
analysis. This results in an autocorrelation function 
that depends on the diffusion rate of the analyte mol¬ 
ecules, as well as other molecular processes that cause 
fluorescence fluctuations or changes in diffusion 
rate. Such processes can include photoisomerization. 
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Fig. 1 (a) Schematic diagram 
of a dual-beam FCCS 
experiment used to analyze 
molecules flowing through 
a capillary electrophoresis 
system, (b) Coordinate axis 
system for the confocal 
excitation volumes in 
dual-beam FCCS and capillary 
electrophoresis. For the data in 
Fig. 2, the following 
parameter values were used: 
ct>o ~ 250 nm, Zq ~ 1.5 pm, 
fi Ri 5 pm, £ fsi -22 kV 
(Copyright: American 
Chemical Society, Used with 
permission) 
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intersystem crossing, biomolecule folding, or ligand- 
receptor binding, making single-focus FCS a powerful 
technique for studying a wide range of molecular pro¬ 
cesses with high sensitivity and low sampling volumes. 

Single-beam FCS is typically used to probe molec¬ 
ular processes taking place in static solutions. There 
are many circumstances under which the molecules of 
interest flow or are transported in a uniform direction. 
Such systems may include microfluidics or electropho¬ 
resis channels or even individual cells in which mole¬ 
cules are transported across membranes or through 
subcellular compartments. Single-beam FCS can 
detect changes in the transit time of the molecules 
through the detection volume caused by this uniform 
translation, but is not sensitive to the direction of 
flow. Also, single-beam FCS has difficulty resolving 
relative contributions of flow and diffusion to the 


autocorrelation function, as well as the presence of 
different components of a mixture on the basis of 
flow rate or direction. In dual-beam FCCS, the excita¬ 
tion laser beam is split into two beams of equal inten¬ 
sity, such that two closely spaced probe volumes, 
^2-5 pm apart, are created within the solution 
(Fig. 1). This adds an additional dimension to FCS 
because cross-correlating the fluorescence fluctuations 
observed in the two detection volumes gives rise to 
a peak in the correlation function with position, ampli¬ 
tude, and width that depend on the flow and diffusion 
rates and the direction of flow. If different molecules are 
flowing at different rates, due to, for example, differ¬ 
ences in electrophoretic mobility, multiple peaks are 
observed in the cross-correlation function, giving dual¬ 
beam FCCS an advantage in multicomponent analysis. 
These characteristics have given rise to a number of 
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applications of dual-beam FCCS, including flow profil¬ 
ing in microfluidic channels (Brinkmeier 2001), detec¬ 
tion in continuous flow capillary electrophoresis 
(Fogarty and Van Orden 2009), DNA-protein binding 
studies (LeCaptain and Van Orden 2002), and single 
cell analysis (Schiro et al. 2007). Recently, it has been 
demonstrated that dual-beam FCCS, in combination 
with other types of FCS, is a useful technique for inves¬ 
tigating the kinetics of biomolecule folding reactions 
(Van Orden and Jung 2008), whereby changes in the 
cross-correlation function due to folding and unfolding 
of dye-quencher labeled DNA or RNA hairpins can he 
analyzed to determine the folding and unfolding reac¬ 
tion rates and mechanisms. 

Figure la displays a dual-beam FCCS apparatus 
that was used to measure the flow rate and direction 
of positive and negative ions flowing under the influ¬ 
ence of an applied electric field in a capillary electro¬ 
phoresis system (Fogarty and Van Orden 2003). 
Figure lb shows a close-up of the two focused laser 
beams from which fluorescence was detected as the 
molecules flowed sequentially from one beam to the 
other. The key elements of the optical setup are an 
excitation laser split into two spatially offset beams 
by means of 50/50 beamsplitters, a dichroic mirror, 
which reflects the laser beams and transmits the fluo¬ 
rescence, a microscope objective, which focuses the 
laser beams into the capillary to form the two spatially 
offset probe regions and also collects the fluorescence, 
two confocal pinholes that selectively transmit the 
fluorescence from each focal region, and two single 
photon counting detectors equipped with appropriate 
optical filters. A digital correlator is used to carry out 
the auto- and cross-correlation analyses of the fluores¬ 
cence fluctuations detected from each prohe region. 

Dual-beam FCCS measures two cross-correlation 
functions simultaneously by cross-correlating the fluo¬ 
rescence detected from beam 2 relative to beam 1 
(Fig. lb), or by cross-correlating the fluorescence 
detected from beam 1 relative to beam 2. These are 
referred to as the forward, Gpii), and reverse, Gr(x), 
cross-correlation functions, respectively. 


Gf(t) 


{Fi{t)F2{t+T)) 

F1F2 


GR{r) 


(F2(0Fi(f+T)} 

F 1 F 2 


Here, t is the lag time, and F„{t) refers to the 
fluorescence intensity detected by each detector at 
time t. Using an optical setup such as the one shown 


in Fig. la, the cross-correlation functions for samples 
containing a single component can he modeled using 
the equation; 


ml{l + T/zd) 
( 2 ) 

where N is the average number of molecules occupying 
the probe volumes, is the average diffusion time, kq 
is the ratio coo/zq (Fig. lb), and the remaining parame¬ 
ters are defined in Fig. lb. Notice that the cross¬ 
correlation function is roughly approximated as 
a Gaussian-shaped peak centered near the lag time 
Zp « R/Vjt, and a width characteristic of the diffusion 
of the molecules in the absence of flow. The parameter 
Zp is the average transit time of molecules between the 
two focal volumes. Thus, dual-beam FCS enables the 
simultaneous measurement of both the flow and diffu¬ 
sion properties of the molecules independently, 
a feature not available in single-beam FCS. Equation 2 
also illustrates the additional advantage that molecules 
flowing in opposite directions can be distinguished by 
the occurrence of the cross-correlation peak in either 
forward or reverse cross-correlation function. 

Figure 2 illustrates the ability of dual-beam FCCS 
to resolve multiple components of a solution 
containing analyte molecules flowing at different 
rates and different directions. This figure shows the 
simultaneously recorded forward (black) and reverse 
(red) cross-correlation functions observed for 
nanomolar solutions containing pure tetramethylr- 
hodamine (TAMRA, Fig. 2b), pure TAMRA labeled 
poly thymine DNA, pure rhodamine 6G (R6G), and 
a mixture of all three components. The data were 
obtained from solutions flowing through an electro¬ 
phoresis capillary under the influence of an applied 
electric field. Notice the negative ions (TAMRA and 
polythymine) were observed in the forward cross¬ 
correlation channel, and positive ion (R6G) was 
observed in the reverse correlation channel due to the 
orientation of the electric field relative to the flow axis 
(Fig. lb). Figure 2e shows that each component of the 
mixture could be resolved based on differences in flow 
speed and direction. 

Other recent applications of dual-beam FCCS illus¬ 
trate the advantages of this technique in resolving the 
flow and diffusion properties of single molecules. 


GF.«(r)=- 


1 


1 
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Spectroscopy, 

Fig. 2 Forward (black) and 
reverse (red) dual-beam 
fluorescence cross-correlation 
functions for a blank buffer 
solution (a), and nanomolar 
solutions of (b) tetramethyl 
rhodamine (TAMRA), (c) 
TAMRA labeled polythymine 
DNA, (d) rhodamine 6G 
(R6G), and (e) a mixture of all 
three species (Copyright: 
American Chemical Society. 
Used with permission) 




T (ms) 


In one example, dye-quencher labeled DNA hairpins 
were studied to determine their folding and unfolding 
kinetics (Jung and Van Orden 2006). DNA hairpin 
folding reactions can take place on a timescale of tens 
to hundreds of microseconds, which is similar to the 
molecular transit time in single-beam FCS. For this 
reason, single-beam FCS has difficulty resolving the 
reaction and diffusion times. Dual-beam FCS, used in 
combination with single-beam FCS, provides an inde¬ 
pendent measure of the diffusion properties, due to the 


dependence of the cross-correlation width on diffusion. 
Simultaneous single-beam FCS can then be used to 
measure the reaction times. These studies enabled 
a more careful analysis of the hairpin folding kinetics 
and revealed an unexpected three-state mechanism for 
the reaction. Further developments have demonstrated 
the ability of dual-beam FCCS to extend the time range 
for DNA and RNA hairpin folding kinetics into the 
milliseconds range, compared to the much shorter 
time range of single-beam FCS (Jung et al. 2008). 
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Another example demonstrated the ability of dual¬ 
beam Fees to measure the effective charge of DNA 
molecules interacting with various counterions in con¬ 
tinuous flow capillary electrophoresis (Fogarty et al. 
2009). The effective charge, of a molecule is 
related to the electrophoretic mobility, and diffu¬ 
sion constant, D, according to the equation: 


Hence, both the electrophoretic mobility and diffu¬ 
sion constant must be known to accurately measure the 
effective charge. Dual-beam FCCS was used to accu¬ 
rately measure these parameters simultaneously based 
on its ability to resolve the flow and diffusion proper¬ 
ties of the molecules. Thus, the interactions of DNA 
with various counterions can be studied. 

In summary, dual-beam FCCS serves an important 
role within the family of FCS-related techniques due to 
its ability to analyze molecular processes taking place 
in liquid solutions undergoing unidirectional flow. 
Such techniques will continue to play a role in charac¬ 
terizing such biophysical processes as DNA, RNA, and 
protein binding, biomolecule folding, and the 
electrostatic processes of biomolecules and their 
associated ions. 

Cross-References 

► Fluorescence Correlation Spectroscopy 

► Fluorescence Cross-Correlation Spectroscopy 

► Fluorescence Fluctuation Spectroscopy 

► Microfluidics for Single Molecule Detection 

► Single Fluorophore Blinking 

► Single Fluorophores Photobleaching 
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Synonyms 

Molecular motors; Optical trap; Optical tweezers; 
Single-molecule methods 

Definition 

An optical trap is a device that uses a focused laser 
beam to create a potential well, in which a small object 
(100 nm to a few micrometers) whose refractive index 
is greater than that of the surrounding fluid can be 
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trapped. The object experiences a restoring force that is 
directed toward the center of the potential well, which 
is directly proportional to the distance of the object 
from the center of the potential well. An optical trap 
can be used to apply piconewton forces to biological 
marcomolecules such as proteins and DNA. A dual¬ 
beam optical tweezer consists of two focused laser 
beams that are used to trap two different objects and 
move them independently in solution. A common bio¬ 
logical application of the dual-beam optical tweezer is 
the measurement of the biophysical properties of 
molecular motors. 

Basic Concept of an Optical Trap 

Light has momentum and a force is generated when the 
momentum is changed. When light passes through an 
interface between two different media, reflection and 
refraction occur at the interface. Consider the example 
of a laser beam that is focused at the center of a 
spherical object that has a higher refractive index (n) 
than that of the surrounding aqueous medium. Reflec¬ 
tion of light at the surface of the object generates 
a force (radiation force) on the object, in the direction 
of light propagation, with a magnitude proportional to 
the beam power. Refraction of light, on the other hand, 
produces a force (gradient force) that attracts the object 
to the focal point of the laser beam, with a magnitude 
proportional to the gradient of beam power. The object 
is “trapped” at a point where the vector sum of the 
gradient and radiation forces is zero (Ashkin 1992). 
This point is referred to as the center of the trap and is 
located near the focal point of a high numerical aper¬ 
ture (NA) microscope objective lens (NA > 1.2, and 
most often 1.4). 

For biological applications, where the optical trap is 
used to apply a force on protein/DNA molecules, the 
trapped object is usually a polystyrene (n = 1.57) 
sphere ranging in size from several hundred nanome¬ 
ters to a few micrometers. The force experienced by 
the trapped object increases linearly with distance 
from the trap center. The strength of the trap is 
expressed in terms of the “trap stiffness” and varies 
for most applications from 0.005 to 1 pN/nm. The force 
experienced by the trapped object is therefore the 
product of the trap stiffness and distance from 
the trap center. A more detailed theoretical description 
of how the laser traps such particles can be found in 


Svoboda and Block (Svoboda and Block 1994). 
The specific layout of the optical trap elements is 
dependent on the biological question. In this entry, 
the details of a dual-beam optical trap are discussed, 
based on the three-bead assay to study myosin 
molecular motors and originally developed by Finer 
et al. (1994); for advancements see Sung et al. (2010). 

Myosin Function Studied Using a Dual-Beam 
Optical Trap 

The broad goal of this instrument is to investigate the 
coupling between protein structure and function. In 
this entry, the setup of a dual-beam optical tweezer 
as it relates to investigating the mechanical stroke and 
the chemical ATPase cycle of the myosin motor is 
discussed. A detailed description of the setup, align¬ 
ment, and calibration of different optical components 
can be found in Sung et al. (2010). With minor modi¬ 
fications, the dual-beam optical trap setup can be 
applied to a variety of biological applications. The 
dual-beam optical trap has been used to study the 
workings of myosin at the single molecule level, with 
high spatial (^nm) and temporal (^ms) resolution 
(Finer et al. 1994). Figure 1 shows the geometric 
configuration of different elements to investigate the 
interaction between a single actin filament and a single 
myosin molecule. A single biotinylated actin filament 
is attached at each end to a 1 pm polystyrene bead, 
through a neutravidin-biotin linkage. The two polysty¬ 
rene beads are held in two independent optical traps. 
The optical traps are positioned such that the actin 
filament is stretched taut. Myosin molecules are 
attached to 1.5-pm platform beads using either an 
antibody linkage or nonspecific adsorption. The plat¬ 
form bead is moved using the microscope stage such 
that the actin filament is positioned close to the myosin 
molecule. Myosin binding to actin results in a stroke of 
its lever arm. Since the myosin is anchored to the 
surface, the myosin stroke displaces the actin filament 
along its length. This in turn causes a displacement of 
the optically trapped beads at the two ends of the 
actin filament. Bead displacement is monitored by 
a quadrant photo diode or a position-sensitive detector 
and is a measure of the stroke size of the myosin motor. 
Following each stroke, the myosin exchanges ADP for 
ATP and detaches from the actin filament. For 
processive myosins, each of the two heads alternately 
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Dual-Beam Optical Tweezers, Fig. 1 Configuration of acto- 
myosin in a dual-beam optical trap. An actin filament is attached 
at each end to a 1-pm polystyrene bead. The polystyrene beads 
are held in two separate optical traps. The actin filament is 
stretched taut and positioned close to a platform bead (1.5 pm). 


A myosin molecule, attached to the platform bead through an 
antibody linkage, interacts with the actin filament. A myosin 
stroke results in the displacement of the actin filament along its 
length. Position-sensitive detectors are calibrated to read out 
the displacement of the trapped beads 


binds and releases from the actin filament, resulting in 
a processive movement of the actin filament relative to 
the myosin, which translates into a stepwise movement 
of the optically trapped heads. 

Layout of a Dual-Beam Optical Trap 

A dual-beam optical trap can be divided into five mod¬ 
ules: (a) Trapping laser with beam steering; 
(b) microscope with objective and condenser; (c) detec¬ 
tion laser with bead position detector; (d) bright held 
illumination and detection; and (e) fluorescence illumi¬ 
nation and detection (Fig. 2). A 1,064-nm laser beam 
with high power (100 mW-5 W) is used for optical 
trapping. The laser beam Is guided by a set of mirrors 
(M) through an isolator (ISO), followed by telescopic 
lenses (L) that expand the beam. The position of the 
optical trap in the sample plane can be varied by 
a combination of piezo mirrors (PM) and acousto¬ 
optic deflectors (AOD). PMs provide a large working 
distance in the sample plane (>10 pm) with lower 
response frequency (^ 100 Hz). AODs provide a smaller 
working distance (^ 1 pm) with a higher response fre¬ 
quency (> 10 kHz). A polarizing beam splitter (PBS) is 
used to split the trapping beam into two components 
(“p” and “s”). The position of the two beams in the 
sample plane is controlled separately by two different 
PMs. Another PBS is used to combine the two trapping 
beams to generate a dual-beam optical trap. An 850-nm 


laser is used for detection of the optically trapped beads. 
Similar to the trapping laser, the detection laser is 
steered using PMs and is split and recombined using 
PBSs. The trapping and detection lasers are steered into 
the back focal plane of the objective (OBJ). Each of the 
two trapping beams is focused by the OBJ to form 
independent optical traps in the sample plane. An inter¬ 
ference pattern is formed in the back focal plane of the 
condenser (CON) as a result of the interaction of the 
detection laser with the trapped object. The shape of the 
interference pattern changes in response to a change in 
the position of the trapped object relative to the trap 
center (Fig. 3). A position-sensitive detector (PSD) 
responds to this change in interference pattern and its 
output can be calibrated to directly read out the move¬ 
ment of the trapped object relative to the trap center. 
Brightfield illumination (740-nm LED) and detection 
(CCD camera) are used to visualize the sample plane 
while steering either the coverslip or trapping laser to 
facilitate trapping of objects in solution. Eor a dual¬ 
beam optical trap, a separate object is trapped in 
each of the two trapping beams. A series of dichroic 
mirrors (DM) is used to sort the different light paths 
to their respective detectors. Fluorescent macromole¬ 
cules including DNA or proteins can be visualized using 
a separate fluorescence illumination and detection path. 
Figure 2 shows the layout of a 532-nm laser that is used 
for epifluorescent illumination of the sample plane. 
Using appropriate DMs, the corresponding emission is 
captured by a CCD camera. 
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Layout of a dual-beam optical trap 


(Not to scale) 



L - lens 


p, s -polarization 


Gray background: side view 


Dual-Beam Optical Tweezers, Fig. 2 Layout of components 
in a dual-beam optical trap. The optical trap can be partitioned 
into five modules as follows: (a) Trapping optics - This includes 
the trapping laser, isolator (ISO) and steering components 
(piezo-mirrors (PM) and acousto-optic deflector (AOD)). The 
two beams in the dual-beam trap are derived from a single laser 
beam by using polarizing beam splitters (PBS), (b) Microscope - 
The microscope objective (OB.f) focuses the laser beam to form 
an optical trap in the sample plane. The condenser (CON) is used 
for detection of the trapped beads and also brightfield 


illumination of the sample plane, (c) Detection optics - This 
includes the detection laser and piezo mirrors (PM) for steering 
the detection beams. Polarizing beam splitters (PBS) are used to 
derive two separate beams for detecting the two optical traps, 
(d) Brightfield optics - This includes the brightfield illumination 
through the condenser and the CCD camera to display the 
sample plane, (e) Fluorescence optics - This includes a laser of 
appropriate wavelength for epifluorescent illumination of the 
sample plane and a CCD camera to capture the emission of 
fluorophores in the sample plane 


Insights into Macromolecular Function Using 
a Dual-Beam Optical Trap 

The dual-beam optical trap configuration has been 
used to dissect the workings of the molecular motor 
myosin. Finer et al. (1994) first developed the dual¬ 
beam optical trap and used it to measure the displace¬ 
ment of the actin filament relative to the tail of myosin, 


also known as the stroke size, during a single ATPase 
cycle of this molecular motor. For processive myosins, 
including myosin V and myosin VI, several research 
groups have measured myosin step sizes using a 
dual-beam optical trap (Trybus 2008; Spudich and 
Sivaramakrishnan 2010). The optical trap has been 
used to apply piconewton forces on myosin heads 
following a mechanical stroke or a processive step. 
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Dual-Color Fluorescence Correlation Spectroscopy 



Dual-Beam Optical Tweezers, Fig. 3 Images of the sample 
plane and back focal plane, (a) Snapshot of the sample plane 
captured on a CCD camera. Arrow points to a trapped bead. 
Platform beads appear as unfocused dots with dark centers, 
(b) Snapshot of the interference pattern in the back focal plane 
of the condenser acquired using a CCD camera. Center - bead 
aligned with the center of the trap, left — bead displaced to the left 
of the trap center, right — bead displaced to the right of the trap 
center 

The force can either be in the direction of myosin 
movement (forward) or opposed to myosin movement 
(backward). Changing the power of the trapping beam 
in the sample plane alters the magnitude of the trapping 
force. The force applied can be increased until the 
myosin can no longer stroke or step and is a direct 
measure of the maximum force that can be generated 
by the myosin (stall force). In addition to the size of the 
stroke or step, the optical trap also measures the length 
of time that the myosin remains strongly bound to the 
actin filament. The mean value of the strongly bound 
time is a measure of a rate-limiting step in the ATPase 
cycle of the myosin. The dual-beam optical trap can 
therefore directly measure the rates of events in the 
ATPase cycle, including the ADP release rate and the 
ATP re-binding rate. The dual-beam optical trap is the 
only known method to measure these rate constants in 
the presence of either a forward or backward load on 
the myosin, which in turn provides insight into the 
physiological function of myosin. For instance, in 
myosin VI a backward load on the processive motor 
results in a significant increase in the ADP release rate 
and transforms the motor from a transporter to an 
anchor (Chuan et al. 2011). In the case of myosin V, 
forward and backward loads on a myosin head have 
different effects on the kinetic cycles and induce 
asymmetry in a myosin V dimer, which facilitates 
long-range processive movement (Trybus 2008). Last 
but not least, the dual-beam optical trap configuration 
with acto-myosin allows application of piconewton 
forces on protein elements that are fused to the myosin 
tail. For example, the mechanical properties of 
a single, unusual, ER/K a-helix can be directly 
measured by fusing different lengths of this protein 
structural element to the tail of myosin VI and 


subjecting it to varying bending forces in a dual-beam 
optical trap (Sivaramakrishnan et al. 2009). Beyond 
acto-myosins, the dual-beam optical tweezer configu¬ 
ration has been used to directly detect base-pair 
stepping by RNA polymerase (Abbondanzieri et al. 
2005) and nonequilibrium thermodynamics of the 
unfolding of RNA hairpins (Liphardt et al. 2002). 


Cross-References 

► ATPase: Overview 

► Molecular Motors 

► Myosin: Fundamental Properties and Structure 

► Optical Tweezers 
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Synonyms 

Conductance injection; Hybrid circuits; Membrane 
potential; Real-time computational modeling 

Definition 

Dynamic clamp refers to the use of a computer to 
stimulate neurons or other excitable cells with 
a signal (usually an electrical current) which is calcu¬ 
lated in real-time using feedback of the membrane 
potential. Most often, this is done to inject conductance 
which mimics the electrical behavior of membrane ion 
channels, for example, synaptic or voltage-gated 
channels. 

Introduction 

The earliest use of an artificial membrane conductance 
as an experimental tool for studying excitable cells is 
probably the simulation of a gap-junctional connection 
between cardiac muscle cells by an analog resistor, 
described by Tan & Joyner in 1990, an idea which 
also appears in the Ph.D. thesis by Scott in 1979 (see 
Goaillard and Marder 2006). Dynamic clamp, in its 
usual sense of using a freely programmable digital 
computer, in a feedback loop comprising analog- 
to-digital conversion, a real-time computation, 
followed by digital-to-analog conversion, to mimic 
a membrane-potential sensitive ionic channel current, 
was developed independently by (Robinson 1991; 
Robinson and Kawai 1993) who used the term “con¬ 
ductance injection,” and (Sharp et al. 1993), who intro¬ 
duced the term “dynamic-clamp.” 

The basic idea of dynamic-clamp or conductance 
injection is shown in Fig. 1 . It uses a standard current- 



Dynamic Clamp: Synthetic Conductances and Their Influ¬ 
ence on Membrane Potential, Fig. 1 A typical dynamic- 
clamp setup. While recording from a cell with a current-clamp 
amplifier, the current command signal (Icomm) is constantly 
updated by a real-time computation based on the membrane 
potential signal (Vn,), effectively creating a current source (linj) 
which is “instantaneously” sensitive to the membrane 
potential, and which follows any computational model of the 
experimenter’s choosing. The key point is the feedback - 
computation - stimulus cycle, and that this should be sufficiently 
fast as to be effectively instantaneous - i.e., such that any further 
speedup produces no change in the outcome. Abbreviations: 
10 input-output, OS operating system 

clamp amplifier, which records the membrane poten¬ 
tial of a cell, while simultaneously allowing injection 
of current according to a command signal. The mem¬ 
brane potential signal is sampled by a computer, and 
used as input to a real-time computation of a command 
signal determining the injected current which is fed 
back into the cell. Provided this is done sufficiently 
quickly (typically within 20-50 ps), it can very accu¬ 
rately reproduce the electrical effect of an ionic 
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conductance, whose current may be voltage-, time-, 
and model-dependent. 

Injection of this kind of synthetic conductance is 
a valuable tool in many applications, including electri¬ 
cally imitating synaptic input to a neuron to enable 
direct measurement of the synaptic integration 
or transfer function of neurons; “knocking-in” or 
“knocking-out” voltage-activated conductances to 
investigate their function in excitability; creation of 
virtual synaptic connections among neurons; and 
even “hybrid networks” linking simulated and biolog¬ 
ical neurons, to test hypotheses about neural circuit 
dynamics. These kinds of applications are explained, 
and some future directions of the technique are 
discussed here. Dynamic clamp is the subject of a 
number of more detailed reviews, including (Prinz 
et al. 2004; Goaillard and Marder 2006) as well as 
a recent book (Destexhe and Bal 2009). 

The Nature of a Conductance Input 

A simple example demonstrates the fundamental dif¬ 
ference between injecting a predetermined pattern of 
current on the one hand, and injecting a dynamic or 
reactive current source - a synthetic conductance - on 
the other. In a simple passive cell membrane, the 
response - the membrane voltage - scales up linearly 
with the amplitude of an applied current step, as seen in 
Fig. 2a (left), and rising and decaying transients have 
the same exponential time constant of relaxation. If 
instead, as in Fig. 2a (right), steps of conductance are 
injected, the responses do not grow in proportion to the 
conductance, but instead saturate as the voltage 
approaches the equilibrium potential of the conduc¬ 
tance - in other words, the input-output relationship 
from conductance to voltage change is actually 
nonlinear, even in this passive membrane. The reason 
for this is seen in the current traces - because the 
driving force (V — E,-gy) is reduced as V approaches 
Erev, the injected current / = g(V — Erev) is progres¬ 
sively choked off. Notice the asymmetric time course 
of these responses: the rising transient has a faster time 
constant than the decaying transient, because during 
the application of the conductance step, the membrane 
has an increased total conductance and hence a faster 
time constant. Starting from different resting potentials 
(Fig. 2b), responses to the predetermined current step 
remain constant (left), while the size and even the sign 


of the responses to the conductance step changes as the 
membrane potential varies around the reversal poten¬ 
tial of the conductance. This behavior captures the 
essential shunting nature of conductances due to ion 
channels in cell membranes, a phenomenon which is 
particularly important for understanding synaptic inhi¬ 
bition. An ionic channel conductance should be 
thought of as an attractive force on the membrane 
potential toward its characteristic reversal potential, 
rather than a fixed amount of current. 

This experimental paradigm - stimulating a cell 
membrane with a conductance or dynamically reacting 
current - is quite different from that of voltage-clamp, 
in which the membrane potential is controlled while 
the current needed to do so, which is equal and oppo¬ 
site to the membrane current, is measured. It uses 
information about the dynamics of ion channel con¬ 
ductances, obtained from voltage-clamp studies, to 
stimulate a cell membrane in a physiological, but con¬ 
trolled and precisely specified way - as if by ion 
channel opening. Hence, it is a much more realistic 
way to study the physiological responses of excitable 
cells than is traditional current injection. 

Implementation and Experimental Issues 

A large number of custom, open-source, and commer¬ 
cial implementations of dynamic clamp are in use, in 
many laboratories, and based on various operating 
systems and hardware platforms, for example (Dorval 
et al. 2001), (Raikov et al. 2004) (Nowotny et al. 2006), 
(Robinson 2008), to name just a few. Any implemen¬ 
tation faces quite demanding requirements: 

• Low and reliable latency of both analog-to-digital 
and digital-to-analog conversion. 

• High computational speed in order to be able to 
handle updating a potentially complex dynamical 
model within the remaining period of each feedback 
cycle. 

A deterministic and low latency of response, i.e., 
little or no jitter in timing of the control loop, can be 
difficult or impossible to achieve in many sophisti¬ 
cated, multitasking operating systems such as Win¬ 
dows, and really requires a real-time operating 
system. For injecting simpler conductances, such as 
synaptic conductances, analog computation including 
operations of multiplication, exponentiation, and divi¬ 
sion, has also been used very successfully - the settling 
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Current injection 



Dynamic Clamp: Synthetic Conductances and Their Influ¬ 
ence on Membrane Potential, Fig. 2 Comparison of current 
injection and conductance injection in a passive membrane, 
(a) Applying a sequence of step current injections {left panels) 
to a passive membrane, gives a set of linearly increasing voltage 
responses which rise and decay with the same time constant. 
Injecting an evenly spaced sequence of conductances (right 
panels) on the other hand, gives a sublinear set of responses, in 


Conductance injection 




G 



which the level of injected cuiTent is choked off as the potential 
approaches the reversal potential. The time constant during the 
rising phase, during which the conductance is turned on, is reduced 
in a reciprocal fashion as the amplitude of conductance increases, 
(b) Injecting a repeated step current from different starting mem¬ 
brane potentials always gives the same shape of response (left), 
while the magnitude and sign of the response to a step conductance 
(right) is highly dependent on the membrane potential 


times of such circuits are measured in nanoseconds 
rather than microseconds (Robinson 1998). When 
injecting highly voltage-dependent and fast-gating 
conductances such as sodium conductances, stability 
and error of the method of integration (usually Euler or 
exponential Euler) of the underlying differential equa¬ 
tions are important issues (Butera and McCarthy 2004; 
Bettencourt et al. 2008). As for any current-injection 
experiment, it is important to either monitor con¬ 
tinuously and compensate for the series resistance, 
if injecting current through the same electrode as used 
for voltage measurement, or preferably use separate 


electrodes for current injection and voltage measure¬ 
ment. This issue is particularly important for dendritic 
whole-cell patch recording and sharp intracellular elec¬ 
trodes, where series resistance is high and changeable. 

Driving Neurons with Synaptic-Like 
Conductance 

A major area of application of dynamic clamp conduc¬ 
tance injection has been to mimic the action of 
a synaptic input to a neuron, in order to study the 
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Dynamic Clamp: Synthetic 
Conductances and Their 
Influence on 
Membrane Potential, 

Fig. 3 Cortical pyramidal cell 
firing driven by complex 
synaptic-like conductance. 
Membrane potential (top) 
response to three different 
fractions of conductance 
input, simulating synaptic 
AMPA, NMD A, and GABAa 
receptor activation {red, 
green, and blue waveforms). 
Each conductance signal is 
formed by summing unitary 
transients with the amplitude 
and time course of elementary 
synaptic events, whose timing 
is determined by stochastic 
processes. AMPA/NMDA 
events and GABA events are 
each generated by a Poisson 
process whose rate varies 
periodically at a gamma 
frequency (40 Hz), but with 
different added random 
delays. The NMDA 
conductance is further blocked 
in a voltage-dependent way, as 
indicated at the top right. For 
further details, see Morita 
et al. (2008) 


NMDA unblock 





NMDA 


GABA 





1+1 
0 nA 


100 ms 


process of synaptic integration in a controlled fashion. 
See (Destexhe and Bal 2009) for a review of such 
studies. A typical example of this is shown in Fig. 3, 
taken from a study on the network mechanisms of 
gamma oscillations during UP states in the neocortex 
(Morita et al. 2008). Three channels of synaptic con¬ 
ductance are injected into a pyramidal neuron simul¬ 
taneously, representing the different major 
components of input that they receive, due to AMPA 
receptor and NMDA receptor activation at excitatory 
synapses and GABAa receptor activation at inhibitory 
synapses. The conductance signals are assembled by 
summing unitary synaptic conductance transients pro¬ 
duced by stochastic point processes representing the 
spiking of excitatory and inhibitory cell classes in the 
network, including the population of pyramidal cells 
themselves. Each conductance effectively attracts the 
membrane potential toward its reversal potential, as 
indicated, with a strength which depends on its relative 
amplitude, and in the case of NMDA receptor input, on 


the membrane potential itself, in a nonlinear fashion 
(inset at right). The injected current shown at the 
bottom is then a complex function of all these conduc¬ 
tances and of the response of the cell to them. 

Compound AMPA/NMDA events and GABA 
events are each generated by a Poisson process whose 
rate varies periodically at a gamma frequency (40 Hz), 
in a way derived from measurements of spiking of 
pyramidal and fast-spiking inhibitory cells during 
gamma oscillations (Hasenstaub et al. 2005). Using 
this approach, the authors showed that the distribution 
of the neuron’s phase of firing, during the gamma 
cycle, obtained from the measured pyramidal cell 
spike output could only be self-consistent - i.e., 
matching the timing assumed to generate the recurrent 
excitatory synaptic input - if long randomized delays 
were introduced into the recurrent feedback but not 
into the inhibitory input. Thus, using knowledge of 
the surrounding network firing pattern to measure the 
integrative function of individual neurons under 
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Dynamic Clamp: Synthetic 
Conductances and Their 
Influence on 
Membrane Potential, 

Fig. 4 (a) An example of 
modifying the active behavior 
of a neuron membrane - 
injection of Kv3.2 type 
conductance in a cortical 
intemeuron. At left, a step 
current of 110 pA is applied to 
elicit spiking (Control). At 
right, both a step current of 
150 pA and a synthetic Kv3 
type potassium current 
Ik = gKin^iEK - V), where 
dm jdt = (m^ — m)jx and 
m^a and t are voltage- 
dependent as shown in (b), and 
gK = 60 nS, and 
Ek = — 90 mV, are applied 


a Control +Kv3 




different conductance input scenarios allows consider¬ 
able insight into how the network oscillation works 
dynamically. Such an approach has been used to 
study integration of stochastic background firing in 
the brain (Destexhe et al. 2001), dendritic transfer 
functions (Williams 2004), phase-resetting behavior 
of neurons and synchronization (Gouwens et al. 
2010), and many other aspects of synaptic integration. 

Active Conductances and Spike Generation 

Another important application of dynamic clamp 
conductance injection is to investigate the role of indi¬ 
vidual voltage-dependent ionic currents in action 
potential (AP) generation. Figure 4a shows the injec¬ 
tion of a dynamic conductance modeled on that of 
Kv3.1/Kv3.2 voltage-dependent potassium channels, 
in a mouse cortical inhibitory interneuron. The differ¬ 
ential equation for the activation variable m is integrated 
in real time, according to the voltage-dependence of the 


activation and time constant (Fig. 4b), which are mea¬ 
sured from voltage-clamp data. During the injection of 
this conductance, the voltage threshold of the AP is 
raised, the peak of the AP is reduced, and its afterhyper¬ 
polarization is deeper and faster. Such controlled, pre¬ 
cisely specified changes in the voltage-dependent 
dynamics of the membrane can cause bifurcations in 
the firing pattern, threshold behavior or phase-response 
behavior of neurons. This has become a useful tool for 
establishing the biophysical function of voltage- 
dependent channels in many cell types. For some exam¬ 
ples of this approach, see (Lien and Jonas 2003), 
(Berecki et al. 2005), (Tateno and Robinson 2011). 
Both positive (“knock-in”) and negative (“knock-out”) 
conductances can be injected. 

Hybrid Circuits 

Real-time modeling of individual voltage-dependent 
conductances leads naturally to the idea of simulating 
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Dynamic Clamp: Synthetic 
Conductances and Their 
Influence on 
Membrane Potential, 

Fig. 5 Adapted from (Sohal 
et al. 2009), showing an 
example of dynamic network 
feedback, (a) A pyramidal cell 
in the cortex is stimulated with 
a stochastically fluctuating 
conductance signal through an 
electrode. Each time it spikes, 
a blue flash of light is triggered 
which stimulates spikes in the 
surrounding network of fast- 
spiking inhibitory cells, which 
in turn feedback inhibition to 
the pyramidal cell, (b) Turning 
on this optical feedback 
inhibition shifts the power of 
spiking in the cell from low 
(beta) frequencies to high 
(gamma) frequencies 



b 



whole neurons, and allowing the virtual neuron to 
interact in real time by injecting synaptic-like conduc¬ 
tance into a biological neuron or neurons, while receiv¬ 
ing input from them. The aim of such “hybrid circuits” 
is to quantify the function of particular kinds of neu¬ 
rons within a neural circuit. For example (Le Masson 
et al. 2002), used this approach to investigate 
thalamocortical signaling in the visual system. 
A hybrid-circuit approach is also useful for studying 
synchronization. In one such study (Netoff et al. 2005) 
used a simple virtual neuron consisting of a periodic 
spike generator whose phase is adjusted by the firing of 
its coupled biological partner which it in turn stimu¬ 
lates with synaptic-like conductance, to elucidate the 
way in which phase-resetting behavior determines syn¬ 
chronization. Sophisticated hybrid-circuit approaches 
have also been used in the cardiac muscle field, notably 
by Wilders and collaborators. 

Dynamic Network Feedback 


recently been applied together with conventional con¬ 
ductance injection in a study by Sohal et al., to produce 
a kind of dynamic network feedback, with conduc¬ 
tance-driven spikes in a pyramidal neuron triggering 
flashes of blue light which excite the local population 
of fast-spiking inhibitory interneurons (Fig. 5a). 
Switching on the optical feedback produces gamma- 
frequency firing in the pyramidal cell (Fig. 5b), 
strongly supporting the idea that the fast-spiking cell 
network is directly involved in the generation of 
gamma oscillations. Here, a dynamically controlled 
conductance is injected by optical activation of chan¬ 
nel rhodopsin, rather than through an electrode. In the 
future, combining this with multisite detection of 
activity, either optically or through electrode arrays, 
will allow “network dynamic clamping,” i.e., 
a generalized controlled feedback between the activity 
of the network and its stimulation. This is likely to be 
a very powerful way of investigating network circuit 
dynamics, and is achievable with current technology. 


The advent of optogenetic techniques allows stimula¬ 
tion of electrical activity in a specific genetically 
defined type of neuron, or in a population of neurons 
connected to a specific area of the brain. This has 


Summary 

Dynamic clamping allows the injection of synthetic, 
controlled conductances into cells, whose dynamics 
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are determined programmatically by the experimenter. 
It is essentially a fusion of real-time computational 
modeling and experimental electrophysiological mea¬ 
surement and stimulation. It allows biophysically real¬ 
istic imitation of synaptic inputs to neurons, insertion 
or deletion of ion channel populations with specific 
voltage-dependent gating kinetics, and simulation 
of the firing of particular cell types in a neural circuit. 
It has been applied widely, in neuroscience, in 
cardiac electrophysiology, and other fields, as a very 
flexible and insightful way of probing the dynamical 
responses of cells and networks to physiological elec¬ 
trical signals, and for testing the functional conse¬ 
quences of different models of membrane electrical 
mechanisms. 


Cross-References 

► Analysis of Macroscopic Currents 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 

► Computational Modeling of Neuronal Networks 

► HCN Channels: Biophysics and Functional 
Relevance 

► Kinetics: Simulation and Analysis 

► Modeling the Heart 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Voltage-Gated Sodium and Calcium Channels 
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Dynamic Docking, Fig. 1 Energy landscapes for simple 
docking (SD) and DD complexes 


Dynamic docking (DD) denotes a complex formed 
between reactive (electron transfer (ET)-active) pro¬ 
tein partners in which a large ensemble of weakly- 
bound configurations of the complex contribute to 
binding, but only a few are reactive. Typically, the 
ET reaction occurs within the “thermodynamic” or 
“fast exchange” limit, in which case binding is effec¬ 
tively decoupled from reactivity, and it is possible to 
modulate reactivity without changing affinity. 

Basic Characteristics 
The DD Energy Landscape 

Heterogeneity in the structure and dynamics of 
a macromolecule, and likewise of a complex between 
two macromolecules, is commonly visualized with an 
idealized “energy landscape,” an energy surface that 
represents the accessible conhgurations of the system 
on a hierarchy of energy/length scales (Wales 2004; 
Ubbink 2009). It has become increasingly clear that 
electron transfer protein partners commonly form 
complexes with dynamic structures, and are appropri¬ 
ately described by an energy landscape with multiple 
interconverting bound configurations, not by a “sim¬ 
ple-docking” landscape with a single energy mini¬ 
mum, Eig. la. When the interactions between the 
partners are weak and of a transient nature, the 
corresponding landscape has many shallow minima 
that contribute to the net thermodynamic binding, 
Eig. lb. However, the overwhelming preponderance 
of these are non-reactive (NR), and ET is dominated 
by a small subset of reactive (R) configurations for 
which the ET matrix element is large. Studies under 


NIH support of the weakly binding myoglobin/cyto¬ 
chrome hs (Mb/foj) ET complex (Liang et al. 2000) led 
to the identification of the DD protein-protein reaction 
paradigm: a large ensemble of interconverting conhg¬ 
urations contribute to binding, while the reactive con¬ 
hgurations are relatively rare, and not necessarily 
among the most stable. 

Simulation of DD with Molecular Dynamics 

A variety of molecular dynamics approaches and soft¬ 
ware packages can be used to visualize protein-protein 
complex formation on a DD landscape (Tuncbag et al. 
2009). Eor illustrative purposes the Brownian Dynam¬ 
ics (BD) approach is used here (MacroDox program; 
(Northrup et al. 1988)). In this approach, one protein is 
allowed to diffuse {in silico) toward its stationary part¬ 
ner, and a “hit” is recorded when a predetermined 
criterion of contact is achieved during its trajectory. 
Figure 2a displays a computation for the weakly bound 
Mb//75 complex, where the center of mass (COM) of h; 
is plotted as a point on the surface of Mb whenever the 
hs trajectory brings the COM of the two proteins within 
roughly the sum of their radii. Such a criterion gener¬ 
ates a “hit” prohle that reflects the binding interactions 
of the two proteins. This hgure illustrates the DD 
binding pattern: hs binds weakly at numerous locations 
on the Mb surface, as displayed by hits distributed over 
the entire Mb surface. When the binding of the nega¬ 
tively charged hs around the Mb heme edge is encour¬ 
aged through neutralization of the propionates of the 
heme by reconstitution of Mb with the di-methyl ester 
heme, the “hits” begin to localize and coalesce on the 
Mb front face (Fig 2b), even though the complex 
remains in the DD regime. Further enhancement of 
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Dynamic Docking, 

Fig. 2 DD Brownian 
Dynamics “hit” profiles for the 
docking of onto 

Fe^’^Mb variants using the 
COM criteria, (a) Mb(wt), 
with deprotonated heme 
propionates; (b) Mb with the 
heme propionates neutralized. 
For further DD examples, see 
(Wheeler et al. 2007) 



binding through charge-reversal mutations on the Mb 
front face can, however, cause a strong shift toward SD 
(Nocek et al. 2010; Xiong et al. 2010). 

Kinetic Formulation of Binding and ET on a DD 
Landscape 

An intracomplex ET reaction within a weakly-bound 
donor (D)-acceptor (A) pair can be described on 
a macroscopic level as a two-step process with revers¬ 
ible binding (formation constant Ka) preceding the ET 
step (rate constant, ket), which is taken to be irrevers¬ 
ible for the high driving force of photoinitiated flash 
photolysis reactions. 

k. 

D-FA ;-^ DA D+A^ (1) 

This formulation also Is appropriate on a micro¬ 
scopic level for a complex with a single configuration 
(Fig. la). However, for a complex that operates on 
a landscape with multiple contributing configurations 
(i = 1.. .n; Fig. lb), each conformation has its own ET 
rate constant, k'^,, microscopic binding-free energy 
(AGj), and binding constant, Kj = exp[-AGjlkT] 
(which can be expressed in terms of on- and off-rates, 
^ onr ^ ojf^ where, — k Q^^lk 

k‘ 

D-FA ;-> (DA)i ^ (D+A-)i (2) 


Binding on a DD Landscape 

The overall thermodynamic binding constant, can 
be measured in isothermal titration ► calorimetry, 


nuclear magnetic resonance (► NMR), or kinetic titra¬ 
tions. It is the sum of the individual binding constants 
{KJ) for all binding configurations (Eq. 3). 

= ( 3 ) 

/ 

For a complex that operates on a DD landscape, 
only a small subset of the many bound configurations 
are ET-active; in Fig. 2, this would be a fraction of the 
small number of hits near the Mb heme edge. As 
a result, reactive configurations do not contribute sig¬ 
nificantly to binding, and the thermodynamic binding 
constant summation in Eq. 3 is dominated by the won- 
reactive (NR) majority conformations (Eq. 4). 

(4) 

NR 

ET on a DD Landscape 

For the fast-exchange reaction-limit (koff >> ket) of 
the macroscopic binding model (Eq. 1) that applies to 
most weakly bound DD-type complexes, ET is second 
order with rate constant k 2 as in Eq. 5. 

h = ket Ka (5) 

The observed rate constant on a DD landscape, ^ 2 , 
takes a form analogous to that of the binding constant 
(Eq. 3) in being the sum of the fast-exchange ET rate 
constants for all individual conformation, each 

weighted by the binding constant for that conformation 
(Eq. 6): 

= (6) 

/ 
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Whereas both reactive (R) and non-reactive (NR) 
configurations contribute comparably to the binding, 
only the R configurations contribute to ET reactivity 
because of the relatively short range of ET compared to 
the overall protein dimensions. Thus, one can limit the 
sum in Eq. 6 to the R configurations as in Eq. 7. 

= ( 7 ) 

R 


Decoupling of Binding and Reactivity on the DD 
Landscape/in a DD Complex 

The consequences of DD for binding and reactivity 
(Eqs 3 and 6) are best understood by considering an 
idealization of the model in which the few members 
(nR) of the small sub-ensemble of reactive conforma¬ 
tions all have the same large average ET rate constant, 
and the same average binding constant kJ^, while 
the large majority of non-reactive sites (Nnr; Ar/Nimr 
< < 1) have zero reactivity, = 0, and an average, 
most likely larger, binding constant, KJ^^. The result is 
a pair of formulas for the binding constant and second- 
order rate constant that transparently show that binding 
and reactivity are decoupled as in Eq. 8. 

Ka « NnrK^'^ k2 « ( 8 ) 

This decoupling is illustrated in Fig. 3, which plots 
both the total affinity, K^, and the reactivity, presented 
as k 2 lkct'^ (Eq. 8) as a function of the affinity of the 
R conformations, KJ^, under the condition that the 
affinity of the NR conformations remains constant. 
Such behavior in fact occurs when heme propionate 
neutralization enhances reactive binding. Fig. 2b 
(Liang et al. 2002; Liang et al. 2004). One sees that 
^2 increases in proportion to KJ^, while changes 
minimally so long as n^Kj^ ^ Consider the 

situation where the minority “reactive” conformations 
of a complex contribute ^1% to the binding {ngKj^/ 
NnrKJ^'^ = 10~^). If modification of the interface 
increases kJ^ to the point that ngKj^ « 
which equalizes the contribution of the R and NR 
sites to Ka, then ^2 would be increased by 100-fold 
relative to the value in the native complex, but Ka 
would increase by a mere two-fold. This clearly illus¬ 
trates how reactivity and binding are “decoupled” in 
DD systems: up to a point, changes in binding affinity 
for the reactive sites, achieved by placing appropriate 



Dynamic Docking, Fig. 3 Plot of the predicted thermody¬ 
namic binding constant (K^) and reactivity (k 2 /ke,’^) versus the 
binding affinity for the reactive conformations (n’^Ka’^), 
according to Eq. 6, under the assumption that the affinity for 
the non-reactive conformations remains constant at, 

(NNRk^NR) ^ J QQQ jyj-l 

charge(s) in the interface near a reactive site can 
strongly enhance reactive binding (Fig. 2B) and 
cause orders-of-magnitude changes in ^ 2 , yet not 
cause significant accompanying changes in the overall 
binding constant (Liang et al. 2002; Liang et al. 2004). 

Examples 

The concept of DD originated to explain the observa¬ 
tion that surface modifications expected to increase 
affinity in the Mb/h 5 complex enhanced the reactivity 
without affecting the apparent affinity (Liang et al. 
2000; Liang et al. 2002; Liang et al. 2004). Such 
“decoupling” of binding and reactivity shows that 
effective bimolecular ET does not require that 
a protein-protein complex is optimized for strong bind¬ 
ing. This may be of physiological relevance for the 
reduction of non-functional met-Mb in muscle cells 
and of met-Hb in a red blood cells, where tight binding 
of cyt hs to the high concentration of ferrous-Mb/Hb 
would prevent cyt by from finding and reducing the 
met-Mb. 

Dynamic docking has proven to be relevant for 
other protein-protein ET complexes as well. Examples 
include ET reactions that occur across the protein- 
protein interfaces of non-covalent complexes of a- 
hemoglobin/cytochrome hs (Wheeler et al. 2007) and 
cytochrome hs, cytochrome c (Ren et al. 2004), and in 
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ET reactions between the tethered domains of sulfite 
oxidase (Johnson-Winters et al. 2010). It likely applies 
to other reactions involving transient complexes whose 
primary function is ET such as the photosynthetic 
reaction center/cytochrome C 2 complex (Abresch 
et al. 2009) and the plastocyanin/cytochrome c com¬ 
plex (Ivkovic-Jensen et al. 1998). 
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Definition 

Dynamics of bone morphogenetic protein (BMP) recep¬ 
tor signaling pertain to the movement and interactions 
of BMP receptors and associated proteins on the mem¬ 
brane surface, particularly control of downstream path¬ 
ways through sequestering, shuffling, recycling, and 
degradation of receptors. 

Basic Characteristics 

BMP Receptors Are Housed in Detergent-Resistant 
Membranes (DRMs) and Clathrin-Coated Pits 
(CCPs) 

BMP receptors are localized on the plasma membrane 
in distinct microdomains. Two such domains are CCPs 
(enriched in AP2, EPS 15, and clathrin) and detergent- 
resistant membranes (DRMs). DRMs are further 
subgrouped into caveolar (composed of caveolae of 
both alpha and beta isoform or beta isoform alone) 
and noncaveolar fractions (Bragdon et al. 2011). 

The role of these domains in BMP signaling is 
extensively studied; however, it is still controversial. 
BMP receptors type 1 and type II clearly localize to 
these regions. Studies in A431 and C2C12 cells 
showed that BMPRla is localized to caveolae and 
shuttling between caveolae composed of the 
caveolin-lp isoform and caveolae containing both 
isoforms enhances smad signaling. Overexpression of 
caveolin-lp in these cells had an inhibitory effect on 
smad signaling, while overexpression of caveolin-1 
alpha had little to no inhibitory effects. Additionally, 
studies on smooth aortic muscle cells show that 
BMPR2 interacts with caveolin-1 and this interaction 
is necessary for smad signaling (Wertz and Bauer 
2008). Internalization and dephosphorylation of 
BMPR2 by Dullard results in degradation of BMPR2 
in caveolae, pointing to the importance of these 
domains in smad signaling (Satow et al. 2006). Another 
domain discovered recently is the NANDOR (No Acti¬ 
vating Non-Downregulating) Box that is important for 
type I receptor downregulation. Situated near the 
C-terminus end of the type I receptor, this domain 
plays an important role in receptor internalization and 
endocytosis. It is a highly conserved sequence among 
all the type 1 receptors of the TGF-p family, but this 
motif does not fall into the most well-categorized 
groups of internalization signals (Chen 2009). 


At the same time, BMP receptors are found to 
localize in CCPs in A431 and C2C12 cells. Disruption 
of the pits enhanced smad signaling in C2C12 cells. 
Supporting these data is that the disruption of CCPs 
decreases the time it takes for Smadl to translocate to 
the nucleus (Bragdon et al. 2011). On the other hand, 
this effect could not be detected in SHED cells and 
disruptions of CCPs with nonspecific agents were not 
able to reproduce this result showing that cells respond 
differently to CCP disruption. Recent data also 
indicate that smad signaling is initiated in CCPs 
themselves in response to BMP2 (Bragdon 2010). 
In addition, ectopic localization of mutant BMP 
receptors in caveolae has been reported in Pulmonary 
Arterial Hypertension. This leads to decreased smad 
phosphorylation and decreased smad signaling 
(Jiang et al. 2011). 

In conclusion these data demonstrate that both 
membrane domains/pits as well as outside regions 
on the plasma membrane are able to signal by smad 
activation. Using Image Cross Correlation Spectros¬ 
copy, it is shown that only the type I receptor without 
BMPRIl is present in the absence of BMP2 on 
the plasma membrane, raising the question how this 
receptor can activate smad signaling (Bragdon 2010). 
Additionally, using FGFM it was shown that BMP 
binds with high affinity to BMP receptors in caveolae 
(Bonor 2012). 

BMPRla Receptor Dynamics in Primary Bone 
Marrow Stromal Cells 

Recently, the importance of BMP receptor dynamics 
for smad signaling and mineralization was studied in 
bone marrow stromal cells (BMSCs) derived from 
three different mice phenotypes. The B6.C3H-6T 
(6T) is a congenic mouse with decreased bone mineral 
density (BMD) and with increased marrow adipocytes 
and decreased osteoprogenitor proliferation, while the 
B6.C3H-1-12 (1-12) congenic is a mouse model with 
increased BMD and osteoblast mineralization. In the 
experiments C57BL/6J (B6) mice are used as controls 
since only a segment of Chromosome 6 or only 4 Mb 
of Chromosome 1 from the C3H/HeJ mouse was 
backcrossed to a C57BL/6J background. B6 mice are 
a mouse strain that already exhibit low bone mineral 
density. 

In BMSCs isolated from the B6 control mouse, 
siRNA against caveolin-1 leads to a significant 
reduction in BMP2-dependent smad signaling and 
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Dynamics of BMP Receptor 
Signaling, Fig. 1 Summary 
ofBMPRIa receptor dynamics 
on the cell suiface ofBMSCs 
isolated from B6, 6T, and 1-12 
mice. Difference between low 
and high peak bone density 
mice is the number of caveolae 
of the beta isoform only and 
the localization of BMPRla in 
these domains prior to BMP2 
signaling. As shown hy FGFM 
BMP2 binds predominantly to 
receptors localized in caveolae 


6T 


B6 


1-12 



i smad signaling 


T smad signaling 


Caveolae enriched in Caveolin-1 alpha and Caveolin-1 beta 
I Caveolae enriched in Caveolin-1 beta BMPRla BMP2 


mineralization. These results point to the importance 
of caveolae in these processes. In the absence of BMP2 
downregulation of caveolin-1 in BMSCs isolated from 
6T mice resulted in activation of the smad signaling 
pathway, while in BMSCs from 1-12 mice, 
downregulation of caveolin-1 leads only to smad sig¬ 
naling in the presence of BMP2. 

Using the Family of Image Correlation Spectros¬ 
copy, BMPRla aggregation on the plasma membrane 
can be correlated to the extent of BMP signaling and 
mineralization. Therefore, aggregation of BMPRla 
could be used as an indicator for osteogenic potential. 
Studies showed that BMPRla is aggregated in BMSCs 
isolated from 1-12 mice, while it failed to aggregate in 
response to BMP2 stimulation in BMSCs isolated from 
6T mice. Aggregation in response to BMP2 is observed 
in BMSCs isolated from B6 mice (Fig. 1). 

Further data obtained indicate that the movement of 
BMP receptors out of caveolae of the P isoform into 
caveolae enriched in the alpha and beta isoforms is 
crucial for enhanced smad signaling and mineraliza¬ 
tion. BMSCs from low peak bone mass mice, B6 and 
6T, fail to express BMPRla in caveolae enriched in the 
beta isoform, while the high peak bone mice 
have BMPRla localized in caveolae composed of this 
isoform. Upon BMP2 stimulation, BMPRla shuttles 


between caveolae of different isoforms for enhanced 
signaling in BMSCs from 1-12 mice. 

Interestingly, the release of BMPRla from caveolae 
of the alpha beta isoform in BMSCs isolated from 
6T mice leads to increased smad signaling and miner¬ 
alization, while in BMSCs from B6 or 1-12 this effect 
is not observed. However, disruption of caveolae 
clearly enhances BMP2 response in BMSCs isolated 
from 1-12. 

Taken together the results obtained in primary cells 
conhrm the conclusions from C2C12 and A431 
cells that: (1) BMPRla shuttling between caveolae is 
important for smad signaling (A431 cells). (2) Disrup¬ 
tion of caveolae can lead to increased smad signaling 
and mineralization (C2C12 cells). (3) Trapping of 
BMP receptors in caveolae and inhibition of shuttling 
leads to decreased smad signaling (A431, C2C12 cells) 
(Fig. 1). It also shows that signaling in C2C12 cells 
may reflect BMSCs isolated from mice with very low 
BMD, while results from A431 cells mimic the 
response of cells with a high peak bone phenotype. 
However, these data seem to indicate that once 
a certain threshold of BMP2 signaling is achieved 
this signal cannot be potentiated, suggesting that 
negative feedback mechanisms activate at this time 
(Bragdon 2012a, b). 
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Dynamics of BMP Receptor Signaling, Fig. 2 Current dogma of BMP signaling pathway highlighting the associated proteins and 
the downstream signaling 


Activation of the BMP Receptor Independent of 
the Ligand 

Although normal initiation of the BMP signaling path¬ 
way occurs only after ligand (BMP) binding of the 
receptors, recent studies have shown evidence that 
there are cases when the receptor activation takes place 
in the absence of the ligand (Fig. 2). Several mecha¬ 
nisms inherent to the BMP signaling pathway exist, that 
prevent this activation leading to ectopic effects. 

A single point mutation in the GS box in AcvRl, 
a BMP type 1 receptor, results in constitutive 
stimulation of the receptor even in the absence of the 
ligand. This leads to disastrous effects such as abnor¬ 
mal osteogenesis, chondrogenesis, and also develop¬ 
ment of fused joints. This particular degenerative 
genetic condition is called FOP or Fihrodysplasia 
Ossificans Progressiva (Shore et al. 2006). Addition¬ 
ally, a nine amino acid sequence in the kinase domain 
of the type 1 receptors called the L45 loop imparts 


further specificity to the signaling pathway. This 
domain in the type 1 receptors can dictate the phos¬ 
phorylation of smads as well as control subsequent 
genetic responses. FKBP12, a negative regulator of 
the BMP signaling pathway, can bind to the BMPRla 
receptor and prevent leaky signaling, thus further 
assuring specificity and tight control (Bragdon et al. 
2011) (Fig. 3). 

Another mechanism of BMPRla receptor activation 
in the absence of the ligand is the release of CK2 from 
the receptor. CK2 binds at three distinct binding sites to 
BMPRla (Fig. 3). Release of CK2 from BMPRla leads 
to activation of the smad signaling pathway. In other 
studies CK2 association to ALKl, a TGF beta type 
I receptor, was shown to be necessary for smad signal¬ 
ing (Lee et al. 2009). Additionally, BMP? signaling is 
enhanced by CK2. These data demonstrate multiple 
facets of CK2 in BMP signaling dependent on the 
receptor and the ligand (Bragdon et al. 2011). 
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Dynamics of BMP Receptor 
Signaling, Fig. 3 A 3D 
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Introduction 

The dynamics of colloids and macromolecules is 
a very broad topic. It deals with time-dependent 
phenomena of all kinds in colloidal suspensions and 
macromolecular solutions. Due to space limitations, 
here, we concentrate on a single aspect: Brownian 
motion in dilute dispersions. 

The colloidal and macromolecular world is domi¬ 
nated by thermal agitation. This leads not only to inter¬ 
nal motion in flexible macromolecules but also to 
rotational and translational diffusion. A study of the 
translational diffusion of colloids by Jean Perrin 
(1923) first convinced the scientific world of the gran¬ 
ularity of matter: The Brownian motion of a colloid 
seen in a microscope is the random motion of the 
molecules in the surrounding liquid made visible. 
Today, probing the Brownian motion of colloids and 
macromolecules forms the foundation of one of the 
most useful techniques for particle sizing: dynamic 
light scattering. The first half of this entry explains this 
technique, which has been a more or less routine tool for 
particle sizing since the 1970s (Berne and Pecora 1976; 
Chu 1991; Brown 1993; Egelhaaf 2006). We will also 
explain a much newer technique, differential dynamic 
microscopy (Cerbino and Trappe 2008). Both methods 
give access to the same “intermediate scattering func¬ 
tion,” which contains information on the dynamics of 
dispersed colloids and macromolecules. Specifically, 
this function characterizes the dynamics of the sponta¬ 
neous density fluctuations in these systems due to the 
motions of the colloids and macromolecules: diffusive 
motions of single particles, collective motions of many 
particles together, and motions internal to the particles. 

We focus on the use of dynamic light scattering 
and differential dynamic microscopy to probe the 
Brownian diffusion of single particles in dilute disper¬ 
sions, which allows us to measure their diffusion coeffi¬ 
cient. After explaining the principles of both methods, we 
give two examples which illustrate how measuring the 
diffusion coefficient via the intermediate scattering func¬ 
tion can provide information on the size and shape of 
colloids and macromolecules. 

Dynamic Light Scattering 

From our everyday experience, we know that pure 
water appears transparent. In contrast, we can observe 


soil particles suspended in river water or small dust 
particles as they scatter light in a sunbeam. These 
particles have refractive indices different from the 
medium in which they are dispersed and thus scatter 
light. In light-scattering experiments, we exploit this 
effect to investigate the structure, size, arrangement, 
and dynamics of soft and biological matter. This in 
particular includes the motion of biomolecules as well 
as parts and assemblies of biomolecules. 

In a light-scattering experiment, the sample is illu¬ 
minated by (laser) light and the light scattered by the 
sample detected. In static light scattering (SLS), the 
angular dependence of the average intensity of 
the scattered light is measured as a function of scatter¬ 
ing angle, 9. The angular dependence provides infor¬ 
mation on the shape, structure, size, and molar mass of 
the particles as well as their spatial arrangement. In 
a dynamic light-scattering experiment (DLS; or photon 
correlation spectroscopy, PCS), one records the time 
dependence of the scattered intensity (at a given 9), 
which reflects the dynamics of the objects on a certain 
length scale. This might be their center of mass motion 
and/or, in the case of flexible objects, local motions 
such as shape fluctuations or conformational changes. 

In both static and dynamic light scattering, a key 
quantity is the scattering vector Q. To define Q, we first 
consider the wave vector, k, which is defined to be 
parallel to the direction of propagation of a light 
beam and has magnitude inversely proportional to the 
light’s wavelength in vacuo, specifically, k = Innl'k, 
where n is the refractive index of the medium. In both 
SLS and DLS, we are interested in incident light with 
wave vector ki and scattered light with wave vector k^ 
(Fig. 1). Since we are only concerned with processes in 
which the scattered light has about the same wave¬ 
length (or, equivalently, energy) as the incident light, 
the magnitudes of these two vectors are equal, 
l^il = l^sl = ^- The difference of these two vectors 
defines the scattering vector, Q — K — h, the length 
of which is related to the scattering angle, 9 (Fig. 1, 
inset). Simple geometry gives: 

Q=\Q\ = 2ksin^^^ sm(^. (1) 

A crucial idea in interpreting both SLS and DLS is 
that data taken at scattering vector Q give informa¬ 
tion on particle size, arrangement, and dynamics 
on a length scale around C — 2nlQ. Typical 
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Dynamics of Colloids and Macromolecules, Fig. 1 Scatter¬ 
ing geometry and (inset) definition of the scattering vector Q 


light-scattering equipment covers an angular range 25° 
< 9 < 160° and thus lengths in the range 

100 nm < £ < 1 qm. 

Average Scattered Intensity 

A small volume element at position r, where r is 
measured relative to an arbitrary origin (Fig. 1), is 
illuminated by a plane wave with wavelength X and 
wave vector The small volume acts as an oscillating 
dipole which reradiates or scatters light in all direc¬ 
tions. Compared to light scattered at the origin 

(by another volume element), the light travels an 
extra distance SL = 8 Li + 8 L 2 (where 5Li and 5 L 2 
are projections of r onto ki and k^, respectively, as 
indicated by the dotted lines in Fig. 1), which results 
in a phase difference; 

^ , 2 nn ^ ^ ^ „ 

8 <f> = -—SL = ki ■ r — ks ■ r = —Q ■ r. (2) 

A 

This phase difference, which is angle dependent 
through the 6 dependence of Q (Eq. 1), determines 
the (constructive or destructive) interference 
between waves scattered by different volume ele¬ 
ments and thus the angular dependence of the 
scattered light. 

An extended particle consists of many volume 
elements. In the simplest (Rayleigh-Gans-Debye) 
approximation, the scattering of an extended particle 
is obtained by adding the contributions of all volume 
elements. This requires that the incident and scattered 
waves are not altered when passing through a particle. 
This so-called first Born approximation holds for 


particles small compared to the wavelength and/or 
with a low refractive index difference to the solvent. 
Finally, the contributions of the individual particles in 
the scattering volume are summed to give the 
scattered field of the whole ensemble of particles. 
Note that in an experiment, we measure not the aver¬ 
age scattered field but the average scattered intensity, 
which is the square of the field. This scattered inten¬ 
sity depends on the shape and size of the particles 
as well as their arrangement. The effect of the 
particle properties and the particle arrangement 
can be decoupled, if the particle properties and posi¬ 
tions are not correlated. (This is in particular the 
case for identical particles.) In this case, the time- 
averaged scattered intensity, which represents the 
typical result of a static light-scattering experiment, 
can be written as: 

(^I{Q))=KcMP{Q)S{Q) (3) 

with a prefactor K, the particle concentration c and the 
molar mass M of the particles. Note that (• • •), denotes 
averaging over time. The form factor P{Q) depends on 
intraparticle interferences and thus on the size, shape, 
and structure of the individual particles (Kerker 1969; 
van de Hulst 1981; Pedersen 1997). The structure 
factor S(Q) is determined by interparticle interferences 
and hence by the spatial arrangement or relative posi¬ 
tions of the particles (Pedersen 1997; Lindner and 
Zemb 2002). 

Time-Dependence of the Scattered Intensity 

As mentioned above, the interference between waves 
scattered by different volume elements depends on 
their relative positions (Eq. 2). In general, particles 
and also parts of flexible particles move due to thermal 
motion, and thus, their relative positions change with 
time. The phase differences and hence the interference 
of the scattered waves continuously change, and the 
scattered intensity fluctuates accordingly (Fig. 2a). 
The characteristic time scales of these fluctuations 
reflect the particles’ motion or, in the case of flexible 
objects, the “internal modes.” In static light scattering, 
the average intensity over a (macroscopic) time inter¬ 
val is measured (Eq. 3, section “Average Scattered 
Intensity”), while in dynamic light scattering, these 
intensity fluctuations are exploited to obtain informa¬ 
tion on the particles’ dynamics. The time scale of the 
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Dynamics of Colloids and Macromolecules, Fig. 2 (a) 

Schematic representation of a typical time dependence of the 
scattered intensity I{t). Two different delay times (ti, dashed 
blue line', T 2 , dotted green line with T 2 < Tj) are indicated as 
examples, (b, c) Schematic representations of a typical 


normalized time correlation function g^{i) (b) and intermediate 
scattering function/fr) (c) as a function of delay time t. Note that 
the horizontal axes in (b) and (c) are stretched by a factor of 
about 5 compared to the horizontal axis in (a) 


fluctuations in the intensity /(r) is characterized by the 
normalized intensity autocorrelation function: 

. (/(a 

s^^HO. t ) =- 

/(G, t) 

This function quantifies the average degree of sim¬ 
ilarity, or correlation, between light scattered at time t 
and at a delay time t later. With increasing delay time 
T, the scattered intensities at time t + i become statis¬ 
tically less similar to those at time t. The correlation 
function decays on some characteristic time scale, the 
correlation time, At t ^ the intensities become 

uncorrelated so that t+x'j'^. 

Since is normalized by the averaged squared scat¬ 
tering intensity, it decays to 1 at long delay times. This 
normalization also removes instrument-dependent 
contributions (laser intensity, detector distance, detec¬ 
tor efficiency, etc.) so that only sample-specific infor¬ 
mation remains. Furthermore, many processes are 
stationary and hence the observations are independent 
of the start time t. Then the correlation function only 
depends on the delay time x, and the statistics can be 
improved by averaging over the start time (denoted by 
the subscript “t” in Eq. 4). 

The reason for measuring g^^^ is that it gives access 
to a key quantity characterizing the dynamics of 
a sample, the intermediate scattering function (ISF), 
/(G, "t). The ISF quantifies how quickly a sinusoidal 
density fluctuation of wavelength 2n/Q decays away 
due to motions of all kinds. (Note that from here on, we 
drop the vectorial nature of the scattering vector since 



we will be considering samples that are spatially 
isotropic.) Formally, the ISF is defined to be the nor¬ 
malized correlation function of density fluctuations: 


/(G,t) 


{Ap{Q,0)Ap{Q,x)) 

([Ap(G)]") 


(5) 


where Ap(Q, t) is a fluctuation about the average den¬ 
sity with wavelength 2nlQ at time t. For noninteracting 
(independent), monodisperse spherical particles 
undergoing Brownian motion, it can be shown that 
the ISF decays exponentially (Egelhaaf 2006): 

m,x)=e-^0^\ ( 6 ) 

This function decays with a characteristic time (the 
correlation time): 

1 

^^~ 

DQ^ £)(47in sin(0/2))^ D 

It quantifies the typical time of the fluctuations. It is 
the average time taken by a particle to explore 
a distance IjQ, which is about the wavelength X 
(Eq. 1). The correlation time Xc is usually in the range 
of milliseconds. 

Provided that certain assumptions are satisfied, 
g^^^iQ, x) (what we can measure in DLS, Fig. 2b) and 
f(Q, t) (what we can interpret in terms of dynamics. 
Fig. 2c) are related by: 

= 1 +/^(G,'t)- 


( 8 ) 
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To measure g^^\Q, t), a sample is illuminated by 
a coherent light source, a laser, which is focused onto 
the sample to produce a plane incident wave. The 
scattered light is detected by one or several detectors 
(Egelhaaf and Schurtenberger 1996). Two apertures or 
a single-mode optical fiber (Gisler et al. 1995) in front 
of each detector ensures that the fluctuating intensity of 
only a few or, ideally, a single coherence area (speckle) 
is monitored, which reduces the averaging of the signal 
and hence maintains strong fluctuations in the mea¬ 
sured intensity. The detector records the scattering 
intensity 1 (f) from which the normalized time correla¬ 
tion function g^^\Q,r) and the ISF f(Q,x) are calculated 
(Eqs. 4, 8), typically using a correlator card. 

If the suspension contains noninteracting, monodis- 
perse Brownian particles, a log-linear plot of the mea¬ 
sured ISF/(Q, t) = exp {-DQ^x} (Eq. 6) versus the 
delay time t results in a straight line with slope — DQ^. 
This can be used for particle sizing since the diffusion 
coefficient D is linked to the hydrodynamic radius 
by the Stokes-Einstein relation; 


with Boltzmann’s constant ke, temperature T, and vis¬ 
cosity r]. The hydrodynamic radius is the same as the 
“size” measured in centrifugation experiments. For 
monodisperse, spherical particles, 7?^ is identical to the 
particle radius, = R. In the case of nonspherical 
particles, the dependence of Ry, on the particle parameters 
is more complicated but known for a number of different 
shapes. For an example, see section “Solution Structure 
of ocr; Shape Determination Using DLS” and references 
(Berne and Pecora 1976; Brown 1993; Lindner and 
Zemb 2002; Schurtenberger and Newman 1993). 

For systems with shape or size polydispersity and/or 
flexible parts which undergo internal motions, a mean 
ISF {f{Q,x)) is measured. It represents an average 
over the individual ISFs, which are weighted by the 
scattering intensity of the corresponding (part of the) 
particle. With the intensity-weighted distribution of 
diffusion coefficients or correlation times, piiQ, Tc), 
one can calculate the mean ISF; 

= J Pi{Q,t:c) dtc 

= y pKGjTc) die. (10) 


This indicates that the mean ISF {f{Q,x)) repre¬ 
sents a Laplace transform of piiQ, r^). An inverse 
Laplace transform of the measured {f{Q,x)) could 
thus be expected to yield pi(Q, Xc). However, this is 
a so-called ill-conditioned problem; even a small sta¬ 
tistical or systematic error in the measurement trans¬ 
lates into a large uncertainty in pi(Q, Xg). Nevertheless, 
methods have been developed to extract the maximum 
possible information on piiQ, Xc) from (/(Q,x)), for 
example, CONTIN (Provencher 1982). Furthermore, 
for particles with a relatively narrow size distribution, 
the method of cumulants (Koppel 1972) provides an 
average diffusion coefficient {D) and an indication of 
the width of the distribution. However, the interpreta¬ 
tion of the data remains ambiguous. It requires the 
assumption of a model, which might exploit a priori 
information, and a comparison of this model with the 
experimental data. Even if the model agrees with the 
data, it is possible that other models fit the data equally 
well. This approach is prone to controversies because 
in principle, all possible models have to be tested, 
including, for example, the possibility of size or 
shape polydispersities and inhomogeneities of the par¬ 
ticles as well as interparticle interactions. Further com¬ 
plications arise if the sample is turbid and multiple 
scattering occurs or if it is nonergodic and the average 
over all starting times (Eq. 4) does not sample a repre¬ 
sentative set of configurations in time and hence does 
not correspond to the ensemble average. 

Differential Dynamic Microscopy 

Using DLS for sizing particles and macromolecules 
via the measurement of a sample’s ISF relies on 
specialized apparatus; a laser to provide coherent 
illumination and a correlator card are essential, 
while ideally, one should also use a goniometer to 
permit measurements at multiple scattering angles 
(and therefore Q). Recently, a new method, “differ¬ 
ential dynamic microscopy” (DDM) (Cerbino and 
Trappe 2008), has been invented which allows the 
measurement of the ISF and therefore the diffusion 
coefficient D and hydrodynamic radius Rj, of parti¬ 
cles, without the need for specialist items, but only 
standard laboratory equipment; a microscope and a 
CCD camera. Due to these relatively modest require¬ 
ments, DDM has the potential of being implemented 
much more widely than DLS. 
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Because DDM is a newer development and less 
background is available elsewhere, we give a more 
detailed discussion than in the case of DLS. The raw 
data in DDM are low-magnihcation time-lapsed 
images of objects, described by the intensity I{r, t) in 
the image plane (r). (Note that r denotes position in the 
image plane in this section, and not a position in the 
sample as in our discussion of DLS.) From these, we 
calculate difference images at various delay times, 
T, D(r, t) = /(r, f + t) — /(r, r) = A/(r, t) — A/ (r, 0), 
where A/(r, t) = /(r, t) — (/) denotes intensity fluctu¬ 
ations and we assume a stationary process, i.e., inde¬ 
pendence of the start time t. Fourier transforming 
D{r, t) gives: 

J^n(Q,r) = J D{r,r)e‘^-^dr. ( 11 ) 

For stationary, isotropic processes, we average 
over the start time t in the difference images and 
azimuthally in Q space to calculate the basic output 
of DDM, what we may call the “differential intensity 
correlation function” (DICF), ■ 

We now show that the DICF is related to the ISF if 
we assume that intensity fluctuations in the image are 
proportional to the fluctuations in the number density 
of objects around the average density (p): 

AI{r,t) = KAp{r,t). (12) 

Here, the constant k depends on the contrast mecha¬ 
nism and Ap{r, t) = p{f^ t) — (p). Equations 11 and 12 
give: 


The prefactor A(Q) depends on the imaging system 
through K and on the sample’s structure through 

[Ap(Q)]^y Recognizing that the T-dependent term 

on the right-hand side of Eq. 15 is the ISE, we arrive 
at this key result: 

{\FD{Q.rf) = A{Q)[l -/(e,T)] +B(e), (17) 

where we have included a term B{Q) to account for 
camera noise. Thus, the power spectrum of intensity 
fluctuations of the images, yields the 

ISE. In a monodisperse sample of Brownian spheres, 
f{Q, t) = (Eq. 6), which, when substituted into 

Eq. 17, can be used to fit the measured DICE and obtain 
D and the hydrodynamic radius Rj, (Eq. 9). In general, 
to analyze the ISE, the same procedures can be 
followed as described above (Section “Time- 
Dependence of the Scattered Intensity”), for example, 
for polydisperse samples (Eq. 10). 

DDM relies on a number of assumptions. We have 
already mentioned the proportionality of imaging 
intensity and sample density (Eq. 12). Our derivation 
also assumes that the decorrelation of the light caused 
by motion perpendicular to the imaging plane is slow 
enough to be ignored. Indeed, one of the reasons for 
using a low numerical aperture objective in DDM is to 
ensure a large enough depth of focus that this assump¬ 
tion holds. These two assumptions need to be checked 
for their validity in individual cases in the application 
of this new technique. 


Fd{Q, t) = k Ap(Q, t) - Ap(g, 0) 


(13) Examples 


where Ap{Q,T) = J Ap{r,t)e'^''dr. 

Thus, the DICE can be expressed as: 

{\Fo{Q,r)\^)=A{Q) 

, {Ap{Q,0)Ap*{Q,x)) 

([Ap(e)f) 

where A{Q) = 2K^(^[Ap{Qfy (16) 


We present two examples. The first, briefer, example 
compares data obtained from DLS and DDM on a 
model suspension of quasi-monodisperse hard spheres 
suspended in water (Section “Sizing Colloids by DLS 
and DDM”) (Wilson et al. 2011). The second, more 
extensive, example illustrates the use of DLS to deter¬ 
mine the shape of a nonspherical protein (Section 
“Solution Structure of ocr: Shape Determination 
Using DLS”) (Blackstock et al. 2001). 

Sizing Colloids by DLS and DDM 

The ISE for dilute, monodisperse spheres is exponen- 
tial,/(2, t) = (Eq. 6). This reflects the fact that 
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diffusion is the only motion that relaxes density 
fluctuations in such a suspension. We have measured 
/ (Q, t) of a dilute suspension of 620-nm-radius 
polystyrene spheres dispersed in water. The DLS was 
performed using the red line of a He-Ne laser 
(1 = 633 nm in air), while the DDM was performed 
using an inverted microscope and a computer- 
controlled high-speed camera. The raw data are qual¬ 
itatively identical to those shown in the next section 
(Fig. 4a) for the protein ocr: In each case, we fitted 
f{Q, t) = to the data. Since the particles are 

purely diffusive, we expect the correlation time to 
obey TjT* = DQ^. Figure 3 shows the correlation time 
plotted against Q from DDM (two frame rates) and 
DLS. The data sets overlap along a single straight line 
with slope —2. 

Note from Fig. 3 that DDM typically operates in a 2 
range that is an order of magnitude lower than that 
accessed by conventional DLS. This is because, as 
already noted, DDM uses a low numerical aperture 
objective, which collects light that has only been 
scattered through small angles by the sample. The 
ability to reach such low Q can be exploited for the 
high-throughput characterization of bacterial motility 
(Wilson etal. 2011). 

Solution Structure of ocr; Shape Determination 
Using DLS 

Upon infection of an Escherichia coli host cell by 
bacteriophage T7, ocr (overcome classical restriction) 
is the first protein to be produced. Once present, ocr 
inhibits the type I restriction/modification (R/M) 
enzymes of the host cell. R/M enzymes can detect 
and destroy foreign DNA. Their inhibition is thus 
crucial to prevent the destruction of the invading 
DNA. This allows the remaining bacteriophage DNA 
to be transcribed and thus ensures a successful infec¬ 
tion of the host by the phage. Recently, the structure of 
ocr was solved by X-ray crystallography to a resolution 
of 0.18 nm. This confirmed a previous structural 
model, which had a lower resolution and was based 
on light-scattering experiments (Blackstock et al. 
2001 ). 

Static light-scattering experiments indicate entities 
with a molar mass of about 28 kDa (Eq. 3), which 
implies that ocr (molar mass M « 14 kDa) exists in 
solution as dimers. This result together with dynamic 
light-scattering experiments can provide information 
on the shape of the dimer, although its size is much 



Dynamics of Colloids and Macromolecules, Fig. 3 The cor¬ 
relation time Zc of a dilute suspension of polystyrene particles 
suspended in water plotted against the magnitude of the scatter¬ 
ing vector Q. The correlation times Zc were determined by fitting 
f{Q, z) — to ISFs obtained by DLS (•) and DDM with 
different camera frame rates (50 fps: -I-; 150 fps: x). In a purely 
diffusive system, = DQ^. the data from the two methods 
follow a single straight line with slope —2 

smaller than the wavelength of light (and thus C) 
(Blackstock et al. 2001). This is due to the fact that 
static light scattering and dynamic light scattering 
probe different properties, the (scattering) mass distri¬ 
bution, and mobility, respectively. The ISFs / (Q, t) 
determined at different scattering vectors Q fall on top 
of each other and follow a linear dependence as func¬ 
tion of (Fig. 4a). This is consistent with diffusion 
(Eq. 6) and the presence of a single, rigid species. From 
the slope (Fig. 4a), the diffusion constant D can be 
calculated (Eq. 6), D = 79.1 prn^s”*, and using the 
Stokes-Einstein relation (Eq. 9) also the hydrodynamic 
radius = 2.6 nm. This value is larger than expected 
for a spherical protein with M = 28 kDa and specific 
volume V = 0.721 ml/g (resulting in Rsph = 2.0 nm), 
even including a reasonable hydration layer 

^sph"^ = 2.2 nm^. This suggests a nonspherical 

shape of the ocr dimer, which would explain a larger 
hydrodynamic radius Rh = ^sph"^/®- For oblate or pro¬ 
late ellipsoids of revolution with axes a, h, h, and axial 
ratio p = a/h, the ratio a{p) depends only on the axial 
ratio p and is independent of protein-specific factors 
(Fig. 4b). The experimental value of the hydrodynamic 
radius = 2.6 nm suggests an oblate shape with its 
long and short axes h — 6.7 nm and a — \A nm, 
respectively, or a prolate shape with the long and 
short axes a — 10.4 nm and h = 2.6 nm, respectively. 
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a 


b 



Q^slum^) Mp=bla p = a/b 


Dynamics of Colloids and Macromolecules, Fig. 4 (a) 

Intermediate scattering function / {Q, t) obtained by dynamic 
light scattering as a function of Q^t for different scattering 


vectors Q (O, 13.8 nm“'; □, 15.0 |am“'; A, 16.3 nm^'; x, 
17.5 |am^'; O, 18.7 |am^*). (b) The function a{p) as a function 
of 1/p for oblate {left) and p for prolate {right) objects 


A maximum thickness of only 1.4 nm is not compatible 
with typical elements of secondary structures and 
implies a very flexible molecule, which makes it diffi¬ 
cult to maintain a stable tertiary structure and thus an 
active conformation. A prolate shape is thus more 
likely. The size of the prolate protein is strikingly 
similar to the size of the DNA to which type I R/M 
enzymes bind. This suggests that ocr mimics the struc¬ 
ture of this piece of DNA, which is also supported by 
the fact that, like DNA, ocr is strongly acidic with 28 
negative charges per monomer at neutral pH. This is 
consistent with the observation that ocr competes for 
the DNA binding site of type I R/M systems. 

Conclusion 

Both DLS and DDM give access to the same dynam¬ 
ical quantity, the intermediate scattering function f{Q, 
t). For a dilute dispersion of monodisperse spheres, the 
interpretation of this function is unambiguous, 
f{Q, t) = where D = k^T/6TiriR. Knowing the 

temperature T and the viscosity -q of the solvent, we can 
therefore determine the radius R of the spheres. As the 
scattering objects deviate from dilute homogeneous 
single-sized spheres, models become increasingly 
important in the interpretation of / (Q, t). We have 
discussed briefly the issues of size polydispersity 
and nonsphericity. Further complexities arise, for 
example, due to internal motions within each scatter¬ 
ing object or interactions between these objects. In real 


experimental situations, therefore, it is important that 
results from DLS or DDM are corroborated by other 
measurements. Nevertheless, DLS and DDM are con¬ 
venient, laboratory-based methods for probing the 
dynamics of a broad range of dispersions of biophys¬ 
ical interest. 
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Synonyms 

7-Transmemhrane domain receptors; Amphipathic 
helix; GPCR; Protein Dynamics; Seven-Transmem- 
hrane Domain Receptors; Signal transduction at the 
intracellular surface of the memhrane-integral 
G-protein-coupled receptors 

Definition 

The intracellular surface of G-protein-coupled recep¬ 
tors (GPCR) is a key element in signal transduction, 
enabling molecular recognition by signaling molecules 
such as G-proteins, kinases, and arrestins. Helix 8 is 
an amphipathic helical segment that is found predom¬ 
inantly in the rhodopsin-like class A of GPCRs. Helix 


8 lies parallel to the cytoplasmic membrane surface 
and connects the membrane domain of the receptor 
with its long C-terminal tail. 

Basic Characteristics 

G-protein-coupled receptors (GPCRs) are the largest 
class of human membrane receptors. They transmit an 
enormous variety of different external stimuli, includ¬ 
ing hormones, neurotransmitters, odorants, taste, and 
photons, to intracellular signaling cascades. GPCR 
signaling is mediated by the activation of a GTP- 
binding protein (G-Protein) through an agonist- 
activated receptor. GPCRs are often called seven trans¬ 
membrane helical (7TM) receptors, as the linear chain 
of amino acids folds into a structural motif of seven 
a-helices, which traverse the membrane seven times. 

Conformational changes of the receptor induced by 
ligand binding from the extracellular side are transmit¬ 
ted to the intracellular surface of the receptor and 
manifest themselves in an “active” cytoplasmic recep¬ 
tor domain that is recognized by the G-protein and 
enables protein-protein interaction. In parallel to 
G-protein signaling, the active receptor is phosphory- 
lated by kinases and recognized by arrestins, initiating 
receptor desensitization. 

In particular, the amphipathic helix 8 plays an 
important role in protein-protein interactions involved 
in GPCR and G-protein activation, GPCR phosphory¬ 
lation and deactivation. Helix 8 has been identified as 
a highly conserved structural motif among class 
A GPCRs, lying parallel to the cytoplasmic membrane 
surface and connecting transmembrane helix (TM) 

7 (TM7) with the long C-terminal tail (Fig. 1) (for 
a review of GPCR structures, see Deupi and Standfuss 
201 1). Focated at the start of the C-terminal tail, helix 

8 might serve as a transmitter of signaling states or is 
itself involved in regulating C-terminal structures. 
Involvement of helix 8 in G-protein signaling, receptor 
phosphorylation, deactivation, internalization, as well 
as expression and sorting is documented (for reviews 
see Huyhn et al. 2009; Okuno et al. 2005). 

Coupling of helix 8 to the ligand binding pocket is 
mediated by the conserved NPxxY motif in TM7 
preceding helix 8. In the GPCR rhodopsin (Fig. 1) 
this coupling is realized by an interhelical network 
that connects the conserved amino acid D83 (TM2) 
close to the ligand binding pocket with the NPxxY 
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Dynamics of Helix 8 in GPCR Function, Fig. 1 Close-up of 
the molecular model of rhodopsin. (a) Side view of the cytoplas¬ 
mic part of rhodopsin based on the X-ray stracture of the inactive 
dark state (pbd-hle 1U19). The amino acids belonging to the 
NPxxY {blue) motif and interacting residues as well as the 
hydrophobic residues of H8 (red) are indicated, (b) Top view 


of the cytoplasmic surface. Possible phosphorylation sites 
(serine 334, serine 338, serine 343) in the C-terminal tail are 
shown as blue spheres, (c) Comparison of the inactive (blue) 
and active (red, pdb-Hle 3PXO) crystal structures of rhodopsin. 
(d) Helical wheel presentation of amphipathic helix 8 of 
rhodopsin 


motif. In the inactive receptor state, the aromatic 
amino acid residue F313 at the start of helix 8 is in 
contact with residue Y306 of the NPxxY motif (Fig. 1), 
coupling TM7 with helix 8 (for a review see Hofmann 
et al. 2009). Sequence alignments of helix 8 from 180 
GPCRs revealed that the NPxxY motif is separated by 
six amino acids from the N-terminal phenylalanine of 
helix 8 in the majority of the receptors (Okuno et al. 
2005), indicating a conserved mechanism that 
constrains helix 8 with TM7 as well as TM7 with 
TMl and TM2 via the NPxxY(x)F motif. 

Helix 8 is amphipathic by nature (Fig. 1). This 
property plays a crucial role in helix-helix interaction 
and in the interaction with membranes. The 
distribution of the hydrophobic residues in rhodopsin’s 


amphipathic helix 8 follows the rule that every fourth 
residue is hydrophobic. Sequence alignments of class 
A GPCR showed that the majority of the receptors 
contain a phenylalanine in the first hydrophobic 
position of helix 8, followed by phenylalanine or leu¬ 
cine in the n -F 4, and leucine in the n -F 8 hydrophobic 
position (Okuno et al. 2005). 

Because of its amphipathic nature helix 8 displays 
an intrinsic high conformational plasticity; it is disor¬ 
dered in an aqueous environment but adopts predom¬ 
inantly a helical conformation in close contact to 
a membrane. It was anticipated that helix 8 may act 
as a membrane-dependent conformational switch as 
well as an ionic/hydrophobic zipper facilitating forma¬ 
tion of receptor dimers (for reviews see Kim et al. 
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Dynamics of Helix 8 in GPCR Function, Table 1 Dynamics 

of helix 8 and its function in inactive and active receptor states 

Receptor state 

Helix 8 dynamics 

Functional role 

Biophysical technique 

Inactive State (dark 
rhodopsin, inverse 
agonist-bound state) 

Medium mobile below 
neutral pH, highly mobile at 
alkaline pH {“alkaline 
unlock” with loss of 
constraints) 

“Alkaline unlock” of helix 8 
correlates with pH-dependent 
equilibrium between active and 
inactive receptor state 

Fluorescence 

anisotropy” 

Active state 

(agonist-activated 

metarhodopsin-II) 

More immobile and slower 
motion {gain of constraints) 

Direct correlation of helix 8 
conformational changes with 
H-bonding to the conserved D83 in 
TM2, which couples the ligand 
binding pocket via the NPxxY 
motif to helix 8 

EPR” 



Additional contacts by the 

C-terminal tail keeping helix 8 in 
a position in which its environment 
is strongly coupled to the ligand 
binding site 

Fluorescence 

anisotropy” 



These correlations are maintained 
upon G-protein binding 

FTIR/fluorescence 

cross-correlation” 

P hosphorylated 
active state in 
complex with 
arrestin 

Prebinding of aiTestin: high 
mobility, slow motion 
{loss of constraints) 

High-affinity binding state 
of arrestin: More immobile 

A loss of interactions that restrain 
helix 8 is required to fit helix 8 into 
a flexible conformation, facilitating 
multistep arrestin binding 

Fluorescence 
anisotropy, 
fluorescence pump- 
probe spectroscopy 
(Kirchberg et al. 

2011) 


“For references see review Kim et al. (2012) 


2012; Parker et al. 2011). The conformational flexibil¬ 
ity of helix 8 is an important property for the GPCR 
activation process that actively involves the mem¬ 
brane. Table 1 summarizes the functional role of 
highly mobile and immobile (constrained) conforma¬ 
tions of helix 8 in inactive and active receptor states 
and during receptor deactivation. 

Since GPCRs are involved in fundamental and 
physiological diverse signal transduction pathways, 
they are a major target for therapeutic agents. Different 
ligands as well as allosteric modulators can stabilize 
selective receptor conformations that are linked to 
specihc functional outcomes (for a review see 
Rajagopal et al. 2010). Helix 8 was identihed as an 
allosteric interaction site for a small molecule ligand 
stabilizing an alternative receptor conformation in the 
GPCR PARI that was selective with regard to 
G-protein coupling and confers an antithrombotic 
effect (Dowal et al. 2011). This ligand selectivity was 
correlated with receptor interactions that constrain 
helix 8 to TM7 and the membrane. Knowledge about 
coupled and uncoupled conformational states in recep¬ 
tor signaling thus may provide leads for the rational 
design of efficient drugs. 


Cross-References 

► EPR Spectroscopy: General Principles 

► Fluorescence 

► Membrane Proteins; Structure and Organization 

► Protein Dynamics and Allostery 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 

► Protein-Ligand Dynamics 

► Rhodopsin 
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Synonyms 

Molecular motors 


Introduction 

All known cytoskeletal molecular motors belong to 
one of three families: the ► dyneins, ► myosins, and 
► kinesins. Each motor includes at least one heavy 
chain (HC), which contains a motor domain that con¬ 
verts the chemical energy of ATP into mechanical 
work as well as binding sites for various accessory 


chains. Despite these similarities in overall structure 
and function, the dyneins are distinct due to the size of 
their HCs. Dynein HCs (>500 kDa) are much larger 
than those of kinesin (^70-170 kDa) and myosin 
(^120-250 kDa). This disparity in size reflects the 
difference in evolutionary origin between dynein and 
the other two motor families. The highly conserved 
nucleotide binding pockets of kinesin and myosin are 
homologous to the active site of G proteins, while 
dynein’s hydrolytic activity relies on AAA-f domains. 
All three families are composed of several related, yet 
distinct members, each adapted to its unique cellular 
function. The focus of this entry is to provide an 
overview of the dynein family and to draw contrasts 
and comparisons between its various members. 

The dynein family is first divided into axonemal, 
cytoplasmic, and intraflagellar transport (IFT) dyneins. 
Axonemal dyneins reside within cilia and flagella and 
are responsible for the rhythmic and coordinated bend¬ 
ing of these organelles. Cytoplasmic dynein is found 
across the entire microtubule (MT) network and are 
responsible for retrograde transport of various cargos 
toward the minus ends of MTs, while IFT dynein 
predominantly transports cargos along MTs in the axo- 
neme. The axonemal dyneins consist of one known 
outer dynein arm (ODA) and at least seven distinct 
inner dynein arms (IDAs), with the exact number vary¬ 
ing between species. Also depending on the species, 
the ODA can contain either two or three heavy chains, 
which are further defined as the a, P, and y HCs. Each 
IDA contains a single HC except for one known as 
inner arm II or f (hereafter referred to as Il/f), which is 
a heterodimer. Cytoplasmic dynein and IFT dynein are 
related and are sometimes referred to as cytoplasmic 
dynein-1 and cytoplasmic dynein-2, respectively. Both 
IFT and cytoplasmic dynein are homodimers of their 
respective HCs (Fig. 1). 

Structurally, the ► dyneins differ not only in the 
number and sequences of their HCs, but also in the 
composition and organization of the smaller subunits. 
These differences are described below, but there are 
also key characteristics that all dyneins share. Every 
dynein complex contains at least one HC. The C- 
terminal two-thirds of the HC is known as the motor 
domain. It consists of six AAA-f domains that form 
a ring. The stalk, a 15 nm antiparallel ► coiled coil, 
projects out of the fourth AAA-f domain and bears the 
globular microtubule-binding domain (MTBD) at its 
tip (Fig. 2). A shorter coiled coil, known as the buttress 
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Dynein Family 
Classification, 

Fig. 1 Classification of 
dynein. A phylogenetic tree of 
~60 distinct dynein heavy 
chains. Sequences cover all 
known types of dynein heavy 
chains from several species, 
including A. gossypii, C. 
elegans, C. reinhardtii, D. 
rerio, D. melanogaster, G. 
gallus, H. sapiens, K. lactis, 

N. crassa, S. cerevisiae, and T. 
thermophila. Encircling the 
phylogenetic tree is 
a summary of the heavy chains 
that make up each type of 
dynein motor 
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or the strut, extends from the hfth AAA+ domain to the 
middle of the stalk. N-terminal to the hexameric ring is 
an element known as the linker, which plays a pivotal 
role in translating the ATPase action of the motor 
domain into mechanical motion (Burgess et al. 2003). 
The N-terminal one-third of the HC is known as the 
stem or, more recently, the tail domain in order to 
differentiate it from the stalk. The tail serves as 
a major site of protein-protein interactions. It contains 
the site of HC oligomerization (homo and heterodi¬ 
merization, as well as trimerization) and binds acces¬ 
sory proteins, such as intermediate chains (ICs) and 
light chains (LCs). The resulting tail subcomplex is 
responsible for attachment of cytoplasmic and IFT 
dynein to cargos and anchors the motor to doublet 
MTs in axonemal dyneins. 

Axonemal Dyneins 

The axoneme is a highly conserved structure found 
in cilia and eukaryotic flagella. In motile cilia and 


flagella, each axoneme is made up of a bundle of nine 
peripheral doublet MTs and two central singlet MTs 
that run longitudinally throughout the organelle. Each 
doublet MT has several arms projecting toward neigh¬ 
boring doublet MTs as well as radial spokes that span 
the gap between the peripheral and central MTs 
(Fig. 3a). In 1963, Ian Gibbons began chemically 
probing axonemes in order to discover the protein 
responsible for their bending. This involved dissolving 
the membrane of Tetrahymena cilia and treating the 
exposed axonemes with various solutions. He found 
that after treatment with EDTA, the axonemes lost 
their ATPase activity, while the supernatant gained it. 
Using an electron microscope. Gibbons noticed that 
the arms that extend between adjacent peripheral MTs 
were missing from these inactive axonemes and named 
the large protein purified from the supernatant 
“dynein” after the dyne, a unit of force (King 201 1). 

The unicellular green alga Chlamydomonas has 
served as an important model organism for the study 
of axonemal dyneins. Several dynein deficient mutants 
that exhibit reduced flagellar motility or are 
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Dynein Family Classification, Fig. 2 Overview of dynein 
heavy chain structure. Starting at its N-terminus, the heavy 
chain includes a cargo-binding tail domain {red), a mechanical 
element known as the linker (purple), and a motor domain (blue), 
which consists of a hexameric ring of AAA+ domains. A 15 nm 
coiled coil known as the stalk (yellow) projects out of the ring 
from the fourth AAA-t domain and bears the globular microtu¬ 
bule-binding domain (green) at its tip. A shorter coiled coil 
known as the buttress or strut (pink) similarly extends out from 
the fifth AAA-r domain and interacts with the middle of the 
stalk, possibly playing a role in the mechanochemical cycle 


completely paralyzed have been described. By com¬ 
paring their physiology, subcellular anatomy and 
genotypes, it was demonstrated that there are several 
types of inner dynein arms (IDAs), which are structur¬ 
ally and functionally distinct from the single outer 
dynein arm (ODA). The IDAs can be further resolved 
into seven distinct dyneins by using high-pressure 
liquid chromatography to fractionate high salt extracts 
of Chlamydomonas axonemes. The seven resulting 
peaks have been named a-g (Hirose and Amos 2012). 
Full genome sequencing predicted at least three more 
gene products that resemble axonemal dynein HCs, 
which have only recently been purified due to a lower 
level of expression. These minor IDAs were shown to 
localize specifically to either the proximal or distal 
region of the axoneme, demonstrating yet another 
level of complexity in the coordination between the 
various IDAs (Yagi et al. 2009). 


The Outer Dynein Arm 

The ODA can be found on the side of the doublet MT 
farthest from the central pair of MTs. Each doublet MT 
is made of one complete MT, known as the A-tubule, 
and one incomplete MT, known as the B-tubule. The 
ODA is situated such that its tail subcomplex associ¬ 
ates with the A-tubule of one doublet MT, and its stalk 
stretches across to the B-tubule of the neighboring 
doublet MT. The link between the ODA and A-tubule 
remains intact throughout the mechanochemical cycle, 
while the MTBD at the tip of the stalk dissociates and 
reassociates with the B-tubule each step. Looking 
down the length of the axoneme, there is one ODA 
every 24 nm (Bui et al. 2010) (Fig. 3b). 

Depending on the species, each ODA can contain 
either two or three HCs. ► Electron tomography of 
Chlamydomonas axonemes shows the planes of the 
AAAh- rings of three HCs (a, (3, and y) stacked parallel 
to the A-tubule they are attached to (Takazaki et al. 
2010). Phylogenetically, the a and (3 chains are very 
similar and probably result from gene duplication, 
while the y chain is more distantly related. Animal 
ODAs contain only two HCs, which are most similar 
to P and y chains of the Chlamydomonas (Fig. 1). 

Each ODA also contains two ICs and eight LCs, 
which associate with the tail portion of the HC to form 
the tail subcomplex. Several of the LCs are homo- 
logues of well-known proteins and these relationships 
have been used to suggest potential mechanisms for 
regulation of the ODA by these LCs. For example, LC3 
and LCS are closely related to ► thioredoxin, and may 
regulate via redox potential (Kamiya 2002). 

Furthermore, there are associated complexes that 
are not considered part of the ODA, but are found 
adjacent to it and are thought to play a role in its 
regulation. One subcomplex of three proteins, known 
as the ODA docking complex (ODA-DC) is made 
primarily of coiled coils that are roughly 24 nm in 
length. The ODA-DCs lay end-to-end along the 
A-tubule, which suggests that its primary function is 
to measure out the appropriate distance between adja¬ 
cent ODAs. There are also two recently discovered 
linkers, which are thought to contribute to the tight 
inter-dynein regulation necessary for axonemal beat¬ 
ing. The first, known as the outer-outer-dynein linker 
(OOD) connects adjacent (down the length of the axo¬ 
neme) ODA HCs, and is thought to help propagate 
ODA activity necessary for the production of the 
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Dynein Family Classification, Fig. 3 Position and orienta¬ 
tion of the various dyneins in the axoneme. (a) Cross-sectional 
view of an axoneme showing the 9-1-2 arrangement of microtu¬ 
bules and the location of various dyneins. The outer dynein arms 
are heterotrimers of three distinct heavy chains (two in some 
species). The inner dynein arms are monomers aside from the 


heterodimeric Il/f dynein. Colors correspond to Fig. lb. The red 
box indicates the doublet microtubule that is viewed in Fig. 3b. (b) 
Longitudinal view of a single doublet microtubule demonstrating 
the spacing of outer and inner dynein arms along the axoneme. 
There is one outer dynein arm every 24 nm, while a complex 
pattern of several inner dynein arms repeats every 96 nm 


observed waveform. The second is the outer-inner- 
dynein linker (OID), which connects two out of every 
4 ODAs to the ID As and is thought to help coordinate 
activity between the ODAs and IDAs (Nicastro et al. 
2006). In summary, although there is only one type of 
ODA, it is a highly complex molecular machine made 
up of several individual polypeptides and forming 
direct interactions with a dozen others. 


The Inner Dynein Arm 

In contrast to the ODA each IDA is much simpler in 
structure, but it is the interplay of the different types 
that gives the IDAs a complexity of their own. The 
IDAs are found on the side of the doublet MT closest to 
the central pair. They are oriented in the same way as 
the ODAs, with their tail attached to the A-tubule of 
one doublet MT and the stalk attached to the B-tubule 
of its neighbor. Rather than repeating every 24 nm like 
the ODA, a complex pattern consisting of several IDAs 
repeats every 96 nm (Fig. 3b). 

IDA Il/f is phylogenetically, structurally and func¬ 
tionally distinct from the rest of the IDAs. It is 
a heterodimer of two HCs, also referred to as a and (3 
(not to be confused with the a and P HCs of the ODA), 


which form distinct clusters in Fig. 1. Il/f contains 
three ICs (140, 138, and 97 kDa). The 140-kDa 
IC anchors IDA Il/f to the doublet MT, while the 
138-kDa either promotes or inhibits the activity 
of this dynein depending on its state of phosphoryla¬ 
tion. The monomeric IDAs do appear to form some 
phylogenetic groups, but these clusters do not corre¬ 
late well with groups that are based on structural, 
biochemical, and physiological properties (Horn 
et al. 2011). Structurally, each monomeric IDA has 
a single HC, the tail of which interacts with an actin 
monomer. The actin subunit in turn interacts with 
a centrin LC in IDAs b, e, and g and a p28 LC in 
IDAs a, c, and d (Kamiya 2002). Actin appears to be 
filling the role of the IC in these dyneins since it is 
thought to provide a scaffold for the corresponding 
LC to bind to the HC tail. 

Each 96 nm repeat of IDAs also contains a crescent¬ 
shaped complex called the dynein regulatory complex 
(DRC). This complex is situated directly on the dou¬ 
blet MT and interacts with the radial spokes, the OID 
mentioned above and nexin, an elastic protein that ties 
together the bundle of nine peripheral doublet MTs. 
Because of its central location and interaction with 
both the radial spokes and the OID, the DRC plays 
a key role in coordinating the activity of the axoneme 
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(Barber et al. 2011). The OlDs themselves promote 
communication between the ODAs and ID As. Since 
there are two OIDs for every 96 nm unit in the IDA 
pattern, the two OIDs must interact with different 
parts of the inner arm. One OID forms a connection 
from an ODA to the DRC, while the other links an 
ODA to IDA Il/f (Nicastro et al. 2006). The discovery 
of these OID connections underscored the critical roles 
that the DRC and IDA Il/f have in regulating axone- 
mal activity. 

Functional Characteristics of Axonemal 
Dyneins 

Functional studies of various Chlamydomonas mutants 
revealed that not all axonemal dyneins are necessary 
for motility and helped to define a minimum combi¬ 
nation of dyneins that must be present for axonemal 
beating to take place. The first, somewhat unexpected 
finding was that mutants lacking ODA altogether still 
swim, but at less than half the speed of the wild 
type. Mutants lacking certain combinations of IDAs 
were also able to swim at a reduced velocity, but other 
combinations were paralyzed. Specifically, if 
a mutant lacked two of the following three items, it 
was paralyzed completely: (1) The entire ODA, 
(2) IDA Il/f, (3) IDAs a, c, and d. Analysis of these 
same mutants also helped characterize the types of 
motion each type of dynein is responsible for. For 
example, ODAs produce a high beat frequency, 
while IDAs promote the large amplitude of each beat. 

In vitro motility assays performed by isolated 
dyneins provided further insights into the velocity 
and force produced by each specific dynein. For exam¬ 
ple, IDA Il/f, which appears to be very important to the 
function of the axoneme in vivo, moves MTs 95-99% 
slower than the other isolated dyneins. Additionally, 
IDAs a, c, d, e, and g rotate MTs in addition to 
translocating them, which suggests that one of the key 
roles of these dyneins is to produce torque. Furthermore, 
IDA c has been reported to be a processive motor, which 
is surprising both because it is a monomer, and because 
there is no immediately obvious benefit for such a motor 
in the axoneme (Kamiya 2002). 

The current model for reconciling all of these struc¬ 
tural and functional observations describes two inde¬ 
pendent systems working in parallel within the 
axoneme. The relationship between the two remains 


unclear, but it appears that the IDA system is respon¬ 
sible for initiating axonemal beating, while the ODA 
amplifies the power. This would account for the fact 
that mutants completely lacking ODAs can still swim, 
while those lacking all IDAs cannot. It is also 
supported by the layout of the regulatory network, 
which seems to suggest that the signal originates in 
the central pair MTs and radial spokes, and moves 
outward through the IDAs, before arriving at the 
ODAs through the DRC and OIDs (Movassagh et al. 
2010). Another perplexing observation is the large 
discrepancy in velocity between individual dyneins. 
One possibility is that some may only be involved 
in beat initiation while others are required for propa¬ 
gation, similar to how one shifts gears on a bicycle 
when accelerating from a complete stop. It has also 
been shown that different dyneins are active on each 
side of the axoneme. This combination of temporal 
and spatial coordination of the various axonemal 
dyneins is likely to play a critical role in producing 
the final beafing pattern of the motile cilium or 
flagellum. 

Cytoplasmic Dynein 

While fhere are multiple axonemal dyneins, there is 
only one cytoplasmic dynein that is responsible for 
a wide variety of cellular functions. In contrast to the 
axonemal dyneins, which collectively promote cellular 
locomotion, cytoplasmic dynein’s many functions 
involve transport within the cell. Like certain ► myo¬ 
sins and ► kinesins, cytoplasmic dynein transports 
vesicles, organelles (e.g., mitochondria and nuclei), 
RNAs, and proteins to specific locations in the cell. It 
is also responsible for the collection of misfolded and 
aggregated proteins. These functions lead to cytoplas¬ 
mic dynein playing a key role in processes as diverse as 
fast axonal transport, golgi dynamics, protein 
recycling, and several distinct stages of mitosis includ¬ 
ing spindle orientation, nuclear migration, spindle 
assembly, and chromosomal segregation during ana¬ 
phase (Karki and Holzbaur 1999). 

Cytoplasmic dynein forms a homodimer made up of 
HCs, ICs, LCs, and light intermediate chains (LICs). 
Each motor contains a dimer of HCs, which binds 
directly to a dimer of one of two homologues of LIC 
(either LICl or LIC2). The HCs also bind a dimer of 
ICs, which provide a scaffold for dimers of one to three 
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different LCs depending on the species. There are two 
competing hypotheses on the role of the LCs in cyto¬ 
plasmic dynein. The first states that the LCs form 
a direct link between the IC and cargo because each 
LC is known to bind to many other proteins within the 
cell. However this model has been complicated by the 
discovery that the LCs use the same binding site for 
both cargo and the IC, meaning it can only bind one of 
the two at any point in time. A second model has 
emerged, which states that the LCs provide structural 
stability to a disordered region of the IC, which facil¬ 
itates its dimerization (Rapali et al. 2011). In this 
model, the specificity for various cargos would have 
to come from a different adaptor protein. 

Because cytoplasmic dynein is integral to so many 
diverse processes within the cell, it requires the 
attachment of adaptor proteins to create functionally 
distinct motors. Dynactin, a 1 MDa complex that 
targets dynein to the sites of its action and increases 
the motor’s ► processivity, is known to bind the 
N-terminus of the dynein IC. Dynactin also binds sev¬ 
eral other adaptors, such as Bicaudal D and the Rod- 
ZWlO-Zwilch (RZZ) complex, which are thought to 
help recruit dynein to cargos including mRNA and the 
kinetochore (Kardon and Vale 2009). Another complex 
made of LISl and NUDE or LISl and NUDEL binds 
dynein in at least two places and most likely has multi¬ 
ple regulatory effects on the motor. It forms a direct link 
between cytoplasmic dynein’s catalytic AAA-f domain 
and the MT, which suggests some effect on ATP catal¬ 
ysis. However, LIS 1/NUDE was also recently shown to 
bind the IC’s dynactin binding site in a mutually exclu¬ 
sive way, suggesting reciprocal regulation between 
dynactin and LISl/NUDE (Me Kenney et al. 2011). 
Many questions remain about this complex regulatory 
network, but it seems certain that these interactions are 
critical for cytoplasmic dynein’s versatility in carrying 
out its numerous functions. 

IFT Dynein 

Even though lET is most closely related to cytoplasmic 
dynein in phylogeny, general function, and structure, it 
is primarily found within or near cilia and flagella where 
it carries out its primary function of axoneme mainte¬ 
nance. Eigure 1 shows that IFT and cytoplasmic dyneins 
cluster together and are more closely related than either 
is to the axonemal dyneins. Functionally, IFT dynein is 


similar to cytoplasmic dynein in that it participates in 
intracellular transport, but it is located primarily along 
axonemes. IFT dynein’s main function is to bring com¬ 
ponents of IFT particles back to the cell body where they 
can be recycled. The mechanism is thought to be similar 
to that used by cytoplasmic dynein to carry cargos 
across the cell, except in this case the cargo is a form 
of proteinaceous raft and the track is the axoneme 
(Mikami et al. 2002). When IFT dynein is defective, 
flagella are extremely short and IFT particles accumu¬ 
late in their tips. IFT dynein also plays a role in signal 
transduction by relaying signals from the tip of 
a nonmotile cilium back toward the cell body. For 
example, mouse embryos lacking IFT dynein are not 
responsive to the Hedgehog signaling pathway and are 
therefore unable to specify ventral neural cell types in 
the neural tube (Scholey and Anderson 2006). 

Structurally, IFT dynein is similar to cytoplasmic 
dynein in that it also contains a dimer of HCs and 
a dimer of LICs; however, it is only known to bind 
one type of LIC called LIC3 (Tan et al. 2011). Until 
recently, IFT dynein was not known to have any ICs or 
LCs, but new evidence suggests that an IC homologue 
called FAP133 does indeed bind to IFT dynein’s HC as 
well as LCS - the same LC found in both cytoplasmic 
and axonemal dyneins (Rompolas et al. 2007). Much 
less is known about the regulation of IFT dynein than 
that of cytoplasmic dynein, but some likely adaptors 
that are necessary for proper function include IFT 
complex A (IFT139), IFT complex B (IFT81), and 
a subunit of kinesin-2 (which is required for antero¬ 
grade IFT) called FLA 10. 

Summary 

The dynein family is made up of axonemal, cytoplas¬ 
mic, and IFT dyneins. Each dynein consists of between 
one and three HCs containing the motor activity and 
various accessory chains. The axonemal dyneins are 
further divided into the ODA and ID As. The ODA 
contains either two or three different HCs. The IDAs 
include multiple monomeric dyneins and the Il/f, 
which consists of a heterodimer of HCs. Cytoplasmic 
dyneins are based on a single HC gene, but gain diver¬ 
sity through various combinations of adaptor proteins 
and splice isoforms. IFT appears to be a highly spe¬ 
cialized cytoplasmic dynein that diverged very early in 
the evolutionary history. 
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Dynein Motility: Mechanism 
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Advanced ICT Research Institute, National Institute of 
Information and Communications Technology, Kobe, 
Hyogo, Japan 

Synonyms 

Allosteric functions of AAA+ ATPase; Hand-over- 
hand mechanism; Molecular Motors; Processivity 

Definition 

The mechanism of dynein motility is that of bidirec¬ 
tional allosteric communications between the ATPase 
site in the AAA-i- ring and the microtubule-binding 
domain segregated far from the ATPase site, the con¬ 
certed actions of the mechanical components with the 
AAA-i- ring, and the communication between multiple 
motor domains through mechanical strain. 

Introduction 

Dynein is a minus end-directed microtubule-based 
motor, which was initially discovered as a high- 
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molecular-weight ATPase extracted from Tetrahy- 
mena cilia. It was named “dynein” after the cgs unit 
of force, the dyne (Gibbons 2011). Dynein is now 
known to belong to the AAA-i- protein family in the 
ATPase associated with various cellular activities 
(AAA-i- ATPase) superfamily, members of which 
mostly function as hexameric rings. Dynein itself con¬ 
sists of a functionally diverse family. The members of 
the family function in a wide variety of essential cel¬ 
lular processes in various eukaryotic cells including 
the trafficking of numerous cargoes, the positioning of 
the nucleus, the maintenance of Golgi apparatus, and 
movement of cilia and flagella (see also ► Dynein 
Family Classification). 

The heavy chain of dynein contains essential com¬ 
ponents for motility, the motor domain, which are 
localized in the C-terminal two thirds of the heavy 
chain (ca. 350 kDa fragment) (Fig. la). The mechani¬ 
cal components are as follows: 

• The linker, which works as a mechanical element, 
undergoes nucleotide-dependent power stroke, and 
drives the movement of microtubules 

• The head ring, which contains six AAA-i- modules 
termed AAA1-AAA6 

• The a-helical coiled-coil stalk, which extends from 
AAA4 and the microtubule-binding domain 
(MTBD) located at the tip of the stalk 

• The buttress/strut, which extends from AAA5 and 
interacts with the stalk 

The N-terminal third of the heavy chain forms the 
flexible tall. The tail constitutes an extended structural 
domain containing ca. 26% a-helix and is involved in 
dimerization/trimerization of dynein heavy chains and 
acts as a scaffold for the assembly of intermediate 
chains (ICs) and light chains (LCs) to form the dynein 
complex. 

During the mechanochemical cycle of dynein, bind¬ 
ing of ATP to the primary ATPase site of AAAI causes 
dissociation of the MTBD from the microtubule, and 
re-binding of the MTBD to the microtubule accelerates 
the dissociation of hydrolysis products from the AAAI 
ATPase site (Kon et al. 2005; Imamula et al. 2007). 
Because the sites of microtubule binding and primary 
ATP hydrolysis are spatially segregated (^25 nm), 
elucidation of the bidirectional communication path¬ 
way between them is an important issue in understand¬ 
ing dynein function. 

Biophysical characterization of the native dynein 
isolated from axonemes of several organisms, brain. 


yeast, and slime mold, and the heavy chains of cyto¬ 
plasmic dynein expressed from Saccharomyces 
cerevisiae (Reck-Peterson et al. 2006) and Dictyostelium 
discoideum (Shima et al. 2006) established the 
processivity of dyneins, the property that single dynein 
molecule can move continuously for several microme¬ 
ters on a microtubule without dissociation. For 
understanding the mechanism of this processivity, coor¬ 
dination between two or three motor domains connected 
by a presumably flexible tail is a key issue to be inves¬ 
tigated. Since the motor domains are intimately associ¬ 
ated with each other, force generation and kinetics of 
each motor domain could be modulated by intramolec¬ 
ular strain in multimeric dynein molecule. 

Many textbooks have been published on dyneins. 
For more details, see textbooks such as those by King 
(2011) or Hirose and Amos (2012). Details of the 
recent achievements on dynein motility have also 
been published (Sakakibara and Oiwa 2011; King 
and Pazour 2009). 

Molecular Organizations of Dynein 

Despite their distinct roles in cells, cytoplasmic and 
axonemal dyneins, forming large protein complexes, 
are constructed along the similar basic plan: The com¬ 
plexes contain heavy chains (HCs), several intermedi¬ 
ate chains (ICs) with WD repeats involved in cargo 
attachment, and at least three distinct classes of light 
chains (LCs), namely, the highly conserved LCS 
and members of the roadblock/LC7 and Tctexl 
(T-complex testis specific protein 1) protein families 
(Hirose and Amos 2012). 

Each dynein HC has a molecular mass of typically 
500-540 kDa, consisting of approximately 4,500 
amino acid residues. It contains a fundamental motor 
domain in the C-terminal ca. 320-380 kDa fragment, 
incorporating sites for both ATP hydrolysis and micro¬ 
tubule binding. The motor domain is composed of six 
AAA-I- modules each of 35^0 kDa, the microtubule¬ 
binding stalk, buttress/strut, and the linker, which 
connects the head ring and the tail and is normally 
docked onto the head ring (Fig. lb). A tail is formed 
of the N-terminal third of the heavy chain and consti¬ 
tutes an extended structural domain which mediates 
dimerization of the HCs and also provides a scaffold 
for ICs and light intermediate chains and a fixed attach¬ 
ment of dynein to the A tubule of an axonemal doublet. 
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Dynein Motility: 
Mechanism, 

Fig. 1 Overview of the 
molecular organization of 
dynein. (a) Linear map of the 
heavy chain of 
Chlamydomonas axonemal 
inner-arm dynein c 
(BAE19786, Chlamydomonas 
reinhardtii), showing the 
domain structure: tail {blue), 
linker (pink), AAA-t modules, 
stalk, and the microtubule¬ 
binding domain (MTBD). 

(b) Schematic drawings of the 
dynein heavy chain in apo 
state and electron microscopy 
class averages of dynein c in 
apo state. The six AAA-t 
modules are arranged in a ring, 
which are indicated in red, 
orange, yellow, cyan, green, 
and magenta for AAAI 
through AAA6. Each module 
is composed of the N-terminal 
large domain (the larger 
circles) and the C-terminal 
small domain (the smaller 
circles). Dynein has two 
distinct faces. The face on 
which the linker docked is the 
linker face corresponding to 
the face seen in the left view 
and the face on which 
C-terminal localized is the 
C-terminal face corresponding 
to that seen in the right view in 
the EM class averages (These 
figures are reproduced and 
modified from Burgess et al. 
(2003) and Sakakibara and 
Oiwa (2011)) 


a C. reinhardtii axonemal dynein c 

Head ring 

I-1 

Motor domain 


tail linker #1 #2 #3 #4 stalk #5 #6 C-domain 



I-1-1-1-1-1 

1 1000 2000 3000 4000 5000 



The Left View (Linker Face) The Right View (C-terminal Face) 


Mechanical Components of Dynein 

AAA Ring 

Sequence of the AAA Ring 

Dynein is a member of the AAA-i- ATPases, members 
of which mostly function as hexameric rings. How¬ 
ever, dynein is so unusual that six AAA-i- modules, 
each of 35^0 kDa, are linked in tandem in a single 
polypeptide and are not identical. These AAA-i- mod¬ 
ules fold into a ring-shaped structure approximately 


13 nm in diameter, similar to other AAA-i- ATPase 
(Figs, lb and 2a) (Hirose and Amos 2012). 

The first four AAA-i- modules (AAA1-AAA4) con¬ 
tain a highly conserved Walker A motif (GXXXXGKT, 
a so-called P-loop) and a Walker B motif (DEXX), and 
are thought to bind nucleotides. In contrast, the 
sequences of the two C-terminal most AAA-i- modules 
(AAA5 and AAA6) have highly degraded Walker 
motifs. A principal site of ATP hydrolysis was mapped 
to the Walker A and B motifs of AAAI. 
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Dynein Motility: Mechanism, Fig. 2 (a) A Crystal structure 
of the Dictyostelium cytoplasmic dynein (PDB ID: 3AY1). The 
six individual AAA+ modules are highlighted in color same as in 
Fig. 1. The figure was prepared using PyMOL provided by 
DeLano Scientific LLC (http://www.pymol.org). The left and 
right panels show the linker face and the C-terminal face respec¬ 
tively. The linker (in pink) is mainly composed of a series of 
helices. In the side view of the AAA+ ring, the linker is observed 
arching over the linker face of the ring and having contacts only 
at the AAAI and AAA4 at its terminal portion, (b) The crystal 
structure of AAAI and AAA2 of yeast cytoplasmic dynein 


(PDB ID: 3QMZ). The large domain of the AAA module is 
composed of five stranded beta sheet (S1-S5) flanked by five 
helices (H0-H4). The large domain is connected with the 
small domain composed of five helices (H5-H9). The interface 
between the large domain of AAAI and the large domain of AAA2 
is the primaiy ATPase site of dynein. The residues of GKT in 
the P-loop are indicated as spheres, (c) Asymmehic structure of the 
yeast cytoplasmic dynein viewed from the linker face. The large 
domains of AAAI, 2, 3 and AAA6 and AAA4 with the stalk and 
AAA5 with the buttress/stmt are shown. The arrows indicate large 
gaps between AAA1-AAA2 and AAA5-AAA6 
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It seems likely that the additional Walker motifs (in 
AAA2-AAA4) act in a regulatory manner by binding 
either ADP or ATP. In cytoplasmic dyneins, the 
ATPase site in AAA3 plays important roles in the 
motility since ATP binding and hydrolysis mutations 
in AAA3 produce severe impairment in dynein motil¬ 
ity (Kon et al. 2004). Comparable mutations of the 
ATPase sites in AAA2 and AAA4 have more subtle 
effects on motility. In some axonemal dyneins, the 
presence of ADP is known to be essential for motility 
in vitro, and in others, ADP increases the velocity of 
microtubule driven by dyneins (Yagi 2000), indicating 
that ADP binds to at least one of these AAA-i- modules. 
Crystal structures of the dynein motor domain show 
that the nucleotide-binding Walker A motifs lie close 
to the interface between adjacent modules (Fig. 2b) 
(Carter et al. 2011; Kon et al. 2011). Interactions 
between adjacent AAA-i- modules through their nucle¬ 
otide pockets support the idea that they may act in 
concert to produce a functional motor. 

Crystal Structure of the AAA Ring 
Two crystal structures of the motor domain of cyto¬ 
plasmic dyneins of Saccliaromyces cerevisiae (Carter 
et al. 2011, internally truncated motor domain in apo 
state at 6 A resolution, the PDB accession number, 
3QMZ) and Dictyostelium discoideum (Kon et al. 
2011, a full-length motor domain in ADP state at 
4.5 A resolution, the PDB accession number, 3AY1) 
have been reported (Fig. 2). The crystal structures 
provide valuable new insights into dynein organization 
and communication mechanism between the AAAI 
ATPase site and MTBD and they also validate many 
of insights obtained from previous electron micros¬ 
copy studies. 

The overall appearance of the motor domain is 
similar between in yeast and in Dictyostelium, and is 
similar to negatively stained electron microscopic 
images. The AAA-i- modules are arranged asymmetri¬ 
cally in the motor domain, which are oriented at dif¬ 
ferent angles and have different packing between 
adjacent AAA-i- modules. The linker is composed of 
a series of a-helices arching over the center of 
the AAA-I- ring and its two ends contact the ring 
while the central portion bows well above the central 
hole (Fig. 2a). This side of the face is termed the 
“linker face.” The C-terminal domains of yeast and 
Dictyostelium dynein differ in size but they lie on the 
side of the ring (C-terminal face) opposite from the 


linker face. Therefore, the ring is not simple planar ring 
but it has two distinct faces. 

An individual AAA-i- module is composed of an 
N-terminal large domain with a/p Rossmann fold 
and a C-terminal a-helical domain (small domain) 
(Fig. 2b). The a/p large domain contains a five 
stranded P-sheet (S1-S5) flanked by five helices 
(H0-H4) and the nucleotide-binding Walker 
A and B motifs. The a-helical bundle small domain 
(H5-H9) extends from the C-terminus of the large 
domain and is connected with the large domain by 
a flexible link. Owing to the flexible link, there is 
a variation in angle between the large and small 
domains. However, all of the small domains pack 
against the neighboring large domains in a similar 
manner and create six rigid units in the dynein motor 
domain. The order of the AAA-i- modules is clockwise 
around the ring when the motor domain is viewed from 
the linker face. The small domains are located clock¬ 
wise with respect to each large domain when viewed 
from the linker face (Carter et al. 2011; Kon et al. 
2011 ). 

When the AAA-i- ring of the yeast dynein is viewed 
from the linker face, the AAA-i- modules display 
uneven spacing. Large gaps are seen between the 
large domains of AAAI and AAA2, and also between 
those of AAA5 and AAA6 (Fig. 2b, c). In contrast, in 
the AAA ring of Dictyostelium structure, the gap 
between AAA5 and AAA6 is not seen, and the gap 
between AAAI and AAA2 is partially filled by the 
linker (Cho and Vale 2012). Additionally, the 
Dictyostelium dynein motor domain has more symmet¬ 
rical and planar ring than the motor domain of the 
yeast. Although there are many possibilities to explain 
these differences, one fascinating idea is that the dif¬ 
ferences are the result of the different nucleotide states 
of the two molecules (Cho and Vale 2012). It is noted 
that the head ring of axonemal dynein in ADP- 
vanadate state observed in negative staining EM 
images is much more symmetric than that in apo 
state. This observation is not coincidental with the 
differences found in crystal structure between yeast 
and Dictyostelium. 

Stalk/MTBD 

The stalk was originally modeled as a structure 
extending from the ring at AAA4 and returning 
to the ring at AAA5. The stalk had been seen bifur¬ 
cated near its base in electron microscopy samples 
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(Burgess et al. 2003). The crystal structures now reveal 
that the stalk Is the variation of H7 and H8 of the small 
domain of AAA4 and the H7 helix (CCl) extending 
out from and the H8 helix (CC2) returning back to the 
small domain of AAA4 form the coiled-coil stalk. 
A microtubule-binding domain (MTBD) is localized 
at the tip of the stalk and forms a small globular domain 
(Fig. 3) (Carter et al. 2008). Although poor sequence 
conservation exists in this globular domain at the 
stalk tip, mutagenesis of conserved residues clearly 
interferes with microtubule binding (Koonce and 
Tikhonenko 2000). Microtubule binding of the stalk 
tip was also examined with a recombinant stalk-tip 
peptide. The stalk-tip peptide was observed to bind to 
a microtubule with a periodicity of 8 nm and to share 
the binding region on the microtubule with kinesin 
(Mizuno et al. 2004). 

During the mechanochemical cycle of dynein, bind¬ 
ing of ATP to the primary ATPase site of AAAI causes 
dissociation of dynein from the microtubule, and bind¬ 
ing of the MTBD to the microtubule accelerates the 
dissociation of hydrolysis products from the ATPase 
site. Because the sites of microtubule binding and 
primary ATP hydrolysis are spatially segregated 
(^25 nm), elucidation of the communication pathway 
between them is an important issue in understanding 
dynein function. To investigate the communication 
mechanism of the stalk, a series of fusion constructs 
was designed, in which the microtubule-binding 
domain, along with a portion of its predicted coiled- 
coil stalk, was fused onto a stable antiparallel coiled- 
coil base found in the native structure of seryl-tRNA 
synthetase (Gibbons et al. 2005). Using the fusion 
peptide, the optimal alignment between the hydropho¬ 
bic heptad repeats in the two strands of the stalk coiled- 
coil was examined. Alterations in the phase of the 
heptad repeats in the CCl changed the affinity of the 
MTBD toward the microtubules. Finally, identified 
was the pattern of two alternate registries (a and P) 
having high and low microtubule-binding affinity, 
respectively (Gibbons et al. 2005). On the basis of 
these results, it was hypothesized that during the mech¬ 
anochemical cycle, the two strands of its coiled-coil 
stalk would undergo a small amount of sliding dis¬ 
placement as means of communicating between the 
AAA-i- core of the motor and the MTBD. More con¬ 
clusive support for this hypothesis was provided by the 
usage of expressed dynein motor domain, in which the 
coiled-coil of the stalk was trapped at three specific 


MT binding interface 

I-1 


Dynein Motility: Mechanism, Fig. 3 The atomic structure of 
the distal part of the stalk and the MTBD of mouse cytoplasmic 
dynein in the weakly binding state (PDB ID: 3ERR). Two 
a-helices (CC1 in gray and CC2 in magenta) form coiled-coil. 
They connect to the six-helix bundle (HI to H6) which forms the 
MTBD. The interface to the microtubule is thought to he HI and 
H3. The proline residues at 3409 and 3285 are indicated as 
spheres. This staggered pair of conserved prolines is associated 
with a kink in the stalk (The figure was prepared using PyMOL 
provided by DeLano Scientific LLC (http://www.pymol.org)) 

registries using oxidation to disulfides of paired cyste¬ 
ine residues introduced into the two helices (Kon et al. 
2009). It was shown that coupling between ATPase 
activity and the binding activity to microtubules 
depends upon the registry of the coiled-coil. These 
works suggested that the CCl and CC2 might slide 
relative to one another, undergoing a half-heptad shift 
in different nucleotide states and influence affinity 
of MTBD to microtubules. The model proposed 
suggested that CCl might be pulled from its base 
relative to the stationary CC2. However, the crystal 
structure of the motor domain challenges this model 
(see below). 

Structure of the MTBD 

The research on MTBD had been proceeded and the 
2.3 A resolution coordinates of MBTD in a weakly 
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binding conformation (P registry) and the distal por¬ 
tion of the coiled-coil stalk of mouse cytoplasmic 
dynein were reported (Fig. 3) (Carter et al. 2008). It 
is confirmed, as expected, that the stalk is a coiled-coil. 
The MTBD consists of a bundle of six a helices 
(HImtbd to H6mtbd) and the interface against micro¬ 
tubules is made up of three helices, H1 mtbd> H3mtbd> 
and H6mtbd- These three helices form the electropos¬ 
itive surface composed of high density of conserved 
residues. The helices HImtbd and H3mtbd are placed 
in a groove at the interface between the a- and 
P-tubulin subunits, which is the same location that 
kinesin motors use for binding to microtubules, as 
shown in the previous study (Mizuno et al. 2004). 

The coiled-coil of the stalk is not straight but bent 
near MTBD by a pair of staggered highly conserved 
proline residues (Fig. 3), with the regular packing of 
hydrophobic residues in the coiled-coil core being 
disrupted in the region between the prolines. When 
the heptad registry resumes after the prolines, the reg¬ 
istry of CCl has slipped by one half-heptad relative 
to that of CC2. The distal portion of CC2 makes exten¬ 
sive hydrophobic interactions with H2mtbd, H4mtbd, 
HSmtbd, and H6mtbd, whereas CCl makes only a few 
contacts with H4mtbd before joining directly into 
HImtbd- It was again suggested that communication 
along the coiled-coil of the stalk is effected by inter¬ 
strand sliding and this asymmetry at the interface 
between the stalk and MTBD plays an important role 
in the dynein mechanochemical cycle. The crystal 
structures of the motor domain show, however, that 
CCl and CC2 merge into the well-packed helices of 
the AAA4 small domain, it is not likely that either 
helix could move at its base and that the sliding could 
occur along the entire helix. The discovery of buttress/ 
strut sheds light on the communication mechanism 
(see Buttress/Strut) and prompts a new model. 

Buttress/Strut 

The new and unexpected structure found in crystals of 
the dynein motor domain is an additional antiparallel 
coiled-coil near the base of the stalk, emerging from 
the AAA5 small domain (Carter et al. 2011; Kon et al. 
2011). It is an extension of H5 and H6 helices of the 
AAA5 small domain. It bends sharply at its middle and 
extends toward and makes contact with the stalk at 
its tip forming a Y-shaped structure together with 
the stalk. Since this Y-shaped structure had been pre¬ 
viously seen in electron microscopic images of 


axonemal dynein, the buttress/strut may be 
a common architecture in all types of dynein. The 
structure is called buttress (Carter et al. 201 1) or strut 
(Kon et al. 201 1). 

The interaction between the stalk and the buttress/ 
strut provides insight for the regulation mechanism of 
MTBD by the AAA-i- modules in the head ring. As 
mentioned above, the base of the stalk is composed of 
the well-packed helices of the AAA4 small domain, it 
is not likely that relative sliding of CCl and CC2 can 
occur at the stalk base. The discovery of the buttress/ 
strut suggests that through the interaction of the 
buttress/strut and the stalk, relative sliding between 
CCl and CC2 is propagated at the stalk-buttress/strut 
contact site. Thus, the buttress/strut might relay rigid 
body motion between AAA-i- modules into shear 
motions between the helices of the stalk coiled-coil. 

Linker 

The linker is a structure located in the portion of the tail 
proximal to AAAI, which serves as a connection 
between AAAI and the main part of the tail (Fig. lb). 
The existence of the linker was hrst indicated in 
images of negative-stained monomeric axonemal 
dynein (Burgess et al. 2003). Though the linker is 
normally docked onto the head ring, it is revealed as 
a relatively large structure about 2-nm wide and 10-nm 
long when the linker is disrupted and undocked from 
the head ring. 

It has been suggested that the linker is involved in 
generation of force through its interaction with the 
head ring. Two-dimensional analysis on negative- 
stained axonemal dynein described the conformations 
of the dynein molecule in two different nucleotide 
states which mimic the post- and pre-power stroke 
conformations of the motor (Fig. 4). In the absence of 
nucleotide (post-power stroke conformation, state I), 
the tail emerges near the base of the stalk. In the 
presence of ATP and vanadate, which forms 
a dynein-ADP-Vi complex that mimics the dynein- 
ADP-Pi conformation (pre-power-stroke conforma¬ 
tion, state II), the tail emerges further away from the 
stalk base. These observations were interpreted that the 
linker swings relative to the head ring (Burgess et al. 
2003). The existence and the movement of linker have 
subsequently been confirmed in cytoplasmic dynein, 
identified as the N-terminal region of the motor 
domain using a green fluorescent protein (GFP)- and 
a blue fluorescent protein (BFP)-tagged constructs by 
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Fig. 4 Proposed mechanisms 
of dynein’s power stroke. In 
the right panels'. Electron 
microscopy on negatively 
stained samples followed by 
single particle analysis 
succeeded in capturing two 
distinct conformations of 
inner-arm dynein molecules 
isolated from Chlamydomonas 
flagella in the ADP-V; (state 
II) and apo (state I) state. On 
ATP binding to the ATPase 
site in AAAI, the origin of the 
tail emerging moves far from 
the stalk (state II, right top 
panel). Upon product release, 
the origin of the tail emerging 
from the ring is closer to the 
stalk (state I, right bottom 
panel), suggesting movement 
of the linker domain. The 
movement of the linker leads 
to the contraction of the whole 
dynein molecule. The 
microtubule is thus dragged by 
this contraction 



negative stain electron microscopy and the Forster 
Resonance Energy Transfer (FRET) (Kon et al. 2005). 
In the absence of nucleotide or in the presence of ADP, 
GFP inserted at the linker’s N-terminus lies close to 
AAA4, at the base of the stalk, in the so-called unprimed 
position, whereas in the presence of ATP and vanadate, 
the GPP lies close to AAA2, in the primed position. 

Crystal Structure of the Linker 

The crystal structure of dynein showed that the linker 
is composed of helical bundles and does not sit flat on 
the head ring but rather arches over it (Fig. 2a). The 
linker is composed of four predominantly helical 
subdomains (from N-terminus, subdomain 1, 2, 3, 
and 4). The C-terminus subdomain 4 interacts with 
AAAI and the part of AAA6 and is connected into 
AAAI. The N-terminal subdomain 1 contacts AAA5 
in the yeast motor domain (apo state), or AAA4 in the 
Dictyostelium head ring (ADP state) and this contact 
looks tenuous and may break and dissociate from 
AAA4 or 5 during the ATPase cycle. Although the 
significance of the difference in the contact point 


between two crystal structures remains unclear, it 
could represent conformational changes upon ADP 
release during dynein’s ATPase cycle. 

Tail 

The amino-terminal tail is involved in dimerization/ 
trimerization of dynein heavy chains and acts as 
a scaffold for the assembly of different ICs and LICs 
to form the dynein complex. Since an expressed cyto¬ 
plasmic dynein without a native tail but with 
a substituted tail shows intact processive movement 
in vitro, these subunits are not essential for dynein 
motility in vitro. However, they may regulate dynein 
motility in vivo and recent data indicate that the non- 
catalytic subunits link dynein to cargos and to several 
adaptor proteins that regulate dynein function. For 
example, missense mutations in the tail domain of 
cytoplasmic dynein in mouse cause neurodegenerative 
disease. The best-characterized model of dynein dys¬ 
function is the Legs at odd angles (Loa) mouse 
(Hafezparast et al. 2003). This mutation is thought to 
affect homo-dimerization of the dynein heavy chains 
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and/or association of the heavy and intermediate 
chains. Single-molecule nanometry on the mutant 
dynein showed that dynein purified from the mutant 
mice has lower processivity and more frequently 
shows bidirectional motility along a microtubule and 
sidesteps to adjacent protofilaments than the wild type 
dynein does (Ori-McKenney et al. 2010). The results 
suggest that the mutation in the tail domain of dynein 
results in increased flexibility of the dynein molecule 
and diminished gating between the motor domains. 

While plus end-directed transports in cells are car¬ 
ried out by many kinesin families with wide range of 
tail domains, despite dynein being involved in large 
repertories of cellular functions, all minus end- 
directed transports within the cytoplasm are carried 
out by a single cytoplasmic dynein. The tail domain 
is thus important to mediate interaction with various 
types of cargoes by recruiting specific and proper 
adaptor proteins. 

Force-Generating Mechanism 

The observations of negatively stained EM samples, 
combined with the structural information obtained 
from crystal structures of the motor domain, can pose 
the speculative model of dynein force generation 
although the confirmation of the model needs a high- 
resolution structure in distinct nucleotide states (Cho 
and Vale 2012) (Fig. 5): In the apo state, dynein tightly 
binds to a microtubule through the MTBD and the 
AAA1-AAA2 gap is widely open. The N-terminal 
end of the linker is docked at the stable position near 
AAA4/5. Upon binding of Mg-ATP to AAAI ATPase 
site, the gap is then closed and the large rearrangement 
among AAA-i- modules occurs, which is referred to as 
“domino effect” (Carter et al. 2011). The information 
of ATP binding propagates along the ring. The infor¬ 
mation pathway from AAAI to the MTBD would be 
two, one is from AAAI to AAA4 through AAA2 and 3, 
and the other is from AAAI to AAA5 through 
C-terminus and AAA6 (Kon et al. 2011). The 
rearrangement of the modules in the AAA-i- ring 
changes the interaction between buttress/strut and the 
stalk. This change elicits the relative shear of CC1 and 
CC2 of the stalk leading to the conformational change 
of the MTBD and to dissociation of the MTBD from 
the microtubule. The linker detaches from the AAA4/5 
docking point and changes its orientation on the head 


ring causing the tail to emerge far from the stalk (state 
II). When ATP is hydrolyzed at the AAAI, the gap 
between AAAI and AAA2 may change. This change 
propagates along the ring and modifies the buttress/ 
strut-stalk interaction leading to the increase in the 
affinity of MTBD to the microtubule. The MTBD 
then finds the docking site on the microtubule and 
binds. This binding leads to open the AAA 1-2 gap 
enhancing the product release. Upon release of prod¬ 
ucts, the linker orientation on the ring changes, bring 
the tail emergence point closer to the stalk (state I). The 
linker changes its orientation by switching between 
two differently docked positions on the head ring, 
thus producing a rotation of the head ring that causes 
the stalk to swing. 

Additional nucleotide binding to AAA2, 3, and 4 
may work as modulator for signal transduction through 
the ring. The high resolution of crystal structures with 
various nucleotide states will be required for a detailed 
understanding of the force-generating mechanism. 

The stalk is not rigid since stiffness of a 15-nm 
length of coiled-coil peptide clamped at one end can 
be estimated to be about 0.4 pN nm“*. The range of 
conformations for an axonemal dynein molecule 
observed in negatively stained samples provides also 
an estimation of the stiffness of the stalk, which is 0.5 
pN nm“* in apo molecules and 0.14 pN nrn”^ in ATP- 
vanadate molecules. These estimates imply that stalk is 
too flexible to work as a rigid lever (Oiwa and 
Sakakibara 2005). 

Cryo-electron microscope images of whole dynein 
molecules interacting with microtubules have revealed 
that, even though the tail and linker shift relative to the 
ring and stalk as in isolated dynein molecules, the stalk 
orientation on the microtubule remains fixed (Ueno 
et al. 2008). The observation provides strong evidence 
to the concept of dynein as an ATP-dependent winch. 
Furthermore, winch-type motion was also observed in 
an axoneme. Cryo-electron tomograms of axonemes in 
the apo state were compared to those obtained in the 
presence of ADP-vanadate. Global changes of dynein 
arm complexes were shown and several key changes in 
dynein structures were found. Although the stalks are 
not clearly visible in the tomograms, close examina¬ 
tions showed that the stalks typically tilt toward the 
proximal end of the axoneme (the base of the axo¬ 
neme) in both nucleotide conditions. The dynein head 
rings were observed to move 8 nm toward the distal 
end of the axoneme upon the release of the nucleotide 
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Dynein Motility: Mechanism, Fig. 5 A highly speculative 
model how the primary ATPase site bidirectionally communi¬ 
cates with the MTBD. (a) Upon ATP binding to the primary 
ATPase site at AAAI, the wide gap between AAAI and AAA2 
closes, triggering a global conformational change in the AAA-l- 
ring. The gap between AAA5 and AAA6 might close and this 
closure shifts the large domain of AAA6 toward the small 
domain of AAA5. This shift changes the interaction between 
buttress/strut and the stalk, (b) The buttress/strut could kink the 
stalk and induce the shear between helices in the coiled-coil 
stalk. The shear would elicit the change in the conformation of 
the MTBD and then the MTBD releases from the microtubule, 
(c) The hydrolysis of ATP occurs at the AAAI site and as Carter 
et al. (2011) proposed, a “domino effect” which is triggered by 

(Movassagh et al. 2010). Since the MTBD is attached 
to the adjacent microtubule, the movement results in 
dragging the adjacent microtubule distally and produc¬ 
ing the shear. 

The winch model can explain the result of Carter 
et al. (2008), in which cytoplasmic dyneins with its 
stalk coiled-coil either lengthened or shortened by 
seven heptads move toward the minus end of 
a microtubule, irrespective of the length of the stalk. 
This result is remarkable since these changes in the 
stalk length would be predicted to rotate the head ring 
by 180° and reverse the direction of dynein movement 
according to the lever arm model. To explain the 
directionality of dynein, it is proposed that the head 


the AAA-r ring contraction after ATP binding at AAAI causes 
the distal part of the linker to detach from the ring, (d) Through 
Brownian search, the MTBD rebinds to the microtubules and 
this induces the relative shift in the stalk and modulates the 
interaction between the buttress/strut and stalk. Increasing 
the gap width at the interfaces of AAA5-6 and AAAl-2 enhances 
the release rate of the hydrolysis products, (e) Coupled with this 
product release, the linker docks onto the unprimed position 
(power stroke). This movement of the linker makes the contrac¬ 
tion of the whole molecule. Although the scheme described here 
is partly based on the experimental data, details are highly 
speculative. The further and precise discussion should await 
the crystal structures in different nucleotide states with the 
higher resolutions 

ring does not elicit a lever-like rotation of the linker 
domain perpendicular to the stalk, rather contraction 
where the force vector of the linker domain’s confor¬ 
mational change is directed parallel to an angled stalk. 

Mechanical Properties of Dynein 

Sliding Disintegration of Axonemes 

Axonemes isolated from organisms (sea urchin sperm, 
Tetrahymena, and Chlamydomonas are the major 
resources of the axonemes) are treated with detergents 
to remove cell membrane and then gently digested 
with a protease, trypsin, or elastase. The protease 
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treatment causes disruption of the axonemal substruc¬ 
tures connecting the doublet microtubules. Addition of 
Mg-ATP to these axonemes elicits sliding disintegra¬ 
tion of these axonemes, where doublet microtubules 
extrude from the axoneme. This method has been 
useful in elucidating the basic properties of axonemal 
dynein motility (Shingyoji 2009 and the references 
therein). Together with using the axonemes from 
dynein mutants, sliding velocities of extruding doublet 
microtubules provide mechanical properties of subspe¬ 
cies of axonemal dyneins. In addition, attachment of 
fine glass needles to these extruding microtubules pro¬ 
vided the information of the force generation of dynein 
and characterization of the force-velocity relation of 
dynein-microtubule sliding (Oiwa and Kojima 2009) 
and the references therein). 

In Vitro Motility Assays 

One crucial advance in the study of motor proteins (not 
only for dyneins) has been development of in vitro 
motility assays. In these assays, motor proteins can 
be adsorbed to inert surfaces, yet retain the ability to 
generate force and translocate or move along hlament 
tracks, actin filaments, or microtubules. These assays 
offer many advantages including use of small amounts 
of purihed proteins, rapid and precise assessment of 
motility parameters, and convenient changes in the 
experimental conditions. 

In vitro motility assays have two types of geome¬ 
tries: surface-gliding assay and bead assays. In surface¬ 
gliding assays, motor proteins are attached onto a glass 
surface using specific or nonspecific interactions. 
Upon addition of actin filaments/microtubules and 
ATP, motor action may be observed as their gliding 
over the surface. In the bead assay, hlament tracks are 
hxed on a glass surface through antibodies, hlament- 
associated proteins, silane couplers, or nonspecihc 
interactions. Glass or polystyrene beads with a size 
on the order of a micrometer are coated with protein 
motors using either specihc or nonspecihc interactions. 
When these beads are introduced into a how cell in the 
presence of ATP, they bind to hlament tracks hxed on 
the surface and start moving along these tracks. Their 
movement can be observed using a bright-held or dark- 
held microscope. When huorescent beads or huores- 
cently labeled beads are used, the movement of the 
beads can be observed with a huorescence microscope. 

These two motility assays are often used to charac¬ 
terize ensemble properties of motor proteins. Some of 


the salient features which have been determined about 
mechanical properties of dynein are listed below 
(Kojima et al. 2009 and the references therein): 

• When being examined in the standard in vitro motil¬ 
ity assays, dyneins studied to date move unidirec- 
tionally toward the minus end of a microtubule. 

• Monomeric dynein or constructs containing the 
motor domain can move in ensemble and thus, 
the minimal components of dynein motility are 
the 380 kDa domain which contains a linker, six 
AAA-i- modules, a microtubule-binding stalk, and a 
buttress/strut. 

• Each of the heavy chains of axonemal dyneins so far 
studied has distinct motility. 

• Some inner-arm dyneins can generate torques and 
rotate a microtubule about its long axis while it is 
moved on the dynein-coated surface. 

• Some inner-arm dyneins require a trace amount of 
ADP for their microtubule motility. 

• Cytoplasmic dynein 1 is a highly processive motor. 

• Some inner-arm dyneins, even some single-headed 
or heterodimeric motors, show certain degree of 
processivity in vitro. 

Single Molecule Assays 

In vitro motility assays have paved the way to single¬ 
molecule studies on dynein. In recent decades, the 
development of a number of technologies, such as 
atomic-force microscopy, optical-trap nanometry, 
and fluorescence imaging with nanometer accuracy 
(FIONA) (Yildiz et al. 2003), has provided tools for 
studying the dynamics of single molecules in situ over 
time scales from milliseconds to seconds. The single¬ 
molecule sensitivities of these methods permit studies 
to be made on conformational changes and functions of 
dyneins that are masked in ensemble-averaged exper¬ 
iments. Processivity, step size, and dwell time distri¬ 
butions are among properties that can be directly 
measured by single-molecule techniques. Our under¬ 
standing of the functions of dyneins has benefited 
considerably from the application of single-molecule 
techniques (Oiwa and Sakakibara (2005) and refer¬ 
ences therein). 

However, single-molecule measurements on force 
generation of dyneins have raised some issues to be 
resolved: The stall force generated by single dynein 
molecules varies from measurements to measure¬ 
ments: for axonemal dyneins, a value of 1 ^ 2 pN in 
single-headed inner-arm dynein, ^6 pN in an inner 
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dynein arm in an axoneme, and 4.7 pN in outer-arm 
dynein; for cytoplasmic dyneins, a value of ^ 1 pN in 
bovine cytoplasmic dynein, 3^ pN and 7-8 pN in 
porcine cytoplasmic dynein, and 8 pN in yeast cyto¬ 
plasmic dynein. The variation in force may depend 
upon the type of dynein used and distinct roles that 
dynein plays in vivo. It is also suggested that the 
geometry of the force measurements may influence 
the force and the mode of motility. Nucleotide concen¬ 
trations may also effect on the force generation and 
modes of movements. The precise measurements and 
direct comparison of the force generated by cytoplas¬ 
mic dyneins are now required since they will provide 
important information to reveal the mechanism of 
cargo transport hy a number of or several types of 
motors mechanically coupled to each other. The step 
size of dynein is still controversial as described in the 
next session. 

Mechanism of Processivity and Modes of 
Movement 

Processivity places severe constraints on the mecha¬ 
nism for motion generation of motor proteins. It 
requires that at least one portion of the motor protein 
must always be tightly bound to the track so that this 
attachment can bear the load. Other portion can then be 
weakly bound and able to advance to the next binding 
site on the track. For two-headed protein motors, this 
requirement is fulfilled if one of the two heads remains 
bound during the mechanochemical cycle. 

Cytoplasmic dynein 1 shows a high processivity 
and is the best-studied dynein at the single-molecule 
level (Reck-Peterson et al. (2012) and references 
therein). It can take micrometer-scale movements along 
a microtubule without dissociating from the microtubule. 
Through the creation of a cytoplasmic dynein constract 
that can be converted between monomeric and dimeric 
states by a small molecule, rapamycin, it is demonstrated 
that processive motion requires the dimerization of two 
motor domains, although the endogenous dimerization 
domain (tail) is not essential for the processivity (Reck- 
Peterson et al. 2006). 

The step sizes and modes of movement of cytoplas¬ 
mic dyneins are under debate. Cytoplasmic dynein 
purified from bovine brain primarily takes large steps 
(24-32 nm) at low loads, but with decreased step size 
from 32 to 8 nm with increasing load to its stall force 
(Mallik et al. 2004). In addition, when multiple dynein 
molecules interact with a microtubule and contribute 


to movement, the dynein molecules move predomi¬ 
nantly in 8-nm steps. In contrast, movement of single 
cytoplasmic dynein molecules purified from porcine 
brain (Toba et al. 2006) and a functional recombinant 
dimeric dynein of the budding yeast 5. cerevisiae 
(Reck-Peterson et al. 2006) were analyzed with 
a high spatial precision tracking technique and were 
stepwise with a regular 8-nm step size, irrespective to 
the load. The recent two independent studies have, 
however, challenged the tight coordination between 
two heads and the regular step size (Qiu et al. 2012; 
DeWitt et al. 2012). 

Communication between two heads enhances the 
efficiency of the processivity. Several models have 
been proposed for synchronizing the chemical states 
of the two heads with some phase difference. All these 
models use intramolecular strain to gate chemical 
events over large distances. The hand-over-hand 
mechanism originally proposed in kinesin studies can 
explain cytoplasmic dynein processivity, which allows 
detachment of one head while it steps along the fila¬ 
ment and the other head remains attached, thereby 
preventing diffusing away. However, on the basis of 
experimental findings on cytoplasmic dyneins, 
a molecular model (which they called the “alternating 
shuffling model”) was proposed to explain how 
processive motion is achieved by cytoplasmic dynein 
(Reck-Peterson et al. 2006). The model is conceptually 
similar to the hand-over-hand model: Two dynein heads 
alternate taking 16-nm steps, while the centroid position 
of the molecule moves by 8 nm for each step. The large 
dimensions of the head ring do not allow the head to 
alternately “swing” forward at each step, but overlap 
part of the head ring during the stepping motion. 

The model requires coordination between two motor 
domains. Optical-trap studies of dynein force produc¬ 
tion suggest that this coordination is carried out by strain 
transmitted through the linkage between the two head. 
Besides the intramolecular strain, physical contacts 
between two head ring during the movement along 
a protofilament could mediate head-head coordination. 

However, given apparently large size, flexibility of 
a dynein molecule and the magnitude of the suggested 
power stroke in relation to the tubulin lattice, the 8-nm 
step size displayed by dimeric dyneins is surprisingly 
small. This is in contrast to other linear motors since 
the step size of a motor, except myosin VI, can be 
predicted to be proportional to the length of its power 
stroke. One possible explanation is that when a dimeric 
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dynein moves on a microtubule taking 8-nm steps, the 
molecule could be compact and stiff with its two head 
rings in close, intimate association as is seen in axone- 
mal dyneins in situ, and in electron microscopic obser¬ 
vations of the phi (<l))-shaped structure of cytoplasmic 
dynein, in which the tails of the heavy chains are 
close to each other and the head rings are partially 
overlapped (Fan and Amos 2001). 

Dynein apparently does not have to walk along 
a single protohlament. Precise measurements showed 
that fluorescently labeled dynein also displayed lateral 
stepwise movements, which usually occurred simulta¬ 
neously with forward stepping. This shows that dynein 
has the reach or flexibility to occasionally land on an 
adjacent protofilament. Furthermore, the Loa mutant 
cytoplasmic dynein showed increased frequency of 
sideways steps and backward steps and decreased 
processivity. Modulation of the dynein tail domain 
could affect coordination between two motor domains. 
Although further studies will be necessary to clarify 
the mechanism of the coordination, modulating the tail 
domain could be one of mechanisms of regulation of 
dynein motility. 

In addition, dynein shows large variation in the 
mode of movements but the mode-switching mecha¬ 
nism is not well understood (Sakakibara and Oiwa 
(2011) and references therein). Several research 
groups have shown that dynein can display a diffusive 
mode of bidirectional motion along the microtubule as 
well as processive stepwise motion. Regulation mech¬ 
anisms may exist that switch from the stepping mode 
to the diffusive mode and vice versa. Variation of the 
modes may reflect physiological roles of dyneins 
in vivo. Further work will be required to resolve 
these reported differences in dynein behavior, with 
particular attention paid to possible species variation, 
protein preparation, and assay conditions. 

Summary 

To understand the mechanism of dynein motility, we 
need to take into consideration three layers of coordi¬ 
nation: intra-domain coordination within a dynein 
molecule, intramolecular coordination among multiple 
motor domains within each dynein molecule, and 
intermolecular coordination among multiple dynein 
complexes such as dynein arms in axonemes. In this 
entry, we focus on the first two. 


The large size and the difficulty of expressing and 
purifying mutants of dynein had hampered progress in 
understanding of these coordination mechanisms. 
Recently, however. X-ray crystallography, electron 
microscopy, and single-molecule nanometry have 
shed light on key unsolved questions. 

The crystal structures of the motor domains of cyto¬ 
plasmic dynein and the microtubule-binding domain 
show the presence of buttress/strut, an unusual asym¬ 
metric arrangement of ATPase domains in the ring- 
shaped motor domain, the structure of the microtubule¬ 
binding interface, and its relation with the stalk 
coiled-coil. The arrangement of mechanical components 
on the AAA-i- ring now provides clues on how ATP- 
driven conformational changes might be transmitted 
across the motor domain and how an unexpected inter¬ 
action works between the stalk and the buttress/strut. 

For processive movement of a dimeric motor, such 
as kinesin and myosin V, two motor domains would 
show the gating, in which the leading motor domain is 
not detached from the track while the trailing motor 
domain is dissociated from the track. The similar 
mechanism has been proposed in dynein motility. 
However, recent studies on cytoplasmic dyneins have 
challenged this strict gating system: Dynein can move 
processively without strict communications between 
the motor domains. 

Nevertheless, detailed mechanisms of coordination 
in and between motor domains remain poorly under¬ 
stood compared to other cytoskeletal motors such as 
kinesin and myosin. It still awaits crystals of the motor 
domain in different nucleotide states with higher reso¬ 
lutions and further analysis using single-molecule 
techniques such as optical trap combined with geneti¬ 
cally engineered dynein molecules. 

Notes 

The crystal structure of the 380-kDa motor domain of 
Dictyostelium discoideum cytoplasmic dynein at 2.8 A 
resolution was reported. Clearly visualized features at 
this resolution are that the coordination of ADP in each 
of four distinct nucleotide-binding sites in the AAA-i- 
modules, a newly identified interaction interface 
between the ring and mechanical linker, and junctional 
structures between the ring and the stalk, all of which 
should be critical for the mechanism of dynein motility. 
A long-range allosteric communication pathway was 
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identified between the primary ATPase site and the 
MTBD and provides a framework for understanding 
the mechanism of dynein-based motility (Kon et al. 
2012 ). 

A 3.3-A crystal structure of the motor domain of 
Saccharomyces cerevisiae cytoplasmic dynein was 
reported, which was crystallized in the absence of 
nucleotides. The interaction between the linker and 
the ring is investigated and they show the linker is 
docked to a conserved site on AAA5. They soak crys¬ 
tals in ATP, ADP and AMP-PNP and show the differ¬ 
ences between the abilities of nucleotides to bind 
individual AAA-i- domains (Schmidt et al. 2012). 

Cross-References 

► Atomic Force Microscopy 

► ATPase: Overview 

► ATP-Driven Mechanical Work Performed by 
Molecular Motors 

► Coiled Coils 

► Dynein Family Classification 

► Electron Cryo-Microscopy of Molecular Machines 

► Electron Microscopy of Motor Structure and 
Possible Mechanisms 

► Eluorescence Imaging with One Nanometer 
Accuracy 

► Kinesin: Fundamental Properties and Structure 

► Mechanism of Myosin Work and Motility 

► Molecular Motors 

► Optical Tweezers 

► Processivity 

► Protein Dynamics and Allostery 

► Single Particle Tracking 

► Single Protein Production (SPP) System in Bacteria 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Site-Specific Protein Labeling for Single-Molecule 
Eluorescence Studies 

► Total Internal Reflection Eluorescence Microscopy 
for Single-Molecule Studies 
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Synonyms 

Electron Cryo-microscopy; Electron microscopy; 
Molecular motors; Single particle electron microscopy 

Definition 

Molecular machine: A large complex made up of mul¬ 
tiple copies of proteins and/or nucleic acids with 
a molecular size ranging from hundreds of kDa to 
tens of MDa. Examples of molecular machines are 
virus, ribosome, chaperonin, etc. 
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Electron Cryo-microscopy of Molecular Machines 


Electron cryo-microscopy (Cryo-EM): A structural 
method uses electron microscope to record images of 
frozen, hydrated specimens and uses computational 
procedures to reconstruct its three-dimensional 
density. 

Single particle cryo-EM: A special application of 
cryo-EM where the specimen is a noncrystalline type 
embedded in a thin layer of vitreous ice as individual 
particles in random orientations. 

Introduction 

Single-particle electron cryo-microscopy (cryo-EM) is 
an emerging structural biophysics technique for imag¬ 
ing individual molecular machines. Advantages of the 
cryo-EM imaging technique are that it can be used to 
determine the macromolecular 3-D structure without 
invoking crystallography and in physiological or solu¬ 
tion conditions (Glaeser et al. 2007). The highest res¬ 
olution cryo-EM structures have been determined up to 
^4 A (e.g., Zhang et al. 2008, 2010; Liu et al. 2010). 

Specimen Preparation and Data Collection 

The single particles are preserved in vitreous ice by 
rapid plunging into a cryogen, such as liquid ethane. 
Images of the frozen, hydrated single particles are 
collected in a 100-300 kV electron microscope with 
the specimen kept at or below liquid nitrogen temper¬ 
ature. The images are generally recorded on a large 
format CCD camera. The dose used in the data collec¬ 
tion is low enough to minimize radiation damage to the 
specimen. Therefore, the contrast in each particle 
image is relatively low. Subsequent data processing 
is required to enhance its feature visibility. 

Data Processing 

The quality of the recorded digital image is assessed by 
the power spectrum of the boxed-out particle images in 
each CCD frame, which has the characteristic pattern of 
the contrast transfer function (CTF) of the image modu¬ 
lated by a combined specimen and instrumental damping 
function as a function of frequency. Each particle image 
has to be deconvoluted for the CTF effects (Ludtke et al. 
1999). Tens of thousands of single particles (the 2-D 


projections of the molecular machines) are boxed-out 
from hundreds to thousands of CCD frames and compu¬ 
tationally classihed and aligned to the correct orientation 
by following iterative procedures. 

The orientation of each particle image can be 
determined by reference to a set of projection images 
of a template model which can be featureless initially 
(e.g., an ellipsoidal shape model for GroEL (Ludtke 
et al. 1999)). The particle images in the same class (i.e., 
orientation) are averaged. These class averages will 
then be used to generate the next map. This is an 
iterative process that gradually improves the orienta¬ 
tion determination and alignment accuracy (Fig. 1). In 
each iterative cycle, the latest map will be used as 
a template to generate next set of projection images, 
and the number of projection images will gradually 
increase for finer sampling at increasing resolutions. 
The iteration would stop when there is no more 
substantial improvement in the maps in terms of 
resolutions and features in successive iterations. 
Generally, the initial map is noisy and requires filtering 
(Zhang et al. 2010; Rosenthal and Henderson 2003). 
The “optimally filtered” map can then be low-pass 
filtered to the point at which visible noise is not 
apparent. 

The number of particle images needed for the 
reconstruction depends on the preservation of struc¬ 
tural integrity of the particles during the freezing step; 
the randomness of the particle orientation of the fro¬ 
zen, hydrated sample; the quality of the images (i.e., 
being free from drift and charging); the inherent sym¬ 
metry of the particle; and the software used for data 
processing. The 4.3 A map of an Archaeal chaperonin 
(Mm-cpn), generated with D8 symmetry, included 
^30,000 particles from more than 600 4kx4k pixels 
CCD frames with an angular sampling of ^ 1.4° (Zhang 
et al. 2010). 

Map Resolution Assessment 

The resolution criteria for single-particle analysis are 
usually based on calculation of the correlation between 
two reconstructions from two half dataset in the Fou¬ 
rier space (the Fourier Shell Correlation) at a threshold 
between 0.14 and 0.5 (Rosenthal and Henderson 
2003). An important quality judgment of the map can 
be based on the structural features revealed at different 
resolutions (Fig. 2). For instance, secondary structure 
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Fig. 1 (a) CCD image of a closed-state chaperonin Mm-cpn 
frozen in vitreous ice at 112,000X detector magnification; (b) 
Selected reference projections {left), corresponding class aver¬ 
ages (middle), and a selection of corresponding un-aligned noisy 
raw particles (right). One can see, by aligning and averaging the 


A simplified flow-diagram outlining the reconstruction process 
in EMAN (Ludtke et al. 1999; Baker et al. 2010); (d) Density 
map of Mm-cpn in the closed state; (e) Ca model of Mm-cpn in 
the closed state 


elements like long alpha helices and large beta sheets 
would be recognizable visually or computationally in 
density map determined at subnanometer resolution 
(6-9 A) (Baker et al. 2010). If a crystal structure of 
a molecular component is known, one can match the 
density with the crystal structure (Zhang et al. 2008). 

Model Building 

Usually, a large molecular machine is made up of 
multiple protein subunit types with perhaps 


a different number of copies for each protein type. 
The initial step of the map analysis is the segmentation 
of the molecular boundary for each protein subunit 
followed by the estimate of its secondary structure 
elements. If any protein subunit has a crystal structure 
of itself or a homologue, an atomic model can be 
rigidly or flexibly fitted to the density by a variety of 
software. If there is no a priori crystal stmcture of the 
protein subunit, a program such as Gorgon can be used 
to build a de novo model of the Ca polypeptide back¬ 
bone (Baker et al. 2010) and to refine with standard 
modeling tools (e.g., Schroder et al. 2007). 
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Electron Cryo-microscopy of Molecular Machines, 
Fig. 2 (a) Simulated map by low-pass filtering the atomic 
structure of Mm-cpn single subunit to 30 A, 15 A, 8 A, and 4 
A resolution. At 30 A resolution, only the envelope of the 
molecule is discemable. At 15 A resolution, the domain 


architecture can be revealed. At 8 A resolution, the a-helices 
can be clearly seen. At 4 A resolution, P-strands in the map can 
be separated and some side-chain densities are visualized, (b) 
Examples of side-chain densities in the actual experimental Mm- 
cpn cryo-EM map with the fitted model 


Cryo-EM Structures of Molecular Machines 

The above technique has been applied to determine 
molecular machines such as virus and protein fold¬ 
ing machine at near-atomic resolution. This 
approach has been extended to study the structures 
not only in one but multiple functional states. For 
instance, nearly rocking motion in the single sub¬ 
units of the 16-subunit Archaeal chaperonin (Mm- 
Cpn) with two back-to-back rings is revealed from 
the cryo-EM structures in response to the binding 
and hydrolysis of ATP resulting in a positive coop¬ 
erative phenomenon of subunits in one ring (Zhang 
et al. 2011). A very complex tertiary structural 
change in multiple parts of the protein subunits of 
the coat protein of P22 bacteriophage is clearly 
delineated in the procapsid and the mature phage 
structures that lead to expansion of the capsid parti¬ 
cle, packaging of the viral genome, and release of the 
scaffolding proteins (Chen et al. 201 1). 


Summary 

Single-particle electron cryo-microscopy (cryo-EM) 
has emerged as a powerful methodology in structural 
biology. Through advances in instrumentation and 
computational methods, single-particle cryo-EM 
study can now yield near-atomic resolution structures 
of large molecular machines under near-physiological 
condition. This resolution is sufficient to resolve 
Ca backbones and some side-chain densities of the 
protein components. Such cryo-EM structures have 
yielded insightful mechanisms of the organizational 
principles of the multi-subunits machines and the 
structural dynamics in their biological functions. 

Cross-References 

► Electron Microscopy 

► Protein Folding 
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► Protein Structure Comparison Methods 

► Protein Structure Prediction 

► Structural Genomics 

► Statistical Analysis and Prediction of Protein- 
Protein Interactions and Binding Sites 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 

► X-Ray Scattering of Lipid Membranes 
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Electron Crystallography of 2D Crystals 
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Synonyms 

Aquaporin (AQP); Electron microscopy; Two- 
dimensional crystals (2D crystals) 

Definitions 

Electron crystallography of 2D crystals: A procedure 
where electron diffraction data and electron micro¬ 
scope images taken from 2D crystals are used to deter¬ 
mine the structure of biological macromolecules. 

Introduction 

Electron crystallography of 2D crystals is a method to 
determine protein structure at high resolution using the 
electron microscope. This technique was originally 
developed by Henderson and Unwin in their structural 
studies of purple membrane (Henderson and Unwin 
1975). They obtained the density map of bacteriorho- 
dopsin (bR) at 7-A resolution, showing the first struc¬ 
ture of a membrane protein which consists of seven 
membrane-spanning a-helices. Henderson and co¬ 
workers improved their specimen preparation and 
data processing methods, and finally presented the 
first atomic model of bR by electron crystallography 
(Henderson et al. 1990). Initially, this method was 
applied for studying naturally formed 2D crystals, but 
it became a more general methodology when the 
atomic structure of plant light-harvesting complex II 
was solved using reconstituted 2D crystals 
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Electron Crystallography of 2D Crystals 
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Electron Crystallography of 2D Crystals, Fig. 1 Workflow 
of electron crystallography of 2D crystals 


(Kuhlbrandt et al. 1994). Since then, five atomic struc¬ 
tures of proteins have been solved using electron crys¬ 
tallography of 2D crystals, including one soluble 
protein. Currently, electron crystallography of 2D 
crystals is mainly used for studying the structure of 
membrane proteins because 2D crystals are suitable for 
maintaining membrane proteins in lipid sheets. 
Electron crystallography of 2D crystals consists of 
many steps to go from purified membrane proteins to 
a three-dimensional structure (Fig. 1). Individual steps 
are described below allowing with the advantages and 
difficulties of this method. 


2D Crystallization 

Except for the case of the naturally formed 2D crystal 
of purple membrane, it is necessary to produce 
a sufficiently large 2D crystal to determine the protein 
structure at high resolution using electron crystallog¬ 
raphy. 2D crystallization for purified membrane 


proteins is carried out as follows: Detergent- 
solubilized membrane proteins are mixed with deter¬ 
gent-solubilized lipids, and the mixture is dialyzed 
against the detergent-free buffer solution (Kuhlbrandt 
1992). Gradually decreasing the concentration of 
detergent, membrane proteins start to fuse with the 
lipid micelles when it reaches to the critical micelle 
concentration (cmc). Continuing the dialysis, the flu¬ 
idity of the lipid is reduced causing membrane proteins 
to form a crystal array in their proteoliposomes. Tem¬ 
perature of dialysis or concentration of detergent in the 
dialysis buffer can control the speed of dialysis. In 
addition to micro-dialysis, bio-beads are also used to 
remove detergents. 

Important factors in forming 2D crystals are the 
selections of lipid and detergent, the lipid-protein 
ratio (LPR), and the content and pH of dialysis buffer 
solution. Difficulties to defining the factors to make 2D 
crystals come from the low reproducibility of 2D crys¬ 
tallization, which can derive from variability of protein 
preparation, a difference of batches in lipid and deter¬ 
gent used, and a small difference of LPR caused by 
manual pipetting. One difference from X-ray crystal¬ 
lography is that a device capable of automatically 
screening the crystals has not been developed for elec¬ 
tron crystallography. Some groups are trying to 
develop a robot to screen 2D crystals, but it has yet to 
become general. 

2D crystal takes several shapes: vesicle, tube, and 
sheet. Currently, the parameters determining the crys¬ 
tal types are unknown. Vesicle crystals are used for 
functional assays, but are not ideal for structural stud¬ 
ies because they have a tendency to be folded on the 
EM grid when they are absorbed on the carbon film, 
causing a reduction in useful areas for data collection 
and image analysis. In addition, electron diffraction 
patterns and Fourier spots from top and bottom layers 
in the vesicle are overlapped, causing difficulties in 
data analysis. Tubular crystals include all the projec¬ 
tions of a membrane protein from different orienta¬ 
tions, and thus it is not necessary to tilt the specimen. 
However, it is practically impossible to collect high- 
resolution electron diffraction patterns from these 
tubular crystals, as the projection area is generally 
too small. 

Crystal sheets are suitable for high-resolution elec¬ 
tron crystallography because well-ordered large crys¬ 
tal areas are necessary to collect high-resolution 
images and electron diffraction patterns. In the case 
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of bR, fusion of natural small crystals produces large 
crystal sheets (>3 pm in diameter). Stacking is 
a common problem with crystal sheets. If the crystal 
is stacked in the proper orientation between the layers 
like the double-layered crystal of AQPO, it is possible 
to directly reconstruct the structure of the protein 
(Gonen et al. 2005). However, it makes it more diffi¬ 
cult to determine the structure if each layer is not 
ordered with respect to each other. In the case of the 
double-layered AQP4 crystals, in which the two layers 
were not ordered with respect to each other, electron 
diffraction patterns and images were taken at the same 
area to overcome this problem (Hiroaki et al. 2006). 

Specimen Preparation 

There are two ways to prepare the 2D crystal specimen 
to collect high-resolution data in the electron micro¬ 
scope: vitrification and sugar embedding. In vitrifica¬ 
tion, an aliquot of 2D crystals applied on carbon film 
are plunged into liquid ethane, which has been 
precooled by liquid nitrogen. One advantage of vitrifi¬ 
cation is that the 2D crystal is softly attached on the 
carbon film, and large domains protruding from the 
lipid bilayer are preserved. However, 2D crystals 
loosely adsorbed on carbon film often cause 
a wrinkling of the 2D crystals. On the other hand, the 
sugar embedding method developed by Unwin and 
Henderson (1975) keeps the 2D crystal more flat by 
firmly sticking the crystals onto carbon film. This 
specimen preparation method also makes it easier to 
collect data from high tilted specimens. However, this 
method can also lead to extremely dried specimens and 
damage the 2D crystals. Furthermore, large domains 
protruding from the lipid bilayer are also deformed. 
Currently, glucose and trehalose are used for sugar 
embedding, and combined method with vitrification 
and sugar embedding is employed. In the 2D crystal 
of plant light-harvesting complex II, tannic acid was 
also used as an embedding material. 

Carbon films supporting 2D crystals must be atom¬ 
ically flat to collect high-resolution data from tilted 
specimen. If the crystals are not completely sustained 
as flat, it causes diffraction spots perpendicular to the 
tilt axis to be diffused by increasing the resolution. 
Although several methods to make flat carbon films 
for electron crystallography have been introduced, 
methods to make atomically flat carbon films are still 


under development. The EM grid is also involved in 
keeping 2D crystals flat. Standard EM grids are made 
of copper, which has a different thermal expansion 
coefficient from that of the carbon film. When the 
grid is cooled, the carbon film forms a wrinkle because 
copper shrinks more than the carbon film, and this is 
known as cryo-crinkling. To overcome this problem, 
molybdenum grids are used because molybdenum has 
a similar thermal expansion coefficient to carbon 
which prevents it from folding. 

Data Collection 

In electron crystallography of 2D crystals, two kinds of 
data, high-resolution images and electron diffraction 
patterns, are separately collected. Although atomic 
structures can be obtained only from images 
(Miyazawa et al. 2003; Zhou 2008), electron diffrac¬ 
tion patterns give more accurate amplitude informa¬ 
tion, and are easy to collect because it is not as 
sensitive to specimen-induced movements caused by 
specimen charging. Amplitude is thus determined from 
electron diffraction patterns, while phase is extracted 
from high-resolution images. 

Beam damage is a critical problem when collecting 
data from biological specimens. A low-dose system can 
minimize the specimen damage. This system consists of 
three optical modes: search, focus, and photo. 2D crys¬ 
tals are found by using search mode with a weak elec¬ 
tron beam. Then, the image is focused near the area of 
the 2D crystal. The specimen is finally illuminated using 
a minimum dose (^10 electron/A^) for recording 
images and electron diffraction patterns. In addition, 
specimens are cooled to reduce beam damage. Although 
low temperature does not directly prevent beam damage 
from electron irradiation, it may allow atoms to stay in 
the same position even though chemical bonds have 
been broken. Generally, specimens are cooled at liquid 
nitrogen temperature or liquid helium temperature. 

The electron microscope only gives a projection of 
the specimen. To acquire information in the vertical 
direction of the specimen, we must collect images and 
electron diffraction patterns while tilting the specimen. 
It is difficult to collect images from tilted specimens 
because of mechanical and thermal instability of the 
side-entry specimen holder, in addition to beam- 
induced movement. The stability of the specimen 
stage is somewhat improved using a top-entry 
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specimen cold-stage (Fujiyoshi 1998). Employing spot 
scan imaging can also reduce specimen charging that 
causes beam-induced movement. This method 
involves in imaging the specimen with an array of 
small size spots. Another way to reduce beam-induced 
movement in the tilted specimen is to use the carbon 
sandwich method, where the specimen is preserved 
with symmetrical carbon support films. 

Data Processing 

Data from electron crystallography has historically been 
processed using the MRC software suite since it was 
developed for analyzing purple membrane, bR 2D crys¬ 
tals. The images recorded on photographic films are 
digitized, indexed by crystal lattice, unbent, and 
corrected for the contrast transfer function (CTF). 
Phase on each diffraction spot is extracted from Fourier 
transform of the unbent and CTF-corrected image. On 
the other hand, electron diffraction patterns can directly 
be recorded on a CCD, and the background is subtracted 
after indexing by crystal lattice and masking out the 
beam stopper. Amplitude is determined on each diffrac¬ 
tion spot as an integral of the spot area after removing 
the local background noise. Finally, phases and ampli¬ 
tudes are collected from tilted and untilted data and 
combined to reconstitute the reciprocal lattice lines, in 
which the thickness of the 2D crystal is estimated. The 
density map is calculated by inverse Fourier transform 
of the 3D data set of phases and amplitudes. 
A molecular model is built in the density map and 
refined using X-ray crystallography software. 

The algorithm in the MRC programs is so powerful 
that it has released many atomic structures. However, 
the data processing is tedious and each program is not 
so user-friendly. The situation was changed by the 
introduction of GUI software, 2dx. 2dx provides 
a user-friendly interface for the MRC programs and 
allows for the automation of image processing (Gipson 
et al. 2007). The current trend is to apply the methods 
of single particle EM to electron crystallography. Max¬ 
imum-likelihood approach improves the process in 
less-ordered 2D crystals. To quickly determine struc¬ 
tures by electron crystallography the indirect phasing 
method of X-ray crystallography has also been intro¬ 
duced. Molecular replacement method was used to 
successfully determine the structures of AQPO and 
APQ4 (Gonen et al. 2005; Hiroaki et al. 2006). 


Outlook 

Electron crystallography was the only method to study 
protein structures at near-atomic resolution using the 
electron microscope until recently when this was 
achieved using single particle images (Zhou 2008). 
However, electron crystallography is still a powerful 
tool to study the structure of membrane proteins, which 
are stable in lipid bilayers and are generally of large 
molecular weight; thus, it is difficult to study by X-ray 
crystallography and nuclear magnetic resonance 
(NMR) spectroscopy. Electron crystallography of 2D 
crystals is a patient work because there are so many 
steps as mentioned above. However, recent technical 
advantages reduce these hurdles. Protein expression 
systems using insect cells make it possible to produce 
enough of rare membrane proteins. Crystallization 
robots and automatic screening systems will speed 
2D crystallization up and make it more reproducible. 
Improvements in electron microscopy: spot scan imag¬ 
ing and carbon sandwich method reduce a beam- 
induced movement at tilted specimens. In addition to 
an introduction of the 2dx GUI software, data 
processing techniques in single particle EM and 
X-ray crystallography contribute to analyze smaller 
and less-ordered 2D crystals and extend the map reso¬ 
lution. This entry is a brief summary of recent reviews 
(Raunser and Walz 2009; Schenk et al. 2010; Hite et al. 
2007). More detailed information will be provided in 
these papers. 
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Synonyms 

Electron cryo-microscopy; Electron crystallography 


Definition 

Electron diffraction refers to the coherent scattering of 
high-energy electrons by proteins to generate interfer¬ 
ence patterns that can then be focused to form an 
image. 

Introduction 

In principle, electron diffraction should provide atomic 
resolution information on the three-dimensional struc¬ 
ture of proteins. Indeed the 100 keV electrons used in 


a standard electron microscope have a wavelength less 
than 0.004 nm. Electrons interact much more strongly 
with matter than do X-rays, with the scattering cross 
section for electrons by biological materials being 
some 10,000 times greater. This means that very 
small and thin specimens can be studied. With elec¬ 
trons the investigator has the added advantage that 
electron beams can be focused with magnetic lenses 
to form a high-resolution image. Image data can be 
used to retrieve the phases from ordered specimens by 
computer processing methods (Misell 1971; Glaeser 
et al. 2007). These phase data are generally more 
accurate than those obtained from X-ray diffraction 
(Unwin and Henderson 1975). In principle, electron 
diffraction can be used to study any thin array of 
proteins but in practice its most powerful application 
has been in the structure determination of membrane 
proteins which can often be induced to form crystals 
just one molecule in thickness - two dimensional (2D) 
crystals. 

Early History 

Perrier and Murray were possibly the first workers to 
successfully record small-angle electron diffraction 
patterns from biological molecules (Perrier and 
Murray 1966). Subsequent work made it evident that 
there was a limit to the number of electrons that these 
types of molecules could be exposed to before 
high-resolution order was lost as a consequence of 
radiation damage (Glaeser 1971). Biological mole¬ 
cules are also prone to dehydration and disorder by 
the high vacuum that has to be maintained in the 
electron microscope. Despite this, Matricardi et al. 
(1972) were able to obtain large-angle electron diffrac¬ 
tion patterns from hydrated crystals of catalase. 
However, this method of specimen preservation 
has not proved popular, due to the many practical 
difficulties involved. 

At about the same time that the earliest electron 
diffraction experiments were being performed, 
the possibilities of phase determination from images 
were being investigated. Hoppe et al. (1968) promoted 
the idea of the electron microscope as a phase¬ 
determining diffractometer. Theory, founded on the 
language of the X-ray crystallographer was developed 
with the anticipation that such an approach could be 
used to obtain three-dimensional (3D) structural 
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information from tilted specimens. The first practical 
use of the electron microscope for 3D structure deter¬ 
mination of a biological macromolecule was described 
by DeRosier and King (1968). The theory of Fourier 
decomposition of an image (originally performed by 
diffraction of laser light (King and Berger 1964)) was 
utilized in the computation of a 3D reconstruction of 
the helical “tail” bacteriophage. 

A major advance in electron crystallography 
of biological molecules was made by Unwin and 
Henderson (Unwin and Henderson 1975; Henderson 
and Unwin 1975) who investigated the crystalline pur¬ 
ple membrane from Halohacterium salinarum. These 
authors used low doses of electrons (to minimize the 
effects of radiation damage) to produce images and 
electron diffraction patterns. Crystalline order was 
maintained by replacing water molecules with the 
less volatile glucose. Electron diffraction showed that 
the crystalline order was preserved to 0.3 nm resolu¬ 
tion. Adequate contrast was provided in images from 
the differing densities of protein and membrane lipids 
in the crystals. Assuming the crystals to be weak phase 
objects (see below), spatial averaging techniques were 
used to compute crystallographic phases from images. 
The amplitudes were obtained more accurately from 
electron diffraction patterns as these were unaffected 
by the various microscope modifying functions. By 
merging data from different specimen tilt angles, 
a 3D structure was determined to 0.7 nm resolution in 
the membrane plane and to 1.4 nm perpendicular to the 
plane. This approach involving image averaging and 
sometimes electron diffraction forms the essential 
basis of all current protein electron crystallography. 

Theory of High-Resolution Electron 
Microscopy 

Figure 1 shows a highly simplified and highly sche¬ 
matic view of the relation between sample, objective 
lens, diffraction plane, and image plane in an electron 
microscope. Most electrons fired at thin protein crys¬ 
tals pass straight through but a small number are 
perturbed. Elastic and inelastic scattering are the 
most important perturbations of electrons in the con¬ 
text of biological imaging. Any scattering that results 
in loss of electrons in the final image results in 
amplitude contrast; this is primarily a low to medium 
resolution effect (Spence 2008). For the purpose of 



Electron Diffraction, Fig. 1 Schematic cross section through 
an electron microscope showing how electrons are scattered 
elastically by the objective lens to form a diffraction pattern in 
the back focal plane (diffraction plane). Intensities in this plane 
can be recorded directly in the microscope (see Fig. 2a). 
Scattered electrons interfere constructively and destructively to 
generate an image in the image plane (see Fig. 2b). This image 
can also be recorded directly in the microscope 

higher resolution structure determination from 
periodic specimens, the most important effect is on 
electrons scattered coherently and elastically by 
atomic nuclei; these obey the Bragg relationship and 
give a diffraction pattern in the back-focal plane of the 
objective lens (Fig. 1). All elastically scattered elec¬ 
trons emerge with their wavelength unchanged but 
suffer an approximate ti/2 phase shift (Amos et al. 
1982). Electrons interacting with orbital electrons 
undergo inelastic collision and these electrons are 
poorly focused in the image giving rise to an incoher¬ 
ent background. In the diffraction pattern these 
electrons give rise to a background which is concen¬ 
trated in the forward direction (Eig. 2a). Eor atomic 
number Z, the ratio of total elastic (ae) to inelastic (ct,) 
scattering cross section behaves roughly as (Amos 
et al. 1982): 

ae/cji = Z/19 

Thus inelastic scattering predominates with biolog¬ 
ical molecules, causing extensive specimen damage. 

The Weak Phase Object Approximation 

Elastically scattered electrons passing through the 
objective aperture will generally be out of phase with 
the unscattered electrons. This will give rise to rein¬ 
forcement and subtraction of wave amplitudes leading 
to phase contrast. Elastic phase contrast is the domi¬ 
nant contrast mechanism at medium to high resolution. 
The simplest theory describing elastic phase contrast is 
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Electron Diffraction, Fig. 2 (a) Electron diffraction pattern 
recorded from a 2D crystal of the orthorhombic form of purple 
membrane. Individual diffractions spots can be seen out to about 
0.35 nm resolution and only contain intensity information. The 
dark region in the center corresponds to the inelastically 

the weak phase object approximation (WPO) (Spence 
2008). This is only applicable to very thin specimens 
with light atoms in which electrons are scattered on 
average only once; fortunately most 2D protein crys¬ 
tals approximate to weak phase objects. For 
a wavefield incident on the specimen having unit 
amplitude, the exit face complex wave amplitude is 
given by the WPO approximation: 

where (Pp(x,y) is the complex electric potential varia¬ 
tion across the specimen. By Fourier transformation 
the complex amplitude in the back focal plane of the 
objective lens then becomes: 

v|/d(M,v) = 8(0,0) -icjF[(Pp(jc,y)] 

We can observe the diffraction intensities in this 
plane (Figs. 1, 2a) which are given by: 

The wave emerging from the diffraction plane is 
modified by an aberration function and objective 
aperture function. The aberration function arises 
because the lenses and electron beam are not perfect 
and are affected by spherical aberration, astigmatism, 
chromatic aberration, incoherence, and electrical insta¬ 
bilities. Spherical aberration (CJ and defocusing (Af) 



scattered electrons, (b) Fourier transform computed from 
a similar 2D crystal to that in (a). Diffraction is only visible to 
about 0.7 nm resolution but the diffraction spots contain both 
phase and amplitude information 


cause a phase shift which is a function of the scattering 
angle, 0, where 



and X is wavelength. For a 2D crystal with unit cell 
lengths a and h, the transform is only sampled at the 
reciprocal lattice points {h,k) where 

/a^ + /h^ = + 

The amount of phase shift in radians can be 
expressed as: 

X(0) = 2n/X{MQ^/2 + Cs0V4) 

Considering only phase contrast at higher than 
about 2 nm resolution it can then be shown (Spence 
2008) that the intensity in the image plane (ignoring 
magnification) is given by: 

I(jr,y) « 1 -F 2cj(p,(-jr, -y) * F[P(m, v)sinx(M, v)] 

where P(m,v) = 1 across the diameter of the objective 
aperture and P(m,v) = 0 elsewhere and where the 
complex potential: 

CPp = (Pr + i(Pi 
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is made up of a real component and an imaginary 
component which is used to represent depletion 
of the elastic wavefield by inelastic scattering and 
elastic scattering to the periphery of the objective 
lens, sinx(lul) is called the phase contrast transfer func¬ 
tion (CTF) (Amos et al. 1982). The effect is such that 
the image intensity is a linear representation of the 
electrical potential of the sample but whose contrast 
is modified by a sinusoidally varying resolution and 
defocus-dependent factor that must be corrected for 
during image processing. From the image, after 
correcting for the CTF, we can then recover cp^ to 
reconstruct the structure of the sample. 

Determination of Projection Structures from 
2D Crystals 

In practice the signal-to-noise ratio of images of 
unstained protein samples is very low indeed and 
variations in arising from the protein crystal are 
usually invisible to the naked eye. This arises from 
the need to use very low doses of electrons to image 
the sample while avoiding structural damage. The 
result is that the “shot noise” (Spence 2008) from the 
collected electrons can be of the order of ten times 
greater than the signal of interest (Henderson and 
Unwin 1975). However, the underlying crystal “sig¬ 
nal” can be conveniently separated from the back¬ 
ground “noise” by a Fourier transformation of I(x,j). 
The Fourier components of the crystal lattice are them¬ 
selves arranged on a repeating lattice known as the 
reciprocal lattice. This Fourier transform is analogous 
to the diffraction pattern formed in the back focal plane 
(Fig. 1) and effectively we have computed the complex 
term v|/d which can be represented as discrete crystal¬ 
lographic Fourier components F(/7,^)e'“^*’** in 
a manner familiar to X-ray crystallographers. Thus 
from the image we can recover both the crystallo¬ 
graphic amplitudes F but also the crystallographic 
phases, a. Figure 2b shows a computed Fourier trans¬ 
form from a low dose image of a 2D protein crystal. 
Amplitude and phase can be extracted from the dis¬ 
crete “diffraction spots” whereas the background 
around the spots, arising from the image noise, can be 
discarded. A Fourier synthesis using the F(/i,^)e'“^*’*' 
terms determined from the diffraction spots results in 
a relatively noise-free representation of the 2D projec¬ 
tion of the crystal structure (potential) (Glaeser et al. 
2007). In practice, the F terms must be corrected for the 
effects of the CTF and they are also degraded by the 


effects of radiation damage, specimen movement, and 
charging (Bullough 1990; Glaeser et al. 2007). 

Determination of Three-Dimensional Structures 
from Tilted 2D Crystals 

For practical purposes a 2D crystal can be considered 
infinite in extent in the crystal plane, thus leading to 
discrete sampling of the Fourier transform at recipro¬ 
cal lattice points and “spots” in the diffraction pattern 
(Fig. 2a). However, in the third dimension perpendic¬ 
ular to the 2D crystal plane the crystal is finite in 
extent leading to a continuous Fourier transform and 
“lattice lines” with an {h,k) index but continuously 
varying amplitude and phase along z* (Fig. 3). The 
values of amplitude and phase along these lattice lines 
can be sampled experimentally by tilting the crystal with 
respect to the electron beam. Values between sample 
points can be estimated by interpolation and thus used 
to compute a Fourier synthesis in 3D (3D map). In 
practice, tilting beyond 60° is difficult and Fourier ampli¬ 
tudes degraded at high z* values such that a cone of data 
is missing from the 3D transform. This tends to blur 
features in the 3D map in the z direction (Henderson 
et al. 1990). Furthermore, in practice this technique is 
restricted to true 2D crystals, since very thin 3D crystals 
often with varying thickness give rise to lattice lines for 
which it is very difficult to interpolate the amplitude and 
phase values. 

Contemporary Electron Diffraction 

Since the determination of the structure of bacteriorho- 
dopsin at 0.7 nm resolution by Henderson and Unwin 
in the mid-1970s (Henderson and Unwin 1975; Unwin 
and Henderson 1975), there have been a number of 
spectacular technical developments such that in 
a small number of cases, resolutions have been achieved 
comparable to those familiar to X-ray crystallographers 
(Hite et al. 2007). Perhaps the three most important 
advances have been specimen freezing, use of field 
emission electron sources, and computational correction 
of crystal lattice distortions in images (Glaeser et al. 
2007). With the development of cryomicroscopy, the 
sample is maintained at liquid nitrogen or liquid helium 
temperatures either by embedding in vitreous ice or in 
another stain-free preservative such as glucose. Freez¬ 
ing in this way confers at least a fivefold increase in 
specimen lifetime meaning that much higher doses of 
electrons can be used and so enhancing the signal- 
to-noise ratio. Field emission gun (Spence 2008) 
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Electron Diffraction, Fig. 3 Cross section through reciprocal 
space for a thin 2D crystal. In the horizontal direction (crystal 
plane) the diffraction pattern is made up of discrete spots but 
these extend as continuous lines in the perpendicular z* direction 
(vertical blue lines). Amplitudes and phases can be sampled 

electron microscopes provide much more coherent illu¬ 
mination than conventional microscopes with heated 
tungsten sources such that the envelope of the CTF 
falls off less steeply at high resolution. Finally, the 
development of computation procedures for correcting 
crystal lattice distortions and tilt of the illumination 
beam has led to a dramatic increase in the level of detail 
that can be extracted from low dose images (Flenderson 
et al. 1990; Glaeser et al. 2007). All these technical 
developments were brought together by Henderson 
and coworkers who published the first 3D atomic 
model of a protein determined by electron crystallogra¬ 
phy (Henderson et al. 1990) (see also Fig. 4). 

Bacteriorhodopsin can be isolated in the form of 
natural 2D crystals which are ideally suited for elec¬ 
tron crystallography. However, most studies of pro¬ 
teins by electron crystallography have involved the 
growth of artificial 2D crystals from purified proteins; 
typically for membrane proteins this has required the 
protein to be extracted from the membrane with deter¬ 
gent and then reconstitution with lipid into a 2D array 
by removal of the detergent by dialysis (Glaeser et al. 
2007). The first example of this approach leading to an 
atomic resolution model was of plant light harvesting 
complex II by Kiihlbrandt et al. (1994). Spectacular 
success has been achieved in using this approach to 
elucidate the fine details of structure within the 
aquaporin family (Raunser and Walz 2009) with the 
structure of AQPO being determined to 0.19 nm 
resolution using electron diffraction and molecular 
replacement, sufficient to reveal even lipid and water 


along these lines when the crystal is untilted relative to the 
electron beam (green sample points) or tilted relative to the 
beam (blue sample points). Phases and amplitudes at discrete 
and regular z* intervals (horizontal blue lines) can be calculated 
by interpolation 



Electron Diffraction, Fig. 4 Representation of the polypeptide 
backbone of bacteriorhodopsin as determined by electron dif¬ 
fraction (Subramaniam and Henderson 2000). The figure was 
calculated using the UCSF Chimera package from the Resource 
for Biocomputing, Visualization, and Informatics at the Univer¬ 
sity of California, San Francisco (supported by NIH P41 
RR001081) (Pettersen et al. 2004) 

molecules within the crystals. A very powerful 
application of electron crystallography has been in 
time-resolved studies of protein conformational changes, 
for example, by rapid freeze-trapping of molecules in 
liquid ethane (reviewed by Hite et al. 2007). 
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Cross-References 

► Macromolecular Crystallography: Overview 

► Macromolecular Neutron Diffraction 
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Synonyms 

Electron transfer; Multi-step electron transfer 

Definition 

Electron hopping in biomolecules is a process where 
electrons migrate through molecules in a multi-step 
reaction. This enables long-distance electron transfer 
in biomolecules. 

Basic Characteristics 

Electrons can migrate in biopolymers like DNA and 
proteins over long distances (up to 30 nm). According 
to the ► electron transfer theory (Marcus and Sutin 
1985) a single-step electron transfer (ET) over these 
distances is too slow to occur because the rate 
decreases exponentially with the distance (Ar) between 
electron donor and electron acceptor (Eq. 1), where 
P is the decay factor that depends upon the bridging 
medium connecting donor and acceptor. 

^ET oc exp(-/lAr) (1) 

Long-distance ET becomes possible if biopolymers 
contain groups that can be oxidized or reduced during 
electron transfer. These groups act as relay stations 
(stepping stones) where the charge has a definite 
life time. The electron hops through the molecule in 
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a multi-step reaction; the rate of each hopping step 
follows the Marcus equation. The overall process can 
be mathematically described as a diffusion of the 
charge through the biomolecule (Bixon et al. 1999). 
In this hopping process one long and therefore slow ET 
step (Eq. 1) is replaced by a cascade of shorter and 
therefore faster steps. In the special case that all of 
these steps have similar rates and are reversible the 
overall rate decreases with where N is the number 
of the ET steps (Bixon et al. 1999). In a more general 
case the slowest hopping step becomes the rate deter¬ 
mining step. 

Electron Hopping in DNA 

In DNA, the purine bases guanine (G) and adenine 
(A) are the stepping stones for a positive charge. 
Guanine is the preferred relay station as it can be 
faster oxidized than adenine. Thus, the ET occurs 
between the guanines, a process that is called G- 
hopping. If the distance between the guanines, gener¬ 
ated by the intervening adenines, is longer than 1 nm 
also the intervening adenines can become oxidized 
during the ET reaction because the endothermic oxi¬ 
dation of the neighboring adenine by the guanine 
radical cation is faster than the thermo-neutral oxida¬ 
tion of the distant guanine (Giese 2004). In these 
DNA sequences also the adenines act as relay stations 
(A-hopping). Under oxidative conditions 
(► oxidative stress) a guanine base of DNA can 
become oxidized to a guanine radical cation, and ET 
through DNA starts. Driving forces for the ET process 
are the differences in ► redox potential of different 
DNA base sequences. Chemical reactions of these 
radical cations with water and/or oxygen lead to 
chemical modifications of the DNA bases causing 
mutations during replication (Giese 2002). 
A consequence of the electron hopping under oxida¬ 
tive stress is that DNA mutations can occur far away 
from the first site of oxidative attack. In contrast to 
oxidative conditions where preferentially the purine 
base guanine is oxidized and carries the (positive) 
charge, the reduction of DNA occurs at the pyrimi¬ 
dine bases, thymine (T) and cytosine (C). As the 
pyrimidine bases T and C have similar reduction 
potentials both bases are relay stations for the 


(negative) charge during the ET process. As 
a consequence the base sequence plays only a minor 
role in the long-distance ET through DNA (Behrens 
et al. 2004; Tainaka et al. 2010). 

Electron Hopping in Proteins 

Long-range ► electron transfer through proteins and 
enzymes is a fundamental reaction for all living 
organisms. In energy conversion processes like pho¬ 
tosynthesis and respiration (Cordes and Giese 2009), 
multiprotein assemblies control ET through the cell 
membrane over more than 4 nm. An example of long¬ 
distance ET in enzymes is ► ribonucleotide reductase 
(Stubbe et al. 2003) where a 3.4 nm electron transfer 
occurs by a stepwise, hopping mechanism. Relay 
stations for these processes are either >- electron 
transfer cofactors (Moser et al. 2006) or certain 
amino acids (Giese et al. 2009; Gray and Winkler 

2009) whose side chains have appropriate redox 
potentials for a hopping process. This condition is 
fulfilled for tyrosine, tryptophan, and histidine, 
which carry aromatic side chains. But also the sul¬ 
fur-containing aliphatic amino acids, cysteine and 
methionine, might act as relay amino acids (Giese 
et al. 2009). To mention some biological examples: 
In the enzyme, DNA photolyase (Brettel and Byrdin 

2010) tryptophan is the relay amino acid, and in 
ribonucleotide reductase (Stubbe et al. 2003), both 
tyrosine and tryptophan are the stepping stones of 
the long distance ET. If the oxidized side chains 
carry acidic protons, the ET is coupled with a proton 
transfer process (Dempsey et al. 2010; Kumar and 
Sevilla 2010). 

Cross-References 

► Electron Transfer Cofactors 

► Electron Transfer Theory 

► Electron Transfer Through Proteins 

► Gated Electron Transfer 

► Oxidative Stress 

► Proton-Coupled Electron Transfer 

► Redox Potential 

► Ribonucleotide Reductase 
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Electron Microscopy of Membrane Lipids 

Toyoshi Eujimoto and Yugo Fukazawa 
Department of Anatomy and Molecular Cell Biology, 
Nagoya University Graduate School of Medicine, 
Nagoya, Aichi, Japan 

Synonyms 

Lipids; Membrane Lipids 

Definition 

Electron microscopy is a method for examining mate¬ 
rials using an electron microscope. Electron micro¬ 
scopes use electrons instead of light, which requires 
that the objects under study be placed in a high 
vacuum. 

Membrane lipids are lipid molecules that are major 
structural components of biological membranes. They 
include glycerophospholipids, sphingolipids, and cho¬ 
lesterol (or ergosterol in plants). 

Basic Characteristics 

A lipid bilayer forms the basic structure of most 
biological membranes. The lipids in the membrane 
generally consist of phospholipids, sphingolipids, and 
cholesterol (or ergosterol in plants). The membrane 
exhibits a two-dimensional fluidity at ambient temper¬ 
ature, meaning that molecules diffuse laterally in the 
membrane plane. The distance that a molecule can 
diffuse in a certain time t can be calculated by the 
equation, — 4kt (where d is the distance and k is 
a diffusion coefficient). The diffusion coefficient for 
phospholipids is generally more than 1 pm7s, 
indicating that their two-dimensional distribution 
changes quickly. 

The quick diffusion of lipids in a membrane is 
a major obstacle when trying to determine their distri¬ 
bution in situ by electron microscopy. In conventional 
electron microscopic methods, biological samples are 
first treated with chemical fixatives to stabilize the 
structure, but most lipids do not react with the 
aldehydes commonly used as fixatives. In fact, lipids 
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in a cell membrane have been shown to remain mobile 
even after treatment with formaldehyde and/or glutar- 
aldehyde (Tanaka et al. 2011). Another fixative, 
osmium tetroxide, reacts with the double bonds of 
fatty acids, but molecules thus treated are not amenable 
to subsequent procedures such as labeling with probes. 
However, even if chemical reagents that can fix lipids 
without compromising detection are invented, the time 
needed for fixation will likely prevent capturing the 
instantaneous distribution of lipids in a membrane. 

In addition to their two-dimensional fluidity, asym¬ 
metry is another important feature of membrane lipids, 
since the composition of lipids differs between the two 
leaflets of a membrane. Methods that enable unambig¬ 
uous analysis of membrane asymmetry are thus 
preferable. 

Lipids that remain bound to crystallized mem¬ 
brane proteins are often observed using electron 
crystallography. Except for these cases, however, 
direct electron microscopic observation of mem¬ 
brane lipids’ structure is difficult. The distribution 
of membrane lipids can be observed microscopically 
using various probes and reagents that show specific 
binding. 

Basic Methodology 

Electron microscopic methods for observing mem¬ 
brane lipid distributions can be classified into the fol¬ 
lowing four categories (Eig. 1): 

(A) Use of reagents that bind specific lipids and pro¬ 
duce characteristic structures (Elias et al. 1979). 
The best-known example is filipin that binds to 
cholesterol. The resultant filipin-cholesterol 
complex is not observed by itself, but character¬ 
istic membrane deformation caused by the com¬ 
plex is observed by electron microscopy 
(Fig. lA). 

(B) Expression of protein probes and their detection 
by immunoelectron microscopy (van Rheenen 
et al. 2005). This method utilizes protein domains 
that have an affinity for specific membrane lipids 
such as phosphoinositides. These domains are 
expressed in live cells as a GFP-tagged protein 
and enable visualization of lipid distributions by 
fluorescence microscopy. Eor electron micros¬ 
copy, the distribution of the protein probes is 
examined using conventional techniques, so the 


distribution of target lipids is detected indirectly 
(Fig. IB). For this method as well as methods 
C and D, colloidal gold particles are used as 
markers for visualization. 

(C) Direct application of conventional 
immunoelectron microscopic methods (Downes 
et al. 2005). Cells after chemical fixation are 
frozen or embedded in resins prior to the prepara¬ 
tion of ultrathin sections that are 50-100 nm in 
thickness. Such sections are labeled by probes that 
bind to specific lipids (Fig. 1C). 

(D) Labeling of membranes that are stabilized by 
physical means (Fujita et al. 2010; Fairn 
et al. 201 1). The mobility of membrane molecules 
is stopped by quick freezing, and the molecular 
disposition at that instant is preserved by embed¬ 
ding the specimen in resins or by the formation of 
a thin scaffolding layer by vacuum evaporation 
(Fig. ID). 

Methods A through C are often used, but their 
limitations must be kept in mind when interpreting 
obtained results. In methods A and C, for example, it 
should be noted that lipids are not immobilized by 
chemical fixation, so they are prone to artificial distri¬ 
butional changes during subsequent sample prepara¬ 
tion procedures. In addition to two-dimensional 
redistribution in the plane of a membrane, lipids may 
also move from one leaflet to another (i.e., flip-flop). In 
method B, protein probes expressed in living cells may 
cause artifacts; that is, the distribution of the target 
lipid may be changed by the binding of the probe. 
Considering these problems, method D, which will be 
described in more detail in the next section, is the most 
promising for defining lipid distributions by electron 
microscopy. 

For all methods listed above, it should be noted that 
probes available for phospholipids and sphingolipids 
recognize the hydrophilic head group. The hydropho¬ 
bic tail region, especially the fatty acid composition, 
will vary depending on the cellular origin and 
conditions and thus is a very important factor for 
determining membrane properties. At present, 
however, there is no good method for analyzing the 
tail region at the electron microscopic level. 

Physical Fixation Methods 

Cellular samples are frozen quickly by slamming to 
a metal block cooled with liquid helium (metal 
contact method) (Heuser et al. 1979) or by spraying 
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Electron Microscopy of 
Membrane Lipids, 

Fig. 1 Electron microscopic 
methods for observing 
membrane lipid distributions. 
In methods A through C, 
specimens are treated with 
chemical fixatives, but the 
fixatives do not react with 
membrane lipids. In method 
D, membrane lipids are 
immobilized by quick freezing 
and their distribution is 
preserved either by embedding 
in resins or deposition of a thin 
carbon/platinum layer 
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Electron Microscopy of 
Membrane Lipids, 

Fig. 2 Quick freezing 
methods. In the metal contact 
method, a specimen is 
slammed onto a mirror surface 
of a cooled copper block. The 
freezing rate is > 10,000 K/s 
and the specimen becomes 
frozen within 0.1-1 ms. In the 
high-pressure freezing 
method, a specimen is frozen 
by liquid nitrogen sprayed 
simultaneously with an 
application of high pressure. 
The freezing rate is about 
200 K/s and the freezing takes 
about 200 ms 


Metal contact freezing 
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Frozen by slamming onto 
a pure copper block 
cooled by liquid helium 


High-pressure freezing 



Frozen by liquid Nz 
under 2,100 bar 


liquid nitrogen under a high pressure (high-pressure 
freezing method) (Moor 1987) (Fig. 2). Both methods 
allow samples to be frozen without ice crystal forma¬ 
tion, but the time needed for freezing is < 1 ms for the 
metal contact method and ^200 ms for the high- 
pressure freezing method. 

Frozen samples are processed using two different 
labeling methods: 


1. In freeze substitution, samples are treated with 
a uranyl acetate solution in methanol 
(e.g., at —90°C) and embedded in a resin at a low 
temperature (e.g., —45°C). Once the resin becomes 
solidihed, lipids are retained physically and 
ultrathin sections can be labeled by various probes 
(Fairn et al. 2011) (Fig. lD-1). An uncertainty in 
this methodology is that it is not known whether 
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lipids remain immobile during the uranyl acetate 
treatment in cold methanol. 

2. In freeze fracture, physical force applied to the fro¬ 
zen sample splits a membrane into two leaflets 
(Severs and Shotton 1995). The hydrophobic 
interface thus exposed is coated with a thin carbon/ 
platinum layer using vacuum evaporation at a low 
temperature (e.g., < — 100°C). The carbon/platinum 
layer is in direct contact with the hydrophobic tails of 
membrane lipids and thereby immobilizes them. The 
hydrophilic head portion of lipids can be accessed 
with various probes for labeling (Fujita et al. 2010) 
(Fig. lD-2). A problem with this method is that the 
labeling of lipids with small head groups is ineffi¬ 
cient, probably as a consequence of steric hindrance 
due to surrounding membrane molecules. 

Cross-References 

► Chemical Diversity of Lipids 

► Electron Microscopy: Classical Sample Preparation 

► Functional Roles of Lipids in Membranes 

► Glycerolipids: Chemistry 

► Glycosphingolipids 

► Lipid Domains 

► Lipid Lateral Diffusion 

► Membrane Fluidity 

► Sphingolipids and Gangliosides 
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Electron Microscopy of Motor Structure 
and Possible Mechanisms 

Tohru Minamino, Takayuki Kato, Tomoko Miyata 
and Keiichi Namba 

Graduate School of Frontier Biosciences, 

Osaka University, Osaka, Japan 

Synonyms 

Bacterial flagellar motor structure; Electron micros¬ 
copy; Molecular motors; Rotary motor 

Definition 

Structural analyses of substructures of the bacterial 
flagellum by the complementary use of electron 
cryomicroscopy and X-ray crystallography provide 
structural insights into the mechanisms of how they 
can exert their functions. 

Introduction 

Bacteria such as Escherichia coli and Salmonella 
enterica can swim in liquid environments by rotating 
flagella, which are powered by the proton motive force 
across the cytoplasmic membrane. The bacterial fla¬ 
gellum is a supramolecular complex made by about 30 
different proteins with copy numbers ranging from 
a few to a few tens of thousands. The flagellum consists 
of at least three parts: the basal body, the hook, and the 
hlament. The basal body, which works as 
a bidirectional rotary motor, is imbedded within the 
cell envelope, while the hook and filament, which 
function as a universal joint and a propeller 
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Electron Microscopy of Motor Structure and Possible 
Mechanisms, Fig. 1 Schematic diagrams of the bacterial 
flagellum. The flagellum consists of at least three parts: the 
basal body (reversible rotary motor), the hook (universal joint), 
and the filament (helical screw). OM outer membrane, PG 
peptidoglycan layer, CM cytoplasmic membrane 


respectively, extend outward to the cell exterior. Fla¬ 
gellar assembly begins with the basal body, followed 
by the hook and finally the filament. Most of the 
flagellar component proteins are translocated into the 
central channel of the growing flagellar structure by 
the flagellar type III protein export apparatus and 
assemble at the distal end of the structure with the 
help of a cap structure (Fig. 1) (Minamino and 
Namba 2004). Here, we describe our recent discover¬ 
ies on the structure and function of Salmonella 
flagellum. 

Basal Body 

The basal body consists of the MS-C ring (rotor), the 
rod (drive shaft), and the LP ring (molecular bushing) 
(Fig. 2). The MS ring is formed within the cytoplasmic 
membrane by self-assembly of 26 copies of a single 
protein, FliF. FliG, FliM, and FliN form the C ring on 
the cytoplasmic face of the MS ring. FliG directly 
binds to the MS ring and forms the FliG ring, which 
acts as a track on the rotor, along which the stator 
complexes align to generate torque. The FliG ring has 



Electron Microscopy of Motor Structure and Possible 
Mechanisms, Fig. 2 Three-dimensional (3D) structure of the 
basal body. Left panel', electron cryomicroscopic image of 
frozen-hydrated hook-basal body in the side view. Right panel'. 
3D density map of the basal body reconstructed from 
frozen-hydrated particle images 


26-fold rotational symmetry. FliM and FliN form 
a stable complex consisting of one copy of FliM and 
four copies of FliN, which form the majority of the 
C ring. The FliM-FliN 4 ring has 32- to 36-fold rota¬ 
tional symmetry (Minamino et al. 2008a). It has been 
proposed that a symmetry mismatch between the FliG 
and FliM-FliN 4 rings may be the basis of torque gen¬ 
eration, although how the symmetry mismatch is over¬ 
come remains unknown (Thomas et al. 1999). The rod 
consists of three proximal rod proteins, FlgB, FlgC, 
FlgF, and a distal rod protein, FlgG, and traverses the 
entire periplasmic space. Each of the proximal rod 
proteins is present in about 6 copies, and the distal 
rod protein in about 26 copies (Jones et al. 1990). 
The rod has a helical symmetry. Since there is also 
a symmetry mismatch between the rod and the MS 
ring, it has been proposed that FliE forms a MS ring/ 
rod junction zone to overcome the symmetry mismatch 
(Minamino et al. 2000). The P and L rings, which exist 
within the peptidoglycan layer and the outer mem¬ 
brane, respectively, consist of 26 copies of 
a periplasmic P ring protein, Flgl, and 26 copies of an 
outer membrane L ring protein, ElgH, respectively. 

Filament 

The filament grows as long as 15 pm by self-assembly 
of as many as 30,000 subunits of flagellin (EliC or 
EljB). The flagellin subunits are arranged on a tubular 
helical lattice. The basic helical line passes through all 
the lattice points and has approximately 11 subunits 
per two turns of the one-start helix. As a result, the 
filament is a tubular structure made of 11 
protofilaments, which are axially staggered by about 
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L-type R-type 



Electron Microscopy of Motor Structure and Possible 
Mechanisms, Fig. 3 Atomic models of the L-type (left 
panel) and R-type (right pane) straight filaments solved by 
electron cryomicroscopy and helical image analysis. Upper 
panel: End-on view. Lower panel: Side view from outside. The 
Ca backbone is color-coded from blue to red, going through the 
rainbow colors from the N- to the C-terminus 

a half subunit in their lateral interactions. A remarkable 
structural feature of the filament is that the chemically 
identical flagellin subunits produce supercoiled tubular 
structures that act as a helical propeller with 
a switchable helical handedness between left and 
right. The polymorphic supercoiling of the filament 
can be explained by a bistable protofilament model, 
in which the supercoiled forms can be produced by 
combinations of two distinct conformations and pack¬ 
ing interactions of the L- and R-type protofilaments 
(Macnab 1996). Complete atomic models of the L- and 
R-type straight filaments have been built by electron 
cryomicroscopy and helical image analysis (Fig. 3) 
(Yonekura et al. 2003; Maki-Yonekura et al. 2010). 
The flagellin subunit of the filament is divided into four 
domains: DO, Dl, D2, and D3. Domains DO and D1 
form the inner and outer tubes of the concentric dou¬ 
ble-tubular structure, respectively. The hydrophobic 
interactions between the DO domains make the fila¬ 
ment structure mechanically very stable in aqueous 
environments. The hydrophilic inner surface of the 
central channel of the filament is important for rapid 


and efficient transport of flagellin in a largely unfolded 
conformation with many hydrophobic residues 
exposed. Domains D2 and D3 form the outer part of 
flagellin in the filament, making relatively minor con¬ 
tributions to the filament structure and stability 
(Yonekura et al. 2003). Comparison of the L-type 
straight filament with the R-type revealed that domains 
D2 and D3 behave as rigid bodies, presumably due to 
their loose packing interactions in the filament. There¬ 
fore, it is likely that the structural changes between the 
L- and R-type filaments do not require much local 
deformation within these two domains. In contrast, 
the DO and Dl domains are tightly packed in the 
filament structure. It has been observed that the con¬ 
formation of the lower part of domain DO is distinct 
between the L- and R-type flagellin subunits, resulting 
in a hinge motion of the remaining three domains 
relative to domain DO, with a hinge at the spoke region 
between the DO and Dl domains. The hinge motion 
seems to be responsible for switching from the L-type 
to the R-type filament. Since the changes in the con¬ 
formations of the D1 domains and their packing inter¬ 
actions occur against the relatively stable 
conformation of the inner tube structure made of the 
Dl domains, the inner tube of the filament as a rigid 
core structure is important for polymorphic switching 
(Maki-Yonekura et al. 2010). 

Hook 

The hook is a highly curved tubular structure 
connecting the filament to the basal body. The hook 
has a bending flexibility, enabling it to work as 
a universal joint to smoothly transmit torque produced 
by the motor to the filament. The hook is composed of 
about 120 subunits of the hook protein FlgE, which are 
assembled with a helical symmetry. FlgE is composed 
of three domains, DO, Dl, and D2, that are arranged 
from the inner to the outer part of the tubular structure 
of the hook. Axial packing of the subunits in the outer 
part of the tube made of domains Dl and D2 is rela¬ 
tively loose, presumably resulting in the bending flex¬ 
ibility of the hook structure (Eig. 4) (Samatey et al. 
2004; Fuji! et al. 2009). The N- and C-terminal 
a-helices form a coiled coil in the inner core domain 
DO, which structurally and mechanically stabilizes the 
hook structure (Fujii et al. 2009). The a-helical coiled 
coil motif of the terminal regions is shared by all the 
flagellar axial proteins (Namba 2001). In contrast to 
the DO domains of flagellin, which are tightly packed 
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Electron Microscopy of Motor Structure and Possible 
Mechanisms, Fig. 4 Partial atomic model of the hook by the 
complementary use of X-ray crystallography and electron 
cryomicroscopy and comparison of the protofllament structures 
of the hook {right panel) and the filament (right panel). Only side 
view of a longitudinal section through the axis is shown. The Ca 
backbone is color-coded from blue to red, going through the 
rainbow colors from the N- to the C-terminus 

in all directions, however, the orientations and axial 
packing interactions of the terminal two a-helices of 
FlgE provide enough space for axial compression and 
extension for bending flexibility without impairing the 
mechanical stability of the hook (Fig. 4) (Fujii et al. 
2009). 

The length of the hook is controlled at about 55 nm 
by the export specificity switching machinery, which 
consists of a molecular ruler, FliK, and an export 
switch, FlhB. When the hook has reached its mature 
length of 55 nm, the temporal association of the 
N-terminal ruler domain of FliK with the inner surface 
of the hook during FliK secretion allows the 
C-terminal domain of FliK to interact with FlhB, 
thereby terminating the export of proteins required 
for assembly of the rod and hook and initiating the 
export of proteins responsible for filament formation 
(Minamino et al. 2009). 

Capping Structure 

Most of the component proteins of the flagellar axial 
structure require capping proteins to assemble at the 
distal end of the growing structure. Three capping 
proteins - FlgJ, FlgD, and FliD - are responsible for 
the assembly of the rod, the hook and the filament, 
respectively. Electron cryomicroscopy and image 
analysis revealed that the EliD cap is made of 
a pentagonal plate domain as a lid and five axially 
extended leg-like domains that bind to the filament at 
the distal end, forming a cavity under the pentagonal 



Electron Microscopy of Motor Structure and Possible 
Mechanisms, Fig. 5 Structures of the filament cap and the 
cap-filament structure (lower pane). The cap complex, which is 
composed of five subunits of the filament capping protein FliD, 
has a plate domain and axially extended domains (upper panel). 
Five axially extended leg-like domains bind to the indentations 
formed by domains D1 of flagellin subunits at the distal end, 
forming a cavity under the cap plate as a folding chamber for 
newly exported flagellin molecules 

plate as a folding chamber for newly exported flagellin 
molecules. There is a symmetry mismatch between the 
helical subunit array of the hlament with 11 
protohlaments forming the tube and the pentameric 
annular structure of the EliD cap, and this symmetry 
mismatch is the basis of the assembly promotion mech¬ 
anism by the EliD cap (Eig. 5) (Yonekura et al. 2000). 

ElgD has two domains: a flexible N-terminal 
domain (FlgD^j) and a compactly folded C-terminal 
domain (FlgDc). Since ElgDj^ is capable of partially 
fulfilling the hook-capping function, it is plausible that 
FlgDiM may have flexible leg-like domains that bind to 
the hook tip in a similar way to the EliD cap. Recent 
genetic analysis of Salmonella ElgD has shown that 
FlgDc contributes to efficient hook polymerization, 
suggesting that FlgDc blocks the exit of newly 
exported FlgE monomers into the culture media, 
allowing ElgE to have more time to refold and assem¬ 
ble into the hook (Moriya et al. 2011). Interestingly, 
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when the second-site flgD mutations, which suppress 
their parental mutations, are mapped on the crystal 
structure of FlgDc from Xanthomonas campestris, 
which contains residues 84-221 (Kuo et al. 2008), 
they are localized on one side of the molecular surface 
of FlgDc, suggesting that FlgDc interacts with newly 
exported FlgE subunit through this surface and that the 
interaction between FlgDc and FlgE may contribute to 
the efficient and proper incorporation of the ElgE sub¬ 
units into the growing hook (Moriya et al. 201 1). 

Type III Protein Export Apparatus 

Eor the construction of the bacterial flagellum, most of 
the flagellar component proteins are recognized, 
unfolded, and translocated into the central channel 
and to the distal end of the growing flagellar structure 
by the flagellar type III protein export apparatus, which 
consists of three soluble proteins (EliH, Flil, FliJ) and 
six integral membrane proteins (FlhA, FlhB, FliO, 
Flip, FliQ, FliR). The export apparatus is assumed to 
be located within the central pore of the MS ring with 
a domain projecting into the cytoplasmic space within 
the C ring. The six membrane proteins form the export 
gate complex that converts proton motive force across 
the cytoplasmic membrane into the mechanical work 
required for protein transport. The three soluble pro¬ 
teins form the pilot complex to deliver export sub¬ 
strates and chaperone-substrate complexes to the 
export gate and help the initial entry of substrate into 
the narrow pore of the gate (Minamino et al. 2008b). 

Flil ATPase self-assembles into homo-hexamer to 
fully exert its ATPase activity, thereby facilitating 
flagellar type III protein export (Fig. 6a, left panel). 
Electron microscopic analyses of the ring formation by 
Elil and its catalytic mutant variants have revealed that 
ATP binding induces Elil hexamerization and that the 
release of ADP and Pi destabilizes the ring structure, 
suggesting that Elil couples ATP binding and hydroly¬ 
sis to its assembly-disassembly cycle to efficiently 
drive the flagellar protein export cycle (Kazetani 
et al. 2009). Recently, it has been shown that EliJ 
facilitates Elil ring formation by binding to the center 
of the ring (Eig. 6a, right panel) (Ibuki et al. 2011). 
Interestingly, the atomic structures of Elil and EliJ 
show extensive structural similarities to the a/(3 and y 
subunit, respectively, of FoEi-ATPsynthase, which 
utilizes PMF and ATP for ATP synthesis and proton 
pumping, respectively (Imada et al. 2007; Ibuki et al. 
2011). Electron cryomicroscopy and image analysis of 


a Flilg ring Flilg FliJ ring 



Electron Microscopy of Motor Structure and Possible 
Mechanisms, Fig. 6 FlilgFliJ ring model (a) Averaged 
end-on view images by electron cryomicroscopy of frozen- 
hydrated Flil {left} and FliJ-Flil {right) ring particles, (b) Side 
view of two opposite Flil subunits and FliJ in the Flil-FliJ ring 
model. FliJ penetrates into the central hole of the Flil hexamer 
ring 


the FlilgFliJ ring structure have shown that EliJ pene¬ 
trates into the central hole of the Elil ring in a way 
similar to an antiparallel a-helical coiled coil of the 
amino and carboxyl termini of the y subunit penetrat¬ 
ing into the central cavity of the a 3 p 3 ring of EqFi- 
ATPsynthase (Eig. 6b) (Ibuki et al. 2011). These 
results suggest a similar mechanism and an evolution¬ 
ary relationship between the flagellar type III export 
system and EoEi-ATPsynthase. 

Summary 

The bacterial flagellum is a biological nanomachine 
consisting of a reversible rotary motor, a universal 
joint and a helical screw. The structure and function of 
the flagellum including the type III protein export appa¬ 
ratus have been revealed by the complementary use of 
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X-ray crystallography and electron cryomicroscopy 
combined with biochemical, physicochemical, and 
genetic analyses. 

Cross-References 

► Bacterial Flagellar Motor Structure 

► Bacterial Flagellar Motor: Overview 

► Electron Microscopy 

► Fj-ATPase: Fundamental Properties and Structure 

► Sodium-Driven Flagellar Motor: Structure and 
Mechanisms 
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Electron Microscopy: Cryo-Preparation 

Bruno M. Humbel 

Electron Microscopy Facility, University of Lausanne, 
Lausanne, Switzerland 

Synonyms 

CEMOVIS; Cryo-electron Microscopy; Cryo-electron 
Microscopy of Vitreous Sections; Cryo-fixation; Cryo- 
immobilization; Cryo-substitution; Electron Cryo¬ 
microscopy; Freeze-Substitution 

Definitions 

Cryo-Fixation: Immediately arresting all cellular 
processes in a biological sample by fast freezing 
without inflicting ultrastructural changes. 
Freeze-Fracture: Fracturing a cryo-fixed sample and 
making a platinum replica by metal shadowing. 
Freeze-Substitution: Replacing the water (ice) of 
a cryo-fixed sample at sub-zero temperatures by an 
organic solvent that may contain chemical fixatives. 
Cryo-Electron microscopy: Imaging of frozen- 
hydrated cryo-sections with an electron microscope. 
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Introduction 

Biological electron microscopy has since ever the 
reputation of producing artifacts. To my opinion 
the main reason that exclusively electron microscopy 
is associated with artifacts and none of the other tech¬ 
niques used in Biology, not even light microscopy, is 
because an electron micrograph shows uncountable 
details and it has the resolution to make artifacts visible. 
But we want this to be a true image of reality. This 
desire let an uncountable number of electron microsco- 
pists to search the Holy Grail, the image of true nature 
until the finest detail. Of course any handling, even the 
sampling, changes the original living state and the true 
image can only be attained by approximation. The most 
important first step in the direction of artifact-free 
images is cryo-fixation. Already in 1960, Humberto 
Femandez-Moran showed us the way to go and with 
new methods and instrumentation the goal is being 
reached asymptotically. 

Cryo-Fixation 

Cryo-fixation is the first and decisive step. The biological 
specimen must be immobilized by freezing that the ice is 
vitrified in such a way that no ice crystals can form and 
redistribution of any cellular component is prevented. 
There are several methods and apparatuses for cryo- 
fixation: Propane-jet, metal mirror freezer, spray-freez¬ 
ing, plunging into a cryogen, and high-pressure freezing. 
Today, the most widely used techniques are plunge¬ 
freezing and high-pressure freezing. By plunge-freezing 
macromolecules are embedded into a thin layer of amor¬ 
phous ice for high-resolution stracture analysis, or thin 
cells for cryo-electron tomography. Larger samples, up 
to about 200 pm of thickness, can be vitrified by high- 
pressure freezing. Whereas thin samples can directly be 
imaged by cryo-electron microscopy, thicker samples 
must further be treated for analysis. 

Freeze-Fracture 

Kazushi Fujimoto established an interesting hybrid 
method, combination of cryo- and non-cryo-methods, to 
investigate the distribution of membrane proteins. Cryo- 
fixed samples are freeze-fractured and replicated with 
platinum and carbon. The replica is washed with sodium 


dodecyl sulfate (SDS). Thereafter proteins, trapped in the 
replica, can be labeled by immunocytochemistry. This 
method is preferentially used to study membrane protein 
localization or membrane-membrane interactions, as the 
fracture plane generally follows membranes. It can also 
be useful to localize lipid droplet-associated proteins that 
are very difficult to preserve otherwise. 

Freeze-Substitution 

Another hybrid method is freeze-substitution. After 
cryo-fixation the frozen samples are dehydrated at 
low temperatures, at around —90°C (183 K). The 
dehydration agents are acetone, methanol, or ethanol. 
Several other organic solvents like tetrahydrofuran, 
propane, and diethylether were used. It has been 
shown, that methanol is far more efficient in removing 
water than acetone, but Studer et al. (1995) suggested 
that the slower dehydration of acetone better preserves 
the ultrastructure of cartilage; therefore acetone again 
became the undisputed, favorite solvent. 

At —90°C the ice is dissolved and replaced by the 
organic solvent. If desired, chemical fixatives such as 
osmium tetroxide, glutaraldehyde, and uranyl acetate 
can be added to the solvent. The chemical fixatives 
cannot react at —90°C but only at higher temperatures, 
osmium tetroxide at —70°C, and glutaraldehyde at 
about —40°C. Chemical fixation is not essential for 
the preservation of the ultrastructure, but, if omitted, 
proteins may be lost during wet sectioning. 

After freeze-substitution and chemical fixation 
the samples can be embedded in methacrylate at low 
temperature for immunolabeling or at ambient tempera¬ 
ture in epoxy resin for morphology. Recently, cryo-fixed 
samples were chemically fixed during freeze-substitution 
and then prepared for Tokuyasu cryo-sectioning that is 
known for high-efficiency immunolabeling. 

Freeze-substitution is the method of choice for 
localization of macromolecules by affinity labeling 
with high-fidelity preservation of the ultrastructure or 
for cellular electron tomography of thick sections. For 
high-resolution electron microscopy at molecular 
level, however, native cryo-sectioning (CEMOVIS) 
and cryo-electron microscopy is primordial (see 
below). There are two main arguments against 
freeze-substitution: (a) Dehydration may cause col¬ 
lapse of fine structure (e.g.. Fig. lb), though the dehy¬ 
dration process at cryogenic temperatures is far more 




598 


Electron Microscopy: Cryo-Preparation 



Electron Microscopy: Cryo-Preparation, Fig. 1 Comparison 
of Freeze-substitution with CEMOVIS. Lynghya majuscula cells 
were cryo-fixed and either (a) directly cryo-sectioned 
(CEMOVIS) or (b) freeze-substituted in 0.25% glutaraldehyde 
and 0.5% osmium tetroxide in acetone or (c) in 2% osmium 
tetroxide in acetone and then embedded in Epon. In the cryo- 
section, the regular, equidistant pattern of the S-layer is well 


preserved (a, arrow), whereas it became very irregular after 
freeze-substitution (b). Further the fine structure of the fibrils 
in the capsule has collapsed (b, arrow) after freeze-substitution. 
On the other hand the pore complex (c, arrow) is only clearly 
visible after freeze-substitution and heavy metal staining 
(K. MacLellan, B.M. Humbel, J. Dubochet, 2007, unpublished). 
Bar represents 500 nm 


gentle; and (b) that we cannot see the macromolecules 
directly but only by their binding affinity to heavy 
metals. Some preliminary experiments (Fig. 2) give 
hope that also in resin sections macromolecules may 
directly be imaged with phase contrast electron 
microscopy, herewith eliminating the second argument 
against freeze-substitution. 

Cryo-Sectioning and Cryo-Electron 
Microscopy 

Today the ultimate biological resolution is achieved by 
cryo-fixation and imaging in the frozen-hydrated state 


by cryo-electron microscopy. Suspensions of proteins 
are frozen as a thin layer in holy carbon grids by 
plunge-freezing and analyzed by imaging at high res¬ 
olution in a cryo-electron microscope. Thus, the 3D 
structure of large biomolecules and biomolecular 
assemblies can be determined at close to atomic reso¬ 
lution. Tissue, however, has to be sectioned before 
analysis by cryo-electron microscopy. First attempts 
were already done in the 1980s but only in the last 
decade the method was rehned to perfection: Cryo- 
£lectron Microscopy Of VTtrified Sections 
(CEMOVIS). The samples are vitrihed by high- 
pressure freezing, sectioned at temperatures below 
the devitrification point (—140°C), and the frozen 
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Electron Microscopy: Cryo-Preparation, Fig. 2 Phase con¬ 
trast on resin-emhedded unstained liver cells. Liver was 
extracted from perfusion-fixed mice, and small pieces were 
dehydrated in methanol and embedded in LR white without 
addition of any (heavy) metal ions. Unstained sections were 
imaged in (a) a Jeol JEM3100FFC transmission electron 


microscope at —30 pm defocus or (b) in focus with a Zemike 
phase plate. Note the increase of contrast in resin sections 
induced by the phase plate, intracellular membranes, and ribo- 
somal endoplasmic reticulum (b, arrow) can clearly be identified 
(Y. Fukuda, N. Kamasawa, K. Nagayama, B.M. Flumbel, 2009, 
preliminary data). Bar represents 500 nm 
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Electron Microscopy: Cryo-Preparation, Fig. 3 Summary of the different pathways after cryo-fixation 


sections are imaged by cryo-electron microscopy. 
CEMOVIS is best suited to study the cellular architec¬ 
ture in its native environment hy cryoTEM and by 
cellular cryo-tomography, which, under optimal con¬ 
ditions, allows for high-resolution analysis of the struc¬ 
ture of macromolecules, for example, cadherins, in 
situ. With “template matching,” finding a given struc¬ 
ture of a macromolecule in the tomogram, the cellular 
distributions of these macromolecules can be 
cartographed in situ. 


CEMOVIS, unfortunately, is also not without draw¬ 
backs, the sections suffer from knife marks, compres¬ 
sion, and crevasses; therefore, the thickness of sections 
is limited to about 100 nm. To overcome this limit 
recently different groups started to experiment with 
cryo-sectioning using a focused ion beam (EIB) in 
a scanning electron microscope (EIB-SEM) replacing 
the diamond knife. 

The hrst approach was to thin cells frozen on an 
electron microscope grid to a thickness of 500 nm or 
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less in the ion beam to render them thin enough to be 
analyzed by cryo-electron tomography. It is best suited 
for single cells and the study of cell-substrate interac¬ 
tions. To investigate cells in tissue in a three- 
dimensional network, however, another approach is 
needed. 

A second approach is the attempt using FIB-cryo- 
sectioning of larger samples, which has been done by 
Hayles et al. (2010). Cells were fixed in copper tubes 
by high-pressure freezing. In a cryo-ultramicro- 
tome, a part of the copper tube was cut off leaving 
a wedge with the ice-embedded cells at the tip. In 
the FIB, a lamella, 300 - 500 nm thick, was cut at 
the tip. Then the sample was transferred to 
a modihed cryo-holder, and the lamella could be 
imaged in a cryo-TEM. More work has to be done 
to pinpoint the cell of interest in the tissue and to 
prepare this cell for investigation by cryo-electron 
tomography. 

Summary 

To achieve reliable preservation of biological struc¬ 
tures, cryo-fixation is a prerequisite. The type of ques¬ 
tion asked decides the follow-up procedure (3): 

• Fracture labeling to localize membrane- and fatty 
acid-associated proteins 

• Freeze-substitution and methacrylate embedding 
for localization of intracellular proteins by 
immunolabeling 

• Freeze-substitution and methacrylate or epoxy 
embedding for analysis of cellular architecture and 
the 3D organization of cellular organelles 

• CEMOVIS for high-biological-resolution imag¬ 
ing in cellular cryo-electron tomography and 
for determination of the ultrastructure of 
macromolecules 

Cross-References 

► CEMOVIS 

► Cryo-electron Microscopy 

► Cryo-electron Tomography 

► Electron Microscopy: Classical Sample Preparation 

► Ereeze-Substitution 

► Single Particle Tracking 
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Electron Transfer Cofactors 

Andrew W. Munro and Kirsty J. McLean 
Manchester Interdisciplinary Biocentre, University of 
Manchester, Manchester, UK 

Synonyms 

Redox cofactors 


Definition 

Electron transfer cofactors are non-protein mole¬ 
cules that bind to proteins and enzymes and act as 
conduits for the passage of electrons in redox 
reactions. 


Introduction 

Life in all organisms relies on electron transport to 
facilitate molecular transformations and for fundamen¬ 
tal processes such as respiration and photosynthesis. 
These reactions are described by ► electron transfer 
theory. Biological electron transfer reactions are 
performed predominantly by proteins and generally 
rely on the presence of cofactors: non-proteinaceous 
molecules hound to proteins and enzymes that enable 
their biological functions. The cofactors may be 
organic molecules such as ► flavin adenine dinucleo¬ 
tide (FAD) and flavin mononucleotide (FMN), ► nic¬ 
otinamide adenine dinucleotide (NADH), and 

► quinone cofactors. The cofactors may be inorganic, 
such as ► iron-sulfur clusters, or organometallic such 
as ► hemes. These cofactors are components of respi¬ 
ratory and photosynthetic electron transport chains as 
well as metabolic enzymes. For example, iron-sulfur 
clusters are found in ubiquinol-cytochrome c reduc¬ 
tase (► Ubiquinol-Cytochrome c Oxidoreductase 
(Complex III)) and copper centers are found in ► cyto¬ 
chrome c oxidase (complex IV) in the mammalian 

► mitochondrial respiratory chain (Yoshikawa et al. 
2006). FMN is present in the respiratory ► NADH- 
uhiquinone oxidoreductase (complex I). Generation 
of nitric oxide and L-citrulline from L-arginine 
requires successive NADPH-driven oxygen reduction 
steps at the heme center of nitric oxide synthase, an 
enzyme that uses both FAD and FMN as electron 
carriers from NADPH to the heme iron (Stuehr et al. 
2004). The most common electron transfer cofactors 
are described in detail in other entries in this encyclo¬ 
pedia. However, nature has evolved a wide range of 
different electron transferring cofactors. To illustrate 
the diversity of these cofactors, the structure, mecha¬ 
nism, and biochemical roles of representative exam¬ 
ples of some of the less common electron transfer 
cofactors are described. 

Mononuclear Iron Centers 

There are several examples of iron-sulfur clusters 
involved in electron transfer reactions, including 
those incorporated in ferredoxin proteins that partici¬ 
pate in important biological functions such as photo¬ 
synthesis and reduction of cytochrome P450 enzymes 
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(Munro et al. 2007). Typically, iron ligation is by both 
cysteinate and inorganic sulfide (although Rieske-type 
2Fe-2S clusters have one of the irons coordinated by 
two histidines). However, the simplest iron-sulfur cen¬ 
ter is the mononuclear iron center found in rubredoxins 
from bacteria and archaea, and involves a single iron 
atom tetrahedrally coordinated by four cysteinates. 
Rubredoxins are single electron transfer proteins, 
with the iron atom shuttling between Fe^"^ and Fe^"^ 
states and with standard midpoint reduction potential 
(► redox potential) for the iron typically being around 
0 mV. Roles include electron transfer to the non-heme 
iron center of microbial alkane hydroxylases to initiate 
pathways for alkane degradation and utilization, with 
electrons being supplied to the rubredoxin by 
a flavoprotein rubredoxin reductase, and ultimately 
derived from NAD(P)H (Van Bielen and Funhoff 
2007). Rubredoxins also support electron transfer to 
microbial superoxide reductases, resulting in conver¬ 
sion of superoxide to hydrogen peroxide (Pinto et al. 
2010). There are many other examples of mononuclear 
iron centers in nature, several of which do not have 
cysteinate ligands to the iron. Other ligands may be 
nitrogen atoms from histidine, and oxygen atoms from 
the side chains of aspartate, glutamate, asparagine, 
glutamine or tyrosine, or from water. An unusual 
example of oxygen ligation in a mononuclear iron 
protein is seen in lipoxygenases, where the C-terminal 
carboxylate from an isoleucine provides one of the iron 
ligands (Ivanov et al. 2010). An example of an iron 
ligand set for a mononuclear iron enzyme is shown in 
Fig. 1 for the iron superoxide dismutase (Fe-SOD) 
from Escherichia coli and other bacteria. The five 
ligands are provided by nitrogens from three histidines 
and oxygens from an aspartate side chain and from 
a water molecule. 

Mononuclear iron proteins generally have penta- or 
hexacoordinate iron, and usually have one position 
vacant (or accessible by ligand displacement) for 
the binding of dioxygen (or superoxide) - with 
the majority of the mononuclear iron proteins catalyz¬ 
ing reductive oxygen activation. Examples include 
monooxygenases (e.g., aromatic amino acid 
hydroxylases, such as phenylalanine hydroxylase) 
and dioxygenases (e.g., extradiol aromatic ring 
cleavage dioxygenases, such as protocatechuate 
4,5-dioxygenase). However, nitrile hydratase activity 
in bacteria is associated with either a mononuclear iron 
enzyme (with the low-spin ferric iron coordinated by 



Electron Transfer Cofactors, Fig. 1 Ligation of the iron atom 
in the mononuclear iron protein Fe-superoxide dismutase. 
Ligands are provided by three histidine nitrogens, one aspartate 
carboxylate oxygen, and a water molecule (or hydroxide ion) 

three cysteine thiolates, two main chain amide nitro¬ 
gens and a displaceable oxygen from water or hydrox¬ 
ide) or a non-corrinoid cofactor cobalt ion center 
(Prasad and Bhalla 2010). In addition, the mononu¬ 
clear iron center in the photosynthetic reaction center 
of purple bacteria also has a non-oxygen activation 
role, and is sited between (and spin-coupled to the 
semiquinone forms of) the electron transferring 
quinone cofactors (Qa and Qb) (Cogdell et al. 2003). 

Corrinoid Electron Transfer Cofactors 

Corrinoids are heterocyclic ring structures (based on 
the structure of corrin) that possess four pyrrole rings 
and show structural relationships to the porphyrin 
cofactors, such as heme h found in hemoglobin and 
the cytochromes P450. Cobalamin cofactors are the 
best known of the corrinoids, all possessing a cobalt 
atom at the center of the macrocycle, coordinated 
equatorially by four corrin ring nitrogens (Fig. 2). 
The compound known as vitamin B 12 is cyanocobala- 
min, which has dimethylbenzimidazole and cyanide 
groups as axial ligands. However, this is a non- 
physiological form of the cofactor, and the prevalent 
natural forms have cyanide replaced by a 5 ’ -deoxy- 5 ’ - 
adenosyl (coenzyme B 12 ) or a methyl group 
(methylcobalamin), respectively (Krautler 2005). B 12 
derivatives can exist in three different oxidation 
states - Co(III), Co(II), and Co(I). The number of 
axial ligands decreases with the increasing reduction 
of the metal (2, 1, and 0, respectively). Illustrations of 
the importance of the redox chemistry of these cofac¬ 
tors in physiology can be seen first for the 
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Electron Transfer 
Cofactors, Fig. 2 Structures 
of cobalamin cofactors. The 
structure of the cobalamin 
corrinoid ring (with central 
cobalt ion) is shown. One of 
the axial ligands is provided by 
dimethylbenzimidazole. The 
other axial ligand (denoted by 
R) is cyanide (in vitamin B 12 ), 
methyl (in methylcobalamin), 
or a 5’-deoxy- 5’-adenosyl 
group (coenzyme B 12 ) 



methylcobalamin form in the enzyme methionine 
synthase (MS). MS catalyses methyl group transfer to 
homocysteine following heterolytic cleavage of the 
C-Co bond, with concomitant change in formal oxida¬ 
tion state of the Co from Co(III) to Co(I). The second 
enzymatic step is the transfer of a methyl group from 
methyltetrahydrofolate to regenerate the starting 
enzyme state, and producing tetrahydrofolate (which 
is used in, e.g., nucleic acid and amino acid metabo¬ 
lism). The propensity of the Co(I) form of MS to 
be oxidized to Co(II) is addressed by regeneration 
of the enzyme through reductive reactions with 
reduced flavodoxin (in bacteria) or the diflavin 
enzyme methionine synthase reductase (MSR) 
(Koutmos et al. 2009). MS is one of only two cobala- 
min-dependent human enzymatic reactions. The other 
is methylmalonyl CoA mutase (MMCM), which binds 
coenzyme B 12 and undergoes heterolytic cleavage of 
the C-Co to generate a 5’-deoxy-5’-adenosyl radical 
and a 5-coordinate Co(II) cofactor. Radical-induced 


reactions with MMCM result in the rearrangement of 
the substrate and formation of succinyl CoA (a citric 
acid cycle intermediate) prior to rebinding of 
the 5’-deoxy-5’-adenosyl radical to the cofactor to 
restore the starting form (Banerjee 2003). Several 
other important enzymatic redox reactions rely on 
cobalamin cofactors, including bacterial glycerol 
dehydratase (in metabolism of glycerolipids) and 
ethanolamine ammonia lyase (which catalyses deami¬ 
nation of ethanolamine to produce ammonia and 
acetaldehyde). 

Copper Centers 

Several pivotal biological processes rely on electron 
transfer through copper centers. As with the mononu¬ 
clear iron centers, copper atoms are ligated to protein 
scaffolds using various N-, S-, and 0-ligands 
derived from amino acids and water/hydroxide. 
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Electron Transfer Cofactors, Fig. 3 Structures of copper 
centers in proteins. Panel A shows the coordination of the 
type I copper center from the bacterial protein amicyanin. 
Panel B shows copper coordination in a CuZn superoxide 


dismutase (e.g., that in human cell cytoplasm). Panel C show 
a type III center in the case of the mollusc oxygen carrier 
hemocyanin 


The mitochondrial inner membrane cytochrome c oxi¬ 
dase (complex IV), responsible for the 4-electron 
reduction of dioxygen to water and concomitant proton 
translocation that participates in development of 
a proton gradient across the membrane, contains two 
copper centers - Cu^ and Cub- The CuA is a binuclear 
copper center that receives electrons from cytochrome 
c and transfers them to a heme a center, which in turn 
passes electrons to a binuclear Cue-heme 03 center. 
Cui of Cua is ligated by histidine nitrogen and methi¬ 
onine sulfur, with glutamate oxygen and histidine 
nitrogen providing ligands to the Cu 2 . Bonds from 
two cysteine residues bridge both copper atoms. 
The CuA copper atoms cycle between EPR active 
Cu(I)/Cu(II) and EPR silent Cu(II)/Cu(II) states during 
electron transfer (Yoshikawa et al. 2006). The 
Cue/heme <23 center is the site of oxygen reduction, 
with the Cub coordinated by three histidine nitrogens 
and a fourth hydroxide ion ligand shared with the heme 
<23 in the resting state of the enzyme. 

The more widespread types of redox active biolog¬ 
ical copper centers are named types I, II, and III, as 
illustrated in Pig. 3. Type I centers (often referred to as 
blue copper proteins or ► cupredoxins) typically have 
two histidine nitrogens and one cysteine sulfur ligands, 
with a fourth oxygen or sulfur ligand. They are found 
in proteins involved purely in electron transfer (e.g., 
bacterial azurins and photosynthetic plastocyanins) 
and in enzymes with oxidase functions (e.g., ascorbate 
oxidase) (Sakurai and Kataoka 2007). Type II centers 
are found in several enzymes (mainly reductases, 
monooxygenases, and dioxygenases) and are 4- or 


5-coordinated, with 1^ histidine nitrogen ligands, 
and other(s) provided by oxygen- or sulfur donors. 
Examples are galactose oxidase and Cu-Zn 
► superoxidase dismutase, with one of the histidines 
ligated to both Cu and Zn in the latter case (Fig. 3) 
(MacPherson and Murphy 2007). Type III copper pro¬ 
teins have histidine-ligated metals, with examples 
including the related phenol oxidases tyrosinase and 
catechol oxidase, and the oxygen carrier hemocyanin 
found in molluscs and certain arthropods (Eicken 
et al. 1999) (Fig. 3). 

Factor F430 and Methanogenesis 

Factor F430 is a nickel-containing tetrapyrrole cofac¬ 
tor, in which the tetrapyrrole is more highly reduced 
than are other related structures (e.g., hemes and chlo¬ 
rophylls) and referred to as a corphin (Fig. 4). Its name 
comes from the optical absorption maximum due to 
71 - 71 * transitions from the unsaturated parts of the 
molecule. It is found in the enzyme methyl coenzyme 
M reductase from methanogenic archaea (Rouviere 
and Wolfe 1988), and may be involved directly in 
electron transfer to methylated coenzyme 
M (MeCoM) through oxidation of Ni(I) to Ni(II). 
The reductase binds to its substrates MeCoM 
(2-[methylthio]-ethanesulfonate) and coenzyme B 
(CoB, 7-mercaptoheptanoylthreonine phosphate), 
using CoB as an electron donor for reduction of 
MeCoM, leading to liberation of methane in the final 
step of methanogenesis (Ellermann et al. 1988). 
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Electron Transfer Cofactors, Fig. 4 The structure of the 
nickel-porphyrin Factor F430. Factor F430 is found in the 
enzyme methyl coenzyme M reductase from methanogenic 
archaea, where it plays a central role in the release of methane 
from methylated coenzyme M. The cofactor is yellow in color 
and is the only natural porphyrin-based cofactor known to bind 
nickel 

The redox activities of CoB and CoM are also used for 
fumarate reduction to succinate in the cytoplasm of the 
archaeon Methanohacterium thermoautotrophicum 
(Heim et al. 1988). 

Other Electron Transfer Cofactors 

Redox reactions in nature are achieved by many dif¬ 
ferent cofactors, some of which (like factor F430) 
appear relatively infrequently, or which are restricted 
to certain types of organism or domains of life. 
Nonetheless, these cofactors play key roles in various 
physiological processes. A notable example is molyb¬ 
denum, which is found in several bacterial nitrogenase 
enzymes (sometimes replaced by iron or vanadium). 
Nitrogenase catalyses the energetically expensive “fix¬ 
ing” of nitrogen to ammonia, an 8 -electron reduction 
process that also requires energy input in the form of 
hydrolysis of several molecules of ATP to enable 
breakage of the stable N 2 triple bond (Seefeldt et al. 
2009). A complex molybdenum- and iron-containing 
(MoFe) cofactor is required for the reaction, and the 
heterotetrameric MoFe protein is sequentially reduced 
by an ATP-bound homodimeric Fe-protein partner in 
order to achieve bond breakage in the dinitrogen 


substrate. The precise MoFe nitrogenase mechanism 
is still to be defined. Molybdenum is frequently found 
bound to a complex pyranopterin cofactor structure 
called molybdopterin (or MOCO, standing for molyb¬ 
denum cofactor). Important eukaryotic reactions 
include mitochondrial sulfite reductase (in sulfur 
metabolism) and plant nitrate reductase (in nitrogen 
assimilation). Molybdenum enzymes frequently trans¬ 
fer oxygen atoms (usually to, or from, water) and the 
metal ion usually shuttles between Mo(VI) and Mo 
(IV) oxidation states. Another important mammalian 
example is liver xanthine oxidase, a key enzyme in 
purine breakdown (Schwarz et al. 2009). Other impor¬ 
tant examples of redox active cofactors are 
( 1 ) lipoamide, which is, for example, successively 
oxidized and then reduced in the pyruvate dehydroge¬ 
nase catalytic cycle (Berg and de Kok 1997); (2) the 
antioxidants ascorbic acid (vitamin C) and glutathione 
(GSH); (3) ubiquinone (coenzyme Q) and 
menaquinone (vitamin K 2 ) in eukaryotic mitochon¬ 
drial/prokaryotic respiratory electron transfer; and 
(4) tetrahydrobiopterin (H 4 B) as a single electron 
donor/acceptor in the generation of nitric oxide by 
NOS enzymes, and as a two electron donor in the 
oxygenase reactions catalyzed by enzymes such as 
the aromatic amino acid monooxygenases phenylala¬ 
nine hydroxylase and tryptophan hydroxylase 
(Wei et al. 2003). 

Summary 

Electron transfer reactions are critical in biology for 
energy generation and exploitation, and nature has 
evolved an array of cofactor molecules that enable elec¬ 
tron transfer reactions to occur within enzyme scaffolds. 

Cross-References 
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► Cupredoxins 

► Electron Transfer Proteins; Overview 
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Electron Transfer Flavoproteins 
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Synonyms 

Electron transferring flavoprotein 


Definition 

Electron transfer flavoproteins are dimeric proteins 
that function as mobile electron carriers. These pro¬ 
teins are grouped in three classes: (a) flavoproteins 
serving as electron acceptors for flavoprotein dehydro¬ 
genases in mammalian mitochondria and in several 
bacteria; (b) those heterodimeric proteins that function 
in nitrogen fixation and are structurally related to 
mammalian electron transfer flavoprotein; (c) putative 
proteins related to mammalian electron transfer flavo¬ 
protein, encoded by genes in the fix region of the 
Escherichia coli genome. 

Electron Transfer Flavoproteins: Structure 
and the Reductive Half-Reaction 

Electron transfer flavoprotein (ETF) and its electron 
acceptor, ETF-ubiquinone oxidoreductase (ETF-QO) 
play essential roles in the P-oxidation of fatty acids, 
branched-chain amino acids, lysine, tryptophan, and 
N-methyl amino acids (N-methyl-glycine and N, 
N-dimethylglycine) in mammalian mitochondria. 
Mechanistic and structural aspects of the primary 
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flavoprotein dehydrogenases that function in the mito¬ 
chondrial oxidation of these compounds and use ETF 
as the electron acceptor are well known. ETF and 
ETF-QO can be considered a branch of the ► mito¬ 
chondrial electron transport chain with 11 sites for 
input of electrons from flavoprotein dehydrogenases 
and a single site for electron output to complex III 
through ubiquinone (UQ). The mitochondrial ETF 
and ETF-QO are the best characterized among all 
families of these evolutionarily conserved >• electron 
transfer proteins. Human ETF is a heterodimer (aP) 
with subunits of about 32 kD (a) and 28 kD (P). ETFs 
contain three structural domains. The a subunit has 
two globular domains, al and all, which are connected 
by a seven-amino acid flexible linker. The third 
domain is the P subunit. The protein contains a single 
equivalent of ► flavin adenine dinucleotide (FAD); the 
only redox center in the protein. FAD is noncovalently 
bound to the all domain and located in a cleft between 
the all domain and the P subunit. A single equivalent 
of 5’-adenosine monophosphate (AMP) is buried in the 
P subunit. AMP has been identified in most ETF pro¬ 
teins (Fig. 1). AMP plays a structural role in the mat¬ 
uration of ETF proteins. Human, porcine, and 
Paracoccus denitrificans ETF can be renatured only 
in the presence of both FAD and AMP after denatur- 
ation with 6 M guanidine hydrochloride (Griffin et al. 
1997). The hydroxyl of a266Thr (numbering is based 
on the mature human sequence) is within hydrogen 
bonding distance of the flavin N5 and modulates the 
► redox potential of the flavin. Foss of the hydrogen 
bond in the aThr266Met mutant of human ETF is 
a pathogenic mutation in multiple acyl-CoA dehydro¬ 
genation deflciency, an inherited metabolic disease. 
The mutation results in a 0.06 V decrease in the 
redox potential of the ETF flavin (Salazar et al. 
1997). The 4'-hydroxyl of the ribityl side chain of 
FAD and a286His are within hydrogen bonding dis¬ 
tance of the N1 and the C2 carbonyl oxygen of the 
flavin (Roberts et al. 1996). These hydrogen bonds also 
stabilize the flavin semiquinone and modulate the 
flavin potential. The potential of the semiquinone/ 
oxidized couple of human ETF substituted with the 
analog, 4'deoxy-FAD, is 0.116 V less than that of 
native ETF. The potential of the semiquinone/ 
dihydroquinone couple has not been explicitly deter¬ 
mined; however, the two-electron reduced state is gen¬ 
erated only with great difficulty (Dwyer et al. 1999). 
The hydrogen bond between N1 of the ETF flavin and 



Electron Transfer Flavoproteins, Fig. 1 Ribbon diagram of 
human ETF. The a. subunit is shown in blue and the P subunit is 
shown in violet. The FAD (yellow) is bound in the crevice 
between domains I (a subunit) and III (P subunit); AMP 
(green) is bound to the P subunit (Roberts et al. 1996,) (Copy¬ 
right 1996 National Academy of Sciences, U.S.A) 

the 4'hydroxyl of the ribityl side chain of the FAD has 
been implicated in the oxidative half-reaction of ETF. 
The C4(0) and N3 of the isoalloxazine ring form 
hydrogen bonds with atoms in the a subunit backbone 
and are expected to contribute to modulation of the 
redox potential of the flavin. 

The all domain of ETF which bears the FAD pros¬ 
thetic group is highly mobile and rotates 30-50° 
toward the al domain in the rigid x-ray structure. 
Conformational mobility was predicted from the crys¬ 
tal structure which showed that the isoalloxazine rings 
of the ETF flavin and medium-chain acyl-CoA dehy¬ 
drogenase flavin were 19-25 A apart in models with 
the rigid structures of the proteins (Roberts et al. 1996). 
This great a distance is very unfavorable for ► electron 
transfer through proteins (14 A). In the crystallized 
ETF-medium chain acyl-CoA dehydrogenase com¬ 
plex, ETF is in an open conformation. The minimum 
distance between the isoalloxazine rings of the two 
flavins is 9.7 A which would be expected to increase 
the rate of for electron transfer by orders of magnitude 
relative to the other conformation. The mobility of the 
all domain ETF also permits the this flavin-containing 
domain to adopt a number of open conformations 
to allow rapid electron transfer in the reductive 
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half-reactions with multiple flavoprotein dehydroge¬ 
nases when there is no conserved sequence in the 
dehydrogenases to dock with the all domain. 
A conserved recognition loop on the (3 suhunit serves 
as an anchor to dock ETF on the dehydrogenases. The 
docking site on the dehydrogenases is hydrophobic 
with a conserved three-dimensional structure but few 
residues are conserved in this hydrophobic patch. The 
overall hydrophobicity of this patch is retained to 
accommodate the PLeul95 side chain in the recogni¬ 
tion loop of ETF. This leucine residue is conserved in 
the ETF family. A similar interaction of the P subunit 
may stabilize the electron transfer complex with ETF- 
QO based on chemical cross-linking of the P subunit 
with ETF-QO (Toogood et al. 2007). 

ETF flavin is reduced by the flavoprotein dehydro¬ 
genases to the anionic semiquinone, which is kineti- 
cally stabilized. The oxidative half-reaction of reduced 
ETF catalyzed by ETF-QO is complicated by the ETF- 
QO catalyzed, rapid disproportionation of ETF 
semiquinone (1) in which ETF semiquinone acts as 
an electron donor and electron acceptor. 

2ETFie- ^ ETFo, + ETFze- (1) 

The reaction appears kinetically competent to 
participate in the overall reduction of UQ by ETF- 
QO and may be physiologically significant. 

The Oxidative Half-Reaction of ETF and 
ETF-QO 

ETF-QO is bound to the inner mitochondrial mem¬ 
brane and contains two redox centers, a [4Fe4S]’^^’’^* 
► iron-sulfur cluster and FAD. Both are in the oxi¬ 
dized form when the protein isolated. The crystal 
structure of the porcine protein, which has 92% 
sequence identity with the human protein, has three 
functional regions, a FAD domain, an iron-sulfur 
cluster domain, and a ubiquinone domain. There are 
no discrete structural domains, and the functional 
domains are closely packed and share structural ele¬ 
ments. The structure suggests that ETF-QO is 
a monotopic membrane protein with an a helix 
(Gly427-Trp451) and a P hairpin (Phel 14-Leul31) 
inserted in one side of the inner membrane bilayer. 
These two elements form the entry to a channel for 
UQ to enter from the membrane phase. The cluster 



Electron Transfer Flavoproteins, Fig. 2 Ribbon diagram of 
porcine ETF-QO. The structural domains are blue (flavin 
domain), red (iron-sulfur cluster domain), green (ubiquinone 
binding domain), cyan (putative membrane associated regions), 
and the mitochondrial membrane is represented by the blue 
shaded area. The FAD is shown in gold, the iron-sulfur cluster 
in red, and the ubiquinone in dark violet (Zhang et al. 2006) 
(Copyright 2006 National Academy of Sciences, USA) 

lies about 8 A from the surface of the protein, while 
the flavin lies greater than 14 A from the surface. 
Distances among the three redox centers are: 4Fe4S 
to UQ, 19 A, 4Fe4S to flavin 11.5 A and flavin to UQ, 
8.5 A (Fig. 2). The distance of the redox prosthetic 
groups from the protein surface and the distances 
among the three redox elements are all consistent with 
an electron transfer pathway from ETF involving, in 
sequence, the iron-sulfur cluster ^ FAD —> UQ (Zhang 
et al. 2006). The potential of the iron-sulfur cluster in 
ETF-QO is -1-0.047 V, the potential of the semiquinone/ 
oxidized couple of ETF is -1-0.004 V, and the potential 
of the dihydroquinone/semiquinone couple of ETF is 
—0.050 V. The disproportionation of ETFie. (1) is 
sufficiently rapid to generate ETF 2 e- with the potential 
to drive electrons to UQ (Em? = 0.06 V). The potentials 
of the FAD in ETF-QO are -1-0.028 for the first electron 
and —0.006 for the second electron. Although a three- 
electron reduced form of the protein can be generated 
by chemical reduction with a low potential reductant, 
enzymatic reduction by an acyl-CoA substrate and an 
appropriate acyl-CoA dehydrogenase in the presence 
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of ETF yields the two-electron reduced ETF-QO. In 
the one-electron reduced protein, the electron rapidly 
equilibrates between the flavin and cluster (Swanson 
et al. 2008). 

The high redox potential of the iron-sulfur cluster, 
-1-0.047 V, is due to hydrogen bonding of the Sy of 
three of the four cysteine residues that ligate the iron 
atoms. N1 and C2(0) of the isoalloxazine ring of the 
FAD are within hydrogen bonding distance of 
Thr376, and N3 and C4(0) are hydrogen bonded to 
oxygen and nitrogen atoms in the protein backbone. 
These interactions and the positive dipole of 
a nearby helix stabilize the anionic flavin 
semiquinone in the reduced protein. 04 of UQ is 
hydrogen bonded to the nitrogen of Gly273 and 
oxygen of Gly272. The UQ binding pocket is lined 
with hydrophobic residues and is not similar to the 
binding pockets of other UQ-dependent proteins 
(Zhang et al. 2006). ETF-QO from Rhodohacter 
sphaeroides has been employed in some biophysical 
investigations taking advantage of its 67% sequence 
identity with human ETF-QO and the ability to pro¬ 
duce large amounts of the recombinant protein. The 
overall structure of R. sphaeroides ETF-QO and 
redox potentials of the flavin and cluster are very 
similar to the mammalian ETF-QO. Site-directed 
mutagenesis of the hydrogen bond donors, TyrSOl 
and Thr525, to the Sy of the cysteine residues that 
ligate the cluster resulted in a decrease in potential of 
0.097 V. TyrSOl and Thr525 are structurally identi¬ 
cal to Tyr533 and Thr558 in porcine ETF-QO. The 
potential of the double mutant, Tyr501/Thr525, 
decreased to 0.165 V such that in the two-electron 
reduced protein, both electrons reside on the flavin 
(Swanson et al. 2008). The flavin in R. sphaeroides 
ETF-QO is hydrogen bonded at N1 and C(2)0 by an 
asparagine residue rather than the threonine residue 
in the porcine and human proteins. This hydrogen 
bond stabilizes the anionic semiquinone. Elimina¬ 
tion of hydrogen bonding by substitution of the 
asparagine residue with an alanine residue decreases 
the potential of the semiquinone/oxidized couple by 
0.085 V and the dihydroquinone/semiquione couple 
by 0.019 V (9). These mutations have no detectable 
effect on the structure of the protein as judged by the 
absorption spectra, the ► EPR g-values and mutation 
at one center has no effect on the potential of the 
other redox center. When the potential of the cluster 
was decreased by this strategy, the rate of the 


disproportionation reaction catalyzed by the double 
mutant was reduced 90% as was the overall reaction 
assayed by reduction of UQ. The disproportion reac¬ 
tion catalyzed by the Asn338Ala mutant protein with 
the decreased flavin potentials was unaltered; how¬ 
ever, the rate of UQ reduction decreased 90%. These 
results are consistent with the structural data that 
indicate that the cluster is the immediate electron 
acceptor from reduced ETF and that UQ reduction 
occurs at the flavin center (Swanson et al. 2008). The 
hydrogen bond between the 4'hydroxyl of the FAD 
in ETF and N1 of the ETF flavin have been impli¬ 
cated in electron transfer from ETF to ETF-QO 
because ETF substituted with 4'-deoxyFAD neither 
accepts from nor donates electrons to ETF-QO 
(Dwyer et al. 1999). 

Reactive Oxygen Generated in the ETF/ 
ETF-QO Branch of the Electron Transport 
Chain 

The ETF and ETF-QO branch of the mammalian mito¬ 
chondrial respiratory chain produce ► reactive oxygen 
species (ROS) in muscle, heart, and liver mitochondria 
and release the ROS on the matrix side of the inner 
mitochondrial membrane. In the case of fatty acid 
oxidation, matrix concentrations of long-chain fatty 
acid catabolic intermediates play a role in determining 
ROS concentrations (Seifert et al. 2010). ROS have 
been implicated in oxidative damage to membrane 
lipids, proteins, and DNA and play a role in aging 
and pathology resulting from ischemia-reperfusion 
and many neurodegenerative diseases. More recently, 
ROS have been recognized to be a component of some 
cellular signaling mechanisms involving some hor¬ 
mones, coagulation factors, and changes in oxygen 
tension resulting in hypoxia. 

Mutations in either ETF or ETF-QO are the cause 
of multiple acyl-CoA dehydrogenation deficiency, an 
inherited metabolic disease which is lethal in its neo¬ 
natal forms. In its less severe form, multiple acyl- 
CoA dehydrogenation deficiency presents later in 
life as a neuromuscular disease, occasionally accom¬ 
panied by encephalopathy. ROS production is ampli¬ 
fied in these patients with ETF-QO defects and in 
some of these patients is reversed by pharmaceutical 
doses of riboflavin, the precursor of the FAD pros¬ 
thetic group. 
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Evolutionary Conservation of ETF and 
ETF-QO 

In all biological kingdoms, ETF proteins that have 
been identified are soluble heterodimeric proteins as 
the mammalian ETF and contain FAD and AMP. An 
exception to the other eukaryotic and bacterial ETF 
proteins is the ETF from the anaerobe, Megasphera 
elsdenii. This ETF does not contain AMP, but rather 
contains a second molecule of FAD which functions in 
an NADH dehydrogenase activity for the reduction of 
crotonyl-CoA by a butyryl-CoA dehydrogenase. All 
ETF proteins function as mobile electron carriers. 
The ETF proteins from P. denitrificans and 
Methylophilus methylotrophus are the best character¬ 
ized in bacterial systems and their functions and over¬ 
all structures are analogous to the mammalian ETF. 
These bacterial ETF proteins also exhibit the segmen¬ 
tal motion of the all, flavin-containing domain. Pro¬ 
teins very similar to the eukaryotic and bacteria ETF 
proteins are found in nitrogen-fixing and diazotrophic 
bacteria. In these microorganisms, the FixA protein is 
analogous to the a subunit of ETF and the FixB protein 
is equivalent to the (3 subunit of ETF. With the products 
of the FixC and FixX genes, the FixA and FixB proteins 
are essential for nitrogen fixation. The FixC and FixX 
gene products are similar to the amino terminal and 
carboxyl terminal regions of the mammalian ETF-QO 
protein, in which the amino terminal sequence contrib¬ 
utes to the FAD domain and the carboxyl terminal 
contains the characteristic sequence for binding an 
iron-sulfur cluster. These proteins do not transfer elec¬ 
trons into a respiratory electron transport chain, but 
function in a system that reduces N 2 via a ferredoxin 
and a nitrogenase reductase/nitrogenase system 
(Toogood et al. 2007). 

ETF-QO is highly conserved among mammals, 
bacteria, yeast, plants, and invertebrates (Zhang et al. 
2006). Among the invertebrates, the parasitic nema¬ 
tode, Ascaris suum, carries out an anaerobic mitochon¬ 
drial energy metabolism and utilizes a low-potential 
quinone, rhodoquinone (Eq = —0.064 V) to reduce an 
electron transfer flavoprotein-rhodoquinone oxidore- 
ductase. The potential of the cluster, -1-0.025 V, is 
slightly less than the clusters in the mammalian ETF- 
QO. The quinone is reduced by NADH via a rotenone- 
sensitive, energy-transducing complex I-type activity. 
The terminal electron acceptors in this pathway are 
2-methylcrotonyl-CoA and 2-methyl-2-pentenoyl- 


CoA. Reduction of these enoyl-CoAs requires an 
ETF and a 2-methyl-branched chain enoyl-CoA reduc¬ 
tase (Ma et al. 1993). In the plant kingdom, 
Arahadopsis thaliana is dependent on an ETF/ETF- 
QO system to oxidize leucine and phytol in dark- 
induced sucrose starvation and certain mutants 
(Ishizaki et al. 2005). 

Cross-References 

► Electron Transfer Through Proteins 

► Flavins 

► Iron-Sulfur Clusters 

► NADH-Ubiquinone Oxidoreductase (Complex I) 

► Quinone Cofactors 

► Reactive Oxygen Species 

► Redox Potential 

► Ubiquinol-Cytochrome c Oxidoreductase 
(Complex III) 
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Definition 

Catalases (EC 1.11.1.6) are enzymes that catalyze the 
disproportionation of hydrogen peroxide into water 
and molecular oxygen by means of a heme iron or 
a dinuclear manganese active site. They have an essen¬ 
tial role in the regulation of the cellular concentration 
of hydrogen peroxide. 

Catalase-peroxidases, also named KatGs in rela¬ 
tion to the encoding katG gene, are heme-containing 
oxidoreductases that are capable of disproportion¬ 
ation of hydrogen peroxide as monofunctional cata¬ 
lases, as well as of oxidation of substrates via high- 
valence iron-oxo intermediates as monofunctional 
peroxidases. KatGs can also perform INH lyase, 
NADH oxidase, and isonicotinoyl-NADH synthase 
reactions. 

Basic Characteristics 

Catalases are protective enzymes that convert the 
potentially damaging H 2 O 2 into H 2 O and O 2 . 
Catalases fall into three distinct phylogenetic 
groups: the heme-containing monofunctional cata¬ 
lases, the heme-containing catalase-peroxidases and 
the manganese-containing catalases. Evidence for 


electron transfer exists only in the two heme- 
containing groups. Conceptually, the purpose of elec¬ 
tron transfer in heme enzymes is to move electrons 
from an electron rich substrate, usually on the surface 
of the protein, to an electron deficient, usually oxidized 
heme. However, both monofunctional catalases and 
catalase-peroxidases react with hydrogen peroxide 
directly at the heme iron site in the ferric oxidation 
state, serving as oxidant and reductant in a two-step 
process (Reactions 1 and 2) and involving an 
oxoferryl-porphyrin (For) radical intermediate called 
Compound I (Cpd I) (Nicholls et al. 2001). This article 
describes the reactions catalyzed by these enzymes and 
addresses the question of why internal electron transfer 
should be needed and supported in catalases. 

[Fe“‘ For] -F H 2 O 2 ^ [Fe’'' = O For*+] -F HjO (1) 

[Fe^'' = O Por*+] -F H 2 O 2 ^ [Fe'“ For] -F H 2 O -F O 2 

( 2 ) 

Monofunctional Catalases 

Two different lines of evidence suggest a role for 
electron transfer in monofunctional catalases. One 
involves the movement of electrons from ► nicotin¬ 
amide adenine dinucleotide phosphate (NADFH) to 
the heme cavity in a subset of catalases. NADFH was 
first discovered in bovine liver catalase but has since 
been identified in other catalases from bacteria 
to humans (Kirkman and Gaetani 2006). Electron 
transfer is inferred from the fact that NADFH, bound 
15 A from the heme, enhances enzymatic activity by 
preventing the formation of the Compound II 
(Fe*'*' = O or Fe*'^-OH For) species due to the sponta¬ 
neous one-electron reduction of Cpd I. Specifically, 
when the two-electron reduction of Cpd I is slow 
(low levels of H 2 O 2 ), Cpd I can undergo one-electron 
reduction to Cpd II, a species that is unreactive with 
H 2 O 2 , unless electron transfer from NADFH is avail¬ 
able to return Cpd I directly to resting ferric state (Fe^^^ 
For). The electron transfer process is postulated to 
occur through a multistep and through-bond event 
between NADFH and the heme edge (Olson and 
Bruice 1995; Sicking et al. 2008). While experimental 
verification of the pathway is required, the evidence for 
electron movement makes this process reminiscent of 
a peroxidatic reaction, albeit one involving a two- 
electron reduction (Reaction 3). 
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Electron Transfer in Catalases and Catalase-Peroxidases, 
Fig. 1 Structure of the N-terminal domain of B. pseudomallei 
KatG with isoniazid (INH) bound (PDB reference 3N3N). The 
three radical sites (Trp330, Trpl39, and Tipl53) for the 
Fe"^ = O Trp* intermediates identified by multifrequency EPR 
spectroscopy are shown in green. The tryptophan residues in 
blue and the structural waters in red are required for electron 
transfer between Tipl53 or Trpl39 and the heme, with the paths 
indicated by blue and white dashed lines. The long distance 


(24.6 A) between the INH substrate and the heme precludes 
a single-step tunneling phenomenon to occur at biologically 
relevant rates. Hence, multistep tunneling or electron hopping 
is likely to occur, with distances of 8.1 A from INH to Trp202, 
4.5 A from Trp202 to Trp 139, and 9.7 A from Trp 139 and 
Trplll. Interestingly, the distance from INH to the site of the 
reactive radical intermediate Tipl39 is 13.3 A, well within the 
predicted limit of 14 A (Page et al. 1999), but Tip202 may play 
the role of electron relay even in this case 


[Fe’'' = O Por+*l 4- NADPH -f H+ 

(3) 

^ [Fe'“ Por] + NADP+ + HjO 

The second line of evidence for electron transfer in 
catalases is inferred from the identification of a tyrosyl 
radical (Tyr*) intermediate in bovine liver catalase 
(Ivancich et al. 1999). In a peroxidase-like reaction, 
intramolecular electron transfer from a Tyr residue to 
the Por converts the Cpd I intermediate of catalase into 
an oxoferryl heme and the Tyr* species (Reaction 4). 

[Fe^'" = O Por*+] ^ [Fe^'' = O Tyr*]+ H+ (4) 

Only 2 tyrosines (out of 19) in bovine catalase agree 
well with the physicochemical properties revealed by 
multifrequency Electron Paramagnetic Resonance 
(► EPR) spectroscopy studies. Tyr369 is close to the 
heme proximal side and Tyr259 is at the enzyme 
surface. The formation of an oxoferryl-tyrosyl radical 


intermediate implies an inherent peroxidase capability 
in catalases for which either the full pathway has been 
lost in evolution or for which the natural substrate has not 
been identified yet. Whether this putative peroxidase-like 
reaction might involve intramolecular one-electron trans- 
fer(s), as implied by the formation of tyrosyl radical(s), or 
is limited to the two-electron transfer from NADPH 
remains to be determined. Quenching of other short¬ 
lived Tyr* species may be the route by which inactive 
Cpd II is formed leading to the need for NADPH. 

Catalase-Peroxidases 

Catalase-peroxidases, commonly called KatGs, con¬ 
tain an unusual adduct of three cross-linked residues 
Met-Tyr-Trp (see Fig. 1) required for the catalatic 
reaction (Reactions 1 and 2). The adduct formation 
involves electron transfer initiated by oxidation of the 
nearby heme active site (Ghiladi et al. 2005), and 
radical formation on the adduct has been proposed as 
part of the catalatic process (Zhao et al. 2010). 
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However, there is a controversy on the assignment of 
the EPR spectra of the putative adduct radical and this 
leaves its role in the catalatic process undefined. 

On the other hand, multifrequency EPR studies have 
unequivocally identified tryptophan radicals (Trp*) in 
KatGs from Synechocystis (Ivancich et al. 2003), Myco¬ 
bacterium tuberculosis (Singh et al. 2007), and 
Burkholderia pseudomallei (Colin et al. 2009) formed 
as stable and catalytically competitive oxoferryl-Trp* 
intermediates in the peroxidase cycle. B. pseudomallei 
KatG proved to he the more general case among bacte¬ 
rial KatGs, in which three unique Trp* intermediates are 
formed (Pig. 1). One is on Trp330 very close to the 
heme; another one is on Trp 153 close to the enzyme 
surface and requiring a long-range electron transfer 
pathway to the heme (assisted by structural waters and 
two Trp acting as electron relays); and a third one is on 
Trp 139 half way between the heme and the surface and 
requiring the Trp of the adduct as electron relay (Pig. 1). 
Trp 139 is close to a funnel-shaped channel, where iso- 
niazid has been shown to bind (Wiseman et al. 2010). 
The multiplicity of electron pathways implies 
a multiplicity of substrate-binding sites from which elec¬ 
trons can flow to the heme cavity for the two subsequent 
one-electron reduction steps. Tyrosyl radical species are 
also formed in KatGs but appear to have a less specific 
role as intermediates in the peroxidase reaction. 

Catalase-peroxidases also catalyze the reduction of 
molecular oxygen to superoxide using electron donors 
such as NADH (Singh et al. 2004) and isonicotinyl 
hydrazide (INH) (Wengenack et al. 1999) 
(Reactions 5 and 6). The route for electron transfer 
from NADH to the heme is not known, but the pathway 
leading through the adduct is used for transfer from 
INH (Wiseman et al. 2010). 

IN - NHNH 2 ^ IN* -f HN = NH -f H+ -I- e^ (5) 
[Pe‘“ For] - O 2 -f ^ [Pe™ For] -|- 02*^ (6) 

The role of superoxide formation in normal metab¬ 
olism is not clear, but the process of pro-drug INH 
activation in M. tuberculosis KatG has adapted super¬ 
oxide to the formation of IN-NAD. In conclusion, 
superoxide formation and the peroxidatic process are 
quite different processes, but they use the same under¬ 
lying electron transport pathways (see Fig. 1) to bring 
electrons to the heme cavity from a surface-bound 
electron donor. 


Cross-References 

► Cytochrome c Peroxidase - Computational Studies 

► Electron Transfer Theory 

► Electron Transfer Through Proteins 

► EPR 

► Nicotinamide Adenine Dinucleotide (NAD) 

► Peroxidases 
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Definition 

Proteins involved in electron transfer reactions 


Introduction 

Electron transfer proteins are essential for life 
because of the importance of electron transfer in 
bioenergetics and other processes. They contain 
redox-active prosthetic groups or “redox sites” 
where oxidation/reduction occurs. The most com¬ 
mon redox sites contain metals such as hemes, 
iron-sulfur clusters, and copper centers but also 
include ► flavins, reducible disulhdes, and quinones. 
The simplest electron transfer proteins are single 
domain one-electron carriers. Since they are small 
(6-15 kDa in molecular mass) and generally water 
soluble, many have been extensively characterized by 
experimental including very high-resolution crystallo¬ 
graphic structures. Often, the redox site is located close 
to one surface of the protein, which facilitates electron 
transfer to other redox centers. Examples include the 
► cytochromes, the ferredoxins, and the ► cupredoxins. 
More complex proteins may have multiple cofactors of 
the same or different types that guide electrons from an 
entry point at one surface through the protein to an exit 
point at another surface. They often have multiple 
domains or subunits that may have additional functions 
such as oxidation or reduction of organic substrates; 


these building blocks may be homologues of the smaller 
water-soluble proteins. Examples include the respira¬ 
tory complexes and photosystems, which are integral 
membrane proteins that move electrons across the mem¬ 
brane in electron transport chains. Since experimental 
studies of these large membrane proteins can be diffi¬ 
cult, studies of homologous small, water-soluble pro¬ 
teins can be important for understanding more complex 
functions. 

Because most electron transfer proteins are 
metalloproteins, inorganic chemistry plays an impor¬ 
tant role in their functions (Holm and Solomon 2004). 
Besides traditional biochemical and biophysical tech¬ 
niques for proteins including site-directed mutagene¬ 
sis, X-ray crystallography and nuclear magnetic 
resonance, other methods are used that probe the 
metal site such as electron paramagnetic resonance 
(► EPR), X-ray absorption spectroscopy (XAS), elec¬ 
tron-nuclear double resonance (ENDOR), Mdssbauer, 
magnetic circular dichroism, photoelectron spectros¬ 
copy (PES), and electrochemistry. Computational 
methods include molecular mechanics/molecular 
dynamics simulations (Ichiye 2001) and continuum 
electrostatic calculations (Gunner et al. 1996) for the 
protein, and density functional theory (DFT) methods 
especially for the metal sites (Lovell et al. 2003). 

Electron Transfer Theory 

Marcus ► electron transfer theory (Marcus and Sutin 
1985) describes the transfer of an electron from a donor 
(D) species to an acceptor (A) species in an environ¬ 
ment so that the reactant (R) state is D“A and the 
product (P) state is DA“. Eor example, in the classic 
case of ferric-ferrous in aqueous solution, the Pe(II) 
and Pe(III) ions are the donor and acceptor, respec¬ 
tively, and the surrounding water is the environment. 
For more complicated solutes such as metal com¬ 
plexes, the metal ligands are considered to be part of 
the donor or acceptor rather than the environment. 
Moreover, although water beyond the first few shells 
may be considered as a homogeneous dielectric con¬ 
tinuum, the first few shells may have a complex, 
dynamic structure highly dependent on the structure 
of the solute. For electron transfer between 
metalloproteins, the reactive species are generally 
defined as only the prosthetic groups while the envi¬ 
ronment consists of the polypeptide chain plus its 
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surroundings including water and membrane. More¬ 
over, although the environment is inhomogeneous, 
the first “solvation” shell consists of the protein and 
thus has a well-defined average structure stabilized by 
covalent and non-covalent interactions. 

According to Marcus theory, electron transfer 
proteins can influence the rate of electron transfer 
by the overall driving force AG, the reorganization 
energy 1, and the electronic coupling //da; these 
factors are defined by both the prosthetic group and 
the protein environment. Since AG is the difference 
in free energy between the R and P states and thus 
equal to the difference in the reduction potentials of 
the donor and acceptor, the reduction potential E° = 
AG/nFE° of an electron transfer protein is an 
important experimental property. E° is composed 
of the “inner” contribution AGin due to the intrinsic 
energy required to add an electron to the prosthetic 
group (i.e., the negative of the adiabatic ionization 
energy), an “outer” contribution AGom due to the 
change in the interaction energy of the prosthetic 
group with the environment when the electron is 
added (i.e., the change in the “solvation” energy of 
the prosthetic group where the solvation is by the 
protein and surrounding solvent), and the free 
energy of an electron in the standard hydrogen 
electrode AGshe (1); 

— nFE° = AGin + AGout + AGshe (1) 

The other two properties affect the rate of the reac¬ 
tion but not the equilibrium. For instance, A is the 
energy required to move the nuclear coordinates from 
the R to P state while the donor and acceptor remain in 
the R state. It has an inner contribution due to the 
changes in the geometry of the prosthetic groups 
including changes in metal-liegand bond lengths and 
coordination, and an outer contribution lout due to 
changes of the environment; changes in the prosthetic 
groups may in turn perturb the rest of the protein(s). 
Finally, //da depends roughly on //°da, the coupling 
matrix when D and A are in contact, and the distance 
between D and A. The prosthetic groups are the most 
important determinants of the wave functions for the 
electron on D and A. Also, since ► electron transfer 
through proteins is dependent on the separation 
between D and A and the nature of the intervening 
protein, different pathways may affect //da (Beratan 
and Onuchic 1996). 


Architecture of Redox Proteins 

Since the electron carrier function resides on the pros¬ 
thetic group, the role of the protein itself is in part to 
serve as a scaffold for the prosthetic groups, making it 
soluble in water, providing a recognition site for redox 
partners, and holding it in an optimal position for 
electron transfer. However, since proteins with the 
same redox site can have very different reduction 
potentials, understanding how proteins can control 
their redox properties is of great interest. Syntheses 
of redox site analogs make studies of the donor and 
acceptor independent of the protein environment pos¬ 
sible (Holm and Solomon 2004); however, their elec¬ 
trochemical reduction potentials vary considerably 
with the ligands and solvent and obtaining a set of 
reduction potentials for different redox sites with 
a consistent set of ligands and solvent can be difficult. 
Thus, combined DFT and PES studies of oxidation/ 
reduction energies of various cluster analogs in the gas 
phase (for instance Wang et al. 2003), which are inde¬ 
pendent of the solvent, allow systematic comparisons 
between analogs and proteins with different types of 
clusters. In addition, site-specific mutagenesis allows 
very controlled alterations of the environment created 
by the protein matrix. Computational studies of the 
protein environment have led to the proposals that the 
backbone fold is the major contributing factor to 
the reduction potential (Stephens et al. 1996) while 
the specific sequence tunes it by factors including 
side chain polarity and subtle backbone shifts due to 
side chain size (Ichiye 1999). Recently, quantitative 
predictions of absolute reduction potentials using 
a combination of DFT and continuum electrostatic 
calculations (Perrin and Ichiye 2010) have allowed 
a more complete picture of the architecture of the 
different contributions (Fig. 1). 

The most important contribution to E° is the energy 
required to add an electron to the redox site, although it 
can be modified by other factors due to the protein. 
Thus, the redox site determines the inner sphere ener¬ 
getics. The choice of the atoms defining the redox site 
is important in calculations; for instance, a good choice 
is the atoms in analogs that are necessary to reproduce 
the electronic and bonding structure and have signifi¬ 
cant changes in electron density upon oxidation or 
reduction. Thus, side chains that ligate metals are gen¬ 
erally considered to be part of the redox site. The 
reduction energy is highly dependent on the redox 




E 


616 


Electron Transfer Proteins: Overview 



Electron Transfer Proteins: Overview, Fig. 1 Schematic of 
the architecture of an electron transfer protein (Rhodocyclus 
tenuis [4Fe-4S] HiPIP, PDB ID IISU). The inner sphere consists 
of the redox site {white), which consists of a [4Fe-4S] cluster and 
the ligating cysteinyl side chains {hall-and-stick representation) 
for the HiPIPs. The outer sphere consists of the rest of the protein 
{red), which has a fold contribution from the backbone {blue) 
and a sequence contribution from the side chains {yellow wire), 
and the surrounding solvent {blue) 

site, easily spanning a range of several volts. More¬ 
over, since AGin is the adiabatic free energy difference 
between the oxidized and reduced states, it is com¬ 
posed of the vertical oxidation/reduction energy of 
the prosthetic group and Of course, the AGin of 
an analog may differ from that of a prosthetic group in 
the protein; for instance, the protein environment may 
provide entatic restraints. 

The protein and surrounding solvent also contribute 
to the E°, but indirectly by the change in the electro¬ 
static interaction energy with the redox site between 
the oxidized and reduced states; thus, they mainly 
determine the outer sphere energetics. The protein 
fold largely determines AGout so that homologous pro¬ 
teins have similar redox properties while specific 
sequences of homologous proteins can tune AGout 
(Perrin and Ichiye 2010). In part, the protein fold 
determines the degree to which the redox site is buried 
inside the low dielectric protein cavity within a high 
dielectric aqueous solvent, which defines a solvation- 
free energy. In addition, the fold determines the 
arrangement of the backbone polar groups (the amide 
and carbonyl groups) and thus most of the protein 


“electret,” which is the electric potential at the redox 
site due to the charges of the polypeptide in its average 
configuration. On the other hand, the protein sequence 
can alter the electret by differences in the electrostatic 
properties of the side chains or slight variations in the 
backbone positions due to side chain size. Moreover, 
based on the assumption that the fold contribution is 
the major component, one can look for sequence deter¬ 
minants of differences in reduction potential between 
homologous proteins, which can be used to predict 
site-specific mutations that alter the reduction potential 
(Ichiye 2001). 

Classes of Electron Transfer Proteins 

Electron transfer proteins are generally classified by 
their redox site, although some may contain more than 
one type of site (Palmer 1992). Each fold typically 
binds one type of cluster with a characteristic binding 
motif while each type of cluster may be found in more 
than one fold. The major classes are the iron-sulfur 
proteins, the ► cytochromes, the copper proteins, the 
flavoproteins, and the ► thioredoxins. Other classes 
include the molybdenum, nickel, and vanadium pro¬ 
teins, and quinoproteins. Since detailed discussions of 
the major classes are given elsewhere, here the focus is 
on comparisons of the factors contributing to the redox 
properties of small, water-soluble electron carriers 
from the three major metalloprotein classes. 

Iron-Sulfur Proteins 

Iron-sulfur proteins are ubiquitous and are often 
involved in electron transfer although a variety of 
other functions have been found (Beinert 2000). They 
are largely low-potential electron carriers with E° rang¬ 
ing from —700 to 400 mV (Meyer 2008). The redox site 
typically contains of a core of one to four irons and 
inorganic sulfurs, which is denoted by [mFe-nS]; the 
entire redox site consisting of the core plus cysteinyl 
side chains that bind the core to the protein is denoted by 
[Fe„,S„(SCys) 4 ]‘^ (Fig. 2). The ligand coordination of the 
iron is relatively rigid in geometry and preferentially 
tetrahedral for both the Fe(II) and Fe(III) states and the 
natural protein ligands are almost exclusively cysteine 
so that the properties of the redox sites are almost 
independent of the protein environment. This has 
aided the synthesis of a wide variety of structural and 
functional analogs in biologically relevant oxidation 
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Electron Transfer Proteins: 
Overview, Fig. 2 Examples 
of iron-sulfur redox sites: (a) 
[IFe] cluster bound by 
cysteinyl ligands, (b) [2Fe-2S] 
cluster bound by cysteinyl 
ligands, and [4F-4S] cluster 
bound by cysteinyl ligands 


a b 

Cys \ / Cys 
Fe 

Cys ^ y V ^\\Cys 
S S' 


Cvs Cys 




Cys. 4 
S 



Electron Transfer Proteins: 
Overview, Fig. 3 Simple 
iron-sulfur proteins with a- 
helices in red, /i-strands in 
blue, and redox site iron in 
green and sulfurs in yellow, (a) 
Clostridium pasteurianum 
rubredoxin (PDB ID lIRO), 

(b) Synechocystis sp. [2Fe-2S] 
ferredoxin (PDB ID lOFF), 

(c) Rhodocyclus tenuis [4Fe- 
4S] HiPIP (PDB ID IISU), (d) 
Clostridium acidurici [4Fe- 
4S] ferredoxin (PDB ID 
2FDN) 



states (Rao and Holm 2004). Moreover, the delocalized 
charge on the S-ligands and the independence of the 
ligand coordination geometry on the oxidation state 
helps to reduce (Sigfridsson et al. 2001). 


Since simple iron-sulfur proteins may have been 
the primordial electron transfer proteins (Beinert 
2000), there are numerous examples of small, 
water-soluble electron carriers: the [IFe] 
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Electron Transfer Proteins: Overview, Fig. 4 Cytochrome c 
redox site, a heme c bound to the protein by cysteinyl ligands 

rubredoxins, the [2Fe-2S] plant-type ferredoxins, the 
[2Fe-2S] Rieske-type ferredoxins (which are unusual 
in having two Flis ligands), the [4Fe-4S] high poten¬ 
tial iron-sulfur proteins (HiPIPs), and the [4Fe-4S] 
bacterial-type ferredoxins (Fig. 3), each of which has 
characteristic binding motifs (Meyer 2008). Homo- 
logues of these proteins are also found as components 
of larger electron transfer complexes such as the 
[2Fe-2S] Rieske ferredoxins in the respiratory cyto¬ 
chrome hc\ complex and the photosynthetic cyto¬ 
chrome h(,f complex and the [4Fe-4S] bacterial- 
type ferredoxins in respiratory complex I and photo¬ 
system 1. In comparing proteins (Meyer 2008; Munck 
and Bominaar 2008) to analogs with R = phenol in 
acetonitrile (Rao and Holm 2004), the E° for [Fe 
(SR) 4 ]^'^^' are —0.050 to 0.125 V in rubredoxin ver¬ 
sus —0.28 V in the analog, for [Fe 2 S 2 (SR) 4 ]^'^^' are 
—0.12 to —0.45 V in [2Fe-2S] ferredoxins versus 
— 1.17 V in the analog, for [Fe 4 S 4 (SR) 4 ]*'^^" are 
0.1-0.4 V in the HiPIPs versus ~0.24 V (irreversible) 
in the analog, for [Fe 4 S 4 (SR) 4 ]^'^^' are —0.65 to 0 V 
in bacterial ferredoxins versus —0.76 V in the analog, 
and for [Fe 4 S 4 (SR) 4 ]^'^'*' are —0.79 V in the nitroge- 
nase iron-protein versus —1.48 V in the analog. The 
range in E° for the different analogs illustrates how 
the different redox sites affect AGin while the range 
in E° for a given protein fold illustrates how 
differences in sequence affect AGout- Moreover, the 
[4Fe-4S] cluster apparently exhibits three different 
couples because the different folds of the HiPIPs, 
bacterial ferredoxins, and iron-proteins have much 
different AGout- 



Electron Transfer Proteins: Overview, Fig. 5 Thunnus 
alalunga (tuna) cytochrome c (PDB ID 3CYT) with a-helices 
in red, /J-strands in blue, and redox site iron in green and sulfurs 
in yellow 

Cytochromes 

The cytochromes are heme proteins whose normal 
function involves a change in the redox state of the 
heme. E° are between —300 and -1-470 mV (Moore 
1996) so they serve as mid-potential carriers. A heme 
consists of an iron bound to a porphyrin ring, which 
differs in the substituents attached to the ring; the most 
common in cytochromes are heme A, B, C (Fig. 4). In 
cytochromes, the iron is also bound to the protein by 
two axial ligands, typically histidine and methionine, 
creating an octahedral coordination sphere. Cyto¬ 
chromes are classified based on the heme type(s) 
(denoted by a lower case letter corresponding to the 
upper case letter denoting the heme type), the axial 
ligands, and fold. The heme is typically found in 
a hydrophobic pocket in the protein, and analogs typ¬ 
ically are stable in nonpolar environments. The inner 
sphere reorganization energy is apparently quite low 
due to a combination of factors including delocalized 
charge on the N- and S-ligands, preference of Fe(II) 
and Fe(III) for the same coordination number and 
geometry, low spin iron rather than the high spin 
irons in the iron-sulfur proteins, and preservation of 
the Fe-Npor bond length by ring strain in the porphyrin 
(Sigfridsson et al. 2001). 

The mono-heme cytochrome c (Bertini et al. 2006) 
containing a heme C is a small protein found in eukary¬ 
otes, prokaryotes, and archaea. The archetypical 
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Electron Transfer Proteins: 
Overview, Fig. 6 Blue- 
copper redox sites from (a) 
plastocyanin with methionyl 
terminal axial ligand and (b) 
fungal laccase, in which 
a leucine replaces the 
methionine so that there is no 
terminal axial ligand 


Met 



HC-CH3 

CH3 



mitochondrial cytochrome c has a water-soluble single 
domain of about 100-110 residues and transfers elec¬ 
trons between Complex III (cytochrome bcx complex) 
and IV (cytochrome oxidase complex) of the respira¬ 
tory electron transport chain. It is characterized by the 
cytochrome c fold (Fig. 5), the CXXCH heme binding 
motif, which axially coordinates the heme iron with the 
histidine and the ring by thioether bonds to the two 
cysteines, and a loop containing the second axial 
ligand, which is generally a methionine. Although the 
iron is relatively buried, one edge of the heme is at the 
surface. E° of [Fe(N 4 porphyrin)(NHis)(SMet)]'^° in 
cytochrome c vary from 0.1 to 0.5 V (Moore 1996) 
versus 0.43 V for a heme with phenyl substituents and 
thioether and imidizole axial ligands in tetrahydrofu- 
ran (Marchon et al. 1982). 

Copper Proteins 

Copper proteins are found throughout nature, generally 
involving electron transfer in bioenergetics or sub¬ 
strate oxidation (Farver 1996). E° range from 180 to 
800 mV in the protein (Farver 1996) so that they 
generally serve as high potential carriers. The copper 
sites typically consist of one or more coppers coordi¬ 
nated by a variety of ligands including histidine and 
cysteine, with a varying coordination number and 
geometry. The common sites are classified according 
to their spectroscopic properties: type I contain a single 
Cu(II) coordinated by two histidines, a cysteine, and 
a variable axial ligand in a distorted trigonal-pyramid 
or tetrahedron (Fig. 6); type II contain a single Cu(II) 


with a square planar coordination by N or N/0 ligands; 
and type III contain two antiferromagnetically coupled 
Cu(II), each coordinated by three histidines. The cop¬ 
per sites are relatively more plastic than the iron-sulfur 
sites, in part because Cu(I) and Cu(II) have different 
preferred coordination number and geometry, and thus 
depend more heavily on the protein environment. This 
has made the synthesis of analogs that are both struc¬ 
turally and functionally analogous and more difficult 
and only recently have type I analogs been synthesized 
(Solomon et al. 2004): a trigonally distorted tetrahedral 
structure similar to plastocyanin by Kitajima and 
coworkers and a trigonal planar structure similar to 
the fungal laccase by Holland and Tolman. Moreover, 
while the charge is delocalized over the N- and 
S-ligands, the difference in the preferred coordination 
number and geometry for the two oxidation states of 
the cluster give rise to relatively large than in the 
iron-sulfur proteins (Sigfridsson et al. 2001). 

The cupredoxins, which are the most common elec¬ 
tron transfer proteins, use the type I or “blue copper” 
sites (Gray et al. 2000). They are found in both pro¬ 
karyotes and eukaryotes, generally as water-soluble 
electron carriers. For instance, plastocyanin is an elec¬ 
tron carrier from cytochrome / of the cytochrome h^f 
complex to P700-I- of photosystem I. There are ^100 
residues with the same fold (Fig. 7) and binding motif 
HXqoCXXHXXXXM where the methionine axial 
ligand sometimes is substituted by a glutamine as in 
stellacyanin, is lacking as in the fungal laccases, or may 
be joined by a second axial ligand such as a backbone 
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Electron Transfer Proteins: 
Overview, Fig. 7 Simple 
blue-copper proteins with 
a-helices in red, ^-strands in 
blue, and redox site iron in 
green and sulfurs myellow. (a) 
Populus nigra (poplar) 
plastocyanin (PDB ID IPLC) 
and (b) Cerrena maxima 
fungal laccase (PDB ID 
2H5U) 




carbonyl in azurin. ET of wild-type cupredoxins range 
from 0.26 to 0.80 V, where the axial ligands play a large 
role in tuning AGm (Gray et al. 2000). For instance, 
E° of [Cu(NHis) 2 (SCys)(SMet)]''° in plastocyanins 
range from 0.32 to 0.39 V while the Kitajima analog 
is -0.32 V in CH 2 CI 2 while £° of [Cu(NHis) 2 (SCys)]'^° 
in the fungal laccase are from 0.39 to 0.55 V (near pH 7) 
while the Tolman analog is —0.18 V in tetrahydrofuran. 
The analog results indicate that the lack of the thioether 
axial ligand in laccase may lower AGin and thus raise E° 
although the range of E° in both the plastocyanins and 
the laccases indicate the rest of the sequence also tunes 
the potential. 

Summary 

The electron transfer properties of electron transfer pro¬ 
teins are a function of both the redox-active prosthetic 
group and the protein environment. In particular, the 
redox site controls the inner sphere contribution and the 
protein fold controls the outer sphere control while the 
protein sequence tunes the outer sphere contribution. 

Cross-References 

► Cupredoxins 

► Cytochromes 

► Electron Transfer Cofactors 

► Electron Transfer Elavoproteins 

► Electron Transfer Theory 

► Electron Transfer Through Proteins 


► EPR 

► Flavins 

► Hemes 

► Iron-Sulfur Clusters 

► Redox Potential 

► Thioredoxins 
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Introduction 

Electron transfer Is a fundamental process which plays 
a central role in physics, chemistry, and biology. 
Biological electron transfer reactions are required for 
respiration, photosynthesis, and redox reactions of 
intermediary metabolism. Unlike typical chemical 
reactions, electron transfer reactions do not involve 
the making or breaking of bonds. Electron 
transfer theory describes the physical parameters that 
define the rates at which electron transfer reactions 
occur. 

Electron Transfer Theory 

Chemical reactions are described by the Arrhenius 
equation (Eq. 1) which defines to what extent the 
rate constant (k) of the reaction will depend upon the 
temperature T and activation energy (£„). Electron 
transfer theory is a modified form of transition-state 
theory (Eq. 2), which is developed from the traditional 
Arrhenius equation. Marcus theory (Marcus and Sutin 
1985; Marcus 1997), in which the nuclear coordinates 
of the reactant and product states are described using 
classical harmonic oscillators, is a prevalent version of 
electron transfer theory. In the Marcus theory, the 
contribution of the nuclear coordinates (including the 
activation-free energy) is comprised of two terms, the 
reorganization energy (1) and the Gibbs free energy 
(AG°). The pre-exponential Arrhenius factor is defined 
in terms of the electronic coupling matrix element 
(Hab)- The other parameters in Eq. 2 are Planck’s 
constant (h), temperature in kelvin (T), and the gas 
constant (R). Alternatively the Boltzmann constant 
(^b) may be used instead of R, with change of the 
activation-free energy units from joule/mol to joule/ 
molecule. 


Synonyms 

Marcus theory 

Definition 


k = A exp(—iifl/RT) 


( 1 ) 


kpr — 


4n^HAB^ /H^TiARTf^ 
exp[-(AG° -F/1)V41RT 


( 2 ) 


Electron transfer theory describes the parameters 
which control the rate at which an electron is trans¬ 
ferred from one atom or molecule to another. 


The influence of 1 and H^b on electron transfer rates 
is schematically represented in Fig. 1 . In the classical 
model, electron transfer occurs at the intersection of 
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Electron Transfer Theory, 
Fig. 1 Potential energy 
diagrams for electron transfer 
reactions. In each of the three 
examples, the parabolas on the 
left and right represent the 
reactant and product states, 
respectively. In these 
examples AG°= 0 



the potential energy surfaces for the reactant and prod¬ 
uct states. For simplicity, these multi-dimensional 
energy surfaces are typically presented as parabolas 
described by the free energy (ordinate) and reaction 
coordinate (abscissa). The electron transfer event only 
occurs when the reactant and product states have the 
same energy, thus satisfying the principle of conserva¬ 
tion of energy, and same nuclear configuration, thus 
obeying the Franck-Condon Principle. 

The parabola describes the relationship one would 
get if the nuclei were connected by springs to each 
other and because nuclear vibrations are harmonic 
oscillators they obey Hooke’s Law. In this case, the 
energy is dependent on the square of the distance that 
the springs are stretched or compressed from their 
equilibrium positions. The reactant and product states 
have different equilibrium nuclear positions because 
during electron transfer the charge on the participating 
atoms or molecules changes and the resulting attrac¬ 
tion or repulsion causes the spring to either compress 
or expand. The Frank-Condon principle requires that 
electron transfer is a vertical transition in this diagram 
as it occurs within a vibrational frequency, so rapidly 
that no change in nuclear configuration can occur. 
Conservation of energy also requires that electron 
transfer is a horizontal line on the diagram. The only 
position where these conditions are satisfied is where 
the parabolas intersect. This crossing point represents 
the energy level to which the reactant state must be 
raised before progressing to the product state. This 
activation energy for the electron transfer is equal to 



Electron Transfer Theory, Fig. 2 The Marcus inverted 
region. The predicted dependence of the electron transfer rate 
constant on free energy driving force from Eq. 2 

((AG° -I- Xf'lAX). When AG° is equal to zero the acti¬ 
vation energy reduces to 1/4 (Fig. 1). It also follows 
that when 1 = — AG° the activation energy is zero and 
the electron transfer rate will be maximal. An impor¬ 
tant implication of this postulate is that the electron 
transfer rate constant will decrease as — AG° increases 
beyond the value of 1. The conditions under which this 
occurs is referred to as the Marcus inverted region 
(Fig. 2). 

Evidence for quantum mechanical tunneling was 
gleaned from the observation that the rate of electron 
transfer in the bacterial photosynthetic reaction center 
is temperature-independent at temperature below 
100 K (DeVault and Chance 1969; DeVault 1980). 
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It is assumed in Eq. 2 that the electron transfer is 
coupled only to low frequency vibrations, which is 
valid as long as the nuclear vihrational energy quantum 
(/tv) is much less than the thermal energy {k-^T) 
(Hopfield 1974; Jortner 1976). Given that biological 
electron transfer reactions are typically studied at or 
near ambient temperature, this assumption will be valid. 

Electron Transfer Activation Energy 

As seen in Fig. 1, A is the energy difference between 
the reactant and product states at the equilibrium 
nuclear configuration of the reactant, and it is the 
energy required to bring the reactant and product states 
to this common intermediate in which electron transfer 
occurs. It is comprised of inner sphere and outer sphere 
components as in Eq. 3. 

A = Ain + Aout (3) 

The Ain reflects changes in bond lengths and angles 
that accompany the redox reaction. The Aout reflects 
the reorientation of solvent molecules that accom¬ 
pany the redox reaction. For reactions in solution 
between small molecules these distinctions are rea¬ 
sonably well defined; however, for biological elec¬ 
tron transfer reactions involving protein-bound redox 
centers, which may be organic molecules or metals 
that are surrounded by the protein matrix as well as 
solvent, the distinction between Ai^ and Aout is less 
well defined. 

The other determinant of the activation energy for 
electron transfer is AG°. For true electron transfer 
reactions AG° depends on A£„, for the donor and 
acceptor redox centers as given in Eq. 4, where n is 
the number of electrons transferred and F is the Fara¬ 
day constant. 

AG° = -nFAE„, (4) 

Several factors may influence the oxidation- 
reduction midpoint potential (£„) value (also called 
the ► redox potential) of redox centers within proteins 
including the identity of ligands for metal cofactors, 
protein-imposed constraints on organic cofactor 
conformation or metal ligation geometry, hydrogen¬ 
bonding pattern around the center, presence of water, 
hydrophobicity, and electrostatic effects. 


Electronic Coupling Matrix Element 

describes the extent to which the wave functions 
of the reactant and product states overlap. (Alternative 
designations for Hab are often used which include 
Hda, Hrp, and Tda-) The magnitude of FIab depends 
upon the distance and the nature of the intervening 
medium between the redox centers (Newton 1991). 
While electron transfer between small molecules in 
solution typically involves direct contact, biological 
► electron transfer through proteins and between pro¬ 
tein-bound redox centers occurs over relatively long 
distance. As such, H^b is a significant determinant of 
electron transfer rates in biological systems. As seen in 
Fig. 1, the splitting at the intersection point is equal to 
2Hab- When FIab is zero there is no chance of the 
reaction occurring regardless of the energy state. At 
the other extreme is the adiabatic system. The term 
“adiabatic” here should not be confused with its defi¬ 
nition in the context of thermodynamics where an 
adiabatic process is one in which no heat is gained or 
lost by the system. In the context of electron transfer, 
an adiabatic process is one in which the value of Hab is 
so large that the probability of crossover when the 
activation energy is achieved is unity. In this case, 
the reaction coordinate will effectively proceed along 
a single surface. These adiabatic reactions are 
described by transition-state theory rather than elec¬ 
tron transfer theory. In a non-adiabatic electron trans¬ 
fer reaction, the electronic overlap between the 
reactant and product states is relatively weak (<<1 
cm”*), and the gap represented as 2Hab is relatively 
small. Because of this weak coupling the activation 
energy may have to be achieved several times before 
the crossover from reactant to product states occurs. 
Thus, Hab describes the degree of non-adiabicity (i.e., 
probability of the reaction occurring when the activa¬ 
tion energy has been achieved). 

In simple systems Hab and consequently k^T will 
decrease exponentially with distance. This is reflected 
in Eq. 5, where ka (approximately 10*^ s”* at room 
temperature) is the characteristic frequency of the 
nuclei. This is the upper limit for when donor 
and acceptor are at van der Waals’ contact and 
A = — AG°. The donor to acceptor distance is r, and 
I'a is the close contact distance (the sum of the van der 
Waals’ radii of donor and acceptor which is com¬ 
monly taken to be 3-4 A). The attenuation factor 
parameter // is used to quantitate the nature of the 
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intervening medium with respect to its efficiency to 
mediate electron transfer. 

ksT = koexp[-P{r - rg)] exp [ - (AG° + Xf /A)JiT] 

(5) 

Application of Electron Transfer Theory to 
Biological Systems 

An important question in Biophysics is whether or not 
electron transfer theory is generally applicable to biolog¬ 
ical electron transfer reactions which typically occur 
through and between proteins (Bendall 1996; Bolton et 
al. 1991; Johnson et al. 1990). Evidence for turmeling 
was first discovered by DeVault and Chance (1969) in 
the photosynthetic reaction center protein. With this pro¬ 
tein system it has also been possible to examine the 
temperature and AG° dependence of electron transfer to 
correlate with electron transfer theory (Gutmer and 
Dutton 1989; Lin et al. 1994). However, for the reactions 
of many ► electron transfer proteins it is not possible to 
experimentally determine the dependence of the kET on 
AG°. While it may be possible to alter the E„, value of 
a protein-bound redox center by site-directed mutagene¬ 
sis of nearby amino acid residues, one cannot rule out the 
possibility that other structural changes have occurred as 
a consequence of the mutation which may also affect a or 
//as- As such, studies of protein electron transfer reac¬ 
tions more typically examine the temperature depen¬ 
dence of the electron transfer rate. With this approach 
there is the limitation that proteins are stable only over 
a narrow temperature range. Furthermore, given the com¬ 
plexity of the biological system the possibility exists that 
some non-electron transfer process (e.g., a protein con¬ 
formation change or protein-protein interactions) is 
influencing the observed rate. This may give rise to an 
apparent electron transfer rate that does not really 
describe the true electron transfer event. This is seen in 
► gated electron transfer reactions, ► coupled electron 
transfer reactions, and those controlled via ► dynamic 
docking between redox protein partners. In such cases 
one could extract apparent values of a and H^b from the 
dependence of the observed rate on temperature but since 
these are not trae electron transfer reactions, these exper¬ 
imentally derived parameters would not describe the 
actual electron transfer event. Nevertheless, it has been 
possible to directly apply Eq. 2 to analyze the 


temperature dependence of protein electron transfer reac¬ 
tions and obtain reasonable values of I and //ab, and 
predict electron transfer distances using Eq. 4 (Davidson 
2008). Even in those cases where the observed rate of the 
redox reaction is that of a non-electron transfer event, the 
magnitudes of the apparent values of i and Hab niay 
provide clues as to the overall mechanism and regulation 
of the biological electron transfer process. 

Summary 

A precise and complete description of the physical param¬ 
eters that dictate the rates of biological electron transfer 
reactions is critical to our understanding of metabolism, 
respiration, and photosynthesis at the molecular level. 
This is important since defective biological electron trans¬ 
fer leads to production of ► reactive oxygen species and 

► oxidative stress which is deleterious to all living organ¬ 
isms. It is also of practical value as electron transfer theory 
provides a logical basis for design of applications of 
bioanalytical tools and devices, such as enzyme-based 
amperometric biosensors and fuel cells. 

Cross-References 

► Coupled Electron Transfer 

► Dynamic Docking 

► Electron Hopping in Biomolecules 

► Electron Transfer Cofactors 

► Electron Transfer Proteins; Overview 

► Electron Transfer Through Proteins 

► Electron Transport Chains in Bacteria 

► Gated Electron Transfer 

► Mitochondrial Electron Transport 

► Oxidative Stress 

► Photosynthetic Electron Transport 

► Proton-Coupled Electron Transfer 

► Reactive Oxygen Species 

► Redox Potential 
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Synonyms 

Protein-assisted electron transfer 


Definition 

Proteins mediate electron transfer by lowering the 
barrier to electron tunneling that an electron would 
experience with empty space intervening between the 
electron donor and acceptor and by constraining the 
positions of these groups in space. 


Introduction 

Electron transfer (ET) rates depend on two key phys¬ 
ical quantities: the activation-free energy and the 
donor-acceptor electronic coupling interaction. As 
described in ► electron transfer theory, the activa¬ 
tion-free energy depends on the reaction thermody¬ 
namics and the “reorganization energy” (1). The 
reaction thermodynamics is determined by the 
► redox potential values of the donor and acceptor 
cofactors, and the reorganization energy describes the 
response of the protein and solvent to the mobile elec¬ 
tron (Bendall 1996). The donor-acceptor electronic 
interaction, Hqa, arises from the protein-mediated 
coupling between the donor (D) and acceptor (A) 
localized electronic states. Many biological ► electron 
transfer proteins contain multiple ► electron-transfer 
cofactors. However, in most circumstances, protein 
ET kinetics can be described as a sequence of D to 
A reactions between pairs of redox cofactors (vide 
infra). That is, the electron is localized either on a donor 
or on an acceptor cofactor and the protein medium 
between them acts as an electron tunneling barrier. 
As such, the mechanism for charge transfer is quantum 
mechanical tunneling, facilitated by the electronic 
mixing of D and A cofactors via the intervening protein. 
How does the protein direct the electron tunneling? 

Early Models for DA Tunneling in Proteins 

The rate of unimolecular non-adiabatic ET in the high- 
temperature regime is (Bendall 1996) 

kET = [47t2HDAVh(4triRT)'/2] 

X exp[-(AG° -I- iflAXKT] (1) 

This rate formulation arises from a so-called two- 
state approximation that requires the electron be 
localized dominantly on a D or A cofactor. Further, 
this framework assumes that protein motion does not 
cause significant fluctuations in the electronic cou¬ 
pling. The derivation of this rate expression assumes 
a tunneling mechanism that does not permit the 
electron to hop among multiple sites between 
D and A, which would create true intermediate 
states. Moreover, this formulation assumes that the 
transfer kinetics is not gated by slow conformational 
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motion (Davidson 2000). Extensions of this descrip¬ 
tion to incorporate protein motion are established 
(Beratan et al. 2009; Skourtis et al. 2010). 

The electron tunneling interaction, Hoa> is expected 
to depend on several characteristics of the protein 
system: (1) the donor-acceptor structure, distance, 
and orientation, (2) the interaction of D and A with 
the immediately neighboring protein and solvent 
atoms, (3) the fold of the protein between the D and 
A, (4) the protein dynamics, and (5) the structure of 
intervening water and counter-ions. 

What are the most essential physical features that 
control Hoa through the five factors above? The elec¬ 
tronic dynamics of Eq 1 is purely two-state dynamics, 
so estimates of Hda require computing the protein- 
mediated D-A tunneling probability. 

The probability of tunneling through a square bar¬ 
rier protein model of height (E) and width (x) in one 
dimension is 

P(x) oc exp[—(2mE)*^^//i]x (2) 

where m is the electron mass (Hopfield 1974). In pro¬ 
teins, this exponentially small probability, and the 
Boltzmann probability of populating bridge states, 
exp[—AEbridge/RT], compete in determining the mech¬ 
anism (Beratan et al. 2009). Indeed, for a sufficiently 
large distance (x), or low tunneling barrier (E), the 
probability of thermally populating the bridge states 
can produce a transition in ET mechanism invalidating 
the two-state framework. In addition to the possibility 
of thermally injecting an electron or hole into the 
bridge (i.e., ► electron hopping), Marcus theory pre¬ 
dicts a fluctuation in the donor energy due to medium 
polarization fluctuations in order to reach the activated 
complex. These fluctuations in the donor electronic 
energies have a standard deviation a = (2)^RT)*^^, on 
the scale of tenths of electron Volts (eV) (Hopfield 
1974; Hatcher et al. 2008). As the tunneling barrier 
shrinks to fractions of eV, the tunneling and carrier 
delocalization or hopping mechanisms can become 
scrambled, and the simple rate relation of Eq. 1 will 
no longer be valid. In most cases of protein ET studied 
so far, the tunneling mechanism dominates (Gray and 
Winkler 2003, 2010). This is because, in many circum¬ 
stances, tunneling barriers in proteins are high com¬ 
pared to both RT and to the thermally induced donor 
electronic energy fluctuations. This is especially true 
for transition metal-containing donor and acceptor 


biological cofactors and D-A distances that are not 
exceedingly large. Indeed, constructing systems that 
utilize hopping intermediates of the protein are of great 
current interest (e.g., Shih et al. 2008). 

When transfer involves little population of the 
bridge, the rate depends on Hqa , as noted in Eq. 1, 
and the transport mechanism requires quantum 
mechanical electron tunneling. What is the magnitude 
and distance dependence of this tunneling interaction 
Hoa? In 1974, Hopfield estimated an effective one¬ 
dimensional (ID) barrier to tunneling in proteins of 
2 eV. Since the exponential decay of the tunneling 
interaction (Eq. 2) scales with the square root of this 
barrier height, one finds Hda cx exp[—1.4 r] (via 
Eq. 2), with r measured in Angstroms (Hopfield 
1974). In the 1980s, Beratan and Hopfield showed 
that the decay exponent associated with charge tunnel¬ 
ing is sensitive to the structure of the intervening 
bridge (Beratan and Hopfield 1984). Indeed, multiple 
pathways may interfere constructively or destruc¬ 
tively, and the decay exponent is sensitive to the 
energy of the tunneling electron as well (Skourtis and 
Beratan 1999). 

Tunneling Pathway Theory of Protein ET 

The ID square barrier model of tunneling neglects 
many effects on ET kinetics arising from molecular 
structure. In the 1990s, Beratan and Onuchic showed 
that simple tunneling barrier models could be 
expanded in a manner that distinguishes between 
the dramatically different length scales for through- 
bond and through-space tunneling (Beratan et al. 
1991). The electronic binding energy of redox cofac¬ 
tors in biology is about 5 eV. Thus, the tunneling 
decay exponent for through-space tunneling propa¬ 
gation is about twice as large as for through-bond 
tunneling propagation. Indeed, the softer decay 
of tunneling through-bond compared to through- 
space arises because the through-bond barriers 
are ^1-2 eV. As such, protein-mediated ET can be 
understood by mapping out the through-bond 
and through-space content of tunneling routes 
from D to A. This framework, which distinguishes 
between through-bond and through-space decays, led 
to the tunneling pathway model for protein ET 
(Beratan et al. 1991). In the pathway model, the 
D-A coupling is described by Eq. 3. 
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Electron Transfer Through Proteins, Fig. 1 Schematic rep¬ 
resentation of the sU'ongest tunneling pathway along bonds 1-10 
between donor and acceptor. Note the combination of covalent 
(solid line) and non-covalent (dashed line connections) (Beratan 
and Onuchic 1989) 


Hda oc MaxjriiEi} (3) 



Electron Transfer Through Proteins, Fig. 2 The strongest 
tunneling pathways in cytochromes c that link the native heme 
cofactor with surface-labeled Ru-tagged sites are shown. Note 
that some longer distance derivatives have strong paths, while 
other shorter distance derivatives have weaker paths (by virtue of 
weak through-space links) (Wuttke et al. 1992) 


The Ei values in Eq. 3 are the tunneling penalties for 
traversing a particular through-bond or through-space 
contact in a protein. “Max” denotes the fact that one 
searches the protein structure for the maximum 
strength combination of through-bond and through- 
space decay factors that maximizes the value of this 
product of decay factors. Hqa is assumed proportional 
to the strength of the pathway that maximizes the 
product of these decay factors leading from D to A. 
Simple rules are used to compute the decay factors for 
each step. Using data from chemical model systems for 
calibration, the through-bond decay factor was set to 
0.6 for all covalent bonds; the decay factor for all 
through-space contacts was derived from Eq. 2 in 
such a way that it would return the value 0.6 when 
a through-space contact is “squeezed down” to 
a covalent length (Skourtis and Beratan 1999). This 
approach distinguishes the dramatic difference 
between through-bond and through-space tunneling, 
without trying to capture all of the orbital symmetry 
and energy effects that may influence bridge-mediated 
tunneling interactions (Figs. 1 and 2). 

Importantly, pathway analysis predicts that the 
average distance decay of Hda^ is smaller, and rates 
are consequently faster (for fixed activation free 


energies), for tunneling along P strands than for tunnel¬ 
ing through a-helices (Beratan et al. 1991). Tunneling 
through covalent bonds is much more favorable than 
tunneling through space. The effects of protein-motif 
on ET rates predicted by the “pathway model” were 
validated in seminal studies of Gray, Winkler, and 
coworkers (Gray and Winkler 2003) on Ru-modifled 
proteins. The proteins analyzed in those studies 
included cytochrome c, myoglobin, cytochrome 
iron-sulfur proteins, and the ► cupredoxins, azurin, 
and plastocyanin. 

In addition to describing the mechanism of tunnel¬ 
ing as dominantly through-bond, the tunneling path¬ 
way model for protein ET emphasizes mediation via 
the virtual oxidized states of the protein (hole states) 
rather than by the virtual reduced states (electron 
states) of the protein. 

More Fully Quantum Treatments of Protein- 
Mediated Tunneling 

More detailed quantum mechanical approaches have 
also been developed for protein ET. These methods 
include mediation by multiple pathways (Kuki and 









E 


628 


Electron Transfer Through Proteins 



Electron Transfer Through Proteins, Fig. 3 Diagram show¬ 
ing how DA coupling fluctuations could wash out the effects of 
protein structural differences with increasing DA distance, Rda- 
Each diagram shows possible Hqa probability densities for two 
pairs of distinct ET species, each pair having the same average 
values of Rda- and <T 2 are the root mean squared coupling 
fluctuations ffHojOf each species in the pair, r^rit is the DA 


distance beyond which cthda > (Hda)- For Rda < rcut, Hda 
fluctuations are small and do not wash out structural differences 
in (a) For Rda > rcut the increase in coupling fluctua¬ 
tions could wash out structural differences in leading to 

a generic average baiTier limit (b). Surprisingly, simulations do 
not observe this second regime, even though coupling fluctua¬ 
tions are significant (Balabin et al. 2009; Skourtis et al. 2010) 


Wolynes 1987; Regan et al. 1993; Stuchebrukhov and 
Marcus 1995). Related analysis assesses the impor¬ 
tance of specific atoms and groups to ET mediation 
(Regan et al. 1994; Skourtis and Beratan 1999). Com¬ 
putations that assess the influence of thermal fluctua¬ 
tions on ET have also been pursued, and they indicate 
that Hda values may fluctuate a great deal with struc¬ 
tural changes (Wolfgang et al. 1997; Balabin and 
Onuchic 2000, Prytkova et al. 2007; Balabin et al. 
2009). Indeed, Hqa may change in sign, indicating 
that coupling pathways can interfere destructively. 
This multi-path analysis indicates that D-A coupling 
interactions need to be carried out in the framework of 
a thermal ensemble average, <Hda^>, as any given 
protein snapshot may have anomalously weak cou¬ 
pling due to destructive interference effects (Balabin 
and Onuchic 2000). 

Simplifications of the pathway estimation of Hda in 
the context of simple protein packing densities produce 
largely the same predictions as those arising in the 
tunneling pathway model (Page et al. 1999; Jones 
et al. 2002). Nonetheless, there are some differences 
in the predictions of pathway and density-based 
methods, since the connectivity characteristics of the 
strongest coupling pathway are not always mirrored in 
the protein packing density between redox cofactors. 
Recent studies showed that proteins with multiple 
pathways tend to display D-A couplings that are typ¬ 
ical of their secondary structural motif (Prytkova et al. 


2007; Balabin et al. 2009). On the other hand, proteins 
with a narrower bundle of coupling pathways are more 
apt to have anomalous coupling strengths typical of 
their specific dominant paths, and these paths can be 
stronger or weaker than the average for the motif 
(Prytkova et al. 2007). 

Writing the ET rate in the form of Eq. 1 assumes 
a Franck-Condon approximation, or a separation of 
electronic and nuclear terms. That is, the D-A interac¬ 
tion is assumed to be independent of nuclear confor¬ 
mational fluctuations that occur on the time scale of 
reaction coordinate motion through the “crossing” 
between D and A potential energy surfaces. Indeed, 
we have shown that this assumption is valid for the 
blue-copper protein azurin (Skourtis et al. 2005). 

Tunneling Across Protein-Protein Interfaces 

Tunneling pathway effects between proteins are par¬ 
ticularly intriguing, as issues of conformational ensem¬ 
bles take center stage (Bendall 1996; Davidson 2000). 
In the case of ► dynamic docking, a minority of the 
protein-protein ensemble can dominate the ET mech¬ 
anism (Liang et al. 2004). The reason is that small 
through-space distance differences are magnified expo¬ 
nentially in the computation of Hda- A particularly 
interesting recent finding is that thin water layers bridg¬ 
ing between ET proteins may cause anomalously large 
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D-A couplings for the ET distance (Lin et al. 2005). 
Fhotein-protein distances that admit a small number of 
organized waters between D and A seem to establish 
a plateau in Hda values, compared to the distance 
decay of couplings at larger and smaller distances. 
Indeed, signatures are being found in experimental bio¬ 
logical systems that confirm the theory that structured 
waters enhance ET rates. 

Protein Fluctuations, HoAr and the 
Persistence of Protein Structural Effects 
on ET 

The “pathway” model is based on robust characteris¬ 
tics of through-bond and through-space tunneling 
(Beratan and Onuchic 1989; Beratan et al. 1991). 
Multi-pathway models indicate that the size and strength 
of quantum interferences among multiple pathways fluc¬ 
tuate strongly as protein geometries change. Is it possible 
that thermal fluctuations on the time scale of ET kinetics 
may entirely erase the protein medium effects on the 
rates predicted by the pathway model? Recent theoretical 
simulations computed thermally fluctuating Hda values 
along classical molecular dynamics trajectories (Balabin 
et al. 2009; Beratan et al. 2009; Skourtis et al. 2010). 
These studies indicate that, at long D-A distances, struc¬ 
tural fluctuations of Hda dominate the value of Hda for 
the minimum energy structure. In this limit, the value of 
Hda for the minimum energy structure of the protein is 
not relevant to the ET rate, because thermal fluctuations 
drive the protein to non-equilibrium conformations with 
larger Hda values that enhance the ET rate. The relevant 
quantity in this regime is the thermally averaged coupling 
<Hda^>- Recent simulations indicate that, although 
stmctural fluctuations are critical for the ET electronic 
coupling at large D-A distances, they do not wash out the 
signatures of secondary and tertiary protein structure 
effects on the ET rates. The quantity <Hda^> (and 
thus the ET rate) does indeed depend on protein stmcture, 
as predicted by the pathway model (Balabin et al. 2009) 
(Fig. 3). 

Summary 

Tunneling through proteins is understood to involve 
a combination of through-bond and through-space 
electronic propagation, with the strength of the 


dominant paths determined by the protein fold. 
In motifs dominated by through-bond propagation 
with direct coupling routes (P-sheets), the distance 
dependence of the rates is softer than in motifs with 
more circuitous tunneling routes that include essen¬ 
tial through-space steps, as are found in a-helices 
and random coils. The presence of thermal fluctua¬ 
tions makes the calculation of ensemble averaged 
D-A couplings essential when explicit multi¬ 
pathway Schrodinger equation methods are used. 
Yet, even in the presence of thermal fluctuations, 
the underlying protein fold is a critical determinant 
of D-A coupling. Interestingly, emerging multi-path 
perspectives suggest that some protein motifs sup¬ 
port D-A couplings with relatively few coupling 
pathways, while other motifs more commonly sup¬ 
port multiple coupling routes with strong interfer¬ 
ences among the pathways (Prytkova et al. 2007). 
Anomalously slow ET rates seem to be observed in 
cases with few and weak coupling paths. Impor¬ 
tantly, theory and experiment show that a protein’s 
structural motif can cause orders of magnitude dif¬ 
ferences in ET rates at the very same distance as 
a consequence of tunneling pathway effects (Beratan 
et al. 1991; Gray and Winkler 2003). 

Cross-References 

► Coupled Electron Transfer 

► Dynamic Docking 

► Electron Hopping in Biomolecules 

► Electron Transfer Proteins; Overview 

► Electron Transfer Theory 

► Gated Electron Transfer 

► Redox Potential 
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Synonyms 

Anaerobic respiration; Electron transport chains in 
bacteria 


Definition 

Electron transport in facultative anaerobes is 
a prerequisite for ATP synthesis by anaerobic respi¬ 
ration. A diverse range of oxygen-independent elec¬ 
tron transport chains is synthesized in mostly 
bacterial and archaeal species when the cells switch 
their energy metabolism from aerobic to anaerobic 
respiration. 
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Enzymes oxidizing Enzymes reducing 

eiectron donors eiectron acceptors 



Electron Transport in Facultative Anaerobes, Fig. 1 Exam¬ 
ples of electron transport chains in E. coli. The quinone/quinol 
pool (either ubiquinone or menaquinone) mediates electron 
transfer from an electron donor (oxidized by a dehydrogenase) 
to an electron acceptor (reduced by a reductase). The free energy 


change of a particular electron transport chain depends on the 
standard redox potential difference of the respective electron 
donor and acceptor redox pairs. DMSO dimethyl sulfoxide, 
TMAO trimethylamine W-oxide 


Basic Characteristics 

Facultative anaerobes are organisms able to grow in 
the presence or absence of molecular oxygen, that is, 
under both aerobic and anaerobic conditions 
(Richardson 2000). When the oxygen concentration 
in their environment diminishes, many bacterial and 
archaeal (but also some eukaryotic) microorganisms 
switch from adenosine triphosphate (ATP) production 
by aerobic respiration to either anaerobic respiration or 
fermentation. In anaerobic respiration a variety of 
alternative terminal electron acceptors can be used by 
facultative anaerobes. The most prominent examples 
are nitrate, nitrite, fumarate, elemental sulfur, or metal 
ions like iron or manganese. Figure 1 depicts the 
corresponding electron transport network in the facul¬ 
tatively anaerobic bacterium Escherichia coli. Further 
exemplary model organisms for anaerobic respiration 
belong to the bacterial genera Campylobacter, 
Geohacter, Paracoccus, Shewanella, and Wolinella 
and usually possess high respiratory flexibility. Sub¬ 
strate conversions that are due to anaerobic respiration 
contribute immensely to the biogeochemical cycles of 
nitrogen, sulfur, and iron which are the foundations for 
life on Earth (Bothe et al. 2007; Dahl and Friedrich 
2007; Moir 2011). 


Bioenergetic Principles 

The reduction of an electron acceptor substrate by 
the appropriate enzyme forms the final step of an 
electron transport chain that serves in the generation 
of a proton motive force (pmf) across the membrane. 
The pmf is subsequently coupled to ATP synthesis 
according to Peter Mitchell’s chemiosmotic hypoth¬ 
esis (Nicholls and Ferguson 2002). Electrons 
migrate through a range of ► electron transfer 
cofactors, such as > flavins, ► iron-sulfur clusters, 
heme, and copper centers that are bound to integral 
membrane or membrane-associated protein com¬ 
plexes. The free energy released during this process 
is used to drive the translocation of protons across 
the coupling membrane at a certain proton per elec¬ 
tron ratio (HVe~). In facultatively anaerobic bacte¬ 
ria, ► electron transport proteins often form 
structurally conserved electron transport modules 
that may participate in different electron transport 
chains. In principal, the pmf can be built up by 
different mechanisms like proton pumping, qui¬ 
none/quinol turnover, or by a redox loop (discussed 
below). A few representative examples of electron 
transport chains in facultatively anaerobic organ¬ 
isms are presented, emphasizing typical principles 
of pmf generation. 
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Fdh —- Nar 


HCOOH CO 2 H+/e- = 2 



Fdh —►Nap 
H+/e- = 1 


NO3- NOg- + H2O 



Electron Transport in Facultative Anaerobes, Fig. 2 Sche¬ 
matic presentation of the E. coli electron transport chains from 
formate dehydrogenase (electrogenic FdnGHI complex; orange) 
to either membrane-bound nitrate reductase (electrogenic 
NarGHI complex; blue) or the periplasmic nitrate reductase 
(NapA; green). The catalytic subunits FdnG, NarG, and NapA 
are molybdoproteins while FdnFI and NarFI are electron- 
transfeiTing iron-sulfur proteins. The membrane integral sub¬ 
units Fdnl and Narl are quinone/quinol-reactive diheme 


cytochromes h. The tetraheme cytochrome c NapC and the 
iron-sulfur protein complex NapGFl represent alternative quinol 
dehydrogenase modules of the Nap system whereas NapB is 
a pentaheme cytochrome c that delivers electrons to NapA. 
Fligh-resolution structural models for many of these proteins 
have been determined. Note that the different FI'*'/e^ ratios are 
caused by the localization of the nitrate reduction site relative to 
the membrane (-r and - denote periplasmic and cytoplasmic 
membrane sides) 


Nitrate Respiration 

Many facultative anaerobes use nitrate as an energeti¬ 
cally favorable electron acceptor in the absence of 
oxygen. Nitrate is reduced to nitrite by either 
a membrane-bound or a periplasmic nitrate reductase 
(Nar or Nap enzyme). Depending on the nitrate reduc¬ 
tase used, a short electron transport chain from formate 
via the quinone/quinol pool to nitrate is operated at 
a ratio of either 2 or 1 (Fig. 2). Electron transport 
from formate dehydrogenase to the Nar enzyme repre¬ 
sents the prototype example of a two-enzyme proton 
motive redox loop. 

The redox loop mechanism couples electron trans¬ 
port to a net proton transfer across the membrane with¬ 
out proton pumping (Richardson and Sawers 2002; 
Simon et al. 2008). Instead, charge separation is 
achieved by quinone/quinol-reactive enzymes whose 
active sites for substrates and quinones/quinols are 
located on different sides of the coupling membrane. 

Nitrite molecules formed in the course of nitrate 
respiration are converted either to N 2 (via NO and 
N 2 O) in denitrihcation or to ammonium in a process 


called respiratory nitrite ammonification (see Simon 
(2011) for a comprehensive description of the archi¬ 
tecture of the corresponding electron transport chains 
in various facultative anaerobes). 

Fumarate Respiration 

In bacterial fumarate respiration, quinol oxidation is 
coupled to fumarate reduction (yielding succinate) 
which is catalyzed by a membrane-bound fumarate 
reductase complex consisting of three or four subunits 
(FrdA, B, C, and sometimes FrdD). FrdA is the fumarate- 
reducing subunit and located at the cytoplasmic side of 
the membrane. Fumarate reductase is related to complex 
II (► Succinate Dehydrogenase (Complex II)) of the 
aerobic electron transport chain. Fumarate respiration is 
widespread in facultative anaerobes and was investigated 
in detail in E. coli and Wolinella succinogenes (Kroger et 
al. 2002). These organisms are able to couple quinone- 
dependent formate (or hydrogen) oxidation to pmf gen¬ 
eration as described in Fig. 2. On the other hand, quinol 
oxidation by fumarate is an electroneutral reaction that 
does not contribute to pmf formation. Hence, fumarate 
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can be regarded as an electron sink that is predominantly 
needed to replenish the quinone pool. 

Sulfur Respiration 

Examples of facultatively sulfur-respiring bacteria are 
Desulfuromonas acetoxidans and W. succinogenes. The 
latter bacterium reduces elemental sulfur in the form of 
polysulfide which is generated from Sg rings in the pres¬ 
ence of sulfide. The terminal enzyme of the 
corresponding respiratory electron transport chain is 
membrane-bound polysulfide reductase (PsrABC com¬ 
plex). PsrA is a poly sulfide-reducing molybdoprotein, 
PsrB is an iron-sulfur protein, and PsrC is a membrane 
anchor subunit that binds a menaquinone molecule. PsrC 
belongs to the PsrC/NrfD family of polytopic membrane 
proteins whose members represent electron-transferring 
modules in different electron transport chains of faculta¬ 
tive anaerobes (Simon and Kem 2008). For example, 
NrfD is involved in electron transport from quinol to 
nitrite in enteric bacteria such as E. coli. 

Electron Transport to Oxidized Metal Ions 

Oxidized metal ions such as Fe^"^ and Mn^"^ are energet¬ 
ically favorable electron acceptors for facultative anaer¬ 
obes like Geohacter and Shewanella species. However, 
metal oxides are almost insoluble and hence unsuitable 
energy substrates unless the metal-respiring organism is 
able to transfer electrons via the outer surface of the cell 
to the metal oxide. This might be accomplished either 
by direct contact, by recently described conductive pili 
or by excreted redox mediators. Various multiheme 
c-type ► cytochromes are known to be involved in 
electron transport from the quinol pool across the peri- 
plasmic space to the outer membrane in iron-respiring 
bacteria. The usual product of Fe^"^ respiration is 
magnetite (Fe 304 ), a ferromagnetic mineral easily 
detectable with a magnet. 
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Electron Transport in Strict Anaerobes 


Definition 

Transfer of electrons from a donor to an acceptor in 
strictly anaerobic bacteria and archaea. 

Introduction 

The diversity of metabolism in strict anaerobes is truly 
incredible. They may derive energy from light 
(photolithoautotroph), redox reactions using organic 
substrates (chemoorganoheterotroph), or inorganic 
electron donors (chemolithoautotroph). Anaerobic 
chemotrophs are classically divided into the fermenting 
and respiring organisms. Fermentation is an anaerobic 
redox process, in which the oxidation of the substrate is 
coupled to the reduction of another substrate or an 
intermediate derived from the oxidation. The difference 
in > redox potential of the substrate and the end product 
provides energy for adenosine triphosphate (ATP) syn¬ 
thesis. In most fermentations, the same substrate is used 
as both reductant and oxidant, whereas in some amino 
acid fermenting organisms, one amino acid is oxidized 
and another is reduced (Stickland reaction). The oxida¬ 
tion reaction is coupled to substrate level phosphoryla¬ 
tion whereas the reduction reaction is usually not. 
Fermentations are classically considered as pathways 
in which energy is conserved solely by substrate level 
phopshorylation, but in recent years evidence accumu¬ 
lated that strictly fermenting microbes have ion-motive 
electron transport chains as well (Muller 2008). Strictly 
anaerobic microorganisms that can grow by anaerobic 
respiration use a variety of electron acceptors such as 
carbon dioxide, nitrate, dimethylsulfoxide, sulfur, fuma- 
rate, or even phenylacrylates. In contrast to facultative 
anaerobes, such as Escherichia coli, that grow by anaer¬ 
obic respiration, strict anaerobes have special pathways 
for substrate oxidation such as the Wood-Ljungdahl 
pathway in acetogenic bacteria. Furthermore, novel 
electron transport systems have been identified in 
strictly anaerobic microorganisms. Examples of these 
are described below. 

Electron Transport from Reduced Ferredoxin 
to NAD'^: The Rnf Complex 

Ferredoxins are used as electron carriers by several 
anaerobes and reduced by the action of hydrogenases. 


pyruvate:ferredoxin oxidoreductases, carbon monox¬ 
ide dehydrogenases, some pyruvate formate lyases, 
and electron bifurcation mechanisms. Reduced ferre¬ 
doxin in anaerobes has a low redox potential of —500 
to —420 mV and thus is a very potent reductant for 
electron transport chains. Genes potentially encoding 
ion-motive, ferredoximNAD"^ oxidoreductases are 
encoded by the genomes of various anaerobes. Since 
they were discovered as components of the nitrogen- 
reducing machinery in Rhodobacter, these enzymes 
were termed Rnf (Rhodobacter nitrogen /ixation). 
Most Rnf complexes are predicted to have six subunits 
(Biegel et al. 2011). The three subunits RnfA, RnfD, 
and RnfE have a number of transmembrane helices 
indicating that these subunits are membrane-bound. 
The subunits RnfB and RnfG are mainly hydrophilic 
but each has a stretch of hydrophobic amino acids at 
the N-terminus that might serve as a membrane- 
anchor. RnfC is soluble. The suhunits RnfG and 
RnfD contain a covalently bound FMN. The amino 
acid sequence of RnfC contains binding motifs for 
a ► flavin and two 4Fe4S ► iron-sulfur cluster. RnfB 
is predicted to contain six 4Fe4S cluster. 

The difference in redox potential between 
ferredoxin (E°' = -500 to -420 mV) and NADH 
(E°'= -320 mV) is rather small (AE°' = -180 
to —100 mV) and equivalent to AG*^' = —20 
to —35 kJ/mol (AG°'= —n*E*AE‘^'; n: number of elec¬ 
trons transferred, E: Earaday constant). Assuming an 
electrochemical ion potential of —200 mV across the 
cytoplasmic membrane, the free energy change of this 
reaction would allow for the translocation of one to 
two ions (AG°' = —n * F * ApNa"^; n: number of ions 
that can be pumped). A lower electrochemical ion 
potential would increase the number of ions that can 
be pumped. If one assumes the energetically most 
unfavorable value of one ion exported per mol of 
ferredoxin oxidized, 3 mol of reduced ferredoxin 
must be oxidized to give the three ions required by 
the ATP synthase to synthesize 1 mol of ATP. 

The Rnf complex was indeed shown to be an ion 
(Na"^) pump (Biegel and Muller 2010). In the 
acetogenic bacterium Acetobacterium woodii it is the 
key enzyme in caffeate and carbon dioxide respiration 
and shown to catalyze electrogenic Na"^ transport, 
which is in accord with the knowledge that this organ¬ 
ism possesses a Na'^-FiFo ATP synthase (Muller et al. 
2008) (Fig. 1). In other organisms may be the 
coupling ion. 
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Electron Transport in Strict 
Anaerobes, Fig. 1 Model of 
caffeate/carbonate respiration 
in A. woodii. Electrons 
deriving from molecular 
hydrogen or pyruvate: 
ferredoxin oxidoreductase via 
ferredoxin are channeled 
through the Rnf complex, 
generating NADH and thereby 
establishing a Na"^ gradient. 
NADH can be used to reduce 
caffeate (via Etf) (left) OR to 
reduce carbonate to acetate 
(right). Please note that 
carbonate reduction to acetate 
is by way of the Wood- 
Ljungdahl pathway and 
requires eight electrons some 
of which derive from NADH, 
others from ferrdoxin. The 
generated Na"^ gradient can be 
used by a Na’^-dependent ATP 
synthase to generate ATP 



ferredoxin 


ferredoxin.. 


electron donor 
(fructose, H 2 ) 


NADH + H+ 


NAD+ 


caffeyl-CoA hydrocaffeyl-CoA 


ATP ADP + P; 


caffeate respiration 


Wood-Ljungdahl-pathway 


The Rnf complex is the most widespread electron 
transport chain in anaerobes (Biegel et al. 2011). It is 
found in classical fermenters such as Clostridia as well 
as anaerobic respirers such as methanogenic archaea, 
acetogenic bacteria, and sulfate-reducing bacteria. 
Fermenting Clostridia reduce ferredoxin in the pyru- 
vate:ferredoxin oxidoreductase reaction but have an 
additional, novel way to reduce ferredoxin. The reduc¬ 
tion of crotonyl-CoA (an intermediate in the butyrate 
fermentation pathway) with NADH is exergonic and the 
energy liberated is used to catalyze the thermodynami¬ 
cally uphill transport of electrons from NADH to reduce 
ferredoxin (Herrmann et al. 2008). This mechanism 
termed “electron bifurcation” is similar to electron 
bifurcation by the ► cytochrome -complex and 
widespread in anaerobes. The driving force for ferre¬ 
doxin reduction with NADH as reductant is crotonyl- 
CoA reduction in Clostridia. It may be caffeyl-CoA or 
methylene-tetrahydrofolate reduction in acetogens, 
reduction of heterodisulfide in methanogens, or reduc¬ 
tion of yet unknown compounds in organisms that have 
genes similar to heterodisulhde reductase. Often, as in 
Clostridium kluyveri or the acetogen A. woodii, the Rnf 
complex is the only system to couple substrate oxidation 
to the generation of an electrochemical ion gradient. 


It should be mentioned that Rnf is also present in 
many aerobes and facultative aerobes in which it may 
be involved in providing energy for the “uphill” trans¬ 
port of electrons from NADH to acceptors with more 
negative redox potential, like to nitrogenase in 
Rhodobacter. 


Electron Transport from Reduced Ferredoxin 
to Protons: Ech and Eha/B Complexes 

A novel multisubunit Ni-Fe hydrogenase (Ech) was 
discovered in acetate-grown cells of Methanosarcina 
harkeri (Meuer et al. 1999) that is homologous to 
hydrogenase 3 and 4 from E. coli and to the CO- 
induced hydrogenase from Rhodospirillum ruhrum. 
The enzyme catalyzed the H 2 -dependent reduction of 
a two 4Fe4S ferredoxin and is also able to perform the 
reverse reaction, namely H 2 formation from reduced 
ferredoxin. Ech hydrogenase in M. harkeri is encoded 
by the echABCDEF operon. The subunits EchA und 
EchB are predicted to be membrane-integral, with the 
other four subunits anchored to the membrane. The 
amino acid sequence predicts 4Ee4S clusters in EchE 
and in EchC. EchE also exhibits a binding motif for 
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H+? 


Electron Transport in Strict Anaerobes, Fig. 2 Model of 
Ech and Eha/Ehb hydrogenases. Both hydrogenases might use 
the potential difference between ferredoxin and hydrogen to 
pump ions across the membrane. It is suggested for Ech of 


M. barkeri that protons are translocated, experimental data is 
available (a); the coupling ion of Eha/Ehb has not been deter¬ 
mined (b). Ech composition has been taken from Hedderich 
(2004) 


a Ni-Fe site. Because of the homology of the ech 
genes to subunits of ► complex I it was proposed 
that Ech functions as a proton pump (Fig. 2a). It was 
recently shown that ATP synthesis linked to Ech 
activity in membrane vesicles of Methanosarcina 
mazei was inhibited by protonophores, but not Na"^ 
ionophores, indicating an involvement of Ech in 
energy conservation by transport (Welte et al. 
2010). Under standard conditions, CO-dependent 
hydrogen formation is coupled to a free energy 
change of —19.3 kJ/mol, calculating with a redox 
difference of 0.1 V (CO: E°' = -525 mV, Hj: 
E°' = —420 mV). Assuming an electrochemical 
potential of 0.15 V in methanogens, Ech would 
allow the translocation of one proton per hydrogen 
molecule formed (for calculation see above). 

Ech provides reducing equivalents (reduced 
ferredoxin) for the first step of methanogenesis from 
H 2 + CO 2 , the formation of formylmethanofuran 
(E°'= —500 mV) with electrons deriving from molec¬ 
ular hydrogen (E°' = —414 mV). It is assumed that the 
proton gradient across the membrane is used for the 
energetically unfavorable reduction of ferredoxin by 
H 2 . Furthermore, the reduced ferredoxin generated by 
Ech can serve as electron donor of oxidoreductases in 
biosynthetic pathways. Ech is proposed to function in 
the reverse direction, establishing a proton gradient 
under conditions of acetoclastic methanogenesis, pro¬ 
ducing H 2 from reduced ferredoxin, generated by the 
oxidation of the carbonyl group of acetate to CO and 
further to CO 2 (Hedderich 2004). 


Another proton-reducing hydrogenase fueled by 
reduced ferredoxin is the Eha/Ehb hydrogenase found 
in methanogens and Pyrococcus fwiosus. In 
Methanothermohacter thermautotrophicus, the 
operons eha and ehh each encode a Ni-Ee hydrogenase 
large subunit, a Ni-Ee hydrogenase small subunit, and 
two conserved integral membrane proteins, with high 
similarity to Ech of M. barkeri. The eha operon also 
encodes 4Fe4S polyferredoxins and 14 more subunits 
(10 are membrane-integral). Ehh encodes additionally 
ferredoxin, polyferredoxin, and 11 additional proteins 
(9 are membrane-integral) (Hedderich 2004) (Fig. 2b). 

The strictly fermenting organism P. furiosus reduces 
ferredoxin in the course of oxidative metabolism by 
ferredoxin-linked oxidoreductases like glyceraldehyde 
3-phosphate (GAP):ferredoxin oxidoreductase and pyru- 
vate:ferredoxin oxidoreductase and employs the Eha/ 
Ehb hydrogenase to reduce protons to hydrogen with 
electrons derived from reduced ferredoxin. Again, the 
energy difference is used to generate an electrochemical 
proton gradient across the membrane (Sapra et al. 2003). 

Electron Transport from Various Donors to 
the Heterodisulfide in Methanogenic 
Archaea 

In the pathway of methanogenesis, the methyl group 
(derived from CO 2 or a methyl-group containing sub¬ 
strate or acetate) is bound to coenzyme M. Methyl 
coenzyme M undergoes a nucleophilic attack by the 
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Electron Transport in Strict Anaerobes, Fig. 3 (continued) 
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Electron Transport in Strict Anaerobes, Fig. 3 Model of 
electron transport from various donors to the heterodisulfide in 
methanogenic archaea. Electron transport involving F 420 - 
nonreducing hydrogenase (a), F420H2 dehydrogenase (b), 


Rnf (c) or Ech (d). Rnf and Ech are used alternatively in different 
organisms (Figure has been modified from Deppenmeier and 
Muller (2008)) 
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thiolate anion of coenzyme B (7-mercaptoheptanoyl 
threonine phosphate) giving rise to methane and 
a mixed disulfide of CoM and CoB, the heterodisulfide. 
In the last step of methanogenesis, the heterodisulfide 
is reduced to the corresponding thiols. In some 
methanogens, the heterodisulfide reductase is a solu¬ 
ble, flavin-containing enzyme assumed to couple the 
exergonic reduction of the heterodisulfide to the end- 
ergonic reduction of ferredoxin (Thauer et al. 2008). 
The latter may be used in the Ech or Eha/Ehb reaction 
(see above) for additional energy conservation or func¬ 
tions as an electron donor for CO 2 fixation. 

In Methanosarcina species, the heterodisulfide 
reductase is membrane-bound and functions as a termi¬ 
nal respiratory reductase. It is composed of two sub¬ 
units, a membrane anchoring fi-type ► cytochrome 
(HdrE) and a hydrophilic iron-sulfur protein (HdrD). 
Methanophenazine (the first phenazine isolated from 
archaea) is the mobile electron carrier in the membrane 
that connects the electron input module with the 
heterodisulfide reductase. The source of reducing equiv¬ 
alents necessary for the reduction of the heterodisulfide 
depends on the growth substrate and three different 
electron input devices are known (Fig. 3): 

1. During growth on H 2 - 1 - CO 2 a cytochrome h 
containing, membrane-bound hydrogenase (F 420 - 
nonreducing hydrogenase) channels electrons to 
the heterodisulfide reductase (Fig. 3a). 

2. When Methanosarcina strains grow on methylated 
Cl compounds, part of the methyl groups are 
oxidized to CO 2 and electrons are transferred to 
coenzyme F 420 . The membrane-bound F 420 H 2 
dehydrogenase catalyzes the oxidation of F 420 H 2 
and the reduction of the heterodisulfide under this 
growth condition. The protein from M. mazei 
is encoded by the fpo cluster that comprises 
12 genes. The subunits are highly homologous to 
proton translocating NADH dehydrogenases of 
respiratory chains from bacteria (NDH-1) and 
eukarya (complex I). In this context, it is important 
to note that the F 420 H 2 dehydrogenase and bacterial 
NADH dehydrogenases have some interesting fea¬ 
tures in common. Both enzymes have a complex 
subunit composition and contain flavin and iron- 
sulfur centers. The electron donors F 420 H 2 and 
NADH are similar in that both cofactors are revers¬ 
ible hydride donors with comparable midpoint 
potentials. The enzymes use small hydrophobic 
electron acceptors, namely quinones in the case of 


NADH dehydrogenases, and methanophenazine in 
the case of F 420 H 2 dehydrogenases. Moreover, both 
enzymes show redox-driven proton translocating 
activity (Deppenmeier and Muller 2008) (Fig. 3b). 

3. During growth on acetate the heterodisulfide reduc¬ 
tase receives electrons from reduced ferredoxin. Two 
scenarios are discussed for the composition of this 
third electron transport system. (1) The Ech hydroge¬ 
nase and the heterodisulfide oxidoreductase are 
involved in this process. H 2 (formed by the Ech 
hydrogenase) would be an intrinsic intermediate at 
the expense of reduced ferredoxin. H 2 would be 
reoxidized by the F 420 -nonreducing hydrogenase and 
the electrons channeled via methanophenazine to the 
heterodisulfide reductase. Studies using M. harkeri 
mutants lacking the Ech hydrogenase confirmed 
these results. (2) Since the genome of M. acetivorans 
does not contain genes encoding a functional Ech 
hydrogenase, alternative electron transport compo¬ 
nents might be involved in the transfer of electrons 
to CoM-S-S-CoB in this methanogen. Sequence and 
physiological analyses suggest that M. acetivorans 
replaces the H 2 -evolving Ech hydrogenase complex 
with the Rnf complex (Fig. 3d). In M. acetivorans the 
Rnf complex is presumably composed of 
a membrane-bound subcomplex containing 
a cytochrome c subunit that functions as an ion chan¬ 
nel for the translocation of either protons or sodium 
ions and catalyzes the electron transfer to 
methanophenazine. It is interesting to note that the 
genomes of the freshwater organisms M. mazei and 
M. harkeri do not contain genes homologous to the 
Rnf complex. This indicates that there may be differ¬ 
ences in the pathway for acetate conversion to meth¬ 
ane especially in the ferredoxin-dependent electron 
transport chains in Methanosarcina species. 
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Synonyms 

Electric birefringence; Electric dichroism; Rotational 
diffusion 


Definition 

Electro-optics refers to the optical properties of 
macromolecules under the influence of an applied 
external electric field. 

Introduction 

Electro-optical procedures are particularly useful for 
characterizing structures in solution of macromolecules 
and colloidal suspensions (Eredericq and Houssier 1973; 
O’Konski 1976, 1978; Stoylov 1991; Porschke and 
Antosiewicz 2007). The main source of information is 
rotational diffusion, which is a very sensitive measure 
for size, shape, and flexibility of particles. Rotational 
diffusion can be determined by electro-optical proce¬ 
dures at a high accuracy and can be interpreted quanti¬ 
tatively based on reliable hydrodynamic models. In 
addition information can be obtained about the electrical 
and optical parameters, which are also useful for assign¬ 
ment of structures. The quantitative interpretation of all 
electro-optical data and a detailed comparison of struc¬ 
tures in solution with structures in crystals or simulated 
molecular structures is enabled by “quantitative molec¬ 
ular electro-optics” (Porschke and Antosiewicz 2007). 
This analysis not only provides information about the 
global structure but also on elements of local structure. 

The experiments are very simple in principle; elec¬ 
tric held pulses are applied to solutions or suspensions 
and induce partial orientation of dipoles into the direc¬ 
tion of the field vector (Fig. 1). The orientation can be 
registered by various optical techniques: birefringence, 
dichroism, and light scattering are commonly used. 

In the past, electro-optical techniques have been 
used almost exclusively at low salt concentrations 
because the effects are particularly large under these 
conditions. However, in recent years experimental 
techniques have been extended and developed to 
a higher sensitivity - now electro-optical studies can 
also be conducted at physiological salt concentrations 
and at low field strengths (Porschke 2007,2010,201 1). 

Experimental Procedures 

Electric Dichroism 

When molecules are aligned by external electric fields, 
the absorption of light is turned anisotropic. A scheme 
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Electro-optics and Macromolecular Hydrodynamics, 
Fig. 1 Scheme of electro-optical experiments, (a) In the 
absence of an external electric field the molecules are distributed 
randomly in space, (b) Application of an external electric field 
pulse induces partial orientation of the molecular dipoles into the 
direction of the field vector. 


of an experimental setup for measurements is shown in 
Fig. 2. The anisotropy is recorded at optimal sensitiv¬ 
ity, when polarized light is used with the polarization 
plane parallel with the field vector. The change of the 
light intensity A/|| provides the ahsorhance change AA|| 


AA|| = -log[(/«-f A/||)//^ 


( 1 ) 


where /Jj* is the light intensity measured in the absence 
of an external electric field and the index 11 indicates the 
fact that the measurements are conducted with light 
polarized parallel to the field vector. The absorbance 
change AA^ is obtained by the corresponding proce¬ 
dure using light polarized perpendicular to the field 
vector. These absorbance changes are related by 


reactions: effects appearing under these conditions 
indicate the existence of reactions - for example, con¬ 
formation changes - induced by the electric field 
pulses. 

Electric Birefringence 

The optical anisotropy induced by electric field pulses 
can be recorded by measurements of the electric bire¬ 
fringence at a particularly high sensitivity. Usually the 
birefringence is measured at a wavelength out of the 
range of absorbance bands. A considerable technical 
advance was made possible by the development of 
lasers. The narrow light beam allows the construction 
of cells with a small electrode separation and the high 
light intensity facilitates measurements at a high 
signal-to-noise ratio. 

The stationary change of the light intensity Alg 
induced by an electric field pulse, using the optical 
setup shown in Fig. 3, provides the phase shift or 
optical retardation 

6 = 2 X jarcsin [{Als/Ioi) x sin^(a) + sin^(a)] — a| 

(4) 

where a is the angle between the polarization planes of 
the polarizer and the analyzer and /q, is the light inten¬ 
sity measured at angle a at zero field strength. The 
birefringence, that is, the difference in the refractive 
index of the solution parallel and perpendicular to the 
field vector, is given by 

An = bX/ilTil) (5) 


AA|| = -2 X AA_l (2) 

The reduced electric dichroism is given by 

AAii — AAx 1.5 X AAii 

.c _1i 1 


where A is the absorbance measured in the absence of 
an external electric field. 

The theory predicts, in agreement with experiment, 
that the dichroism is zero for light polarized at the 
“magic angle” of 55.7° with respect to the field vector. 
This is very useful, because experiments under magic 
angle conditions provide a test for field-induced 


where X is the wavelength of the light and I is the 
optical path length. 

The main advantage of birefringence is the fact 
that it can be measured at particularly high accuracy 
(Porschke 2011). However, there are also some 
problems: at low concentrations of analyte the bire¬ 
fringence of the solvent can be significant and may 
lead to difficulties in the analysis. The birefringence 
signal of the solvent may be measured separately 
and be subtracted, but this requires careful 
matching. Another drawback of the birefringence 
technique is the fact that it does not provide 
a simple, separate, and selective analysis of signals 
resulting from field-induced reactions (cf. Electric 
Dichroism). 
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Electro-optics and Macromolecular Hydrodynamics, 
Fig. 2 Scheme of a setup for measurements of the efectric 
dichroism. Pofarized light at a wavelength in the absorbance 
range of the molecules under investigation is generated by 
a xenon or mercury-xenon arc lamp, a monochromator and 
a polarizer; changes of the absorbance of this light induced in 


the sample cell by application of an electric field pulse are 
measured by a detector and recorded by a digitizer. The PC is 
used to induce repetition of measurements at given time intervals 
with inversion of the field vector after each pulse and signal 
averaging; the shutter is opened only during recordings to avoid 
photo-reactions 


Diode 

Shutter Polarizer X/4 Analyzer Detector Digitizer 



Electro-optics and Macromolecular Hydrodynamics, 
Fig. 3 Scheme of a setup for measurements of the electric 
birefringence. A laser light heam, polarized at an angle of 45° 
(in clockwise direction viewed from the light source) with respect 
to the field vector, passes the cell, a quarter wave plate, an 


Interpretation of Experimental Data 
Amplitudes 

The orientation of molecules in the presence of an 
external electric field is determined by the interac¬ 
tion of dipoles with the electric field. In the case 
of an induced dipole with a preferential polariz¬ 
ability a along one axis the energy of interaction is 
given by 

t/,-= — ^a£^cos^(0) (6) 


analyzer and hits the detector. The slow axis of the 2/4 plate is 
parallel to the polarization plane of the polarizer, and the analyzer 
is rotated from the cross-position by a small angle a toward the 
polarization plane of the polarizer. The telescope is used to reduce 
the heam diameter. Automatic sampling as described in Fig. 2 


where E is the electric field strength and 0 is the angle 
between the induced dipole and the field vector. 
For a permanent dipole with a moment /tp the 
corresponding energy is 

Up = —jipE cos 0 (7) 

The distribution of molecular orientations is deter¬ 
mined by the interaction energy with respect to the 
thermal energy kT (k = Boltzmann constant and 
T = absolute temperature) in the form of the following 
Boltzmann function 
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exp(—(7 jkt) 
fg exp(—(7 /kT)2Ti sin 


( 8 ) 


where U = Uj + Up. The dependence of the dichroism 
on the electric field strength is described by the 
following Boltzmann function also denoted as the 
orientation function 

fg [exp(—[//^r)](3cos^0 — l)(7isin0r/0) 

^ [exp(—17 /kT)]2n sin 070 

(9) 

which provides the experimental dichroism value 

= ^<^00 (10) 


where is the limiting value of the dichroism at 
infinite field strength. 

For induced dipoles the integrated form of the 
orientation function is given by 


4 ) = 






( 11 ) 


with y = {aE^)/{2kT). For permanent dipoles the 
integrated form of the orientation function is 


4 ) = 



3(cothi3- 1 /^) 

P 


( 12 ) 


with fi = jipE/kT. 

Experimental values of the stationary dichroism 
measured at different electric field strength may be 
fitted to orientation functions. An example is given in 
Fig. 4, showing that such data can be used to distin¬ 
guish induced from permanent dipoles and to deter¬ 
mine the magnitude of the dipole moment. Finally, the 
fit also provides the limiting value of the dichroism 4oo> 
which provides direct information on the orientation of 
the chromophore with respect to the dipole vector 
according to the following relation 


^o, = (3/2)(3cos2((p)-1) (13) 



Electro-optics and Macromolecular Hydrodynamics, 
Fig. 4 Electric dichroism ^ of lac repressor measured as 
a function of the electric field strength E. The continuous line 
shows a fit according to a permanent dipole mechanism with 
a dipole moment of 1,320 Debye units and ^ 00 = —0.12. The 
dashed line shows a least squares fit according to an induced 
dipole mechanism (Reproduced from Porschke 1987) 


analyzed by orientation functions in the same line. 
However, limiting values of the birefringence usually 
cannot be interpreted as directly as in the case of 
dichroism values, because in most cases there are 
many contributions to the birefringence. 


Decay Time Constants 

The decay of the dichroism or of the birefringence 
observed upon pulse termination reflects rotational 
diffusion, which is very sensitive to the size, shape, 
and flexibility of the particles under investigation. 
Thus, decay time constants are a very useful source 
of information about structures in solution. 

Analytical solutions are available for rigid particles 
of simple shape. For spherical particles the decay time 
constant is given by 


where cp is the angle of the transition dipole moment of 
the chromophore with respect to the dipole vector. 

Stationary values of the electric birefringence 
measured at different electric field strengths may be 


T, = Vhr\lkT (14) 

where Vh is the effective hydrated volume and t) the 
viscosity of the solvent. 
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Electro-optics and 

Macromolecular 

Hydrodynamics, 

Fig. 5 Model of the complex 
between cyclic-AMP-receptor 
and promoter DNA with 
a DNA bending angle of 92 
(Reprinted with permission 
from Porschke 2010; 
Copyright 2010 American 
Chemical Society) 



The rotational diffusion of rigid rods is mainly 
determined by their length I, whereas the radius r has 
a marginal effect. A useful approximation for the 
decay time constant of rigid rods (Tirado et al. 1984) 
is given by 

^ 18/tr[ln(^) - 0.662 + 0.917/<7 - O.OSO/t/^] 

(15) 

where q = Ijlr. 

There are other useful equations for the decay time 
constants of ellipsoids or thin circular discs. 

For particles of complex shape the electro-optical 
decay curves may be calculated from their structures. 
Simulation programs have been assembled to calculate 
all electro-optical data from crystal or model structures 
of macromolecules (Porschke and Antosiewicz 2007; see 
section below). Because of the progress in computation, 
these calculations are relatively fast and quite reliable. 

The flexibility of structures is also indicated by the 
decay time constants. This has been used, for example, 
in the determination of persistence lengths of DNA 
(Porschke 1991) and of other biopolymers. 

Instrumentation 

One of the problems in the field is the fact that there is 
no supplier of ready-for-use electro-optical instru¬ 
ments. Thus, instruments have to be assembled from 
individual components. The required optical compo¬ 
nents can be obtained from many different suppliers. 
Detectors and transient digitizers are also available 
from various sources. However, currently it may be 
a problem to get a suitable high-voltage pulse 


generator. Finally, cells for measurements with elec¬ 
trodes for pulse application cannot be easily pur¬ 
chased, but have to be constructed in some workshop 
for precision engineering. These problems have lim¬ 
ited advanced applications of electro-optics to special¬ 
ized laboratories, in particular applications requiring 
high sensitivity and high time resolution. 

Simulation of Electro-optical Transients from 
Crystal or Model Structures 

Most of the structures of current interest cannot be 
represented by simple geometric bodies like spheres or 
ellipsoids. More accurate representations are required, 
for example, for the analysis of protein-nucleic acid 
complexes. Progress in modeling of hydrodynamic 
parameters both on the theoretical level and in computer 
implementations allows surprisingly accurate descrip¬ 
tions of translational and rotational diffusion of stractures 
of any shape (see, e.g., Garcia de la Torre [2007] and 
Garcia de la Torre and Ortega ► Linear dichroism this 
volume). Using the theory of electro-optical transients 
(Wegener et al. 1979), simulated diffusion tensors can be 
combined with optical and electrical parameters for the 
calculation of electro-optical transients for any structure 
(Porschke and Antosiewicz 2007). The following exam¬ 
ple demonstrates the high potential of this approach. 

We start from a model constructed in atomic detail 
for a complex between the cAMP receptor and pro¬ 
moter DNA (cf. Fig. 5). For calculation of the diffusion 
tensor the structure is converted into a bead model. 
Based on procedures described in the literature the 
diffusion tensor is obtained at a high reliability. The 
next step is the calculation of the absorbance tensor 
based on the optical parameters and the positions of the 
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Electro-optics and 

Macromolecular 

Hydrodynamics, 

Fig. 6 Time constants Xj (o, 
left ordinate) calculated for 
models of the complex 
between cyclic-AMP-receptor 
and promoter DNA with 
different DNA bending angles 
(cf. Fig. 5). For comparison 
the translational diffusion 
coefficient D (x, right 
ordinate) is also shown for 
these models (3°C, model 
construction and calculations 
described in Porschke 2010) 



residues contributing to the absorbance. Finally the 
electrical parameters are required. In the case of the 
complex between cAMP receptor and promoter DNA, 
the electric response is dominated by a special quasi¬ 
permanent dipole resulting from the non-symmetric 
distribution of phosphate charges (Antosiewicz and 
Porschke 1989). This quasi-permanent dipole moment 
must be referred to the center of diffusion, which is 
obtained from the bead model simulation. Finally the 
tensors of diffusion and of absorbance together with 
the dipole vector are combined using the theory devel¬ 
oped by Wegener et al. (1979) for calculation of the 
electro-optical transient. This procedure can be applied 
easily for different model structures as shown in the 
case of the cAMP receptor-promoter DNA complex 
for different DNA bending angles (Porschke 2010). 

The calculated electro-optical parameters are 
strongly dependent on the structure of the protein- 
DNA complex. In particular, the decay time constant 
is most sensitive to the bending angle and thus, the 
bending angle can be determined at a high accuracy 
by comparison with experimental data. The bead model 
simulations provide both the translational and rota¬ 
tional diffusion parameters. A comparison of the depen¬ 
dence on the DNA bending angle obtained for the 
translational diffusion with that for the electro-optical 
decay time constants (Fig. 6) illustrates the unique 
sensitivity of the electro-optical approach: a variation 
of the bending angle from 0 to 177° results in a change 
of the translational diffusion by 24.7%, whereas the 
decay time constant changes by 258%. Thus, electro¬ 
optics is much more sensitive than methods based on 


translational diffusion. Another advantage is the fact 
that the optical techniques available for electro-optical 
analysis can be applied down to very low concentra¬ 
tions. In the case of the promoter DNA-CAP complex, 
the measurements were conducted at concentrations of 
the complex of ^0.16 pM (Porschke 2010). 

Kinetics 

Electro-optical measurements provide information on 
global structure and also on some aspects of local 
structure by relatively simple measurements within 
a short time interval. Information on structures 
obtained within short time intervals is required also 
in kinetics. Thus, the application of electro-optics for 
analysis of macromolecular kinetics is very promising. 
The most popular technique for studies of biomolecu- 
lar kinetics is stopped flow. Electro-optical detection 
for stopped flow measurements requires construction 
of an instrument combining stopped flow and electric 
field jump techniques. The technical problem has been 
solved by a relatively simple construction (Porschke 
1998). Initial applications demonstrate the excellent 
capacity of the technique (Porschke 2012). 

Summary 

Electro-optical procedures prove to be very useful for 
analysis of macromolecular structures in solution. The 
combination of rotational time constants with optical 
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and electrical parameters provides particularly sensi¬ 
tive information on global structures and also on local 
structure elements. Electro-optical transients can be 
calculated in detail from molecular structures and, 
thus, experimental data obtained in solution can be 
compared quantitatively with data obtained, for exam¬ 
ple, from crystals or from simulations. Instruments for 
electro-optical measurements at physiological salt 
have been developed. The combination of electro¬ 
optics with stopped flow provides detailed information 
on macromolecular structures during reactions. 

Cross-References 

► HYDFIT and Related Packages for Linear 
Molecules 

► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules 

► Kinetics: Overview 

► Kinetics: Relaxation Methods 

► Linear Dichroism 

► Linear Dichroism Spectra - Measurement 

► Nucleic Acid Linear Dichroism 

► Stopped-Flow Techniques 
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Electron Microscopy: Classical Sample 
Preparation 

Bruno M. Humbel 
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Lausanne, Switzerland 

Synonyms 

FIB-SEM; Focused ion beam-SEM; Scanning electron 
microscopy; SEM; Transmission electron microscope; 
TEM 


Definitions 

Chemical fixation: Fixation of a biological sample 
with chemicals that cross-link proteins and other 
cellular macromolecules 

Resin: A solution of chemicals that polymerize into 
a large polymer network thus embedding the biological 
sample 
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Introduction 

An electron microscope is a hostile environment for 
living organism; the water evaporates in the vacuum, 
needed to increase the mean free path of electrons, 
and the sample is exposed to high energy electrons. 
Furthermore, for transmission electron microscopy 
(TEM) the sample must he thin, 50-500 nm, to be 
transparent for electrons. To fulfill these require¬ 
ments, the biological object has to undergo 
a preparation scheme: chemical fixation, dehydration, 
and resin embedding. Further, ultrathin sections need 
to be cut and stained with heavy metal salts before 
they can be investigated in the electron microscope. 
Though more modern, less damaging preparation 
methods based on cryo-fixation are available, chem¬ 
ical fixation is over the world still the most widely 
used preparation method to analyze cells and tissue 
by electron microscopy. In particular, it is a valuable 
technique when large, in the millimeters, or hazard¬ 
ous samples need to be prepared. 

Chemical Fixation 

The biological samples are chemically fixed with alde¬ 
hydes, osmium tetroxides, and uranyl acetate. 

The most important aldehydes are formaldehyde 
and glutardialdehyde. They bind to terminal -NH 2 
groups through the formation of Schiff’s bases and 
will therefore cross-link proteins. 

Formaldehyde (FA) is a small molecule, HCOH 
(32 Da) and is prepared by depolymerization of para¬ 
formaldehyde. Note that the fixing molecule is form¬ 
aldehyde and not the white powder paraformaldeyde. 
It is a gas dissolved in buffer, therefore it has to be 
stored and fixation has to take place in a closed con¬ 
tainer. It has been reported that formaldehyde can react 
with proteins and to a limited extent with nucleic acids 
and lipids. The reaction is reversible upon removal. 
Formaldehyde is the favorite fixative for immunohis- 
tochemistry most likely because of its poor fixation 
capacity. 

Glutardialdehyde (GA) is a molecule 
CIIOC 3 H 5 CHO (104 Da) that has two aldehyde groups 
at each end. At physiological pH, however, it forms 
pentamers through aldol condensation that contain a, (3 
unsaturated aldehyde groups. The mesomery of the 
aldehyde groups is responsible that glutardialdehyde 


binding is irreversible. Glutardialdehyde reacts with 
proteins but not with lipids and nucleic acids. The 
better fixation, compared to formaldehyde, is most 
likely the main reason for the reduction of efficiency 
of immunolabeling. 

Osmium tetroxide (OSO 4 ) is a highly oxidizing agent 
that is used to fix proteins and unsaturated fatty acid 
chains, mainly the lipids in membranes, it also adds to 
the contrast. OSO 4 is the most controversial fixatives. It 
was shown that at temperatures above zero centigrade 
OSO 4 has no longer the fixing structure depicted in the 
text books (oxidations state 6 ) but forms a mixture of 
molecules in the oxidation states 4 and 3 and it functions 
rather like a protease than like a fixative. Also its contri¬ 
bution to contrast seems to be minimal without the 
addition of heavy metal stains, like uranyl acetate and 
lead citrate. Still for high contrast images in TEM or 
FIB-SEM, imaging back scattered electrons, chemical 
fixation with osmium tetroxide with or without potas¬ 
sium ferricyanide is needed. 

Interestingly enough osmium tetroxide used during 
freeze-substitution does not get black below —30°C. It 
has been described that osmium tetroxide can react 
already at about —70°C, to the oxidation state of 
6 that better coincides with the assumed formula of 
the text books. Preliminary results suggest that fixation 
with OSO 4 during freeze-substitution in combination 
with low-temperature embedding affects the antigenic¬ 
ity far less and the label efficiency is comparable to that 
when only aldehydes were used. 

Uranylacetate (UA) can stabilize phospholipids. 

Chemical fixation leads to a stabilization of the 
cellular ultrastructure to endure the following dehydra¬ 
tion and resin embedding, but it can induce large 
changes of the cellular fine structure: membrane 
rearrangement, aggregation of nucleic acids, loss of 
proteins, and changes of the conformation of proteins. 

Dehydration and Resin Embedding 

Dehydration is done in graded series of ethanol or 
acetone. During dehydration the sample undergoes 
large changes in volume, at the beginning the vol¬ 
ume increases up to 300% then it goes back to 100% 
at about 70% of dehydration it collapse to 40-50% at 
100% dehydration. It has to be assumed that not all 
the subcellular structures react identical in this 
volume changes. Dehydration is necessary for 
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embedding into the mostly hydrophobic resins. 
There is one important exception, the preparation 
scheme for the Tokuyasu cryo-sectioning technique, 
here, the preferentially formaldehyde with or with¬ 
out low concentrations of glutardialdehyde fixed 
samples are embedded in an aqueous 2.3 M sucrose 
solution that is frozen for sectioning in a cryo- 
ultramicrotome. 

The presently used resins are either epoxides: Epon, 
Durcopan, Spurr’s resin for morphological studies or 
methacrylates, Lowicryls: LR white for immuno his¬ 
tochemistry. The epoxy resins can covalently bind to 
the biological structures and form copolymers, 
whereas the methacrylates, polymerizing by a radical 
reaction, enmesh the biology in the network of the 
polymer formed. In general the methacrylates are pre¬ 
ferred for immunolabeling; probably the less well 
embedding leaves more spaces for the antibody to 
access the antigen. The covalent binding with the 
epoxy resin, on the other hand, can react as a fixative 
and prevent the loss very small antigens, e.g., 
neuro transmitters. 

Summary 

Conventional preparation with aldehyde and osmium 
tetroxide fixation and epoxy embedding is still the 
most widely used preparation scheme for transmission 
electron microscopy. Recently, this method gained an 
importance in serial block face tomography done by 
scanning electron microscopy either by in situ serial 
sectioning, array tomography, or FIB-SEM tomogra¬ 
phy. The method is valid at low biological resolution, 
i.e., studying cell-cell interactions, coarse distribution 
of cellular organelles, investigation of large or biohaz¬ 
ardous samples. For immunohistochemistry, the 
Tokuyasu cryo-sectioning technique, or methacrylate 
embeddings are preferred. For higher resolution stud¬ 
ies implying membrane-membrane interactions, e.g., 
Golgi organization, methods based on cryo-fixation, 
freeze-substitution, freeze-fracture, or CEMOVIS are 
the only reliable tool. 

Cross-References 

► Electron Microscopy: Cryo-Preparation 

► Freeze-Substitution 


References 

Behrman EJ. The chemistry of osmium tetroxide fixation. In: Revel 
JP, Barnard T, Haggis GH, editors. The science of biological 
specimen preparation 1983. AMF O’Hare: SEM; 1984. p. 1-5. 

Boyde A, Maconnachie E. Morphological correlations with 
dimensional change during SEM specimen preparation. 
Scanning Electron Microsc. 1981;4:27-34. 

De Winter DAM, Schneijdenberg CTWM, Lebbink MN, Lich B, 
Verkleij AJ, Drury MR, Humbel BM. Tomography of 
insulating biological and geological materials using focused 
ion beam (FIB) sectioning and low kV BSE imaging. 
J Microsc. 2009;233:372-83. 

Denk W, Horstmann H. Serial block-face scanning electron 
microscopy to reconstruct three-dimensional tissue 
nanostructure. PLoS Biol. 2004;2:1900-9. 

Glauert AM, Lewis PR. Biological specimen preparation for trans¬ 
mission electron microscopy. London: Portland Press; 1998. 

Griffiths G. Fine structure immunocytochemistry. 
Berlin/Heidelberg; Springer; 1993. 

Hayat MA. Principles and techniques of electron microscopy. 
Biological applications. 4th ed. Cambridge: Cambridge 
University Press; 2000. 

Hopwood D. Cell and tissue fixation, 1972-1982. Histochem J. 
1985;17;389^42. 

Hopwood D. Fixatives and fixation: a review. Histochem J. 
1969;1:323-60. 

Hopwood D. Theoretical and practical aspects of glutaraldehyde 
fixation. Histochem J. 1972;4:267-303. 

Micheva KD, Smith SJ. Array tomography: a new tool for 
imaging the molecular architecture and ultrastructure of 
neural circuits. Neuron. 2007;55:25-36. 

Tokuyasu KT. A technique for ultracryotomy of cell suspensions 
and tissues. J Cell Biol. 1973;57:551-65. 

Wang L, Humbel BM, Roubos EW. High-pressure freezing 
followed by cryosubstitution as a tool for preserving 
high-quality ultrastructure and immunoreactivity in the 
Xenopus laevis pituitary gland. Brain Res Brain Res 
Protoc. 2005;15:155-63. 

White DL, Andrews SB, Faller JW, Barmett RJ. The chemical 
nature of osmium tetroxide fixation and staining of 
membranes by x-ray photoelectron spectroscopy. Biochim 
Biophys Acta. 1976;436:577-92. 


Electrophoretic Light Scattering 

Fraser McNeil-Watson 

Malvern Instruments Fimited, Malvern, UK 


Synonyms 

Dynamic light scattering; Faser Doppler electrophore¬ 
sis (LDE); Fight scattering; Phase analysis light scat¬ 
tering (PALS) 
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Electrophoretic Light 
Scattering, Fig. 1 Optical 
arrangement of a typical ELS 
instrument 


Optical Configuration 



Introduction 

Electrophoretic light scattering, commonly abbrevi¬ 
ated to ELS, is an analytical method that measures 
the electrophoretic mobility of a suspension of 
particles or macromolecules. As in the related DLS 
(► dynamic light scattering) method, the suspension 
is illuminated by a light source, generally a laser of 
a visible wavelength (^400-800 nm). The scattered 
light from the suspension is sampled by an optical 
system connected to a photodetector and signal 
processing system that allows the measurement 
of particle motion utilizing the Doppler effect. ELS 
differs from DLS in that the random Brownian motion 
of the suspended entities is superimposed on a directed 
motion caused by the application of an electric field. 
This motion is dependent on the electric charges 
carried by the suspended particles modified by the 
presence of the suspending liquid which is usually 
an electrolyte solution. Many commercially available 
ELS instruments are also capable of performing the 
DLS measurement. 

Optical Arrangements 

The main components of a typical modern instrument 
are shown in Eig. 1 . Light from a laser source is split 
into two portions. One containing >99% of the flux 
is directed toward the cell through subsidiary compo¬ 
nents to provide beam shaping and alignment finally 
focusing the light into a small lozenge-shaped 
“probe volume” perhaps 50 pm across located in the 
electrophoresis cell or chamber. This “scattering 


beam” is passed through a variable attenuator to adjust 
the level of illumination so as to adapt to a wide range 
of sample concentrations while keeping the detected 
light in a suitable range for the photodetector. 
Scattered light from the sample is collected at a partic¬ 
ular angle (“the scattering angle”). This scattered light 
is mixed with the second portion of the source light 
(“the reference beam”) which forms an interferogram 
whose fluctuations record the frequency spectrum 
of the scattered light and reference combination. 
The reference beam is reflected from a moving 
mirror driven by a piezo motor to provide a constant 
frequency shift equivalent to a steady motion of the 
scattering entities in the sample cell. This resultant 
signal contains both this steady (“modulator shift”) 
and the directed motion (electrophoretic velocity) of 
the sample. To this is added the random “Brownian 
motion” of the sample. Without the modulated refer¬ 
ence beam, it would not be possible to deduce the 
direction of movement or measure very low velocities. 
The modulator defines a reference frequency (typically 
around 250 Hz) which corresponds to stationary scat¬ 
tering: Particles moving across the cell in one direction 
produce an increase in frequency, the opposite direc¬ 
tion a decrease. Depending on the sign of the applied 
electric field, the direction will correspond to either 
positive or negatively charged scattering entities. 
The scattering angle used in the apparatus described 
is around 12° - this is known as forward scatter and is 
used to minimize the “broadening” in the frequency 
spectrum due to Brownian motion. The scattering 
angle determines the scattering vector Q as shown in 
Figure 3. The frequency shift due to electrophoretic 
motion is proportional to Q, while that due to 
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Brownian motion to so the ‘shift to width ratio’ - 
the resolution of the system - is increased hy reducing 
Q. There are of course trade-offs to be considered - 
low Q means measuring low frequencies which 
intrinsically means longer measurement times and the 
chosen angle is eventually a working compromise. 
Very small scattering angles also lead to the possibility 
of detecting “flare” from the surfaces of the various 
components in the optical path - including the cell 
itself - which will manifest themselves as apparent 
uncharged and hence stationary sample components, 
and hence necessitate extra care in the manufacture 
and care of the apparatus. The mixed signal is passed to 
a suitable detector which is chosen for its sensitivity, 
speed, and linearity of response. If the instrument is 
also to be used for dynamic light scattering, the 
requirements are usually more demanding in all these 
areas and a single photon counting device with 
a detection limit around 50 ns and therefore ability to 
work up to 20 MHz count rate is usually a suitable 
choice. Practical detection limits are lower due to the 
random arrival times of photons. In the instrument 
considered here, the sample scattering is usually 
adjusted to around 200^00 thousand counts per 
second (Kcps) while the reference beam is set at 
2,500 Kcps, though these levels are not critical. The 
scattering from clean water is usually around 30 Kcps 
and samples suitable for detection need to exceed this 
by around 10 Kcps for practical measurements. 
The implications of this will be discussed further 
in the section on applications. 

Cell Configurations 

There are two basic forms of the electrophoresis cell 
that have been used in modern ELS measurements; the 
capillary and the parallel plate electrode. 

The capillary cell consists of a tube joining larger 
chambers containing the electrodes. The tube is 
transparent, usually of glass or quartz (pure silica) or 
a suitable polymer. This form of cell has the advantage 
that the electrodes can be large in area so reducing the 
current density at the surface and ensuring that any gas 
bubbles produced are remote from the measurement of 
the sample which is performed in the central part of the 
capillary. The current flowing is limited by the cross- 
sectional area of the capillary, rather than the size of 
the electrodes so limiting the Joule heating from the 


passage of the electric current. The tube format also 
makes the thermal regulation of the cell fairly easy as 
the distance from the center to the wall is low and the 
wall can be surrounded by a thermostat arrangement 
such as a water bath or other heat sink. The same 
geometry means that comparatively high voltages 
(50-200 V typically) must be applied to produce the 
required electric fields (2,000 V/m) in a capillary cell 
a few centimeters in length. A major advantage of the 
capillary form is that the electric field is established 
parallel to the axis of the tube due to the geometry and 
remoteness of the electrodes, the form of which is 
immaterial to the field direction. When first used in 
microelectrophoresis, the use of the capillary form 
required compensation for the effect of electroosmo¬ 
sis: The wall of the capillary is charged in solution and 
gives rise to a distribution of charge in the solution 
with an excess of counter ions (opposite charge to the 
wall) adjacent to the wall. When the field is applied, 
these ions move toward the opposite charge electrode 
and if the capillary tube has closed ends the flow is 
reflected forming a steady-state parabolic distribution 
of flow. At the center of the capillary, the net 
flow observed is the sum of the electrophoretic velocity 
of the particles or molecules and the electroosmotic 
(EP -I- EO). To measure the amount of the EO flow, it 
was normal to perform measurements at several points 
across the capillary and fit a parabolic profile to 
the results allowing the EP velocity to be extracted. 
Alternatively, the measurement could be made at the 
so-called stationary layer or level (SL) where the wall 
and bulk flow in opposite senses canceled. In a circular 
capillary, this is located fairly close to the wall 
(0.14 and 0.86) of the diameter and required careful 
alignment and a small probe volume to select only 
substantially zero EO flow. Even so, practical mea¬ 
surements were “broadened” by the finite geometry 
of the measurement sampled across the infinitesimal 
SL. The method of scanning several places across the 
tube also requires careful alignment and measurements 
near to the wall for accuracy - and is subject to any 
drift or other systematic uncertainty in the set of mea¬ 
surements arising from Joule heating or sample degra¬ 
dation or evolution. These difficulties with the 
capillary format have been overcome in some modem 
instruments by exploiting the finite time taken for 
electroosmosis to be established. After the field is 
applied, rapid measurements of the apparent EP 
velocity show a steady increase as increasing EO 
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Fig. 2 Electroosmotic profile 
across a capillary 



flow is established. In a cell of 2 mm cross section, it 
takes some 100 ms for EO flow to become significant 
and around 1,000 ms for it to be fully established. So if 
the instrument can make rapid velocity determinations 
and the electric field switched on and off rapidly, 
the effects of EO can be avoided. Eigure 2 shows the 
effects of measuring at different points across 
a capillary with flelds of different switching frequen¬ 
cies and demonstrates that using this technique 
measurement at the center of the capillary is perfectly 
possible as the remaining small component of EO flow 
can be estimated directly from the multiple frequency 
data and removed to a high order of accuracy. This 
measurement approach is referred to as EER-SER, fast 
field reversal followed by slow field reversal: During 
the EER stage, a mean value for the mobility is 
obtained in the absence of EO effects, during the SER 
stage a complete spectrum of mobility is obtained with 
EO included, combining the two gives the required 
spectrum with the EO contribution identified 
and removed. The capillary need not be a straight 
tube: It can be “folded” into a U tube and packed into 
the same form factor as a standard cuvette. 

Another difficulty of the traditional capillary cell - 
that of effectively cleaning it - has been removed 
by the supply of a disposable type manufactured 
from polycarbonate polymer. 

The second type of cell uses parallel electrodes in 
the form of plates. These are usually a few millimeters 
square and spaced a few millimeters apart. It is impor¬ 
tant that the plates really are parallel to a high order 
of accuracy to ensure the field is uniform and the EP 
velocity is really normal to the electrodes. One 


advantage of this form of cell is that the voltages 
used are quite low - typically less than 5 V to establish 
a field of 2,000 V/m. A disadvantage is that the evolu¬ 
tion of gas bubbles is close to the measurement zone 
and can distort the field. This is usually addressed by 
frequent reversing of the field. The electrodes are usu¬ 
ally suspended in the center of a cuvette well away 
from the walls of the cell so electroosmosis can usually 
be ignored. Joule heating from the passage of current is 
likely to give rise to turbulent flow so necessitating 
lowering the voltage and achieving poorer resolution 
and accuracy. This type of cell is probably best 
suited to lower ionic strengths (<0.05 M) so may not 
generally be suitable to measurements in physiological 
conditions. One advantage of this format is that very 
high fields can be obtained with moderate applied 
voltages and hence are often used with nonaqueous 
suspension liquids of low dielectric constant and low 
conductivity where EP mobilities are typically very 
low. However, these applications are rare in the field 
of biophysics. A particular advantage of a capillary cell 
is that a small aliquot of sample can be introduced as 
a plug of disperse material suspended in a continuous 
phase of similar electrolyte so keeping the sample 
itself away from the electrodes for long enough to 
carry out the measurement. 

Eor either type of cell, the electrodes are usually 
formed from noble metal to avoid reactions with 
the salt or buffer solutions typically used as sample 
solutions. Gold, platinum - often platinized, and pal¬ 
ladium have all been used in modern instrumentation. 

The electrodes are driven by a field generator circuit 
capable of supplying up to hundreds of milliamperes of 
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Electrophoretic Light 
Scattering, 

Fig. 3 Frequency and phase 
analysis of the ELS signal 
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Signals collected during the 
application of the electric field 
are processed by Fourier 
Analysis and Phase Analysis to 
calculate the frequency shift 
and derive the mobility. 


As the field switches between +ve, zero, and -ve values, the scattered light 
frequency spectrum shifts symmetrically about the modulator value. The phase 
spectrum tracks the frequency shift. The mobility is calculate (from that shift) 
Phase 

At = UeQE 



Q = AimlX sin{0/2) (scattering vector) 
E (applied electric field) 

'Je (electrophoretic mobility) 


current at voltages from 1 to 200 depending on the type 
of cell used and the sample conductivity. The held is 
usually switched in polarity at regular intervals to 
preserve overall neutrality. The frequency of switching 
used ranges from kilohertz to a fraction of a hertz 
depending on the requirements of the measurement. 
In a typical modem instrument, the held will be 
switched at around 2 kHz to measure the conductivity 
of the sample - avoiding polarization effects. It is 
switched at around 20 Hz during the FFR period to 
measure mean mobility, and around 1 Hz to measure 
the mobility spectrum. The ultimate resolution of the 
spectrum is determined by the longest held application 
time (lowest frequency). In some cases, the held gen¬ 
erator can be set to drive the cell with a constant 
current rather than a constant voltage. This approach 
allows the current to be automatically adjusted to 
nullify the effects of electrode polarization. When 
a constant voltage is applied and the current sensed 
we see that the current falls away due to a buildup 
of counter ions near the electrode. Constant current 
application overcomes this by automatically increas¬ 
ing the voltage supplied to overcome this extra 
resistance. The effect is most noticeable in conductive 
solutions (>0.05 M) and in parallel plate cells where 
the electrodes are close together - as the polarization 
zone is then a sensible proportion of the gap between 
the electrodes. It is much less noticeable in capillary 


format cells. If constant current control is not used, 
then the changing current can be monitored to allow 
the electric field at any time to be measured accurately. 

Signal Processing 

The signal from the photodetector may be either a digital 
waveform - consisting of a sequence of photon detec¬ 
tions as electronic pulses spaced in time, or a fluctuating 
analogue signal. The former will typically be integrated 
by a counting circuit so that a series of data samples 
representing the amplitude of the signal is formed; In the 
case of the analogue waveform, the same result is 
achieved using an ADC (analogue-digital converter) at 
regular sampling intervals resulting in a multi-bit digital 
signal. In most modem instmments, this signal is 
converted to a frequency spectmm by selecting 
a number of samples corresponding to a particular appli¬ 
cation of the electric field and performing a Fourier 
transform. This is a standard mathematical technique 
for converting frequency from a time series. An alternate 
approach if only the mean value of the frequency is 
required is to perform phase analysis (the acronym 
PALS - phase analysis light scattering is often applied 
to this). This is effectively a single frequency transform 
performed at the modulator frequency in which the 
phase difference of the transform measures the 
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frequency shift from the modulator frequency. This 
technique is particularly suitable for small values of 
frequency shift when, for example, the electrophoretic 
mobility itself is low, or the applied field is low - to 
reduce Joule heating in a conductive sample. As PALS 
tracks changes in frequency as a function of time, it can 
be applied to the entire signal rather than discrete sam¬ 
ples synchronized with the electric field. The switching 
of the electric field gives rise to phase changes synchro¬ 
nized with the switching of direction of the electric field. 
This PALS approach can be applied in the FER-SFR 
approach discussed above. PALS is used to measure the 
mean mobility during the FFR stage; Fourier analysis is 
used during the SFR stage to provide the full mobility 
spectrum. The steps described above are illustrated 
in Fig. 3. 

An alternate approach that has been used is to process 
the signal using a digital correlator. This is an electronic 
device which takes individual photo detections as a time 
series and forms the correlation function - effectively 
multiplying the signal by a delayed copy of itself 
over a series of different delay times. The correlation 
function can be converted to a frequency spectrum 
by taking a simplified form of the Fourier transform - 
the discrete cosine transform - which is one of the 
motivations for this approach. The use of a correlator 
was convenient when DLS instruments were introduced 
with ELS as an auxiliary measurement. Correlation 
remains the best approach to recovering the detailed 
spectrum of Brownian motion and hence particle 
size: Recently more convenient and sensitive techniques 
such as PALS have replaced it as the method of 
choice for the mobility measurement (Cummins and 
Pike 1973, 1976). 

Data Analysis 

Having derived the frequency spectrum, the next step 
is calculate electrophoretic mobility and then derive 
zeta potential and perhaps the underlying electric 
charges present on the sample moieties. The conver¬ 
sion process is illustrated in Fig. 3. 

The relationship between frequency shift and mobil¬ 
ity depends on the Q vector - set by the optical arrange¬ 
ment and the electric field. The latter can either be 
measured by taking the applied voltage across the gap 
between the electrodes, or better by measuring the cur¬ 
rent and using the conductivity of the sample to 


Electrical double layer 


Stern Diffuse 

layer layer 



Distance from particle surface 


Electrophoretic Light Scattering, Fig. 4 Ions and potentials 
around a charged particle in solution 

calculate the field - a simple application of Ohm’s 
law. If the field is changing during the measurement 
due to polarization effects, then this must also be 
taken into account. Knowing the field and Q enables 
frequency to be converted to EPM. Deriving 
zeta potential or surface charge then depends on 
dealing with the effects of the ion cloud surrounding 
the sample particles. The main parameters used here 
are the particle radius a and the Debye length IIK. 
This quantity is a measure of the thickness of 
the double ionic layer around the particle and is 
dependent on ionic concentration and valency, which 
is typically ^1 nm in 0.05 M NaCl. The product Ka 
is a dimensionless number representative of the 
double layer being either “thin” (with respect to 
particle radius) when Ka ^ 100 or “thick” Ka ^ 1. 
These are referred to as the Smoluchowsky or Huckel 
limits, respectively. There is a formula due to Henry 
which deals with intermediate cases but only applies for 
zeta potential values lower in magnitude than around 
25 mV. More complex approaches, which take into 
account distortion of the double layer around the parti¬ 
cle due to the electric field, have been formulated 
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numerically by O’Brien and White and more recently 
White and Mangelsdorf. A formula due to Loeb has also 
been used to take double-layer effects into account in 
deriving a value for the interfacial charge (Hunter 
(1981) and Hiemenz and Rajagopalan (1997)). The 
distribution of ions around a charged particle and the 
potentials arising are illustrated in Fig. 4. 

Applications 

The first application of ELS in biophysics was by Ware 
and Flygare in 1971-1972 (Ware and Flygare 1971, 
1972) where they reported studies on BSA and fibrin¬ 
ogen. BSA has continued to be a useful model system, 
and recently, Jachimska et al. (2008) reported DLS and 
ELS studies on it and HSA, leading to the derivation of 
protein valency from the EPM data at different salt 
concentrations and pH values. While there is always 
interest in absolute measurements of mobility to 
compare with theory and derive charge estimates, 
many fruitful applications lie in comparative studies 
where changes in EPM arise from binding of multiple 
components or other interfacial changes. Eor example, 
Akagi et al. (2006) studied the binding of a range of 
proteins to PGA nanoparticles correlating size and zeta 
potential in the interactions. Okuda et al. (2006) 
showed that changes in EPM and zeta potential tracked 
with molecular constitution of amino acid dendrimers 
of poly L-lysine and poly L-ornithine. 

Recommendations for the measurement of biomol¬ 
ecules such as proteins in relatively highly conductive 
media are presented by Corbett and Jack (2010). 

Summary 

The arrangement of a typical ELS instrument is 
described and the rationale behind this presented in 
terms of the optical scheme, measurement cells, signal 
processing, and data analysis. The measurement 
process is discussed and typical applications in the 
biophysics area are outlined. 

Cross-References 

► Dynamic Light Scattering 

► Fluorescence Correlation Spectroscopy 


► Nanoparticles for Drug and Gene Delivery 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 


References 

Akagi T, et al. Multifunctional conjugation of proteins on/into 
bio-nanoparticles prepared by amphiphilic polyly-glutamic 
acid). J Biomater Sci Polymer Edn. 2006;17(8):875-92. 

Corbett CW, Jack RO. Measuring protein mobility using modem 
microelectrophoresis. Colloids Serv A. 2010;376(1-3):31-41. 

Cummins HZ, Pike ER, editors. Photon correlation and light 
beating spectroscopy. New York: Plenum; 1973. p. 314. 
ISBN 0-306-35703-8. 

Cummins HZ, Pike ER, editors. Photon correlation spectroscopy 
and velocimetry. New York: Plenum; 1976. p. 260-373. 
ISBN 0-306-35723-2. 

Hiemenz PC, Rajagopalan R. Principles of colloid and surface 
chemistry. New York: CRC Press; 1997. ISBN 0-8247-9397-8. 

Hunter RJ. Zeta potential in colloid science. London: Academic; 
1981. ISBN 0-12-361960-2. 

Jachimska B, Wasilewska M, Adamczyk Z. Characterization of 
globular protein solutions by dynamic light scattering, elec¬ 
trophoretic mobility, and viscosity measurements. Langmuir. 
2008;24(13):6866-72. 

Muller RH. Zetapotential und partikelladung in der laborpraxis. 
Stuttgart: Wissenschaftliche Verlagsgesellschaft mbH; 1995. 

Okuda T, et al. Biodistribution characteristics of amino acid 
dendrimers and their PEGylated derivatives after intravenous 
administration. J Control Release. 2006;114:69-77. 

Ware BR, Flygare WH. The simultaneous measurement of the 
electrophoretic mobility and diffusion coefficient in bovine 
serum albumin solutions by light scattering. Chem Phys Lett. 
1971;12(1):8L 

Ware BR, Flygare WH. Light scattering in mixtures of BSA, 
BSA dimers, and fibrinogen under the influence of 
electricfields. J Colloid Interface Sci. 1972;39:3. 


Ellips and Covol 

Stephen E. Harding 

School of Biosciences, NCMH Laboratory, University 
of Nottingham, Sutton Bonington, Leicestershire, UK 

Introduction 

ELLIPS (Harding et al. 1987, 2005) and COVOL 
(Harding et al. 1997; Jones et al. 1999) are 
pre-compiled sets of FORTRAN algorithms for the rep¬ 
resentation of low-resolution molecular shape in solution 
from hydrodynamic measurements and are available 
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from a Microsoft WINDOWS platform. ELLIPS 1,2,3, 
and 4 are simple to use algorithms for the representation 
of the overall hydrodynamic shape of proteins in solution 
in terms of triaxial ellipsoids(three perpendicular semi¬ 
axes a>b>c and shape characterized by two axial ratios 
a/b, b/c) and biaxial ellipsoids or “ellipsoids of revolu¬ 
tion” where two of the semi-axes are approximated as 
equal: a prolate ellipsoid, has semi-axes a,b,b an oblate 
ellipsoid of semi-axes a,a,b, with a>b in both cases and 
the asymmetry defined by a the axial ratio a/b. COVOL 
is a related algorithm for evaluating the thermodynamic 
nonideality parameter B on the basis of the size, shape, 
and polyelectrolyte properties of a protein in solution. 
Both ELLIPS and COVOL are downloadable from 
the NCMH software link on www.nottingham.ac.uk/ 
ncmh. 

There are two approaches to representing the 
conformation of quasi-rigid proteins in solution. One 
is head modeling (► HYDRO Suite of Computer Pro¬ 
grams for Solution Properties of Rigid Macromole¬ 
cules) whereby a macromolecule or macromolecular 
assembly is approximated as an array of spherical 
beads. Although the hydrodynamic calculations are 
approximate, the models are useful for representing 
low-resolution conformations of macromolecules 
which cannot be reasonably represented as a uniform 
symmetrical shape. Computer programs currently 
available (based on how these spheres interact) are 
HYDRO, SOLPRO, HYDROPRO, HYDROSUB, 
and SOMO (► HYDRO Suite of Computer Programs 
for Solution Properties of Rigid Macromolecules; 
► US-SOMO: Methods for Construction and Hydra¬ 
tion of Macromolecular Hydrodynamic Models). 
There are uniqueness or degeneracy problems: for 
example, one can predict a hydrodynamic parameter 
for a particular complicated model, but there will be 
many other equally complicated models which give the 
same sedimentation coefficient. This type of modeling 
is therefore best for choosing between plausible models 
for a structure, or for refining a close starting estimate 
for a structure from, say, x-ray crystallography and has 
formed the basis of the crystallohydrodynamics proce¬ 
dure for the representation of domain orientation of 
antibodies (Lu et al. 2007). A complementary approach 
to bead modeling is to make no assumptions concerning 
starting estimates and to calculate the shape directly 
from hydrodynamic measurements. This is called the 
“ellipsoid” or “whole body” approach (Harding 1989, 
1995; Serdyuk et al. 2007) so called because the 


investigator instead of approximating the macromole¬ 
cule as an array of spheres approximates the macromol¬ 
ecule instead as a smooth whole regular structure - an 
ellipsoid, or “three dimensional ellipse” characterized 
by three perpendicular semi-axes a> h > c. Of course, 
only simple representations are possible but by 
combining shape parameters together there are no 
hydration or uniqueness problems. The hydrodynamic 
calculations for biaxial and triaxial ellipsoids are 
also exact. 

The simplest of these (which has been used in one 
form or another for well over half a century (Harding 
1995)) is the ellipsoid of revolution in which two 
of the three semi-axes are equal (c = h). Ellipsoids of 
revolution are so called because they are the shapes 
formed by rotating an ellipse of semi-axes, a, h either 
about the major (a) axis to give a prolate ellipsoid (semi¬ 
axes a, h, h) or about the minor (h) axis to give an oblate 
ellipsoid {a, a, h), both defined by the axial ratio (a/h) 
(where a > h). One hydrodynamic measurement can 
uniquely define (alh), after assuming a value for the 
hydration of the molecule; two hydrodynamic measure¬ 
ments are normally sufficient to define (alh) without 
assumptions concerning hydration; a third is occasion¬ 
ally necessary to distinguish whether an oblate ellipsoid 
or prolate ellipsoid is the more appropriate (usually the 
latter for proteins). In the extremes a»h the prolate ^ 
rod and the oblate ^ disc and the case of a = /> is of 
course a sphere. 


E 


ELLIPSPRIME: Universal Shape Functions 

In common with the bead modeling program SOLPRO 
(Harding and Cdlfen 1995) the ELLIPS algorithms all 
use universal shape functions (USFs) (Harding et al. 
1987, 2005; Garcia de la Torre et al. 1997) and these 
can be evaluated from experimental data by a simple 
routine known as ELLIPSPRIME. A Universal shape 
function is a shape parameter which is a function of 
shape alone (and not volume); it makes no odds 
what the size is: a universal shape function will 
have the same value, it will only depend on the 
shape. To measure these USLs experimentally, many 
require knowledge of the time-averaged hydration 5 or 
hydrated volume V (ml) of the particle, others do not 
(Harding et al. 1987, 2005; Serdyuk et al. 2007; 
Garcia de la Torre et al. 1997; Rallison and Harding 
1985; Harding and Colfen 1995) 
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V = v,-M/Na ( 1 ) 

where M is the molecular weight or molar mass 
(g/mol) and is Avogadro’s number (6.02205 x 
10 ^^ mol"^), and Vs is the specific volume (ml/g) of 
the hydrated macromolecule (volume occupied by 
the hydrated macromolecule per unit mass of dry 
macromolecule) or 

V={v + d/p,)-M/NA ( 2 ) 

where v is the partial specific volume (ml/g). Universal 
shape functions requiring experimental knowledge of 
V or 5 include: 

1. The Einstein-Simha viscosity increment, v 

V = [^]MI{NaV) (3) 

V = 2.5 for a sphere independent of its size 

2. The Perrin function, P 

^=(///o)/{l+^/(vPo)}^/' (4) 

where (flfo), the frictional ratio (Harding 1995) is 
related to the sedimentation coefficient ^° 20 ,w by 

(///o) =M{1 - vpJ/(A ,4 • 67tri„Ao,w)- 
(471^4/3vM)'/^ 

Tjo is the viscosity of water at 293.15 K (0.010 P), 
Po is the density of water at 293.15 K 
(0.99823 g/ml), and ks is Boltzmann’s constant 
(1.3807 X 10“'® erg K“'). P = 1 for a sphere. 
An analogous relation exists linking the transla¬ 
tional diffusion coefficient D with {flfo). 

3. The reduced excluded volume, 

«red = u/V = {2Bhf - Z^/2 I)I{NaV) (6) 

where u is the excluded volume (ml), calculated 
using Rallison-Harding theory (Rallison and 
Harding 1985), B is the second thermodynamic 
(or “osmotic pressure”) virial coefficient, from 
osmotic pressure, light scattering or sedimentation 
equilibrium measurements, Z is the valency of the 
macromolecule, measurable by titration (Harding 
et al. 1997; Jones et al. 1999) and I is the ionic 


strength of electrolyte in the solvent (mol/ml), 
“red = 8 for a sphere. 

4. The harmonic mean rotational relaxation time 
ratio: 

■Chho = {ksT/r\oV} ■ (7) 

where i/, (s) is the harmonic mean rotational relax¬ 
ation time, traditionally measured using steady-state 
fluorescence depolarization methods, and Tq the 
corresponding value for a spherical particle of 
the same volume: 

To = ^oV|kBT ( 8 ) 

In earlier representations, a factor of three was 
introduced because the rotational relaxation time 
was referred to on a dielectric dispersion 
basis (compensated for in the equations for 
steady-state anisotropy depolarization) although 
this is no longer necessary (Harding et al. 1987; 
Garcia de la Torre et al. 1997). T/,/To = 1 for 
a sphere. 

5. The reduced electro-optic decay constants 

= {^oy/kaT) ■ 0/ (9) 

where 0 , are the electric birefringence or electric 
dichroism decay constants. For ellipsoids of revo¬ 
lution that are homogeneous, that is, where the 
geometric axis of symmetry coincides with the 
electrical axis, ; = 1. For general ellipsoids that 
are homogeneous, that is, where the geometric 
axes coincide with the electrical axes, i — 2 , 
termed “H-” and for general particles i = 1-5. 
For a sphere, 0;:"‘' = 0.66667. 

Harding and coworkers et al. (Harding et al. 
1987, 2005; Harding 1989, 1995) also give a 
complete list of those USFs NOT requiring knowl¬ 
edge of 5 or y for their experimental measurement. 
Again, we give here only the most popular or 
useful ones: 

6 . The Scheraga-Mandelkern parameter 

P = {N‘/'-no}/{M"/'(i - vpj.ioo'/^} 

= |NA'/V(162007t2)''^^|.{v'/VP} ( 10 ) 
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The p parameter is unfortunately very insensitive 
to shape, and Eq. 10 is used more as an equation of 
consistency, or for measuring M from sedimenta¬ 
tion velocity and viscosity measurements, 
p = 2.1115 X 10® (Harding 1989) for a sphere. 

7. The PI function 

n = {2BM/[r\]} - {zV2/M[ti]} = u,^/v (11) 

with the second term (a good approximation 
of the charge contribution for polyelectrolytes) 
—> 0 at sufficient values of /, and of course = 0 
for uncharged macromolecules or proteins at the 
isoelectric point (Z = 0). H = 3.2 for a sphere 

8 . The Wales-van Holde parameter 

R^kJ[^]=2{l+P^)/v (12) 

where (ml/g) is the concentration dependence 
parameter of the sedimentation coefficient in the 
limiting relation 

‘^20,w — ^ 20,w(l ksC^ (13) 

or 

1A20,w = {1A°20,w}( 1 +^iC). 

Although the theory behind Eq. 13 is less rigorous 
than that for IT (because of the greater complexity 
of “hydrodynamic” as opposed to “thermody¬ 
namic equilibrium” based nonideality), it does 
have a strong experimental basis. To apply in 
this way it is important that charge contributions to 
are absent or if the macromolecule 
is a polyelectrolyte, charge contributions are 
suppressed by working in a solvent of sufficient 
ionic strength. R = 1.6 for a sphere. 

9. The reduced radius of gyration function G 

G=R/- |47tiV,4/(3vM)^/^} (14) 

where Rg is the radius of gyration (cm). Equation 14 
applies if neutron scattering is used to measure Rg. 
If x-ray scattering then the partial specific volume v 
shoud be replaced by Vj 


10. The Lambda function 

A = (rio • [ti] • M)/{Na ■ keTth) 

= v/(t,,/To) 

For spheres, A = 2.5. 

11. Electro-optic delta functions 

§,■ = {6r\jNA ■ keT) [r\]M ■ 0,- = 60,(16) 

(for homogeneous ellipsoids of revolution i = 1 
and for homogeneous triaxial ellipsoids, i = “H-” 
and “-”). For spheres 5, = 2.5. 

ELLIPS1 

Aim: Prediction of axial ratio (alb) (equivalent prolate 
or oblate ellipsoid of revolution) from a user-specified 
value for a shape function. 

Description. ELLIPS 1 is based on simple ellipsoid 
of revolution models (where two of the three axes of 
the ellipsoid are fixed equal to each other); if the 
user types in a value for a shape function from 
sedimentation or other types of hydrodynamic 
measurement, it will return a value for the axial 
ratio of the ellipsoid. The question an experimenter 
wishes to address usually is not “what is the shape 
function for a specified value of the axial ratio a/hl” 
but rather “what is the axial ratio alh for my macro¬ 
molecule specified by my (universal) shape function 
which I have experimentally measured?” Although 
there are exact analytical formulae linking each 
shape function with alh, the reverse is not true: inver¬ 
sion is analytically impossible. The algorithm 
ELLIPS 1 uses the polynomial-based inversion proce¬ 
dure of Harding and Cdlfen (1995) to give alh versus 
the various universal shape functions to an acceptable 
degree of accuracy (i.e., better than the precision of 
the measurement, which is normally no better than 
a few percent). 

Examples. Figure la gives the entry screen and 
Fig. lb some examples for different wheat gliadin 
proteins evaluated from the sedimentation coefficient 
and the P function (Ang et al. 2010). 
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Fig. 1 Representation of the 
shapes of wheat protein 
gliadins using ELLIPS 1 
(Ang et al. 2010) 


a ELLIPSOID INUERSION FORMULAE 
by H. Coelfen, 5/1995. 

Fortran version by S. Harding 6 Z. Aziz, 3/2006. 

Based on Harding, S.E. & Coelfen, H. Inversion formulae 
for ellipsoid of revolution macromolecular shape 
functions. Analytical Biochem. 228 (1995) 131-142. 

For which universal shape function do you seek an (a/b)? 
For P-function press P 
For nu-function press N 
For lambda-function press L 
For pi-function press I 
For R-function press R 
For beta-function press B 
P 

Enter value for P (must be >-1) 

1.2 

(a/b) for Prolate ellipsoid = 4.25 

(a/b) for Oblate ellipsoid = 4.58 

Do you want to plot your ellipsoid? y/n 



ELLIPS2 

Aim. Evaluate the values of all the universal hydrody¬ 
namic shape functions from user-specified axial 
dimensions (a, h, c) or axial ratios {alh, h/c) for the 
macromolecule as modeled hy a general triaxial 
ellipsoid. 

Description. ELLIPS2 is essentially analogous to 
SOLPRO (Garcia de la Torre et al. 1997) in that from 
a given structure (as represented by an array of 
beads in SOLPRO or as a general triaxial ellipsoid 
in ELLIPS2) the complete set of universal shape 
functions is returned. Most of the universal 


shape functions involve one or more of ten differ¬ 
ent elliptic integrals (called alpha 1 ... alpha 10). 
These are solved by quadrature using the Numeri¬ 
cal Algorithms Group (Oxford, UK) routine 
DOIAME (Harding et al. 1987) for a one¬ 
dimensional integration with an infinite upper 
limit and routine DO 1 DAP for a two-dimensional 
integration with finite limits. 

Example. Pigure 2a gives the output data for an 
{alh, hlc) = (1.10, 1.85), based on the crystallographic 
axial dimensions of for Pc’ of a chimeric antibody 
B72.3. Figure 2b shows the corresponding ellipsoid 
shape drawn using the routine ELLIPSDRAW. 
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ELLIPS2 output Data 


(a/b, b/c) = 1.2300 1.5200 

viscosity increment, nu = 2.7290 

Perrin function, p = 1.0269 

corresponding hydration independent functions: 
wales-van Holde, R = 1.5265 

scheraga-Mandelkern, 10*'*'-6xbeta = 2.1173 

Functions based on the 2nd virial coefficient: 
Reduced excluded volume, u_red 8.6713 

corresponding hydration independent function: 

Pi function = 3.1775 

G function (from radius of gyration) = 0.6784 

Reduced electro-optic decay constants: 

Theta+ = 0.1622 

Theta- = 0.1336 

Corresponding hydration independent functions: 

Delta+ = 2.6554 

Delta- = 2.1883 

Gamma+ = 1.3921 

Gamma- = 1.1473 

Harmonic mean rotational relaxational time ratio: 
tau_h/tau_0 = 1.1268 

corresponding hydration-independent functions: 
psi (squire-Himmel) function = 0.9869 

LAMBDA = 2.4219 

Fluorescence anisotropy relaxation time ratios: 

tau_l/tau_0 = 1.0316 

tau_2/tau_0 = 1.1523 

tau_3/tau_0 = 1.2119 

tau_4/tau_0 = 1.2471 

tau_5/tau_0 = 1.0277 

corresponding hydration independent functions: 

lambdaJL = 2.6455 

lambda_2 = 2.3684 

lambda_3 = 2.2517 

larabda_4 = 2.1883 

lambda_5 = 2.6554 

psij. = 1.0163 

psi_2 = 0.9795 

psi_3 = 0.9632 

psi_4 = 0.9541 



a/b = 1.10 b/c = 1.85 


Ellips and Covol, Fig. 2 Values for the Hydrodynamic 
Universal shape functions from ELLIPS2 for an ellipsoidal 
shape protein of axial ratios similar to that of an antibody 
Fab’ fragment with (a/b, b/c) = (1.10, 1.85) (a) output 
from ELLIPS2 (b) 3-D ellipsoidal representation using 
ELLIPSDRAW 


These were then used as the basis for the con¬ 
struction of a surface shell bead model. This pro¬ 
cedure was repeated for Fab and a model for the 
intact immunologically active antibody was then 
constructed (Lu et al. 2007). 

ELLIPS3 

Aim. Perform the reverse of ELLIPS2 by evaluating 
the triaxial shape of a macromolecule {a/h, h/c) using 
two possible combinations of universal shape 
functions. 

Description. Whereas an {alh, hic) specifies unique 
values for all the hydrodynamic shape functions, the 
reverse is unfortunately not true: measurement of P, v, 
R, A, etc., does not uniquely fix {alh, hIc) but rather 
gives a line solution of possible values. A graphical 
combination of the line solutions for two of these shape 
functions will in principle provide a unique solution for 
{alh, hic). The main criteria for selection are (1) their 
ease of measurement, (2) their sensitivity to shape and 
insensitivity to experimental error, (3) the two give an 
intersection as orthogonal as possible (4) the lack of 
requirement of an estimate for the hydration for their 
experimental measurement. ELLIPS3 currently offers 
two such combinations: the A function combined with 
the R function and the 11 function combined with the 
G function. The former combination satisfies all four 
criteria: A requires the experimental measurement 
of the harmonic mean rotational relaxation time 
(from, e.g., steady-state fluorescence measurements, 
no complicated resolution of exponentials is required), 
with the intrinsic viscosity. With regard to the 
latter parameter, the traditional U-tube viscometers 
require relatively large quantities of material - the 
new generation pressure imbalance methods now 
make this attractive for those materials available 
in low quantities (► Intrinsic Viscosity). The Wales- 
van Holde parameter R is measured from the ratio of 
the concentration dependence sedimentation term to 
the intrinsic viscosity. The latter combination also 
involves a hydration-independent function, namely, 
n (from measurement of the intrinsic viscosity and 
the second thermodynamic virial coefficient B). 
G can also be measured without experimental mea¬ 
surement of hydration provided that the radius of gyra¬ 
tion Rg is measured using neutron scattering 
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R - Lambda Intersection Plot for Neurophysin_Dimers 



a/b 

Ellips and Covol, Fig. 3 Evaluation of the triaxial ellipsoid 
axial ratios (a/b, b/c) of dimers of neurophysin using ELLIPS3. 
Graphical intersection of the Universal shape functions A = 2.53 
and R = 1.07 (allowing for experimental error), resulting in 
an (a/b, b/c) of ~ (2.5, 3). A comparable evaluation for the 
monomers alone gives (a/b, b/c) ~ (4, 1) meaning the dimeriza¬ 
tion is not end-to-end but side-by-side (Harding et al. 2005; 
Harding and Rowe 1982) 

(► Neutron Scattering of Membranes). ELLIPS3 uses 
as its basis the function calculation routine of ELLIPS2 
except that a whole array of such values are evaluated 
in the {aJh, hfc) plane (a matrix of 40 x 40 values). 
A Contour plotting routine (RGCNTS from the 
Simpleplot Library) interpolates between these matrix 
points and can plot the IT, G, A, and R functions (or 
any other of the universal shape functions if the 
programmer so decides) in the {alh, h/c) plane. 

Example. Eigure 3 shows an example of the deter¬ 
mination of the triaxial shape of neurophysin dimers in 
solution (Harding and Rowe 1982). 

ELLIPS4 

Aim. Evaluate the triaxial shape of a macromolecule 
{alh, h/c) from electro-optic-decay-based universal 
shape functions combined with other hydrodynamic 
data. 

Description. Rotational hydrodynamic shape 
functions, based around rotational diffusion measure¬ 
ments, are attractive for determining the shapes 
of macromolecules in solution since they are generally 
more sensitive functions of shape compared to other 
shape functions. This sensitivity comes however 
at a price because they are generally more difficult 


to measure. A lot of the difficulty centers around 
resolution of multi-exponential decay functions. 
Electro-optic measurements (► Electro-optics and 
Macromolecular Hydrodynamics) are more attractive 
than time-resolved fluorescence depolarization anisot¬ 
ropy measurements in the sense that for homogeneous 
triaxial ellipsoids at least, there are only two exponen¬ 
tial terms to resolve (the decay constants or reciprocal 
relaxation times 0+ and 0_) as opposed to five (ti - 15): 

A« = exp(—6 0+f) -f A'_ exp(—6 0_t) (17) 

where A« is the birefringence or dichroism (often 
expressed as “optical retardation” in degrees) at time 
t after the aligning electric held has been switched off. 
A practical problem with electro-optic decay methods 
is the potential local heating effects from the high 
electric helds used, especially if the experiments 
are conducted in solutions of high ionic strength: the 
investigator is advised to consult an article by Porschke 
and Obst (Porschke and Obst 1991) describing how 
these effects can he minimized. After eliminating 
hydration (via, e.g., combination with [p]) to give the 
universal hydration independent shape functions 5+ 
and 5_ (eq. 11) the triaxial shape {a/h, h/c) is found 
from their graphical combination. Resolution however 
of even two exponential terms is not easy, particularly 
for globular macromolecules where 0+ and 0_ are 
similar, no matter what form of mathematical 
deconvolution is applied, whether it be nonlinear 
least squares or more refined types of analysis. In our 
hands, a more reliable method of extraction is to use 
another hydrodynamic function as a constraining 
parameter in the analysis of the electro-optic decay 
data; in this way the problem is reduced to 
one of four variables (A'+, 0+, A'_, 0_) to one of three 
(A'+, A'_, alh) since aJh will specify, by the 
constraining function a unique value for h/c (and 
hence 0+, 0_). Eor an example of the application 
of this routine the reader is referred to Harding et al. 
(2005) and Harding et al. (1987). 

COVOL 

Aim. Evaluate the second “osmotic pressure” thermo¬ 
dynamic virial coefficient B (ml.mol.g”^) - a measure 
of the nonideality of a solution at finite concentration of 
a protein (Harding et al. 1997; Jones et al. 1999) - from 
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Ellips and Covol, 

Fig. 4 Evaluation of the 
second virial coefficient B for 
ovalbumin in dilute solution 
using COVOL. (a) Input, 

(b) Output. (Based on data 
from reference Jones et al. 
1999) 


a Name of molecule 
ovalbumin 

you need to enter the data as either (a, b, c) or (a/b, b/c) 

Choose 1 (a, b, c) or 2 (a/b, b/c) 

2 

Enter ellipsoidal axial ratios a/b and b/c 
1.87 1.08 

Reduced excluded volume, ured == a.89961E*01 

Vou will need to enter either (1) the swollen specific volume, vs (nl/g); or 
(2) the hydration (g/g), solvent density and partial specific volume, vbar 
Which method (1 or 2)? 

2 

Enter hydration delta (g/g), solvent density rho (g/ml) and vbar (ml/g) 

0.45 1 0.748 

Enter molar mass, M (g/mol) 

4S000 

Molar covolume, U = O.agaggE+Obml/mol 

Do you want the exclusion volume contribution to the virial 
coefficient, B(ex)? 

y 

Bex = 0.11975E-03ml.mol/g»»2 

Is there any contribution from the polyelectrolyte behaviour? 

y 

Enter valency on macromolecule, Z 
-14 

Enter ionic strength of solvent, I (mol/1) 

0.1 


COUOL Output Data 


Name of molecule: oualbumin 
(a/b, b/c) = 1.8700 1.0800 

Reduced excluded uolume (sphere = 8), ured = 8.9961 

Excluded uolume, u = 0.80532E-18 ml 

us = 1.1980ml/g 

Molar mass, M = B.45000E«05 g/mol 

Molar couolume, U = 0.48498E'r06 ml/mol 

B(ex) = 8.11975E-83 ml.mol/g««2 

B(z) = 0.1088BE- 03 ml.mol/g«*2 

B = B(ex) + B(z) = 0.2285E-B3 ml.mol/g»»2 

BM = e.1B285E*B2 ml/g 

BM«2 = B.46281E«03 litre/mol 

= B.46281E-03 uM»»-1 


user-supplied values of the shape of the molecule (a/h, 
h/c) - for example, from the crystallographic dimen¬ 
sions, the molar mass, hydration, valency of the protein, 
and ionic strength of the supporting electrolyte. It is 
particularly useful for the correction for non-ideality in 
the analysis of self-associating or other interacting sys¬ 
tems which are concentration dependent. 

Description. It is a modified form of ELLIPS2 focus¬ 
ing on the reduced excluded volume (evaluated using 
the Rallison-Harding equation (Ang et al. 2010)), based 
on user-entered values for the dimensions of the mole¬ 
cule (from, e.g., x-ray crystallography) and using this 
together with the molar mass M and volume 
V information to determine the exclusion volume 
contribution to the second thermodynamic virial 
coefficient. From further user-inputted values for the 
charge or valency, Z of the protein, and a value for the 
ionic strength I of the solvent, the charge contribution 
to the second virial coefficient, B, with B = 5ex + ^z- The 


method is valid only for dilute solutions of proteins 
where more than two particle interactions are not 
significant, and is not valid for highly irregularly 
shaped proteins such as immunoglobulins: for the latter 
an approximate evaluation based on bead models is 
recommended (Garcia de la Torre et al. 1999). 

Example. Sample output is given for the calculation 
of B for an ovalbumin solution (Fig. 4). The ellipsoidal 
dimensions had been evaluated using the algorithm 
FFLIPSF (Taylor et al. 1983) which evaluates the 
surface shape for a crystallographically determined 
protein structure. 

Summary 

A set of easy to download and easy to use algorithms 
can be downloaded from the NCMH web page for 
“whole body” or ellipsoidal representations of protein 
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shape in solution, based on Universal shape functions 
or USFs which are so called since they are functions 
only of the shapes of the proteins and not on their 
sizes or volumes. The USFs can be evaluated from 
hydrodynamic parameters such as the sedimentation 
coefficient, intrinsic viscosity, rotational relaxation 
times using the EXCEL routine ELLIPSPRIME. 
The representations can be in terms of prolate or oblate 
ellipsoids of revolution (ELLIPS 1), or general triaxial 
ellipsoids (ELLIPS2-4), and the ellipsoid shapes can 
be plotted in three dimension using ELLIPSDRAW. 
Although the hydrodynamic calculations for ellipsoids 
are exact, the shape representations are not applicable 
to highly asymmetric structures such as immunoglob¬ 
ulins where bead models are more appropriate 
(► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules; ► US-SOMO: 
Methods for Construction and Hydration of Macromo- 
lecular Hydrodynamic Models). And they are not 
applicable to linear flexible molecules such as poly¬ 
saccharides and large glycoconjugates where “Confor¬ 
mation Zoning” (Pavlov et al. 1997) 
(► Hydrodynamics of Macromolecules: Conformation 
Zoning for General Macromolecules; ► Polysaccha¬ 
rides: Biophysical Properties) and persistence length 
evaluation routines such as HYDEIT (► HYDRO Suite 
of Computer Programs for Solution Properties of Rigid 
Macromolecules; ► Polysaccharides: Biophysical 
Properties) are appropriate. An adapted 
form of ELLIPS2 known as COVOL - based on 
Rallison-Harding exclusion volume theory - can be 
used to evaluate the second thermodynamic virial 
coefficient B based on shape, size, and molecular 
charge information. 

Cross-References 

► Dynamic Light Scattering 

► Electro-optics and Macromolecular Hydrodynamics 

► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules 

► Intrinsic Viscosity 

► Neutron Scattering of Membranes 

► Polysaccharides: Biophysical Properties 

► US-SOMO: Methods for Construction and 
Hydration of Macromolecular Hydrodynamic 
Models 

► X-Ray Scattering of Lipid Membranes 


References 

Ang S, Kogulanathan J, Morris GA, Kok MS, Shewry PR, 
Tatham AS, Adams GG, Rowe AJ, Harding SE. Structure 
and heterogeneity of gliadin: a hydrodynamic evaluation. 
Eur Biophys J. 2010;39:255-61. 

Garcia de la Torre J, Carrasco B, Harding SE. SOLPRO: 
theory and computer program for the prediction of SOLution 
PROperties of rigid macromolecules and bioparticles. Eur 
Biophys J. 1997;25:361-72. 

Garcia de la Torre J, Harding SE, Carrasco B. Calculation of 
NMR relaxation, covolume, and scattering-related properties 
of bead models using the SOLPRO computer program. Eur 
Biophys J. 1999;28:119-32. 

Harding SE. Modelling the gross conformation of assemblies 
using hydrodynamics. In: Harding SE, Rowe AJ, editors. 
Dynamic Properties of Biomolecular Assemblies. Cam¬ 
bridge: Royal Society of Chemistry; 1989. p. 32-56. 

Harding SE. On the hydrodynamic analysis of macromolecular 
conformation. Biophys Chem. 1995;55:69-93. 

Harding SE, Colfen H. Inversion formulae for ellipsoid of 
revolution macromolecular shape functions. Anal Biochem. 
1995;228:131M2. 

Harding SE, Rowe AJ. Modelling biological macromolecules 
in solution III: the A - R intersection method for triaxial 
ellipsoids. Int J Biol Macromol. 1982;4:357-61. 

Harding SE, Colfen H, Horton JC. The ELLIPS suite of 
macromolecular conformation algorithms. Eur Biophys J. 
1987;25:347-60. 

Harding SE, Horton JC, Winzor DJ. COVOL: an answer to your 
thermodynamic non-ideality problems? Biochem Soc Trans. 
1997;26:737M0. 

Harding SE, Colfen H, Aziz Z. The ELLIPS suite of whole-body 
protein conformation algorithms for Microsoft Windows. In: 
Scott DJ, Harding SE, Rowe AJ, editors. Analytical Ultra¬ 
centrifugation. Techniques and Methods. Cambridge: Royal 
Society of Chemistry; 2005. p. 460-83. 

Jones S, Thornton JM, Winzor DJ, Harding SE, Horton JC. 
COVOL: an interactive program for evaluating second virial 
coefficients from the triaxial shape or dimensions of rigid 
macromolecules. Biophys J. 1999;76:2432-8. 

Lu Y, Harding SE, Michaelsen TE, Longman E, Davis KG, Ortega 
A, Grossmann JG, Sandlie I, Garcia de la Torre J. Solution 
conformation of wild type and mutant IgG3 and IgG4 immu¬ 
noglobulins using Crystallohydrodynamics: possible implica¬ 
tions for complement activation. Biophys J. 2007;93:3733^t4. 

Pavlov GM, Rowe AJ, Harding SE. Conformation zoning of 
large molecules using the analytical ultracentrifuge. Trends 
Anal Chem. 1997;16:401M. 

Porschke D, Obst A. An electric field jump apparatus with ns time 
resolution forelectrooptical measurements at physiological 
salt concentrations. Rev Sci Insh'um. 1991;62:818-20. 

Rallison JM, Harding SE. Excluded volumes for pairs of triaxial 
ellipsoids at dominant Brownian motion. J Colloid Interface 
Sci. 1985;103:284-9. 

Serdyuk IN, Zaccai N, Zaccai G. Methods in Molecular Biophys¬ 
ics: Stmcture, Dynamics, Function. Cambridge: Cambridge 
University Press; 2007. 

Taylor WR, Thornton JM, Tumell WG. An ellipsoidal approx¬ 
imation of protein shape. J Mol Graph. 1983;1:30-8. 



Empirical Valence Bond Methods 


663 


E 


Elongation 

► RNA Polymerases and Transcription 


► Redox Potential 


Empirical Valence Bond Methods 

Ewa Chudyk 

Centre for Computational Chemistry, School of 
Chemistry, University of Bristol, Bristol, UK 

Synonyms 

EVB methods 

Definition 

Empirical force field-hased methods for simulations of 
chemical reactions, used particularly for simulations of 
enzyme-catalyzed reactions. 

Basic Characteristics 
Introduction 

Warshel and Weiss first formulated the empirical 
valence bond method in the early 1980s to examine 
chemical reactivity in solutions and condensed phases 
(Warshel and Weiss 1980; Aqvist and Warshel 1993; 
Warshel 1991), building on the valence bond model of 
chemical bonding. The EVB method makes it possible 
to calculate free energy profiles of chemical reactions 
with respectively low computational cost even for large 
systems. A mixture of resonance structures, each 
representing an important state (e.g., reactant, products, 
or intermediates), represents the reacting system. This 
is mathematically described by the secular equation: 

t^EVB^g — 


The Hamiltonian matrix elements (Hevb) consist of 
potential energies of the diabatic states (Has) and 
mixing terms (//y = Hji): 

^'i = ’^'cas + Q) + 2) <?) 

+ ^Solventi''^ l) 

Ha describes the internal energy of resonance form 
;, dependent on the atomic coordinates and charges 
of solute {R and Q) and solvent (/■ and q). The 
represents the gas-phase energy of the f-th resonance 
form, where all the fragments are taken to be at infinity. 
^intrai^’ Q) st^^ds for the intermolecular potential of 
the solute, U‘i^,^i.{R,Q,r,q) is the energy of interac¬ 
tions between solute and solvent, and 
represents the potential energy of the solvent system. 
The Hij mixing term is assumed to be the same in the 
gas phase, but more importantly in solution and the 
protein. Different forms could be used for the mixing 
term, e.g., a constant value, exponential, distributed 
Gaussian function, or others. The more sophisticated 
Hij (exponential or distributed Gaussian) formulas can 
make it possible to reproduce the shape of an energy 
more accurately, if that is required. The calibration of 
the EVB surface can be based on experimental or ah 
initio calculation data, and is usually done only once 
per study. The ground state energy is obtained by 
diagonalizing the EVB Hamiltonian {Hevb)- The acti¬ 
vation free energies are obtained by changing the sys¬ 
tem adiabatically from one diabatic state to another, 
e.g., in free energy perturbation molecular dynamics 
simulations using an energy gap reaction coordinate 
(Kamerlin and Warshel 2010). 

Applications 

EVB and related methods have been used to predict 
properties of many different systems, from reactions of 
a few atom reactions in the gas phase (Schlegel and 
Sonnenberg 2006), to reactions in solution (Glowacki 
et al. 2011; Warshel 1991) to large biological systems 
(Aqvist and Warshel 1993). 

In enzymes, the method has been used to analyze 
the origins of catalysis, as well as, e.g., predicting the 
effects of site-directed mutagenesis. An example of 
an EVB application is a study on an important 
model enzyme in ► computational enzymology, 
► chorismate mutase. In this study, various possibly 
catalytic effects were examined, including 
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electrostatic transition state stabilization, steric 
strain, and conformational effects (Strajbl et al. 
2003). It was demonstrated that the enzyme functions 
by transition state stabilization and the electrostatic 
contribution is the major contribution to the reduction 
of the activation energy. In a further EVB study on 
this enzyme (Roca et al. 2009), the catalytic contribu¬ 
tions of the most important active site residues were 
evaluated for the monomeric, dimeric, and trimeric 
forms of the enzyme. The EVB method turned out to 
be able to accurately predict experimentally observed 
mutagenesis effects, showing its value as a tool for 
quantitative ranking in the final stages of computer- 
aided enzyme design. 

Software 

EVB and similar methods are available in a range of 
programs, e.g., using different force fields. The first 
implementation of the method for biomolecular simu¬ 
lations was in the Molaris program package with the 
ENZYMIX force field; and the energy gap is used as 
the reaction coordinate for modeling enzymatic reac¬ 
tions. EVB methods are also available with AMBER 
and CHARMM simulation packages, and other 
programs. 

Cross-References 

► Carboxypeptidase A - Computational Studies 

► Chorismate Mutase - Computational Studies 

► ONIOM 

► QM/MM Methods 
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ENAC, Degenerins, ASICs, and Related 
Channels 

Eric Lingueglia 

Institute for Molecular and Cellular Pharmacology, 
University of Nice-Sophia Antipolis UMR7275, 
Sophia Antipolis, Valbonne, France 

Synonyms 

Amiloride-sensitive cation channels; Non-voltage- 
gated cation channels; Sodium channels 

Definition 

Amiloride-sensitive, non-voltage-gated cation channels 
form a family of ion channels present from flies and 
worms to mammals, which participate in diverse biolog¬ 
ical functions most of them related to sensory perception. 

Introduction 

The epithelial Na"^ channel/degenerin/acid-sensing ion 
channel (ENaC/DEG/ASIC) gene family forms a large 
family of cation channels from flies and worms to 
mammals with very diverse modes of activation and 
physiological functions often related to sensory per¬ 
ception (Kellenberger and Schild 2002) (Fig. 1 and 
Table 1). So far no member has been identified in 
prokaryotes. The members that have been functionally 
characterized form voltage-independent ion channels 
that are mainly Na"^ selective and sensitive to the 
diuretic amiloride. 

All members of the ENaC/DEG/ASIC family share 
the same overall structure with two hydrophobic trans¬ 
membrane domains flanking a large disulphide-rich 
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ENAC, Degenerins, ASICs, and Related Channels, Fig. 1 Dendrogram of the ENaC/DEG/ASIC family of cation channels 


extracellular loop representing more than 50% of the 
protein, and short intracellular N- and C-termini 
(Fig. 2). The tridimensional structure (► X-Ray Diffrac¬ 
tion and Crystallography of Oligosaccharides and Poly¬ 
saccharides) of Acid-Sensing Ion Channel 1 (ASICl), 
a proton-activated channel of the ENaC/DEG/ASIC fam¬ 
ily, shows the association of three subunits to form 
a trimer (Jasti et al. 2007). This organization is probably 
shared by other members of the family. 

Epithelial Na^ Channel (ENaC) 

The epithelial Na"^ channel (ENaC) is involved in Na"^ 
transport across epithelia and plays a critical role in 
Na"^ homeostasis (Schild 2010). ENaC is present in the 
apical membranes of sodium-absorbing epithelia like 
distal nephron and colon, lung, sweat, and salivary 


ducts. It is the rate-limiting step for sodium absorption 
in these epithelia and plays therefore a critical role in 
the maintenance of body sodium balance. ENaC is 
made of three homologous subunits (a, (3, and y), 
which assemble together to form a constitutively 
open, highly Na'^-selective (PNa+/PK+ > 100), 
amiloride-sensitive (IC50 ^0.1 pM) channel 
(Fig. 3). Amiloride acts as a charmel pore blocker. An 
additional subunit, §ENaC, has been detected in 
a range of human tissues (highest expression levels in 
testis, ovary, pancreas, and brain) but seems to be 
absent in rodents, and behaves like a aENaC-like 
subunit, which can associate with pENaC and yENaC 
to generate a large amiloride-sensitive constitutively 
active channel. The aENaC (or 5ENaC) subunit is 
mandatory for channel activity (Canessa et al. 1993; 
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ENAC, Degenerins, ASICs, and Related Channels, 
Fig. 2 Typical structure of ENaC/DEG/ASIC family members, 
based on the 3D stmcture of chicken ASIC la. (a) Thi'ee subunits 
assemble together to form a functional channel that predominantly 
conducts Na"^ ions. Each subunit has two hydrophobic 


transmembrane regions flanking a large extracellular loop, (b) At 
the atomic level, a subunit looks like a clenched hand, which can 
be further divided into finger, thumb, palm, knuckle, [l-ball, and 
wrist in the extracellular domain, and forearm constituted by the 
two pore-forming transmembrane domains TMl and TM2 


Lingueglia et al. 1993), whereas the P and y are 
necessary for maximal channel expression and activity 
at the cell surface. ENaC expression has also been 
reported in a number of other tissues including skin, 
endothelial cells, vascular smooth muscle cells, and 
neurons where its physiological role remains poorly 
characterized. 

ENaC in the distal part of the kidney plays a key role 
in the fine adaptation of urinary Na"^ excretion in order 
to precisely balance the daily intake of salt. The control 
of this Na"^ balance is under the control of aldosterone 
and vasopressin and is essential for the maintenance of 
the extracellular fluid volume and blood pressure. In the 
lung, ENaC controls the composition and quantity of 
airway surface liquid, and defective ENaC regulation 
may contribute to lung disease in cystic fibrosis. 

Mechanisms that control ENaC expression at the 
apical membrane of epithelia regulate in large part the 
epithelial Na"^ absorption. The number of active ENaC 
channels at the cell surface is tightly controlled by aldo¬ 
sterone and vasopressin. Aldosterone increases both the 
expression and the apical membrane localization of the 
channel. Vasopressin also increases the density of ENaC 
channel at the cell surface. The Nedd4-2 protein- 
ubiquitin ligase is a critical convergence point for the 
regulation of ENaC at the cell surface. Nedd4-2 binds to 
PY motif, a proline-rich domain found at the C-termini 
of PENaC and yENaC and promotes ubiquitylation 


(► Ubiquitinylation) of ENaC subunits. The Ser/Thr 
kinase SGK-1 (serum- and glucocorticoid-regulated 
kinase 1) is upregulated by aldosterone and phosphory- 
lates Nedd4-2, which decreases interaction and 
ubiquitylation of ENaC subunits and increases ENaC 
level and activity at the cell surface. However, the sig¬ 
naling pathway involving SGKl and Nedd4-2 does not 
seem to be absolutely required for aldosterone-mediated 
control of ENaC in vivo. ENaC was also found to be 
stimulated in heterologous expression systems by solu¬ 
ble proteases such as trypsin, chymotrypsin, elastase, 
thrombin, or kallikrein, as well as membrane-bound 
proteases like CAP-1 (channel-activating protease-1), 
CAP-2, CAP-3, and furin. 

Deletions in the cytosolic C-terminus of PENaC and 
yENaC have been associated with Liddle’s syndrome, 
a rare autosomal dominant form of salt-sensitive hyper¬ 
tension. The mutations lead to both an increased number 
of active channels at the cell surface and a higher prob¬ 
ability for the channel to remain open (gain-of-function). 
Patients with a systemic form of Pseudohypoaldos- 
teronism type 1 (PHA-1), an autosomal recessive salt 
wasting disorder, have mutations in a, P, and yENaC 
subunits leading to channel loss-of-function. 

Acid-Sensing Ion Channels (ASICs) 

Acid-sensing ion channels (ASICs) form a family of 
voltage-insensitive depolarizing cation channels that 
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ENAC, Degenerins, ASICs, and Related Channels, 
Fig. 3 Typical currents associated with ENaC/DEG/ASIC 
channels. Epithelial Na"^ channel (a) and mouse BLINaC (b) 
give rise to constitutive currents blocked by amiloride. Acid¬ 
sensing ion channels (c) are activated by extracellular protons 
with a transient inward current that desensitizes within seconds 
(left panel), except for ASIC3 that has an additional sustained 
component {rightpanel). FaNaC and HyNaC (d) form RFamide 
peptide-gated channels in Molluscs and Cnidarians, respec¬ 
tively, and peptide application elicits a fast and partially 
desensitizing response. Gain-of-function mutants (GOF) of the 

are activated by extracellular acidification (Waldmann 
et al. 1997). They form effective proton sensors in both 
central neurons and peripheral sensory neurons 
(Lingueglia and Lazdunski 2009). 


MEC-4 and MEC-10 degenerins expressed together (e, left 
panel) and Drosophila RPK/dGNaCl (f) generate constitu- 
tively activated, amiloride-sensitive Na"^ currents. Native 
MEC-4-rMEC-lO currents (e, right panel) have been recorded 
in vivo from C. elegans touch receptor neurons upon force 
application. Both the application and withdrawal of mechanical 
stimuli evoked rapidly adapting inward currents. All traces were 
recorded after heterologous expression in Xenopus laevis 
oocytes except for ASICs (heterologous expression in mamma¬ 
lian cell line) and for native MEC-4 -r MEC-10 (e, right panel) 
recorded from touch neurons 

Four different genes (ASIC 1-4) encoding at least 
six isoforms have been described to date in mammals. 
They are called ASICl (ASIC la and ASIC lb variants) 
(► Alternative Splicing), ASIC2 (ASIC2a and ASIC2b 
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variants) (► Alternative Splicing), ASIC3, and ASIC4. 
ASICs have also been characterized from ascidian, 
toadhsh, lamprey, shark, and zebra hsh. 

The atomic-scale structure of chicken ASICl shows 
a homotrimer with a chalice-shaped structure and 
a bound chloride ion (Fig. 2). 

ASICs are activated by extracellular protons, except 
ASIC2b and ASIC4 that can associate with other ASICs 
to modulate their properties and/or their expression. 
ASICs form either homo- or hetero-trimeric channels, 
giving rise to a wide range of currents with different 
sensitivity to external H"^, kinetics, and pharmacology, 
which accounts for the large diversity of native ASIC- 
like currents observed in neurons. The pHO.5 for acti¬ 
vation ranges from 4.0 to 6.8 with activation thresholds 
close to pH 7.0 for ASICl and ASIC3. They show steep 
pH dependence, making them extremely sensitive 
sensors of extracellular protons. A rapid drop of extra¬ 
cellular pH triggers a transient inward current 
that desensitizes rapidly (within seconds). 
ASIC3-containing charmels have, in addition to the 
transient peak current, a slowly activating, sustained 
component that does not inactivate as long as the pH 
remains acidic, producing a persistent depolarization. 

ASICs predominantly conduct Na"^ (gNa ^ 10-15 
pS). ASIC la is also permeable to Ca^"^, but direct 
permeation of Ca^"^ through ASIC la seems to contribute 
only modestly to the intracellular Ca^"^ rise in neurons 
upon activation of ASIC la, and much of the increase in 
free Ca^"^ could be mediated by indirect secondary mech¬ 
anisms. The transient inward current is always Na"^ selec¬ 
tive, but the sustained current associated with ASIC3- 
containing channel can be less selective for Na"^ over K"^. 

ASICs are nonselectively blocked by amiloride in 
the micromolar concentration range (pore blocker). 
The amidine A-317567 is a more specihc yet 
nondiscriminant inhibitor of ASICs (IC50 between 2 
and 30 pM on native ASIC currents in DRG neurons). 
ASIC la and ASIC3 are also directly inhibited by ther¬ 
apeutic concentrations of nonsteroidal anti¬ 
inflammatory drugs (NSAIDs) (IC50 ^ 92-350 pM). 
ASIC channels are inhibited by a variety of heavy 
metal ions (Gd^’^, Pb^"^, Ni^"^, Cd^"^, Cu^"^) and bivalent 
cations (Ca^"^, Zn^"^). ASICs are regulated by extracel¬ 
lular Ca^"^ with a dual inhibitory and stimulatory effect 
and Ca^"^ binds to the outer mouth of the ion pore. 
Displacement during acidification of the Ca^"^ ions 
bound to the channel in the closed state seems to be 
an important feature of ASIC gating. 


The spider toxin Psalmotoxin 1 (PcTxl) 
blocks ASIC la homomeric channel with an IC50 
of ^ 0.9 nM. The sea anemone toxin APETx2 
blocks homomeric ASIC3 with an IC50 of ^ 63 nM 
as well as ASIC3-containing heteromeric channels 
(IC50 ^ 0.1-2 pM). PcTxl is a gating modiher that 
inhibits the channel by increasing the affinity for 
protons and shifting most channels into the inactivated 
state at pH 7.4. 

FMRFamide and structurally related peptides 
(e.g., FRRFamide and neuropeptide FF) potentiate 
H’^-gated currents through direct binding to the chan¬ 
nel (increase of the peak current amplitude and/or 
slowdown of the inactivation rate) of heterologously 
expressed ASICl and ASIC3 (EC50 ~ 10-50 pM; 
threshold ~ 1 pM; lower potency on ASIC la versus 
ASIC3), but not ASIC2a. The modulation of ASICs 
by REamide-related neuropeptides requires 
EMREamide addition at pH 7.4, i.e., when the chan¬ 
nel is closed. Dynorphin A and big dynorphin, two 
endogenous opioid neuropeptides, also directly inter¬ 
act with ASIC la to enhance channel activity by 
preventing its steady-state desensitization. Einally, 
GMQ (2-guanidine-4-methylquinazoline) forms 
a nonproton ligand of ASIC3 that potentiates the 
sustained component of low pH-gated current and 
allows activation of the channel at pH 7.4. 

A combination of genetic and pharmacologic 
approaches has revealed the implication of ASIC 
channels in an increasing number of physiological 
and pathophysiological processes, most of them asso¬ 
ciated with extracellular pH fluctuations. However, 
the possibility of the existence of other, unknown, 
stimuli besides H"^ cannot be ruled out. In the central 
nervous system, ASIC la has been involved in synap¬ 
tic plasticity, learning, memory, fear, depression, 
visual transduction, seizure termination, neuronal 
degeneration (Sluka et al. 2009), as well as pain 
modulation through either central sensitization in 
the spinal cord or activation of the opiate system. In 
the peripheral nervous system, ASIC3 has been 
involved in nociception with an important role in 
inflammatory pain (Deval et al. 2010), and ASIC2 
and ASIC3 have been involved in mechanosensation 
and mechanonociception, notably in the gastrointes¬ 
tinal tract. ASICs therefore emerge as potential inter¬ 
esting therapeutic targets in the management of 
psychiatric disorders, stroke, neurodegenerative dis¬ 
eases, and pain. 
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Mammalian BLINaC 

BLINaC (or INaC in human) forms a sodium channel 
predominantly expressed in mammalian non-neuronal 
tissues including liver (and hepatocytes), intestine, 
kidney, and lung (in mice), and to a lower extent 
brain (Sakai et al. 1999). Rat BLINaC is inhibited by 
micromolar concentrations of extracellular Ca^"^, 
which lead to very low activity in physiological con¬ 
ditions. Mouse BLINaC on the contrary is much less 
sensitive to extracellular Ca^"^ and is constitutively 
active in physiological conditions. Interestingly, 
a single amino acid (alanine 387) determines the dif¬ 
ferent apparent affinities for Ca^"^ and is amiloride of 
rat and mouse BLINaC. The rat BLINaC current acti¬ 
vated by decreasing the extracellular Ca^"^ concentra¬ 
tion is Na"^ selective but poorly amiloride sensitive 
(IC50 ^ 5mM). The constitutive mouse BLINaC cur¬ 
rent is Na"^ selective and amiloride sensitive 
(IC50 - 7 pM). 

BLINaC can also be activated by gain-of-function 
mutations just before the second tansmembrane 
domain (the so-called DEG mutation, see below) 
that lead to the expression of large amiloride-sensitive 
(IC50 - L31pM) Na-'-selective (PNa+/PK+ 10) 
currents. 

Phylogenetic analysis associates BLINaC/INaC with 
ASICs and FaNaC (Fig. 1) suggesting that rat BLINaC/ 
human INaC could be gated by a yet unknown extracel¬ 
lular ligand, but its mode of activation remains largely 
unknown. Inversely, mouse BLINaC forms 
a constitutively open channel similar to ENaC in phys¬ 
iological conditions. Thus, profound species differences 
exist for BLINaC regarding the sensitivity to Ca^"^ inhi¬ 
bition and as a consequence the constitutive activity of 
the channel, questioning the respective physiological 
roles of this channel in rat and human on one side, and 
mouse on the other side. 

Caenorhabditis elegans Degenerins 

The C. elegans genome contains at least 28 members 
of the ENaC/DEG/ASIC family, and only 10 have 
been genetically and molecularly characterized 
(MEC-4, MEC-10, UNC-105, ACD-1, DEG-1, 
DEGT-1, UNC-8, DEL-1, asic-1, FLR-1) (Arnadottir 
and Chalfie 2010). 

MEC-4 and MEC-10 form together a 
mechanotransducing sodium channel 

(► Mechanosensitivity of Ion Channels) blocked by 
amiloride and expressed in the touch receptor neurons, 


which can be mutated to produce animals insensitive to 
gentle touch (MEC-4 and MEC-10 have been indeed 
identified by genetic screens to identify mutants that are 
defective in mechanosensation). MEC-10 does not pro¬ 
duce functional channels without MEC-4. MEC-4, 
MEC-10, and related proteins in C. elegans are named 
“degenerins” after the neuronal degeneration phenotype 
caused by gain-of-function mutations (DEG mutations 
just before the second tansmembrane domain). The 
MEC-4 and MEC-10 proteins are not the only compo¬ 
nents of the mechano-transduction complex, where at 
least nine proteins have been involved. MEC-4 and 
MEC-10 form the core ion channel, together with the 
paraoxonase-like protein MEC-6. Other accessory pro¬ 
teins are present on the intra- and extracellular part of 
the membrane that tether the ion channel and have been 
proposed to be important for opening of the channel 
upon membrane movement, like the prohibitin-domain 
protein MEC-2. MEC-6 and MEC-2 indeed increase 
channel activity by 30-45 fold when coexpressed in 
Xenopus laevis oocytes with a mutated version (carry¬ 
ing the DEG mutation) of MEC-4 possessing constitu¬ 
tive activity. MEC-2 binds to cholesterol and the lipid 
environment seems to be important for the function of 
the C. elegans mechano-transduction complex. 

UNC-105 is a muscular degenerin that can be also 
activated by gain-of-function mutations to form an 
amiloride-sensitive sodium channel. Dominant, gain- 
of-function mutations in UNC-105 cause severe mus¬ 
cle hypercontraction. A functional interaction between 
LET-2 collagen and UNC-105 has been described. 

Acid-sensitive channel, degenerin-like 1 (ACD-1) 
forms on its own a constitutively open, sodium-selective 
ion charmel sensitive to amiloride. ACD-1 is inhibited 
by extracellular and intracellular acidification. It is 
expressed in sheath glia-like cells rather than neurons 
to influence sensory function. It functions with another 
degenerin, DEG-1, but not in the same cells (DEG-1 
being expressed in chemosensory neurons among other 
neurons) to mediate acid avoidance behavior and che- 
motaxis to the amino acid lysine. DEG-1 is also 
expressed in a group of neurons that includes the PVC 
intemeurons of the posterior touch sensory circuit, and 
mutations in the deg-1 gene are touch abnormal. 

DEGT-1 is expressed together with MEC-10 (and 
probably associated in the same multimeric complex) 
in PVD multidendritic neurons, which form polymodal 
nociceptors, and DEGT-1 and MEC-10 are required 
for responses to harsh touch but not to cold. 
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The UNC-8 degenerin is expressed in several neurons 
including the VA and VB motor neurons that may act as 
stretch receptors. UNC-8 may interact with another 
degenerin, DEL-1, to form a mechanosensitive ion 
channel. Asic-1 (the name is misleading since the protein 
has nothing to do with acid-sensing ion channel 1 previ¬ 
ously described) is localized at presynaptic terminals of 
dopaminergic neurons and is required for associative 
learning in C. elegans by amplifying normal dopaminer¬ 
gic signaling. Finally, FLR-1 is only expressed in the 
intestine and controls the defecation rhythm. 

Drosophila melanogaster Related Proteins 

The fly genome contains at least 30 ENaC-/DEG-/ 
ASIC-related genes, and only 12 have been genetically 
and/or molecularly characterized (dGNaCl/RPK, 
dmdNaCl/PPK, PPK 4, -7, -10, -11, -12, -13, 
-14, -19, -25/LLZ, and -28). 

The Drosophila gonad Na"^ channel 1 (dGNaCl)/ 
Ripped Pocket (RPK) has been suggested to participate 
in gametogenesis and/or early embryonic develop¬ 
ment (Adams et al. 1998; Darboux et al. 1998). 
It is expressed very early in the development (stage 0-3 h), 
i.e., before the beginning of the zygotic transcription and 
thus corresponds to a maternally encoded transcript. The 
transcript is distributed uniformly within the embryo 
cytoplasm. The mRNA is also detected in adult where 
it is expressed in gonads. dGNaCl/RPK is associated 
with a non-voltage-dependent, amiloride-sensitive 
(IC50 ^ 24 pM) Na’^-selective channel when expressed 
in Xenopus oocytes. This activity is of low amplitude and 
is restricted to Xenopus oocytes, but can be significantly 
increased by the DEG gain-of-function mutations. 

The Drosophila multidendritic neuron Na"^ Channel 
1 (dmdNaCl (/Pickpocket (PPK) has been involved in 
proprioception and the control of rhythmic locomo¬ 
tion. dmdNaCl/PPK is expressed in the late embryo- 
genesis (stage 14-17 h) in a particular subset of 
sensory neurons, the multiple dendritic neurons (md- 
da neurons). It has no activity when heterologously 
expressed in Xenopus oocytes. Expression of 
dmdNaCl in the late embryogenesis supports a role 
in the function of the peripheral nervous system 
rather than in its differentiation. dmdNaCl/PPK is 
required for mechanical nociception but not thermal 
nociception in md-da sensory neurons. 

Pickpocket 4, -7, -10, -11, -12, -13, -14, -19, 
and —28 are expressed in the tracheal system where they 
may be involved in liquid clearance from the airways. 


Pickpocket 11 (PPKll) and 19 (PPK19) are also 
expressed in the larval taste-sensing terminal organ and 
in adults on the taste bristles of the labelum, the legs, and 
the wing margins, and are important for Na"^ and salt 
taste. Pickpocket 28 (PPK28) is expressed in gustatory 
water-sensing neurons (water is a potent chemosensory 
stimulus for insects) where it probably forms an 
osmosensitive ion channel activated by hypo-osmotic 
solutions (Cameron et al. 2010; Chen et al. 2010). 

Pickpocket 25 (PPK25)Aounge lizard (LLZ) is 
expressed at high levels in male appendages (legs, 
wings, and antennae) and is detected in the larvae in 
a single lateral multidendritic (mechanosensitive) neu¬ 
ron and in two external sensory neurons that project to 
the chemosensitive terminal organ, as well as in non¬ 
neuronal support cells enwrapping sensory neurons. 
PPK25/LLZ contributes to sensory functions during 
male courtship and is necessary for response to female 
pheromones by Drosophila males. PPK25/LLZ does not 
generate ion channel current by itself in heterologous 
expression systems. 

Invertebrate RFamide Peptide-Gated Channels 

The ENaC/DEG/ASIC gene family contains peptide¬ 
gated Na"^ channels (Lingueglia et al. 2006). These chan¬ 
nels are unique because peptides directly bind to the 
channels to gate them without the involvement of G- 
protein-coupled receptors, forming so far the only 
known peptide ionotropic receptors (Cottrell 1997). 
These channels have been identified in invertebrate ner¬ 
vous system (Molluscs and Cnidarians). Members have 
been cloned from snails (HaFaNaC from Helix aspersa 
and HtFaNaC from Helisoma trivolvis), great pond snail 
(LsFaNaC from Lymnaea stagnalis), Aplysia (AkFaNaC 
from Aplysia kurodai), and freshwater polyp Hydra 
(HyNaCs from Hydra magnipapillata). They are all 
gated by RFamide-related neuropeptides. Snail and 
Aplysia channels are forming homomeric channels 
gated by FMRFamide while HyNaC2, HyNaC3, and 
HyNaC5 co-assemble to form a heteromeric ion charmel 
gated by Hydra-RFamides I and II (Durmagel et al. 2010). 

FaNaCs are able to generate a large FMRFamide- 
induced inward current when expressed in Xenopus 
oocytes and mammalian cells. External application of 
FMRFamide induces a fast and partially desensitizing 
current (EC50 - 2, 4, and 70 pM for HaFaNaC, 
AkFaNaC, and HtFaNaC, respectively). FaNaC is highly 
selective for Na"^ versus and Ca^"^ (Enb^/Pk^ > 10)j 
and has relatively low conductance (gNa ^ 6-13 pS). 
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The HyNaC2/3/5 channel is gated by Hydra-RFamides 
I and II (EC50 ^ 5 and 0.34 |j,M, respectively) and has 
a low Na"^ selectivity. HaFaNaC and HyNaC2/3/5 are 
blocked by amiloride (IC50 ^ 0.6 and 122 pM, respec¬ 
tively). Amiloride also has a potentiating effect, which 
may result from binding to a site located in the extracel¬ 
lular loop different from the binding site localized in the 
ion pore and involved in the blocking effect. 

These peptide-activated channels may participate in 
fast synaptic transmission at some junctions made by 
RFamide-containing neurons. The HyNaC2/3/5 chan¬ 
nel could also be involved in the coordination of the 
feeding reaction of Hydra. Finally, a technique 
(FMRFamide-tagging) that relies on the addition of 
FMRFamide as an electrophysiological tag (► Ion 
Channel Regulation by G-Protein-Coupled Receptors: 
Recent Advances with Optical Biosensors) to 
a prohormone encoding a neuropeptide of interest, and 
detection by coexpression with HaFaNaC that binds the 
tag, has been developed to measure the release of neu¬ 
ropeptides on a millisecond timescale (Whim 2002). 

Summary 

Members of the epithelial Na"^ channel/degenerin/ 
acid-sensing ion charmel (ENaC/DEG/ASIC) gene 
family form non-voltage-gated ion channels that share 
a Na'^-selective pore and sensitivity to the diuretic 
amiloride, but are very diverse in their functions and 
modes of activation. The family comprises the epithelial 
Na"^ channel (ENaC), a constitutively open channel 
involved in taste perception and Na"^ homeostasis in 
mammals, neuronal acid-sensing ion channels (ASICs) 
gated by extracellular protons, which form effective 
proton sensors in both central and peripheral sensory 
neurons and have been involved in numerous physiolog¬ 
ical functions including pain, the degenerins of the 
nematode Caenorhahditis elegans forming neuronal 
and muscular mechanosensitive channels involved in 
mechanoperception as well as constitutively open ion 
charmels inhibited by acidihcation in glia-like cells, and 
peptide ionotropic receptors identified from invertebrate 
nervous system (the RFamide peptide-gated channels 
EaNaCs and HyNaCs). The ENaC/DEG/ASIC family 
also comprises a mammalian sodium channel of yet 
unknown physiological function, which is highly sensi¬ 
tive to extracellular Ca^"^ and expressed in brain, liver, 
and intestine, and which can form a constitutively active 


channel in physiological conditions in mice, but not in 
rat (and probably in human) where its mode of activation 
remains unknown. Finally, several members are present 
in Drosophila melanogaster where they fulhll diverse 
functions ranging from liquid clearance in epithelia to 
mechanosensation, chemosensation, and osmosensation 
in neurons. 

Cross-References 

► Alignment of Protein Sequences 

► Alternative Splicing 

► Ion Channel Regulation by G-Protein-Coupled 
Receptors: Recent Advances with Optical 
Biosensors 

► Mechanosensitivity of Ion Channels 

► Membrane Proteins: Structure and Organization 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Protein Phosphorylation 

► Ubiquitinylation 

► Voltage-Gated Sodium and Calcium Channels 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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Enzymatic Chlorination/Halogenation - 
Computational Studies 

Hans Martin Senn 

WestCHEM and School of Chemistry 
University of Glasgow, Glasgow, UK 

Definition 

Halogenases are enzymes catalyzing the formation of 
C-X bonds (X = F, Cl, Br, I) to make halometaholites. 


which are halogen-containing organic compounds 
{organohalides) produced by living organisms. 

Prevalence of Halometabolites and 
Halogenases 

Some 4,000-5,000 halogenated natural products have 
been cataloged, biosynthesized by organisms from across 
the phylogenetic tree, including bacteria, fungi, terrestrial 
plants, various marine organisms, higher animals, and 
humans (Gribble 1998, 2003, 2004). The most abundant 
halogen in these halometabolites is chlorine, followed by 
bromine, but there is a smaller number of iodine- and 
even fluorine-containing compounds as well. The 
halometabolites are not only highly diverse in their bio¬ 
genic origin and their halogen content, but also in their 
structures, which range from very simple molecules like 
CH3CI to exceedingly structurally complex natural prod¬ 
ucts. Most of them are biologically active, for example, 
antimicrobial, antifungal, or antibiotic, and their activity 
is crucially dependent on the halogen(s) being present. 

Until quite recently, little was known about the 
enzymes involved in the biosynthesis of 
organohalogens. The first halogenating enzymes to be 
discovered were heme-dependent haloperoxidases in the 
1960s; vanadate-dependent haloperoxidases followed in 
the 1980s. Since about 2005, however, several new 
families of halogenases have been discovered and char¬ 
acterized biochemically and structurally; the field has 
been reviewed from various angles (Vaillancourt et al. 
2006; Blasiak and Drennan 2008; Neumann et al. 2008; 
Butler and Sandy 2009; Wagner et al. 2009). 

Halogenases may be divided into three classes 
according to the chemical nature of the active haloge¬ 
nating agent (Table 1). Solvated halide anions are the 
primary halogen source in all cases, but these are 
activated in different ways for reaction with the 
organic substrate. The principal mechanistic features 
and computational studies for each class are summa¬ 
rized in separate sections below. 

Electrophilic Halogenases 

Electrophilic halogenases oxidize the halide anion to 
form an electrophilic species “X"^,” most likely 
hypohalous acid HOX (or hypohalite XO~, respec¬ 
tively), which then reacts with the organic substrate. 
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Enzymatic Chlorination/Halogenation - Computational Studies, Table 1 Classes of halogenases according to the chemical 
nature of the halogenating agent. (LG = leaving group, SAM = S-adenosylmethionine) 

Class 

Electrophilic 

Nucleophilic 

Radical 


C-X bond 
forming reaction 


LG 

• 




X- - LG 


"" X 

Ml" “9+ 

Halogenating 

agent 

Hypohalite (XO“) 

Halide (X“) 

Halogen (X‘) 


Reaction type 

Electrophilic aromatic 
substitution (SnAr) 

Nucleophilic substitution (Sn2) 

Radical rebound 


Enzyme families” 

1. Heme-dependent 
haloperoxidases (1995) 

1. SAM fluorinase (2004)/ 
chlorinase (2008) 

Nonheme iron/2-oxoglutarate-dependent 
halogenases (2006) 


2. Vanadate-dependent 
haloperoxidases (1996) 

3. Elavin-dependent 
halogenases (2005) 

2. SAM-dependent halide methyl 
transferase (2010) 




“The year indicates the first structural characterization of an enzyme of the respective family 


The halogen is installed at relatively electron-rich car¬ 
bon centers, usually in alkenes or aromatic rings, in an 
electrophilic aromatic substitution reaction. Three 
families of electrophilic halogenases are known, 
which use different oxidants to effect two-electron 
oxidation of X“ to heme-dependent and 

vanadate-dependent haloperoxidases as well as fla¬ 
vin-dependent halogenases. 

Heme-Dependent Haloperoxidases 

The heme-dependent haloperoxidases (Fe-HPOs) fea¬ 
ture a redox-active heme cofactor, akin to cytochrome 
P450 and related heme enzymes (Scheme 1). Exoge¬ 
nous H 2 O 2 oxidizes the heme-Fe center to generate an 
Fe^'^-oxo (ferryl) species B (also known as Compound 
I), which in turn oxidizes X“ (X = Cl, Br, I), affording 
HOX. The general lack of substrate specificity of the Fe- 
FIPOs suggests that HOX is released from the active 
site. However, there are indications that, at least in some 
cases, HOX does not escape from the enzyme 
completely, implying that the substrate is somehow 
associated with the enzyme. Depending on conditions 
and substrates present, Fe-HPOs are able to catalyze 
a range of oxidation reactions other than halogenations. 

Computational Studies 

The key steps of the oxidation of X“ to XO~ in heme- 
dependent haloperoxidases resemble in many ways the 


chemistry of other heme enzymes, which has been thor¬ 
oughly studied computationally. Some of the same 
researchers have also turned their attention to Fe- 
HPOs, notably Shaik and coworkers. They used QM 
(DFT)/MM methods to study the formation of Com¬ 
pound I in chloroperoxidase (Chen et al. 2008), starting 
from the hydrogen peroxide adduct [Fe*^*]-(HOOH). 
The preferred pathway proceeds on the doublet 
{S = 1/2) surface via deprotonation of Op 

to Compound 0 (A in Scheme 1), followed by rate¬ 
determining heterolytic 0-0 cleavage with 
concomitant protonation of Od and loss of water to 
form Compound I (B). This proton-catalyzed sequence, 
involving Glul83 as the general acid/base, is preferred 
to direct 0-0 cleavage in [Fe*^]-(HOOH). Also, reac¬ 
tion in the S = 1/2 (doublet) electronic state is favored 
over S — 3/2 (quartet) or S = 5/2 (sextet) reactivity. The 
reaction from Compound 0 to Compound I is an inter¬ 
esting case of a proton-coupled electron transfer 
(PCET). The early stage of the 0-0 cleavage is homo- 
lytic, but the proton transfer to Od triggers the transfer of 
an electron from the [Fe]-Op unit to the incipient water 
molecule, which makes the overall process heterolytic. 

A follow-up study (Fai et al. 2009a) reported further 
details about the effects of the substrate, protonation 
state of active-site residues, and type of ligand trans to 
the oxygen ligands on the formation of Compound I in 
chloroperoxidase. 
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Enzymatic Chlorination/Halogenation - Computational 
Studies, Scheme 1 Key steps in the mechanism of heme- 
dependent haloperoxidases. The Fe^* hydroperoxide complex A 


(Compound 0) is oxidized to the Fe^-oxo species B (Compound I), 
which in turn oxidizes to XO^. Op and Od in A designate the 
proximal and distal oxygen atoms 


OH 

0-,.],v 


O* 


His 


1- 


-V''—OH 

I 


NH 


A1 


OH2 

0.,.wv 


O* 


-V''—OH 

I 

N-His 

A2 


HpO, 


■2H20 HO*^| 


O 

0-.,vV_^Q 


N-His 
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H+, X- 


HO* 


OX 

I 

OH 

I 

N-His 


Enzymatic Chlorination/Halogenation - Computational in different protonation states, only one of which is shown; the 

Studies, Scheme 2 Key steps of the mechanism of exact order of steps from A1 to B is uncertain 

vanadate-dependent haloperoxidases. Each complex may exist 


Another contribution from the Shaik group (Lai 
et al. 2009b) used DFT/MM to compute Mdssbauer 
parameters of different isomers and protonation states 
of Compound II, {[Fe^'^]=0} of chloroperoxidase. 
(Compound II derives from Compound I by one- 
electron reduction.) The calculations allowed the 
assignment of three Compound II species in the exper¬ 
imental spectrum. 

Vanadate-Dependent Haloperoxidases 

Like the Fe-HPOs, the vanadate-dependent 
haloperoxidases (V-HPOs) also use exogenous H 2 O 2 , 
but they activate it as a vanadium(V) peroxido com¬ 
plex, which oxidizes X“ (X = Cl, Br, I) to HOX 
(Scheme 2). The vanadate cofactor acts as a Lewis 
base, the metal center remaining V(V) throughout the 
catalytic cycle, unlike the redox-active Fe center in the 
Fe-HPOs. Similarly to the Fe-HPOs, it is not entirely 
clear for the V-HPOs if they release free HOX or if the 
halogenation reaction takes place in/at the enzyme. 
They are generally more selective and less promiscu¬ 
ous than the Fe-HPOs and have been shown to be 
involved in the biosynthesis of halometabolites in 
marine organisms. 

Computational Studies 

Most computational studies of vanadate-dependent 
haloperoxidases have focused on the question “where 


are the protons?”, that is, how many and which of the 
vanadate oxygen atoms are protonated in the different 
forms of the cofactor. Because the vanadate is 
surrounded by several titratable, hydrogen-bonding 
residues, studies of the isolated cofactor, or even 
small models including a few residues, are of limited 
value. We will therefore only discuss results from 
DFT/MM studies that consider the entire protein, or 
at least large parts of it. It should be noted, however, 
that the assignment of protonation states remains 
a difficult problem even at this rather involved level 
of modeling. Slight differences in the computational 
method (e.g., different exchange-correlation func¬ 
tionals in DFT) or in the model setup (especially the 
assignment of protonation states in the MM part of the 
model) can lead to different answers. It is often impos¬ 
sible to resolve such problems without recourse to 
experimental data. 

Much work has concentrated on the resting state (A) 
of vanadium chloroperoxidase (V-CPO), which is well 
characterized by X-ray crystallography, UV/Vis, and 
solid-state NMR spectroscopy. The identity of A could 
only be determined with reasonable certainty through 
the combination of computational and experimental 
results. Waller et al. (2007, 2008); Geethalakshmi 
et al. (2009) highlighted that V NMR isotropic chem¬ 
ical shifts, even in combination with structural data, 
were not sufficient to assign protonation states; only 
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Enzymatic Chlorination/ 
Halogenation - 
Computational Studies, 
Scheme 3 Key steps of the 
mechanism of a flavin- 
dependent tryptophan 
7-halogenases. (a) Formation 
of FADHOOFI, which 
oxidizes to FIOX. 

(b) Proposed formation of 
a lysine A-haloamine 
intermediate, (c) Electrophilic 
aromatic substitution of the 
substrate tryptophan 
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the incorporation of further NMR parameters (chemi¬ 
cal shift anisotropy and nuclear quadrupole coupling) 
provided enough information. The consensus from 
several studies (Kravitz et al. 2005; Raugei and Carloni 
2006; Waller et al. 2007; Zhang and Gascdn 2008) is 
that A in the resting state of V-CPO is most likely 
a doubly protonated monoanion (Al). 

The reaction proceeds by protonation (A2), attack 
of H 2 O 2 , and loss of two water molecule to yield the 
r| -peroxido complex B; however, the order of these 
steps is not entirely clear (Kravitz et al. 2005; Raugei 
and Carloni 2006). Neither is the protonation state of B 
(Geethalakshmi et al. 2009), whose structure is known 
as well. The neutral, singly protonated form shown in 
Scheme 2 appears to be most likely. Only Raugei and 
Carloni (2006) have modeled the complete catalytic 
cycle of V-CPO, even though with bromide as the 
halide. 

For vanadium bromoperoxidase, the protonation 
states of the resting state (Waller et al. 2008) and the 
peroxido vanadate (Geethalakshmi et al. 2009) have 
been studied. Pacios and Galvez (2010) built 
a homology model of vanadium iodoperoxidase and 
investigated the selectivity of this enzyme for iodide 
over bromide and chloride by means of electrostatic 
potentials calculated on molecular surfaces. They also 
studied the isolated cofactor by DFT calculations. 


Flavin-Dependent Halogenases 

Similarly to FAD (flavin adenosine dinucleotide)- 
dependent monooxygenases, the flavin-dependent 
halogenases reductively activate O 2 to form 
a hydroperoxyflavin (FADHOOH) intermediate, 
which is a source of “OH"^” (Scheme 3a). It oxidizes 
X“ (X = Cl, Br), bound in close proximity to the 
cofactor, affording HOX. The free amino acid trypto¬ 
phan is the natural substrate of the flavin-dependent 
halogenases that have been structurally characterized 
to date. Its indole unit is halogenated regioselectively 
in the 5-, 6-, or 7-position, depending on the enzyme. 
The substrate is bound in a second active site, some 
10 A away from the flavin/X“ binding site. Presum¬ 
ably, HOX reacts first with the side-chain amino group 
of a lysine residue lining the channel connecting 
the two binding sites (Scheme 3b). The resulting 
V-haloamine (-NH 2 X'^) intermediate would then be 
the active electrophilic halogenation agent, which 
transfers “X"^” onto the aromatic substrate 
(Scheme 3c). 

Sequence data suggest that there exists a second 
subfamily of flavin-dependent halogenases that pro¬ 
cesses electron-poor pyrrole- or phenol-type sub¬ 
strates; however, no such enzyme has been 
structurally characterized yet, and the details of the 
mechanism remain unclear. 
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Computational Studies 

We are not aware of any published computational 
studies on flavin-dependent halogenases. Possible 
questions amenable to computational investigations 
include: (1) Identity of the halogenating agent: 
Whaloamine intermediate or HOX hydrogen-bonded 
to the lysine side-chain amine? (2) Role of general 
acid/base residues: Is the lysine amine neutral or pro- 
tonated in the resting state? How is the OH“ released 
from HOX protonated? What happens to the proton 
released from the substrate? (3) Regioselectivity: 
What factors control whether the substrate tryptophan 
is 5-, 6 -, or 7-halogenated? 

Nucleophilic Halogenases 

Although halide anions are not normally considered 
viable nucleophiles, especially in aqueous solution, 
there is a small number of enzymes where (partially) 
desolvated X“ anions (X = F, Cl, Br, I) act as nucle¬ 
ophiles in substitution reactions at carbon. In all 
cases, the substrate is S-adenosyl-L-methionine 
(SAM), whose carbon centers in a-position to the 
sulfonium sulfur are strongly electrophilic by virtue 
of the positive charge and the favorable leaving group 
characteristics of the thioether (Scheme 4). Two fam¬ 
ilies of nucleophilic halogenases have been 
characterized. 

SAM Fluorinase and SAM Chlorinase 

The fluorinase FIA, the only enzyme known to be able 
to fluorinate, only accepts F~ and Cl~ as halide sub¬ 
strates. The structurally very similar chlorinase SalL 
accepts Cl~, Br“, and I“, but not F“. The products of 
both enzymes are the corresponding halogenated aden¬ 
osine derivatives and L-methionine. See the separate 
entry ► Enzymatic Fluorination - Computational 
Studies for details. 

SAM-Dependent Halide Methyl Transferases 

Harnessing the power of SAM as a versatile biological 
methylating agent, these enzymes produce 
halomethanes, CH 3 X (X = Cl, Br, I). Although bio¬ 
chemically characterized in the 1990s and quite possi¬ 
bly the most prevalent kind of halogenase, the first 
structure of a member of this family only appeared in 
2010 (Schmidberger et al. 2010). 


Halide methyl transferases 
X- 



Enzymatic Chlorination/Halogenation - Computational 
Studies, Scheme 4 Nucleophilic halogenases catalyze the 
attack of halide anions as nucleophiles onto SAM. The fluorinase 
FIA and the chlorinase SalL produce halogenated adenosines, 
whereas halide methyl transferases produce halomethanes 

Computational Studies 

The only member of this class studied computationally 
to date is the fluorinase FIA; see the separate entry 
► Enzymatic Eluorination - Computational Studies. 

Radical Halogenases 

The enzymes in this class are nonheme iron/2- 
oxoglutarate (2-OG, oc-ketoglutarate) dependent and 
use a mono-iron complex to install a halogen (Cl or 
Br) at nonactivated carbon centers. The proposed 
mechanism (Scheme 5) borrows its essential steps 
from the better known nonheme iron/2-OG-dependent 
oxygenases, whose active site mainly differs by having 
a side-chain carboxylate ligand instead of the chloride. 
Starting from the resting state A with an Fe(II) center, 
O 2 is reductively activated, with the help of the 2-OG 
co-substrate. Via several intermediates and under loss 
of CO 2 , the high-spin (S — 2) Ee(IV)-oxo (ferryl) 
species D is generated. D is powerful enough as an 
oxidant to abstract an H atom from an aliphatic carbon 
center, R-H. The substrate radical R‘ then combines 
with the iron-bound halogen in a rebound mechanism 
to form the halogenated product R-X. 

The native substrates of these enzymes are a-amino 
acids, which are halogenated at aliphatic side-chain 
positions; examples are shown in Scheme 6 . However, 
free amino acids are not recognized as substrates: They 
need to be linked as thioesters to a phosphopantetheine 
tether (Fig. 1) and presented to the enzyme by a sepa¬ 
rate carrier protein. The halogenases are part of 
nonribosomal peptide synthase (NRPS) assembly 
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Halogenation - 
Computational Studies, 
Scheme 5 Proposed 
catalytic cycle of nonheme 
iron/2-oxoglutarate (2-OG)- 
dependent halogenases. The 
central active intermediate is 
the ferryl species D. R is an 
aliphatic carbon atom of an 
amino acid side chain; 

R' = (CH2)2C00“ 
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Enzymatic Chlorination/Halogenation - Computational 
Studies, Scheme 6 Native conversions catalyzed by some 
nonheme iron/2-oxoglutarate-dependent halogenases 
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Enzymatic Chlorination/Halogenation - Computational 
Studies, Fig. 1 The amino acid substrate (shown in red) of 
nonheme iron/2-oxoglutarate-dependent halogenases is linked 
as a thioester to a phosphopantetheine tether (in blue) and 
presented to the enzyme by a carrier protein 


lines, which include the set of enzymes necessary to 
install the free substrate to the tether. 

A key question arising from the mechanism shown 
in Scheme 5 is why the substrate radical combines 
with the chlorido, rather than the hydroxido, ligand (E 
^ F). It is known experimentally that certain 
nonnatural substrates are indeed hydroxylated, 
which shows that the halogenases are in principle 
competent to catalyze either reaction. Another exper¬ 
imental finding, from Mossbauer spectroscopy, is that 
(at least) two isomers of the ferryl species D exist in 
equilibrium. 
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Enzymatic Chlorination/Halogenation - Computational 
Studies, Scheme 7 The ferryl species of nonheme iron 
halogenases exists as an equilibrium of two positional isomers 


Computational Studies 

Because no substrate-bound structure is available exper¬ 
imentally, the substrate must be positioned into the 
active site by some modeling procedure. In a compre¬ 
hensive study of the nonheme iron halogenase SyrB2, 
Borowski et al. (2010) used molecular docking to place 
the native substrate L-threonine linked to a shortened 
tether into the active site of the experimental structure of 
the holoenzyme. They then built a “cluster model” 
including the Fe complex, substrate, and a few selected 
residues. Other computational studies on halogenation 
by SyrB2, using smaller models, are succinctly summa¬ 
rized in Borowski et al. (2010). 

Focusing on the crucial steps D —> E —> F, they 
formulated a mechanism based on stmctures, energetics, 
and Mossbauer parameters obtained primarily from DFT 
calculations, but validated by CASPT2 multi-reference 
single-point energies. They identified two isomeric ferryl 
complexes (Scheme 7) of almost the same energy, which 
can interconvert fairly easily (over a barrier of 
^55 kJ mol”^). The simulated Mossbauer spectra of 
such a mixture agree with experiment. Calculating the 
energetics of hydrogen abstraction and chlorine vs. 
hydroxyl rebound for both ferryl isomers, they concluded 
that the chemoselectivity for halogenation is determined 
by the slightly lower H-abstraction barrier from D2 than 
from D1 (73 vs. 77 kJ mol“*). Once the respective 
hydroxido complex E is formed, the ligand trans to 
His245 (i.e.. Cl for D2 and OH for Dl) preferentially 
and easily combines with the substrate radical. As the H- 
abstraction barrier critically depends on the position of 
the substrate relative to the oxo ligand, it is the position¬ 
ing of the substrate in the active site that ultimately 
controls chemoselectivity. 

According to Borowski et al. (2010), the 
H-activation step D ^ E takes place entirely in the 
S = 2 spin state. The electronic structure of other 
nonheme Fe*'^=0 complexes, their H-abstraction reac¬ 
tivities, and spectroscopic properties have been 


investigated in detail at various computational levels, 
including coupled-cluster methods, also by other 
groups (Neese 2006; Chen et al. 2010; Geng et al. 
2010; Janardanan et al. 2010). 


Summary 

Three classes of halogenases where at least one mem¬ 
ber has been structurally characterized are now known; 
electrophilic (heme, vanadate, and flavin dependent), 
nucleophilic (SAM dependent), and radical (nonheme 
iron/2-oxoglutarate dependent). They are involved in 
the biosynthesis of the relatively small, but highly 
diverse group of natural organohalogens. Computa¬ 
tional studies have already, and will continue to, elu¬ 
cidate the details of the mechanisms of these enzymes, 
often in synergistic combination with experiment. 


Cross-References 

► Computational Enzymology 

► Computational Studies of Enzyme Motions 

► Cytochrome P450 - Computational Studies 

► Enzymatic Eluorination - Computational Studies 

► QM/MM Calculations of Vibrational Spectra 

► QM/MM Methods 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Taurine/Alpha-Ketoglutarate Dioxygenase - 
Computational Studies 
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Definition 

Fluorinases are enzymes catalyzing the formation 
of C-F bonds to make fluorometaholites, which 
are fluorine-containing organic compounds 
{organofluorines) produced by living organisms. 

Enzymatic Fluorination 

Among the more than 4,000 halogen-containing natu¬ 
ral products described to date, there are only a handful 
of fluorometaholites. Notably, a wide variety of plants 
produce fluoroacetate, which is highly toxic for mam¬ 
mals and many insects. Other organofluorines were 
identifled from a sponge and soil bacteria. There are 
several reasons why organofluorines are scarce com¬ 
pared to other halometabolites, despite fluorine being 
the most abundant halogen in the Earth’s crust; 
(1) Most of the fluorine is bound in sparingly soluble 
fluoride minerals (e.g., CaF 2 ). (2) Once in solution, 
the fluoride anion is very well solvated; the very favor¬ 
able hydration energy (AhyuG* « —437 kJ mol”*) 
must be offset to remove fluoride from the aqueous 
environment. (3) Fluorine’s high electronegativity 
(equivalently, high ionization energy and high reduc¬ 
tion potential) precludes its participation in oxidative/ 
electrophilic or radical halogenation pathways, which 
are common in other halogenases; “F+” or “F‘” 
equivalents are not accessible under physiological 
conditions. However, when (partially) desolvated, F“ 
is a strong nucleophile. 

Fluorinase 

Introduction 

The fluorinase FIA (EC 2.5.1.63; other names; adenosyl- 
fluoride synthase, S-adenosyl-L-methionine;fluoride 
adenosyltransferase, 5'-fluoro-5'-deoxyadenosine 
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Enzymatic Fluorination - 
Computational Studies, 
Scheme 1 Enzymatic 
fluorination catalyzed by the 
fluorinase from 5. cattleya 



OH OH 
SAM 



O 



OH O 



synthase, 5'-FDAS) from the soil bacterium Strepto- 
myces cattleya is the only biosynthetic enzyme 
promoting C-F bond formation characterized to 
date (Dong et al. 2004). The enzyme catalyzes the 
production of 5'-fluoro-5'-deoxyadenosine (FDA) 
and L-methionine from S-adenosyl-L-methionine 
(SAM) and fluoride (Scheme 1); downstream metab¬ 
olism converts FDA, via fluoroacetaldehyde, into 
fluoroacetate and 4-fluoro-L-threonine, the latter 
being a powerful antibiotic. Notably, SAM is a sub¬ 
strate in this reaction, rather than a co-factor, unlike in 
SAM-dependent methyl halide transferases. 

The protein is a homotrimer in solution; the three 
299-residue monomers are arranged around a threefold 
axis. Three equivalent active sites are located at the 
interfaces between the chains. Zhu et al. (2007) 
reported detailed experimental studies of the enzyme’s 
molecular mechanism; a brief review is available in 
Deng and O’Hagan (2008). 

Computational Studies 

The C-F bond-forming step in the fluorinase was 
investigated in detail by combined quantum mechan¬ 
ics/molecular mechanics (QM/MM), treating the 
reacting substrates by density-functional theory 
(DFT) and the surrounding protein and water with the 
CHARMM force field (Senn et al. 2005). The setup 
included the full trimer, with one of the active sites 
solvated by a water sphere of 25 A radius (finite 
boundary conditions). Only atoms well inside the 
solvated region were allowed to move during classical 
molecular dynamics (MD) runs and subsequent 
QM/MM structure optimizations, while the remainder 
of the system was kept fixed. For comparison, the 


intrinsic reactivity of SAM + F“ was investigated by 
DFT calculations with a polarizable continuum model 
of water. The main results of the study are summarized 
in the following. 

Fluoride Binding Site 

No experimental structure of the enzyme with bound 
fluoride is available. Placing fluoride at different 
starting positions inside the proposed binding pocket, 
followed by classical MD and QM/MM optimizations, 
yielded structural details of the reactant complex 
(Fig. 1). The anion binding pocket is created by the 
loop Serl58-Tyrl57-Phel56, ThrSO, and the bound 
SAM. In the fluoride-free (but SAM-bound) structure, 
which is known experimentally, the empty pocket is 
stabilized by hydrogen bonds from Tyrl57;(N)H to 
Thr80;O(H) and from Serl58:(0)H to 3'-0(H) of the 
SAM ribose. In the fluoride-bound structure, both the 
ThrSO and the Serl58 OH groups have rotated to form 
hydrogen bonds to F“ (with lengths of 1.82 and 1.79 A, 
respectively), while the Tyrl57;(N)H-Thr80;O(H) 
interaction remains intact. Another hydrogen bond to 
F“ is formed by Serl58;(N)H (1.82 A), resulting in 
overall three hydrogen bonds stabilizing the fluoride. 
Moreover, the positive charge of SAM’s sulfonium 
cation further contributes to stabilizing the bound anion. 

C-F Bond Formation 

In the reactant complex, fluoride is bound at a distance 
of 2.76 A from the attacked ribose carbon C5'. A tran¬ 
sition state for the attack of F“ at C5' could be located 
(Fig. 2), in which F“ has approached C5' to 1.93 A, 
while the C5'-S bond is effectively broken. The fluo¬ 
ride’s negative charge is partially delocalized over, and 
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Fig. 1 QM/MM-optimized structure of the reactant complex 
(SAM • • enzyme). SAM is shown in hall-and-stick repre¬ 

sentation; the green hall is the fluoride anion. Selected residues 
involved in substrate binding are shown as sticks; hydrogen 
bonds are highlighted in orange. The two protein chains forming 
the active site are shown in gray (chain A) and yellow (chain B) 
cartoon representation 

compensated by, the cationic SAM, which is reflected 
in loosened hydrogen bonds to fluoride. The structure 
of the transition state is indicative of an Sn2 nucleo¬ 
philic attack at C5', with a trigonal-planar carbon cen¬ 
ter and partially formed/broken bonds to the attacking 
fluoride nucleophile and the methionine thioether leav¬ 
ing group, respectively. The optimized product mini¬ 
mum connected to the transition state has a fully 
formed C-F bond (1.43 A), while methionine is now 
completely detached [c/(C5'-S) = 3.44 A]. The only 
hydrogen bond to F retained in the product is Ser:(N) 
H- -F with 2.00 A. The QM/MM-optimized structure 
agrees very closely with the experimental structure of 
the product complex. 

The activation energy for C-F bond formation in the 
enzyme was calculated to be 59 kJ mol“* at the BP86/ 
def2-TZVP+/CHARMM level (Senn et al. 2009); the 
corresponding reaction energy is —22 kJ mol”'. The 
harrier in (polarizable continuum) water is 40- 
50 kJ mol” * higher, which corresponds to a rate accel¬ 
eration in the enzyme of 6-8 orders of magnitude. 
Thanks to enzyme catalysis, the actual C-F bond for¬ 
mation is thus a facile step, which is not rate-limiting for 
the overall fluorination process. Senn et al. (2009) also 



Enzymatic Fluorination - Computational Studies, 
Fig. 2 QM/MM-optimized structure of the transition state for 
nucleophilic attack of F” at C5' of the SAM ribose. Labeled 
distances are in A 

computed the free-energy harrier using QM/MM free- 
energy perturbation (FEP) methods and found it to be 
only a few kJ mol”* lower than the potential-energy 
barrier. However, there will clearly be substantial entro- 
pic and finite-temperature effects on the steps preced¬ 
ing, and following, the bond-forming event; in 
particular binding and desolvation of fluoride and prod¬ 
uct release. As these processes are likely to be coupled 
to large-scale conformational changes (like tertiary- 
structure rearrangements), they are difficult to model 
computationally. 

Alternative reaction channels that could yield 
the observed products were also explored hy DFT cal¬ 
culations in continuum water (Senn et al. 2005), includ¬ 
ing an elimination-addition mechanism in which 
fluoride acts as a base, rather than a nucleophile. How¬ 
ever, none of the alternatives was found to be energet¬ 
ically viable compared to the Sn 2 pathway. In summary, 
the combined evidence from computational, structural, 
and chemical studies have established enzymatic C-F 
bond formation in the fluorinase via an Sn2 mechanism. 

Substrate Specificity 

An interesting and important aspect of enzymatic reac¬ 
tivity is substrate specificity. The fluorinase was shown 
experimentally to be able to act as a chlorinase as well 
(Deng et al. 2006); however, the equilibrium of the 
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chlorination is shifted to the reactant side. This 
is consistent with the calculated (Senn et al. 2009) 
positive reaction energy (16 kJ mol~') and a higher 
forward activation energy (69 kJ mol”*) for the 
chlorination. Computationally, carbon-halogen bond 
formation/cleavage should also be feasible with bro¬ 
mide and iodide, with the equilibrium shifted even 
further (Senn et al. 2009); however, no reactivity was 
observed experimentally with either of the heavier 
halides. A possible explanation is that halide selectiv¬ 
ity may not be determined in the C-X bond-forming 
step, which was the focus of the computational studies, 
but in a preceding step, for instance, halide binding. 

The second substrate variations investigated are 
deoxy derivatives of SAM. Experimental results are 
available only for the dehalogenation leading to 2'- 
deoxy-SAM + fluoride or chloride (Cobb et al. 2006), 
but computationally, 2'-deoxy-SAM, 3'-deoxy-SAM, 
and 4'-deoxy-SAM (a carbasugar derivative) were stud¬ 
ied with both fluoride and chloride (Senn, unpublished). 
The removal of either of the ribose hydroxyl groups, 
which help holding the substrate in place by hydrogen 
bonding to Asp 16, increases the forward barrier by up to 
8 kJ mol“* and has no major effect on the reaction 
energies compared to native SAM + fluoride or chlo¬ 
ride. For the carbaribose derivative 4'-deoxy-SAM, 
however, a significantly more exothermic reaction 
energy (—49 kJ mol”* with F”) and slightly lowered 
barrier (51 kJ mol”*) were calculated. 

Related Enzymes 

A small number of sequence and structural homologues 
of the fluorinase are known. The chlorinase SalF from the 
marine bacterium Salinispora tropica produces 5'- 
chloro-5'-deoxyadenosine from SAM and chloride, 
using an active-site setup highly similar to the fluorinase ’ s 
(Eustaquio et al. 2008b). However, SalF does not accept 
fluoride as a substrate, but is able to bromlnate and iodin- 
ate. Genome sequencing has revealed that the fluorinase 
as well as SalF belong to the DUF62 superfamily of 
proteins, whose about 200 members are almost exclu¬ 
sively found in extremophiles and archaea. Five other 
DUF62 proteins have been structurally characterized 
and/or assayed for their function (Deng et al. 2008; 
Deng and O’Hagan 2008; Eustaquio et al. 2008a). Despite 
the very close structural similarity to FIA and SalF, these 
enzymes show no halogenating activity, but catalyze the 


hydrolysis of SAM into adenosine and methionine. 
Except for the fluorinase, none of the DUF62 enzymes 
has so far been the subject of computational mechanistic 
studies, which could reveal the origin of the different 
substrate and chemoselectivities. 

Chemically related to the fluorinase is the 
fluoroacetate dehalogenase, which catalyzes the hydro¬ 
lytic defluorination of fluoroacetate into hydroxyacetate 
(glycolate) and fluoride. Its mechanism has been inves¬ 
tigated in detail both experimentally (Chan et al. 2011) 
and by QM/MM calculations (Kamachi et al. 2009). 
The reaction sequence follows the paradigm of the 
related haloalkane dehalogenases (none of which is 
able to process fluorinated substrates, however): The 
fluoride is displaced by a side-chain carboxylate in an 
Sn2 reaction, yielding an ester intermediate, which is 
subsequently hydrolysed via a tetrahedral intermediate. 
The latter step was found to be rate-determining, while 
the halide displacement is facile. 

Cross-References 

► Computational Enzymology 

► Enzymatic Chlorination/Halogenation - 
Computational Studies 

► QM/MM Calculations of Vibrational Spectra 

► QM/MM Methods 
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Abbreviations 

EMR Electron magnetic resonance spectroscopy 

ENDOR Electron-nuclear double resonance 

EPR Electron paramagnetic resonance 
spectroscopy 

ESEEM Electron spin-echo envelope modulation 

ESR Electron spin resonance spectroscopy 

Synonyms 

Electron paramagnetic resonance spectroscopy; Elec¬ 
tron spin resonance spectroscopy 


Definitions 

Axial spectrum: a spectrum of a paramagnetic center 
with axial symmetry in a parameter such as g-factor, 
8x = gy gz- Such spectra may arise from a fibrous 
sample, or a heme, in which the x and y axes are in the 
heme plane, and z is normal to the plane. 

g-Eactor: a dimensionless scalar quantity that 
defines the condition of resonance for an electron 
spin as a function of applied magnetic field and micro- 
wave frequency. 

Rhombic spectrum: a spectrum of a paramagnetic 
center with no symmetry, gxy^ gyy^ gz- 

Introduction 

Paramagnetic centers have a central role in many bio¬ 
logical electron-transfer processes, including photo¬ 
synthesis, oxidation of foodstuffs, and nitrogen 
fixation. They include the trace metals iron, copper, 
manganese, cobalt, nickel, and molybdenum, which 
have oxidation states with an odd number of electron 
spins. Anion and cation radicals are formed from 
enzyme cofactors and amino-acid residues by one- 
electron addition or removal, respectively. Although 
the amounts of these paramagnets present in living 
cells are small, they are often present at centers of 
catalytic activity and thus of considerable biochemical 
interest. The sensitivity of EPR spectroscopy is such 
that some of them are readily observed in intact cell 
preparations; more detailed studies are performed on 
purified systems. 

Recent developments in EPR are described in 
a review series (Hanson and Berliner 2010). Biomed¬ 
ical applications of EPR of radicals are outside the 
scope of this review, but are reviewed by Rhodes 
(Rhodes 2011). 

EPR and ENDOR of Organic Radicals 

Radicals may be produced through oxidation-reduc¬ 
tion reactions with transition metal ions, by photolysis 
with visible or ultraviolet light, or by ionizing radia¬ 
tion. Photo-induced radicals may be generated by 
illumination in the CW-EPR cavity (Section ► EPR 
Spectroscopy: General Principles). Primary radicals 
are often short-lived, undergoing chain reactions 
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forming other radicals. These transient radicals may be 
detected by spin-trapping, as described by Haywood 
(Section ► Spin-Trapping: Theory and Applications). 
EPR spectra of long-lived radicals are observable at 
room temperature. For organic free radicals that are 
tumbling rapidly in solution, the anisotropy of g and A 
is averaged out and the spectra have narrow linewidths. 
The spectra may be complex, owing to hyperhne inter¬ 
actions A of the electron spin with nuclear spins, espe¬ 
cially *H (nuclear spin I = V 2 ) and (nuclear spin 
7=1). These spectra are better resolved by electron- 
nuclear double resonance (ENDOR) spectroscopy. 
Radicals of molecular mass greater than 1,000 are 
usually in the slow-motion or solid-state regime, so 
the anisotropy of the hyperhne interactions is not aver¬ 
aged out and the spectra are considerably broadened. 
Hyperhne couplings can still be resolved by ENDOR. 
They have in some cases been assigned to individual 
nuclei of the radical by systematically labeling the 
atoms with nuclei such as ^H, *^C, or ^^N. This dehnes 
the distribution of unpaired electron density, which can 
then be interpreted by quantum calculations of the 
electronic structure of the radical (Jeschke 2005). 

Flavin Radicals 

Havins are an example of redox cofactors, found in many 
electron-transfer proteins. Unlike the radicals of free 
havins which are unstable in neutral solution and prone 
to disproportionation, the havin radicals in havodoxins, 
and some havoenzymes, are stabilized, and give strong 
EPR signals (Fig. lb). The EPR spectra of havoprotein 
radicals consist of an unresolved single line of width 
1.4—2.0 mT, depending on the state of protonation of 
the havin ring system (Fig. la). The hyperhne couplings 
to multiple *H and ^'^N nuclei are unresolved under this 
envelope, but can be resolved by ENDOR. The coupling 
constants were assigned to the atoms of the havin ring by 
studies of havin semiquinones, specihcally substituted 
with isotopes of carbon, hydrogen, and nitrogen. These 
showed the regions of unpaired electron population 
(Fig. la), and that the protonation is on N — 5 (Fig. 2a). 
Similar electron distributions are seen in the anionic and 
neutral havin radicals (Medina and Cammack 2007). The 
electron distribution will depend on the surrounding pro¬ 
tein. Regions of high electron density are potential sites 
for electron transfer. The Davies pulsed ENDOR (see 
section ► Pulsed ENDOR and ESEEM: Principles and 
Examples of Applications to Heme Proteins) spectrum of 
a havin radical in a havoprotein is shown in Fig. Ic. 


Phenolic Compounds and Quinones 

Some biopolymers contain trapped radicals which are 
detectable by EPR. Melanin is a complex polymer 
formed by oxidation of quinone compounds and 
containing stable trapped cation radicals. Its EPR spec¬ 
trum is a single line of width approximately 5 mT, 
readily detected in room-temperature EPR spectra of 
hair or skin, and also fungal spores (Fig. 2c, lower 
spectrum). 

Protein-Derived Radicals 

Radical cations from the aromatic amino acids tyro¬ 
sine, tryptophan, and histidine are generated by oxida¬ 
tion reactions in certain enzymes and electron-transfer 
proteins, in which they form part of the catalytic site. 
Well-known examples are the tyrosyl radicals in ribo¬ 
nucleotide reductase and in Photosystem II (signal 2). 
Some of these radicals are essential components 
of enzymes such as ribonucleotide reductase, or 
electron-transfer systems such as Photosystem II. 
High-held measurements at W-band or higher frequen¬ 
cies are able to distinguish spectra of radicals on the 
basis of the anisotropy of g, which is lost in the 
linewidth at X-band. The amino acid residues involved 
have been identihed by site-directed mutagenesis. The 
hyperhne splittings due to the ortho- and meta-protons 
were assigned by measurements of proteins from bac¬ 
teria grown on amino acids isotopically substituted 
with ^H. The reaction cycle in peroxidases involves 
the formation of compound I, which incorporates 
a cation radical on the heme, which is detected by an 
EPR signal. 

Radicals Produced by Radiation and Photolysis 

For spectra of radicals that are conhned to a very 
narrow range of Bq held, it is possible to detect 
a free-induction decay (FID) and derive a spectrum 
by Fourier transformation, as in NMR. Normally, how¬ 
ever, the spectrum is too broad and the FID decays too 
rapidly. Instead, it is possible to detect an electron spin 
echo after a two-pulse (7t/2—7t echo) sequence. The 
spectrum of a transient species may be observed after 
a laser hash synchronized with the microwave pulses 
(Prisner et al. 2001). The EPR spectrum of the hash- 
induced species is derived by repeating the measure¬ 
ment over a range of magnetic held. The lifetime of the 
excited species is measured by repeating the electron- 
spin echo sequence while the interval between the hash 
and the pulse sequence is incremented. Alternatively, 
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EPR and ENDOR of Radicals and Metalloproteins in 
Biological Systems, Fig.1 (a) Upper structure, the isoalloxa- 
zine ring system of FAD in the anionic radical form {the red 
semiquinone radical)', circles indicate regions of greatest elec¬ 
tron spin density in a typical radical. Lower structure, protonated 
{blue radical) form; (b) CW-EPR spectra of a typical flavin 
semiquinone in 4-hydroxybutyryl-CoA dehydratase, in IH 2 O 
(broader spectrum) and 2 H 2 O (narrower spectrum), showing 
coupling to an exchangeable proton that is absent in IH 2 O 


(Cinkaya et al. 1997); and (c) Davies ENDOR spectrum of the 
flavin cofactor of DNA photolyase, showing assignments of 
hyperfine couplings to ring protons. Figure Ic is adapted, with 
permission from Schleicher E et al. (2007). Electron Nuclear 
Double Resonance Differentiates Complementary Roles for 
Active Site Histidines in (6-4) Photolyase. Journal of Biological 
Chemistry. Copyright © 2011, by the American Society for 
Biochemistry and Molecular Biology 


CW-EPR may be applied in a special instrument with¬ 
out magnetic-field modulation (direct detection) 
(van der Est 2009). 

EPR and ENDOR of Transition Metals 

For a readable introduction to the practicalities and 
theory of biological EPR, especially of transition 


metals, see Hagen (2009). The EPR of transition ions 
has been reviewed by Pilbrow (1990) and Weil and 
Bolton (2007). 

The oxidation states of the transition metals that are 
found in biological systems are summarized in Table 1. 
From Kramer’s theorem, oxidation states that have an 
odd number of electrons are doubly degenerate, and 
will have at least one pair of electron energy levels 
(called a Kramer’s doublet) that can only be split by the 
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g-factor 


(2Fe-2s) 



EPR and ENDOR of Radicals and Metalloproteins in 
Biological Systems, Fig. 2 EPR spectra of S = ‘/2 paramag- 
nets: (a) reduced adrenal [2Fe-2S] ferredoxin, recorded at tem¬ 
perature 77 K; (b) Resting II signal of molybdenum(V) in milk 
xanthine oxidase, recorded at 150 K; (c) melanin from Penicil- 
lium, recorded at room temperature 

magnetic field, and therefore be EPR-detectable. States 
with an even number of electrons, such as high spin 
Fe^\ S = 2 may not be detectable; the EPR transitions 
may be forbidden, in which case they may be observed 
with a resonator in parallel mode (Bi || Bq); or the 
separation between energy levels is greater than the 
microwave quantum, in which case they may be 
detectable in a high-frequency spectrometer. 

A primary purpose use of EPR of transition ions in 
biological systems Is to identify the metal center, its 
redox and spin state, and in favorable cases its state of 
coordination. Investigations of the mechanism of 
metalloenzymes have stimulated the most advanced 
developments in instrumentation and computational 
methods. Apart from a few exceptions such as Mn^^ 
which may be observed in solution at physiological 
temperatures, most transition metals have to be cooled 
to cryogenic temperatures for the EPR signal to be 
observed. Further details of the metal site and its coor¬ 
dination are obtained by ESEEM, ENDOR, and other 
double-resonance techniques (Section ► Pulsed 
ENDOR and ESEEM: Principles and Examples of 
Applications to Heme Proteins). This approach can 


be extended to systems containing diamagnetic 
ions if they can be functionally substituted 
by a paramagnetic metal ion, such as magnesium by 
manganese, or zinc by cobalt (Bennett 2010). 

EPR can provide various types of information about 
a transition metal ion. Changes in oxidation-reduction 
state can be studied kinetically, often using rapid- 
freeze techniques (Lowe 1995), or thermodynami¬ 
cally, using redox equilibria to measure redox 
potentials. Where the spectra have narrow line widths 
and are detectable at room temperature (V^'^, Mn^\ 
Co*^), the spectrum may be broadened by slow motion, 
and may offer similar information to a nitroxide spin 
label. In others, such as some iron proteins, which 
require cryogenic temperatures for detection, the 
anisotropic spectra of the solid-state spectra are very 
sensitive to coordination geometry. 

Copper 

Copper in the oxidation state Cu^^ is an excellent sub¬ 
ject for EPR, except where it is strongly spin-coupled 
to another S = center, as in the EPR-silent Cu*^-Cu^^ 
dimeric (Type III) centers. Cu^* spectra often have 
axial-type spectra, with a clearly visible hyperfine cou¬ 
pling on gj^ (Fig. 3a), as seen in the Type II copper 
center Cu/Zn superoxide dismutase (Fig. 3d). A com¬ 
mon motif is the “blue” or Type I copper center, as in 
plastocyanin (Fig. 3b) which has a cysteine ligand, and 
is characterized by a small hyperfine coupling to the 
copper nucleus. A common motif in many copper 
oxidases, such as laccase (Fig. 3c), is a blue copper 
center at some distance from a “trinuclear center” 
comprising an EPR-detectable Type II and an 
EPR-silent Type III center (Fig. 3) (Solomon and 
Hadt2011). 

Iron and Cobalt 

Iron is the most abundant trace element in the bio¬ 
sphere, reflecting its abundance on the planet, and has 
many biological roles. Fortunately, it provides the 
most diverse range of EPR spectra, reflecting differ¬ 
ences in spin state and coordination (Fig. 4). Cobalt is 
less widely distributed, mainly in proteins containing 
cobalamin (vitamin B 12 ). EPR spectra are particularly 
sensitive to spin states of the metals, and are an excel¬ 
lent method of studying them; an example is the spin- 
state changes that take place on substrate binding to 
cytochrome P-450. 












EPR and ENDOR of Radicals and Metalloproteins in Biological Systems 


689 


E 


EPR and ENDOR of Radicals and Metalloproteins in Biological Systems, Table 1 

biological systems 

Properties of transition 

ions relevant to 

Metal ion 

Paramagnetic 
oxidation states 

Spin 

states S = 

Typical 

g-factor 

Typical temperature 
of measurement, K 

Isotopes 
with spins 

Nuclear 

spin 

% Natural 
abundance of 
isotope 

Manganese 

Mn" 

1/2 

2.0 

R.T. 

^^Mn 

5/2 

100 

Iron 

Fe“ 

h.s. 2 

ND 


=’Fe 

1/2 

2.19 


Fe'“ 

l.s. 1/2 

2^ 

40 K 






h.s. 5/2 

2-10 

77 K 




Cobalt 

Co“ 




=’Co 

7/2 

100 



l.s. 1/2 

2 

77 K 






h.s. 5/2 

2-10 

10 K 





Co™ 







Nickel 

Ni' 

l.s. 1/2 

2.15 

77 K 

“^‘Ni 

3/2 

1.134 


Ni" 



77 K 





Ni'“ 

1/2 

2.15 





Copper 

Cu“ 




“Cu 

3/2 

69.09 



1/2 

2.15 

77 K 

“Cu 

3/2 

30.91 

Molybdenum 

Mo'' 




”Mo 

5/2 

15.72 



1/2 

1.96 

150 K 

’’Mo 

5/2 

9.46 


Other oxidation states and isotopes exist for these elements 
is. low spin, /t..v.high spin, R.T. room temperature 


The EPR-detectable oxidation states are Fe^^* and 
Co*^; they are low spin (5 =1/2) in octahedral coordi¬ 
nation geometry, such as in heme proteins with strong- 
held axial ligands, and in some cobalamins. g-factors do 
not normally exceed 4, and spectra can be recorded at or 
below liquid nitrogen temperature, 77 K. Most other 
forms are high spin {S = 5/2). EPR studies of ferriheme 
are described in the section ► Pulsed EPR: Principles 
and Examples of Applications to Elemeproteins. 

For nonheme iron proteins, the shape of the spec¬ 
trum is dehned by the axial and rhombic ligand held 
parameters D and E (Hagen 2009). For high-spin 
Fe^^\ there are three sets of Kramer’s doublets with 
S — ±1/3, ±3/2, and ±5/2. For rubredoxin, transferrin, 
and Fe-superoxide dismutase, the E/D ratio is near to 
1/3, and the spectrum comprises a strong signal at 
g = 4.27 from the S — 3/2 doublet, and low-amplitude 
signals around g = 9 from the other two. Relaxation 
rates are very rapid, so that liquid-helium temperatures 
are needed, particularly for pulsed measurements. Fell 
is normally EPR-silent, but when nitric oxide, NO, is 
a ligand it imparts an extra unpaired electron (S = 16). 
Thus, for example nitrosyl high-spin Fe*^ has a net spin 
S = 2 — 16 = 3/2. The lower spectrum in Fig. 4 is of 


a high-spin nitrosyl complex with the catalytic 
nonheme iron site in a dioxygenase. 

Molybdenum 

With the exception of nitrogenase, most molybdenum 
enzymes contain the molybdenum cofactor, which 
comprises Mo bound to one or two pyranopterin 
groups (Hille et al. 2011). The Mo is EPR-detectable 
in the intermediate oxidation state Mo'^. The freeze- 
quench technique was devised to study the reaction of 
xanthine oxidase, one of the earliest enzymes to be 
studied by EPR (Bray et al. 1964). The spectrum 
shown in Fig. 2b is of a stable Mo'’^ species observed 
in xanthine oxidase and related enzymes, but transient 
Mo'^ species have been observed during enzyme turn¬ 
over. In addition to Mo, this enzyme contains flavin 
and iron-sulfur clusters, and was a model for biologi¬ 
cal electron-transfer systems. 

Metal Cluster Complexes 

If two or more paramagnets, such as transition metal 
ions, form a covalently bonded complex or cluster, 
exchange interactions are usually very strong and the 
EPR spectrum will be very different from the spectra 
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EPR and ENDOR of Radicals and Metalloproteins in 
Biological Systems, Fig. 3 Upper panel: EPR spectra of 
some CuII-containing proteins: (a) the nonphysiological Cull 
complex, copper-substituted transferrin; (b) a blue copper pro¬ 
tein, plastocyanin; (c) a multi-copper oxidase, mushroom 
laccase; and (d) a non-blue copper protein, Cu/Zn superoxide 
dismutase. Spectra recorded at temperature 40 K, microwave 
frequency 9.38 GHz, power 10 mW (author’s own observations). 
Lower panel: structure of the copper centers in a multi-copper 
oxidase (Reproduced with permission from Solomon et al. 2008) 

of the paramagnets alone. The spin of the system 
depends on the way in which the spins are coupled. 
The transition-ion dimers in oxidized ferredoxin and 
methemerythrin comprise two Fe*^* spins, strongly 
antiferromagnetic ally coupled, to give a “ladder” of 
energy levels, with ground state S — 0, hence no EPR 
signal at liquid helium temperature. At higher temper¬ 
atures, excited states with S =1, 2,.. .10 become pop¬ 
ulated, leading to rapid electron-spin relaxation, and 
contact shifts in the room-temperature NMR spectrum, 
but no detectable EPR signal. Flowever in the mixed- 
valence state, Pe“-Fe“‘, the Fe“ (5 = 2) and Fe“' (S = ^/ 2 ) 


couple to give a ground state 5 = Vi, which is EPR- 
detectable at low temperatures (Fig. 2a). Once again at 
higher temperatures excited states S =3/2.. .9/2 become 
populated. The existence of such dimers is revealed if 
the metal ions have a nuclear spin, in which case the 
resulting EPR spectrum shows hyperfine interaction 
with both nuclei. For example, the Cu*-Cu^^ dimeric 
Cua centers of nitrous oxide reductase and cytochrome 
oxidase show complex splittings due to two copper 
nuclei, as revealed by multifrequency EPR (Kroneck 
et al. 1990). The “multiline” spectrum of the oxygen- 
evolving complex of Photosystem II results from 
a MrqCaCl cluster (Britt et al. 2004). 

Weaker spin-spin coupling occurs where two or 
more paramagnetic centers are distributed in 
a macromolecule. Under such conditions, the spectra 
may be perturbed and show enhanced electron-spin 
relaxation. They provide information about distances 
between the centers and relative orientations. Electron 
transfer may be observed at distances up to 0.14 nm, 
but longer distances may be observed by DEER/ 
PELDOR (Section ► DEER of Metalloproteins). 

Iron-Sulfur Proteins 

These are a widespread group of metalloproteins, dis¬ 
covered in the 1970s by EPR spectroscopy (Beinert 
et al. 1997). They contain clusters of iron and sulfide, 
bound usually to cysteine sulfurs from the protein. 
There is strong exchange coupling between the spins 
on the iron atoms, and they give EPR spectra in oxida¬ 
tion levels where the total of d electrons on the iron 
atoms is an odd number. The [2Pe-2S] (Fig. 2a) and 
[4Fe-4S] clusters give EPR spectra in the reduced (-1-1) 
state, with characteristic g-factors less than 2; [3Pe-4S] 
clusters also give signals at g = 2.01 in the oxidized 
state. The clusters are major components of 
the electron-transport chains of mitochondria, chloro- 
plasts, and bacteria (Cammack and MacMillan 2010). 

Photosynthetic Reaction Centers 

EPR spectroscopy and related techniques lend them¬ 
selves well to the primary reactions of photosynthesis, 
in which single electrons are ejected from 
a photoexcited donor molecule, producing a donor cat¬ 
ion radical and a reduced acceptor. The photochemical 
reaction in photosynthetic reaction centers may be 
observed in the EPR cavity, even at cryogenic temper¬ 
atures. Long before the biochemical structures were 
determined, there was evidence that the primary donor 
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EPR and ENDOR of Radicals and Metalloproteins in 
Biological Systems, Fig. 4 Spectra of S = 5/2 high-spin 
nonherae iron proteins, recorded at 12 K: (a) human serum 
transferrin (sample provided by Prof R.W. Evans); 
(b) Pyrococcus furiosus rubredoxin (sample provided by 
Prof M.W.W. Adams); (c) Fe-superoxide dismutase from 


Methanococcus thermolithotrophicus (sample provided by Dr 
E.C. Hatchikian); (e) S = 3/2 nitrosyl Fell in benzene 
dioxygenase (sample provided by Dr C. Bagneris); and 
(d) a ferriheme protein, horse metmyoglobin;. Vertical scales 
are arbitrary 


comprises a pair of chlorophyll molecules. The 
linewidth of the EPR spectrum of the donor chlorophyll 
radical is 1/V2 of those of monomeric chlorophylls, and 
the hyperhne coupling constants as measured by 
ENDOR are half those of isolated chlorophylls; both 
effects are expected if the unpaired electron of the free 
radical is delocalized equally over two chlorophylls. The 
reaction centers are in fact more complex than this, and 
there is a chain of electron carriers from the primary 
acceptor to a final acceptor: a quinone in Photosystem II 
and bacterial reaction centers (Chatterjee et al. 2010), or 
an iron-sulfur cluster in Photosystem I (Golbeck 2006). 
This serves to prevent the transfer back of the electron 
from the acceptor to the donor; the overall efficiency of 
electron transfer is near to 100%. These systems can be 
explored by CW and pulsed EPR, without the need for 
mixing with reductants and freeze-quench of samples. 
Transient radicals in photosynthetic reaction centers 
may be generated by a laser flash and observed by pulsed 
EPR. For primary reactions, the radical recombines rap¬ 
idly and reversibly, and an EPR spectrum may be accu¬ 
mulated from multiple flashes, while sweeping the 
magnetic field (Lendzian et al. 2003). The primary 


paramagnetic species are often spin-polarized triplet 
states or radical pairs (van der Est 2009), which give 
strong EPR signals because the spin-up and spin-down 
populations and A/p. (see Eq 2 in Section ► EPR 
Spectroscopy: General Principles) are far from thermal 
equilibrium. 

Nickel-Iron Hydrogenase 

Hydrogenases are metalloenzymes, usually containing 
iron-sulfur clusters. They catalyze the production and 
consumption of H 2 ,reactions of great importance in 
microbial metabolism and potentially in bio-energy pro¬ 
duction. There are two major types of hydrogenases, the 
[NiFe]-hydrogenases and the [FeFe]-hydrogenases, 
each containing a unique catalytic center. The occur¬ 
rence of nickel in the [NiFe]-hydrogenases was first 
determined by growing the organism on °*Ni (I = 3/2), 
leading to a splitting of the EPR spectmm into four 
lines (Fig. 5a). The stracture of the Fe-Ni dinuclear 
center (Fig. 5e) was revealed by x-ray crystallography. 
EMR techniques remain the method of choice to study 
the metal centers and their interactions with protons, 
which are not directly detectable in x-ray crystal 
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EPR and ENDOR of Radicals and Metalloproteins in 
Biological Systems, Fig. 5 EPR spectra of NiFe hydrogenases 
(a) spectra in the resting (Ni-A) state of hydrogenase from 
Desulfovihrio gigas, and from cells grown on ‘’’Ni isotope (sam¬ 
ples provided by Dr E.C. Hatchikian); (b) D. gigas hydrogenase 
in the active oxidized (Ni-B) state, hydrogen-reduced (Ni-C 
state), photolysed at temperature 20 K (Ni-L state) (Cammack 
et al. 1987. (c) EPR crystal rotation diagram for the three prin¬ 
cipal axes of hydrogenase from D. vulgaris (Miyazaki) in the 
Ni-C state; (d) HYSCORE of D. vulgaris (Miyazaki) in the Ni-C 
state; (Reprinted with permission from Foerster S et al. (2005). 


An orientation-selected ENDOR and FIYSCORE study of the 
Ni-C active state of Desulfovihrio vulgaris Miyazaki 
F hydrogenase. Journal of Biological Inorganic Chemistry. 
Copyright © 2011, Springer Berlin/FIeidelberg); (e) and (f) 
proposed structures for the NiFe center in the Ni-C and Ni-L 
states (Reprinted with permission, from Foerster S et al. (2003). 
Single Crystal EPR Studies of the Reduced Active Site of [NiFe] 
Flydrogenase from Desulfovihrio vulgaris Miyazaki F. Journal 
of the American Chemical Society. Copyright © 2011, Ameri¬ 
can Chemical Society) 
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structures. The iron atom is probably low-spin Fe^\ and 
^^Fe ENDOR spectroscopy showed a weak electron 
density on the iron atom (Huyett et al. 1997). This 
unusual chemistry was attributed to the carbonyl and 
cyanide ligands, which were discovered by FTIR and 
are a unique feature of hydrogenases. EPR spectra of 
samples frozen during reduction showed different coor¬ 
dination and oxidation states, the resting state Ni-A 
(Eig. 5a), the oxidized active state Ni-B, and the inter¬ 
mediate state Ni-C (Eig. 5a), which are relevant to the 
catalytic cycle of the enzyme. To hnd the location of the 
hydrogen atoms of the substrate, the enzyme was trans¬ 
ferred from *H20 to ^H20, when the Ni-C EPR spec¬ 
trum from the active enzyme became narrower. This is 
consistent with the smaller magnetic moment of ^F1 than 
of 'F1, and shows that an exchangeable proton is bound 
in the vicinity of the nickel. Another form of the center 
that is relevant to the mechanism, Ni-L, is produced by 
photolysis of Ni-C in the frozen state, which displaces 
the proton from the center. More details of the mecha¬ 
nism of proton binding emerged from detailed single- 
crystal ENDOR studies of the hydrogenase, which 
established the coordinates of the g-matrix and the 
hyperfine coupling (Eig. 5c). Pulsed ENDOR and 
HYSCORE studies (Fig. 5d), supported by density func¬ 
tional calculations, were consistent with the present 
view that the proton is as a bridging ligand between 
the Ni and Fe atoms during the reaction cycle. 

Nitrogenase 

Nitrogenase catalyzes the difficult six-electron reduc¬ 
tion of dinitrogen to ammonia, simultaneously releasing 
some H 2 . Hoffman et al. (2009) have described it as the 
“Everest of enzymes,” in recognition of its importance 
and the complexity of the reaction it catalyzes. It com¬ 
prises two proteins: the Ee protein, which contains 
a [4Ee-4S] cluster and ATP hydrolysis sites, and the 
MoFe protein, which contains two complex iron-sulfur 
clusters, the P-cluster and the MoEe cluster; the latter is 
the site where N 2 is reduced (Eig. 6). The steps of the 
reaction cycle were elucidated by a combination of 
stopped-flow kinetics and freeze-quench EPR 
(Thomeley and Lowe 1984). During these steps, the 
clusters give rise to a number of different EPR signals, 
examples of which are shown in Pig. 6b. However, the 
chemical states involved in the cycle, and the location of 
the reduction site in the cluster, remained obscure. 
Recently promising results have been obtained from 
Azotohacter vinelandii MoPe protein containing the 



EPR and ENDOR of Radicals and Metalloproteins in 
Biological Systems, Fig. 6 (a) Structure of the MoFe complex 
in nitrogenase. (b) EPR spectra of nitrogenase MoFe protein 
from Azotohacter vinelandii, in the resting state, without sub¬ 
strate, and frozen during turnover with N2 (Reprinted with 
permission from Fioffman BM et al. (2009). Climbing Nitroge¬ 
nase: Toward a Mechanism of Enzymatic Nitrogen Fixation. 
Accounts of Chemical Research. Copyright © 2011, American 
Chemical Society; Barney BM et al. (2009). Trapping an Inter¬ 
mediate of Dinitrogen (N2) Reduction on Nitrogenase). Bio¬ 
chemistry. Copyright © 2011, American Chemical Society 

amino acid substitutions R195Gln, which inhibits pro¬ 
ton delivery to substrates and R70Ala, which modulates 
substrate accessibility to the reduction site (Hoffman 
et al. 2009). The MoFe protein in the resting state 
shows a signal indicative of an 5 = 3/2 state (Fig. 6b). 
A combination of spectroscopic and genetic approaches 
is revealing the intermediate stages of the reaction. 
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Summary 

Paramagnetic centers are essential to the function 
of many enzymes and biological electron-transfer 
systems. They include protein-based radicals, 
semiquinone forms of cofactors such as flavin, 
and transition-metal centers containing iron, cop¬ 
per, manganese, cobalt, nickel, and molybdenum. 
EPR is established as the method of choice to 
examine electronic structure and molecular mech¬ 
anisms of such centers. Iron proteins are particu¬ 
larly fruitful subjects for study, with a rich variety 
of different types of spectra. The environment of 
the metal centers may be explored by hyperfine 
interactions with ligands and substrates. EPR offers 
insights into the primary reactions of photosynthe¬ 
sis, which involve electron-transfer processes. 
Increasingly advanced pulsed techniques such as 
HYSCORE and double resonance are proving 
their value in establishing electronic structures 
and molecular mechanisms. 
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► Peroxidases 

► Photosynthetic Electron Transport 
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Abbreviations 


CW 

EMR 

ENDOR 

EPR 

ESEEM 

ESR 

PELDOR 

TRIPLE 


Continuous-wave 
Electron magnetic resonance 
Electron-nuclear double resonance 
Electron paramagnetic resonance 
Electron spin-echo envelope modulation 
Electron spin resonance 
Pulsed electron-electron double resonance 
Electron-nuclear triple resonance 


Synonyms 

Electron paramagnetic resonance spectroscopy; Elec¬ 
tron spin resonance spectroscopy; EPR 


Definition 

Electron paramagnetic resonance spectroscopy is 
a magnetic resonance technique that uses microwaves 
to observe the structure and dynamics of paramagnetic 
centers, such as radicals and transition-metal ions. 


Introduction 

Electron paramagnetic resonance (EPR) spectroscopy 
was developed during the exploration of fundamental 
questions of quantum physics and chemistry. Like all 
other forms of spectroscopy, continuous-wave EPR 
detects changes in the energy state of a species upon 
the absorption of incident electromagnetic radiation. In 
this case the excitable species is an unpaired electron. An 
introduction to the EPR phenomenon is provided by 
Weil and Bolton (2007), Hagen (2009), and Brustolon 
and Giamello (2009). An historical introduction to EPR, 
with papers by some of the original discoverers, is edited 
by Eaton et al. (1997), and applications to biomedicine 
have been reviewed by Eaton et al. (2005). EPR and its 
advanced developments have found many applications 
in biophysics and biochemistry, as a means of exploring 
paramagnetic molecules. EPR is a selective technique; 
since most materials are diamagnetic, their electrons are 
paired in atomic or molecular orbitals. The molecules of 
biological systems that can be observed by EPR are 
organic free radicals, produced by chemical oxidation- 
reduction reactions or by irradiation, and transition- 
metal ions, such as Mn, Ee, Co, Cu, and Mo in their 
paramagnetic oxidation states. Thus it provides informa¬ 
tion about oxidation states. Paramagnetic centers occur 
widely in nature, and are important components of the 
active sites of enzymes and other metalloproteins. Often 
they are at the heart of biological processes such as 
photosynthesis, respiration, and enzyme catalysis. EPR 
experiments beneht from parallel studies using other 
spectroscopic methods including NMR, Mossbauer, 
magnetic circular dichroism, and FTIR spectroscopy. 
When molecular structures are known from x-ray 
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crystallography, further information can be obtained 
from interactions between electron spins and protons. 

The general, the principle of EPR is similar to NMR 
spectroscopy, but it differs in the range of its practical 
applications because electron spins are much rarer than 
nuclear spins in biological materials, because the elec¬ 
tron spin has a much greater magnetic moment and 
because it relaxes more rapidly than nuclear spins. As 
a result the typical EPR experiment differs from NMR. 
An important restriction for most transition metal com¬ 
plexes is that the sample must be studied at cryogenic 
temperatures, hence in the frozen state. This entry out¬ 
lines the principles and practice of continuous-wave 
EPR and ENDOR spectroscopy. More advanced time- 
domain methods have been developed that are analo¬ 
gous to those in pulsed NMR (Schweiger and Jeschke 
2001); in all, over 50 distinct types of measurement may 
have been described, which may be encompassed into a 
wider term as “electron magnetic resonance,” or EMR. 
Further details about pulsed methods are described in 
the entry “► Pulsed EPR; Principles and Examples of 
Applications to Hemeproteins” by I. Garcia-Rubio. The 
finest details of electronic structure have been obtained 
from examination of purified biochemical species by 
these advanced EPR techniques. 

EPR in Biology 

The many applications of EPR spectroscopy include 
identifying new biochemical entities, determining 
structural features of paramagnetic centers and the 
proteins that contain them, and elucidating molecular 
mechanisms. EPR spectra of radicals in solution are 
sensitive to motion, which is the basis of the spin-label 
technique. EPR can be applied to observe naturally 
occurring paramagnets in living tissue or other 
complex mixtures, though more detailed information 
can be derived from isolated proteins or other com¬ 
pounds. By freezing of transient intermediates or 
photolytically induced states, EPR spectroscopy 
can provide structural information about enzyme 
intermediates that are inaccessible to crystallography. 

The simplest applications of EPR spectroscopy are 
analogous to UV/visible spectrophotometry, to measure 
the amount of the paramagnetic material in a sample. 
Reaction kinetics can be measured by mixing reagents 
and freezing after an elapsed time (Belevich et al. 2009). 
Midpoint redox potentials can be determined by quanti¬ 
fying EPR signals from samples frozen at measured 
oxidation-reduction potentials (Cammack 1995). 


The power of the technique is greatly enhanced by the 
ability to observe magnetic interactions. The EPR spec¬ 
trum of a paramagnetic center is sensitive to spin-spin 
interactions between unpaired electrons and nuclear 
spins (the hyperfine interaction) and between unpaired 
electrons and other paramagnetic centers. These interac¬ 
tions are stronger than those between protons in NMR, 
since the magnetic moment of the electron is much 
greater than those of nuclear spins. The hyperfine inter¬ 
action is used for the identification of species detected by 
EPR; if a spectrum is split or broadened by substitution 
with a nucleus having a particular spin, for example, 
^^Fe, ®^Ni, this is proof positive that the element is 
present and interacting with the paramagnet. 

Principles of EPR 

Continuous-Wave EPR 

EPR may be introduced in terms of a semiclassical 
description, similar to that used in textbooks of 
NMR. The electrons are considered as behaving like 
spinning electric charges, interacting with the static 
and oscillating magnetic fields, as described by the 
Bloch equations. This theory introduced the funda¬ 
mental concepts of magnetic resonance spectroscopy, 
such as the relaxation times Ti and T 2 . 

Figure 1 represents a typical biological sample 
containing a free radical (5 = 1/2). If the unpaired 
electrons are separated from each other by other 
diamagnetic molecules, they can be assumed not to 
interact with each other, and their spins are randomly 
orientated. An applied magnetic field Bq imparts 
a direction, known as the axis of quantization, relative 
to which the magnetic moments of the electrons are 
aligned. The electron spin will process about this axis 
at the Larmor frequency. The electron may only be 
observed in one of two possible energy states, a and fS, 
corresponding to alignment of the electron spins with 
(parallel to) or against (antiparallel to) the Bq field. 

Because of the Zeeman interaction, these two states 
have different energy levels separated by; 

( 1 ) 

The basic CW-EPR spectrum may be considered as 
a method of scanning for signals from spins with dif¬ 
ferent values of g. The g-factor or g-value is considered 
to be a characteristic parameter of the spin system. At 
temperature T, a thermal equilibrium will be set up 
with a Boltzmann distribution of spin populations; 
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EPR Spectroscopy: General Principles, Fig. 1 Boltzmann 
distribution, stimulated absorption and emission, and T i relaxa¬ 
tion process 


A^a/A^/i = exp(g^BBo/^7’) (2) 

where is the Bohr magneton. When the microwave 
frequency matches the Larmor frequency (the condi¬ 
tion of resonance), the rotating component of the 
Bi microwave field drives spin-flips that tend to equal¬ 
ize the populations and Np. This causes resonant 
absorption of microwave energy, which the 
spectrometer is set up to detect (Fig. 2). Continuous 
absorption is made possible by transfer of thermal 
energy of the spin system to its surroundings (the 
lattice), at a rate that is characterized by the 
spin-lattice relaxation time, Ty (Fig. 3). 

The fraction NJNp, and hence the EPR signal, is 
enhanced and improved at higher magnetic fields and 
at low sample temperatures. At low temperature, Ti 
becomes very long, so the relaxation rate cannot main¬ 
tain the NJNp ratio and the signal decreases. This 
effect is more pronounced with increasing microwave 
power, a situation known as microwave power satura¬ 
tion. The saturation characteristics of paramagnets 
have been used to estimate Ti and the spin-spin 
relaxation time, T 2 . 


The Continuous-Wave EPR Spectrometer 

There are many variations on the EPR experiment, but 
the CW EPR instrument operating at X-band fre¬ 
quency (Czoch and Francik 1989; Poole and Farach 
1999), is the workhorse of EPR studies in Chemistry and 
Biology. This contrasts with NMR, where pulsed 
methods have superseded CW almost completely. This 
is partly because electron-spin relaxation times are much 
faster than for nuclei, leading to enhanced relaxation 
broadening, so that most types of transition-metal ion 
are only detectable at cryogenic temperature. The situa¬ 
tion is analogous to solid-state NMR; the anisotropy of 
parameters such as g and A is not averaged by motion as 
in solution NMR or spin-label EPR. The larger magnetic 
moment of the electron means that EPR signals may 
often be detected at lower concentrations than for NMR. 

The basic CW instrument (Pig. 2) is designed to 
optimize the weak magnetic resonance signal. Micro- 
wave radiation sets up a standing wave in the cavity, 
concentrating the microwave held at the sample 
(Fig. 3), by the quality factor Q, which enhances the 
power at the sample typically 6,000-fold. 

Q = 27r(Energy stored)/(Energy dissipated per cycle) 

(3) 

Because the cavity restricts the microwave fre¬ 
quency V to its resonant frequency, in order to obtain 
a spectrum over a range of values of g, the variable 
that is swept is the Bq field. The instrument uses 
modulation of the Bq held at a frequency of 
100 kHz, with phase-sensitive signal detection. 
A lock-in amplifier selects only the part of the output 
signal that is oscillating in phase with the field mod¬ 
ulation, which greatly improves the signal to noise 
ratio. As a result, the EPR spectrum is output as the 
first derivative (first harmonic) of the EPR absorp¬ 
tion with respect to the Bq field. This presenta¬ 
tion emphasizes details of the line shape, although 
it tends to exaggerate narrow-line features at 
the expense of broad signals, which may be equally 
important but easily overlooked. The conventional 
absorption spectrum can be derived from this 
by numerical integration, though the first- 
derivative presentation is widely used. Thus the 
first-derivative spectrum and the field-swept mode 
in the CW experiment arise from features which 
enhance instrumental sensitivity, but which intro¬ 
duce distortions compared with a conventional 
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EPR Spectroscopy: General 
Principles, Fig. 2 Principal 
features of an EPR 
spectrometer. It incorporates 
a resonant microwave circuit, 
described as a microwave 
bridge. The microwave source 
is a Gunn diode or reflex 
klystron. Perturbation of the 
microwave radiation in 
resonance with paramagnetic 
material in the cavity is 
measured by a diode detector 


Microwave 

bridge 



frequency-swept spectrum, and which have to be 
corrected when spectra are simulated or integrated 
for spin quantitation (Pilbrow 1990). 

The majority of EPR spectrometers use microwave 
radiation at X-band frequency, about 9 GHz (see 
Table 1). The cavity is located between the poles of 
an electromagnet which supplies the homogeneous 
static magnetic field, Bq (Fig. 2). Additional coils 
around the cavity provide the oscillating magnetic 
field that is used for phase detection. For temperature 
control, nitrogen or helium gas flows vertically around 
the sample. For aqueous samples, to minimize dielec¬ 
tric losses, the sample is positioned in the cavity for 
a minimum £-field (the electric vector of the micro- 
wave field) and maximum (the magnetic vector). In 
the rectangular cavity this is a plane in the center of 
the cavity, and a thin flat quartz sample holder cell 
is used for aqueous samples (Fig. 3a). An alternative 
for such lossy samples is the loop-gap resonator 
(Fig. 3b). 

Pulsed EPR methods are increasingly being devel¬ 
oped for specialist applications, as described below. 
Unlike NMR, high-resolution spectra cannot be 
acquired by Fourier transformation of a free-induction 
decay (FID), because of the much greater spectral 
bandwidth and shorter phase memory times in EPR. 
A field-swept echo experiment, in which the electron 
spin-echo is measured while the Bq field is swept, 
provides the equivalent spectrum to the CW-EPR 
experiment, but in EPR absorption rather than the 
derivative spectrum. Higher magnetic helds have spe¬ 
cialist applications in EPR; although higher fields do 
not give the same general improvements in sensitivity 


as in NMR. The enhanced Boltzmann factor is offset 
by lower efficiency of resonators. In addition, some 
of the principal causes of line-broadening, such as 
g-strain, scale with the Bq field, so resolution does not 
necessarily improve (Table 2). 

Information from EPR Spectra 

Spin Concentration 

The EPR spectrum is a quantitative measure of the 
concentration of electron spins. The intensity may be 
measured by double integration of the first-harmonic 
spectrum and comparison with a standard sample of 
known concentration, such as a nitroxide or copper 
solution. Certain conditions must be met, for example, 
measurement of the spectra at a non-saturating 
microwave power level. 

Hyperfine Coupling 

The interaction between the magnetic moment of the 
electron and nuclear spins is a valuable feature of EPR 
spectra. It introduces splittings of the electron energy 
levels (Fig. 4). Each nuclear spin I induces a splitting 
into [2/ -I- 1] levels. The magnitude of the splitting is 
defined by the hyperfine coupling constant. A, which is 
expressed in energy units (usually MHz). Where there 
are several nuclei, for example, protons in a radical, 
each level is further split by each nuclear hyperfine 
interaction. The hyperfine coupling can be observed as 
splitting of the EPR spectrum, unless the coupling is 
less than the linewidth. The magnitude of the splitting 
is given the symbol a, expressed in magnetic field 
unifs. Weak hyperfine inferacfions may be absorbed 
by ENDOR. 
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EPR Spectroscopy: General Principles, Fig. 3 EPR resona¬ 
tors (a) rectangular microwave cavity. For photochemically 
generated species, the sample may be irradiated with visible or 
UV radiation through a grid, which retains the microwave field. 
For CW ENDOR, another coil around the sample provides the 
radiofrequency field B 2 which is scanned to bring nuclei into 
resonance, (b) Cylindrical resonator. The design shown is 
a loop-gap, or split-ring resonator. Flere, is maximum along 
the cylindrical axis, and the £-field is confined to a gap (Froncisz 
and Flyde 1982), so that aqueous samples can be measured. 
Other designs use a cylinder of a dielectric such as sapphire as 
the resonant element. These may be cooled by immersion in 
liquid helium 


EPR Spectroscopy: General Principles, Table 1 Microwave 
frequencies commonly used in EPR spectroscopy (values 
approximate) 


Band 

Frequency, 

GHz 

Wavelength, 

mm 

Resonant field for 
g = 2, mT 

S-band 

3 

100 

110 

X-band 

9.5 

32 

340 

Q-band 

35 

8.5 

1,250 

W-band 

96 

3.1 

3,220 


EPR Spectroscopy: General Principles, Table 2 Energy 
terms in the spin Flamiltonian for a transition-metal ion 


Interaction 

Hamiltonian term 

Scales with 
applied field? 

Zeeman 

interaction 


Yes 

Nuclear 

Zeeman 

interaction 

= Zj Sn I 

Yes 

Hyperfine 

coupling 

3t=I.jIA.S 

No 

Quadrupole 

coupling 

3l='ZjLP.I 

No 

Axial zero- 
field splitting 

.« = D[5/- 1/3 S(5-t 1)1 

No 

Rhombic zero- 
field splitting 

31 = BID - S/) 

No 

Electron spin- 
spin coupling 

3€ — S\Dy2^1 

No 


In general, for a nucleus with spin /, the allowed 
transitions give rise to (2/ -i-1) absorption lines of equal 
intensity. The resonance lines are given by 

AE — hv = gfigBo ± Aniijl (4) 

Interactions of a free electron with several nuclear 
spins will split the resonant lines further. With more 
nuclear spins in the radical, the spectrum can soon 
become very complicated. It can be simplified by the 
use of ENDOR, in which the splittings from multiple 
nuclear spins are additive, rather than multiplicative. 

From Eq. 4 it can be seen that the effects of g and A 
can be distinguished by measurements at different 
microwave frequencies such as X-band and Q-band. 
Shifts in line positions due to g-factor differences are 
proportional to frequency, whereas the line separations 
due to hyperfine interactions are, to first order, 
independent of the spectrometer microwave frequency. 

The hyperfine interaction can be considered as the 
sum of two processes. The first is the dipolar interac¬ 
tion, which is equivalent to the interaction through 
space between two magnetic dipoles. This is an aniso¬ 
tropic process and can provide details of molecular 
structure in spectra of solid samples. The strength of 
the interaction varies as (1—3cos^6)/;-^, where 9 is the 
angle between the inter-spin vector and the Bq field, 
and r is the distance between the electron and nuclear 
magnetic moments. For radicals that are tumbling 
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Applied field Bq 


EPR Spectroscopy: General Principles, Fig. 4 A. Energy 
levels and spin transitions for a radical containing an / = 1 
nucleus, such as a nitroxide, in a magnetic field Bq. At low fields, 
where the hyperflne and Zeeman interactions are of eomparable 
magnitude, there is a noticeable deviation from linear behavior. 
Note the convention that the value of the hyperfine coupling A is 
expressed in energy units, but the hyperfine splitting a in the 
spectrum is in magnetic field units (mT) 

rapidly in solution, the anisotropic hyperfine coupling 
averages to zero. The second contribution is the Fermi 
contact interaction, which is isotropic and takes place 
through electron orbitals. Only electrons in the s 
orbitals have a finite electron density at the nucleus; 
other electrons, for example, in ti orbitals, are coupled 
through exchange interactions with s electrons, 
a process known as spin polarization. In rapid motion 
it does not average to zero; hence, for example, spectra 
of nitroxide spin labels always show a splitting 
into three lines. The contact interaction can be used to 
estimate the unpaired electron density distribution over 
a radical or metal complex. 

For a transition-metal center in an enzyme, the 
hyperfine interaction gives rise to characteristic hyper¬ 
fine splittings, which can be used to determine the 
following: 

• The type of metal present, for example, copper 
(/ = 3/2, splitting into four lines) and manganese 
(/ = 5/2, six lines); isotopic substitution can be 
used to identify specific metals, for example, ^^Fe 
(/ = 1/2) for natural iron. 

• The ligands: hyperfine couplings to ligands, some¬ 
times called superhyperfine splittings, allow 


determination of the electron density distribution 
in transition-metal complexes. Nuclei with I > 
1/2, such as I — have a nuclear electric 
quadrupole moment and this further shifts the 
energy levels. 

• Other substrates in the vicinity. 

• Exchangeable protons in the active site, by 
exchange of ^H 20 for *H 20 . 

Interactions Between Unpaired Electrons 

Dipolar and exchange spin-spin interactions between 
electrons provide another dimension to EPR. When 
two or more electron spins are present in a paramag¬ 
netic center, such as a transition ion with spin 5 > 1/2, 
or an organic triplet state (5 = 1), electrostatic inter¬ 
action between the electrons gives rise to fine struc¬ 
ture splittings. These are known as zero-field splittings, 
since they occur in the absence of a Bq field, and they 
strongly influence the apparent g-factor of the EPR 
signal. If S is an integral value, the splitting between 
the energy levels may be larger than the X-band micro- 
wave quantum, in which case no EPR signal is 
detected. However, for an odd number of electron 
spins there are always energy levels that can only be 
split by the magnetic field. As a consequence, para- 
magnets with half-integral spin, such as high-spin Ee*^^ 
and Co^\ are EPR-detectable. Eor these ions, with 
S — 5/2, effective g-factors may extend from near 0 to 
10. In the solid state, the spectrum is broadened by 
anisotropy of the zero-field splittings. 

Electron spin-spin interactions occur between 
paramagnetic centers, in solution and in the solid 
state, and provide information about distances. As 
with hyperfine interactions, they have dipolar and 
exchange terms, and they cause splitting or broaden¬ 
ing of the EPR signal. Over short distances, for solid 
paramagnetic materials, such as the ferrihydrite cores 
in ferritin, the broadening may be so great as to render 
the X-band EPR spectrum undetectable. Therefore, 
for EPR spectra in the solid state, the paramagnets 
must be magnetically dilute, for example, doped into 
a diamagnetic host material; transition ions in pro¬ 
teins usually fulfill this criterion. Eor longer dis¬ 
tances, spin-spin interactions between paramagnets 
in proteins give rise to characteristic changes in line 
shape, and enhancement of the electron-spin relaxa¬ 
tion rate, from which it is possible to estimate dis¬ 
tances between the paramagnetic centers (Eaton et al. 
2001). Dipolar electron spin-spin interactions can 
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be detected over distances up to 8 nm, using 
PELDOR (see entry “► DEER of metalloproteins,” 
by J. Harmer). 

Lineshapes 

An assembly of electron spins that all resonate at the 
same microwave frequency, known as a spin packet, 
will have an inherent EPR linewidth, due to dynamic 
processes such as uncertainty broadening, and the 
effects of fluctuating local fields. This is known as 
homogeneous broadening', it gives an inherent 
Lorentzian lineshape with a width that is inversely 
proportional to Ty. As Ty increases with temperature, 
the EPR linewidth may become so great that the 
absorption lines are undetectable; this often happens 
with ions such as high-spin Ee**^ at room temperature. 

Different spin packets have slightly different reso¬ 
nant frequencies, due to local variations in the static 
fleld and hyperflne couplings. These random effects 
give rise to a Gaussian broadening of the line, known 
as inhomogeneous broadening. An important effect, in 
the solid state, is the anisotropy of spectroscopic 
parameters such as g. Since the electronic structure 
around paramagnetic centers in biological materials 
is typically of low symmetry, the observed g-factor 
for a paramagnet depends on the orientation of the 
molecule relative to the Bq field. The g-matrix has 
principal axes g^, gy, and g^ (Eig. 5). The spectrum of 
a frozen sample of randomly oriented molecules, 
termed a powder spectrum, gives rise to a characteris¬ 
tic rhombic lineshape. This is principally the result 
of the geometry of the situation and is commonly 
seen in EPR spectra. Other spectroscopic parameters 
such as the hyperflne coupling A and quadrupole cou¬ 
pling r are also anisotropic. Each will have its own 
principal axis system, which in general is not collinear 
with the principal axes of g. Einally, in biological 
samples such as frozen solutions of proteins, the values 
of g. A, and other spectroscopic parameters are subject 
to local distortions, known as strain broadening 
(Hagen 1981). 

The resulting lineshapes can be fitted by spectral 
simulation, as described in the entry “► Simulation of 
transition-metal EPR spectra,” by Stoll, to estimate the 
spectroscopic parameters of the spin system. These 
parameters reflect the electronic structure of the 
paramagnetic center, and if they can be estimated by 
simulation, they can be used in molecular orbital 
calculations. 


The Spin Hamiltonian 

The spin Hamiltonian provides a quantum-mechanical 
formalism, in which the relevant anisotropic energy 
terms can be combined. The transitions between the 
energy levels (eigenstates) are calculated by matrix 
methods. For example, a Hamiltonian which is appli¬ 
cable to a mononuclear iron centers comprises the sum 
of a number of terms including the electronic Zeeman 
interaction, the nuclear Zeeman term gp^B^S, the 
hyperflne coupling tensor A, and the axial and rhombic 
zero-field tensors, D and E. The spin operator S used in 
the spin Hamiltonian is an “effective” spin, reflecting 
the number of energy levels involved in the EPR 
spectrum and does not necessarily refer to the true 
electron spin quantum number as is the case, for exam¬ 
ple, for the Cu^"^ ion. The interpretation of spectra by 
simulation is further discussed in the entry by Stoll. 

EMR Spectroscopy of Oriented Samples and Single 
Crystals 

The principal components of the g and A- tensors may 
be observed directly in a single crystal, where all the 
molecules are aligned. By rotating the crystal in the 
magnetic field with a goniometer, it is possible to 
determine the components of g relative to the crystal 
axes. When the Bq field is aligned along the x-axis 
(cf. Fig. 5a), a narrow line is observed at a field 
corresponding to gx> and so on. Such crystals have 
been obtained from free radicals and proteins that 
contain transition metal ions. Partially oriented sam¬ 
ples of membrane proteins may be obtained by drying 
down multilayer vesicles on a surface. Molecules 
oriented at different angles to the magnetic field will 
resonate at different values of Bq. An example, show¬ 
ing the EPR spectra of the Ni-B and Ni-C states of 
hydrogenase, is shown in the entry “►EPR and 
ENDOR of Radicals and Metalloproteins in Biological 
Systems” by Cammack. For a spectrum with rhombic g 
and narrow linewidth, the extreme g-factors can be 
read off; the precise values of gx> gy, and gx and the 
ambiguities of sign, can be estimated by simulation. 

Multi-frequency and High-Field EPR 
EPR spectroscopic parameters may be divided into 
those that scale with microwave frequency, such as 
the Zeeman and nuclear Zeeman interactions, and 
those that do not, such as hyperflne splittings and 
electron spin-spin interactions (Table 2). Overlapping 
spectra recorded at one frequency may be resolved at 
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EPR Spectroscopy: General 
Principles, Fig. 5 (a) The 

powder spectrum represents 
the prohability distribution of 
the magnetic field vector over 
the surface of a sphere; 

(b) absorption spectrum; 

(c) first derivative 



Other microwave frequencies (Table 1). Each signih- 
cant change in frequency requires the use of a different 
microwave bridge and cavity. Higher magnetic helds 
enhance the Boltzmann distribution leading to stronger 
signals, as in NMR, although smaller cavity volumes 
lead to fewer spins in the sample. Higher magnetic 
fields are essential to observe spectra of transition- 
metal centers with larger zero-field splittings, when 
the X-band microwave quantum is insufficient to 
span the transitions between energy levels. The most 
important developments have emerged from the use of 
high-field ENDOR and related techniques, which 
greatly enhance the resolution of spectra and the obser¬ 
vation of spin-spin interactions (Mdbius et al. 2005) 

Continuous-Wave Electron-Nuclear Double Resonance 
(ENDOR) 

Electron-nuclear double resonance is a technique 
which measures hyperfine couplings to nuclear spins 
adjacent to the paramagnetic centre that are too small 
to be discerned under the linewidth of CW EPR spec¬ 
tra. It involves partially saturating the sample with 
microwave power at the resonant frequency, while 
scanning with radiofrequency at the nuclear frequen¬ 
cies of the coupled spins. The energy levels and tran¬ 
sitions relevant to the ENDOR measurement for an 
S = 1/2 state coupled to a nuclear spin I = 1/2 are 
illustrated in Eig. 6. The saturating microwave field 
tends to equalize the populations of the nis = =E 1/2 
levels, decreasing the magnitude of the EPR signal. 


The radiofrequency is swept, and when it is in reso¬ 
nance with the frequency Vn of an interacting nuclear 
spin, it provides further pathways for electron spin 
relaxation and changes the populations of energy 
levels; as a result, the EPR signal is enhanced. 

The resulting spectrum consists of a pair of lines. 
The positions of the lines depend on the relative mag¬ 
nitudes of the hyperfine coupling. A, and the nuclear 
Zeeman frequency, Vn, (the NMR frequency of the 
nucleus) (Pig. 6). Protons have a large nuclear 
moment, and when Vn > A/2, the lines will be centered 
at Vn and separated by A. Eor couplings to other nuclei, 
usually A/2 > Vn, and the ENDOR spectrum comprises 
two lines centered at A/2 and separated by 2 vn. Hyper¬ 
fine couplings of a few megahertz can be resolved and 
provide information about nuclei in the second coor¬ 
dination sphere, at distances up to 0.5 nm. 

Triple resonance, alternatively known by the 
descriptive term Double ENDOR, involves measure¬ 
ment of the rf-swept ENDOR spectrum while 
pumping with rf at a static nuclear frequency. When 
the pump frequency coincides with that of a nuclear 
frequency in the spectrum, lines of other nuclei that 
are coupled to it will change in intensity, becoming 
either larger or smaller, depending on the sign of A. 
In this way pairs of lines associated with a particular 
hyperfine interaction, may be identified in a complex 
spectrum. There are pulsed variants of the experi¬ 
ment (Schweiger and Jeschke 2001). Triple reso¬ 
nance also makes it possible to determine the sign 
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EPR Spectroscopy: General Principles, Fig. 6 


of A (+ve or —ve) which is of use in calculations of solution results from the statistical distribution of 
radical molecular orbitals (Rigby et al. 1994). anisotropic molecules with different orientations. 

For example, those molecules with gx aligned close 
Orientation Selection to Bq will resonate at one extreme of the envelope. 

From Fig. 5, it can be seen that the shape of an Therefore ENDOR or HYSCORE measured at this 
anisotropic lineshape in a spectrum of a frozen value of Bq will select the population of molecules 
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with that orientation. Spectra recorded at each posi¬ 
tion in the EPR envelope correspond to predictable 
populations of orientations, and in principle the whole 
three-dimensional dataset can be reconstructed by 
calculation. In practice it is difficult to achieve this 
unambiguously without some other data to fix some 
orientations, but in principle, spectra can be obtained 
from disordered samples that are equivalent to those 
from an oriented single crystal. 

The other interactions that affect the energy of the 
EPR transition, such as the hyperfine interaction and 
zero-field splitting may also be anisotropic. In the most 
general case the principal axes of these interactions will 
not coincide with the principal axes of g. Eor any given 
set of principal values of g,., gy, and gz; Ax, Ay and A^, 
etc., the powder spectrum may be simulated by a 
computer (see the entry “► Simulation of transition- 
metal EPR spectra,” by Stoll). The parameters can be 
adjusted to give the best fit to an experimental spectrum. 
This approach gives the maximum amount of informa¬ 
tion about the unpaired electron and its environment. 

Pulsed EPR 

An alternative to continuous-wave methods is to 
manipulate the electron energy levels by pulsed tech¬ 
niques, in an analogous way to NMR spectroscopy. 
Many different types of pulsed EPR experiment are 
possible, as described in the comprehensive treatise by 
Schweiger and Jeschke (2001). Some of these are 
analogous to those in NMR, others are unique to 
EPR. They use sequences of intense microwave pulses, 
followed by the detection of a spin echo or free- 
induction decay. Pulse sequences are shorter than in 
NMR, because of the short electron phase memory 
time, a parameter which is related to 7’2- Pulsed 
measurements on transition-metal ions are usually 
performed at liquid-helium temperatures. 

The layout of the pulsed microwave bridge is sim¬ 
ilar to that of the CW instrument (Pig. 2), except that 
a pulse former unit controls the microwave supply 
through diode switches. The microwave pulses, of 
duration 5-50 ns, are amplified by a travelling-wave 
tube amplifier (TWT), to a power up to 1 kW. The 
cavity has a lower Q-factor than for CW-EPR, in order 
to allow the microwave field to build up and decay 
more quickly. The ring-down time of the microwave 
circuit means that there is a “dead time” in which the 
echo cannot be detected, but this can be circumvented 
by suitable pulse sequences. 


Pulse sequences can estimate the spin-spin 
relaxation times, which can provide information 
about various phenomena including molecular motion 
and spin-spin interactions between paramagnets. T i is 
measured with an inversion-recovery experiment as in 
NMR, while is observed by a two-pulse echo 
sequence. The amplitude of the echo decreases with 
increasing delay time t, owing to loss of “phase mem¬ 
ory.” Pulsed EPR techniques (described in the entry by 
Garcia-Rubio) have been developed to detect weak 
hyperfine and quadrupolar couplings in the solid 
state, and are increasingly being applied to biological 
systems. Pulsed ENDOR uses a combination of micro- 
wave and radio frequencies to excite transitions of the 
electrons and coupled nuclei (Telser 2007). A plot of 
echo amplitude against rf yields a spectrum analogous 
to that of a continuous-wave ENDOR spectrometer, 
except that the spectra are in absorption mode rather 
than the first-derivative. Electron-spin envelope 
modulation (ESEEM) and the two-dimensional tech¬ 
nique hyperfine sublevel correlation spectroscopy 
(HYSCORE) (Hdfer et al. 1986) provide similar infor¬ 
mation, without the need for a radiofrequency source. 
They are complementary to ENDOR in that they give 
the best results in the intermediate regime of hyperfine 
couplings where A « Vn; also they are particularly 
sensitive to couplings with quadrupolar nuclei such 
as and ^'‘N. 

Definitions 

Continuous-wave EPR spectrometer: an instrument in 
which the sample is irradiated continuously with 
microwaves at a fixed frequency, while an applied 
magnetic field is progressively varied. The condition 
of resonance is observed as a perturbation or absorp¬ 
tion of the microwave field. 

EMR, or electron magnetic resonance: an umbrella 
term to describe all forms of spectroscopy that involve 
EPR detection. 

ENDOR, or electron-nuclear double resonance: 
a double resonance technique, essentially an EPR 
spectroscopy with NMR detection; it measures the 
hyperfine coupling between unpaired electrons and 
nuclear spins in a sample. 

Powder spectrum: the EPR spectrum of a sample 
comprising molecules that are randomly distributed, 
such as a frozen solution or a solid that is ground up 
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into a powder. The name derives from early studies 
that were carried out on paramagnets in crystals. 

Spin-lattice, or longitudinal relaxation time, Ti\ 
a time constant characterizing the decay of magnetiza¬ 
tion of a system of spins by thermal equilibration with 
the surroundings (lattice). 

Spin-spin, or transverse relaxation time, T 2 : a time 
constant characterizing the decay of spin coherence, or 
the component of magnetization perpendicular to Bq. 

Zeeman interaction: the splitting of a spectral line 
into several components in the presence of a static 
magnetic field. 

Summary 

Electron paramagnetic resonance (EPR) or Electron 
Spin Resonance (ESR) is an incisive method to 
observe paramagnetic centers in biology, including 
free radicals and complexes of transition metals such 
as iron and copper. It can be used to identify such 
species; it also provides structural information about 
proteins, through the dipolar interactions between the 
unpaired electron and nuclear and electron spins. The 
conventional spectrometer acquires a spectrum by 
means of a swept applied magnetic field and continu¬ 
ous microwave radiation. Cryogenic temperatures are 
necessary to observe most transition-metal centers. 
More advanced instruments use multiple frequencies 
to probe hyperfine and electron-electron interactions, 
and pulsed microwave and double-resonance tech¬ 
niques provide more selective control of magnetiza¬ 
tion. EPR of paramagnetic centers in proteins is 
a powerful adjunct to structural data from x-ray crys¬ 
tallography, since it can observe transient states that 
cannot be crystallized and observe interactions with 
electrons and protons, which are invisible in crystallo¬ 
graphic electron density maps. The ability to manipu¬ 
late protein structures in detail by mutagenesis has 
made it possible to dissect the structure and 
mechanisms of protein active sites. 

Cross-References 

► EPR and ENDOR of Radicals and Metalloproteins 
in Biological Systems 

► EPR Spectroscopy: General Principles 

► Paramagnetic Metalloproteins 


► Protein Dynamics and Conformational Change from 
Site-Directed Spin Labeling and Electron 
Paramagnetic Resonance Spectroscopy 

► Pulsed EPR: Principles and Examples of 
Applications to Hemeproteins 

► Simulation of Spin-Label EPR Spectra 

► Simulation of Transition-Metal EPR Spectra 

► Spin-Labeling EPR of Lipid Membranes 

► Spin Labeling of DNA and RNA 

► Spin-Trapping: Theory and Applications 
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Introduction 

Essential fattyacids (EEAs) are important constituents 
of all cell membranes and alter membrane fluidity, 
thereby determining and influencing the behavior of 
membrane-bound enzymes and receptors. EEAs are 
essential for survival of humans, and as they are not 
synthesized in the body, they have to be obtained in our 
diet (Das 2006). EEAs are of two types: the 03-6 series 
derived from c«-linoleic acid (LA, 18:2) and the co-3 
series derived from a-linolenic acid (ALA, 18:3). 
There is another sequence of fatty acids derived from 
oleic acid (OA, 18:1 03-9) that is not an EFA. The 03-9, 
03-6, and 03-3 series of fatty acids are metabolized by 
the same set of enzymes to their respective long-chain 
metabolites. The present discussion is centered on the 
EEAs: 03-6 LA and oi-3 ALA and their metabolites. In 
general, a majority of the actions of EEAs has been 
attributed to their conversion to eicosanoids, but in 
some instances the fatty acids themselves are active. 

Metabolism of EFAs 

EEAs are polyunsaturated fatty acids (PUFAs) since 
they contain two or more double bonds. There are at 


least four independent families of PUFAs. They 
include: 

The “03 -3” series derived from a-linolenic acid 

(ALA, 18:3, co -3). 

The “co -6” series derived from cA-linoleic acid 

(LA, 18:2, CO -6). 

The “co -9” series derived from oleic acid (OA, 18:1, 

CO -9). 

The “co -7’’ series derived from palmitoleic acid 

(PA, 16:1, 03-7). 

LA is converted to y-linolenic acid (GLA, 18:3, 
03-6) by the enzyme A° desaturase (d-6-d) and GLA 
is elongated to form dihomo-GLA (DGLA, 20:3, 03- 
6), the precursor of the 1 series of prostaglandins 
(PGs). DGLA can also be converted to arachidonic 
acid (AA, 20:4, 03-6) by the enzyme desaturase 
(d-5-d). AA forms the precursor of 2 series of PGs, 
thromboxanes (TXs) and the 4 series of leukotrienes 
(LTs) and anti-inflammatory compounds: lipoxins. 
ALA is converted to eicosapentaenoic acid 
(EPA, 20:5, 03-3) by d-6-d and d-5-d. EPA forms 
the precursor of the 3 series of PGs, TXs and the 5 
series of LTs and anti-inflammatory compounds: 
resolvins. EPA can be elongated to form 
docosahexaenoic acid (DHA, 22:6 03-3) the precur¬ 
sor of anti-inflammatory compounds: protectins. LA, 
GLA, DGLA, AA, ALA,EPA, and DHA are all 
PUFAs, but only LA and ALA are EFAs (see Fig. 1 
for metabolism of EFAs). AA and EPA also are 
converted to their respective LTs. PGs, TXs, and 
LTs are biologically active and, in general, have 
pro-inflammatory actions and play a significant role 
in atherosclerosis, bronchial asthma, inflammatory 
bowel disease, and other inflammatory conditions. In 
the present discussion, the term “PUFAs” is used to 
refer to all unsaturated fatty acids: LA, GLA, DGLA, 
AA, ALA, EPA, and DHA; and the term EFAs refers 
to LA and ALA. Although the terms EFAs and 
PUFAs are used interchangeably, it should be under¬ 
stood that all EFAs are PUFAs but all PUFAs are not 
EFAs. Many actions of EFAs are also brought about 
by PUFAs and so EFA-deflciency can be corrected 
by PUFAs, and hence, PUFAs are termed as “func¬ 
tional EFAs.” 

EFAs/PUFAs and their products PGs, LTs, and TXs 
play a significant role in collagen vascular 
diseases, hypertension, diabetes mellitus, metabolic 
syndrome, psoriasis, eczema, atopic dermatitis, coro¬ 
nary heart disease, atherosclerosis, and cancer 
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Essential Fatty Acids, 

Fig. 1 Scheme showing the 
metabolism of essential fatty 
acids and cofactors that either 
enhance or decrease the 
activity of and 
desaturases and elongases and 
the formation of various 
eicosanoids. Ethanol blocks 
both A^ and A^ desaturases. 
(+) Indicates enhancement of 
the activity of the enzyme or 
increase in the formation of 
the product; (—) Indicates 
inhibition of the activity of the 
enzyme or decrease in the 
formation of the product 
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(Das 2002, 2010, 2011a). Mechanism(s) by which 
various stimuli preferentially induce the release of 
AA, EPA, and/or DHA and convert them to their 
respective products is not clear. Since, AA, EPA, and 
DHA give rise to anti-inflammatory molecules 
lipoxins (LXs), resolvins, protectins, maresins, and 
nitrolipids and pro-inflammatory eicosanoids, it is 
likely that the balance between these mutually antag¬ 
onistic compounds could determine the final outcome 
of the disease process. Biologically active compounds 
formed due to the nitration of unsaturated fatty acids, 
that is, nitrolipids such as nitrolinoleate (formed due to 
the nitration of linoleate by nitric oxide), have been 
shown to stimulate smooth muscle relaxation, inhibit 


platelet aggregation, and suppress human neutrophil 
pro-inflammatory functions. These studies suggest that 
PUEAs and various biologically active compounds 
formed from them play a significant role in several 
diseases. The average daily intake of EE As, in general, 
is around 7-15 g/day in Europe and the USA. 

Dietary Sources of EFAs 

The main dietary sources of EE As are: 

LA: Cereals, eggs, poultry, most vegetable oils, 
whole-grain breads, baked goods, and margarine. 
Sunflower, saffola, and com oils are rich in LA. 
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Essential Fatty Acids, Fig. 2 (continued) 
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ALA: Canola oil, flaxseed oil, linseed, and rapeseed 
oils, walnuts, and leafy green vegetables. Human 
milk is rich in EFAs and GLA, DGLA, AA, EPA, 
and DHA. Olive oil is rich in OA, whereas palm and 
coconut oils contain virtually none. 

GLA: Human milk contains 0.3-1.0% of its fat as 
GLA. GLA is present in EPO at concentrations of 
7-14% of total fatty acids; in borage seed oil it is 
20-27%; and in black currant seed oil at 15-20%. 
GLA is also found in some fungal sources. 

DGLA: Moderate amounts are found in human milk, 
liver, testes, adrenals, and kidneys. 

AA: Human milk contains modest amounts and cow’s 
milk small amounts of AA. Meat, egg yolks, 
some seaweeds, and some shrimps contain substan¬ 
tial amounts. Average daily intake of AA is esti¬ 
mated to be in the region of 100-200 mg/day that 
is enough to account for the total daily production 
of various PGs, which is estimated to he about 
1 mg/day. 

Adrenic acid (22:44 (u- 6 ): The main sources of adrenic 
acid are adrenals, kidneys, testes, and hrain. 

EPA and DHA: The major source of these two fatty 
acids is marine fish. These fatty acids may he 
denatured and converted into trans-fats during 
processing and cooking that are harmful. Substan¬ 
tial fall in the intake of EPA and DHA could be one 
reason that could have contributed to the increasing 
incidence of atherosclerosis, CHD, hypertension, 
metaholic syndrome, obesity, collagen vascular 
diseases, and cancer. 


Modulators of Metabolism of EFAs 

LA, ALA, and OA are metabolized by the same set of A® 
and desaturases and elongases (see Figs. 1 and 2 for 
metabolism of EFAs). As a result, these 3 series compete 
with one another for the same set of enzymes, though the 
enzymes seem to prefer a)-3 to (B-6 and co -6 over (B-9 (cd- 
3 > ( 0-6 > ( 0 - 9 ). Thus, the presence of significant 
amounts of 20:3 co-9 suggests that there is deficiency of 
( 0-3 and ( 0-6 fatty acids and is used as an indicator of 
EFA deficiency. The activities of A® and desaturases 
are slow in humans (A^ > A°). As a result, the conver¬ 
sion of LA and ALA to their respective metabolites may 
be inadequate under certain conditions such as athero¬ 
sclerosis. The western diet is rich in ( 0-6 fatty acids 
compared to ( 0-3 fatty acids (( 0-6 to ( 0-3 ratio is 10:1), 
whereas the recommended ratio is ^1:1 (Das 2006). 

Saturated fats, cholesterol, trans-fatty acids, 
alcohol, adrenaline, and glucocorticoids inhibit A® 
and A^ desaturases (Das 2002, 2006, 2010, 2011a). 
Pyridoxine, zinc, nicotinic acid, and magnesium are 
cofactors for normal A® desaturase activity. Insulin 
activates A® desaturase whereas diabetics have 
reduced A® desaturase activity. The activity of A® 
desaturase falls with age. Oncogenic viruses and radi¬ 
ation inhibit A® desaturase activity. Total fasting, 
protein deficiency, and a glucose-rich diet reduce, 
whereas a fat-free diet and partial caloric restriction 
enhance A® desaturase activity. Activity of A® and A® 
desaturases are regulated by sterol regulatory element¬ 
binding protein-1 (SREBP-1) and peroxisome 


< - 

Essential Fatty Acids, Fig. 2 (a) Scheme showing the forma¬ 
tion of resolvin E (RvE) derived from EPA. In the endothelial 
cells, the COX-2 enzyme that has been acetylated introduces an 
18/? hydroperoxy-group into the EPA molecule (c.f. the role 
of aspirin in the biosynthesis of the epi-lipoxins). This is 
reduced to the corresponding hydroxy compound before 
a 5S-hydroperoxy group is introduced into the molecule by 
the action of 5-lipoxygenase as in the biosynthesis of leukotri- 
enes. A further reduction step produces 15S,18R-dihydroxy- 
EPE or resolvin E2. Alternatively, the 55-hydroperoxy, 
18/?-hydroxy-EPE intermediates are converted to 5,6-epoxy 
fatty acids in polymorphonuclear leukocytes in humans and 
eventually to 5S,12/?,18/?-trihydroxy-6Z,8/i,10/;,14Z,16/i-eico- 
sapentaenoic acid or resolvin El by a process similar to the 
formation of leukotrienes in leukocytes, (b) Structures of 
Resolvin D1 and D2. DHA is converted to 17R-resolvins by 
a aspirin-triggered mechanism similar to the scheme shown in 
Fig. 2a. In the absence of aspirin, COX-2 of endothelial cells 


converts DHA to 13S-hydroxy-DHA. In the presence of aspirin, 
the initial product is 17R-hydroxy-DHA, which is converted to 
7S-hydroperoxy, 17R-hydroxy-DHA by the action of 
a lipoxygenase, and thence via an epoxy intermediate to epi- 
meric resolvins D1 and D2. An alternative lipoxygenase¬ 
generated intermediate, 4S-hydroperoxy, 17R-hydroxy-DHA, 
is transformed via an epoxide to epimeric resolvins D3 and 
D4. 17S Resolvins of the D series are produced in cells in the 
absence of aspirin by a reaction catalyzed in the first step by 
a lipoxygenase, (c) Scheme showing the synthesis of 
neuroprotectin Dl. Resolvins are generated in brain tissue in 
response to aspirin treatment, and in addition docosatrienes 
termed neuroprotectins are also produced. The lipoxygenase 
product 175-hydroperoxy-DHA is converted first to a 16(17)- 
epoxide and then to the 10, 17-dihydroxy docosatriene denoted 
as 10, 175-DT or NPDl. As with the leukotrienes, there are 
three double bonds in conjugation, hence the term “triene,” 
though there are six double bonds in total 
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proliferator-activated receptor-a (PPAR-a), two recip¬ 
rocal transcription factors for fatty acid metabolism, 
and some of their (SREBP-1 and PPAR- a) lipogenic 
functions are brought about by their action on PUFAs 
(Matsuzaka et al. 2002). 

Activities of A® and desaturases are decreased in 
diabetes mellitus, hypertension, hyperlipidemia, and 
the metabolic syndrome. Trans-fats interfere with the 
metabolism of EFAs and promote inflammation, 
atherosclerosis, and coronary heart disease 
(Das 2006; Lopez-Garcia et al. 2005). Several 
PUFAs, especially EPA and DHA inhibit the produc¬ 
tion of pro-inflammatory cytokines: interleukin -6 
(IL- 6 ), tumor necrosis factor-a (TNF-a), IF-1, and 
lL-2 (Das 2006). Saturated fatty acids and cholesterol 
interfere with the metabolism of EFAs and thus, 
promote the production of pro-inflammatory cyto¬ 
kines, which explains their ability to cause atheroscle¬ 
rosis and coronary heart disease (CHD). Saturated fats, 
cholesterol, and fran^-fats reduce the formation of 
GLA, DGLA, AA, EPA, and DHA that in turn lead to 
reduced formation of prostacyclin (PGI 2 ), PGI 3 , 
lipoxins, resolvins, protectins, maresins, and 
nitrolipids, whose deficiency may initiate and acceler¬ 
ate the progression of atherosclerosis, persistence of 
low-grade systemic inflammation, CHD, and failure of 
the healing process. 

Actions of EFAs Relevant to Their Clinical 
Applications 

Cell Membrane Fluidity 

Cell membrane fluidity is determined by its lipid com¬ 
position. For example, increasing the cell membrane 
content of saturated fatty acids and cholesterol renders 
it more rigid, whereas enhancing its content of unsat¬ 
urated fatty acids makes it more fluid. A rigid cell 
membrane shows reduced number of insulin receptors 
and their affinity to insulin that, in turn, causes insulin 
resistance, while an increase in cell membrane fluidity 
increases the number of insulin receptors on the mem¬ 
brane and their affinity to insulin, thus decreasing 
insulin resistance (Das 2002, 2003, 2006, 2010). 

Availability of appropriate amounts of «)-3 and ai-6 
fatty acids and various growth factors is essential for 
the growth of the brain during the perinatal period and 
adolescence (Das 2002, 2003; Calderon and Kim 
2004). Deficiency of ®-3 EPA and DHA and ffl -6 AA 


during this critical period impairs brain growth and 
development and may reduce the number of synaptic 
connections that in turn could lead to the development 
of dementia, depression, schizophrenia, Alzheimer’s 
disease, and neurodegenerative diseases: Huntington’s 
disease, Parkinson’s disease, spinocerebellar degener¬ 
ation, etc., at a later date and may impair memory 
formation and consolidation, ro-3 PUFA dietary insuf¬ 
ficiency ablates long-term synaptic depression 
mediated by endocannabinoids in the prelimbic 
prefrontal cortex and accumbens. In ra3-deficient 
mice, presynaptic cannabinoid CB7 receptors 
(CB7Rs) normally responding to endocannabinoids 
were uncoupled from their effector Gi/o proteins. The 
dietary-induced reduction of CB7R functions in mood¬ 
controlling structures was associated with impaired 
emotional behavior (Lafourcade et al. 2011). 

EFAs/PUFAs Have Second Messenger Actions 

Several growth factors and hormones activate phos¬ 
pholipase A 2 (PLA 2 ), induce the release of DGLA, 
AA, EPA, and DHA from the cell membrane lipid 
pool that, in turn, are used for the formation of eicos- 
anoids that bring about some of their (growth factor 
and hormone) actions. For example, the tumoricidal 
action of TNF-a is dependent on its ability to induce 
PLA 2 , and inhibitors of PLA 2 inhibit its action. 
TNF-a-resistant tumor cells can be rendered sensitive 
to TNF-a by the addition of GLA, AA, EPA, and DHA. 
These fatty acids also enhance the activity of protein 
kinase C (PKC), a second messenger; activate macro¬ 
phages and polymorphonuclear leukocytes (PMNs); 
modulate ThI and Th 2 balance; and modulate free 
radical generation by these cells (reviewed in 
Das 2010, 2011a). 

EFAs/PUFAs Have Antibiotic-like Actions 

Several studies showed that LA, ALA AA, EPA, and 
DHA have antibacterial, antiviral and antiparasitic 
actions (Kodicek 1949; Kohn et al. 1980; Krugloak 
et al. 1995). Prostaglandin Ei (PGEi) and PGA derived 
from DGLA, AA, and EPA, inhibited viral replication 
(Giron 1982; Santoro et al. 1980). Based on these 
studies, it is evident that EFAs/PUFAs and some eicos- 
anoids could function as endogenous antiviral mole¬ 
cules that led to the suggestion that they could be of 
benefit in AIDS (acquired immunodeficiency 
syndrome) (Das 1985a, b). Furthermore, cell mem¬ 
branes of lymphocytes and macrophages contain 
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significant amounts of PUFAs that are released on 
exposure to bacteria, viruses, and tumor cells, which 
in turn stimulate NADPH-dependent superoxide pro¬ 
duction (Bromberg and Pick 1984) to kill the invading 
microorganisms and tumor cells (Schlager and Meltzer 
1983). It was reported that PUFAs, when given 
together with conventional antibiotics, enhance their 
antibacterial activities in experimental sepsis caused 
by multidrug-resistant Pseudomonas aeroginosa. 

PUFAs Regulate ACE and HMG-CoA Reductase 
Activities and NO Generation 

PUFAs inhibit leukocyte ACE (Kumar and Das 1997) 
and HMG-CoA reductase activities (El-Sohemy and 
Archer 1999), and enhance nitric oxide generation 
(Okuda et al. 1997). Thus, PUEAs may function as 
endogenous regulators of Ang-II (angiotensin-II) 
formation, cholesterol synthesis, and endothelial nitric 
oxide generation. In this context, it is important to note 
that plasma concentrations of PUEA and eNO are low 
in hypertension, type 1 and type 2 diabetes mellitus, 
renal diseases, rheumatoid arthritis, lupus, psoriasis, 
eczema, atopic and non-atopic dermatitis, atheroscle¬ 
rosis, insulin resistance, obesity, dementia, 
schizophrenia, bipolar disorders, Fluntington’s dis¬ 
ease, Alzheimer’s disease, peptic ulcer disease, and 
cancer (Das 2002, 2006, 2010, 201 la). 

Recent studies showed that aspirin brings about 
some of its actions by augmenting the formation of 
eNO through the generation of epi-lipoxins that are 
known to enhance NO generation (Gilroy 2005). 
Epi-lipoxins have potent anti-inflammatory actions 
and prevent the interaction between leukocytes and 
the vascular endothelium. Eurthermore, NO stimulates 
PGI 2 formation from AA (Wang and Diamond 1997), 
which is also a potent vasodilator and platelet 
anti-aggregator. Thus, a close interaction exists 
among PUEAs, lipoxins, NO, and PGI 2 . 

PUFAs Possess Anti-inflammatory Action 

PUFAs and their products such as PGEi, PGI 2 , 
lipoxins, and resolvins suppress the production of 
pro-inflammatory cytokines IL-1, IL-2, IL-6, macro¬ 
phage migration inhibitory factor (MIF), HMGBl 
(high mobility group box 1), and TNF-a (Das 2006, 
2011a; Arita et al. 2005; Dooper et al. 2003; Chen 
et al. 2005), and thus function as endogenous 
anti-inflammatory molecules. The suppressive action 
of DFIA on IE-1 (3 and TNE-a production by stimulated 


human retinal vascular endothelial cells suggests its 
role (especially via protectins) in the prevention of 
age-related macular degeneration (AMD) and diabetic 
retinopathy. PUEAs and lipoxins, resolvins, and 
protectins suppress the production of pro- 
inflammatory cytokines by increasing PPAR-y 
mRNA and protein. 

PUEAs bind to G-protein-coupled receptor, 
GPR120, which is abundantly expressed in the intes¬ 
tine. The stimulation of GPR120 by PUEAs promoted 
the secretion of GLP-1 (glucagon-like peptide-1) 
in vitro and in vivo, and increased circulating insulin 
(Hirasawa et al. 2005). In addition, GPR40, expressed 
in the pancreas, functions as a receptor for PUEAs and 
amplifies glucose-stimulated insulin secretion from 
pancreatic (3 cells by activating GPR40 (Itoh 
et al. 2003). The ability of EPA/DHA to bind to 
GPR120 is responsible for their anti-inflammatory 
effects. CO-3 feeding inhibited inflammation and 
enhanced systemic insulin sensitivity in WT 
(wild-type) mice, but was without effect in GPR120 
knockout mice, suggesting that GPR120 is a functional 
CO-3 FA receptor/sensor that mediates insulin sensitiz¬ 
ing and antidiabetic effects in vivo by repressing mac¬ 
rophage-induced tissue inflammation (Oh et al. 2010). 

IL-1, IL-6, MIL (macrophage migration inhibitory 
factor), and TNE-tx induce insulin resistance, have 
cytotoxic and neurotoxic actions, and produce 
cachexia in tuberculosis, cancer, and AIDS. EPA and 
possibly other PUEAs decrease cachexia induced by 
TNE-a in animal tumor models. PUEAs, lipoxins, 
resolvins, and protectins are likely to be of benefit to 
insulin resistance by inhibiting the production of 
TNE-a and enhancing adiponectin levels. 


E 


PUFAs and NO Interact to Form Nitrolipids 

NO interacts with PUEAs to form their respective 
nitroalkene derivatives that can be detected and esti¬ 
mated in the plasma (see Eig. 3) that are termed as 
nitrolipids, which produce vascular relaxation, inhibit 
neutrophil degranulation, superoxide formation, and 
platelet activation (Baker et al. 2005; Coles et al. 
2002 ) and thus, possess anti-inflammatory action. 
Nitrolipids show PPAR-y ligand agonist activity and 
release NO. These results suggest that nitrolipids have 
a role in low-grade systemic inflammatory conditions 
such as type 2 diabetes, hypertension, hyperlipidemias. 
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Two regioisomers of OA-NO 2 : 9- and 10-nitro-9-cis-octadecenoic acids 

Essential Fatty Acids, Fig. 3 Nitrolipids and their detection. 
Two regioisomers of OA-NO 2 : 9- and lO-nitro-9-c/.s- 
octadecenoic acids 


insulin resistance, and the metabolic syndrome. 
Nitrolipids may have a role in depression schizophre¬ 
nia, and Alzheimer’s disease, since low-grade 
systemic inflammation occurs in these conditions. 

Clinical Significance of EFAs/PUFAs and 
Their Products 

It is evident from the preceding discussion that EFA/ 
PUFAs and their products PGs, TXs, LTs, LXs, 
resolvins, protectins, maresins, and nitrolipids have 
many biological actions and thus have a potential role 
in the pathobiology of several diseases processes 
(Fig. 4). 

For example, PUFAs may participate in 
atherosclerosis by modulating the expression of 
uncoupling protein-1 (UCP-1) in the vascular tissue. 
Uncoupling proteins (UCPs) (inner mitochondrial 
membrane anion transporters) allow protons to leak 
back into the mitochondrial matrix, thereby 
decreasing the energy available for ROS generation. 
UCP-1 expression in aortic smooth muscle cells 
produced hypertension, increased high fat diet-induced 
atherosclerosis without affecting cholesterol levels, 
enhanced superoxide anion production, and decreased 
the availability of nitric oxide, suggesting that mild 
respiratory uncoupling due to mitochondrial 
dysfunction causes atherosclerosis (reviewed in Das 
2010, 2011a). FFA deficiency promotes respiratory 
uncoupling and atherosclerosis (Bernal-Mizrachi 
et al. 2005). Atherosclerosis-free aortae have abundant 
amounts of FFA linoleate, whereas fatty streaks of 
early atherosclerosis are deficient in FFAs (Klein and 
Johnson 1954; Hayashida and Portman 1960). 
Pro-inflammatory eicosanoids stimulate the formation 


and the activity of adhesion molecules (integrins) and 
thus promote atherosclerosis and inflammation, 
whereas GLA, DGLA, FPA, and DHA, and LXs, 
resolvins, and protectins suppress the expression of 
adhesion molecules, and thus could inhibit 
atherosclerosis. 

LTB 4 , a 5-lipoxygenase (5-LO) product derived 
from AA, is a potent leukocyte chemoattractant, 
a pro-inflammatory molecule, and participates in 
atherosclerosis through its receptors BLT-1 and 
BLT-2. LTB 4 promotes atherosclerosis by chemo- 
attracting monocytes, and enhancing the formation of 
foam cells. The expression of 5-LO is elevated in 
symptomatic compared with asymptomatic atheroma¬ 
tous plaques, suggesting that LTB 4 has an important 
role in atherosclerosis. These results indicate that the 
balance between pro-inflammatory and anti¬ 
inflammatory bioactive lipids (such as LXs vs LTs) at 
the site of atherosclerosis-prone area of the vasculature 
ultimately determines the initiation, progression, and 
degree of atherosclerosis. This balance between anti¬ 
inflammatory LXs, resolvins, protectins, maresins, and 
nitrolipids versus pro-inflammatory PGs, TXs, and 
LTs may also determine the production of both pro- 
and anti-inflammatory cytokines depending on the 
degree of inflammation, and its resolution at the sites 
of injury. Plasma cholesterol, glucose, trans-fats, 
ethanol, homocysteine, vitamins, and minerals are 
some of the dietary and endogenous factors that may 
have a regulatory role in the synthesis of PGs, LTs, 
TXs, LXs, resolvins, protectins, maresins, and 
nitrolipids, and thus in the pathophysiology of diseases 
associated with them. 

For instance, previously it was shown that AA, 
FPA, and DHA prevent chemical-induced pancreatic 
P cell damage and thus abrogate development of dia¬ 
betes mellitus. Both CO and LO inhibitors did not 
block this beneficial action of these PUFAs 
(Suresh and Das 2003a, b). This suggests that, possi¬ 
bly, enhanced formation of lipoxins, resolvins, 
protectins, maresins, and nitrolipids from AA, FPA, 
and DHA may have been responsible for the P-cell 
protective action seen with these fatty acids. Thus, 
lipoxin, resolvins, protectins, maresins, and nitrolipids 
may function as endogenous antidiabetic molecules. 
Similar beneficial action of these bioactive lipids may 
also prevent other low-grade systemic inflammatory 
conditions such as obesity, hypertension, metabolic 
syndrome, Alzheimer’s disease, asthma, atopic 
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Fig. 4 Scheme showing the 
relationship among various 
mediators of tissue damage/ 
resolution and clinical 
conditions and the role of 
PUFAs and their metabolites 
in these processes 



eczema, coronary heart disease, lupus, rheumatoid 
arthritis, stroke, psoriasis, cancer, Huntington’s dis¬ 
ease, depression, schizophrenia, polycystic ovarian 
syndrome, non-alcoholic fatty liver disease, and fetal- 
alcohol syndrome (see Fig. 4). In this context, it is 
important to note that PUFAs, PGs, LTs, TXs, LXs, 
resolvins, and protectins influence stem cell survival, 
proliferation and differentiation (reviewed in Das 
1982, 2011a), and also bind to DNA, regulate gene 
expression and micro-RNAs (Recchiuti et al. 2011), 
a potential mechanism(s) for their involvement in 
various cellular processes and diseases. 

Since several biologically active molecules are 
formed from PUFAs, it is important to know the 
molecular triggers that facilitate the specific path¬ 
ways in the formation of various bioactive lipids. If 
methods to selectively enhance the synthesis of 
desirable molecules such as LXs, resolvins. 


protectins, maresins, and nitrolipids are developed, 
then it may be possible to suppress inappropriate 
inflammation. 
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EVB Methods 

EXSY (Exchange Spectroscopy) 

► Empirical Valence Bond Methods 

► ZZ-Exchange 

Excitatory Amino Acid Transporter 

Extein 

► Glutamate Transporter Eamily 

► Protein Splicing - Computational Studies 


Excretion 


► Glycosylation in Metabolism 

Extended X-Ray Absorption Fine 
Structure (EXAFS) 

Exo- and Endo-anomeric Effects 

► X-Ray Absorption Spectroscopy of Metals in 
Biology 


► Anomeric Effect in Sugars 


Exoglycan 

External Protein 
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► Bacterial Polysaccharide Structure and Biosynthesis 


Exopolysaccharide 

Extracellular Matrix Components 

► Bacterial Polysaccharide Structure and Biosynthesis 

► Proteoglycans 

Expressed Protein Ligation 

Extracellular Polysaccharide 


► Protein Segmental Labeling 


► Bacterial Polysaccharide Structure and Biosynthesis 
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Introduction 

ATP synthase, termed as EqEi-ATP synthase, exists 
in thylakoid membrane, mitochondrial inner mem¬ 
brane, and bacterial membrane. It consists of 
a combination of a water soluble Fj part with 
a membrane embedded Fq part (Fig. 1). This enzyme 
catalyzes adenosine triphosphate (ATP) synthesis 
reaction from adenosine diphosphate (ADP) and inor¬ 
ganic phosphate (Pi) using the proton flow across the 
membrane. Both Fj and Fq are rotary motors, but 
driving forces are different: rotation of Fi is driven 
by the free energy of ATP hydrolysis (Fig. 1, lower 
right), and that of Fq is driven by the electrochemical 
potential of proton (the proton motive force) across the 
membrane (Fig. 1, upper right). In bacteria, Fj is com¬ 
posed of a3p3Y5E subunits. The minimum complex 
that functions as a motor is a 3 P 3 y subcomplex, a and 


P subunits have non-catalytic and catalytic 
ATP-binding sites near at the a/p interface, respec¬ 
tively. They form a hexamer ring in alternating 
arrangement as a stator and y subunits rotates in the 
a 3 P 3 hexamer ring upon ATP hydrolysis in 
a counterclockwise direction from the membrane side 
of view (Fig. 1, lower right) (Yoshida et al. 2001; 
Junge et al. 2009). Fq part is composed of ab 2 Cio.^i 5 
subunits. The c subunits align in a circle to form a ring 
and a proton channel is formed at the interface between 
the c-ring and a subunit. This c-ring rotates against the 
stator ab 2 subunits as protons flow downhill through 
the proton channel (Fig. 1, upper right) (Duser et al. 
2009). In FqFi complex, Fq and Fj are connected 
through central (y, e and c subunits) and peripheral 
(§ and b subunits) stalks, and the torques generated by 
Fq and Fj motors are transmitted through central stalk 
(Fig. 1, left). When the electrochemical potential of 
proton is enough to overcome the free energy of the 
ATP hydrolysis, as in the physiological condition, Fq 
forcibly rotates Fj in the opposite direction and this 
results in ATP synthesis. In contrast, when the electro¬ 
chemical potential Is low, Fi forcibly rotates Fq by 
ATP hydrolysis and Fq pumps protons against electro¬ 
chemical potential. 

Structure of Fi-ATPase 

As X-ray crystallographic studies have revealed the 
atomic level structures of a 3 P 3 y subcomplex (hereafter 
referred to as Fi), they have gave insights into its 
rotary mechanism. For example, in the first crystal 
structure of Fi from bovine heart mitochondria, each 
catalytic P subunit showed different conformational 
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F|-ATPase: Fundamental Properties and Structure, 
Fig. 1 Fq and Fi motors of ATP synthase. Schematic images 
of FqFi-ATP synthase. The rotor and stator parts are shown in 
red and blue, respectively. The subunit composition of bacterial 
Fo is ab 2 C|o_i 5 (the number of the c subunits varies from 10 to 15 
in different species). Fq is embedded in the cell membranes and 
rotates the c-ring against ab 2 stator driven by passive proton 
translocation along the proton electrochemical potential 


comprising the proton concentration (ApFl) and membrane volt¬ 
age (AT*) across the membrane. Bacterial Fi is composed of the 
“sPsYSe is an ATP-driven rotary motor in which the y rotates 
against the a 3 P 3 cylinder. The s subunit binds to the protruding 
part of the y subunit. The 5 binds to the bottom of the a 3 P 3 ring 
(Note that the rotational direction of Fq is opposite to that of F,). 
In the whole complex of FqF,, Fq reverses the rotation of Fi, 
leading to ATP synthesis from ADP and P; 


State depending on the bound substrate: one bound 
adenosine-5'-(P,y-imino)-triphosphate (AMP-PNP), 
which is an ATP analogue, (Ptp), second bound ADP 
(Pop), and third binds no substrate (Pempty) (Fig- 2) 
(Leslie and Walker 2000). This conformational differ¬ 
ence of each P subunit is consistent with the rotational 
catalysis which was proposed by P. Boyer (1993), i.e., 
each catalytic site is in a different reaction state and 
alternates their reaction states cooperatively upon the 
rotation of the y subunit. The catalytic model has been 
featured in many text books. Furthermore, the crystal 
structure suggested that the rotor y subunit rotates 
counterclockwise, as viewed from the membrane 
side, upon ATP hydrolysis; when each P converts the 
conformational state from Pempty to Ptp and then to 
Pdp, the conformational state of Fi rotates in the coun¬ 
terclockwise direction accompanying the rotation of 
the y subunit. This prediction was confirmed later by 
the single-molecule rotation experiment (Noji et al. 
1997). 


Proposed Torque Generation Mechanism 
from the Crystal Structure 

The crystal structure also provides insights into the 
torque generation mechanism of Fj. The conforma¬ 
tions of p subunits are different depending on the 
presence of nucleotide. In Pempty, C-terminal domain 
rotates away from the binding site to allow the cleft of 
the catalytic site to open (Fig. 3, left-top). On the other 
hand, in Pxp and Pop, catalytic sites are closed to warp 
the bound nucleotide tightly (Fig. 3, center and right- 
top). Therefore, it is anticipated that large conforma¬ 
tional change upon the ATP binding and ADP release 
contributes the torque generation. When the conforma¬ 
tions of Ptp and Pop are compared, there is no much 
difference in the global conformations, except at the 
interface that forms the catalytic site with the adjoining 
a subunit. Although there is also no obvious difference 
in the global conformations of the a subunits, the 
position of the tx subunit which forms the catalytic 
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F|-ATPase: Fundamental Properties and Structure, 
Fig. 2 Crystal structure of the sub-complex of Fj The 
crystal structure of F, from bovine mitochondria (pdb code; 
IBMF). a, P and y subunits are shown in yellow, green, and 
red, respectively. The left figure is viewed from the top (Fq side), 
the right is from the side. The catalytic sites are located at the ot-P 


Side 



interface indicated by red circles, mainly on the P subunit. Each 
carries AMP-PNP, ADP, or none, and is designated as Ptp, Pop, 
or Pumpty The other interfaces are non-catalytic sites (blue 
circles), all of which bind with AMP-PNP. Each a forming 
catalytic site with the P is designated as otxp, otop. and “Empty, 
respectively 



F|-ATPase: Fundamental Properties and Structure, 
Fig. 3 Conformational states of the P subunit and the 
catalytic a-f interface. Three a-P pairs with the y subunit 
are shown in yellow and green with the central y 
subunit (red). The a and P subunits are composed of the 
N-terminal domain, nucleotide-binding domain, and the 
C-terminal domain (from bottom to top). Pumpty has an open 


conformation in which the ot-helical C-terminal domain 
rotates upward to open the cleft of the nucleotide-binding 
pocket. Both Pxp and Pup have a closed conformation 
entrapping the nucleotide within the closed pocket. Lower 
row shows the catalytic a-P interfaces. The a-P pairs com¬ 
prising the catalytic sites occupied with none, an AMP-PNP, 
or ADP are shown with the y subunit 
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F|-ATPase: Fundamental Properties and Structure, 
Fig. 4 Single-molecule rotation assay of Fi. (a) A schematic 
image of the experimental setup. The 03 P 3 ring is fixed on the 
glass surface to suppress the translational and rotational Brownian 


motions of the Fi molecule. A large probe (40-nm gold bead here) 
is attached onto the y subunit to visualize the rotary motion under 
optical microscope, (b) Time-courses of the rotation with probes 
of several sizes under ATP-saturating condition (2 mM ATP) 


site with Pop shifts toward pop, as compared with that 
of Ptp, to occlude the bound nucleotide more tightly 
(Fig. 3, bottom). This shift of the a subunit causes the 
Arg-finger, a well-conserved arginine residue of the a, 
to move toward the y-phosphate of the bound ATP by 
a few As. Biochemical studies and theoretical calcula¬ 
tions suggested that the Arg-hnger enhances the ATP 
hydrolysis step by stabilizing the transition state 
(Senior et al. 2000; Dittrich et al. 2003). 

Single-Molecule Rotation Assay of Fi -ATPase 

While crystal structures have revealed the snapshot of 
the atomic detail of Fi, single-molecule studies have 
revealed the kinetics and dynamics of the active mol¬ 
ecule from the rotation. In single-molecule rotation 
assay, Fi is immobilized on the glass to suppress 
Brownian motion in water and a large probe is attached 
onto the y subunit to visualize the rotation. Fluores- 
cently labeled actin hlaments, polystyrene beads, gold 
colloidal beads, gold nanorods, or magnetic beads are 
used as probes in order to magnify the small turning 
radius of the y subunit (approximately 1 nm). Figure 4 
shows the examples of rotations when using several 
sizes of probe. Due to the hydrodynamic friction 
against the probe, the rotational velocity depends on 
the probe size, i.e., Fj rotates slowly (rapidly) with 
large (small) probe (lino and Noji 2009). Irrespective 
of the probe size, the torque generated by Fi is calcu¬ 
lated to be constant and approximately 40 pN nm from 
the angular velocity and drag coefficient. In the case of 


Fi from thermophilic Bacillus PS3 that is the best 
characterized Fj. Larger torque, ^60 pNnm was 
reported for Fi from Eschericia coli (Junge et al. 
2009). 

Stepping Rotation 

The mechanical rotation of Fi is coupled with the 
elementary reaction steps of ATP hydrolysis, such as 
ATP binding, ATP hydrolysis, and product release. 
In order to clarify how the rotation is coupled with 
the elementary catalytic steps, many attempts have 
been made to resolve rotary motion into discrete 
steps. The ATP hydrolysis (or rotation) rate of Fi 
obeys the Michaelis-Menten equation. At low ATP 
concentration ([ATP]), the rate-limiting step is the 
ATP binding and the rate becomes proportional to 
[ATP]. On the other hand, at high [ATP], the rate is 
independent on [ATP] and ATP hydrolysis or product 
release steps become rate limiting. Under the ATP 
limiting condition (<1 pM), Fi shows three pauses 
that separated with 120° in a single turn due to slow 
ATP binding (Fig. 5a). The dwell time of the pause is 
inversely proportional to the [ATP], and statistical 
analysis of the dwell time indicates that the each step 
is triggered by binding of single ATP (lino and Noji 
2009). Furthermore, a mutant Fi (pE190D mutant of Fi 
derived from thermophilic Bacillus PS3) that hydro¬ 
lyzes ATP very slowly also shows three pauses that are 
separated by 120° even at high [ATP] (Fig. 5b). When 
[ATP] is decreased, the mutant shows six pauses that 
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F|-ATPase: Fundamental Properties and Structure, 
Fig. 5 Pauses and steps in the rotation of Fj. (a) Rotation of 
Fi-ATPase under ATP binding rate-limiting condition, at 60 nM 
ATP. Inset shows the trajectory of centroid of the probe, 
(b) Rotation of mutant F|-ATPase, f5(E190D), at 2 mM ATP. 
Hydrolysis rate of this mutant is so slow that three pauses to 
complete hydrolysis reaction are visible. Trajectory of rotation is 


shown in inset, (c) Rotation of mutant F|-ATPase, P(E190D), at 
2 pM ATP. Under this condition, 120° step is divided into 80° 
and 40° sub-steps. The pauses before 80° and 40° sub-steps 
correspond to ATP binding and ATP catalytic dwelling confor¬ 
mational states, respectively. Arrows indicate the positions 
where 40° sub-step is observed 


separated with 80° and 40° (Fig. 5c). The 80° and 40° 
sub-steps is also resolved in wild-type Fi by high¬ 
speed imaging (Kinosita et al. 2004). Kinetic analyses 
of the dwell time of the pauses show that the 80° and 
40° sub-steps are triggered by ATP binding and two 
consecutive reactions each of which has a time con¬ 
stant of 1.3 ms and 0.1 ms, respectively. One of latter 
reactions has been identified as the ATP hydrolysis 
reaction (lino and Noji 2009). Furthermore, the posi¬ 
tions of product release were investigated by adding an 
excess amount of ADP or Pi in the solution (Adachi 
et al. 2007; Watanabe et al. 2008). When adding ADP, 
the rotation of Fj is slowed due to lengthening the 
dwell time at ATP-binding angle. In contrast, in the 
presence of Pi, Fi remains in the catalytic position and 
that leads to a long pause at the catalytic angle. These 
results indicate that ADP and Pi releases occur at ATP 
binding and catalytic angles, respectively. Fi also 
shows temperature sensitive rate-limiting step at ATP 
binding angle below ^4°C, which is thought to be 
a conformational rearrangement of (3 before or after 
ATP binding and not directly involved in chemical 
step (Watanabe et al. 2008). 

Mechanochemical Coupling Scheme 

More detailed scheme of the mechanochemical cou¬ 
pling between mechanical rotation and the elementary 
steps of the ATP hydrolysis reaction has been revealed 


(Fig. 6). The study using a hybrid Fi which has a single 
copy of pl90D with a slow hydrolysis rate identified 
the angular position of ATP hydrolysis is 200° from the 
ATP binding at 0° (Ariga 2008). Using fluorescently 
labeled ATP as a substrate, it has been shown that the 
P subunit keeps ADP bound after hydrolysis until 240° 
(Nishizaka et al. 2004; Adachi et al. 2007). Very 
recently, the timing of Pi release has been determined 
to be at 320° (Watanabe et al. 2010). Each P subunit is 
reset by a single turnover of ATP hydrolysis through 
one revolution of the y rotation; in summary, it begins 
from ATP binding at 0°, hydrolyzes the ATP at 200°, 
releases the produced ADP at 240°, and, then, releases 
Pi at 320°. A correlation between the conformational 
state of Fi in crystal structure and in the single¬ 
molecule experiment has been also established 
(Okuno et al. 2010). The crystal structures resolved 
so far correspond basically to the conformational state 
at the catalytic angle, and pjp corresponds to the 80° 
state: Pop, to the 200° state, and Pempty> to the 320° 
state (Fig. 6, dark green circle). 

High Reversibility of Mechanochemical 
Coupling 

Although the essential properties and basic mechano¬ 
chemical coupling scheme of Fj as ATP-driven motor 
have been established as described above, Fi physio¬ 
logically functions as a catalyst for ATP synthesis 
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F|-ATPase: Fundamental 
Properties and Structure, 
Fig. 6 Mechanochemical 
coupling scheme ofFj. Each 
circle indicates the chemical 
states of the catalytic sites. 
One catalytic site is 
highlighted in green. The 
arrow (red) represents the 
angular position of the y 
subunit. Each catalytic site 
retains the bound nucleotide as 
ATP until the y rotates 200° 
from the binding angle of the 
highlighted catalytic site (0°). 
After 200° rotation, the 
catalytic site executes ATP 
hydrolysis into ADP and P;, 
each of which is released at 
240° and 320°, respectively 


0 ° 



reaction. However, the mechanism of ATP synthesis 
has not been sufficiently studied at the single-molecule 
level. If ATP synthesis is a simple reverse version of 
hydrolysis, ATP should be efficiently synthesized by 
forcible rotation of Fi in the reverse direction. To 
verify this, two single-molecule experiments have 
been conducted. In the first experiment (Kinosita 
et al. 2004), many Fj molecules enclosed in an obser¬ 
vation chamber were forced to rotate in the reverse 
direction using the combination of magnetic bead and 
magnetic tweezers. The synthesized ATP was detected 
as bioluminescence of the luciferin-luciferase system. 
Although ATP synthesis upon reverse rotation was 
clearly proved, quantitative estimation of the mecha¬ 
nochemical coupling ratio was not allowed because of 
the uncertainty of the number of active Fi molecules in 
the chamber. In the following experiment, a single 
active Fi molecule was focused on to achieve the 
quantitative estimation (lino and Noji 2009). To mea¬ 
sure small amount of ATP which is synthesized by 
a single Fj molecule, a single Fj molecule was 
enclosed in the micron-sized chamber in order to accu¬ 
mulate a small amount of synthesized ATP to 
a detectable concentration (Fig. 7). In this experiment. 


although a 3 P 3 y subcomplexes synthesized very few 
ATPs by reverse rotation (Fig. 7c, left), the efficiency 
of ATP synthesis was improved up to ^80% (2.4 ATP 
molecules per turn) when a 3 P 3 yE subcomplex was 
used (Fig. 7c, right). It is likely that the e subunit 
stabilizes the conformation of the protruding portion 
of the y subunit, and then the torque applied on the 
protruding part of the y is efficiently transmitted to 
the catalytic P subunit. This result implies that in 
the whole FqFi complex, the efficiency of the mecha¬ 
nochemical coupling in ATP synthesis is also high. 
High reversibility of mechanochemical coupling is 
a remarkable feature of the ATP synthase because 
other motor proteins such as kinesin and myosin can¬ 
not synthesize ATP when the movements are reversed 
by external force. 

Regulation of ATP Hydrolysis Activity 

ATP synthase synthesizes ATP in the cell, but it can 
also hydrolyze ATP. However, ATP consumption by 
ATP synthase in the cell is physiologically unfavor¬ 
able. To prevent ATP consumption, Fi has the several 
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F|-ATPase: Fundamental 
Properties and Structure, 
Fig. 7 Single-molecule 
measurement of ATP synthesis 
hy F], (a) Schematic drawing 
of experimental setup. 

(b) Experimental procedure of 
ATP synthesis. Active single 
Fi is enclosed in a femtoliter 
chamber (left). A magnetic 
bead attached to y is forcibly 
rotated by magnetic tweezers 
(center). Newly synthesized 
ATP is accumulated in the 
chamber. The number of 
synthesized ATP molecules is 
estimated from increments in 
the ATP-driven rotational 
speed (right), (c) ATP 
synthesis by reversing a 3 P 3 y 
(left) and a 3 P 3 ys (right). Each 
trace is derived from 
individual Fi molecules. 
Dotted lines indicate slopes of 
the coupling ratio of 0% 

(0 ATP/tum), 50% (1.5 ATP/ 
turn), and 100% (3 ATP/turn), 
respectively 
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Coverglass 



Low [ATP] 
Mag. field OFF 


Low [ATP] 
Mag. field ON 


High [ATP] 
Mag. field OFF 



Forced rotations Forced rotations 


mechanisms to suppress the ATPase activity, as fol¬ 
lows (Fig. 8). One of the most common mechanisms is 
ADP inhibition. During turnover of ATP hydrolysis, 
Fi often lapses into ADP-inhibited state in which ADP 
is tightly bound to the catalytic site and results in 
cessation of the catalytic reaction cycle (Fig. 8a). The 
ADP-inhibited form is not observed in the ATP syn¬ 
thesis condition. Another regulation mechanism that is 
conserved in many species is inhibition by the s 
subunit (Feniouk et al. 2006). The e subunit is one of 
the rotor subunit of bacterial and chloroplast Fi and is 
well known as an endogenous inhibitor of Fi motor. 


In mammalian ATP synthase, homologue of this 
subunit is termed as 5, and it is unclear whether this 
subunit suppresses the ATPase activity of Fi (Feniouk 
et al. 2006). The s has two distinct domains, the 
N-terminal p-barrel domain and the C-terminal 
helical domain, and it employs “hairpin-folded” or 
“extended” conformations in which the two a-helices 
of the C-terminal domain are bundled or extended. 
While the s subunit does not perturb the ATPase activ¬ 
ity in the former state, in the latter form, the helices 
extend toward the rotor-stator interfaces and 
inhibit the y rotation by the steric hindrance (Fig. 8b). 
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F|-ATPase: Fundamental Properties and Structure, 
Fig. 8 Regulation of ATP hydrolysis activity ofFj. In all figure, 
the ot, p, and y subunit are shown in yellow, green, and red. 
(a) ADP inhibition. ADP is accidentally not released and 
Fi cannot rotate by deviation from the catalysis cycle, (b) s 
inhibition. The extended (active) and haiipin-folded (inactive) 
conformational states of s (blue) is shown, (c) Redox regulation. 


In chloroplast Fi, ATP hydrolysis activity is regulated hy the 
redox state of two cysteine residues in the y; while it is suppressed 
upon oxidation, it becomes active upon reduction (left), (d) Inhi¬ 
bition by IF]. Left and right figures show Fj-IF] complex from 
side (left) and the membrane side of view (right), a helix of IFi 
(cyan) binds to the C-terminal domains of the ttop and Pop inter¬ 
face of the catalytic site and interacts with the y subunit 


Because the hairpin-folded form is stabilized upon 
ATP binding, the e subunit is considered to be the 
ATP-sensing regulator of Fi. Chloroplast Fj employs 
a specific regulation mechanism based on the redox 
potential (Flisabori et al. 2002). For photosynthesizing 
cells, it is necessary to completely block the ATP con¬ 
sumption by Fi when photosynthetic electron transport 
system is inactive during the night. To attain the full 
suppression of the ATPase of Fi, chloroplast Fj has the 
specihc sequence inserted into the y subunit, which has 
two cysteine residues. When photosynthesis is inactive 
and the redox potential is low, the two cysteines are 
oxidized to form disulfide bond. This cross-linkage 
causes some conformational change in the y subunit 
and inactivates the ATPase activity of Fi (Fig. 8c). 
In mitochondrial ATP synthase, a natural small protein. 


IFi, acts as an inhibitor. The crystal structure of Fi with 
IFi reveals that the a helix of IFi binds to the C-terminal 
domains of the a^p and Pop interface of the catalytic site 
and penetrates into the core through the groove made by 
Qtop and Pop and interacts with the y subunit (Fig. 8d) 
(Gledhill and Walker 2006). Thus, IFi sterically pre¬ 
vents the conformational change associated with the 
rotary catalysis. 

Summary 

Fi rotates the rotor y subunit upon ATP hydrolysis. 
The rotation rate is determined by the rate-limiting 
steps of the ATP hydrolysis, i.e., Fi is a stepping 
motor waiting for the completion the elementary 
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reaction steps. The cooperative conformational 
changes of three P subunits convert the elementary 
reaction steps into the mechanical rotation. The equi¬ 
librium of the ATP hydrolysis/synthesis reaction is 
modulated by the rotational direction of the y, and 
Fi catalyzes highly efficient ATP synthesis upon forc¬ 
ible reverse rotation. A great deal of information about 
rotation mechanism of Fi has been elucidated, but 
there is still much work to be done to fully elucidate 
its secrets, such as the mechanism of highly reversible 
mechanochemical coupling. 

Cross-References 

► ATPase: Overview 

► Magnetic Tweezers 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Vacuolar-Type ATPases in Animal and Plant Cells 
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Far UV Protein Circular Dichroism 
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Synonyms 

Peptide Backbone - Amide Backbone; Protein back¬ 
bone circular dichroism 

Definition 

Protein circular dichroism (CD) spectroscopy is 
usually divided into (1) far UV or backbone (meaning 
the amide transitions. Fig. 1) with data collected from 
^ 190 to 250 nm and (2) near UV or aromatic with data 
collected from 250 to 300 nm. The practical reason for 
the division is that the absorbance magnitudes of the 
two regions for a typical protein differ by ^2 orders 
of magnitude. In addition, the far UV CD spectrum of 
a protein contains information about the asymmetric 
features of the backbone of proteins whereas the near 
UV depends on the orientations and environments of 
the side chains. The challenge is to extract the struc¬ 
tural information. The most common reason for 
collecting protein CD data is to assign secondary struc¬ 
ture content by expressing the spectra as a combination 
of standard spectra which are then deconvoluted to 
give the percentages of a limited number of well- 
defined backbone geometries. This entry outlines the 
CD spectra expected for different secondary structures. 

Basic Characteristics 

Far UV CD Data Collection for Proteins 

One routinely collects protein backbone CD for 
a 0.1 mg/mL protein solution in a 1-mm path length 
cuvette. If the concentrations of the sample of interest 
differ from these, then the path length can be adjusted 
to compensate according to the Beer-Lambert Law. 
The acquired spectrum of a peptide or protein that is 
composed only of a single secondary structure motif 
might be expected to have the shapes illustrated 
in Fig. 2. In contrast to the situation for DNA, the 


spectrum of a protein that contains a mixture of 
such structures has been found to be to a reasonable 
approximation the sum of the component parts, 
thus enabling the extensive applications of protein 
structure htting. 

Protein Backbone CD 

The remainder of this entry describes in more detail 
what CD signatures are expected for the different 
secondary structures. 

a-Helix CD 

The Qc-helix is a dominant secondary structure in many 
proteins and on average accounts for about one third of 
the residues in globular proteins. It is a well-defined 
motif and is also adopted by many polypeptides, partic¬ 
ularly ones inserted into membrane. The a-helix CD is 
larger in magnitude than that due to other motifs and has 
a distinctive spectral form in the backbone region of the 
spectrum; it is apparent upon the most casual inspection 
of a spectrum. The CD spectra of a-helices have two 
negative peaks of similar magnitude at 222 nm (the n —> 
71* transition) and 208 nm (part of the hrst n ^ n* 
transition) and a larger positive peak at about 192 nm 
(the other part of the first n ^ n* transition). Some 
examples are given in Fig. 3. The relative magnitudes 
of the 222 nm and 208 nm peaks may vary, particularly 
as a function of the hydrophobicity of the environment 
which is increased in a membrane or by proteins 
dimerizing or folding and protecting their surfaces 
from the solvent (resulting in a relative increase of the 
222 nm band). A 222 nm:208 nm intensity ratio of about 
0.8 has been proposed for a single-stranded a-helix and 
a ratio of about 1.0 for a two-stranded coiled-coil of a- 
helices (Cooper and Woody 1990). The actual 
ratio depends on H-bonding and the environment 
of the helix. 

a-helical content can be estimated approximately 
by assuming that a 100% helical protein has a mean 
residue signal of As ^ 12 moU* cm“* dm (Johnson 
1999; Anderluh et al. 2004). The CD of the aurein 2.5 
frog peptide in Fig. 3b therefore shows that the 
peptide is about 64% folded into a helix, whereas the 
myoglobin CD spectrum of Fig. 3a correctly suggests 
that myoglobin is approximately 70% a-helical. 
The various empirical CD htting programs that 
are available are fairly successful in determining 
the percentage of a-helical content of a protein - 
particularly if the sample concentration is known 
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Far UV Protein Circular 
Dichroism, 

Fig. 1 The peptide 
chromophore and the n and rr 
orbitals with predominant 
movement of electron 
density for the (a) n ^ n* and 

(b) 71 —► 71* transitions 
indicated by the arrows. The 
net transition moments in 

(c) an a-helix, (d) P-sheet, and 
(e) polyproline II helix are 
illustrated (Norden et al. 2010 
reproduced with permission) 
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Far UV Protein Circular Dichroism, Fig. 2 Approximate 
shapes of CD spectra expected for standard protein motifs 
(e.g., Berova et al. (in press); Rodger and Norden 1997; 
Norden et al. 2010; Wallace and Janes 2009) 


accurately. It should, however, he noted that the mag¬ 
nitude of the CD signal does vary with variations in the 
helix structure and has been concluded to depend on 
helix length. Thus, the estimates for peptides are to be 
viewed with caution. The length dependence of the 


a-helix CD is unsurprising: the signal behaves as if 
it were for a helix approximately four residues shorter 
than it in fact is; this corresponds to the number of 
unanchored hydrogen-bonding groups. 

3io-Helix CD 

The 3 10 helix has a CD spectrum broadly similar to that 
of an a-helix, but the 222-nm band is much smaller 
than the 208-nm band and the positive 192-nm band is 
also smaller than for the a-helix. 

P-Sheet CD 

The spectroscopic characterization of P-sheets in pro¬ 
teins has proved more difficult than that of a-helices 
due to the practical reason that they are less soluble in 
solvents with a good UV transmission and due to the 
intrinsic reason that they are generally structurally 
(and hence spectroscopically) less well-defined: 
P-sheets may be parallel or antiparallel and of varying 
lengths and widths. Furthermore, an extended P-sheet 
is usually found to show a marked twist, rather than to 
be planar. Such tertiary structures also influence the 
overall CD spectrum. 

The general characteristics of P-sheet CD may be 
taken to be a negative band between 215 and 219 nm 
and a larger positive band between 195 and 202 nm. 
The peptide and antibody spectra of Fig. 4 are typical 
examples. This level of characterization of the spec¬ 
trum might be deemed to be not much worse than that 
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Far UV Protein Circular Dichroism, Fig. 3 CD of various 
predominantly a-helical proteins or peptides, (a) CD of the 73% 
helical protein myoglobin in water. As is per residue, (b) CD of 
a-helical aurein 2.5 inserted into a lipid bilayer composed 


of 50:50 DPPC (dipalmitoylphosphatidylcholine) and DPPG 
(dipalmitoylphosphatidylglycerol). (c) CD of the 29 residue 
helical peptide melittin from bee venom in aqueous DPPC 
(Norden et al. 2010) 



Far UV Protein Circular Dichroism, Fig. 4 P-sheet CD spectra of (a) a GLRILLLKV in a solution containing soya bean lipid 
liposomes and (b) a monoclonal antibody 
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Far UV Protein Circular Dichroism, Fig. 5 CD spectra of 
some peptide bonds illustrating the variation in positions of 
the ~200-nm negative band of the polyproline II conformation 
(Gokce et al. 2005) 


of a-helices were it not for the fact that the “random 
coil” spectral features (see below) have their maxima 
at similar wavelengths and are of opposite sign from 
those of the P-sheet. This means that an empirical 
fitting program may incorrectly weight these spectra 
(and the P-turn components) in an attempt to better 
account for the wavelength and magnitude variations 
that do occur. Further, some P-sheet structures 
(referred to as Pn) have spectra that closely resemble 
that of a random coil (see below). 

P-Turn CD 

The label p-turn is usually used to include all possible 
turns that occur, not simply the ones that enable 
a single p-strand to become an antiparallel P-sheet. 
About one quarter of the residues in globular proteins 
then fall into this structural group. Despite this range 
of structures, a “typical” P-turn CD spectrum has 
been identihed which has a weak red-shifted negative 
« —> 71* band near 225 nm, a positive n* transition 
between 200 and 205 nm, and a negative band between 
180 and 190 nm. 

Random Coil CD and Polyproline II Helix 

Truly disordered or unfolded proteins have a strong 

negative CD signal at ^200 nm, may have a small 



Wavelength (nm) 

Far UV Protein Circular Dichroism, Fig. 6 CD of a 27 residue 
peptide in different solvents (Miguel et al. 2003) 


positive band at about 218 nm in many systems, 
and perhaps a very weak negative band at 235 nm. 
Unfortunately for the use of CD as a means of 
identifying unfolded proteins, the very well-dehned 
left-handed polyproline II helix of collagen is indis¬ 
tinguishable from the general description of a random 
coil spectrum as is the Pu conformation. This similarity 
suggests that the local arrangement of transition 
moments is similar in the two geometries (Anderluh 
et al. 2004) and that the simplest peptide bonds have a 
preferred conformation that is essentially that of 
the polyproline II helix. So in the absence of other 
well-dehned secondary structures such as the a-helix, 
polyproline II conformations dominate the local 
structure and hence the CD spectra of peptide 
chains (Fig. 5). 

In the context of protein secondary structure htting, 
proteins typically have 30-40% of their residues not 
htting into the well-dehned structural categories. 
These have traditionally been referred to as random 
coil, despite the fact that there may be a range of 
ordered structures included in this “residual”. 

Protein CD as a Function of Structure and 
Environment 

Many proteins change their structure as a function of 
environment and this is rehected in their CD spectrum. 
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Far-Field Optical Nanoscopy 


Solvent effects on short peptides can be particularly 
dramatic as shown in Fig. 6. In this example, a change 
from approximately 50% a-helix (in TFE) to random 
coil (in water) is observed. 


Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Circular Dichroism Spectroscopy: Units 

► DNA-Ligand Circular Dichroism 

► Near UV Protein CD 

► Nucleic Acid Circular Dichroism 

► Oriented Circular Dichroism Spectroscopy 

► Protein Circular Dichroism Analysis 
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Farnesyltransferase - Computational 
Studies 

Sergio Eilipe Sousa, Pedro A. Eemandes and 
Maria Joao Ramos 

Department of Chemistry and Biochemistry, The 
University of Porto, Porto, Portugal 

Synonyms 

CAAX farnesyltransferase 

Introduction 

Eamesyltransferase (FTase) is a Zinc enzyme that 
consists of a 48 kDa a-subunit and a 46 kDa P-subunit 
and has been the subject of particular interest in 
anticancer research. FTase promotes the addition of 
farnesyl from farnesyldiphosphate (FPP), to a cysteine 
residue of a protein substrate containing a typical - 
CAAX motif at the carboxyl terminus, where C repre¬ 
sents the cysteine residue that is farnesylated, A is an 
aliphatic amino acid, and X represents the terminal 
amino acid residue. The list of known substrates of 
FTase comprises the H-, N-, and K-Ras proteins, 
nuclear lamins A and B, the y subunit of heterotrimeric 
G-proteins, and several proteins involved in visual 
signal transduction. 

Motivation for Computational Studies 

The initial studies on the activity of this enzyme in the 
early 1990s, showing that famesylation was absolutely 
required for the oncogenic forms of the Ras proteins 
to transform cells (Jackson et al. 1990), gave rise to 
a large number of experimental studies which helped 
to understand and characterize FTase. Kinetic and 
spectroscopic studies helped to unveil many details 
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FTase Resting state Binary Complex Ternary Complex Product Complex 

(FTase-FPP) (FTase-FPP-CAAX) (FTase-Product) 



t_ 

Product Release 


Farnesyltransferase - Computational Studies, Fig. 1 Schematic representation of the main steps in the catalytic mechanism of 
the enzyme FTase with the indication of the main intermediates formed 


on this enzyme, but it was in 1997 (Park et al. 1997) 
with the publication of the hrst X-ray structure of an 
FTase enzyme (resting state, PDB: IFTl) that a major 
breakthrough was accomplished. 

In the subsequent years, other X-ray structures 
followed, including that of the binary complex 
FTase-FPP. Finally, in 2002 with the publication of 
the first X-ray structures for the FTase ternary complex 
(FTase-FPP-Peptide) and product complex (Long 
et al. 2002), a complete series of structures 
representing the main intermediates along the catalytic 
pathway of the mechanism of this enzyme (see Fig. 1) 
was finally made available. 

In spite of the X-ray structures available, and of the 
valuable kinetic, mutagenesis, and spectroscopic 
results, a number of mechanistic dilemmas arising 
from the existence of apparently contradicting experi¬ 
mental evidence limited an atomic level understanding 
of the farnesylation process. These experimental con¬ 
tradictions and the corresponding implications have 
been reviewed in the literature (Sousa et al. 2005b). 

Central to many of these mechanistic doubts is 
a catalytically relevant Zn(II) cation. Flowever, most 
spectroscopic techniques normally used in the study 
of biological metal complexes that explore the UV- 
vis and microwave spectral regions cannot be success¬ 
fully applied to treat biological Zinc systems 
(Penner-Hahn 2005). Alternative methodologies play, 
therefore, a key role in obtaining insight into the struc¬ 
ture of biological Zinc complexes. Synchrotron 
spectroscopy. X-ray absorption spectroscopy (XAS) 
and, of course, crystallography are among the few 
experimental methodologies that can be used with 
some success. For many problems however, the 


application of theoretical and computational methods 
is essential to fully understand a given biological Zinc 
system, both at the structural and mechanistic level. 
FTase is one of such cases. 

Defining the Metal Coordination Spheres 
Formed During Catalysis: The 
Carboxylate-Shift Mechanism 

The first challenge in the computational modeling of 
FTase activity was the identification of the several 
Zinc coordination spheres formed along the catalytic 
pathway of this enzyme. In fact, from the X-ray 
structures and EXAFS data available several doubts 
on the type of coordination and on the identity of the 
ligands present at the Zinc coordination sphere 
persisted (Sousa et al. 2005b). 

Quantum calculations have been used on two 
differently sized active-site models built from the 
closest X-ray structures and include the following; 

(1) the Zinc coordination sphere (ca. 25-37 atoms); 

(2) the Zinc coordination sphere plus all residues within 
a 5 A radius (ca. 121-140 atoms). Such models were 
optimized in vacuum with B3LYP/SDD (type 1 models) 
or with ONIOM (B3LYP/SDD:PM3) (type 2 models), 
and the energetics of the different coordination pro¬ 
posals were evaluated in single point energy calcula¬ 
tions with higher basis sets and the lEF-PCM method. 
A dielectric constant (e) of 4 was taken as an approxi¬ 
mation to represent the average effect of both the protein 
and buried water molecules, a standard approach in 
quantum chemical computational enzymology (Ramos 
and Eemandes 2008). In some cases, the potential 
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Farnesyltransferase - Computational Studies, Fig. 2 The Zinc coordination sphere at the main intermediate states in the 
catalytic mechanism of FTase 


energy surface for the interconversion between different 
minima was also assessed. 

In particular, this approach has been applied to the 
analysis of the Zinc coordination sphere in the resting 
state (Sousa et al. 2005a), upon binding of the CAAX 
substrate (FTase ternary complex) (Sousa et al. 2005c) 
and for the FTase-Product complex (Sousa 
et al. 2007b). These results, together with the previous 
X-ray and EXAFS data, and with kinetic results on 
mutant species (particularly with a Asn297P mutant 
(Fu et al. 1998)), led to the acceptance of a new 


paradigm in FTase activity regarding the nature of 
the Zinc coordination sphere (see Fig. 2). In particular, 
this new model for FTase activity assumes that the 
Zinc coordination sphere exists at the resting state as 
an equilibrium between two states with a change from 
a monodentate coordination mode to bidentate by 
Asp297P (carboxylate-shift Rardin et al. 1991; Sousa 
et al. 2007a) and that such process is critical in lowering 
the activation barriers for ligand entrance and product 
exit from the metal coordination sphere. The calcula¬ 
tions performed also helped to show: (1) that the CAAX 



Farnesyltransferase - Computational Studies 


733 


F 


Farnesyltransferase - 
Computational Studies, 
Fig. 3 Schematic 
representation of the large 
distance between the two 
reactive atoms S and Cl (in 
red) observed in all the X-ray 
structures for the ternary 
complex (FTase-FPP-peptide 
substrate), illustrating the 
“distances paradox” 



Peptide 

(CAAX) 


Cys299|« 
I Asp297p 


[His362p 


substrate cysteine residue initially binds Zinc as a thiol 
and only subsequently loses the proton to yield a tightly 
hound Zinc-thiolate; (2) that upon product formation 
the thioether formed retains Zinc coordination. 

Together, these studies helped to identify the 
several metal coordination sphere formed along 
the catalytic pathway of this enzyme, preparing ground 
for more global mechanistic studies on the activity of 
this enzyme. 

Bridging the Gaps 

All the available FTase crystallographic structures of 
the ternary complex (FTase-FPP-peptide substrate) 
show a distance of more than 7.5 A between the two 
reactive atoms (the C1 of FPP and the peptide thiolate 
sulfur atom S) (Fig. 3). This distance must be reduced 
for the reaction to occur. Two distinct alternatives had 
been suggested in the literature. One of the hypothesis 
suggested that the peptide thiolate dissociates from 
Zinc with simultaneous translocation nearer to FPP. 
The other alternative proposed a movement of the 
FPP carhon Cl toward the Zinc-hound peptide 
thiolate, through a rotation involving the two first 
isoprene subunits. Several computational studies have 
tried to solve this problem, which became known as the 
“distances paradox” (Sousa et al. 2005b). 


Quantum mechanics (B3LYP) and the ONIOM 
method (B3LYP:PM3) have been used to evaluate, 
from an energetic point of view, thiolate dissociation 
from the Zinc coordination (Sousa et al. 2007c). 
The results obtained indicated that thiolate dissociation 
from Zinc would implicate just by itself a reaction 
barrier higher than 22 kcal/mol, a value which would 
be significantly higher than the 16 kcal/mol suggested 
from the experimental kinetic measurements for the 
global process involving both distance decrease and 
the chemical step leading to product formation (Huang 
et al. 2000), hence disproving the first hypothesis. 

Quantum calculations based on a molecular 
mechanical scan involving the full enzyme in a box 
of waters to assess the viability of a movement of the 
FPP Cl toward the Zinc-bound peptide thiolate, 
through a rotation of the first two isoprenoid subunits, 
was also attempted. The results indicated that such 
hypothesis could effectively approach the two reacting 
elements to a distance of 5 A, with a very moderate 
energetic cost (10 kcal/mol) consistent with the 
experimental data available. 

Cui et al. have also analyzed this conformational 
change using potential of mean force calculations with 
the full enzyme (Cui and Merz 2007), and following 
initial MD studies on substrate binding (Cui 
et al. 2005), and found that by changing the peptide 
target from the peptide substrate CVIM to the 
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Farnesyltransferase - 
Computational Studies, 

Fig. 4 Schematic 
representation of the transition 
state structure calculated for 
the chemical step in the 
reaction catalyzed by FTase, 
indicating the more important 
distances (A) and the atomic 
charge of the directly 
participating atoms. The figure 
also illustrates the 
computational model 
employed in the calculations 
described 
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non-substrate CVLM led to an increase in the transition 
free energy barrier height that qualitatively agreed with 
the experimental results available. Together, these com¬ 
putational studies helped to consolidate the idea that it is 
the FPP substrate that suffers a conformational arrange¬ 
ment prior to the chemical step. 

The Chemical Step 

The nature of the chemical step of the reaction 
catalyzed by FTase was subject of major controversy 
during the last decade, with several evidences seeming 
to point to two very distinct alternative mechanisms. 
The two likely possibilities were, at that time, either an 
electrophilic mechanism (SnI or dissociative) with 
formation of a farnesyl carbocation on C1 of FPP, or 
a nucleophilic mechanism (Sn 2 or associative) with 
a direct displacement of the pyrophosphate (PPi) 
from FPP by the thiolate at the cysteine residue of the 
CAAX motif. 

With time and amidst several doubts, a more inter¬ 
mediate vision became dominant, with the mechanism 
seen as associative with dissociative character, 
proceeding through a highly polar “exploded” transi¬ 
tion state, with considerable Cl-0 bond cleavage and 
modest Cl-S bond formation, and with a partial 


positive charge on Cl, and partial negative charges 
on the Zinc-coordinated sulfur, and on the diphosphate 
leaving group (Sousa et al. 2005b). A computational 
analysis could provide the atomic level detail needed 
for the full understanding of this process. However, its 
application proved to be particularly challenging. 

A first computational attempt to model this elusive 
reaction step by Lenvevitch et al. (Lenevich 
et al. 2007) employed gas-phase structures of ethane 
thiolate and of a 10-carbon geranyldiphosphate to 
model the transfer reaction (MPWlN/6-31 + G(d) 
level of theory), yielding some clues into the possible 
structure of the transition state of this reaction. How¬ 
ever, in this study the effect of the Zinc coordination 
sphere in the transition state structure was not accounted 
for, a critical aspect for an accurate modeling of this 
reaction, particularly taking into consideration that it is 
a Zinc-bound thiolate that acts as a nucleophile in this 
reaction, and that both the reactant and the product in 
this step retain Zn-S coordination. 

A 72-atom quantum mechanical active-site model 
(see Fig. 4) treated at the B3LYP/SDD//B3LYP/6-311 
++G(3df, 3pd) level of theory, comprising the Zinc 
coordination sphere, and the directly participating 
atoms and surrounding amino acid residues was also 
used to analyze this step (Sousa et al. 2009). Following 
a linear transit scan performed along the (FPP)Cl-S 
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(thiolate) (starting from a structure resulting from 
the FPP conformational rearrangement outlined in the 
previous section), the maximum and minimum obtained 
were used as starting guesses to find the transition state 
and the product. All stationary states were confirmed 
taking into consideration the number of imaginary fre¬ 
quencies, and thermal and entropic corrections were 
added to the single point energies calculated in an 
lEF-PCM continuum model with a dielectric constant 
of 4. Figure 4 illustrates the structure of the transition 
state model obtained for the chemical step. 

A 10.9 kcal/mol activation barrier for the chemical 
step was obtained, a value that together with the 10 kcal/ 
mol envisioned for the FPP conformation step pointed 
toward global activation barrier of 21 kcal/mol, a value 
that is in excellent agreement with the one suggested 
from kinetic studies for this step in the absence of Mg^"^ 
(20 kcal/mol) (Furfine et al. 1995; Huang et al. 2000). 
The chemical step has also been evaluated in a more 
recent computational study using classical molecular 
dynamics (MD) and ab initio Car — Parrinello (CP) 
QM/MM calculations considering the full enzyme and 
which estimated the energy barrier for the process to be 
10.8 kcal/mol (Ho et al. 2009). 

The presence of an elusive Mg^"^ ion, which is 
known to decrease the barrier height of this process 
to 16 kcal/mol,was also tested at several positions in 
our study (Sousa et al. 2009). The computational 
results obtained indicated all that such inclusion signif¬ 
icantly decreased the 21 kcal/mol barrier obtained in the 
absence of Mg greatly lowering the energy barrier for 
the chemical step and leaving the FPP conformational 
change as the main energetic determinant for this 
process. 

Globally these computational studies (Sousa 
et al. 2009; Ho et al. 2009) confirmed the existence of 
an associative mechanism with a partially dissociative 
character postulated from experiment, with 
a significant Cl-0 bond cleavage, and very modest 
Cl-S bond formation. The computational results 
obtained so far also confirmed the carbocation 
character of the Cl atom at the transition state, and 
the development of additional negative charge at the 
Zinc-bound sulfur atom and at the diphosphate leaving 
group. The analysis performed (Sousa et al. 2009) also 
illustrated the importance and effectiveness of the 
Mg^"^ ion in lowering the activation barrier for 
the farnesylation reaction, although the exact position 
of this ion is still subjected to debate. 


Cross-References 

► Carboxypeptidase A - Computational Studies 

► Computational Fnzymology 

► Computational Studies of Fnzyme Motions 

► Molecular Dynamics Simulations of Lipids 

► ONIOM 

► Superoxide Dismutases 

► Zinc-Dependent Metalloenzymes - Computational 
Studies 
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Fatty Acid Amide Hydrolase - 
Computational Studies 

Alessio Lodola and Marco Mor 
Dipartimento Farmaceutico, Universita degli 
Studi di Parma, Parma, Italy 

Synonyms 

Anandamide amidohydrolase; FAAH; FAAH-1 


Definition 

Fatty acid amide hydrolase (FAAH, EC 3.5.1.4) is an 
enzyme specialized in the cleavage of the amide bond of 
fatty acid amides, including the endocannabinoid anan¬ 
damide and other terms with biological significance. 

Anandamide (AEA, arachidonoylethanolamide, 
N-arachidonylethanolamine) is an endogenous ligand 


for cannabinoid receptors, subtypes 1 and 2, and for 
other targets, such as the transient receptor potential 
vanilloid type-1 channel (TRPVl). 

Basic Characteristics 

Eatty acid amide hydrolase (EAAH) is a membrane 
enzyme widely distributed in mammalian tissues 
which is responsible for the inactivation of biologically 
active fatty acid amides (McKinney and Cravatt 2005). 
In particular, EAAH terminates the signal brought by 
the endocannabinoid anandamide (AEA), catalyzing 
its hydrolysis to arachidonic acid and ethanolamine 
(Eig. 1). FAAH has wide substrate specificity, as it 
also catalyzes the hydrolysis of other amides. These 
are: (1) other ethanolamides, including oleoyletha- 
nolamide (OEA) and palmitoylethanolamide (PEA) 
which are agonists of the peroxisome proliferator- 
activated receptor (PPAR) subtype alpha; (2) oleamide 
(OA, oleoylamide) identified in the brain of primates 
as a sleep-inducing factor; (3) N-acyltaurines, which 
have been recently reported to be activators of the 
transient receptor potential vanilloid type-1 (TRPVl) 
channel. FAAH has been shown to cleave ester sub¬ 
strates in vitro, in particular the endocannabinoid 
agonist 2-arachidonoyl glycerol (2-AG). However, 
this cleavage does not seem to have a biological sig¬ 
nificance, as other enzymes (e.g., monoglyceride 
lipase) control the levels of 2-AG in vivo. 

FAAH is a member of the amidase signature (AS) 
family, featured by (1) a conserved serine- and gly- 
cine-rich stretch of about 130 amino acids (including a 
GXSXG motif); (2) a unique catalytic triad consisting 
of two serine residues and one lysine residue, rather 
than the common serine-histidine-aspartate triad found 
in classical serine hydrolases (McKinney and Cravatt 
2005). FAAH is also unusual among AS enzymes in 
that it is an integral membrane enzyme. Furthermore, 
sequence analysis identifies in FAAH an N-terminal 
transmembrane domain that is not present in other AS 
enzymes. 

Structure-Function Relationship and 
Mechanism of Inhibition 

The catalytic mechanism of FAAH has been 
widely investigated with both experimental and 
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Fatty Acid Amide 
Hydrolase - Computational 
Studies, Fig. 1 Reaction 
catalyzed by FAAH 




Fatty Acid Amide 
Hydrolase - Computational 
Studies, Fig. 2 Catalytic 
mechanism for FAAH 
acylation in presence of fatty 
acid ethanolamides. 

R represents the lipophilic 
chain of the substrate 
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computational methods (McKinney and Cravatt 2005; 
Lodola et al. 2005). These studies indicate that Lysl42 
acts as a key base and acid in distinct steps of the 
catalytic process (Fig. 2). In the early phase of the 
catalysis, neutral Lysl42 activates Ser241 nucleophile 
for attack on the substrate carbonyl, an event which 
leads to the formation of a tetrahedral intermediate 
(TI). In the final step of the reaction, the positively 
charged Lysl42 protonates the (substrate) leaving 
group of the TI adduct, leading to its expulsion and to 
the formation of an acylenzyme. The impact of Lysl42 
on Ser241 nucleophile strength and on leaving group 
protonation occurs indirectly, via the bridging Ser217 
of the triad which acts as a “proton shuttle” (Lodola 
et al. 2005). The catalytic cycle of FAAH terminates 
with the hydrolysis of the acylenzyme, which restores 
a functional FAAH catalytic core. Typically, serine 
hydrolases cleave ester substrates at higher rates com¬ 
pared with structurally similar amides, reflecting the 
relative intrinsic reactivity of these compounds. FAAH 
represents a remarkable exception to this principle, as 
it hydrolyzes amides faster than esters. It has been 
proposed that the efficiency of FAAH in catalyzing 
amide cleavage might have an evolutive meaning. In 
vivo, FAAH must catalyze the hydrolysis of fatty acid 
amides in an environment which is rich in endogenous 
fatty acid esters. If FAAH would act as a classical 
serine hydrolase, it might have been saturated by 
lipid esters and failed to function as an amidase 
in vivo (McKinney and Cravatt 2005). 

Rat FAAH has been crystallized in complex with 
the covalent inhibitor methyl-arachidonyl- 
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Fatty Acid Amide Hydrolase - Computational Studies, 
Fig. 3 Substrate binding pocket for FAAH, modeled with anan- 
damide (AEA, green carbon atoms) 


fluorophosphonate (Bracey et al. 2002). Visual inspec¬ 
tion of the structure reveals that FAAH has a series of 
channels and cavities that are crucial for its biological 
function (Fig. 3). These include the membrane access 
channel (MAC), connecting the active site to an open¬ 
ing located at the membrane anchoring face of the 
enzyme, the cytosolic access (CA) channel, that allows 
exit of hydrophilic products from the active site, and 
the acyl chain-binding pocket (ABP) that 
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Fatty Acid Amide 
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Studies, Fig. 4 Chemical 
structure of relevant FAAH 
inhibitors 
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accommodates the arachidonyl chain of co¬ 
crystallized inhibitor, and possibly interacts with the 
substrate acyl chain during the catalytic reaction. 

FAAH enzyme activity is blocked by a variety of 
classical serine hydrolase inhibitors (Otrubova et al. 
201 1). Among them, A-alkylcarbamic acid aryl esters, 
exemplified by URB597 (Fig. 4), emerged as the first 
drug-like class of compounds able to inhibit FAAH 
in vivo (Kathuria et al. 2003). URB597 has been 
shown to inhibit FAAH by carbamoylating the active- 
site nucleophile Ser241 (Otrubova et al. 2011). Docking 
simulations of URB597 within FAAH suggested the 
presence of alternative binding orientations for this 
inhibitor (Mor et al. 2004). Quantum mechanical/ 
molecular mechanical (QM/MM) calculations on the 
formation of the covalent adduct between FAAH and 
URB524 (Fig. 4) showed that carbamoylation of Ser241 
was only possible in one specihc binding orientation 
(Lodola et al. 2008). This prediction has been recently 
confirmed by the crystallographic structure of the 
FAAH-URB597 carbamoylated adduct (Mileni et al. 
2010), showing the reliability of QM/MM approaches 
for inhibitor design. Furthermore, QM/MM modeling of 
FAAH inactivation by A-alkylcarbamic acid aryl esters 
identifies the deprotonation of Ser241 as the key reac¬ 
tion step, explaining why FAAH is insensitive to the 
electron donor effect of conjugated substituents, 
a finding that may help the design of inhibitors with 
improved selectivity and in vivo potency (Vacondio 
et al. 2009; Lodola et al. 201 1). 

Pharmacology 

FAAH is abundantly expressed in mammalian tissues. 
Genetic disruption of ths faah gene results in a marked 
increase of AEA, PEA, and OEA levels. Pharmacolog¬ 
ical investigations employing the reference inhibitor 
URB597 have shown that a persistent inactivation of 
EAAH enzyme also leads to a considerable increase of 
the level of these neuromodulators, with potentially 


important therapeutic applications. Administration of 
URB597 leads to analgesic, anti-inflammatory, anxio¬ 
lytic, and anti-depressant effects in animal models, 
without producing the undesirable side effects 
observed with cannabinoid receptor agonists (Kathuria 
et al. 2003). Therefore, EAAH is emerging as an attrac¬ 
tive therapeutic target for the treatment of several 
nervous system disorders. 
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Synonyms 

Alkanols: long chain fatty alcohols; Diacylglycerols: 
long chain diglycerides 

Definition 

Protonated fatty acids, alkanols, and diacylglycerols 
are weakly amphiphilic long chain lipids which 
strongly perturb membrane structure and stability. 

Introduction 

The main types of lipids found in eukaryotic membranes 
include the glycerophospholipids, the sphingolipids, 
and cholesterol (CHOL). Other types of lipids that are 
present in membranes are derivatives of these, such as 
free fatty acids (FAs), lysophospholipids (LPCs), and 
diacylglycerols (DAGs). Even though a single phospho¬ 
lipid can form a fluid semipermeable membrane, 
eukaryotic membranes can consist of more than 1,000 
different lipid species (Ohvo-Rekila et al. 2002). 

Although FAs and DAGs are low in abundance in 
cell membranes, they are an important class of lipid 
molecules that are intermediates in the metaholism of 
many lipid species, can serve as a fuel source for cells, 
can act as second messengers and trigger translocation 



Fatty Acids, Alkanols, and Diacylglycerols, Fig. 1 Chemi¬ 
cal structures of amphiphilic lipids which are too weakly 
polar to form any lyotropic phases on their own in water: 
(a) cholesterol, (b) diacylglycerols (dioleoyl glycerol), 
(c) alkanols (octadecanol), (d) protonated fatty acids (stearic 
acid) 


and subsequently activation of enzymes to the mem¬ 
brane, and can modulate the activity of many proteins 
(Gomez-Fernandez and Corbalan-Garcia 2007; Goni 
and Alonso 1999 and references therein). Apart from 
the de novo synthesis of FAs and DAGs, these can also 
be generated in the membrane by the action of 
enzymes such as phospholipases. 

1-alkanols (or fatty alcohols, FAlc) are not naturally 
occurring components of mammalian cells. However, 
they show many similarities with FAs and DAGs in 
their ability to perturb membrane structure and prop¬ 
erties. Moreover, they are of great interest as they can 
function as anesthetics, and as penetration enhancers 
for transdermal delivery of drugs and other bioactive 
molecules (Williams and Barry 2004). 

Many lipids, such as phospholipids, are strongly 
amphiphilic and will spontaneously self-assemhle in 
water to form a variety of lyotropic liquid crystalline 
phases. Some important classes of lipids, such as 
CHOL, DAGs, alkanols, and protonated FAs (Fig. 1), 
are too weakly polar to form any lyotropic liquid 
crystal phases on their own in water (although DAGs 
can form metastable gel phases). However, they are 
able to dissolve into membranes of more strongly 
amphiphilic lipids up to high mole fractions, strongly 
perturbing the structure and phase stability. 
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Fatty Acids, Alkanols, and Diacylglycerols, Fig. 2 Sketches 
of some of the lyotropic liquid crystalline phases adopted by 
lipids in water, in order of increasing negative interfacial curva¬ 
ture. From left to right, fluid lamellar (L^) phase, inverse 


bicontinuous cubic phases (of space groups Pn3m, Im3m, 
Ia3d), inverse hexagonal (Hn) phase, inverse micellar cubic 
(Fd3m) and inverse micellar (L 2 ) solution (Taken from 
Shearman et al. (2006)) 


As a general trend, increasing the amount of 
protonated FA, FAlc, and DAG in phospholipid 
mixtures tends to promote phases with increasingly 
negative interfacial mean curvature (Fig. 2). How¬ 
ever, their ability to do so depends on a number of 
factors such as temperature, pressure, hydration, pH, 
chain length, and degree of unsaturation in the 
chains. They are able to hydrogen-bond to phospho¬ 
lipids and partially eliminate water, reducing the 
hydration shell and effective headgroup size of 
the phospholipids, whilst increasing the splay in the 
chain region, consequently promoting formation of 
non-bilayer structures and phases. 

From a biological point of view, FAs, FAlcs, and 
DAGs can rapidly diffuse and cross membranes, which 
could result in transient clusters of high concentrations 
of these molecules, destabilizing membrane structure, 
at least on a local scale. This destabilization by favor¬ 
ing formation of “non-bilayer” regions is highly 
relevant to processes such as cell fusion and fission, 
where the flat membrane distorts to form highly curved 
structures locally. FAs, FAlcs, and DAGs do indeed 
increase the fusogenicity of membranes and have been 
implicated in processes such as membrane fusion and 
fission. 

CHOL is thought to preferentially interact with 
saturated phospholipids and sphingomyelin, forming 
fluid regions of increased acyl chain order and 
thickness (the Lq phase) in the membrane, termed 
“lipid rafts,” with unsaturated lipids tending to be 
largely excluded from these regions (Nielsen 
et al. 1999). Although cholesterol is usually associated 
with promoting the formation of lamellar phases, it 


was recently shown that CHOL induced inverse phases 
in mixtures with unsaturated PCs and diacylglycerols 
(Tenchov et al. 2006; Armstrong et al. 2002) and was 
also found to be incorporated in a highly curved 3-D 
hexagonal inverse micellar phase of space group 
Pha/mmc (Shearman et al. 2009). Such findings could 
possibly explain how cell membranes can form rigid 
Lo domains while still being able to simultaneously 
undergo fusion events by forming highly curved 
phases in the presence of CHOL. 

Fatty Acids and Alkanols 

Fatty acids consist of a carboxyl group which behaves 
as an acid attached to a long aliphatic chain which can 
either be saturated or unsaturated. The carboxyl group 
can be ionized in solution rendering it more 
hydrophilic. Carboxy acids with hydrocarbon chains 
as short as four carbons long are considered to be fatty 
acids, while fatty acids derived from natural fats and 
oils typically have chain lengths greater than eight 
carbons long. Fatty acids found in cells differ only in 
the length and degree of unsaturation of their 
hydrocarbon chains and typically contain an even 
number of carbon atoms since their biosynthesis 
involves acetate. Moreover, they can be produced by 
the action of phospholipase A 2 on phospholipids or by 
the hydrolysis of tri- and diglycerides during the 
process of digestion of fat (Salentinig et al. 2011). 

As mentioned earlier, although present at low levels 
(a few percent), fatty acids are important constituents 
of biomembranes. Apart from their role as protein and 
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enzyme activators and ion channel regulators, they are 
also involved in membrane fusion, and certain polyun¬ 
saturated FAs, such as arachidonic acid and 
docosahexaenoic acid (DHA), act as second messen¬ 
gers in cell signaling (Wassail and Stillwell 2009, and 
references therein). As they are also intermediates in 
lipid metabolism and can be used as an energy fuel for 
cells, they are consequently in constant flux, which 
implies that they have to be able to enter and 
leave cells swiftly (on the timescale of cellular metab¬ 
olism which is ^ minutes). In solution and at physio¬ 
logical pH values, long chain FAs form a 
protonated/deprotonated “acid-soap” bilayer as their 
equilibrium structure (Hamilton 1998 and references 
therein). A free carboxyl group has a pKa close to 4.6. 
However, in a phospholipid bilayer, FA ionization is 
suppressed, raising the pKa to above 7.5, and thus a 
high proportion is in the unionized form at physiolog¬ 
ical pH value. Consequently, they can diffuse across 
(flip-flop) the bilayer rapidly, with a half-life in the 
order of seconds, but they can also be accumulated on 
one side of the membrane by a pH gradient. Fast flip- 
flop rates, rapid and strong FA binding to phospholipid 
bilayers due to their low solubility in water and high 
hydrophobicity, and the observation that fast FA 
desorption occurs in the presence of albumin, reaching 
an equilibrium distribution of FAs between the mem¬ 
brane and albumin, has led to the hypothesis that pro¬ 
teins are not essential for the transport of FAs through 
membranes, although most scientists in the field 
believe that proteins also play an important role in 
FA transport. 

The phase behavior of FAs mixed with various 
lipids shows marked polymorphism and complex 
formation. For example, the action of phospholipase 
A 2 (PLA 2 ) on bilayer phospholipids causes the accu¬ 
mulation of reaction products, lysophospholipid and 
fatty acid. It has been suggested that a free fatty acid 
and a lysophospholipid can act together and structur¬ 
ally mimic an average double chain phospholipid 
(Okonogi et al. 2002). As LPC and FA do not form 
stable fluid lamellar phases when mixed alone with 
water, it is assumed that the bilayer structure formed 
in the pseudo-binary system of LPC:FA in water can be 
accredited to the acyl chains of LPC and FA associat¬ 
ing to form a complex, resembling to some extent the 
structure of the corresponding diacylphospholipid. 
However, this association is not exceptionally strong 
and is markedly less restricted, both in conformation 


and orientation, than the diacylphospholipid. The pres¬ 
ence of equimolar amounts of LPC and PA could 
explain why particular cell types and organelles 
enriched in LPC and PA remain viable, as on their 
own they increase the permeability of membranes by 
acting as “detergents.” 

Although FAs are low in abundance in most mem¬ 
branes, they are major components of the upper layer 
of the epidermis, the stratum corneum, along with 
cholesterol and ceramides (McIntosh 2003). Lafleur 
et al. reported the formation of the Lq phase in 
a palmitic acid (PA);CHOL system a decade ago 
(Pare and Lafleur 2001). This was the first time the 
formation of the Lq phase was reported for a system not 
containing a polar amphiphile. They proposed that “the 
melted palmitic acid provides a non-polar environment 
for the solubilization of cholesterol and the latter 
induces a straightening of the fatty acid chains, pro¬ 
moting a molecular order compatible with bilayer 
formation.” The Lq phase has been linked to lipid 
rafts so the phase behavior observed is of biological 
interest. The formation of ordered lipid domains has 
also been seen in the stratum corneum and a domain 
mosaic model of stratum corneum lipid organization 
has been proposed by (Forslind 1994), showing liquid- 
liquid immiscibility. In an analogous fashion to lipid 
rafts, it has been suggested that the different lipid 
domains observed contribute to regulating protein 
activity in the stratum corneum (Moore et al. 2006 
and references therein). 

FAs are known to promote the transformation of the 
lamellar phase of phospholipids to more highly curved 
liquid crystalline phases such as the inverse 
bicontinuous cubic, inverted hexagonal (Hu), and 
inverse micellar cubic (Fd3m) phases and 
consequently phosphatidylcholine (PC)/FA mixtures 
have been studied by a number of groups. As the 
protonated FAs are only weakly hydrophilic, they are 
essentially insoluble in water but are solubilized in 
a hydrated liquid crystalline phase by the phospho¬ 
lipid. FAs modify the hydrophilic/hydrophobic 
balance and tend to reduce the headgroup area/volume 
ratio, due to the small headgroup of the FA occupying 
a much smaller area at the interface compared to the 
strongly hydrated PC headgroup and by also reducing 
the effective size of the PC headgroup due to hydrogen 
bonding between the PC and FA, resulting in a smaller 
hydration shell. Consequently, liquid crystalline 
phases with lower interfacial area per molecule values 
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than that of a fluid bilayer will be stabilized, meaning 
that the transition temperature (Tn,) from a lamellar gel 
(Lp) to a fluid lamellar (L^;) phase will be elevated, and 
the Tjn of the Lot-Hn transition will be depressed. 

Saturated PC/FA mixtures with >14 carbon atoms 
in excess water show peritectic phase behavior, and 
commonly exhibit maximum azeotropic points at 
molar ratios of 1:2, respectively, transforming directly 
from an Lp phase to an Hn phase. The azeotropic point 
is due to a combination of compound formation in the 
gel phase and nearly ideal mixing in the fluid phase. 
The 1:2 composition corresponds to the amount of 
lipid involved in the compound formation being at 
a maximum; at any other PC:FA mole ratio one of 
the components will be in excess and will be excluded 
from the compound formation (Seddon et al. 1997; 
Koynova et al. 1997). Due to non-bilayer phases 
being implicated in processes such as membrane 
fusion and fission, PC:FA 1;2 mixtures are used as 
model systems to study the kinetics and mechanism 
of lamellar-non-lamellar phase transitions in lipids, 
both by simulation and experiment (Squires 
et al. 2002; Koynova and Tenchov 2001; Knecht 
et al. 2006 and references therein). Moreover, due to 
the fact that they show temperature- and pH-dependent 
fusion, such mixtures have been proposed to be possi¬ 
ble candidates as nanocarriers for drug delivery. 

The chain melting transition of saturated PC;FA 1 ;2 
mixtures having chain lengths between 12 and 20 
carbons long as a function of water content is indepen¬ 
dent of hydration. For all chain lengths, the tilted Lp- 
and rippled Pp- gel phases of the pure PC are not 
observed and are instead replaced by an untilted gel 
phase (Lp), which swells upon hydration. Hydrated 
PC;FA mixtures with chain lengths between 12 and 
14 carbons long display miscibility gaps in both the gel 
and fluid phase regions of the phase diagrams; as an 
example, a PC-rich lamellar phase and a FA-rich Hn 
phase are found in coexistence. In PC:FA 1:2 12 
carbon long mixtures (DLPC:LA) an Lot phase is 
observed within a narrow temperature range above 
the melting transition of the system. However, for 
longer chain mixtures, the L^t phase is not observed 
and the Lp phase melts directly into either an inverse 
bicontinuous cubic phase for 14 carbon chain long 
mixtures (DMPC:MA) or an Hn phase for mixtures 
with >16 carbon atoms. Interestingly, DLPC:LA 
water mixtures display a re-entrant Hn phase at low 


and high hydrations between 30°C and 50°C, with 
and cubic phases appearing at intermediate hydrations. 
The inverse phases adopted by DLPC:LA 1:2 mixtures 
appear in the order of Hn - Ia3d - Pn3m - lm3m 
(see Fig. 2) with increasing hydration. Increasing 
temperature produces the same phase sequence of the 
bicontinuous cubic phases but with the Hn phase being 
adopted at higher temperatures. Increasing the chain 
length in PC:FA 1:2 mixtures results in the cubic 
phases being disfavored, with the DMPC:MA 1:2 sys¬ 
tem adopting the Pn3m and Im3m phases and the cubic 
phases being abolished in mixtures with a chain length 
>14 carbons and being replaced by an Hn phase 
instead. By progressively increasing the amount of 
LA in DLPC:LA mixtures, the following phase 
sequence is observed: lamellar, Ia3d, Pn3m, Im3m, 
Hn, and finally the highly curved inverse micellar 
Fd3m cubic phase is observed at high LA mole 
fractions. Conversely, in phosphatidylserine (PS):FA 
1:2 mixtures at neutral pH, the L^ phase is stable over 
a large temperature range; this stability has been attrib¬ 
uted to the negative charge of the PS (Seddon 
et al. 1997; Koynova and Tenchov 2001, and 
references therein). 

Mixtures of PC with long chain fatty alcohols show 
remarkably similar behavior to those of PC;FA 
mixtures, with evidence for 1;2 complex formation 
and as with the 1:2 PC:FA mixtures, highly fusogenic 
temperature-dependent liposomes made of PC:FAlc 
1:2 mixtures can be prepared. As with the FAs, the 
interfacial mean curvature of a lipid system can be 
tuned by the addition of FAlcs resulting in PC:FAlc 
mixtures forming phases of increasingly negative 
interfacial curvature as the FAlc content is increased. 

Due to their ability to tune curvature and interact 
nonspecifically with the lipid membrane, alkanols are 
known to act as anesthetics and penetration enhancers 
for transdermal delivery of drugs and other bioactive 
molecules (Williams and Barry 2004). Short chain 
1-alkanols act as anesthetics, although the anesthetic 
potency is cut off in alkanols with a chain length >12 
carbons long (Griepernau et al. 2007 and references 
therein). It is known that the anesthetic effect can be 
reversed at high hydrostatic pressures. In a similar 
fashion to FAs, it has been suggested that addition of 
FAlcs to a PC membrane will alter the lateral stress 
profile of the lipid bilayer so as to favor the formation 
of curved phases, due to an increased repulsion in the 
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chain region and decreased repulsion (as well as 
a decrease in the area/volume ratio) in the headgroup 
region (Seddon et al. 1997). Increasing the amount or 
chain length of FAlcs in the membrane results in the 
formation of phases with increasing negative curva¬ 
ture, with the highly curved Fd3m phase being adopted 
for fully hydrated PCiFAlc 1:3 mixtures with a chain 
length >14 carbons long (Fluang et al. 1996). 

Short chain alkanols are known to depress the Tm 
of the Lp-Lo: phase transition whereas long alkanols 
elevate it; however, this effect on the phase transition 
is not a monotonic function of alkanol concentration 
but is biphasic, where the Tm is lowered at low 
alkanol concentrations and elevated at high concen¬ 
trations, for both short and long chain alkanols 
(Kamaya et al. 1986). Alkanols are thought to be 
intercalated with the phospholipid chains and 
anchored near the interface. Moreover, MD simula¬ 
tions have shown that short chain alkanols stabilize 
the Lo( phase by a reduction in the order parameter of 
the chains and a decrease in packing density, whereas 
for long chain alkanols, the Lp phase is stabilized 
instead due to an increase in chain order and bending 
moduli, in agreement with experiments (Griepemau 
et al. 2007). 

Tuning the interfacial curvature of a lipid mem¬ 
brane can be achieved by the addition of FAs or 
FAlcs and as a function of temperature. Additionally, 
due to the ionization properties of FAs, chemically and 
electrostatically sensitive PC:FA fusogenic liposomes 
can be prepared, where a change in pH or ion concen¬ 
tration will act to tune the preferred mean curvature of 
the system. 

Diacylglycerols 

DAGs (Gomez-Fernandez and Corbalan-Garcia 2007 ; 
Goni and Alonso 1999, and references therein) are 
lipid molecules consisting of two fatty acid chains 
covalently bonded to a glycerol moiety through ester 
bonds. The physiologically pertinent isomer is 
1,2-diacyl-sn-glycerol; however, the fatty acyl 
moieties attached to the glycerol backbone can vary 
significantly (chain length, degree of unsaturation). 
Naturally occurring DAGs possess a saturated fatty 
acid chain in the in-1 position and an unsaturated or 
poly-unsaturated sn-2 chain. DAG has a weakly polar 


headgroup rendering DAG hydrophobic and conse¬ 
quently it remains relatively dehydrated in the mem¬ 
brane. As a result, the combination of a negligible 
amount of bound water and the unsaturated, mobile 
sn-2 chain results in rapid transbilayer movement 
with a small energy cost. Attempts to measure the 
rate of DAG flipping, using a variety of techniques 
have shown that DAG undergoes rapid transbilayer 
movement (with a half-life in the order of a few milli¬ 
seconds), although the precise rate remains uncertain. 

Apart from the de novo synthesis of DAGs, they can 
also be generated by the action of enzymes such as 
phospholipase C (PLC). Although the concentration of 
DAG is low in cell membranes, they are an important 
class of molecules that act as second messengers in 
cell signaling, are key intermediates in the metabolism 
of many lipid species, trigger translocation and subse¬ 
quently activation of enzymes to the membrane, and 
modulate the activity of many membrane proteins. 
Moreover, it has been shown that DAGs are involved 
in membrane fusion and fission, and are required for 
the secretion of molecules in the trans-Go\gi network. 

Although DAGs are known to form metastable gel 
phases on their own in water, they do not form any 
lyotropic liquid crystalline phases. However, they can 
be dissolved in lyotropic liquid crystal phases of more 
strongly amphiphilic lipids up to high mole fractions, 
strongly perturbing the structure and phase stability. 
Work by Hamilton (Hamilton et al. 1991) and 
coworkers found that DAGs are oriented along the 
bilayer normal in PC membranes and adopt 
a conformation similar to that in the crystalline struc¬ 
ture of phospholipids and the phase of DLPE and 
some PCs. 

Incorporating DAGs in phospholipid membranes 
gives rise to a marked lipid polymorphism. Hydrogen 
bonding between DAGs and phospholipids results in 
the partial elimination of bound water from the 
phospholipid headgroup, causing surface dehydration. 
Numerous studies have investigated the phase behav¬ 
ior of fully hydrated PCs and DAGs with saturated 
chains and, as with FAs and FAlcs, stoichiometric 
compound formation is observed. Immiscibility 
between DAG and phospholipids is well known and 
this effect is even more pronounced in mixtures of 
unsaturated lipids upon addition of cations; for exam¬ 
ple, immiscibility between DAG:PC compounds 
formed and pure PC can begin at less than 1 mol% 
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DAG. Increasing the amount of DAG in phospholipid 
mixtures tends to cause highly curved inverse phases 
to he adopted. As DAG is a type II lipid, it will 
inherently like to promote inverse phases and as 
a result, its ability to induce a lamellar to Hn transition 
has been extensively documented. It has been shown 
that 1 mol% of various DAGs can lower the temper¬ 
ature of the lamellar to Hu transition in PC and PE 
bilayers by as much as 2-10°C. The length and degree 
of unsaturation of DAG significantly influences the 
ability of the system to undergo a lamellar to Hn 
transition. 

Two cubic phases have been discovered in DAG- 
phospholipid mixtures, namely, the inverse micellar 
cubic Fd3m phase (Luzzati et al. 1992) and the 
bicontinuous cubic Pn3m phase (Nieva et al. 1995) 
and it was recently shown that DOPC;DOG;CHOL 
1:2:1 mixtures can form a highly curved 3-D hexago¬ 
nal inverse micellar phase of space group Pha/mmc 
(Shearman et al. 2009). Moreover, Armstrong et al. 
showed that a mixture of PC, sphingomyelin, CHOL, 
and DOG exhibited inverse hexagonal and unidentified 
cubic phases (Armstrong et al. 2002). As these are 
typical “raft” compositions, it is interesting to see 
that regions with increased CHOL and DAG content 
within rafts can adopt such inverse structures. 

Summary 

Protonated fatty acids, alkanols, and diacylglycerols 
are lipids that are too weakly amphiphilic to form any 
lyotropic liquid-crystalline phases on their own in 
aqueous solution. However, they can be soluble up to 
high mole fractions in phospholipid membranes, 
tuning the interfacial mean curvature toward inverse 
curved phases or structures. This is presumably also 
related to their ability to promote membrane fusion and 
fission. Another effect is that they hydrogen-bond to 
phospholipid headgroups, partially dehydrating them. 
Their small, weakly polar headgroups allow them to 
undergo transmembrane flip-flop more rapidly than 
phospholipids. 
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Synonyms 

Electron transfer; ESRS; Ultrafast raman spectroscopy 

Definition 

Eemtosecond stimulated Raman spectroscopy (ESRS) 
is a time-resolved vibrational technique capable of 
probing molecular structure with a 40 fs temporal and 
10 cm”' spectral resolution. 

Introduction 

All chemical reactions can conceptually be thought 
of as a series of elementary bond making or break¬ 
ing steps. Formally these bond alterations can be 
classified into three distinct types: redox reactions 
involving electron transfer, acid-base reactions, and 
rearrangement reactions. Most complex biochemical 
reactions operating in nature are comprised of these 
fundamental chemical steps functioning with high 
spatiotemporal control to optimize overall efficiency 
(Berg et al. 2002). In order to describe such 
reactions a detailed structural and energetic descrip¬ 
tion of the reactants, products, and involved transi¬ 
tion states is imperative. Therefore capturing the 
whole reaction trajectory from the inception of the 
chemical trigger to product formation by monitoring 
the nuclear coordinates represents the grand chal¬ 
lenge in chemical reaction dynamics (Zewail 2000; 
Grim 2008). 
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Classically reaction mechanisms are formulated on 
the basis of a reaction coordinate. A reaction coordi¬ 
nate (RC) is defined as a set of curvilinear coordinates 
along which the “configuration of the reactants” 
smoothly converts to the “conhguration of the prod¬ 
ucts,” and is usually given by the path that leads to 
reaction completion (lUPAC definition). Unfortu¬ 
nately this abstract definition of the RC overlooks 
the interplay of anharmonic coupling between the 
“critical” normal modes, and thus its inherent rela¬ 
tionship to the structure of reactants (Levine and Mar¬ 
tinez 2007). The reactivity profile is usually encoded 
in the anharmonicities and potential energy surface 
crossings of the participating vibrational modes. 
Thus it is completely dehned by the molecular struc¬ 
ture and initial bonding geometry of the reactants. As 
the reaction proceeds, changes in the frequency of 
these discrete modes will directly report on the 
dynamics of the bond breaking or making steps. Typ¬ 
ically any vibrational technique can provide 
a dynamic structural resolution at the limit of ^0.01 
A for bond length changes which is superior to the 
static X-ray images of crystallized chemical interme¬ 
diates. Therefore, a time-resolved vibrational tech¬ 
nique has to be employed to probe the molecular 
system across its entire gamut of vibrations from 
the low-frequency torsions (10-500 cm”*) to the 
high-frequency bond stretches (1,000-3,600 cm”*) 
to identify the key reactive modes (► Nonlinear IR 
Spectroscopy of Proteins, ► Infrared Spectroscopy of 
DNA, ► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics). 

Femtosecond stimulated Raman spectroscopy 
(FSRS) is a high resolution time-resolved broadband 
vibrational technique which can theoretically follow 
reactive dynamics on the timescale relevant to 
atomic motions from tens of femtoseconds to few 
nanoseconds (Weigel and Ernsting 2010; 
Laimgruber et al. 2006; Kukura et al. 2007; Frontiera 
and Mathies 2011). FSRS is a three-pulse experiment 
as depicted in Fig. 1. FSRS uses a femtosecond trig¬ 
gering pulse (A, = 400-800 nm) to initiate a desired 
photochemical event. The broadband nature of this 
excitation pulse (^500-800 cm”*) along with its 
ultrashort duration (^25-50 fs) creates a coherent 
superposition of molecular vibrational states (or 
a wavepacket) on the excited state. These 
wavepackets evolve on the excited state potential 
while accessing various regions of the phase space 


FSRS Detection 



Femtosecond Stimulated Raman Spectroscopy, 
Fig. 1 Potential energy surface representation of a 
phototriggered chemical reaction that is temporally monitored 
by femtosecond-stimulated Raman spectroscopy (FSRS). Vibra¬ 
tional information on the evolving wavepacket with ~25 fs 
precision allows one to capture a structural glimpse of the entire 
multidimensional reaction coordinate 


defined by the shape of the multidimensional surface. 
During its evolution period, a combination of 
narrow bandwidth picosecond Raman pulse (A = 
490-720 nm or 800 nm) and a broadband femtosec¬ 
ond probe pulse (^2,000 cm”* bandwidth) interro¬ 
gates the molecular structure of the reactive species 
by collecting stimulated Raman spectra with 
^5-10 cm”* spectral resolution (McCamant et al. 
2004). The femtosecond nature of the probe pulse 
ensures that the Raman process is initiated with high 
temporal precision and thus provides exquisite time 
resolution to follow ultrafast reaction dynamics. There¬ 
fore by simply changing the temporal delay between 
the actinic trigger and the probe pulse, one can record a 
series of vibrational snapshots of an evolving mole¬ 
cule on the multidimensional reaction coordinate 
with unprecedented time-resolution and stmctural 
sensitivity. 
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The initial portion of this chapter will briefly 
describe the working principles of FSRS, and subse¬ 
quently illustrative examples of ultrafast isomerization, 
electron and proton transfer reactions will be presented 
where FSRS has provided a unique energetic and struc¬ 
tural glimpse. Finally, the chapter concludes with 
a glimpse of what lies ahead for FSRS as a technique 
in its quest for decoding the underlying principles of 
chemical reactivity in condensed phase systems. 

FSRS Methodology 

Fundamentally resonance Raman spectroscopy 
(► Resonance Raman Spectroscopy of Protein-Cofac¬ 
tor Complexes, ► QM/MM Calculations of Vibra¬ 
tional Spectra) possesses an intrinsic vibrational 
selectivity for a specific molecular framework within 
a complex biomolecule or in a sea of other molecules 
(Long 2001). When the Raman excitation wavelength 
is tuned to the electronic resonance of the molecule, 
scattering cross-section enhancements of up to 10“ can 
be achieved (Lombardi and Birke 2009). Additionally, 
since most of the biophysical measurements are done 
in 55 M water, the vibrational contrast achieved 
through resonance Raman scattering enables the 
suppression of fundamental and overtone bands of 
FI 2 O. The resonance enhancement factor together 
with stimulated detection, which allows for maximal 
collection of scattered photons, makes Raman 
a powerful structural technique for monitoring bio¬ 
physical reactions in aqueous media. 

Traditionally time-resolved Raman spectroscopy 
had been limited to the picosecond domain due to the 
time-energy bandwidth product. From this relationship 
it follows that using a pair of 100 fs Gaussian pulses for 
a time-resolved measurement will generate poorly 
resolved Raman spectra (^150 cm”' resolution) lead¬ 
ing to complete loss of structural information for large 
macromolecules. In addition, many fluorescent mole¬ 
cules are very hard to study with conventional Raman 
due to intense emission background that obscures 
Raman features. Initial efforts to record Raman data 
in the sub-picosecond domain were carried out by 
Yoshizawa using stimulated gain spectroscopy 
(Yoshizawa and Kurosawa 2000). Later Mathies and 
coworkers independently developed the technique 
called femtosecond stimulated Raman spectroscopy 
(FSRS), and showed for the first time that the nominal 


uncertainty limitations (AxAv ^ 5,300 fs cm”*) can 
be circumvented (McCamant et al. 2003, 2004). 
The clever choice of utilizing a narrowband picosec¬ 
ond Raman excitation pulse in combination with stim¬ 
ulating broadband femtosecond probe pulse forms the 
conceptual backbone of this technique as described 
below in detail. 

The detection of femtosecond Raman snapshots for 
a reactive molecule was made possible by incorporat¬ 
ing a stimulated broadband probe pulse within the 
traditional picosecond Raman experimental scheme. 
The breakthrough comes from the disentanglement of 
the time-energy bandwidth product via simultaneous 
multi-wavelength dispersed detection of the probe 
pulse spectrum thereby making it possible to record 
structural information comparable to or faster than the 
vibrational time periods of the probed motions 
(Kukura et al. 2007). The FSRS process with Stokes- 
side Raman detection is diagrammatically shown in 
Fig. 2a and b. The pulse timing and the corresponding 
evolution of the time-dependent density matrix of the 
major FSRS pathway is shown in Fig. 2c (Frontiera 
et al. 2008; Sun et al. 2008). A broadband actinic 
excitation creates a population on the excited state 
surface that initiates the photochemistry. After 
a delay time AT, simultaneous arrival and interaction 
of a broadband probe pulse along with a narrowband 
Raman pump pulse creates a coherent superposition of 
vibrational states v and v -F 1 denoted as Iv -F 1 > <vl on 
the excited potential energy surface with a high time 
precision. A second interaction of the time-evolving 
density matrix with the relatively long (^1-3 ps) 
Raman pump pulse completes the transition to 
the v -F 1 state by creating a population denoted by 
Iv -F 1 > <v -F II, and leads to the emission of a Raman 
photon. Therefore, the Raman transition involves the 
destruction of a pump photon while creating a probe 
photon with the energy deposited into the Raman 
active modes. Since the second interaction can take 
place anytime within the vibrational de-coherence 
time, the dispersed probe detection step is not time- 
resolved (Pollard and Mathies 1992; Lee et al. 2004; 
Yoon et al. 2005). This method of interrogating the 
temporal changes in molecular vibrational degrees of 
freedom critically depends on the intrinsic lifetime of 
the induced vibrational coherence which determines 
the observed line width. 

The advantages of FSRS over other traditional 
vibrational spectroscopies are many. Due to the 
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Femtosecond Stimulated Raman Spectroscopy, Fig. 2 (a) 

Experimental setup for the FSRS detection. Two phase-locked 
choppers in the actinic pump line and the Raman line allow for 
Raman spectrum detection in the presence and absence actinic 
excitation. Due to the single-shot detection of the dispersed 
probe spectrum by the CCD, very fast data acquisition can be 
achieved, (b) The energy profile of the Raman pump and probe 
FSRS pulses is shown. The normalized ratio of Raman-on/ 
Raman-off is used to extract the background free Raman spec¬ 
trum. (c) Timing diagram for a three-pulse FSRS experiment. 


The actinic pump initiates the photochemistry by creating 
a population on the excited state. After a time delay AT between 
the Raman pump and probe pulses creates a vibrational coher¬ 
ence on the excited state. Subsequently another interaction with 
the Raman pulse completes the Raman transition and thereby 
emitting a stimulated Raman photon in the probe pulse direction. 
Portions of images shown above has been adapted from images 
in Annual Review in Physical Chemistry with permission from 
Kukura et al. (2007) 


coherent nature of the spectroscopy and its transmis¬ 
sion data collection geometry, FSRS can be used to 
structurally address intensely florescent molecules 
(Shim et al. 2008). Additionally FSRS does not suffer 
from a non-resonant background which is a common 
menace in coherent anti-Stokes Raman spectroscopy 
(CARS) (Cheng and Xie 2004). This simplifies the 
analysis of the spectra and also makes the technique 
easier to implement for microscopy applications. But 
the major advantage that FSRS presents is the usage of 
tunable broadband visible pulses for addressing struc¬ 
tural questions through shot-to-shot data acquisition in 
a ^2,000 cm“' wide spectral window at a kilohertz 
repetition rate. 


FSRS Setup 

Detailed descriptions of the FSRS setup can be found 
in other relevant papers and reviews (McCamant et al. 
2003, 2004; Kukura et al. 2007; Shim and Mathies 
2008). The next section will involve a brief description 
of the basic elements of the experiment to make the 
reader aware of the nonlinear optical phenomena that 
are routinely used to accomplish the detection of stim¬ 
ulated Raman spectrum. 

As with many other femtosecond time-resolved 
technique the source of optical pulses for the FSRS 
setup is derived from the amplified Ti:sapphire laser 
system. The fundamental pulse coming out of 
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a regenerative amplifier is divided up into three por¬ 
tions to produce pulses which are finally focused on the 
sample to implement the FSRS scheme. The weakest 
of the three portions is used to generate the broadband 
continuum pulse from a sapphire plate. Using 
a requisite filter and a prism compressor, a tunable 
^20 fs probe pulse can be used for Stokes or anti- 
Stokes measurements. The actinic pump pulse is gen¬ 
erated using a conventional noncollinear optical para¬ 
metric amplifier (NOPA) via mixing of a 400 nm pump 
pulse with the sapphire-generated continuum pulse in 
a nonlinear beta barium borate (BBO) type-II crystal 
(Cerullo and De Silvestri 2003). The mixing condi¬ 
tions can be varied via both the temporal overlap 
between the beams and the crystal angle to tune the 
color from 470 to 730 nm with varying bandwidths. 
After the NOPA generation, the actinic pulse is com¬ 
pressed down to 25-35 fs using a prism compressor. 
The time resolution of the instrument is measured 
using the optical Kerr-effect (OKE) (Castner and 
Maroncelli 1998; Steinmeyer et al. 1999) based 
cross-correlation measurement in glass and is usually 
~50 fs in the NIR region. 

The tunable Raman pump pulse is created in a two- 
step process: First the output of a second NOPA is 
spectrally filtered and compressed to provide a weak, 
picosecond pulse which then acts as a seed for 
a narrowband OPA. This seed pulse is amplified via 
a stretched 400 nm pump pulse in a long BBO type-I 
crystal in order to provide a Raman pulse with 
200-1,000 nJ energy. Using this methodology, the 
wavelength of the Raman pump can be tuned between 
470 and 650 nm which allows for the greatest flexibil¬ 
ity in addressed chemical samples (Shim and Mathies 
2006, 2008; Co et al. 2010). It will be evident through 
our illustrative examples how the ability to resonantly 
enhance the vibrational signatures ultimately leads to 
FSRS detection of molecules with small Raman cross 
sections. Traditionally Stokes-side FSRS can be col¬ 
lected with a 800 nm Raman excitation pulse generated 
through a grating filter which selects a narrow band¬ 
width of the broad spectrum and is compressed to 
picoseconds subsequently. Although it is an inefficient 
process of generating the Raman pulse, the enormous 
energy of the fundamental (800 nm) comes to the 
rescue. However the observation of the low-frequency 
modes is limited under such conditions due to strong 
overlap of the fundamental line within the spectral 
profile of the sapphire-generated continuum probe. 


With a visible probe and a noncollinear pump-probe 
geometry, Raman lines at frequency as low as 
100 cm“^ can be detected without Rayleigh 
interference. 

The most critical issue in any form of Raman spec¬ 
troscopy is the development of an efficient data collec¬ 
tion scheme. Typically most of the experiments 
reported to date have used dual photodiode arrays 
(PDA) which are ideally suited for the collection of 
large photon fluxes (10^-10** counts) due to large well 
depth (McCamant et al. 2004). However due to the 
poor signal-to-noise ratio of PDAs, multiple averages 
have to be carried out, which reduces the duty cycle of 
detection process. To partially solve this issue, Zinth 
and coworkers devised a single-shot detection scheme 
with phase-locked choppers (Fig. 2a) with fast PDA 
and were successful in collecting FSRS data at shot-to- 
shot rate (Faimgruber et al. 2006). More recently due 
to the development of fast and superior CCD technol¬ 
ogy, this methodology now can easily be implemented 
to allow detection of low-noise unsaturated spectra at 
kHz repetition rates. A single FSRS spectrum which is 
obtained by normalizing the Raman pulse-on with the 
Raman pulse-off data can now be recorded within 2 ms 
while keeping sample exposure limited to only a few 
picoseconds. As an example, a Raman spectrum of 
cyclohexane with S/N ^10^ can be obtained within 
a second under optimized beam geometry conditions. 
Therefore when interrogating excited state structure 
limited by small excitation population, the above- 
mentioned detection methodology reduces the data 
collection time at least by a factor of 5. 

Understanding Chemical Reactivity 

FSRS is a powerful vibrational structural technique 
that can be used to probe the ground state geometry 
of fluorescent molecules both in far-field spectroscopy 
and microscopy applications (Ploetz et al. 2007; Shim 
et al. 2008; Spillane et al. 2009; Wilson et al. 2009; 
Lockard et al. 2010). More recently FSRS has been 
used routinely to elucidate the excited state structure 
and chemical reaction dynamics in ultrafast photo¬ 
chemical reactions (Kukura et al. 2005; Dasgupta 
et al. 2009; Fang et al. 2009; Frontiera et al. 2009; 
Shim et al. 2009; Fockard et al. 2010). As discussed 
earlier, the resonance Raman selectivity gives FSRS 
a huge advantage as a tool over other available 



F 


750 


Femtosecond Stimulated Raman Spectroscopy 


time-resolved structural techniques to look at complex 
biological macromolecules. Specifically in colored 
proteins, the structural evolution of bound chromo- 
phores can be temporally mapped avoiding the inter¬ 
fering vibrational signatures of the peptide backbone 
or its side chains. FSRS studies on three distinct 
photoactive molecular systems which have lead to the 
advancement in our understanding of the underlying 
reaction mechanisms are described next. 

Bilin Photoisomerization in Phytochrome 

Rearrangement reactions are typically unimolecular 
chemical transformations that lead to switching 
between two isonuclear ground electronic states. 
Photoisomerization of conjugated organic chromo- 
phores within proteins present a remarkable case of 
such rearrangement reactions which lead to light- 
induced cellular responses in living organisms 
(Mathies et al. 1988; Rudiger and Thummler 1991; 
Schoenlein et al. 1991; Kandori et al. 1992; Rudiger 
1992; Wang et al. 1994). One such photoactive protein 
that uses isomerization as a working paradigm for light 
sensing is phytochrome (Rockwell et al. 2006). The 
bilin chromophore in phytochrome responds to an 
incident red photon via a Z-to-E transition around the 
Ci 5 =Ci 6 methine bridge. From a chemical reaction 
standpoint, the contributing reaction coordinate(s) in 
such a bond twisting process can be assumed to be 
mainly the Ci 5 =Ci 6 torsion. However since the large 
bilin chromophore has multiple conjugated C=C 
bonds along with other functionalities, even the most 
compelling ethylenic bond isomerization can have 
additional vibrational coordinates involved in the 
reactive mechanism. These “unknown” reactive coor¬ 
dinates can influence the timescale of product forma¬ 
tion, and also its reaction efficiency which is typically 
scored by the measured quantum yield. Knowledge of 
such nuclear motions is critical for establishing robust 
design principles of efficient artihcial biomimetic sys¬ 
tems (Hampp 2000). 

In plants and bacteria, the photochemical inter¬ 
conversion between the red (Pr) and far-red (Pfr) 
absorbing forms of phytochrome initiate the intracel¬ 
lular signal transduction pathways. The Pf-to-Pf, pho¬ 
toreaction leads to a macroscopic protein conformation 
change which is photodriven by an ultrafast Z-to-E 
bond isomerization at the Ci 5 =Ci 6 methine bridge 
within the bilin chromophore (see Fig. 3a) (Wagner 
et al. 2005; Essen et al. 2008). It has been widely 


debated whether the localization of the Z-to-E isomer¬ 
ization is the Ci 5 =Ci 6 double bond, or other portions 
of the bilin molecule are responsible for the reactive 
trigger. Additionally the molecular origin of the sur¬ 
prisingly poor in vitro quantum yield of 10-15% 
(depending on organism) for the overall reaction has 
not been properly addressed. 

FSRS is ideally suited to address such an isomeri¬ 
zation process as it can provide a structural view of the 
reaction coordinate. Recently a series of vibrational 
snapshots of the evolving bilin structure during the 
Pf-to-Pfr transformation were reported (Dasgupta 
et al. 2009). Stokes-side FSRS spectra (shown in 
Fig. 3) were obtained using a NIR (800 nm) Raman 
pump pulse after red excitation by a 25 fs actinic pulse. 
The immediate spectral blueshift of the high-frequency 
C=C stretch mode localized on the D-ring within 
600 fs from 1,630 to 1,609 cm“* indicates a consider¬ 
able amount of structural evolution in the bilin 
backbone near the Ci 5 =Ci 6 bond in the intermediate 
I* state. The low-frequency region of the bilin Raman 
spectra contains wagging and in-plane bending modes 
that sensitively report on structural distortion. One of 
the most informative modes in such a conjugated car¬ 
bon framework is the C-H hydrogen-out-of-plane 
(HOOP) mode which appears at 816 cm“* in the I* 
state. The enhanced intensity and the considerable 
blueshift of this mode (assigned as a C 15 -H HOOP) 
with respect to the ground state frequency (794 cm“*) 
is an indication of non-planarity at the Ci 5 =Ci 6 region 
as the bilin is undergoing isomerization. 

The vibrational spectra between 3 and 30 ps show 
a decrease in the peak amplitudes; however the mode 
frequencies do not evolve signihcantly after 5 ps 
(Dasgupta et al. 2009). The relative stagnancy in the 
observed spectral positions of the modes till 100 ps 
indicate that the isomerization is nearly complete by 
3 ps (data not shown here). However the decrease in 
the amplitudes of the signal signifies a population 
bifurcation on the excited state surface. Most of the 
excited population relaxes back to the ground state 
through a non-radiative channel by 3 ps while 
a minor fraction (15%) evolves to the isomerized 
Lumi-R* state. It should be mentioned that in the 
Lumi-R photoproduct the methyl-methyl clash 
between the C/D rings force the Ci 5 =Ci 6 bond to 
non-planarity. Therefore the low quantum yield is per¬ 
haps explained by an excited state barrier arising from 
methyl-methyl clash and the protein steric interactions 
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Femtosecond Stimulated Raman Spectroscopy, Fig. 3 

(Top) Three-dimensional rendering in VMD of the active 
site of phytochrome in the P, form from Synechocystis 6803 
(Essen et al. 2008). The bilin in its Pf, geometry has been 
modeled in through a bond rotation around the Ci 5 =Ci 6 bond 
for visualization of the reaction. {Bottom, left) FSRS spectra 
representative of the time-evolving bilin structure. The local 
structure around the C-D ring corresponding to the ground 
state, I* (at 0.6 ps) and Lumi-R/Pf, state (after 40 ps) is 


depicted. The spectral intensities have been normalized to 
enhance the distinct vibrational features. The pure P, and 
Pf, end-state spectra were obtained from ground state FSRS 
experiments {Bottom, right). Potential energy surface of the 
Pr-to-Pfr reaction showing the various reaction coordinates 
along which the reaction proceeds until the C-D ring flip 
happens through the Ci 5 =C |6 bond. The images in the above 
figure have been reproduced with permission from Dasgupta 
et al. PNAS 2009 


near the D-ring which tries to drive the chromophore 
back to Pf ground state. Transient electronic and steady 
state fluorescence experiments have indicated that the 
excited state lifetime of the state is ^30 ps thereby 
completing the whole bilin isomerization process 
(Sineshchekov and Akhobadze 1992; Teuchner et al. 
1995; Andel et al. 1997). This leads to the conclusion 
that most of the structural change actually takes place 
on the long-lived excited state surface. The overall 


reaction is depicted on a representative potential 
energy surface in Fig. 3. Our FSRS data show that 
the HOOP mode is a critical part of the reaction coor¬ 
dinate along with the C=C torsion which leads to the 
D-ring flip about the Ci 5 =Ci 6 bond. 

FSRS has been used to also probe similar structural 
rearrangements in rhodopsin and bacteriorhodopsin 
(Kukura et al. 2005; McCamant et al. 2005; Shim 
et al. 2009), in both cases unraveling the vibrational 
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modes responsible for the high quantum yield and fast 
timescale of the reactions. From these classes of exper¬ 
iments it appears that the common thread in selective 
C=C bond isomerization is the ground state strain in 
the photoactive double bond. Additionally the FIOOP 
modes conspire with the torsion to lead the molecule 
toward the photoproduct state. The varying efficiencies 
of the reactions are possibly a direct consequence of 
the protein architecture around the chromophores. 
More work has to be done to elucidate the role of the 
protein in the sensitive control of quantum yields. 

Ultrafast Interfacial Electron Transfer 

Oxidation-reduction reactions are an important class of 
chemical reactions that are prevalent in all intracellular 
biosynthetic and photosynthetic pathways (Beratan 
et al. 2009; Magnuson et al. 2009; Reece and Nocera 
2009). The electron transfer (ET) reaction is the most 
fundamental of the redox transformations. While ther¬ 
mally activated electron transfer reactions follow the 
conventional Marcus theory, (Marcus and Sutin 1985) 
photoinduced electron transfer can be finished on 
a timescale faster than solvent relaxation, i.e., in less 
than 10 ps. Therefore in order to understand the ET 
process at the ultrafast time regime, novel experiments 
and theoretical formulations have to be developed. 
Specifically, it is critical to monitor the contributions 
from all involved nuclear coordinates in order to deter¬ 
mine the driving forces for ultrafast electron transfer 
(Yoshihara et al. 1994). 

The ultrafast ET reaction is ubiquitous in biology, 
an important functional manifestation of which is the 
photosynthetic energy production (Gray and Winkler 
2003). Our lack of understanding of the fast and effi¬ 
cient charge separation reaction in the photosynthetic 
reaction center protein represents the tip of this chal¬ 
lenge. Inspired by this reaction, artificial solar energy 
capture and conversion devices use fast photoinduced 
interfacial ET (lET) reactions to derive usable electri¬ 
cal or chemical energy (Magnuson et al. 2009). One 
example of such a device is the dye-sensitized solar 
cell (DSSC) which comprises a dye chemisorbed 
onto a Ti02 coated Pt electrode (O’Regan and Gratzel 
1991). Electronic excitation of the dye leads to electron 
injection into the Ti 02 conduction band while simul¬ 
taneously creating a hole on the donor dye molecule. 
Usually in such systems, electron transfer occurs on 
a 5 fs to several picoseconds timescale. The exact 
molecular mechanism and disparity of injection 


timescales involved in these systems where donor 
and acceptor are in the strong coupling limit is poorly 
understood. Here it is important to stress that slower 
ET injection rates (>1 ps) can result in reduced effi¬ 
ciency of the overall processes as fast internal conver¬ 
sions or intersystem crossings can channel the 
excitation energy away to non-useful pathways 
(McCusker 2003). With such rapid transfer the sol¬ 
vated donor does not have time to thermally equilibrate 
as traditionally assumed in the Marcus picture. Thus 
specific vibrational modes of the molecule are 
expected to play a significant role in the charge transfer 
process preempting the conventional thermalized adi¬ 
abatic ET mechanism. 

The modified Marcus theory considers the effects of 
one or two vibrational degrees of freedom that can 
promote electron transfer (Jortner 1976). Although 
this assumption simplifies the dimensionality of the 
problem and has been successful in enabling many 
theoretical predictions, it is challenging to predict 
rates for ultrafast reactions. Specifically when design¬ 
ing new generations of efficient DSSCs with complex 
molecular architectures for Ti 02 attachment, it is 
extremely important to understand the nuclear motions 
that drive the wavepacket out of the excited (D*-A) 
surface (Eig. 4a). Systems which have rapid electron 
transfer and can sustain typically long-lived charge 
separation lifetimes (at least three to four orders longer 
than the initial ET timescale) lead to solar cells with 
increased efficiency (Hagfeldt and Gratzel 2000). 
Thus, unraveling the key nuclear motions which pro¬ 
mote the forward ET and retard the recombination can 
lead to insights on the design of efficient and poten¬ 
tially cheap solar cells. 

To address this problem, a time-resolved structural 
investigation on the lET reaction of Coumarin 343 
(C343) bound to anatase titania nanoparticles has 
been recently reported (Erontiera et al. 2009). Since 
C343 is fluorescent when free in solution, the FSRS 
approach presents a unique opportunity to look at both 
the ground state and the time-evolving structure of the 
excited molecule. Steady state resonance Raman spec¬ 
trum with Raman excitation at 500 nm is shown in 
Fig. 4b. It is known from Heller’s time-dependent 
formalism that the vibrational modes with large 
excited state displacements (A, displacement between 
potential energy minima of excited state and ground 
state) along the normal coordinate have high resonance 
Raman scattering cross sections. For these highly 
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Fig. 4 Electron transfer reaction at the dye-semiconductor 
interface, (a) Potential energy scheme representing ultrafast 
electron donation from photoexcited electron Donor (D*) into 
the acceptor (A) followed by creation of a charge-separated 
state, D'^A^. (b) Vibrational modes participating in the 
electron donation from the excited coumarin molecule to Ti 02 


displaced modes, the initial slope (dV/dQ at equilib¬ 
rium geometry, Qo) encountered after absorption on 
the excited potential energy surface leads the system 
out of the Franck-Condon (FC) region (Lee and Heller 
1979; Myers and Mathies 1987). As presented in 
Fig. 4b, the resonance Raman spectrum of the C343- 
Ti 02 bound system shows that the ring breathing 
modes at 632 and 785 cm“* are the ones that are 
strongly enhanced. In fact, most of the intense modes 
involve a stretching character in the carbon ti back¬ 
bone, and no predominately nitrogen stretching modes 
are active. Since the forward FT occurs on a rapid 
timescale (^100 fs), it is likely that the initial motions 
that lead the system toward the charge-separated state 
(D'^A”) involve the ring breathing modes in conjunc¬ 
tion with high-frequency carbon-stretching modes 
(delocalized ring C=C stretch). Because the enhanced 
vibrational modes observed in the resonance Raman 
spectra do not include significant motion of the nitro¬ 
gen atom, the donated electron comes from the 


b 



Raman shift, cm '' 


are captured by the ground state (G.S.) resonance Raman 
spectrum shown in blue. Subsequent to the formation of 
the charge-separated state, the radical cation has a very distinct 
electronic and geometric structure as indicated by the red-shift 
of the C=C/C-C mode to 1,549 cm^' in the Raman 
spectrum obtained through time-resolved FSRS experiment 
(red trace) 


7i-bonding backbone rather than the nitrogen center. 
Additionally coordinated motion from many carbon 
atoms is required for this ultrafast electron transfer to 
happen efficiently. Therefore resonance Raman shows 
that it is imperative to consider multiple modes rather 
than just one to two modes for explaining and 
predicting the reaction. 

In order to get information on the structure of rad¬ 
ical cation, time-resolved vibrational spectra were 
recorded with a 500 nm phototrigger and Raman exci¬ 
tation at 800 nm, as depicted in Fig. 4b. Previous 
transient absorption work on colloidal C343-Ti02 
identified a 100-200 fs component due to the charge 
injection process and a 190 ps charge recombination 
time. The poor signal-to-noise ratio in the FSRS spec¬ 
tra results from a lack of resonance enhancement but 
still shows three peaks at 1,549, 1,358, and 1,205 cm“' 
corresponding to the C-N/C=C stretch, C-H in-plane 
bend, and CH 2 rock/C-C stretch, respectively. Com¬ 
paring the frequency of these peaks to calculated 
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DFT frequencies, it was concluded that the hole 
resides on the carbon backbone rather than on the 
N-center of C343. 

Observing structural changes during lET with 
FSRS offers important molecular insights into the 
charge transfer process. The geometric changes that 
occur in the initial Franck-Condon region of the 
excited state surface are critical to our understanding 
of the modes involved in the IFT process. The above 
measurements confirmed that coordinated motion 
along many degrees of freedom of the carbon-centered 
ring leads to fast and efficient electron transfer. 
In addition, the location of the hole after electron 
transfer was structurally characterized. As back elec¬ 
tron transfer is frequently detrimental to the perfor¬ 
mance of DSSCs, this knowledge could be used for 
rational synthetic design of future dye molecules with 
similar skeletal arrangement and perhaps more spa¬ 
tially separated hole-stabilizing groups. Eventually 
FSRS may be applied to unravel the most challenging 
and intriguing electron transfer reaction that takes 
place in photosynthetic reaction centers (Renger and 
Schlodder 2010). 

Excited State Proton Transfer in Green Fluorescent 
Protein 

Acid-base reactions are one of the most fundamental 
chemical reactions. Their omnipresence in a variety of 
biological processes all the way from metabolite¬ 
generating enzyme catalysis to proton pumps for 
ATP generation has been well documented (Belevich 
et al. 2010; Bothma et al. 2010; Cleland 2010; Lee 
et al. 2010). Vast number of studies have been 
conducted to shed light on the detailed mechanism of 
acid-base reactions in aqueous phase, but the need to 
find a suitable structural technique to monitor fast 
proton transfer dynamics in real time, particularly in 
biomolecules, still remains (Mohammed et al. 2005). 
Proton transfer events are usually understood via 
a one-dimensional reaction coordinate where the rela¬ 
tive position of the with respect to the partner atoms 
is tracked via transient changes in bond distances 
(Agmon 1995). This presumes that most of the proton 
transfer reactions should occur during the ultrafast 
timescale of bond stretching (usually ^tens of femto¬ 
seconds). However most proton transfer events occur 
on a slower timescale (ps-ms) which clearly shows that 
there are other important structural parameters which 
must be considered (Coe et al. 2007). Therefore direct 


monitoring of such a process would lead to unambig¬ 
uous assignment of operative mechanism behind the 
proton transfer event. 

Green fluorescent protein (GFP) is biologists’ 
favorite gene expression marker that emits in the 
green (508 nm) upon UV-excitation (Shimomura 
et al. 1962; Chalfie et al. 1994; Tsien 1998). Despite 
its revolutionary impact in bioimaging (Remington 
2006), the fundamental understanding of the mecha¬ 
nism for its bright and efficient luminescence is still 
missing. Since the first X-ray crystallographic struc¬ 
ture of wtGFP was reported in 1996 (Ormo et al. 1996), 
the structural basis for efficient green fluorescence in 
GFP has been extensively studied using fluorescence 
techniques and time-resolved IR studies (Lossau et al. 
1996; Kennis et al 2004; Vengris et al. 2004; Usman 
et al. 2005; Stoner-Ma et al. 2006). It has been pro¬ 
posed that the protein-generated S65-Y66-G67 GFP 
chromophore undergoes an ultrafast excited state pro¬ 
ton transfer (ESPT) reaction facilitated by a well- 
defined hydrogen-bonding network consisting of 
amino acid side chains and a conserved internal water 
molecule (Wat22) as shown in Fig. 5a (Chattoraj et al. 
1996). Therefore the GFP molecular system presents 
an opportunity to unravel details of a unique proton 
transfer event via an external phototrigger while being 
an excellent candidate to monitor and isolate nuclear 
motions that drive efficient fluorescence. In order to 
reveal the choreography of the ESPT process, 
a structural technique with high enough resolution to 
access the critical 10 fs to 1 ps regime is needed. 
As one of the few techniques capable of monitoring 
complete structural information on the ultrafast time- 
scale, FSRS is ideal for studying the early structural 
events in GFP. 

The excited state FSRS spectra of the neutral chro¬ 
mophore of GFP obtained using a NIR Raman pump 
excitation with 50 fs time steps were resonantly 
enhanced due to the strong excited state absorption in 
the vicinity of 900 nm (Fang et al. 2009). The FSRS 
spectra surprisingly displayed prominent oscillatory 
features both in the frequency and intensity domain, 
which are clearly visible when plotted against the 
delay time (Fig. 5b). Fourier transforming the time 
domain quantum beats indicated that the modulations 
were germinating from an impulsively pumped 
120 cm“* vibrational mode. In addition the anti¬ 
phase relationship between the C-O stretching mode 
primarily associated with the phenolic ring and the 
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Femtosecond Stimulated Raman Spectroscopy, 
Fig. 5 Multidimensional reaction coordinate for GFP fluores¬ 
cence revealed by FSRS. (a) The unique P-barrel structure of 
GFP enclosing the 3-residue chromophore in the central region 
of the protein, (b) Raw (red solid line) and processed (blue solid 
line, broad slope removed and oscillation normalized) spectral 
oscillatory data showing the out-of-phase relationship between 
the C-0 and C=N stretching modes, associated with the oppo¬ 
site rings of the electronically excited chromophore. The intrin¬ 
sic molecular frequency change of ~32 cm^* (black dashed) 
when convoluted with the detection parameters (green solid, fits) 


matches well with experimental data. The observed frequency 
oscillation reports on the structural dynamics of the chromo¬ 
phore on the femtosecond timescale transitioning between 
a tyrosine (cyan) and the tyrosinate (orange) state, (c) Illustra¬ 
tion of the chromophore conformational dynamics inside the 
protein pocket following photoexcitation. The phenoxyl ring 
wag brings the phenolic proton closer (yellow ring) or farther 
away (red ring) from the nearhy water molecule, triggering the 
heavy atom rearrangement down the ESPT chain to transfer the 
proton. Images in the above figure has been adapted with per¬ 
mission from Fang et al. 2009 


C=N stretching mode on the imidazolinone ring indi¬ 
cates that the 120 cm“* mode is able to modulate both 
modes with a 180° phase reversal. DFT calculations on 
the model chromophore in vacuo identified a 110 cm” 
two-ring out-of-plane wagging motion in Sq that is 


insensitive either to deuteration or to electronic exci¬ 
tation (Fang et al. 2009). In addition, the aforemen¬ 
tioned conserved Wat22 is proximally connected to the 
phenolic ring via a H-bond with the -OFI group in 
the ground state. An excited wagging motion would 
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move the phenolic proton (—OH) closer or farther 
away from the water molecule, hence making and 
breaking H-bonds on a femtosecond timescale 
(^280 fs). Deconvolution of the spectral oscillation 
data from the intrinsic vibrational dephasing time of 
the C-0 mode (^500 fs) yielded an ^32 cm“* change 
in the C-0 stretching mode between the two quasi¬ 
stable states: a non-H-bonded neutral -OH (away from 
Wat22) and a partially deprotonated -OH group. 

This variation is highly consistent with the vibra¬ 
tional frequency change that is expected if the proton 
has been transferred. The overall blueshift of the 
C-0 stretching mode after the phenolic deuteron 
(—OD in D 2 O) is fully transferred on the ps timescale 
(Litvinenko and Meech 2004; Chattoraj et al. 1996) 
from the primary excited state A* of the neutral 
chromophore to the unrelaxed anionic form I* is 
^38 cm”'. This frequency shift is comparable to 
the A* spectral oscillation indicating that the 
observed early-time structural dynamics is chemi¬ 
cally relevant. Furthermore from the peak intensity 
analysis, the dwell between the reactant (A*) disap¬ 
pearance and the appearance of excited state 
deprotonated product (I*) is very small (^1 ps in 
H 2 O and ^3 ps in D 2 O), indicative of causality. This 
dwell is needed even though the photoexcited chro¬ 
mophore wags back and forth, and readjusts its H- 
bonding with Wat22 because the rest of the ESPT 
chain involving heavy atoms of S65 and E222 still 
need time to align themselves for the new proton 
configuration. As a result, it was confirmed that the 
ring wagging motion (Eig. 5c) is the principal com¬ 
ponent of the overall ESPT reaction in GPP, and it 
ingeniously sets up the stage for the latter processes 
to efficiently occur (Pang et al. 2009). This is 
undoubtedly a “good” vibration that highlights the 
chromophore-protein interaction in guiding the bio¬ 
logical functions, in this case, proton transfer that 
leads to bright fluorescence. 

The ESPT pathway in GPP clearly reveals a new 
feature that was lacking in our traditional understand¬ 
ing of proton transfer reactions (Levine and Martinez 
2007). It shows that the proton transfer reaction coor¬ 
dinate can involve low-frequency skeletal motions 
which gate the proton release step and make the step 
more efficient (Litvinenko and Meech 2004; Levine 
and Martinez 2007; Pang et al. 2009). The gating 
mechanism proposed through the GPP work should 
influence the design of new photocatalytic systems 


which heavily rely on energetically efficient removal 
of electron and proton conceitedly or in a sequential 
way (Reece and Nocera 2009). Additionally this will 
guide the targeted design of novel fluorescent proteins 
for a more versatile toolset for bioimaging, bioengi¬ 
neering, and biomedicine (Henderson et al. 2007; Ai 
et al. 2008; Andresen et al. 2008; Shaner et al. 2008). 

Future Directions for FSRS 

Mapping the dimensionality of the reaction coordinate 
through anharmonic couplings of various vibrational 
modes is the ultimate goal in chemical reaction dynam¬ 
ics. As discussed above it is clear that PSRS can 
unearth these otherwise hidden components of the 
reaction coordinate even for the most complex chem¬ 
ical reactions. The most striking example is the ESPT 
pathway in GPP where the observed vibrational 
anharmonicities in the PSRS data monitored the con¬ 
tributions of a coherent ring wagging mode to the 
proton transfer coordinate (Pang et al. 2009). 
Extending the PSRS technique to observe these types 
of interactions in a wider range of molecules and with 
higher time resolution would significantly advance our 
understanding of molecular reaction dynamics. To 
easily accomplish the detection of anharmonic cou¬ 
plings, ultrashort femtosecond pulses (^5 fs) will 
enable the creation of vibrational wavepackets in 
a large number of modes either in the ground state 
or excited state, and coherently pump vibrations up 
to 2,000 cm”* (Polli et al. 2007; Payer 2009). The 
anharmonic coupling between individual low- 
frequency and high-frequency modes will thus create 
a complex time-evolving oscillatory pattern in the 
frequency domain PSRS spectra which would give 
information on the PES shape that determines chemi¬ 
cal reactivity (Zhu et al. 1994). Therefore, a direct high 
precision temporal mapping of the multidimensional 
potential energy surface along the reaction coordinate 
should be possible using ^5 fs pulses. In future, PSRS 
is poised to tackle more intriguing photoactive biolog¬ 
ical systems to reveal the structurally encoded 
anharmonicities of vibrational modes that defines the 
multidimensional potential energy landscape leading 
to its functions. 

Another area where PSRS will be a great tool for the 
biophysics community is in structure-specific visuali¬ 
zation of soft biological tissues and in vivo cellular 
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imaging. Femtosecond stimulated Raman microscopy 
(FSRM) was first conceived by the Gilch group 
through their preliminary work on locating 30 pm 
polystyrene beads in water via known Raman signa¬ 
tures from 700 to 4,000 cm“^ (Ploetz et al. 2007). 
Spectra were accumulated for 1,000 laser shots while 
raster scanning the area, resulting in well-resolved, 
background free, broadband spectra. Recently Sunney 
Xie and colleagues took this nascent field forward by 
presenting the first report on label-free cellular images 
of lignin degradation with rapid acquisition rates and 
high-frequency phase-sensitive detection methods 
(Freudiger et al. 2008; Saar et al. 2009). This described 
method is currently limited to picosecond pulses, 
which require scanning laser frequencies to achieve 
a complete vibrational spectrum. However this initial 
step clearly suggests that exciting applications of 
FSRM in dynamically tracking intracellular events 
will be possible in near future. 

Summary 

Visualizing real-time functional activity of biological 
macromolecules has been a long-standing goal for 
chemists and biologists alike. Specifically, tracking 
transient changes in molecular structure during chem¬ 
ical reactions across a range of timescales (femtosec¬ 
onds to seconds) represents the core of this challenge. 
Femtosecond stimulated Raman spectroscopy (FSRS) 
is a newly developed broadband vibrational structural 
technique which critically addresses this question at 
the ultrafast time regime, i.e., at the limit of a bond 
vibrational period (10 fs-100 ps). FSRS uses a femto¬ 
second actinic pulse to initiate the desired photochem¬ 
istry, while a picosecond narrowband Raman pump 
pulse together with an ultrashort (20 fs) broadband 
probe pulse stimulates Raman signals spanning 
a broad vibrational range from 100 to 2,000 cm“\ 
Due to the extraordinary temporal resolution (<40 fs) 
coupled with precise bond-specific information intrin¬ 
sic to vibrational spectroscopy, FSRS is a powerful 
tool to take up challenging biophysical questions 
head on. To illustrate these capabilities, a structural 
interrogation of three fundamentally distinct classes of 
chemical reactions operative in biology, namely, isom¬ 
erization, electron transfer, and proton transfer, have 
been presented. FSRS studies yield new information 
on the working principles of the biological machinery 


in each of the cases. In conclusion, future prospects of 
FSRS and how its power can be used to completely 
expose the biological structure-function paradigm are 
discussed. 

Cross-References 

► Electron transfer 

► Infrared Spectroscopy of DNA 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Nonlinear IR Spectroscopy of Proteins 

► QM/MM Calculations of Vibrational Spectra 

► Raman Spectroscopy 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 
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Introduction 

Bacterial flagellae are helical filaments propelled 
by rotary motors, remarkable nanomachines embedded 
in the bacterial cell envelope. Powered by a proton or 
sodium gradient across the cytoplasmic membrane, the 
flagellar motor converts electrochemical energy into 
torque by interacting with the stator, a surrounding 
protein assembly embedded in the cytoplasmic mem¬ 
brane. Of the approximately 50 genes involved in the 
expression and assembly of the flagellum, about 20 
produce protein components of the flagellar motor. 
The process of flagellar assembly is well understood 
from extensive studies on assembly-defective mutants 
in two model systems Salmonella enterica and 
Escherichia coli. Structural studies have also revealed 
the molecular architecture of the rotor in great detail. 
However, the mechanism of flagellar rotation remains 
poorly understood at the molecular level, mainly 
because of the lack of structural information about 
the intact flagellar motor, including the torque¬ 
generating stator. Cryo-electron tomography 
(cryo-ET) has emerged as a 3D imaging technique 
capable of visualizing the intact flagellar motor within 


the context of living organisms at the molecular reso¬ 
lution. This entry is intended to provide an overview of 
this advanced imaging technique and its promising role 
in the understanding of the flagellar motor complex 
and bacterial motility. 

Bacterial Motility and Flagellar Motor 

Many bacteria require motility for their growth and 
survival. Although other types of motility exist (e.g., 
gliding motility), many bacteria utilize flagellae to 
achieve translational motion (Jarrell and McBride 
2008). Flagellar rotation is driven by the proton- or 
sodium-motive force across the cytoplasmic mem¬ 
brane. In most externally flagellated bacteria, the coun¬ 
terclockwise (CCW, viewed from the distal end of the 
flagellum to where it inserts in the membrane) rotation 
of the flagellum results in the bundling of the multiple 
helical flagellae and propulsion of the cell through 
liquid or viscous environments (“running”). Flagellar 
motors of most species can reverse rotational direction, 
and when rotating clockwise (CW) the flagellar fila¬ 
ments of these peritrichously flagellated species sepa¬ 
rate, resulting in random motion of the cell with little 
translational movement (“tumbling”). Bacterial motil¬ 
ity is thus a zigzag pattern of runs and tumbles, in 
which chemotactic signals favor running toward 
attractants and away from repellents (Berg 2004). 

Current knowledge of the flagellar structure and its 
rotational mechanism is based primarily on studies of 
E. coli and Salmonella (Berg 2003; Sowa and Berry 
2008). The well-conserved structure consists of the 
flagellar motor, the hook (which provides an angular 
connection to the flagellum), and the helically shaped 
flagellar filament. The flagellar motor itself can be 
divided into several morphologically distinct domains: 
the MS ring (the base of the flagellar motor), the C ring 
(the switch complex regulating motor rotation and 
switching), the export apparatus (a multiple protein 
complex located on the cytoplasmic side of the MS 
ring), the rod (which connects the MS ring and the 
hook), the L and P rings on the rod (thought to serve 
as bushings at the outer membrane and at the peptido- 
glycan layer, respectively). Furthermore, a membrane 
protein complex (MotA/MotB) functions as the stator 
by anchoring the entire complex to the peptidoglycan 
layer of the cell through a binding motif in the peri- 
plasmic domain of MotB. The flagellar motor receives 
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torque from the stator complex, and transmits it 
through the rotor to the hook and filament. 

Structural studies have provided atomic models for 
several flagellar proteins, such as flagellar filament 
protein (FliC), several C ring components (FliM, 
FliG, FliN), and the stator component (MotB) (see 
review article (Sowa and Berry 2008)). Molecular 
structures of the rotor/C ring assembly were also 
determined, by using cryo-EM to view the purified 
basal body complex (Thomas et al. 2006). The stator 
and the export apparatus have been visualized by 
using freeze-fracture preparations of cytoplasmic 
membranes, showing 10-16 stator units forming cir¬ 
cular arrays (Khan et al. 1988). Part of the export 
apparatus has been shown to localize in the central 
space of the C ring (Katayama et al. 1996). In 2006, 
the first structure of the intact flagellar motor in situ 
was determined by averaging 20 individual motor 
images obtained using cryo-ET of Treponema 
primitia cells (Murphy et al. 2006). The development 
of high-throughput cryo-ET has recently made it pos¬ 
sible to determine the structure of an intact flagellar 
motor at approximately 3.5 nm resolution using 
a thousand images of Borrrelia burgdorferi without 
imposing rotational symmetry (Liu et al. 2009). 
Asymmetric reconstruction of the flagellar motor 
reveals the curvature of the stator and flexible inter¬ 
action between the stator and the rotor, suggesting that 
inherent flexibility of the motor is required during 
flagellar rotation. 

Spirochetes and Periplasmic Flagella 

The spirochetes represent 1 of around 40 major bac¬ 
terial phyla. They are medically significant patho¬ 
gens, as these organisms are responsible for several 
major diseases in humans like Lyme disease 
{B. burgdorferi and related organisms), syphilis 
(Treponima pallidum), and leptospirosis {Leptospira 
interrogans). Lyme disease is the most common vec¬ 
tor-borne infection in the United States, and its inci¬ 
dence has been steadily increasing ever since its 
discovery in 1975. Syphilis is a common sexually 
transmitted disease in many areas of the world, 
while leptospirosis is the most common worldwide 
waterborne zoonosis. Little is known about the path¬ 
ogenesis of the spirochetes, however motility is 
likely to play an important role in the disease 


progression, since spirochetes are highly motile and 
are even able to transiently invade macrophages 
and other cells (Charon and Goldstein 2002). 

Spirochetes have an unusual and distinct morphol¬ 
ogy in which the flagella are enclosed within the outer 
membrane and are thus called periplasmic flagella 
(PE). Due to their complex morphology, the swimming 
behavior of spirochetes is more complicated than that 
of the externally flagellated bacteria. The flagellar 
motors are localized at both ends of the cell, coordi¬ 
nated to rotate in opposite directions during transla¬ 
tional motion and both CW and CCW during the 
spirochete equivalent of tumbling, called “flexing” 
(Charon and Goldstein 2002). 

The overall structure of spirochetal flagellum is 
similar to that of other bacteria, and consequently 
spirochetes are a very good model system to elucidate 
the molecular architecture of the intact flagellar motor 
in situ, as they are not only representative of highly 
motile and invasive bacterial pathogens, but their small 
cell diameter (0.1-0.3 pm) and orderly arrangement of 
flagellar motors make them ideal for in situ structural 
analysis. Einally, genetic tools are at last becoming 
available to construct mutants with specific flagellar 
genes, a necessary step to bridge functional and struc¬ 
tural studies. 

Cryo-Electron Tomography 

Cryo-ET is a 3D imaging technique, which in principle 
is comparable to computerized axial tomography 
(CAT) by which a 3D structure is reconstructed from 
its 2D projections (Erank 2006). The unique strength of 
cryo-ET lies in its potential for visualizing molecular 
assemblies and cellular components in their native 
environment. Two major technical breakthroughs: the 
preservation of biological samples in vitreous ice 
(Dubochet et al. 1988) and the automation of electron 
tomographic data collection (Koster et al. 1997) are 
responsible for this advantage (Erank 2006). Cryo-ET 
allows direct investigation of the structure-function 
relationship of molecular complexes in their cellular 
environments without fixation, dehydration, embed¬ 
ding, or sectioning artifacts. Therefore, it effectively 
bridges the information gap between the atomic reso¬ 
lution structures from X-ray crystallography/NMR and 
the low resolution images from optical microscopy 
(Baumeister et al. 1999), thereby providing valuable 
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Flagellar Motor: Molecular 
Architecture In Situ, 

Fig. 1 Cellular architecture 
of intact Borrelia burgdorferi 
at the molecular level. One 
central slice of a tomographic 
reconstruction from one 
organism is shown in the 
background, in which flagellar 
motors are readily visible. 

A 3D surface model was 
generated by manually 
segmenting the outer 
membrane (OM, light green), 
cytoplasmic membrane (CM, 
green), flagellar filaments 
(blue), peptidoglycan layer 
(PG, white) and outer 
membrane proteins (OSP, 
pink). Averaged structures of 
the flagellar motor were 
computationally mapped back 
into their cellular context 



information on the organization of cellular and 
subcellular architectures. However, there are major 
challenges for cryo-ET that still need to be addressed, 
such as the poor signal-to-noise ratio and the lack of 
effective protein labeling techniques. It is therefore 
essential to complement cryo-ET with advanced com¬ 
putational methods (e.g., subvolume analysis), label¬ 
ing techniques (e.g., antibody labeling), and molecular 
tools (e.g., single gene knockout and green fluorescent 
protein (GFP) fusion to target protein) in order to 
specifically characterize a specific macromolecular 
complex in situ. 

Cryo-ET of Intact Bacteria 

The preparation of frozen hydrated bacterial specimens 
is the key step for using cryo-ET to observe intact 
bacteria. Typical bacteria are small enough to be pre¬ 
served well in vitreous ice by using a plunge-freezing 
apparatus. However, vitreous-sectioning may be 
required to study large organisms or to minimize spec¬ 
imen thickness in order to achieve better contrast or 
resolution (Al-Amoudi et al. 2004). However, vitreous- 
sectioning currently remains challenging, therefore 


prompting a search for small or skinny bacteria, such 
as spirochetes, suitable as model systems for cryo-ET. 

Frozen-hydrated biological specimens are 
extremely sensitive to electron beam strength. Higher 
doses will induce signiflcant radiation damage, which 
will in turn reduce the resolution of the resulting tomo¬ 
graphic reconstructions. Therefore, automating data 
collection is essential not only for speeding up the 
process but also for minimizing radiation damage. 
Recently, cryo-ET data collected at high magnification 
(39 K) and optimal total electron doses (typically ^100 
e /A^) revealed unprecedented cellular details of B. 
burgdorferi. In particular, multiple flagellar motors 
are easily visible, revealing considerable detail, such 
as the characteristic bowl-shaped structure surround¬ 
ing the central rod and the turbine-like radial arrange¬ 
ment of the motor (Liu et al. 2009). A 3D surface 
representation, which is manually segmented based 
on cellular features of a tomographic reconstruction, 
reveals two flagellar complexes in the middle and two 
at the bottom of the cell tip, furthermore putting these 
features into context. Flagellar motors are embedded in 
the cytoplasmic membrane surrounded by the inner 
membrane (dark green), the outer membrane (light 
green), and the peptidoglycan layer (white) (Fig. 1). 
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Flagellar Motor: Molecular 
Architecture In Situ, 

Fig. 2 Tomography reveals 
the common features and the 
major differences in flagellar 
motor architectures in situ, 
(a-c) Center sections of 3D 
structures of the flagellar 
motor from Borrelia 
burgdorferi, Leptospira 
interrogans and Bdellovihrio, 
respectively. The 3D surface 
models are displayed from the 
side view (d-f) and from 30“ 
off X-axis (g-i), respectively 
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Molecular Architecture of Flagellar Motor 
Revealed by Cryo-Electron Tomography 

It is unrealistic to dissect the flagellar motor structure 
directly from the reconstruction, despite the detailed 
cellular architecture and the readily visible flagellar 
motors within the cell. Subvolume analysis of the 
flagellar motor is the method of choice to improve 
the signal-to-noise ratio and resolution of the flagellar 
motor structure. The recent development of a high- 
throughput cryo-ET pipeline from data acquisition 
and tilt series alignment to 3D reconstruction makes 
it feasible to generate the large number of data sets 
required for sophisticated subvolume analysis. 

A 3D reconstruction of the intact flagellar motor of 
B. burgdorferi at approximately 3.5 nm resolution was 
determined after averaging 1,280 flagellar motors that 
were computationally extracted from 322 tomograms 
(Liu et al. 2009). These tomograms were reconstructed 
from approximately 40,000 images collected with the 
electron microscope. No rotational symmetry was 
imposed during this process, as indicated by the orien¬ 
tation of the hook. This revealed for the first time, the 
asymmetric features of the flagellar motor (Fig. 2a, d, g). 
The stator is embedded in a curved cytoplasmic 


membrane, as the diameter of cell cylinder is relatively 
small (approximately 0.1-0.2 pm) and the size of intact 
motor is relatively large (approximately 80 nm), the 
intrinsic flexibility of the rotor-stator interaction is 
likely an important factor during flagellar rotation. In 
addition, transposon mutants of flgi lacked a torus¬ 
shaped structure attached to the flagellar rod, 
establishing the structural location of the spirochetal 
P ring (Liu et al. 2009). 

In order to determine the degree of conservation of 
the flagellar motor structure within spirochetes and 
other flagella-driven bacteria, the flagellae from intact 
L. interrogans (the causative agent of leptospirosis) 
(Ren et al. 2003) and Bdellovihrio hacteriovorus 
(an evolutionarily distant, predatory Gram-negative 
bacterium) (Rendulic et al. 2004) were analyzed and 
compared with B. burgdorferi. L. interrogans and 
B. burgdorferi share similar, yet not identical, flagellar 
genes (Fallen et al. 2005); One difference is that the 
L ring gene (flgH), while present in L. interrogans, is 
not present in 5. hurgdoiferi. By averaging 496 flagel¬ 
lar motors, the molecular architecture of flagellar 
motor from L. interrogans was obtained (Fig. 2b, e, 
h). The overall motor structure is more complex than 
that of B. burgdorferi, as shown in the regions of the 













F 


764 


Flagellar Motor: Molecular Architecture In Situ 


C ring and export apparatus. Furthermore, the curva¬ 
ture of the membrane and the embedded putative stator 
is striking, due to the smaller cell cylinder of 
L. interrogans (approximately 0.1 pm), and this result 
supports the model of flexible interaction as a key 
element of flagellar structure and flagellar rotation. 

Bdellovihrio is a highly motile bacterium that 
preys on other Gram-negative bacteria by using 
a single polar flagellum (Rendulic et al. 2004), 
which is very different from the periplasmic flagellae 
found in spirochetes. Bdellovihrio is a good candidate 
for study through cryo-EM of externally flagellated 
bacteria, due to its relatively small size (0.3-0.5 pm). 
A 3D reconstruction of the polar flagellar motor and 
subvolume analysis shows a difference from the 
motor of spirochetes in general (Fig. 2c, f, i). The 
C ring of Bdellovihrio is much smaller (approxi¬ 
mately 45 nm) than that of B. burgdorferi, but similar 
to that of E. coli. The structures and locations of the 
putative P ring and L ring are comparable with those 
from E. coli. However, the L ring in Bdellovihrio does 
not intersect with the outer membrane, since the rod 
penetrates the outer membrane with an additional 
membrane sheath covering itself and the hook. 
Despite the significant differences between the flagel¬ 
lar motors in different species, the MS ring (the basis 
of the flagellar motor), the C ring (switch complex), 
and the export apparatus share remarkably similar 
features. It is expected that the majority of flagella- 
driven bacteria share the fundamental machinery of 
flagellar assembly and flagellar rotation, while many 
auxiliary features observed are required for the spe¬ 
cific and enhanced flagellar functions in different 
bacteria. 


Summary 

The bacterial flagellar motor is a remarkable nanoma¬ 
chine, powered by a proton gradient across the cyto¬ 
plasmic membrane. Cryo-ET is an emerging technique 
to elucidate the molecular architecture of the flagellar 
motor in living organisms at the nanometer resolution. 
Significant advances in the cryo-ET, with combination 
of subvolume analysis and molecular tools, would 
provide a detailed structural insight on the intact fla¬ 
gellar motor in situ, thereby furthering our understand¬ 
ing of the important process of flagellar assembly and 
bacterial motility. 


Cross-References 

► 3D Electron Microscopy Based on Cryo-Electron 
Tomography 

► Bacterial Flagellar Motor: Overview 

► Electron Microscopy of Motor Structure and 
Possible Mechanisms 

► Sodium-Driven Flagellar Motor: Structure and 
Mechanisms 

► X-Ray Diffraction and Crystallography of 
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► Bacterial Flagellar Motor: Overview 


Flash Photolysis 

Igor V. Chizhov 

Institute for Biophysical Chemistry, Hannover 
Medical School, Hannover, Germany 

Synonyms 

Pulsed-laser transient spectroscopy; Time-resolved 
light-induced reactions 

Definition 

Flash photolysis is “a technique of transient spectros¬ 
copy and transient kinetic studies in which a light pulse 
is used to produce transient species. Commonly, an 
intense pulse of short duration is used to produce 
a sufficient concentration of a transient species suitable 
for spectroscopic observation” (lUPAC 1997). 

Introduction 

In 1967 Ronald Norrish, George Porter, and Manfred 
Eigen were awarded the Nobel Prize for studies of 
extremely fast chemical reaction, effected by 
disturbing the equilibrium by means of very short 
impulses of energy. The flash photolysis method is 
one those famous relaxation techniques which were 
developed by laureates and many other scientists in 
the last half of twentieth century. As it happens quite 
often in history of science, a technique developed in 
wartime has been applied in unexpected way. In 1948, 
George Porter realized that high intensity flash tubes 


designed for nighttime aerial photography during World 
War II can generate a concentration of chemical species 
(free radicals, transient triplet states, etc.) large enough 
to be observed spectroscopically. Moreover, he decided 
to use two flash tubes: one powerful 2 ms flash for 
excitation of reaction and a second weaker but shorter 
(ca 50 ps) flash of light which passed the reaction 
volume after particular delay time. This light then hits 
detector (a spectrograph with photographic plate) and 
allowed the determination of the transient concentration 
of reaction species. Thus, the technique of flash photol¬ 
ysis was bom (Porter 1950). Further developments of 
the method were mainly concentrated on increasing of 
time resolution (down to few nanoseconds using spe¬ 
cially designed coaxial flash lamps), improvement of 
detection techniques, and an extension of application to 
reactions in liquid and solid phases. 

The advent of the laser in early 1960s revolution¬ 
ized the method. Nowadays, it allows transient kinetics 
to be recorded with a time resolution of few femtosec¬ 
onds (e.g., Huber et al. 2002). The dual pulse method 
(now is known as pump-probe spectroscopy), initially 
proposed by Porter, allows this unprecedented resolu¬ 
tion because after the splitting of a pico- or femtosec¬ 
ond laser pulse to give the strong (pumping) and weak 
(probing) beams, the latter can arrive to the sample 
after precisely defined delay time from few femtosec¬ 
onds to few nanoseconds. One should bear in mind that 
the light propagates about 1 mm in air within 3 ps. 
The optical delay unit provides this variable time. 
Today, this method of ultrafast flash photolysis 
spectroscopy is one of the most active experimental 
approaches in physics, chemistry, and biology. 
In 1999, Ahmed H. Zewail received the Nobel Prize 
in Chemistry for his pioneering work in this held. 

However, this double pulse method has some dis¬ 
advantages. Firstly, for each time point one needs the 
renewal of reaction system, and secondly, the temporal 
accuracy of kinetics is greatly determined by the sta¬ 
bility of the pulses. An obvious benefit is the ability to 
record a transient spectrum at each time point. There¬ 
fore, in parallel with Porter’s first flash photolysis 
system, a setup with a single flash tube was developed 
in his lab. Here the probe light was provided by 
a continuous xenon arc or tungsten lamp, which was 
passed through a monochromator, and detected with 
a photomultiplier tube connected to oscilloscope. This 
configuration is most popular nowadays for investiga¬ 
tion of reaction kinetics in the nanosecond and slower 
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Flash Photolysis, 

Fig. 1 Flash photolysis of 
Bacteriorhodopsin. The 
transient absorption changes at 
650 nm were recorded from 
0.5 ps to 2 s after the laser 
pulses at six different 
temperatures (0-50“C, step 10). 
Experimental data (circles) and 
eight exponential fits (tines) are 
plotted 


60 



0 


10-6 10-5 10-4 io“® 10“2 10“'' 10° 

time, s 



time domain and particularly in the field of biophysics, 
where quite complicated multistep relaxation path¬ 
ways of molecules (proteins, DNA, etc.) are often 
observed. The kinetic analysis of such long transients 
needs very high accuracy of detection. Two applica¬ 
tions of such laser flash photolysis system are 
described as examples (► Protein Dynamics; Time- 
Resolved Spectroscopic Studies). 

Flash Photolysis of Bacteriorhodopsin 

Bacteriorhodopsin (BR) is a 7-alpha helical, trans¬ 
membrane protein that acts as a light-driven proton 
pump. The protein absorbs the visible light (max 
570 nm) and transfers the hydrogen ions through a bio¬ 
logical membrane. This transport occurs in a multistep 
manner and is accompanied by changes of absorption 
spectrum (so-called photocycle of BR). Therefore, 
time-resolved absorption spectroscopy of BR excited 
by laser pulse is one of the most common experimental 
methods of investigation of this protein. In Fig. 1, an 
example of such traces is depicted in which a 5 ns, 
5 mJ, 532 nm laser pulse from the Nd-YAG laser was 
used and the UV-Vis transient traces measured. Global 
kinetic analysis of these traces together with other 
wavelengths in the UV-Vis part of spectrum revealed 
the eight rate constants, their temperature dependence, 
and five distinct spectral states of the BR photocycle 
(Chizhov et al. 1996). 

Note that the ultrafast pump-probe experiments 
detect another three rate constants in the femto- and 


picosecond time domains (Dobler et al. 1988), thus 
giving 11 experimentally resolved transitions of the 
proton transportation by BR. 

Flash Photolysis of Caged ATP and 
Actomyosin Kinetics 

Two major motor proteins, actin and myosin 
are responsible for the muscle contraction. Myosin 
hydrolyses ATP molecule and converts the energy 
of hydrolysis to the mechanical force imposed on the 
actin filaments. The minimal reaction mechanism of 
the actomyosin reaction contains 15 states and 44 rate 
constants between them (Geeves et al. 1984). Most 
of the kinetic investigations of this reaction were 
performed using rapid mixing technique (► stopped- 
flow, ► quenched-flow, and ► relaxation methods). 
Flash photolysis method can substitute some of these 
measurements with better economy of sample (Weiss 
et al. 2000). The 5 ns, up to 20 mJ, 355 nm laser pulses 
have been used from essentially the same Nd-YAG 
laser as for BR flash photolysis studies. These pulses 
photolyzed the inactive precursor of ATP molecule 
(caged ATP) thus initiating the reaction of free ATP 
with actomyosin. In Fig. 2, transient light scattering 
traces indicate actomyosin dissociation, and its subse¬ 
quent reassociation is shown. Analysis of dissociation 
(exponential) and association (logistic) kinetics pro¬ 
vided important values of the apparent second-order 
rate constant of the actomyosin dissociation and the 
catalytic activity of myosin. 
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Flash Photolysis, Fig. 2 Flash photolysis experiments of 
dissociation and association kinetics of actomyosin. The light 
scattering signal decreased after the laser-induced release of free 
ATP molecules from a caged compound, indicating the dissoci¬ 
ation of actomyosin. After the long steady-state period of three 


time decades, the reformation of actomyosin complex and 
corresponding restoring of light scattering signal was observed. 
Experimental data (circles) are plotted together with theoretical 
fit (lines). Five consequent laser shots of different energy 
(2-5 mJ) have been applied to the single sample 


Concluding Remarks 

Only two examples from numerous flash photolysis 
experiments that can be found in the literature are 
described here. They underlined some basic features 
of the method related to biophysical research. Dynam¬ 
ics of active biological molecules is characterized by 
very wide spectrum of relaxation times spanning the 
range from nanoseconds to seconds and slower. Time 
constants shorter than nanoseconds are usually 
assigned to the electron dynamics of the photo-excited 
chromophores and studied by pump-probe transient 
spectroscopy. These nine orders of magnitude of time 
are equivalent to the time span of 1 s to 1 Gs (i.e., 
'^32 years). Therefore, the long life of the biological 
molecule can be recorded in the flash photolysis exper¬ 
iment. In the first example, the transient kinetics were 
recorded over the seven time decades (Fig. 1), and in 
the second one over the five (Fig. 2). The flash photol¬ 
ysis technique with the appropriate log-time data 
acquisition device (digitizing PC card or digital oscil¬ 
loscopes) allows recording of such transients. 

The actomyosin kinetics (Fig. 2) is shown starting 
from 10 ms. This is the time constant of free ATP 
release defined by photochemistry of used caged com¬ 
pound (NPE-ATP) under the experimental conditions. 
One should note however, that the flash photolysis 
method of caged compounds could overcome 


potentially the time resolution of the stopped-flow 
apparatus (^1 ms) and allow the observation of the 
kinetics of bimolecular reactions limited only by the 
time of diffusion (^ ps or shorter). 

One of the most expensive parts of the modern flash 
photolysis device is the laser. The low budget digital 
photo camera is equipped nowadays with a flash lamp 
that produces a light pulse of 10-30 ps duration and 
emits few tens of millijoules of energy. Such a flash 
lamp is adequate for feasibility studies before investing 
in more expensive equipment. 

Cross-References 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 

► Quenched-Flow Methods 

► Relaxation Dispersion 

► Stopped-Flow Techniques 

► Transient-State Kinetic Methods 
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► Flavin Mononucleotide-Binding Fluorescent Pro¬ 
teins 

► Flavins 


Flavin Mononucleotide-Binding 
Fluorescent Proteins 

Aba Losi and Cristiano Viappiani 
Department of Physics, University of Parma, 

Parma, Italy 

Synonyms 

Flavin mononucleotide; Fluorescent protein (FP); 
FMN-binding fluorescent proteins (FbFPs); Green 
fluorescent protein (GFP); Light, oxygen, or voltage 
domains (LOV); Riboflavin 5'-Phosphate (FMN) 

Definition 

Flavin mononucleotide (FMN)-binding fluorescent 
proteins (FbFPs) are novel reporters for cell micros¬ 
copy, outperforming green fluorescent proteins (GFP) 
and derivatives in anaerobic or microaerobic environ¬ 
ments (Fig. 1). Furthermore, they are better suited 


for incorporation into small-sized genomes and for 
following viral infections, protein expression, and 
protein maturation in real time (Figs. 2, 3). FbFPs are 
derived from photoreceptors of the LOV (light, 
oxygen, or voltage) superfamily and exhibit fluores¬ 
cence in the green region of the spectrum with 
quantum yield of ca. 0.2-0.3. FbFPs have been 
introduced as fluorescent reporters in 2007 and have 
been ever since exploited for a number of selected 
applications, including real-time biosensing of 
changing oxygen level within living cells. 

Basic Characteristics 

Development of FbFPs for Fluorescence Imaging 
in Anaerobic/Microaerobic Systems 

Flavin mononucleotide (FMN)-binding fluorescent 
proteins (FbFPs) are fluorescent proteins (FP) 
engineered from LOV (light, oxygen, or voltage) 
domains, photoresponsive protein units of ca. 110 
amino acids in length (► LOV proteins: 
photobiophysics; Losi and Gartner 2012). In LOV 
domains the FMN chromophore (► Flavins) is 
noncovalently bound in the dark adapted state and 
brightly fluorescent with quantum yield Op ca. 0.2-0.3 
(► Fluorescence: General Aspects; ► Fluorescence and 
FRET in Membranes). Blue-light photoexcitation leads 
to the reversible formation of a FMN-cysteine covalent 
adduct, with complete loss of fluorescence. FbFPs have 
been originally designed such that the reactive cysteine 
is mutated into alanine and the system retains perma¬ 
nent fluorescence (Drepper et al. 2007). This mutation 
can also increase Op up to 0.39. Different from green 
fluorescent protein (GFP), the FMN fluorophore 
is formed also in anaerobic and microaerobic environ¬ 
ments (Vizcaino-Caston et al. 2012). FbFPs are 
thus ideally suited as fluorescent reporters (► Optical 
Fluorescence Microscopy) of bacterial location or gene 
expression in obligate anaerobes, for example the 
opportunistic pathogens Porphyromonas gingivalis 
(Choi et al. 2011) (Fig. 1) and Bacteroides fragilis 
(Lobo et al. 2011), and the facultative anaerobe 
Rhodohacter capsulatus (Drepper et al. 2007). FbFPs 
have also been proposed as fluorescent tools to 
study fungal colonization of hypoxic niches during 
infections, based on model studies on gene expression 
in Candida albicans and Saccharomyces cerevisiae 
(Tielker et al. 2009). 
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Flavin Mononucleotide-Binding Fluorescent Proteins, 
Fig. 1 Epifluorescent visualization of intracellular PgFbFP 
{Porphyromonas gingivalis expressing a FbFP, in green) at 
24 h postinfection in human primary gingival epithelial cells 
(GECs) revealed high levels of viable bacteria. Actin cytoskel- 
eton was stained by phalloidin-TRITC {red) and nuclei stained 


FbFPs as Fluorescent Reporters during Viral 
Infections 

A small molecular weight (ca. 10 kDa) FbFP 
denominated iLOV has been designed for studying 
the dynamics of viral infections in plants and animal 
cells (Chapman et al. 2008). Tagging viruses with GFP 
derivatives often results in decreased infectivity and 
loss of FP through recombination events, due to the 
limited size of viral genomes and high recombination 
rate. The smaller size of iLOV seems to overcome this 
problem, allowing optimal packing within the genetic 
material of a virus. Furthermore the lack of need for 
a posttranslational maturation of the FMN-LOV 
domain fluorescent chromophore with respect to 
GFP, permits a better visualization of the infection 
dynamics (Fig. 2). iLOV bears several mutations 
beyond the Cys-to-Ala outlined above, aimed to 
enhance fluorescence to a <t)p = 0.44 and minimize 
irreversible photobleaching (Chapman et al. 2008). 

FbFPs as In Vivo Real-Time Reporters for 
Protein Synthesis 

The main advantage of hypoxia-tolerant FbFPs over 
oxygen-dependent FP is a better correlation between 
the production of a recombinant fused protein and the 
onset of fluorescence, without the requirement for 
oxygen during bacterial growth. An optimized 
expression vector for E. coli called evoglow is now 
commercially available (Drepper et al. 2010). Evoglow 
carries the gene for£cFbFP, codon optimized for£. coli 
expression from the LOV domain of the Bacillus 
suhtilis protein YtvA (Drepper et al. 2007). It was 


with DAPI {blue) to outline the bacterial localization in the host 
cells. The merged image displays prominent localization 
of PgFbFPs with the highly polymerized actin filaments and 
globular actin cytoskeleton around the nucleus {yellow). White 
arrows point to this dynamic interaction (Reproduced from Choi 
et al. 2011, courtesy of Yilmaz Ozlem) 



Flavin Mononucleotide-Binding Fluorescent Proteins, 
Fig. 2 Example for the utility of iLOV, a small-size FbFP, as 
a fluorescent reporter fusion for tobacco mosaic virus (TMV) 
infections. Systemic spread of TMV MP-iLOV (TMV carrying 
a gene for a viral movement protein fused with iLOV) and TMV 
MP-GFP. Upper leaves of Nicotiana tabacum at 4 days post 
inoculation. TMV MP-iLOV shows extensive systemic spread 
and unloads from all major vein classes, spreading into neigh¬ 
boring ground tissue {left). TMV MP-GFP by comparison shows 
no or limited systemic spread, unloading only patchily from the 
midrib and some secondary veins. Leaves were photographed 
simultaneously to allow direct comparison of green fluorescence 
intensity (Modified from Chapman et al. 2008, courtesy of 
John Christie) 
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Flavin Mononucleotide-Binding Fluorescent Proteins, 
Fig. 3 Comparative analysis of yellow fluorescent protein 
(YFP) and FhFP expression and in vivo fluorescence in E. coli 
cells. The development of biomass (cell density), dissolved 
oxygen tension (DOT), and FP-mediated fluorescence were 
permanently monitored in each well of a microtiter plate. The 
growth of cultures was monitored with scattered light (I-Io) at an 
excitation wavelength of 620 nm. Fluorescence emission of FPs 


was recorded at 540 nm (YFP) and 492 nm (FhFP). Data repre¬ 
sent averages from four cell cultivations. Accumulation of YFP 
and FhFP in E. coli was analyzed at the indicated time points 
(tl to t7, marked with black triangles) by immunodetection using 
specific antisera. All cell extracts were adjusted to the same cell 
density (corresponding to an OD600 of 10) (Drepper et al. 2010, 
reproduced with permission from Applied and Environmental 
Microbiology, license Nr. 2874230314600, Mar 22, 2012) 


demonstrated that iscFbFP is insensitive toward variable 
levels of dissolved oxygen during cell cultivation and 
that its fluorescence signal develops immediately after 
gene expression, thus allowing the monitoring of in vivo 
gene expression and protein production levels much 
more precisely than GFP-related FP. 

Recently, a dual-color reporter system based on 
green fluorescent iJcFbFP and red fluorescent 
protein DsRed has allowed to monitor plasmid transfer 
under both aerobic and anaerobic conditions 
(Krol et al. 2010). As a further development, genes 


for the hypoxia-tolerant FhFP evoglow and the 
yellow fluorescent protein (YFP) that is sensitive 
toward oxygen depletion have been recently com¬ 
bined. This has led to a genetically encoded Forster 
resonance energy transfer (FRET)-based biosensor 
(► Fluorescence and FRET in Membranes) able to 
detect changing molecular oxygen concentrations 
inside living cells. Speciflcally this dual-system has 
been applied to monitor in real time changes of O 2 
levels in Escherichia coli cells during batch cultiva¬ 
tion (Potzkei et al. 2012). 
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Flavins 

Giovanni Gadda 

Departments of Chemistry and Biology, Georgia State 
University, Atlanta, GA, USA 

Synonyms 

7.8- Dimethylisoalloxazine; Flavin mononucleotide; 
Riboflavin; Riboflavin 5'-Phosphate (FMN) 

Definition 

Any of a class of pale yellow, water soluble, chemical 
compounds containing a heterocyclic 7,8-dimethyl- 
isoalloxazine that includes riboflavin (Vitamin B 2 ) and 
its derivatives flavin mononucleotide (FMN) and flavin 
adenine dinucleotide (FAD). Flavins are found in all 
kingdoms of life and commonly act as electron transfer 
cofactors in flavoproteins. 

Introduction 

Flavins are any of a class of yellow, water soluble, 
chemical compounds containing a heterocyclic 

7.8- dimethylisoalloxazine that includes riboflavin 
(Vitamin B 2 ) and its derivatives flavin mononucleotide 
(FMN) and flavin adenine dinucleotide (FAD) (Fig. 1) 
(Massey 2000). Flavins are found in all kingdoms of 
life. Synthesis of riboflavin occurs in bacteria and 
plants but not in animals, which must acquire it from 
diet. The riboflavin derivatives FMN and FAD, but not 
riboflavin, are commonly associated to specific pro¬ 
teins or enzymes as cofactors. In 10-15% of the known 
flavoproteins the flavin is covalently linked to the 
protein moiety, with the remaining exploiting 
noncovalent association of the flavin and the protein. 
The first isolation of a flavin, called lactochrome, 
was in 1879 from cow’s milk by the English chemist 
A. W. Blyth (1879). The name riboflavin was coined 
when R. Kuhn using chemical synthesis elucidated the 
chemical structure of lactochrome in 1934. A year 
later, H. Theorell demonstrated that the protein moiety 
of the Old Yellow Enzyme extracted from yeast, which 
is the first flavoenzyme ever isolated, could be 
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flavin adenine dinucleotide (FAD) 
Flavins, Fig. 1 Chemical structures of common flavins 


reconstituted with either rihoflavin or FMN, but only 
with the latter the enzyme regained activity. In a sim¬ 
ilar fashion, in 1938 Warburg and Christian showed 
that the apoprotein of o-amino acid oxidase isolated 
from pig kidney was enzymatically active only when 
it was reconstituted with the extracted flavin, but not 
with riboflavin or FMN. In 1954, the structure of 
the flavin extracted from o-amino acid oxidase was 
proved by total synthesis to be FAD. These milestones 
established riboflavin as a vitamin and FMN and FAD 
as cofactors in enzyme catalysis. The covalent attach¬ 
ment of flavins to proteins was firstly established in 
mammalian succinate dehydrogenase in 1955 (Kear¬ 
ney and Singer 1955). Since then, covalent attachment 
to a protein has been demonstrated through the C(6) 
atom and the C(8) methyl group of the flavin in many 
flavoproteins. X-Ray crystallography revealed the first 
case of a flavin bicovalently linked to a protein in 2005, 
in glucooligosaccharide oxidase (Huang et al. 2005). 

Chemical Properties 

Flavins exist in three redox states: oxidized, 
semiquinone (1 e“ reduced, radical), and hydroqui- 
none (2 e“ reduced, fully reduced). Six out of the 
nine possible redox/ionic forms are physiologically 
relevant based on their pKa values (Fig. 2), with each 


form displaying a characteristic UV-visible absor¬ 
bance spectrum. In solution flavin semiquinones and 
hydroquinones react readily with molecular oxygen 
yielding oxidized flavin, which is the redox form com¬ 
monly stabilized at atmospheric oxygen conditions. An 
anaerobic mixture of oxidized and fully reduced fla¬ 
vins readily equilibrates to 5% semiquinone at pH 7.0. 
These equilibria are governed by the midpoint ► redox 
potentials of the oxidized/semiquinone (—238 mV for 
FMN) and semiquinone/hydroquinone (—172 mV 
for FMN) couples (Muller 1992). Redox equilibria 
can be significantly perturbed in enzyme-bound flavins 
due to modulating effects of the protein environment 
that surrounds the flavin, allowing the semiquinone to 
be variably stabilized between zero and 100% in dif¬ 
ferent enzymes with midpoint redox potentials ranging 
from —440 mV to -F130 mV. In vitro reconstitution of 
apoenzymes with unnatural synthetic flavins of various 
redox potential has been used to investigate the bio¬ 
chemical properties of some flavoenzymes. Flavopro- 
tein oxidases (vide infra) commonly stabilize the 
anionic semiquinone (red form) and the anionic hydro- 
quinone (Fig. 2). This is mainly due to the presence 
of a protein positive charge in proximity of the flavin 
N(l)-C(2) atoms. The modulation of the redox poten¬ 
tials and the variability of the active sites found 
in flavoenzymes contribute to the outstanding chemical 
versatility of the flavins, which are capable of both 
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Flavins, Fig. 2 Physiologically relevant redox/ionic forms of flavins 


one- and two-electron transfer processes. In prokaryotic 
and eukaryotic respiratory chains flavins couple the 
two-electron oxidation of organic compounds to the 
one-electron transfers in ► cytochromes and other 
redox centers. Moreover, flavins can act as electrophiles 
or nucleophiles, with covalent intermediates sometimes 
observed involving the flavin and the substrate of 
a flavin-dependent enzymatic reaction. Irrespective 
of whether one or two electrons are transferred, flavins 
undergo catalytic cycles that include reductive and oxi¬ 
dative half-reactions, where electron(s) are accepted on 
the flavin from an electron donor and donated from the 
flavin to an electron acceptor, respectively. 

Biochemical Functions 

It is estimated that up to 3% of prokaryotic and eukary¬ 
otic proteins contain or react with flavins. More than 
1,500 structures of flavoproteins have been deposited 
in the Protein Data Bank as of August 2010. Due to the 
chemical versatility of the flavin cofactor, flavin- 
dependent proteins are involved in a myriad of bio¬ 
chemical reactions. These include, but are not limited 


to, aerobic and anaerobic metabolism, photosynthesis, 
light-dependent DNA repair, light emission, plant pho- 
totropism, bacterial photoreception, regulation of 
biological clocks, energy production, oxygenations, 
reductions, oxidations, biodegradation, chromatin 
remodeling, apoptosis, protein folding, detoxification, 
non-redox reactions, and various biosyntheses (Muller 
1992; Massey 2000; Joosten and van Berkel 2007). 

Flavoproteins 

Flavin-dependent enzymes catalyze different types of 
reactions and are involved in a large variety of bio¬ 
chemical processes {vide ante). These enzymes have 
evolved in organisms to exploit the chemical proper¬ 
ties of the flavin cofactor by mainly suppressing 
the plethora of possible side reactions while optimizing 
the chemistry of the reaction to be catalyzed. Over the 
years several classifications of flavoproteins have been 
proposed based on the type of reaction that is cata¬ 
lyzed, the type of the organic substrate acting as reduc¬ 
ing agent, the identity of the oxidizing substrate, the 
product (if any) of the reduction of oxygen, and 
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the three-dimensional fold or the chemical mechanism 
of the reaction (Massey and Hemmerich 1980; Massey 
2000; Fraaije and Mattevi 2000; Fagan and Palfey 
2010). All of the classihcations are satisfactory since 
they depend upon whether the attention is focused on 
the biochemical, structural, or mechanistic properties 
of the enzymes that are considered. Over the years 
significant advancements have been made toward elu¬ 
cidating how the biochemical and structural properties 
of flavoproteins translate into their mechanisms and 
ultimately biological functions. Many flavoenzymes 
catalyze the oxidation of alcohols to aldehydes 
(or ketones), through a hydride transfer mechanism 
from the substrate to the flavin (Gadda 2008). Exam¬ 
ples include choline oxidase and dehydrogenase, glu¬ 
cose oxidase, lactate oxidase, flavocytochrome b 2 , and 
cholesterol oxidase. A large number of flavoenzymes 
catalyze the oxidation of amines to imines, which 
are hydrolyzed nonenzymatically in solution. This 
class includes o-amino acid oxidase, monoamine 
oxidase, sarcosine oxidase, proline dehydrogenase, 
lysine-specific demethylase 1, and polyamine oxidase. 
The reduction of carbon-carbon bonds is catalyzed, 
among others, by acyl-CoA oxidases and 
dehydrogenases, ► succinate dehydrogenase (Com¬ 
plex II), fumarate reductase, dihydroorotate dehydro¬ 
genases, and old yellow enzyme. The reduction of 
disulfides by pyridine nucleotides is catalyzed by 
flavoenzymes that utilize two cysteines in their active 
site, including ► glutathione reductase, ► thioredoxin 
reductase, lipoamide dehydrogenase, mercuric reduc¬ 
tase, and sulfhydryl oxidase. Hydroxylations of 
organic compounds by pyridine nucleotides are 
catalyzed by one- and two-component hydroxylases, 
which include p-hydroxybenzoate hydroxylase, 
kynurenine monooxygenase, styrene monooxygenase, 
MICAL, and flavin-dependent monooxygenase. Some 
flavoenzymes catalyze reactions other than oxidation/ 
reductions, among which are chorismate synthase, 
UDP-galactopyranose mutase, DNA photolyase, 
oxynitrilase, and isopentenyl diphosphate isomerase. 
Some flavoproteins are responsible for transferring 
electrons from dehydrogenases to the membrane- 
bound respiratory chain, including ► electron transfer 
flavoprotein, phthalate dioxygenase reductase, and 
► cytochrome P-450 reductases. A number of flavin- 
containing proteins are complex and also contain other 
types of ► electron transfer cofactors, such as ► iron- 
sulfur centers, ► hemes, and diiron centers, or can 


contain more than one flavin cofactor. Examples of 
complex flavoproteins are glutamate synthase, xan¬ 
thine oxidase and dehydrogenase, and »• nitric oxidase 
synthase. 

Cross-References 

► Cytochrome P450 - Computational Studies 

► Electron Transfer Cofactors 

► Electron Transfer Elavoproteins 

► Electron Transfer Proteins: Overview 

► Flavoenzyme Mechanisms: p-Hydroxybenzoate 
Hydroxylase and Phenol Hydroxylase - 
Computational Studies 

► Hemes 

► Iron-Sulfur Clusters 

► Mitochondrial Electron Transport 

► NADH-Ubiquinone Oxidoreductase (Complex I) 

► Nicotinamide Adenine Dinucleotide (NAD) 

► Nitric Oxide Synthase - Computational Studies 

► Oxidative Stress 

► Photosystem I 

► Pyranopterins 

► Redox Potential 

► Succinate Dehydrogenase (Complex II) 

► Thioredoxins 
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Definition 

Para-hydroxybenzoate hydroxylase (PHBH) is 
a flavoprotein catalyzing 3-hydroxylation of 
p-hydroxybenzoate. 

Phenol hydroxylase (PH) is a flavoprotein catalyz¬ 
ing ortho-hydroxylation of phenol. 


Introduction 

Para-hydroxybenzoate hydroxylase (PHBH) is one of 
the most extensively studied ► flavin-dependant 
monooxygenases, which catalyzes 3-hydroxylation of 
p-hydroxybenzoate (p-OHB) to protocatechuate 
(Entsch and Van Berkel 1995). Phenol hydroxylase 
(PH) is a related flavoprotein catalyzing a very similar 
reaction, i.e., ortho-hydroxylation of phenol. PH and 
PHBH are homologues in two branches of the 
microbial P-ketoadipate pathway involved in the 
degradation of aromatic compounds, including lignin 
and various xenobiotic aromatics. PHBH has become 
a model system for computational studies of enzyme- 
catalyzed reactions. Specifically, the rate-determining 
monooxygen transfer from the reactive C4a- 
hydroperoxyflavin intermediate to the substrate has 
been subject of numerous studies (Fig. 1). Early 
computational work related characteristics of the 



Flavoenzyme Mechanisms: p-Hydroxybenzoate Hydroxy¬ 
lase and Phenol Hydroxylase - Computational Studies, 
Fig. 1 Schematic drawing of the reaction catalyzed by PHBH. 
The gray box indicates the hydroxylation step, subject to com¬ 
putational studies 


frontier molecular orbitals of different substrates in 
gas phase to the observed enzymatic reaction rates 
(Vervoort et al. 1992). Subsequently, combined quan¬ 
tum mechanical/molecular mechanical (► QM/MM) 
models were applied to study the actual reaction path¬ 
way within the context of the enzyme active site 
(Fig. 2). The first QM/MM models of PHBH and PH 
used the ► semiempirical AMI method to treat the QM 
region of the model and were based on a single 
conformation of the protein-substrate complex, 
resulting in an adiabatic mapping of the approximate 
reaction coordinate (Ridder et al. 1998, 1999, 2000a). 
Subsequent studies on PHBH applied QM/MM 
► molecular dynamics (MD) simulations to obtain 
free-energy profiles (Billeter et al. 2001; Senn et al. 
2005; Kastner et al. 2006). In addition, higher level 
QM methods were employed to describe the QM 
region more accurately and improve the calculated 
energy barrier (Claeyssens et al. 2006; Mata 
et al. 2008; Ridder et al. 2003). 
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Flavoenzyme Mechanisms: p-Hydroxybenzoate Hydroxy¬ 
lase and Phenol Hydroxylase - Computational Studies, 
Fig. 2 The active site region in the transition state of the 
hydroxylation reaction in PHBH. The QM atoms are shown in 
sticks and interacting MM atoms are shown as lines. Dashed 
lines indicate hydrogen bonds 


Reaction Mechanism of PHBH 

The mechanism by which the C4a-hydroperoxyfavin 
reacts with /r-OHB has been a first key question in 
computational studies. Several reaction mechanisms 
had been proposed, including both homolytic and het- 
erolytic cleavage of the peroxide bond. Furthermore, it 
had been suggested that deprotonation of the hydroxyl 
moiety occurs in the catalytic site of PHBH. X-ray 
crystallography revealed that a hydrogen bond 
network able to form and stabilize the dianion and 
mutation studies showed the detrimental effect of 
distorting this network (Entsch and Van Berkel 
1995). A first theoretical study based on frontier orbital 
calculations suggested that nucleophilic reactivity of 
the substrate is a key factor for enzymatic activity and 
is induced by deprotonation of the p-hydroxy moiety 
(Vervoort et al. 1992). QM/MM methods were applied 
to model the actual hydroxylation reaction according 
to an electrophilic aromatic substitution mechanism 
involving heterolytic cleavage of the peroxide bond 
(Fig. 1). The results agreed with experimental obser¬ 
vations in several ways, supporting the modeled 


reaction mechanism. First of all, two independent 
QM/MM studies showed that without deprotonation, 
prohibitively high activation barriers are obtained 
(Billeter et al. 2001; Bidder et al. 1999). Furthermore, 
activation barriers calculated for a series of (fluorinated) 
p-hydroxybenzoate substrates correlated well with the 
experimentally observed rates. Subsequently, also 
the calculated effects of substituents on the flavin 
cofactor were shown to correlate with experimental 
rate constants (Bidder et al. 2000b). While the trends 
of the calculated barriers agreed with experiment, the 
actual barriers were overestimated, which was subject 
of further work described below. 

Catalytic Interactions in the Active Site 
of PHBH 

The role of the active site surroundings on the 
hydroxylation reaction has been the second focus of 
computational studies. The catalytic effect of the 
protein on the isolated hydroxylation step was 
estimated to be very small, based on the difference in 
the calculated energy barriers with and without the 
protein present (Bidder et al. 1999; Billeter 
et al. 2001). It indicated that the main role of the active 
site is to bring together the substrate and flavin cofactor 
in their reactive forms, i.e., the dianionic form of 
/7-OHB and C4a-hydroperoxyflavin. Several key 
active site residues involved in stabilizing this starting 
point for hydroxylation are identified (Fig. 2). The 
carboxylate group of the substrate is held in place by 
a salt bridge to Arg214 and hydrogen bonds to Ser212 
and Tyr222. The hydroxyl moiety of p-OHB is 
deprotonated and subsequently stabilized by 
a hydrogen bonding network including Tyr201 and 
Tyr385. A crystal water molecule (Wat717) can form 
a hydrogen bond to the hydroperoxide moiety of the 
activated flavin cofactor, and independent QM/MM 
studies indicated its role in stabilizing the negative 
charge that develops on the proximal oxygen as the 
hydroxylation reaction proceeds (Bidder et al. 1999; 
Senn et al. 2005). 

One specific interaction has been of particular 
interest as it appears to specifically stabilize the tran¬ 
sition state of the hydroxylation step. In the transition 
state, the distal peroxide oxygen of the cofactor is 
halfway being transferred to the substrate. At this 
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stage along the reaction coordinate, the OH group 
forms a hydrogen bond to the backbone amide oxygen 
of Pro293. Decomposition of the QM/MM energies 
yielded an estimated stabilization of the transition 
state by 2-3 kcal/mol due to this specific interaction 
(Ridder et al. 1999, 2003). QM/MM MD simulations 
(Senn et al. 2005) reproduced the above interactions 
positioning the substrate in the active site and showed 
the interaction with the amide oxygen of Pro293 to be 
relatively transient in nature. 

Computational Aspects 

Several aspects of the QM/MM approach to enzyme 
reaction modeling have been evaluated on the basis 
of the PHBH case. First of all, the level of QM 
treatment needed to accurately describe the reaction 
has been investigated in several studies. The first 
QM/MM studies, which used the semiempirical 
AMI method coupled to CHARMM molecular 
mechanics, recognized that AMI overestimates 
both the activation barrier and the exothermicity of 
the hydroxylation step (Ridder et al. 1998, 1999). 
Subsequent ah initio QM/MM studies showed that 
HF overestimates the barrier even more, while single 
point B3LYP and LMP2 calculations on the HF 
geometries result in an underestimation of the barrier 
(Ridder et al. 2003). Overall, AMI turned out to 
provide a good trade-off between accuracy and com¬ 
putational efficiency, which was the basis for other 
studies to investigate the importance of conforma¬ 
tional sampling along the reaction coordinate (Senn 
et al. 2005; Kastner et al. 2006). Senn et al. (2005) 
used an AMl/GROMOS implementation to perform 
thermodynamic integration along the hydroxylation 
coordinate, starting from different conformations of 
the PHBH model, to obtain a free-energy profile. 
They investigated several aspects of the QM/MM 
MD setup in order to obtain convergence in sampling 
and thermodynamic integration, and confirmed the 
overestimation of the barrier by AMI. Kastner et al. 
(2006) applied two other techniques to obtain free- 
energy barriers, free-energy perturbation and 
umbrella sampling, and showed that the results by 
all three techniques were consistent. In addition, 
further approximations to the free-energy perturba¬ 
tion approach were tested. For example, MD 


sampling was performed with the QM region 
replaced by simple atomic charges that reproduce 
its electrostatic field (ESP charges), which was 
shown to give comparable results at a fraction of 
the computational cost. 

The most accurate calculation of the activation bar¬ 
rier has been obtained by optimizing the transition 
states and reaction paths with B3LYP/GROMOS for 
different conformations of the model system, and sub¬ 
sequently performing single point QM/MM energies 
calculations based on a local approximation to coupled 
cluster theory (LCCSD(TO)). Corrections for zero- 
point vibrational energy and entropy, derived from 
AMI-based (free energy) calculations, resulted in an 
average free-energy barrier of 13.7 kcal/mol, which 
is in quantitative agreement with the barrier of 
14-15 kcal/mol derived from experiment (Claeyssens 
et al. 2006; Mata et al. 2008). 

Comparison with Phenol Hydroxylase 

Hydroxylation by phenol hydroxylase has been stud¬ 
ied using the same AMI-based QM/MM method that 
was used for the early PHBH model, allowing 
a detailed comparison between the two enzymatic 
reactions (Ridder et al. 2000a). A potential energy 
surface was calculated spanning the reaction coordi¬ 
nates for the hydroxylation step as well as the 
deprotonation of the hydroxyl moiety by Asp54. The 
result indicated that while the phenolic substrate binds 
to PH in its neutral state, the hydroxylation neverthe¬ 
less proceeds by initial proton transfer to Asp54 lead¬ 
ing to the formation of a relatively unstable phenolate 
intermediate, which is then hydroxylated. A strong 
correlation between the calculated barriers and the 
observed overall catalytic rate constants of a series 
of substituted phenols by PH indicate that, like in 
PHBH, the hydroxylation step is rate limiting under 
physiological conditions and proceeds via the 
modeled electrophilic aromatic substitution mecha¬ 
nism. Finally, a similar interaction with a backbone 
amide oxygen of a proline residue, Pro364, was found 
to specifically stabilize the transition state toward 
the phenol hydroxylation, suggesting this catalytic 
interaction could be one of the reasons for the conser¬ 
vation of this proline residue within the flavoprotein 
monooxygenases. 
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Synonyms 

BEST; Direct-Injection NMR (DI-NMR); Flow-Injec- 
tion-Analysis NMR (FIA-NMR); LC-NMR; LC- 
NMR/MS; Tubeless NMR; VAST 

Definition 

Flow NMR (Nuclear Magnetic Resonance) is an 
NMR-spectroscopy technique in which the samples 
flow through the rf coil. (The rf coil, where both exci¬ 
tation and detection occur, is located in the NMR 
probe, which is within the NMR magnet.) This is in 
contrast to conventional NMR spectroscopy, in which 
the samples are contained in precision glass sample 
tubes that are mechanically shuttled in and out of the rf 
coil. In flow NMR, the sample may - or may not - be 
flowing during that actual analysis. 

The classic example of a flow NMR technique is LC- 
NMR - and its related techniques such as LC-NMR/ 
MS, capillary-electrophoresis NMR (CE-NMR), and 
size-exclusion chromatography NMR (SEC-NMR). 
Other major forms of flow NMR include Direct- 
Injection NMR (DI-NMR) and Flow Injection-Analysis 
NMR (FIA-NMR). Potential advantages of flow NMR 
include rapid sample shuttling (in and out of the rf coil) 
and minimal shimming requirements (due to the fixed 
sample geometry of the static flow cell). Potential dis¬ 
advantages of flow NMR techniques include the risks of 
carryover from one sample to the next, contaminated 
flow cells, and plugged tubing. Flow NMR also requires 
the user to pay increased attention to the minimization 
of magnetic-susceptibility mismatches within the sam¬ 
ples (if high-quality data is desired). (This is also 
a requirement in tube-based NMR, but it is easier to 
satisfy there, and so most users are not readily conscious 
of its importance.) 

The LC-NMR-type methods perform sample sepa¬ 
ration and purification (with the HPLC step) and sam¬ 
ple analysis (with the NMR and MS steps). Because of 
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this, LC-NMR/MS in particular is considered one of the 
most powerful and sophisticated analytical methods 
available. It is typically used on those analytical prob¬ 
lems that cannot be solved with other methods. As such, 
it is not normally used for rapid analyses, but is instead 
used for the most challenging analyses, regardless of 
how long it takes to get the answer. (LC-NMR/MS 
experiments can easily run for 24 h or more.) In contrast, 
the other flow-NMR methods that do not contain 
a chromatography step, such as DI-NMR and FIA- 
NMR, are capable of being run much faster. These 
methods typically exploit their ability to rapidly shuttle 
samples in and out of the rf coil, and often run highly 
repetitive analyses in which the NMR experiments take 
only a few minutes per sample. 

The nucleus most typically observed in flow NMR 
is ^H, although and ^^Si have also been 

observed. Most flow-NMR experiments acquire only 
one-dimensional NMR spectra, although some also 
acquire multidimensional NMR spectra. Normally, the 
WET pulse sequence (which uses pulsed field gradients 
and often decoupling) is used to suppress unwanted 
signals from the solvents. Most NMR data is acquired 
when in stopped-flow mode, although some is 
acquired in continuous-flow mode. In LC-NMR and 
FIA-NMR, samples flow through the rf coil unidirection- 
ally; however, in DI-NMR the flow scheme is typically 
bidirectional (e.g., samples are removed from the rf coil 
by reversing the direction of flow, and pulling them out 
through the same path in which they entered the rf coil). 

Cross-References 

► Continuous Flow 

► Cryogenic NMR Probes 

► NMR in Drug Discovery - Introduction 

► Multidimensional NMR Spectroscopy 

► NMR Methods for Kinetic Analysis 

► NMR 

► Pulsed Field Gradient NMR 

► Stopped-Flow Techniques 


Flow of Electrical Charges in Biological 
Circuits 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 


Flow-Injection-Analysis NMR (FIA-NMR) 

► Flow NMR 


Fluctuation Fit 

► Molecular Recognition: Lock-and-Key, Induced Fit, 
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Synonyms 

Fluorescence resonant energy transfer (FRET); Lipid 
bilayer 

Definition 

Fluorescence methodologies are based on emission 
of light from a substance in an electronically excited 
singlet state. Forster Resonance Energy Transfer 
(FRET) denotes a photophysical process upon which 
the excitation energy of a molecule, the donor, is 
transferred to another, the acceptor, without emission 
of light from the former or collision between the two 
species. 








F 


780 


Fluorescence and FRET in Membranes 


Introduction 

The molecular level organization and structure/func¬ 
tion relationships in membrane systems pose challeng¬ 
ing problems that require a variety of experimental 
approaches for their study. Among these, and due to 
its sensitivity and versatility, ► fluorescence came to 
the fore. The timescale of fluorescence (sub-ns 
resolution) is adequate to follow the kinetics of both 
fast (fluorophore rotation, conformational changes) 
and relatively slow (translocation, hindered diffusion) 
processes. Its intrinsic sensitivity is almost unrivalled, 
and is fully exploited in techniques such as fluores¬ 
cence correlation spectroscopy and single-fluorophore 
imaging. Perhaps most importantly, fluorescence 
spectroscopy is highly versatile, as different 
approaches (anisotropy, quenching, Forster resonance 
energy transfer [FRET] ) allow studying such diverse 
problems of relevance in membrane biophysics as 
extent of partition to the bilayer, location, rotational 
and translational dynamics, distribution and aggrega¬ 
tion state of membrane components, and membrane- 
active molecules. Fluorescence microscopy adds 
spatial resolution to the canon, and also because of 
the nondestructive nature of fluorescence techniques, 
is widely used in the study of live cells. 

The subject of this entry is the application of 
fluorescence techniques (with an emphasis on FRET) 
in membrane biophysics, focusing on the lipid bilayer 
and its lateral organization (domain structure). Out of 
scope of this entry is the use of fluorescence in the 
context of lipid/protein or lipid/peptide interaction, 
regarding which the reader is referred to the literature 
(e.g., Mateo et al. 2006; see also the entry on “► Mem¬ 
brane Proteins: Structure and Organization”). 

Fluorescent Membrane Probes 

The basic structural unit of biological membranes is 
the phospholipid bilayer, and because the vast majority 
of naturally occurring membrane lipids are 
nonfluorescent, extrinsic membrane probes are widely 
used (see Pig. 1 for structures of common fluorescent 
membrane probes). Design fluorophores often possess 
convenient photophysical properties, such as absorp¬ 
tion and emission in the visible range and high molar 
absorption coefficient and fluorescence quantum yield, 
which is especially convenient for microscopy and 


single-molecule techniques. Some fluorophores are 
known for their simple decay kinetics, stability, and 
invariance of fluorescence parameters, while others are 
notable for their sensitivity to the local environment. 
Taking into account the purpose of a given experiment, 
one of these types of molecules will be most appropri¬ 
ate. However, it must be noted that membrane probes 
can induce perturbation in the host lipid structure and 
dynamics, which can be minimized using very low 
concentrations, or studied using methodologies such 
as molecular dynamics simulations (Loura and Prates 
Ramalho 2011). 

1,6-diphenylhexatriene (DPH) is a rod-shaped 
molecule whose fluorescence quantum yield and inten¬ 
sity decay show little sensitivity to the lipid phase, 
unlike the fluorescence anisotropy, which decreases 
threefold upon melting of the lipid acyl chains. As 
a consequence, DPH and its derivatives are commonly 
used as probes of membrane fluidity (Lentz 1989). 
Pyrene is a polycyclic aromatic hydrocarbon notable 
for its spectroscopic properties, namely, the unusually 
long fluorescence lifetime (>100 ns in a variety of 
solvents and membrane systems) and ability to form 
excimers. Due to these distinctive features, both free 
pyrene and pyrene-labeled lipids have been used in 
membrane biophysics studies (Somerharju 2002). 
6-dodecanoyl-2-dimethylamino naphthalene (laurdan) 
shows spectral properties very sensitive to the physical 
state of the surrounding phospholipids (namely, 
polarity) and has been used in numerous occasions to 
detect lipid domains (Parasassi et al. 1994) and to 
determine the physical state of lipids surrounding 
proteins. The all-trans isomer of parinaric acid 
(f-PnA) is unique for its preference for the rigid gel 
rather than the fluid disordered phase (unlike 
the cis/transitrans/cis isomer, c-PnA, which distributes 
almost equally), as well as for the sensitivity of its 
fluorescence to local density, increasing markedly in 
rigid media (Sklar et al. 1979). 

Prom the 1970s on, lipids labeled with design 
fluorophores, including 7-nitrobenz-2-oxa-l,3- 
diazol-4-yl (NBD) and 4,4-difluoro-4-bora-3a,4a- 
diaza-s-indacene (BODIPY) became commercially 
available. These fluorescent analogs include acyl 
chain-labeled phospholipids of several lipid classes 
(most commonly phosphatidylcholine, PC), head- 
labeled phosphatidylethanolamine (PE), and steroids. 
NBD derivatives are available for all major phospho¬ 
lipid classes, and have been used extensively as 
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Fluorescence and FRET in Membranes, Fig. 1 Membrane 
Biophysics Lipids. Structures of common membrane fluo¬ 
rescent probes, (a) DPH; (b) pyrene; (c) laurdan; (d) r-PnA; 


(e) NBD-labeled PC derivative; (f) Dil; (g) BODIPY-labeled 
cholesterol derivative; (h) DHE 


fluorescent analogs of native lipids in biological and 
model membranes to study a variety of processes 
(Chattopadhyay 1990). BODIPY membrane probes 
are very popular in cell biology and biophysics, due 
to their desirable and unusual photophysical properties 
(environmental insensitivity, single exponential decay, 
high fluorescence quantum yield, photostability, and 
molar absorption). Rhodamine-based fluorophores 
(e.g., Lisssamine-rhodamine B, Texas Red), with 
high-wavelength absorption and emission, are used as 
phospholipid headgroup labels. Carbocyanines such as 
Dil have very high molar absorption coefficients 


(^10^ cm“*) and are used as membrane potential 

probes and as stains in cell studies. They have also 
been extensively used as probes of bilayer dynamics 
and structure (Wolf 1988). 

Because of the importance of cholesterol as a major 
component of mammalian cell membranes and its role 
as a regulator of the physical properties of the 
membrane, as well as its implication in the formation 
of membrane microdomains such as lipid rafts and 
caveolae, cholesterol fluorescent analogs are 
a particularly important class of membrane probes 
(reviewed in Wiistner 2007). Structurally, the closest 
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fluorescent sterol to cholesterol is cholestatrienol, 
which differs only in having two additional double 
bonds in the ring system. Despite possessing an addi¬ 
tional double bond and methyl group in the side chain, 
dehydroergosterol (DHE), a natural sterol from yeast, 
behaves very similarly to cholesterol and is a useful 
reporter in model and biological membranes, lipid 
rafts/caveolae, and real-time imaging in living cells 
using multiphoton excitation. Fluorescent sterols 
which can be conveniently excited and detected in 
the visible range have extrinsic fluorophores, such as 
22- and 25-NBD-cholesterol and cholesteryl esters of 
BODIPY fatty acids. 


Partition to the Membrane 

The extent of the partition of an amphiphilic solute 
between the lipid and water phases is often the first step 
in the study of its interaction with membranes. It is 
usually quantified by the partition coefficient, K^, 
which is the ratio between the solute concentration in 
the two phases. If the species under study is fluores¬ 
cent, and if a parameter such as fluorescence quantum 
yield, anisotropy or lifetime has distinct values in the 
two media, A'p can be obtained by analyzing the 
variation of this parameter with increasing lipid 
concentration, for constant amount of solute 
(Santos et al. 2003). Similar formalisms can be 
employed to recover the partition coefficient between 
two coexisting phases within the bilayer (e.g., Silvius 
2005). 

FRET 

Forster resonance energy transfer (FRFT) is 
a photophysical phenomenon upon which electronic 
energy of a molecule, the donor, is transferred to 
another, the acceptor. This latter molecule can be 
either identical to the donor, or a different species, 
resulting in homotransfer (energy migration, 
homo-FRFT) or heterotransfer (hetero-FRFT). As 
a result of hetero-FRFT, the donor fluorescence is 
quenched, and the acceptor becomes excited (and 
may fluoresce). The rate of the FRFT interaction 
depends on the inverse sixth power of the donor- 
acceptor distance, and its range is characterized by 
the Forster radius (Rq) of the donor/acceptor pair 
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Fluorescence and FRET in Membranes, Fig. 2 Membrane 
Biophysics Lipids. Schematic representation of the effect of 
phase separation and probe partition on the FRET efficiency. 
D and A molecules are represented by green and red dots, 
respectively. SP-FRET steady-state probe-partitioning FRET 
(Reprinted figure with permission from Buboltz (2007) Phys 
Rev E 76:021903, http://pre.aps.org/abstract/PRE/v76/i2/ 
e021903. Copyright 2007 by the American Physical Society) 


under study. In practice, FRFT is operative in the 
1-10 nm range. These features constitute the basis for 
its widespread use in diverse problems in membrane 
biophysics involving lipid/lipid or lipid/protein inter¬ 
actions. Most pertinent to this entry is the application 
of FRFT in the detection and characterization of lateral 
nanoheterogeneity (reviewed in Loura et al. 2010). The 
kinetics of FRFT in a situation where each donor is 
surrounded by a distribution of acceptors is complex, 
and analytical solutions can be derived for specific 
geometries, including those most relevant in one-phase 
bilayers (distribution of probes in a plane or in two 
parallel planes). For biphasic systems, analytical 
solutions can still be derived if the coexisting domains 
are infinitely large in the FRFT scale (typically 
>10 Rq, or ^50 nm). These formalisms take into 
account the differential partition of donor and acceptor 
probes between the two phases. If these probes have 
identical preference for one of the phases, phase sepa¬ 
ration leads to an enhancement in FRFT efficiency, 
and conversely if donor and acceptor prefer distinct 
phases (Fig. 2). Of course, in the limit of very small 
clusters {<Ro), the measured FRFT signal becomes 
indistinguishable from the one-phase scenario. For 
the most interesting intermediate range, no exact 
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analytical solutions have been derived, and only 
numerical or approximate solutions have been 
presented. Several methods for estimating domain 
sizes and/or obtaining phase diagrams of two- and 
three- component mixtures using FRET have now 
been published and are critically reviewed by Loura 
et al. (2010). 

Other Fluorescence Methodologies 

Fluorescence anisotropy (r) is a measure of the degree 
of reorientation experienced by a fluorophore during 
its excited state lifetime. It can be measured both in 
steady-state and time-resolved conditions, by monitor¬ 
ing the intensity of the emission components with 
polarization parallel and perpendicular to that of the 
excitation radiation. Absence of rotational diffusion of 
fluorophores randomly oriented in the macroscopic 
scale leads to r = 0.40 (in case of coincidence of the 
transition moments for absorption and emission), 
whereas /• = 0 denotes complete reorientation. 
Fluorescent membrane probes are commonly used in 
anisotropy experiments to determine membrane phase 
transitions (e.g., typical values of steady-state anisot¬ 
ropies of DPFI are ^0.3 for gel phase, ^0.2 for liquid 
ordered phase, and ^0.1 for liquid disordered phase) 
and physical properties of membranes such as order 
parameter and “microviscosity,” as well as for 
detecting alterations of these parameters induced 
upon interaction of the membrane with molecules of 
interest. The anisotropy decay parameters contain 
information about the shape, size, flexibility, and loca¬ 
tion of the fluorophore or of the bounded macromole¬ 
cule, and/or about the rigidity of the molecular 
environment where the fluorophore resides (e.g., 
a lipid membrane). To extract this information, it is 
necessary to propose a rotation model for the 
fluorophore and compare the exponential terms 
corresponding to this model with those of the 
experimental anisotropy decay. For a detailed discus¬ 
sion on fluorescence anisotropy and its application to 
membrane systems, see Lakowicz (2006). 

Fluorescence quenching can have two origins: 
long-lived association of fluorophore and quencher, 
resulting in quenching of the emission of the former 
(so-called static quenching), and collision-dependent 
quenching. The static component of quenching can be 
analyzed using sphere-of-action or complexation 


formalisms (Lakowicz 2006) whereas the dynamic 
component is analyzed with a Stern-Volmer-type treat¬ 
ment, allowing the recovery of the fluorophore 
diffusion coefficient. The applications of quenching 
to the detection of lipid domains were reviewed by 
Silvius and Nabi (2006). A typical experimental 
setup consists of preparing mixtures of cholesterol 
and two polar lipids, one of which (a spin-labeled or 
brominated lipid) bears a quencher group, as well as 
a small amount of fluorescent probe. The sterol content 
may be fixed for the different samples, whereas the 
relative proportions of the quencher and non-quencher 
polar lipids are varied. In the absence of membrane 
heterogeneity, the probe fluorescence intensity or 
lifetime decreases in a smooth, monotonic manner 
upon varying composition by increasing quencher 
lipid content. If domains are present over a particular 
range of compositions, the quenching curve will 
deviate from that expected for homogeneous mixing 
(i.e., present kinks) in the corresponding range. 
Another application of fluorescence quenching is the 
determination of the transverse location of 
a fluorophore within the bilayer. For this purpose, 
several methods have been proposed (see London and 
Ladokhin 2002 for a presentation and critical compar¬ 
ison of the two most popular approaches). 

► Fluorescence microscopy and ► single molecule 
imaging techniques have presented a tremendous 
development over the recent years, and their applica¬ 
tions to the study of model and biological membranes 
have become widespread. Conventional fluorescence 
imaging techniques, including fluorescence lifetime 
imaging spectroscopy or FLIM (especially useful 
when coupled with FRET), although widely used 
(see Stockl and Herrmann 2010 for a review), are 
generally restricted by the diffraction-limited optical 
resolution of the light microscope. In practice, this 
prevents detection of microdomains smaller than 
about 250 nm, a value much larger than the size 
believed for lipid rafts (^20 nm). Several techniques 
developed to circumvent this limitation have come to 
the fore recently, such as near-field scanning optical 
microscopy (see Dickenson et al. 2010 for a review), 
stimulated emission depletion microscopy, and related 
“super-resolution microscopy” approaches (see Owen 
et al. 2010 for a review). Single molecule techniques 
based on diffusion such as single particle/molecule 
tracking (SPT) and fluorescence correlation spectros¬ 
copy (ECS) have been used to detect transient 
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confinement zones identified as domains. The sophis¬ 
ticated physical models employed in analysis of PCS 
and SPT data, especially when coupled with the novel 
super-resolution microscopy techniques, offer exciting 
possibilities regarding accurate characterization of 
membrane organization in live cells. 

Summary 

Due to their sensitivity and versatility, the use of 
fluorescence techniques in membrane biophysics is 
widespread, namely, on the characterization of lipid 
organization within the bilayer. Because membrane 
lipids are nonfluorescent, extrinsic membrane probes 
are widely used. The fundamentals and applications of 
the most important fluorescence methodologies (FRET, 
anisotropy, quenching, microscopy) are addressed. 


Cross-References 

► Live-Cell Single-Molecule Imaging 

► Membrane Proteins: Structure and Organization 

► Optical Fluorescence Microscopy 

► Single-Molecule Methods 
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Fluorescence Correlation 
Spectroscopy, 

Fig. 1 Schematic diagram of 
a typical confocal FCS setup 
with the different steps 
involved in autocorrelation 
analysis, (a) The confocal FCS 
instrumentation allows to 
observe fluorescence in a very 
small observation volume. 

(b) The dotted line represents 
the observation volume, the 
solid lines the diffusing path of 
a fluorescent particle causing 
fluctuations in fluorescence 
intensity, (c) The observed 
time course of fluorescence 
intensity fluctuates around the 
average due to Brownian 
motion and other processes, 
(d) The fluctuations are 
analyzed by autocorrelation of 
the data {green) and fitted with 
a theoretical autocorrelation 
function {black) 
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Dichroic mirror 
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Definition 

Fluorescence correlation spectroscopy (FCS) is a bio¬ 
physical single molecule technique, which measures 
properties such as diffusion coefficients, concentra¬ 
tions, and photophysical characteristics of fluorescent 
particles. 

Introduction 

In ► fluorescence correlation spectroscopy, fluctua¬ 
tions in fluorescence intensity in a very small volume 
are used to measure molecular properties. The basic 
principle of the method relies on the observation and 
subsequent mathematical analysis of these fluctua¬ 
tions. Fluorescence intensity fluctuations can arise 
from the local concentration variations caused by 
Brownian motion in and out of the observation 
volume. The mathematical analysis is performed by 
calculating the so-called temporal autocorrelation 
function from the recorded fluorescence signal. The 
experimentally determined autocorrelation function 
can be fitted with a suitable model function to obtain 


the sample properties, such as ► diffusion coefficients, 
flow velocities, as well as concentrations. Furthermore, 
FCS can quantify rates of intra- and intermolecular 
reactions, photophysical properties, and oligomeriza¬ 
tion states. FCS differs from other fluorescence tech¬ 
niques, where typically the bulk fluorescence is used to 
obtain information about a sample, and fluctuations 
merely represent a disturbing source of noise. 

The most widespread experimental realization of 
FCS is achieved by extension of a confocal microscope 
with a very sensitive detector and a correlator (Fig. 1). 
However, FCS has also been combined with other 
fluorescence microscopy techniques with a small 
detection volume, such as total internal reflection 
(TIR) or stimulated emission depletion microscopy 
(STED). For further reading, see, for example, Rigler 
and Elson (2001), Thompson (2002), Haustein and 
Schwille (2007), and Petrov and Schwille (2008). 

Historical Outline 

The principle of ECS was first published in 1972 
(Magde et al. 1972) as conceptual analogue of 
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► dynamic light scattering (DLS). In the early 1990s, 
significant improvements were made by implementing 
PCS on a confocal fluorescence microscopy setup in 
combination with a strongly focused laser as an exci¬ 
tation source and a highly sensitive detector (Rigler 
et al. 1993). This vastly improved the signal-to-noise 
ratio and decreased the required measurement time. 
Furthermore, several authors pointed out the huge 
potential of the method, providing access to single 
molecules. From the mid-1990s on, publications 
using FCS increased exponentially (Petrov and 
Schwille 2008). Nowadays, FCS setups are commer¬ 
cially available and the technique is widely used in 
biophysics, molecular biology, and biochemistry. 

Fluctuations of Fluorescence and 
Autocorrelation Analysis 

In thermal equilibrium, the observed fluorescence 
intensity F{t) in a small volume fluctuates around the 
temporal average {F{t)), hence: 

F{t) = {F{t))+SF{t) 

These fluorescence fluctuations 5F{t) result from 
stochastic processes such as Brownian motion (diffu¬ 
sion) in and out of the observation volume. In general, 
all dynamic processes that alter the fluorescence inten¬ 
sity cause fluorescence fluctuations. For example, like 
in the case of green fluorescent protein, intramolecular 
dynamics constantly switch the fluorophore between 
states of different spectral properties, generating fluc¬ 
tuations in fluorescence. Figure 2 shows a few impor¬ 
tant processes resulting in fluorescence intensity 
fluctuations in an observation volume. But how can 
fluctuations in fluorescence be used to obtain informa¬ 
tion about their source? In case of a highly diluted 
solution of freely diffusing molecules, most of the 
time the detector will record only background noise, 
but whenever a molecule enters the observation 
volume, the fluorescence intensity increases for 
a short time, causing a burst of fluorescence. A large 
and therefore more slowly diffusing molecule remains 
for a longer time in the observation volume compared 
to a small and faster diffusing molecule. On average, 
this causes a longer burst in the fluorescence signal 
compared to the fast diffusing molecule. To efficiently 
compile the average decay time of bursts, the temporal 


\ 



Fluorescence Correlation Spectroscopy, Fig. 2 Different 
processes causing fluorescence fluctuations in the observation 
volume. These processes include diffusion, flow, or transport 
and conversion between fluorescent and nonfluorescent states 
(e.g., singlet to triplet transition and protonation-induced 
blinking) 

autocorrelation function G(t) is calculated from the 
recorded fluorescence fluctuations SF{t) (Fig. Ic, d): 

The autocorrelation yields direct access to the typ¬ 
ical persistence time of the fluorescence fluctuations. 
Parameters related to the source of the fluctuations can 
be obtained from tilting this autocorrelation function 
with a theoretical model. 

Depending on the experimental situation, 
a multitude of theoretical autocorrelation functions 
are available. For three-dimensional diffusion of one 
species, the function is: 



In this equation, the amplitude is inversely propor¬ 
tional to the average particle number in the detection 
volume, which is obtained from the fit G 3 j( 0 ) = 

The parameter, S, is the so-called structural parameter. 
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Fluorescence Correlation Spectroscopy, Fig. 3 Examples 
of free diffusion autocorrelation curves showing the dependency 
on N and Tjj. Increase in N shows a decrease in amplitude, 
whereas an increase in Tq shifts the autocorrelation curve toward 
longer timescales 

It is related to the axial and lateral dimensions of the 
observation volume, zq and rg, respectively, by the 
following equation: 


additional multiplicative term, which contains the 
fraction of molecules in the dark state T and the char¬ 
acteristic time describing the timescale of the 

transition; 

\ — T + Te 

G^d,BUnk[V = --J- j -'G3j(t) 

A frequently used case is two-dimensional diffusion 
in membranes. With the membrane perpendicular to 
the optical axis of the microscope, the autocorrelation 
function is given by: 



Another important parameter in FCS is the molec¬ 
ular brightness p, also referred to counts per molecule. 
It is defined by the ratio of the fluorescence 
signal (F’(r)) and the particle number; 


S = 


zo 

''0 


V = ■ 


im 

(A) 


(F(t))G(0) 


The third parameter is Zq, the diffusion time, 
describing the time to laterally diffuse through 
the observation volume. In the autocorrelation func¬ 
tion, it determines the decay time, with 
~ ^Gid(P) (see Fig. 3). The diffusion time is 
related to the diffusion coefficient by the equation: 


This can be used to calibrate a FCS setup by using 
a dye of known diffusion coefficient. Finally, the effec¬ 
tive detection volume is given by Vgff = Tt^/^zorg, 
which allows one to calculate the average concentra¬ 
tion of fluorophores in the observation volume as 
(C) = {N)/V,ff. 

For many fluorophores, the fluorescence properties 
change between a fluorescent and a nonfluorescent 
state, while moving through the observation volume. 
This so-called blinking can be caused by a transition 
from a fluorescent singlet to a nonfluorescent triplet 
state. In the autocorrelation curve, this is represented 
by an additional decay at short timescales. In this 
case, the autocorrelation function is modified by an 


Since molecular brightness represents the average 
fluorescence signal per molecule, it is sensitive to 
oligomerization reactions. Furthermore, it is used 
to characterize fluorescent dyes under certain experi¬ 
mental conditions. 

Experimental Realization 

A typical confocal setup of FCS is illustrated in Fig. 1. 
A fluorescent sample, for example, a fluorescent pro¬ 
tein, is excited by a laser beam of a suitable wave¬ 
length, which is focused onto the sample through an 
objective (Fig. la). The emitted fluorescence is sepa¬ 
rated from the excitation light by a dichroic mirror, 
a wavelength-dependent mirror, which is reflective 
for a certain wavelength range and transmissive for 
another. A pinhole (a small aperture) in the image 
plane excludes all fluorescence in axial direction, 
which is not generated close to the focus of the objec¬ 
tive. Consequently, only fluorescence from a very 
small volume (dashed line in Fig. lb) reaches the 
detector, which can be considered as the product of 
the excitation and detection volume. Typical 
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dimensions of this volume are 200 nm x 200 nm x 
1 pm, which is below 0.1 fL. The exact size depends on 
the objective, emission, and excitation wavelengths 
and the refractive index of the medium used. However, 
it must be noted that the edges of the volume are not 
sharp, but run out smoothly. The fluorescence signal 
from this volume is detected by a sensitive detector, 
most often an avalanche photo diode (APD) detector. 
Figure Ic shows the initial fluorescence signal of an 
FCS experiment, where the fluorescence counts F{t) 
per second are displayed as a photon detection 
frequency versus time. This signal is then used to 
calculate its autocorrelation function shown in 
Fig. Id, either by a specialized hardware correlator or 
by a fast computer. 

Alternative FCS Schemes 

An increasing number of FCS schemes have been 
developed. Here, a few variants are outlined. 

In two-photon excitation (TPE), two lower energy 
photons are simultaneously absorbed to excite the 
fluorophore to its excited state. TPE-FCS does not 
require a pinhole for axial sectioning, since the proba¬ 
bility to absorb two photons is proportional to the 
square of the intensity, which is only given in a small 
volume in the focal spot. An advantage of TPE is the 
reduced cumulative ► photobleaching. Additionally, 
scattering and absorption of the excitation light is 
reduced due to the relatively high wavelengths used 
(typically red or infrared), and multicolor applications 
can be more easily implemented. Therefore, TPE-ECS 
is especially suitable for ECS on complex samples like 
cells and tissues. A drawback is the expensive instru¬ 
mentation, which requires pulsed lasers. 

Total internal reflection fluorescence (TIRE) is an 
important method to dramatically conflne the detection 
volume in axial direction and, therefore, can be com¬ 
bined with ECS. In TIRE, the sample is illuminated at an 
angle of incidence larger than the critical angle, so that 
the incident beam reflects back into glass and generates 
an evanescent wave at the coverslip-sample interface. 
The evanescent wave has maximum intensity at the 
surface and exponentially decays inside the sample. 
Thus, fluorescence is only generated close to the surface 
(up to ^100 nm). TIRE-ECS is a useful tool to study 
surface-binding reactions, as well as diffusion close to 
surfaces, for example, in membranes of adherent cells. 


Due to the different shape of the observation volume, the 
model functions differ from confocal ECS. Eor further 
reading, see, for example, Thompson and Pero (2005). 

Stimulated emission depletion (STED) is another 
fluorescence microscopy technique with sub¬ 
diffraction resolution. The high resolution is achieved 
by controlled de-excitation of the fluorescence in a 
toroidal region around the focal spot. As a conse¬ 
quence, the fluorescence signal is obtained from 
a spot smaller than the conventional diffraction limit. 
Recently, the combination of STED with ECS was 
demonstrated (Kastrup et al. 2005). 

Scanning FCS (SFCS) is a set of methods, where the 
detection volume is moved in the sample. This can be 
useful to study slow diffusion, to avoid photobleaching, 
or to compensate for membrane movements and 
refractive index variations distorting the optical path. 
SFCS implementations include linear, circular, and 
raster scanning across the sample. For an overview, 
see Petrasek et al. (2010). 

Cross-References 

► Dual-Beam Fluorescence Cross-Correlation 
Spectroscopy 

► Dynamic Light Scattering 

► Fluorescence 

► Lipid Lateral Diffusion 

► Single Eluorophores Photobleaching 
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diffusion has been probably always the most common 
and best established application of PCS. The spread of 
PCS is closely connected with the advances in instru¬ 
mentation, especially in confocal microscopes, which 
are the most common instruments for PCS. Therefore, 
combination of PCS with confocal imaging is straight¬ 
forward and widespread, above all in biological 
applications of PCS. Those have been following the 
general trend in current biophysics of a shift from 
in vitro to in vivo studies. 
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Synonyms 

Pluorescence fluctuation spectroscopy; Lipid bilayer 


Definition 

Pluorescence correlation spectroscopy (PCS) is 
a method using statistical analysis of fluctuations of 
fluorescence intensity to extract information on con¬ 
centration and mobility of fluorescent molecules or on 
kinetics of chemical reactions. 


Principles of FCS 

The standard PCS method is based on statistical anal¬ 
ysis of temporal fluctuations of fluorescence intensity 
originating from a very small detection volume (1 pm^ 
or smaller) within the sample. The principle of 
the method is schematically depicted in Pig. 1. In the 
most typical experimental setup, the detection volume 
is deflned by the intensity profile of a laser beam in 
the focus of a confocal or two-photon microscope. 
It means that the shape of the detection volume 
can be described by the point spread function of 
the microscope and it is usually approximated by 
a three-dimensional Gaussian profile (Enderlein et al. 
2005). The autocorrelation function G (t) defined by 
(1) is calculated from the time-trace of fluorescence 
intensity I (t) either by a hardware correlator or by 
software processing; the latter being more versatile 
and preferred nowadays (Wohland et al. 2001). 


G(t) 


m /(^+t)) 

mf 


( 1 ) 


Introduction 

Eluorescence correlation spectroscopy was developed 
in the 1970s (Elson and Magde 1974) and since then 
has become a widespread method for measuring trans¬ 
lational and rotational diffusion coefficients or flow 
velocities of fluorophores or fluorescently labeled 
molecules, kinetic rate constants of chemical reactions, 
critical micelle concentrations, and other quantities, 
which can be related to fluctuations in observed 
fluorescence intensity (Thompson 1991). Neverthe¬ 
less, characterization of molecular translational 


The shape of the autocorrelation function G (t) 
contains information on the timescale of the processes 
underlying the intensity fluctuations. It rises steeply to 
its maximum value on nanosecond timescale; the 
effect (photon antibunching) is related to the minimal 
time delay between two photons emitted by a single 
fluorophore and, thus, to the lifetime of fluorescence 
and number of independent fluorophores within the 
detection volume. Decay of G (t) on microsecond 
timescale results from fluctuations caused mainly by 
photophysical and photochemical processes (such as 
intersystem crossing to nonfluorescent triplet state) 
and rotational diffusion of molecules (in the case of 
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Fluorescence Correlation Spectroscopy of Lipids, 
Fig. 1 The basic principles of PCS. Fluorescence is collected 
from a small detection volume defined by microscope focus; the 
observed fluctuations of its intensity are typically caused by diffu¬ 
sion or flow of fluorescent particles or by chemical reactions 
accompanied by changes in brightness of the fluorophore (a). 
In the case of an aqueous solution of a fluorophore, the fluctuations 
on ms timescale result from diffusion of fluorophore molecules in 


polarized detection). Decay on longer timescales is 
related to translational diffusion of molecules in and 
out of the detection volume (which is the process most 
frequently addressed by FCS) or chemical reactions 
associated with a change in brightness (photon count- 
rate per molecule) of the fluorophore. If more 
processes happen on the same timescale, their contri¬ 
butions to G (t) are not easy to separate. Timescales 
shorter than microseconds are not accessible in most 


and out of the detection volume (b). For a short lag time t, the 
probability is high that a molecule is in the detection volume at 
time t+T if it was there at time t. That results in a high value of the 
autocorrelation G (t). For higher values of t, the fluctuations 
become random and G (t) approaches its asymptotic value G 
(oo). The characteristic diffusion time Tq a molecule spends dif¬ 
fusing through the detection volume corresponds to the value of G 
(t) equal to the average of its maximum G (0) and minimum G (cxs) 


FCS experiments and therefore only the decay of G (t) 
is analyzed (Enderlein et al. 2004). 

The information on the source of fluctuations is 
extracted by fitting the obtained G (t) with 
a theoretically derived model. Equation 2 gives the 
model function for the case of a free diffusion 
through a Gaussian detection volume of characteris¬ 
tic dimensions in the focal plane and along the opti¬ 
cal axis coo and co^, respectively (defined as the 
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distance from the center of the volume, at which the 
excitation intensity decreases to e“^ times its maxi¬ 
mal value). T£) (diffusion time) is the characteristic 
time a fluorophore spends diffusing through the 
detection volume and N (particle number) is the 
average number of independently moving fluorescent 
particles in the detection volume (molecules or 
supramolecular structures). In cases of macromole¬ 
cules larger than the detection volume, N may be 
higher than the number of molecules, because molec¬ 
ular segments rather than whole molecules are mov¬ 
ing in and out of the detection volume as independent 
particles (Adjimatera et al. 2008). Note that the max¬ 
imal value of autocorrelation function G (0) is 
inversely proportional to N. 


GW = 1+77- 


N 1 + ( t / td ) 1 + (T/Tfl)(coo/(Uz)‘ 


( 2 ) 


If the sample contains M populations of fluorescent 
species having differing diffusion times T/j, a more 
general model (3) is needed. The contribution of each 
population is weighted by its fraction on the overall 
population f, and the brightness Q, of the respective 
species (Thompson 1991). 


E (G/) 
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1 + (t/to/) Y 1 + (r/TaWcLio/fflz)^ 


Diffusion time is approximately proportional to the 
cube root of molecular mass; therefore, contributions 
of fluorescent species differing only slightly in molec¬ 
ular mass are usually not discernible in G (t). In cases 
where fluorescent species differ more significantly 
in brightness than in (such as in the cases of 
dimerization of fluorescently labeled molecules), pho¬ 
ton counting histograms (PCHs) may complement 
PCS in determination of concentrations of individual 
species from fluorescence intensity time-traces. The 
method analyzes histograms of numbers of detected 
photons within a certain integration time (Chen et al. 
1999). 


To account for intersystem crossing, an average 
fraction of fluorophores in triplet state T and 
intersystem crossing relaxation time Zj- are included 
in model (4) (Widengren et al. 1995): 


G(t) = 1 + [1-T + T exp(-T/T 7 -)] 

1 

l + (7/TD)(mo/fflz)^ 

If diffusion of the molecules of interest is restricted 
to two dimensions (such as is in the cases of biological 
membranes or molecular layers on interfaces) the 
detection volume is reduced to a detection area. The 
model function (5) is a simplified version of function 
(2) and describes free diffusion through a two- 
dimensional Gaussian intensity profile (Machan and 
Hof 2010b). More general models can be obtained by 
analogous modifications of functions (3) and (4). 





G2d(t) = 1 + ^ ■ 


N I + {zfzo) 


(5) 


Diffusion of molecules in more complexly struc¬ 
tured environments such as membranes of living cells 
where lipid rafts, membrane proteins, and cytoskeleton 
influence mobility of molecules, cannot be always 
described by simple models for free diffusion. To 
account for deviations from free diffusion described 
in terms of anomalous diffusion, an anomalous expo¬ 
nent a is included in model (6) derived from model (5) 
for diffusion in a planar system. Models for other 
situations (transition to triplet state, diffusion in three 
dimensions or more species differing in diffusion time) 
are derived analogously from (2), (3) and (4) 
(Vercammen et al. 2008). 


-ranomal 

^2D 


^^Nl + iz/zoT 


( 6 ) 


The physically relevant characteristic of diffusion, 
the diffusion coefficient D, is related to the Zo and the 
lateral dimension of the detection volume coq though 
a simple relation (7). If the size of the detection volume 
coq is calibrated by an PCS measurement in a solution 
of a reference fluorophore with known diffusion coef¬ 
ficient, diffusion coefficients of other fluorescent 
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species can be determined by FCS. A similar relation 
connects particle number N with concentration c of the 
fluorescent species; the effective detection volume 
taking into account the inhomogeneous distribution 
of excitation intensity and detection efficiency has to 
be used in that case (Chen et al. 1999). 


Accuracy and Artifacts in FCS 

The estimation of accuracy of parameters measured 
by FCS is rather complicated due to their highly 
nonlinear relation to the measured intensity fluctua¬ 
tions; several methods have been developed to deal 
with this problem. The signal-to-noise ratio in FCS is 
the highest when there is on average only one mol¬ 
ecule in the detection volume. That fact limits the 
range of concentrations of fluorescent particles suit¬ 
able for FCS to the nM range (if the usual confocal 
setup is used), that means between the concentration 
ranges applicable for ► single particle tracking and 
► fluorescence recovery after photobleaching 
(Machan and Hof 2010a). Higher concentrations are 
suited for special experimental setups with smaller 
detection volumes. The detection volume in these 
setups is reduced either by confinement of the sample 
to special nanostructures such as ► zero-mode wave¬ 
guides or by reducing the size of the focus of the 
microscope in two-photon microscopy and modern 
techniques reaching sub-diffraction resolution (Blom 
et al. 2006). 

The excitation intensity in FCS should be kept 
rather low to avoid photobleaching and optical satura¬ 
tion (resulting in loss of proportionality between exci¬ 
tation and fluorescence intensity) within the detection 
volume. In a usual FCS experiment, the excitation 
intensity should be much lower than 30 kW cm“^ 
(Enderlein et al. 2005). Investigation of slow processes 
is especially prone to photobleaching-related artefacts, 
because the fluorophores reside longer within the 
detection volume and, furthermore, the overall mea¬ 
surements are longer to reach sufficient statistical 
accuracy (10^-10^ times longer than the timescale of 
the process under investigation). Optical saturation and 
other sources of artefacts in FCS (refractive index 


mismatch, optical path misalignment) cause deviations 
of the actual detection volume shape from the assumed 
one. The influence of the deviations on the FCS param¬ 
eters has been extensively studied by experiments and 
simulations (Enderlein et al. 2004). 

The detection volume calibration is another impor¬ 
tant source of artefacts in ECS mainly due to the fact 
that the sample and the reference solution very often 
differ in refractive index (and, thus, the actual detec¬ 
tion volume). An additional positioning problem arises 
in the case of planar samples negligible in thickness 
(^ nm) when compared to the axial dimension of the 
detection volume (^ pm). A slight axial displacement 
of the planar sample from the focal plane of the micro¬ 
scope results in an increase of the actual detection area 
and, thus, of and N (see Eig. 2a). Variations in axial 
position of the sample between individual measure¬ 
ments would, therefore, increase significantly the 
error in determination of diffusion coefficient and con¬ 
centration by FCS (Milon et al. 2003). That was an 
important motivation for the development of several 
calibration-free FCS techniques. 

Calibration-Free FCS 

These techniques are avoiding the need for an external 
calibration by using a well-defined distance measure 
intrinsically present in each measurement. The accu¬ 
racy and stability of this intrinsic measure are of 
crucial importance. For example ► 2-focus FCS mea¬ 
sures fluorescence intensity in two closely spaced 
detection volumes and analyzes cross-correlation 
between the two intensity time-traces. The two detec¬ 
tion volumes can be created via doubling the micro¬ 
scope focus by a Wollaston prism or a Michelson 
interferometer and the distance between the two foci 
serves as the intrinsic distance measure (Dertinger 
et al. 2007). 

Scanning FCS measures fluorescence intensity 
while continuously moving the detection volume 
along a linear or circular path, the geometrical 
parameters of the path, and the speed of the scanning 
defining the intrinsic calibration. It is well suited for 
investigation of slowly diffusing molecules, because 
scanning reduces the exposure time of a molecule to 
the excitation beam, and thus the risk of 
photobleaching (Ries and Schwille 2008). 
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Fluorescence Correlation Spectroscopy of Lipids, 
Fig. 2 Z-scan FCS and the positioning problem of planar 
samples. Due to the divergence of the beam, positioning of 
a planar sample out of the foeal plane results in larger detection 
areas and, thus, higher Tq and N (a). In Z-scan FCS, measure¬ 
ments are performed with the sample in different axial positions 


(b) and the dependences of Tq and N on distance from focal 
plane Az are analyzed (c). Data from a measurement of lipid 
diffusion in a supported lipid bilayer are shown. The Az depen¬ 
dence of photon count-rate per molecule Q reflects the axial 
intensity profile of the beam, which is assumed to have 
a Lorentzian shape 


While 2-focus and scanning FCS can be used for 
planar and bulk sample alike, Z-scan FCS (illustrated 
in Fig. 2) was developed to address the positioning 
problem of planar samples. It is based on performing 
a series of FCS measurements with sample in differ¬ 
ent axial positions with respect to the focal plane. 
Diffusion time and particle number N depend 
quadratically on the axial position Az as described 
by (8), where 1 is the excitation wavelength, D the 
diffusion coefficient, and c the surface concentration 
of the fluorescent species. Since each Z-scan consists 
of more measurements, it is recorded for a longer 
time, imposing higher requirements on the stability 
of the sample and the microscope (Machan and Hof 
2010b). 


'tD(Az) 


AD 


Ti’^COq^J ’ 


N{Az) = Ticoo^c 





( 8 ) 


Image correlation spectroscopy (ICS) represents 
a different approach to calibration-free FCS with 
inter-pixel distance in fluorescence microscopic 
image as the intrinsic calibration. Instead of temporal 
correlation G (t) of a fluorescence intensity time-trace, 
spatial correlation G rj) is calculated from an image; 
distances ^ and r] between correlated pixels along the 
X and Y axis, respectively, play the role of lag time t. 
Spatial correlation function G rf) is, therefore. 
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Fluorescence Correlation Spectroscopy of Lipids, 
Fig. 3 Normalized spatial autocorrelation functions G rj) 
measured by RICS. Diffusion of fluorescently labeled 10 kDa 
dextran in agarose solution (a) and of fluorescently labeled lipid 
in supported lipid bilayer (b) are compared. The temporal reso¬ 
lution along the r] axis is determined by the scanning time per 
line (3.68 ms; 256 pixels) and it is not sufficient to observe 


a surface with the maximum in the origin = 0; 
7) = 0). If a static image recorded in one instant is 
correlated, the correlation function lacks temporal 
information and only information on fluorescent parti¬ 
cle concentration can be retrieved from the maximum 
of G (<^, rj). Spatial cross-correlation between images 
forming a temporal image series allows determination 
of diffusion coefficient or direction and velocity of 
oriented flow from width and position of maximum 
of the correlation function (spatiotemporal ICS - 
STICS) (Hebert et al. 2005). Another possibility is to 
analyze by temporal correlations time-traces of fluo¬ 
rescence intensity in corresponding pixels of 
a temporal image series. That is actually a direct exten¬ 
sion of 2-focus PCS to up to hundreds of foci (Kannan 
et al. 2006). The time required for image acquisition 
imposes a limitation on timescale of processes acces¬ 
sible by that approach. Images recorded by a laser 
scanning microscope inherently contain temporal 
information, which is employed in raster ICS (RICS). 



120 130 140 
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a decay of the autocorrelation function along the ry axis in the 
case of the dextran diffusion (a), which is faster than diffusion of 
the lipid (b). The temporal resolution along the ^ axis is deter¬ 
mined hy the scanning time per pixel (10 ps) and a distinct decay 
of the autocorrelation function along the { axis is visible in both 
cases. The f and ry are scaled in pixels in the plots; a pixel 
corresponds to 50 nm 


RICS (see Fig. 3) has a rather broad dynamic range, 
because the spatial correlation contains information on 
two different timescales (fast and slow scanning axes) 
(Digman et al. 2005). 

Summary 

PCS analyzes fluctuations of fluorescence intensity 
collected from a small detection volume (defined typ¬ 
ically by the focus of a confocal microscope) and 
extracts information on the timescale of the processes 
underlying the fluctuations. Usual applications contain 
measurements of concentrations and diffusion coeffi¬ 
cients of molecules or supramolecular structures mov¬ 
ing through the detection volume or investigation of 
association between differently labeled molecules via 
cross-correlation analysis. The last decade has seen the 
introduction of several new techniques in PCS such as 
scanning PCS, multi-focus PCS or image correlation 
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spectroscopy (ICS). Their development was motivated 
mainly by the pursuit of more robust techniques for 
investigation of molecular diffusion in biological sam¬ 
ples (especially biological membranes), which is cur¬ 
rently a very important application of PCS. 

Cross-References 

► Dual-Color Fluorescence Correlation Spectroscopy 

► Fluorescence and FRET in Membranes 

► Fluorescence Correlation Spectroscopy 

► Fluorescence Cross-Correlation Spectroscopy 

► Fluorescence Fluctuation Spectroscopy 

► Lipid Lateral Diffusion 

► Single Particle Tracking 

► Zero-Mode Waveguides 
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Fluorescence Cross-Correlation 
Spectroscopy 

Tbomas Weidemann and Petra Scbwille 
Biotec/Biopbysics Research Group, Tecbniscbe 
Universitat Dresden, Dresden, Germany 

Synonyms 

Dual-color fluorescence correlation spectroscopy 


Definition 

Fluorescence cross-correlation spectroscopy (FCCS) 
is a time-dependent, spectroscopic method for the 
analysis of interacting, differently colored, fluorescent 
particles randomly diffusing through a microscopic 
detection volume under steady-state conditions. 

Introduction 

With fluorescence correlation spectroscopy (FCS), 
molecular properties are determined from recording 
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Fluorescence Cross-Correlation Spectroscopy, Fig. 1 Exper¬ 
imental setup, (a) Overlapping laser beams of different wave¬ 
length focused by a high NA objective produce an 
approximately Gaussian-shaped intensity distribution. The cen¬ 
ters of the foci may exhibit displacement (d„ dy, r/J. 
The fluorescence of the differently labeled particles is detected 
simultaneously, (b) Instrumental setup of an PCS unit 


attached to a confocal laser scanning microscope. The scanning 
unit allows to position the detection volume within cells; 
solution measurements can be implemented with a fixed mirror. 
The fluorescence signal is split by two subsequent dichroic 
mirrors (DM) into the detection channels for single photon 
counting by avalanche photodiodes (APDs) 


a fluctuating signal of a pico- to micromolar concen¬ 
tration of fluorescent molecules in solution 
(► Fluorescence Fluctuation Spectroscopy, ► Fluores¬ 
cence Correlation Spectroscopy). As primary parame¬ 
ters, FCS provides time constants of processes 
modulating the fluorescence emission, as well as num¬ 
bers and transit times of the observed particles diffus¬ 
ing through a tiny illuminated open volume (Fig. la). 
A biochemical reaction or macromolecular 
rearrangement, when linked to the fluorescence read¬ 
out, can be quantified. Since FCS is now routinely 
performed in a laser-illuminated confocal spot, measur¬ 
ing in living cells is straightforward. Different cellular 
compartments like the nucleus, the cytoplasm, or the 
plasma membrane can be easily distinguished and 
addressed. In addition to biochemical reactions, the 
local environment of macromolecules, for example, ion 
concentrations, pH, or viscosity, can be probed. Thus, 
FCS is a versatile and attractive technique when striving 
for a quantitative understanding of interactions and 
dynamics of biological and, in particular, cellular sys¬ 
tems (Lakowicz 2006; Weidemann and Schwille 2009). 

However, single color FCS bears certain limita¬ 
tions. The size and shape of a confocal detection vol¬ 
ume is prone to distortions depending on local changes 
in refractive index within the sample, a major source of 
scatter for measurements based on the fluctuation 


amplitude (which relates to the number of observed 
particles and counts per molecule). The average transit 
time of the observed complexes (diffusion time, diffu¬ 
sion constant) is quite insensitive with respect to 
molecular size; for example, a simple dimerization is 
not detectable with a conventional confocal setup. 
These limitations can be overcome by observing dif¬ 
ferently labeled, spectrally distinct molecules and 
extending the system to two detection channels 
(Fig. lb) (► Protein Fluorescent Dye Labeling). 
A dual-color setup allows the acquisition of both the 
autocorrelation of the signal in each channel and the 
cross-correlation between the two channels (Fig. 2). 
The cross-correlation function is directly proportional 
to the fraction of co-diffusing particles carrying both 
dyes simultaneously through the detection volume. 
Thus, a positive cross-correlation signal is indicative 
for molecular interactions above a background of 
noninteracting particles, even if all of them are indis¬ 
tinguishable based on diffusive properties. 

Implementation 

Cross-correlating spectrally distinct fluorescence sig¬ 
nals to monitor particle interactions was first proposed 
in 1989 (Rieka and Binkert 1989), however gained 
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Fluorescence Cross-Correlation Spectroscopy, Fig. 2 Exam¬ 
ple correlation curves, (a) Section of intensity traces in the green 
and red illustrates how pairs of values separated by a lag time x 
are multiplied for computing the correlation functions (1). (b) 
Numerical correlation curves for a double-labeled standard 
(D, = 200 pm^/s, 10 nM) measured by a two color setup with 
CDo = 0.25 pm in the green and cOo = 0.325 pm in the red channel 
(S = 5 for both foci, T = 20%, x^jp = 3 and 6 ps, chromatic 


mismatch 20% of 1/e^ radii; (2)and (3)). Dashed curves show the 
purely diffusion related correlation function without triplet or 
displacement, (c) Increase of the cross-correlation amplitude 
with progressing heterotypic binding (no cross talk), (d) Com¬ 
parison between a hetero- (open squares) and a homotypic inter¬ 
action (dots). In the latter case, the ratio between cross- and 
autocorrelation is nonlinear and reduced to one third. Cross 
talk may compress the experimentally accessible range 


much broader attention when Schwille et al. demon¬ 
strated the feasibility of the method with one- and two- 
photon excitation, using a confocal alignment and a set 
of molecules that displayed stable binding/unbinding 
(Heinze et al. 2000; Schwille et al. 1997). Since then, 
the method has matured as a powerful tool for bio¬ 
chemistry and quantitative cell biology as described in 
several reviews (Bacia et al. 2006; Bacia and Schwille 
2007; Weidemann et al. 2002). The optical setup as 
used for single point confocal FCCS is not much dif¬ 
ferent from a multicolor confocal laser scanning 
microscope: Two laser beams of different wavelengths 
are coupled into a high numerical aperture objective 
and focused in solution or inside a cell. Fluorescence of 
the distinctly labeled particles is collected by the same 
objective and detected in two spectrally well-separated 
detection channels (Fig. lb). In contrast to standard 
confocal microscopes, FCS requires single photon 
counting detectors with superior time resolution and 
quantum efficiency as compared to standard 


photomultiplier tubes. The recorded signals (counts 
per second) are fed into a hardware correlator card 
for real-time correlation or stored and processed post 
measurement. 

Data Processing 

The correlator unit computes the normalized correla¬ 
tion function from the fluorescence intensities Fj{t) 
recorded at the individual detectors k and / according to: 

,,, {F,{t)F,{t+x)) , {dF,{t)dFit+x)) 

^ {Fk){F,) {F,){F,) 

( 1 ) 

Here the two indices allow the simultaneous treat¬ 
ment of autocorrelation {k = I ^ j) or cross-correlation 
(k ^ I ^ X). The fluctuation amplitude, as defined by 
the intercept of the cross-correlation function, Gj:(0), is 
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the primary readout of the experiment (Fig. 2). The 
cross-correlation amplitudes as well as the autocorre¬ 
lation amplitudes, Gf (0) and Gf (0), are obtained by 
fitting analytical model functions to the measured cor¬ 
relation curves: 


G«(t) = G[,(t)G"(t) 

G[/(t) = 1 


= 


(1-r) 

1 1 


T 


{Nob) 2D "---" 


3D 


( 2 ) 


Note that the detection volumes Veff^ki, and therefore 
the obtained fit parameters, differ between the indi¬ 
vidual detection channels (k, /, x). The diffusion 
times of the single-color channels scale with the 
square of the excitation wavelength. For example, 
using typical laser lines of 488 nm and 633 nm 
results in a 1.68-fold prolonged diffusion time in 
the red channel for particles with exactly the same 
diffusion coefficient. The detection volume of the 
cross-correlation channel, as generated by focusing 
overlapping laser beams of different wavelength, is 
defined by the degree of spatial overlap. Displace¬ 
ment between the two foci leads to prolonged diffu¬ 
sion times and reduced amplitudes (Weidemann 
et al. 2002): 


Equation (2) describes the typical “one compo¬ 
nent with triplet” model. This assumes an ensemble 
with a narrow particle size distribution such that they 
appear as a single diffusive molecular species, 
G2 (t). However, this can be easily extended for up 
to three diffusive components (► Fluorescence Cor¬ 
relation Spectroscopy). For autocorrelation, one 
often has to account for an additional correlation 
time in the lower ps regime due to population of 
nonfluorescent dark states, G^,(t). For organic dyes, 
these dark states are a result of intersystem crossing 
from the fluorescently productive “singlet” into 
a nonfluorescent “triplet” system of electronic quan¬ 
tum states (Lakowicz 2006) (►Fluorescence). In 
G^/(t), T denotes the fraction of molecules residing 
in a dark state and Xj, the triplet relaxation time. 
A finite T leads to an additional rise of the amplitude 
which must be accounted for to derive true concen¬ 
trations. Since photophysical processes of different 
dyes are statistically independent, this term does not 
appear in cross-correlation (T = 0). 

The so-called “diffusion time” Xjjjf = rn^/AD, 
denotes the average transit time through the detection 
volume and is therefore governed by the translational 
diffusion coefficient D, of the particles and the size of 
the detection volume Veff,ki = Tt^^^coo.tWo./y'zo.jtzo,/- In 
confocal alignments, the detection volume is usually 
stretched ^fivefold along the optical axis and 
described by the “structure parameter” 5 = zo/fflo, 
the ratio of axial and lateral 1/e^ radii of Gaussian 
distributions as used for volume approximation. For 
measuring in membranes, the 3D term in (2) is simply 
dropped (as if measuring with an infinitely large S). 


Ga-(t) = G"(t) exp 


4 + 4 

AD/X A- 


4At + z2^ 
(3) 


with = {col^g i^A-col^ff ,)/2. For fitting cross¬ 
correlation curves, the exponential correction is usu¬ 
ally neglected. It seems more practical to characterize 
the alignment with a biochemically well defined, dou¬ 
ble labeled standard and linearly rescale the cross¬ 
correlation amplitudes. After fitting, the fluctuation 
amplitudes G^(0), Gf (0), and Gx(0) must be corrected 
for non-correlated background: 



Equation (4) allows to derive true concentrations 
as one would measure in solutions of homogeneously 
labeled ensembles of single or double labeled parti¬ 
cles. However, even under equilibrium conditions, 
analyzing binding phenomena, the situation is more 
complicated. Depending on the reaction type, one 
encounters a mixture containing labeled monomers, 
intermediate states, and final complexes, all of which 
are conjugated to a varying number of fluorophores. 
Because the amplitude in each channel is composed 
of individual molecular contributions weighted by 
their molecular brightness, the inverse amplitude is 
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not any longer proportional to the concentrations. 
Moreover, one has to distinguish between the con¬ 
centration of diffusing entities and the initial amount 
of reactants at the start of the binding experiment. In 
the most simple case, a heterotypic dimerization 
reaction between two binding partners homoge¬ 
neously labeled in green (g ^ k) and in red (r ^ 1), 
the ratio of background corrected cross- and autocor¬ 
relation amplitudes directly reflects the fraction of 
bound ligand: 

_ ^gr 

G^(0) Cgr + C,. C,,0 

Note that the “green” channel (k) as used for nor¬ 
malization reproduces the fraction of bound ligand 
with respect to the total concentration of red monomers 
in solution, c,-o- Other scenarios have also been worked 
out (Kim et al. 2005; Weidemann et al. 2002). 


Extensions 

Altered excitation schemes can significantly improve 
the performance of FCCS. Simultaneous excitation of 
both labels diminishes displacement of the illumina¬ 
tion foci and therefore can improve the accessible 
dynamic range of cross-correlation amplitudes 
(Heinze et al. 2000; Liu et al. 2007). A serious issue 
in FCCS is cross excitation with the wrong wavelength 
or cross talk into the wrong color channel. In both 
cases, a false positive cross-correlation amplitude 
builds up. Cross talk is especially severe when working 
with variants of green fluorescent protein inside cells, 
since these fluorophores emit a trailing tail at longer 
wavelengths and are therefore difficult to separate. 
To a certain extent, cross talk can be corrected by 
measuring single-colored probes (Bacia et al. 2006; 
Bacia and Schwille 2007; Weidemann et al. 2002). 
However, with alternating or pulsed interleaved exci¬ 
tation (PIE) a powerful method was devised to fully 
circumvent the problem (Muller et al. 2005). With 
PIE, a combination of pulsed laser excitation 
and gated detection in the nanosecond regime allows 
one to completely separate both excitation channels 
in time, during individual molecular transits. 
Recently, employing alternating excitation and focus 
scanning as technical extensions, cross-correlation 


measurements to quantify morphogen-receptor inter¬ 
actions have been conducted in living embryos (Ries 

et al. 2009). 

Cross-References 

► Confocal Laser Scanning Eluorescence Microscopy 

► Eluorescence 

► Eluorescence Correlation Spectroscopy 

► Eluorescence Fluctuation Spectroscopy 

► Green Fluorescent Protein (GFP) 

► Protein Fluorescent Dye Labeling 
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Definition 

Fluorescence fluctuation spectroscopy (FFS) records 
the intensity variations created by fluorescent particles 
passing through a small measurement volume and 
characterizes the properties of the particles by statisti¬ 
cal analysis of the measured signal. 

Introduction 

Fluorescence fluctuation spectroscopy (FFS) monitors 
the fluctuations of the fluorescence intensity from 
a small volume within the sample. This observation 
volume is typically on the order of 1 femtoliter and 
is created by confocal or two-photon microscopy. 
Fluorescent particles, which in most applications are 


fluorescently labeled molecules, are carried in and out 
of the observation volume by transport processes, 
such as diffusion. Every entry or exit of a fluorescent 
particle leads to a change in the fluorescence signal 
emanating from the observation volume. Other pro¬ 
cesses, such as diffusion or chemical reactions, may 
also modulate the fluorescence intensity of the sample. 
The measured intensity encodes information from all 
the physical processes that contribute to the variations 
in the signal. Any technique that extracts information 
from the fluctuating intensities is referred to as FFS 
(van Orden et al. 2004). 

The information content in the fluctuating signal is 
best illustrated by considering very bright particles at 
a very low concentration. A transient burst of fluores¬ 
cence light is generated every time a particle passes 
through the volume (Fig. la). These bursts or fluctua¬ 
tions of the fluorescence signal contain information 
about the sample. For example, the frequency of bursts 
directly reflects the protein concentration; the average 
duration of a burst characterizes the typical time for 
a particle to cross the observation volume; and the 
average burst amplitude describes how bright the par¬ 
ticle is (Fig. lb). The concentration of particles is 
typically such that multiple particles are present in 




Fluorescence Fluctuation 
Spectroscopy, 

Fig. 1 Fluorescence 

fluctuation spectroscopy. q 

(a) Experimental setup. When 
a fluorescent particle (depicted 
as a light bulb) diffuses 
through the observation 
volume setup by the 
microscope optics, it is excited 
and emits fluorescence. 

(b) Fluorescence bursts from 
single particles passing 
through the observation 
volume, (c) Experimental data 
showing detected photon 
counts vs time 
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the observation volume. Thus, the bursts from individ¬ 
ual particles overlap temporally, thereby obscuring the 
aforementioned properties. In addition, the fluores¬ 
cence signal is usually collected by photon counting 
methods, which as part of the detection process add 
shot noise to the fluorescence signal. Experimental 
FES data (Fig. Ic) reflect these complications and 
therefore require statistical analysis methods to extract 
information from the fluctuating signal. 

The most prominent FES technique is fluorescence 
correlation spectroscopy (FCS), which is based on the 
autocorrelation function of the fluctuating intensity 
signal. The autocorrelation function provides informa¬ 
tion on the concentration of the particles and their 
dynamic properties, such as diffusion and reaction 
rates. FCS, which was introduced in the 1970s, is the 
most widely used of all FFS techniques (Elson 2004). 
As a result, most FFS analysis originally focused 
on the translational diffusion coefficient, but the emer¬ 
gence of alternative analysis and measurement 
methods has expanded the capabilities of FFS consid¬ 
erably. For example, fluorescence lifetime, anisotropy, 
spectral separation, and spatial scanning are now 
widely used in fluorescence fluctuation experiments 
(Jameson et al. 2009). Because FCS is frequently 
reviewed, this entry focuses on brightness analysis, 
a more recently introduced technique that has become 
an important FFS technique. 

Brightness 

Brightness is a measure of the average fluorescence 
intensity of a single particle. Identifying the brightness 
relies on the ability of FFS to detect the change in 
fluorescence when a single particle enters or leaves 
the observation volume, and is a parameter not acces¬ 
sible by conventional fluorescence methods. Thus, 
FFS provides the unique opportunity of separating a 
mixture of fluorescent species that differ in brightness. 

Two FFS analysis methods, the photon counting 
histogram (PCH) and fluorescence intensity distribution 
analysis (FIDA), have been introduced independently 
for measuring the brightness (Muller et al. 2001; 
Kask and Palo 2001). Although these two methods 
were formulated using different approaches, the equa¬ 
tions of PCH and FIDA are directly transformable into 
each other. Consequently, PCH and FIDA are identical 
and interchangeable methods. Both techniques 



Fluorescence Fluctuation Spectroscopy, Fig. 2 Photon 
count distribution. Example of a theoretical probability distribu¬ 
tion function of the photon counts k (solid line with symbols). 
The shot noise contribution is shown as dashed line 

constmct a histogram of the number of photons detected 
per sampling time interval. By selecting a sufficiently 
short sampling time interval, the fluorescent particle is 
approximately stationary within the observation volume 
and each measurement interval represents a snapshot of 
the emitted fluorescence intensity of the particle. The 
number of photon counts detected in each interval 
depends on the excitation intensity distribution of the 
observation volume, on the brightness and concentra¬ 
tion of the particles, and on the shot noise of the detec¬ 
tion process. Quantitative modeling of this process 
derives the probability distribution function (pdf) of 
the photon counts, which is used to fit the experimental 
histogram (Miiller et al. 2001 ; Kask and Palo 2001 ). The 
shot noise gives rise to a Poisson distribution of photon 
counts (Fig. 2). The fluctuations in the fluorescence 
intensity due to particles traversing the observation vol¬ 
ume represent a second noise source and leads to 
a broadening of the photon count distribution. This 
broadening from a Poisson to a super-Poisson distribu¬ 
tion contains all the information on the sample. Analysis 
of the photon counting histogram of a single brightness 
species determines two parameters, the brightness and 
the average number of particles in the observation vol¬ 
ume. Analysis of the photon counting histogram of 
a mixture of brightness species ideally determines the 
brightness and average number of particles for each 
species. Resolving species by brightness requires very 
good signal-to-noise ratio, which can be difficult to 
achieve, especially for intracellular measurements. 
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If the signal-to-noise ratio is insufficient, then PCH and 
FIDA recover an apparent brightness, which represents 
a brightness averaged over the mixture of species 
present in the sample. 

Brightness analysis by PCH and FIDA goes back to 
earlier work by Elson and coworkers and Thompson 
and coworkers, who analyzed the moments of the 
photon counts (Elson 2004; Thompson and Mitchell 
2001). Their work provided the conceptual framework 
for resolving species by brightness. Because the prob¬ 
ability distribution and its moments are mathemati¬ 
cally related by a transformation, both approaches are 
suitable for brightness analysis of EES data. Moment 
analysis was further improved by the introduction 
of time-integrated fluorescence cumulant analysis 
(TIECA), which added a rigorous treatment of uncer¬ 
tainties to moment analysis. 


Brightness Analysis of Protein Interactions 
Inside Cells 

Eluorescent proteins, such as green fluorescent protein 
(GEP), provide genetically encoded fluorescent 
labels. Measurement of the brightness of fluorescent 
proteins, in turn, provides a unique way to determine 
the stoichiometry of proteins in cells (Chen et al. 
2010). This concept can be illustrated with the follow¬ 
ing example. Monomeric proteins labeled with GPP 
produce fluorescence bursts when passing through 
the observation volume. The average amplitude of 
the bursts specifies their brightness X. Suppose the 
monomers associate into dimers. Each dimer carries 
two fluorescent labels, which leads to a doubling of the 
burst amplitude. Thus, dimers appear twice as bright as 
monomeric proteins (Pig. 3). This concept was suc¬ 
cessfully tested in cells expressing either EGPP or 
EGPP 2 , a dimeric EGFP construct (Chen et al. 2010). 
A brightness study of a nuclear protein labeled with 
EGPP demonstrated the potential of EPS to measure 
both stoichiometry and protein binding curves directly 
inside living cells (Chen et al. 2010). A qualitative 
graph of the data (Pig. 4) shows that the brightness 
equals that of EGPP at low concentrations, but 
increases to twice the value as a function of concentra¬ 
tion. This is indicative of a monomer-dimer transition. 
Intermediate brightness values characterize a mixture 
of monomers and dimers. This experiment is in 
essence a titration experiment, but carried out by 
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Fluorescence Fiuctuation Spectroscopy, Fig. 3 Brightness 
and stoichiometry. The fluorescent protein GFP with brightness 
X is attached to the protein of interest. The brightness of the 
monomeric protein equals that of the GFP alone. The brightness 
of the dimer doubles, because it carries two GFPs 



Fluorescence Fluctuation Spectroscopy, Fig. 4 Brightness 
titration. A monomer/diraer equilibrium leads to a brightness 
{red curx’e) that increases as a function of protein concentration. 
The brightness h is normalized with h = \ corresponding to 
monomeric protein and h = 2 representing dimeric protein. 
A normalized brightness h between 1 and 2 is indicative of 
a mixture of monomeric and dimeric protein 

measuring the brightness of a population of cells that 
vary in the expressed protein concentration. 

Brightness provides a powerful methodology for 
the quantification of cellular homo-protein 


















Fluorescence Imaging with One Nanometer Accuracy 


803 


F 


interactions. In contrast, the translational diffusion 
coefficient of proteins measured by FCS is difficult to 
predict inside the complex environment of the cell. 
Macromolecular crowding and interactions with other 
cellular proteins affect the diffusion process in an 
intricate manner. Consequently, the diffusion coeffi¬ 
cient is generally not suitable for quantifying specific 
protein interactions, but is useful for observing associ¬ 
ation with other cellular proteins that are not fluores- 
cently labeled. 

The characterization of hetero-protein interactions 
requires that each protein species is labeled by 
a specific color. A green and a red fluorescent protein 
are suitable markers, because the large color separation 
facilitates the splitting of the fluorescence signal into 
a “green” and “red” detection channel with minimal 
spectral overlap. Dual-color FFS requires excitation 
light that coexcites both fluorescent labels and detects 
hetero-interactions, because a protein complex carry¬ 
ing both labels leads to simultaneous fluorescence 
bursts in the two detection channels. Fluorescence 
cross-correlation spectroscopy characterizes interac¬ 
tions from a measurement of the cross-correlation of 
both detection channels (Kohl and Schwille 2005). 
Similarly, dual-color brightness analysis of the two 
detection channels detects hetero-protein interactions. 
However, unlike in the single-color brightness analysis 
of homo-interactions, quantification of hetero-protein 
interactions is quite difficult. The main obstacles are 
the potential of fluorescence energy fransfer befween 
fhe fluorescenl proteins, the spectral crosstalk of the 
fluorescence emission of the two labels, and the pres¬ 
ence of two protein species. A recently introduced FFS 
analysis method has overcome these challenges and 
demonstrates the feasibility of quantitative characteri¬ 
zation of hetero-protein interactions (Wu et al. 2010). 

Cross-References 

► Fluorescence 

► Fluorescence and FRET in Membranes 

► Fluorescence Correlation Spectroscopy 

► Fluorescence Correlation Spectroscopy of Lipids 

► Fluorescence Cross-Correlation Spectroscopy 

► Lipid Lateral Diffusion 

► Protein Fluorescent Dye Labeling 

► Protein Oligomers 

► Single-Molecule Spectroscopy 
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Introduction 

The microscope allows scientists to peer into small 
objects. With new organic dyes and fluorescent pro¬ 
teins, the microscope has improved our ability to 
extract biological information from selectively labeled 
proteins, DNA, and organelles. However, a physical 
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Fluorescence Imaging with 
One Nanometer Accuracy, 
Fig. 1 An EMCCD image of 
a single fluorophore {left) and 
its point-spread function 
graph. Note that the 
fluorophore looks much bigger 
than its actual size (around 
20 nm) due to diffraction. 
Scale bar represents 400 (nm), 
and 1 pixel is around 107 (nm) 



property of light, called diffraction, has made it nearly 
impossible to see objects smaller than the diffraction 
limit, which is about 250 nm. However, much infor¬ 
mation smaller than this distance can be insightful. 
This can be accomplished without violating the dif¬ 
fraction limit using a technique called FIONA (Fluo¬ 
rescence Imaging with One Nanometer Accuracy). 
With FIONA, the position of a diffraction-limited 
spot can be located to within a few nanometers, on 
a time-scale longer than a millisecond. In this chapter, 
we discuss some background principles underlying 
FIONA and its applications. 

Diffraction-Limited Spot 

Modern biology relies on visible light-based imaging 
techniques, commonly using fluorescent dyes to label 
organelles, proteins, or nucleic acids. Fluorescence, as 
opposed to simple absorption, is typically used to spe¬ 
cifically label a particular object and increase its con¬ 
trast. Sensitivity is excellent, having reached the 
single-molecule level. Readily available choices of 
labels include organic dyes, quantum dots, and fluo¬ 
rescent proteins. Although they have endless stories of 
success in biology, they all have a critical limitation 
imposed by diffraction. No matter how small the fluo¬ 
rescent object is, diffraction blurs their images to 
a diffraction-limited size, about 250 nm. However, 
one oftentimes wants to measure changes smaller 
than this. For example, in the cell, a molecular motor 
called kinesin, “walks” with 8 nm center-of-mass 
movements. But, due to diffraction, we cannot see 


this. Nevertheless, there is a light-based technique 
that can reveal such small steps. It is called Fluores¬ 
cence Imaging with One Nanometer Accuracy or 
FIONA. One simply takes the diffraction image of 
a single molecule, which looks like an isolated moun¬ 
tain, and determines its maximum (Fig. 1). Naturally, 
one can determine the center of a distribution much 
more accurately than the width, the latter being known 
as the resolution of the microscope. 

Accuracy of Finding the Center of 
Diffraction-Limited Spot 

The width of a diffraction-limited spot image is 
~ 5 ■ M ’ "'hich is around 200-300 nm, where X is 
wavelength « 550 nm (green color), and NA is numer¬ 
ical aperture of an objective, « 1.4 (Yildiz et al. 2003) 
(Fig. 1). Nevertheless, its position can be localized 
much more accurately than this. Just how well can 
the center be determined? This is what’s called the 
standard error of the mean (s.e.m.). To first order, it is 
inversely proportional to the square-root of the total 
number of photons collected. The more photons col¬ 
lected, the clearer the image is. It is also related to the 
diffraction width - the wider the mountain, the more 
difficult it is to localize the mountain top. The s.e.m. is 
therefore proportional to the width/v/N. With 
w = 250 nm and 10,000 photons, the s.e.m. is about 
2.5 nm. (More precisely, it’s closer to a nanometer - 
see below.) 

There are two other terms that come into how well 
a peak can be localized. One is the pixel size of the 
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detector. If, the pixel size is much bigger than the entire 
mountain width, then the center can be moved around 
and you can’t tell. Typically, to get good accuracy an 
effective pixel size of about 80-100 nm is required. An 
8x8 arrangement of pixels (a CCD commonly records 
a 512 X 512 array) are then recorded and fit with 
a Gaussian function. (Technically, the diffraction- 
limited spot is an Airy function, which is a first-order 
Bessel function, but a Gaussian function generally 
does a good job of fitting the function.) The third 
term accounts for background noise that can come 
from various sources such as a noisy camera or 
autofluorescence. All of fhese effecls are summarized 
nicely in a paper by fhe Webb group in 2002 
(Thompson el al. 2002) and shown originally in 
a paper by Yildiz et al. (2003) for an organic 
fluorophore. 



( 1 ) 


Slandard error of the mean (/: x or y direction) 
s,: standard deviation of the Gaussian function in 
the i direction (typically 125 nm) 

N: collected photon number, (typically 14,000 pho¬ 
tons in s.) 

a: effective detector pixel size (typically 86 nm) 
b: uncertainty (standard deviation) of the background 
noise (typically 10) 

Using the typical values yields, cr^, = 1.24 nm, in 
each direction, meaning that the position of a dye can 
be determined to this accuracy (Yildiz et al. 2003). 
Photon noise only (first term, (1)) leads to 1.02 nm; 
pixelation due to the finite detector area (second term, 
(1)) increases to 1.04 nm; and background noise 
(third term, (1)) increases cr^ to 1.24 nm. This shows 
that photon noise is the dominant contributor to cr^. 
With the FIONA technique, particularly with quantum 
dots, it is possible to gather 20,000 or more photons 
during the detector exposure time, and therefore 
achieve 1 nm accuracy. 


Applications of FIONA 

FIONA has especially been applied to cytoplasmic 
motor proteins, i.e., to myosins that walk on the 


“roadway” called actin, or to kinesins and dynein, 
which walk in the (-F) or (—) direction on microtubules, 
respectively. It has been applied in vitro, in vivo, and to 
living organism (summarized in Toprak et al. 2010). 
FIONA has achieved the time resolution of as fast as 
2 ms, which made it feasible for in vivo study (where 
the ATP concentration is very high, so motors walk 
rapidly). And due to the nature of total internal reflec¬ 
tion fluorescence (TIRF) microscopy (see below), 
FIONA is suifable for single-molecule sfudies on sur- 
face-lethered profeins/nucleic acids or membrane pro¬ 
teins in a cell. 

Figure 2 is an example, applied to kinesin with 
a quantum dot. (Many of these studies have also been 
done with regular fluorophores, which tend to be 
smaller than quantum dots.) Figure 2a shows the 
kinesin moving with a quantum dot attached to its 
neck region. As determined in the 1990s by the Block 
lab using optical trap measurements, kinesin moves 
with a center-of-mass motion of 8.3 nm (Svoboda 
et al. 1993). A histogram shows a step size of 8.4 ± 
0.7 nm (Fig. 2b) in excellent agreement with the opti¬ 
cal trap data. To determine just how the kinesin takes 
a step, it was necessary to label one of the feet (also 
called a “head” or sometimes “hand”), as shown 
in Fig. 2c. If a kinesin “walked,” i.e., moved in a 
hand-over-hand fashion, the step size would be 
2 X 8.3 =16.6 nm. If it inchwormed along, all parts 
of the kinesin would move by 8.3 nm. From the 
step sizes and histogram, the step size is 16.3 nm 
(Fig. 2d), and clearly it walks hand-over-hand. In 
fact, one can also tell that each foot proceeds at the 
same rate. That is, the quantum dot doesn’t hinder the 
foot it is attached to and it is not limping. (When 
a molecular motor is labeled with a significantly 
smaller conventional fluorophore, it also yields 
the same values.) A hand-over-hand model with only 
one head labeled should allow the 0 nm step to be 
indirectly detected. The total probability of the 
labeled foot moving, P(t), is a convolution of 
the labeled and unlabelled head stepping, and results 
in: P{t) = kik 2 (e-'^“ - where kj is 

the rate of motion forward. With the rates being equal, 
(ki = k 2 = k), thenP(t) = tk^e^*^'. ki = k 2 fits the data 
well, which can be seen in Fig. 2e. In contrast, if the 
16.3 nm steps arise from a single process, then the 
dwell-time histogram would be expected to yield an 
exponential decay (the Poisson-distributed rate). 
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Time (sec) 

Fluorescence Imaging with One Nanometer Accuracy, 
Fig. 2 (a) shows the motion of step sizes and its histogram 
and (b) of the center-of-mass motion of kinesin. Here the kinesin 
is labeled with a quantum dot near the C-terminus, near the cargo¬ 
binding domain, (c) and (d) show kinesin labeled on a foot; it 


In conclusion, one foot steps over the second foot, 
and then the second footsteps over the first, with 
equal rates. 

Getting the Maximum Number of Photons 
for FIONA 

So far, we have learned from (1) that to locate a 
fluorophore with the maximum accuracy, one needs to 
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walks with a step size of 16.3 nm. (e) is a histogram versus dwell 
time, showing that it fits very well to a form P(t) = tk^e^'^’. 
Hence, kinesin walks in a hand-over-hand fashion, (a-b are 
adapted from Yardimci et al. 2008; c-e are adapted from Toprak 
et al. 2009) 


1. Extract as many photons as possible from the fluo¬ 
rescent dye 

2. Send the photons to a detector with minimal loss 

3. And reduce the background noise 

The success of the FIONA is attributed to ways of 
achieving (1) through (3). 

Getting the maximum number of photons emitted 
before the fluorophore photobleaches is important 
since this determines the accuracy of localization. 
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Also, it is important that the fluorophores do not blink 
since when they are in the off state, one cannot locate 
them. With very good fluorophores, we can get over 
one million photons (=l/e value). For example, Cy3, 
CF633, Atto647N all emit at this level, although the 
exact value depends on what they are attached to. 
(Anecdotally, we have found that dyes when attached 
to the ends of DNA are more photostable than when 
attached to proteins.) In general, this requires adding 
deoxygenation agents (oxygen scavenger system), e.g., 
glucose oxidase/catalase (Yildiz et al. 2003) or the 
PCA/PCD system (Aitken et al. 2008), or a reducing 
and oxidizing system (ROXS) (Vogelsang et al. 2008). 
For blinking, p-mercaptoethanol (BME) (Toprak et al. 
2009), or Dithiothreitol (Yildiz et al. 2003; Toprak 
et al. 2009) or 6-hydroxy-2,5,7,8-tetramethyl- 
chromane-2-carboxylic acid (Trolox) (Rasnik et al. 
2006) works well. 

The above reagents are for in vitro FIONA work. 
For in vivo FIONA, we are left with fluorescent pro¬ 
teins (FPs). We have found the most photostable FP is 
eGFP. Unfortunately, eGFP is still not stable enough to 
see a single one; instead, you must look at a group of 
eGFP’s. Here, the bunch must act together, as a giant 
fluorophore, and should move together (Rural et al. 
2005, 2009). Another alternative is looking at absorp¬ 
tion - looking at single melanosomes, dark pigment 
granules, for example (Rural et al. 2007). 

In terms of time resolution, one needs to get out 
enough photons to enable sub-diffraction-limited 
localization. One can turn up the excitation light so 
the photons are emitted more rapidly, thereby increas¬ 
ing the time resolution in the experiment. This works 
fine. On the other hand, with a finite number of photons 
emitted, the maximum lifetime becomes less. Also, we 
find that above a certain light intensity, the 
fluorophores tend to bleach more rapidly. 

Ways to Reduce Background 

Background noise is significantly reduced by adopting 
► total internal reflection fluorescence microscopy 
(TIRFM). With this microscopy scheme, incoming 
laser light whose incident angle is larger than 
a critical angle impinges on the water interface and 
gets reflected back away from the interface. However, 
an evanescent field (which dies off exponentially as 
you move away from the interface) is generated in the 


Buffer 



Fluorescence Imaging with One Nanometer Accuracy, 
Fig. 3 Schematic of TIR. The laser impinges on the water 
(buffer)-glass interface with angle 6, which is over the critical 
angle. An evanescent held is generated, and its intensity expo¬ 
nentially decays away from the interface. Fluorescent dyes 
within penetration depth d are excited. The indexes of refraction 
are all the same for the cover glass (gray), the index-matching 
immersion oil (brown), and objective lenses (not drawn in this 
figure) 

aqueous medium upon reflection of the laser beam, 
which enters the water through a glass coverslip/ 
slide. Consequently, only fluorophores near the glass- 
water interface are excited. The 1/e depth varies with 
the incident excitation light, generally about 100 nm. 
(The index of refraction of water, Mwater; = 1.33; the 
index of refraction of glass, Wgiass = 1-518.) Other 
regions above this criterion are essentially not excited 
(see Fig. 3), thereby reducing the background as com¬ 
pared to epi-fluorescence microscopy. 

Penetration Depth d = — 

47t 

= 118 (nm) forlo = 532 (nm), 6 = 65° (2) 

The background can be further reduced by proper 
selection of dichroic mirror and emission/excitation 
filters. The dichroic mirror and emission filter selection 
is such that reflected-laser and Raman-scattered light 
should not be allowed to enter the detector, yet emis¬ 
sion signal from excited fluorophores is sent to the 
detector as much as possible. A notch filter is some¬ 
times inserted in the emission beam path to cut off any 
reflected or scattered laser light. Putting an excitation 
filter in front of the dichroic mirror also helps clean up 
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unwanted laser light wavelength and blocks the off- 
axis light, therefore reducing the background even 
further. 

In addition to the proper dichroic mirror and hlter 
sets, employing an objective with a high numerical 
aperture (NA) and a high-performance detector allows 
the maximum amount of photons to be registered for 
final FIONA data analysis. We now have high NA 
objectives (generally 1.40-1.49, even 1.65 is avail¬ 
able) and a high-performance electron multiplying 
charge-coupled device (EMCCD) camera with almost 
100% quantum efficiency and minimal-added camera 
noise. Note that the minimum NA of the objective to 
get TIR is 1.37, which corresponds to the index of 
refraction of cells. This means that a water immersion 
objective cannot be used with TIR. We strongly rec¬ 
ommend investing money to purchase objectives with 
high NA. (Caution: An objective with 1.65 NA 
requires special immersion oil and fairly expensive 
sapphire coverslips.) 

Once fluorophore image file data are gathered, the 
center of the fluorophore is found by a two- 
dimensional Gaussian fitting using custom (Matlab, 
IDL, SigmaPlot, and so on) programming code. 
Some codes are available free of charge through 
selvin@illinois.edu. 

FIONA Variants 

FIONA has several variants. One example is 
► single-molecule high-resolution imaging with 
photobleaching (SHRImP) (Gordon et al. 2004), also 
called, Nanometer-localized multiple single-molecule 
fluorescence microscopy (NALMS) (Qu et al. 2004). It 
uses photobleaching to break the diffraction limit for 
two or more dyes located very closely. For example, 
when there are two dyes with close proximity, their 
signals are not resolved. However, when one of the 
dyes is photobleached, the position of the remaining 
dye can be determined accurately by FIONA. By then, 
subtracting the image of the remaining dye from that 
of the previous un-photobleached two dyes, the 
photobleached dye image can be inferred and its posi¬ 
tion is determined by FIONA. The resolution is then 
simply the difference between the two positions. With 
SHRImP, a resolution of 8-10 nm has been achieved 
(Gordon et al. 2004; Qu et al. 2004). The second exam¬ 
ple is ► single-molecule high-resolution colocalization 


(SHREC) (Churchman et al. 2005). While SHRImP is 
a technique for two fluorophores with the same color, 
two different fluorophores, with distinct emission spec¬ 
tra, are used in SHREC. Each emission spectrum is 
separated and imaged onto the different portion of the 
same detector. EIONA can then be used to find their 
positions. 

EIONA is also exploited in two similar high- 
resolution techniques: ► photoactivated localization 
microscopy (PALM) (Betzig et al. 2006) and ► sto¬ 
chastic optical reconstruction microscopy (STORM) 
(Rust et al. 2006). In these schemes, a high density of 
fluorophores is used, but only a small percentage of the 
fluorophores - less than one per diffraction-limited 
spot - are turned on by a pulse of light. EIONA can 
then be applied to each of the “on” fluorophores, and 
a high-resolution image is built up by pulsing many 
times and applying EIONA repeatedly (reviewed in 
Huang et al. 2009). 

Summary 

EIONA is a technique that can locate the center posi¬ 
tion of a fluorescent dye with nanometer accuracy and 
temporal resolution of up to ^2 ms. The technique is 
based on the theoretical study that the position of 
a fluorophore can be determined more accurately than 
the diffraction limit as long as we collect enough 
photons. Eor this purpose, oxygen scavenger systems 
to decrease photobleaching and reagents to suppress 
blinking have to be optimized. TIRE is also used to 
reduce background level by limiting illumination 
depth of a sample. With additional optimization such 
as using an objective with high NA, proper dichroic 
mirror/filters, and a high quantum efficiency detector, 
EIONA opened a way to see what has not been seen 
with traditional optics. 

Cross-References 

► Photoactivated Localization Microscopy (PALM) 

► Single Fluorophore Blinking 

► Single Fluorophores Photobleaching 

► Single-Molecule High-Resolution Colocalization 
(SHREC) 

► Single-Molecule High-Resolution Imaging with 
Photobleaching (SHRImP) 
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► Stochastic Optical Reconstraction Microscopy 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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Synonyms 

DNA; RNA 

Fluorescence-based assays have become increasingly 
important in biophysical studies of the structure, 
function, and dynamics of nucleic acids. The advent 
of single-molecule fluorescence spectroscopy has 
offered insight into many kinetic and mechanistic 
details previously hidden by ensemble techniques. 
These studies require the site-specific and stable incor¬ 
poration of fluorescent tags into the system of interest. 
Ribonucleic acid (RNA) and deoxyribonucleic acid 
(DNA) do not contain naturally occurring fluorophores; 
therefore, to perform fluorescence assays, it is neces¬ 
sary to introduce these groups. For short nucleic acid 
constructs, a fluorophore may be conjugated to 
a nucleotide at the 5' end, 3' end or internally. For 
longer nucleic acid constructs, a fluorophore label 
may be added by ligation or hybridization of short 
fluorescently labeled oligonucleotides. 

Short Nucleic Acids 

Short nucleic acids (less than 60 nucleotides) can be 
fluorophore labeled by conjugating to the 5' end, the 3' 
end or an internal nucleotide. The label may be added 
during oligonucleotide synthesis through the incorpo¬ 
ration of fluorescently labeled nucleotide analogs or by 
post-synthetic covalent attachment of the fluorophore 
to a modified base. 
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Fluorescence Labeling of 
Nucleic Acids, Fig. 1 Post¬ 
synthetic fluorophore labeling 
via covalent reaction, (a) An 
amino group on the 
oligonucleotide reacts with a 
succinimidyl ester conjugated 
to a fluorophore; (b) A thiol 
group on the oligonucleotide 
reacts with a maleimide 
conjugated to a fluorophore 



Labeling During Synthesis 

Fluorophores may be incorporated directly into an 
oligonucleotide during solid-phase synthesis by use 
of fluorescent phosphoramidites. Drawbacks of fluo¬ 
rescent labeling during synthesis include the limited 
availability of fluorescent phosphoramidites and lim¬ 
ited stability of many fluorophores during multiple 
rounds of nucleotide addition. 

Post-synthetic Labeling 

Post-synthetic fluorescence labeling is the most com¬ 
mon method for labeling nucleic acids. To post- 
synthetically add a fluorescent label to the 5' end or 
3' end of an oligonucleotide, an amino (—NH 2 ) or 
thiol (—SH) group is added via an aliphatic carbon 
linker that is linked to the terminal phosphate group. 
Succinimidyl ester, isothiocyanate, or sulfonyl chlo¬ 
ride derivitized fluorophores may be then covalently 
linked to those nucleotides with amino functionali¬ 
ties (Fig. la). Maleimide or iodoacetamide 
derivitized fluorophores may be covalently linked 
to those nucleotides with thiol functionalities 
(Fig. lb). Labeling protocols have previously been 
published and are provided along with the dye from 
the manufacturer (Walter 2003; Rueda and Walter 
2006; Zhao and Rueda 2009). Labeling reactions can 
typically be carried out at room temperature under 
mild conditions. To post-synthetically add 
a fluorescent label to an internal nucleotide, an ali¬ 
phatic carbon linker with an amino or thiol group 
may be linked to the C5 of a modified thymine, 
uracil, or cytosine. The amino or thiol moieties may 
be reacted with the derivitized fluorophores men¬ 
tioned above. Other less common labeling strategies 


have been described, including conjugation via 
click chemistry, biotin/avidin, antibody/antigen, 
crosslinking, and caging (Walter 2003; Rueda and 
Walter 2006; Zhao and Rueda 2009). 

Labeling During Transcription 

The 5' end of an oligonucleotide may be labeled during 
transcription by introduction of fluorescently labeled 
nucleotide analogs or by functionalizing the 5' termi¬ 
nus with a thiophosphate, which is then reacted with 
a derivatized fluorescent dye. In the latter case, an 
excess of guanosine 5'-monothiosphosphate (GMPS) 
over guanosine triphosphate (GTP) is added to the 
transcription reaction. As the excess GMPS can only 
be incorporated at the 5' end, the majority of RNA 
transcripts will be primed with 5' GMPS. The thiol 
group at the 5' end can then be conjugated to 
a maleimide or iodoacetamide derivitized fluorophore. 
This method has been successfully used to, for exam¬ 
ple, site-specifically label the 5' end of precursor 
tRNA'^’^P (Rueda et al. 2005). 

Long Nucleic Acids 

Longer nucleic acids (greater that 60 nucleotides) pre¬ 
sent more of a challenge for fluorescent labeling, as 
such RNAs cannot be efficiently synthesized by solid- 
phase synthesis, and RNA polymerase cannot incorpo¬ 
rate dyes site-specifically during transcription. Two 
main strategies, hybridization of short fluorescently 
labeled oligonucleotides and ligation, have been devel¬ 
oped to introduce a fluorescent label into these longer 
nucleic acids. 






Fluorescence Labeling of Nucleic Acids 


811 


F 


Fluorescence Labeling of 
Nucleic Acids, 

Fig. 2 Fluorophore labeling 
via hybridization with 
fluorescently labeled 
oligonucleotide. 

(a) Fluorescence labeling of 
a group II intron from 
Saccharomyces cerevisiae 
(Steiner et al. 2008). (b) Site- 
specific fluorescence labeling 
of the ribosomal 30S subunit 
from Thermus thermophilus 
by introducing helix 
extensions for hybridization 
with fluorescently labeled 
oligonucleotides (Dorywalska 
et al. 2005). Red boxes 
indicate helices that were 
extended for labeling. 

(c) Fluorescence labeling of 
the catalytic domain of RNase 
P from Bacillus suhtilis (Smith 
et al. 2005). Labeled 
oligonucleotides (red and 
blue) hybridized to the 
extended loops L15 and 
LI8m. All figures were 
reproduced with permission 
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Labeling via Hybridization 

A fluorescent label may be introduced to a longer 
oligonucleotide via hybridization, in which a short 
fluorescently labeled oligonucleotide is designed to 
selectively hybridize to an appropriate single-stranded 
stretch of the RNA or DNA. A specific loop sequence 
may be engineered to afford hybridization of the 
labeled oligonucleotide. For example, a group II intron 
was fluorescently labeled via hybridization (Fig. 2a) to 
investigate the conformational changes of this large, 
catalytic RNA (Steiner et al. 2008). The ribosome was 
also site-specifically labeled (Fig. 2b) by hybridizing 
fluorescently labeled oligonucleotides to extensions in 
ribosomal RNA hairpins (Dorywalska et al. 2005). 
Finally, a long RNA derived from the catalytic domain 
of RNase P was fluorescently labeled (Fig. 2c) by 
replacing a functionally unimportant hairpin loop 


with one complementary to a short fluorescently 
labeled oligonucleotide (Smith et al. 2005). 

Labeling via Ligation 

A fluorescent label may also be introduced to a longer 
oligonucleotide via ligation of a fluorescently labeled 
oligonucleotide. In short, a fluorescently labeled oli¬ 
gonucleotide is ligated to the long unlabeled RNA 
strand via a short DNA splint oligonucleotide that is 
complementary to the ends of the two oligonucleotides 
to he connected. T4 DNA ligase is then added to 
connect the two oligonucleotides and DNase may be 
added to degrade the splint (Fig. 3). T4 DNA ligase has 
been shown to be effective at ligation as long as either 
the splint or the oligonucleotides to be ligated are 
DNA. T4 RNA ligase II has been shown to be more 
efficient at ligating two RNA oligonucleotides. 
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Fluorescence Labeling of Nucleic Acids, Fig. 3 Fluorophore 
labeling via ligation with fluorescently labeled oligonucleotide. 
Step 1: T4 DNA ligase or T4 RNA ligase 11 ligates unlabeled 
oligonucleotide with labeled oligonucleotide. Step 2: DNase 
degrades DNA splint 
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Precursor-messenger RNA was fluorescently labeled 
via ligation in order to monitor splicing by single¬ 
molecule spectroscopy (Crawford et al. 2008). 

Labeling via hybridization or ligation offer advan¬ 
tages and disadvantages. Labeling via ligation allows 
site-specific labeling anywhere in the RNA but the 
structure of the RNA competes with hybridization of 
the DNA splint, leading to a large variation in labeling 
efficiency. Labeling via hybridization of fluorescently 
labeled oligonucleotides offers a more efficient label¬ 
ing reaction as well as high specificity. However, the 
complementary oligonucleotide may dissociate, limit¬ 
ing the duration of experiments, and it may be difficult 
to find a loop in the native sequence that is sufficient 
for hybridization so that it is often necessary to intro¬ 
duce a nonnative sequence for successful hybridization 
of the labeled oligonucleotide. 

Cross-References 

► Fluorescence 

► Single-Molecule Spectroscopy 
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Definition 

Fluorescence photoactivation localization microscopy 
(FPALM) is a super-resolution imaging technique 
based on single molecule localization, which exploits 
the photophysical properties of photoactivatable or 
photoswitchable fluorescent proteins. 

Basic Characteristics 

FPALM belongs, together with photoactivation 
localization microscopy ► PALM (Betzig et al. 2006), 
stochastic optical reconstraction microscopy 
(► STORM), and ground state depletion imaging 
(GSDIM), to the wide family of individual molecule 
localization (IML) techniques. 
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Fluorescence Photoactivation Localization Microscopy, 
Fig.1 Basic concept of fluorescence photoactivation localization 
microscopy (FPALM). In normal fluorescence microscopy, the 
image is affected by the blurring effect due to the diffraction limit. 
In contrast to normal fluorescent molecules, FPALM uses 
photoactivatable or photoswitchable fluorescent probes, which 
are in an inactive state and can be switched by light in an active 
“bright” one. Molecules are photoactivated by a low intensity 


This technique allows to perform live super-resolu¬ 
tion imaging of biological structures achieving 
a resolution below 30 nm (Hess et al. 2006). In normal 
fluorescence microscopy, the image is affected by the 
blurring effect due to the diffraction limit. A classical 
FPALM experiment is based on a number of cycles of 
photoactivation, localization, and photobleaching of 
sparse subsets of photoactivatable fluorescent mole¬ 
cules. The principle behind this technique (Thompson 
et al. 2002) relies on the possibility to localize the 
positions of single molecules with a better precision 
than the optical system resolution and the final images 
are built by plotting the positions of all the localized 
molecules. A low power activation laser (usually in the 
UV-Vis range) switches the molecules to an active 
“bright” state and a readout laser allows to emit and 


activation laser, active molecules are imaged by a readout laser, 
and their positions are precisely localized (crosses). Molecules are 
then permanently turned off by photobleaching. Several cycles 
of activation, imaging, localization, and photobleaching 
are performed until enough molecules have been localized. The 
FPALM image is obtained by plotting the positions of all the 
localized molecules. The final image exhibits a resolution below 
the diffraction limit (Adapted from ST Hess, University of Maine) 


subsequently bleach the molecules, driving them into 
a “dark” state. The process of activation, imaging, local¬ 
ization, and photobleaching is repeated until enough 
molecules have been localized to reveal the structure 
of the sample and the final image can be reconstracted 

(Fig. 1). 

Within this scenario, an important role has been 
played by the development of new classes of geneti¬ 
cally encodable FPs whose spectral properties can be 
controlled and modified by light (Shaner et al. 2007). 
A step forward in this field has been represented by 
a new generation of chromophores able to switch from 
a quiescent dark state to a bright one (photoactivation) 
or convertible from a fluorescent emission spectral 
band to a different one (photoconversion). The advent 
of such a class of proteins smoothed the way to 
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super-resolution imaging techniques, such as 
photoactivation localization microscopy PALM and 
FPALM. 

The use of encodable fluorescent proteins makes 
FPALM a suitable tool for multicolor advanced 
applications (Gould et al. 2008; Gunewardene et al. 
2011; Appelhans et al. 2012) and 3D super-resolution 
(Gunewardene et al. 2011; Appelhans et al. 2012; 
Juette et al. 2008; York et al. 2011) in living samples 
(Celia Zanacchi et al. 201 1). 


Cross-References 

► Photoactivated Localization Microscopy (PALM) 

► Stochastic Optical Reconstruction Microscopy 
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Synonyms 

Fluorescence microphotolysis; FPR; FRAP 

Definition 

Fluorescence Recovery after Photobleaching is a fluo¬ 
rescence microscopy technique for measuring diffu¬ 
sion and binding of fluorescently labeled molecules 
on a microscopic scale. 

Basic Characteristics 

Fluorescence Recovery after Photobleaching (FRAP), 
a fluorescence microscopy technique developed in the 
1970s, is used to investigate the diffusion of fluores¬ 
cently labeled molecules (Peters et al. 1974). The tech¬ 
nique is suitable for diffusion coefficients in the range 
of 0.1-100 pm^/s. FRAP Is also used for studying 
binding kinetics of molecules. 

A FRAP experiment consists out of three steps. 
First, a suitable part of the sample is selected, prefer¬ 
ably exhibiting homogeneous fluorescence, and a num¬ 
ber of fluorescence images are recorded. Secondly, the 
selected area is briefly illuminated by a high power 
light source. The light destroys the fluorescence of a 
significant part of the molecules within that area, 
a process called photobleaching. Thirdly, a time series 
of fluorescence images of the recovery of the fluores¬ 
cence in the photobleached region is recorded. This 
recovery takes place because of diffusion of intact 
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Fluorescence Recovery After Photobleaching, 
Fig. 1 An illustration of a typical FRAP experiment and subse¬ 
quent analysis. On the left, four images from a recorded time 
series showing the fluorescence recovery after photobleaching of 


a rectangle in the sample. On the right, the average fluorescence 
in the photobleached rectangle in function of time. 
A mathematical model is fitted to this recovery curve {solid 
line), yielding a diffusion coefficient and immobile fraction 


fluorescently labeled molecules into the photobleached 
area, and diffusion of photobleached molecules out of 
that area. 

After performing a FRAP experiment, the recorded 
fluorescence recovery is analyzed in order to quantify 
the diffusion of the molecules (Crank 1975). Usually, a 
time course of the average fluorescence inside the 
photobleached region is used for this purpose, see 
Fig. 1. A mathematical model that describes 
the recovery is fitted to this curve, resulting in a value 
for the diffusion coefficient of the fluorescently labeled 
molecules. Usually the fraction of locally immobilized 
molecules is calculated as well from the difference in 
initial and final fluorescence intensity. 

More elaborate models for analysis of FRAP 
data exist that allow to calculate the diffusion coeffi¬ 
cient of multiple species, to look for anisotropic diffu¬ 
sion, or to determine reaction binding rates (Sprague et 
al. 2004). Some of these models require the full two- 
dimensional fluorescence recovery images rather than 
the average fluorescence in the photobleached spot. 

The experimental setup required to perform 
a FRAP experiment consists of a fluorescence micro¬ 
scope equipped with a high power light source for 
photobleaching. The light source can be a regular 
mercury arc lamp, but lasers are preferred due to 
their high power and narrow excitation spectrum. 
FRAP experiments can be conveniently performed 
on a confocal laser scanning microscope (CLSM) as 
they combine laser illumination of multiple wave¬ 
lengths with the possibility to photobleach arbitrary 
user-defined areas. 


FRAP has found ample application in cell biology 
research, more in particular diffusion measurements of 
proteins in the cell membrane. Since the introduction 
of the fluorescent proteins, the diffusion and binding of 
proteins inside the cell cytoplasm and nucleus were 
also investigated with FRAP (Reits et al. 2001). In 
combination with two-photon microscopy, which 
allows deeper penetration inside tissues, in vivo 
FRAP measurements have been performed as well. In 
pharmaceutical research, FRAP was used, e.g., to 
measure the behavior of therapeutic drugs in delivery 
systems such as hydrogels or in the cell interstitium of 
tumors (Meyvis et al. 1999). 

For more information on this topic, the reader is 
referred to the references listed below. 
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Synonyms 

Pluorescence resonant energy transfer (LRET); 
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Introduction 

With the cloning of the hrst acetylcholine receptor and 
voltage-gated Na"^ and channels in the 1980s, two 
classes of structure-function studies became possible 
for the first time: (a) site-directed mutagenesis com¬ 
bined with electrophysiological recordings to identify 
functional domains from specific perturbations of 
channel function and (b) structural studies. In the 
25 years since then, practically all known channels 
were added to the collection. Punctional and structural 
studies rapidly led to major progress in the understand¬ 
ing of the subunit and domain organization of chan¬ 
nels, into the mechanism of selective permeation and 
into some aspects of gating. However, such studies 
could not definitively resolve some fundamental ques¬ 
tions about gating because they could not deduce how 
the channels change their structure in relation to their 
functional transitions. To obtain this kind of informa¬ 
tion, it became necessary to develop measures of chan¬ 
nel conformation that could be determined on channels 
expressed in live cells and which would report confor¬ 
mational changes in real time. Pluorescence spectros¬ 
copy seemed perfectly suited to this challenge. 
Provided it is not too intense, light does not harm 
cells or alter the function of channels, nor does it 
interfere with the electronics of a voltage clamp. 


Pluorescence signals are large enough to detect from 
small numbers of molecules, and even single mole¬ 
cules in cells. In the mid-1990s, investigators began to 
capitalize on two different properties of fluorophores 
that could report on protein structure and motion: (a) 
environmentally sensitive dyes, which “sniff’ an 
extremely local nano-environment around the dye 
attachment site and provide a real-time report of struc¬ 
tural rearrangements that change this nano-environ- 
ment, and (b) fluorescence and luminescence 
resonance energy transfer (LRET and LRET), which 
provide measures of changes in distance between pairs 
of fluorescent probes. Both environment sensing and 
energy transfer sensing could be carried out on chan¬ 
nels expressed in heterologous cells at the same time as 
the functional state of the channel was controlled and 
monitored by voltage clamping. The method is known 
as voltage clamp fluorometry (VCF). 

VCF 

Fluorophores, Protein Attachment, and Sensitivity 
to Protein Conformation 

The idea behind VCF (Mannuzzu et al. 1996) and 
a patch clamp variant called patch clamp fluorescence 
(PCF) (Taraska and Zagotta 2010) is that an environ¬ 
mentally sensitive fluorophore is used to detect the 
structural changes of an ion channel. Measurements 
can be made from many channels at once (Mannuzzu 
et al. 1996) or in single molecule imaging (Sonnleitner 
et al. 2002; Blunck et al. 2008). DNA or RNA 
encoding the channel protein is delivered into a non- 
excitable cell that has few channels of its own so that 
the channel can be studied in an environment where it 
is present at much higher densities than native chan¬ 
nels. The most common cell to use is the oocyte from 
the South African clawed frog, Xenopus. 
A fluorophore is attached to a specific location in the 
channel. The gene encoding the jelly fish green fluo¬ 
rescent protein (GFP) can be genetically fused to the 
gene encoding the channel so that the channel protein 
is fluorescent as soon as it is made by the cell (Siegel 
and Isacoff 1997). Alternatively, small organic 
fluorophores, such as rhodamine, fluorescein, bodipy, 
alexa, or cyanine dyes, can be attached to specific 
amino acids of the channel after it has been made by 
the cell and shipped to its surface membrane. The 
attraction of this approach is that, unlike a large 
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Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET, Fig. 1 Voltage clamp fluorom- 
etry (VCF) provides a real-time report of the parts of the channel 
that move during different gating transitions, (a, top) Schematic 
of VCF: optical recording of protein conformation in the form of 
a fluorescence signal from fluorophores (red filled circles) that 
are attached site-speciflcally to channels or transporters (gray 
ellipses in blue membrane) is obtained simultaneously with 
voltage clamp recording of current (I) that reflects functional 
transitions of voltage sensing (gating current) and channel open¬ 
ing and closing (ionic current). The setup shown here is the two 
electrode voltage clamp which holds and jumps voltage (VO 
under the control of a computer (not shown). The red 
fluorophores are excited with green light delivered through the 
objective (below the cell) and emit red fluorescence that is 


captured by the objective and detected by a photomultiplier, 
photodiode, or CCD (not shown), (a, bottom) Depiction of how 
outward (upward) motion of positively charged S4 in response to 
membrane depolarization changes the nano-environment around 
a red fluorophore so that it becomes dimmer in activated (up) 
state, (b) A fluorophore tetramethylrhodamine attached to 
a cysteine at the outer end of S4 tracks (top) the gating charge 
(bottom). Short depolarizing steps (V) evoke gating currents, 
whose integral, the gating charge (Q), is tracked by the increase 
in fluorescence (F), indicating that S4 motion underlies voltage 
sensing, i.e., that S4 is the voltage sensor. Responses to 
five different voltage steps are superimposed, (c) Tetramethylr¬ 
hodamine attached to the pore helix (top) tracks the slow inac¬ 
tivation of the channel (bottom), indicating that a rearrangement 
of the outer pore closes the channel following its rapid opening 


fluorescent protein, the small chemical dyes are close 
in size to an amino acid so that they can be inserted at 
many locations with minimal perturbation to protein 
function (Pathak et al. 2007). The chemical dyes can be 
targeted to specific sites in the channel protein by 
mutating the channel to remove all of its externally 
exposed native cysteines and introducing a single 
cysteine residue to serve as a dye-anchoring site. 
The fluorophore selectively forms a permanent cova¬ 
lent bond with the introduced cysteine through 
cysteine-specific thiol chemistry. Once the channels 
are fluorescently labeled, the cell is voltage clamped 
on a microscope and the fluorophore is excited 


through an objective and its fluorescence emission is 
captured through the same objective and detected 
by a photomultiplier, photodiode, or CCD camera 
(Fig. la, top). 

Motions of the protein that move or, perhaps, put 
a strain on the GFP, or that move the labeled amino 
acid (and thus the attached fluorophore) in relation to 
the surrounding protein, lipid, and water change the 
fluorophore’s chemical environment and therefore its 
fluorescence (Fig. la, bottom). The environmental sen¬ 
sitivity can be to the dielectric constant around the 
fluorophore (Cohen et al. 2005), to its position in the 
membrane electric field (Asamoah et al. 2003), as well 
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as to its interaction with protein or freedom of rotation. 
The change in fluorescence intensity for the chemical 
fluorophores can be as large as 20-fold (Sonnleitner 
et al. 2002). The key to VCF is that channel fluores¬ 
cence (and thus structural state) is monitored at the 
same time as voltage jumps or agonist application is 
used to turn the channel on and off and current mea¬ 
surements are obtained to detect the functional state of 
the channel, thus making it possible to relate structural 
transitions to functional transitions (Fig. lb, c). 

The Power of Nanosecond Temporal Resolution 
and Domain-specific and Subunit-specific 
Labeling in VCF 

VCF is a time-resolved fluorescence assay. Since the 
excited state lifetimes of most fluorophores are in the 
nanosecond range (Lakowicz 2006) and the fastest ion 
channel motions occur on the microsecond scale, VCF 
reports on the protein structure at a rate much faster 
than the structural rearrangements and, thus, serves as 
a real-time readout of the protein motion. Thus, when 
the fluorophore is attached to the so-called pore helix 
in a voltage-gated channel, the fluorescence 
changes over hundreds of milliseconds, in parallel 
with the slow closure of the channel inactivation 
gate, thereby identifying this piece of the pore as 
a gating element involved in channel inactivation 
(Fig. Ic) (Gandhi et al. 2000). In contrast, when the 
fluorophore is attached to the charged S4 segment of 
the voltage-gated channel, the change in fluores¬ 
cence intensity occurs in milliseconds, and has the 
voltage dependence and kinetics of the gating current 
which precedes the opening of the pore (Fig. lb) 
(Mannuzzu et al. 1996; Cha and Bezanilla 1997; 
Gandhi et al. 2000), providing one of the key pieces 
of evidence that S4 is the channel’s voltage sensor. 
Because small dyes can be attached at many individual 
spots, it is possible to reconstruct global gating 
rearrangements of the protein from motions sensed at 
multiple individual sites (Pathak et al. 2007). More¬ 
over, dyes can be attached selectively to one subunit to 
study the influence of other, unlabeled subunits on its 
conformational changes and thereby identify coopera¬ 
tive transitions (Pathak et al. 2005). 

VCF has revealed several fundamental aspects of 
channel gating, which could not have been learned 
from static structural methods or methods that did not 
simultaneously measure structural changes and func¬ 
tional transitions. These include: (a) that S4 moves in 


a series of independent steps in the four subunits of the 
channel, with a final cooperative step corresponding to 
the opening of the gate, providing a physical embodi¬ 
ment of what was earlier shown in electrophysiological 
analysis of gating (Pathak et al. 2005), (b) that gating 
can be modulated in speed by changing the rate of S4 
motion (Schonherr et al. 2002), (c) that to open the 
FICN channel only two out of the four S4 voltage 
sensors must move (Bruening-Wright et al. 2007), 
(d) that the four nonidentical subunits of Na-F channels 
activate in a specific sequence (Chanda and Bezanilla 
2002), (e) that voltage-gated proton channels, which 
are bundled into pairs, gate cooperatively (Gonzalez 
et al. 2010; Tombola et al. 2010), and (f) that the 
voltage-sensing phosphatase is a monomer in which 
a single voltage-sensing domain, without a pore to lean 
on, undergoes as complex a series of voltage-sensing 
rearrangements as observed in voltage-gated channels 
which are made up of four voltage sensors and a pore 
(Kohout et al. 2008; Villalba-Galea et al. 2008). Envi¬ 
ronment-sensing fluorescence has also shown that 
GABAa receptor channels and p-adrenoceptor G- 
protein-coupled receptors are activated at different 
rates by agonists that have distinct actions and place 
the receptor in distinct conformations (Chang and 
Weiss 2002; Yao et al. 2006) and has provided impor¬ 
tant insights about the function of transporters 
(Larsson et al. 2004). 

FRET 

Physical Basis of FRET 

Fluorescence resonance energy transfer (FRET; some¬ 
times called Eoerster resonance energy transfer) is 
a phenomenon that can be used to measure distances 
and changes in distances between amino acid residues 
or domains in ion channels. ERET is based on the 
quantum mechanical phenomenon that the energy 
absorbed by a fluorescent molecule (the donor mole¬ 
cule) is transferred to another nearby molecule (the 
acceptor molecule) by resonance energy transfer 
(Lakowicz 2006). In a FRET experiment, the donor 
fluorophore is excited with a wavelength that mini¬ 
mally excites the acceptor fluorophore. Provided that 
the two fluorophores are properly aligned, the closer 
the acceptor fluorophore is located to the donor 
fluorophore the higher is the ERET efficiency, i.e., 
the more often the donor excitation results in energy 
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transfer to the acceptor rather than donor fluorescence 
(Fig. 2a, h). The more the donor emission 

spectrum overlaps with the acceptor absorption spec¬ 
trum, the larger is the distance over which energy 
transfer occurs between the donor and acceptor 
(Fig. 2a, b) (Lakowicz 2006). 

One can quantify the FRET efficiency by measuring 
how much of the emitted fluorescence comes from the 
donor fluorophores and how much of the emitted fluo¬ 
rescence comes from the acceptor fluorophores. The 
FRET efficiency depends on the sixth power of the 
distance between the donor and the acceptor molecules 
(Eig. 2b) (Lakowicz 2006); so by determining the 
ERET efficiency for one specific fluorophore FRET 
pair, one can infer the distance between the donor 
and the acceptor molecules and thereby determine the 
distance between the different fluorescent labeled chan¬ 
nel domains. Because the relative orientation of the 
donor and the acceptor dipoles (emission and acceptor 
transition moments) influences the ERET efficiency, 
controls for the freedom of the two fluorophores to 
rotate (anisotropy) have to be conducted to get reliable 
distance estimates (Lakowicz 2006). 

Insertion of Acceptor/Donor for FRET 

The donor and acceptor molecules needed for FRET 
can be inserted into an ion channel in many different 
ways. The cyan fluorescence protein (CEP) and yellow 
fluorescence protein (YEP), variants of GPP, have 
been inserted into different domains or subunits of 
ion channels to measure the relative movements of 
these domains. For example, movements of the cyto¬ 
solic domains in the inward rectifier K channels Kir2.1 
during channel opening and closing have been mea¬ 
sured using FRET between CEP and YEP (Sadja et al. 
2001). However, due to the relatively large size of 
these GPP variants, they have mainly been inserted in 
the N- and C-termini of ion channels to minimize 
detrimental effects on protein folding and function. 

Donor/acceptor FRET pairs of smaller chemical 
fluorophores, such as rhodamine and the Alexa 
fluorophores, can be attached at many places in an 
ion channel using commercially available cysteine- 
reactive forms of such fluorophores (see above 
section “VCP”). 

Different Ways to Measure FRET 

In most cases, one needs to estimate quantitatively the 
ERET efficiency to draw conclusions about channel 


gating from FRET measurements. One estimates the 
FRET efficiency by measuring the increase in acceptor 
fluorescence (sensitized emission) or the decrease in 
donor fluorescence (donor quenching) due to ERET 
(Lakowicz 2006). 

The three-cube method is a widely used to measure 
the sensitized emission (Taraska and Zagotta 2010). 
Because, for most ERET pairs, the donor emission 
spectrum extends well into the acceptor emission spec¬ 
trum and because the acceptor absorption spectrum to 
some degree extends into the donor absorption spec¬ 
trum (Pig. 2a), the fluorescence measured in the wave¬ 
length range of the acceptor emission will be the result 
of three processes (Pig. 2c). The three-cube method 
allows separate estimates of these three processes: 
Donor fluorescence bleed-through in the acceptor 
channel, direct excitation of the acceptor by the exci¬ 
tation light, and sensitized acceptor emission. Only 
sensitized acceptor emission is from the FRET process 
itself. The other two components of fluorescence must 
be subtracted from the fluorescence measurements for 
a quantitative evaluation of the ERET efficiency. The 
three-cube method was used, for example, to measure 
conformational changes in the cytosolic domains of 
cyclic-nucleotide-gated CNG channels (Taraska and 
Zagotta 2010). 

ERET efficiency is more directly determined by 
measuring donor quenching (or dequenching) 
(Pig. 2d). By measuring the donor emission in the 
absence and presence of the acceptor fluorophore, 
one gets a direct estimate of the ERET efficiency 
(Lakowicz 2006). In a donor dequenching experi¬ 
ment, the donor fluorescence is first measured in 
the presence of both donor and acceptor and then 
measured again after the acceptor has been 
bleached by a strong laser line that only excites 
and bleaches the acceptor (Taraska and Zagotta 
2010). In a donor quenching experiment, the 
donor fluorescence is first measured in the presence 
of only the donor and then measured again in 
the presence of both the donor and acceptor after 
the introduction of the acceptor molecule (Pig. Id) 
(Koch et al. 2008). The relative change in the 
donor fluorescence induced by the presence of the 
acceptor is the FRET efficiency (Fig. 2d). Donor 
quenching FRET experiments were, for example, 
used to determine the distance between voltage 
sensors in the two subunits of the Hv proton chan¬ 
nels (Koch et al. 2008). 
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Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET, Fig. 2 Different FRET mea¬ 
surements. (a) Absorption (abs) and emission (em) spectra for 


a FRET pair with Alexa488 as donor and tetramethylrhodamine 
(TMR) as acceptor. Alexa488 can be excited by a 488 nm laser 
{blue arrow). The absorbed energy is transferred {black arrow) 
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Further ways to measure the FRET efficiency are to 
measure how much the acceptor changes the fluores¬ 
cence lifetime of the donor fluorophore or changes 
the bleaching rate of the donor fluorophore. Due to 
the non-radiant transfer of the excitation energy 
from the donor to the acceptor during FRET, the mea¬ 
sured fluorescence lifetime of the donor will decrease 
(Eig. 2e), and the measured bleaching rate of the donor 
will decrease (Eig. 2f), with increasing ERET effi¬ 
ciency (Lakowicz 2006). These two methods were 
used by two different groups in discovering that the 
voltage sensor S4 segment in Shaker K channels 
undergoes a 180° rotation during channel activation, 
which brings some residues of the four S4 segments of 
the channel closer together and other S4 residues 
further apart (Cha et al. 1999; Glauner et al. 1999). 
It is important to note that in sensitized emission 
measurements, one only obtains a number proportional 
to the ERET efficiency, whereas in donor quenching/ 
dequenching, fluorescence lifetime, and bleaching 
experiments, absolute ERET efficiency is obtained 
(Taraska and Zagotta 2010). 

Variants and Alternatives to FRET 

Movements of ion channel domains relative to the lipid 
bilayer have been studied with ERET, using membrane- 
partitioned acceptors, such as the nonfluorescent 
dipicrylamine (DPA). In this case, GEP or a cysteine- 
linked fluorophore is attached to the channel domain of 
interest and FRET is measured between the channel- 
bound donor fluorophore and the membrane-inserted 
DPA acceptor. Using this technique, movements of 
the voltage sensor in Shaker K channels relative to the 
membrane bilayer have been determined (Chanda et al. 
2005). 

Transition metal ions, such as Cu^"^ and Ni^"^, 
receive resonance energy transfer from a fluorophore 


by transition metal ERET (Taraska and Zagotta 
2010). In this case, the donor is a small fluorophore, 
such as bimane or fluorescein, attached to a cysteine 
inserted in one part of the ion channel and the accep¬ 
tor (Cu^"^ or Zn^"^) is bound in a dihistidine motif 
inserted in another part of the ion channel (Taraska 
and Zagotta 2010). The dihistidine motif, however, 
has to be inserted in an alpha helical structure for 
reasonable affinity (pM) to the transition metal ion 
(Taraska and Zagotta 2010). Because the acceptor is 
of atomic size and the acceptor absorption is orien¬ 
tation independent, one more accurately determine 
distances between the donor and acceptor than 
with fluorophore acceptors. In addition, the range of 
accurate distance estimates is shorter in transition 
metal ERET (10-20 A) than in ERET with conven¬ 
tional fluorophores (30-80 A; Fig. 2b) (Taraska and 
Zagotta 2010). 

An alternative to FRET used to study gating in 
ion channels is tryptophan quenching of bimane 
fluorophores (Taraska and Zagotta 2010). In bimane 
quenching, one measures the effect of native trypto¬ 
phans, or tryptophans inserted in a channel domain 
of interest, on the fluorescence from a bimane 
fluorophore attached to a cysteine inserted in a nearby 
channel domain. The quenching of bimane by trypto¬ 
phan is most likely due to a short-range (5-15 A) 
photo-induced electron transfer (Taraska and Zagotta 
2010), rather than to ERET. Eor accurate distance 
estimates, the effect of the tryptophan quenching on 
the fluorescence lifetime of bimane has to be mea¬ 
sured, not just the effect on the bimane fluorescence 
amplitude (Mansoor and Earrens 2004). Tryptophan 
quenching of bimane fluorescence has been used to 
measure conformational changes in the cytosolic 
domains in CNG channels upon channel opening 
(Taraska and Zagotta 2010). 


-<- 

Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET, Fig. 2 (continued) from 
Alexa488 to TMR by FRET, (b) FRET efficiency as a function 
of distance between the donor and acceptor fluorophores for 
three different FRET pairs with different Rq (35, 50, and 
65 A). Ro depends on the spectral overlap of the specific donor 
and acceptor fluorophores. (c) Spectra showing the three differ¬ 
ent components contributing to the total emission (black) for an 
Alexa488 and TMR FRET pair: (1) Donor emission (green), 
(2) direct excitation of the acceptor by the 488 m laser (red 


dashed), and (3) sensitized emission of the acceptor due to 
FRET (red), (d) Spectra with (black) and without (green) accep¬ 
tor fluorophores in a donor quenching experiment using 
Alexa488 and TMR as the FRET pair. The ratio of the donor 
emission with (Fda) tutd without acceptor (Fd) gives directly the 
FRET efficiency (E = 1 — Fda/Fd)- (c) Donor fluorescence life 
time with (black) and without (green) the acceptor present. The 
FRET efficiency E = 1 — XoA/to- (f) Bleaching rate of the donor 
fluorescence with (black) and without (green) the acceptor pre¬ 
sent. The FRET efficiency E = 1 — Td/tda 
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a Tb-DTPA-cs124 Maleimide 



Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET, Fig. 3 LRET. (a) Molecular 
structure of a thiol-reactive lanthanide “antenna” DTPA-csl24- 
maleimide with a Th^’"^ hound, (b) LRET measured with 
time-resolved fluorescence microscopy. A short laser pulse 

LRET 

Physical Basis of LRET 

In lanthanide (or luminescence) resonance energy 
transfer (LRET), the excitation light is absorbed by 
a luminescent lanthanide, such as Tb^"^, that can trans¬ 
fer the energy to a nearby acceptor molecule by reso¬ 
nance energy transfer (Lakowicz 2006). A lanthanide 
ion is bound to the ion channel by an absorptive 
“antenna” group that is attached to the channel domain 
of interest (Fig. 3a) (Taraska and Zagotta 2010). The 
antenna group both selectively binds the lanthanide ion 
and increases the absorption of excitation light, which 
is very poor for a single lanthanide ion (Lakowicz 
2006). Acceptors in LRET are usually fluorophores 
attached to cysteines introduced in another channel 
domain. As for the FRET efficiency, the LRET effi¬ 
ciency also depends on the distance between the donor 
and acceptor to the sixth power (Fig. 2b), thereby 
allowing for distance measurements between donor 
and acceptor molecules. 

How to Measure LRET 

The lifetime of an excited lanthanide (0.5-3 ms) is very 
long compared to the lifetime of most fluorescent mol¬ 
ecules (1-10 ns) (Lakowicz 2006). The relatively long 
lifetime of lanthanide ions simplifies the measurement 


b Luminescence lifetime 



excites the lanthanide (t = 0). The background 

autofluorescence {red) decays within tens of nanosecond, 
whereas the lanthanide luminescence with (black) and without 
(green) the acceptor decays in millisecond. The LRET effi¬ 
ciency E = 1 - TdaAd 

of LRET, because the light emission by the lanthanide 
is easily separated from background fluorescence with 
time-resolved fluorescence microscopy (Fig. 3b) 
(Lakowicz 2006). The LRET efficiency is usually 
determined by lifetime measurements, because life¬ 
time shortens with LRET efficiency (Fig. 3b). LRET 
has been used to assess the rotation of the voltage 
sensor S4 during channel opening, and to determine 
the extent of S4 movement relative to the conduc¬ 
tive pore during Shaker K channel activation (Posson 
and Selvin 2008). 

FRET Versus LRET 

An advantage of LRET over FRET is that the orienta¬ 
tion independence of the lanthanide ions increases the 
accuracy of the distance estimates with LRET com¬ 
pared to FRET. Uncertainties in the orientation factor 
in FRET introduce errors in the FRET distance 
estimates (Lakowicz 2006). A disadvantage with 
LRET is the long excited lifetime of lanthanide ions, 
which allows the donor and acceptor in a LRET exper¬ 
iment (and the domains in which they are attached to) 
to explore many possible conformations before the 
absorbed energy is transferred or emitted. Because of 
the steep distance dependence of LRET, the distance 
estimates are not the average distance between donor 
and acceptor. Rather, the distance estimate is skewed 
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toward the closest approach of the donor and acceptor 
during the excited lifetime (milliseconds) (Taraska and 
Zagotta 2010). The lifetime in a FRET experiment is 
much shorter (in the ns range), and therefore, the 
acceptor and donor will not have as much time to 
sample different conformations in a FRET experiment. 

Summary 

VCE, ERET, and LRET have been able to resolve some 
of the fundamental questions about the gating mecha¬ 
nism in a number of different ion channels, because 
these techniques allow the monitoring of which parts 
of the channels move during each of its functional 
transitions. In addition, these techniques have also 
allowed the uncovering of cooperativity among 
channel subunits, as well as determining the actual 
molecular rearrangements during specihc channel 
transitions. 

Cross-References 

► Cut-Open Oocyte Voltage-Clamp Technique 

► Fluorescence 

► Ion Channel Regulation by G-Protein-Coupled 
Receptors: Recent Advances with Optical 
Biosensors 

► Molecular Dynamics Simulations of Lipids 

► Potassium Channel Selectivity and Gating at the 
Selectivity Eilter: Structural Basis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Voltage-Gated Potassium Channels, Kv Channels 

► Voltage-Gated Sodium and Calcium Channels 
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Synonyms 

3D optical sectioning microscopy 


Definition 

Fluorescence three-dimensional optical microscopy 
deals with the possibility of collecting a series of 
two-dimensional images from three-dimensional spec¬ 
imens by means of optical slicing. 

Basic Characteristics 

Optical sectioning is achieved by producing optical 
images at different positions in a three-dimensional 
specimen and can be realized using different classes 
of optical fluorescence microscopes. It is clear that the 
3-D reconstruction of an object starting from the 



Fluorescence Three-Dimensional Optical Imaging, 
Fig. 1 Optical sectioning scheme. A three-dimensional sample 
can be sketched as a series of optical slices. Let us call slice j the 
one containing the geometrical focus of the objective and refer 
to the adjacent planes as k slices. The sample contains 
a three-dimensional distribution of fluorescently labeled mole¬ 
cules whose intensity distribution is i, slice by slice. The thick¬ 
ness of each optical slice is approximately one-half of the axial 
resolution, say fsi 7-/2. Considering a description of the image 
formation process in the spatial frequency domain (using upper 
case for the related quantities) one could describe the situation as 
Oj = IjSj-FEk7^jIkSk-FN 


acquisition of two-dimensional (2-D) datasets, that is, 
2-D optical slices, is a powerful procedure for morpho¬ 
logical analysis and volume rendering. More specifi¬ 
cally, the opportunity for optical slicing a biological 
specimen like a cell, organ, or tissue allows one to get 
information from different planes of the specimen 
without being invasive, thus preserving structures and 
functionality. When a set of 2-D data sets, that is, 
images, is collected at various focus positions in 
a sample and under certain conditions, in principle, 
one can computationally reconstruct the 3-D shape of 
the object. Using an optical microscope, the observed 
image o(x, y, z), produced by a true intensity distribu¬ 
tion i(x, y, z), for example, related to the presence of 
fluorescent molecules, is a distorted representation of i 
(x, y, z) as corrupted by the characteristic transfer 
function, that is, point spread function (PSF), of the 
image formation system s(x, y, z), by some additive 
noise stemming from different sources n(x, y, z). 
Moreover, cross talk and information stemming from 
different planes rather than the actual contribute to mod¬ 
ify the representation of i(x, y, z) through o(x, y, z). 
Considering a certain plane, j, in which is placed the 
geometrical focus of the imaging lens over N within the 
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Fluorescence Three-Dimensional Optical Imaging, 
Fig. 2 Optical sectioning using 2PE autofluorescence. 2PE 
optical sectioning of Colpoda maupasi resting cysts, 21-32-pm 
average dimensions. Encystment is particularly widespread in 
species living in ephemeral fresh-water puddles and is induced 
by exhaustion of the food and drying out. The resulting images 
have been obtained at LAMBS-MicroScoBio exploiting 

3-D sample, the observed image Oj can be regarded as 
the ij distorted according to the microscope’s PSF Sj plus 
the summation of the contributions coming from the 
summation of the adjacent k planes, ij^, blurred 
according to Si^, above and below the plane j, Fig. 1. 
Noise, considered additive at a first approximation, can 
be summed to this quantity. The notation, using pedices, 
means that the problem is discretized and the solution 
lies in inverting the relationship by finding the best 
estimate, under certain criteria, for i(x, y, z). Within 
this description in the spatial domain, the mathematical 
operation involving 1 and s is a convolution. Usually, 
a description in the frequency Fourier domain is utilized. 
In such a case the relationship between the true distri¬ 
bution of emitters, I, and the point spread function of the 


autofluorescence primed using 740-nm excitation. A 
Chameleon-XR ultrafast Ti-Sapphire laser (Coherent Inc., 
USA) and a Nikon PCM2000 confocal scanning head have 
been used. Linear frame dimension is 70 pm, z steps have 
been performed every 0.5 pm, and not all the optical sections 
are imaged (Sample courtesy of Paola Ramoino, University 
of Genoa) 

system, s, is a multiplication. Several algorithms can be 
found or adapted for solving such an inverse problem. 
Web-based approaches are also available at www.svi.nl 
or www.powermicroscope.com. 

Successively, 3-D sample reconstruction is 
obtained by directly piling up 2-D images, after fur¬ 
ther scale correction is performed, accounting for 
possible axial distortion phenomena linked to the 
refractive index mismatch. The solution of the 
problem is simplified when performing confocal 
laser scanning microscopy or fwo-photon excifation 
microscopy. In fad, both methods are able to 
reject the out-of-focus information coming from 
the k adjacent planes, as demonstrated, for example, 
in Fig. 2. 
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A promising and very interesting method for 
performing fluorescence optical sectioning micros¬ 
copy is the one related to selective plane illumination 
microscopy (SPIM) on thick specimens over long tem¬ 
poral intervals and allowing 3D super-resolution fol¬ 
lowing its individual molecule localization (IML) 
implementation known as IML-SPIM. 


Cross-References 

► Confocal Laser Scanning Fluorescence Microscopy 

► Fluorescence: General Aspects 

► Optical Fluorescence Microscopy 

► Selective Plane Illumination Microscopy (SPIM) 

► Two-Photon Excitation Fluorescence Microscopy 
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Synonyms 

Fluorescence; Fluorescence resonant energy transfer 
(FRET); Photoluminescence 

Definition 

Eluorescence is the light emitted by molecules via 
spontaneous decay from an excited electronic state, 
generated by light absorption. Fluorescent molecules 
are called fluorochromes. 

Introduction 

Molecules can exist in a number of discrete energy 
levels. The lowest, stable electronic state is called 
ground state. Each electronic state is composed of 
a large number of vibrational and rotational states, 
nearer together than the electronic levels. 

Absorption of a photon of energy hv^x 
{h = Planck’s constant and v — frequency of light) 
supplied by an external source brings a fluorochrome 
from its singlet electronic ground state to an unstable 
excited singlet state. Very fast nonradiative processes 
reduce the energy of the molecule to the lowest vibra¬ 
tional level of the excited state. The fluorophore can 
relax from this level to the ground state emitting 
a photon of energy hvEm- This light is called fluores¬ 
cence (Schulman 1977; Rendell 1987; Sharma and 
Schulman 1999; Lakowicz 2006). 

Fundamentals of Fluorescence 

The molecular energetic states and the transition 
between levels can be graphically represented by 
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Fluorescence: General 
Aspects, Fig. 1 Jahlonski 
diagram. Horizontal lines 
symbolizes the vibrational 
levels in the electronic ground 
state (So), in the first (lowest) 
excited singlet state (S/) and in 
the first excited triplet state 
(Tj). Vertical arrows represent 
transitions between levels 


Excitation 

hvc. 


Internal 


conversion 


Intersystem 


crossing 


Internal 


conversion 


Fluorescence 


Internal 


conversion 


Phosphorescence 

hv„,. 


Internal 


conversion 


a Jablonski diagram (Alexander Jablonski, 1898- 
1980). The Jablonski diagram is an energy level plot 
of photophysical processes in fluorophores (Fig. 1). 

All processes shown in the diagram are described in 
the next paragraphs. 

Excitation 

The energy spacing between both electronic states and 
vibrational states is much higher than the molecular 
thermal energies at room temperature. Therefore, only 
the lowest vibrational level of the ground state (So) is 
appreciably populated. 

All orbital electrons in a molecule have a spin (s), 
either +1/2 or -1/2. The multiplicity (n) of an energetic 
level is defined as: 

n = 2s + I 

When all electrons have opposite spins {spin- 
paired), s = 0 and n ~ the state is an electronic 
singlet state. When the electrons have the same spin, 
s = 1 and n — 3: This is the triplet state. The ground 
state So for most organic molecules is an electronic 
singlet in which all electrons are spin-paired. Quantum 
mechanics rules establish that excitation by light 
absorption occurs with no change in spin-pairing; there¬ 
fore, the electronic excited state (S/) is also a singlet. 

So + hvEx S 1 

If light of the right energy is absorbed, molecules 
can be excited to one of many vibrational and 


rotational levels of the electronic state Si. Conse¬ 
quently, there is a whole range of photon energies, or 
wavelengths, allowing a transition and causing excita¬ 
tion: This range is called excitation spectrum 
(Guilbault 1990; Royer 1995; Rost 1995). 

Quantitatively light absorption is described by the 
Beer-Lambert law. This law gives the amount of light 
absorbed, for a beam passing through a path length x of 
an absorbing species in solution: 

l/lo = 

where I and Iq are the intensities of transmitted and 
incident light, respectively, C is the concentration 
of the absorbing species, and e(A) is the molar extinc¬ 
tion coefficient. The extinction coefficient of 
a fluorochrome is the amount of light of a given wave¬ 
length that is absorbed by the fluorochrome. The molar 
extinction coefficient is defined as the optical density 
of a one-molar solution of the fluorochrome through 
a 1-cm light path. 

Since e(A) is a function of wavelength, the absorp¬ 
tion process is wavelength dependent. As a 
consequence, every transition has a different probabil¬ 
ity, being more or less privileged than others. 
The excitation spectrum of a fluorochrome shows the 
probability of excitation of the fluorochrome by 
a given wavelength of incident light. The excitation 
spectrum is a plot of total emitted fluorescence versus 
excitation wavelength. 

The whole excitation process occurs in the femto¬ 
seconds (10“*^ s) timescale. 
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Internal Conversion 

Following light absorption, a chromophore is usually 
excited to a higher vibrational energy level in the hrst 
excited state S/. Conversion of vibrational energy to 
thermal motion undergoes a very fast nonradiative 
relaxation in which the excitation energy is dissipated 
as heat and the molecule relaxes to the lowest vibra¬ 
tional level of the excited state S/ This event is termed 
internal conversion or vibrational relaxation and 
occurs on the picosecond (10“'^ s) timescale. 

Fluorescence 

Fluorescence emission occurs from the lowest vibra¬ 
tional level of the hrst excited singlet state S/, when 
a photon is released from the huorochrome. This pho¬ 
ton emission brings the molecule back to one of the 
several vibrational levels of the electronic ground 
state. 

Si —> So -F hVEm + IC. 

Due to the difference among the vibrational ener¬ 
gies in the ground state, the emitted photons may have 
different energies (hvEm)- The range of the emitted 
photon energies is the fluorescence emission spectrum 
(Dewey 1991; Mason 1998; Herman 1998). This spec¬ 
trum shows the distribution of the transition probabil¬ 
ities related to the frequency of the emitted photon. 
Fluorescence emission from the lowest vibrational 
level of S/ results in a very important implication: the 
emission spectrum is independent of the excitation 
wavelength. 

The spacing of excited state vibrational levels is 
similar to that of the ground state in the majority 
of molecules, due to the strict similarity of the 
molecular geometry both in the ground and in the 
excited states. Accordingly, the emission spectrum of 
a huorochrome is a mirror image of its excitation 
spectrum, suggesting that the same transitions have 
a high probability for both excitation and huores- 
cence emission. 

Due to the loss of energy by internal conversion of 
the molecules in the excited state, emitted huorescence 
photons are less energetic than the absorbed photons. 
Accordingly they have longer wavelengths (lower fre¬ 
quencies), and the huorescence light color is red- 
shifted. This redshift is termed Stokes shift, and is the 
difference between the maxima of the excitation and 
emission spectra. 


The huorescence intensity decays exponentially 
according to: 

/(?) = he-"^ 

where I(t) is the huorescence intensity at time t, Iq is 
the huorescence intensity at time zero, and t is the time 
when the huorescence has dropped to /o/c. t is termed 
fluorescence lifetime (Dewey 1991; Lakowicz 1991; 
Mason 1998; Diaspro 2001). 

If the huorescent rate constant is kf, the huorescence 
lifetime is given by: 

T = \/kf 

In a more realistic view, the huorescence lifetime 
of a huorophore is the average time spent in the 
excited state, or the probability of the molecule 
existing in the excited state S/. However, there are 
a number of relaxation processes from S/ that do not 
result in huorescence emission. Therefore, if huores¬ 
cence is not the only way of relaxation, the lifetime 
depends on the total rate constant, kf + where k„f 
is the rate constant of all nonhuorescent relaxation 
pathways. 

Therefore: 

T = \/{kf + k„f) 

In the assumption that the huorophore relaxes to 
the ground state by huorescence alone, k„f — 0, and 
the huorescence lifetime is called natural or intrinsic 
lifetime and the symbols y or Tq are used. 

The huorescence quantum yield (QY) is dehned as 
the ratio between the number of photons emitted 
as huorescence and the number of photons absorbed 
by the molecule. 

QY = Nem/Nahs 

Quantum yield is a measure of the huorescence 
efficiency of a huorochrome. Indeed, it is the fraction 
of the excited molecules that decay by photon emis¬ 
sion, or also the fractional probability with which 
huorescence occurs, relative to other nonhuorescent 
relaxation processes. 

Accordingly: 

QY = kf/{kf + k„f) 
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If all the rate constants different from fluorescence 
of a fluorophore are small compared to kf then the 
value of QY for the molecule is Accordingly, for 
a nonfluorescent molecule QY = 0. Fluorophores nor¬ 
mally used in fluorescence microscopy have QY values 
ranging from ^0.05 to '^0.9 (Taylor et al. 1986; 
Hemmila 1991; Royer 1995; Pawley 2006). 

The higher the value of QY the greater will be the 
intensity of fluorescence. Bright fluorophores usually 
have QY > 0.1. The intensity of fluorescence I is usu¬ 
ally named brightness, and is proportional to the prod¬ 
uct of the extinction coefficient and the quantum yield: 

I (xeQY 

Relaxation Processes Competing with 
Fluorescence Emission 

Intersystem Crossing and Phosphorescence 

In the process called intersystem crossing, the nomi¬ 
nally forbidden spin-exchange converts the excited 
singlet to an excited triplet state. 

Spontaneous spin-unpairing of two electrons in the 
excited state may occur due to molecular collisions, or 
by spin/orbital interactions in the presence of heavy 
atoms such as iodide, bromine, or cesium ions. 
According to quantum mechanics, this process, called 
spin-orbit coupling, consists of the interaction of elec¬ 
tron’s spin with its motion. 

Intersystem crossing is generally due to electro¬ 
magnetic interaction between the electron’s spin 
and the magnetic field generated by the electron’s 
orbit around the nucleus. The excited electrons, 
instead of relaxing to the lowest singlet state 
through vibrational interactions, make a transition 
to the nearest excited triplet state. Since little or no 
energy must be gained or lost in the transition, the 
probability of this process occurring is more favor¬ 
able when the vibrational levels of the two excited 
states overlap. 

Similar to singlet, electrons in the triplet state rap¬ 
idly relax to the lowest vibrational level of the excited 
triple state by internal conversion. 

Fluorophores in a triplet excited state may relax to 
the ground state by photon emission. The radiative 
transition from the triplet state is termed phosphores¬ 
cence (Herman 1998; Sharma and Schulman 1999; 
Lakowicz 2006; Diaspro 2010a). As the triplet state 


generally is lower in energy than the excited singlet, 
phosphorescence takes place at longer wavelengths 
than fluorescence. 

The lifetime of the triplet state Is generally very 
long (from microseconds to several seconds); there¬ 
fore, the timescale of phosphorescence is much longer 
than the timescale of fluorescence emission. 

Quenching 

A number of conditions allow an excited fluorophore 
to relax back to the ground state through nonradiative 
processes. These processes compete with the relaxa¬ 
tion by photon emission and reduce or extinguish 
fluorescence. 

Any process which decreases the fluorescence inten¬ 
sity of a fluorophore is generally termed quenching 
(Diaspro 2001; Pawley 2006; Lakowicz 2006; Johnson 
and Spence 2010). Therefore quenching refers to any 
process that causes a reduction in the quantum yield of 
a given fluorescence relaxation. It can be either static 
(also referred to as contact quenching) or dynamic 
(also referred to as collisional quenching). 

Static quenching occurs when a fluorophore in the 
ground state binds another molecule in the ground state 
{Quencher) forming a molecular complex. They bind 
together to form a ground state complex, an intramolec¬ 
ular nonfluorescent dimer. Hydrophobic and electro¬ 
static effects cause the dye and quencher molecules to 
stack together, for instance to minimize contact with 
water. 

Multiple labeling of a molecule with bright 
fluorophores does not always lead to an increase in 
fluorescence intensity. In many cases, by increasing 
the number of the fluorescent molecules, the overall 
brightness decreases due to the phenomenon of self¬ 
quenching. Many fluorophores exhibit self-quenching 
because the presence of large concentrations of mole¬ 
cules significantly affects the quantum yield. 

Dynamic quenching refers to the loss of the 
fluorophore’s excited state energy by collision with 
a quencher. Because collisional quenching depopu¬ 
lates the excited state without allowing fluorescence 
emission, the decrease in fluorescence intensity 
matches the decrease in fluorescence lifetime. 

The dependence of the emission intensity on 
quencher concentration is given by the Stern-Volmer 
equation (Pawley 2006; Lakowicz 2006): 

lo/I = to/t = I + ky[Q]T 
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where !„ is the fluorescence intensity in the absence of 
quencher, I is the intensity in the presence of quencher 
at concentration [Q], kq is the rate constant for the 
dynamic reaction of the quencher with the fluorophore, 
T and To are the lifetime in the presence and absence of 
quencher, respectively. 

In biological specimens, molecular oxygen (O 2 ) is 
a highly effective quencher of fluorophore triplet state 
just because of its unusual triplet ground state. Metal 
ions are also strong fluorescence quenchers. 

Fluorescence Resonant Energy Transfer (FRET) 

This physical process is also called Forster resonance 
energy transfer, in honor of Theodore Forster, who 
first developed the quantum theory of singlet-singlet 
energy transfer. 

FRET is the quantum-mechanical process by which 
the energy of a fluorophore in an excited state {donor 
fluorophore) may be transferred to the ground state of 
a neighboring fluorophore or chromophore (acceptor). 
The FRET process, due to dipole-dipole interactions, is 
nonradiative and causes both quenching of the donor 
fluorescence emission and reduction of the donor fluo¬ 
rescence lifetime (Pawley 2006; Lakowicz 2006; 
Diaspro 2010a, b; Johnson and Spence 2010). 

The efficiency of this energy transfer is reasonably 
large only if three conditions occur: (1) the emission 
spectrum of the donor significantly overlaps the exci¬ 
tation spectrum of the acceptor; (2) the distance 
between donor and acceptor is within 1-10 nm of 
each other; (3) mutual orientation of donor and accep¬ 
tor is favorable, or rather donor and acceptor transition 
dipole moments are properly aligned with respect to 
one another. 

The FRET efficiency Ep can be expressed both in 
terms of distance between donor and acceptor and in 
terms of donor lifetime: 

Ep = + /•'); Ep= I- (Xda/u) 

where r is the distance between the donor and the 
acceptor; is the donor lifetime in the presence of 
the acceptor and in the absence of acceptor, 
respectively. 

The Forster radius (Rq) is the distance at which half 
the excitation energy of the donor is transferred to the 
acceptor, i.e., the ERET efficiency Ep is 0.5. For 
r = Rq the energy transfer rate equals the sum of the 
rates of the other relaxation process of the donor. 


According to Forster theory, 

Ra = Q.2n{K^n-^QDJ{X)Y'^ 

where Qp, is the quantum yield for the donor in the 
absence of acceptor, is the orientation factor 
describing the relative orientation between the donor 
and acceptor dipoles, n is the refraction index of the 
medium, and /(A) is the overlap integral that accounts 
for the degree of spectral overlap between the donor 
emission and the acceptor absorption spectra, 
can be expressed as: 

= {cosO, — cosOdCosBaf' 

where Ot is the angle between the donor emission 
dipole moment and the acceptor absorption dipole 
moment; do and 0a 0 are the angles between the vector 
joining the donor and acceptor and the emission and 
absorption transition moments, respectively. 

7(A) can be expressed as: 

/(I) = J Fd{l) ea{2)fdl 

where (A) is the fluorescence intensity of the donor 
at a given wavelength and Eq (A) the molar extinction 
coefficient of the acceptor at the same wavelength. 

As a result of the energy transfer, the excited accep¬ 
tor may relax to the ground state by fluorescence 
emission. 

Photobleaching 

Fluorescence is a cyclic process: After the photon 
emission, the fluorophore brought back to the ground 
state is ready for a new excitation by light absorption. 
However, the number of feasible cycles is very limited. 

A fluorophore in the excited state is generally more 
chemically reactive than in the ground state. Fluores¬ 
cent molecules in the excited state take part in chem¬ 
ical reactions involving photochemical modifications 
of the fluorophore and lead to the irreversible loss of its 
ability to fluoresce. This phenomenon is called 
photobleaehing (Pawley 2006; Lakowicz 2006). 

Under high-intensity illumination conditions, the 
irreversible destruction of the excited fluorophore is 
the main factor limiting fluorescence detectability. The 
degree of photobleaching is dependent on both the 
duration and intensity of exposure to excitation light. 



Fluorescence: General Aspects 


831 


F 


The very complex chemical mechanisms of the 
bleaching process are not well understood. Different 
photochemical reaction pathways seem to be respon¬ 
sible for photobleaching, including reaction between 
adjacent molecules. 

The main causes seem to involve photodynamic 
interactions between excited fluorophores and reactive 
oxygen species {ROS) dissolved in the sample 
medium, particularly molecular oxygen (O 2 ) in its 
triplet ground state. 

Due to intersystem crossing, a fluorophore in long- 
lived triplet excited state interacts with its environment 
for a very long time. Interaction between O 2 and 
fluorophore triplet states may generate singlet oxygen 
according to: 

Ti +^02 —> 5 o + *02 

Singlet oxygen has a very long lifetime, and its 
relaxation originates different types of damaging oxy¬ 
gen free radicals, leading to both bleaching of the 
fluorophore and phototoxicity in the case of living 
cells stained with the dye. Irreversible chemical reac¬ 
tions may involve other intracellular organic mole¬ 
cules, depending not only on the intracellular singlet 
oxygen concentration but also on the distance between 
the fluorophore and proteins, lipids, etc. Therefore, the 
photobleaching rates depend both on the nature of the 
fluorophore itself and, to some extent, on the mole¬ 
cule’s environment. 

Many authors investigated photochemical reac¬ 
tions responsible for the photobleaching of fluores¬ 
cein. The photobleaching quantum efficiency 
Qb, is the ratio between the number of bleached 
molecules and the number of photons absorbed dur¬ 
ing the exposure. The average number of photons 
emitted by fluorescein before bleaching is the ratio 
between the QY of the molecule and Q/,. At excita¬ 
tion intensities of lO^^-lO^"* photons x cm"^ s“', 
at 488 nm wavelength, fluorescein Qi, is close to 
3 X 10~ . Therefore, a fluorescein molecule in solu¬ 
tion emits 3 X 10^ to 4 x 10"* photons before being 
irreversibly bleached. 

Photobleaching may be minimized by reducing 
excitation intensity and by using detection strategies 
maximizing sensitivity, such as high-numerical 
aperture objectives and the widest bandpass emis¬ 
sion filters compatible with satisfactory signal 
isolation. 


Summary 

Fluorescence results from a cyclic process that takes 
place in peculiar molecules called fluorophores. 

The first step is the fluorophore excitation by 
absorption of light, the second step lies in the release 
of energy in form of heat, and the third step is the 
fluorescence emission by release of light. 

Fundamental properties of fluorophores are: (a) 
excitation and emission spectra, (b) Stokes shift, (c) 
fluorescence quantum yield, (d) fluorescence lifetime. 

Different chemical and physical processes affect or 
compete with fluorescence emission. Intersystem 
crossing, quenching, Forster resonance energy trans¬ 
fer, and photobleaching are described in this entry. 
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Synonyms 

Ion channels 

Introduction 

Membrane channels form an important class of biolog¬ 
ical machines, as they allow the efficient and selective 
permeation of both solutes and solvent across cellular 
and subcellular membranes (see Fig. 1). Ion channels, 
for example, are at the heart of signal transduction in 
nerve cells (Hille 2001). Likewise, specialized water 
channels allow the cell to rapidly exchange water with 
the surroundings upon changes in the environment 
(Preston et al. 1992). 

Inspired by the capabilities of biological channels, 
nanotechnological and biotechnological initiatives 
have started to achieve high channel efficiency and 
selectivity in designed channels and nanopores, for 
example, for the industrial scale production of 
ultrapure water. 

The combination of high efficiency and strict spec- 
ihcity presents a compromise between two competing 
factors. Selectivity can be increased by a higher 
affinity between channel and the permeant molecule. 
However, this will increase the residence time of 
the permeant molecule in the channel and thereby 
reduce the throughput. Likewise, efficiency can 
usually be enhanced by a widening of the pore, at the 
cost of a lower specificity. Substantial effort has hence 
been invested to understand the mechanisms of 
efficient, yet specihc permeation. 

Simulation of Fluxes 

For a detailed understanding of the determinants 
responsible for selective solute and solvent fluxes 
through channels and pores, a characterization of the 
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Fluxes Through Channels and Pores, Fig. 1 (a) Pathway of 
a potassium ion (magenta) through the ion channel KcsA (Doyle 
et al. 1998); (b) pathway of a water molecule (redlwhite) through 
a carbon nanotube (Hummer et al. 2001) (Reproduced with 
permission from Hummer et al. (2001)) 


underlying molecular mechanism is indispensable. 
Experimentally, it is not tractable to simultaneously 
achieve the time and spatial resolution required to 
monitor individual permeation events. Molecular 
dynamics (MD) computer simulations are, therefore, 
routinely used for this purpose. Three simulation strat¬ 
egies can be distinguished. 

Direct Flux Measurements from Nonequilibrium 
Simulations 

The net flux, measured as the number of molecules 
permeating unidirectionally through a pore as a func¬ 
tion of time, can be directly monitored from 
nonequilibrium MD simulations. Examples of driving 
forces to generate a molecular flux include osmotic 
pressure (Kalra et al. 2003), transmembrane voltage 
(Suenaga et al. 1998; Sachs et al. 2004; Kutzner et al. 
2011), and hydrostatic pressure (Zhu et al. 2002). An 
example of a simulation setup with an applied 



Fluxes Through Channels and Pores, Fig. 2 Direct simula¬ 
tion of sustained ionic fluxes through ion channels in computa¬ 
tional electrophysiology (Kutzner et al. 2011). (a) Two 
membranes are explicitly simulated, generating two compart¬ 
ments (the outer compartment is connected via periodic bound¬ 
ary conditions). Ion concentrations are chosen differently in the 


two compartments to generate a transmembrane voltage, (b) 
Simulation system of a double membrane PorB setup, (c) Trans¬ 
membrane potentials over the two membranes for chosen charge 
differences between the two compartments (Reproduced with 
permission from Kutzner et al. (2011)) 
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Fluxes Through Channels and Pores, Fig. 3 Relation 
between nonequilibrium flux and equilibrium fluctuations (de 
Groot et al. 2002; de Groot and Grubrniiller 2005). Under equi¬ 
librium, the forward and backward rate coefficients both equal 
ko. Via Kramers’ theory, ko is related to the transition barrier 
AG* as ko = tDo exp (—AG^/kgT) with cOq Kramers’ prefactor or 
attempt frequency, ks Boltzmann’s constant and T the temper¬ 
ature. Under nonequilibrium conditions {bottom) the chemical 
potential or free energy difference Ap increases the forward rate 
k’’' by a factor exp (Ap/kgT). The resulting net flux is j = k’’’ — 
k" = 2ko sinh(Ap/2kBT) molecules per second, or, in linear 
approximation j = ko Ap/kaT. This approximation holds for 
a free energy difference Ap that is small compared to the main 
permeation barrier AG^ (Reproduced with permission from de 
Groot and Grubrniiller (2005)) 

transmembrane voltage is depicted in Fig. 2. The sim¬ 
ulation of nonequilibrium fluxes allows a direct com¬ 
parison with fluxes determined experimentally, either 
on a single pore level or derived from hulk measure¬ 
ments. Hence, it is the method of choice when the 
permeation process is fast enough to allow sufficient 
sampling at MD timescales, typically limited to micro¬ 
seconds. Sampling issues may be overcome by apply¬ 
ing gradients higher than applied experimentally or 
physiologically, at the risk of introducing artifacts. 

Flux Estimations from Equilibrium Fluctuations 

It has been shown that unidirectional fluxes that arise 
because of an applied gradient are directly related to 
bidirectional equilibrium fluctuations (de Groot et al. 
2002; de Groot and Grubrniiller 2005). This relation is 


particularly useful for channels operating with 
very small gradients, such as water channels that are 
regulated by an osmotic gradient: Osmotic gradients as 
exerted by osmolytes at typically millimolar concen¬ 
trations are so small that the number of net, unidirec¬ 
tional permeation events, which contribute to the flux, 
is orders of magnitude smaller than the number of 
bidirectional equilibrium permeation events caused 
by diffusion (see Fig. 3). Therefore, for such channels, 
the linear relationship between flux and equilibrium 
fluctuations, an approximation that holds for small 
gradients, allows to make use of the orders of magni¬ 
tude better statistics of bidirectional equilibrium per¬ 
meation events rather than the small number of 
unidirectional events that generate a net flux. Techni¬ 
cally, equilibrium simulations are often more straight¬ 
forward to set up than nonequilibrium simulations as 
no external gradient is required. 

Energetically, the picture associated with this sce¬ 
nario is that the permeation process is dominated by 
a permeation barrier that is substantially higher than 
the free energy difference between the two compart¬ 
ments connected by the pore due to a concentration 
differences. 

Flux Estimations from Free Energy Profiles and 
Rate Theory 

A third, popular, method builds directly on the relation 
between flux and the underlying free energy landscape 
and assesses the free energy permeation profile rather 
than specific permeation events. The free energy 
profile is usually obtained as a potential of mean 
force (PMF) derived from either equilibrium or 
nonequilibrium simulations (Fig. 4). 

Among the equilibrium approaches, umbrella sam¬ 
pling (Torrie and Valleau 1977) is the most commonly 
used technique. In umbrella sampling, a putative per¬ 
meation pathway is required for the definition of a 
reaction coordinate. In separate simulations 
(“umbrella windows”) the permeant molecule is 
restrained at different locations in the pore, to cover 
the complete permeation pathway. The positional dis¬ 
tributions of the restrained molecule are determined 
both by the restraining potential (usually a harmonic 
potential) and the permeation profile, the PMF. From 
the deviation of the positional distributions resulting 
from the restraining potential alone, the PMF can be 
reconstructed. A common way to achieve this is the 
weighted histogram analysis method (WHAM) that 
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Fluxes Through Channels and Pores, Fig. 4 Left panel: 
Principle of umbrella sampling to derive a PMF underlying 
permeation (Torrie and Valleau 1977). In addition to the 
PMF exerted by the channel {black), an additional umbrella 
potential (green) is applied to the permeant molecule of inter¬ 
est (blue). The resulting positional distribution histogram 
(green) therefore is determined by the PMF and biased by the 
umbrella potential. WHAM can be used to unbias a series of 


histograms to reveal the underlying PMF (Kumar et al. 1992). 
Right panel: Two-dimensional PMF for the permeation 
of potassium ions through the potassium channel KcsA 
(Berneche and Roux 2001). The PMF is shown as a function 
of the position of the ion in the cavity, Z 3 , and the position of 
the center-of-mass of the two ions in the selectivity filter, Z 12 
(Right panel is reproduced with permission from Berneche and 
Roux (2001)) 


takes into account distribution histograms from multi¬ 
ple windows to estimate a PMF (Kumar et al. 1992). 
Briefly, WHAM recovers PMF that matches the 
observed probability distributions with the lowest pos¬ 
sible statistical error. 

Alternatively, nonequilibrium approaches 
have been developed to estimate PMFs (Chipot and 
Pohorille 2007). In such a nonequilibrium approach, 
solutes are typically pulled through the channel by 
attaching them to a moving harmonic spring, to 
enforce permeation events. As the pulling occurs at 
a finite speed, the work required for such an enforced 
translocation event contains a (reversible) contribution 
from the permeation free energy as well as an 
(irreversible) nonequilibrium work contribution that 
is dissipated as heat. By employing either the Jarzynski 
or Crooks nonequilibrium formalisms, the free energy 
equilibrium profile can be constructed from a 
nonequilibrium work distribution. 

Rate theory connects the PMF profile with perme¬ 
ation rates, therefore allowing an estimate of equilib¬ 
rium (bidirectional) permeation coefficients. 
Unidirectional fluxes can be estimated as well, in the 
regime of driving forces that are small relative to the 


permeation barrier, such as, e.g., in the case of osmotic 
pressure differences. The basic assumption made here 
is that the permeation rate is proportional to the prob¬ 
ability to pass the highest barrier in the permeation 
profile. This probability is given by the Boltzmann 
factor of the highest barrier. To achieve absolute 
permeation rates, a pre-exponential factor or attempt 
frequency is required, defined as the frequency with 
which the barrier is effectively attempted to be crossed 
spontaneously, successful or not. As the estimate 
of attempt frequencies is frequently nontrivial, PMF 
profiles are frequently used to estimate relative rather 
than absolute permeation rates. 

Depending on the complexity of the permeation 
pathway, either one-dimensional or multidimensional 
reaction coordinates may be appropriate. Correspond¬ 
ingly, one-dimensional or multidimensional PMFs 
will be the result of such a calculation. Examples of 
a one-dimensional PMF include solute permeation 
PMFs through aquaporins, where the pore coordinate 
is chosen as reaction coordinate. An example for 
a multidimensional PMF is potassium permeation 
through a potassium channel (Berneche and Roux 
2001, Fig. 4). Also here, the pore axis is suitable as 
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reaction coordinate, but as a second dimension the dis¬ 
tance between ions needs to be taken into account as 
ions do not move independently through the pore. 

The biasing potential in both equilibrium and 
nonequilibrium methods can be chosen arbitrarily, 
rendering the PMF approach suitable also for high 
barriers that are precluded from direct flux estimates, 
as the spontaneous permeation rate would be too low to 
gain sufficient statistics on simulation timescales. In 
addition, the driving force required in nonequilibrium 
simulations in terms of, e.g., a concentration gradient 
or an applied voltage would be unrealistically high. 

Cross-References 

► Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 
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Definition 

Force-fluorescence spectroscopy generally refers to 
the combination of single-molecule manipulation tech¬ 
nique with simultaneous single-molecule fluorescence 
measurement to characterize the mechanochemical 
properties of a macromolecule. 

Basic Characteristics 

On the one hand, single-molecule manipulation using 
scanning probe techniques, ► optical tweezers, and 
► magnetic tweezers enables mechanical perturbation 
of a system at the single-molecule level. On the other 
hand, intermediates in time-dependent reactions that 
are otherwise difficult to study in conventional ensem¬ 
ble experiments due to the averaging over many 
molecules can now be observed and studied directly 
using single-molecule fluorescence techniques. The 


combination of single-molecule manipulation tech¬ 
niques with single-molecule fluorescence measure¬ 
ments can thus find many potential applications in 
biophysical research, for example, manipulation of a 
macromolecule, while observing conformational tran¬ 
sitions of the molecule in real time. ► Optical twee¬ 
zers, ► magnetic tweezers, and ► atomic-force 
microscopy have all been combined with fluorescence 
microscopy for simultaneous manipulation and fluo¬ 
rescence measurements at the single-molecule level. 

One of the first studies of this kind was targeting 
single myosin molecules and their interactions with 
actin (Ishijima et al. 1998). A single actin filament 
with beads attached to both ends was levitated in 
solution using optical tweezers (Fig. la). The single 
actin filament was brought into contact with a single 
myosin molecule immobilized on the surface of 
a coverslip. Binding of a single myosin molecule to 
the actin filament in the presence of ATP triggers the 
displacement of the actin filament, which was 
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Force-Fluorescence Spectroscopy, Fig. 1 Schematics of various force-fluorescence experiments at the single-molecule level 
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measured by ► optical tweezers. Simultaneously, indi¬ 
vidual ATPase reactions of the myosin motor were 
monitored as the time course of binding and unbinding 
of a Cy3-labeled ATP analogue using ► total-internal- 
reflection fluorescence (TIRF) microscopy. The 
unique advantage of this experiment is that the tempo¬ 
ral relationship between force generation and ATP 
hydrolysis by the motor can be directly measured. 
The authors found that force generation does not 
always coincide with the release of bound nucleotide. 
Instead, there is a frequent delay of several hundreds of 
milliseconds after release of the bound nucleotide 
before the displacement of the actin. However, it is 
widely accepted that the force generation in myosin is 
directly coupled to the release of ADP. These findings 
contradict this view and suggest the presence of 
a memory state in single molecules of myosin. 

In a different setup combining ► optical tweezers 
with ► TIRF microscopy (Lang et al. 2004), Lang et al. 
applied mechanical force to a single DNA molecule 
immobilized on a surface using optical tweezers, 
while simultaneously measuring the dissociation of 
a DNA strand marked by fluorescence labels. In this 
experimental design (Fig. lb), optical trapping and 
single-molecule fluorescence coincide also in space. 
The challenges arising from this design are the 
photobleaching of fluorophores by the intense trapping 
laser through multiphoton processes and the selective 
detection of photons emitted by the single fluorophores 
of interest. Careful choice of fluorophores and engi¬ 
neering of the microscope, including alternating fluo¬ 
rescence excitation and optical trapping, are necessary 
to overcome these challenges (Lang et al. 2004). This 
design will be useful for certain systems where it is 
difficult to separate spatially the point of manipulation 
from the point of fluorescence excitation and detection. 

Confocal fluorescence imaging was also com¬ 
bined with optical tweezers to study the effect of 
subpiconewton forces on the conformational dynamics 
of Holiday junctions (Hohng et al. 2007). The Holiday 
junction is a crossover structure that can be formed by 
two double-stranded DNA molecules. It is an impor¬ 
tant intermediate in DNA homologous recombination. 
In this experiment (Fig. Ic), individual Holiday junc¬ 
tions were immobilized on a coverslip surface. One 
arm of the Holiday junction was attached to a long 
double-stranded DNA linker through complementary 
base pairing. The other end of the linker was bound 
to the surface of a bead held in solution by optical 


tweezers, through which force can be exerted and 
transmitted through the DNA linker to change the 
conformation of the Holiday junction. Force and 
single-molecule fluorescence were measured simulta¬ 
neously but separated in space. The conformational 
fluctuations of the Holiday junction under the influence 
of the mechanical force were exquisitely monitored 
through ► fluorescence resonance energy transfer 
measurements. In addition, these results demonstrated 
a lever-arm effect, where the response of the molecule 
to mechanical force can be amplified with an increase 
in the molecular “arm” length. 

For certain types of optical tweezers, the optical 
setup can be easily modified to incorporate epi- 
fluorescence imaging with simultaneous ► optical 
trapping (see this volume on ► Optical Tweezers). 
This principle was demonstrated by Wuite and 
coworkers (van Mameren et al. 2009), and Chemla 
and coworkers (Min et al. 2009). In one experiment, 
the disassembly of Rad51 proteins from preformed 
nucleoprotein filament held under tension was measured 
(van Mameren et al. 2009). The disassembly of Rad51 
protein from nucleoprotein filament is necessary for the 
completion of eukaryotic homologous recombination 
and was monitored using fluorescently labeled RadSl 
in vitro. Van Mameren et al. found that this process can 
slow down and even be stalled upon application of 
tension to the filament. In a different experiment (Min 
et al. 2009), epi-fluorescence was used to image 
a fluorescently labeled single bacterium trapped by 
optical tweezers. The authors measured bacterial flagel¬ 
lar rotation using back focal plane interferometry, which 
offers a temporal resolution greater than 100 Hz. 

► Atomic-force microscopy (AFM) and ► magnetic 
tweezers were also combined with TIRF to measure 
conformations of macromolecules in response to 
mechanical force. In one study (Hugel et al. 2002), 
a single azobenzene polymer was stretched between an 
AFM cantilever and a flint glass surface. The polymer 
conformation is sensitive to 365-nm light input due to 
the presence of the azobenzene group. Excitation of the 
azobenzene group by 365-nm light delivered through 
TIRF (Fig. Id) triggers the reversible cis-trans isomer¬ 
ization and, in turn, changes the extension of the mole¬ 
cule, which was monitored using AFM. This elegant 
example demonstrated for the first time a single¬ 
molecule device that is capable of optomechanical 
energy conversion. In a second example (Kufer et al. 
2008), AFM was used to pick up individual DNA 
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Force-Fluorescence 

Spectroscopy, 

Fig. 2 Schematic design of an 
optical tweezers with 
simultaneous epi-fluorescence 
imaging. TLA trapping laser; 
n and T2 telescopes; Ml, M2, 
and M3 mirrors; DM1, DM2, 
and DM3 dichroic mirrors; 0 1 
and 02 objectives; MC 
microfluidic chamber; TLl, 
TL2, and TL3 tube lens; RL 
relay lens; PD photodetector; 
LED 1 light source for bright 
held imaging; FLA 
fluorescence excitation laser; 
BP band-pass Alter; EMCCD 
electron-multiplying CCD 
camera for fluorescence 
imaging 



oligonucleotide molecules from a surface for ordered 
assembly of these molecules at a distant location. TIRF 
was used to characterize the quality and efficiency of 
this process. Also, magnetic tweezers have been com¬ 
bined with TIRF to study the DNA packaging motor of 
bacteriophage phi29 (Hugel et al. 2007). 

All these hybrid techniques detect fluorescence 
from single-photon excitation process. A majorities 
of them has not been applied to live cells. One potential 
future application of force-fluorescence spectroscopy 
is to study single-molecule activity in the context 
of a live cell, for example, manipulation of a single 
receptor on the cell surface and measurement of 
reporter gene expression to probe the mechanisms of 
mechanotransduction. Compared to single-photon 
excitation, two-photon excited (TPE) fluorescence 
has the superior advantages of low background, intrin¬ 
sic three-dimensional (3D) resolution, and reduced 
overall phototoxicity for biological samples. There¬ 
fore, TPE may be well suited for integration with 
single-molecule manipulation for simultaneous fluo¬ 
rescence detection with high sensitivity. This could 
be very useful for systems that are typically studied 
deep in solution or tissue, where evanescent field exci- 
tation/TIRF is difficult. Custom-built microscopes that 
combine angstrom resolution optical tweezers with 
TPE fluorescence are just beginning to appear (Cheng 
et al. 2010), in which a 830-nm trapping laser is used 
for simultaneous TPE excitation and optical trapping. 


The engineering aspects of force-fluorescence 
microscope are more complex than a conventional 
force-measuring technique because of the need to mea¬ 
sure both force and fluorescence simultaneously. How¬ 
ever, epi-fluorescence detection can always be 
incorporated into an existing ► optical tweezers as 
shown in Eig. 2. A potential drawback is the high 
fluorescence background that may not allow detection 
of fluorescence at a single-molecule level. Eurther 
engineering designs, for example, incorporation of 
TIRE excitation, confocal excitation, or TPE excitation 
are necessary to guarantee fluorescence detection at 
the single-molecule level. It is very likely that 
depending on the system to be studied and the exper¬ 
imental geometry required, different fluorescence 
excitation schemes will offer different advantages. 
Eor certain cases, force and fluorescence measure¬ 
ments are desired to coincide in space. The design by 
Block and coworkers will be suitable for this purpose, 
with further developments possible. In other cases 
where trapping and fluorescence detection can be sep¬ 
arated in space, alternative schemes can be designed 
and used (Hohng et al. 2007; Ishijima et al. 1998). 

With the development of ► nanometer resolution 
fluorescence detection (Yildiz et al. 2003) and 
subnanometer measurement by ► optical tweezers, 
force-fluorescence spectroscopy is anticipated to 
reach higher resolution for both manipulation and 
fluorescence detection, which offers a great tool for 
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biophysicist to understand the fundamental mechano- 
chemistry of macromolecules as well as mechanotrans- 
ductions that occur in the cell. 

Cross-References 

► Atomic Force Spectroscopy 

► Fluorescence Labeling of Nucleic Acids 

► Magnetic Tweezers 

► Optical Tweezers 

► Protein Fluorescent Dye Labeling 

► Single Fluorophores Photobleaching 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► TIRF 
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Synonyms 

Optoacoustic spectroscopy; Photoacoustic spectros¬ 
copy; Photothermal spectroscopy 

Definition 

► FTIR-PAS is a unique extension of IR spectroscopy 
which combines the utility of interferometry with the 
standard sample-gas microphone of the photothermal 
technique for depth-profile analysis of materials. 
FTIR-PAS is the most powerful spectroscopy-based 
depth-profiling tool that enables nondestructive and 
noncontact measurements with minimal sample prep¬ 
aration need. 

Although absorption spectrum is retrieved from 
FTIR-PAS experiments, the thermal behavior of the 
sample rather than the optical properties plays a major 
role in the generation of PA signal. 

Basic Characteristics 

The evolution of photoacoustic spectroscopy (PAS) 
began with the discovery of emission of sound from 
a thin diaphragm exposed to modulated (mechanically 
chopped) sunlight by Alexander Graham Bell in 1880. 
However, the concept was impractical until the advent 
of microphone in the 1930s. Similar effect could be 
observed when infrared or ultraviolet light is used. 
Unlike most spectroscopic techniques, PAS is not 
based on the measurement of electromagnetic radia¬ 
tion. Rather, PAS involves measurement of acoustic 
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wave (pressure oscillations) in a hermetically sealed 
cell fitted with a very sensitive microphone. The 
microphone signal, when plotted as a function of 
wavelength, contains a spectrum proportional to the 
absorption spectrum of the sample. 

The photoacoustic (PA) wave generation follows 
absorption of light, which is modulated at a frequency 
in the acoustic range, by the sample. Most FTIR instru¬ 
ments provide modulation frequencies between 50 and 
500 Hz in the 400^,000 cm“^ wave number span. 
(The audible range is between 20 and 20,000 Hz.) 
When IR light is utilized, a portion of the absorbed 
energy turns into modulated local heat through radia¬ 
tionless de-excitation processes (mostly thermal de¬ 
excitation). This results in the generation of periodic 
heat diffusion from the sample into the surrounding 
gas. Such formed heat flux produces an oscillatory 
motion of the adjacent gaseous atmosphere due to the 
periodic thermal expansion and contraction of the gas. 
Periodic changes in the temperature of the boundary 
gas layer generates pressure wave (sound) since at 
a constant volume, pressure of a gas is proportional 
to temperature (the ideal gas law PV = nRT). As the 
amplitude of the pressure wave scales with the inten¬ 
sity of the absorbed light modulated at an audio fre¬ 
quency, recording the PA signal as a function of the 
wavelength produces a relative absorption spectrum. 

The thickness of the boundary layer of the gas is 
determined by the thermal diffusion layer/length (fig). 
The thermal diffusion layer acts as an acoustic piston, 
producing periodic pressure waves inside the PA cell 
(Rosencwaig 1980). 



where k (cal/cm.s.°C), p (g/cm^), and C (cal/g.°C) are 
intrinsic properties denoting the thermal conductivity, 
density, and specific heat, respectively, and / is the 
modulation frequency in Hz. The thermal diffusion 
layer can also be described by the thermal diffusivity, 
D(cm/s) which is equal to kjpC, and ca (= 2nf) which 
is the angular frequency at which the light is modulated 
(rad/s). 

Rosencwaig and Gersho provided the first one¬ 
dimensional mathematical model for the heat flow in 
the PA cell due to the absorbed light energy in 1976 
(Rosencwaig and Gersho 1976). This model is referred 


to as RG theory or the “thermal-piston” model. When 
intensity or phase modulated light with a power of Iq is 
directed to a sample, certain quanta of light absorbed 
(A) by the sample act as the photothermal source and 
can be given as (McClelland 1983; Zhang and Urban 
2004), 


A = (1 - (2) 

where x refers to the depth light can diffuse, R is 
a constant for reduction in the absorption energy due 
to the sample’s surface reflection, scattering, etc., 
is the optical absorption coefficient of the sample at 
wavelength X and its unit is cm“ *. The absorbed quanta 
of light are converted, in partly or in whole, into heat 
through non-radiative relaxation of excited molecules 
back to the ground state. As a result of this conversion, 
thermal wave at different amplitude generates and 
decays as it propagates to the thermal diffusion layer, 
Pg. The thermal wave (heat vector), which forms 
a pressure wave inside the PA cell, can be derived 
from the law of energy conservation (Zhang and 
Urban 2004), 

Tn{x) = (I - R) Ioa{X)e e (3) 

Equation 3 is used to express the amplitude of thermal 
waves Tn(x) at a given depth, x, and its propagation 
where the phase lag of thermal waves is represented by 
(xlpg). (1 — R)' corresponds to a correction constant for 
both nonabsorbed IR light and total heat loses do not 
take a part in the photoacoustic effect. 

The thermal waves propagating from a layer 
beneath the surface decay at a rate indicated by the 
thermal decay coefficient which is approximately 
equal to the inverse of the thermal diffusion length Pg. 
The mathematical representation of the decay coeffi¬ 
cient is then given as. 


1 




(4) 


The thermal wave equation can also be defined as 
a function of the thermal decay coefficient, in Eq. 3. 
The probing (sampling) depth of a PA measurement is 
approximately equal to the thermal diffusion length 
(Pg), when Os is approximately equal or greater than 
the absorption coefficient, a, which dictates the optical 
penetration depth of light (McClelland et al. 2003). 
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Fourier Transform Infrared 
Photoacoustic 
Spectroscopy (FTIR-PAS), 
Fig. 1 Schematic of an 
FTIR-PAS unit with 
a photoacoustic cell purged 
with helium, integrated with 
an FTIR spectrometer 


Mid-IR source 



Photoacoustic Cell 


However, the sampling depth is largely defined by the 
optical decay of light rather than the thermal decay 
process within the sample. The decay of thermal waves 
within the sample results in time delay between the 
thermal diffusion in the sample and the subsequent 
generation of the acoustic (pressure) wave into the 
coupling gas. This introduces a phase shift between 
the impinging modulated IR radiation and the 
photoacoustic response, so signals from deeper regions 
correspond to larger phase angle (Michaelian 1989). 
Hence, the decay and phase delay of thermal waves in 
the sample play a major role in the characterization of 
materials with depth-varying composition, such as 
medical devices, biological materials, multilayer pack¬ 
aging hlms, and coating systems because they are 
directly associated with the magnitude (intensity of 
radiation absorption) and phase component 
(McClelland et al. 2003). 

FTIR-PAS system illustrated in Fig. 1 has three 
main components. The first is the IR light source and 
intensity modulator (interferometer), which is confined 
in a typical FTIR spectrometer. The second is the 
photoacoustic cell where sample is located and filled 
with an IR-transparent nonabsorbing gas with good 
thermal coupling efficiency, such as helium at high 
purity. The third component is the signal collection 
and processing unit. Typically, at least a 50 mV/Pa 
sensitive microphone detector is utilized for the detec¬ 
tion of the photoacoustic signal. In FTIR-PAS 


experiments, an absorption spectrum is obtained by 
plotting the intensity of the acoustic waves as 
a function of wavelength, as the wavelength of the 
light continually spans from 400 to 4,000 cm“\ yield¬ 
ing information similar to that of optical absorption 
spectra with identical absorption modes (peaks). The 
photoacoustic cell is conceptually a simple apparatus, 
whose setup consists of an acoustically sealed chamber 
with a window often made up of KBr, a microphone, 
and a sample holder. It is critical that the PA cell design 
meets the signal intensity maximization and back¬ 
ground signal and noise minimization criteria. For 
minimization of the background signal, PA cells are 
usually made of either polished metal or optical-grade 
glass so that light absorption by the interior surface of 
the cell can be neglected. Mechanical vibrations and 
acoustic leaks are two main noise sources in a PA cell. 
The PA signal intensity is inversely proportional to cell 
volume. The distance between the window and the 
specimen affects the signal quality. Heat generated at 
the sample transfers to the window by conduction 
mechanism when this distance is smaller than the ther¬ 
mal diffusion length (pg) of the carrying gas (He). 
Practically, separation between the window and sam¬ 
ple by a distance of 2 mm is sufficient to prevent this 
effect for modulation frequencies above 10 Hz 
(Kinney and Staley 1983). 

PAS measurements can be performed using both 
continuous (rapid) scan and step-scan (S^) 
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interferometric operations of FTIR spectrometers, uti¬ 
lizing the advantage of the frequency and/or phase 
dependence of the PA signal. In the rapid-scan mode, 
the frequency dependence of the PAS signal frequency 
is exploited. The optical layout of Michelson interfer¬ 
ometer, which is widely utilized in FTIR instrumenta¬ 
tion, produces frequency of/= 2Vcj, (/= 2 ’V/'k) (for 
Genzel interferometer/= 4Vv). Here, V is the veloc¬ 
ity of the moving mirror that scans continuously, a is 
the wave number, and k is the wavelength. The inter¬ 
ferometer mirror moves at a constant velocity but 
modulates each IR wavelength at a distinct Fourier 
frequency. The involvement of multiple (one at each 
wave number) modulation frequencies for signal 
generation requires multiplexing for the detection. 
By changing the mirror (scanning) speed, a series of 
probe depth can be assessed in rapid-scan FTIR-PAS 
experiments. However, since the thermal diffusion 
length (fig) varies inversely with the square root of 
the modulation frequency, in the rapid-scan opera¬ 
tions, the thermal diffusion length varies considerably 
across the spectrum. For example, a probe depth at the 
lower boundary (400 cm”*) is found to be v^IO times 
deeper than that of the upper limit (4,000 cm”*) 
(McClelland et al. 2003). Step-scan operation is how¬ 
ever capable of providing a constant modulation fre¬ 
quency over the entire spectral range that allows 
a lock-in amplifier (LIA) or digital signal processing 
(DSP) methods to detect and amplify the PA signal. In 
the step-scan mode of operation, the retardation is 
changed step by step holding the optical path differ¬ 
ence fixed during data collection (scanning). There are 
two different approaches to modulate IR light using S^ 
FTIR. These are amplitude modulation (AM) by 
a mechanical chopper and phase modulation (PM) by 
jittering the position of one of the interferometer mir¬ 
rors. In a typical amplitude-modulation experiment, IR 
light passes through a mechanical chopper before 
impinging on the sample to provide all wavelengths 
with the same modulation frequency. The modulation 
frequency can be adjusted by the chopping frequency 
to control the thermal diffusion length. However, this 
kind of modulation is hardly employed because a small 
changes in the IR source intensity arises a remarkably 
increased noise level. The phase-modulation technique 
is the most efficient way to modulate IR radiation for 
PAS. The interferogram obtained using phase- 
modulation is the same as the first derivative of 
rapid-scan or amplitude-modulation interferograms 


which increases the dynamic range of the detector 
(Wang et al. 1998). PM is generated by dithering the 
fixed mirror at a constant frequency while keeping the 
average position of both interferometer mirrors at each 
sampling point. The phase-modulation amplitude is 
expressed by the wavelength of the He-Ne laser, 
^He-Ne> 633 nm. The PM mode produces a constant 
sampling depth for each modulation frequency (a sin¬ 
gle discrete frequency) accessed and offers analysis of 
both amplitude and the phase of the PAS signal 
(Palmer 1993). The use of discrete phase-modulation 
frequency allows DSP or LIA to demodulate the signal 
from microphone and extract in phase (I, surface) and 
out of phase quadrature (Q, interior) interferogram 
components simultaneously. Taking the PA signal, 
C(cr), as a complex quantity with a real (in-phase 
/(cr)) part and an imaginary (quadrature Q((t)) part, it 
can be demodulated into magnitude spectrum M(cr) 
and phase angle 9(a) as, 

C((t) = I((j) + iQ((y) = M{a) exp[—;0(f7)] (5) 

For calculation of the phase angle 0(a) and the mag¬ 
nitude spectrum M (a), the below equations are used. 

6((t) = tan”* [Q{a)/l{a)] (6) 

M{a) = +I(a)'^ (7) 

The spatial resolution is significantly increased 
beyond the IR diffraction limit when PA signal phase 
(O) is employed for depth-profile analysis, by synchro¬ 
nously collecting orthogonal in-phase (I) and quadra¬ 
ture (Q) spectra in the step-scan mode (Jiang and 
Palmer 1997). 

Nondestructive noncontact depth-profile analysis is 
the most distinct advantage of the PAS technique with 
which localization of contaminants and determination 
of diffusion in materials can be studied. Taking the 
modulation frequency as the external perturbation, 
generalized 2D spectral correlation technique can be 
applied to the step-scan FTIR-PAS spectra for the 
enhancement of both spectral and depth resolutions, 
allowing the study composite food systems (Irudayaraj 
and Yang 2002), and component diffusion in packag¬ 
ing materials (Yang and Irudayaraj 2000). Along with 
the depth-profile analyzing ability, FTIR-PAS offers 
the advantage of investigating highly absorbing. 
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composite, highly scattering biological samples 
including food and biomaterials of dry or low moisture 
materials. 

Cross-References 

► Single Fluorophores Photobleaching 

► Vibrational Spectroscopy, A Short Flistory of 
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List of Abbreviations 


ADC 


Connes 

Advantage 

D* 


DLATGS 

DTGS 

Fellget 

Advantage 


FFT 

/n 


Globar 


IF 


Analog to digital converter used to 
digitize the pre-amplified (AC- 
coupled) IR detector signal typically 
at 16 bit. 

The internal wavelength calibration in 
Fourier spectrometry provided by a 
HeNe-laser interferogram. 

Specific Detectivity characterizes the 
quality (i.e., low noise) of a detector. 
D* = (Ad)'''WEP, unit; cm Hz’'"" W', 
with Ad the detector surface. 
Deuterated L-alanine-doped triglycin 
sulfate, a ferroelectric detector mate¬ 
rial superior to DTGS. 

Deuterated triglycin sulfate, a ferro¬ 
electric detector material. 

Increase of the SNR by a factor of N'^ 
in a Fourier-spectrometer as com¬ 
pared to an equivalent measurement 
of N spectral lines with a grating 
spectrometer. 

Fast Fourier transform based on an 
algorithm developed by Cooley and 
Tukey (1965). 

Nyquist frequency, the sampling fre¬ 
quency required to correctly digitize 
the highest frequency /^ax component 
within an interferogram,/ a( = 2 /max- 
A resistively heated non-encapsulated 
MIR source typically from silicon 
carbide. 

Interferogram here the IR intensity 
measured after recombination of two 
IR beams on a detector as a function 
of the OPD between the partial beams. 
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Jacquinot 

Advantage 

MIR 


NEP 


OPD 

PEM 


SNR 

Wavenumber 


Increase of the SNR in a Fourier- 
based spectrometer due to higher 
energy throughput. 

Mid infrared, the region of the electro¬ 
magnetic spectrum ranging from 2.5 to 
25 pm (corresponding to 0.5-0.05 eV). 
Noise equivalent power, a measure of 
the optical power that produces a sig¬ 
nal equal to the noise level, unit: 

Optical path difference. 

Photoelastic modulator, a hirefringent 
material (typical ZnSe) exposed to 
periodic elastic strain used to gener¬ 
ate circularly and linearly polarized 
light whose orientation is modulated 
by the fundamental and first har¬ 
monic of the elastic modulation, 
respectively. 

Signal to noise ratio. 

Number of wave lengths of mono¬ 
chromatic electromagnetic radiation 
fitting into 1 cm, the common unit of 
cm“* in IR spectroscopy. 


Synonyms 

Fourier transform infrared spectroscopy; FTIR spec¬ 
troscopy; Infrared spectrometers; Infrared spectros¬ 
copy; Rapid scan FTIR; Sampling methods in 
infrared spectroscopy; Step scan FTIR 


Definitions 

The potential of IR spectroscopy in directly obtaining 
chemistry- and structure-related information from large 
biological macromolecules has found increasing appre¬ 
ciation in the last two decades. In the form of Fourier 
transform infrared (FTIR) spectroscopy, biophysical 
questions addressing the function of rather complex 
hiomolecules such as lipids, enzymes, or membrane 
proteins have been successfully answered. The entry 
covers some salient technical aspects of FTIR spectros¬ 
copy and describes measuring techniques specifically 
adapted to biological samples, typically implying aque¬ 
ous phases. The principles of static as well as time- 
resolved measurements are introduced. 


Introduction 

The analysis of biopolymers by infrared spectros¬ 
copy requires covering the mid infrared (MIR) 
region of wavenumber (4,000-400 cm“\ 
corresponding to 2.5-25 pm), where the vibrations 
of chemical groups cause IR absorptions at frequen¬ 
cies related to structural properties. The infrared- 
spectroscopic signature of functional transitions in 
biomolecules is extremely informative but frustrat- 
ingly small (0.05-0.1% absorption changes), requir¬ 
ing the high spectral resolution and signal to noise 
ratio (SNR) provided by modern Fourier transform 
infrared (FTIR) spectrometers. Today, FTIR spec¬ 
troscopy is used as a label-free nondestructive 
method to analyze mixtures of organic compounds 
and as a dedicated research tool to identify biomo- 
lecular reaction mechanisms ranging from nanosec¬ 
onds to hours. Different types of biophysical FTIR 
spectroscopy have emerged as a result of reconcil¬ 
ing the constraints inherent to the physics of IR 
absorption with the measuring conditions dictated 
by the biological samples. The often inevitable pres¬ 
ence of water in biological samples has long been 
an adverse factor due to its strong absorption in the 
MIR and a number of sampling strategies have 
solved this problem. It is adequate to dissect an 
overview on biophysical FTIR instrumentation into 
interferometer operation, that is, the instrument’s 
crucial part determining time and spectral resolu¬ 
tion, and “interfacing” of the IR beam to a specific 
biological sample. Commercial developments have 
focused on interferometer design and signal 
processing to optimize SNR and time resolution. 
Despite the commercial availability of diverse 
sample equipment, the “interfacing” to biological 
samples is still today driven by the increasingly 
complex systems studied in an academic environ¬ 
ment. Comprehensive surveys on FTIR theory and 
instrumentation have been published (Smith 1995; 
Griffiths and de Haseth 2007), biological applica¬ 
tions specifically addressed (Stuart 1997; Siebert 
and Hildebrandt 2008), and introductory material 
on opto-electronic IR components is available 
(Diggers et al. 1999). Laser-based pump probe 
IR techniques assessing the femtoseconds to nano¬ 
seconds time regime of biomolecules by dispersive 
methods are dealt with in separate entries. 
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IR Sources 

FTIR spectrometers commonly use a water-cooled 
Globar emitting maximally in the MIR when heated 
to ^1,300 K. IR Sources are not encapsulated due to 
limited heat resistance and IR transmittance of 
corresponding materials. Non-cooled ceramic mate¬ 
rials are available as well. Although operation in air 
prevents higher temperatures, the emitted energy den¬ 
sity yields a sufficient SNR in combination with 
a DTGS detector and 16-bit ADC. Higher energy 
density may be obtained with graphite if required. 
For the FIR, high-pressure mercury lamps are used 
with a quartz envelope which is transparent below 
100 cm~* (i.e., at wavelengths >10 pm). 

IR Detectors 

FTIR spectroscopy takes advantage of spreading the 
detector noise over all spectral elements that impinge 
on the detector simultaneously, rather than in sequence 
as in dispersive instruments. This “Fellget advantage” 
is optimal, when detector noise exceeds all other 
sources of noise and is independent of the signal 
intensity. 

Thermal Detectors 

These detectors transduce a change in temperature into 
an electrical signal. Temperature-sensitive conductors 
and semiconductors had been used in the past. How¬ 
ever, they exhibit response times larger than 
a millisecond which is too slow for modem fast scan¬ 
ning FTIR spectrometers which are now commonly 
equipped with pyroelectric detectors. The latter are 
made from ferroelectric materials exhibiting thermally 
induced changes of their electric polarization which 
can be detected in the millisecond time range through 
capacitive coupling. Deuterated triglycine sulfate 
(DTGS) and deuterated L-alanine-doped triglycine sul¬ 
fate (DLATGS) detectors have become standard 
equipments in FTIR spectrometers. Yet, their voltage 
responsivity at a modulation frequency / is propor¬ 
tional to \lf, requiring slow scanning speeds and 
preventing their use in time-resolved studies. 

Quantum Detectors 

Quantum detectors are typically p-n junction semicon¬ 
ductors. The absorption of an IR photon exceeding the 


band gap energy creates an electron hole pair used in 
photoconductive (PC) detectors to generate a change in 
electrical current or a change in voltage in photovoltaic 
(PV) detectors exhibiting ^tenfold faster responses 
(down to 10 ns). Both types are available as liquid 
nitrogen-cooled mercury cadmium telluride (MCT) 
elements (typically covering the 7,000-800 cm“* 
range). Unlike thermal detectors, the detectivity of 
MCT detectors increases with modulation frequency, 
and is generally 10-50 fold higher, rendering them 
ideal for time-resolved experiments and detection of 
low intensities. Signal intensity should be in the linear 
response of the detector or non-linearity corrections 
(supplied with the instrument software) applied to 
avoid spectral artifacts. 

The Michelson Interferometer 

FTIR instruments use interferometers to split the 
amplitude of a collimated IR beam into two beams 
that are recombined on the detector after having 
acquired an optical path difference (OPD). The 
Michelson interferometer uses a beam splitter (a film 
of poly ethylene terphtalate known as Mylar) and 
a movable mirror to generate an OPD in one interfer¬ 
ometer arm (Fig. 1). One mirror sweep samples a dou¬ 
ble-sided interferogram (IF) (covering negative and 
positive OPDs relative to the position of maximal 
constructive interference, the “center burst”) at the 
detector. The OPD is monitored by coupling a HeNe- 
laser beam collinearly with the IR beam into the inter¬ 
ferometer. The zero crossings (destructive interfer¬ 
ences) of the sinoidal laser IF measured 
simultaneously at a photodiode identify the OPD in 
multiples of half the laser wavelength, providing an 
internal calibration (Connes advantage) and a signal to 
digitize the IF at equidistant OPDs. The modulated IR 
intensity is processed by AC-coupIed electronics and 
a fast Fourier transform (FFT) retrieves the measured 
IR spectrum from the IF. Tilt between the interfering 
beams is compensated in most instruments by dynamic 
alignment (laser beams sample different regions of the 
interferometer mirrors to activate a feedback loop that 
re-aligns the fixed mirror). Spectral artifacts inherent 
to this method are avoided by a “true alignment” mir¬ 
ror in the moving interferometer arm (Fig. 2). Interfer¬ 
ometer and sample compartments are purged with dry 
air or evacuated to prevent absorption by water vapor. 
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Technical Aspects, 

Fig. 1 Michelson 
interferometer. The moving 
mirror modulates the IR 
intensity at the detector by 
alternating constructive and 
destructive interferences of all 
wavelengths. The modulated 
signal is amplified and 
digitized at the zero crossings 
of the laser IF measured at the 
photodiode. A large dynamic 
range of the ADC (typically 16 
bit) is required to resolve the 
intense center burst and the 
small modulations (insert) at 
larger OPDs which provide 
spectral resolution 


0 1000 2000 3000 4000 

sampling points 

If all zero crossings of the HeNe-laser IF are used 
for digitization of the IR IF, then Vn,ax = 15,800 cm“*. 
Even if the region above 4,000 cm“* is rarely of 
interest in biophysical spectroscopy, it is advantageous 
to use such a “dense” digitization because higher fre¬ 
quency noise will be spectrally separated, improving 
the SNR in the desired region below 4,000 cm“*. In 
general, frequencies above (Vn,ax-fmin) have to be 
physically absent. 

Static FTIR Spectroscopy 

Static spectroscopy determines the absorption of 
a sample at thermodynamic equilibrium. A reference 
and sample run are required to obtain an absorption 
spectrum. The moving mirror is scanned continuously 
at constant speed V over the distance AOPD^ax- IPs are 
recorded as a time-dependent detector signal (except 
for step scan mode). Thus, v^ax translates into 
a maximal data acquisition frequency 

/max = 2V • Vmax- (3) 

For V = 0.316 cm s“* a Vn,ax of 4,000 cm“^ corre¬ 
sponds to 1.25 kHz, the limit for detection by a DTGS 
detector. Scanning speed should be low with this 


Scanning Modes 

Spectra 5(v) are calculated analytically from the inter- 
ferogram /(x) as the Fourier transform: 


5(v) = 




( 1 ) 


where x is the OPD (minus to plus infinity). How¬ 
ever, only a limited number of sampling points is 
acquired and the maximal OPD which determines 
spectral resolution as Av = (OPD^ax) * is not infinite. 
Performing the discrete FT can be a critical source of 
error. Data acquisition parameters have to be chosen 
carefully to prevent spectral artifacts. The spacing of 
IF sampling points determines the highest 
wavenumber (or spectral bandwidth) to be cor¬ 
rectly represented after the FFT. According to the 
Nyquist theorem, the digitization wavenumber Vj 
(twice the wavenumber of the HeNe-Laser) has to 
comply with the relation: 


I'd > 2vmax, and in generalized form : 

Vd > 2(Vn,ax - Vmin). 


( 2 ) 


fixed mirror 
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Fourier Transform Infrared Spectroscopy for Biophysical 
Applications: Technical Aspects, Fig. 2 Optical layout of 
a modem FTIR instrument. The interferometer deviates from 
the 90° geometry to minimize polarization effects at the beam 
splitter (BMS). A “true alignment” mirror (asterisk) continu¬ 
ously keeps the beam on the fixed optical axis. Mirrors 


select MIR and near infrared (NIR) sources and detectors 
(Dl, D2). Apertures (APT) and optical filters (OF) are set by 
“wheels.” “IN” and “OUT” ports allow coupling to external 
instruments (microscopes, IR sources, synchrotron beam 
lines, etc.). Courtesy of Dr. Zachmann, Bruker optics. 
Ettlingen: Germany 


detector but the SNR of the center burst must not 
exceed the dynamic range of the ADC, otherwise sig¬ 
nal averaging will not improve the spectrum. MCT 
detectors allow higher scanning rates but should he 
used only when their higher sensitivity is required for 
low intensity signals. 

Rapid Scan Mode 

IFs can be obtained with fast scanning instruments in 
about 10-20 ms to follow reaction kinetics in the sub¬ 
second time range. Figure 3 shows how the continu¬ 
ously moving mirror cuts out “slices” (red traces) of 
a time-dependent IF (yellow surface), approximating 
the succession of “momentary” IFs when data acqui¬ 
sition is significantly faster than the reaction kinetics. 
Processing of accordingly high modulation frequen¬ 
cies in the IR signal necessitates quantum detectors. 
Instrumental software synchronizes the onset of data 
collection with triggering of a physical perturbation of 
the sample. A single reaction activation (indicated hy 
the red asterisk) precedes the acquisition of 
a sufficient number of IFs. Time resolution can be 


enhanced by exploiting both back and forward move¬ 
ments of the mirror for collection of individual spec¬ 
tra. Reduction of spectral resolution translates into 
shortening of OPDmax, thereby, also increasing time 
resolution. Ideally, reactions are initiated by a laser 
pulse as in photosensitive biomolecules. Reactions 
can be made photosensitive by employing caged 
“compounds,” that is, chemicals that become reactive 
only after a photolytic process, for example, in IR 
studies of ATPases (see ► ATPase: Overview). Alter¬ 
natively, reactions can be initiated by rapid mixing 
using Stopped Flow Methods (e.g., George et al. 
2000 ). 

Step Scan Mode 

Reaction kinetics and mirror movement are fully 
uncoupled. At each digitization point, the mirror posi¬ 
tion is stabilized to nanometer precision before 
a reaction is initiated. Stability of the mirror position 
is crucial and step scan interferometers operate ideally 
in vacuum to prevent acoustic vibrations. The detector 
output is recorded at the desired time resolution and 
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Fourier Transform Infrared Spectroscopy for Biophysical 
Applications: Technical Aspects, Fig. 3 Concept of rapid 
scan and step scan FTIR spectroscopy. The 3D plot represents 
the time-dependent change of the IF during a (photo) chemical 
reaction. If scanning is faster than the reaction initiated at time 
r = 0 (red asterisk), quasi-stationary IFs are sampled along the 


split into DC and AC components. The DC signal is 
used to determine the real and imaginary part of the 
FFT required to compute the FFT of the AC part which 
reflects only the temporal absorption change. The blue 
traces in Fig. 3 exemplify the data collection. The 
reaction must be highly reproducible. The IF at 
a given time is constructed from the two-dimensional 
data set (blue traces). Reaction efficiencies have to be 
independently monitored to scale the IR signals at each 
mirror position to an identical amount of reacted mate¬ 
rial. Otherwise, oscillations are introduced along the 
OPD axis and “misinterpreted” as interference pat¬ 
terns. The time resolution is determined by the detector 
reaching 100 ns to 1 ps and tenths of ns with a PC and 
PV MCTs, respectively. The inherently large data sets 
can be reduced by (1) restricting the spectral band 
width Eq. 2 and (2) by employing logarithmic rather 
than linear sampling times. 

Only a few biological samples exhibit the required 
reproducible reactivity. The bacterial proton pump 
bacteriorhodopsin is the most prominent one, where 
a photoreaction can be triggered by a laser and relax¬ 
ation is fast enough (< 1 s) to allow stepping and signal 
averaging in a total experimental time of several hours 


red trace (“duty” time marked by the black arrows, green arrows 
cover the “dead” time needed for mirror turning and re-acceler- 
ation). If the reaction is too fast to acquire “instantaneous” IFs, 
the reaction has to be initiated at each sampling point in the step 
scan mode (blue asterisks). IFs are reconstructed from the IR 
signals for each time point 


to resolve the nanoseconds to microseconds time 
range. Such studies have largely motivated the devel¬ 
opment and improvement of step scan instrumentation 
in the field of IR of protein reactions as described 
(Gerwert 2001) and the technique has been applied to 
bacterial electron transfer chains (e.g., Pinakoulaki 
et al. 2002). 

Stroboscopic Mode 

Stroboscopic modes for sub millisecond time-resolved 
FTIR spectroscopy are rarely used today. Unlike step 
scanning, the interferometer scans continuously but the 
reaction is triggered multiple times during a single 
scan (in Fig. 3, the asterisks would lie in the x, y 
plane rather than being aligned along the OPD axis 
only). Using different delays between the initiation of 
a reaction and the data acquisition, a set of IFs can be 
recorded and re-sorted to generate IFs in which the 
reaction has proceeded to the identical degree at any 
given OPD. However, the accessible reaction kinetics 
must be compatible with IF sampling rates defined by 
the mirror velocity, and the impracticable normaliza¬ 
tion to equal reaction efficiency renders this method 
less attractive. 
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Fourier Transform Infrared Spectroscopy for Biophysical 
Applications: Technical Aspects, Fig. 4 A commercial IR 
flow cell (Microbiolytics, Esslingen, Germany) employing two 
~10 pm-spaced CaF 2 windows attached to a thermal jacket. 
Liquids are applied with a syringe (top) and exit through HPLC 
tubings (bottom) 


Interfacing to Biological Samples 

Transmission Spectroscopy of Thin Films of 
Biological Molecules 

KBr pellets used for IR transmission studies of inor¬ 
ganic powders are not suited for biological samples. 
Transmission studies can be done with thin 
(10-100 pm) films of biological material on 
IR-transparent supports such as CaF 2 , AgCl, ZnSe, 
BaF 2 , or diamond in sealed temperature-(hydration-) 
controlled sample holders. If liquid water is required, 
flow cells of < 10 pm optical path are available (Fig. 4). 

IR Reflection Absorption Spectroscopy (IRRAS) 

Much interest has evolved in self-assembling pro¬ 
cesses of biomolecules on planar surfaces which can 


be studied by IRRAS. On metallic surfaces, IR absorp¬ 
tion occurs virtually exclusively for p-polarized light 
(the electric field vector in the plane of incidence) at 
large angles of incidence. Spectra are recorded at graz¬ 
ing incidence over large sample areas (Fig. 5). The 
little absorbed s-polarized IR beam can be exploited 
to measure a reference spectrum without actually 
removing the sample. Fast switching between s- and 
p-polarization can be accomplished by introducing 
a photoelastic modulator (PEM). Lock-in electronics 
separate the ^70 kHz modulated IR signal from the 
slow modulation caused by the interferometer. For 
lipids and surfactants water is the preferred substrate. 
Reference spectra are measured on a pure water sur¬ 
face using motorized probe and reference troughs. 
Details of the general investigation of ultra-thin films 
by IR spectroscopy have been published (Tolstoy et al. 
2002 ). 

Attenuated Total Reflection 

The presence of liquid water requires small optical 
path lengths in transmission as H 2 O above 10 pm 
thickness causes too large IR intensity losses. Atten¬ 
uated total reflectance (ATR) allows measuring the 
absorption of biomolecules adsorbed on an IR- 
transmissive material in contact with bulk water. 
Here, the IR beam is internally reflected in the 
supporting ATR crystal (e.g., ZnSe, Si, Ge, dia¬ 
mond). Only the non-propagating evanescent field 
interacts with the sample with a penetration depth 
d of the exponential decay of IR intensity E into the 
bulk phase (at a distance z from the ATR crystal 
surface): 

E = Eq^T-I'^ with 

27i(ni^sin^6 — n2^) ^ 

where 1 is the wavelength of IR radiation and ni and 
n 2 the refractive indices of the ATR crystal and the 
bulk material, respectively. The angle of internal total 
reflection is given by 6 (measured with respect to the 
normal). ATR units have been scaled down from 
rather long trapezoidal to small circular cells 
(Fig. 6). Additionally to the independence from 
micrometer spacings, non-interfering manipulations 
in the solvent can be performed rendering ATR prob¬ 
ably the most attractive sample interface in biological 
FTIR spectroscopy. It has been coupled to 
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Fourier Transform Infrared Spectroscopy for Biophysical 
Applications: Technical Aspects, Fig. 5 Schematic of 
IRRAS at grazing incidence. Upper panel: polarized IR light 
becomes modulated by a PEM such that the transmitted light 
switches between p- and s-polarization from which sample and 


reference spectra are obtained, respectively, from a single sys¬ 
tem. Lower panel: sample and reference spectra of biomolecules 
spread on water are measured with p-polarized IR light reflected 
from different troughs 


diamond ATR crystal 



purge gas connection IR entrance 


b 

Dialysis membrane 



Fourier Transform Infrared Spectroscopy for Biophysical 
Applications: Technical Aspects, Fig. 6 A diamond ATR- 
cell (Resultec, Illerkirchberg, Germany) showing the central mea¬ 
suring area (5 mm diameter), the entrance aperture, and the port to 


purge the in-coupling optics with dry air. The scheme to the right 
shows a dialysis-coupled ATR set up used to study binding 
reactions at biomolecular surfaces (e.g., Fahmy 2002) for which 
commercial equipment is available as well 


electrochemistry (Ataka and Heberle 2007) and sur¬ 
face enhanced IR, periodic perturbation of physico¬ 
chemical solution parameters (Fringeli et al. 2001), 
and to the excitation of sample fluorescence for cross¬ 
correlation with the simultaneously recorded IR 
absorption in time-dependent experiments (Lehmann 
et al. 2007). 


Summary 

FTIR spectroscopy has become an important nonde¬ 
structive tool in gathering structural information of 
biological macromolecules at atomic resolution and 
under functional conditions. Modem instrumentation 
allows recording high resolution IR spectra of 
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biomolecules in liquids, thin films, and adsorbed 
monolayers without chemical modihcation. The time 
course of structural changes of biomolecules can be 
followed easily down to ^20 ms time resolution with 
rapidly scanning interferometers. If such reactions are 
highly reproducible, molecular mechanisms can be 
studied at a time resolution down to nanoseconds by 
step scan interferometers. Despite the advanced user- 
friendliness in operating modern FTIR spectrometers, 
the use of the adequate detector type, the correct 
adjustment of the signal to noise ratio, the setting of 
optical and electronic filters to physically restrict the 
band width and the corresponding choice of the inter¬ 
ferometer scanning speed are crucial parameters in the 
hand of the user. Their prudential use is essential for 
gaining high quality spectra by proper signal averaging 
procedures in static as well as time-resolved experi¬ 
ments and for avoiding spectral artifacts inherent to 
improper sampling of both the optical path and the 
intensity of the IR interferogram. 

Cross-References 

► ATPase: Overview 

► Electron Transport Chains in Bacteria 

► Infrared Spectroscopy of Protein Structure 

► Optical Spectroscopy: Future Fourier 

► Reaction-Induced Infrared Difference Spectroscopy 

► Stopped-Flow Techniques 
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Definition 

The development of the SAR by NMR technique in 
1996 by Stephen Fesik at Abbot Laboratories resulted 
in a tremendous expansion in NMR ligand affinity 
screens throughout the pharmaceutical industry (Shuker 
et al. 1996). One important enhancement has been the 
design of the chemical library used in these NMR 
screens. Instead of a library of random compounds, 
screening libraries were constructed to contain fragments 
of known dmgs. This was first demonstrated with the 
SHAPES library designed by Jonathan Moore at Vertex 
Pharmaceuticals (Fejzo et al. 1999). Fragment-based 
screens utilize a structurally diverse chemical library of 
low molecular-weight compounds (<200-300 Da) that 
correspond to fragments of known dmgs or have drug¬ 
like characteristics by adhering to Lipinski’s mle of five 
and exhibiting high aqueous solubility (Lipinski et al. 
2001, Lipinski 2004). A fragment library is typically 
small (<10^ compounds) making it amenable to an 
NMR high-throughput screen (HTS) (Bautin et al. 
2004). But, it also takes advantage of NMR’s sensitivity 
to identify low (pM to mM) affinity ligands. As a result, 
fragment screens have a 10 to 1,000 times higher hit rate 
relative to traditional HTS (Hajduk et al. 201 1). A variety 
of one-dimensional (ID) and two-dimensional (2D) 
NMR experiments are used to identify binders, to iden¬ 
tify the ligand binding site, and to assist in “growing” the 
fragments to improve binding affinity (Powers 2009). 
Maximizing ligand efficiency, the number of 
nonhydrogen atoms relative to free energy of binding, 
is the most important advantage of fragment screens 
(Reynolds et al. 2007). Other advantages include 
improving coverage of chemical space, higher quality 
of leads, direct observation of biologically relevant inter¬ 
actions, and a universal assay design. 
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FRAP 

► Fluorescence Recovery After Photobleaching 


Free Radical Damage 

► Oxidative Stress 


Freeze-Substitution 

► Electron Microscopy: Cryo-Preparation 


Cross-References 

► NMR Frequency Doubling Microscopy 

► NMR in Drug Discovery - Introduction 

► SAR by NMR ► Second Harmonic Generation Microscopy (SHG) 
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► Femtosecond Stimulated Raman Spectroscopy 
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FTIR Spectroscopy 

► Fourier Transform Infrared Spectroscopy for Bio¬ 
physical Applications: Technical Aspects 

► Infrared Spectroscopy of Cells, Tissues, and 
Biofluids 

► Infrared Spectroscopy of Membrane Lipids 

► Infrared Spectroscopy of Protein Structure 

► Infrared Spectroscopy: Data Analysis 

► Polarized Infrared Spectroscopy 

► Rhodopsins - Intramembrane Signaling by Hydro¬ 
gen Bonding 


F-Type ATPase 

► ATP Synthase Structure 


FucP 

► FucP, the L-Fucose-H"^ Membrane Transport Pro¬ 
tein and Related Transporters 


FucP, the L-Fucose-H^ Membrane 
Transport Protein and Related 
Transporters 

Nieng Yan 

School of Medicine, Tsinghua University, 

Beijing, China 

Synonyms 

FucP; L-fucose:proton symporter 

Definition: Based on Primary Sequence 
Similarity and Biological Function 

The L-fucose:proton symport protein, FucP (TC # 2. 
A. 1.7.1, http://www.tcdb.org/search/result.php?tc=2. 
A. 1.7.1), is the product of the /mcP gene in Escherichia 


coli. The FucP protein and its homologues are further 
classified to the Fucose:H'^ Symporter (FHS) family 
(TC # 2.A. 1.7) (http://www.tcdb.org/search/result. 
php?tc=2.A.1.7#2.A.1.7) based on primary sequence 
similarity and biological function (Saier 2000); the 
FHS family is a member of the major facilitator super¬ 
family (MFS). FucP and its homologues transport 
sugars like glucose, galactose, mannose, etc., in 
a proton or Na'^-dependent manner: 

H+/Na+(out)-l-Sugar(out)^ Sugar(in)+H+/Na+(in) 

Representative members of the FHS family are as 
follows: 

2. A. 1.7.1: FucP - L-fucose:H'^ symporter in 

Escherichia coli 

2.A. 1.7.2: Ggp - glucose/galactose porter in Brucella 
abortus 

2.A.1.7.3: GlcP - glucose/mannose:H'^ in Bacillus 
suhtilis 

2.A.1.7.4: NaGLTl - Na^dependent methyl a- 
glucoside transporter in the kidney of rats 
2.A.1.7.5: DeoP - 2-Deoxy-D-ribose porter in Salmo¬ 
nella typhimurium 

2.A. 1.7.6: ScrT - putative sucrose transporter in 
Shewanella frigidimarina 

2.A.1.7.7: HP1174 - the Na'^-dependent sugar 
transporter of Helicobacter pylori 
2.A. 1.7.8: NagP - the putative A-acetylglucosamine 
porter in Shewanella oneidensis 
2.A. 1.7.9: AgaP - the putative A-acetylgalactosamine 
porter in Shewanella amazonensis 
2.A.1.7.10: GlcP - the putative glucose porter 
in Shewanella amazonensis 
2.A.1.7.11: ManP - the putative mannose porter in 
Shewanella amazonensis 

2.A.1.7.12: TreT - the putative trehalose porter in 
Shewanella frigidimarina 

Definition: Based on Structural Similarity 

MFS transporters may contain 12, 14, or 24 transmem¬ 
brane (TM) segments. Crystal structures of five MFS 
transporters are available [LacY (Abramson et al. 2003), 
GlpT (Huang et al. 2003), EmrD (Yin et al. 2006), FucP 
(Dang et al. 2010), and PepTso (Newstead et al. 2010)]. 
They all exhibit an identical structural fold of 12 TMs 
that is arranged into two halves - namely the N and 
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C domains. The two domains, with both N- and 
C-termini located on the cytoplasmic side, share 
a similar fold with a pseudo twofold symmetry 
axis that is perpendicular to the membrane bilayer. 
In addition to crystal structure, the projection 
structure of OxlT was reported (Heymann et al. 2001). 
Except for PepTso, the MFS members whose 
structures were determined can be found in the TCDB 
as follows: 

2.A.1.2.9: EmrD - (Hydrophobic uncoupler, e.g., CCCP, 
benzalkonium, SDS): antiporter in Escherichia 

coli 

2.A.1.4.3: GlpT - Glycerol-P:Pi antiporter 

in Escherichia coli 

2.A.1.5.2: LacY - Lactose symporter in Escherichia 
coli 

2.A. 1.7.1: FucP - L-fucose:H’^ symporter in Escherichia 
coli 

2.A. 1.11.1: OxlT - the oxalate:formate antiporter 
in Oxalohacter formigenes 

Introduction 

Proton-Dependent Transport Activity of FucP 

In intact cells of E. coli, transport of L-fucose was 
associated with the coupled movement of protons 
(Bradley et al. 1987; Gunn et al. 1994). The transport 
activity of the purified FucP protein was further 
examined through a liposome-based counterflow 
assay (Dang et al. 2010), in which FucP exhibited 
a pH-dependent activity also consistent with a fucose: 
symport mechanism (Fig. 1). 

Binding Affinity with i-Fucose 

The binding affinity between FucP and L-fucose 
was measured in detergent solution by isothermal titra¬ 
tion calorimetry (ITC). The affinity was affected by 
the presence of detergents. For example, in the 
presence of 0.4% (w/v) P-NG, the binding affinity was 
not detectable. In the presence of Cymal-7, the Kj value 
was approximately 0.47 ± 0.02 mM (Fig. 2). 

Crystal Structure of FucP in an 
Outward-Open Conformation 

The crystal structure of FucP was determined at 3.15 A 
resolution (Dang et al. 2010). The N and C domains of 



FucP, the L-Fucose-H^ Membrane Transport Protein and 
Related Transporters. Fig. 1 The transport activity of FucP is 
pH-dependent in the liposome-based counterflow assay 


L-fucose -I- WT (cymal-7) 



Molar Ratio 

FucP, the L-Fucose-H^ Membrane Transport Protein and 
Related Transporters, Fig. 2 Binding affinity between 
L-fucose and FucP measured by ITC, in which FucP, purified 
in 0.02% Cymal-7, is titrated with L-fucose 


FucP can be superimposed with a root-mean-squared 
deviation (rmsd) of 2.97 A over 138 Ca atoms. The 
N domain and C domain each comprise a pair of 
internal structural repeats, namely TMs 1/2/3 and 
TMs 4/5/6 in the N domain, and TMs 7/8/9 and TMs 
10/11/12 in the C domain. The two repeats within each 
domain, related by an approximate 180° rotation 
around an axis parallel to the membrane bilayer, are 
intertwined and give rise to two 3-helix bundles. 
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FucP, the L-Fucose-H’^ 
Membrane Transport 
Protein and Related 
Transporters, Fig. 3 FucP 
exhibits an outward-open 
conformation in the crystal 
structure determined at 3.15 A 
(From Dang et al. (2010). 
With permission) 




55 A 


TM6 


Periplasm 


20 A 


30 A 


30 A 


Cytoplasm 


For example, the N domain comprises two bundles, 
TMs 1/5/6 and TMs 4/2/3, with TMl and TM4 
swapped from the aforementioned internal structural 
repeats. 

FucP in the crystal structure exhibits an outward- 
open conformation. A central cavity, which measures 
approximately 20 A in depth and 10 A in diameter at 
the periplasmic side, is surrounded by TMs 1, 2,4, 5 of 
the N domain and TMs 7, 8, 10, 11 of the C domain. 
This cavity is likely an important part of the transport 
path for substrate molecules (Fig. 3). 

Transport Path 

Despite their similar fold, the N and C domains display 
contrasting surface features, giving rise to an amphi- 
pathic cavity. The N domain has an elongated strip of 
negatively charged amino acids along the central 
cavity, whereas the C domain contains a hydrophobic 
patch in the cavity, capped by positively charged 


residues on the periplasmic as well as the cytoplasmic 
sides (Fig. 4). 

The cavity-facing side of the N domain is enriched by 
Asn and Gin residues, with 14 Asn/Gln and 4 Asp/Glu 
residues. Nearly all of these residues located in the 
middle portion of the transport path are conserved, 
including Asn45 and Asp46 of TMl, Gln67 of TM2, 
Glul35 and Asnl39 of TM4, and Asnl56, Glnl59, and 
Asn 162 of TMS. In contrast to the N domain, residues of 
the C domain that line the central cavity are mostly 
hydrophobic, including Val271, Ile282 and Val286 
of TM7, Ala294, Ala298, Leu301, Met305, Phe308 of 
TM8, Pro366 and Phe369 of TMIO, Val388 and Ile391 
of TMl 1. The hydrophobic surface patch is sandwiched 
by two prominent Arg residues, Arg283 at the 
C-terminal portion of TM7 and Arg312 of TM8. In 
particular, the highly conserved Arg312 is surrounded 
by three conserved, polar amino acids, Tyr270 and 
Gln274 of TM7 and Tyr365 of TMIO. The side chains 
of these residues point into the cavity, suggesting an 
important role in the transport activity of FucP. 
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FucP, the L-Fucose-H’^ 
Membrane Transport 
Protein and Related 
Transporters, Fig. 4 The 

cavity-facing sides of the 
N and C domains exhibit 
contrasting surface 
electrostatic potentials. The 
surface electrostatic potential 
was calculated for the N and 
C domains (From Dang et al. 
(2010). With permission) 



Notably, Asp46 of TMl and Glul35 of TM4, 
located on the corresponding TMs of two internal 
structural repeats, point into the cavity. They are the 
only two residues along the transport path that 
are capable of protonation and deprotonation within 
physiological pH values. We speculate that, given their 
ability to be protonated or deprotonated in response 
to the concentration changes of the microenviron¬ 
ment, these two amino acids may play an essential 
role in the symport activity of FucP, perhaps serving 
as sensors and carriers (Fig. 5). 

Asp46 and Glu135 Play an Essential Role in 
L-Fucose: Symport 

The essential roles of the Asp46 and Glul35 residues 
in FucP were manifested by both in vivo uptake and 
in vitro counterflow experiments. Replacement of 
Asp46 by Ala or Asn, or Glul35 by Ala, Gin, or Asp 
resulted in complete abrogation of FucP-mediated 
active transport of L-fucose (Fig. 6). 

In the liposome-based counterflow assay, which does 
not rely on a proton gradient for substrate exchange, 
D46A or D46N showed a transport activity that is two- 


to threefold higher than that of the WT FucP, whereas 
Glul35 mutants exhibited severely compromised activ¬ 
ities. These observations indicate that, although both 
Asp46 and Glul35 are essential for the active transport 
activity of FucP, they have distinct biochemical roles. In 
addition, neutralization of Asp46 by Asn or Ala rendered 
FucP insensitive to pH changes in the counterflow assay. 
The behavior of Asp46 is reminiscent of Glu325 of Lac Y 
(Franco and Brooker 1994; Guan and Kaback 2006). 
These observations suggest that Asp46 is essential in 
proton-dependent active transport, whereas Glul35 
might be involved in additional processes such as sub¬ 
strate recognition. In support of this notion, mutation of 
Glul35 led to abrogation of interaction between L-fucose 
and FucP (Fig. 7). 

A Working Model of an Alternating Access 
Mechanism for the Coupling of Proton 
Translocation and i-Fucose Transport 

Based on the structural and biochemical analyses, 
a working model was suggested on how FucP may 
mediate the symport of L-fucose and proton (Fig. 8). 
In this model, Asp46 and Glul35 play an essential role. 
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FucP, the L-Fucose-H’^ 
Membrane Transport 
Protein and Related 
Transporters, Fig. 5 The 

polar or charged residues 
lining the cavity from the 
N and C domains (From Dang 
et al. (2010). With permission) 
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FucP, the L-Fucose-H^ Membrane Transport Protein and 
Related Transporters, Fig. 6 The uptake of L-fucose by 
E. coli BL21(DE3) transformed with FucP variants. Residues 
Asp46 and Glul35 are essential for the active transport of FucP 
(From Dang et al. (2010). With permission) 

coupling proton translocation and substrate recogni¬ 
tion. L-fucose can only bind to FucP following the 
protonation of Asp46, where the neutralizes the 
negative charge of Asp46 and lowers the energetic 


Uptake of L-Fucose by Intact cell 



FucP, the L-Fucose-H^ Membrane Transport Protein and 
Related Transporters, Fig. 7 Transport activity of FucP 
variants in the counterflow assay (From Dang et al. (2010). 
With permission) 


barrier for L-fucose entry and/or transport. Given the 
locations of Asp46 and Glul35, translocation of the 
proton presumably involves a proton relay mediated by 
structural water molecules and other polar and/or 
charged amino acids in FucP. The presence of the 
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protonation of D46 
upload of substrate 



Outward open, apo 


rigid body rotation 
of N and C domains 



outward open, substrate bound 


outward open, substrate bound 


rigid body rotation 
of N and C domains 



inward open, apo 


release of substrate 


deprotonation of El 35 



inward open, substrate bound 


FucP, the L-Fucose-H^ Membrane Transport Protein and Related Transporters, Fig. 8 A working model depicting an 
alternating access mechanism of FucP-mediated L-fucoseiH"^ symport (From Dang et al. (2010). With permission) 


substrate L-fucose may also facilitate the proton relay 
by coordinating waters. Substrate binding and proton¬ 
ation of Glul35 is likely to trigger a conformational 
switch that results in the inward-open conformation of 
FucP, followed by release of the substrate and 
the deprotonation of Glul35. The open-inward 
and empty protein, which can be modeled onto the 
structure of LacY (Abramson et al. 2003; Guan 
and Kaback 2006; Dang et al. 2010), can then revert 
to the open-out empty conformation and the cycle of 
events be repeated to complete an alternating access 
mechanism of the transport process. 
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Sydney, NSW, Australia 

Synonyms 

Biologically active amyloids 

Definition 

Functional amyloid is a fibrillar protein deposit with 
the structural, morphological, and tinctorial properties 
of amyloid fibrils but whose production or application 
is beneficial to the organism and not associated with 
disease. It is now recognized that the self-assembly 
properties of amyloid fibrils, and the stability and 
resistance to proteolysis of the amyloid structure, are 
applied for biological functions including production 
of protective coatings, development of mechanical 
strength, control of cytotoxic intermediates, transmis¬ 
sion of information, and protein localization (Maury 
2009). Many microorganisms produce functional amy¬ 
loid structures and these are commonly employed in 
the coating of aerial structures or in mediating adhe¬ 
sion (Otzen and Nielsen 2008). Filamentous fungi 
produce class I hydrophobins which are small proteins 
which self-assemble into amyloid-like rodlets at 
hydrophobic-hydrophilic interfaces and reduce sur¬ 
face tension. In addition, hydrophobins form amphi- 
pathic monolayers on spores and on gas-exchange 
surfaces, rendering them resistant to wetting and 
waterlogging. The amyloid layer protects the spore 
from the host immune response and can play a role in 
infection. The filamentous bacterium Streptomyces 
secretes a family of related proteins known as chaplins, 
which are also able to form an amphipathic film com¬ 
posed of amyloid fibrils at an air-water interface and 
reduce surface tension, facilitating growth of aerial 
hyphae. Insects and fish use functional amyloid fibrils 
formed from chorion proteins to produce a protective 
coating around eggs and larvae. Bacteria such as Sal¬ 
monella and Escherichia coli express proteins known 
as curli which form extracellular amyloid structures 


that facilitate attachment and colonization. Amyloid 
may play a role in adhesion and biofilm formation in 
nonfilamentous yeast, with the recognition of fibril 
formation by Als adhesins from Candida albicans. 
Amyloid deposits also contribute mechanical strength 
to some natural adhesives produced by algae. Func¬ 
tional amyloid is central to the yeast prion systems, 
including Sup35 and URE2p in Saccharomyces 
cerevisiae and HET-s in the fungus Podospora, 
where memory is encoded in the self-replicating amy¬ 
loid structure. The formation of amyloid by these pro¬ 
teins, within the cytoplasm, gives rise to distinct 
phenotypes, dependent on the function of the soluble 
form of the protein involved (Greenwald and Riek 
2010). The prion state is controlled by chaperones 
and is inherited in an epigenetic, non-Mendelian man¬ 
ner. In mammals, amyloid fibrils formed by a fragment 
of the PMell? protein bind precursors of melanin, 
templating and accelerating the synthesis of melanin 
and trapping cytotoxic intermediates in melanosomes. 
Some human secretory peptides are also sequestered in 
an amyloid fibrillar form in granules. A functional 
prion-like form of CPEB may be important in the 
maintenance of synaptic changes associated with 
memory. Recognition of the amyloid structural motif 
has also been implicated in hemostasis, where tPA and 
factor XII bind to, and are activated by, misfolded 
proteins with the cross-P structure. The cross-P amy¬ 
loid structure is conserved and utilized by a wide range 
of organisms for a variety of functions. Eormation and 
application of functional amyloid must be well regu¬ 
lated in order to avoid adverse, disease-associated con¬ 
sequences and it appears to be controlled by a range of 
different mechanisms, including chaperone activity 
and very rapid fibrillogenesis in discrete organelles. 

Cross-References 

► Amyloid Formation 

► Amyloid Formation in Bacteria 

► Amyloid Inhibitors 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 
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Synonyms 

Artificial lipid bilayer; DIB; Droplet-on-hydrogel 
bilayer (DHB) 

Definition 

A droplet interface bilayer (DIB) is a lipid bilayer 
formed between two monolayer-coated aqueous drop¬ 
lets in an oil phase. Droplet-on-hydrogel bilayers 
(DHBs) can also be formed by a similar process. The 
bilayers can include membrane proteins or stimulus- 
sensitive lipids. DIBs can be used for single-molecule 
studies of membrane proteins, and for the construction 
of functional droplet networks. Droplet networks can 
be stabilized in bulk aqueous solution by encapsulation 
within small oil drops. 

Introduction 

An aqueous droplet in a solution of lipids in oil 
acquires a lipid monolayer coat, and two such droplets 
brought into proximity adhere to form a droplet inter¬ 
face bilayer (DIB) (Fig. la). DIBs in various configu¬ 
rations (Fig. Ib-c) are useful platforms for the study of 
membrane proteins through electrical and optical mea¬ 
surements. Further, networks of droplets joined by 
DIBs that contain membrane proteins or stimulus- 
sensitive lipids can be assembled to form functional 
devices (Fig. Id). Droplet networks can also be stabi¬ 
lized in bulk aqueous solution by encapsulation within 


small oil drops (Fig. 2), enabling prospective medical 
applications. 

Various functional DIB systems developed over the 
last five years by the Bayley group and others are 
outlined here. The systems are presented according to 
the type of aqueous phases involved, the means by 
which the monolayers are brought into contact to 
form a bilayer, the type of oil used, and the types of 
participating lipids and membrane proteins. Where 
possible, each system is illustrated with applications 
from the literature. 

Aqueous Phase 

The use of aqueous solutions and hydrogels for DIB 
formation are considered separately. 

Aqueous Solutions 

The aqueous phases that support a DIB may be open, in 
that at least one of the aqueous volumes is exposed to 
air. This configuration was used in the earliest DIB- 
like system known to the authors (Tsofina et al. 1966). 
Alternatively, the aqueous phases may be confined 
within channels (Malmstadt et al. 2006), usually in 
a microfluidic device (Funakoshi et al. 2006). More 
typically, the aqueous phases are closed, or entirely 
surrounded by the oil phase, providing greatly 
enhanced versatility (Fig. 1). Droplets joined by DIBs 
can move relative to each other, and can be detached 
entirely and rejoined (Holden et al. 2007). The revers¬ 
ibility of bilayer formation is useful in screening appli¬ 
cations (Holden et al. 2007; Syeda et al. 2008) and in 
the formation of reconfigurable networks (Holden 
et al. 2007), both of which are discussed below. 
Although it has been known for some time that bilayers 
form between droplets in certain water-in-oil emul¬ 
sions (Poulin and Bibette 1998), the functionalization 
of DIBs with membrane proteins or lipids is a recent 
development (Funakoshi et al. 2006; Holden et al. 
2007). 

A large variety of techniques exists for droplet 
production, and only the most pertinent are outlined 
here. Perhaps the simplest means of producing a great 
number of droplets is bulk emulsification of an aque¬ 
ous phase in oil (Poulin and Bibette 1998); however, 
the droplets so produced are highly polydisperse, 
whereas most applications of DIBs require droplets 
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Functional Droplet Interface Bilayers, Fig. 1 Droplet inter¬ 
face hilayers (DIBs) in hulk oil. (a) An aqueous droplet in 
a solution of lipids in oil becomes coated with a lipid mono- 
layer, and two such droplets brought into proximity form 
a droplet interface bilayer (DIB) (Holden et al. 2007). 
(b) Hypothetical chip to screen membrane proteins against 
multiple analytes in parallel using an array of DIBs. The red 
droplets on the bottom plate contain the various analytes, and 
are connected to a common electrical ground. Each of the blue 
droplets on the top plate contains the membrane protein, and is 


of a well-defined size. DIBs have been made between 
small numbers of droplets, with diameters >100 pm, 
that were produced by manual (Holden et al. 2007) or 
robotic pipetting (Poulos et al. 2009a; Stanley et al. 
2010). The production of many monodisperse droplets, 
or of droplets significantly smaller than 100 pm in 
diameter, requires the use of microfluidics. 

Microfluidic droplet generation is typically 
achieved by either T-junction breakup, wherein an 
aqueous flow is broken off into droplets by 
a perpendicular oil flow (Garstecki et al. 2006), or 
flow-focusing, wherein an aqueous flow is pinched 
off into droplets by a surrounding oil flow when forced 
through a constriction (Anna et al. 2003). These mech¬ 
anisms can produce several thousand droplets per sec¬ 
ond (Teh et al. 2008), and require the maintenance of 


individually electrically addressed. The interactions of the 
membrane protein with each analyte are monitored by electri¬ 
cal recording, (c) Droplet-on-hydrogel bilayers can be made on 
a thin, optically flat substrate, allowing the visualization 
and functional assay of membrane proteins in the bilayer by 
TIRE microscopy (Thompson et al. 2007; Heron et al. 2009). 
(d) Droplets can be joined to form DIB networks that make use 
of protein nanopores to function as batteries, light sensors, and 
simple electrical devices (Holden et al. 2007; Maglia et al. 
2009) 


constant high pressure in the microfluidic channels. 
Although droplets made by ► continuous-flow 
microfluidics (► Continuous Flow) are typically 
uncoated, or coated with surfactant (Surfactants) 
monolayers, it should be possible to create lipid-coated 
droplets in the same way (Bai et al. 2010). 

By contrast, drop-on-demand (DOD) techniques do 
not require constant high pressures, and allow the 
production of droplets in precisely controlled numbers. 
A common DOD mechanism creates a transient pres¬ 
sure pulse by vaporization or piezoelectric actuation of 
the liquid, which forces the liquid out of a reservoir 
through a fine aperture. This method is employed by 
inkjet print-heads, and can also be used for the produc¬ 
tion of droplets for network formation (Villar et al. 
2011). Droplet production can also be achieved by 
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electrical stimulation through electrowetting-on- 
dielectric (EWOD), which produces an imbalance of 
interfacial tensions (Pollack et al. 2002), or dielectro¬ 
phoresis, which can draw droplets from larger aqueous 
volumes in nonuniform electric fields (Jones et al. 
2001 ). 

Hydrogels 

Lipid bilayers have been formed between droplets of 
aqueous solution and a wide range of hydrogels, 
including agarose and polyacrylamide (Heron et al. 
2007). It is also possible, by the use of photocurable 
polymers, to trigger gelation of the aqueous volumes 
following bilayer formation (Sarles et al. 2010). While 
unconstrained aqueous droplets joined by a DIB adopt 
spherical cap geometries (Villar et al. 2011), the 
hydrogel used to form a droplet-on-hydrogel bilayer 
(DHB) can be made to have many other geometries 
(Sapra T, unpublished 2010). In the original demon¬ 
stration of a DHB system by the Wallace group, 
a bilayer was formed between a droplet and an elec¬ 
trophoresis gel that contained several types of mem¬ 
brane protein (Heron et al. 2007). Electrical recording 
(► Patch-Clamp Recording of Single Channel Activ¬ 
ity: Acquisition and Analysis) of the ionic current 
across the bilayer allowed the in situ identification of 
membrane proteins at the single-molecule level. 

Notably, the hydrogel can instead be applied as 
a thin coating on glass, which allows the imaging of 
individual fluorescently tagged membrane proteins in 
a DHB by ► total internal reflection fluorescence 
(TIRE) microscopy (► Total Internal Reflection Eluo- 
rescence Microscopy for Single-Molecule Studies) 
(Thompson et al. 2007) (Eig. Ic). If the droplet con¬ 
tains an ion-sensitive fluorophore, this technique can 
also be used to measure the ionic flux (►Eluxes 
Through Channels and Pores) through each of multiple 
membrane proteins in a DHB under an applied poten¬ 
tial (Heron et al. 2009). By contrast, electrical moni¬ 
toring of the ionic current through individual 
membrane proteins in parallel requires that each pro¬ 
tein be in a separate aqueous phase that is individually 
electrically addressed (Pig. lb). 

Manipulation 

Aqueous-oil interfaces must be coated by lipid mono- 
layers before they are brought into contact in order to 


form a bilayer. The time required for monolayers to 
self-assemble is typically a few minutes (Villar et al. 
2011) to tens of minutes (Holden et al. 2007), 
depending on the oil used, as discussed below. In 
open and constrained systems, the monolayers are 
commonly brought into contact by varying the applied 
pressure at the open ends of the aqueous phases 
(Tsofina et al. 1966; Funakoshi et al. 2006), although 
other means such as solvent extraction are possible 
(Malmstadt et al. 2006). 

In droplet DIB systems, micropipetted droplets can 
be dropped or pushed into place with the micropipette. 
Alternatively, the droplets can be impaled with elec¬ 
trodes mounted on micromanipulators, which can then 
be used to bring the droplets into contact (Holden et al. 
2007). Following bilayer formation, the interfaces can 
be pushed together or pulled apart to control the bilayer 
area, for both DIBs (Holden et al. 2007) and DHBs 
(Heron et al. 2007; Gross et al. 2011a). The inserted 
electrodes also allow monitoring of the capacitance 
between the two droplets, which serves as an indicator 
of bilayer formation and area (Holden et al. 2007; 
Gross et al. 201 lb). 

Droplets can move relative to each other while 
joined by DIBs, and can be detached entirely (Holden 
et al. 2007). Although such flexibility is useful in 
screening applications (Holden et al. 2007; Syeda 
et al. 2008) as discussed below, it renders droplet 
networks vulnerable to reconfiguration by mechanical 
perturbation. Accidental reconfiguration of a network 
can be prevented by using dimples (Holden et al. 2007) 
or wells (Funakoshi et al. 2006; Sarles and Leo 2010a) 
to contain the droplets. Droplets contained in a flexible 
substrate can be separated and rejoined, so breaking 
and reforming the DIB, by mechanical distortion of the 
substrate (Sarles and Leo 2010b). 

Droplets in microfluidic devices can be immobilized 
in appropriately designed obstacles, and thereby 
brought into contact in groups (Bai et al. 2010; Zagnoni 
and Cooper 2010; Huebner et al. 2011). For example, 
a single device can produce droplets from two different 
aqueous solutions, bring droplets into contact in heter¬ 
ologous pairs, and immobilize over 1,000 such pairs 
simultaneously (Bai et al. 2010). In this instance, the 
droplets were coated with a monolayer of surfactant 
rather than lipid, but it is likely that arrays of lipid 
DIBs could be formed in the same way. 

In addition to their use in droplet generation as 
discussed above, EWOD (Poulos et al. 2009b) and 
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dielectrophoresis (Aghdaei et al. 2008) can be used to 
join droplets and thereby form DIBs. Droplets joined 
by a DIB can also be separated by dielectrophoresis 
(Thiam et al. 2011). DIBs have also been formed by 
optical manipulation. For example, laser-induced 
heating can be used to manipulate droplets either by 
causing natural convection in the bulk oil phase, or by 
the thermocapillary effect, which requires a gradient of 
surface tension at the aqueous-oil interface (Dixit et al. 
2010 ). 

Droplets can be fused together to rapidly deliver 
molecules from one droplet to another (Bayley et al. 
2008). Two droplets will fuse if they are brought into 
contact before both of the lipid monolayers have 
assembled completely. Droplets can also be made to 
fuse following the formation of a DIB by rupturing the 
bilayer, which can be achieved by several means, such 
as the application of a high voltage between electrodes 
in the droplets (Holden MA, unpublished 2005) or in 
the oil phase (Eow et al. 2001; Huebner et al. 2011), 
laser-induced heating (Dixit et al. 2010) or cavitation 
(Li et al. 2011), and the use of pH- or temperature- 
sensitive lipids as discussed below. 

Oil Phase 

The oil phase commonly consists of a hydro¬ 
carbon, such as n-decane (Funakoshi et al. 2006), 
«-hexadecane (Holden et al. 2007), or squalene 
(Bayley et al. 2008; Gross et al. 2011b). Shorter 
alkanes partition more strongly into the bilayer than 
longer alkanes, increasing the bilayer thickness (Gross 
et al. 2011b; McIntosh et al. 1980; Needham and 
Haydon 1983). Oils other than hydrocarbons, such as 
silicone oil, also support DIB formation (Poulin and 
Bibette 1998; Villar et al. 2011). The high density of 
silicone oil can be used to match the densities of the 
aqueous and oil phases. Moreover, droplets made in 
a solution of lipids in silicone oil require less time to 
acquire a lipid monolayer (Villar et al. 2011), and are 
more strongly adhesive than in hexadecane (Poulin and 
Bibette 1998; Bibette et al. 1999; Villar et al. 2011), 
which has been attributed to the lower solubility of the 
lipid in silicone oil (Poulin and Bibette 1998; Bibette 
et al. 1999). Importantly, a small proportion of hydro¬ 
carbon such as hexadecane in the silicone oil is 
required to render DIBs electrically insulating (Villar 
et al. 2011). 


Encapsulation 

Droplets in a DIB network are typically surrounded by 
an oil phase, precluding direct interaction with the 
external environment, although indirect interaction is 
possible. For example, a DHB system under oil can be 
made to sense air flow in its surroundings, by using a 
hair-like fibre to mechanically couple the hydrogel to 
the air around the oil phase. Air movements then dis¬ 
turb the bilayer, and produce capacitive currents that 
can be monitored by electrical recording (Sarles et al. 
2011 ). 

Droplet networks can also be stabilized in bulk 
aqueous solution, by encapsulation within small oil 
drops. The encapsulated droplets in these multisomes 
can communicate directly with the aqueous environ¬ 
ment, and with each other, through protein pores. 
Further, the droplets can release their contents upon 
a change in pH or temperature through destabilization 
of the bilayer, as discussed below (Villar et al. 2011) 
(Fig. 2). Multisomes have been made by manual 
pipetting (Villar et al. 2011), but it should be possible 
to adapt the microfluidic techniques that are used for 
the production of double emulsions stabilized by sur¬ 
factants (Okushima et al. 2004; Chu et al. 2007; Seo 
et al. 2007; Bauer et al. 2010) or block copolymers 
(Polymersomes) (Shum et al. 2011) to create smaller 
functional multisomes in greater numbers. 

Lipids and Membrane Proteins 

Lipids 

Lipid (► Chemical Diversity of Lipids) can be 
included in the bulk oil bath {lipid-out. Fig. la) or in 
the aqueous droplets in the form of vesicles (iipid-in). 
With the lipid-in method, droplets in the same oil bath 
become coated with different lipid monolayers 
(► Lipid Bilayer Asymmetry, ► Lipid Flip-Flop) 
(Hwang et al. 2008). 

The lipid most commonly used in DIB systems has 
been 1,2-diphytanoyl-i'«-glycero-3-phosphocholine 
(DPhPC), which yields stable and electrically insulat¬ 
ing bilayers over a wide range of experimental condi¬ 
tions. However, other lipids can be used to provide 
a suitable environment for a particular membrane pro¬ 
tein (Bayley et al. 2008), or to study the behavior of 
a membrane protein in a bilayer with a particular lipid 
composition. For instance, the lipid-in technique was 
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Communication with external solution 



pH- and temperature-dependent release 


Functional Droplet Interface Bilayers, Fig. 2 Multisomes: 
DIBs in bulk aqueous solution, (a) A multisome. Small drops of 
oil contain networks of aqueous droplets, which communicate 
with each other through protein pores in the DIBs that join them 
(Villar et al. 2011). Further, pores in bilayers at the surfaces of 
the aqueous droplets that protrude from the oil drop enable the 
internal network to communicate with the bulk aqueous solution. 


The encapsulated droplets can release their contents into the 
aqueous environment by pH- or temperature-induced rupture 
of bilayers. This triggered release could enable the simultaneous 
delivery of binary or higher-order drug combinations in precise 
proportions, (b) The monolayers and bilayers in a multisome 
containing two aqueous droplets 


used to study the gating behavior of the membrane 
protein OmpG in asymmetric bilayers with variously 
charged lipid leaflets (Hwang et al. 2008). Lipids 
can also be chosen to provide functionality to a DIB 
network. For example, aqueous droplets in multisomes 
made with lipids that have a pH- or temperature- 
induced phase transition release their contents into 
the environment upon a change in pH or temperature 
(Villar etal. 2011). 

Membrane Proteins 

A variety of ► membrane proteins (► Membrane Pro¬ 
teins; Structure and Organization) have been inserted 
into DIBs, including the staphylococcal a-hemolysin 
(aHL) pore (Holden et al. 2007), the anthrax toxin pore 
(Fischer et al. 2011), the outer membrane protein 
OmpG (Hwang et al. 2008), the ► potassium channels 
(► Potassium Channels; Their Physiological and 
Molecular Diversity) Kcv (Syeda et al. 2008) and 
KcsA (Bayley et al. 2008), and the proton pump bac- 
teriorhodopsin (Holden et al. 2007). The proteins are 
typically expressed in Escherichia coli cells or by 
in vitro transcription/translation (IVTT) (Bayley et al. 
2008; Syeda et al. 2008). Alternatively, cell membrane 
fragments from bacteria, eukaryotic cells, and 


organelles (► Reconstitution of Ion Channels from 
Intracellular Membranes and Bacteria Not Amenable 
to Conventional Electrophysiological Techniques) can 
be incorporated into DHBs, allowing the study of 
endogenously expressed membrane proteins such as 
mammalian potassium channels (Leptihn et al. 2011). 
Smaller pore-forming peptides, such as gramicidin 
A (Bayley et al. 2008) and alamethicin (Sarles et al. 
2010; Harriss et al. 2011), as well as non-peptidic 
pore-forming molecules, such as the antimycotic drug 
nystatin (Bayley et al. 2008), have also been inserted 
into DIBs. 

DIBs have been used to screen potential blockers 
against membrane proteins by including a membrane 
protein in one droplet and each of the analytes in 
a separate droplet. The protein-containing droplet 
was sequentially brought into contact with each ana¬ 
lyte droplet, and allowed to form a DIB into which 
the protein inserted. The ionic current was measured 
under an applied potential, allowing the interaction 
of the protein with each analyte to be monitored 
at the ► single-molecule level (► Single-Molecule 
Methods). This approach has been applied to study the 
interaction between the aHL pore and various cyclo- 
dextrins using DIBs (Holden et al. 2007) and DHBs 
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(Heron et al. 2007), and between the viral potassium 
channel Kcv and potential blockers using DIBs in a chip 
format (Syeda et al. 2008). Figure lb shows a hypothet¬ 
ical parallelized version of the screening technique. 

Membrane proteins can also be used to purposefully 
functionalize DIBs in networks of droplets for the 
construction of functional devices (Fig. Id). Both nat¬ 
ural proteins like bacteriorhodopsin, which has been 
used to make a light-sensing droplet network (Holden 
et al. 2007), and engineered proteins can be used. For 
example, an anion-selective mutant of txHL has been 
used to make a simple battery that converts 
a difference in chemical potential into an electrical 
voltage (Holden et al. 2007; Xu et al. 2010); and 
another mutant of the same pore that strongly rectifies 
ionic current has been used to construct simple 
electrical devices including a full-wave rectifier 
(Maglia et al. 2009). 

The Future 

Many avenues are foreseeable for the further develop¬ 
ment of DIB systems. For example, in basic science, 
the application of DIBs to high-throughput screening 
and similarly large-scale applications will require 
reliable, parallelizable, and rapid means of droplet 
production, manipulation, and electrical recording 
(Fig. lb). Many of the microfluidic systems outlined 
here are promising, but have so far seen limited 
application to lipid-based DIB systems. Alternatively, 
some screening configurations may be better based on 
a high-capacity variant of a hydrogel-based optical 
recording platform (Heron et al. 2009). The extension 
of DIB systems to allow the study of membrane pro¬ 
teins that span two lipid bilayers, such as nuclear pores 
(Strambio-De-Castillia et al. 2010) and gap junctions 
(Nakagawa et al. 2010), would be tremendously 
valuable. This might be achieved by apposing two 
multisomes with single inner droplets in such 
a way that the two bilayers are brought into contact 
(Villaretal. 2011). 

Functional droplet network devices comprising 
smaller droplets and made in greater numbers than 
achievable by manual micropipetting would find 
broad application. The controlled, automated assem¬ 
bly of miniaturized networks is an important chal¬ 
lenge, and may be resolved by using microfluidic 
techniques for directed and templated assembly 


(Aghdaei et al. 2008; Bai et al. 2010; Dixit et al. 
2010; Sarles and Leo 2010b; Zagnoni and Cooper 
2010). Alternatively, sufficiently small droplets may 
be able to self-assemble through mechanisms that are 
ineffective for larger droplets, such as through deple¬ 
tion interactions (Sacanna et al. 2010) or adhesion 
forces (Kolmakov et al. 2010). It would be useful to 
be able to induce the reconfiguration and movement 
of droplet networks. Other interesting problems 
include the formation of bilayers with finite curva¬ 
tures, the controlled formation of three-dimensional 
networks, and the activation or inactivation of net¬ 
work function using external stimuli. 

A particularly intriguing direction for the devel¬ 
opment of droplet networks is their integration with 
biological systems. First, miniaturized multisomes 
may find use in medicine, where their functiona¬ 
lization with membrane proteins and lipids would 
allow drug combinations to be delivered in a more 
sophisticated way than is presently possible (Villar 
et al. 2011). Second, droplet networks consist of 
aqueous compartments that communicate through 
membrane proteins in a defined way, and are easily 
addressed electrically; DIB networks could therefore 
be developed as an interface between electronics and 
live tissue. This will likely require the stabilization 
of droplet networks in aqueous solution, possibly in 
the form of multisomes or by embedding networks in 
a semirigid, biocompatible substrate. Finally, in the 
longer term, extended networks of miniaturized 
droplets may be developed as “prototissues” by anal¬ 
ogy with the protocells (► Lipid Protocells) pro¬ 
duced by synthetic biologists (Woolfson and 
Bromley 2011). 

Cross-References 

► Chemical Diversity of Lipids 

► Continuous Flow 

► Fluxes Through Channels and Pores 

► Lipid Bilayer Asymmetry 

► Lipid Flip-Flop 

► Lipid Protocells 

► Membrane Proteins; Structure and Organization 

► Patch-Clamp Recording of Single Channel Activity; 
Acquisition and Analysis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 
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► Reconstitution of Ion Channels from Intracellular 
Membranes and Bacteria Not Amenable to 
Conventional Electrophysiological Techniques 

► Single-Molecule Methods 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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Functional Roles of Lipids in Membranes 

William Dowhan 

Department of Biochemistry and Molecular Biology, 
University of Texas Medical School-Houston, 
Houston, TX, USA 

Synonyms 

Membrane lipids 


Introduction 

The structure and function of an integral 
membrane protein is dependent on a complex interplay 
between the protein and its lipid environment during 
protein synthesis, hnal folding, and assembly. In addi¬ 
tion, once fully assembled, membrane protein structure 
remains dynamic and responds to changes in the lipid 
environment (Dowhan and Bogdanov 2009). Mem¬ 
brane proteins are exposed to a changing lipid envi¬ 
ronment due to local fluctuations in lipid composition 
in all membranes and during ► vesicle-mediated 
movement between organelles of varying lipid com¬ 
position in eukaryotic cells. Current rules governing 
initial membrane protein synthesis, membrane inser¬ 
tion, and folding are based on the properties of proteins 
and the amphipathic property of lipids but do not take 
into account the specific properties of individual lipids. 
The molecular species that make up the lipidome, 
which determine the collective properties of the mem¬ 
brane lipid bilayer, are as varied as the proteins that 
make up the proteome (Dowhan 1997). Therefore, 
membrane lipid composition plays a significant role 
in determining the structure and function of membrane 
proteins. 

There are two basic approaches to determining how 
lipids affect membrane protein structure and function. 
Early investigations relied solely on purification and 
reconstitution of membrane proteins into liposomes of 
different lipid composition. This approach allows 
precise control of the protein environment but is 
fraught with potential artifacts and lacks verification 
of biological significance. Although there is a wealth 
of information on the physical and chemical proper¬ 
ties of individual and simple mixtures of lipids in 
solution, how this information translates into biolog¬ 
ical function is lacking. What do viscosity, membrane 
curvature, bilayer and non-bilayer properties, surface 
charge, and specific structure features mean in bio¬ 
logical terms? How these many properties are 
integrated in a physiologically meaningful manner 
is not known. In many cases these parameters have 
not been well controlled in in vitro studies. A 
second approach has been to alter membrane lipid 
composition and properties in vivo through molecular 
genetic approaches. This approach provides the 
necessary biological significance but is complicated 
by pleiotropic effects on multiple biological pro¬ 
cesses. However, by combining genetic manipulation 
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Functional Roles of Lipids in Membranes, Fig. 1 Lipid 
Pathways. Synthesis of native and foreign lipids in Escherichia 
coli. Pathways that occur in E. coli are noted with blue arrows, 
and those due to introduction of foreign genes are noted with red 
arrows. Major lipids are color coded as zwitterionic {blue), 
neutral (green), or anionic (orange). The enzymes encoded by 
the genes associated with each biosynthetic step are: (1) CDP- 
diacylglycerol synthase; (2) PS synthase; (3) PS decarboxylase; 
(4) PG-P synthase; (J) PG-P phosphatases (Lu et al. 2011); (6) 


CL synthase; (7) PG:MDO (membrane-derived oligosaccharide) 
in-glycerol-1-P transferase; (8) diacylglycerol kinase; (9) 
GlcDAG synthase (Acholeplasma laidlawii); (10) GlcGlcDAG 
synthase (A. laidlawii)', (11) PC synthase (Legionella 
pneumophila)', (12) PI synthase (Saccharomyces cerevisiae) 
(This figure was modified from the original published in 
Bogdanov et al. (2009) © the American Society for Biochem¬ 
istry and Molecular Biology) 


of membrane lipid composition with in vitro recon¬ 
stitution studies, specificity and biological function 
can be assessed leading to an understanding of the 
role of lipids in determining membrane protein struc¬ 
ture and function (Dowhan 2009). 

This entry focuses on the recently uncovered roles 
of lipids in determining membrane protein organiza¬ 
tion by combining in vivo and in vitro approaches to 
altering membrane lipid composition with biochemical 
and biophysical analysis of protein structure and 
function. 

Manipulation of Membrane Lipid 
Composition 

Eukaryotic cell lipid metabolism has been altered 
through genetic manipulation mostly in cultured cells 


or yeast. However, the effects are more complex than 
in microorganisms, and cell viability is more difficult 
to control. Currently, most progress has been made in 
manipulating lipid metabolism in bacteria such as 
Escherichia coli. 

The biosynthetic pathways (Dowhan 2009) for the 
synthesis of E. coli glycerophosphate-based lipids 
(►phospholipids) are shown in Fig. 1. The major 
lipids are composed of zwitterionic (carries a plus 
and minus charge) phosphatidylethanolamine (PE, 
70-80%) and anionic (or acidic) phosphatidylglycerol 
(PG, 20-25%) and cardiolipin (CL, 5-10%). The pre¬ 
cursors to these major lipids are about 5% of the total 
phospholipid pool. These lipids make up the bilayer of 
the inner membrane and the inner leaflet of the outer 
membrane bilayer; the outer leaflet is composed of 
lipid A-based component of lipopolysaccharide 
(Raetz et al. 2007). Mutations in steps leading to the 
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synthesis of the common intermediate, CDP-diacyl- 
glycerol, to the zwitterionic and anionic lipids are 
lethal, but mutations in steps beyond this branch 
point are viable under set growth conditions and result 
in distinct phenotypes for which the molecular basis 
has in many cases been defined. 

Null mutants in the pssA gene eliminate all zwitter¬ 
ionic lipids (phosphatidylserine (PS) and PE) from 
cells with the remaining lipids being anionic 
(95% PG and CL with 5% phosphatidic acid [PA] 
and CDP-diacylglycerol). These cells are viable but 
require high concentrations of divalent cations 
(Ca"^^, Mg"^^, and Sr"^^ but not Ba"^^) to prevent cell 
lysis. The cells present a filamentous morphology and 
are multinucleated suggesting a defect in cell division. 
The permeability barrier of the outer membrane, where 
the inner leaflet is normally 90% PE, is compromised 
resulting in leakage of proteins from the inner mem¬ 
brane space. The cells also have defects in many of the 
secondary solute transporters of the inner membrane 
that couple the proton electrochemical potential to the 
accumulation of substrate against a concentration 
gradient (uphill transport). Energy independent equil¬ 
ibration of substrate across the membrane 
(downhill transport) is not affected (Dowhan and 
Bogdanov 2009). Mutants in the psd gene accumulate 
PS in place of PE and display phenotypes similar to 
pssA mutants. 

Strains of E. coli have been engineered to regulate 
the expression of the pssA gene and thus PE in 
a dose-dependent manner by placing the pssA gene 
under a promoter that can be regulated by the 
composition of the growth medium such as VaraB 
(promoter regulated by arabinose) or (promoter 
regulated by anhydrotetracycline). Using such strains 
steady-state PE levels can be regulated between 3% 
and 70% or synthesis can be turned on or off to study 
the effects of changes in lipid composition on cell 
processes. 

Lipid composition can be further altered by 
introducing genes from other organisms that enable 
the synthesis of lipids foreign to E. coli either in the 
presence or absence of normal lipids (Bogdanov et al. 
2009, 2010b; Xia and Dowhan 1995). Using this 
approach, lipids with similar or different properties 
can be compared in an in vivo system (see Fig. 2 and 
discussion below). Thus far, phosphatidylcholine 
(PC, 70%), monoglucosyl diacylglycerol (GlcDAG, 


30-40%), or diglucosyl diacylglycerol (GlcGlcDAG, 
30-40%) have been introduced into E. coli in place of 
PE. In addition, phosphatidylinositol (PI) has been 
expressed at 10-15% of total phospholipid. 

The major anionic phospholipids, PG and CL, can 
also be eliminated by a null mutation in the pgsA gene. 
In this case, 5% of total lipid composition still contains 
anionic lipids, mainly PA and CDP-diacylglycerol. 
Initially, it was thought that elimination of PG and 
CL is lethal so that experiments to assess the role of 
these lipids were done using a pgsA gene under regu¬ 
lation of P/ac (Dowhan et al. 2004). With this strain, 
anionic lipids can be varied between about 10% and 
25% in a dose-dependent manner, which allowed the 
investigation of anionic lipid roles in DNA replication 
and protein secretion. Mutants completely lacking PG 
and CL are viable if several additional mutations are 
introduced. Mutants in anionic lipid synthesis have 
been used to study the interaction of several peripheral 
membrane proteins that require membrane association 
for activation. 

With this set of lipid mutants, the membrane lipid 
composition and therefore membrane physical and 
chemical properties can be varied over a broad range 
to assess the role of each property in determining 
membrane protein structure and function. As summa¬ 
rized in Fig. 2, each lipid head group has a unique 
physical and chemical signature that contributes to 
the collective properties of the membrane. Head 
groups can be net neutral but either zwitterionic (PE 
and PC) or uncharged (GlcDAG and GlcGlcDAG), net 
anionic but either non-amine-containing (PG, CL, PI) 
or amine-containing (PS), or have hydrogen bonding 
capability (all but PC). The head group and the fatty 
acid composition (R groups) also determines the 
bilayer or non-bilayer properties of the lipid by impos¬ 
ing a positive or negative curvature, respectively, when 
in association with itself or with bilayer-forming 
lipids. Large head groups for lipids like in PC, PG, 
PI, and GlcGlcDAG tend to form bilayer structures in 
solution, while small head groups of lipids like in PA, 
PE, GlcDAG, or CL (in the presence of divalent cat¬ 
ions) form non-bilayer structures and introduce lateral 
stress in an ordered bilayer structure. Non-bilayer 
properties and membrane fluidity are also dependent 
on the fatty acid composition of the lipids with unsat¬ 
urated fatty acids increasing both properties (Dowhan 
1997). 
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Functional Roles of Lipids in Membranes, Fig. 2 Structure 
and Properties of Lipids. Stick diagrams represent the carbon 
backbone of each lipid. The R refers to the extended hydrocar¬ 
bon chains of fatty acids in otI and sn2 position of glycerol. 
The organization of those lipids capable of non-bilayer 


formation is dependent on fatty acid composition, solvent, and 
temperature (This figure was modified from the original 
published in Dowhan and Bogdanov (2009) © Annual Reviews 
of Biochemistry) 


Lipids As Topological Determinants 

An important structural feature of polytopic a-helical 
► membrane proteins is the orientation of transmem¬ 
brane domains (TMs) with respect to the plane of the 
lipid bilayer. Residency of an Gt-helical domain within 
the membrane bilayer is largely dependent on its 
hydrophobicity (White and von Heijne 2005) while 
initial orientation of TMs is determined by an interplay 
between topogenic signals flanking TMs, the 
translocon and the net charge of the membrane surface 
determined by the lipid head group composition 


(Dowhan and Bogdanov 2009). Final TM orientation 
occurs during late folding events and is governed by 
short-range lipid-protein interactions and long-range 
interactions within the protein (Dowhan and Bogdanov 
2009). TM orientation once established is not static but 
can change in response to changes in the lipid environ¬ 
ment. The role of the translocon in orienting TMs has 
been extensively reviewed as has the topogenic signals 
within protein sequences (White and von Heijne 
2008). It is generally accepted that TM orientation is 
largely dictated by the positive inside rule, which is 
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Functional Roles of Lipids in Membranes, Fig. 3 Protein 
Topology. Dependence of protein topology on membrane lipid 
composition. The top of each figure faces the cytoplasm. TMs 
are pictured as rectangles. Extramembrane domains are labeled 
NT, C, P, or CT for N-terminal, cytoplasmic, periplasmic, or 
C-terminal, respectively, as oriented in PE-containing cells, 
(a) Topology of LacY in PE-containing i+PE) cells. Positions 
of negatively charged {green) and positively charged (red) 
amino acids are indicated. Numbers in parentheses denote 


the net charge of the respective extramembrane domain, 
(b) Topology of LacY in PE-lacking {—PE) cells, (c) The topol¬ 
ogy of LacY after post-assembly synthesis of PE in cells lacking 
PE during initial synthesis of LacY. For PheP and GabP, only the 
topology of lipid sensitive domains (4 of 12 TMs) is shown 
for PE-containing (d) and PE-lacking (e) cells (This figure is 
modified from the originally published in Bogdanov et al. (2009) 
© the American Society for Biochemistry and Molecular 
Biology) 


based on both experimental evidence and statistical 
analysis (von Heijne 2006). According to the positive 
inside rule, 85% of the cytoplasmically oriented 
extramembrane domains flanking TMs have a net pos¬ 
itive charge and those facing the opposite side of the 
membrane have a net negative or no net charge. 

However, the role lipids play in accentuating and 
supporting the positive inside rule has only recently 
been established through the use of E. coli lipid 
mutants (Bogdanov et al. 2008; Dowhan and 


Bogdanov 2009). The topological organization of 
three polytopic secondary transporters (lactose 
(LacY), phenylalanine (PheP), and y-aminobutyrate 
(GabP) permeases) is partially inverted with respect 
to the plane of the bilayer and to the remainder of each 
protein when expressed in cells lacking PE (Fig. 3). In 
the case of LacY, this inversion is dependent on the 
hydrophobicity of TM VII. Two Asp residues in TM 
VII are stabilized in the membrane by salt bridges to 
positively charged amino acids in wild-type 


























































































Functional Roles of Lipids in Membranes 


873 


F 


PE-containing cells. However, the low the hydropho- 
bicity of TM VII allows it to be exposed to the 
aqueous environment and act as a molecular hinge 
between the two independently folding halves of 
LacY when synthesized in cells lacking PE. Substitu¬ 
tion of one Asp residue by lie raises the hydrophobic- 
ity, which prevents solvent exposure and inversion. 
The mis-organization of LacY in PE-lacking cells can 
be partially reversed by inducing PE synthesis post¬ 
assembly of LacY thereby demonstrating that TM 
orientation is not hxed after final protein folding but 
is dynamic and can respond to changes in the lipid 
environment. 

The molecular basis for the lipid requirement was 
established by varying membrane lipid composition 
and protein sequences of cytoplasmic domains that 
contain a mixture of positive and negative amino 
acids. Normally, positive residues (domain retention 
signals) are dominant over negative residues (domain 
translocation signals) as topological determinants. 
However, these rules are dependent on the normal 
balance between neutral or zwitterionic lipids and 
anionic or net negatively charged lipids. The former 
lipids dampen or attenuate the translocation potential 
of negative residues thereby strengthening the reten¬ 
tion potential of positive residues. This was demon¬ 
strated by replacement of PE in vivo by either neutral 
(GlcDAG or GlcGlcDAG) (Dowhan and Bogdanov 
2009) or zwitterionic (PC) foreign lipids (Bogdanov 
et al. 2010b), which restored wild-type topology by 
diluting the high negative charge of the membrane 
surface due to PG and CL. Inversion in PE-lacking 
cells was prevented by increasing the net positive 
charge of the normally cytoplasmic domains flanking 
TMs. Inversion in PE-containing cells was induced by 
increasing the negative charge of the same flanking 
domains. Thus, topology is determined and sensitive 
to short-range charge interactions between the charge 
density of the membrane surface and the charge nature 
of cytoplasmically exposed domains of the membrane 
protein (Charge Balance Hypothesis). Non-bilayer and 
hydrogen-bonding properties of the lipid head groups 
are not a factor. 

The topological organization of PheP and GabP is 
also dependent on the membrane lipid composition in 
a similar manner to LacY except only the N-terminal 
two-TM hairpin is inverted in the absence of PE. 
PheP misorientation of TMs is also reversed by 
post-assembly synthesis of PE. Increasing the net 


positive charge of normally cytoplasmic domains 
flanking the hairpin also prevents misorientation in 
PE-lacking cells (Vitric et al. 2011). 

Einal topological organization is also dependent on 
long-range interactions within membrane proteins as 
exemplified by the effect of TM VII hydrophobicity 
on the orientation of the N-terminal six TM bundle of 
LacY. As noted above, increasing the hydrophobicity 
of TM VII prevents misorientation of TM VII in 
PE-lacking cells. However, further increasing the neg¬ 
ative charge of the extramembrane domains 
N-terminal to TM VII of higher hydrophobicity 
results in inversion in PE-lacking cells (Vitric et al. 
2011). Therefore, final topological disposition is 
a result of long-range thermodynamic balance 
between lipid-protein interactions that determine 
translocation or retention potentials and the hydropho¬ 
bicity of TMs that act as potential molecular hinges. 
In addition, topology of N-terminal TM bundle of 
LacY is affected by position-independent single 
amino acid changes in each of domains C2, C4, or 
C6, which indicates that the bundle responds to charge 
and hydrophobic balance as a single unit after exit 
from the translocon. This observation coupled with 
the reorganization of TMs post-assembly of LacY 
and PheP indicate that final topological organizational 
decisions are made independent of the translocon and 
during late stage folding of the protein (Dowhan and 
Bogdanov 2009). 

Determination of Membrane Protein 
Topological Arrangement 

Although there are several methods to determine mem¬ 
brane protein topological organization, the Substituted 
Cysteine Accessibility Method applied to TMs 
involves the least amount of alteration in protein struc¬ 
ture (Bogdanov et al. 2005, 2010a). In this method, all 
endogenous cysteine residues are converted by molec¬ 
ular genetic manipulation to serine residues. In almost 
all cases, such derivatives retain near full activity as 
well as other native properties. After determination of 
a putative topological organization using hydropathy 
plots based on the amino acid sequence (von Heijne 
2006), single cysteines are introduced into putative 
extramembrane domains as well as into TMs. The 
protein is then expressed from a plasmid in a strain 
lacking a genomic copy of the target protein and with 
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either wild-type or altered lipid composition. Whole 
cells and cells disrupted by sonication are then treated 
with a membrane impermeable sulfhydryl reagent 
linked to a detectible tag. 3-(Wmaleimido-propionyl 
biocytin (MPB) has been extensively used and is 
a derivative of A^-ethylmaleimide linked to biotin. 
Under ideal conditions, cysteines in domains exposed 
to the periplasm are derivatized in whole and sonicated 
cells while cysteines exposed to the cytoplasm are only 
derivatized after cell disruption. Cysteines within TMs 
are not derivatized under any conditions. Cysteines in 
extramembrane domains can be unreactive due to local 
secondary structure that raises the ionization constant 
of the sulfhydryl group or due to steric hindrance. 
Carrying out derivatization at pH 9.0 instead of 7.5 
generally disrupts secondary structure sufficiently to 
allow reaction with extramembrane cysteines without 
compromising cell integrity. Cysteine within a TM that 
spans the membrane can be distinguished from 
a partially membrane inserted domain or mini-loop 
that does not cross the membrane by prior treatment 
of the sample at pH 12. Such treatment releases 
mini-loops but not true TMs from the bilayer and 
exposes cysteine residues for reaction. Therefore, reac¬ 
tion conditions can be set up to distinguish among 
cytoplasmic or periplasmic domains and TMs or 
mini-loops. There are also examples where dual topol¬ 
ogies coexist for the same domain within a protein 
resulting in partial derivatization in whole cells with 
increased derivatization in broken cells (Vitric et al. 
2011). Pre-blocking whole cells with a membrane- 
impermeable sulfhydryl reagent lacking a detectible 
tag (4-acetamido-4-maleimidylstilbene-2,2-disulfonic 
acid) followed by the normal procedures outlined 
above can generally establish a mixed topology for 
a particular domain. 

After derivatization, membranes are solubilized 
by detergent and the target protein immunopre- 
cipitated and subjected to sodium dodecyl sulfate 
polyacrylamide electrophoresis followed by West¬ 
ern blot analysis using Avidin-linked fluorescence 
detection. The fluorescence pattern and intensity are 
recorded using a fluorescence imager for presenta¬ 
tion and quantification. The absence of a signal 
requires verification that the protein derivative was 
present on the blot, which can be done after stripping 
the solid support and redoing a Western blot using 
target-specific antibody. 


Reconstitution of Membrane Proteins into 
Liposomes 

Reconstitution of purified membrane proteins into 
liposomes of defined composition is an effective 
means of verifying conclusions drawn from in vivo 
studies of lipid effects on protein structure and 
function. Reconstitution of purified LacY as well as 
several other secondary transporters into liposomes 
composed of E. coli phospholipids either with or 
without PE verified the absolute requirement for PE 
for energy-dependent uphill transport function 
(Dowhan and Bogdanov 2009); PE is not required for 
downhill transport. Wild-type topological organization 
of LacY in proteoliposomes is also dependent on PE 
(Wang et al. 2002). A similar inversion of topology as 
observed in vivo is seen when PE is left out of the 
reconstitution procedure. The topological organization 
of LacY is dependent on liposome composition and not 
the source of LacY further confirming the dependence 
of topology on temporal lipid composition and not 
initial lipid composition at the time of synthesis. 
Interestingly, substitution of PE by PC 
(commercially available containing dioleoyl fatty 
acids) supported wild-type topology of LacY but not 
uphill transport function. This observation has led to 
considerable speculation of the role of specific lipids in 
supporting function. However, this observation 
appears to be the result of using PC species lacking 
fatty acid composition normally found in E. coli. When 
PC is made in E. coli, where PC contains a mixture of 
saturated and unsaturated fatty acids, LacY is fully 
functional in uphill transport (Bogdanov et al. 2010b). 

Eurther verification of the effects of lipids on struc¬ 
ture and function have been carried out using 
a monoclonal antibody 4B1 (mAb 4B1) to probe 
structure-function relationships in vitro. This antibody 
recognizes the proper conformation of an epitope in 
domain P7 of LacY (Fig. 3), which is associated with 
display of uphill transport function. LacY (in vivo and 
in vitro) is recognized by mAb 4B1 when assembled 
in, reconstituted in, or exposed post-assembly to 
GlcDAG but not GlcGlcDAG, PC (also requires 
presence of PG or CL) or PE containing at least one 
saturated fatty acid but not PE or PC with two unsatu¬ 
rated fatty acids (Bogdanov et al. 2010b). Therefore, 
the requirements for supporting full function are 
more stringent than those for supporting topological 
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organization. Sorting out physiologically significant 
lipid-protein interactions requires a combination of 
in vivo and in vitro studies. 


Summary 

Lipids act as more than a hydrophobic solvent for 
membrane proteins. The collective properties of the 
membrane lipid bilayer are determined by the lipid 
composition, which affects net charge of the surface, 
lateral pressure, and viscosity. All these factors influ¬ 
ence membrane protein structure and function at the 
time of initial membrane insertion and folding into 
a native structure. However, membrane proteins are 
highly dynamic structures that can change in response 
to changes in the lipid environment. The study of 
membrane proteins in cells with genetically altered 
lipid composition and in liposomes of variable lipid 
composition is necessary to establish the physiologi¬ 
cally important molecular basis for lipid functions. 
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Synonyms 

7-transmemhrane domain receptors; GPCR; 
Heptahelical receptors; Seven-transmembrane domain 
receptors; Signal transduction at the intracellular 
surface of the membrane-integral G-protein-coupled 
receptors 

Definitions 

GPCR - G protein-coupled receptor 
Activation - a series of conformational changes in the 
receptor that leads to activation of intracellular 
G proteins and other effectors 
Agonist - a ligand that, upon binding, stabilizes an 
active state of the receptor, leading to activation of 
cytoplasmic effectors and subsequent cellular 
responses 

Inverse agonist - a ligand that, upon binding, stabilizes 
an inactive state of the receptor, which reduces 
basal activity 


Introduction 

G protein-coupled receptors (GPCRs) are seven trans¬ 
membrane proteins that transmit a wide variety of 
extracellular signals (photons, odorants, pheromones, 
biogenic amines, (neuro)peptides, proteases, glycopro¬ 
tein hormones, and ions) across the plasma membrane. 
When activated by light or ligands, GPCRs undergo 
conformational changes leading to coupling and 
activation of G proteins, GPCR kinases, and arrestins, 
which, in turn, trigger a variety of cellular signaling 
pathways. Over 30% of current drugs modulate the 
activity of GPCRs, and, thus, understanding the 
structural changes during ligand-induced receptor 
activation may help to design a next generation of 
pharmaceuticals with improved properties. The 
currently available X-ray structures of several GPCRs 
in the inactive (reviewed in Rosenbaum et al. 2009) 
and active (Deupi et al. 2012; Rasmussen et al. 2011) 
states provide key snapshots of the mechanisms of 
activation (reviewed in Deupi and Standfuss 2011). 
A number of biophysical evidences suggest that these 
mechanisms are shared among the large GPCR family 
(reviewed in Deupi and Kobilka 2007; Farrens 2010). 

Activation Process 

While in the absence of agonists GPCRs predomi¬ 
nantly exist in an inactive state, most receptors show 
some level of basal activity. This suggests that, even 
in the absence of an activating signal, receptors 
transiently sample different conformational states, 
including active states. Binding of agonists can stabi¬ 
lize these active states, which recruit and activate the 
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G Protein-Coupled 
Receptor Activation Based 
on X-ray Structural Studies, 
Fig. 1 Conformational states 
of the receptor. It is likely that 
most receptors sample all of 
these conformations even in 
the absence of agonist ligands 
and G proteins. Binding of 
agonists and G proteins, 
however, increases the 
probability of the receptor of 
being in the active state 
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G protein (Fig. 1). Exchanging of the bound GDT to 
GTP in the alpha subunit of the activated G protein 
leads then to its dissociation from the receptor as well 
as to its dissociation into the G^, and Gpy subunits. Each 
of these subunits can trigger specific intracellular 
signaling events. 

It is important to note that the affinity of agonists for 
the receptor in the active conformation is much higher 
(e.g., 100-fold for b2AR) than for the inactive confor¬ 
mation, thus providing a physicochemical basis for the 
stabilization of the active state by agonists. 

Structural Differences Between Inactive and 
Active States 

In order to achieve a high-affinity state for the agonist 
and the G protein, the receptor undergoes significant 
conformational changes in several domains. Firstly, the 
ligand-binding site rearranges slightly to accommodate 
the agonists, in the case of receptors activated by dif¬ 
fusible agonists, or, in the case of rhodopsin, to accom¬ 
modate the light-induced conformational change of an 
inverse agonist (11-cis retinal) to an agonist (all-trans 
retinal). Secondly, the G protein-binding site at the 
cytoplasmic part of the receptor shows a large structural 
rearrangement (Fig. 2). The inactive state is character¬ 
ized by a “closed,” tightly packed arrangement of trans¬ 
membrane bundle. In contrast, in the active state, 
helix 6 shows a large outward movement that opens 


a cavity to accommodate the C-terminal helix of the 
G protein. Concomitantly with the movement of 
the helix 6, helixes 5 and 7 show significant but smaller 
movements, to compensate for changes in packing inter¬ 
actions. Thirdly, a “transmission switch,” consisting of 
several conserved amino acids located roughly in the 
center of the transmembrane bundle, also rearranges 
upon activation (Deupi and Standfuss 2011; Rasmussen 
et al. 2011). This region acts as a “fulcrum” point, 
transmitting the conformational changes between the 
ligand-binding pocket to the G protein-binding sites 
through helices 5, 6, and 7. 

Partial Decoupling of Ligand Binding and 
Activation 

In some of the crystal structures of the Pi- and P 2 - 
adrenergic and A 2 A adenosine receptors, binding of 
agonists induced or stabilized local structural changes 
in the ligand-binding pocket without a shift to an active 
conformation in the cytosolic part of the protein. In 
order to stabilize an active state, most existing struc¬ 
tures required the presence of the G protein itself, in the 
case of P 2 -AR/GS complex, or of G protein surrogates, 
such as a nanobody or a peptide derived from the 
C-terminal part of the G protein in rhodopsin. How¬ 
ever, the structure of the active metarhodopsin-II has 
been crystallized without any stabilizing agent at the 
cytoplasmic side, supporting the notion that GPCRs 
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G Protein-Coupled Receptor Activation Based on X-ray 
Structural Studies, Fig. 2 Conformational changes during 
GPCR activation exemplified by crystal structures of the rho- 
dopsin ground state {left, pdb id:lGZM) and that of the light- 


activated metarhodopsin-II in complex with a peptide resem¬ 
bling part of the Got protein {right, pdb id: 4A4M, Deupi et al. 
2012). Arrows indicate the most prominent conformational 
changes in transmembrane helix (TM) 5, 6, and 7 


G 


can sample active conformations in the absence of 
G protein. It is interesting to note that binding of the 
G protein to the receptor in the absence of agonist 
induces high-affinity state of the receptor, which is 
associated with the active conformation. These results 
suggest that agonist binding alone may not always be 
sufficient to fully induce/stabilize the active state, 
i.e., that agonist binding and activation are, to some 
extent, decoupled. Binding of both agonist and 
G protein is, thus, generally required to stabilize 
the active conformation of the receptor. 

Filling the Gaps by Biophysical Experiments 
and Molecular Dynamics (MD) Simulations 

Conformational changes upon activation of GPCRs 
have been measured by electron paramagnetic reso¬ 
nance, NMR, and fluorescence spectroscopy. For 
instance, a number of fluorescence spectroscopy 
approaches have shown that activation proceeds 
through a sequence of conformational intermediates 
that can be specifically stabilized by agonists 
(reviewed in Kobilka 2007). 

Advances in long scale MD simulations have 
allowed investigation of the relaxation trajectory of 
the active to the inactive state of the P 2 -AR (Dror 
et al. 2011). These simulations sampled intermediate 
states where helices 5 and 7 remain in an inactive-like 
conformation while helix 6 retains an active-like con¬ 
formation. These results are consistent with a study 
using 19F NMR probes attached to the cytoplasmic 


ends of either helix 6 or 7, where the NMR signal 
originating from the labels can be decomposed into 
a sum of signals of active and inactive states. Specifi¬ 
cally, in the absence of ligand, the NMR signals from 
both helix 6 and 7 reflect mainly the inactive state while, 
upon binding of a full agonist, the receptor population 
shifts toward an active state (Liu et al. 2012). 

Physicochemical View of the Receptor 
Conformations 

Structural and biophysical evidence suggest that 
GPCRs can adopt several conformational states, 
including the active state, even in the absence of an 
agonist. This situation is schematically depicted as an 
energy diagram in Fig. 3. The energy difference 
between the states is probably relatively small, so 
that all states are sufficiently populated. The small 
fraction of the active state is responsible for the basal 
activity of the receptor. Binding of the agonist and the 
G protein leads to a change of the energy landscape of 
the receptor states, making the active state more stable. 
The height of the energy barriers between the states 
determines the kinetics of changes between states 
(reviewed in Deupi and Kobilka 2010). 

Outlook 

The current structures of GPCRs provide new insights 
into the activation mechanism leading to the coupling 
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a absence of agonist b agonist-bound 



G Protein-Coupled Receptor Activation Based on X-ray 
Structural Studies, Fig. 3 Schematic energy landscape of 
a GPCR. (a) In the absence of agonists, the most populated 
state is the low-energy inactive state (R). However, the energy 



G Protein-Coupled Receptor Activation Based on X-ray 
Structural Studies, Video 1 Morph between the inactive 
(pdb id: IGZM, Li et al. 2004) and light-activated conformation 
with bound G protein peptide (pdb id: 4A4M, Deupi et al. 2012) 
of the GPCR rhodopsin. 

and activation of the G protein. The future challenges 
include obtaining structural information about signaling 
complexes with other classes of G proteins, to under¬ 
stand the structural basis of G protein selectivity, and 
obtaining structures of the signaling complexes between 
receptors and kinases or arrestins (Video 1). 

Cross-References 

► NMR 

► Dynamics of Helix 8 in GPCR Function 



difference between the states is probably relatively small, so that 
intermediate (R') and active (R*) states are partially populated, 
(b) In the presence of agonist and the G protein, the active state 
(R*) is stabilized and, thus, becomes more populated 


► Fluorescence Resonant Energy Transfer (FRET) 

► Macromolecular Crystallography: Overview 

► Molecular Dynamics Simulations of Lipids 

► Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy 

► Rhodopsin: Stability and Structural Organization 
in Membranes 
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Gap Junction Proteins (Connexins, 
Pannexins, and Innexins) 

David C. Spray and Eliana Scemes 

Dominick P Purpura Department of Neuroscience, 

Albert Einstein College of Medicine, Bronx, NY, USA 

Gap Junctions 

Gap junctions are recognized structurally by roughly 
parallel appositional cell membranes separated by 
a “gap” of 2-4 nm; in freeze fracture, they appear as 
plaques of hexameric particles in the membrane 
(see Scemes and Spray 2(X)9). Three differences between 
gap junctions stractures in vertebrates and invertebrates 
were noted in early electron microscope studies (Epstein 
and Gilula 1977). Invertebrate gap junctions have wider 
“gaps,” larger particles and tend to cleave with the E face 
in freeze fracture replicas. The rather tight clustering of 
the particles at punctate junctional contacts is likely due 
to the strong and irreversible head to head binding of 
hexamers (termed connexons, innexons, or pannexons 
depending on protein composition, see below), that 
pulls appositional membranes closely together (Eig. 1). 

Gap Junction Proteins 

Connexins are only found in chordates, the most prim¬ 
itive example being tunicates. In rodents and man, 
there are 20 or 21 connexins, all with four transmem¬ 
brane domains and both amino and carboxyl termini 
within the cytoplasm. A list of these proteins, along 
with their tissue and cellular expression, and diseases 


associated with connexin deficiencies is shown in 
Table 1. The extracellular loops of connexins contain 
three cysteine residues each, and the disulfide bonds 
formed by these residues likely provide the secondary 
structure responsible for the tight seal and irreversibil¬ 
ity of the docking of one connexon to another. No 
homologous proteins have been found in invertebrates 
despite early reports that connexin antibodies 
disrupted intercellular communication in hydra (Eraser 
et al. 1987) and that antibodies to vertebrate connexins 
recognized sea anemone structures (Mire et al. 2000). 

Invertebrate gap junction proteins are formed by 
the innexin family of proteins, of which there are 8 in 
Drosophila and 25 in Caenorhahiditis. In vertebrates, 
there are weakly homologous proteins termed 
pannexins, one of which (pannexinl) has been demon¬ 
strated to form channels when expressed in Xenopus 
oocytes or in transfected mammalian cells. Parmexin 
sequences contain two cysteine residues in each extra¬ 
cellular loop and there also are glycosylation sites 
within the loops suggesting that such branched sugar 
oligomers might serve to physically separate pannexons 
from one another. Although a few reports have indicated 
that overexpressed pannexinl may form gap junction 
channels, and that deglycosylation might facilitate such 
gap junction formation, it seems unlikely that this 
occurs to any large extent (Sosinsky et al. 201 1). 

Gene Structures of Connexins and Pannexins 

Most gap junction proteins are encoded by genes where 
the entire coding sequence is contained within a single 
exon. Although there are a few exceptions, where coding 
exons are interrupted, in no case does the coding region 
span more than two exons. Pannexin genes are much 
more complicated, with hve exons for pannexinl. 

How Do Gap Junctions Form? 

It was first shown in Xenopus oocytes that gap junctions 
would rather quickly form if oocytes were manipulated 
into contact. Similarly in both insect and mammalian cell 
lines, formation of gap junctions between cells manipu¬ 
lated into contact is rapid. This has been interpreted as 
indicating that there is a large pool of connexons or so- 
called hemichannels in the surface membrane which are 
ready and able to join with partners across extracellular 
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Gap Junction Proteins (Connexins, Pannexins, and 
Innexins), Fig. 1 Gap junction structure. Schematic drawing 
of a longitudinal section of two gap junction channels formed by 
the docking of two hexameric oligomer of connexin proteins 
(connexons) at the appositional membranes of two cells. The 


carboxyl and amino terminal domains of each gap junction 
subunit (connexin) are located in the cell cytosol and the two 
connexons are joined by the two extracellular loops. Disulfide 
bonding in cysteines within connexins stabilizes the binding inter¬ 
face but does not directly link the connexin proteins to one another 


space to form the junctions. However, it is increasingly 
appreciated that most cells are highly polarized and that 
delivery to the site of the junctions may well be directed 
through cytoskeletal proteins attaching to adhesive junc¬ 
tions (Shaw et al. 2007) and including vesicular motors 
driving the directed mobility (Fort et al. 2011). It is now 
also increasingly appreciated that junctional proteins are 
added to the margins of the plaques and are destroyed 
following internalization of the connexons in both cells 
occurring at the plaque center (Laird 2006). 

Are There Functional Hemichannelsl 

It has been demonstrated that certain connexins 
expressed exogenously readily form channels in the 
nonjunctional membrane; this is an especially prominent 
characteristic of the lens gap junction proteins, Cx46 and 
Cx50. The manipulation generally used to achieve open¬ 
ing of channels includes lowering extracellular calcium 
concentration. It has been argued that “hemichannels” 
are formed by Cx43, although in most cases the possi¬ 
bility that other channels (including pannexinl) may be 
responsible has not been rigorously excluded. In fact, 
laboratories that originally attributed dye uptake and 
ATP release to opening of Cx43 hemichannels (Bennett 
et al. 2003) have now indicated that at least part of the 
exchange is through pannexinl channels. 


What Opens and Closes Connexin and 
Pannexinl Channels? 

Most of the original studies identifying agents that close 
gap junction channels were done using invertebrate 
cells because the method used (measurement of electri¬ 
cal coupling) was facilitated by having large cells to 
work with (insect salivary glands, giant axons of cray¬ 
fish and earthworm). Some of the agents used for 
uncoupling experiments likely act through depolarizing 
the cells, a type of voltage dependence that is largely 
absent in mammalian gap junction channels. Thus, it 
should be appreciated that the uncoupling caused by 
elevation in intracellular calcium in salivary gland 
cells could be reversed by membrane hyperpolarization. 

Carbenoxolone and flufenamic acid block pannexinl 
channels at concentrations lower than those needed to 
block connexin channels, whereas mefloquine is much 
more pofent on pannexinl than on any of the connexins 
(Bruzzone et al. 2005; Iglesias et al. 2009). There is 
literature indicating blockade of pannexinl channels by 
agents that activate P 2 X 7 receptors, which would be 
a protective effect against positive feedback on channel 
opening (Qiu and Dahl 2009) (Fig. 2). 

Voltage dependence is a prominent feature of gap 
junctions. In invertebrates the prominent voltage 
dependence is between the inside and outside of 
the cell, likely sensed across the channel wall. 
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Gap Junction Proteins (Connexins, Pannexins, and Innexins), Table 1 Tissue distribution and effects of deficiencies of 
connexins in mouse and man 

Gene/protein 

Distribution 

Human disease 

Phenotype of connexin-null mutant mouse 

G,Mi/Cx43 

Most tissues, esp. heart 

Oculodentodigital dysplasia (ODDD) 

Cardiac malformation (postnatal lethality) 

GJA3 /Cx46 

Lens 

Zonular pulverulent cataract-3 
(CZP3) 

Zonular nuclear cataracts 

GJA4ICx31 

Endothelium 

Predisposition to arteriosclerosis 

Female sterility 

G.M5/Cx40 

Heart, endothelium 

Atrial fibrillation 

Cardiac malformations 

GJA6/Cx33 

Testis 

No human orthologue 

Gja6ICx33 in mouse 

GJA8/Cx50 

Lens 

Zonular pulverulent cataract-1 
(CZPl) 

Microphthalmia, congenital cataracts 

GM9/Cx59 

Testis 

d 

No mouse orthologue 

GJA10ICx62 

Heart 

d 

Mouse orthologue is Cx57, horizontal cells 
not coupled 

GJB1/Cx32 

Liver, myelin 

X-linked Charcot-Marie-Tooth 
disease (CMTX) 

Late-onset neuropathy, liver 
carcinogenesis 

G./B2/Cx26 

Liver, skin, cochlea 

Deafness (DFNA3, DFNBIA), skin 
disease® 

Lethality at ED 11 

G./Bi/Cx31 

Skin 

Deafness'’, skin disease*’ 

Transient placental dysmorphogenesis 

GJB4ICx30.3 

Skin 

Erythrokeratodermia variabilis 
(EKV) 

Reduced vanilla scent, no hearing 
anomalies 

G./B5/Cx31.1 

Skin 

d 

Impaired placental development 

GJB6/Cx3Q 

Skin, brain, cochlea 
hydrotic Ectodermal 
dysplasia (Clouston 
syndrome) 

Autosomal and digenic (with GJB2) 
deafness 

Hearing insufficiency, no skin phenotype 

GJB7ICx25 

Placenta 

d 

No mouse orthologue 

GJCHCxAS 

Heart, neurons 

d 

Lethality at ED 10.5 

GJC2ICxAl 

Brain 

Hypomyelinating leukodystrophy 2 
(HLD2) 

Vacuolation of nerve fibers 

G./C5/Cx30.2 

Brain, others 

d 

Cx29 in mouse; myelinating cells 

GJD2/Cx36 

Neurons, fi cells 

Predisposition to juvenile myoclonic 
epilepsy 

Night blindness 

G./Di/Cx31.9 

Heart, neurons 

d 

Mouse 30.3; decreased AV conduction 
time 

G./D4/Cx40.1 

Various 

d 




G/£/Cx23 


d 

d 





“Palmoplantar hyperkeratosis, keratitis-ichthyosis deafness (KID), Vohwinkel syndrome 
'’Autosomal dominant 2B, autosomal recessive, and digenic (with GJB2) deafness 
‘'Erythrokeratodermia variabilis et progressive (EKV) 

‘'Unknown disease or tissue expression pattern 


Depolarization of one or both cells reduces junctional 
conductance. In vertebrate gap junction channels, 
there is a minor component, if any, to inside-outside 
voltage dependence which rather involves sensing 
the voltage through the gap junction channel between 
the cytoplasm of one cell with that of the other (see 
del Corsso et al. 2006). For certain connexins, 
the voltage dependence is rather steep, such that 
sustained uncoupling can occur if there is a prolonged 
transjunction voltage and a brief triggering event 
(Spray et al. 1981). Early modeling studies predicted 


the voltage sensitivity of the gates on each side of the 
gap junction and studies expressing one connexin in 
one cell and another in the other have confirmed such 
independent gates. Moreover, studies on Cx46 and 
Cx50 indicate that hemichannels have gating as 
predicted from the modeling experiments. Voltage 
dependence of pannexinl is such that under normal 
conditions of extracellular ions, channel opening 
begins at about 0 mV, but when extracellular K''' 
is raised activation threshold voltage is lowered 
(Silverman et al. 2009). 
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Gap Junction Proteins (Connexins, Pannexins, and 
Innexins), Fig. 2 Gating mechanisms of connexin channels. 
(a) Junctional current (Ij) recorded from one cell of a pair 
coupled via Cx43 gap junction channels obtained by applying 
transjunctional voltage steps, (b) Voltage dependence of Cx43 
gap junction channels showing that maximal conductance occurs 
when transmembrane potential is 0 mV and that conductance 


decreases at potential above 80 mV to a level that corresponds 
to the subconductance state of the Cx43 gap junction channel, 
(c) Total closure of gap junction channels is shown to occur 
following intracellular acidification by CO 2 application, (d) The 
pH dependence of Cx43 gap junction channels is illustrated. At 
physiological pH (7.2) gap junction conductance is maximal and 
at intracellular pH below 6.5 conductance is zero 


Finally, there are other mechanisms hy which 
pannexinl channels may be activated. First of all, it 
appears to be coupled to P 2 X 7 receptors forming the 
highly permeable pore that is opened by prolonged 
exposure to high concentrations of ATP or specihc 
agonists. Second, the channel may be opened by rais¬ 
ing intracellular calcium levels and finally, it may be 
opened by mechanical stretch. Voltage dependence 
in the invertebrate gap junction channels generally 
displays sensitivity to absolute potential (inside out) 
such that depolarization will close the channel. 
Examples in which this has been quantitatively exam¬ 
ined include Drosophila and midge salivary glands and 
crayfish. It is noteworthy that such depolarization 
could have been one mechanism by which calcium 
produced the uncoupling action that was thought to 
be a major control mechanism for gap junctions; it 
was shown that uncoupling by calcium could be 
reversed by hyperpolarization (Obaid et al. 1983) 


Perhaps the mechanism of gap junction closure with 
most relevance is that of intracellular acidification. All 
connexins display sensitivity to low intracellular pH 
where the apparent pK ranges from near physiological 
(7.3 or so) to as low as 5.8. For certain connexins, 
therefore acidification by only a few tenths of a pH 
unit may substantially reduce the number of open 
channels. Whereas channel closure in response to 
transjunctional voltage occurs via transition from 
a main state to a substate, closures in response to 
acidification are all or none (see Spray et al. 2002). 

The notion that junctional conductance could be 
decreased by elevated intracellular calcium was initi¬ 
ated by the finding that high calcium levels could cause 
cells to seal off from their neighbors following damage. 
This was followed by demonstration that injection of 
high concentrations of calcium into cells could uncou¬ 
ple. However, these early experiments did not consider 
secondary effects on pH or voltage gating and it is now 
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known that the calcium levels necessary to obtain 
uncoupling are likely very high (see Obaid et al. 1983). 


What Do Connexin and Pannexin 
Channels Do? 

It has long been appreciated that current flow through 
gap junctions in working myocardium mediates 
synchronous cardiac contractions, that gap junctions 
between neurons mediate rapid electrical transmission, 
that gap junctions in lens fibers facilitate transparency 
through efflux of metabolites, and that gap junctions in 
exocrine and endocrine glands facilitate synchronous 
secretion. Results from mutagenesis studies in mice 
and from human genetic analysis indicate that dysfunc¬ 
tion and disease accompanying reduced or altered gap 
junction function is widespread (see Table 1, modified 
from Dobrowolski and Willecke 2009, by inclusion of 
recent data obtained from OMIM [Online Mendelian 
Inheritance in Man]), varied, and, in many cases, totally 
unexpected. Mutation in Cx32, the major gap junction 
protein of liver and peripheral myelinating glia 
(Schwann cells), results in a progressive peripheral neu¬ 
ropathy, X-linked Charcot Marie tooth disease. Muta¬ 
tions in either of the lens gap junction proteins, Cx50 or 
Cx46, result in cataract formation, the difference being 
that Cx50 mutations/deletion leads to growth defects as 
well. Indications in the major gap junction protein of 
myelinating glia in the CNS, Cx47, also lead to demy- 
elinating diseases as well as the recently discovered 
lymphedema and mutations in Cx43 are now known to 
underlie ODDD (Pfenniger et al. 2011). 

Role of Connexin and Pannexin Channels in 
Calcium Wave Spread Among Astrocytes 

Intercellular communication can occur as a result of 
direct communication of intracellular signals through 
gap junction channels or through the release of transmit¬ 
ter molecules from one cell which diffuses across 
intracellular space, binding to receptors on another cell 
that respond by opening an ion channel or initiating 
a second messenger cascade (Scemes and Giaume 2006). 
Connexins in vertebrates and innexins in invertebrates 
are the only proteins known to form intercellular 
channels. They are permeable to molecules up to a 
molecular weight of about 1,000 permitting passage of 
ions and signaling molecules such as cAMP, IP 3 , and 


ATP. One impressive example of how gap junctions 
allow signaling between cells is the phenomenon of 
“calcium waves” where IP 3 liberated within one cell 
diffuses to an adjacent cell, evoking calcium release in 
this cell. Such waves generally have velocities on the 
order of 10-20 pm/s and passage can be to considerable 
distance from the site of initiation. An additional pathway 
of signaling between cells which can also play a role in 
calcium spread involves release of ATP from one cell 
and reception by a purinergic receptor in an adjacent cell. 
Pathways of ATP release can be pannexin 1 channels 
or vesicular release. Based on studies evaluating the 
contribution of the two pathways involved in the trans¬ 
mission of calcium waves, the emerging concept is 
that connexin gap junction channels define communica¬ 
tion compartments within a tissue, whereas pannexin 
channels provide amplification of signaling both within 
and between compartments (Scemes et al. 2007). 
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Synonyms 

Electron transfer 


Definition 


Basic Characteristics 

Gated electron transfer occurs when a relatively slow 
adiabatic (non-electron transfer) reaction precedes 
a faster nonadiabatic (true electron transfer) reaction, 
and the slower reaction is required to activate the 
system for electron transfer. If the rates of these two 
sequential reactions cannot be kinetically distin¬ 
guished, then the observed rate constant (/robs) for the 
redox reaction will not be the true electron transfer rate 
constant (^et)- Instead it will be the rate constant for 
the preceding slower reaction step. Gated electron 
transfer could occur in a bimolecular electron transfer 
reaction in which some rearrangement of reactants 
must occur after initial binding to optimize the system 
for electron transfer (Eq. 1), or it could occur during an 
intramolecular electron transfer if two or more states of 
the system exist with only one optimized for electron 
transfer (Eq. 2). In these schemes A and B are the 
reactants in the oxidized (ox) and reduced (red) states. 
The prerequisite adiabatic reaction step has a forward 
rate constant k,^, and since k^< < ^et then ^obs = ^x- 

Ka ^x(slow) /:£r(fast) 

AqxTB red ' Aox/Bred ^ [Aox/Bred]^ ^ Ajed/Box (f) 
k-x k-ET 

^x(slow) ^£j-(fast) 

Aox/Bred — [Aox/Bred]* — Ared/Box (2) 
k-X k^ET 

True electron transfer reactions are described by 
► electron transfer theory (Eq. 3) in which AG° is the 
driving force for the electron transfer reaction, X is the 
reorganization energy, and //ab is the electron cou¬ 
pling matrix element (Marcus and Sutin 1985). Since 
the rate constant for a gated electron transfer reaction 
{kjt) is unrelated to ^et, the observed rate constant for 
the gated reaction will not be dependent upon AG° for 
the redox reaction (i.e., the ► redox potential differ¬ 
ence of the redox centers). This distinguishes gated 
electron transfer reactions from true and ► coupled 
electron transfer reactions, the rates of which will 
vary with AG° since the observed rate constant is 
equal to k-grt:, or proportional to /:et> respectively. 


Gated electron transfer is a process in which a rela¬ 
tively slow non-electron transfer reaction is required to 
activate the system for a much faster true electron 
transfer reaction. 


kET — 




0.5 


exp 


-{^G° + if /AXKI 


( 3 ) 
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Conformationally Gated Electron Transfer 

The term “gated electron transfer” was coined by 
Hoffman and Ratner (1987) in the context of electron 
transfer reactions, the observed rate of which was 
controlled by conformational interconversion. Elec¬ 
tron transfer reactions in biological systems, especially 
those between proteins, may be gated by either a pro¬ 
tein conformational change or rearrangement of pro¬ 
teins relative to each other within an electron transfer 
complex. These structural changes occur prior to elec¬ 
tron transfer and optimize the positions of the redox 
centers, their geometries, and the intervening protein 
matrix for the electron transfer event. In conforma¬ 
tionally gated electron transfer, the effect of the pre¬ 
requisite activation reaction is primarily to optimize 
Hab for the electron transfer reaction. 

Chemically Gated Electron Transfer 

Electron transfer reactions may be gated by chemical 
adiabatic reaction steps (i.e., making or breaking of 
bonds). In contrast to conformationally gated electron 
transfer reactions which typically optimize Hj^g, the 
prerequisite activation step for a chemically gated 
electron transfer reaction typically lowers the activa¬ 
tion energy (i.e., AG° + 1) for the electron transfer. In 
chemically gated electron transfer reactions, in the 
absence of the preceding adiabatic reaction, the elec¬ 
tron transfer reaction may still occur but at a relatively 
slow rate which is insufficient for biological activity. 
After the chemical activation reaction, kgj- may be 
increased by orders of magnitude. Thus, even though 
these reactions are gated by a reaction step that is 
slower than the activated optimized kgr, the overall 
rate of the chemically gated redox reaction is much 
greater than the rate of the “ungated” true electron 
reaction in the absence of chemical activation 
(Davidson 2002). 

Evidence for Gated Electron Transfer 

In the experimental systems used to study electron 
transfer reactions and the kinetic models used to ana¬ 
lyze these data, the reaction steps which precede 
electron transfer in gated reactions may be spectro¬ 
scopically invisible or otherwise undetectable events 
(Davidson 2008). The ideal approach to ascertain 
whether the observed rate of the redox reaction is 
or k^ (see Eqs. 1 and 2) is to determine whether or not 
the reaction rate varies with AG°. For a gated electron 
transfer reaction in which k^ < ^et> the reaction rate 


does not depend on AG° because is not rate- 
limiting. Unfortunately, for protein electron transfer 
reactions it is typically difficult and sometimes impos¬ 
sible to experimentally determine the dependence of 
the kgr on zl G°. This is because it is difficult to alter the 
redox potential of a protein-bound redox center with¬ 
out otherwise affecting the protein. In such cases, 
evidence for gated electron transfer may alternatively 
be obtained by examining the effects on kohs of varying 
solution conditions such as ionic strength or viscosity. 
True electron transfer rate constants should not vary 
but those of non-electron transfer reaction steps, par¬ 
ticularly those for conformationally gated reactions, 
may exhibit a dependence on such reaction conditions. 
It should be noted, however, that rates of coupled 
electron transfer reactions may also vary with solution 
conditions. For chemically gated electron transfer 
reactions the observation of a kinetic isotope effect 
on kahs, when using isotopically labeled substrates of 
redox enzymes, may be useful in indentifying chemi¬ 
cally gated electron transfer, as the activating chemical 
reaction but not the true electron transfer reaction 
could exhibit such an effect. If relatively slow proton 
transfer is required for activation then it may be possi¬ 
ble to observe a solvent kinetic isotope effect (Bishop 
and Davidson 1997). 

For true electron transfer reactions it is possible to 
determine the values of the electron transfer parame¬ 
ters "k and Hab from analysis of the temperature depen¬ 
dence of kohs for ths reaction. If the temperature 
dependence of the kohs for a gated electron transfer 
reaction is analyzed by Eq. 3, then one would obtain 
apparent values of a and //ab which do not describe 
^et- However, such an analysis may be of diagnostic 
use. When the temperature dependence of a rate con¬ 
stant for a gated reaction is analyzed by Eq. 3, the 
fitted value of will often exceed the nonadiabatic 
limit (Winkler and Gray 1992). In fact, obtaining such 
unreasonably large fitted values of //ab from analysis 
of the temperature dependence of an electron transfer 
reaction has been taken as evidence for gated electron 
transfer (Davidson 2000). This approach has also 
been used to document that alteration in the rate of 
an electron transfer reaction that results from site- 
directed mutagenesis of amino acid residues is 
sometimes the consequence of a change in kinetic 
mechanism in which the mutagenesis has caused 
a true electron transfer reaction to become gated 
(e.g., Ma et al. 2006). 
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Cross-References 

► Coupled Electron Transfer 

► Dynamic Docking 

► Electron Transfer Proteins: Overview 

► Electron Transfer Theory 

► Electron Transfer Through Proteins 

► Proton-Coupled Electron Transfer 

► Redox Potential 
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Gated Stimulated Emission Depletion 
Microscopy (g-STED) 
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^Unit of Dynamic Neuronal Imaging, Department of 
Neuroscience, Institut Pasteur, Paris, France 

Synonyms 

Time-gated stimulated depletion microscopy 

Definition 

Time-gated stimulated emission depletion (STED) 
microscopy, usually referred as g-STED microscopy. 


uses the time-dependent behavior of fluorescence to 
further enhance the effective spatial resolution of 
STED microscopy (Vicidomini et al. 2011). 

In a nutshell, the fluorescence on-off contrast (the 
suppression of spontaneous emission) depends not only 
on the average STED intensity but rather on the amount 
of STED photons a fluorophore sees (on average) before 
a possible spontaneous emission. Based on this insight, 
the fluorescence on-off contrast, that is critical to 
attaining sub-diffraction resolution and ensures that 
fluorescence light is recorded mainly from fluorophores 
from the doughnut center (zero-intensity point), can 
be enhanced by detecting fluorescence only from 
fluorophores that have been exposed to the STED 
beam for at least a certain time-delay Tg after excitation. 

Another formulation is that the STED reduces the 
average time that a fluorophore spends in the excited 
state, i.e., its effective excited-state lifetime x is 
determined by the rate of stimulated emission, which 
increases linearly with the STED intensity. Conse¬ 
quently, in the doughnut-shaped STED pattern, 
the excited-state lifetime of a fluorophore changes 
according to its position. In particular, the excited- 
state lifetime decreases away from the doughnut center, 
reaching a minimum in the proximity of the maximum 
STED intensity (doughnut crest). This excited-state life¬ 
time signature can be used to reject photons emitted by 
short-lived fluorophores in the periphery and thus to 
select only those photons emitted by the long-lived 
fluorophores located close to the doughnut center. 

Selecting photons emitted only after a certain time- 
delay Tg from the excitation can be easily implemented 
using pulsed excitation beam in combination with 
time-gated detection. 

Importantly, g-STED can be integrated both in 
STED systems relying on pulsed and continuous 
wave (CW) STED beams. However, in the all-pulsed 
STED microscopy implementation the pulse width of 
the STED beam (f« 100-300 ps) is, in most of the case, 
much shorter than the unperturbed (no STED action) 
excited-state lifetime Xfj of the fluorophore (« 1^ ns), 
thereby, in general, time-gated detection hardly 
improves the effective spatial resolution of the all- 
pulsed STED modality. Roughly speaking, the proba¬ 
bility that a fluorophore spontaneous emits before all 
the STED photons (bounced in the short STED pulse) 
arrive is negligible. On the other hand, time-gated 
detection is extremely important for the CW-STED 
implementation. In this case, the STED photons are 
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spread over time and thereby, for a given STED aver¬ 
age intensity, the peak intensity and the instantaneous 
probability of stimulated emission is rather low com¬ 
pared to the all-pulse STED implementation. As a 
consequence, a nonnegligible part of the fluorophores 
emits fluorescence before having been exposed to 
much of the STED light. In terms of imaging, those 
early photons result in a residual fluorescence outside 
the doughnut center and in a reduction of the effective 
spatial resolution. Time-gated detection removes such 
early fluorescence and strongly confines the effective 
fluorescence probing area. 

Time-gated detection also rejects “desired” pho¬ 
tons, namely, those that are emitted during Tg from 
the doughnut center. Therefore, the increase in the 
effective resolution has to be pondered against the 
reduction in signal. Even so, it has been shown that 
for typical parameters, time-gated detection combined 
with CW-STED (namely, gCW-STED) greatly 
reduces the STED intensity demand to achieve sub¬ 
diffraction resolution. This makes gCW-STED highly 
appealing for live cell imaging, where lower light 
irradiation avoids potential phototoxic effects. 


Cross-References 

► Stimulated Emission Depletion (STED) Microscopy 
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Gating of Potassium Channels by Cyclic 
Nucleotide Binding 

Joao Henrique Morais-Cabral 
Instituto de Biologia Molecular e Celular, 
Universidade do Porto, Porto, Portugal 


Synonyms 

Cyclic nucleotides; Ion channels 


Definition 

Cyclic nucleotides (cAMP or 3'-5'-cyclic adenosine 
monophosphate; cGMP or 3'-5'-cyclic guanosine 
monophosphate) are signaling molecules that regulate 
the function of many different proteins including 
kinases, transcription factors, and ion channels. All 
these proteins share a common feature: Binding of 
the ligand to a cyclic nucleotide binding (CNB) 
domain induces a conformational change in the 
domain which alters the activity of the effector domain 
or protein. 


Basic Characteristics 

The superfamily of tetrameric cation channels (which 
includes potassium, sodium, and calcium channels) 
includes proteins that are regulated by cyclic 
nucleotides. Among these is a family of bacterial 
cyclic nucleotide regulated potassium channels 
(Nimigean et al. 2004) as well as the eukaryotic CNG 
(cyclic nucleotide gated) (Craven and Zagotta 2006; 
Mazzolini et al. 2010) and HCN (hyperpolarization- 
activated cyclic nucleotide gated) channels (Biel 
et al. 2009; Craven and Zagotta 2006). These channels 
are all tetrameric, with each subunit encompassing six 
transmembrane helices, a diverse N-terminal cytoplas¬ 
mic region and a C-terminal cytoplasmic cyclic 
nucleotide binding (CNB) domain. Other channels 



Gating of Potassium Channels by Cyclic Nucleotide 
Binding, Fig. 1 Structure of MlotiKl CNB domain; P-roll is 
shown in gray, aA in cyan, aB and aC in green, and C-linker in 
red. cAMP molecule in binding pocket is shown as a stick model 
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Gating of Potassium Channels by Cyclic Nucleotide 
Binding, Fig. 2 Ligand-binding pocket, (a) View of some 
residues in the PBC, glutamate, and arginine that interact with 
the ribose and phosphate moieties of cAMP; (b) view of residues 

have these same sequence characteristics, for exam¬ 
ple, the EAG (ether-a-go-go) potassium channels, but 
appear not to be regulated by cyclic nucleotides 
(Brelidze et al. 2009). 

The MlotiKl bacterial potassium channel has been 
a useful system for understanding structural aspects of 
cyclic nucleotide gating (Altieri et al. 2008). The CNB 
domain in MlotiKl is formed by 110 residues which 
adopt a very characteristic structural fold (Fig. 1). A set 
of eight P-strands assume a P-roll fold and form 
a shallow ligand-binding pocket. On the surface of 
the P-roll there are three helices which are important 
in the ligand-induced conformational change: aA helix 
is just before the first strand, while aB and aC are 
positioned after the last p-strand. The connection of 
the domain to the last transmembrane helix of the 
channel, and to the channel gate, is done through 
a stretch of residues known as the C-linker. Here lies 
one of the crucial differences between bacterial 
cyclic nucleotide regulated channels and their eukary¬ 
otic counterparts. In the MlotiKl channel the linker is 
^20 residues long, forming two helices. In contrast in 
HCN and CNG channels the C-linker is much 
longer, ~80 residues (Craven and Zagotta 2006). 
The functional implications of this difference are 
discussed below. 

There are several key residues in the cyclic nucleo¬ 
tide binding pocket (Fig. 2). The Phosphate Binding 
Cassette (PBC), between P-strands 6 and 7, includes 
the highly conserved sequence Phe-Gly-Glu, with the 
glutamate side-chain interacting with ligand’s ribose 
moiety. The PBC also includes the highly conserved 



that interact with the base of cyclic nucleotide, in particular the 
arginine in the aC helix. Dotted lines indicate interaction 
between atoms of residues and ligand 


arginine which neutralizes the ligand’s phosphate 
group. At the other end of the ligand-binding site, the 
aC helix acts as a lid and an arginine from the helix 
extends over one of the faces of the nucleotide base, 
establishing Van der Waals contacts with the ligand 
and hydrogen bonding the glutamate on the PBC. In 
addition, two or three residues from the hairpin formed 
by P-strands 5 and 6 interact with the other face of the 
nucleotide base. The determinants of nucleotide selec¬ 
tivity, cAMP versus cGMP, in channel CNB domains 
have been explored (Craven and Zagotta 2006) but 
are far from being fully understood since residues 
that are implicated in selectivity in a specific channel 
do not explain selectivity in other related proteins 
(Cukkemane et al. 2010). 

Ligand-Induced Conformational Changes 

The conformational change induced by ligand has been 
well described by a series of MlotiKl CNB domain 
structures in the bound and unbound states (Fig. 3) 
(Altieri et al. 2008). Upon ligand binding the pocket 
collapses around the ligand, and the interactions 
described above are established; the three helices of 
the domain are rearranged, and these changes are 
extended to the helices of the linker. In particular, the 
aC helix takes its position as the lid of the binding 
pocket by displacing the C-terminal end of the C-linker 
and consequently altering the position of the linker 
itself. Note that this overall rearrangement is also 
observed in the CNB domain from protein kinase 
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Gating of Potassium Channels by Cyclic Nucleotide 
Binding, Fig. 3 Superposition of MlotiKl CNB domain in 
the bound and unbound states. Color scheme is identical to the 
one used in Fig. 1 ; different tones of color are used to distinguish 
the same helix in different states, with more vivid tones indicat¬ 
ing unbound state. Arrows indicate the rearrangement of helices 


A (PKA) but not in the catabolite gene activator 
protein (CAP) (Sharma et al. 2009). PKA is, therefore, 
a better model than CAP to understand cyclic nucleo¬ 
tide activation in channels. 

The differences seen between the C-linkers of the 
bacterial and eukaryotic channels are reflected in the 
functional properties of these channels. In MlotiKl, 
it has become apparent that nucleotide activation 
is noncooperative and that structural and thermody¬ 
namic coupling between the conformational changes 
in the domain and channel gate is probably weak 
(Cukkemane et al. 2010). The X-ray and EM structures 
of the full-length MlotiKl channel suggest that the 
CNB domains do not interact directly with each other 
and that these C-linkers act just as a connector to the 
gate region; functional and biochemical data strongly 
support the idea of independence between domains. In 
contrast, in HCN and CNG channels coupling between 
CNB domains and coupling of CNB domains with 
other channel regions are important mechanistic 
features (Craven and Zagotta 2006; Cukkemane 
et al. 2010). The 80 residues composing the C-linker 
form an intricate helical arrangement (aATo aF') in 
the structures of CNB domains from mouse HCN2, 
human HCN4, and sea urchin SpIH (Fig. 4). A striking 
feature is the ring formed by the C-linkers from 
four CNB domains, with inter-subunit contacts 
established in an “elbow-on-shoulder” arrangement; 
no inter-subunit contacts are observed at the level of 
the CNB domains. Importantly, although the CNB 
domains in these structures are in the bound state 
(corresponding to a channel open state), it has been 


G 



Gating of Potassium Channels by Cyclic Nucleotide 
Binding, Fig. 4 CNB domain ring from FICN channel, (a) 
View, from the membrane, of the gating ring formed by the 
C-linker_CNB domain. The C-linkers from each subunit are 
shown in different colors. CNB domains are shown in the 



background in gray, (b) Side view of the ring. Each subunit is 
shown in a different color. C-linkers are shown in more vivid 
color tone. Flelices of linker are labeled. Arrows indicate the 
“elbow-on-shoulder” contacts established between neighboring 
linkers 
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demonstrated that the C-linker ring region has adopted 
a closed state conformation (Craven and Zagotta 2006). 
Nevertheless, it is thought that ligand-induced changes, 
as described above, are propagated to the ring probably 
destabilizing it and leading to channel opening. Aboli¬ 
tion of inter-subunit contacts present in the C-linker ring 
structure of HCN2 shifted the equilibrium toward the 
open channel (Craven and Zagotta 2006). It must be 
noted that the activation gates appear to be different in 
HCN and CNG channels; in HCN channel, the gate is 
thought to be at the intracellular crossover of S6 like in 
Shaker, while in CNG channels it is the changes in the 
pore loop or selectivity filter which result in channel 
gating (Mazzolini et al. 2010). 

The mechanism of cyclic nucleotide channel gating 
is still unclear but new approaches are opening 
up interesting possibilities. For example, in HCN2 
channels monitoring of channel currents and of ligand 
binding has revealed coupling between ligand binding 
and events prior to channel opening, most likely 
voltage sensor movement (Kusch et al. 2010), while 
new short distance FRET probes have allowed moni¬ 
toring of changes within the C-linker ring (Taraska 
et al. 2009). 

Cross-References 

► Calcium Activation of Channels: RCK Domains 

► Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET 

► HCN Channels: Biophysics and Eunctional 
Relevance 

► N-Type Inactivation in Voltage-Gated Potassium 
Channels 

► Potassium Channel Selectivity and Gating at the 
Selectivity Eilter: Structural Basis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Signal Transduction at the Intracellular Surface of 
the Membrane-Integral G-protein-Coupled 
Receptors 
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Synonyms 

Bacterial glutamate transporter homologue; Bacterial 
sodium/aspartate symporter; Excitatory amino acid 
transporter 


Definition 

Glutamate transporters, and their archaeal homologue, 
Gltph, are ► membrane proteins, which belong to the 
Dicarboxylate/Amino Acid: Cation (Na"^ or H"^) 
Symporter (DAACS) Family (2.A.23, Saier TCDB), 
also designated in eukaryotes as Solute Carrier Family 1 
(SLCl) (Saier et al. 2006). There are currently over 
1,000 sequenced members of the family derived from 
bacteria, archaea, and eukaryotes. The human glutamate 
transporters, also known as excitatory amino acid trans¬ 
porters (EAATs), are responsible for the concentrative 
uptake of glutamate and aspartate and are driven by the 
electrochemical potential of sodium, potassium, and 
hydrogen ions. Gltph is a transporter from the hyperther- 
mophilic archaebacterium Pyrococcus horikoshii and 
catalyzes concentrative uptake of aspartate driven by 
the electrochemical potential of sodium ions. 


Introduction 

The characterized members of the DAACS family 
catalyze Na"^ and/or symport together with a 
Krebs cycle dicarboxylate such as malate, succinate. 


or fumarate; a dicarboxylic amino acid glutamate or 
aspartate; or a small neutral amino acid. The general¬ 
ized transport reaction catalyzed by members of the 
DAACS family is: 


Substrate (out) + nM+(out) ^ Substrate(in)-|- nM+(in) 


where M"^ stands for or Na"^. However, the coupling 
mechanisms generally differ between family members. 
The mammalian EAATs comprise hve subtypes, 
which in humans are termed EAATl-5. Extensively 
studied homologues in other species are GLASTl (rat 
equivalent to EAATl), GLTl (rat equivalent to 
EAAT2), and EAACl (rabbit equivalent to EAATS). 
This family also contains two mammalian neutral 
amino acid exchangers termed ASCTl/2 and many 
prokaryotic members including the aspartate transporter 
from Pyrococcus horikoshii (Gltpi,) whose three- 
dimensional structure is known and discussed below. 


G 


Physiological Significance 

Glutamate is the major excitatory amino acid in the 
mammalian central nervous system (CNS) and medi¬ 
ates the transmission of information at most excitatory 
synapses. For neurons to transmit information effi¬ 
ciently and to avoid neurotoxicity, extracellular gluta¬ 
mate levels need to be tightly regulated. This role is 
carried out by the EAATs which, in the CNS, are 
located on neurons and glia surrounding synapses 
(Fig. la). The dysfunction or downregulation of the 
EAATs leads to an increase in extracellular glutamate 
and has been implicated in pathologies associated with 
Alzheimer’s disease, motor neuron disease, and ische¬ 
mia following stroke (Danbolt 2001; Beart and O’Shea 
2007). Glutamate transporters are also expressed in 
most other organs in the body and the main glutamate 
transporter in the kidney, EAATS, is responsible for 
the reabsorption of acidic amino acids from the glo¬ 
merular filtrate. Dicarboxylic aminoaciduria (DA) is 
a metabolic disease characterized by excessive excre¬ 
tion of glutamate/aspartate in urine and two loss-of- 
function mutations have been identified in SLClAl, 
the gene that encodes EAATS, in patients with DA 
(Bailey et al. 2011). The closely related neutral 
amino acid exchangers ASCTs mediate Na"^- 
dependent electroneutral exchange of neutral amino 
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Glutamate Transporter Family, Fig. 1 Glutamatergic syn¬ 
apse and stoichiometry of transport via the EAATs and Gltp^. 
(a) Cartoon of a glutamatergic synapse. Glutamate {black dots) 
is stored in synaptic vesicles in the presynaptic terminal. Upon 
release, glutamate diffuses across the synaptic cleft where it 
binds to and activates postsynaptic ionotropic (red) or 
metabotropic (green) glutamate receptors. Glutamate is cleared 
from the synapse by the EAATs (blue) located in glial 
cells (EAATl/2) or the postsynaptic neuron (EAAT3/4). 
EAATS is localized to the retina and not shown in this diagram, 
(b, c) Substrate specificity and coupling stoichiometry of EAATs 
and Gltph, respectively 

acids (Zerangue and Kavanaugh 1996a). ASCT2 is 
known to be upregulated in some cancers and is 
thought to act in concert with other amino acid trans¬ 
porters to maintain the large nutrient requirements of 
rapidly dividing tumor cells (Fuchs and Bode 2005). 
Remarkably, in addition to glutamate transport, the 
EAATs also mediate a glutamate-gated anion conduc¬ 
tance, which resembles an ion channel and under 


physiological conditions likely allows for chloride 
influx into cells (Fairman et al. 1995). This influx 
may lead to hyperpolarization of neurons and play an 
important role in regulation of neuronal excitability, 
particularly in retina (Veruki et al. 2006). 


The Stoichiometry of Transport 

The EAATs are secondary active transporters that 
require energy to pump glutamate against a steep elec¬ 
trochemical gradient into cells. This energy is derived 
from coupling glutamate transport to the movement of 
ions and by exploiting the preexisting Na"^ and 
gradients across the cell membrane that are maintained 
by the ► Na'^/K'^-ATPase. The transport of glutamate 
is coupled to the cotransport of 3 Na"^ ions and 1 and 
the counter-transport of 1 ion which is thought to 
facilitate reorientation of the transporter (Zerangue and 
Kavanaugh 1996b) (Fig. lb). This results in a net 
transfer of two positive charges into the cell per trans¬ 
port cycle. Theoretically, this coupling allows the 
transporters to maintain a 10°-fold gradient of gluta¬ 
mate across the membrane. Indeed, recent estimates 
predict the resting extracellular concentration of glu¬ 
tamate in the brain is as low as 25 nM (Herman and 
Jahr 2007), while the intracellular glutamate concen¬ 
tration is in the millimolar range. The characterized 
prokaryotic homologues demonstrate simpler cou¬ 
pling. For example, Gltph couples the transport of 
each substrate molecule to the symport of 3 Na"^ ions, 
but does not couple to the movements of other ions 
(Fig. Ic) (Ryan et al. 2009; Groeneveld and Slotboom 
2010). Other bacterial homologues are driven only by 
gradients (Slotboom et al. 1999). 

The chloride conductance mediated by the EAATs 
requires binding of Na"^ and substrate but is thermody¬ 
namically uncoupled from their transport. This means 
that once the Cl~ conductance is activated, its rate and 
direction is independent of glutamate transport. This 
Cl~ conductance has been observed in all five of the 
human glutamate transporters and also in the prokary¬ 
otic homologue Gltph suggesting that this process is 
evolutionarily conserved (Ryan and Mindell 2007). 
The substrate-activated anion conductance of the 
EAATs displays a chaotropic selectivity sequence 
where larger anions are more permeant (F“ < Cl“ 
< Br“ < r < NOs" < SCN“). The different EAAT 
subtypes exhibit differences in the proportion of the 
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current carried by anions compared to the current car¬ 
ried by the coupled transport process. For example, the 
conductance of EAAT2/3 is mostly due to coupled 
transport while the conductance of EAAT4/5 is mostly 
due to the uncoupled Cl~ conductance. The relative 
contribution of coupled transport and the uncoupled 
Cl“ conductance for EAATl is roughly equal. In the 
case of EAAT5, the rate of transport is so slow that it is 
thought to act almost like a glutamate-gated Cl“ chan¬ 
nel (Veruki et al. 2006). Eor a review of the different 
conductance states of the EAATs, (see Vandenberg 
et al. 2008). 

The Aspartate Transporter from Pyrococcus 
Horikoshii 

Gltph is an EAAT homologue from a hyperthermo- 
philic archaebacterium Pyrococcus horikoshii, and 
shows approximately 35% amino acid sequence iden¬ 
tity to the EAATs. Gltpi, has been identified as a 
structurally stable protein, which provides a conve¬ 
nient model system to study the structural underpin¬ 
nings of glutamate transporter function. In marine 
P. horikoshii, Gltph is likely responsible for the 
concentrative transport of the nutrient aspartate. 
P. horikoshii resides in the hydrothermal vents under 
the Sea of Japan, where the temperatures are near 
100°C and the pressure is around 200 atm, and hence 
in vitro analysis of Gltpi, under physiological condi¬ 
tions is not feasible. Instead, Gltph has been purified to 
homogeneity and reconstituted into proteo-liposomes, 
which were than assayed at ambient temperatures. 
These assays established the specihcity and the stoi¬ 
chiometry of Gltph (Boudker et al. 2007; Ryan et al. 
2009; Groeneveld and Slotboom 2010). The transport 
rate is slow with approximately 1 turnover per 3 min, 
but is steeply temperature dependent, suggesting that 
at 100°C, the turnover time may be in millisecond time 
range, similar to EAATs. 

The Structure of GItph 

The first crystal structure of Gltph was determined in 
2004 (Fig. 2a) and showed that Gltph, like other 
characterized glutamate transporters, assembles into 
a trimer (Yernool et al. 2004). The trimer forms 
a large extracellular bowl, lined by polar amino acids 
and reaching approximately half way across the lipid 
membrane. Overall, the structure of each Gltph 
protomer can be divided into two domains: the cen¬ 
trally located trimerization domain and the peripheral 


transport domain (Fig. 2). The trimerization domains, 
comprising transmembrane helices (TMs) 1, 2, 4, 5, 
mediate all inter-subunit interactions, while the trans¬ 
port domains coordinate aspartate and Na"^ ions 
(Fig. 2c). The transport domains consist of four TMs: 
3, 6, 7, and 8, and two reentrant helical hairpins (HPs): 
an intracellular HPl and an extracellular HP2 (Fig. 2d). 

The substrate-binding site is located immediately 
underneath the tips of HPl and HP2 (Figs. 2c and 3). 
The majority of the residues that are involved in coor¬ 
dination of substrate originate from either HP2, which 
forms the front of the binding site or from TM8, which 
forms its back. Many of them have been implicated in 
substrate binding from mutagenesis studies on the 
EAATs, where mutations of these residues led to 
loss-of-function proteins. In particular, an arginine 
residue in TM8 that was predicted to interact with the 
y-carboxylate of glutamate in EAATl is seen to inter¬ 
act with the P-carboxylate of aspartate in the Gltph 
structure. Interestingly, the equivalent residue in the 
neutral amino acid transporters ASCTl/2 is a threonine 
or cysteine residue respectively. Mutation of this argi¬ 
nine residue to a neutral amino acid in EAAT3 results 
in a transporter that no longer transports acidic amino 
acids but instead acts as an electroneutral exchanger of 
neutral amino acids (Bendahan et al. 2000). 

The resolution of the Gltph structures is not high 
enough to identify Na"^ ions bound to the protein; so the 
heavy atom thallium (TE) was used to probe cation¬ 
binding sites on Gltp^. Two TE-binding sites were 
identified and competition studies revealed that they 
were selective for Na"^ over K"^, suggesting that they 
were indeed Na’^-binding sites. The first, Nal site, is 
formed by elements of TM7 and TM8 and sits 
below the bound aspartate while the second, Na2 site, 
is formed by HP2 and TM7 (Eig. 3). Interestingly, 
similarly to other Na'^-coupled transporter such 
as LeuT-fold transporters, ► Na’^/K'^-ATPase, and 
Na'^/H'^ exchanger, the ion-binding sites are adjacent 
to non-helical elements within TMs. The main chain 
carbonyl oxygen atoms of the extended region of 
TM7 and the carboxyl terminal end of the first HP2 
helix provide most of the metal coordination in sites 
Nal and Na2. A single side chain of aspartate 405 
in TM8 coordinates Na"^ in Nal site. The mutation of 
this aspartate residue to an asparagine in Gltph reduces 
the coupling of Na"^ to aspartate binding and also 
leads to the loss of this binding site in the crystal 
structure (Boudker et al. 2007). The location of the 
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Glutamate Transporter Family, Fig. 2 The structure of 
an aspartate transporter from Pyrococcus horikoshii (Gltph). 
(a, b) Surface representation of Gltph trimer in the outward and 
inward facing states. The trimerization domain is colored wheat 
and the transport domains are colored blue. HPl and HP2 are 


yellow and red, respectively, (c) Structure of the outward facing 
Gltph trimer is tilted toward the viewer to reveal the surface of the 
extracellular bowl. The bound substrate and Na"^ ions are shown in 
ball representation, (d) The topology of the transporter protomer. 
Two inverted structural repeats are shaded light pink and blue 


third sodium ion in Gltpn is still ambiguous and sev¬ 
eral sites have been proposed through site-directed 
mutagenesis experiments and Molecular Dynamics 
(MD) simulations. One of the proposed sites is 
defined by residues asparagine 310 and aspartate 
312, which are part of a highly conserved sequence 
motif (NMDGT) in the non-helical region of TM7 
(Fig. 3b). Binding of Na"^ ions and L-aspartate 
to Gltph are tightly thermodynamically coupled, 


resulting in a steep dependence of substrate affinity 
on Na"^ such that a 10-fold increase in Na"^ concentra¬ 
tion leads to a 100-1,000-fold increase in L-aspartate 
affinity. This tight coupling is achieved even though 
the bound Na"^ ions that have been identified are 
not directly coordinated by the substrate, suggesting 
that their binding triggers conformational changes 
necessary to accommodate aspartate in its binding 
pocket. 
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Glutamate Transporter 
Family, Fig. 3 Substrate- and 
Na"^ ion-binding sites on 
Gltph- (a) The front and 
(b) side view of the substrate¬ 
binding pocket. Bound 
L-aspartate and residues 
involved in coordinating the 
substrate and Na"^ ions are 
shown in stick representation 
and colored by atom type with 
carbon atoms of L-aspartate 
and protein residues yellow 
and green, respectively. 

HPl and HP2 are yellow and 
red, respectively and TM7 and 
TM8 are light and dark blue 




The Transport Mechanism of GItph 

Two additional structures of Gltpn have shed light on 
the transport mechanism of the EAAT transporter fam¬ 
ily (Boudker et al. 2007; Reyes et al. 2009). One 
structure was determined in complex with a competi¬ 
tive blocker, DL-threo-p-benzyloxyaspartate (TBOA), 
which resembles aspartate with an additional benzyl 
group attached to the P-carbon. The bulky benzyl 
group props HP2 in an open conformation, which 
exposes the substrate-binding site to the extracellular 
solution (Fig. 4a). Supported by computational studies, 
this structure suggests that HP2 may serve as an extra¬ 
cellular gate of the transporter. Both structures of 
TBOA- and aspartate-bound Gltph likely represent so- 
called outward facing states of the transporter, in 
which the substrate-binding site is near the extracellu¬ 
lar solution and either open or occluded. The structure 
of an inward facing state of Gltph in complex 
with aspartate (Figs. 2b and 4a) reveals that the 
trimerization domain remains relatively rigid while 
the transport domains move by more than 15 A toward 
the cytoplasm across the lipid bilayer resulting in the 
proximity of the substrate- and ion-binding sites to the 
cytoplasm. The ligands in this structure are still 
occluded from the intracellular solution under the tips 
of HPl and HP2. At present, it remains unknown 
how they are released into the cytoplasm, and so the 


structural basis of an alternating access mechanism 
(Jardetzky 1966) is not yet complete. Combined, the 
structures of the outward and inward facing states not 
only provide a mechanism by which the substrate and 
bound ions are carried across the membrane, but also 
illustrate how each protomer within the trimer can 
function independently. The functional independence 
of protomers has been postulated for the FAATs 
and Gltph and is consistent with the distant substrate- 
and ion-binding sites within the cores of individual 
protomers and with the fact that the trimerization 
domain remains rigid during the transition. The latter 
property of the assembly may allow each transport 
domain to move across the membrane with little effect 
on the state of the other two transport domains. 

Inverted Structural Repeats 

Remarkably, the structure of Gltpi, contains two sets of 
the so-called inverted structural repeats. These are 
structural elements that are approximately related by 
a twofold symmetry and are positioned in the mem¬ 
brane in an antiparallel orientation. The first repeat 
includes the TMs comprising the trimerization 
domains and the lipid-facing “backside” of the trans¬ 
port domain: TMs 1,2, and 3 can be superimposed onto 
TMs 4, 5, and 6 by ^180° rotation around an axis 
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Glutamate Transporter Family, Fig. 4 (a) The structures of 
single Gltph protomers in the outward facing state bound to TBOA 
(open) or aspartate (occluded) and in the inward facing state bound 
to aspartate (occluded). The structures are shown in cartoon repre¬ 
sentation and colored to emphasize the internal symmeUy. TMs 1, 
2, and 3 are colored wheat and the structurally symmetric TMs 4,5, 
and 6 are colored orange. The core regions of HPl and TM7 and 


HP2 and TMS are colored as in Fig. 3. Regions of the protein that 
are not part of the symmetry are white. Bound substrate and Na* 
ions are shown as sphere.^, (b) The first set of the structurally 
related TMs is shown along with an approximate symmetry axis 
for each state, (c) The second set of the structurally related ele¬ 
ments is shown for each state. NB: the symmetry between HPl and 
HP2 is broken in the TBOA-bound stmcture, where HP2 is open 


parallel to the membrane (Fig. 4b). The second repeat 
comprises the substrate-binding core of the protomer: 
FlPl and TM7 can be superimposed onto HP2 and 
TMS by ^145° rotation (Fig. 4c). Interestingly, the 
conformational transition between the outward and 


inward facing states can be approximated as a switch 
of the relative orientation of the second repeat relative 
to the first (Crisman et al. 2009). Indeed, in the outward 
facing state, HPl and the hrst part of TM7 are approx¬ 
imately perpendicular to the membrane and pack 
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against TMs 2 and 5, while HP2 and the first part of 
TM8 lean away. In this state, HP2 forms the surface of 
the extracellular bowl and is positioned to serve as the 
extracellular gate. In contrast, in the inward facing 
state, HPl leans away from the trimerization TMs 
and forms the cytoplasmic surface of the transport 
domain while HP2 assumes a more perpendicular ori¬ 
entation and packs against TMs 2 and 5. It has been 
proposed that in this orientation HPl may play 
a functionally symmetric role and serve as an intracel¬ 
lular gate. However, it should be noted that the two 
HPs do not play equivalent roles in coordinating sub¬ 
strate, and the symmetry may break down at this higher 
detail level. 

Summary 

The EAAT transporter family contains over 1,000 
members found in a wide range of organisms from 
bacteria to humans and contains transporters for 
acidic and neutral amino acids and for Krebs cycle 
dicarboxylates. Most members of the family are sec¬ 
ondary active transporters that couple substrate move¬ 
ment to Na"^ and/or H"^ ions but some members are 
Na'^-dependent electroneutral exchangers. In the case 
of the EAATs, an additional ion is counter- 
transported and an uncoupled Cl“ conductance 
is activated by Na"^ and substrate binding. This 
uncoupled Cl“ conductance has also been observed 
in Gltph, a prokaryotic homologue of the EAATs. Sev¬ 
eral crystal structures of Gltpi, have revealed details 
about substrate- and coupled ion-binding sites and the 
conformational changes that occur during transport. In 
humans, the EAATs have been implicated in acute and 
chronic diseases of the brain, metabolic disorders, and 
some cancers. A detailed understanding of the mecha¬ 
nism of transport of this family of proteins will allow 
us to understand their role in normal physiological and 
pathological processes. 

Cross-References 

► Ei-ATPase: Eundamental Properties and Structure 

► Macromolecular Crystallography: Overview 

► Membrane Protein Eunction 

► Membrane Proteins: Structure and Organization 
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► Molecular Dynamics Simulations of Lipids 
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Synonyms 

y-glutamylcysteinylglycine 

Definition 

Glutathione is a tripeptide comprised of L-glutamate, 
L-cysteine, and L-glycine. It is an important thiol-hased 
redox buffer as well as an enzymatic cofactor in 
a variety of detoxification and biosynthetic pathways. 

Basic Characteristics 
Introduction 

Although observed in a select number of Gram¬ 
positive bacteria, glutathione is found primarily in 
eukaryotes and Gram-negative bacteria, and is often 
the most abundant redox buffer affording protection 
against ► oxidative stress. Glutathione is synthesized 
by the sequential action of two ATP-dependent 
ligases, glutamate cysteine ligase (also known as 
y-glutamylcysteine synthetase), and glutathione 
synthetase. Glutamate cysteine ligase catalyzes the 
phosphorylation of the y-carboxylate of glutamate. 
The a-amino group of the incoming cysteine then 
attacks the y-glutamylphosphate intermediate to 
generate y-glutamylcysteine. In analogous fashion, 
glutathione synthetase catalyzes the addition of 
glycine to y-glutamylcysteine, generating glutathione 
(Fig. 1). The unusual y-glutamyl peptide bond 



Glutathione, Fig. 1 Structure of glutathione. Glutathione is 
a tripeptide comprised of L-glutamate, L-cysteine, and L-glycine. 
Its y-glutamyl peptide bond, colored in red, is formed by the side 
chain carboxylate of glutamate and the a-amino group of cysteine 

protects glutathione from nonspecific proteolytic 
digestion. However, the glutathione salvage enzyme, 
y-glutamyltanspeptidase, can catalyze the hydrolysis 
of the y-glutamyl linkage (Meister and Anderson 1983; 
Dickinson and Forman 2002; Lu 2009). 

The thiol group of glutathione is critical to its 
numerous biological functions, and has an experimen¬ 
tally determined pA'^ of 9.42. This is significantly 
higher than that of free cysteine, which has a reported 
^Ka = 8.22 (Tajc et al. 2004). This increase in p/^„ can 
largely be attributed to the formation of the y-glutamyl 
peptide bond, as the thiol of y-glutamylcysteine also 
has an elevated pA'^. Glutathione exists in a dynamic 
equilibrium between its reduced form, GSH, and its 
most commonly observed oxidation product, GSSG, 
which is comprised of two glutathione molecules 
cross-linked via a disulfide bond. The GSH/GSSG 
redox system is of critical importance in numerous 
biological systems. The half-cell reduction potential 
(£hc) for the GSH/GSSG redox system has been esti¬ 
mated to be —262 mV in vitro (Millis et al. 1993). 
However, the Fhc for this redox couple can be dramat¬ 
ically shifted depending on the [GSH]/[GSSG] ratio 
and absolute concentration of GSH (Schafer and 
Buettner 2001). 

Glutathione is relatively abundant in biological 
systems; intracellular concentrations between 0.5 and 
10 mM are typically observed. However, extracellular 
glutathione concentrations are considerably lower, 
with estimated values in the micromolar range (Meis¬ 
ter and Anderson 1983). The ► nicotinamide adenine 
dinucleotide phosphate (NADPH)-dependent enzyme, 
glutathione reductase, maintains the vast majority of 
glutathione in its reduced state. Cytosolic [GSH]/ 
[GSSG] ratios generally range from 30:1 to >100:1, 
ensuring a reducing environment. Similar reducing envi¬ 
ronments are found in the nucleus and mitochondria. 
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Glutathione, 

Fig. 2 Ovennew of the 
glutathione redox system. 
Levels of reduced glutathione 
are maintained through its de 
novo biosynthesis and the 
action of glutathione 
reductase. Reduced 
glutathione can serve as 
a cofactor or substrate for 
several detoxification and 
biosynthetic reactions. It also 
transfers reducing equivalents 
to a number of cellular targets 
through redox cycling of 
glutaredoxins (Grx) 
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In contrast, values reported for the endoplasmic reticu¬ 
lum range from 1:1 to 3:1 reported, consistent with the 
need for a more oxidizing environment to promote pro¬ 
tein disulfide bond formation in this compartment 
(Meister and Anderson 1983; Dickinson and Forman 
2002; Lu 2009). 

Functions 

Glutathione facilitates the removal of ► reactive oxy¬ 
gen species and reactive nitrogen species, as well as 
exogenous electrophilic compounds. It also contrib¬ 
utes to signaling pathways and control of enzymatic 
function by modulating the accessibility of protein 
sulfhydryl groups (Dickinson and Forman 2002; 
Lu 2009). In addition, glutathione is critical for the 
storage and transport of cysteine, as well as metals 
such as mercury, lead, arsenic, gold, silver, zinc, and 
copper (Wang and Ballatori 1998). Given its abun¬ 
dance and central importance in a variety of biological 
process, changes in the £hc of the GSH/GSSG redox 
system have been shown to correlate with the overall 
status of the cell. The measured £hc is approximately 
—240 mV during cell proliferation, which increases to 
200 mV during differentiation and to 170 mV 
during apoptosis (Schafer and Buettner 2001). 

In addition to its functions as a redox buffer, gluta¬ 
thione is a cofactor for several detoxification enzymes, 
including glutathione peroxidases, peroxiredoxins, 
glyoxalase, and glutathione S-transferases. Glutathi¬ 
one peroxidases are a family of selenocysteine- 
containing enzymes that convert H 2 O 2 to water, or 
lipid peroxides to their corresponding alcohols, con¬ 
comitant with the generation of oxidized glutathione. 


Peroxiredoxins catalyze similar reactions by a differ¬ 
ent catalytic mechanism, but only some members of 
this family use glutathione as a reductant (Dickinson 
and Forman 2002). Glyoxalase I and glyoxalase II act 
sequentially to detoxify methylglyoxal, glyoxal, and 
other alpha-oxoaldehydes. Though dependent on 
reduced glutathione, the glyoxalase system does not 
consume the cofactor. ► Glutathione 5-transferases 
couple reduced glutathione to electrophilic substrates, 
decreasing the reactivity of these compounds. These 
glutathione conjugates often have increased solubility 
and are more readily expelled from the cell. In addition 
to these detoxification functions, several glutathione 
5-transferases are involved in leukotriene and prosta¬ 
glandin biosynthesis (Hayes et al. 2005). 

Glutathione is also an essential cofactor for the 
glutaredoxin enzyme family (Lillig et al. 2008). Over¬ 
all, the glutathione system, comprised of glutathione 
reductase, glutathione, and glutaredoxins (Grx), 
functions in a manner analogous to the thioredoxin 
system (Fig. 2). Similar to ► thioredoxins, dithiol 
glutaredoxins catalyze the reduction of protein 
disulhdes using an active site Cys-X-X-Cys motif. 
Two molecules of GSH reduce the resulting active 
site disulfide, leading fo fhe production of GSSG. 
Glutaredoxins catalyze the reversible glutathio- 
nylation of protein thiol groups, which regulates 
activity in a comparable manner to reversible 
protein phosphorylation. In addition, glutathione is 
used by glutaredoxins to catalyze the reduction of 
dehydroascorbate to ascorbate, and has recently been 
implicated in the formation of ► iron-sulfur cluster 
containing glutaredoxins. 
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Cross-References 

► Glutathione S-Transferase - Computational Studies 

► Oxidative Stress 

► Peroxidases 

► Reactive Oxygen Species 

► Redox Potential 

► Thioredoxins 
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Definition 

Glutathione S-transferases catalyze the conjugation of 
glutathione to electrophilic centers in a wide variety of 
substrates. 

Basic Characteristics 

Glutathione S-transferases (GSTs) make up a large 
family of enzymes and occur widely in nature. They 


catalyze the conjugation of (reduced) glutathione to 
a range of endogenous and xenobiotic substrates. One 
important physiological role is their contribution to 
phase II biotransformation of exogenous compounds; 
conjugation to glutathione typically makes these 
compounds more soluble, facilitating excretion from 
the body. ► QM/MM methods have been used to 
investigate the mechanism of several types of GSTs. 
For example, the Ml-1 glutathione S-transferase- 
catalyzed conjugation of glutathione to l-chloro-2,4- 
dinitrobenzene, a compound often used in activity 
assays for GSTs, has been studied (Bowman et al. 
2007). The reaction involves significant changes in 
solvation, which required > molecular dynamics sim¬ 
ulations: A static/geometry optimization modeling 
approach would not allow the necessary signihcant 
changes in solvation with an atomistic solvent model. 
This required the use of semiempirical QM/MM 
(AM1/CHARMM22) methods, which are less compu¬ 
tationally demanding than higher level calculations, 
and so allow molecular dynamics simulations to be 
performed. For accuracy, AMI was specifically 
reparameterized for the reaction. The reaction in the 
wild-type enzyme and three different mutants were 
investigated and the results agreed well with experi¬ 
mental data. A similar approach was used to model the 
epoxide ring opening of phenanthrene-9,10-oxide in 
Ml-1 glutathione S-transferase; comparison with the 
M2-2 enzyme suggested a single amino acid position 
as having a determining role in the diastereoselectivity 
of the ring opening reaction (Ridder et al. 2002). 
Glutathione S-transferase Al-1 has been investigated 
with different QM/MM methods (using the ► ONIOM 
method) (Dourado et al. 2008) and ► molecular 
dynamics simulations (Dourado et al. 2009), and sup¬ 
ports a glutathione activation mechanism in which 
a water molecule assists the proton transfer from the 
glutathione thiol group to the glutamyl a carboxylate. 
The arrangement of water around the glutathione thiol 
and glutamate a carboxylate groups was indicated to 
be of particular importance for catalysis. 
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Synonyms 

Cell surface glycoproteins; Glycosylation; 
Glycosylation sequon; Intracellular glycoproteins; 
Monosaccharide-protein linkage region; Protein 
glycosylation; Secreted glycoproteins 


Definition 

The major classification of protein glycosylation 
is by whether the glycan is linked through the nitrogen 
(N) of the acetamido side group of asparagine (Asn) 


amino acids in the protein backbone or via 
the oxygen (O) of the hydroxyl group side group 
of serine (Ser) and threonine (Thr) amino acids 
in the protein backbone, and are thus called 
N- and 0-glycosylation. In mammalian cell surface 
glycoproteins, the linkage monosaccharides are, 
respectively, A-acetylglucosamine (GlcNAc) and 
N-acetylgalactosamine (GalNAc). These are C-4 
hydroxyl-to-hydrogen epimers, the latter giving 
a more hydrophobic face on the glycosidic ring of 
GalNAc and allows for differentiation by the 
enzymes that catalyze the glycan-protein link so 
that GlcNAc is linked in (3 anomeric configuration 
and GalNAc in a. These linkage regions have dis¬ 
tinct stereo-conformations the former being rela¬ 
tively flexible around the GlcNAc pi-acetamido 
bond and the latter being relatively rigid around 
the GalNAca-hydroxyl. This, and the size and type 
of the oligosaccharide linked to protein, affects their 
whole biophysical relationship: 

N-linked chains are built up initially on a lipid template 
as Glc 9 Man 3 GlcNAcGlcNAc- and transferred to 
protein where a series of enzymes catalyze the 
transformation of Glcg to give high mannose, com¬ 
plex, or hybrid chains (► Molecular Dynamics Sim¬ 
ulations of Carbohydrates, ► Glycoconjugates in 
Cell Lunction and Therapeutics) of, normally, 
6-20 monosaccharides, but sometimes larger, all 
having Man 3 GlcNAcGlcNAc- as a common core 
which may or may not be fucosylated at the inner¬ 
most GlcNAc (Luca 1,6 in mammals, Luca 1,3 in 
plants and Luca 1,3 and Puctxl,6 in insects) with or 
without so-called bisecting GlcNAc in mammals 
and, in plants, xylosylation (Xyipi,2) at the proxi¬ 
mal, branch point Man. 

0-linked chains are built up stepwise on the initial 
GalNAc- via several different di- to tri-saccharide 
core regions with often only one or a few extra 
monosaccharides added, but again sometimes 
many more. 

The short a-linked 0-linked chains seem to intercalate 
with the attached and surrounding amino acids 
forming a glycopeptide epitope, whereas N-linked 
chains often form large, flexible, hydrodynamic 
domains covering the underlying protein and theo¬ 
retically altering the sol-solvent interface at the cell 
membrane. 

The differences in O- and N-linked glycosylation 
discussed above are further accentuated by 
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(a) the surrounding amino acids and (b) the types of 
monosaccharide and their linkages/branching patterns. 
For (a) in the case of N-linked the Asn to which the 
oligosaccharide chains is attached is always in the 
sequon AsnXxxSer/Thr which has a distinct conforma¬ 
tional shape. This may occur several times along the 
peptide backbone and each of these potential glycosyl- 
ation sites may or may not be glycosylated. Occupied 
sites are often those in a loop or turn and need to be 
accessible. In the case of 0-linked glycosylation, 
additional surrounding Ser, Thr, and Pro are overrep¬ 
resented near the linkage Ser/Thr. 

There are two more ubiquitous protein glycosyla¬ 
tion patterns: In mammals the polysulfated proteogly¬ 
cans have mostly 0-linked chains with different core 
regions (► Proteoglycans) providing the starting point 
for the biosynthesis of large, highly functional glyco¬ 
sylation chains (► Glycosaminoglycan (GAG)) and in 
bacterial proteins Asn, Ser, Thr, and Tyr amino acids 
can have added mono-oligosaccharide chains, some of 
which have now been ascribed several functions in 
bacterial protein stability and pathogenicity. There 
are also many less well represented glycosylation var¬ 
iants to which no specific function or biophysics have 
been described although the requirement for a specific 
enzyme for each glycan-protein and monosaccharide- 
monosaccharide linkage and the ubiquity of specific 
carbohydrate-binding proteins (► Lectins) hints at 
probable biological recognition. 

There are two glycan-protein linkages in particular 
that should be mentioned for their known involvement 
in pathology: in intracellular proteins the addition to 
Ser/Thr hydroxyl groups that are also targets for 
phosphorylation of (mostly) GlcNAcP- (0-linked 
glucosaminylation) (see ► (Glyco)protein Folding 
Disorders) and in secreted proteins the non-enzymatic 
addition to any amino group of circulating glucose in 
open chain form (called “glycation” to distinguish it 
from all the enzyme catalyzed reactions mentioned 
above and elsewhere). 
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Synonyms 

Protein folding disorders 

Definition 

The term “protein folding disorders” (PFD) was intro¬ 
duced for a small group of proteins that misfold after 
biosynthesis producing pathological symptoms, for 
example, by depositing in inappropriate places. Of 
these proteins, three can be normally or sometimes 
N-glycosylated hence the term “(glyco)protein folding 
disorders” (GFD). Flowever one of these, the Ap pre¬ 
cursor glycoprotein providing the 40-42 amino acid 
residue fragments called amyloid-P-protein (AP) that 
are found in the plaques of patients with Alzheimer’s 
disease (AD), is also 0-GlcNAcylated and many other 
proteins can also be, which results in misfolding and 
tissue deposition. 

Introduction 

The term “PFD” can also now be applied to many more 
proteins, as we find out more about their complicated 
life stories. For example, in Huntington’s Disease (HD) 
the addition at gene level of repeat CAG means that the 
protein can be biosynthesized with a long Gin tail. This 
is somewhat reminiscent of hemoglobin in sickle cell 
disease where a Glu to Val mutation results in a sticky 
end resulting in aggregation of hemoglobin and subse¬ 
quent changing shape of the red blood cells. However, 
generally PFD relates to the amyloidoses (Omtvedt 
et al. 2000), a heterogeneous group of disorders in 
which normally soluble proteins aggregate to form 
insoluble amyloid deposits resistant to proteolysis. 
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Amyloidosis 

As discussed in most biochemistry textbooks (e.g., 
Voet et al. 2008) most proteins were thought to main¬ 
tain their native conformations or retain their ability to 
undergo proteolysis and degradation, but some soluble 
proteins form insoluble fibrous aggregates. The aggre¬ 
gates are known as amyloids, a term that means starch¬ 
like because it was originally thought that the material 
resembled starch. However it is now known that these 
are largely protein aggregates that deal with plane 
polarized light-like starch. The diseases known as 
amyloidoses are a set of relatively rare inherited dis¬ 
eases in which mutant forms of normally occurring 
proteins, for example, lysozyme and hhrinogen, accu¬ 
mulate in a variety of tissues as amyloids. Associated 
with amyloid deposits are the amyloid P components 
and ► Glycosaminoglycans and such associations may 
be important in tissue and species specificity. Here and 
in Hounsell (2010, for NMR studies) I summarize the 
three specific proteins where glycosylation appears 
a major factor (GFD): 

• The prion, the PRoteinacious INfectious particle that 
is the pathological reagent in the encephalopathies 
affecting many species but including bovine 
spongiform encephalopathy (BSE) and in humans 
Creutzfeldt Jakob Disease (CJD) 

• Immunoglobulin light chains involved in systemic 
amyloidosis 

• Alzheimer’s glycoprotein which is a transmemhrane 
Type I glycoprotein that has N-glycans on the extra¬ 
cellular surface and 0-GlcNAc (O-GlcNAcylation) 
on the cytoplasmically oriented side 

The nonspecihc cases include many cytoplasmic 
proteins that have 0-GlcNAcylation and the secreted 
(glyco)proteins that are modified hy glycation. It could 
also be said that the many proteins that have O-linked 
glycosylation that is intimately involved in glycopep- 
tide conformation and antigenicity could also rate as 
GFDs, hut here we will stick to the three specihc cases 
outlined above. 


The Prion 

The GFD held really came alive (and dehned) when 
the structure of the prion was elucidated and also it was 
shown that blockade of glycosylation promotes acqui¬ 
sition of scrapie-like properties by the prion protein in 


cultured cells (Lehmann and Harris 1997). The natural 
prion protein always has two sequons for 
N-glycosylation at or near (depending on the species) 
Asn 181 and 197. These can be and are glycosylated 
with hundreds of different glycoforms and it is there¬ 
fore difficult to see the importance of any one, but more 
interestingly there is a preponderance for underglyco- 
sylation so that one can dehne three states having 
either one, both, or neither sites glycosylated. These 
can be seen, for example, by polyacrylamide gel 
electrophoresis as glycosylation gives a large 
additional molecular weight and the pattern of glyco¬ 
sylation occupancy dehnes different prion types 
(Wong et al. 2000). Another dehning feature of the 
prion is that it is anchored in the cell membrane by 
a Glycosylphosphatidyl (GPI) moiety which can theo¬ 
retically account for its infectivity as such a molecule 
may be able to move from cell membrane to cell 
membrane which is not allowed by integral transmem¬ 
brane glycoproteins. 


G 


Alzheimer's Disease (AD) 

There are some similarities of the AD glycoprotein 
and the prion. The former is a transmemhrane 
N-glycosylated glycoprotein where the transmem¬ 
brane region has some sequence similarity with 
a section of (non-transmemhrane) prion sequences 
(the prion is anchored not by this part, but by its 
GPI anchor) that is specifically cleaved by protease 
to give pathological peptides. Unlike the prion, 
N-glycosylation has not been implicated in the forma¬ 
tion and deposition of amyloid-P-protein (AP) that are 
found in the plaques of patients with Alzheimer’s 
disease (AD); however the 0-GlcNAcylation seems 
to he involved in pathogenesis. This is a common 
feature of many neuron-associated proteins, for exam¬ 
ple, neurofilaments, microtubule associated proteins 
such as Tau, and clathrin assembly proteins. In AD, 
Tau is hyperphosphorylated causing it to dissociate 
and self-assemble into paired helical filaments which 
are the major components of neurofibrillary tangles 
in the brain (Wang et al. 1996). Phosphorylation is 
reciprocal with 0-GlcNAcylation. 0-GlcNAc modifi¬ 
cation of the clathrin assembly protein AP-3 is 
also reduced in AD and this is associated with an 
increase in density of neurohbrillary tangle. A very 
large amount of glucose is converted to GlcNAc in 
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neurons. Also cellular conversion is an essential pre¬ 
requisite for the development of insulin-resistant or 
type 2 diabetes (Brooks et al. 2002). Advanced 
glycation end products (AGEs) contribute to amyloid¬ 
osis in Alzheimer’s disease (Vitek et al. 1994) as has 
been proposed in several other pathologies such as 
involvement of p-2-microglobulin modihed with 
AGEs in the pathogenesis of hemodialysis-associated 
amyloidosis (Miyata et al. 1994). 

Systemic Amyloidosis 

AL-amyloidosis is a fatal disease caused by deposition 
of Ig light chains in a fibrillar form (AL) in various 
organs. Hence this is also known as systemic amyloid¬ 
osis by comparison to those primarily associated with 
the brain. AL-amyloidosis presents in patients with 
multiple myeloma or other plasma cell dyscrasias 
where there is an increase in production and secretion 
of oligoclonal or monoclonal immunoglobulin (Ig) 
components. However, only 10-15% of patients with 
multiple myeloma develop AL and the reason for 
this selectivity is unknown (Omtvedt et al. 2000). 
Approximately 15% of circulating light chains 
from homogeneous Ig in patients with multiple mye¬ 
loma have covalently attached oligosaccharides and 
AL-associated proteins are four times more frequent 
than that reported for light chains from patients with 
myelomatosis without amyloidosis. Eurther there is an 
unexpected preponderance of the glycosylation sequon 
in the framework regions of amyloid light chains ver¬ 
sus the hypervariable regions (Omtvedt et al. 2000). 
A particular glycosylation pattern has been found with 
a bisecting GlcNAc (Omtvedt et al. 2000; Routier et al. 
1998) which may be involved in tissue deposition. In 
addition, glycosylation may be important in biosynthe¬ 
sis cellular trafficking and secretion, or have a role to 
play in peptide stability. 

The discussion above is for light chain glycosyla¬ 
tion. It has long been known that the heavy chains of 
Igs are glycosylated (Alavi and Axford 1995). In each 
heavy chain there is one glycosylation sequon for 
N-glycosylation (see ► Glycan-to-Protein Linkages) 
and this is always glycosylated, but with different 
glycoforms (Routier et al. 1998). In rheumatoid arthri¬ 
tis there is a particular pattern of chains lacking termi¬ 
nal sialic acid and subterminal Gal called GalO and it 
has been postulated that the exposed GlcNAc here 


could have a natural GlcNAc binding ligand. It has 
also been suggested that the differences in glycosyla¬ 
tion in patients with rheumatoid arthritis are responsi¬ 
ble for changes in the protein structure and/or the 
resulting glycoprotein may have a different overall 
3D structure with the glycans on each heavy chain 
interacting across the space between them. 

Summary 

There are several different modes of glycosylation/ 
glycation which are implicated in pathologies associ¬ 
ated with protein misfolding. 

Cross-References 

► Glycan-to-Protein Linkages 

► Glycosaminoglycan (GAG) 

► Glycosylation in Metabolism 

► Glycosylphosphatidylinositol 

► Proteoglycans 


References 

Alavi A, Axford JS. Glycoimmunlogy. New York: Plenum 
Press; 1995. Adv. Exp. Med. Biol. 376. 

Brooks SA, Dwek MV, Schumacher U. Functional and molecu¬ 
lar glycobiology. Oxford: BIOS Scientific; 2002. 

Hounsell EF. NMR spectroscopy of carbohydrates, lipids 
and membranes. In: Webb G, editor. Specialist reports in 
NMR spectroscopy. Cambridge: Royal Society of Chemis¬ 
try; 2010. 

Lehmann S, Harris DA. Blockade of glycosylation promotes 
acquisition of scrapie-like properties by the prion protein in 
cultured cells. J Biol Chem. 1997;272:21479-87. 

Miyata T, Inagi R, lida Y, Sato M, Yamada N, Oda O, Maeda K, 
Seo H. Involvement of P-2 microglobulin modified with 
advanced glycation end products in the pathogenesis of 
haemodialysis-associated amyloidosis. Induction of human 
monocyte chemotaxis and macrophage secretion of 
tumour necrosis factor-a and interleukin-1. J Clin Invest. 
1994;93:521-8. 

Omtvedt LA, Bailey D, Renouf DV, Davies MI, Paramonov NA, 
Haavik S, Husby G, Sletter K, Hounsell EF. Glycosylation of 
immunoglobulin light chains associated with amyloidosis. 
Amyloid Int J Clin Invest. 2000;7:227^4. 

Routier EH, Hounsell EF, Rudd PM, Takahashi N, Bond A, 
Hay EC, Alavi A, Axford JS, Jeffris R. Quantitation of the 
oligosaccharides of human serum IgG from patients with 
rheumatoid arthritis: a critical evaluation of different 
methods. J Immunol Methods. 1998;213(2):113-30. 



Glycerolipids: Chemistry 


907 


G 


Vitek MP, Bhattacharya K, Glendening JM, Stopa E, Vlassara H, 
Bucala R, Manogue K, Cerami A. Advanced glycation end 
products contribute to amyloidosis in Alzheimer’s disease. 
Proc Natl Acad Sci USA. 1994;91:4766-70. 

Voet D, Voet JD, Pratt CW. Principles of biochemistry. 
Hoboken: Wiley; 2008. 

Wang JZ, Grundke-Iqbal I, Iqbal K. Glycosylation of 
microtubule-associated protein tau: an abnormal posttransla- 
tional modification in Alzheimer’s disease. Nat Med. 
1996;2:871-5. 

Wong NKC, Renouf DV, Lehmann S, Hounsell EF. Glycosyla¬ 
tion of prions and its effects on protein conformation 
relevant to amino acid mutations. J Mol Graph Model. 
2000;18:126-34. 


Glycerolipids: Chemistry 

Robert Bittman 

Department of Chemistry and Biochemistry, 

Queens College of the City University of New York, 
Flushing, NY, USA 

Introduction 

Glycerolipids are a structurally heterogeneous group 
of lipids that play key structural and functional roles in 
bacterial, plant, and animal membranes. They all have 
at least one hydrophobic chain linked to a glycerol 
backbone in an ester or ether linkage. This entry 
discusses the chemistry of neutral (uncharged) 
glycerolipids (mono-, di-, and triacylglycerols), neu¬ 
tral glyceroglycolipids (mono- and digalactosyl- 
glycerides; sulfoglucolipids), and polar glycerolipids 
(phospholipids and betaine glycerolipids). 

Glycerophospholipids are amphipathic lipids, and 
are the abundant membrane lipids of bacteria and 
mammalian cells. Their structural role in forming the 
bilayer of biological membranes and the monolayer of 
lipoproteins is well known. They interact with other 
lipids and proteins; noncovalent lipid-lipid interac¬ 
tions are the basis of membrane dynamics, including 
the formation of nanoscale microdomains and phases 
in biological membranes with different degrees of 
molecular order. Many glycerophospholipids have 
other functions, such as providing sources of intracel¬ 
lular messengers that participate in the regulation of 
various key biological and physiological activities 
(► Chemical Diversity of Lipids; ► Fatty Acids, 


Alkanols, and Diacylglycerols; ► Lipid Shape and 
Curvature Stress; ► Lipid Domains; Lipidomics). 

Triacylglycerols (TAGs). Triacylglycerols are 
mainly used as storage fats, providing an efficient 
storage form of fatty acids. TAG metabolism also pro¬ 
vides precursors (fatty acids and diacylglycerol) for 
membrane lipid synthesis. In addition, TAGs are 
a sink for removing excess fatty acids from cells, 
thereby preventing fatty acid-induced lipotoxicity. 
TAGs are stored as oils in the seeds of plants and in 
the core of lipid droplets (lipid storage organelles) in 
the cytosol of prokaryotes and eukaryotes; these lipid 
droplets play important roles in lipid mobilization and 
membrane trafficking. In the bloodstream, TAGs are 
present in the core of lipoproteins, together with 
cholesteryl esters. TAGs consist of three fatty acids 
linked via ester bonds to a glycerol backbone. Vegeta¬ 
ble oils and animal fats are mainly TAGs. Oils and fats 
differ in their proportion of unsaturated versus satu¬ 
rated fatty acids. Animal fats have higher melting 
points than vegetable oils because of their higher con¬ 
tent of saturated fatty acids. The unsaturation in most 
naturally occurring fatty acids has the cis or Z stereo¬ 
chemistry. This geometry decreases the melting point 
of fatty acids and TAGs because a kink is introduced 
into the long aliphatic chain, hindering close packing 
of adjacent molecules. Unsaturation also renders 
the molecules susceptible to peroxidation reactions. 
Partial hydrogenation of dietary oils affords fats 
containing various trans fatty acyl chains; trans fats 
have been shown to constitute a greater risk for car¬ 
diovascular disease than saturated fats. Figure 1 shows 
the structure of a TAG that has two saturated fatty 
acids (stearic and palmitic acids) and one monounsat- 
urated fatty acid (oleic acid) esterified to glycerol. 
This unsymmetrical TAG molecule, which is named 
l-stearoyl-2-oleoyl-3-palmitoyl-OT-glycerol, is chiral. 
The prefix “sn” means “stereospecific numbering” and 
is used for naming glycerol derivatives. In the “sn” 
nomenclature system, the glycerol molecule is drawn 
as a Fischer projection with C-1 at the top and C-3 at 
the bottom, and with the C-2 hydroxy group of glycerol 
on the left-hand side of the Fischer projection. The 
enantiomer of this TAG (see Fig. 1) is named 
l-palmitoyl-2-oleoyl-3-stearoyl-OT-glycerol. 

Monoacylglycerols {MAGs). Three regioisomeric 
MAGs exist in which the fatty acid is esterified to the 
5 / 7 - 1 , 5/7-2, or 5/7-3 positions of the glycerol backbone. 
Since 5/7-2-MAG is less stable than sn-l- and 
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1 -Stearoyl-2-oleoyl-3-palmitoyl-sn-glycerol Enantiomer 



1-Palmitoyl-2-oleoyl-3-stearoyl-sn-glycerol 


Glycerolipids: Chemistry, Fig. 1 Structures of the TAG l-stearoyl-2-oleoyl-3-palmitoyl-CT-glycerol and its enantiomer, 
3-palmitoyl-2-oleoyl-1 -stearoyl-r;i-glycerol 
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Glycerolipids: Chemistry, 
Fig. 2 Structures of 1,2-DAG 
and 2-AG 


-OH —OH 

R = CH3(CH2)4CH=CHCH2CH=CHCH2CH=CHCH2CH=CH(CH2)3— 
(all cis) 


5«-3-MAG, it undergoes isomerization on storage, pro¬ 
ducing a mixture of all three positional isomers. An 
example of a polyunsaturated MAG that has attracted 
a great deal of recent interest is 2-arachidonoylglycerol 
(2-AG; see Fig. 2), which exerts numerous neuro- and 
immunomodulatory functions in the nervous system 
and in peripheral tissues. 2-AG, together with ananda- 
mide, is an endogenous ligand that activates the can- 
nahinoid receptors CBl and CB2. After its release, 
2-AG is inactivated by a specific monoacylglycerol 
lipase, which hydrolyzes 2-AG to arachidonic acid 
and glycerol, resulting in endocannabinoid deactiva¬ 
tion (Sugiura et al. 2006). 

Diacylglycerols (DAGs). 1,2-DAG (Fig. 2) is 
the first known lipid second messenger. Membrane- 
bound DAG is generated by hydrolysis of the 
phosphodiester bond of phosphatidylinositols (see 
below) via the well-known signal-induced activation 
of phosphatidylinositol-specific phospholipases C 
(PI-PLCs). DAG remains in the membrane and binds 
to a cysteine-rich domain in protein kinase C (PKC) 
isozymes, resulting in activation of PKC and subse¬ 
quent phosphorylation of target proteins (Nishizuka 
1992). PI-PLCs also generate another second 
messenger, the water-soluble molecule inositol-1,4,5- 
trisphosphate (InsP 3 ), which is released into the cyto¬ 
sol and induces Ca^"^ release from internal stores 
(Berridge 1993). DAGs derived from phospho- 
inositides are enriched with a polyunsaturated fatty 


acyl chain at the sn-2 position; an example is 
l-stearoyl-2-arachidinoyl-5n-3-glycerol, a potent acti¬ 
vator of PKC. Recent studies showed that DAG is 
an allosteric activator of other signal transducing 
proteins with DAG-binding motifs such as Ras gua¬ 
nine-nucleotide releasing proteins (RasGPRs) and an 
inhibitor of cyclic nucleotide-gated ion channels. 

Glycosylglycerolipids. DAGs bearing a carbohy¬ 
drate head group at the sn-3 position of glycerol are 
abundant lipids of plant membranes. Monogalacto- 
syldiacylglycerol (MGDG) and digalactosyldiacyl- 
glycerol (DGDG) are the most abundant and second 
most abundant lipids in nature, respectively. These 
galactosylglycerides are prevalent in photosynthetic 
membranes of higher plants, and are also components 
of plant leaves and nonphotosynthetic tissues such as 
seeds, fruits, and potato tuber. In these lipids, one or 
two galactose molecules are joined to the sn-3 position 
of 1,2-diacylglycerol in a p-anomeric glycosidic link¬ 
age (Fig. 3). In the disaccharide DGDG, the terminal 
galactosyl moiety is in an a-linkage to C6 of the inner 
P-galactosyl residue. Monoglycosyldiacylglycerol is 
also present in many gram-positive bacteria that do 
not contain phosphatidylethanolamine (PE). The gly- 
colipids monoglucosyl- and diglucosyl-DAG comprise 
about 50% by weight of the total lipids in mycoplasma 
and acholeplasma cell membranes (Bittman 1993). 
Glycoglycerolipids are devoid of phosphorus and, 
unlike phospholipids, have a neutral head group. 













Glycerolipids: Chemistry 


909 


G 



Glycerolipids: Chemistry, Fig. 3 Structures of MGDG, DGDG, and SQDG 


Galactolipids are highly enriched in polyunsaturated 
fatty acids. Linolenic acid (18:2A®’*^’^^) is the predom¬ 
inant fatty acid at both the sn-l and sn-2 positions of 
MGDG and DGDG in higher plants. Lower plants 
(green algae, mosses, ferns, conifers) and some angio- 
sperms have a different polyunsaturated omega-3 fatty 
acid (16:3 n-3) concentrated in the sn-2 position of 
galactolipids, whereas the galactolipids of red algae 
and photosynthetic diatoms have a high proportion of 
20:5 n-3 fatty acid. A sulfolipid (SQDG, 6-sulfo-6- 
deoxy-ot-D-glucopyranosyldiacylglycerol, Fig. 3) 
is also present in higher plant membranes. The 
head group of this sulfolipid, which is called 
sulfoquinovose, contains a sulfonate group (which 
has a negative charge) at the 6 position of glucose. 
In the thylakoid membranes of chloroplasts and 
cyanobacteria, about 85% of the lipids are MGDG, 
DGDG, and SQDG. A nonlamellar hexagonal II (Hu) 
phase can be formed by the cone-shaped MGDG 
because of the small size of its head group (the galac¬ 
tose residue) relative to the size of its polyunsaturated 
hydrocarbon chains. However, viable cells avoid the 
formation of a nonlamellar phase (which would disrupt 
the barrier function of the membrane); instead, 
a lamellar phase is formed when MGDG is present as 
a mixture with other membrane lipids having a larger 
head group, such as DGDG and phosphatldylglycerol. 

Glycerophospholipids. These are also triesters of 
glycerol, but the ester group at the sn-3 position of 
glycerol is a phosphate ester instead of a carboxylic 
ester. Thus, glycerophospholipids are amphipathic 
lipids, whereas triacylglycerols are neutral lipids. The 
simplest diacylglycerophospholipid molecule is phos- 
phatidic acid (PA or PtdOH). PA, which is present 
in very low amounts in membranes, is a key lipid 


metabolite and signaling lipid. PA can be converted by 
a phosphatase to DAG (which can then be converted to 
TAG by transesterification), or to lysophosphatidic acid 
by the action of a phospholipase A 2 , or to another 
glycerophospholipid by attachment of a head group. In 
PA, fatty acids are esterified to the sn-l and sn-2 posi¬ 
tions of i’«-glycerol-3-phosphate and phosphoric acid 
is esterified to the C-3 hydroxy group. PA differs 
from other glycerophospholipids because it is a phos¬ 
phate monoester; thus the protonation state of the phos¬ 
phate headgroup of PA is sensitive to the pH of 
the environment. All of the other phosphoglycerides 
in biomembranes (except phosphatidylinositol phos¬ 
phates) are phosphodiesters, which have a second phos¬ 
phate ester linkage to a polar alcohol. The alcohol may 
be neutral (as in serine, glycerol, and inositol) or ionic 
(as in choline, ethanolamine, and inositol phosphates). 
The structures of the major glycerophospholipids 
of biological membranes are displayed in Fig. 4. 
The sn-2 chain of diacyl phosphoglycerides is bent, 
making the first two carbons positioned approximately 
parallel to the lipid-water interface of the bilayer. 
Phosphatidylcholine (PC) and phosphatidylethanol- 
amine (PE) are the most abundant glycerophos¬ 
pholipids of mammalian cell membranes, whereas 
PE, phosphatldylglycerol (PG), phosphatidylserine 
(PS), and cardiolipin (CL or DPG) are the most 
common phospholipids in bacterial membranes. In 
Escherichia coli, PE and PG account for 70-80% and 
20-25% of the total phospholipids, respectively. PC 
and PE are both zwitterionic phospholipids; the head 
group of PE is smaller than that of PC, and can partic¬ 
ipate in intermolecular hydrogen bonding. PS, PG, and 
DPG are anionic phospholipids. PA, PE, and DPG can 
undergo a bilayer to nonlamellar transition, depending 
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on the content of unsaturated fatty acyl chains, temper¬ 
ature, and presence of cations. In many membranes, 
the sn-l chain of glycerophospholipids is a saturated 
fatty acyl chain (palmitic and stearic acids are com¬ 
monly found), and the sn-2 acyl chain is often 
unsaturated (with C16 to C20 fatty acids). An example 
of a synthetic phospholipid with this distribution 
of fatty acyl chains is l-palmitoyl-2-oleoyl-5’n-3- 
glycerophosphocholine, which is often abbreviated as 
POPC, where P stands for palmitoyl and O stands for 
oleoyl. A notable exception to this general composi¬ 
tion is dipalmitoylphosphatidylcholine (DPPC). This 
glycerophospholipid is composed of two palmitoyl 
chains and a phosphocholine head group, and is the 
major component of pulmonary surfactants, function¬ 
ing to lower the surface tension inside mammalian lung 
alveoli at the air-fluid interface. 

Glycerophospholipids spontaneously form a lipid 
bilayer in water, with the polar head groups facing 
outside and the two long hydrocarbon chains packed 
in the interior. They maintain the structural integrity of 
cell membranes, and have an asymmetric distribution 
in the two leaflets of biological membranes. In addition 
to providing the permeability barrier of cells and 
organelles, phospholipids have a wide range of other 
functions and participate in specific cellular activities. 
PS is more abundant in the inner leaflet than in the 
outer leaflet of the plasma membrane and endocytic 
membranes of eukaryotes; the loss of PS asymmetry 
resulting from its translocation to the outer leaflet is an 


early marker of apoptosis (Leventis and Grinstein 
2010). In addition, many glycerophospholipids are 
involved in the transduction of extracellular signals, 
providing second messengers that regulate distinct 
pathways. For example, phosphatidylinositol and its 
various phosphorylated derivatives are important sig¬ 
naling molecules in many eukaryotic cells, regulating 
diverse processes such as membrane traffic, ion chan¬ 
nel function, and cytoskeletal rearrangement (Di Paolo 
and De Camilli 2006). 

Diphosphatidylglycerol. DPG (l',3'-bis[l,2-diacyl- 
glycero-3-phosphosphoryl]glycerol) is also known 
as cardiolipin (CL) because it was first isolated from 
beef heart. As its name implies, DPG is an atypical 
glycerophospholipid because it contains four fatty 
acyl chains and consists of two molecules of phospha- 
tidic acid joined by a central glycerol moiety 
(Fig. 4). The secondary hydroxy group of the central 
glycerol moiety plays an important role, forming an 
intramolecular hydrogen bond with the two adjacent 
phosphate head groups and with water. This is postu¬ 
lated to raise the pKa value of the second phosphate 
head group relative to the pKa value of the first phos¬ 
phate head group of CL, thereby providing a source of 
protons for oxidative phosphorylation. DPG is found 
in the plasma membranes of gram-negative and gram¬ 
positive bacteria. DPG constitutes about 20% of the 
total phospholipid composition of the inner mitochon¬ 
drial membrane and chloroplast membrane of eukary¬ 
otes, where it binds to membrane proteins involved in 
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ATP synthesis and plays an essential role in many 
mitochondrial functions. For example, cytochrome c 
binds to DPG at the outer leaflet of the inner mitochon¬ 
drial membrane. Although the range of molecular spe¬ 
cies of the four fatty acyl groups of DPG is potentially 
complex, deacylation and reacylation reactions take 
place to remodel the molecular species of CL in 
eukaryotes. As a result, linoleic acid constitutes about 
80% of the acyl chain content of mammalian DPG; 
however, a notable exception occurs in patients with 
Barth syndrome (Schlame and Ren 2009). The small 
size of its head group relative to the size of its four 
hydrocarbon chains enables DPG, which has a cone 
shape, to undergo a lamellar to nonlamellar phase 
transition in the presence of Ca^"^. 

Betaine glycerolipids. These 1,2-diacylglycerolipids 
have a polar group linked to the sn-3 position in an ether 
bond. The polar group is charged or zwitterionic 
(having a quaternary ammonium group and a carbox- 
ylate group), but does not contain phosphorus or 
a carbohydrate. Figure 5 shows the structure of 
diacylglyceryl trimethylhomoserine (DGTS). DGTS 
is a polar lipid found in plant and algae cell and thyla- 
koid membranes. Its biosynthesis involves the reaction 
of DAG with S-adenosylmethionine, followed by three 
methyl transfer reactions from S-adenosylmethionine. 

Lysoglycerolipids. Lysoglycerophospholipids have 
only one long hydrophobic chain. Lysophosphatidic 
acid (l-acyl-OT-glycero-3-phosphate, LPA, Fig. 5) is 
the simplest lysoglycerophospholipid with regard to 
structure, since it consists of only a glycerol backbone, 
one acyl chain, and a phosphate head group. The sim¬ 
ple structure belies its biological diversity. This lipid 
second messenger exerts a myriad of biological 
effects; LPA signaling has been implicated in various 
disease processes and in normal physiological 
functions (Lin et al. 2010). LPA is present in blood 
plasma, and can be formed extracellularly by the 
lysophospholipase D activity of autotaxin, a secreted 
glycoprotein that is overexpressed in various tumors, 
on lysophosphatidylcholine. LPA interacts with and 
activates five specific G-protein coupled receptors 
(GPCRs), stimulating cell migration, proliferation, 
survival, and cytoskeletal reorganization (Mills and 
Moolenaar 2003). The intracellular LPA receptors 
include a member of the nuclear hormone receptor 
family called peroxisome proliferator-activated recep¬ 
tor Y (PPARy). LPA is produced intracellularly by 
various mechanisms, and is converted by the action 


of acyltransferases to phosphatidic acid, which is sub¬ 
sequently converted to other phospholipids. 1-0-Alkyl 
and 1-0-alkenyl ether forms of LPA and sn-2 acyl 
forms of LPA also exist. Cyclic phosphatidic acid 
(l-acyl-OT-glycero-2,3-cyclic phosphate, cPA) has 
a cyclic phosphate at the sn-2 and sn-3 positions of 
glycerol; it is generated from LPA by an intramolecu¬ 
lar transphosphatidylation reaction and has distinct 
bioactivities from those of LPA (Fujiwara 2008), 
including inhibition of PPARy. 

Other lysoglycerolipids. Bis(monoacylglycerol) 
phosphate (BMP). BMP is OT-glycero-l-phospho-1'- 
5/7-glycerol in which there is a fatty acyl chain at the 
sn-3 position of each glycerol (Amidon et al. 1995). 
It is found in trace amounts in many tissues of various 
cell types, but is concentrated in the membranes of 
alveolar macrophages, late endosomes, and lysosomes. 
BMP consists of two glycerol backbones with an acyl 
chain at the sn-3 and sn-3' positions (Fig. 5). The fatty 
acyl chain is predominantly oleoyl in most cell types. 
However, docosahexaenoyl chains are present in some 
cells and organs, and the presence of di-22:6-BMP 
may be a biomarker of phospholipidosis (excessive 
intracellular accumulation of polar lipids and lamellar 
bodies) in the etiology of drug-induced toxicities 
(Anderson and Borlak 2006). This lipid has an unusual 
stereochemistry, since the phosphodiester linkage is at 
sn- \ and sn-V positions rather than at the sn-3 position, 
as in other glycerophospholipids. Lysobisphosphatidic 
acid (LBPA, Fig. 5) is an isomer of BMP in which the 
two fatty acyl chains are at the sn-2 positions of the two 
glycerol backbones. However, intramolecular acyl 
migration is facilitated at the acidic pH of endosomes 
and lysosomes, resulting in an equilibrium mixture 
favoring the 1-acyl isomer (i.e., BMP). 

Ether species of glycerolipids. Almost all bacterial 
and animal cells (except most aerobic bacteria) 
contain glycerophospholipids with ether linkages. 
Plasmalogens or plasmenylphospholipids (l-G-alk-F- 
enyl-2-acyl-5«-3-glycerol phospholipids) have a vinyl 
ether bond at the ^n-l position (adjacent to the ether 
linkage) of glycerol, an acyl chain (primarily 16:0, 
18:0, and 18:1) at the sn-2 position, and either 
phosphoethanolamine or phosphocholine as the polar 
head group at the sn-3 position. These compounds 
are called ethanolamine plasmalogen (PlsEtn) and cho¬ 
line plasmalogen (PlsCho), respectively (Fig. 4). 
Although plasmalogens are minor constituents of 
many tissues, they account for more than 50% of the 
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total myocardial sarcolemmal and sarcoplasmic retic¬ 
ulum phospholipids and are also abundant in the 
tissues of the central nervous system. In higher organ¬ 
isms, the first step in the biosynthesis of plasmalogens 
takes place in peroxisomes. Displacement of a fatty 
acid from 1-acyl-dihydroxyacetone phosphate by 
a fatty alcohol is catalyzed by alkyl-dihydroxyacetone 
phosphate (DHAP) synthase, forming l-O-alkyl- 
DHAP. After acylation at the sn-2 position and instal¬ 
lation of a phosphoethanolamine head group, the cis 
double bond is introduced at the sn-1 position between 
C-1' and C-2' by a I'-alkyl desaturase, which requires 
NADPH and molecular oxygen. The 0-vinyl ether 
linkage of plasmalogen is easily cleaved by 
protons and mercuric ion, releasing a l-lyso-2- 
acylglycerophospholipid plus an aldehyde derivative. 
In fact, the name plasmalogen is derived from the 
discovery of the aldehyde (the suffix of which is “al” 
in organic chemistry nomenclature) in the plasma of 
cells. As plasmalogens are highly susceptible to attack 
by reactive oxygen species, one postulated function of 
plasmalogens is as putative scavengers of free radical- 
induced oxidative stress. Hypochlorous acid, which is 
generated by the myeloperoxidase-catalyzed reaction 
of hydrogen peroxide and chloride ion, converts 
plasmalogen to the corresponding 1-lysophospholipid 
(lysophosphatidylcholine or lysoethanolamine) and 
a 2-chloro fatty aldehyde (Lessig and Fuchs 2009). 
Several other roles of plasmalogens in animal cells 
have been proposed; for example, plasmalogens with 
arachidonic acid at the sn-2 position may release this 


fatty acid in the presence of plasmalogen-selective 
phospholipase A 2 . The abundance of plasmalogens in 
nervous and heart tissue suggests that they may have 
a role in the regulation of ion permeability. 

PAF. Another polar glycerolipid that contains 
a long ether chain is l-C)-alkyl-2-acetyl-OT-glycero-3- 
phosphocholine, which is known as platelet-activating 
factor, PAF. The alkyl group at the ^«-l position does 
not have unsaturation adjacent to the ether linkage; 
a saturated chain is generally present, especially 
a hexadecyl or octadecyl chain. An acetyl group is 
present at sn-2, and a phosphocholine moiety is located 
at sn-3 (Fig. 5). The name PAF is a misnomer because 
PAF is a phospholipid mediator with diverse effects 
that are not limited to platelets. Indeed, PAF was the 
first phospholipid known to serve as a lipid second 
messenger. PAF is synthesized, mainly from the pre¬ 
cursor 1 -(9-alkyl-2-acyl-in-glycero-S-phosphocholine, 
in two steps when cells are stimulated. Extracellular 
PAF initiates inflammatory responses in endothelial 
cells, leukocytes, and mast cells by stimulation of 
a specific GPCR; the PAF receptor is coupled with 
a G protein that activates PI-PLC (Prescott et al. 2000). 

Archaeal lipids. Archae are a third taxon, evolu¬ 
tionary distinct from Bacteria and Eukarya. The mem¬ 
brane lipids of the archae kingdom have unusual 
structures; they are typically diether and tetraether 
glycerolipids (Chong 2010). Since these single-cell 
organisms grow under extreme conditions, such as in 
high salt, high or low pH (from 0.5 to 12), or high 
temperature (up to 120°C), they require membranes 
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that can withstand harsh life conditions. They have 
very long, saturated hydrocarbon chains (with 32^0 
carbons) and methyl branches at every fifth carbon 
(which are presumed to maintain a fluid phase 
membrane). A variable number of cyclopentane rings 
(from 0 to 8), depending on the growth temperature, 
may be present within the isoprenoid chains of 
the ether lipid. Figure 6 shows the structure of a polar 
lipid in methanogens and thermoacidophiles known as 
a dibiphytanylglycerol-tetraether lipid; a biphytanyl 
chain contains 40 carbon atoms, and is formed by 
condensation of two phytanyl chains. The chains are 
linked in an ether bond at each end to two sn-2,3- 
glycerol backbones. The ether bonds represent 
a means by which the organisms adapted to grow, 
since ethers are much less susceptible than esters to 
lipases and acid- and base-catalyzed hydrolysis, 
enabling the organisms to survive under extreme con¬ 
ditions. Unsaturation is generally absent, making the 
organisms resistant to attack by oxidants. In the 
extended form, one biphytanyl chain is essentially 
the same length as the hydrophobic core of a bilayer 
membrane formed from a typical bacterial or eukary¬ 
otic glycerophospholipid, and thus can span the 
membrane from the inside to the outside of the cell. 
Each glycerol is linked to a polar head group, which 
may include a sugar, phosphate, or sulfate. For exam¬ 
ple, one glycerol hydroxy group may be coupled to 
a disaccharide composed of galactose and glucose and 
the other glycerol hydroxy group may be linked to 
a phosphate-containing head group. The stereochem¬ 
istry of the glycerol moieties in archaeal lipids is also 
different from the lipids of Bacteria and Eukarya. 
Their backbone is 2,3-0-alkyl-^w-glycerol, as in 
2,3-di-(9-phytanyl-OT-l-glycerol, whereas the glycerol 
backbone in the lipids of bacteria and eukaryotes has 
the OT-1,2-diacyl configuration. 


Summary 

Glycerolipids are a heterogeneous group of lipids that 
have a glycerol backbone linked to various fatty acids 
or fatty alcohols and diverse head groups. The focus in 
this review has been on the abundant glycerolipids that 
comprise archaeal, bacterial, plant, and mammalian 
membranes. The structures of some of the most impor¬ 
tant glycerolipids have been presented, together with 
a brief description of their occurrence, molecular prop¬ 
erties, and multiple roles in the regulation of biological 
and biochemical processes. 

Cross-References 
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Synonyms 

Cell surface glycoproteins; Glycolipids; Proteoglycans 


Definition 

The “glyco” part (see ► Carbohydrate Nomenclature, 

► Glycoproteins, ► Glycan-to-Protein Linkages, 

► Glycosphingolipids, ► Mucin Biophysics, ► Pro¬ 
teoglycans) of proteins and lipids inside the cell, at 
the cell surface, in the extracellular matrix, and on 
secreted proteins is made up of oligosaccharides that 
form a hydrodynamic shell and also have many spe¬ 
cific functions in recognition. 

Introduction 

In the past this shell has been called the glycocalyx 
suggesting a coating of cells by glycosylation. Thus the 
typical O- and N-linked oligosaccharide structures of 
glycoproteins (► Glycoproteins, ► Glycan-to-Protein 
Linkages) can be considered the first signals to be 
seen in cell-cell interactions and are targets of cell 
regulation. Most secreted glycoproteins have such 
typical O- and N-glycosylation but as reviewed, for 
example, in Brooks et al. (2002) there are many more 
unusual glycosylation patterns for which biophysics 
and function have not been clarified. In addition to 
the external surface of the mammalian cell and 
secreted glycoproteins, cytoplasmic proteins are also 
glycosylated giving a signal reciprocal with protein 
phosphorylation (see ► (Glyco)protein Folding Disor¬ 
ders and ► Glycosphingolipids) on internal organelles 
have been implicated in apoptosis, for example. 

Particular functions rely on the diversity of oligo¬ 
saccharides which biosynthetically is of two types 
(1) many enzyme catalyzed interconverted monosac¬ 
charides that are (2) linked together in different 
ways all precisely enzymatically controlled. The 
genes for the enzymes take up a large part of the 
genome of mammalian, bacterial, and all other organ¬ 
isms (except viruses that use their host glycosylation 
machinery). This has long been suggestive of their 
important function, some examples of which are 
given below in cancer, immunology, infection, and 
structural integrity. 

Mammals 

Aberrant glycosylation occurs in essentially all types 
of cancer and appears to be an early event such as 
in control of apoptosis as well as playing a key 
role in the induction of invasion and metastasis 
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(Ono and Hakomori 2004). A recent example of 
the latter is the heightened expression of a 
sialyltransferase, ST6GalNAcV, which specifically 
mediates brain metastasis of breast cancers (Bos et al. 
2009). The nomenclature here relates to O-linked 
glycosylation via GalNAc (see ► Glycan-to-Protein 
Linkages) that can have several mono-disaccharide 
additions defined as core regions I-VIII (Brooks ef al. 
2002) and fhat can be furfher glycosylated or 
sialylated, the latter by the addition of one of the family 
of sialyltransferase (ST) catalyzed sialic acids (in 
humans A-Acetylneuraminic Acid) to C-6 (designated 
by “6” in the enzyme nomenclature), C-4, or C-3. Also 
the expression of sialyl lewis x (SLex) on core II 
glycans is associated with hematogenous metastasis 
(Ono and Hakomori 2004; Kannagi et al. 2004) and 
the binding of this glycan to selectins (see ► Lectins) is 
the mechanism whereby leukocytes extravascate from 
the blood (McEver et al. 1995). This gives a possible 
mechanism for involvement of glycosylation patterns 
in metastasis. In addition to being highly represented in 
tumors, SLex and related glycosylation are prime can¬ 
didates as possible therapeutics in many aspects of 
inflammatory states (Hounsell 2001). 

Many of the possible short O-glycosylation patterns 
have been associated with different primary cancer 
states, but particularly with breast, gastrointestinal, 
and colorectal carcinomas. Monoclonal antibodies 
against many of these have been used in detection 
and therapy. As suggested by the conformation choices 
of O-glycosylation described in ► Glycan-to-protein 
linkages, monoclonal antibodies can also be raised 
against 0-glycopeptide epitopes and these have been 
associated with cancer (Large and Warren 1997; 
Haavik et al. 1999) and are potential diagnostic 
markers (Wandall et al. 2010). At the protein level, 
significant multivalency is given by clustered 
O-glycosylation sites on mucins and the MUG genes 
define a family of these described in ^ Mucin 
Biophysics. The O- and N-glycosylation of many 
other normal size proteins have been implicated, for 
example, osteopontin (OPN) that exemplifies the inter¬ 
connectedness of different cell surface components in 
cancer, as a phosphorylated glycoprotein that binds to 
cell surface integrins and CD44 in normal tissues and 
functions as a signal transducer to promote adhesion, 
motility, and survival (Bellahcene et al. 2008). OPN 
has been shown to facilitate the growth of established 
tumors linked to distant sites, but this is dependent on 


the cell-type expressing OPN and is hence thought to 
be related to the different cell-specific glycosyl- 
transferase levels mediating the different N- and 
O-glycosylation patterns observed. 

As implicated by the example of OPN, changes 
in glycosylation that occur in cancer can also alter 
molecular recognition with the immune system and 
receptor signaling (Bos et al. 2009) and interaction 
with the extracellular matrix (see ► Proteoglycans). 
Carbohydrate-protein interactions are mediated by 
lectins (as defined in ► Lectins). To choose just one 
example, C-type (for Ca-H- dependent) lectin domains 
(Carbohydrate Recognition Domains, CRDs) are pre¬ 
sent on a wide variety of cell surface glycoproteins 
including DC-SIGN (dendritic cell-specific ICAM-3 
grabbing non-integrin; CD209) and DC-SIGNR 
(DC-SIGN related; CD299) which are two type II 
transmembrane glycoproteins that possess C-terminal 
C-type CRDs. In addition to binding endogenous cell 
adhesion molecules (CAMs), DC-SIGN and DC- 
SGINR interact with carbohydrate structures on the 
surfaces of a broad range of lethal infectious reagents. 
Thus understanding of the interaction can lead to spe¬ 
cific therapeutics in infection and inflammation. To 
this end the CRDs at approximately 16 kDa have 
been studied with various carbohydrate ligands by 
X-ray crystallography and NMR (Feinberg et al. 
2007; Hibbert et al. 2005). X-ray studies cannot 
accommodate critical biophysical considerations such 
as dynamics, temperature, and kinetics in real time; 
hence NMR and modeling have also been important in 
elucidating their biophysics (see ► Molecular Dynam¬ 
ics Simulations of Carbohydrates, ► Carbohydrate 
NMR Spectroscopy, ► X-Ray Diffraction and Crystal¬ 
lography of Oligosaccharides and Polysaccharides). 

C-type lectins involved in immune regulation are 
described as selectins on lymphocytes (L-selectin), 
endothelial cells (E-selectin), and platelets (P-selectin) 
from the cells that they were first observed on, but 
this is an over simplification as is what follows: Lym¬ 
phocytes in the blood stream react to bacterial 
signal molecules and upregulate their L-selectin. 
Meanwhile, the endothelial cells near the site of infec¬ 
tion upregulate E-selectin as well as the carbohydrate 
ligand for L-selectin which is called P-selectin ligand 1 
(PSGL-1). Lymphocyte traffic is fhus slowed down 
enough to bind to the endothelial cells (a procedure 
called “rolling”) and then integrins tighten their hold 
so that the lymphocytes traverse (extravascate) across 
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the vascular endothelium to the site of infection. 
Various places along the line are targets for therapeutic 
intervention. Reference to the Leucocyte Handbook 
(1993, 1997) that defines the CD nomenclature 
referred to above also suggests roles of oligosaccha¬ 
rides in the overall structure of the cell surface 
biophysics as, for example, molecular spacers for 
helping coordinate the interactions of multiple glyco¬ 
proteins involved in T-cell recognition of peptide 
epitopes presented on MHC glycoproteins of B and 
dendritic cells. 

Bacteria 

Bacteria synthesize many glycoproteins, lipopolysac- 
charides (LPS), and polysaccharides (PS). LPS and PS 
are reviewed elsewhere (► Bacterial Lipopolysaccha¬ 
ride, OPS, and Lipid A, ► Bacterial Polysaccharide 
Structure and Biosynthesis). In addition bacterial pro¬ 
tein glycosylation (► Glycan-to-Protein Linkages) is 
described further in a good short review in Brooks et al. 
(2002). Among their oligosaccharide diversity are 
represented many of the carbohydrate epitopes also 
synthesized by humans, and this, for example, is 
thought to be the mechanism for our blood groups 
and other cross-reactivities involved in human pathol¬ 
ogy such as the Guillain-Barre syndrome. Human 
blood group oligosaccharide structures are shown in 
the entry on > Lectins. In short, people who are blood 
group A express an enzyme that catalyzes the addition 
of GalNAc to a backbone Gall,3/4[FucL2]GlcNAc 
structure called H. People who are blood group 
B express an enzyme that catalyzes the addition of 
Gal to H. Blood group A serotypes have circulating 
antibodies to blood group B antigen and vice versa 
because bacteria also synthesize the blood group 
A and B structures to which we make antibodies. 
However, immune tolerization mechanisms are bought 
into play so that those with blood group B antigens do 
not have antibodies to blood group B and those with 
blood group A antigens do not have antibodies to blood 
group A. During blood transfusion, if blood group 
B erythrocytes are given to blood group A people, or 
vice versa, the erythrocyte-antibody interaction causes 
hemagglutination. Blood group O, by the way, is the 
absence of the enzymes that catalyze either of the 
A and B antigens; hence this serotype has antibodies 
to both A and B and can only accept blood from other 
0-serotypes, but can donate their red blood cells to 
either A or B serotypes (a universal donor). 


Plants 

The lectins initially used in blood typing were initially 
found in plants, but since then have found many uses as 
reagents in mammalian biomedicine. However, the 
function of their binding to their natural ligands in 
plants is still not known, but is presumed to have 
some function in the “medical cabinet” of plants for 
reagents against viral, bacterial, and fungal pathogen¬ 
esis. In addition there is a huge diversity of plant 
natural products having many ► Glycosylated Natural 
Products that also function in Nature’s medicine cab¬ 
inet, one that we have amply raided for our own ther¬ 
apeutics. Plants are normally associated with the 
polysaccharides such as ► Pectin Biophysics that are 
involved in structure rather than recognition. 

Viruses 

Viruses use the glycosylation machinery of the host 
cell that they are infecting in order to add oligosac¬ 
charides to their external proteins. High mannose 
N-glycans on viruses in particular have been targets 
for therapeutic intervention in order to destabilize 
virus structure (e.g., HIV) and to inhibit, for example, 
the multiple modes of binding that enhance the affinity 
of DC-SIGN mentioned above (Feinberg et al. 
2007). It appears that changes in N-glycosylation pat¬ 
tern via changes at the genome level moving the 
N-glycosylation sequon AsnXxxSer/Thr (► Glycan- 
to-Protein Linkages) is also one of the mechanisms 
that helps the virus evade the immune system which 
is targeted at exposed protein epitopes (any carbohy¬ 
drate antigens being also those of the host and hence 
tolerated). Detailed epitope mapping on, for example, 
the highly glycosylated gpl20 of HIV and the hemag¬ 
glutinin of influenza virus has identified protein epi¬ 
topes and mapped changing glycosylation patterns. 

The mention of the word hemagglutinin of 
influenza virus alerts us to the fact that viruses also 
use the host glycosylation in their infectivity mecha¬ 
nisms. In short, the influenza virus has two glycopro¬ 
teins on its surface: the hemagglutinin that binds to 
host cell NeuAc2,6 Gal at the end of N- and 0-linked 
chains of the host cell surface glycoproteins of the 
epithelia of the human respiratory tract and 
a neuraminidase that cleaves NeuAc2,6 Gal bonds. 
Once the virus is bound, the cell is infected, 
more virus is synthesized, and this adheres to the 
cell surface until the neuraminidase cleaves the cell 
surface NeuAc2,6Gal bond releasing the virus to 



Glycolipid Analysis 


917 


G 


infect another cell. Newcastle disease vims, Sendai 
vims, and fowl plague virus similarly have a hemag¬ 
glutinin-neuraminidase system and polyomaviruses 
and coronaviruses also bind sialic acids. Other viruses 
may use sulfated glycoproteins as a mechanism of 
infectivity. As can be imagined, much effort has gone 
into characterizing at the biophysical level the specific 
interactions for the rational dmg design of inhibitors, 
the most successful of which are Relenza and Tamiflu. 


Summary 

There is an awe-inspiring diversity of the structure and 
mechanisms involved in glycoconjugate interactions 
that are only beginning to be explored as therapeutics. 
Inhibition of carbohydrate-protein interactions is 
a difficult area as many of the individual interactions 
are of relatively low affinity, but techniques to induce 
multivalency are being exploited. Elucidating the bio¬ 
physics of the enzymes involved has shown consider¬ 
able promise. 
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Glycoproteins 

Elizabeth Hounsell 

School of Biological and Chemical Sciences, Birkbeck 
College, University of London, London, UK 

Synonyms 

Carbohydrates; Cell Surface Glycoproteins; Glycan; 
Intracellular glycoproteins; Mucins; Oligosaccharides; 
Protein glycosylation 

Definition 

Glycoproteins are ubiquitous cell surface and secreted 
proteins that have covalently linked oligosaccharides. 
Indeed the majority of extracellular facing surface 
molecules are glycosylated as either glycoproteins or 
glycolipids (largely glycosphingolipids) embedded in 
the lipid micelle. All secreted proteins roughly larger 
than insulin are glycosylated except albumin. Some 
secreted and cell surface proteins called mucins are 
highly glycosylated (► Mucin Biophysics). A few 
intracellular signaling molecules are also glycosylated, 
particularly important are glucosaminylation signals 
reciprocal with phosphorylation (► (Glyco)protein 
Lolding Disorders, ► Glycosylation in Metabolism). 

The derivation “glyco” stems from “glucose,” the 
first monosaccharide to be characterized. Lrom the 
composition of glucose, being only carbon, hydrogen, 
and oxygen, comes the term “carbohydrate” 
(► Carbohydrate Nomenclature). Glucose with 
another monosaccharide, fructose, forms the disaccha¬ 
ride “sucrose” from where come the terms “saccha¬ 
ride” and “sugar” (often used synonymously for 
oligosaccharide and glycan). Oligo- and poly¬ 


saccharides are made up of a large diversity of mono¬ 
saccharides which require a large part of the genome 
of all organisms larger than viruses to encode enzymes 
that catalyze inter-conversion of the monosaccha¬ 
rides and their linkage together (► Carbohydrate 
Enzymology). 

Although considered highly hydroxylated, and 
hence providing hydrophilicity to the cell surface, 
for example, the large diversity includes amino 
groups, hydrophobic methyl groups, and hydropho¬ 
bic surfaces on the glycosyl rings. 0-Sulfation 
is highly represented in some glycoproteins 
(► Glycosaminoglycan (GAG), ► Proteoglycans) 
and glycolipids and 0-phosphorylation is also 
known. Each specific orientation of functional group 
can form a part of a recognition motif for cell-cell 
interactions. The orientation is controlled by the strict 
stereoconformations of 5-carbon (pentoses, ubiquitously 
ribose, xylose), 6 carbon (hexoses like glucose, galac¬ 
tose, and mannose and their various amino derivatives 
such as A-acetylglucosamine, A-acetylgalactosamine, 
A-acetylmannosamine, and dehydroxylated derivatives, 
rhamnose, arabinose, and fucose), and the C-9 family of 
sialic acids (► Glycosphingolipids, ► Carbohydrate 
Nomenclature, ► Anomeric Effect in Sugars) and have 
an equilibrium highly favoring cyclized forms (either 
pyranose or furanose, d or l, a or P anomericity) when 
in glycosyl linkage to either protein (► Glycan-to- 
Protein Linkages), lipid (► Glycosphingolipids), or 
other monosaccharide (making oligo- and poly-saccha- 
rides) (► Carbohydrate Enzymology, ► Carbohydrate 
Nomenclature). In some areas there is huge 
unclassifiable diversity such as bacterial polysaccharides 
(► Bacterial Polysaccharide Structure and Biosynthesis, 

► Bacterial Lipopolysaccharide, OPS, and Lipid A), but 
in mammalian protein and lipid glycosylation some 
favorites are apparent from which we get the names of 
the glycolipid families (► Glycosphingolipids), the gly¬ 
coproteins containing high mannose, hybrid, or complex 
type chains (► Molecular Dynamics Simulations of 
Carbohydrates, ► Glycoconjugates in Cell Eunction 
and Therapeutics), and N- and 0-glycosylation of pro¬ 
teins (► Glycan-to-Protein Linkages). 

Cross-References 

► Anomeric Effect in Sugars 

► Bacterial Lipopolysaccharide, OPS, and Lipid A 






Glycoside Hydrolases - Computational Studies 


919 


G 


► Bacterial Polysaccharide Structure and Biosynthesis 

► Carbohydrate Enzymology 

► Carbohydrate Nomenclature 

► Glycan-to-Protein Linkages 

► Glycoconjugates in Cell Function and Therapeutics 

► Glycosaminoglycan (GAG) 

► Glycosphingolipids 

► Mass Spectrometry of N-Linked Carbohydrates and 
Glycoproteins 

► Molecular Dynamics Simulations of Carbohydrates 

► Mucin Biophysics 

► Proteoglycans 


Glycosalinoglycan 

► X-Ray Diffraction and Crystallography of Oligosac¬ 
charides and Polysaccharides 


kingdom. Some glycosaminoglycans, through specific 
interactions with small extracellular proteins, are 
essential for embryonic development and play roles 
in the response of the organism to infection and injury. 
Hyaluronan has a very high molecular weight, and its 
rheological properties are important to its role in the 
extracellular matrix and in synovial fluid. 
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Glycosaminoglycan (GAG) 

Barbara Mulloy 

Laboratory for Molecular Structure, National Institute 
for Biological Standards and Control, Potters Bar, 
Hertfordshire, UK 

Synonyms 

GAGs; Mucopolysaccharides; Polysaccharides 

Definition 

The glycosaminoglycans are linear polysaccharides 
made up of a disaccharide repeat unit, in which one 
of the monosaccharides is a hexosamine and the other 
is either a hexose (in keratan sulfate only) or a uronic 
acid. Most glycosaminoglycans exist as the O-linked 
glycan part of protein glycoconjugates known as pro¬ 
teoglycans, and carry sulfate substituents, making 
them the most acidic class of biological macromole¬ 
cules. The exception is the GAG hyaluronan, which is 
not attached to a protein core and is not sulfated. 

Glycosaminoglycans are structural components 
in the extracellular matrix throughout the animal 


Glycoside Hydrolases - Computational 
Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

Glycosidases 

Definition 

Enzymes that catalyze the hydrolysis of glycosidic 
linkages to release smaller sugars. 

Basic Characteristics 

Glycoside hydrolases are common enzymes across all 
domains of life. Their roles include the degradation of 
biomass (e.g., cellulose from plants), defense against 
bacteria (e.g., lysozyme, involved in breaking down 
components of the bacterial cell wall), mechanisms of 
pathogenesis (e.g., viral neuraminidases that enable 
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the release of virus particles from the host cell), and in 
normally functioning cells (e.g., modifying glycans 
attached to proteins in biosynthesis). Computer simu¬ 
lations have been used to investigate several different 
glycoside hydrolase-catalyzed reactions. 

One of the first enzymes to be studied using com¬ 
puter simulation was also the first enzyme to have 
its structure determined, hen egg-white lysozyme 
(HEWL). This enzyme catalyzes the hydrolysis of 
1,4-p-linkages between W-acetylmuramic acid 
(NAM) and A-acetyl-D-glucosamine moieties in pep- 
tidoglycan, which forms the layer outside the plasma 
membrane of (Gram-positive) bacteria. The very first 
study of an enzyme with ► QM/MM methods was 
performed on HEWL and found that this enzyme 
provides good electrostatic stabilization of the 
oxocarbenium ion species that features (as a transition 
state or intermediate) in the glycosidic bond cleavage 
reaction (Warshel and Levitt 1976). Based in part on 
experimental crystallographic structural data, it was 
initially proposed by Phillips that the oxocarbenium 
ion was an intermediate in the reaction, formed by 
proton transfer from a catalytic glutamic acid (Glu35) 
to the glycosidic oxygen and the subsequent breakage 
of the glycosidic bond. Later experimental studies 
(using mutant enzymes or fluorinated substrates) con¬ 
cluded, however, that a general mechanism proposed 
previously by Koshland (1953), featuring a covalent 
intermediate with a bond between the catalytic aspar¬ 
tate (Asp52) and the NAM moiety, was preferred. To 
investigate this question, Bowman et al. (2008) calcu¬ 
lated free energy profiles for the protonation of the 
glycosidic bond by the catalytic glutamate and subse¬ 
quent cleavage, at the PM3/CHARMM22 QM/MM 
level, in umbrella sampling molecular dynamics simu¬ 
lations. These simulations indicated a spontaneous for¬ 
mation of the covalent aspartate-NAM intermediate. 
Corrections of the QM/MM energies at the B3LYP/ 
6-31 l-i-G(2d) and MP2 ab initio levels gave a free 
energy barrier for formation of the covalent intermedi¬ 
ate consistent with the experimental data, thereby pro¬ 
viding convincing evidence that the Koshland-type 
mechanism is indeed correct. The transition state was 
similar to an oxocarbenium ion, consistent with 
experimental kinetic isotope effects. 

Other examples of computational modeling of 
reactions in glycoside hydrolases include work on 
P-galactosidases found in E. coli (Bras et al. 2010) 
and in rice (Wang et al. 201 1). These studies used the 


► ONIOM QM/MM approach (using BB IK/AMBER 
for structural optimization and B3LYP/AMBER 
for energy calculations) and an additive QM/MM 
approach (geometries and energies at the B3LYP/ 
6-31G(d,p)/CHARMM27 level) to calculate minimum 
energy paths, respectively. Both studies concluded 
that an oxocarbenium cation-like transition state is 
formed, which is specifically stabilized by the enzyme 
(as was found for lysozyme). QM/MM methods 
have also been used to study the reaction in Golgi 
a-mannosidase II (Petersen et al. 2010), an enzyme 
involved in maturing the A-linked glycans in proteins. 
As unusual quantitative distributions of complex 
carbohydrate structures on the cell surface are associ¬ 
ated with several types of cancer, the enzyme is an 
anticancer target. The results, obtained using QM/MM 

► Car-Parrinello molecular dynamics simulation, help 
explain the inhibitory potency of recently developed 
anticancer drugs and drug leads and can contribute 
to further drug development. Different QM/MM 
methods have also been used in simulations to study 
substrate distortion in wild-type and mutant xylanases 
(Soliman et al. 2009). 

Cross-References 

► Car-Parrinello Molecular Dynamics 

► ONIOM 

► QM/MM Methods 
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Glycosphingolipids 
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^Institute for Hygiene, University of Munster, 
Miinster, Germany 

^Interdisciplinary Center for Clinical Research 
(IZKF), University of Munster, Munster, Germany 
^Institute of Medical Physics and Biophysics, 
University of Munster, Munster, Germany 

Introduction 

S. J. Singer and G. L. Nicolson introduced the fluid 
mosaic model for the structure of cell membranes in 
1972 that suggested the aggregation of some proteins in 
lipid bilayers (Singer and Nicolson 1972). 
As a theoretical consequence of this working hypothe¬ 
sis, components of biological membranes distribute lat¬ 
erally resulting in the creation of morphologically 
distinct domains with a heterogeneous molecular com¬ 
position and supramolecular organization. Although the 
existence of such domains in cellular membranes is still 
controversially discussed, most authors now accept the 
concept that membrane domains with physicochemical 
properties that differ from the surrounding membrane 
environment exist (Lingwood and Simons 2010; 
Simons and Gerl 2010). This review will focus on 
certain aspects of glycosphingolipids (GSLs) that par¬ 
ticipate as driving forces in the formation of membrane 


microdomains and exert specific functions related to 
their physicochemical characteristics through GSL- 
protein interactions in those domains (Sonnino et al. 
2006; Prinetti et al. 2009; Gupta and Surolia 2010). 

Chemical Structures and Biosynthesis 

The amphipathic GSLs are built up from a hydrophilic 
oligosaccharide chain and a hydrophobic ceramide moi¬ 
ety. In vertebrate GSLs the dihydroxylated and singly 
unsaturated long-chain amino alcohol sphingosine 
(4-sphingenine, dl8:l) represents the typical core struc¬ 
ture of the ceramide. The amino group of sphingosine is 
linked to a fatty acid which may vary in chain length most 
commonly from C16 to C24 and degree of unsaturation 
forming the GSL’s hydrophobic ceramide membrane 
anchor. The oligosaccharide is connected via glycosidic 
linkage to the primary hydroxyl group of the ceramide. 
GSL biosynthesis starts with the formation of the cer¬ 
amide moiety in the endoplasmic reticulum and continues 
in the Golgi apparatus by stepwise addition of non- 
charged monosaccharides and sialic acids being accom¬ 
plished by specific glycosyl- and sialyltransferases, 
respectively (Lahiri and Futerman 2007). Neutral GSLs 
and sialic acid-containing gangliosides constitute the 
majority of mammalian GSLs which in most cases belong 
to one of the four main structural families: the ganglio-, 
globo-, lacto-, or neolacto-series (Fig. 1). According to 
recommendations of the lUPAC-IUB Joint Commission 
on Biochemical Nomenclature the name contains the de¬ 
signation of the structure family, globo (Gb), ganglio 
(Gg), lacto (Lc), or neolacto (nLc). The number of mono¬ 
saccharide units is indicated by the suffixes “biaosyl,” 
“triaosyl,” “tetraosyl,” etc., and the ceramide is 
abbreviated by “Cer,” for instance, “globotriaosyl- 
ceramide,” GbSCer (for structure of Gb3Cer (dl8:l, 
C16:0) see Fig. 2a). All hexoses are in the 
D-configuration of the pyranose form and all glyco¬ 
sidic linkages originate from the C-1 hydroxyl group. 

The parent sialic acids of gangliosides 
are A-acetylneuraminic acid (NeuSAc) and 
N-glycolylneuraminic acid (NeuSGc), interestingly 
the latter is absent in normal human tissues. Ganglio¬ 
sides are termed as Y^Neu5Ac-(or Neu5Gc)-XxCer 
acronyms, where X stands for the root name of the 
GSL family to which the sialic acid is attached (e.g., 
Gg, nLc) and x for the number of monosaccharides 
(e.g., Gg4, nLc4, see Fig. 1). Y stands for the Roman 
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GalNAcl33Gala4Gall34Glcp1 Cer 
Gb4Cer 

t 

Gala4Gaip4Glcp1Cer 

Gb3Cer 


Gaip3GlcNAcp3Gaip4Glcp1 Cer 
Lc4Cer 


Cer—GlcpiCer- 


I Globo-series | 

t 

-Gaip4Glcp1Cer- 

LacCer 

i 


Lacto-series 




•GlcNAcp3Galp4Glcp1Cer 
Lc3Cer "x 


|Ganglio-series| 

GalNAcp4Gaip4Glcpi Cer 
Gg3cer 

i 

Gaip3GalNAcp4Galp4Glcp1 Cer 
Gg4Cer 


|Neolacto-series| 

Gaip4GlcNAcp3Gaip4Glcp1 Cer 
nLc4Cer 


Glycosphingolipids, Fig. 1 Biosynthesis flow diagram of 
neutral GSLs with tetrahexosylceramide core of the ganglio-, 
globo-, lacto-, and neolacto-series. Sialylation may occur at 
different stages of neutral GSL biosynthesis (e.g., LacCer ^ 
GM3 or nLc4Cer —> IV*’Neu5Ac-nLc4Cer, see Table 1) or 


ganglioside biosynthesis (e.g., GMl —> GDla or GM3 —> 
GD3) as reviewed by Lahiri and Futerman (2007). Further 
prolongation of nLc4Cer by 1, 2, or more Gaip4GlcNAc- 
repeats results in nLc6Cer, nLcSCer, or even longer nLc-core 
structures 


a Gala4Galp4GlcpiCer(cl18:1, C16:0) 


Fatty acid (Cl6:0) 



Glycosphingolipids, 

Fig. 2 Structures of Gb3Cer 
(dl8:l, C16:0) (a) and 
II^Neu5Ac-Gg4Cer (dl8:l, 
C18:0) (or GMl) (b) depicted 
in the Flaworth projection. 
Gb3Cer and GMl represent 
the specific GSL receptors of 
Shiga toxin and cholera toxin, 
respectively (Miithing and 
Distler 2010) 


b 


Galp3GalNAcp4Galp4Glcpi Cer (d18:1, 
3 

NeuSAca 


C18:0) 


Fatty acid (Cl 8:0) 



numeral indicating the monosaccharide residue to 
which the sialic acid is attached, and the Arabic 
numeral superscript y refers to the position within 
that residue to which the sialic acid is linked (e.g., 
II^Neu5Ac-Gg4Cer or IV®Neu5Ac-nLc4Cer; see 


Table 1). Neu5Ac is always linked via the C-2 
hydroxyl group in ot(2-3)- or a(2-6)-configuration to 
Gal and via the C-8 hydroxyl group for a(2-8)- 
bound Neu5Ac-Neu5Ac. The structure of II^Neu5Ac- 
Gg4Cer (or GMl) is depicted in Fig. 2b. 
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Glycosphingolipids, Table 1 Structures of GM3 and GD3 

and common gangliosides of the ganglio-, lacto-, and neolacto- 
series“ 


GSL-series 

Structure 

Symbol 


iTNeuSAc-LacCer 

GM3 


Il\Neu5Ac)2-LacCer 

GD3 

Ganglio 

lLNeu5Ac-Gg3Cer 

GM2 


lLNeu5Ac-Gg4Cer 

GMl 


lL(Neu5Ac)2-Gg3Cer 

GD2 


IV^Neu5Ac,Il’Neu5Ac-Gg4Cer 

GDla 


n\Neu5Ac)2-Gg4Cer 

GDlb 


IV^(Neu5Ac)2,lLNeu5Ac-Gg4Cer 

GTla 


IV^Neu5Ac,IL\Neu5Ac)2-Gg4Cer 

GTlb 


IV^(Neu5Ac)2,lL(Neu5Ac)2-Gg4Cer 

GQlb 

Lacto 

IV^"’Neu5Ac-Lc4Cer'’ 

IV“^^‘^Lc4 

Neolacto 

IV^''’Neu5Ac-nLc4Cer'’ 



vL''’Neu5Ac-nLc6Cer'’ 



“The nomenclature follows the lUPAC-IUB recommendations 
'’Binding of Neu5Ac in a(2-3)- or a(2-6)-conflguration 


The Lipid Raft Concept 

A range of biophysical and related techniques have 
attested that lipid raft microdomains do exist in 
cell membranes, which are characterized as small, 
heterogeneous, dynamic domains enriched in 
cholesterol, GSLs, sphingomyelin, and dipalmitoyl- 
phosphatidylcholine (one of the molecular species of 
phosphatidylcholine). This nonhomogeneous lateral 
distribution of membrane components is made possi¬ 
ble by the existence of lateral interactions which 
stabilize different membrane domains creating a 
second level of order in the molecular architecture 
of biological membranes (Prinetti et al. 2009). 
Sphingolipids that carry saturated acyl chains prefe¬ 
rentially assemble with cholesterol to form tightly 
packed lipid rafts which correspond to the liquid- 
ordered phase (lo), whereas the more fluid surrounding 
membrane is analogous to the liquid-disordered 
phase (lu), in which the lipid rafts can freely float. 
Glycerophospholipids and cholesterol constitute 
major lipids whereas GSLs are minor cell membrane 
compounds which preferentially reside in the outward¬ 
facing part of the bilayer with the oligosaccharide 
protruding into the extracellular environment. This 
hydrophilic headgroup as well as acyl chain structure 
of GSLs are considered functionally important 


modulators of lipid packing and thus of lipid raft 
formation (Maggio et al. 2006). Elemental rafts 
might be small, containing only several thousand 
molecules and, therefore, accommodating only a 
few proteins including glycosylphosphatidylinositol 
(GPI)-anchored proteins, doubly acylated proteins, or 
various transmembrane proteins, especially those car¬ 
rying palmitic acid (C16:0). 

Lateral Organization of GSLs in Phospholipid 
Bilayers 

At physiological temperature, i.e., above the melting 
temperature (T^) that is characteristic of each molecu¬ 
lar species of lipid, the bilayer exists in the Ij phase in 
which the lipid acyl chains are disordered and charac¬ 
terized by high fluidity. Acyl chains of lipids in the Iq 
phase are extended and ordered having high lateral 
mobility in the bilayer as in the 1^ phase. Cholesterol 
stabilizes the gap between the glycerophospholipid or 
GSL acyl chains in the Iq phase. The fact that lipids in 
different phases may coexist within the same mem¬ 
brane gave evidence for the concept of lipid domains. 
This lateral phase separation of complex lipids based 
on the incomplete miscibility within a single phase has 
been extensively studied by various biophysical strate¬ 
gies (Sonnino et al. 2006). The unique molecular struc¬ 
tures and conformational features of GSLs suggested 
the formation of segregated domains in phospholipid 
mixtures. As an example, phase separation can occur in 
mixtures of cholesterol with a phospholipid and 
a sphingolipid. This could be clearly shown in artificial 
membranes, for instance, in case of several neutral 
GSLs such as the monohexosylceramides GalCer and 
GlcCer and the tetrahexosylceramides Gb4Cer and 
Gg4Cer. The employment of negatively charged gan¬ 
gliosides revealed a more complex situation. Lateral 
cooperative interactions between these molecules 
reduced mobility in phospholipid bilayers most likely 
due to self-clustering of gangliosides. This effect could 
be enhanced by Ca^''' ions suggesting that polar head 
groups of gangliosides may participate in phase sepa¬ 
ration. Experiments obtained with semisynthetic gan- 
glioside variants differing in fatty acid acyl chain 
length, degree of unsaturation, and nature of the 
sphingoid base indicated that this behavior could be 
attributed to the differences in the ceramide moieties. 
Thus, as exemplarily shown for GMl species with 
homogenous sphingoid base composition, a decrease 
in the acyl chain length or an increase in its 
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unsaturation induced enhanced ganglioside distribu¬ 
tion to the Ij phase of the bilayer (Sonnino et al. 
2006). In conclusion, numerous studies of GSLs in 
artificial membrane models clearly demonstrated that 
sphingolipids have the tendency to form “clusters” or 
“lipid rafts,” the term most widely used in the current 
literature, in fluid lipid bilayers. 

The lipid raft concept has long been and is still 
controversially discussed, since techniques developed 
so far did not allow lipid rafts to be “viewed” in 
living cells. The existence of GSL clusters at the cell 
surface of natural cell membranes was demonstrated 
by immuno-electron microscopy, for instance, for 
Gb4Cer in human erythrocytes, GM3 in human lym¬ 
phoid cells, and polysialogangliosides in fish brain 
neurons. The implementation of novel microscopy 
and spectroscopy technologies (e.g., single-particle 
or single-fluorophore tracking microscopy, atomic 
force microscopy, fluorescence recovery after 
photobleaching, and fluorescence resonance energy 
transfer) has deepened our insight into the dynamics 
of membrane organization and impressively substanti¬ 
ated the lipid raft concept of membrane subcompart- 
mentalization (Lingwood and Simons 2010; Simons 
and Gerl 2010). This confirms dynamic nanoscale 
assemblies of cholesferol, sphingolipids, and proteins 
in living cells which can coalesce and form platforms 
that are important, e.g., in signaling, membrane traf¬ 
ficking, and viral infection. However, conflicting 
results were obtained in some cases such as in the 
determination of the average size of lipid rafts which 
ranges from 26 nm to about 2 pm. Since membrane 
lipid domains harboring cell lipids (including choles¬ 
terol and GSLs) are not solubilized in aqueous 
nonionic detergents under specific conditions, compo¬ 
sitional analysis of such domains is largely obtained 
from detergent-resistant membranes (London and 
Brown 2000). 

Detergent-Resistant Membranes 

The specialized lipid and protein composition of 
cholesterol-rich lipid rafts in the tightly packed 
lo phase make these domains relatively resistant to 
solubilization by nonionic detergents such as Triton 
X-100 in the cold allowing for the isolation of deter¬ 
gent-resistant membranes (DRMs) from low buoyant 
density fractions after sucrose density ultracentrifu¬ 
gation. Because the degree of lipid insolubility 
depends on the stability of lipid-lipid interactions 
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Glycosphingolipids, Fig. 3 A schematic illustration of the 
effect of temperature on solubilization by detergent. At T ^ Tn, 
lipids with long, saturated acyl chains are insoluble, and do not 
bind detergent well. At T close to Tn, detergents can bind and 
micelles form at low detergent concentrations due to lipid packing 
behaviour. At T > detergents bind well, but form mixed 
bilayers with lipid at subsaturating concentrations. At higher 
detergent concentrations micelles form. Reprinted from London 
and Brown (2000) with permission from Elsevier Science B.V., 
copyright 2000 


relative to lipid-detergent interactions, the quantitative 
relationship between lipid rafts and detergent- 
insoluble membranes is complex, and can depend on 
lipid composition, detergent, and temperature (see 
Fig. 3) (London and Brown 2000). The isolation of 
insoluble membrane fractions is a key factor in the 
ability to understand the origin and structure of tightly 
packed cholesterol-sphingolipid-rich membrane lipid 
rafts. However, the limitations of DRMs must be 
clearly understood to avoid experimental artifacts, 
and it should be stressed that DRMs should not be 
equated with lipid rafts (Lingwood and Simons 
2007). Nevertheless, when used carefully DRMs are 
an extremely helpful tool for studying the composition 
of lipid rafts and characterizing the functional interac¬ 
tions of their constituents. 
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GSL-Mediated Effects on the Function of 
Membrane-Associated Proteins 

Although neutral GSLs as well as gangliosides have 
emerged as membrane compounds with high potential 
for specific lateral interactions with membrane- 
associated proteins, the molecular and functional 
impact of GSL-protein interactions underlying 
modulatory effects of GSLs are still only poorly under¬ 
stood. Two examples of DRM studies are provided 
in the following sections that evidence the involve¬ 
ment of the hydrophobic lipid anchor of a neutral 
GSL as well as the hydrophilic headgroup of a 
ganglioside in specific funcfional modification of 
membrane proteins. 


C24 LacCER 



PHAGOCYTOSIS 


LacCer and Src Family Kinase Lyn 
How a neutral GSL can participate in the process of 
transduction of information across the membrane has 
been shown in DRM studies for LacCer (Sonnino et al. 
2009). This neutral GSL specifically interacts with 
the Src family kinase Lyn in plasma membrane 
microdomains of human granulocytes (Fig. 4). Lyn is 
activated by ligand binding to LacCer resulting in 
phagocytosis, superoxide generation, and migration 
of neutrophils. The long C24 fatty acid of LacCer 
was found to be functionally essential in addition to 
the hydrophilic Galp4Glc-disaccharide which is also 
necessary but not sufficient for Lyn activation. Thus, 
interdigitation of the ceramide is proposed to be the 
switch for the transduction of information through the 
membrane. 


^ glycerophospholipid 

H LacCER 

H sphingomyelin 

^ acyl-anchor 

^ cholesterol 

^ gangliosides 


Glycosphingolipids, Fig. 4 LacCer-mediated signaling 
through Src family kinase Lyn. In DRMs of human neutrophils 
Lyn was found to physically interact with C24-LacCer, likely via 
van der Waals forces between the hydrocarbon chains of LacCer 
and Lyn. Coupling of Lyn with C24-LacCer is required for 
LacCer-mediated signal transduction, leading to superoxide gen¬ 
eration, chemotaxis, and phagocytosis of neutrophils. Reprinted 
from Prinetti et al. (2009) with permission from Elsevier B.V., 
copyright 2008 


GM3 and Insulin Receptor 

How a ganglioside can influence the membrane loca¬ 
lization of a hormone receptor and the receptor- 
mediated responsiveness has been shown for GM3. In 
normal adipocytes (= fat cells) the insulin receptors 
reside in caveolae (a certain subtype of lipid rafts 
characterized by presence of caveolin-1) and are pre¬ 
sent in DRMs, where the P-subunit of insulin receptors 
interacts with caveolin-1 (Fig. 5). TNFa induces insu¬ 
lin resistance in adipocytes which is accompanied by 
an accumulation of GM3 in DRMs. This excess of 
GM3 induces the displacement of insulin receptors 
from the complex with caveolin-1 to the complex 
with GM3. The electrostatic interaction between the 
negatively charged Neu5Aca3Galp4Glc oligosaccha¬ 
ride headgroup of GM3 and the positively charged 


amino group of lysine 944, located closely to the trans¬ 
membrane domain sequence of insulin receptors, is 
essential for the formation of the GM3/insulin receptor 
complex. Thus, the regulation of insulin receptor/ 
caveolin-1 by GM3 is postulated to be responsible 
for the changed insulin response in insulin-resistant 
adipocytes (Prinetti et al. 2009). 

Basic Principles of Isolation and Thin-Layer 
Chromatography 

GSLs are usually extracted with chloroform-methanol 
mixtures with a recommended tissue-to-solvent ratio of 
1:20 to 1:40 (weight/volume). Anion-exchange chroma¬ 
tography is convenient to separate crude GSL extracts 
into neutral and negatively charged GSLs and 
high-performance liquid chromatography (FIPLC) is 
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Glycosphingolipids, Fig. 5 GM3-mediated change of insulin 
receptor responsiveness. In DRMs of insulin-resistant adipo¬ 
cytes, accumulation of GM3 was found to induce displacement 
of insulin receptors (IR) from the complex with caveolin-1 to the 
complex with GM3 leading to inhibition of downstream signal 


transduction. The electrostatic interaction between the 
sialyllactose chain of GM3 and lysine 944 (K) in IR is essential 
for the formation of the GM3/IR complex. Reprinted from 
Prinetti et al. (2009) with permission from Elsevier B.V., copy¬ 
right 2008 
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Glycosphingolipids, Fig. 6 TLC overlay assay schemes of 
antibody- (a) and toxin-mediated GSL detection (b). GSL mix¬ 
tures are separated by TLC on a silica gel-coated glass plate. 
After fixation of the silica gel with polyisobutylmethacrylate 
(plexigum) (Miithing 1998), the chromatogram is consecutively 
overlaid with (a) the GSL-specific primary antibody and 
secondary alkaline phosphatase-labeled antibodies or (b) the 


GSL-specific toxin (for example, an AB 5 -toxin like Shiga 
toxin), the antitoxin primary antibody, and secondary enzyme- 
labeled antibodies. Bound antibodies are detected by 
generating a blue precipitate from the chromogenic substrate 
5-bromo-4-chloro-3-indolylphosphate (BCIP). Reprinted from 
Miithing and Distler (2010) with permission from Wiley 
Periodicals, Inc., copyright 2009 


appreciated for preparative isolation of individual GSLs 
(Miithing 2002). High-performance thin-layer chroma¬ 
tography (TLC) in conjunction with immunostaining on 
the plate (overlay technique) using GSL-specific poly- 
or monoclonal antibodies or GSL-binding toxins 
(Fig. 6) has become the standard tool for analytical 
purposes (Miithing 1998). 


Mass Spectrometric Investigation of GSLs 

Structural details of the oligosaccharide portion and 
the ceramide moiety of GSLs are essential for under¬ 
standing the biophysical features of a given GSL and 
the prerequisite especially for studies on the function 
of GSLs in membrane microdomains. Numerous mass 
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spectrometry (MS)-based methods have heen devel¬ 
oped among which electrospray ionization (ESI) and 
matrix-assisted laser desorption/ionization (MALDI) 
MS have been found to especially allow highly sensi¬ 
tive characterization of GSL molecules by their mole¬ 
cular mass and sugar composition using fragmentation 
techniques (Levery 2005). In recent years the general 
potential of high-throughput GSL analysis based on 
automatization and miniaturization for rapid and sen¬ 
sitive analysis by implementation of liquid chromato¬ 
graphy (LC) MS^ has been demonstrated (Sommer 
et al. 2006) and as a next step chip-based technology 
employing microfluidic devices is believed to play an 
important role for future developments (Bindila and 
Peter-Katalinic 2009). However, as the workhorse of 
many laboratories, MALDI MS has become one of the 
major tools for the analysis of carbohydrates and 
glycoconjugates, including GSLs, as continuously 
and extensively reviewed for the last years (Harvey 
2011). However, exhaustive structural characteriza¬ 
tion of a GSL requires the determination of the (1) indi¬ 
vidual sugar monomers, (2) ceramide constitution, 
(3) monosaccharide sequence, (4) linkage positions, 
(5) anomeric configuration, (6) conformation of sugar 
rings, and (6) secondary structure. With the exception 
of nuclear magnetic resonance spectroscopy, MS alone 
can be used at the best to define (2)-(4), but a complete 
structural elucidation requires a multidisciplinary 
approach and has to resort to additional biochemical 
or immunochemical tools such as glycosidases or anti¬ 
bodies, respectively, or chemically modified monosac¬ 
charides like partially methylated alditol acetates for 
gas chromatography MS, just to mention a few (Geyer 
and Geyer 1998). However, structural analyses are 
often impeded by the limited amounts of sample avail¬ 
able and the vast structural heterogeneity of glycan 
chains. With respect to these restrictions, recent 
improvements were obtained by merging TLC separa¬ 
tion of GSLs, their detection with oligosaccharide- 
specific antibodies or toxins, and in situ infrared 
(IR)-MALDl MS analysis of protein-detected GSLs 
directly on the silica gel-coated glass plate as shown 
for antibody-mediated detection in Fig. 7, thereby 
delivering specific structural information of small 
quantities of GSLs with only limited investment in 
sample preparation (Miithing and Distler 2010). This 
is demonstrated for GbSCer (dl8;l, C24:l/C24:0) and 
Gb3Cer (dl8:l, C16:0) in Fig. 8a and b, respectively, 
and a mixture of the GMl variants II^Neu5Ac-Gg4Cer 




i 


Mass spectrum 



Glycosphingolipids, Fig. 7 Scheme of matching TLC 
immunodetection (a) with direct IR-MALDI MS (b and c). 
(a) A GSL mixture is separated on a silica gel-coated TLC 
plate (1) and the overlay assay (2 and 3) is performed as 
described in the legend to Fig. 6a. After removal of the plastic 
fixative the TLC plate is cut into appropriate pieces (b) and fixed 
on the MALDI target. The immunostained bands are sprinkled 
with the glycerol matrix and the target is inserted into the mass 
spectrometer. Desorption/ionization of GSLs by IR-photons is 
achieved directly from the immunostained bands resulting in the 
MALDI mass spectrum (c) (Miithing and Distler 2010) 

(dl8:l, C18;0) and lI^Neu5Ac-Gg4Cer (dl8:l, C20:0) 
in Fig. 8c. More precise in-depth structural information 
can be obtained by fragmentation analysis such as 
collision-induced dissociation (CID) as exemplarily 
shown for ESI quadrupole time-of-flight (Q-TOF) 
MS of immunostained IV”Neu5Ac-nLc4Cer species 
and subsequent MS^ of IV®Neu5Ac-nLc4Cer (dl8:l, 
C16:0) in Fig. 9. 
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Glycosphingolipids, Fig. 8 Direct TLC-IR-MALDI mass 
spectra of GbSCer detected with Shiga toxin (Stx; a and b) and 
of II^Neu5Ac-Gg4Cer (=GM1) detected with the cholera toxin 
B subunit (CTB; c). GSLs were chromatographed and stained 
with orcinol (Ore) or identified by overlaying the TLC plate with 
the respective toxin followed by immunostain. GSL species 
analyzed by MS in the positive ion mode are marked with 
arrowheads in each panel. Total amounts of 10 pg of neutral 
GSLs from human erythrocytes were applied for the orcinol and 
the Stx overlay assay (a and b); total amounts of 20 pg and 2 pg 
of human brain gangliosides were employed for the orcinol and 
the CTB overlay assay (c). In the upper Stx-positive band (a) the 


prevalent monosodiated [M^ + Na]"^ ions are indicative for 
GbSCer (dl8:l, C24:l/C24:0) and the minor [M* + Na]"^ ions 
for GbSCer (dl8:l, C22:0). The lower Stx-positive band 
(b) contains GbSCer (dl8:l, C16:0) evidenced by the [M + 
Na]"^ ions. In the CTB-positive band (c) the dominant disodiated 
[M' - H + 2Na]'^ and [M^ — H -r 2Na]'^ ions could be assigned 
to the GMl variants II^Neu5Ac-Gg4Cer (dl8:l, C18:0) and 
II^Neu5Ac-Gg4Cer (dl8:l, C20:0), respectively, flanked by 
their corresponding less abundant monosodiated [M* + Na]"^ 
and [M^ + Na]"^ ions. The symbolic representation system 
according to Varki (2007) is used in this figure: Gal, 

Glc, GalNAc, Neu5Ac 
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Glycosphingolipids, Fig. 9 (continued) 
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Conclusions 

Novel hyphenated MS strategies aimed at full struc¬ 
tural characterization of GSLs in small sized samples 
and the implementation of sophisticated microscopical 
techniques capable of viewing lipid rafts in living cells 
will contribute to better understanding of plasma mem¬ 
brane structure and function. This interdisciplinary 
approach together with improved microdomain prepa¬ 
ration techniques will help us to further exploring the 
largely enigmatic functional role of GSLs in lipid rafts. 

Summary 

In this mini review we at first give a short introduction 
on the concept of lipid membrane domains followed by 
some general remarks on the chemical structure, bio¬ 
synthesis, and the nomenclature of glycosphingolipids 
(GSLs), which are known as key structural elements of 
cellular membranes. We then refer to the supramolecu- 
lar organization of lipid membrane microdomains and 
the functional role which GSLs play owing to their 
specihc physicochemical structural features in those 
domains named as lipid rafts. Next a short survey 
on the core principles of isolation and basic separa¬ 
tion techniques of GSLs is provided. Since exact 
knowledge of GSL structures is required particularly 
to exploring their biophysical characteristics and 
functional roles in biological membranes, we finally 
report on some novel aspects of recent state-of-the-art 
mass spectrometry strategies especially applicable for 
microscale structural investigations of GSLs and close 
the mini review with a short outlook. 
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clonal antibody AB2-6. 20 pg of gangliosides were applied on 
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TLC plate, extracted, and investigated by ESI-Q-TOF MS. 


(b) MS* spectrum of immunopositive IV®Neu5Ac-nLc4Cer 
species in the negative ion mode. [M* — H]“ ions indicative 
for IV®Neu5Ac-nLc4Cer (dl 8 :l, C16:0) and [M^ — H]^ ions 
representing IV®Neu5Ac-nLc4Cer (dl 8 :l, C24:l) are 

detected as predominant species, (c) MS^ spectrum of 
IV'’Neu5Ac-nLc4Cer (dl 8 :l, C16:0) with miz 1516.77. 
(d) Corresponding fragmentation scheme and molecular 
structure of IV^Neu5Ac-nLc4Cer (dl 8 :l, C16:0). Y 4 ions 
and diagnostic ions resulting from ring cleavages are printed 
in boldface type (Miithing and Distler 2010). The symbolic 
representation system according to Varki (2007) is used in this 
figure: Gal, Glc, GlcNAc, Neu5Ac 
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Synonyms 

Glycosides 


Definition 

Glycosylated natural products are defined as natural 
compounds where the carbohydrate part (glycon), 
either monomers or oligomers of different structures, 
are covalently bound to another chemical structure 


(aglycon). The carbohydrate can be 0-, N-, S-, or 
C-linked to the aglycon. 0-glycosides are the most 
common ones. In nature, glycosylation is normally 
carried out via a nucleoside diphospho-sugar, i.e., 
UDP-glycose, an enzyme, (glycosyltransferase), and 
the natural product carrying a hydroxyl group being 
the site for glycosylation. Natural products that 
frequently exist as 0-glycosides are flavonoids, anthra- 
quinones, certain types of steroidal compounds like 
cardiac glycosides and saponins, proteins, lipids, and 
nucleosides (DNA and RNA). Cyanogenic glycosides 
(e.g., amygdalin from Prunus amygdalus var. amara 
(bitter almonds) and linamarin from Manihot esculenta 
(cassava)) are 0-glycosides that upon hydrolysis will 
give rise to HCN (hydrocyanic acid or prussic acid). 
Glucosinolates are S-glycosides (products found in 
mustard, Sinapis sp.). N-linked glycosides are 
found both in glycoproteins and in nucleosides, while 
C-glycosides most frequently are present in flavonoids 
and anthraquinones. 0-, N-, and S-linked glycosides 
are easily hydrolyzed by acid, while C-glycosides will 
not be hydrolyzed as this is a C-C linkage. 

Cross-References 

► Monosaccharide-Protein Linkage Region 


Glycosylation 

► Glycan-to-Protein Linkages 


Glycosylation in Metabolism 

Elizabeth Hounsell 

School of Biological and Chemical Sciences, Birkbeck 
College, University of London, London, UK 

Synonyms 

Biological half life; Drug metabolism; Excretion; 
Toxicity 

Definition 

Eor low-molecular-weight metabolites and drugs, 
metabolic conjugation in general increases polarity 
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Glycosylation Sequon 


and water solubility, resulting in facilitation of excre¬ 
tion and elimination from the organism. The reactions 
involved are classified as Phase 1 or Phase 2. The 
former are simple (although highly enzyme controlled) 
oxidation, reduction, hydrolysis, hydration and 
dehalogenation followed by one of sulfation, 
glucuronidation, glutathione conjugation, acetylation, 
and amino acid conjugation (acetylation decreases the 
solubility). Of the Phase 1 enzymes, the cytochrome 
p450 mono-oxygenase system epitomizes the products 
of a gene family that show genetic polymorphisms. 
Of the Phase 2 reactions glucuronidation is the addition 
of the monosaccharide, glucuronic acid (GlcA) to 
hydroxyl, carboxylic acid, amino and thiol groups 
resulting in the a-glucuronide. The reaction is cata¬ 
lyzed by glucuronosyl transferases, microsomal 
enzymes which utilize GlcAP-UDP cofactor as the 
source of GlcA. Less common are the conjugation of 
Glc, Rih, and Xyl, but all increase the hydrophilicity 
and increase elimination. 

The glycosylation pattern of proteins has been 
implicated in their removal from circulation. This 
relates to N-linked glycosylation where the 
Man 3 GlcNAc 2 core (► Glycan-to-Protein Linkages) 
in common complex-type chains (► Glycoproteins) 
has additional A-acetyllactosamine (Gal-GlcNAc) 
disaccharides attached that are usually sialylated. The 
main sialylation is with the sugar A-acetyl-neuraminic 
acid a C-9 pyranose form monosaccharide with a car¬ 
boxyl group designated as C-1, an acetamido group at 
C-5 and a glycerol side chain at C-6 (► Carbohydrate 
Nomenclature). It is thought that removal of this sialic 
acid results in a signal recognized by a Gal-binding 
lectin (the hepatic asialoglycoprotein receptor) on the 
surface of hepatic cells of the liver that attach to the 
glycan and initiate removal of the protein from circu¬ 
lation. The glycosylation of serum proteins certainly 
influences their survival or half-life times. The hepatic 
asialoglycoprotein receptor also importantly mediates 
the lysosomal degradation of many proteins. In bio¬ 
synthesis of proteins, correct glycosylation appears to 
be one of the signals for proper folding and intracellu¬ 
lar transport with incorrectly folded glycoproteins 
being degraded at an early stage. 

After biosynthesis, transport to intracellular organ¬ 
elles or the cell surface, and possible secretion, protein 
glycosylation affects recycling. Protein folding can 
also be affected by glycation where circulating glucose 
undergoes a Maillard reaction with protein amine 


groups followed by Amadori rearrangement that 
leads to Advanced Glycation End (AGE) products 
which severely changes protein conformation, deposi¬ 
tion, and catabolism. This is particularly relevant in 
patients with poorly controlled circulating glucose 
levels (i.e., in diabetes, but is also a phenomenon of 
old age), leading to deposition of “sticky” proteins in 
small blood vessels (such as in the eye, the kidney, and 
skin) (► (Glyco)protein Folding Disorders). The 
damage is monitored by the level of hemoglobin 
HbAic formed by glycation of the amino terminus 
NH 2 group, but many other proteins are affected. 
Conversion of glucose into glucosamine, a key 
metabolic precursor of UDP-GlcNAc, is also an essen¬ 
tial prerequisite for the development of type 2 diabetes. 
UDP-GlcNAc is the substrate for the enzyme that 
catalyzes glucosaminylation of cytoplasmic proteins 
acting reciprocally with phosphorylation as signals 
intimately involved in cell regulation and metabolism 
(► (Glyco)protein Folding Disorders). 

Cross-References 

► Carbohydrate Nomenclature 

► (Glyco)protein Folding Disorders 

► Glycan-to-Protein Linkages 

► Glycoproteins 
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Glycosylphosphatidylinositol 

Mike Ferguson 

Division of Biological Chemistry & Drug Discovery, 
The Wellcome Trust Biocentre, University of Dundee, 
Dundee, Scotland, UK 

Synonyms 

Glycoconjugates; Glypiated Proteins; GPI; Membrane 
anchor 
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Definition 

Glycosylphosphatidylinositol (GPI) is a generic term for 
compounds containing the structural motif: D-GlcN- 
(al- 6 )-D-» 2 yo-inositol-l-P 03 H-lipid, where the lipid 
is either a glycerolipid (OT-l, 2 -diacylgycerol, ot- 1 - 
acylgycerol, ot- 1 -alkyl- 2 -acylgycerol) or a ceramide. 
They can be further divided into three types, 
depending on the glycosyl substituents attached to the 
glucosamine (GlcN) residue. Those containing Man- 
{a 1 - 6 )-Man-(ot 1 -4)-GlcN are type-1 GPIs, those 
containing Man-(ot 1 -3)-Man-(oc 1 -4)-GlcN are type-2 
GPIs, and those containing Man-(Q[ 1 -3)- [Man-(oc 1-6)-] 
Man-(al-4)-GlcN are hybrid GPIs. 

The type-1 GPIs are the most widely distributed, 
and they include the protein-linked GPI membrane 
anchors that, with a few protist exceptions, appear to 
be ubiquitous among the eukaryotes. Mature protein- 
linked GPI membrane anchors have a common 
core structure of NH 2 CH 2 CH 2 P 03 H- 6 )-Man-(al- 2 )- 
Man-(al - 6 )-Man-(a I -4)-GlcN-(a I - 6 )-» 2 yo-inositol-1 - 
P 03 H-lipid, where the amine of the ethanolamine 
phosphate group is in amide linkage to the mature 
C-terminal a-carboxyl group of the GPTanchored 
protein. The common GPI core can be variously 
substituted in a species-, tissue-, and protein-specific 
manner, with additional ethanolamine phosphate 
groups (that are not involved in attachment to protein), 
with an additional fatty acid ester linked to the 
2 -hydroxyl of the myo-inositol residue, and with 
mono- and/or oligosaccharide side chains that can be 
attached to any of the three core mannose residues. 

The type-2 and hybrid GPIs are restricted to certain 
protozoa, like the leishmania parasites, where they can 
represent the most abundant surface molecules. Type-2 
and hybrid GPIs are not found attached to proteins 
and can range from simple small structures to large 
and complex glycoconjugates, like the leishmania 
lipophosphoglycans. 

The biosynthesis of the type-1 GPI protein 
membrane anchors is quite well understood. 
Assembly proceeds on the cytoplasmic face of the 
endoplasmic reticulum (ER) by the transfer of 
A-acetylglucosamine (GlcNAc) from UDP-GlcNAc 
to a phosphatidylinositol (PI) phospholipid, followed 
by removal of the acetyl group to yield GlcN-PI. This 
intermediate is translocated into the lumen of the ER 
where the myo-inositol residue is acylated, the GlcN 
residue tri-mannosylated (from dolichol-P-mannose), 


and ethanolamine phosphate residues(s) are added 
from phosphatidylethanolamine. Proteins destined to 
contain a GPI membrane anchor contain both an 
N-terminal signal peptide, for entry into the lumen of 
the ER, and a C-terminal GPI-addition signal peptide. 
The GPI anchor precursor is transferred to these pro¬ 
teins by a GPI-transamidase complex in exchange for 
the GPI-addition signal peptide. Processing of the pro¬ 
tein-linked GPI anchor can occur in the ER and in the 
Golgi apparatus. These processing events can include 
the addition of carbohydrate chains and the remodeling 
of the lipid component (e.g., from diacylglycerol to 
alkylacylglycerol, the exchange of unsaturated for 
fully saturated fatty acids and the exchange of 
diacylglcerol for ceramide). 

The functions of GPI membrane anchors are 
species-specific. Thus, in the protozoa, GPTanchored 
proteins tend to be the major surface proteins that are 
involved in survival and/or infectivity. In these cases, 
the GPI anchor may simply be a convenient insulator 
and space-saving mechanism, to allow high levels of 
surface protein packing. In the yeasts and fungi, 
many GPTanchored proteins are destined for cross- 
linking into the yeast/fungal cell wall. This occurs via 
a transglycosylation mechanism involving the GPI 
anchor itself. In higher eukaryotes, GPTanchored 
proteins tend to occur in dynamic sphingolipid and 
cholesterol-rich membrane microdomains known as 
“lipid rafts.” The function(s) of lipid rafts are 
under constant revision but they include acting as 
platforms for signal-transduction and in cell-cell 
communication. 


Cross-References 

► Carbohydrate Nomenclature 

► (Glyco)protein Folding Disorders 

► Glycoconjugates in Cell Function and 
Therapeutics 
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GPCR 


GPCR 

► Dynamics of Helix 8 in GPCR Function 

► G Protein-Coupled Receptor Activation Based on 
X-ray Structural Studies 


GPI 

► Glycosylphosphatidylinositol 


Green Fluorescent Protein (GFP) 

► Flavin Mononucleotide-Binding Fluorescent 
Proteins 


Ground State Depletion Imaging 

Francesca Celia Zanacchi and Alberto Diaspro 
Department of Nanophysics, Italian Institute of 
Technology (IIT), Genoa, Italy 
Department of Physics, University of Genoa, 

Genoa, Italy 

Definition 

Ground state depletion imaging (GSDIM) is a super¬ 
resolution imaging technique based on single molecule 
localization of synthetic fluorophores. 


The basic idea behind this technique relies on the 
possibility to drive the fluorophores into a “dark” meta¬ 
stable triplet state in order to obtain a sparse subset of 
active fluorophores. Thus the molecules returned into 
the ground state can be excited, collected, and their 
positions can be localized with a better precision than 
the microscope resolution. The observed fluorophores 
are driven to the metastable dark state and new 
markers, returned to the ground state, can he turned 
on and collected. The cycles are repeated until enough 
molecules are acquired and the final image provided by 
the sum of all the localized events can be 
reconstructed. Such a technique can he applied on 
a wide number of conventional dyes and it provides 
the possibility to perform multicolor and 3D super¬ 
resolution imaging. 

GSDIM belongs, together with ► photoactivation 
localization microscopy PALM, ► fluorescence 
photoactivation localization microscopy (FPALM), 

► stochastic optical reconstruction microscopy 
(STORM), to the wide family of ► individual mole¬ 
cule localization techniques (IML). The brightness of 
the employed dyes allows to achieve a localization 
precision (around 25 nm) comparable to the one pro¬ 
vided by STORM techniques. 

Cross-References 

► Fluorescence: General Aspects 

► Fluorescence Photoactivation Localization 
Microscopy (FPALM) 

► Individual Molecule Localization Techniques 

► Photoactivated Localization Microscopy (PALM) 

► Stochastic Optical Reconstruction Microscopy 
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H/D Exchange 

Kandala V. R. Chary and Atul K. Srivastava 
Department of Chemical Sciences, Tata Institute of 
Fundamental Research, Mumbai, India 


Synonyms 

Hydrogen exchange 


Definition 

Hydrogen exchange is an interchange of hydrogen 
atom of a molecule (protein, nucleic acid, etc.) with 
that of the solvent. It is catalyzed by both acid (HsO"^) 
and base (OH“). 

In general, the hydrogen exchange rate constants 
(kj-c) vary with temperature and pH. When log(^,.c) is 
plotted against pH, it gives rise to a V-shaped curve 
with a minimum around pH 3.0. In a random coil 
peptide, the k,.^. of an individual amide proton (’H^) 
further depends on the amino acid sequence (due to the 
inductive and steric effects of neighboring residues). 
All these effects can be taken together to predict 
respective k,.^ of any given protein sequence. 

In folded proteins, the observed *H^ exchange rates 
(kofos) are much smaller than the predicted , and such 
retardation in exchange process is expressed in terms 
of protection factor, P {k,.Jkohf) that can range from 
a value 1 (for completely unprotected) to 10*° or higher 
for signihcantly protected one. The slow exchange is due 
to the exclusion of solvent (and hydrogen exchange 
catalyst) from the protein core or/and hydrogen bonding. 


To exchange, the protected amide hydrogens must 
transiently expose to the solvent. This opening event 
may include small fluctuation, cooperative partial 
unfolding, or even global unfolding of the protein. 
These states, capable of exchange, represent a higher 
energy conformation within the native state ensemble, 
and this very reason makes the hydrogen exchange 
a powerful technique to study the details of protein 
native ensemble. The technique is also used to study 
the partially unfolded intermediates (kinetic and equi¬ 
librium), even when their population is too small to be 
detected by conventional biophysical methods. 

The exchange reaction for a protected *H*^ can be 
modeled using a microscopic two-state model; 


Closed 


Open Exchanged 


Under steady state approximation for the rare Open 
state, the observed exchange rate constant can be given 


as, IcqIjs — 


h 1c 
•^op ^rc 


-. If kci < krc, kohs = kop’, this case is 


kci kf^ 

called EXl mechanism. Contrary to that, if kd > k^c, it 
is called EX2 mechanism. In the later case, opening 
rarely results in exchange and making it a 
preequilibrium step. Assuming k^p < k^ (a rare 
population of Open state at equilibrium), the observed 


exchange rate is given by kphs = 


Jc Ic 

^op ^rc 


— ^on kp 


Kop is the equilibrium constant for the opening 
event. In the EX2 regime, the free energy change 
required for the opening event (AGhx) can be calcu- 

k h 

lated by AGhx = -RTlnKpp = -RTln — = RTlnP. 

krc 

Thus, hydrogen exchange can report on both the 
kinetic (EXl regime) and thermodynamic (EX2 
regime) aspects of any given protein. 


G.C.K. Roberts (ed.), Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 
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H^-Lactose Membrane Transport Protein, LacY 


Hydrogen exchange can be detected using different 
isotopes (^H and ^H). ^H 20 is mostly used because of 
its low cost and nonradioactive nature. The exchange is 
initiated either by dissolving the lyophilized protein in 
^H 20 or by rapidly exchanging the protons with deu¬ 
terium of the solvent. The incorporated into the 
molecules are detected by FTIR, UV, NMR, or Mass 
spectrometry. The latter two techniques are mostly 
employed to detect hydrogen exchange in biomole¬ 
cules. The hydrogen exchange data can reveal the 
details of structure and dynamics of a native protein 
and partially unfolded intermediates. It is also used to 
study the protein folding and various kinds of biomo- 
lecular interactions and aggregation. 

Cross-References 

► H/D Exchange 

► Infrared Spectroscopy of Protein Dynamics: H/D 
Exchange 

► Infrared Spectroscopy of Protein holding, 
Misfolding and Aggregation 

► Protein holding 


H^-Lactose Membrane Transport 
Protein, LacY 

Lan Guan* and H. Ronald Kaback^ 

'Department of Cell Physiology & Molecular 
Biophysics, Center of Membrane Protein Research, 
Texas Tech University Health Sciences Center, 
Lubbock, TX, USA 

^Departments of Physiology and Microbiology, 
Immunology & Molecular Genetics, Molecular 
Biology Institute, University of California, 

Los Angeles, Los Angeles, CA, USA 

Synonyms 

Lac carrier protein; Lactose permease 

Definition 

LacY {aka Lac permease) is the product of the lacY 
gene, the second structural gene in the lac operon, 
which encodes a single polypeptide that is solely 


responsible for all the translocation reactions catalyzed 
by the /1-galactoside transport system in E. coli. LacY 
(TCDB 2.A.L5.1) belongs to the oligosaccharide/H"^ 
symport subfamily of the major facilitator superfamily 
(MES), which contains >40,000 members found 
ubiquitously in all living organisms. LacY is selective 
for disaccharides containing a o-galactopyranosyl 
ring, as well as o-galactose, but has no affinity for 
D-glucopyranosides or o-glucose. LacY catalyzes the 
vectorial reversible reaction: 

H+(out) + Lactose(out) Lactose(in) -t- H+(in) 


CHpOH 



Structure of lactose 


Introduction 

The lacY gene is the first gene encoding a membrane 
transport protein to be cloned into a recombinant 
plasmid and sequenced as well as the hrst symporter 
to be purihed to apparent homogeneity in a completely 
functional state and to have a high-resolution crystal 
structure determined by X-ray diffraction (Kaback 
et al. 2001; Guan and Kaback 2006). The hydrophobic 
polypeptide is composed of 417 amino acid residues, 
has a mass of 46,517 Da, but electrophoreses with an 
M, of ^33 kDa due to excess dodecyl sulfate binding. 
As a particularly well-studied sugar/H"^ symporter, 
LacY is a paradigm for exploring the molecular mech¬ 
anism of solute/cation symport catalyzed by many 
secondary transport proteins. 

LacY is an effective chemiosmotic machine that 
catalyzes the coupled translocation of an H"^ and a 
galactopyranoside (galactoside/H"^ symport). Since 
translocation is obligatorily coupled, sugar accumulation 
against a concentration gradient is achieved by transduc¬ 
tion of the free energy released from the downhill 
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a Active transport 



H+ 

Lac 



H^-Lactose Membrane Transport Protein, LacY, 
Fig. 1 Lactose/H"^ symport. (a) Active transport. The electro¬ 
chemical gradient (A/ijj+) across the cytoplasmic membrane 
of E. coli is generated by pumping of via the respiratory chain 
or through the hydrolytic activity of FiFq ATPase. Free energy 
released from the downhill movement of Fl’^ catalyzed by LacY 
{sand color) is coupled to the uphill accumulation of lactose as 
indicated by the direction of the arrows and font size, (b) Influx 

movement of with the electrochemical gradient 
interior negative and/or alkaline) (Fig. la). Con¬ 
versely, downhill sugar translocation by LacY drives 
uphill translocation of Ff^ with the generation of Ap.^i+, 
the polarity of which depends upon the direction of 
the sugar concentration gradient (Fig. lb, c). Notably, 
LacY also catalyzes equilibrium exchange of internal 
radiolabeled galactoside for external unlabeled sugar 
substrate or the reverse reaction (i.e., counterflow) in a 
manner that is completely independent of Ap.n+ 
(Fig. Id). Thus, the primary driving force for the global 
conformational change involved in transport (alternating 
access of the binding sites to either side of the mem¬ 
brane) is due to association and dissociation of sugar 
by protonated LacY. Without a galactoside, LacY does 
not translocate in the presence of A/ijj+, and 
unprotonated LacY does not bind galactosides (i.e., bind¬ 
ing of both substrates is required for symport). 

X-ray Crystal Structure of the LacY Protein 

LacY is an ^86% a-helical membrane protein and 
contains 12 mostly irregular transmembrane helices 
traversing the membrane in zig-zag fashion connected 


and (c) Efflux. Energetically downhill lactose transport gener¬ 
ates Ajln+, the polarity of which depends upon the direction of 
lactose concentration gradient (influx generates a that 

is interior positive and acid; efflux generates a A/i[j+ that is 
interior negative and alkaline), (d) Equilibrium exchange. 
At equal intra- and extra-cellular lactose concentrations, 
lactose exchange across the membrane is catalyzed by proton¬ 
ated LacY 

by relatively hydrophilic loops with both N- and 
C-termini on the cytoplasmic side of the membrane. In 
the X-ray crystal structures (Abramson et al. 2003; 
Mirza et al. 2006; Guan et al. 2007; Chaptal et al. 
2011), LacY is in an inward-facing conformation. 
Viewed parallel to the membrane (Fig. 2a, c, d), the 
molecule is heart-shaped with a cavernous water-filled 
cleft 25 X 15 A and open on the cytoplasmic side only; 
the periplasmic side is tightly sealed. The largest dimen¬ 
sions of the molecule are 60 x 60 A. Viewed normal to 
the membrane from the cytoplasmic side (Fig. 2b, e), the 
molecule has a distorted oval shape with dimensions of 
30 X 60 A. The electrostatic surface potential calculated 
from the crystal structure reveals a positively charged 
belt around the periphery of the cytoplasmic opening 
(Fig. 2c, d, e) (Yousef and Guan 2009) with negatively 
charged residues distributed preferentially on the 
periplasmic side of the molecule (Fig. 2f). Thus, LacY 
follows the positive inside rule. 

Helix Packing and Pseudosymmetry 

LacY is organized into two 6-helix bundles connected 
by a long loop between helices VI and VII (Fig. 2a, b). 






H 


938 


H^-Lactose Membrane Transport Protein, LacY 



Cytoplasmic side 


Cytoplasmic side 
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Periplasm 


Cytoplasmic view 


Periplasmic view 
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H^-Lactose Membrane Transport Protein, LacY, 
Fig. 2 X-ray crystal structures (PDB, lpv7). Crystal structures 
of all LacY molecules determined thus far (PDB,lpv7, lpv6, 
2cfp, 2cfq, 2v8n, and 2y5y) exhibit an inward-facing conforma¬ 
tion. In the ribbon representations of LacY (a, b), the 
12 transmembrane helices are colored from the N-terminus in 
blue to the C-terminus in purple. The lactose homologue 
thiodigalactoside is shown as black spheres, (a) Viewed parallel 
to the membrane with helices V and VIII in front. The membrane 

The hydrophilic cavity is lined by helices I, II, IV, and 
V of the N-terminal domain and helices VII, VIII, X, 
and XI of the C-terminal domain. Helices III, VI, IX, 
and XII are largely embedded in the membrane. The 
cavity-lining helices I, IV, VII, and X are distorted with 
kinks, thereby providing a structural basis for the large- 
scale conformation changes observed in LacY. The 
N- and C-terminal 6 -helix domains have the similar 
topology and exhibit twofold pseudosymmetry. Within 
each domain, there are two 3-helix repeats with an 
inverted topology; these symmetrical features may pro¬ 
vide a foundation for the alternating access mechanism 
in LacY. 


Sugar Binding and Specificity 

A LacY mutant with a single native Cys residue at 
position 148 (helix V) binds the monosaccharide 


is represented in light gray, (b) Cytoplasmic view normal to the 
membrane. Helices are labeled with Roman numerals, (c-f) 
Electrostatic surface potential of LacY molecule. The scale 
indicates color-coded values of the electrostatic potentials (kT/ 
e). (c) and (d), viewed parallel to the membrane, where helices 
V/VIII and II/XI are in front, respectively, and the cytoplasmic 
side is on top\ (e) and (f), viewed from cytoplasmic and peri¬ 
plasmic sides, respectively 


galactose, the disaccharide lactose, or 4-nitrophenyl-a- 
D-galactopyranoside (a-NPG) with apparent dissociate 
constants (Xq) of 30 mM, 1 mM, or 20 pM, respectively 
(Sahin-Toth et al. 2000; Sahin-Toth et al. 2001; Guan 
and Kaback 2004). Galactose is the most specific 
substrate for LacY with lowest affinity, and various 
adducts - particularly if they are hydrophobic at the 
anomeric carbon C-1 of the galactopyranosyl ring, in 
the a configuration or both - increase affinity with no 
effect on specificity. The C 4 OH is unequivocally the 
most important determinant for specificity, and the C3, 
Ce, and C 2 OH groups on the galactopyranosyl ring play 
relatively minor roles in H-bonding. 

Cys-scanning and site-directed mutagenesis show 
that only a small number of amino acyl side chains 
located in transmembrane helices are irreplaceable 
with respect to active lactose transport (Frillingos 
et al. 1998). Among them, the charge pair Glul26 
(helix IV)/Argl44 (helix V) is essential for galactoside 
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binding and specificity. An aromatic side chain at 
position 151 (preferably Trp) in helix V is also 
critically involved in sugar affinity by stacking 
hydrophobically with the galactopyranosyl ring and 
forming an H-bond with Glu269 (helix VIII), another 
irreplaceable residue also critical for binding and 
transport. Replacement of Glul26 with Asp or 
Trpl51 with Tyr decreases affinity for galactoside 
significantly with little effect on apparent affinity 
for H-". 

These residues are located at the apex of the 
cavity approximately halfway through the membrane. 
Biochemical findings are consistent with the notion 
that Argl44 (helix V) may interact with the O 3 and 
O4 atoms of the galactopyranosyl ring and that Glul26 
is in proximity to the O4, O5, or Og atoms of the 
galactopyranosyl ring. Spectroscopic studies show 
that Trpl51 stacks with the hydrophobic face of the 
galactopyranosyl ring, and this interaction probably 
orients the ring so that important H-honds can be 
realized. Glu269 in the C-terminal domain is proximal 
to the O3 atom of the galactopyranosyl ring, and may 
form a salt-bridge or an H-bond with Argl44 or 
Trp 151, respectively. In the absence of ligand, 
Argl44 is displaced from Glu269 and forms a salt 
bridge with Glul26. Most residues involved in 
binding and translocation in the C-terminal domain 
also play a role in the affinity of LacY for sugar. 

The H'^-Binding Site 

LacY exhibits a remarkably high pA'a of 10.5 for sugar¬ 
binding affinity (Smirnova et al. 2009a). Therefore, at 
physiological pH, LacY is fully protonated, and galac¬ 
toside binds to protonated LacY only. The H'^-binding 
site is located in the C-terminal helix bundle, aligned 
parallel to the plane of the membrane at the similar 
level as the sugar-binding site (Figs. 3 and 4), and 
exposed to the water-filled cavity. Like the 
galactosidic sugar, the H"^ may bind and be released 
directly into the cavity during turnover. In any event, it 
is apparent that H"^ translocation through LacY does 
not involve a water-filled channel through the mole¬ 
cule. These structural features may also explain why 
LacY is able to catalyze lactose/H"^ symport in both 
directions across the membrane utilizing the same 
residues. Residues Tyr236 and Asp240 (helix VII), 
Glu269 (helix VIII), Arg302 (helix IX), and His322 



hf^-Lactose Membrane Transport Protein, LacY, Fig. 3 The 

cytoplasmic cavity viewed normal to the membrane with resi¬ 
dues important for galactoside and binding (PDB lpv7). The 
N- and C-terminal helices of LacY are colored green and sand, 
respectively, and labeled with Roman numerals. Important res¬ 
idues identified by biochemical and biophysical studies for the 
binding of sugar and/or are shown in cyan and magenta, 
respectively. Dotted lines show salt bridge/H-bond interactions. 
The lactose homologue thiodigalactoside is shown in green. The 
02-4 and 06 atoms on the galactopyranosyl ring are indicated 

and Glu325 (helix X) contribute to the H"^ binding and 
translocation (Kaback et al. 2001), and most are part of 
a charge-pair/H-bond network (Abramson et al. 2003; 
Mirza et al. 2006; Guan et al. 2007; Chaptal et al. 
201 1). Mutation of some of these residues also causes 
a marked decrease in sugar affinity (Smirnova et al. 
2009a). 

Mutagenesis studies demonstrate that Asp237 
(Helix VII) and Lys358 (helix XI), in the vicinity of 
the sugar- and H'^-binding sites, form a salt bridge 
that is not involved in sugar or H"^ binding but is 
important for insertion of LacY into the membrane 
and its stability (Kaback et al. 2001). 

Oligomeric State of LacY 

LacY is monomeric in the detergent dodecylmaltoside 
or in the solvent hexamethylphosphoms triamide, and 
the rotational diffusion coefficient determined from 
fluorescence anisotropy with eosinylmaleimide-labeled 
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H^-Lactose Membrane Transport Protein, LacY 



Periplasmic side 


Cytoplasmic side 


H^-Lactose Membrane Transport Protein, LacY, Fig. 4 The 

alternating access model. An inward-facing crystal structure 
(PDB, lpv7) and an outward-facing model of LacY are shown 
in dots colored as green or sand for the N- or C-terminal helix 


bundles, respectively. A sugar substrate is shown as black 
spheres', a red sphere indicates positions important for bind¬ 
ing. Black arrows indicate binding and release of galactoside and 
from cytoplasmic or periplasmic cavities as indicated 


permease indicates that the polypeptide reconstitutes as 
a monomer. Freeze-fracture electron microscopic stud¬ 
ies on proteoliposomes reconstituted with purified 
LacY demonstrate that the protein reconstitutes as 
a monomer and no change is observed in size or distri¬ 
bution of the particles when a A/(jj+ is imposed. Func¬ 
tionally, the initial rate of A/ifj+-driven lactose transport 
in proteoliposomes reconstituted at very low protein-to- 
lipid ratios varies linearly with the ratio of permease to 
phospholipid. An engineered fusion protein containing 
two LacY molecules covalently linked in tandem has 
high transport activity and does not exhibit negative 
dominance with either mutations or chemical modifica¬ 
tion of either half of the molecule. Finally, dimerization 
of single-Cys mutants in LacY in the plane of the mem¬ 
brane is a stochastic process. Thus, LacY is both strac- 
turally and functionally a monomer (reviewed in Guan 
and Kaback 2006). 

The Alternating Access Model 

As indicated, the sugar- and fL^-binding sites in LacY 
are located at the approximate middle of the molecule 
at the apex of a hydrophilic cavity and inaccessible 
from the periplasm. However, LacY is a highly 
dynamic molecule, and sugar/H"^ symport involves 
a global conformational change in which the cytoplas¬ 
mic cavity closes with the opening of a complemen¬ 
tary periplasmic cavity, thereby allowing alternating 
access of the sugar- and H'^-binding sites to either 


side of the membrane (Fig. 4). Experimental findings 
from site-directed alkylation of single-Cys replace¬ 
ment mutants, thiol cross-linking, single molecule 
fluorescence resonance energy transfer, double elec¬ 
tron-electron resonance spectroscopy, and Tip fluores¬ 
cence quenching and unquenching provide strong 
converging evidence that supports the alternating 
access mechanism (Smirnova et al. 2011). Like other 
MFS members, the internal pseudosymmetry within 
each helix bundle may provide the structural founda¬ 
tion for this mechanism. Thus, swapping the topology 
of two inverted 3-helix repeats in each 6-helix bundle 
may be the basis for the reciprocal formation of 
inward- and outward-facing conformers (Radestock 
and Forrest 2011). 

While all X-ray crystal structures obtained thus far 
reveal the same inward-facing conformation for LacY, 
results from double electron-electron resonance 
studies (Smirnova et al. 2007) and other findings 
reveal that LacY exhibits multiple conformers. In the 
absence of a sugar substrate, the inward-facing confor¬ 
mation dominates in the membrane and after 
solulization in detergent. Sugar binding increases the 
probability of forming intermediate and outward¬ 
facing conformations. 

The H'^-lactose Symport Mechanism 

Because the individual steps in the overall translocation 
cycle cannot be delineated by studying A/ijj+-driven 
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H^-Lactose Membrane 
Transport Protein, LacY, 
Fig. 5 The TT^-lactose 
symport mechanism. The 
cartoon representations for the 
N- and C-terminal domains of 
LacY molecule are colored as 
green and sand, respectively. 
Sugar and are shown as 
a black hexagonal and a red 
sphere, respectively. Sugar 
and hind in the middle of 
the molecule, and LacY moves 
around the binding sites to 
release the symported 
substrates on either side of the 
membrane as indicated by the 
arrows 



active transport, downhill lactose efflux, equilibrium 
exchange, and entrance counterflow (Fig. 1) are used 
to probe the mechanism in right-side-out (RSO) 
membrane vesicles (Kaczorowski and Kaback 1979). 
Efflux, exchange, and counterflow with wild-type LacY 
are explained by the simple kinetic scheme shown in 
Fig. 5. Lactose efflux down a sugar concentration 
gradient consists of eight steps (black arrows): 

1. Protonation in the inward-facing conformer 

2. Binding of a preloaded lactose on the cytoplasmic 
side of the membrane 

3. Formation of a loaded intermediate that is closed on 
both sides 

4. Opening of the periplasmic cavity 

5. Release of lactose 

6. Deprotonation 

7. Formation of an unloaded intermediate that is 
closed on both sides 

8. Opening of the cytoplasmic cavity 

Lactose influx down a sugar concentration gradient 
occurs in identical fashion as efflux; the reaction starts 
at step (6) and proceeds via the red arrows around the 
circle. For equilibrium exchange and counterflow, 
deprotonation does not occur, and only steps (1-5) 
are involved. The ordered binding and release of 
and galactoside are strongly supported by the behavior 
of mutants such as E325A that cannot catalyze any 


transport reactions that involve net translocation, 
but catalyze equilibrium exchange and counterflow as 
well or better than WT LacY (Carrasco et al. 1986). 

Certain enzyme reactions involve transfer as the 
rate-limiting step; as a result, these reactions may 
exhibit a solvent isotope effect when studied in deute¬ 
rium oxide (D 2 O). In brief, such reactions proceed 
slower in D 2 O because of differences in the 
zero-point stretch vibrations of bonds with protium 
relative to deuterium. With RSO membrane vesicles 
or proteoliposomes reconstituted with purified LacY, 
over threefold slowing of the rate of lactose influx or 
efflux from pH 5.5 to 7.5 is observed, with no effect on 
exchange, counterflow, A/ifj+-driven active transport 
or affinity for sugar (Viitanen et al. 1983). These and 
other observations indicate that reactions involved in 

transfer are rate determining when a lactose gradi¬ 
ent drives translocation. Conversely, transfer is 
not rate limiting for exchange or counterflow. Thus, in 
the absence of A/ijj+, the limiting step in the transport 
mechanism is deprotonation, step (6) during efflux and 
step (1) during influx. 

In the presence of A/ijj+ (interior negative and/or 
alkaline), LacY drives accumulation of lactose against 
a 50- to 100-fold concentration gradient (Robertson 
et al. 1980). Both A*L (interior negative) and ApH 
(interior alkaline) have quantitatively the same kinetic 
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effect on transport, a marked decrease in K^, as well as 
the same thermodynamic effect. However, in opposi¬ 
tion to the time-honored conjecture, the Aio of LacY for 
galactosides on either side of the membrane is similar 
and unchanged in the absence or presence of Apjj+ 
(Guan and Kaback 2004). In the membrane, an 
inward-facing conformation of LacY predominates 
(Smirnova et al. 2011). In the absence of an external 
galactoside, opening of protonated LacY on the 
periplasmic side occurs spontaneously, but with very 
low probability (low frequency). Thus, at any given 
time, only a small population of molecules in the 
membrane is open on the periplasmic side statistically. 
While the sugar-induced opening rate of periplasmic 
cavity is ^20 s~* (Smirnova et al. 2009b), the pre- 
steady-state conformation may not affect the steady- 
state transport rate. 

Active transport of lactose against a concentration 
gradient proceeds from step (6) (Fig. 5) via the red 
arrows in the same manner as lactose influx. Binding 
of galactoside from the periplasm (5) by protonated 
LacY (6) results in formation of a high-energy inter¬ 
mediate (4) in which sugar and are bound and both 
cavities are closed. In the presence of the inter¬ 
mediate with a bound sugar opens to the cytoplasm 
(3), sugar dissociates (2) followed by deprotonation 
(1) and the unloaded LacY returns to the outward¬ 
facing conformation (steps 8, 7) and protonates 
immediately. The turnover number for A/ijj+-driven 
lactose transport by LacY in RSO membrane vesicles 
or with the purified protein reconstituted into 
proteoliposomes is ^20 s“\ The reaction occurs 
without a change in Ko and is totally unaffected by 
D 2 O. Thus, when there is a driving force on the 
protonation or deprotonation is no longer rate limit¬ 
ing, and either dissociation of sugar or a conforma¬ 
tional change becomes the limiting step in the 
transport reaction. It is particularly noteworthy that 
a number of experimental findings indicate strongly 
that Apfj+ functions kinetically as a driving force 
on the with little or no direct effect on LacY 
(Guan and Kaback 2006). 

Summary 

LacY, a sugar/H"^ symporter that belongs to the major 
facilitator superfamily, is a particularly well-studied 


paradigm for exploring the molecular mechanism of 
solute/cation symport catalyzed by many secondary 
transport proteins. X-ray crystal structures show that 
LacY is organized into two pseudosymmetrical 6-helix 
bundles surrounding a cavernous cleft open to the 
cytoplasm only, and within each bundle, there are 
two repeats with an inverted topology. Positions 
responsible for binding of galactoside and are 
located in the approximate middle of the molecule 
halfway through the membrane. Structural symmetries 
allow LacY to open and close periplasmic and cyto¬ 
plasmic cavities reciprocally, thereby allowing alter¬ 
nating access of the binding sites to either side of the 
membrane and the coupled translocation of galactoside 
and in both directions. An electrochemical 
gradient (A/i].j+; interior negative and/or alkaline) 
dramatically decreases the for transport by 50- to 
100-fold with no effect on Ko from either side of 
the membrane. LacY is fully protonated under physi¬ 
ological condition, and the primary driving force for 
the global conformational change in the alternating- 
access mechanism is binding of sugar to protonated 
LacY. In the absence of A]lyi+, the limiting step for 
downhill lactose efflux and influx is transfer; in the 
presence of a driving force on the H"^, dissociation of 
sugar becomes limiting with a turnover number of 
-20 s“'. 
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► Membrane Transport, Energetics and Overview 
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Definition 

HADDOCK (High Ambiguity Driven biomolecular 
DOCKing) is an information-driven flexible docking 
approach for the modeling of biomolecular complexes 
(Dominguez et al. 2003). Docking is deflned as the 
modeling of the structure of a complex based on the 
known three-dimensional structures of its constituents. 
HADDOCK distinguishes itself from other docking 
methods by incorporating a wide variety of experimen¬ 
tal and/or bioinformatics data to drive the modeling 
(Melquiond and Bonvin 2010). This allows concen¬ 
trating the search to relevant portions of the interaction 
space using a more sophisticated treatment of confor¬ 
mational flexibility. 

Interface regions can be identified by mutagenesis, 

► H/D exchange, and chemical modifications (e.g., by 
cross-linkers or oxidative agents) detected by ► mass 
spectrometry, ► nuclear magnetic resonance, chemical 
shift perturbations, and cross-saturation transfer. When 
experimental data are unavailable or scarce, this infor¬ 
mation can be supplemented by bioinformatics predic¬ 
tions (de Vries and Bonvin 2008). These diverse 
information sources typically only identify or predict 
interfacial regions, but do not define the contacts 
across an interface. HADDOCK deals with this by 
implementing them as ambiguous interaction restraints 
(AIRs) that will force the interfaces to come together 
without imposing a particular orientation. 

HADDOCK can also incorporate classical NMR 
restraints such as distances from ► nuclear Overhauser 
effects and paramagnetic relaxation enhancement 
measurements, pseudocontact shift, dihedral angles, 

► residual dipolar coupling, and diffusion anisotropy 
restraints, the latter two providing valuable informa¬ 
tion about the relative orientation of the components in 
a complex. In addition, symmetry restraints can be 
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defined in the case of symmetrical homomeric sys¬ 
tems. Data derived from low-to-medium resolution 
techniques such as ► small angle X-ray scattering, 

► cryo-electron microscopy, and ion mobility mass 
spectrometry can provide information about the 
shape of a complex and be used to drive the docking 
process (Melquiond et al. 201 1). 

The HADDOCK protocol, which makes use of the 

► Crystallography and NMR System (CNS) package 
as computational engine, consists of three successive 
steps: (a) rigid-body energy minimization, (b) semi- 
flexible refinement in torsion angle space, and (c) final 
refinement in explicit solvent. By allowing for explicit 
flexibility during the molecular dynamics refinement, 
HADDOCK can account for small conformational 
changes occurring upon binding. Larger and more 
challenging conformational changes can be dealt with 
by starting the docking from ensembles of conforma¬ 
tions and/or treating the molecules as a collection 
of domains. The latter approach makes use of the 
unique multibody docking ability of HADDOCK, 
which can handle up to six separate domains or mole¬ 
cules at the same time. The selection of the final 
models is based on a weighted sum of electrostatics, 
desolvation, and van der Waals energy terms, along 
with the energetic contribution of the restraints used to 
drive the docking. 

HADDOCK has been extensively applied to a large 
variety of systems, including protein-protein, protein- 
nucleic acids, and protein-small molecule docking, and 
has shown a very strong performance in the blind 
critical assessment of the prediction of interactions 
(CAPRI). A considerable number of experimental 
structures of complexes calculated using HADDOCK 
have been deposited into the Protein Data Bank (PDB). 
HADDOCK is available as a web server (http:// 
haddock.chem.uu.nl/services/HADDOCK) (de Vries 
et al. 2010) offering a user-friendly interface to the 
structural biology community. 

Cross-References 

► CNS (Crystallography and NMR System) 

► Double-Mutant Cycle Analysis 

► Electron Microscopy 

► H/D Exchange 

► Ion Mobility Mass Spectrometry - Principles 

► Kinetics: Relaxation Methods 


► Lipid Lateral Diffusion 

► Mass Spectrometry 

► Molecular Dynamics Simulations of Lipids 

► NMR 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► Protein Complexes 

► Residual Dipolar Coupling 

► X-Ray Scattering of Lipid Membranes 
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HCN Channels: Biophysics and 
Functional Relevance 

Dario DiFrancesco and Mirko Baruscotti 
Laboratory of Molecular Physiology and 
Neurobiology, Department of Biosciences, University 
of Milan, Milano, Italy 

Synonyms 

“Funny” channels; f-channels; h-channels; Hyperpo- 
larization-activated channels; q (queer)-channels 

Definition 

Hyperpolarization-activated Cyclic Nucleotide-gated 
channels (HCN) are ion channels largely expressed in 
excitable tissues. In the heart, they generate the spon¬ 
taneous electrical oscillations of pacemaker cells, 
while in neurons they are involved in the control of 
cell excitability. 

Basic Characteristics 

Cardiac Pacemaking: Properties and Functions 
of the Pacemaker "Funny" (If) Current 

The heart of a healthy person beats on average 
2.5 billion times in a lifetime at a frequency ranging 
from ^40 bpm (beats per minute, sleep) to ^220 bpm 
(maximal exercise). 

This performance is generated by the activity of 
spontaneously beating (pacemaker) cells of the sino¬ 
atrial node (SAN). The ability to generate repetitive 


activity is evident from the electrical behavior of single 
SAN cells (Fig. 1, upper right). 

During the diastolic depolarization (pacemaker) 
phase, the membrane voltage slowly depolarizes from 
—60/—65 mV to —40 mV, until reaching the threshold 
for Ca^"^ current-driven initiation of a new action 
potential. This phase of the action potential is typical 
of pacing cells and lacks in working muscle. It is 
generated by the “funny” current, first described over 
30 years ago (reviewed by DiFrancesco 1993) and 
characterized in cardiac pacemaker cells (If) and neu¬ 
ronal cells (If,). 

If is a mixed Na’^/K'^ current activating on hyperpo¬ 
larization from a threshold of —40/—50 mV and a 
reversal potential of —10/—20 mV; its activation 
rate increases rapidly at more negative voltages 
(DiFrancesco 1981). These properties are well designed 
to initiate the process of diastolic depolarization which 
explains why If is designated as “pacemaker” current. 

As well as acting as a primary pacemaker generator. 
If also represents a physiologically relevant target for 
different homeostatic mechanisms that fine-tune the 
heart rate in accordance with the metabolic require¬ 
ment of the organism. Typically, If contributes to auto¬ 
nomic regulation of heart rate; as illustrated in Fig. 1. 
cAMP is synthetized by adenylate-cyclase, and its 
levels are finely regulated by a balance between sym¬ 
pathetic and parasympathetic stimuli. Noradrenaline 
increases cAMP, which binds directly to f-channels 
(DiFrancesco and Tortora 1991) and causes a right- 
ward shift of the activation curve, thus lowering the 
voltage hyperpolarization required for channel open¬ 
ing; this in turn increases inward current during dias¬ 
tole and accelerates pacemaking. Acetylcholine has 
the opposite effect (DiFrancesco 1993). If is also mod¬ 
ulated by several hormones and by the interaction with 
auxiliary or structural proteins, and all these mecha¬ 
nisms act synergistically to fine-tune the current acti¬ 
vation range and kinetics and, thus, set the amount of 
current that can be recruited during cell activity. 

Properties and Functions of the Neuronal 
Hyperpolarization-Activated (If,) Current 

If, currents similar to If have been described in several 
areas of the brain and in peripheral ganglia where 
they play important functions in cellular activity. In 
the brain. If, has been identified almost ubiquitously 
(neocortex, hippocampus, basal ganglia, thalamus, 
brainstem, hypothalamus, retinal cells, olfactory bulb. 
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HCN Channels: Biophysics 
and Functional Relevance, 
Fig. 1 Properties of funny 
channels and role in the 
generation of pacemaker 
activity and autonomic 
modulation of cardiac rate 
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cerebellum), while in the peripheral nervous system it 
has been investigated mostly in the dorsal root 
ganglion (DRG). 

Ih contributes to electrical activity by regulating 
resting voltage level and input resistance, thus setting 
cell’s excitability and firing rate, and by counteracting 
excitatory or inhibitory stimuli. 

As well as by voltage, Ij, is modulated by intracel¬ 
lular messengers and factors and is sensitive to external 
stimuli like pH. In sensory neurons, these properties 
help to mediate perception and transmission of exter¬ 
nal stimuli; Ij, is expressed in the retina; taste buds; 
thermal, olfactory, and stretch receptors; inner hair 
cells of the cochlea; and nociceptors (Robinson and 
Siegelbaum 2003; Biel et al. 2009). 

Ih affects not only the activity of individual cells, 
but also synaptic communication between neurons and 
signal processing. Ih expression in dendritic terminals 
of hippocampal CAl neurons increases with the dis¬ 
tance from the soma. This is thought to underlie the 
greater attenuation of temporal summation of distal 
versus proximal EPSP signals, which normalizes the 
process of signal integration over the entire length of 
the dendritic tree (Magee 1999). Ih plays a role in 
generating and/or controlling neuronal automaticity 
in several brain areas including the thalamocortical 
relay neurons, the cerebellar Purkinje cells, and neu¬ 
rons of entorhinal cortex (Robinson and Siegelbaum 


2003). It also exerts important physiological functions 
in cerebellar Purkinje cells, where it contributes to 
motor learning, and in prefrontal cortical neurons, 
where it contributes to working memory (Biel et al. 
2009). 

HCN Channels: Molecular Determinants of If/lh 
Currents 

Native f-channels are tetramers coded by subunits of 
the Hyperpolarization-activated Cyclic Nucleotide¬ 
gated (HCN) channel family. In mammals, this family 
includes four isoforms (HCNl^) which are variously 
distributed among excitable tissues. HCN proteins are 
formed by six transmembrane domains (S1-S6) and 
intracellular N- and C- termini. The C-termini include 
two functionally relevant regions: the C-linker, a 
membrane proximal region of about 80 amino acids, 
and the Cyclic Nucleotide Binding Domain (CNBD) 
hosting the binding site for the modulatory second 
messenger cAMP. 

The trasmembrane S1-S6 core contains structural 
elements controlling the permeability, the intrinsic 
voltage-dependence, and the pharmacological sensi¬ 
tivity of HCN channels. The S4 segment (rich in 
basic aminoacidic residues) is the voltage sensor ele¬ 
ment. This segment transfers its voltage-induced 
movements to the S4-S5 linker which in turn induces 
the channel gating by increasing/decreasing its 
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electrostatic interaction with the spatially adjacent 
C-linker. The S4-S5 linker and the C-linker are thought 
to contribute to the physical gate whose variable spa¬ 
tial configurations correspond to open/closed pore 
states. The channel permeability filter is located within 
the pore (P) region joining the S5 and S6 segments, and 
is partly determined by the presence in the pore 
sequence of a GYG triplet common to members of 
the superfamily of voltage-dependent and CNG 
channels to which HCN channels belong. The GYG 
motif is a known “signature” of K'^-permeable chan¬ 
nels, and it is curious that HCN pores are also perme¬ 
able to Na"^ ions (DiFrancesco 1981). 

The different HCN isoforms have different kinetic 
and modulatory properties. Typically, HCN4 have the 
slowest and HCNl channels the fastest activation 
kinetics, while HCN2 and HCN3 channels have inter¬ 
mediate rates of activation; the cAMP-induced shift of 
the activation curve is large (up to 15-20 mV) for 
HCN2 and HCN4, and small for HCNl (a few mV at 
most), while HCN3 does not appear to be cAMP sen¬ 
sitive. Values of kinetic and modulatory parameters 
can be found elsewhere in the literature (Baruscotti 
et al. 2005). 

Experimental evidence indicates that functional 
channels can result from both homotetrameric and 
heterotetrameric assembly. In the presence of different 
kinetics and cyclic-nucleotide sensitivity of individual 
isoforms, this feature allows a wide range of properties 
useful for a variety of cellular functions. 

Factors Modulating the Activity of HCN Channels 
Native If/Ih currents have variable properties in differ¬ 
ent tissues, and specific kinetic/modulatory features 
cannot be fully reproduced by heterologous expression 
of homomeric and/or heteromeric assembly of HCN 
isoforms. This has led to the idea that HCN channels 
are part of macromolecular complexes and that auxil¬ 
iary elements likely affect HCN function; many mod¬ 
ulatory factors have indeed been identified (Table 1). 
This ensures large modulating capability and adapta¬ 
tion to different physiological conditions. 

Clinical and Pharmacological Relevance 
of HCN Channels 

The search for substances that are able to selectively 
modify heart rate without side effects has long been an 
important pharmacological target. This search has 
identified drugs such as alinidine, zatebradine. 


cilobradine, ZD-7288, and ivabradine (“specific heart 
rate-reducing agents”), which slow heart rate by spe¬ 
cific block of the If current. Ivabradine is presently the 
only commercially available If inhibitor prescribed for 
the therapy of angina. Ivabradine enters the channels 
from the intracellular side and binds to a site located 
within the inner hydrophilic vestibule of the channel 
(Fig. 2). Block has the peculiar property of being 
current dependent (Bucchi et al. 2002). 

Interest for development of drugs interacting with 
specific HCN isoforms has increased recently also in 
view of growing evidence for the contribution of dys¬ 
functional HCN channels to diseases such as epilepsy, 
inflammation, and neuropathic pain (Dibbens et al. 
2010; DiFrancesco et al. 2011; Emery et al. 2011). 

Genetic investigation of cardiac rhythm disorders 
has so far identified six loss-of-function mutations of 
the HCN4 protein potentially associated with various 
forms of arrhythmias (Baruscotti et al. 2010; 
Laish-Farkash et al. 2010; Schweizer et al. 2010). 

Figure 3 illustrates the effects of the mutation 
S672R (a) in the HCN4 channel found in an Italian 
family with asymptomatic bradycardia. The pedigree 
(b) and the bar graph (c) show that mutated (filled 
symbols) and wild-type individuals (open symbols) 
had all rates lower and higher, respectively, than 
60 bpm, indicating high degree of co-segregation. 
Functional studies indicated that mutant channels are 
activated at more negative voltages than wild-type 
channels; these change slow the heart rate by decreas¬ 
ing the If current during diastole, and mimic those of 
mild vagal stimulation (Milanesi et al. 2006). 

The pathological consequences of the removal of If 
have been recently also investigated in vivo in a HCN4 
knockout mouse model, and data indicate that removal 
of If leads to extreme bradycardia and to complete block 
of the conduction through the AV node (Baruscotti 
et al. 2011). Knockout mouse models (of HCNl or 
HCN2) have also been critically relevant in suggesting 
a causative role for these isoforms in neuronal diseases 
such as ataxia, epilepsy, and impaired learning. 

More recent work has identified a mutation in the 
HCN2 channel associated with recessive generalized 
epilepsy due to a loss of function mechanism 
(DiFrancesco et al. 2011). Expression of homomeric 
mutant channels generates currents characterized by 
a large negative shift of the activation curve which 
essentially moves the voltage range for current activa¬ 
tion outside the physiological range. This change 
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HCN Channels: Biophysics and Functional Relevance, Table 1 Factors affecting properties and functions of HCN channels 


Mode of action 



Substance 

Conductance 

Kinetics/others 

Target 

Proteins 

MiRPl 

T 

Altered kinetics 

All isoforms 

Caveolin 3 

T 

Localization 

HCN4 



P-AR modulation 

HCN4 

Filamin A 


Slower kinetics 

HCNl 

TRIPSb 

T/i 

Slower kinetics 

HCNl 



Splice-dependent modulation 

HCNl, HCN2 



Impairment of cAMP modulation 

HCN2, HCN4 

Tamalin/Mint2 


Formation of multi-protein complex 

HCN2 



Regulation of expression in dendrites 

HCN2 

KCRl 

i 


HCN2 

Enzymes 

Src kinases 

T 

Faster kinetics 

HCN2, HCN4 

cGMP-dep PKII 


Slower kinetics 

HCN2 

MAP kinase 


Shifts right the AC curve 

Hippocampal HCNl 

Phosphatases calcineurin 


Shifts left the AC curve 

Hippocampal HCN 1 

Acidic lipids 

PI(4,5)P2 


Shifts right the AC curve 

All isoforms 

Arachidonic acid/phosphatidic acid 


Shifts right the AC curve 

HCN2 

pH 


Shifts left the AC curve 

HCN2 



Shifts right the AC curve 

HCN1,HCN4 


HCN Channels: Biophysics 
and Functional Relevance, 
Fig. 2 Funny channel 
inhibition caused by 
ivabradine-induced block 
causes slowing of heart rate, 
(a) Slowing of spontaneous 
rate in a SAN cells by 
ivabradine; (b) progressive 
inhibition of If induced by the 
same dose of ivabradine in 
a single SAN cell; (c) cartoon 
showing approximate 
positioning of an ivabradine 
molecule within the water- 
filled cavity underneath the 
pore of an HCN4 channel 
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HCN Channels: Biophysics and Functional Relevance, 
Fig. 3 S672R mutation of the HCN4 channel associated with 
bradycardia, (a) Cartoon of one subunit of HCN4 showing the 
approximate position of the mutation in the CNBD; (b) pedigree 


of a family with bradycardia (filled symbols: with mutation; open 
symbols: wild-type); (c) corresponding heart rates indicated as 
variations relative to 60 bpb (From Milanesi et al. 2006) 


occurs in homomeric mutant channels but not in 
heteromeric wild-type mutant constructs, which 
determines recessive inheritance and leads to strongly 
increased neuronal excitability, a condition predisposing 
to epileptogenesis. 

Cross-References 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 

► Gating of Potassium Channels by Cyclic Nucleotide 
Binding 
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Introduction 

Although DNA is most stable as a double-stranded 
helical structure, this must be unwound and the strands 
separated transiently for most DNA processing. 


The single-stranded intermediates are required for rep¬ 
lication (► Machinery of DNA Replication), repair, 
recombination, and DNA transfer during conjugation, 
and, in each of these processes, duplex DNA unwind¬ 
ing is catalyzed by ubiquitous enzymes known as 
helicases. These proteins cause the destabilization of 
the hydrogen bonds between the complementary base 
pairs and the stacking of adjacent bases as it trans¬ 
locates along the DNA. This separation is coupled 
to hydrolysis of nucleoside triphosphate, most fre¬ 
quently ATP. 

A number of enzymes have also been characterized 
that demonstrate RNA helicase activity (Bleichert and 
Baserga 2007; Jankowsky 2010). Structurally and 
functionally, RNA is a diverse molecule that is 
highly regulated. Helicases function in all aspects of 
RNA metabolism, such as transcription, editing, 
rearranging, and mRNA splicing. Specific helicases 
can also act on RNA-DNA structures, remove proteins 
from RNA without unwinding, and act as clamps. 
These roles include regulating transient associations 
of small noncoding RNAs and ribonucleoprotein 
complexes to target RNAs. This review mainly focuses 
on DNA helicases, but the mechanisms and methods 
described are also applicable to RNA helicases on 
the whole. 

Helicases are motor proteins and as such share some 
properties with other motor proteins, such as myosins 
and kinesins (Delagoutte and von Hippel 2002, 2003). 
They move along a linear track, driven by the 
hydrolysis of nucleoside triphosphate: this in turn 
means that some similar techniques are used to inves¬ 
tigate these varied types of linear motors. An important 
difference is that the distance between repeating units 
of nucleic acid (nucleotides) are much smaller than 
is the case for the protein track of actin or tubulin. 
However, some important properties of helicases are 
common to all motor proteins, including ideas of direc¬ 
tionality and processivity, defined as how far along the 
DNA, or RNA, a helicase moves on average before 
stalling and dissociating. The movement can generally 
be defined in terms of steps. The step size is the 
smallest distance of movement that can be detected, 
often by single-molecule techniques. The step size 
may be related to the coupling ratio, which is the 
distance moved per ATP hydrolyzed: often this is one 
or a few bases. However, in some cases, the kinetic 
step size does not correlate with movement during 
a distinct cycle of ATP hydrolysis. Another property 
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is the speed of movement along the DNA, which may 
vary with both nucleic acid structure and on whether 
the helicase is acting alone or as part of a larger com¬ 
plex with other proteins, as described later. 

Variety and Classification of Helicases 

Helicases were first identihed in 1976 and classified 
into families in 1993 (Gorbalenya and Koonin 1993). 
However, it appears that many putative helicases do 
not actually unwind nucleic acids, although they do 
harness the energy of ATP hydrolysis to translocate 
along the DNA. A more detailed discussion of 
what constitutes a helicase was given in the review 
by Singleton et al. 2007. The two largest superfamilies 
of helicases (SF) are typically monomeric or dimeric: 
SF-1 (containing some well studied DNA helicases, 
such as Per A, Rep, UvrD, RecB, RecD) and SF-2 
(containing DNA helicases RecQ, PriA, RecG, UvrB, 
and almost all RNA helicases) were initially classified 
on the basis of seven conserved regions of sequence 
homology. New motifs have been identihed, including 
TxGx, Q-motif, motif 4a, and TRG, which are specihe 
to each superfamily or subfamily. The superfamilies, 
SF-3 to SF-6, also contain hexameric or double 
hexameric helicases, which contain fewer conserved 
motifs. Despite the diversity, there are universal struc¬ 
tures across all of the superfamilies, such as the “core 
domains” that form the tandem RecA-like folds, either 
within the same polypeptide or between subunits. 
RecA is the prokaryotic DNA-strand exchange protein 
required for recombination. These core domains con¬ 
tain residues relating to the “A” and “B” motifs of 
the “Walker box,” a predictor of nucleoside-5'- 
triphosphate binding. 

A more recent and full classiheation was given by 
consideration of the sequence, as well as the oligo¬ 
meric state, translocation polarity, and DNA substrate 
(Singleton et al. 2007). With respect to the oligomeric 
state, helicases can be monomeric, dimeric, or 
hexameric. The SFl and SF2 members contain the 
RecA-like folds on a single polypeptide. Therefore, 
dimeric helicases from these superfamilies generally 
contain two functional motors, whereas hexameric 
helicases contain six individual RecA-like folds. 
Translocation by helicases can be dehned as 3'-5' or 
a 5'-3', with respect to the strand with the major 
interaction to the helicase motor elements. SFl, SF2, 


and SF6 contain examples of both forms of transloca¬ 
tion, whereas all characterized SF3 members are 3'-5' 
helicases. All members of SF4 and SF5 are 5'-3' 
helicases. Extra levels of complexity are given by 
accessory domains. These play roles in targeting 
motor activity to specihe substrates, such as forked 
junctions. Furthermore, novel enzymatic properties 
can be produced via the addition of complementary 
catalytic domains to helicases, allowing the creation of 
various nucleic acid processing machines. For 
instance, the cooperation of helicase and nuclease is 
common. Other domains or subunits may provide rec¬ 
ognition of particular sequences, such as occurs with 
RecBCD, where one subunit recognizes the short chi 
sequence, important for creating a DNA intermediate 
during recombination (Dixon and Kowalczykowski 
1991). Furthermore, helicases may form interactions 
with other enzymes, such as polymerases, that create 
a complex for unwinding and processing double- 
stranded DNA (dsDNA). 

Regulation 

Helicases are DNA modifying enzymes and as such, 
unregulated behavior is deleterious to the cell. One 
consequence of this is that defects in certain helicases 
can cause a variety of genetic disorders, as well as 
leading to a predisposition to cancers, through impair¬ 
ment of DNA repair pathways (van Brabant et al. 
2000). Helicases need to function at a defined time in 
precise places. Each helicase may require a specihe 
DNA structure as substrate and the formation of such 
structures may trigger helicase activity, for example, 
free ends or junctions appearing following DNA dam¬ 
age and breaks. Helicases may also have to switch 
between translocase and helicase activities. The evi¬ 
dence suggests that this is achieved by a change in 
oligomeric state (e.g., dimerization) and/or by interac¬ 
tion with accessory proteins. Several proteins have 
been shown to increase helicases activity. An example 
of this is PerA, whose helicase activity is greatly 
enhanced by the replication initiation protein, RepD, 
which also enables the helicase to be localized at the 
correct position to take part in asymmetric plasmid 
replication. The helicase can then unwind complete 
plasmids of several thousand bases pairs, although 
a single PerA on its own is unable to unwind more 
than a few base pairs. ► DNA polymerases have also 
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been shown to activate helicase activity, and many 
helicases will only work processively as part of larger 
complexes (Donmez and Patel 2006). Figure 1 shows 
some of the variety of biological roles of helicases and 
different DNA structures. 

Mechanism 

Flelicase mechanisms can be characterized using 
a variety of bulk solution and single-molecule 
approaches, some of which are outlined below but 
described in more detail in reviews: (Donmez and 
Patel 2006; Lohman et al. 2008; Pyle 2008). These 
methods have shown directionality, real-time translo¬ 
cation and unwinding rates, along with corresponding 
ATPase rates for a variety of nucleic acid substrates. 
Such details lead to an understanding of the dynamics 
of helicase activity and its coupling to ATP usage, and 
the mechanism by which ATP is hydrolyzed. 

There are several ways of considering the mecha¬ 
nism hy which helicases might separate strands and 
translocate. One way is to consider how changes in the 
interaction of a protein with DNA could, either 
actively or passively, act on DNA (Fig. 2). A passive 
mechanism would have an opportunistic protein 
waiting at the junction between single-stranded DNA 
(ssDNA) and dsDNA for a transient opening of the 
base pairs. When the opening occurs, the helicase 
advances forward, blocking the reformation of the 
base pairs. At the other extreme, the ATP-driven trans¬ 
location of the helicase forces its way into the junction 
and actively destabilizes the double-strand interac¬ 
tions. For many helicases, there remains uncertainty 
as to what extent these extremes of mechanisms con¬ 
tribute to the unwinding and translocation of DNA. 

This active mechanism sits well with the observa¬ 
tion of wedge domains in some helicase structures that 
sit at the ssDNA/dsDNA junction and point into the 
duplex with one strand of ssDNA lying either side of 
the wedge (Singleton et al. 2007). Detailed information 
supporting an active mechanism came from the crystal 
structures of Bacillus steawthermophilus PcrA in 
complex with a partial duplex DNA molecule in two 
nucleotide states (Dillingham et al. 2001; Singleton 
et al. 2007). The structures (Fig. 3) revealed a protein 
divided into four subdomains, of which two, lA and 
2A, are homologous to RecA. These are the core motor 
domains containing the ATP binding site at their 


interface. The ssDNA-binding site can be seen to 
run along the top of the motor domains in a groove 
which can accommodate ssDNA. By comparison with 
the apo structure, a transition of domain 2B upon 
binding to the partial duplex DNA is observed. The 
movement makes a new interface for the duplex 
DNA and may suggest a regulatory function for 
this domain. 

These structures suggested a translocation mecha¬ 
nism, in which domains lA and 2A were in different 
conformations depending on whether ATP or ADP was 
bound (Singleton et al. 2007). The substrate and prod¬ 
uct structures also showed differences in the contacts 
made between the protein and ssDNA. The transloca¬ 
tion reaction cycle proceeds through a mechanism 
similar to an inchworm. The core subdomains, lA 
and 2A, move relative to one another on ATP binding 
and hydrolysis. The two domains bind ssDNA bases in 
pockets that open and close during this conformational 
change. The binding site is composed of a series of 
aromatic amino acid residues that form pockets for the 
bases of the ssDNA. These residues make alternating 
contacts with individual bases. Initially, the ssDNA is 
bound tightly by domain lA. ATP binding causes the 
pocket in domain 2A to bind the ssDNA, while 
the pockets in domain lA release ssDNA. At the 
same time, the two domains move closer together. 
Upon ATP hydrolysis, the pockets in lA bind ssDNA 
and the pockets in 2A release ssDNA. The domains 
also move apart, causing the ssDNA to be pulled along 
the DNA binding channel relative to domain 2A, 
translocating the DNA. This resets the conformation 
for another cycle. 

Translocation does not require that the dsDNA- 
binding site is hlled. However, if the ssDNA, along 
which the enzyme is translocating, is attached to 
a duplex region ahead of the enzyme, the translocation 
process will remove the duplex, in essence stripping 
the strands apart, and the one strand is pulled across the 
protein as described above (Marians 2000). In one 
sense, DNA unwinding is a biproduct of ssDNA 
translocation. 

Many structures of helicase have now been described, 
including those of hexameric proteins (Donmez 
and Patel 2006; Singleton et al. 2007) and, particularly 
by having structures with different DNA and/or nucleo¬ 
tide present, these have often led to equal insight into the 
mechanism of action that has been related to solution 
studies and single-molecule measurements. 
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Helicases, Fig. 1 Cartoon of some the biological roles of DNA 
helicases and variety of DNA structures. Helicases are shown in 
blue and other proteins in red. (a) Helicase and polymerases, 
involved in bacterial DNA replication. Single-stranded DNA- 
binding protein (SSB) is in green. The arrows refer to the 
helicase movement. Parental DNA is colored black and daughter 
DNA is grey, (b) RecBCD helicase nuclease, cutting DNA as it 
translocates, (c) RecG involved in moving a 4-way junction 


Translocation and Unwinding 
Measurements in Bulk Solution 

One of the most important characteristics of a helicase 
is the unwinding activity, and there is a wide variety of 
rates observed from <1 base s~* to >1,000 bases s“\ 
Similarly, there is a wide range of processivities, from 
helicases that apparently unwind only a few bases to 
ones that translocate many thousands of bases without 
dissociation. The speed and processivity are likely to 
be optimized for their roles in the cell. For example, 
a replicative helicase will have to function rapidly and 
processively. A slow speed may delay cell division, 
while low processivity will lead to replication fork 
stalling or collapse. On the other hand, a repair helicase 
may only have to unwind a hundred base pairs at most, 
so a high processivity is not required. 


during repair of single-strand damage. Arrows indicate the 
pulling of DNA strands as the nascent DNA is unwound 
from the parental strands, (d) PcrA helicase, with polymerase 
and initiator protein RepD, perform asymmetric plasmid 
replication. A potential intermediate is shown, whereby one 
parent strand is being replicated, while the other remains as 
a single-stranded loop, awaiting interaction with the second 
polymerase 


Early methodology to follow unwinding by 
helicases focused on gel assays, which have the advan¬ 
tage of allowing observation and identihcation of the 
DNA product. However, such assays have the disad¬ 
vantage of a limited time resolution and being discon¬ 
tinuous, that is, each gel lane represents a single 
snapshot in time. However, a variety of fluorescence- 
based methods (► Fluorescence) are now available for 
these and other measurements of activity (Toseland 
and Webb 2010; Webb 2007). Fluorescence measure¬ 
ments can report on processes in real time due to 
the sensitive and rapid response of the fluorescence 
probe. When combined with a stopped-flow fluorime- 
ter (► Stopped-Flow Techniques), assays can be 
performed with millisecond time resolution. One type 
of measurement is sometimes described as “all or 
nothing,” in which a fluorophore is located at one 
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Helicases, Fig. 2 Cartoon representation of active and passive 
mechanisms of DNA unwinding, (a) The active mechanism 
shows a wedge domain at the junction, which aids the disruption 
of the dsDNA, as indicated by a slight separation of the DNA 
strands immediately in front of the wedge. In practice, there may 
be significant protein interactions with dsDNA, as shown in the 
structure in Fig. 3a. (b) In a passive mechanism, there is transient 
separation of strands ahead of the helicase, which then 
translocates into the space vacated and so prevents the DNA 
reannealing 

point on the DNA (►Fluorescence Labeling of 
Nucleic Acids): there is little or no change in 
signal while the translocation approaches the label, 
but the fluorescence changes when the helicase 
reaches that point. The label could be a single 
fluorophore, a fluorophore-quencher pair, or a FRET 


(Forster Resonance Energy Transfer) pair 
(► Fluorescence and FRET in Membranes). One of 
the simplest methods of this type to measure unwind¬ 
ing or translocation over short lengths of DNA is to use 
oligonucleotides, labeled with a fluorophore(s) at the 
far end of the strands. Assays of this type are likely to 
be more successful with DNA that is shorter than -100 
bases. If longer than that, the helicase translocation 
becomes desynchronized as a consequence of the large 
number of steps: the helicase molecules become distrib¬ 
uted over a wide length of DNA during the translocation. 

Alternatively, assays are available using fluorescent 
probes that differentiate between ssDNA and dsDNA, 
and these give a continuous change in fluorescence as 
the unwinding proceeds. There are a variety of fluores¬ 
cent dyes that bind to dsDNA with enhanced fluores¬ 
cence. They are released on helicase unwinding, so 
giving a signal change, allowing the extent of unwind¬ 
ing to be monitored, although the presence of the dye 
bound to the dsDNA may interfere with helicase activ¬ 
ity. A biosensor for ssDNA has been developed based 
on a single-stranded DNA-binding protein (SSB), 
which binds to -70 bases of ssDNA, although this is 
dependent on conditions (Webb 2010). The biosensor 
increases in fluorescence when bound to ssDNA, pro¬ 
viding a real-time signal during unwinding (Fig. 4). 
SSB has been shown to interact with a variety of other 
DNA-binding proteins, including helicases, so that 
binding and/or rearrangement of SSB on the ssDNA 
product can potentially modulate the helicase activity. 
This SSB-based biosensor has been successfully used 
to measure helicase-catalyzed unwinding of long lin¬ 
ear fragments and plasmids, and some of the factors 
that allow it to be used under a variety of conditions 
have been discussed (Webb 2010). 

Measurements of ATPase Activity 

An assessment of ATPase activity is part of a full 
characterization of a helicase and an understanding of 
the dynamics of the system. An immediate question is 
to what extent is ATP hydrolysis coupled to the trans¬ 
location: is there a fixed coupling ratio with unidirec¬ 
tional motion or is there a ratio that depends on 
conditions? The extent of coupling may differ with 
helicase and conditions, although for many it seems 
quite fixed with some helicases moving one base per 
ATP, others moving several. 
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Helicases, Fig. 3 Structure 
of a monomeric helicase, 
PcrA. (a) Crystal structure of 
the apoprotein and change in 
structure due to binding DNA 
and an ATP analogue 
(Singleton et al. 2007). 

(b) Cartoon representation of 
domain movements, (c) Bar 
representation of domain 
structure with color code as in 
panel A 
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Generally, ATP utilization during unwinding is 
measured using stopped-flow fluorimetry as the rate 
of hydrolysis is likely to preclude measurements fol¬ 
lowing manual mixing. This addresses the relationship 
between ATP hydrolysis and DNA unwinding, 
answering questions such as how many bases are 
unwound in a single ATP cycle? It is possible to 
measure ATP hydrolysis during DNA unwinding 
using the oligonucleotides and plasmid DNA sub¬ 
strates described above, measuring the products of 
ATP hydrolysis in real time. One way that achieves 
this is use of the fluorescent reagentless biosensor, 
MDCC-PBP (Webb 2007). This has the high sensitiv¬ 
ity and time resolution to measure the inorganic 
phosphate as it is released by the helicase and has 
been used in varied transient and steady-state kinetic 
measurements of ATPases and GTPases, such as 
myosins, kinesins, dyneins, small G proteins, as well 
as helicases. 


The ATPase rate, measured in this way (i.e., during 
unwinding), can then be compared to the unwinding 
rate. A coupling ratio is then established to determine 
the number of DNA bases unwound per ATP. This is 
an important parameter to determine how the helicase 
translocates the DNA strand. 

A more detailed understanding of the mechano- 
chemical coupling between ATP hydrolysis to DNA 
unwinding and the relationship between that coupling 
and protein structural changes requires information 
about the individual steps of the ATPase cycle. This 
will aid in the determination of which intermediates 
are most prevalent, and which steps are likely to be 
closely associated with the structural transitions. These 
methods, typically using stopped-flow fluorescence, 
are well established in the motor protein field, and 
fluorescent ATP analogues, such as mantATP, have 
been widely used (Jameson and Eccleston 1997). The 
fluorescence probe leads to signal changes upon 
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Helicases, Fig. 4 Example of continuous fluorescence data for 
time-resolved measurement of unwinding: DNA unwinding by 
AddAB helicase. The measurements use a fluorescent SSB, 
which binds to the ssDNA as it forms (Fili et al. 2010). 

(a) Bulk solution assay, using a coumarin-SSB. A linear 
biotinylated 1.5 kb DNA fragment was incubated with 
strepavidin to give unidirectional unwinding. A nuclease- 
inactive mutant of AddAB was used in these measurements. 

(b) The single-molecule equivalent measurement, using 
Cy3B-SSB. The biotinylated substrate was immobilized on an 
inert surface. The length of the ssDNA is limited to ~2 kB by the 
extent of the evanescent wave 

binding and possibly during conformation changes 
during the ATP cycle. Helicases typically bind nucle¬ 
oside triphosphates with the base and phosphate 
groups buried within the active site. Therefore, 
fluorophore attachment on the ribose is considerably 
more benign than modification of the base or triphos¬ 
phate moieties. A wide variety of adenine nucleotides 
with ribose-attached fluorophores have been devel¬ 
oped, enabling there to be some degree of optimization 
in terms of fluorophore and length of linker. Helicases 
may contain tryptophan residues which give a fluores¬ 
cence signal upon a non-fluorescent ATP binding, 
thereby preventing the need for non-natural nucleo¬ 
tides. Furthermore, by use of phosphate and/or ADP 
biosensors (Webb 2007), it is possible to measure the 
kinetics of product release under the same conditions 


as fluorescent ATP binding. In this way, it is possible 
to measure and compare the different steps of an 
ATP cycle. Quenched-flow gives similar time resolu¬ 
tion to stopped-flow and can be used to measure 
the cleavage of triphosphate to diphosphate, albeit 
discontinuously. 

Single-Molecule Methods 

Single-molecule methods have been applied widely to 
motor proteins as they allow real-time observation of 
movement and mechanics, as well as some of the 
biochemical processes (Ha and Lohman 2011; Rasnik 
et al. 2006). Single-molecule measurements using fluo¬ 
rescence, particularly by Total Internal Reflectance 
Fluorescence Microscopy ► (Total Internal Reflection 
Fluorescence Microscopy) (TIRFM), and force tech¬ 
niques, predominantly optical or magnetic tweezers 
(► Magnetic Tweezers; ► Optical Tweezers), have 
been applied widely to the study of helicases. 

Advantages of the single-molecule approach 
include the fact that observations of each complex 
may circumvent inhomogeneity of the substrates. 
With helicases, the DNA structures may be not be 
uniform (for example degree of supercoiling) and this 
would not be apparent in bulk solution studies. 
Furthermore, helicases unwind DNA with, perhaps, 
hundreds of discrete biochemical steps. This, in 
effect, desynchronizes the reaction of individual 
molecules in bulk solution. The observation of indi¬ 
vidual unwinding events overcomes this. For example, 
RecBCD exhibits a pausing behavior after reaching 
a chi sequence (Spies et al. 2003). This activity 
can only be observed at the single-molecule level 
because there is a large heterogeneity in the unwinding 
rates of the helicases population. Such asynchronous 
behavior means that pauses may not be detected in 
bulk studies because the events are averaged out 
over time. 

Disadvantages of the single-molecule approach, 
apart from the requirement for highly sensitive fluo¬ 
rescent probes, include issues related to attachment of 
the active complex to a surface and the potential effect 
of that on activity. It is important to assess if “novel” 
properties result from artifacts in the single-molecule 
assay due to fluorescent labeling, forces, or attach¬ 
ment. Particularly with TIRFM, the single-molecule 
data can be compared to the ensemble population 
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solution measurements. This may be relatively easy 
with fluorescent assays, but it is harder to compare 
bulk and single-molecule force measurements. 

Single-molecule fluorescence measurements can be 
applied to helicase unwinding and translocation activ¬ 
ity, and conformation changes (Ha and Lohman 2011; 
Rasnik et al. 2006). TIRFM in combination with an 
appropriate fluorophore label (or labels) provides 
a way to observe single events in real time. The surface 
attachment increases the signal-to-noise ratio as the 
evanescent wave of this technique excites fluorophores 
only very close to the surface. 

Several types of assays are available, which are 
suitable to measure unwinding of different lengths 
of DNA substrate, and, to some extent, these are 
related to the equivalent assays outlined for bulk mea¬ 
surements. Long, linear DNA substrates (>10,000 bp) 
can be measured using microfluidic techniques 
(► Microfluidics for Single Molecule Detection). One 
end of the DNA is attached to a surface or a bead held 
in an optical trap. Hydrodynamic drag (“flow”) 
stretches the DNA fragments, which can be visualized 
by an intercalating dye, such as YOYO. The dye is 
displaced as the helicases unwinds the DNA. 

Shorter linear DNA substrates (<3,000 bp) can be 
measured using a Cy3b labeled version of the ssDNA 
biosensor, described above (Fig. 4). This assay has 
a resolution of ~70 bp, the binding site of the biosensor. 
Multiple unwinding events can be visualized as spots 
of increasing intensity, as the biosensor binds the 
ssDNA. This assay can be performed with DNA, or 
helicases, immobilized on a surface. Importantly, the 
data can be readily compared to bulk measurements 
using the same type of biosensor. 

Single-molecule FRET ► (Single-Molecule Fluo¬ 
rescence Resonance Energy Transfer) (smERET) 
assays can typically be used with substrates up to 
^100 bp, but this approach has the highest potential 
resolution of <10 bp and potentially single-base. 
smERET assays can be used with ERET pairs between 
the DNA strands, or the protein and DNA. FRET pairs 
between the protein and DNA are particularly useful as 
it gives the possibility to measure conformation 
changes. Attaching fluorophores at several locations 
on the protein and DNA allows their precise positions 
to be triangulated and, therefore, determine domain 
rearrangements. 

Eorce measurements, optical trapping, and mag¬ 
netic tweezers are used to assay DNA unwinding. 


These experiments are important to determine the 
effect of topology and force upon helicase activity. 
With magnetic tweezers, DNA is tethered between 
a surface and a paramagnetic bead. During an unwind¬ 
ing assay, the bead is held under a constant force, so as 
the duplex is unwound, the DNA lengthens. The height 
of the bead is monitored using bright-fleld microscopy 
at a resolution of 10 bp. Optical tweezers provide the 
highest resolution, but each molecule is observed one 
at a time, increasing the time to acquire data from 
a significant number of complexes. Various assays 
can be performed, which include, immobilizing a 
helicase to a surface with DNA attached to a bead. 
The free end of the DNA can bind to the helicase. 
While the bead is held in the laser trap, it is 
possible to measure the force of the helicase unwinding 
the DNA. 

Summary 

Helicases are motor proteins that mostly separate dou¬ 
ble-stranded DNA, or RNA, while translocating along 
the duplex. This uses the energy of nucleoside triphos¬ 
phate hydrolysis (generally ATP). They are involved 
in most processing of DNA, including replication, 
recombination, and repair, and of RNA, including 
editing, rearranging, and splicing. Helicases are 
investigated using a variety of bulk solution and 
single-molecule techniques. These studies have 
reveals key aspects of the helicase functions and mech¬ 
anisms. Processivity of helicases varies between a few 
bases to many thousands of bases, and speeds vary 
from <1 base s“^ and >1,000 bases s“\ Structurally, 
there is also a wide variety from monomeric proteins to 
hexameric, although helicases often act as part of 
larger complexes that include other enzyme function. 

Cross-References 

► ATPase: Overview 

► DNA Polymerase 
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► Eluorescence and FRET in Membranes 

► Eluorescence Labeling of Nucleic Acids 

► Machinery of DNA Replication 

► Magnetic Tweezers 

► Microfluidics for Single Molecule Detection 
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► Optical Tweezers 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Stopped-Flow Techniques 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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Synonyms 

Haemagglutinin 


Definition 

A viral glycoprotein responsible for binding the virus 
to the cell that is being infected. 

Basic Characteristics 

When a virus enters a host, it needs to hind to host cells 
and subsequently induce fusion of its membrane with 
the host cell membrane. The hemagglutin protein on 
the viral envelope plays a crucial role in this mecha¬ 
nism: it binds to sialic acid analogues on the host cell, 
inducing endosomal uptake of the virus particle. Many 
computational studies, typically using ► Molecular 
Dynamics Simulations, have been used to investigate 
the recognition between different viral hemagglutinins 
and host cell glycans (Kasson et al. 2009; Newhouse 
et al. 2009). After binding to the host cell, the virus 





Heme Dioxygenases - Computational Studies 


959 


needs ensure that its contents, most crucially its 
genome, enter the host cell cytoplasm. Hemagglutinin 
plays a crucial role in this process by changing 
conformation and exposing a previously hidden hydro- 
phobic portion of its peptide chain, the so-called fusion 
peptide. This peptide inserts itself into the endosomal 
membrane and eventually causes the viral and 
endosomal membranes to fuse together. This in turn 
allows the virus contents to enter the cytoplasm. 

► Molecular Dynamics Simulations, including 
coarse-grained approaches, have been applied to inves¬ 
tigate the interaction between “fusion peptides” and 
membranes (Lague et al. 2005; Kasson et al. 2006; 
Fuhrmans et al. 2009). 

When the virus is replicated in the host cell, hem¬ 
agglutinin is initially produced as an inactive protein 
precursor (HAO) that is subsequently cleaved into two 
active parts, HAl and HA2, which mediate virus to 
host cell binding and membrane fusion, respectively. 
How this proteolytic activation is catalyzed by host 
proteases is not yet clear. Rungrotmongkol et al. there¬ 
fore used ► QM/MM Methods to study the reaction 
mechanism of the hrst step of the acylation process 
by the host protease furin, with the cleavage site 
of the avian influenza virus subtype H5N1 hemagglu¬ 
tinin as its substrate (Rungrotmongkol et al. 2009). 
B3LYP/6-31-HG* corrected PM3/CHARMM22 poten¬ 
tial energy surfaces were calculated and analyzed in 
detail, indicating the specific roles of residues in catal¬ 
ysis of this reaction. 

Cross-References 

► Lectins 

► Molecular Dynamics Simulations of Lipids 

► QM/MM Methods 
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Synonyms 

Indoleamine 2,3-dioxygenase 1 (IDO); Tryptophan 
2,3-dioxygenase (TDO) 


Definition 

Heme dioxygenase enzymes catalyze the conversion 
of L-Tryptophan to A(-Formylkynurenine via the inser¬ 
tion of both atoms of heme-bound molecular dioxygen 
at the C2 and C3 positions of the substrate (Fig. 1). 

One member of the family, indoleamine 
2,3-dioxygenase 1 (IDOl) has been implicated in 
tumor immune escape and is seen as a good target 
for anticancer drugs (Rohrig et al. 2010). An impor¬ 
tant criterion for design of IDOl inhibitors is that 
they should avoid interaction with another heme 
dioxygenase, tryptophan 2,3-dioxygenase (TDO). 
TDO has much higher specificity for L-tryptophan, 
although its active site is structurally very similar to 
that of IDOL Consequently, the elucidation of the 
substrate binding and reaction mechanisms of these 
two enzymes has been a topic of considerable interest 
to which computational investigations have made 
important contributions. 

Early mechanisms involved the abstraction of the 
indole N-H proton by either an active site base or by 
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Heme Dioxygenases - Computational Studies, Fig. 1 Reaction catalyzed by heme dioxygenase enzymes 


the heme-bound dioxygen as the initial step (Sono et al. 
1996; Sugimoto et al. 2006). This was considered 
essential because experimental evidence indicated 
that 1-methyl-tryptophan (1-Me-Trp), which does not 
have an indole N-H hydrogen, is a competitive inhib¬ 
itor of both enzymes. Subsequent steps were suggested 
to be nucleophilic attack on the distal oxygen at the C3 
position followed by product formation via either 
a dioxetane intermediate or a Criegee rearrangement. 

A density functional theory (DFT) study on a model 
system (comprised of a truncated oxy-heme group with 
imidazole representing the axial histidine ligand and 
3-methylindole representing the substrate) was the first 
computational study of the heme dioxygenase reaction 
mechanism (Chung et al. 2008). Whereas previous 
mechanisms had assumed the oxy-heme complex to 
be in the closed shell ferrous dioxygen state Fe(II)-02, 
Chung et al. found the open shell singlet state to be 
significantly lower in energy, which (based on Mulli- 
ken spin population analysis) was considered to a ferric 
superoxide complex Fe(III)-02 ~. It should be noted 
that DFT is known to favor a ferric superoxide 
description of oxy-heme while higher level multi- 
configurational calculations have shown a mixture 
of many different valence bond states, including the 
two mentioned above, to be a better model (Jensen 
et al. 2005). 

The results of the DFT study showed that mecha¬ 
nisms involving indole N-H proton abstraction by 
dioxygen are very unfavorable. Newly proposed 
mechanisms starting with electrophilic addition from 
the Fe(II )-02 closed shell singlet state or radical addi¬ 
tion from the Fe(III)-02 ~ triplet or open shell singlet 
states gave more reasonable barriers. Addition to the 
C2 carbon was found to be lower in energy than to 
the C3 carbon. Following addition, the Criegee 
rearrangement pathway was shown to be very 


unfavorable. Homolytic cleavage of the 0-0 bond 
leading to the formation of an epoxide and the high 
valent Fe(IV) = O species, compound II, was consid¬ 
ered to be feasible though higher in energy that the 
dioxetane pathway. Shortly afterward, compound II 
was identified by resonance Raman spectroscopy dur¬ 
ing IDOl turnover (although not during TDO turn¬ 
over) (Lewis-Ballester et al. 2009). 

The resolving of crystal structures for human IDOl 
and bacterial TDO provided a starting point for hybrid 
quantum mechanics/molecular mechanics (QM/MM) 
calculations which have been performed by three 
groups, one of whom studied both IDOl and TDO 
(Capece et al. 2012) while the other two studied TDO 
only (Guallar and Wallrapp 2010; Chung et al. 2010). 
Each used different software packages and procedures 
in their calculations, but the general approach was the 
same: using molecular mechanics force fields, several 
nanoseconds of molecular dynamics simulations were 
run from which structures were chosen as starting 
points to explore the DFT QM/MM potential energy 
surfaces (Fig. 2). 

These QM/MM studies all concluded that abstrac¬ 
tion of the indole N-H proton does not occur because 
no minimum is observed along any of the proton trans¬ 
fer reaction coordinates investigated. This is in agree¬ 
ment with experimental results that showed that the 
only candidates for the proposed active site bases 
(His55 in TDO and Seri67 in IDOl) are not essential 
to activity and that contrary to previous evidence, 
1-Me-Trp is turned over slowly by IDOl and some 
TDO mutants (Chauhan et al. 2009). Capece et al. and 
Chung et al. found the dioxetane pathway to be 
infeasible. 

In all these computational studies, the open shell 
singlet Fe(III)-02 ~ state was calculated to be the low¬ 
est energy state of the Michaelis complex and radical 










Heme Dioxygenases - Computational Studies 


961 


Heme Dioxygenases - 
Computational Studies, 

Fig. 2 Heme dioxygenase 
reaction mechanism proposed 
by Chung et al. based on QM/ 
MM calculations (Reprinted 
with permission from Chung 
et al. 2012. Copyright 2012 
American Chemical Society) 



addition to the C2 carbon was found to occur followed 
by the formation of an epoxide and compound II. 
Oxygen addition to the C3 carbon did not result in a 
minimum on any of the potential energy surfaces. 
All of the QM/MM studies found epoxide and com¬ 
pound II formation to be the rate-determining step, 
with energy barriers ranging from 8 to 20 kcal/mol. 
One notable difference between the studies is that the 
alkylperoxo species resulting from radical addition 
was identified as being a transition state by Capece 
et al., whereas the studies by Chung et al. and Guallar 
and Wallrapp found the alkylperoxo species to be an 
intermediate. 

In the calculations by Wallrapp and Guallar, 
following epoxide formation, oxo attack by com¬ 
pound II occurs at the C2 carbon concerted with 
C2-C3 bond breaking to afford an unstable 
six-membered ring. The transition state for this step 
has an energy of around 10 kcal/mol relative to the 
reactant complex. After the facile opening of this ring, 
the final step is the breaking of the Fe-0 bond, during 
which a change in spin state from singlet to triplet is 
thought to occur. 

Chung et al. calculated pathways in both the singlet 
and triplet states for the first two steps and found 
that the surfaces cross before the transition state to 
compound II and epoxide formation where the triplet 
surface becomes lower in energy. The rest of the reac¬ 
tion profile is given on the triplet surface only. 


although how or where a change in spin state may 
occur is not discussed. Regiospecific epoxide ring 
opening is assisted by transfer of a proton from the 
substrate NH 3 ’^ group on to the epoxide oxygen, with 
a transition state with energy of around -9 kcal/mol 
relative to the reactant complex (much more favorable 
than the third step in the calculations by Wallrapp and 
Guallar). The compound II oxygen then attacks at 
the unsaturated C2 position followed by the final 
step, which involves proton transfer back to the Tryp¬ 
tophan NH 3 '^ group, concerted with C2-C3 and Fe-0 
bond cleavage. 

The calculations of Capece et al. show that IDOl 
and TDO follow the same reaction mechanism, which 
aside from the concerted versus stepwise nature of the 
epoxide and compound II formation is qualitatively the 
same as that proposed by Chung et al. for TDO. 

Consequently, the experimentally observed dif¬ 
ferences in substrate selectivity between the heme 
dioxygenases may have their origins in differences in 
substrate binding rather than reaction mechanism. Sub¬ 
strate binding and selectivity in IDOl and TDO have 
also been investigated with classical molecular dynam¬ 
ics simulations (Capece et al. 2010). 

Cross-References 

► QM/MM Methods 
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Synonyms 

Peroxidases 


Definition 

Heme peroxidases catalyze the hydrogen peroxide- 
dependent oxidation of a variety of different substrates 
by means of oxidized Compound I and Compound II 
intermediates. 

Basic Characteristics 

Heme peroxidases have a long history and are widely 
distributed in biological systems. Horseradish peroxi¬ 
dase (HRP), which is one of the most famous members 
of the family, was first implicated as an oxidizing 
species in horseradish roots as early as 1810 (Planche 
1810) and in the years to follow the colorful spectra of 
its various reaction intermediates played a key part in 
the development of rapid kinetics methods (reviewed 
in Kresge et al. 2004). Entire books have been written 
on heme peroxidases, exploring their reactivity, prop¬ 
erties, and biological function (Everse et al. 1991; 
Dunford 2010). Peroxidases have been characterized 
which utilize both c-type hemes (bacterial diheme 
peroxidases) and redox-active selenocysteine residues 
(glutathione peroxidases). A separate group of vana¬ 
dium-containing peroxidases are also known which cat¬ 
alyze the H202-dependent oxidation of halides (Dunford 
2010). This article describes the structure and mecha¬ 
nism of the most common and widely distributed perox¬ 
idases - those which possess h-type hemes. 

Heme Structure 

Like the globins and the h-type ► cytochromes, heme 
peroxidases contain heme h as a prosthetic group. 
Fig. 1, which (unlike the cytochromes c) is bound to 
the protein through non-covalent interactions. In the 
resting state, this heme group is in the ferric (Fe*^^) 
form and is either five-coordinated, to an axial histi¬ 
dine residue, or six-coordinated with a weakly bound 
water molecule or hydroxide as the sixth ligand. 

Structure 

Structures for a number of peroxidase enzymes have 
been solved (reviewed recently in Poulos 2010). 
Despite attracting much of the early attention, its 
heavily glycosylated structure meant that HRP proved 
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edge 

Heme Peroxidases, Fig. 1 The structure of heme h 

very difficult to crystallize, and a structure did not 
emerge until fairly recently (Gajhede et al. 1997). 
Cytochrome c peroxidase (CcP), on the other hand, 
proved to he much more obliging and crystallizes 
readily. The structure of CcP was thus the first to 
emerge, and for this reason served as a benchmark 
for other peroxidases. Since then, structures for 
numerous other peroxidases have been solved, includ¬ 
ing, amongst others, manganese peroxidase (MnP), 
chloroperoxidase, ascorbate peroxidase (APX), and 
lignin peroxidase (Poulos 2010). 

Within the heme active site, there is high sequence 
homology across all peroxidase structures. The active 
site of cytochrome c peroxidase is shown in Fig. 2. 
There is a “proximal” histidine ligand (Hisl75) which 
is hydrogen bonded to Asp and Trp residues on the 
proximal side of the heme. On the distal side, there is 
a “distal” histidine residue together with distal Arg and 
Trp residues. In most other peroxidases, such as HRP, 
the proximal and distal Trp residues are replaced 
with Phe, Fig. 2. This replacement of Trp for Phe has 
consequences for the Compound I intermediate, as 
explained below. 

The heme active sites in the peroxidases are closely 
related to those in the globins, which have a similar 
proximal/distal histidine structure but which are gen¬ 
erally more hydrophobic than in the peroxidases. 
There has been ongoing interest in the factors that 
control O 2 binding and peroxidase activity in these 
proteins (see, e.g., Ozaki et al. 2001). In fact, the 
relationships and differences in reactivity across all 
heme proteins - including the ► cytochrome P450s, 


globins, and other heme oxygenases - has attracted 
considerable attention (Ortiz de Montellano 1987; 
Dawson 1988; Karlin 2010). 

Mechanism 

The reaction mechanism has been well-studied across 
a large number of heme peroxidases (Dunford 2010). 
The reaction proceeds by means of initial reaction of 
the ferric enzyme with FI 2 O 2 to form a two-equivalent 
oxidized heme intermediate, known as Compound I 
Eq. 1 and releasing one mole of water. Compound I 
is formally oxidized by two-electrons (compared to the 
ferric state), and in most cases contains a ferryl heme 
(Fe(IV = O) and a porphyrin tt-cation radical (see 
below). This is followed by two, sequential single 
electron reductions of Compound I by substrate, 
returning the enzyme to the resting, ferric state through 
a second intermediate, known as Compound II, which 
is formed from one-electron reduction of Compound I, 
Eqs. 2-3. The substrate varies across peroxidases, but 
here is represented as HS where each FIS donates an 
electron and a proton to Compound I, thus providing 
the protons necessary for release of the second mole of 
water from Compound II. 

Fe™ -f H 2 O 2 Compound I -f H 2 O (1) 

Compound I + HS Compound II -f S* (2) 

Compound II -|- HS Fe‘“ -f S* -f H 2 O (3) 

In HRP the formation of Compound I is accompa¬ 
nied by a color change of the ferric protein from red/ 
brown to green, with the green color characteristic of 
a Compound 1 species that contains both ferryl heme 
(Fe(rV) = O) and a porphyrin tt-cation radical. In 
many peroxidases, this Compound I species is observ¬ 
able by rapid ► stopped-flow methods. CcP, famously, 
does not use a porphyrin pi cation radical in its Com¬ 
pound I, but instead uses protein radical located on 
Trpl91 (Sivaraja et al. 1989). This was one of the 
first examples of a stable protein radical in any protein. 
Using mutagenesis, much attention has focused on the 
role of various active site residues on the reaction 
mechanism, in particular the reaction with H 2 O 2 and 
the intermediates involved (Erman and Vitello 2002). 
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Heme Peroxidases, 

Fig. 2 Comparison of the 
active site structures of CcP 
and HRP 
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Heme Peroxidases, Fig. 3 The two substrate binding 
locations observed in heme peroxidases. The S-heme edge is 
typically used by aromatic substrates, as typified in the HRP/ 
benhydroxamic acid complex {left, Henriksen et al. 1998). 


Substrate binding is also possible at the y-heme edge, shown 
here for the MnP/Mn^"^ (centre, Sundaramoorthy et al. 1994) and 
ascorbate peroxidase/ascorbate (right. Sharp et al. 2003) 
complexes. 


Substrate Binding 

Most commonly, the substrate for peroxidase enzymes 
is a small organic substrate. Identification of the sub¬ 
strate binding sites in heme peroxidases lagged a long 
way behind the elucidation of structures and mecha¬ 
nism as described above. The first structures that 
were solved for a peroxidase-substrate complex were 
for the CcP/cytochrome c and MnP/Mn^"^ complexes 
(reviewed recently Gumiero et al. 2010), but these 
enzymes have somewhat atypical substrates (i.e., not 
a small organic molecule). Across all peroxidases (and 
with the exception of CcP, with its anomalous sub¬ 
strate), it has emerged from structural analyses that 
there are two binding locations for the substrate - one 
close to the so-called §-heme edge of the heme 
group, which is typically used for aromatic substrates, 
and one close to the y-heme edge. Figs. 1 and 3 


(reviewed in Gumiero et al. 2010) see also below for 
figure legend. This ability to accommodate different 
kinds of substrates at different locations might well 
account for the tremendous versatility of these enzymes. 


Cross-References 

► Bacterial Globins 

► Cytochromes 

► Hemes 
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Synonyms 

Hemin; Metalloporphyrin; Porphyrin; Protoheme; 
Protoporphyrin IX 


Definition 

Heme, the ferrous iron complex of protoporphyrin IX, 
is a common prosthetic group in proteins including the 
electron transfer ► cytochromes, the respiratory com¬ 
plexes, and the oxygen carrier hemoglobin. 

Basic Characteristics 
Introduction 

Iron porphyrins, including iron (protoporphyrin IX) or 
heme h, are one of the most versatile and visible classes 
of redox cofactors utilized in biochemistry and bio¬ 
physics. Naturally occurring from archaehacteria to 
mammals, heme proteins carry out diverse biochemi¬ 
cal tasks such as electron transfer, substrate oxidation, 
metal-ion storage, ligand sensing/transport, and gene 
regulation. Thus, heme proteins are critical compo¬ 
nents in numerous biological processes including 
hormone and steroid biosynthesis, drug metabolism, 
aerobic respiration, and even programmed cell death. 
The dysfunction of natural heme proteins leads to 
a myriad of human diseases including sickle cell 
anemia, diabetes, and Alzheimer’s and coronary 
artery disease. 

Chemical Structure of Hemes 

The basic structure of a heme is the tetrapyrrole 
macrocycle with dimensions of about lOA x lOA x 
3A (Dolphin 1978). The aromatic nature of the por¬ 
phyrin encourages a planar conformation of the 
macrocycle and leads to significant absorption bands 
in the visible region. The most intense band, the Soret 
or y-band, typically has a molar extinction coefficient 
value greater than 100,000 M“' cm“* which makes it 
ideal for optical and ► resonance Raman spectros¬ 
copies. The center of the porphyrin contains two 
imine nitrogens (-C=N-C-) and two amine nitrogens 
(-C-NH-C-), the latter of which are deprotonated for 
iron incorporation. The four nitrogens of the porphyrin 
dianion serve as equatorial ligands to the bound iron. 
Heme iron commonly exists in either the +2 (ferrous) 
or +3 (ferric) oxidation states, the latter of which is 
active in X-band electron paramagnetic resonance 
(EPR) spectroscopy. Thus, the heme structure shown 
in Fig. 1 is formally neutral as an Fe(II) complex, and 
a monocation as the Fe(III) complex. Ferric hemes can 
be oxidized by one electron to the ferryl state, Fe(IV), 
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Hemes, Fig. 1 Chemical 
structure of biological hemes 
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with a further one-electron oxidation being centered on 
the porphyrin macrocycle, formally generating an Fe 
(IV) (porphyrin 7 t-cation radical) complex. 

Figure 1 also shows how the peripheral architecture 
of the macrocycle varies between biological heme 
types. Heme b, or protoheme, is the most common 
chemical structure observed in biology and Is the 
prosthetic group found in the oxygen transport 
globins, drug metabolizing ► cytochromes P450, and 
the electron transfer cytochromes h (Antonini and 
Brunori 1971; Ortiz de Montellano 1995). The periph¬ 
ery of heme h contains a methyl group at R 3 , and 
vinyl groups at positions Rg and R 13 . This protopor¬ 
phyrin IX macrocycle is itself biosynthesized from 
5-aminolevulinic acid (ALA) via six consecutive enzy¬ 
matic steps that are universal to all living organisms. 
The final step in heme h biosynthesis is the insertion of 
the iron by the enzyme ferrochelatase. Once assem¬ 
bled, heme h binds to proteins via one or two axial 
ligand(s) to the heme iron, hydrogen bonding interac¬ 
tions between the propionic acid groups and local 
amino acids, and hydrophobic interactions between 
the macrocycle and the hydrophobic core of the pro¬ 
tein. While the full range of heme protein biological 
functions are observed in the subset of heme proteins 
containing just heme h, biology utilizes several other 
porphyrin architectures in heme proteins. 

The second most common biological heme struc¬ 
ture is heme c which is found in cytochromes c (cyts c) 
and in cytochrome/of the ► cytochrome hg/complex 
(Scott and Mauk 1996). Heme c differs from heme h in 
that the porphyrin periphery is covalently linked at 
positions Rg and/or R 13 by thioether bonds to 
cysteine residues. These cysteines are often observed 


in a Cys-Xaa-Xaa-Cys-His sequence motif in cyt c in 
which Xaa represents any amino acid and the histidine 
serves as one of the axial ligands to the bound iron. The 
thioether linkages have been found to be critical to cyt 
c structure and function as their removal often results 
in heme dissociation with concomitant protein 
unfolding. The biosynthesis of heme c from its heme 
h precursor occurs by one of three distinct enzymatic 
pathways in biology. The existence of c-type cyto¬ 
chromes has produced many hypotheses concerning 
the advantages of covalently attaching the heme to 
the protein scaffold. The thioether linkages in cyts c 
may be an artifact of evolution, may guide protein 
folding, may allow for high heme to protein ratios 
which may facilitate their electron transfer function, 
or may be present to enforce the coordination of weak 
ligands, e.g., methionine to ferric heme, by increasing 
their effective concentration. 

While heme h and heme c constitute the majority of 
hemes in biological systems, other hemes including 
hemes a, d, o, siroheme, coproporphyrin, and P460 
are also observed. Heme a, structure in Fig. 1 , is vital 
to aerobic respiration in humans where it is solely 
found in the active site of the integral membrane pro¬ 
tein complex, cytochrome c oxidase (CcO). Heme a is 
biosynthetically derived from heme h in two enzy¬ 
matic steps. In the first step, heme h is transformed 
into heme o by converting the vinyl group at Rg into 
a hydroxyethylfarnesyl group. In the second step, the 
methyl group at R 3 in heme o is oxidized to a formyl 
group to generate heme a. The hydroxyethylfarnesyl 
group of heme a may perform one or more of the 
following roles in terminal oxidase function: acting 
as a lipophilic anchor, modulating electron transfer 
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pathways, functioning in the translocation of protons 
and aiding in exogenous ligand exchange reactions. 
The formyl group of heme a side chain raises its 
redox potential relative to heme h or o, making it 
a better electron acceptor. 

Heme Proteins 

Approximately 5% of all structurally characterized 
proteins in the RCSB Protein Data Bank contain 
heme (Reedy and Gihney, 2004). As mentioned ear¬ 
lier, the hemes interact with proteins via heme iron 
axial ligation, propionic acid hydrogen bonding, 
macrocycle hydrophobic interactions, and covalent 
linkage to the protein in cyts c. The prevalent protein 
secondary structure around hemes is the alpha helix, 
but there are instances where hemes are bound to beta- 
sheets. The heme iron may be bound by amino acids in 
the protein or by exogenous ligands and the protein 
may include additional hemes or other cofactors. 

Heme iron is known to bind a wide range of endog¬ 
enous and exogenous ligands. The heme iron is usually 
five- or six-coordinate in the protein scaffold. Histidine 
and methionine are the most common amino acid 
ligands to heme iron, with examples of lysine, aspara¬ 
gine, cysteinate, and tyrosinate ligation known. The 
heme coordination motif together with the constella¬ 
tion of amino acids surrounding the heme helps estab¬ 
lish the chemical reactivity of the heme and thus its 
biochemical function. 

The function of hemes that are six-coordinate in 
both oxidation states, Fe(III) and Fe(II), is often elec¬ 
tron transfer. An iron coordination motif that is invari¬ 
ant upon iron oxidation/reduction results in lower 
reorganization energies and more rapid electron trans¬ 
fer kinetics. The presence of six-coordination also 
serves to limit the ability of exogenous ligands to 
bind the heme that would alter the heme reduction 
potential, or driving force for electron transfer. Con¬ 
versely, the open coordination site in a five-coordinate 
heme protein facilitates the binding of an exogenous 
ligand. 

Numerous five-coordinate heme proteins bind an 
exogenous ligand that is critical to their biological 
function. These include heme proteins that sense, 
transport, and chemically modify exogenous ligands. 
Heme proteins involved in sensing small molecules 
include the 02-sensor FixL and the NO-sensor soluble 
guanylyl cyclase. The CO-sensor protein CooA is six- 
coordinate, but transient dissociation of the ligating 



Myoglobin 


Hemes, Fig. 2 Structure of myoglobin 



Hemes, Fig. 3 Active site structure of cytochrome c oxidase 


N-terminus provides the five-coordinate heme that 
binds CO. Five-coordinate heme proteins are also 
used for the transport of small molecules as exempli- 
hed by the 02 -transporter hemoglobin and the 
NO-transporter nitrophorin. The bound exogenous 
ligand may also be used for chemical catalysis. For 
instance, heme peroxidases oxidize organic substrates 
using hydrogen peroxide and the drug metabolizing 
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cytochromes P450 catalyze the oxidation of organic 
substrates by O 2 (Dunford 1999). In addition to these 
diatomic gases, live-coordinate heme proteins are 
observed to bind water (hydroxide), ammonia, cya¬ 
nide, thiocyanide, azide, acetate, and sulfate. 

Heme proteins may contain a single heme, multiple 
hemes, or hemes in conjunction with other cofactors. 
Figure 2 shows the structure of myoglobin, the proto¬ 
typical example of a water-soluble h-type heme pro¬ 
tein. As noted above, the alpha helical secondary 
structure and histidine coordination observed in myo¬ 
globin are common to many heme proteins. Multiheme 
proteins typically function as electron transfer conduits 
to buried active sites, e.g., hydroxylamine oxidoreduc- 
tase. Hemes are also found in proteins that contain 
other cofactors including iron-sulfur clusters and cop¬ 
per ions. In the case of cytochrome c oxidase, the 
heterobimetallic active site shown in Fig. 3 is com¬ 
posed of heme and Cub with the adjacent heme a 
serving an electron transfer role. 

Heme Reduction Potentials 

The midpoint reduction potential value, E^, of the iron 
in a heme protein establishes its driving force for 
electron transfer and is integrally related to its protein 
stability, protein folding pathway, and kinetics. In 
addition, the value represents the relative affinity 
of the protein for heme in the two oxidation states. 
Heme proteins with large negative E^ values bind Fe 
(III) heme tighter than Fe(II) heme while heme pro¬ 
teins with large positive E^ values bind Fe(II) heme 
tighter than Fe(III) heme. Figure 4 shows that the Fe 
(III)/Fe(II) midpoint reduction potentials for natural 
heme proteins span an 1,100 mV range. The extremes 
are heme acquisition system A (HasA) at —550 mV 
and diheme cytochrome c peroxidase at +450 mV. 


The reduction potential range of heme a in proteins is 
rather limited, while both heme h proteins and heme c 
proteins span a considerable range of reduction poten¬ 
tial values. 

Cross-References 
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Heteromeric Versus Homomeric 
Association of Protein Complexes 

Jacqueline M. Matthews 

School of Molecular Bioscience, The University of 
Sydney, New South Wales, Sydney, Australia 

Synonyms 

Protein-protein interactions; Subunit composition of 
protein complexes; Superfamily heterodimers 

Definition 

Many proteins form homodimers or other higher- 
order homo-oligomers (Marianayagam et al. 2004; 
Matthews 2012) and/or interact with other proteins to 
effect biological responses. Whereas prokaryotic 
multiprotein complexes tend to have a simple compo¬ 
sition, the equivalent assembly in eukaryotes can be far 


more complex. For example, the catalytic core units of 
proteasomes are made up of four seven-membered 
rings, comprising two alpha subunit and two beta 
subunit rings (Fig. 1). In bacteria and archaea, there 
is only one type of alpha and one type of beta subunit, 
but in eukaryotes, there are seven different types each 
of alpha or beta subunits with each ring containing all 
seven different subunits (Darwin 2009; Pearce et al. 
2008). Whereas the central core of voltage-gated 
sodium ion channels in bacteria generally comprises 
four identical subunits, that in bacteria comprises 
a single-chain protein with four homologous domains 
(Charalamhous and Wallace 2011). Proteins that 
regulate gene expression, which in prokaryotes are 
often homodimers or oligomers, tend to form 
multiprotein complexes in eukaryotes (Beckett 2001). 

Basic Characteristics 

Differences in protein complex composition are 
reflected in the sizes of the genomes and the complex¬ 
ities of the organisms involved. Prokaryotic genomes 
tend to be small compared to eukaryotic genomes, 
whereas eukaryotic genomes can support the develop¬ 
ment and maintenance of highly complex organisms. 
In part, the expanded sizes of eukaryotic genomes are 
likely to have been caused by genome duplication 
events. Over time, identical copies of genes acquire 
mutations allowing them to diverge in terms of both 
sequence and function to form paralogs. With the 
occurrence of additional genome duplication events, 
families of related proteins are gradually formed 
(Pereira-Leal et al. 2007). Thus, protein that formed 
homodimers in parent organisms could have evolved 
families of related proteins that can form homomeric 
and/or heteromeric interactions with other family 
members. These “superfamily heterodimers” include 
cell surface receptors, enzyme complexes, transcrip¬ 
tion factors, and ion channels. Proteins that have 
numerous interaction partners tend to be of increased 
importance to the viability of an organism (Aragues et al. 
2007), and many of these superfamily heterodimers are 
biologically important. 

Different combinations and permutations of sub¬ 
units in complexes tend to have different activities, 
such as transcription factor complexes targeting differ¬ 
ent DNA sequences or recruiting different cofactors. 
The exchange of a single component from the same 
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Heteromeric Versus Homomeric Association of Protein Compiexes 



Heteromeric Versus Homomeric Association of Protein 
Complexes, Fig. 1 Complexity of subunit composition in 
proteasomes and proteasome-like complexes, (a) The archaeal 
20S proteasome contains a single type of alpha (a) and a single 
type of beta (P) subunit arranged in seven-membered rings (PDB 


accession code IPMA). Here, the different rings are shown in 
different colors, (b) The eukaryotic 20S proteasome contains 
seven different types of a and P subunits (PDB accession code 
2NZJ). Each ring comprises one copy of each a subunit, or one 
copy of each P subunit 


family can transform a transcription complex from one 
that activates expression of a target gene to one that 
switches off expression (e.g., Jing et al. 2008). An 
ability to use regulatory proteins in a combinatorial 
fashion is generally thought to be a mechanism through 
which complexity in higher eukaryotes is achieved 
(e.g., Lange et al. 2011). By assembling multiprotein 
complexes with different activities, tight regulation 
of cellular events can be achieved through nonlinear 
responses to changes in protein concentration 
(Boulanger et al. 2003; Hansen et al. 2008). 

Prokaryotes are still able to maintain high levels 
of regulation over the activity of multiprotein com¬ 
plexes through mechanisms including the regulation 
of protein oligomerization through posttranslational 
modihcation, small ligand binding, and/or DNA 
binding (Beckett 2001). 


Cross-References 

► Mass Spectrometry: Mapping Large Stable Protein 
Complexes 

► Protein Structure 

► Proteins: Relationship Among Divergence of 
Sequence, Structure, and Function 

► RNA Polymerases and Transcription 
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Synonyms 

HMM; Markov model 

Definition 

A hidden Markov model (HMM) is a probabilistic 
model in which the system being modeled is assumed 
to be a Markov process with unobserved (hidden) states. 

Introduction 

The development of ► single-molecule spectroscopy 
has allowed for the investigation of a variety of bio¬ 
logical questions previously inaccessible by ensemble 
techniques. The strength of single-molecule tools 
comes from the high-resolution data extracted from 
such experiments. Proper interpretation of these 
data requires efficient, unbiased analysis routines that 
are able to distinguish relevant signals from the 


intrinsically noisy measurements. The hidden Markov 
model, a statistical algorithm initially developed for 
speech recognition, has been adapted for the analysis 
of a variety of single-molecule signals. In this article, 
we will give a general introduction to the theoretical 
basis of hidden Markov modeling and the various 
single-molecule techniques in which they have been 
co-opted for signal analysis. 

Basic Characteristics 

Signals collected from single-molecule experiments 
can be described as a series of discrete states governed 
by an underlying physical property of the molecule(s) 
being interrogated. The discrete states are often 
obscured (hidden) by noise that is inherent to the 
experimental technique, making the identification and 
characterization of these states difficult. The hidden 
states are no longer efficiently detected by visual 
inspection or simple algorithms and doing so can intro¬ 
duce bias and an incomplete characterization of 
the underlying behavior(s). Probabilistic maximum- 
likelihood algorithms, like a hidden Markov model 
(HMM), have become the preferred method of analy¬ 
sis; they provide a mathematically derived routine that 
limits the possibility of user bias as well as providing 
a theoretical framework with which to interpret the 
quantitative results extracted from single-molecule 
experiments. A HMM describes a stochastic progres¬ 
sion through a series of discrete states, where the 
likelihood of the next event in a series of observations 
can be predicted upon knowledge of the immediately 
preceding event and does not depend on knowledge 
of any of the prior events; that is, the process is 
Markovian (Fraser 2008). The main model assumptions 
of a HMM are as follows: 

1. Given the current state, the probability of the cur¬ 
rent observation is independent of states and obser¬ 
vations at all earlier times. 

2. Given the current state, the probability of the next 
observed state occurring is independent of earlier 
states. More simply put, the future does not depend 
on the past. 

HMMs are well suited for single-molecule analysis 
because of their ability to find discrete states, usually 
stable, biologically relevant conformations, within 
noisy time series data, and to reliably find the most 
probable path through these states. As a molecule 
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transitions from one stable conformation to another it 
is often the case that the process is Markovian and 
therefore governed by single exponential kinetics. 
Through the iterative optimization of the HMM param¬ 
eters - the probability matrices of transition, emission, 
and initiation - a model is derived that best approxi¬ 
mates the data. The transition probability matrix 
describes the probability of any one state changing to 
any other state or staying in the same state in the 
subsequent time step (Fig. 1). The emission probability 
distribution contains the probabilities of a specihc sig¬ 
nal value being emitted by each discrete state. Calcu¬ 
lating an emission probability often requires an 
assumption of the noise in the system, usually shot 
noise that is efficiently approximated by simple Gauss¬ 
ian distributions (Fig. 2). The initiation probability 
matrix gives the probabilities of starting at each of 
the possible discrete states. 

During the evaluation of single-molecule data 
one does not usually know what the idealized values 
of the states A and B or their probability matrices are. 
To begin the analysis, a Markov model is estimated 
with a given number of states and a probability tran¬ 
sition matrix. The model is then optimized by deter¬ 
mining the parameters that yield the maximum 
likelihood for the given trace. If the number of states 
has been determined, then the appropriate number of 
states can be entered and the rest of the parameters 
optimized. It is often the case that the number of states 
is also unknown, but can be determined by optimiza¬ 
tion of various Markov models with differing num¬ 
bers of states and the results compared through 
a criterion such as the Akaike Information Criterion 
(AIC) or the Bayesian Information Criterion (BIC) 
(Blanco and Walter 2010). Due to the iterative 
process required to find the most probable model 
parameters and sequence of states, HMMs are com¬ 
putationally expensive. The availability of a family of 
algorithms whose complexity scales only linearly 
with the length of the trajectory makes it possible to 
apply HMMs to time series that reach biologically 
relevant long time scales. The algorithms most com¬ 
monly used include: 

1. The forward algorithm, which calculates the condi¬ 
tional probability of being in a state ^ at time f given 
all of the observations up to that time. It also calcu¬ 
lates the conditional probability of each observation 
given previous observations. Using these terms 
it calculates the probability of the entire data 
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Hidden Markov Modeling in Single-Molecule Biophysics, 
Fig. 1 Simple hidden Markov model (a) An example of a hidden 
Markov model with two states with the transition probabilities 
presented above individual arrows to represent the likelihood of 
transiting from one state to another or staying in the same state 
within the next time step, (b) The transition probability matrix of 
the Markov model from (a), (c) A simulated single-molecule 
FRET trajectory using the two-state model with states A 
(FRET = 0.8) and B (FRET = 0.2) 
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Fig. 2 Emission probabilities for the Markov process. The 
emission probability distributions of the two states A and B 
from the simulated single-molecule FRET trajectory in Fig. 1 
are plotted. Here the emission probabilities are calculated by 
assuming Gaussian noise distributions around the discrete mean 
FRET values (A = 0.8, B = 0.2), modeling the shot noise of the 
signal collection instrumentation 


sequence given the model. This is also the first 
phase of the Baum-Welch algorithm (below). 

2. The Viterbi algorithm, used when one needs to 
estimate a sequence of states from a sequence 
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of observations. It finds the most probable state 
sequence. 

3. The Baum-Welch algorithm (forward-backward 
algorithm) calculates, given a given sequence of 
observations and an initial set of model parameters, 
in a single pass based on the forward algorithm 
a new set of parameters that has higher likelihood 
of being correct. Running many iterations of the 
Baum-Welch algorithm yields a sequence that 
approaches a local maximum of the likelihood. 
Although HMMs provide an excellent tool for ana¬ 
lyzing single-molecule data it must be noted that not 
all data fulfill the assumptions of a Markov process. 
For example, the changes of biomolecules can exhibit 
time-dependent transition probabilities due to 
molecular memory effects. In these cases HMMs 
can still approximate the data but care must be taken 
in the interpretation of the results. For an in-depth 
discussion of the mathematical foundations underly¬ 
ing HMM, we refer the reader to Rabiner (1989) and 
Fraser (2008). 

Applications of Hidden Markov Models in 
Single-Molecule Biophysics 

The general framework of HMMs has been adapted to 
a variety of single-molecule techniques. Due to the 
differing character of the signals acquired from these 
techniques there is not a single HMM that can be 
utilized for the analysis of all types of data collection. 
However, HMMs have been modified and improved in 
the various fields to better model the data and noise of 
each particular field. 

Ion Channel Recordings 

Among some of the earliest single-molecule experi¬ 
ments came from the electrophysiology field, where 
the action potential across a single ion channel can be 
recorded over time with the use of ► patch clamp 
techniques. These techniques allow for the direct mea¬ 
surement of ionic currents through a single channel 
protein molecule. The amplitude of the signal collected 
describes the permeability of ions through the channel 
and the change in this permeability can be recorded in 
real time. HMMs can approximate the open and closed 
states of these pores effectively. This HMM imple¬ 
mentation assumes the underlying signal Is a Markov 
process whose noise is assumed to be Gaussian. QuB 


(available at http;//www.qub.buffalo.edu/wiki/index. 
php/Main_Page) is a readily available software pack¬ 
age routinely used for the analysis of ion channel 
recordings (Qin et al. 2000). 

Fluorescence Microscopy 

Single-molecule fluorescence microscopy has 
become one of the most popular single-molecule 
techniques due to the wide range of biomolecules 
that can be studied with this technique. HMMs have 
been adapted for various types of fluorescence 
microscopy to better model the different forms of 
intensity time traces collected. 

Single-Molecule Fluorescence Resonance Energy 
Transfer (smFRET) 

► smFRET can provide a real-time view of dynamics 
of biomolecules ranging from small catalytic RNAs 
(ribozymes) to large RNA-protein complexes such as 
the ribosome. The distance-dependent interaction 
between two fluorophores can report on intra- and 
intermolecular conformational changes. The applica¬ 
tion of HMM to the analysis of smFRET data became 
more accessible after the release of HaMMy, a user- 
friendly analysis software specifically designed for the 
analysis of smFRET data (McKinney et al. 2006). 
Previously, less sophisticated algorithms such as 
thresholding techniques were used that could often 
not handle complicated trajectories. The adaptation 
of HMM to smFRET has been advantageous for the 
field and the tool continues to be developed for the 
analysis of systems with more complicated behaviors. 
An example of the power of HMM analysis of 
smFRET trajectories is the trajectory of a single pre- 
mRNA molecule imaged under splicing conditions 
(Eig. 3) (Blanco and Walter 2010). The large number 
of states and rapid kinetics of transitions makes this 
type of trace difficult to analyze without the use of 
HMMs. In addition to HaMMy, programs such as 
QuB and vbERET (Bronson et al. 2010) (available at 
httpV/vbfret.sourceforge.net) are available for 
smFRET analysis with advantages for more complex 
trajectories (Blanco and Walter 2010). 

Switchable FRET 

Switchable FRET is a combination of two techniques: 
smFRET and photoswitching, the reversible activa¬ 
tion and deactivation of fluorophores commonly 
used in super-resolution imaging techniques such as 
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Hidden Markov Modeling in Single-Molecule Biophysics, 
Fig. 3 A complex smFRET trajectoiy analyzed with HMM. 
HMM can be utilized to characterize the dynamics of single 
pre-mRNA molecules during splicing that exhibit rapid kinetics 
and a large number of states without the need for smoothing 
which would eliminate these small rapid conformational 
changes. In black is the raw FRET trajectory, and in magenta 
the five-state FIMM idealized fit. In this particular case, the 
QuB ion channel analysis software was utilized for the HMM 
analysis 

► stochastic optical reconstruction microscopy 
(STORM). This technique utilizes multiple donor- 
acceptor fluorophore pairs to sequentially probe and 
obtain multiple distances within a single molecule. 
Traditional HMMs for smFRET cannot incorporate 
the stochastic photoswitching of the acceptor dye and 
therefore a linked hidden Markov model was devel¬ 
oped where FRET and donor-acceptor stoichiometry 
are tracked (Uphoff et al. 2010). The linked HMM 
allows for the proper identification of states and deter¬ 
mination of their transitions. 

Multi-fluorophore Bleaching 

Some single-molecule experiments utilize multiple 
fluorophores to help determine the number of subunits 
assembled in a particular biological complex. The 
sudden drops in intensity when single fluorophores 
each undergo an irreversible ► photobleaching event 
can be used as a measure of the number of particles 
present. HMMs have been developed for the unbiased 
determination of the discrete steps in the fluorescence 
intensity traces. Up to 30 fluorophores can be reliably 
detected through the use of HMMs (Messina et al. 
2006). The ability to reliably visualize and count 
the number of single molecules present has applica¬ 
tions ranging from self-assembly of biomolecules to 
tracking the assembly of trans factors in multi- 
component systems. 


Molecular Motor Step Size 

HMMs have been developed for the case of fluorescent 
measurements of molecular motors. An HMM variant, 
the variable-stepsize HMM (vsHMM) where the posi¬ 
tion of the motor is modeled as a large number of 
states, has been developed to more accurately track 
the movement of these motors as a result of their 
reaction cycle. This model differs in that it allows 
for an arbitrary distribution of step sizes that allows 
it to run as a robust algorithm with little user input. 
The algorithm has also been extended to the variable- 
stepsize integrating-detector HMM (VSI-HMM) 
which serves to better model the variation in signals 
during data acquisition such as random baseline fluc¬ 
tuations. Together, these FlMMs have been utilized to 
characterize the movement of a myosin motor both 
in vitro and in vivo (Syed et al. 2010). 

Single-Particle Tracking 

► Single-particle tracking can be used to extract 
modes of diffusion for a single molecule. In the case 
of biomolecules diffusing through a cell this approach 
can provide insight into the regions of localized activ¬ 
ity, concentration gradients, or sites of modification. 
A single-particle track, with certain assumptions, can 
be modeled with a two-state HMM. The two-state 
HMM is optimized through the diffusion coefficients 
of the states and the rates of transition between them. It 
has been shown that this HMM is sufficient to extract 
multiple states of diffusion within a single trajectory in 
a practical manner (Das et al. 2009). 

Tethered Particle Microscopy (TPM) 

>■ TPM experiments use light microscopy to measure 
the position of a bead tethered to a microscope slide via 
a polymer to infer the behavior of the polymer. An 
example is the use of DNA as the tether to measure 
DNA folding/unfolding dynamics and the effects of 
DNA-binding proteins on those dynamics. HMM anal¬ 
ysis algorithms have been used to track the subtle 
changes in bead position and determine the relevant 
changes from those induced by Brownian motion by 
incorporating factors for the diffusive motion of the 
bead (Beausang et al. 2007). This approach has 
allowed for data analysis without the need for filtering. 
Another improvement to the HMM algorithms used for 
these experiments introduced factors to account for the 
nonlinear extension of DNA, allowing for a more accu¬ 
rate, quantitative assessment of the kinetics. 
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Limitations of Hidden Markov Models 

HMMs can be a powerful tool for the unbiased analysis 
of single-molecule data, but the resulting models need 
to be carefully inspected. As noted previously, not all 
biological processes studied under single-molecule 
conditions fulfill the Markov property, thus violating 
one of the assumptions of HMMs. This can affect the 
ability of the model to fully recapitulate the underlying 
behaviors. Additionally, as can be seen by the various 
adaptations of HMMs, it is necessary to define the right 
set of parameters to properly simulate the noise present 
in the system as well as the discrete number of states of 
interest. Finally, as with any fitting technique, when 
using HMM a rigorous test for model selection is 
required. This testing is often complicated by the lack 
of a clear and decisive way of selecting the proper 
number of states for the model. Although several 
methods have been presented for an unbiased approach 
at state selection, there is little consensus regarding 
which is best. 

Cross-References 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Photoactivated Localization Microscopy (PALM) 

► Single Fluorophore Photobleaching 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Single-Molecule Spectroscopy 

► Single-Particle Tracking 

► Stochastic Optical Reconstruction Microscopy 

► Tethered Particle Microscopy 
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Hierarchically Structured Lipid Systems 

Chandrashekhar V. Kulkarni 
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University of Cambridge, Cambridge, UK 

Synonyms 

Lipid phase equilibria; Lipid self-organization; Lipid 
superstructures; Multiscale structural ordering of 
lipids 

Definition 

Amphiphilic lipids self-assemble into various thermo¬ 
dynamically stable nanostructures in the presence of 
water, which can be kinetically stabilized into hierar¬ 
chically ordered lipid systems exhibiting multiple 
structural length scales. 

Introduction 

Amphiphilic molecules such as lipids have the inherent 
tendency of self-assembling in an aqueous environ¬ 
ment. The hydrophobic effect acts upon minimizing 
interactions between the water and hydrophobic parts 
of lipid molecules, thus having a prime contribution 
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toward their structural ordering. Lipid polymorphism 
spans thermodynamically stable and metastable forms 
of simple micelles as well as complex, yet well- 
ordered, one-, two- and three-dimensional assemblies 
(Seddon 1990; Hyde et al. 1997). They typically dis¬ 
play intermediate behavior between that of solids and 
liquids, and also, their dimensions lie in the range of 
^2.5-25 nm; therefore they are also referred as liquid 
crystalline (LC) nanostructures or lyotropic (phases 
induced by the addition of solvent) LC phases. 
Biological evidence of lipid self-assembly includes 
the plasma membrane, highly convoluted forms of 
ER (endoplasmic reticulum) membranes, inner nuclear 
membrane, mitochondrial inner membrane, chloro- 
plast thylakoid membranes, tubules, and liposomes. 
However, most of the biological architectures are com¬ 
prised of several molecular constituents rather than just 
one or a few simplistic lipid structural makeups. In 
other words, above biological entities would not be 
purely “self-assembled” and “thermodynamically sta¬ 
ble” lipid systems, but may well be “kinetically stable” 
and/or “metastable” ensembles, which are affected by 
various physiological and physicochemical parame¬ 
ters. It is also possible to design many more complex 
morphologies by artihcially modulating the basic lipid 
assembly at various hierarchical orders, and this is 
what is mainly focused on in the following sections. 

Lipid Phase Behavior 

Amphiphilic lipid molecules usually have slightly 
more hydrophobic character compared to that of sim¬ 
ple detergents and tend to form type 2 or “inverse” 
aggregates where the interfacial curvature is toward 
the aqueous region. The lipid molecular chain splay 
increases with increasing temperature (pressure has the 
opposite effect), which makes the molecule more 
inverse conical, diverting away from cylindrical 
shape. This in turn increases the mean curvature (the 
average of the two principal curvatures) of a phase, 
resulting in phases with greater (negative) curvature. 
The formation of phases is also governed by other 
important physicochemical factors such as the type of 
lipid, degree of unsaturation, head group area, lipid 
chain length, type of solvent, composition of solvent, 
volume fraction of lipid in a corresponding mixture of 
lipids, addition of other additives (like buffers, salts, 
other amphiphilic molecules), and pH. Some of these 


factors have been reviewed in the literature (Seddon 
and Templer 1995; Kulkarni et al. 2011b). The 
most simple lyotropic system is with two components, 
i.e., a pure lipid plus water, whose behavior is usually 
studied at different temperatures and represented on 
a temperature-composition phase diagram, as shown 
in Fig. 1. The monoelaidin-water system (Kulkarni 
2011 c), taken here as an example, shows three 
bicontinuous cubic phases, Ia3d, Pn3m, and Im3m, 
whose architecture is based on mathematical minimal 
surfaces G, D, and P, respectively. These intricate 
cubic phase assemblies exhibit hnely designed and 
ordered topology where an individual phase consists 
of a continuous lipid bilayer enclosing two interwoven 
but discrete aqueous channels. The numbers, dimen¬ 
sions, and angles at which the aqueous channels meet 
at single points in the unit cell of the cubic phases differ 
among the phases, which allows a certain degree of 
tunability with respect to their dimensions (Kulkarni 
et al. 2011b). 

The lamellar phase is quite common in many lipid 
systems and is an indispensable structural constituent 
of biological membranes. This phase consists of one¬ 
dimensional stacking of planar lipid bilayers, whose 
behavior may differ in the degree of chain fluidity, 
head group ordering, tilting and interdigitation of mono- 
layers, etc. The aforementioned parameters have vital 
significance in the functioning of biomembranes; 
another consequence is that they allow the observation 
of polymorphism in lamellar phases. Monoelaidin sys¬ 
tem (Fig. 1) shows two polymorphs: fluid (L^) and gel 
(Lp) lamellar phases; however, more are also possible 
at lower temperatures and hydration (not shown) 
(Kulkarni 2011c). The inverse hexagonal Hn phase, 
comprising of long water charmels enclosed by lipidic 
cylinders, is also common among many lipid systems. 
Other LC nanostructures such as micellar cubic-Fd3m, 
disordered bicontinuous - sponge and disk-shaped 
bicelles are observed when additional chemical or bio¬ 
logical component/s is/are added to the binary lipid- 
water system (Kulkarni et al. 201 lb). 

Hierarchical Structural Ordering of Lipids 

Lyotropic LC phases are known for their applications 
in various biotechnological fields, ranging from drug 
delivery to membrane protein crystallization (Kulkarni 
et al. 201 lb). However, direct use of bulk LC phases is 
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Fig. 1 Binary phase diagram 
of the monoelaidin-water 
system displaying lamellar 
(fluid lamellar, and 
lamellar gel, Lp) phases; 
bicontinuous cubic phases - G, 
D, and P phases - also 
indicated by corresponding 
crystallographic space groups 
as, la3d, Pn3m, and Im3m: and 
the inverse micellar fluid 
isotropic phase (L 2 ). (Figure 
modified from Kulkami et al. 
2010b) 



hampered due to their high viscosity and variable 
domain consistency. To overcome these problems, 
they are usually transformed or kinetically stabilized 
into other forms and/or hierarchical structures using 
some external stabilizer. Depending on the method 
of preparation and amount of dispersed phase, LC 
nanostructures reorganize into oil-in-water (0/W) or 
water-in-oil (W/0) emulsions (Fig. 2). The hydropho¬ 
bic phase (lipid or lipid-i-oil) itself displays a wide 
range of nanostructures; when dispersed in water in 
the form of particles, they are genetically known as 
“Isasomes” (Yaghmur et al. 2005), i.e., internally self- 
assembled somes, i.e., particles. More specifically, 
the particles with hexagonal and cubic internal struc¬ 
tures are termed, respectively, “hexosomes” and 
“cubosomes,” the latter of which was the first such 
emulsion, demonstrated more than two decades ago 
(Larsson 1989). The reverse type of emulsion, where 
a hydrophobic phase (with an LC nanostructure) forms 
a continuous wall-like architecture enclosing water 
droplets (up to ^95% by wt), is called a W/0 nano- 
structured emulsion, and more importantly, it does not 
require any external stabilizer (Kulkami et al. 2010). 

Recent research advances have extended the limits 
of internal nanostructures of Isasomes beyond the 
cubic (Pn3m, lm3m, or Fd3m) and hexagonal (H 2 ) 
phases so as to include the microemulsion (Yaghmur 
et al. 2005), lamellar (Battaglia et al. 2007), and even 
sponge phases (Angelov et al. 2011); the resulting 
Isasomes are called emulsihed microemulsions 
(EME), lamellarsomes, and spongisomes (name given 
here), respectively. Sometimes, the vesicles and 


liposomes were seen to coexist and/or enclose the 
Isasomes. Here, note that the lamellarsomes and lipo¬ 
somes are not one and the same, as the interior of 
liposomes is usually hlled with water whereas lamellar 
phase is enclosed by the lamellarsomes. Being similar 
in size as liposomes, the Isasomes are comprised of LC 
phases thereby providing considerably higher volume 
fraction of hydrophobic environment. The latter is 
already equipped with well-dehned nanostructural 
units facilitating loading of poorly water soluble and 
amphiphilic functional molecules. The same disper¬ 
sion allows loading of highly water-soluble molecules 
in the continuous aqueous medium. Increased diffusive 
path length provided by the internal nanostructures of 
Isasomes thus demonstrate an enhanced capacity for 
sustained release of the loaded components, which can 
be further improved by increasing the viscosity by 
adding hydrogelling agent in the aqueous region 
(Kulkami and Clatter 2012). In this manner, the 
Isasomes provide certain advantages over vesicle/ 
liposomal stmctures and conventional (nonstmctured 
oil phase) emulsions; the latter are most common for 
detergent-based amphiphiles. 

There are several other types of hierarchically 
ordered forms of lipids being developed and investi¬ 
gated by researchers for various applications, includ¬ 
ing poly-high internal phase emulsions (poly-HIPE); 
bicontinuous interfacially jammed emulsion gels 
(bijels); polymerized LC phases; lipid nanoparticles 
(LNP), i.e., liposomes loaded with functional 
molecules; lipid nanocapsules; etc. Self-assembled 
monolayers (SAM), Langmuir-Blodgett films, and 
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Hierarchically Structured Lipid Systems, 
Fig. 2 Hierarchical ordering of lipids in (a) OAV and (b) W/0 
nanostructured emulsions indicated with approximate length 
scales. The LC nanostructure shown here is “hexagonal,” but it 


can be easily tuned into other forms such as cubic, fluid isotropic, 
and sponge. Parts of the figure are obtained from (Yaghmur et al. 
2005) and (Kulkami et al. 2010a) 


► supported lipid bilayers (SLB) are some simple, 
commonly employed model lipid assemblies. Solid 
lipid nanoparticles (SLN) developed in the early 
1990s, and their successive variants, nanostructured 
lipid carriers (NLC), prepared by the high-pressure 
homogenization technique, are also under investiga¬ 
tion, particularly for drug delivery applications 
(Muller et al. 2002). Nonetheless, structured emul¬ 
sions, formed primarily from simple monoglycerides, 
have emerged to be quite successful lipid-based mate¬ 
rials that are utilized for various commercial applica¬ 
tions (Yaghmur and Glatter 2009). 

Isasomes: Nanostructured Lipid Particles 

Isasomes are kinetically stabilized O/W emulsions 
comprising internally nanostructured submicron¬ 
sized (typically 200-400 nm) lipid particles dispersed 
in excess water (typically 80-95%) as a continuous 


medium. Some groups define the Isasomes differently, 
e.g., nonlamellar lipid nanoparticles (Barauskas et al. 
2005), self-assembled lipid nanocarriers, nanostruc¬ 
tured lipid particles, or colloidal lipid nanostructures. 
The primary components of Isasomes are lipid, 
stabilizer, and water. Lipids that form cubic and/or 
hexagonal phases at room temperature are usually 
required for Isasome preparation; however, monoolein 
(MO) and monolinolein (ML) seem to be popular in 
their pure or commercial derivatives (Dimodan-U/J® 
(DU) and glycerol monooleate (GMO), respectively), 
while phytantriol (PT) and monoelaidin (ME) have 
also been used in recent years. Boyd and coworkers 
(2009) have compiled a list of lipids and lipid mixtures 
forming nonlamellar phases and their potential utility 
for the preparation of hierarchical structures. The lit¬ 
erature on various lipid phase diagrams is also indexed 
by Koynova and coworkers (Koynova and Caffrey 
2002). A range of polymers, surfactants, food hydro¬ 
colloids, and nanoparticles are utilized as emulsion 
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(Starting from LC phase) 


Intermediate Bottom UP 

(Starting at molecular level) 





1. Ultrasonication 

2. Shearing 

3. High pressure homogenization 


Dilution of Monoolein-Ethanol 
Mixture by stabilizer solution 
(Less energy and room temperature) 


Precursor preparation via: 

1. Spray drying 

2. Freeze drying 


Hierarchically Structured Lipid Systems, Fig. 3 Different approaches for the preparation of Isasomes, especially cubosomes 


stabilizers, which can be broadly categorized into the 

following categories: 

1. Polymers and Polymeric Surfactants 

The amphiphilic triblock copolymer, Pluronic® 
FI27 (PEO 99 -PPO 67 -PEO 99 ), has been most widely 
used for structured emulsions such as Isasomes; 
nevertheless, polysaccharide and polyethylene 
glycol (PEG)-based stabilizers, e.g.. Tween, 
poloxamers, and PEGylated lipids, are also used, 
with varying levels of success (Yaghmur and 
Glatter 2009). 

2. Solid Particles {Pickering Emulsion Stabilizers) 
Surfactant-based stabilizers are prone to affect the 
internal nanostructure of Isasomes; moreover, they 
can penetrate inside the lipid particles to disrupt the 
nanostructured interior. Instead of surfactants, some 
solid particles are exploited to obtain “surfactant- 
free emulsions” which are also known as Ramsden- 
Pickering or Pickering emulsions. An assortment of 
solid particles utilized for Pickering emulsions 
include latex particles, metal oxides, carbon and 
sulfate particles, polymeric rods and grafted poly¬ 
mer brushes, silica nano- or microcolloids, and clay 
platelets (Kulkami and Glatter 2012). The latter two 
are best suited for stabilizing Isasomes because they 
allow the emulsion stability to be controlled by 
changing one or more of the following parameters: 
charge, pH, and temperature; although it is also 
possible to alter conventional parameters, such 
as concentration and particle size and shape. The 
overall emulsion stability varies inversely with the 
particle size, as the smaller particles pack more 
efficiently. The particle size ranges from nanometer 
to micrometer, stabilizing up to millimeter-sized 


droplets, which is rather unusual for surfactant- 
based systems (Kulkami and Glatter 2012). 

3. Biomolecules 

Emulsions are also known to be stabilized by pro¬ 
teins, such as casein, albumin, and lactoglobulin 
(Dickinson 2009; Yaghmur and Glatter 2009). Pro¬ 
tein-stabilized food emulsions are sometimes called 
Pickering emulsions, even though most proteins 
used for this purpose are also amphiphilic. Other 
biomolecules including bile salts, ► cellulose and 
derivatives, modified forms of starch, etc., are also 
being explored for stabilizing structured emulsions 
(Kulkami and Glatter 2012). 

Various emulsifying systems and stabilizers includ¬ 
ing lipids, surfactants, proteins, hydrocolloids, and 
nanoparticles and the underlying principles have been 
recently reviewed (Hunter et al. 2008; Dickinson 2009; 
Yaghmur and Glatter 2009). 

A variety of methods have been developed for the 
preparation of Isasomes, requiring high or low energy 
input for their crude or fine production. There are two 
major approaches: one starting from the ready-made 
LC phases (“top down”), whereas the other starts 
from the molecular level (“bottom up”), as shown 
schematically in Eig. 3. In the first, the LC phases 
are dispersed via high energy techniques such as 
ultrasonication, high-pressure homogenization, or 
shearing in a couette-like device, whereas the second 
utilizes spray or freeze-drying to make dry precursors 
of lipid and starch or dextran to form Isasomes, simply 
by subsequent hydration. The latter is more useful for 
large-scale production, mainly for the food industry. 

An intermediate “hydrotrope”-based approach was 
also reported in the literature, by which Isasomes are 
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Hierarchically Structured Lipid Systems 

Hierarchically Structured Lipid Systems, Table 1 Physicochemical parameters for modulation of the structure and properties of 
Isasome systems. Compilation is based on the literature to date, more specifically from the following references (Barauskas et al. 
2005; Boyd et al. 2009; Yaghmur and Glatter 2009; Kulkami et al. 2011a, b; Kulkami and Glatter 2012) 

No. 

Parameter 

Change in the Isasome system 

1 

Concentration of dispersed phase or continuous phase 
(usually water) 

Viscosity, amount of Isasomes, sometimes the size of Isasomes 

2 

Composition of hydrophobic phase (e.g., lipid to oil ratio) 

Internal nanostmcture type and lattice parameters of LC phase, 
size of Isasomes 

3 

Type of hydrophobic additive (oil type - tetradecane or 
limonene, other lipid, glycerol, biomolecules, etc.) 

Internal nanostmcture type and lattice parameters of LC phase, 
stability of Isasomes 

4 

Type of primary lipid that forms (internal) LC nanostructure 

Internal nanostmcture type and/or lattice parameters of LC phase 

5 

Concentration of stabilizer 

Internal nanostructure type and lattice parameters of LC phase, 
size of Isasomes 

6 

Type of stabilizer (polymeric surfactants, solid 
nanoparticles, clay, proteins, etc.) 

Internal nanostructure type and lattice parameters of LC phase, 
size of Isasomes, mechanism of stability of Isasomes 

7 

Hydrogel formation in continuous phase 

Viscosity, (arresting) particle dynamics, sometimes internal 
nanostmcture, stability of Isasomes 

8 

Charged components 

Type of internal nanostructure, stabilization/destabilization of 
Isasomes 

9 

pH 

Type of internal nanostructure, stability of Isasomes 

10 

Temperature 

Internal nanostmcture type and lattice parameters of LC phase, 
stability of Isasomes, size of Isasomes 

11 

Pressure 

Internal nanostructure type and lattice parameters of LC phase, 
stability of Isasomes, size of Isasomes 

12 

Method of preparation (ultrasonication, shearing-varying 
shear rate) 

Stability of Isasomes, size of Isasomes, mechanism of stability of 
Isasomes 

13 

Intermixing two differently nanostructured Isasomes 

Type of internal nanostructure, kinetics of LC phase formation/ 
disappearance, size of Isasomes 


prepared via spontaneous emulsification upon blend¬ 
ing of two solutions, i.e., lipid plus hydrotrope and 
stabilizer solution. This method requires less energy 
compared to others; moreover, it allows Isasome pro¬ 
duction at room temperature. Some other methods with 
varying degrees of success include mechanical stirring 
during the hydration of lipid plus stabilizer film and 
microfluidization-based methods as revealed by 
Yaghmur and coworkers (Yaghmur and Glatter 2009) 
and Boyd and coworkers (2009). 

For laboratory scale Isasome production, the 
ultrasonication method is very convenient, primarily 
for lower concentrations of dispersed phase (^5-30% 
Isasomes in the total dispersion); however, for concen¬ 
trations above ^30%, the shearing method works better, 
by which one can produce highly concentrated Isasome 
dispersions (up to ca. 70% of dispersed phase) (Kulkami 
and Glatter 2012). The shearing technique can be easily 
scaled up for the continuous production of Isasome- 
like systems. Recently, it was also shown that by simply 


optimizing shear conditions, one can create W/0 nano- 
structured emulsions (Fig. 2b) without requiring any 
external stabilizer (Kulkami et al. 2010). 

Isasomes allow a great degree of tunability at 
every level of stmctural hierarchy, ranging from the 
lowest, molecular level, through an intermediate, 
nanostructural level, to the particle size, at the highest 
level. Various physical parameters, chemical addi¬ 
tives, and/or preparation methods can induce such 
morphological and behavioral tuning (Kulkami and 
Glatter 2012), some of which are listed in Table 1. 

Hierarchical systems like Isasomes have been 
characterized using a range of different techniques; 
some of them are compiled in Table 2. Small-angle 
► X-ray scattering, cryo-transmission ► electron 
microscopy (cryo-TEM), and ► dynamic light scatter¬ 
ing were found to be most accepted among the others, 
which provide morphological information whereas 
spectroscopic techniques reveal the physicochemical 
modulation of Isasome systems. 
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Hierarchically Structured Lipid Systems, Table 2 Various instrumental techniques used for characterization of lipid-based 
systems 

No. 

Characterization technique 

Information (most important/feasible) gained 

1 

Small-angle X-ray scattering (SAXS) 

Internal nanostructure type, coexistence and lattice parameters of LC phase, 
morphology of Isasomes, time-resolved studies - kinetic events, e.g., phase 
transitions or dynamic changes in particle morphology, after calibration for 
intensity - amount of dispersed phase 

2 

Cryo-TEM 

Internal nanostructure type, coexistence and lattice parameters of LC phase, size 
and morphology of Isasomes 

3 

Dynamic light scattering (DLS) 

Isasome size, size distribution, kinetic events, e.g., stability or aggregation 
phenomena 

4 

Optical light microscopy (polarized and 
cross polarized) 

LC nanostructure of bulk systems, especially to distinguish isotropic and 
anisotropic phases, texture studies, e.g., between lamellar and hexagonal phases, 
kinetics events such as formation and phase transitions to some extent 

5 

Cryo-fleld emission scanning electron 
microscopy (cryo-FESEM) 

3-D information on the particle structure 

6 

Environmental scanning electron 
microscopy (SEM) 

Particle morphology under hydrated conditions 

7 

Atomic force microscopy (AFM) 

Particle morphology 

8 

Confocal laser scanning microscopy 
(CLSM) 

Particle morphology 

9 

Small-angle neutron scattering (SANS) 

Information about internal nanostructure, morphology of Isasomes, some time- 
resolved studies 

10 

Spectroscopic techniques 

Physicochemical modulation of Isasome systems (e.g., effect of additives, existing 
components and physical parameters), diffusion studies 

11 

Rheological techniques 

Viscosity, viscoelasticity, texture/consistency 


The bulk LC phases and their hierarchical structures 
have similarities at the nanostructural level; however, 
the latter are more advantageous from an applications 
perspective. Isasomes are easy to handle, being rela¬ 
tively low viscosity materials, using conventional 
industrial designs customized for liquid handling. 
Use of a dispersed form of LC phases drops the overall 
cost, by utilizing less bulk material. The bulk LC phase 
is usually oil continuous; in contrast, Isasomes are 
internally oil continuous but externally water continu¬ 
ous, typically containing a large excess (80-95%) of 
water. This morphological and chemical combination 
facilitates their loading with hydrophobic and 
hydrophilic as well as amphiphilic functional mole¬ 
cules. These nanostructured assemblies are well- 
known vehicles for drug delivery as well as for the 
controlled release and uptake of active molecules. 
They find further applications in the fields of 
pharmaceuticals, foods, cosmetics, and bioadhesives 
given the possibility of their customization from food 
grade or biocompatible components (Yaghmur and 
Clatter 2009). 


Further Advances in Lipid Structural 
Hierarchy 

Isasomes can be embedded in hydrogels formed via 
addition of hydrogelling agents in the dispersion 
medium. The fluidity of the medium, and hence the 
particle dynamics, can be altered via temperature- 
induced solgel transitions or by altering the concentra¬ 
tion of hydrogelling agent (Kulkami and Clatter 2012). 
Dried thin films made from these hydrogels are capable 
of immobilizing the Isasomes and also drying them 
into fluid isotropic (L 2 ) nanostructures. On the other 
hand, the Isasomes can be regenerated by solubilizing 
such films; more interestingly, the Isasome nanostruc¬ 
ture and size remains practically the same, before 
and after loading/unloading in the films. This opens 
up new types of applications where Isasomes need to 
be stored for a long time or transported somewhere, 
along with functional molecules. Isasome-loaded gels 
and films can potentially be used as a fabrication 
material in the pharmaceutics and medicinal fields, 
where sustained release of components is required. 
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Hierarchically Structured 
Lipid Systems, 

Fig. 4 Thermoreversible 
hydrogel can be used to entrap 
the Isasomes, which can be 
further immobilized into dried 
films. The Isasomes can be 
restored by rehydration of the 
loaded films. In this manner, it 
is possible to store and 
transport the Isasome systems 
before their regeneration for 
specific application 
(Figure adopted from Kulkami 
et al. 2011a) 



Similar to the Isasome systems, the stability and vis¬ 
cosity of W/O-nanostructured emulsions can also be 
tuned by forming the hydrogel in its aqueous reser¬ 
voirs/droplets (Fig. 4). 

Loading of lipid structures with proteins and other 
biomolecules is also known, e.g., proteoliposomes, 
proteocubosomes, or lipid bilayer nanodisks (proteins 
with bicelles), which may sometimes lead to 
nanostructural transitions (Kulkarni et al. 2011b). 
Isasomes with dissimilar nanostructures, charges, or 
functional molecules can be intermixed or transformed 
into fascinating layer-by-layer assemblies which have 
great potential in diverse areas. There are enormous 
opportunities to further develop, explore, and utilize 
new and novel types of lipid-based hierarchical 
structures. 


Summary 

Lipids self-assemble into thermodynamically stable 
liquid crystalline phases of various sizes, shapes, and 
properties, and some of them also mimic biological 
structures. However, in order to ease the exploration 
and increase their applicability, it is customary to 
organize these lipid phases into higher-order architec¬ 
tures exhibiting multiple length scales. Accordingly, 
the O/W emulsions in the particulate form called 
“Isasomes” and W/0 type of emulsions in the 
continuous-hlm form called “nanostructured 
emulsions” appear to be very successful options. The 
essential components for the preparation of Isasomes, 


technical methodology of their preparation, character¬ 
ization techniques, and their tunability have been 
described in the current entry. Some more hierarchical 
level structures designed from lipidic systems are 
reported in the hnal section. Research into the explora¬ 
tion and applications of lipid-based systems is likely to 
continue to develop as a lively biotechnological area. 

Cross-References 

► Atomic Force Microscopy 

► Atomic Force Microscopy (AFM) for Topography 
and Recognition Imaging at Single Molecule Level 

► Cellulose 

► Dynamic Light Scattering 

► Electron Microscopy 

► Functional Roles of Lipids in Membranes 

► Glycerolipids; Chemistry 

► Lipid Mesophases for Crystallizing Membrane 
Proteins 

► Lipid Organization, Aggregation, and 
Self-assembly 

► Membrane Fluidity 

► Membrane Proteins; Structure and Organization 

► Neutron Scattering of Membranes 

► Phase Transitions and Phase Behavior of Lipids 

► Phospholipid Bilayer Nanodiscs: Application to 
Single-Molecule Measurements 

► Supported Lipid Bilayers 

► Vibrational Spectroscopy, A Short History of 

► X-Ray Scattering of Lipid Membranes 
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Synonyms 

Variable-pressure NMR 

Definition 

Pressure is a thermodynamic variable that acts on 
volume of the system without significantly perturbing 
its thermal energy, and therefore is more directly 
coupled to the “structure” of a biomacromolecule in 
solution than temperature. The volume of a biomacro¬ 
molecule (meaning partial volume in solution), nota¬ 
bly of a water-soluble protein, is intimately and 
dynamically coupled to its hydration, which brings 
about a dynamic variability in conformation under 
closely physiological conditions. Thus high-pressure 
(or variable-pressure) NMR, combining pressure 
perturbation with the high-resolution one- and 
multidimensional NMR spectroscopy, enables explo¬ 
ration of details of protein and other macromolecular 
conformational variation in solution in a much wider 
conformational space (from within the folded N to 
the fully unfolded U) than explored by conventional 
NMR spectroscopy. For naturally occurring proteins, 
this possibility is substantiated by the two practical 
rules: (1) (marginal balance) A variety of conformers 
exist between N and U in a narrow range of free 
energy (small AG°) often within several kcal/mol and 
(2) (volume theorem) Under normal conditions, the 
partial molar volume of a folded protein decreases 
(AV < 0) with the loss of the conformational order, 
that is, with the progress of hydration and unfolding 
(Akasaka 2006). 

Technically, high-pressure NMR originated in two 
different approaches: one pressurizing the NMR 
detection coil itself in an autoclave and the other 
pressurizing only the sample solution confined 
in a pressure-resistive cell. For biological macromo¬ 
lecular studies, the latter method combined with 
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a commercial NMR spectrometer is chosen for its high 
magnetic field homogeneity and versatility in pulse 
sequence. Typically, a range of pressure of 0.1^ 
a few hundred MPa has been used, the highest pressure 
being limited so far by the pressure-resistivity of the 
sample cell at about 400 MPa (Akasaka and Yamada 
2001). Various materials have been tested for the cell, 
including plastics, glass, quartz, sapphire, and zirconia. 
Handmade quartz cells (<1 mm in inner diameter) 
have been used successfully, but because of its diffi¬ 
culty in manufacturing, handling, and limited sample 
space, ceramic cells (1-3 mm in inner diameter) made 
of zirconia are beginning to be employed. 

Upon applying pressure p to protein solutions at 
constant temperature, all NMR parameters including 
chemical shift, signal intensity, NOE, line width, spin 
relaxation, and even J-coupling are subject to change, 
as the equilibrium population of a lower volume con- 
former (AV < 0) increases by exp (—pAVfRT)- Chem¬ 
ical shifts are particularly sensitive to pressure, 
representing the intrinsic fluctuations of the folded 
conformer N, including transitions to alternatively 
folded ones N’. With increasing pressure, signal inten¬ 
sities may start to decrease for certain residues, mean¬ 
ing local unfolding into an intermediate conformer I, 
and, at still higher pressure, even total unfolding into 
conformer U. All the above changes are essentially 
reversible with pressure and reflect a small-to-large 
amplitude fluctuations occurring in a time range of 
ps ^ ks, difficult to be detected unless “amplified” by 
pressure. High-pressure NMR shares a common target 
with H/D exchange and relaxation dispersion NMR 
methods in “high-energy” conformers, but is unique 
in that it can directly detect signals from “high-energy” 
conformers, allowing detailed analysis of their struc¬ 
tures and stability. 

Cross-References 

► H/D Exchange 

► J Coupling 

► Nuclear Overhauser Effect 

► Relaxation Dispersion 
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Synonyms 

Fast scanning force microscopy 

Definition 

High-speed AFM (HS-AFM) is a type of atomic force 
microscopy (AFM) that, unlike conventional AFM, 
can take an image very quickly and with relatively 
low imaging force, therefore allowing for visualizing 
the dynamic behavior of biomolecules. 

Introduction 

AFM can take high-resolution images of objects in 
various environments. In particular, its in-liquid 
imaging capability is useful for biological research. 
However, the imaging rate of conventional AFM is 
too low to capture dynamic processes of biomole¬ 
cules (DNA, proteins). HS-AFM has been developed 
to overcome this problem by optimizing all devices 
contained in the instrument for fast scanning and by 
newly developing fast and precise control techniques 
(Ando et al. 2001, 2008). The imaging rate of the 
current HS-AFM has reached 15-25 frames per sec¬ 
ond (fps) for a scan range of ^240x240 nm^ with 
100 scan lines. By further reducing the scan range, 
the imaging rate can exceed 50 fps. Importantly, 
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high-speed imaging with small cantilevers applies 
less force during its scans than conventional AFM 
and therefore, is less invasive. Thus, one can directly 
image functioning biomolecules in action at high 
spatiotemporal resolution, without disturbing their 
biological functions (Kodera et al. 2010; Shibata 
et al. 2010; Uchihashi et al. 2011). The resulting 
molecular movies provide much information on 
both structure and dynamics of a functioning mole¬ 
cule, which significantly facilitates a comprehensive 
understanding of the functional mechanism. 

Imaging Rate and Feedback Bandwidth 

When a sample is scanned over an area of WxW with 
scan velocity Vj in the x-direction and N scan lines, the 
image acquisition time T is given by T = 2WN/Vs. 
Assuming that the sample surface is characterized 
with a single spatial frequency l/X, then the feedback 
scan is executed in the z-direction with frequency 
/ = VJX to trace the sample surface. The feedback 
bandwidth /b of the microscope should be equal to or 
higher than /, and thus, we obtain the relationship 
T > 2WN/(Xfs). In the current implementation of 
FIS-AFM,/b"^ 1 10 kHz, achieving the aforementioned 
imaging rate. 

Devices for HS-AFM 

The tapping mode HS-AFM instrument developed 
at the Ando laboratory (Fig. 1) is commercially 
available worldwide from the Research Institute of 
Biomolecule Metrology, Co. Ltd. (Tshukuba, Japan). 

Cantilevers and related devices: The size of rectan¬ 
gular cantilevers for high-speed imaging is small; 
6-10-pm long, 2-pm wide, and 90-nm thick. They 
have resonant frequencies /c = 2-3.5 MHz in air 
and /c = 0.6-1.2 MHz in water, a spring constant 
kc = 0.1-0.2 N/m, and a quality factor Qc = 2-3 in 
water. Small cantilevers (BL-AC10DS-A2, Olympus: 
/c in air 1.5 MHz,/c in water 600 kHz, ^0.1 N/m) 
are commercially available (Atomic Force F&E 
GmbH, Manheim, Germany; Asylum Research, 
Santa Barbara, USA; Bruker, Calle Tecate, USA). 
Alternatively, small cantilevers with a sharp tip made 
by electron beam deposition are also available. 
Although not yet commercialized, small cantilevers 


will soon be available also from NanoWorld AG 
(Neuchatel, Switzerland). The deflection of a small 
cantilever is measured by an optical beam deflection 
(OBD) detector equipped with a long-working dis¬ 
tance objective lens, a quarter-wavelength plate, and 
a polarization beam splitter (Fig. 1) (Ando et al. 2001). 
The small cantilevers grant 10-20-times higher detec¬ 
tion sensitivity than conventional cantilevers. The 
oscillation amplitude is measured by a peak-hold 
method (at every half cycle of oscillation) (Ando 
et al. 2001) or a Fourier method (at every full cycle 
of oscillation) (Ando et al. 2008). 

Scanner: The maximum displacements of the high¬ 
speed scanner are 1-2 pm in the x-direction, ^3 pm in 
the y-direction, and ^1 pm in the z-direction. The 
displacements are large enough for imaging biomole¬ 
cules. The z-scanner can move at 200 kHz without 
a practical delay, while the x-scanner can move 
at >10 kHz in a triangle waveform. The generation 
of unwanted vibrations is effectively damped by 
analogue feedback Q-control of the z-scanner and by 
digital feed forward Q-control (inverse compensation) 
of the x-scanner (Ando et al. 2008). 

Dynamic PID controller: This controller allows one 
to use an amplitude set point (cantilever oscillation 
amplitude to be maintained constant during imaging) 
very close to the free oscillation amplitude Aq (i.e.. 
As ^0.9 xAq) (Kodera et al. 2006). This large set 
point minimizes the tapping force exerted by the oscil¬ 
lating tip on the sample. In conventional PID control¬ 
lers, this large set point often results in complete 
detachment of the cantilever tip from the sample 
surface at steep downhill regions of the sample 
(i.e., “parachuting”), resulting both in an inability to 
image until parachuting ceases and in a significant 
reduction of the feedback bandwidth. 

Drift compensator: The cantilever is excited by 
a piezoactuator placed at (or near) the cantilever holder 
(Fig. 1). To minimize the tapping force, Aq ^ 1-2 nm is 
used. Therefore, the difference between Aq and As is 
very small (^0.1-0.2 nm). However, Aq easily changes 
with time by>0.2 nm due to drift in the excitation 
power, and therefore, stable imaging is difficult 
under this condition. This problem was solved by 
maintaining the constant average amplitude of a sec¬ 
ond harmonic oscillation of the cantilever (Kodera 
et al. 2006). This drift compensator works effectively, 
and thus enables stable low-invasive and high-speed 
imaging when operating together with the dynamic 
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High-Speed Atomic Force 
Microscopy (AFM), 

Fig. 1 Schematic of an 
HS-AFM instrument. Unlike 
conventional AFM, the 
cantilever and sample are 
arranged upside-down so that 
the cantilever tip points 
upward to the sample 



High-Speed Atomic Force 
Microscopy (AFM), 

Fig. 2 AFM images of 
biomolecules captured by 
FIS-AFM (a) Myosin 
V (tail-truncated myosin V) 
walking along an actin 
filament (Modified from 
Fig. Id of Kodera et al. 2010). 
(b, c) bR under dark (b) and 
light illumination (c) 
(Modified from Fig. 1 a of 
Shibata et al. 2010). The white 
circles indicate trimers and the 
gray circles indicate triads of 
nearest neighbor bR molecules 
each belonging to adjacent 
trimers. (d) Anisotropic 
diffusion of single point 
defects in 2D crystal of 
streptavidin; (e) Process of 
planar lipid bilayer formation 
after placing liposomes on 
a mica surface (Figure 8 of 
Giocondi et al. 2010) 
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PID controller, even under the conditions of tIq 1 nm 
and As = 0.9-0.95 x Aq. 

HS-AFM Imaging of Biomolecules 

Substrate surface: To observe dynamically acting 
protein molecules under physiological conditions, the 
substrate surface must have an appropriate affinity 
for the molecules. To observe dynamic interactions 
between different proteins, selective protein attach¬ 
ment to a surface is required. Thus, we have to prepare 
an appropriate substrate surface depending on the 
dynamic biomolecular event to be visualized. As 
substrate, bare mica surfaces, planar lipid bilayer sur¬ 
faces, and the surfaces of two-dimensional crystals 
of streptavidin formed on a biotin-containing lipid 
bilayer have been used for dynamic AFM imaging 
(Yamamoto et al. 2010). 

Examples of dynamic imaging: Myosin V walking 
along an actin filament has been visualized (Fig. 2a) 
(Kodera et al. 2010). For this observation, partially 
biotinylated actin filaments were immobilized via 
streptavidin on a biotin-containing lipid bilayer 
surface. To facilitate weak sideways adsorption of 
myosin V on the surface, a small fraction of 
a positively charged lipid was included in the bilayer. 
The observed velocity is similar to that measured by 
fluorescence microscopy, indicating very small effects 
of the tip-sample and surface-sample interactions on 
motor activity. The high-resolution movies not only 
provide corroborative evidence for previously pro¬ 
posed or demonstrated molecular behaviors, includ¬ 
ing a lever-arm swing, but also reveal more detailed 
behaviors of the molecules. Likewise, the photo- 
activated dynamic structural change of bacteriorho- 
dopsin (bR) in the purple membrane has been 
visualized by HS-AFM (Shibata et al. 2010). A large 
structural change of bR is observed at the cytoplasmic 
surface; each bR molecule moves outward from the 
center of a trimer by ~0.8 nm upon light illumination 
(Fig. 2b, c). The following dynamic processes have 
also been visualized by HS-AFM: association and 
dissociation of bR trimers at the free interfaces of 
the crystalline phase (Yamashita et al. 2009); aniso¬ 
tropic diffusion of single point defects in 2D crystals 
of streptavidin (Fig. 2d) (Yamamoto et al. 2008); 
amyloid-like fibril formation by lithostathine (Milhiet 
et al. 2010); formation of planar lipid bilayers 


from liposomes (Fig. 2e) (Giocondi et al. 2010); and 
counterclockwise propagation of conformational 
states in Fi-ATPase ring without y subunit (Uchihashi 
et al. 2011). 

Cross-References 

► Atomic Force Microscopy 

► Atomic Force Microscopy (AFM) for Topography 
and Recognition Imaging at Single Molecule Level 

► Phospholipid Bilayer Nanodiscs: Application to 
Single-Molecule Measurements 
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Hilbert Phase Contrast (HPC) 


Hilbert Phase Contrast (HPC) 

► Phase Contrast Electron Microscopy 


Histone-Lysine Af-methyltransferase 

► Protein Lysine Methyltransferase - Computational 
Studies 


History of Infrared Spectroscopy 

► Vibrational Spectroscopy, A Short History of 


History of Raman Spectroscopy 

► Vibrational Spectroscopy, A Short History of 


HIV Protease - Computational Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Definition 

An aspartyl protease that is essential for the life cycle 
of HIV by cleaving newly synthesized polyproteins 
into the mature protein components of the infectious 
HIV virion. 


Basic Characteristics 

Three enzymes that are essential for the replication of 
the human immunodeficiency virus type 1 (HIV-1): 
a reverse transcriptase, a protease, and an integrase. 
These enzymes are therefore important targets in the 
development of anti-AIDS drugs. Common current 
treatment of HIV/AIDS involves combining drugs 


that inhibit the reverse transcriptase and the protease. 

► QM/MM methods have been employed to model all 
three enzymes, to help understand their mechanism, 
specificity, and effects of genetic variation. HIV-1 
protease/inhibitor interactions have been studied 
extensively with QM/MM methods. For example, the 
subtractive ► ONIOM QM/MM method was used to 
investigate the role of a conserved water molecule in 
six enzyme-inhibitor complexes (Suresh et al. 2008). 
This water molecule forms four hydrogen bonds to 
bridge the Ile50/50' NH groups in the active site cleft 
opposite from the aspartyl dyad (the so-called flaps of 
the enzyme dimer) and the CO groups at positions 
P2 and P'l of the inhibitor. The approved inhibitor 
tipranavir is designed to replace the water molecule 
and semiempirical QM/MM (AM1/CHARMM22) 

► Molecular Dynamics Simulations indeed indicated 
that tipranavir interacts with both Asp25/25' and IleSO/ 
50', explaining its potency (Hensen et al. 2004). Garrec 
et al. (2010) used QM/MM ► Car-Parrinello molecular 
dynamics to investigate a new class of HIV-1 protease 
inhibitors, based on the presence of a noncovalent 
interaction between a tertiary amine and a carbonyl 
(Gautier et al. 2006). By considering an amino- 
aldehyde peptide bound to the enzyme in all possible 
protonation states of the aspartyl dyad, they concluded 
that the interaction is disrupted through competition 
with the hydrogen bond network around the conserved 
water molecule. Inhibitors that combine the noncovalent 
tertiary amine-carbonyl interaction with replacing the 
conserved water may therefore be promising. A recent 
study further demonstrates the complementarity of 
QM/MM modeling, docking, and experimental methods 
in the analysis of another new class of HIV-1 protease 
inhibitors (Makatini et al. 2011). 

Cross-References 

► Car-Parrinello Molecular Dynamics 

► Molecular Dynamics Simulations of Lipids 

► ONIOM 

► QM/MM Methods 

► Semiempirical Quantum Mechanical Methods 
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Synonyms 

Reverse transcriptase 

Definition 

Fluman immunodeficiency vims type 1 reverse 
transcriptase (HlV-1 RT), a multifunctional enzyme, 
converts the single-stranded viral genomic RNA 
into the double-stranded DNA, which can then be 
integrated into the host cell’s chromosomes. 

Introduction 

Acquired immunodeficiency syndrome (AIDS), 
caused by the human immunodeficiency virus (HIV), 


has been a worldwide health problem for over 25 years. 
HIV-1 reverse transcriptase (RT), one of the three 
essential enzymes in the HIV replication cycle, is 
an important target for anti-AIDS dmg therapy. Two 
classes of FDA-approved HIV-1 RT inhibitors, 
non-nucleoside/nucleotide (NNRTI) and nucleoside/ 
nucleotide (NRTI), are important components of the 
combination therapy using the highly active anti¬ 
retroviral therapy (HAART). However, the emergence 
of drug resistance and their severe side effects limits 
the efficiency of these antiviral agents. Therefore, new 
potent and highly selective inhibitors are urgently 
required. 

HIV-1 RT functions by converting the single- 
stranded viral genomic RNA into double-stranded 
DNA, which can then be integrated into the genomic 
DNA of the host cell. The process requires at least three 
enzymes: RNA-dependent DNA polymerase, DNA- 
dependent DNA polymerase, and ribonuclease H 
(RNase H). The enzymatic activity of the RNA/DNA- 
dependent DNA polymerase is the copying of either an 
RNA or a DNA template, and that of the RNase H is the 
RNA degradation of the RNA/DNA heteroduplex 
(Skowron and Ogden 2006; Sarafianos et al. 2009). 

HIV-1 RT, an asymmetric heterodimer protein, is 
composed of 1,000 amino acids with two related 
subunits: p66 and p51 (Fig. 1). The larger p66 
subunit contains 560 residues and is responsible for 
the RT catalytic functions. The smaller p51 subunit 
possesses 440 amino acids and exhibits a structural 


thumb 



HIV-1 Reverse Transcriptase - Computational Studies on 
the Polymerase Active Site, Fig. 1 The crystal structure of the 
HIV-1 RT/dsDNA/dTTP ternary complex (Huang et al. 1998) 
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role. The p66 subunit comprises the DNA polymerase 
and RNase H domains. The inhibitor binding sites of 
NNRTI and NRTI are located in the polymerase domain, 
which is composed of four distinct subdomains: fingers 
(residues 1-85 and 118-155), palm (residues 86-117 
and 156-236), thumb (residues 237-318), and connec¬ 
tion (residues 319^26). The p51 also folds similarly to 
the four corresponding subdomains (finger, palm, 
thumb, and connection) but the relative orientations of 
its subdomains are substantially different from those of 
p66 (Sarafianos et al. 2009). 

Active Structure of the HIV-1 RT/dsDNA/ 
dNTP Ternary Complex 

The reaction center of polymerization takes place at 
the polymerase active site (Fig. 1) of HIV-1 RT in 
the palm subdomain harboring a catalytic triad of 
aspartates DUO, D185, and D186. Two bound 
magnesium cations (Mg^"^) are required to form 
a complete active site. By molecular dynamics (MD) 
simulations, both Mg ions were observed to be octa- 
hedrally coordinated by six ligands with the Mg-O 
distance of ^ 2 A (Rungrotmongkol et al. 2007), as 
shown in Fig. 2. The coordination of the AMg^"^ ion at 
catalytic site A is comprised of the carboxylate oxygens 
of D110, D185, and D186, the non-bridging Ola oxy¬ 
gen of the a-phosphate of dTTP, the 3'-OH group of the 
primer terminus, and a water molecule. Note that the 
last two ligands are not present in the crystal structure 
(Huang et al. 1998). At the nucleotide-binding site B, 
the fiMg ^ ion coordinates with the three non-bridging 
Ola, 02p, and Oly oxygens of the triphosphate moiety 
of dTTP, the carboxylate oxygens of DUO and D185, 
and the backbone carbonyl oxygen of V111. The incom¬ 
ing dNTP in the fully deprotonated form is more tightly 
bound to the active site and is likely to be more stable 
than the mono-protonated form due to its greater hydro¬ 
gen bond formation with the positively charged side 
chains of K65 and R72 and the peptide backbones of 
DU3 and A114 (Rungrotmongkol et al. 2007; Frank 
and Carloni 2000). 

Role of the Catalytic Magnesium Ion 

DNA polymerases require divalent metal ions for 
their catalytic function (Bloomfield et al. 2000). 


The functions of a DNA polymerase and its fidelity 
of DNA synthesis in vivo depend on the type of 
divalent cation (e.g., Mg^-", Mn^-", Ca^-", Zn^-", Ni^-", 
Co^"^ or Be^"^) occupying the polymerase active site. In 
the HIV-1 RT, the A^Vlg^"^ ion (at catalytic site A) is 
thought to decrease the pK?i of the 3'-OH group of the 
terminal primer, facilitating the nucleophilic attack on 
the a-phosphorus atom of the dNTP substrate. In con¬ 
trast, the nucleotide-binding eMg^"^ ion (at cite B) is 
believed to stabilize the triphosphate of the dNTP 
substrate and the pyrophosphate (PPi) group prior 
to and after the nucleotidyl transferase reaction, 
respectively. A Be^"^ ion substitution for the Mg ion 
at either the catalytic or the nucleotide-binding site, 
acting as a cofactor for the polymerization reaction 
of HIV-1 RT, has been evaluated theoretically using 
MD simulations in comparison with experimentally 
observed pre-steady-state kinetics (Mendieta et al. 
2008). The tetrahedral character of the Be^"^ coordi¬ 
nation was observed at both sites. Three conserved 
aspartates (DUO, D185, and D186) and a water mol¬ 
ecule act as the ligands for the aB^ ion complex, 
while the coordination environment depends 

on whether Be^"^ or Mg occupies the catalytic site 
A. Interest is focused on the critical distance for 
nucleophilic attack from the 3'-OH oxygen of the 
primer terminus to the a-phosphorus of the dTTP 
substrate, d[03'-Pa]. Having Be^"^ or no metal ion at 
the catalytic site A leads to a significant increase in d 
[03'-Pa] by at least 1.3 A or 2.3 A, respectively, 
relative to that of 3.2 A in the HIV-1 RT ternary 
complex containing two Mg^"^ ions. In contrast, the 
nucleophilic attack distance is not affected by 
replacing Mg^"^ with Be^"^ at the nucleotide-binding 
site. This finding supports the experimental data, i.e., 
that DNA polymerase activity is increased by Be^"^ 
ions at a relatively low concentration through binding 
with the dNTP at site B, while this divalent metal ion 
at high concentration could compete with the Mg 
ion in binding at the catalytic site and thus inhibit the 
DNA polymerization. 

DNA Polymerization 

RT is a member of the family of DNA polymerases 
that assists in DNA replication from the 5'- to 
3'-direction (Skalka and Goff 1993). The mechanis¬ 
tic reaction of DNA polymerization contains two 
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HIV-1 Reverse 
Transcriptase - 
Computational Studies on 
the Polymerase Active Site, 
Fig. 2 The proposed 
mechanism of DNA 
polymerization based on the 
modeled structure of the 
HIV-1 RT/dsDNA/dTTP 
ternary complex 
(Rungrotmongkol et al. 2007), 
where the metal chelation in 
the polymerase active site is 
also shown 


P site 

(priming site) 



steps: deprotonation of the 3'-primer terminus 
followed by the nucleotidyl transfer reaction. Although 
an overall mechanism of polymerization has been 
suggested, the general base in deprotonation is uncer¬ 
tain, e.g., it has been reported to be the active site 
aspartate in T7 DNA polymerase (Florian et al. 
2003), the a-phosphate of dNTP in polymerase (3 
(Bojin and Schlick 2007), and ordered water molecule 
in solfataricus DNA polymerase IV (Wang et al. 
2007). The only existing data for HIV-1 RT, obtained 
from the combined QM/MM MD simulations, demon¬ 
strates the DNA polymerization reaction pathway 
with the possibility of the conserved aspartate D185 
acting as a proton acceptor to abstract the hydrogen 
from the 3'-OH group of the primer sugar ring 
(Rungrotmongkol et al. 2004). Note that D185 is 
equivalent to D654, which seems to be the general 
base in T7 DNA polymerase. After deprotonation, 
the negatively charged hydroxyl moiety of the 3'- 
primer then attacks the a-phosphorus of dTTP with 
formation of the stable pentacovalent intermediate. 
Finally, cleavage of the Pa-03a bond occurs to 
leave pyrophosphate from the newly formed 3'-5' 
phosphodiester, leading to the elongation of the 
DNA primer by an added nucleotide. The hydrogen 
bond interactions with the s-amino group K65, the 
guanidinium group of R74 and the peptide backbone 
of D113 and A114 are maintained along the reaction 
pathway. 
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HMM 

► Hidden Markov Modeling in Single-Molecule 
Biophysics 


Homologue Scanning 

► Mutagenic Scanning to Define Binding Hotspots 


Homology Modeling of Protein 
Structures 

Torsten Schwede 

Swiss Institute of Bioinformatics, Biozentrum 
University of Basel, Basel, Switzerland 

Synonyms 

Biomolecular modeling; Comparative protein struc¬ 
ture modeling; Fold recognition; Protein structure 
modeling; Template-based modeling 

Definition 

The three-dimensional structure of a protein can be 
modeled starting from its amino acid sequence by 
using information from homologous proteins with 
known structure. 


Introduction 

Three-dimensional structures of proteins provide 
invaluable information for our understanding of 
macro-molecular function at an atomic level of detail. 
However, despite the tremendous progress in experi¬ 
mental techniques in recent years, structure solution by 
crystallography or ► NMR remains a complex process, 
and as a consequence for the majority of proteins 
in any given organism no direct experimental struc¬ 
tures are available today. Computational approaches 
for predicting the three-dimensional structure of 
a protein from its amino acid sequence appear, there¬ 
fore, as an attractive alternative (Schwede et al. 2008; 
Venselaar et al. 2009). Various approaches for struc¬ 
ture prediction have been developed in recent years, 
among which homology modeling is currently by far 
the most accurate and reliable method (Moult et al. 
2011). Protein structure homology modeling tech¬ 
niques are used to construct three-dimensional models 
for proteins from their amino acid sequences using 
information from homologous proteins with known 
structure. The expected accuracy of homology models 
depends on the evolutionary distance between the pro¬ 
tein of interest (“target”) and the homologous proteins 
used as structural “templates.” The relationship 
between sequence identity and structural similarity, 
which was first described by Chothia and Lesk for the 
globin family (Chothia and Lesk 1986), still provides 
a good first approximation today for the expected 
accuracy of a homology model. 

Template Detection and Target-Template 
Alignment 

The first step in building a homology model for a target 
protein consists of searching for homologous proteins 
with known experimental structures (Table 1, Fig. 1). 
Pair-wise sequence comparison methods are able to 
detect and correctly align protein sequences sharing 
high sequence similarity. For detecting remote homol¬ 
ogy relationships, more sensitive profile- or Hidden 
Markov Model (HMM)-based methods are applied 
(Remmert et al. 2011). Often, several alternative 
template structures are identified for a given target 
protein. Besides the evolutionary distance to the target, 
other factors such as experimental resolution and 
quality of the template, and presence/absence of 
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Homology Modeling of Protein Structures, Table 1 Resources on experimental structure information and structural annotation 
of genomes 

Resources on experimental structure information 

PDB: The worldwide Protein Data Bank (wwPDB; RCSB PDB (USA), PDBe (Europe), and 
PDBj (Japan)) maintains the single archive of experimental macromolecular structural data 

www.rcsb.org 

www.PDBe.org 

www.PDBj.org 

CATH: CATH is a manually curated hierarchical domain classification of protein structures in 
the Protein Data Bank 

www.cathdb.info/ 

Resources on structural annotation of genomes 

SUPERFAMILY is a database of structural and functional annotation for all proteins and 
genomes based on a collection of hidden Markov models, which represent structural protein 
domains at the SCOP superfamily level 

http://supfam.cs.bris.ac.uk/ 

Gene3D is a database of CATH protein domain assignments for ENSEMBL genomes and 
Uniprot sequences. Einer grained groupings of superfamilies based on function (FunEams) and 
structure (SCs) are also provided 

http://gene3d.biochem.ucl.ac.uk/ 


Target Protein Sequence (multi-domain) 
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1. Template selection & 
Target Template Alignment 


2. Model Building 
(alternative templates 
and alignments) 
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Homology Modeling of Protein Structures, Fig. 1 Typical 
homology modeling workflow. Starting from the amino acid 
sequence of the target protein, possible alternative template 
structures are identified by searching a library of experimental 
structures (1). Based on the aligned templates, typically one or 


relevant ligands or cofactors should be considered 
when selecting suitable templates. In many cases, 
template structures will cover only individual domains 
rather than the full length of the target sequence. 


more alternative models are constructed (2), and assessed for 
their expected accuracy using model quality estimation tools (i), 
resulting in a ranked ensemble of alternative models. Note that 
homology modeling techniques cannot generate models in the 
absence of template information 


The result of the template detection step is an 
alignment between the target sequence and the 
sequences of one or more homologous template 
protein structures. 
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Model Building 

In the model building step, structural information from 
the templates is used to construct a three-dimensional 
model for the target protein using the sequence 
alignment from the previous step to identify 
corresponding residues. Two main approaches are 
used for constructing the model backbone coordinates: 
Fragment-based methods assemble the core of the 
model from structural fragments extracted from 
the templates. Alternatively, restraint optimization 
methods derive spatial restraints from the template 
structures and construct a model which optimally 
satisfies these constraints (Sali and Blundell 1993). 
While this approach is typically applied to individual 
domains, the general concept is valid also for modeling 
multi-domain proteins and oligomeric assemblies - as 
long as suitable homologous template structures are 
available (see Table 2 for a list of commonly used 
modeling tools and resources). 

While structurally conserved regions of the target 
can be modeled using template structure information, 
variable regions have to be reconstructed “de novo” - 
specihcally structurally variable loop regions and 
segments corresponding to insertions and deletions in 
the target-template alignment. Typically sampling 
techniques are used to enumerate an ensemble of 
alternative loop conformations which are then scored 
by empirical scoring functions to select the most 
likely arrangement (Rohl et al. 2004). In general, 
reconstructed loop regions are the least reliable parts 
of a homology model (Fig. 2). 

Modeling the correct side chain packing is often 
(mainly for technical reasons) considered a separate 
step in homology modeling. Typically, alternative side 
chain conformations of amino acids are represented 
in so-called rotamer libraries as a function of back¬ 
bone geometry (Shapovalov and Dunbrack 2011). 
The representation of side-chain conformation as 
discrete states allows for numerically efficient methods 
for searching optimal conformations such as the 
dead-end elimination (DEE) algorithm. 

One of the remaining challenges in protein struc¬ 
ture modeling is the prediction of interactions with 
ligands, cofactors, and substrates. Most current 
methods are based on transferring information 
about ligands observed in homologous protein struc¬ 
tures to the target-binding site (Wass et al. 2011). 
While such comparative approaches work well for 


predicting evolutionary conserved interactions in 
well-characterized protein families, they are limited 
with respect to predicting changes in specificity and 
detecting novel, previously not observed, binding sites. 

Model Refinement 

One of the last steps in model building often consists in 
refining the coordinates of the initial model using 
energy minimization techniques. One aim is to correct 
unfavorable contacts or suboptimal conformations 
introduced during the modeling process by identifying 
energetically more favorable conformations. 

A second goal of model refinement is to move the 
model coordinates closer to the “real structure”: 
Comparative modeling techniques tend to construct 
models, which closely resemble the template struc¬ 
ture. Refining the models away from the templates 
closer to the target has been a question of intensive 
research, and many different solutions have been pro¬ 
posed to address this problem. However, as of today, no 
solution has emerged which was able to consistently 
improve the backbone geometry of models closer to 
the correct reference structure in controlled blind exper¬ 
iments (MacCallum et al. 2011). As a consequence, 
homology models are often not predictive for the struc¬ 
tural changes occurring as a result of sequence varia¬ 
tions, such as single point mutations and SNPs. 

Model Quality Assessment, Validation, and 
Quality Estimation 

Depending on the evolutionary distance between the 
target protein and the template structure, homology 
models can vary signihcantly in accuracy. While 
high-quality models can reach the accuracy of 
medium-resolution X-ray structures, low-quality 
models may contain significant errors and inaccura¬ 
cies. Model validation is, therefore, an essential com¬ 
ponent of the homology modeling process. 

The overall accuracy of different modeling tech¬ 
niques can be assessed retrospectively by comparing 
blind predictions for a large number of target proteins 
to their experimental reference structures. Within the 
biannual CASP experiment - Community Wide 
Experiment on the Critical Assessment of Techniques 
for Protein Structure Prediction (Moult et al. 2011), 
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Homology Modeling of Protein Structures, Table 2 Modeling tools and resources. The following list provides a selection of 
commonly used tools and publicly available resources. For a more complete listing, please refer to the annual web server issue of the 
journal Nucleic Acids Research 

Model portals and resource collections 

Protein Model Portal (PMP): a portal to protein structure information (both experimental and 
models), interactive services for model building, and quality assessment 

www.proteinmodelportal.org 

Protein Modeling 101: a collection of resources and tutorials related to protein structure modeling 

www.proteinmodelportal.org/ 

101/ 

Modeling servers and tools 

HHpred: homology detection and structure prediction by HMM-HMM comparison 

hhpred.tuebingen.mpg.de/ 

hhpred/ 

I-Tasser: service for protein structure predictions built based on multiple-threading alignments by 
LOMETS and iterative TASSER assembly simulations 

zhanglab.ccmb.med.umich.edu/ 

1-TASSER/ 

Modeller: software tool for comparative modeling by satisfaction of spatial restraints 

salilab.org/modeller/ 

ModWeb: comparative protein structure modeling server based on the large-scale protein 
structure modeling pipeline, ModPipe, and Modeller 

salilab.org/modweb 

Phyre2: Protein Homology/analogY Recognition Engine: a web server for fold recognition, 
protein modeling, and binding site prediction 

www.sbg.bio.ic.ac.uk/phyre2/ 

Robetta: protein structure prediction server building both ah initio and comparative models of 
proteins 

robetta.bakerlab.org/ 

SCWRL4: efficient software tool for prediction of protein side-chain conformations using 
a backbone-dependent rotamer library 

dunbrack.fccc.edu/scwrl4/ 

SWISS-MODEL: fully automated web-based service for protein structure homology modeling 

swissmodel.expasy.org 



Homology Modeling of Protein Structures, Fig. 2 Crystal 
structure of transaldolase from Cotynehacterium glutamicum in 
comparison with several homology models. As part of the 
CAMEO evaluation of structure prediction servers, the amino 
acid sequence of transaldolase from Cotynehacterium 
glutamicum was sent to several modeling pipelines, and the 
results were compared to the experimental reference structure 
after the coordinates were released (PDB: 3R5E; Samland 
et al. (2012)). The experimental reference structure is shown as 
ribbon representation, colored from N- to C-terminus from blue 
to red\ homology models are shown as gray tubes. While the 
structural core of the protein was modeled consistently and 
accurately by all modeling servers, significant variations are 
observed in the loop regions and at the surface of the protein, 
where also the largest deviations from the experimental refer¬ 
ence structure are encountered 


homology modeling techniques are evaluated in the 
category “template-based modeling” (Mariani et al. 
2011). The CAMEO project (Continuous Automated 
Model Evaluation, http;//cameo3d.org/) continuously 
evaluates the accuracy and reliability of protein struc¬ 
ture prediction services in a fully automated manner. 
One noteworthy outcome of these efforts is that the 
difference in accuracy between the best performing 
modeling methods is small compared to the differences 
observed between different target proteins. Validation 
and absolute model quality estimation for individual 
models is, therefore, essential. 

Since at the time of modeling, the “real structure” of 
the target protein is unknown, the expected accuracy of 
a model needs to be estimated. Two types of criteria 
are typically applied in model validation: “Hard 
criteria” evaluate the physical and stereochemical 
plausibility of the model’s coordinates - essentially 
applying the same validation criteria as for experimen¬ 
tal structures such as atomic distances, bond lengths, 
and angles. While these criteria are necessary for cor¬ 
rect model geometry, they are not sufficient to ensure 
that the model correctly represents the structure of the 
target protein. “Soft criteria” are used to estimate the 
expected quality of a model, or to rank models within 
an ensemble of alternatives. Widely used for this 
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Homology Modeling of Protein Structures, Table 3 Tools for model validation and quality assessment 

Protein model validation tools (stereochemical quality) 

PROCHECK: checks the stereochemical quality of a protein structure 
by analyzing its overall and residue-by-residue geometry 

www.ebi.ac.uk/thornton-srv/software/PROCHECK/ 

WHAT CHECK: protein verification tools from the WHAT IF 
package; detailed analysis of the stereochemical quality of a protein 
structure or model 

swift.crabi.ru.nl/gv/whatcheck/ 

MOLPROBITY : all-atom validation for macromolecular 
crystallography and NMR structures 

molprobity.biochem.duke.edu/ 

Model quality estimation tools 

QMEAN: quality estimation server for protein structure models using 
a composite scoring function, including distance-dependent interaction 
potentials of mean force, torsion angle potentials, and solvation 
potentials 

swissmodel.expasy.org/qmean/ 

ModEval: evaluates various aspects of model quality using the atomic 
distance-dependent statistical potential DOPE and statistical potential 
GA341 for assessing the reliability of a model 

salilab.org/modevaI/ 

ModFold: a quality assessment program used for global and local 
assessment of models, using the comparison of multiple models 
without structural alignments 

www.reading.ac.uk/bioinf/ModFOLD/ 

ProQres: a neural network predictor that predicts the quality of 
different parts of protein model based on a number of structural features 

WWW. she. su. se/~ bj omw/ProQ/ 

Model quality assessment 

GASP: the community wide experiment on the critical assessment of 
techniques for protein structure prediction aims at an in-depth and 
objective assessment of our current abilities and inabilities in the area 
of protein structure prediction 

predictioncenter.org 

CAMEO: the continuous automated model evaluation project 
continuously evaluates the accuracy and reliability of various protein 
structure prediction services in a fully automated manner 

cameo3d.org 


approach are statistical potentials of mean force for 
atomic interactions, backbone geometry, or solvent 
exposure, which are derived from high-resolution crys¬ 
tal structures (Table 3). In case a large number of 
models based on independent modeling methods are 
available, consensus approaches have been shown to 
be successful in selecting the most accurate models 
within these ensembles (Kryshtafovych et al. 2011). 
In the context of practical applications of models, error 
estimates for individual residues in single models 
would be required on an absolute scale, which is, 
however, a much more difficult task. 

Applications of Models 

With the development of stable, reliable, and fully 
automated techniques, protein structure homology 
models have found widespread use in biomedical 
research as source for structure information in cases 


where no suitable experimental structure of the target 
protein is available. Since different applications have 
various requirements on the quality of the structural 
information, ultimately the accuracy of individual 
models determines their usability for specific applica¬ 
tions (Petrey and Honig 2005; Schwede et al. 2009). 

Typical applications of homology models include 
experimental design for protein engineering and func¬ 
tional characterization studies by site-directed muta¬ 
genesis; the interpretation of disease-related mutations 
and SNPs to understand the molecular mechanisms 
of a disease; phasing of X-ray diffraction data in pro¬ 
tein crystallography by ► molecular replacement 
(Schwarzenbacher et al. 2008); or virtual ligand 
screening in structure-based drug discovery projects. 
Often, computational structure models are combined 
with other sources of low-resolution experimental con¬ 
straints, such as cryo-EM maps or proteomics cross- 
linking data, in an integrative approach to protein 
structure solution (Robinson et al. 2007). 
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Modeling Tools and Resources 

Protein structure homology modeling techniques have 
matured in recent years from research projects into 
stable and reliable automated pipelines. SWISS- 
MODEL was the first web server to offer homology 
modeling as a fully automated service to the research 
community (Guex et al. 2009). Today, a broad variety 
of software tools and web servers for protein structure 
homology modeling are available, which are easy to 
use also for the non-expert. Table 2 provides a selection 
of commonly used modeling tools and servers. 

Summary 

The amount of new protein sequences is growing at an 
astounding rate thanks to ongoing genome sequencing 
and meta-genomics efforts. As a consequence, the gap 
between the number of known protein sequences and 
experimental structures is widening continuously - 
despite the tremendous progress made in optimizing 
the experimental structure determination process, for 
example, as part of worldwide structural genomics 
efforts. Computational techniques for protein structure 
modeling allow extrapolating the available structure 
information to homologous protein sequences 
(Godzik 2011). Today, structural information - either 
experimental or homology models - is available for the 
majority of amino acids in well-characterized model 
organism proteomes like E. coli or T. maritima. Fully 
automated, reliable, and accurate homology modeling 
pipelines make structure information readily available 
for a wide spectrum of structure-guided applications in 
biomedical research. However, homology models still 
fall short of accuracy in comparison to experimental 
structures, and the development of reliable techniques 
for model refinement (Zhang et al. 2011) closer to the 
native structure remains a challenging research goal 
for the coming years. 

Cross-References 

► Alignment of Protein Sequences 

► Macromolecular Crystallography: Overview 

► Modeling of Antibody Structures 

► Molecular Dynamics Simulations of Lipids 

► Molecular Replacement 


► Protein Oligomers 

► Protein Structural Models - Evaluating Quality 

► Structural Genomics 

► Structural Impact of SNPs 
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HYDFIT and Related Packages for 
Linear Molecules 
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Synonyms 

Hydrodynamics of macromolecules 

Definition 

HYDFIT is the global term for the different software 
packages for the fitting of multi-property and multi¬ 
sample sets of experimental data to simple models like 
rigid rods, disks or ellipsoids, or semiflexible wormlike 
chains. 


Introduction 

The HYDFIT programs are intended for the determi¬ 
nation of shape and size, or flexible parameters of 
macromolecules and nanoparticles, which is done 
by comparison of computational predictions with 
experimental data. Single-HYDFIT is intended for 
the analysis of several solution properties of a single 
macromolecule or particle, Multi-HYDFIT works with 
values of properties for a series of samples with varying 
molecular weight or length. These programs are 
straightforward for the determination of the optimum 
values of the model parameters that fit best a series of 
experimental values. The optimization is done by min¬ 
imizing a target (error) function. The comparison 
between experimental and calculated quantities is 
made in terms of equivalent radii or ratios of 
radii which are most adequate for expressing the 
structure-properties relationship in a manner that is 
consistent and equilibrated, for the various properties 
and different macromolecular conformations. The 
equivalent radii depend on both size and shape, while 
the ratios of radii are only function of shape (dimen¬ 
sionless) indicators. Single-HYDFIT is oriented to the 
determination of structural parameters of ellipsoidal, 
cylindrical particles and the flexibility and structure of 
linear semiflexible chains, while Multi-HYDFIT 
focuses on macromolecules which can make a series 
of sizes or molecular weights, maintaining the same 
overall structural parameters, such as cylinders and 
wormlike chains. 

Application to Linear Molecules. Calculation 
of Properties 

The focus of this entry is centered in the application of 
HYDFIT programs to fully or semiflexible linear 
macromolecules. There is a variety of synthetic poly¬ 
mers and biological macromolecules which pos¬ 
sesses a semiflexible skeleton, and can be properly 
represented by the ► wormlike chain model. The 
model describes the whole spectrum of chains with 
different degrees of chain stiffness from rigid rods 
to random coils, and is particularly useful for 
representing stiff chains. In the literature this chain 
is sometimes referred to as a Porod-Kratky chain 
(McNaught and Wilkinson 1997). 
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The basics of HYDFIT falls on the comparison of the 
hydrodynamic properties calculated for a given structure 
or set of structural parameters and the experimental 
properties measured in the laboratory. About 40 years 
ago, Yamakawa and Fuji! presented a theoretical treat¬ 
ment of the solution properties of the wormlike model 
(Yamakawa and Fuji! 1973, 1974). Zimm and others 
(Zimm 1980; Hagerman and Zimm 1982), proposed to 
calculate hydrodynamic properties of flexible macro¬ 
molecules by the Monte Carlo rigid-body method. In 
the rigid-body treatment, conformations of the chain 
are generated by a Monte Carlo procedure. For each 
conformation, the properties are individually computed 
using hydrodynamically rigorous procedures, and the 
final values are the conformational averages. This treat¬ 
ment has been shown to predict correctly experimental 
results for fully flexible random-coil polymers in both 
ideal and good solvent conditions (Garcia de la Torre 
et al. 1984). The latter is the methodology applied by 
HYDFIT, the procedure requires rigid-body hydrody¬ 
namic calculation which will be performed on bead 
models of the ► wormlike chain. For such calculation 
advanced methodology embodied in bead-model hydro¬ 
dynamics (Carrasco and Garcia de la Torre 1999) is 
employed. 


Minimization Function. Equivalent Radii and 
Ratios of Radii 

For the purpose of handling the results of the Monte 
Carlo simulation - in regard to both their dependence 
on structural parameters and their use in the determi¬ 
nation of such parameters from experimental data - it 
is very useful the employment of the concepts of 
equivalent radii and ratios of radii (Ortega and 
Garcia de la Torre 2007). The equivalent radii are the 
values of the radius of a spherical particle that would 
have a given value of some property, they do depend 
only on the size and conformation of the solute, 
which in a ► wormlike chain is defined by the contour 
length, L, the persistence length, a and the diameter, d. 
For the solution hydrodynamic or ► light scattering 
properties, namely, the radius of gyration, Rq, the 

► translational diffusion, D,, or equivalently, the 

► sedimentation coefficient, s, etc. the equivalent 
radii can be defined with straightforward equations. 
Some examples are shown below: 
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HYDFIT and Related Packages for Linear Molecules, 
Fig. 1 HYDFIT contour plot of the percent typical deviation, 
lOOA, with fixed 4 = 23 A, allows to predict correct persistence 
length and of the double-stranded DNA molecule 
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The equivalent radii can be then combined into 
ratios of radii which do not depend on the absolute 
size of the particle but only on the shape or conforma¬ 
tion, therefore being perfect candidates for the goal of 
finding a minimization function which accounts for all 
the properties in a weighted manner and sensitive 
enough to structural parameters, such as shape and 
flexibility of the molecule. In the case of wormlike 
chains, GT, GI, and IT ratios are functions of the 
quantities that determine their conformation, which 
are the ratios of parameters, L/d and L/a, but not on 
the individual values of L, a, and d. The notation XY 
defines the ratio of equivalent radii Oxlay- With the 
results from the computer simulation for the Monte 
Carlo procedure, the ratios GT, GI, and IT are obtained 
for any wormlike molecule with given values of L, a, 
and d, then the equivalent radii aj, aj, and qq are 
extracted and finally D, and [ 77 ] are obtained, the latter 
requiring the value of the molecular weight, or that of 
the relative parameter Mj^. 

The ultimate goal of HYDFIT for studying the 
► wormlike model should be to provide a protocol 
to extract the structurally relevant information. 
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HYDFIT and Related Packages for Linear Molecules, 
Fig. 2 Plots of the experimental values for some hydrodynamic 
properties of double-stranded DNA (filled symbols) and 


calculated values from optimum parameters obtained by 
HYDFIT (line), a = 560 A, = 195.0 Da/A, and d = 23 A 


contained in parameters like a, Mj^, and d, from exper¬ 
imental data. Usually, experimental values of the solu¬ 
tion properties are available for a series of samples 
with varying molecular weight. In order to analyze 
simultaneously several properties of various samples, 
in a search for structural parameters, HYDFIT follows 
a strategy (Ortega and Garcia de la Torre 2007), 
implemented in both the Single-HYDFIT and Multi- 
HYDFIT computer programs, which is based on 
equivalent radii. 

Y denotes any of the observable properties, Rq, D,, 
s, [t]], etc. As it has been just noted above, previous 
results allow the prediction of these properties as 
functions of molecular weight from given values of 
the parameters. In the HYDFIT strategy, the properties 
are transformed into equivalent radii, ay, which are 
considered functions of molecular weight, with 


a dependence in turn on the model parameters. In the 
case of the ► wormlike chain, this can be expressed as 
ay (cal) (M; a, M/^, d). Then, the HYDFIT procedure 
optimizes the parameters seeking the minimization of 
a target function such as: 




, A, 

-F 



-1 


Y 


/ay (cal) — fly (exp) 
V fly (exp) 


where the outermost sum runs in the Multi-HYDFIT 
program over the samples with different molec¬ 
ular weight and the innermost sum runs over the 
available properties for each sample. In the case of 










HYDFIT and Related Packages for Linear Molecules 


1001 


Single-HYDFIT, the target function is the same 
containing only the innermost sums, as only one set of 
samples is analyzed. The advantage of using ratios of 
radii is that the ay are different but of the same order of 
magnitude, and in this way the properties are given 
a similar importance. Furthermore, the target function 
uses relative deviations, {ay (cal) — ay {exp))/ay (exp). 
The user of HYDFIT could optionally use different 
weights Wy for the various properties for specific pur¬ 
poses - for instance, to weight the properties according 
to their experimental precision. For more details and 
examples of this strategy, see the original reference 
(Ortega and Garcia de la Torre 2007). 

If the interest is the analysis of data for a single 
sample, there is also an alternative route based on 
the ratios of radii, which determines primarily the 
particle’s shape, described by the parameter p, instead 
of the two sizes L and d\ this is particularly useful, for 
instance, in the cylinder-shaped particle’s case. For 
this purpose another target function based on ratios is 
defined, and included in the program Single-HYDFIT: 

E >^xF[XT(cal) - YT(exp)]" 

V xr / XY 

Examples of HYDFIT Calculations 

To express the accuracy of HYDFIT software, some 
examples of the calculation of the parameters a, Mj^, 
and d of some synthetic and biological polymers are 
shown here. Results for the double-stranded DNA 
molecule, which is very well determined and whose 
experimental properties are available in large amount, 
are presented and, as an example of synthetic poly¬ 
mers, the calculations made for poly isobutylene (PIB), 
an extreme case of a very flexible chain are shown 
(Fig. 1). 

DNA is the paradigmatic example of the ► wormlike 
chain, its solution properties have been determined by 
many authors. To determine the mass per unit length 
{Ml) and the persistence length, a (which is an estimation 
of its rigidity), HYDFIT is run with the input of 147 
experimental property-molecular weight values. With 
one single execution of Multi-HYDFIT, results are 
obtained, in perfect agreement with the established values 
for this macromolecule. Figure 1 shows contour plots of 
the A function, indicating the minimum of this function at 
A/z. = 195.0 ± 4.0 Da/A and a = 560 ± 20 A (Fig. 2). 



HYDFIT and Related Packages for Linear Molecules, 
Fig. 3 HYDFIT contour plot of the percent typical deviation, 
lOOA, of the poly isobutylene molecule with fixed d = 6.9 A, 
showing the minimum of the deviation at values of persistence 
length, a = 7.1 A, and My = 20.5 Da/A 


To show the validity of the methodology 
implemented in HYDFIT for the whole range of rigid¬ 
ities, the calculation of the wormlike parameters of the 
PIB, a very flexible wormlike chain is also included, in 
contrast to the rigidity of the DNA molecule. As can be 
seen in Fig. 3, the parameters found for this molecule 
are — 20.5 Da/A and a = 7.1 A, being d = l.l A. 


Cross-References 

► Dynamic Light Scattering 

► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules 

► Sedimentation Velocity Analytical 
Ultracentrifugation 

► Worm-Like Chain (WLC) Model 

► X-Ray Scattering of Lipid Membranes 
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Synonyms 

Hydrodynamics of macromolecules 


Definition 

The HYDRO suite is a collection of computer tools for 
predicting hydrodynamic and other solution properties 


from macromolecular structures, and to gather struc¬ 
tural information from such properties. 

A collection of free-domain, user-friendly, and 
well-documented computer programs to predict prop¬ 
erties from the three-dimensional shape of rigid 
particles, with various level of resolution, from very 
coarse-grained models to atomic level. 

Introduction 

Hydrodynamic properties, like translational and rota¬ 
tional diffusion, intrinsic viscosities, relaxation times, 
and other properties like those related to scattering of 
radiation, for example, the radius of gyration or the 
distribution of distances, are, in the case of rigid parti¬ 
cles, directly determined from their size and shape, and 
are, therefore, potential sources of information about 
their overall structures. Thus, the validation of struc¬ 
tures obtained by, say crystallography, microscopy, 
etc., against experimental measurements of such solu¬ 
tion properties, or the determination of structural infor¬ 
mation by global fitting or optimization, requires the 
employment of such predictive computational tools. 

Bead Models and Computer Programs 

All the computational procedures in the HYDRO suite 
represent the structure of the rigid particle as a bead 
model, which is an array of N spheres such that its 
overall size and shape reproduces adequately those of 
the particle under consideration (See Fig. la). The 
reason for using beads as the building blocks is that the 
hydrodynamics of a spherical particle is simpler than that 
of any other shape. In an array of spherical elements, an 
important aspect is the hydrodynamic interaction (HI), 
which makes the friction experienced by any element 
dependent on those of the rest of them. The theoretical 
and computational treatment of HI is also simpler with 
spherical elements. The HI effect is the central aspect of 
the hydrodynamics, and it is remarkable that it deter¬ 
mines the computing time, which increases remarkably 
with the number of beads, being proportional to N^. 

The idea of using bead models to represent macro¬ 
molecules comes from classical works in polymer 
physics, which proposed to model chain macromole¬ 
cules, either coiled because of flexibility, or those 
appearing as rigid straight rods, as strings of beads. 
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HYDRO Suite of Computer 
Programs for Solution 
Properties of Rigid 
Macromolecules, Fig. 1 

(a) Scheme of a bead model, in 
strict sense, (b) Scheme of 
a bead/shell model 


a 



b 



The concept was adopted for modeling the com¬ 
plex shapes of rigid biological macromolecules by 
Bloomfield et al. (1967). (For a review see Garcia de 
la Torre and Bloomfield 1981). In the early applications, 
models were constructed as agglomerates of a moderate 
number (such that the computing requirements were 
accessible by them) of non-overlapping spheres. The 
more advanced procedures are based on the bead/shell 
model, in which a large number of small “minibeads” are 
placed on the surface of the particle (see Fig. lb). 

HYDRO+-I- 

This computer program implements the simplest bead 
model, such as that in Fig. la. A real example is shown 
in Fig. 2a. The simplicity of this kind of model allows 
for an extremely fast calculation of properties, but 
the initial versions of the computational procedures 
(program HYDRO) had some deficiencies regarding 
the prediction of rotational diffusion intrinsic viscos¬ 
ity. In the latest version, HYDRO-h- (Garcia de la 
Torre et al. 2007), these problems have been corrected. 
The user has to supply a main input file plus another 
file containing a list of the three coordinates and radii 
of the beads. This method is usually applied to low- 
resolution models, but the number of beads in the 
model is not limited, and it can be applied to any 
other instance like, for instance, elongated filamentous 
structures modeled as strings of beads. 

HYDROPRO and HYDRONMR 

These programs (Garcia de la Torre et al. 2000a, b) 
were intended for the calculation of solution properties 
of models with atomic resolution, starting from atomic 
coordinates contained in PDB-formatted files. In the 
primary hydrodynamic model, whose properties are to 
be calculated, each atom is replaced by a sphere. The 
radii of the elements are larger than the van der Waals 


atomic radii, including a contribution due to hydration. 
From a global fit for a large set of proteins this radius 
has been estimated to be = 3.0 A, which corresponds 
well to a typical van der Waals radius of = 1.8 A plus 
a monolayer of hydration of about = 1.2 A. Properties 
predicted by HYDROPRO with this setting agree 
well with experimental data, within statistical error. 
HYDRONMR predicts the correlation time of mac- 
romolecular structures and, for globular proteins, the 
sequence of T1 and T2 relaxation times for each amino 
acid residue. An example showing the quality of the 
prediction is shown in Fig. 3. 

Although HYDROPRO was designed for atomic- 
resolution elements, it has recently been noticed 
(Frembgen and Elcock 2009; Ortega et al. 2011) that 
it works fairly well with primary elements based on 
more coarse-grained models in which each bead rep¬ 
resents not an atom, but a full amino acid residue. In 
this case, the structural file would be a PDB-formatted 
list of the Ca coordinates, where the beads will be 
centered. The radius of the primary element should 
be in this case about = 5.1 A. The quality of the predic¬ 
tion is practically as good as that of fully atomic model, 
which indicates that such level of detail, although avail¬ 
able and of course useful for the HYDROPRO calcula¬ 
tion, is not fully necessary. Yet another case of pushing 
the HYDROPRO calculation for an even lower resolu¬ 
tion model is the possibility of applying this program for 
models derived from ah initio structural determination 
using ► small-angle scattering, which provides a model 
of large “dummy pseudo-atoms” (Ackerman et al. 
2003). For each individual stmcture, the radius of the 
elements in the primary mode has to be adjusted as to 
reproduce the hydrated volume estimated for the protein. 

HYDROMIC 

While atomic- and residue-level structures can be 
routinely determined for moderately small proteins 
and oligonucleotides, much larger macromolecules 
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HYDRO Suite of Computer 
Programs for Solution 
Properties of Rigid 
Macromolecules, Fig. 2 

(a) HYDRO++ bead model for 
an IgG3 antibody molecule. 

(b) HYDROPRO and 
HYDRONMR primary 
models for lysozyme 

(c) HYDROMIC model for 
CCT chaperonin 

(d) HYDROSUB model for 
wild type and mutant IgG3 
antibodies 



HYDRO Suite of Computer 
Programs for Solution 
Properties of Rigid 
Macromolecules, 

Fig. 3 Sequence of the ratio 
of NMR relaxation times 
T1/T2, calculated by 
HYDRONMR for the outer 
surface protein OSP, for amino 
acids 100-200, showing the 
agreement between 
predictions and experimental 
results 



and macromolecular complexes are investigated by 
► cryo-electron microscopy and related techniques, 
which supply three-dimensional reconstructions of the 
particle in the form of an electron-density map (see 
Fig. 2c). In the same way as HYDROPRO works with 


the atomic or residue coordinates in a PDB-formatted 
file, there is another program, HYDROMIC, which 
accepts the outcome of those microscopy methods for 
the prediction of solution properties. (For an example, 
see Garcia de la Torre et al. 2001) 
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HYDROSUB 

Large macromolecules and macromolecular com¬ 
plexes are often constituted by well-defined domains 
or subunits. Even if high-resolution information of 
those structures were available (which is a rare case, 
as these structures are not amenable to X-ray or 

► NMR determination), the main structural aspect is 
the overall arrangement of these suhunits. Modeling 
those subunits with simple shapes like ellipsoids or 
cylinders, allows the reduction of the structural 
description to such overall aspects, which may he 
those determining both the solution properties and the 
main features of their physiological function. This is 
the purpose of program HYDROSUB (Garcia de la 
Torre and Carrasco 2002), whose structural input file 
is a list of the sizes, positions, and orientations of 
the subunits. The size and shape of the individual can 
he obtained from the atomic-level structure, when it is 
available from crystallography (this is the origin of the 
“crystallohydrodynamics” term) or other higher- 
resolution model. Alternatively, the information on 
subunits can be gathered from if their experimental 
solution properties are available, and analyzed in 
terms of the classical theories for ellipsoids and rods. 
Then the HYDROSUB user would specify the Carte¬ 
sian coordinates of the center of each subunit, and the 
two polar angles (only two are needed thanks to the 
axial symmetry of the subunits) that determine the 
orientation of the subunit in space, all with reference 
to any conveniently chosen system of Cartesian axes. 

Other Programs 

This entry has concentrated on the HYDROxxx pro¬ 
grams that perform the basic task of predicting prop¬ 
erties from structure. The usual purpose of the users 
may go in the inverse direction: determination of struc¬ 
tural information from solution properties. For such 
“inverse problem” we have devised strategies based 
on optimization protocols - the global fitting approach, 
described in a separate article in this Encyclopedia. 
We just mention the names of the tools for such pur¬ 
pose: ► Single-HYDFIT, ► Multi-HYDFIT, and 

► HYDROFIT. The latter is closely related to the 
HYDROxxx programs and other tools, genetically 
named Multi-HYDROxxx, which facilitate the 
HYDROxxx calculations for a number of plausible 


structures (eventually taking advantage of multi-core 
computing platforms), from which the optimum, best 
fitting structure can be selected by HYDROFIT. 

This entry treats exclusively the tools for rigid par¬ 
ticles. Flexible entities require different procedures 
that, in addition to hydrodynamics, consider their con¬ 
formational variability. This is particularly important 
when considering not just overall coefficients like 
those of sedimentation or viscosity, but problems 
related to the internal dynamics of the flexible particle. 
Physical methods adequate for these problems are 
Monte Carlo and Brownian dynamics simulation. 
Computer programs for carrying out easily those 
simulations, for models quite similar to those of the 
rigid particles, are also available (Garcia de la Torre et 
al. 2010). 

Cross-References 

► Crystallohydrodynamics of IgG 

► Dynamic Light Scattering 

► Electron Microscopy 

► HYDFIT and Related Packages for Linear 
Molecules 

► NMR 

► X-Ray Scattering of Lipid Membranes 
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a class of glycoconjugate (poly-deoxyribose or poly- 
ribose backbones linked by phosphor-diester instead of 
glycosidic bonds). 

This means that compared with proteins, the gram¬ 
mar of enquiry is somewhat different: We can use 
hydrodynamic methods to ascertain the molecular 
weight distribution (or the ratio of molar mass aver¬ 
ages, such as M^/M^v), overall conformation type 
(between the extremes of globular/spherical, rod 
and random coil), flexibility (as manifested by the 
persistence length Lp), and quaternary interactions 
(self-association or interactions with other macromol¬ 
ecules or small ligands). 


Representation of Solution Conformation 
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Synonyms 

Hydrodynamics of macromolecules 


Introduction 

Polysaccharides, mucins, and other large molecular 
weight glycoconjugates tend to exhibit quite different 
hydrodynamic properties compared with proteins. 
This is not only because of their general larger size 
(reaching to molar masses >50 x 10® g/mol) for some 
polysaccharides like amylopectin, their greater non¬ 
ideality (through molecular co-exclusion and charge 
effects - as represented by virial coefficients), and 
greater diversity of shapes and flexibilities. The pri¬ 
mary structure of carbohydrate polymers is not coded 
by a genetic template; so they are also polydisperse 
(as represented by their molecular weight distribution 
or composition distribution). Nucleic acids are also 


The Haug Triangle 

In the simplest of terms, the solution conformation of 
polysaccharides can be described as spheres, coils, or 
rods (Smidsrpd and Andresen 1979). These three 
“extremes” of conformation are often represented as 
the points of the Haug triangle (Fig. 1). 

Conformation Zoning (Normalized Scaling 
Relations) 

An improvement on the Haug triangle is conformation 
zoning (or normalized scaling relations) which quali¬ 
tatively estimates the conformation of polysaccharide 
based on a series of hydrodynamic measurements 
(Fig. 2a). The sedimentation conformation zoning 
(Pavlov et al. 1999) plot kML versus [s]IMi^ enables 
an estimate of the “overall” solution conformation of 
a macromolecule in solution ranging from Zone 
A (rigid rod) to Zone E (globular or branched). The 
parameter [s] is related to the sedimentation coeffi¬ 
cient by the relation 


■S°20,w??20,w 

(l — vp20,w) 


( 1 ) 


P 2 Q,w and 772 o,w are, respectively, the density and vis¬ 
cosity of water at 20.0°C and Mj^ is the mass per unit 
length 


Ml = 


m 

1 


( 2 ) 





Hydrodynamic Modeling of Carbohydrate Polymers 


1007 



Fujita (1962) examined the possibility of 
transforming a distribution of sedimentation coeffi¬ 
cients into a distribution of molecular weights for 
linear polymers and concluded that the sedimentation 
velocity determination of g(s) allows the evaluation of 
the molecular weight distribution of a given polymer 
if information about the relation between s and M is 
available from other sources (e.g., Eq. 3b). In Fujita 
(1962), the sedimentation coefficient s was given for 
the case of random coils 5 = More generally 

s = where b = 0.4-0.5 for a coil, ^0.15-0.2 for 
a rod, and ^0.67 for a sphere. 


Hydrodynamic Modeling of Carbohydrate Polymers, 
Fig. 1 Schematic representation of the “extremes” of polysac¬ 
charide conformation (Hang triangle), where most polysaccha¬ 
ride conformations will be described by a position on either the 
rod-coil or coil-sphere edges of the triangle 

where m is the mass of an average monomer and / is the 
length of an average monomer ( ^0.5 nm). 

Pavlov et al. (1999) described a further procedure to 
represent the conformation of polymers in solution 
based on the relationship between their molar mass, 
intrinsic viscosity, and mass per unit length, M^, and 
has been recently applied to sugar beet pectin and its 
homogalacturonan (smooth) and rhamnogalacturonan 
(hairy) regions (Fig. 2b), and it has been demonstrated 
that the smooth regions tend to behave more like rods 
and the hairy regions more like flexible coils. 

For a homologous series of polysaccharides of dif¬ 
ferent molecular weights, the conformation can be 
estimated from the molecular weight dependency of 
a number of hydrodynamic parameters, for example, 
intrinsic viscosity ([?)]), sedimentation coefficient 
(‘i' 20 ,w)> root-mean-square “radius of gyration” (r^), 
and the translational diffusion coefficient 
(Fig. 3a-d). 


f(M) = g(s).b.k"‘/^s(’’'^)/'’ (4) 

To do the transformation, the conformation type or 
b needs to be known and at least one pair of s-M values 
is needed to define the ]<!'. This method only applies to 
the infinite dilution or non-ideality free sedimentation 
coefficient distribution and therefore valid for values 
of 5 (or a distribution of j values) extrapolated to zero 
concentration or s values measured at low enough 
concentrations where non-ideality effects are small 
(Harding et al. 2011). 

Tsvetkov, Eskin, and Frenkel Relations 

The validity of the MHKS parameters can be further 
explored by the calculation of their corresponding 
Tsvetkov, Eskin, and Erenkel (TEE) relations 
(Tsvetkov et al. 1970). 


a = 2-3h 

(5a) 

h=\-c 

(5b) 

Q. \ 

3 

(5c) 


[rj] = k'M^ 

(3a) 


(3b) 

II 

(3c) 


(3d) 

where the power law coefficients a, h, c 
on the polymer conformation (Table 1). 

, and e depend 


The p Parameter 

A further estimate of molecular conformation can be 
obtained by the p parameter which has theoretical 
limits of 0.78, 1.7, and 2 for hard spheres, random 
coils (0-conditions), and rigid rods, respectively 
(Burchard 1992). 



I'H 
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Hydrodynamic Modeling of 
Carbohydrate Polymers, 
Fig. 2 (a) The sedimentation 
conformation zoning plot. 
Zone A: rigid rod; Zone B: 
rod; Zone C: semiflexible coil; 
Zone D: random coil; Zone E: 
globular sphere (Adapted from 
Pavlov et al. 1999). 

(b) Normalized scaling plot of 
[tiIMl versus M„/Ml 
(A dapted from Pavlov et al. 
1999; Moms et al. 2010) 
where the solution 
conformations for pectin 
A (■) and its RG-I (•) and 
HG (A) fractions are 
indicated, a typical citrus 
pectin is shown for 
comparison (T). All other 
symbols are as defined 
previously (Pavlov et al. 

1999): schizophyllan (□), 
DNA (O), globular proteins 
(A), xanthan, 

poly( 1 -vinyl-2-pyrrolidone) 
(0), cellulose nitrate (+), 
pullulan (x), methyl cellulose 
(*), poly-a-methylstyrene (—) 
and polystyrene (I) 



3 4 

log(10®MJML) 


Translational Frictional Ratio and Perrin Function 

The translational frictional ratio, flfo, is a parameter 
which depends on conformation and molecular expan¬ 
sion through hydration effects (Tanford 1961). It can 
be measured experimentally from the sedimentation 
coefficient, hydrodynamic radius, or translational dif¬ 
fusion coefficient and molecular weight: 

/ ^ Af,,(l - vp2o„) f AnNA \ ^ 

/o (A^A67t7?20,„,‘y°20,H') 


f_ ^ ksT f AnNA \ 
fo (67t??20.H'^°20.H-) \3vM„) 

where is Avogadro’s number and kg is the 
Boltzmann constant./is the friction coefficient of the 
molecule and/o the corresponding value for a spherical 
particle of the same mass and (anhydrous) volume 
(Tanford 1961). It should therefore be noted that the 
translational frictional ratio has the same molar mass 
dependency as the hydrodynamic radius. 




L = (7d) 

./b 


3vM, 


(7b) 
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Hydrodynamic Modeling of Carbohydrate Polymers, 
Fig. 3 The Mark-Houwink-Kuhn-Sakurada (MHKS) plots 
for glycogen. The slopes of all four plots are consistent with 
a spherical (globular) conformation (Zone E). (a) the MHKS 



M„{gmol 


viscosity plot (a = —0.07); (b) the MHKS sedimentation plot 
(b = 0.71); (c) the MHKS r^ plot (c = 0.33); and (d) the MHKS 
diffusion plot (e = 0.40) 


Hydrodynamic Modeling of Carbohydrate Polymers, 
Table 1 The Mark-Houwink-Kuhn-Sakurada (MHKS) 
power law exponents (a, b, c, and e), Wales-van Holde (ks/[r|]) 
and p parameters for the five different conformations described 
by sedimentation conformation zoning (see Fig. 2a) 



Zone A 

Zone B 

Zone C 

Zone D 

Zone E 

a 

>1.4 

0.8-1.4 

0.5-0.8 

0.2-0.5 

0.0 

b 

<0.2 

0.2-0.4 

0.4-0.5 

0.5-0.6 

0.67 

c 

>0.8 

0.6-0.8 

0.5-0.6 

0.4-0.5 

0.33 

E 

>0.8 

0.6-0.8 

0.5-0.6 

0.4-0.5 

0.33 

kJhl 

<0.2 

0.2-0.4 

0.4-1.0 

1.0-1.4 

1.6 

II 

>2.0 

1.0-2.0 



0.78-1.0 


Knowledge of the hydration, (5, (g or solvent per g of 
macromolecule) allows the estimation of the Perrin 
(frictional ratio due to shape) parameter, P. 



V 

v/o; 

(v+(5) 


The axial ratio (a/b) can be calculated from the 
Perrin parameter (Fig. 4) using, for example, the 
ELLIPSI routine (Harding 1997). 
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a/b = 4.30 b/c = 1.00 

Hydrodynamic Modeling of Carbohydrate Polymers, 
Fig. 4 Schematic representation of the solution conformation 
of glycogen (Zone E) in terms of prolate ellipsoids (x, y, and 
z represent the orthogonal axes in which the ellipsoid lies, and a, 
b, and c are ellipsoid semi-axes (a > b > c) in the x, y, and 
z directions with c = a for an oblate ellipsoid and c = b for 
a prolate ellipsoid). The axial ratio of ~ 4.3 is calculated from 
P= 1.2 


Hydrodynamic Modeling of Carbohydrate Polymers, 
Fig. 5 The molar mass dependency of the shrinking factors 
g (■), g', (•) and h (A) for glycogen. The slopes for g, g', and 
h are —0.55, —0.73, and —0.17, respectively, which correspond 
to estimated degrees of branching of 40%, 9%, and 8.5%. N.B. 
the values for g, g', and h are very sensitive to the choice of 
model for the linear polymer of the same molar mass (in this case 
pullulan) and this may go some way to explaining the different 
estimations for the number of branches 


Wales-van Holde Ratio 

The Wales-van Holde ratio, R, is a hydration- 
independent estimation of conformation (Wales and 
van Holde 1954). 


ks 


h = 


fbranched ^ Ifo) branched 


fib 


(f/fo),h 


(lOc) 


linear 


(9) 


As with the Perrin function molecules, the axial 


The molar mass dependency of g, g', and h allows 
an estimation of the number of branch points per mol¬ 
ecule, n, and M^u i= the molar mass of the 

branching unit (Eq. 1 la, b, and c), M^u from which we 
calculate the number of monosaccharide residues per 
branch, for example, MbuH^I for glycogen. 


ratio (a/b) can be calculated from the Wales-van 

Holde ratio using, for example, the ELLIPS1 routine 
(Harding 1997). 

8 = 

(-7 

0.5 An 

h 

(lla) 

Shrinking Factors 

The degree of branching of a hyper-branched macro¬ 
molecule can be estimated from the shrinking factors 

8' = 

0-7 

0.5 An 

+ 97r_ 

h' 

(lib) 

(Burchard et al. 1980; Zimm and Stockmayer 1949), 
g, g', and h (Pig. 5). 

h = 

0 -;) 

0.5 An 

9n 

h" 

(11c) 


g = 


I'g^zhranched 

h'g^zlinear^ 


(10a) 


where h, h', and h" are the slopes of the molar mass 
dependencies of g, g', and h, respectively. 


8 = 


iv] 


branched 


linear 


Draining Effects 

(10b) researcher needs to be aware of the possible con¬ 

tribution from intrachain solvent draining effects 
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where some solvent is conjectured to freely flow 
through the interstices between the atoms or segments 
of a macromolecule, although these effects are usually 
small compared to the dominant hydrodynamic inter¬ 
actions between such segments (Tanford 1961). 




Estimation of Persistence Length 

The linear flexibility of polymer chains can also be 
represented quantitatively in terms of the persistence 
length, Lp, of equivalent worm-like chains where the 
persistence length is defined as the average projection 
length along the initial direction of the polymer chain. 
In the case of a theoretical perfect random coil, Lp = 0, 
and for the equivalent rigid rod (see Harding 1997) 
Lp = oo, although in practice, limits of ^ 1 nm for 
random coils (e.g., pullulan) and 200 nm for a rod (e.g., 
schizophyllan) are more appropriate (Tombs and 
Harding 1998). 

Burchard-Stockmayer-Fixman (BSF) Plot 

This is perhaps the simplest way of estimating the 
persistence length. It involves plotting 
versus and the persistence length is calculated 

from the intercept. Kg (Stockmayer and Fixman 1963) 
(Fig. 6 a). 


Yamakawa-Fujii Equation 

Hearst and Stockmayer (1962) first reported the sedi¬ 
mentation coefficient in relation to worm-like chain 
parameters, later refined by Yamakawa and Fuji! 
(1973). The original relation given by Yamakawa and 
Fujii relating the sedimentation coefficient: 


Ml( 1 - vpo) 
2>nr)pNA 


X 


1.843 


' \ 

2MlLJ 


1/2 

-f A2 + A3 


f M„, \ 



Yamakawa and Fujii (1973) showed that A 2 = 
ln(d/2Lp) and A 3 = 0.1382 if the Lp is much larger 
than the chain diameter, d. The persistence length can 
then be estimated from the slope of versus 
(Fig. 6 d). 


/2L \ 

where (P is the Flory coefficient ^ 2.86 x 10 ^® mor\ 

Hearst Approach 

The Hearst approach is similar to BSF and it involves 
the calculation of l/Kg from the slope of [r/]/M^. versus 
(Fig. 6 b) Hearst (1963). 

Bushin-Bohdanecky Method 

This is a popular method devised by Bushin et al. 
(1981) and Bohdanecky (1983) for estimating chain 
persistence lengths particularly for semiflexible poly¬ 
mers, and has been applied to a range of polysaccha¬ 
rides. In its simplest form, the Bushin-Bohdanecky 

1/2 

method involves plotting -jr-^ versus MJ and 

\[mJ 

from the slope Lp can be calculated using the following 
relation and tabulated values (Bohdanecky 1983) of 
the coefficient Bq (Fig. 6 c): 


The HYDFIT Algorithm 

The way these approaches are implemented can lead to 
a significant variability in the results, that is, contrary 
to expectation, Lp is model dependent (Ortega and 
Torre 2007). This is ably demonstrated by the different 
persistence lengths calculated by the Burchard- 
Stockmayer-Fixman, Hearst, Bushin-Bohdanecky, 
and Yamakawa-Fujii approaches (Fig. 6 a-d). The per¬ 
sistence length and mass per unit length can be esti¬ 
mated using Multi-HYDFIT program which considers 
data sets of hydrodynamic parameters for different 
molecular weights. It then performs a minimization 
procedure, finding the best values of Mi^ and Lp and 
chain diameter d satisfying the Bushin-Bohdanecky 
(Bushin et al. 1981; Bohdanecky 1983) and Yamakawa 
and Fujii (1973) equations (Eqs. 13 and 14). The Multi- 
HYDFIT program then “floats” the variable parame¬ 
ters in order to find a minimum of the multi-sample 
target (error) function, A (Ortega and Torre 2007) 
(Fig. 6 e). In this procedure, A is calculated using 
equivalent radii, where the equivalent radius (aj is 
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M,^''^2(gi/2n^Q|-i/2j 


e 



0.2200 

0.2200 

0.2373 

0.2560 

0.2761 

0.2978 

0.3212 

0.3465 

0.3737 

0.4031 

0.4348 

0.4690 

0.5059 

0.5457 

0.5886 

0.6349 

0.6849 

0.7387 

0.7968 

0.8595 

0.9271 

1.000 

1.000 


Lp (nm) 


Hydrodynamic Modeling of Carbohydrate Polymers, 
Fig. 6 The estimation of the persistence length, Lp, for konjac 
glucomannan (Zone C) using different approaches, (a) BSF plot 
where Lp = 3 nm from the intercept, (b) Hearst plot where 
Lp = 1 nm from the slope, (c) Bushin-Bohdanecky plot where 
Lp = 8 nm from the slope, (d) Yamakawa-Fujii plot where 
Lp = 34 nm from the slope, (e) Solutions to the Bushin- 
Bohdanecky and Yamakawa-Fujii equations using equivalent 


radii approach. The x-axis and y-axis represent Lp (nm) and Ml 
( g moP’ nm^’), respectively. The target function. A, is calcu¬ 
lated over a range of values for M^ and Lp. In these representa¬ 
tions, the values of A function are represented by the full color 
spectrum, from the minimum in the target function in blue 
(A = 0.220) to red (A > 1). The calculated minimum 
(Lp =13 nm and Ml = 330 g moP* nm“*) is indicated 
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Hydrodynamic Modeling of Carbohydrate Polymers, Table 2 Representative dilute solution conformation parameters of 
selected glycopolymers 


a 

h 

c 

8 

kJhl 

P 

f/fo 

Lp (nm) 

Zone 

Alginate 

0.73-1.31 

- 

0.52-0.54 

- 

0.6 

- 

9 

12-15 

B/C 

Capsular polysaccharides 0.4—0.9 

- 

- 

- 

0.8-L6 

- 

3-9 

2-20 

C/D 

Chitosan 

0.77-1.10 

0.24-0.25 0.55-0.56 

- 

0.16-0.73 

- 

11-16 

4-35 

B/C 

Galactomannan 

0.70-0.77 

0.12-0.65 0.54-0.57 

- 

0.15-0.41 

- 

8-17 

2-12 

C 

Glycogen 

-0.07-0 

0.71 

0.31-0.33 

0.38-0.40 - 

0.7-1.0 

1.7-2.8 

- 

E 

Heparin 

0.90 

0.38 

0.38 

0.62 

1.04-2.98 

1.34-1.52 

1-3 

4-6 

C 

K-cairageenan 

0.67-0.90 

- 

0.68 

- 

0.39-0.9 

- 

7-9 

2-3 

B/C 

i-carrageenan 

0.77 

- 

0.68 

- 

0.16 

- 

5 

4 

B/C 

Konjac glucomannan 

0.74-0.78 

0.32 

- 

- 

0.4 

- 

9-14 

1-34 

C 

Methyl cellulose 

0.83 

0.39 

- 

- 

0.30-0.75 

- 

10-12 

10-17 

c 

Mucin 

0.56-1.1 

0.4 

0.31-0.69 

- 

1.5 

1.7-2.4 

- 

- 

A/B/C 

Pectin 

0.62-0.94 

0.17 

0.57 

- 

0.10-0.85 

0.6-1.0 

7-10 

10-15 

A/B/C 

Pullulan 

0.66-0.67 

0.45 

0.58 

0.51 

1.27-1.49 

1.40-1.66 

2-5 

1-2 

D 

Xanthan 

1.23 

0.26 

1.00 

- 

0.28 

- 

14-19 

100-150 

A/B 

Xyloglucan 

0.55-0.67 

0.42 

0.51 

- 

0.12-1.44 

- 

2-6 

4-15 

C/D 


defined as the radius of an equivalent sphere having the 
same value as the determined property, for example, 
intrinsic viscosity (ai). A is thus a dimensionless esti¬ 
mate of the agreement between the theoretical calcu¬ 
lated values for the selected hydrodynamic parameter 
of a particular molar mass, persistence length, and 
mass per unit length and the experimentally measured 
parameter. 

Summary 

The solution conformation of polysaccharides, 
mucins, and glycopolymers can be estimated in a 
variety of ways, and estimates for a number of com¬ 
mercially important biomacromolecules are summa¬ 
rized in Table 2. We find that glycopolymers cross 
the whole range of conformations in solution (Zones 
A-E), but that Zone C (semiflexible coil) appears the 
most common. As we can see from Fig. 6, the estima¬ 
tion of conformation (especially in terms of persistence 
length) is very sensitive to the choice of model and it is 
therefore our belief that when trying to estimate solu¬ 
tion conformation of polysaccharides (or any other 
flexible macromolecule), the quality of the estimate is 
determined by the amount of experimental data avail¬ 
able - a combination of hydrodynamic data is always 
desirable. 
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► Dynamic Light Scattering 

► HYDFIT and Related Packages for Linear 
Molecules 

► Hydrodynamics of Macromolecules: Conformation 
Zoning for General Macromolecules 
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► Sedimentation Equilibrium Analytical 
Ultracentrifugation 

► Sedimentation Velocity Analytical 
Ultracentrifugation 
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St. Petersburg, Russia 

Introduction 

The hydrodynamic assessment of the overall confor¬ 
mation of macromolecules in solution using normal¬ 
ized scaling relations will be reviewed in this entry. 
Macromolecules in infinitely dilute solution occupy 
a volume in which the distribution of the polymer 
matter (substance) is determined by the chemical 
structure of the repeat unit, by the architecture/ 
topology of macromolecule, and by the interaction 
between monomers and solvent molecules. An indi¬ 
vidual macromolecule is regarded as a fractal object 
(cluster of connected repeat units, connected monomer 
clusters), which may be represented by the actual frac¬ 
tal dimension (de Gennes 1979; Mandelbrot 1982). 
This dimension df may be determined from the depen¬ 
dence of the number (N) of single particle clusters on 
the distance r for which the number calculated for 
identical particles (the repeat unit) N is directly pro¬ 
portional to the mass M: 

N - M - r" (1) 

More general characteristic dimensions of a linear 
polymer chain may also be represented in terms of the 
radius of gyration or the end-to-end distance. The 
fractal dimension characterizes the distribution of 
polymer matter in the volume occupied by a macro¬ 
molecule. The value of this volume depends on the 
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ability of a linear macromolecule to coil, on the cross- 
sectional diameter of the chain, and on the nature of the 
interactions between polymer matter with the solvent 
molecules. 

Information about macromolecular size may be 
obtained from hydrodynamic studies and/or by the 
study of scattered electromagnetic radiation or neutron 
scattering (Tanford 1961; Tsvetkov et al. 1970; Cantor 
and Schimmel 1980; Benoit and Higgins 1992) - see 
also ► HYDFIT and Related Packages for Linear Mol¬ 
ecules; ► X-Ray Scattering of Lipid Membranes; 
► Neutron Scattering of Membranes - but this para¬ 
graph is focused on interpretation of the hydrody¬ 
namic data. 

Highly useful hydrodynamic values are the transla¬ 
tional diffusion coefficient (Dq), the velocity sedimen¬ 
tation coefficient (Sq) - where the subscript “o” 
indicates the value at infinite dilution - and the intrin¬ 
sic viscosity ([r|]); they are related to the size or 
dimensions (Dq, Sq, and [t]]) and the molecular weight 
or molar mass (Sq and [r|]) of a macromolecule 
(Tanford 1961; Tsvetkov et al. 1970; Yamakawa 
1971; Cantor and Schimmel 1980; Fujita 1990): 

Do = kT/Pr?o < h2>'/2 (2) 

So = M(1 - Po«^)/NaP77o < (3) 

[ri] = O < h2>3/VM (4) 

In (2) (3), (4), <h^> is the average square of 
the end-to-end distance, Na is Avogadro’s number, 
4>(L/A,d/A, e) and P(L/A, d/A, s) are the Flory hydro- 
dynamic parameters, L is the chain contour length, d is 
the diameter of the chain, A is the statistical segment 
length (= 2Lp where Lp is the “persistence length”), 
and £ is a parameter characterizing the thermodynamic 
“quality” of a polymer-solvent system. 

To characterize a particular macromolecular com¬ 
ponent, the experimental values must be extrapolated 
to zero polymer concentration c (g/ml), using, for 
example, the following relations: 

D = Do(l-fkDC + kD'c2+ ...), (5) 

s = So(l -f kjC -I- ks'c^ + ■■•), and (6) 

7 ='7o(l+ki)C-f k/c^-f ...). (7) 


Equation (7) easily transforms into the classical 
Huggins equation (rjr — l)/c = k^ -i- k,.|'c -i-... and by 
definition k^ = [r|] when c ^ 0, where Tjr = r|/rio is the 
relative viscosity of solution (see ► Intrinsic viscos¬ 
ity). The values of ko and ks have the same dimensions 
or units (ml/g) as the k^ = [p] value. Often, 
the concentration dependence parameter of the sedi¬ 
mentation coefficient ks - also called the Gralen 
coefficient - is compared with [r|] giving the dimen¬ 
sionless ratio ks/[r|]. Finally for ks, a relation similar to 
(4) may be considered (Pavlov and Frenkel 1995): 

ks = B < h2>3/VM, (8) 

where B is a parameter with dimension or units moL*. 

The coefficient k^ depends in general both on k, and 
on the second virial coefficient A 2 (Tsvetkov et al. 
1970): 

ko = ks + u - 2 A 2 M, (9) 

where D is the partial specific volume of polymer. To 
eliminate the common solvent properties, the intrinsic 
values of velocity sedimentation coefficient [s] and 
translational diffusion coefficient [D] prove practical 
to use: [s] = So? 7 o/(l — vp^) and [D] = DoTIq/T. These 
“intrinsic” parameters, like the intrinsic viscosity, 
depend only on the main characteristics of a macromo¬ 
lecular component. 

In polymer science, Mark-Kuhn-Houwink- 
Sakurada (MHKS) type or hydrodynamic scaling rela¬ 
tionships equations are widely used (Fujita 1990; 
Brandrup et al. 1999). The comparison of hydrody¬ 
namic characteristics with molar mass allows 
obtaining canonical MHKS relationships: 

Pi = KiM“, (10) 

where Pi represents one of the hydrodynamic charac¬ 
teristics [rj]. Do, So, and k^. The following correlations 
exist in the polymer homologous series among scaling 
indices bi (Tsvetkov et al. 1970; Pavlov et al. 1990): 

|bD| = (l+b„)/3,|bD|-fb, = land (11) 

bks = (2 - 3b,)/b, (12) 
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For instance, between Sq and M and also between k, 
and So, there are the relations Sq = KsM°'’ and = 
Kk,sSo° The indices b^ and b^s are linked by (12). 
These relations follow from the comparison of the 
corresponding fundamental (2, 3, 4, 8) with (10) and 
the supposition that the sizes of macromolecules are 
the same value under the different experimental con¬ 
ditions (velocity sedimentation, translational diffu¬ 
sion, and viscous flow). It should be noted however 
that these relations are meaningful only for a homolo¬ 
gous series of polymers with different conformational 
status. For linear macromolecules, “homology” may 
be defined as a series of macromolecules which have 
the same mass per unit length for different chain 
contour lengths L. The mass per unit length M^, is also 
called a shift factor (Yamakawa 1971 ) or linear density 
of the polymer chain as defined hy = MIL — MJX 
with M the molar mass, Mq the molar mass of the 
repeat unit, and a a projection of repeat unit to the 
main direction of macromolecular chain. This value 
may he determined from X-ray scattering data, or be 
calculated in the simple case of aliphatic chains. For 
the very rigid polymer chains, the estimation of this 
value from the hydrodynamic data is possible using the 
model of a weakly bending rod or a cylinder (Tsvetkov 
1989). This value may change in the following limits: 
3 < 10“^ X g.moP^ cm“' < 22. A low value 
corresponds, for example, to flexible chains (polysty¬ 
rene, pullulan), and a high value corresponds, for 
example, to the triple helical polysaccharide from the 
Schizophyllum commune. 

First Normalization of the MKHS 
Equation for Linear Density of the Polymer 
Chain 

Mark-Kuhn-Houwink-Sakurada equation can be nor¬ 
malized to take into consideration the linear density or 
mass per unit length of the polymer chain M^. In fact, 
a lihrary of hydrodynamic data values for different 
types of linear polymer systems (including also their 
molar masses determined by one of various absolute 
methods - light scattering, sedimentation-diffusion 
analysis, or sedimentation equilibrium) has been col¬ 
lected over the last 50-60 years: from all this data, it is 
possible to make some representative generalizations. 
For instance, the data of [ry] values cover the range of 
contour lengths of linear chains varying by three orders 


of magnitude and the range of statistical segment 
lengths varying approximately 500 times. There are 
also plentiful data for the translational frictional prop¬ 
erties of polymer systems in dilute solution with 
regards the velocity sedimentation and translational 
diffusion coefficients (Brandrup et al. 1999; Pavlov 
2007; Pavlov et al. 1997, 1999) and covering both 
natural and synthetic polymers. 

Polymer systems may be tentatively divided into 
several classes: globular (such as globular proteins 
and glycogen), flexible chain under 0 conditions 
(polystyrene (Pst), poly-a-methylstyrene (Pamst), 
and polymethylmetacrylate (PMMA)), flexible chain 
in thermodynamically good solvents (Pst, Pamst, 
PMMA, poly isobutylene, pullulan, and polyvinylpyr¬ 
rolidone), rigid chains (cellulose derivatives), and 
extra-rigid chain polymers (xanthan, schizophyllan 
polysaccharides, and DNA). Data also used in this 
analysis were obtained for comb-like polymers and 
copolymers (molecular brushes) showing a large 
chain diameter. 

Data analysis can he carried out using plots of 
the MHKS type normalized with the M/^ value: 
log([7;]Mz,) = f(log(M/Mz,)) = f(logL) (Fig. 1). It 
follows from Equation (2) that ([ryJM^) ^ <h^> 

L ^ V/L where L = Ml Mi^ is the contour length of the 
macromolecule and V is the volume occupied by it in 
solution. Consequently, the value of [ryJM^ character¬ 
izes the volume occupied by the chain section 
corresponding to unit contour length of linear macro¬ 
molecule. This value will be larger, for greater equi¬ 
librium rigidity of macromolecule and better 
thermodynamic quality of the solvent. Hence, linear 
macromolecules in Fig. 1 are expanded into a spectrum 
by sizes which are mainly determined by the statistical 
segment length of the macromolecular chain. Zones 
that form linear macromolecules are designed in Fig. 1 
as follows: Zones l-II are the zones of rigid-chain 
macromolecules, III-IV are the zones of flexible- 
chain polymers, and V is the zone of the globular 
state. Notice that it is clear from this plot that the 
excluded volume effects (III-IV zone) have a smaller 
influence on macromolecular size or dimensions in 
comparison with the contribution of the equilibrium 
rigidity of a chain. In the case of absence of 
intrachain volume effects, this dependence may be 
predicted using the Yamakawa-Fujii intrinsic viscos¬ 
ity theory for wormlike cylinders (Yamakawa 1971; 
Tsvetkov 1989). 
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Hydrodynamics of Macromolecules: Conformation Zon¬ 
ing for General Macromolecules, Fig. 1 Normalized plot 
of [tiJMl versus L for the following polymer systems: 1 - 
schizophyllan, 2 - xanthane, 3 - DNA, 4 - cellulose nitrate, 
5 - methylcellulose, 6 - polyvinylpyrrolidone, 7 - pullulan, 8, 

9 - polystyrene and poly-a-methylstyrene in 6 conditions, and 

10 - globular proteins. Approximate curves have shown the 


following zones: I-II - extra rigid - rigid chains (400 nm > 
A >15 nm), III-IV - flexible chain in thermodynamically good 
solvents and in 0 conditions (A Ri 2 nm), and V-globular system 
(A < 1 nm). The references to the original publication 
concerning the compared polymer systems may be found in the 
following publications: Pavlov et al. 1997, 1999, 2008. The 
same concerns to Figs. 2, 3, 4, 5, and 6 


In a similar way, the dependence of velocity sedi¬ 
mentation coefficients normalized for M^, on L, 

will display the opposite trend (Fig. 2). According to 
Svedberg’s (3), the ratio [s]/M/, ^ Ll<h^> can be 
obtained and this ratio characterizes the degree 

of coiling of the macromolecule. The ratio will 

be greater for “compact” macromolecules occupying 
smaller volume and smaller for rigid macromolecules. 
Such arguments are also applicable to plotting the 
characteristic translational diffusion coefficient [£>] 
depending on L. 

What is very informative is a direct comparison of 
the concentration dependence coefficient ks (6 and 8) 
with the sedimentation coefficient Sq taking into 
account the mass per unit length Ml (Pavlov and 
Frenkel 1995). In fact, (k^Mz.) - <h^> and the 
slope of the dependence (k^M^) versus f([i’]/ML) will 
be proportional to A^, whereas the slope of a plot of [ry] 
Ml versus f(L) will be proportional to A^^^. This means 
that a plot of log(ksML) versus f(log([5]/ML)) will be 
more sensitive to the equilibrium rigidity than the plot 
of log([77]ML) versus f(logL). The conformational zon¬ 
ing of macromolecules using a plot of (ksM^) versus 


f([4’]/ML) was proposed in Pavlov et al. (1997, 1999). 
This once again underlines the fact that the study of 
velocity sedimentation of a homologous series of mac¬ 
romolecules is an informative way for investigating 
conformation and, in some cases, can be used as 
a self-contained method (Pavlov and Frenkel 1995; 
Pavlov 1997). 

Second Normalization of the MHKS 
Relationships for Statistical Segment 
Length A 

The next step is to take into account a parameter known 
as the statistical segment length in a further consider¬ 
ation of the hydrodynamic data obtained for homolo¬ 
gous series. 

Quantitative evaluation of the statistical segment 
length A (also known as the “Kuhn segment length”) 
can be estimated by Gray-Bloomheld-Hearst theory 
which considers the dependence of the translational 
friction coefficient of a wormlike chain on molar 
mass taking into account the influence of 
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Hydrodynamics of 
Macromolecules: 
Conformation Zoning for 
General Macromolecules, 
Fig. 2 Plot of the dependence 
of [s]/Ml on L on a double 
logarithmic scale for the same 
systems as in Fig. 1 
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intramolecular draining and excluded volume effects 
on chain size (Gray et al. 1967): 

[s]PoNa = [3/(1 - £)(3 - 

+ [Mi.Po/37tl [HA/d) - (l/3)(A/d)-' - ^(e)] 

(13) 

where Pq = 5.11 and e = 2ho-l = (2h^ — 1 )/3 is the 
parameter characterizing the thermodynamic quality 
of the polymer-solvent system, which may be calcu¬ 
lated from the indexes and hri, d is the hydrody¬ 
namic diameter of the chain (cross-sectional diameter), 
and (p(£) = 1.431 -i- 2.648 -i- 4.718^. 

The viscosity data analysis may be carried out for 
all ranges of M, supposing that the molecular size is 
the same in translational and rotational friction phe¬ 
nomena, and results in the following equation (Pavlov 
et al. 1990): 

{M^<Po/[r,]y^^ = [s]PoNa (14) 

The dependence of on L can thus be addi¬ 

tionally normalized for the segment length A. In this 
case, it follows from (2) and (4) that the generalized 
coordinates are which is a function of the 

relative contour length L/A = M/(M/,A): 

MMl/A^ = <l>(e)[l -h (5/6E-h (1/6)82)]-'(L/A)(^--''^1''^ 

(15) 


In Fig. 3 (based on data from Fig. 1), the data have 
been collected together and delineated by lines 1-2-3- 
4. Line 1-2 dehnes the region corresponding to 
polymers in the absence of intracoil excluded volume 
interactions and its contour represented by the follow¬ 
ing empirical expression: y^xp = —0.795 -i- 1.34x — 
0.141x^. The slope at each point is given by hrj = dy/ 
dx = 1.34 — 0.282x. varies between 1.62 and 0.5, 
while the number of Kuhn segments in the chain cor¬ 
respondingly varies between 0.1 and 1,000. This 
chain behavior without excluded volume effects is 
also predictable on the basis of the Yamakawa-Fujii 
theories of wormlike cylinders which gives compara¬ 
ble agreement through the following relationship: 
Ytheor = —0.865 -I- 1.24x — 0.094x^. 

It is clear from Fig. 3 that the branches 1-2 and 3-4 
approach each other in a region marked as “0.” The 
0 or “zero” region is not an actual single point but is 
instead represents a locality. 

Fujita (1990) generalized the behavior of flexible 
chains in a thermodynamically good solvent and found 
that, when L/A <50 (log(L/A) « 1.7), the excluded 
volume effects are no longer pronounced. The value of 
log(L/A) « 1.7 corresponds to this zero region. Branch 
3-0 limits the region of flexible-chain polymers with 
maximum volume effects (thermodynamically good 
solvent). Branch 4-0 limits the region corresponding 
to a polymer with maximally large chain diameter. 
Lines 3-0 and 4-0 may also be approximated by the 
same contour defined by y = 0.650 -i- 0.230x -i- 0.116x^. 
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Hydrodynamics of Macromolecules: Conformation Zoning 
for General Macromolecules, Fig. 3 Double normalized 
dependence of [tiJMl/A^ on (L/A) for the same polymer systems 
as in Fig. 1. In this plot, the data of polystyrene and poly-a- 
methylstyrene in thermodynamically good solvents and data of 
comb-like copolymer have been added. Branch 1-0-2 corre¬ 
sponds to chains without excluded volume effects, 3-0 corre¬ 
sponds to chains with excluded volume effects, 4-0 corresponds 
to chains with diameter which comparable with statistical seg¬ 
ment length, 5 - lg(L/A) = 1.7, and 6 - lg(L/A) = 2.1. See text 
for details 


This corresponds to the following expression for the 
slope: hrf = 0.230 -i- 0.232x. changes in this case 
from 0 to 0.9 as the number of Kuhn segments in the 
chain increases from 0.1 to 1,000, respectively. This 
indicates the similarity of contributions to the size of 
macromolecules - one part, the excluded volume 
effects in the range of high M, and by another part, 
the diameter of chain in the range of low M. All these 
data can be divided into two subspaces 1-0-4 and 2-0- 
3. The subspace 1-0-4 corresponds to polymers of 
low molar mass when intracoil volume interactions 
become negligible. This region corresponds to the 
case of draining chains when the contribution of diam¬ 
eter to the total chain size can be considerable. The 
region 1-0-4 is characterized by the value of parameter 
£ = 0 and a wide variation in the d/A value (0.005 < d/A 
<0.5). Subspace 2-0-3 corresponds to high molar 
mass polymers, i.e., corresponding to the case of 
nondraining coils when hydrodynamic draining is not 
manifest and the contribution of the chain diameter to 
total coil size is negligible. Region 2-0-3 is character¬ 
ized by a change in the thermodynamic quality of the 


solvent and corresponds to changes in the parameter £ 
in the range: 0 < £ < 0.25. 

The zero region corresponds to the parameter range 
1.7 < lg(LM) < 2.1 (Fujita 1990; Pavlov 2007). In the 
region of relative contour length less than L/A< 50, 
intracoil volume effects virtually do not appear in 
thermodynamically good solvents. The plot in Fig. 3 
is the locus of values of the intrinsic viscosity Flory 
parameter <d> corresponding to the set (or multitude) of 
linear macromolecules. 

In the case of translational friction, the double log¬ 
arithmic normalized plots will be in terms of the fol¬ 
lowing: [^]/M/^ or [D]M/RMi^ versus (L/A) for the 
sedimentation coefficient and translational diffusion 
coefficient, respectively, where R is the gas constant. 
Figure 4 demonstrates the double normalized plot of 
the sedimentation coefficient, which is the locus of the 
translational friction Flory parameter P corresponding 
to the set of linear polymers. The parameter P is anal¬ 
ogous to the Stokes coefficient 6n in the relation of the 
translational friction coefficient / with the end-to-end 
distance of linear chain <h^>:f= Prjo <h^> Lines 
1-2 and 3-4 in Fig. 4 have the same meaning as in 
Fig. 3, and virtually, all experimental data are situated 
inside these two curved lines. Hence, plots 1-4 show 
clearly that the main parameters determining the 
hydrodynamic characteristics of linear chains in solu¬ 
tion are the statistical segment length (or persistence 
length) and the linear density of the chain. The values 
contributing less are the hydrodynamic diameter - the 
effect of which is manifested in the range of low molar 
mass - and the thermodynamic quality of the solvent - 
manifested at high molar mass. It should be empha¬ 
sized that, to evaluate adequately the equilibrium 
rigidity of the chain from hydrodynamic characteris¬ 
tics, it is necessary to precisely take into account the 
effects of intramolecular draining and volume effects 
(Pavlov 2002). 


General Comparison of Hydrodynamic 
Properties of Macromolecules of 
Different Forms 

In Figs. 1 and 2, data have been presented for linear 
polymers for which the linear chain density can be 
determined. For nonlinear molecules, this approach is 
not suitable, because in this case, it is necessary to 
consider only the volume density of the polymer 
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Hydrodynamics of 
Macromolecules: 
Conformation Zoning for 
General Macromolecules, 
Fig. 4 Double normalized 
dependence of [s]/Ml on 
(L/A) for the same polymer 
systems as in Fig. 3. See text 
for details 



substance. Just this consideration makes it possible to 
compare both linear and general macromolecules in 
the system of coordinates [tyJM versus M (Fig. 5). The 
product ([tylM) is claimed to be proportional to the 
volume V occupied by the (macro)molecule in solu¬ 
tion. Note that ([tylM) is the key value in the Benoit 
universal calibration interpreting the results from 
exclusion gel permeation chromatography of poly¬ 
mers. In the first approximation, the slope of these 
dependences will be inversely proportional to the aver¬ 
age intracoil density (^log(l/p)). The upper curves are 
located along the ordinate, and the lower along the 
density of polymer substance in the volume limited 
by the polymer molecule. 

Star-like polystyrenes and short-branched mole¬ 
cules such as glycogen, lignins, and dendrimers can 
be added to those to linear systems as shown in Fig. 1. 
Data concerning supramolecular structures such as 
micelles formed by polystyrenes-polymethacrylic 
acid block copolymers and dendrigrafts can also be 
taken into account in this case. Finally, data on the 
intrinsic viscosity of low molecular weight chelate 
compounds (alkaline FeTaNa solution, cadmium- 
ethylene-diamine complex) and those of mineral salts 
(such as LiCl in dimethylacetamide, and LiCl in H 2 O) 
can also be taken into account. The limiting slopes of 
these dependences at high M range from 1.0 to 2.4 on 
passing from dendrimers, glycogen, and globular pro¬ 
teins to comb-like chain, then flexible linear chain, 


and, finally, to extra-rigid macromolecules (Fig. 5) in 
accordance with the fact that ([? 7 ]M) ^ When 

the molar mass decreases, the differences between 
different types of macromolecules also decreases. 

In the region of low molecular mass (M< 1,000), 
when the dimension of the moving macromolecules 
becomes comparable with those of the solvent, the [rj\ 
value often loses its simple physical sense. In this 
region of M, a negative value of [ 77 ] is possible, 
which obviously cannot be taken into consideration 
in this kind of plot. This means that, in this case, 
dissolved molecules disturb considerably the organi¬ 
zation and properties of solvent continuum, especially 
its dynamic viscosity. A large scattering of data is 
observed in this M region. 

Let us now compare the most compact molecular 
systems: globular proteins, dendrimers, short-branched 
molecules such as glycogen and lignins, molecular 
brushes (comb-like polymers and copolymers), star- 
like polystyrenes, and supramolecular structures such 
as micelles. The dependence for globular proteins, 
dendrimers, and glycogen is a straight line (Fig. 5, 
line 1 ) over the entire molecular mass range (four 
orders of magnitude) with a slope of 1.05 ± 0.01 and 
with a linear correlation coefficient r = 0.9984. 
This slope value means that, to a good approximation, 
y ^ M over the entire molar mass range. In other 
words, the average density of these molecular 
systems is virtually constant for the entire molecular 
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Hydrodynamics of Macromolecules: Conformation Zon¬ 
ing for General Macromolecules, Fig. 5 Dependence of 
[r|]M versus M on a double logarithmic scale for different 
polymer systems: 1 - globular proteins, dendrimers, and glyco¬ 
gen; 2 - comb-like polymers; 3 - polystyrene, poly-a- 
methylstyrene, and PVP in thermodynamically good solvents; 
4 - polysaccharides schizophyllan and xanthan; and 5 and 6 - 
supramolecular micelles formed by block copolymers of poly- 
styrene-polymethacrylic acid and polystyrene - block - poly 


(ethylene/propylene) copolymers. Approximate curves 
according to data shown for the following: 1 - globular and 
short-branched polymer systems (1.0, df = 3.0); 2 - molecular 
bmshes (1.4, df = 2.14); 3 - flexible chain in thermodynamically 
good solvents (1.75, df = 1.71); and 4 - extra-rigid polymers 
(2.4, df = 1.25). Values corresponding to limiting slopes (1 -I- b,^) 
in the region of high molecular mass and coiTesponding fractal 
dimension are shown in parentheses 


range. These molecular systems (globular proteins, 
dendrimers, and glycogen) possess higher average den¬ 
sities in solution than all other macromolecules. Other 
macromolecular systems of this series (comb-like and 
star-like polymers and copolymers) also give a single 
line (Fig. 5, line 2), the slope of which increases from 
1.3 (for logM = 4) to 1.5 (for logM = 6) moving to 
the region of high M. The average density of these 
macromolecular systems in solution is lower than 
those of globular proteins. Interestingly, supramolecu¬ 
lar systems such as micelles formed by polystyrene- 
polymethacrylic acid block copolymers are situated 
between the line (1) and (2) in the system of coordi¬ 
nates [r)]M versus M, and so have the intermediary 
polymer density in solution between the globular 
proteins/dendrimers and polymer brushes. 

The translational friction data of linear and 
branched macromolecules may be considered in the 
coordinates M/[s] = RI[D] = [/]N ,4 as a function of M, 


where [/] =fo/r]Q. This plot is presented in Fig. 6 for the 
same classes of polymers as in Fig. 5 and has the same 
features as Fig. 5. Figure 6 also provides the possibility 
of distinguishing macromolecules of different types 
following their translational friction properties in the 
region of high molecular mass. 

Examples of Normalized Hydrodynamic 
Plots in Interpreting Hydrodynamic Data 

Applicability of normalized hydrodynamic plots can 
be illustrated with the data obtained for the sodium 
poly(4-styrenesulfonate) (NaPSS) in aqueous solu¬ 
tions as studied by Takahashi et al. (1967), Hirose 
et al. (1999), and Pavlov et al. (2008) at various ionic 
forces and in a wide interval of molar mass. In salt-free 
solutions, NaPSS shows polyelectrolyte behavior, at 
addition NaCl polyelectrolyte effects suppress to be 
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Hydrodynamics of 
Macromolecules: 
Conformation Zoning for 
General Macromolecules, 
Fig. 6 Dependence of 
M/[s] = R/[D] versus M on 
a double logarithmic scale for 
the following: 1 - globular and 
short-branched polymer 
systems, 2 - molecular 
brushes, 3 - flexible chain in 
thermodynamically good 
solvents, 4 - extra-rigid 
polymers, and 5 and 6 - 
supramolecular micelles 
formed by block copolymers 
of polystyrene - 
polymethacrylic acid and 
supramolecular polystyrene - 
polyoxyethylene core-shell 
amphiphilic dendrigrafts 



3 4 5 6 

logM 


T 
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Hydrodynamics of 
Macromolecules: 
Conformation Zoning for 
General Macromolecules, 
Fig. 7 Dependence of [r|]M 
versus M on a double 
logarithmic scale for sodium 
poly(4-styrenesulfonate) at 
different ionic strength: 1 - in 
water salt-free solution, 2 - 
0.005 M NaCl, 3 - 0.2 M 
NaCl, 4 - 1 M NaCl, and 5-7 
4.17MNaCl. 1, 3, 4, and 7 are 
the data from Pavlov et al. 
(2008), 2 and 5 are the data 
from Takahashi et al. (1967), 
and 6 is the data from Hirose 
et al. (1999) 
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also attended by the decrease of the coil sizes. The data 
concerning values of intrinsic viscosity is considered. 
The molar masses have been determined by one of the 
absolute methods, for example, by sedimentation- 
diffusion analysis or by light scattering. 

The intrinsic viscosity in salt-free solutions is 
defined as the initial slope of the following depen¬ 
dences: Inrir versus f(c) at c^O, where 13 ^ is the relative 


viscosity of a solution at concentration c (Pavlov et al. 
2006). All over raw experimental data is analyzed in 
coordinates ([riJM) on M (Fig. 7). It is obvious that at 
transition from salt-free solutions to solutions with the 
moderate ionic strength (0.005 M, 0.2 M) and then to 
solutions with the high ionic strength (4.17 M), the 
conformation of macromolecules of NaPSS molecules 
undergoes changes from rodlike conformation up to 
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Hydrodynamics of 
Macromolecules: 
Conformation Zoning for 
General Macromolecules, 
Fig. 8 Normalized plot of 
[tiIMl versus L = M/Ml for 
sodium poly 
(4-styrenesulfonate) at 
different ionic strength which 
are the same as in Fig. 7 
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conformation of strongly dense coil. More quantitative 
comparison can be lead comparing data in coordinates 
[tiJMl from L (Fig. 8) which shows that in salt free 
solvent NaPSS molecules belong to extra-rigid zone, at 
0.005 M solvent NaPSS molecules be found in rigid 
macromolecules zone, at 1 M belong to the zone of 
flexible macromolecules, and at 4.17 M macromole¬ 
cules of NaPSS be found between the zone of flexible 
macromolecules in 0 conditions and the zone of glob¬ 
ular conformation. These conclusions are supported 
by estimation of the statistical segment lengths 
(Hirose et al. 1999; Pavlov et al. 2008). 

Conclusions 

The relations (2, 3, 4) and (1 and 10) allow to the 
possibility of relating the fractal dimension to the 
scaling indices of the MHKS relationships: 

df = 3/(1 + b„) = (1 - bs)-' = Ibor*. (16) 

Globular proteins, glycogen, and dendrimers are 
characterized by high fractal dimension df^ 3. These 
kinds of macromolecules resemble macroscopic parti¬ 
cles. The macromolecular brushes and other supramo- 
lecular systems like micelles and dendrigrafts have 
fractal dimension between 2 <df< 3, and linear mac¬ 
romolecules have df between \.\ < df< 2 following 
the equilibrium rigidity of chains (“soft matter”). 


It is clear that the macromolecular systems are ranged 
in Figs. 3, 4, 7, and 8, following their fractal dimen¬ 
sions (gradually decreasing or increasing). 

Consequently, the vast quantity of experimental 
data gives the possibility of arranging macromolecules 
of different topology accordingly to their hydrody¬ 
namic volume or size in the system of coordinates 
[rj\M versus M or M/[i’] versus M, respectively. The 
consideration of the mass per unit length leads to the 
size spectrum of linear macromolecules in the system 
of coordinates [t/JMl versus L. Further consideration 
of the statistical Kuhn segment length (or persistence 
length) shows that the principal parameters governing 
the sizes of macromolecules in solution are the statis¬ 
tical segment length (equilibrium rigidity of chain) 
and the mass per unit length. The second order param¬ 
eters are thermodynamic quality of solvent and 
hydrodynamic diameter of chain which determines 
the draining effects inside moving macromolecules 
in solution. 

Generalization of the available library of experi¬ 
mental hydrodynamic data suggests that, at a consid¬ 
erable degree of probability, the characteristics of 
a new linear polymer system, the fractal dimension of 
which lies between 1< 3, will be located inside the 

region determined by the lines I and V in a system of 
coordinates — L as shown in Fig. 1 . The char¬ 

acteristics of any new soluble polymer system will be 
located inside the region determined by lines 1 and 4 in 
a system of coordinates [r/]M — M (Fig. 5). 
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Summary 

There are fundamental relationships that link hydrody¬ 
namic parameters to the molecular weight or molar 
mass and size of polymer chains. These include scaling 
or power-law relations known in polymer science as 
the Mark-Kuhn-Houwink-Sakurada equations. These 
have been further developed using the mass per unit 
length of the polymer chain and a parameter known as 
the statistical segment length, and relations have 
been developed for representing the conformations of 
a variety of macromolecular conformation types, 
whether they be synthetically made or naturally 
occurring. 

Cross-References 

► Cellulose 

► Dynamic Light Scattering 

► Lipid Lateral Diffusion 

► Multiangle Light Scattering from Separated 
Samples (MALS with SEC or FEE) 

► Polysaccharides: Biophysical Properties 

► Sedimentation Velocity Analytical 
Ultracentrifugation 

► X-Ray Scattering of Lipid Membranes 
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Definition 

The shape of rigid structures can be represented by 
various models with the desired detail. The wormlike 
chain is a versatile model for the conformation of long 
nucleic acids. The overall hydrodynamic properties are 
the sedimentation and diffusion coefficients, intrinsic 
viscosity, and rotational and relaxation times. 
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Introduction 

The essential roles played by nucleic acids are, in many 
aspects, related to the overall conformation (i.e., size, 
shape, flexibility, etc.) and dynamics (intramolecular, 
diffusive, etc.) of the macromolecular chain. Although 
such roles will take place in the complex environment 
of the cell, for the understanding or characterization of 
such aspects, the proper way is the study of properties 
in dilute solution, where the features of the individual 
macromolecules are manifested without influence 
from their surroundings. Thus, solution properties 
like the hydrodynamic coefficients (diffusion, 
► sedimentation, viscosity), dynamic spectroscopies 
(fluorescence, NMR relaxation), flow behavior, and 
scattering (light, x-rays, neutrons) (Van Holde 
et al. 1998; Serdyuk et al. 2007) are primary sources 
of structural information. The relationship between 
such properties (and, ultimately, their biological 
functions), on the one hand, and the conformational 
and dynamical aspects, on the other, requires theoretical 
treatments and computational tools that would allow the 
calculation of properties from structure or for us to 
extract conformational information from properties. 

Although there exist other important topologies 
among DNA and nucleic acids, like circular or 
superhelical forms, hairpin loops, single-stranded chains, 
etc., the relevant structure, which is the subject of this 
article, is that of linear chains of double-stranded double¬ 
helical, linear DNA (dsDNA). The DNA double helix is 
a quite stiff structure, but still allows for some flexibility, 
whose influence on conformation and dynamics will 
depend on the size of the DNA. Thus, DNA can adopt 
a range of conformations, from that of small rigid oligo¬ 
nucleotides to that of very long, flexible native macro¬ 
molecules. Also, the range of level of detail in modeling 
can range from fully atomic to more coarse-grained 
descriptions and even to idealized polymeric chains. 

For the purpose of computing or simulating overall 
conformation and solution properties, hydrodynamic 
calculation procedures are available for rigid bead 
models (Carrasco and Garcia de la Torre 1999). 
Flexible molecules can be treated by Monte Carlo 
approaches when one considers only conformation 
and overall properties, and Brownian Dynamics with 
the inclusion of hydrodynamic interactions is the 
methods of choice for long-scale internal dynamics 
and single-molecule behavior (Schlick 2002; Garcia 
de la Torre et al. 2005, 2010). 


The Wormlike Chain Model 

The wormlike chain (WMC) model is the most widely 
used paradigm for describing the overall conformation 
and overall hydrodynamic properties of dsDNA. In 
simple words, the WMC is a filament of contour 
length L, diameter d, with a partial flexibility gauged 
by the so-called persistence length, a, which is 
interpreted as the part of the contour over which the 
filament can be considered as essentially rigid. The 
model includes the limit of the rigid rod (L/a <<1) 
and the fully flexible random coil (L/a >> 1). The 
relationship between L and molecular weight M is 
expressed by a constant, the mass per unit length 
As described in classical monographs (see Bloomfield 
et al. 1999), this model is adequate to describe the 
conformation and overall properties of dsDNA 
over the whole range of sizes, from small oligonucleo¬ 
tides to native DNAs. The usually accepted values of the 
WLC parameters are a = (50 ±5) nm, d — (2.3 ±0.3) 
nm, and M^, = (1,900 ±100) Da/nm (the indicated 
ranges are guessed from the author’s experience). 

The theoretical description of properties of WLCs 
has been the subject of intensive work for many years. 
Theoretical approximations, and separate treatments of 
the WLC for different Lja ranges, had been common 
until a recent, remarkable achievement: the develop¬ 
ment of a computational scheme based on Monte 
Carlo calculation and rigid-body hydrodynamics 
(► HYDRO) of properties from the WLC parameters 
(Amoros et al. 201 1). A global fit analysis (► HYDFIT) 
has been able to describe the various overall solution 
properties of dsDNA from 8 to 200,000 base pairs, in an 
unified manner, with results for the WLC that match very 
well the commonly accepted values for the double helix. 

A Simple Model for Very Short dsDNA: The 
Rigid Rod 

For very short dsDNA oligonucleotides (L/a <<1, up 
to, say, 20-30 base pairs), the WLC reduces to a rigid 
rod. In this range, L/d is indeed moderate (<10). The 
most widely employed computational approach for the 
solution properties for such short, thick rods is 
implemented in a scheme, which has been recently 
updated (Ortega and Garcia de la Torre 2003). Diffu¬ 
sion coefficients determined by ► dynamic light 
scattering and NMR relaxation have been successfully 
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interpreted in terms of this approach (Eimer and 
Pecora 1991; Boisbouvier et al. 2003). 


A Simple Model for Very Long dsDNA: The 
Bead-and-Spring Chain Model 

For very long dsDNAs {Lja >> 1, with a random-coil 
conformation), which are still amenable to conven¬ 
tional determination of overall solution properties 
(covering up to M = 10^, like that of DNA from T2 
bacteriophage), the classical Rouse-Zimm 
bead-and-spring chain (BSC) model, in which each 
element in the chain represents a long piece (subchain) 
of the DNA, has been the usual representation for 
theoretical and computational purposes (Rubinstein 
and Colby 2003). The WMC model still describes the 
overall properties in this range, but it is inadequate to 
describe the internal dynamics of the DNA chain, 
which must obviously play a role in the function of 
such long, native DNA molecules. The internal (intra¬ 
molecular) dynamics of DNA had been classically 
characterized by bulk properties of DNA solutions 
(Bloomfield et al. 1999), until it was observed directly 
in the behavior of single molecules during the 
pioneering study by Chu and coworkers (Perkins 
et al. 1997). These authors observed the stretching of 
single DNA molecules when subjected to an exten- 
sional flow and the relaxation to the coil conformation 
upon cessation of the flow. Using a modihed BSC, the 
finitely extensible (FENE) chain model, and a rigorous 
description of hydrodynamic interactions in Brownian 
dynamics simulations, this behavior can be quantita¬ 
tively predicted using Brownian dynamics simulation 
(Schroeder et al. 2004; Garcra de la Torre et al. 2009). 
In addition to predicting the essential quantity for 
internal dynamics, namely, the longest relaxation 
time, the so-called molecular individualism, i.e., the 
various paths and diverse instantaneous conformations 
of individual molecules during unfolding and 
refolding, can be simulated by this technique. 

More Detailed dsDNA Models 

Modeling a dsDNA with M = 10^ Da as a BSC with up 
to, say, 100 beads, means that each element represents 
a subchain of about 1 million Da. While this works for 
some aspects of overall dynamics, more detail may be 


needed in some cases. The level of detail may range 
from a description of the thickness of the macromolec- 
ular chain - which is ignored in the BSM - to an atomic 
description. Certainly, a higher level of detail requires, 
for a DNA piece of some given length, a higher 
number of elements in the computational model, 
and this brings the obvious consequence of a higher 
computational cost. Thus, high-detail models are 
evidently restricted to DNAs of moderate size. 

Touching Beads Model 

Modeling a polymer chain as a string of touching beads 
(looking like a pearl necklace) has also been common 
practice in polymer physics. The touching bead model 
(TBM) consists of a string of N touching beads whose 
diameter, h, is the same as the distance between the 
centers of neighbor beads. The fact that the TBM does 
not have a uniform cross section can have a remedy 
making some equivalence between this model and 
a cylindrical one. A string of touching beads has the 
same volume as a cylinder of the same total length and 
diameter d if one takes h — (3/2)'^^ d — 1.225 d. If one 
accepts d = 2.3 nm for dsDNA, the beads would have 
a diameter and bead-to-bead distance h = 2.8 nm. 
Thus, the ratio of number of beads to number of base 
pairs (nbp) is A/nbp = 0.12; one bead would then 
represent about eight base pairs or about My = 
M/N = 5,000 Da in molecular weight. 

Hagerman and Zimm (1981) first used this model in 
Monte Carlo (MC) simulations of conformational and 
overall hydrodynamic coefficients, and Allison (1986) 
showed how it can be implemented for Brownian 
dynamics (BD) simulation of internal dynamics. For 
more recent applications, see Garcia de la Torre 
(2007), Garcia de la Torre et al. (2009). The maximum 
number of beads typically accessible in present 
implementations is, approximately, 2,000 in MC and 
200 in BD; therefore, the TBM can be employed to 
represent dsDNA of up to 15,000 base pairs in MC or 
1,500 in BD simulations. 

An essential aspect when devising the TBM for 
dsDNA is the representation of the stiffness of the 
double-helical filament. The chain whose segments 
are the bonds between beads can be assimilated to 
a discrete wormlike chain if some bending potential 
is assigned to the angle, a, formed by two consecutive 
bonds. It has been shown (Allison 1986) that with 
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Hydrodynamics of Nucleic Acids: Modeling Overall Conformation and Dynamics, Fig. 1 (a) Scheme of the touching bead 
model (TMB) model of dsDNA. (b) Snapshot of a TBM with 29 beads, representing a model of dsDNA with 209 base pairs 


a quadratic potential y(a) = ViQa^, the chain of seg¬ 
ments represents a WCM with persistence length a if 
one takes V = (h/kTa). This bending potential and its 
associated forces are employed, respectively, in MC 
and BD simulations (Garcia de la Torre 2007; Amoros 
etal. 2011) (Fig. 1). 

Residue-Level Models 

A further level of detail in modeling the conformation 
(shape and flexibility) and dynamics of DNA can be 
achieved with mesoscale models in which one element 
in the model represents one repeating unit, i.e., nucle¬ 
otide in the case of nucleic acids. These are sometimes 
called coarse-grained models, in the sense that the 
detail is coarser than in a fully atomistic description 
(see below). For double helical B-DNA, the equilib¬ 
rium conformation of the model will reproduce the 
shape and primary dimensions (rise per base pair, 
diameter, and major-minor-groove asymmetry) of the 
double helix. For the calculation of hydrodynamic 
properties, the elements are regarded as spherical 
beads with a frictional radius adequately chosen. 

A rigid model suffices for the description of the 
translational and rotational (overall tumbling) of very 
short oligonucleotides (below, say, 25 base pairs). 
Flowever, for longer DNA pieces - or even if one 
wishes to describe the internal, torsional dynamics of 
the oligonucleotides - flexibility must be taken into 
account. This can be easily done by means of stiff but 
still elastic connectors (quadratic-potential springs) 
between the beads. In the spirit of the elastic network 
models, a minimal model can be constructed joining 
every bead with their closest neighbors: for instance, 
six on its own strand (three next neighbors on 
each side) and three in the opposite one (its pair and 


the pair’s next neighbors). With a proper choice of the 
stiffness of those springs, the well-known bending 
and torsional flexibility of the double helix can be 
modulated. This modeling scheme was early 
confirmed by predictions of both overall (transla¬ 
tional and end-over-end tumbling) and internal 
(NMR relaxation) dynamics of short oligonucletides 
(Huertas et al. 1997). 

For a Brownian dynamics simulation, the presently 
accessible number of beads in the model make it pos¬ 
sible to carry out simulations of these mesoscale 
models, with one bead per nucleotide, for DNA pieces 
having up to, approximately, 200-300 nucleotides 
(longer DNAs would require touching beads models). 
Nonetheless, this range can cover cases of present 
interest involving moderately long nucleic acids 
(Figs. 2 and 3). 

Models from Atomic-Level Structures 

With the present proliferation of atomic-level 
structures of biomacromolecules, the employment of 
such detailed structures in dynamic modeling is desir¬ 
able. Even if such high-level details are not significant 
for some overall and large-scale internal dynamics, 
certainly atomic structures should be valid starting 
points for computational predictions. In the field of 
rigid proteins, this scheme has been implemented in 
the program HYDROPRO and other programs of the 
► HYDRO suite. Each (non-hydrogen) atom in the 
structure is represented by one spherical element in 
the model, having some fixed hydrodynamic radius, 
which is practically the same, about 0.28 nm for both 
protein and nucleic acid atoms (Eemandes et al. 2002; 
Ortega et al. 2011). Hydrodynamic coefficients of rigid 
nucleic acids and nucleoprotein complexes can be 
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Hydrodynamics of Nucleic Acids: Modeling Overall Conformation and Dynamics 



Hydrodynamics of Nucleic Acids: Modeling Overall Con¬ 
formation and Dynamics, Fig. 2 (a) Scheme of model for 
dsDNA with one bead per nucleotide, shown in the straight 
(equilibrium) conformation. Such model, considered to be 
rigid, is valid for predicting overall solution properties of small 
dsDNA. (b) Scheme showing the stiff connections of each ele¬ 
ment in the elastic network model (Huertas et al. 1997) 



Hydrodynamics of Nucleic Acids: Modeling Overall Con¬ 
formation and Dynamics, Fig. 3 Snapshot of a dsDNA frag¬ 
ment at an instantaneous conformation during the simulation of 
the elastic network model (Garcia de la Torre et al. 2009) of 
a fragment with 135 base pairs 


predicted with uncertainties not larger than experimen¬ 
tal errors. 

Even if atomic-resolution structures are available, 
one may wish to devise the hydrodynamic model with 
a lower resolution, with one bead per residue. This has 
the advantage of decreasing the number of elements in 


the model, with the subsequent reduction in computing 
requirements. This approach performs practically as 
well as that with atomic models (Ortega et al. 2011). 


Computer Programs 

Computer programs for simulating the conformation, 
dynamics, and solution properties of nucleic acids 
are freely available from http://leonardo.inf.um.es/ 
macromol/. For rigid models, one can employ pro¬ 
grams in the ► HYDRO suite: 

• HYDRO-h-i- calculates solution properties rigid 
bead models in general. 

• HYDROPRO and HYDRONMR compute solution 
properties of rigid models with either atomic. 

For flexible models, ranging from the BSM to the 
more detailed TSM and the BPN, one can employ: 

• MONTEHYDRO, for Monte Carlo simulation of 
conformational and overall dynamic quantities 

• SIMUFLEX, for Brownian dynamics simulation 
(including hydrodynamic interaction) of any con¬ 
formational or dynamic aspect, including single¬ 
molecule event 

Furthermore: 

• WORMCYL is available for the calculation of solu¬ 
tion properties of rodlike and ► wormlike models. 

• ► HYDFIT provides tools to fit the parameters of 
rodlike and WLC models to experimental data. 


Summary 

Modeling, computing, and simulation procedures are 
available to describe the conformation and dynamics 
of nucleic acids, and particularly double-stranded DNA, 
from small oligonucleotides to large, native macromol¬ 
ecules. Various levels of detail can be adopted in the 
models, which can be adapted to the size of the case 
under study so that the computing or simulation 
becomes feasible with available computational 
resources. The problems that are covered range from 
bulk solution properties to single-molecule events. 

Cross-References 

► Dynamic Light Scattering 

► HYDFIT and Related Packages for Linear 
Molecules 
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► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules 

► Neutron Scattering of Membranes 

► Nucleic Acid NMR - Introduction 

► Protein Conformational Dynamics by Relaxation 
Dispersion 

► Sedimentation Velocity Analytical 
Ultracentrifugation 

► Single-Particle Tracking 

► Wide and Small Angle X-Ray Scattering 

► Worm-Like Chain (WLC) Model 

► X-Ray Scattering of Lipid Membranes 
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Hydrogen Atom Transfer (HAT) 

► Proton-Coupled Electron Transfer 


Hydrogen Bonding 

► Rhodopsins - Intramembrane Signaling by Hydro¬ 
gen Bonding 

► Ultraviolet Resonance Raman (UVRR) Spectros¬ 
copy Studies of Structure and Dynamics of Proteins 

► X-Ray Diffraction and Crystallography of Oligosac¬ 
charides and Polysaccharides 


Hydrogen Exchange 

► H/D Exchange 
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Hydrogen-Deuterium Exchange 


- field. They are related to geometrical parameters 

Hydrogen-Deuterium Exchange through the equation: 


► Infrared Spectroscopy of Protein Dynamics: H/D 
Exchange 


Hyperfine Shifts 

Ivano Bertini and Giacomo Parigi 
Magnetic Resonance Center (CERM) and Department 
of Chemistry, University of Elorence, Sesto Fiorentino 
(FI), Italy 


Synonyms 

Pseudocontact and contact shifts 


Definition 

Paramagnetic metal ions affect the NMR spectra of 
molecules. The hyperfine shift is the difference 
between the chemical shift of a nucleus in a 
paramagnetic molecule and that in the diamagnetic 
analog obtained after removal of the paramagnetic 
metal ion, its substitution with a diamagnetic ion, or 
reduction/oxidation of the paramagnetic metal to 
a diamagnetic state. The hyperfine shiff comprises the 
contact and the pseudocontact shifts. The contact shift 
is the result of the presence of unpaired electron spin 
density at the observed nucleus, which may occur 
through direct spin delocalization or spin polarization. 
Contact shifts are experienced by nuclei that are posi¬ 
tioned few chemical bonds away from the paramag¬ 
netic metal ion or even further away if there are n 
bonds, and they contain structural information. In few 
cases (Fe 4 S 4 ^'^ clusters, Fe(III) heme proteins), the 
contact shifts of some nuclei are related to angles 
between the metal ion and the ligand nuclei through 
heuristically obtained Karplus relationships. They 
may be as large as hundreds of ppm and of course 
may tend to zero. 

Pseudocontact shifts arise from the nonzero averag¬ 
ing upon rotation of the dipolar coupling between the 
nuclear magnetic moment and the average electron 
magnetic moment induced by the external magnetic 


dP‘='‘ = 


12nr^ 


AZajt(3cos^-!9- l) +-Axr/,sm^i9cos2ip 


( 1 ) 


where and Ax,./, are the axial and rhombic anisot¬ 
ropy parameters of the paramagnetic susceptibility 
tensor, given by 


. Zxv + Xyy 

^Xax Xzz 2 

A/,;, = - Xyy 

and r, r?, and ip are the spherical coordinates of the 
nuclei in the reference frame in which the paramag¬ 
netic susceptibility tensor x is diagonal. Pcs values 
depend only on the anisotropy of x and not on its 
magnitude; if the magnetic susceptibility is isotropic, 
the rotational average of the dipolar energy is zero, and 
no pcs occurs. They depend neither on the applied 
magnetic field, nor on fhe observed nucleus. They 
depend on the distance of the nucleus with respect to 
the metal ion and on its angular position. Hyperfine 
shifts of nuclei, which are expected not to experience 
contact shifts because of the distance from the metal 
ion, are due to pcs. They have been extensively used 
for the calculations of molecular structures. Pcs can be 
univocally calculated once the molecular structure and 
the magnetic susceptibility anisotropy tensor are 
known. The anisotropy tensor can be calculated from 
the experimental pcs (provided that at least five values 
are available, if the metal position is known) and the 
molecular structure. The molecular structure cannot be 
determined from pcs and the anisotropy tensor alone 
due to the large degeneracy in the nuclear positions 
resulting from Eq. 1 . However, the availability of pcs 
arising from multiple metal ions with different tensors 
and of other structural restraints (like NOEs) can solve 
this degeneracy. Pseudocontact shifts can be detected 
up to several tens of A, depending on the magnitude of 
the magnetic susceptibility anisotropies. 

A further contribution to the hyperfine shiffs of 
nuclei close to the paramagnetic metal ion, besides 
the contact shift and the pseudocontact shift described 
by Eq. 1, can arise from the ligand-centered 
pseudocontact shifts resulting from the presence of 
unpaired spin density within non-s ligand orbitals. 
Nuclei far from the metal ion experience only the 
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pseudocontact contributions, which allow to determine 
the parameters of Eq. 1 - if a structural model becomes 
available - and then to calculate pcs of nuclei close to 
the paramagnetic center. The latter nuclei experience 
also contact and ligand-centered pseudocontact shifts. 
Both of them can he estimated hy subtracting the 
pcs from the hyperfine shifts. The separation between 
contact and ligand-centered pseudocontact shifts 
is very difficult: often the latter are neglected or 
estimated through density functional calculations. 

Cross-References 


► Iron-Sulfur Clusters 

► Paramagnetic Metalloproteins 


Hyperpolarization-Activated Channels 

► HCN Charmels: Biophysics and Functional Relevance 


HyuP ("Permease" in the hyu Operons 
from Pseudomonas sp. NS671) 


► Cytochromes 

► Hemes 


► Mhpl, the Na’^-Hydantoin Membrane Transport 
Protein 
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IFNAR 

► Type I Interferon Receptor 


Immunoglobulin G 

► Crystallohydrodynamics of IgG 
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Introduction 

An implantable enzyme-amperometric biosensor is 
a bioanalytical technology that is intended to measure 


and often remotely transmit a record of specific molec¬ 
ular-level biological phenomena in a living subject. 
The basic function is to indwell a tissue and to detect, 
measure, and record the levels of a molecule of 
interest within that tissue. Being amperometric, the 
biotransducer generally comprises two or three elec¬ 
trodes; a working electrode rendered specific to the 
analyte of interest via immobilization of the appropri¬ 
ate enzyme, a counter electrode to support the ensuring 
current, and a reference electrode to provide a suitable 
voltage plane against which the interrogating voltage 
may be referenced. Two electrode biotransducers func¬ 
tionally cojoin the counter and reference electrodes. 
Implantable amperometric biosensors generally use 
enzymes as the biorecognition biomolecule to enable 
the detection of biochemicals of interest within the 
body (Guiseppi-Elie 2007). Enzymes catalyze bio¬ 
chemical reactions of specific substrates, the products 
of which may then be detected via electrochemical 
oxidation or reduction that occurs at the surface of 
the working electrode while under a suitable impressed 
potential. Bioimmobilization, a technical challenge in 
its own right, is often achieved by a number of different 
techniques, and the current generated by the electro¬ 
chemical oxidation or reduction of the enzymatic prod¬ 
ucts generates a measurable current. The potential 
applied may be with respect to a reference electrode 
with a known redox potential such as a silver/silver 
chloride electrode, although silver is never used 
in vivo, or with respect to a pseudo-reference electrode 
such as platinum, if the magnitude of current generated 
is small enough. 

Amperometry is an electrochemical technique wherein 
a fixed voltage is impressed upon the working electrode 
thereby generating an initial transient current, i{t), 


G.C.K. Roberts (ed.). Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 
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Implantable Amperometric Biosensors 



Implantable Amperometric Biosensors, Fig. 1 Voltage 
applied to cell begins at Ei where no reaction occurs and is 
stepped up to E 2 causing electrode process to begin and 
a current transient ensues. Current, I, drops off with time 
according to the Cottrell equation as substance, C*, must diffuse 
to the electrode surface in order to react 


C°* is a standard concentration), linear dynamic range 
(the range of concentrations over which the response of 
the biotransducer is linear), and the limit of detection 
[3(SD/s)] (where SD is the standard deviation of the 
blank or a measure of system noise) are important figures 
of merit in assessing bioanalytical performance of 
the biosensor (Theavenot et al. 1999). Oxidoreductase 
enzymes, such as the FAD-dependent (flavin adenine 
dinucleotide), glucose oxidase (GOx) (EC 1.1.3.4), and 
FMN-dependent (flavin mononucleotide) lactate oxidase 
(LOx) (EC 1.1.3.2.), are suitable candidates for the rec¬ 
ognition of important physiologic biomarkers related to 
diabetes care and management and to hemorrhage- 
associated trauma care and management, respectively. 
GOx is a stmcturally rigid glycoprotein with a molecular 
weight of ca. 160 kDa (Tsuge et al. 1975) and consists 
of two identical polypeptide subunits, each containing 
a EAD redox center. In Generation 1 biotransducers, 
these enzymes, in the presence of oxygen, produce equiv¬ 
alents of hydrogen peroxide that may be oxidized anod- 
ically at a potential which is positive (typically 0.60 V vs. 
Ag/AgCl) (Guilbault and Lubrano 1973) or cathodically 
reduced at a potential which may be negative (typically 
0.0 vs. Ag/AgCl) (Ruzgas et al. 1995) relative to the 
redox potential of hydrogen peroxide. 


related to the activity of redox species at the interface. 
The current decays to a steady state value, 4s, that 
is dependent upon the bulk chemical potential of 
electroactive species. Eor a single analyte at a macro 
electrode, the resulting current profile is given by 
Cottrell’s equation (Bard and Eaulkner 2001). At 
a microelectrode, the current profile, shown in Eig. 1, is 
given by a modified Cottrell’s equation. Table 1 provides 
a summary of possible geometries of amperometric 
electrodes, their associated temporal and steady state 
forms of Cottrell’s equation and schematics of the diffu¬ 
sive mass transport field associated with each type of 
electrode. 


/?-D-glucose + GOx/EAD 
^ D-glucono-^-lactone -f GOX/EADH 2 

GOx/FADHz -f O 2 ^ GOx/EAD -b H 2 O 2 (3) 
H 2 O 2 ^02-1- 2H+ -b 2e- (4) 

Enzyme kinetic parameters may be obtained from 
the steady state current of the amperometric enzyme 
biotransducer. A Flill or Lineweaver-Burk analysis 
using (5) allows calculation of the apparent enzyme 
kinetic parameters. 


nEAD,^//^C* vFADgppC* 

27jl/2fl/2 2;-o 


_L = ^_L +J_ 

^SS 7jnax ^5 


(5) 


The goal in the design of implantable amperometric 
biosensors it to make the steady state current, /'ss, directly 
proportional to the chemical potential of the in vivo ana¬ 
lyte that is recognized by the enzyme. The bioanalytical 
sensitivity [s = (/'ss- ibOAog C vs. log(C*/C° *)] (where 4s 
is the steady state current, /'bi is the base line current, and 


Here Iss is the steady state current or response after 
substrate, S, addition, C* is the bulk concentration of 
substrate, and is the maximum current or response 
measured under enzyme saturated substrate conditions. 
The maximum current, Imaxj and the Michaelis-Menten 
constant of the system, Km (mM), may thus be 
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Implantable Amperometric Biosensors 


determined, allowing evaluation of the contextual perfor¬ 
mance of the enzyme relative to some standard condition, 
such as in solution. 

Commonly known implantable biosensor systems 
include short-term indwelling continuous glucose 
monitoring systems such as Dexcom’s 2006 FDA- 
approved STS (U.S. FDA 2006) and 2007 FDA- 
approved STS-7 Continuous Glucose Monitoring 
System for diabetics (U.S. FDA 2007). Newer implant¬ 
able biosensor systems are being designed with 
intended use in trauma management (Guiseppi-Elie 
2010). This entry focuses on the multiple challenges, 
and the various attempts and approaches that are being 
forged to address them, in the development of longer- 
term indwelling bioanalytical biosensors for humans. 
Among these challenges are: (1) long-term biocompat¬ 
ibility, (2) in vivo bioanalytical performance, and 
(3) engineering and communications issues. 

Known Failure Modes 

In the now almost 50 years since amperometric enzyme 
biosensors were first described by Clark and Lyons 
(1962) (Clark and Sachs 1968). There has been little 
success in commercializing long-term, indwelling, 
implantable amperometric biosensors. Wang (2007) 
gives a detailed review on the various generations of 
electrochemical glucose biosensors. Indwelling ampero¬ 
metric biotransducers may fail because: (1) of the body’s 
foreign body response, wherefrom the device becomes 
encased within an avascular fibrous capsule of collagen 
that prevents access of analytes to the otherwise properly 
functioning device, (2) proteases of the extracellular 
matrix (ECM) infiltrate the device and degrade the 
immobilized enzyme, (3) the molecular environment 
of the immobilized enzyme promotes its denaturation 
by favoring unfolding of the protein leading to loss of 
biorecognition, (4) time-temperature dependent changes 
in the membrane properties lead to densification of 
the membrane that compromise transport properties, 
(5) materials failures such as delamination of membranes 
and breakage of leadwires and other connections. 

Biocompatibility 

Biocompatibility is the ability of a synthetic material or 
structure to perform as intended when implanted within 


a living host. Implantable enzyme-amperometric biosen¬ 
sors must recognize, transmute, and generate physico¬ 
chemical signals that are proportional to the chemical 
potential (concentration) of the analytes they are 
intended to measure They must do so without interfer¬ 
ence from endogenous (such as dopamine, urate, ascor¬ 
bate, and citrate) or exogenous (such as acetaminophen) 
reactive species, without cross reactivity with similarly 
stractured chemicals, without insult due to the changing 
physiologic conditions of the host, and do so repeatedly 
and reliably for their period of intended performance. 
The first concern for any designed in vivo device is the 
initial host inflammatory response following the trauma 
of implantation. The second concern is the subsequent 
long-term interaction with proteins, cells, and tissues of 
the host as the tissue remodels and the body responds 
during the device’s period of intended performance 
(Ratner and Bryant 2004). There are a number of gener¬ 
alized approaches to addressing biocompatibility 
(Donald 2005; Moon et al. 2006; Stevens and George 
2005). Among these are (1) device design approaches, 

(2) the use of biomimetic chemistry in membrane mate¬ 
rials, (3) the use of heterogeneous nano- and meso- 
stractures, (4) the release of dmgs into the implant 
environment, (5) the matching of mechanical properties 
of materials with the tissue bed of implantation, and 
(6) the pre-seeding of cells onto the device prior to 
implantation. It is clear that this problem will only be 
successfully addressed with a plurality of sophisticated 
approaches for which reductionist science may not be 
well suited. 

Bioanalytical Performance Issues 

Ultimately the implantable amperometric biosensor 
must perform its bioanalytical function to the benefit of 
the patient or clinician and do so without regard to the 
details of the technology that enables its performance. 
A major challenge facing indwelling biotransducers 
of the enzyme-amperometric type are time-dependent 
changes that accompany performance - drift. Among 
the challenges to bioanalytical performance are 
(1) enzyme stability, (2) biomolecular interferences, 

(3) the performance of molecular mediators used in 
Generation 2 biotransducers, and (4) internal calibration. 

Enzyme stability can be defined as the reproducibility 
of analytical response or signal to analyte (substrate) 
over a period of time arising from the changing 
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performance of the enzyme. Factors influencing signal 
reproducibility include the exact activity of immobilized 
enzymes (Gerard et al. 2002), possible enzyme denatur- 
ation during immobilization (Evtugyn et al. 1998), 
enzyme degradation by reactive oxidative species, and 
protease degradation. 

A major challenge with indwelling biotransducers 
of the enzyme-amperometric type is the need for and 
approach to in vivo calibration of the device. One- 
point calibration is generally based on blood derived 
from a single Anger prick. One-point calibrations are 
inherently fraught with error and are notoriously inad¬ 
equate when errors are more complex than systemic 
and/or random. Multipoint calibrations of in vivo 
biotransducers is possible by the administration of 
a bolus of ANALYTE, glucose or lactate, to the patient 
or animal subject, and is often done in animal studies 
and may be done in a clinical setting (Jeong et al. 2003; 
Rong et al. 2008). However, this is not generally pos¬ 
sible or recommended in the chronically monitored 
patent. Moreover, this approach assumes that intersti- 
tially or subcutaneously monitored analytes are at the 
same concentration as their systemic circulating coun¬ 
terparts, or at least they are consistently proportional 
due to the usual transport barriers. While under homeo¬ 
stasis, the latter may be true, however the dynamics of 
the physiological changes suggest otherwise, such as 
during hemorrhage (Guiseppi-Elie 2011). In vivo sen¬ 
sitivity can be determined as the ratio of the 
incremental increase in current corresponding to 
changes to systemic analytes (Rong et al. 2008). The 
equation for in vivo sensitivity (S) is given as follows 
(Rong et al. 2008): 


Where Ii and I 2 are the currents before and after an 
incremental increase in analyte concentration and Gi 
and G 2 are the analyte concentrations before and after 
the incremental increase. Eor convenience, Ii can be 
taken as the in vivo background current, Iq, such that 
Gi may be regarded as zero analyte concentration. 
Therefore I 2 becomes the stabilized initial current, Ij, 
of the biosensor after implantation. In vitro analysis 
with a single blood sample is used to determine G 2 as 
the initial analyte concentration, Gj. The previous 
equation then reduces to the following (Jeong et al. 
2003): 


5 = (7) 

Therefore this in vivo calibration technique could 
benefit from an additional, nonenzymatic electrode by 
which background current can be monitored. Back¬ 
ground current should be monitored anyhow, as it 
tends to fluctuate during biosensor implant duration 
(Jeong et al. 2003). Always knowing the background 
current will enable users of the biosensor system to 
determine possible enzyme activity loss. With sensi¬ 
tivity known and the background current, analyte 
levels (G) can be calculated with the measured current 
I as follows (Jeong et al. 2003): 

G = (8) 

Engineering and Communications Issues 

Implantation is motivated by the need to allow the patient 
or subject the freedom of motion without being confined 
to an instrument; implantation allows the subject to 
take the instrument with them. Central to an implantable 
amperometric biosensor is its potentiostat. However, for 
implantation, the potentiostat is closely associated with 
two-way telemetry and communications. Two general 
formats are being pursued. The first is an implantable but 
otherwise tethered bio transducer with externally located 
power, electronics, and communications components 
with the external components mounted outside but on 
the subject’s body. The second is a fully integrated 
discrete or application specific integrated circuit (ASIC) 
where all components are fully implanted. 

Ideally, an implantable biosensor system should 
be small, have very low power consumption, and be 
able to wirelessly transmit data as well as be remotely 
programmed while within living tissues (Earahi et al. 
2007). Major electronic components include the front- 
end potentiostat, power modules, microcontrollers, ran¬ 
dom access memory, flash memory, RE transmitters, and 
analog-to-digital converters (ADC) (Earahi et al. 2007). 
Miniaturized potentiostats have been developed which 
contain microprocessors, ADC, gain and low-pass Alters 
(Avdikos et al. 2005; Beach et al. 2005; Huang et al. 
2007; Kwakye and Baeumner 2007; Kwang-Seok 
et al. 2003; Lei et al. 2002; Martin et al. 2004; Rocchitta 
et al. 2007). Continued innovations in potentiostats are 
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needed. Miniature conformal antennae for implantable 
telemetry applications have also been developed (Merli 
et al. 2011). Bio-Nano-Sensors with electrochemical 
front-ends have been developed for continuous monitor¬ 
ing of glucose, lactate, glutamate, and ATP (Carrara et al. 
2011). As a whole, the entire package of the implantable 
Bio-Nano-Sensors systems is approximately 2,477 mm^ 
in volume (Merli et al. 2011). Subdermal identification 
(RFID - Radio Frequency Identification) chips such as 
the VeriChip® are approximately 12-14 mm^, approxi¬ 
mately two orders of magnitude smaller. However, while 
application specific integrated circuits (ASIC), these 
devices at not equipped for hioanalytical measurements. 
The holy grail of in vivo hioanalytical and clinical chem¬ 
istry is to develop and deploy physiological status mon¬ 
itoring hiochips of similar footprint, along the lines of 
a large grain of rice. 

Summary 

While the possible failure modes have been outlined 
and the formidable challenges identified, it is impor¬ 
tant to emphasize that the emerging approaches that 
have been highlighted here reflect a shift in collective 
thinking that represent bright lights on the horizon. 
Several of these approaches highlighted above have 
already been adopted and have demonstrated success. 
Continuous performance of implanted biosensors for 
up of 4 months in rats (Yu et al. 2006, 2007) and up to 
1 year in pigs (Gough et al. 2010) has been achieved. In 
both these studies the authors achieve success by inte¬ 
grating several of the approaches outlined above. 

Cross-References 

► Amperometric Biosensors 
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In Cuba Crystallization 

► Lipid Mesophases for Crystallizing Membrane 
Proteins 


In Meso Crystallization 

► Lipid Mesophases for Crystallizing Membrane 
Proteins 


In Vivo Imaging 

► Live-Cell Single-Molecule Imaging 


In-Cell NMR 

Hidehito Tochio and Masahiro Shirakawa 
Graduate school of Engineering, Kyoto University, 
Kyoto, Japan 

Synonyms 

NMR 

In-cell NMR is a multidimensional NMR spectroscopy 
of macromolecules such as proteins and nucleic acids 
in living cells, aiming to analyze their structures and 
functions. Techniques of isotope-aided solution NMR 
spectroscopy are used to selectively derive signals of 
macromolecules of interest from those of a large num¬ 
ber of other molecules present in cells. 

So far, E. coli cells, Xenopus laevis oocytes, human 
HeLa cells, and monkey COS7 and 293F cells have 
been used as “host cells” for the in-cell approach. 
For E. coli cells, proteins to be analyzed are 
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over-expressed in cells grown in isotope-enriched 
media, so that the proteins are semi-selectively labeled 
with NMR-active nuclei such as and *^C. 

The whole cells are subjected to NMR measurements, 
and appropriate pulse schemes are employed to allow 
only the signals of labeled molecules to be detected. 
The obtained spectra possibly suffer from NMR 
signals of intrinsic E. coli proteins and other endoge¬ 
nous molecules that can be isotope-enriched in some 
degree; however, when expression levels of target 
proteins are sufficiently high, these unwanted signals 
are negligible. 

For in-cell NMR using Xenopus laevis oocytes, 
rather straightforward method has been proposed: 
isotope-labeled proteins are directly injected to 
oocytes with microinjection technique. The oocytes 
are then subjected to NMR measurements. Because 
Xenopus oocytes are larger 1 mm in diameter) than 
mammalian somatic cells (^10 pm) or bacterial cells 
(1^2 pm), efficient injection is possible. 

Two approaches have been proposed for in-cell 
NMR using mammalian cells, in both of which 
isotope-labeled proteins are transduced into “host 
cells.” One utilizes cell penetrating peptides (CPPs) 
that promote protein transduction to cytosol. Proteins 
to be studied are engineered to be tagged with CPPs 
such as peptides harboring HIV-Tat or octaarginine 
sequence. Cells are exposed to CPP-tagged proteins 
for a short period, so that the proteins enter the cytosol. 
The cells are collected and subjected to NMR 
measurement. The CPP-tag is designed to be separated 
from the protein after its entry into the cytosol to 
escape from CPP-induced aggregates. The other 
reported method employed to transduce isotope-labeled 
proteins to the cytosol utilizes a bacterial toxin, which 
makes certain types of cells permeabilized by forming 
pores with a diameter of 35 nm on the plasma membrane. 
Isotope-labeled proteins passively enter the cells through 
the pores. 

Although in principle most techniques of solution 
NMR can be applied to proteins in living cells, low 
concentrations and slow tumbling rates of proteins in 
cells hamper the practical applications. Nevertheless, 
using a series of two- and three-dimensional NMR 
spectra, structure determination of a protein in E. coli 
cells has been achieved. Using an in-cell NMR-related 
method for analyzing hydrogen exchange rates, the 
folding stability of ubiquitin in human cells was 
investigated. 


Incoherent Inelastic Neutron Scattering 

► Vibrational Spectroscopy with Neutrons 


Indirect Spin-Spin, Scalar-Coupling, and 
Through-Bond Correlation Spectroscopy 

► J-Based NMR Correlation Spectroscopy of 
Biological Solids 


Individual Light Sheet Microscopy 

► Selective Plane Illumination Microscopy (SPIM) 


Individual Molecule Localization 
Techniques 

Francesca Celia Zanacchi and Alberto Diaspro 

Department of Nanophysics, Italian Institute of 

Technology (IIT), Genoa, Italy 

Department of Physics, University of Genoa, Genoa, 

Italy 

Synonyms 

Single-molecule imaging; Single-molecule switching 
microscopy (SMS) 

Definition 

Individual molecule localization is a family of super¬ 
resolution imaging techniques, which allows one to 
push the resolution below the diffraction limit, 
exploiting the detection and the localization of sparse 
single emitters. 

Basic Characteristics 

Individual molecule localization microscopy is a fluores¬ 
cence super-resolution imaging technique developed 
in the 2000s from single-molecule spectroscopy and 
microscopy (Rigler et al. 2001; Chirico et al. 2001). 
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Individual Molecule 
Localization Techniques, 
Fig. 1 Principle behind 
individual molecule 
localization techniques. Signal 
from a sparse subset of 
molecules is acquired (a) and 
the position of each single 
molecule is localized and 
identified with high accuracy 
(b). Molecules are then 
switched off and a new subset 
of events is collected. The 
cycles are repeated until 
enough molecules are 
collected and the super¬ 
resolution image provided by 
the sum of all the localized 
events can be reconstructed (c) 



This technique is used to perform super-resolution imag¬ 
ing of biological structures achieving a resolution below 
30 nm (Hell 2009). The basic idea behind this technique 
relies on the possibility to switch a fluorophore or 
a fluorescent protein between a “bright” and a “dark” 
state, thus obtaining a sparse subset of emitters distin¬ 
guishable in the image (see Fig. 1). Molecules driven into 
the bright state can be imaged and their positions can be 
localized with nanometer precision. After the readout 
phase the markers are switched off and new markers are 
turned on to be collected. The cycles are repeated until 


enough molecules are acquired and the flnal image pro¬ 
vided by the sum of all the localized events can be 
reconstructed. One of the first techniques demonstrating 
the capability to localize with nanometer accuracy and 
determine the position of single fluorescent probes was 
► fluorescence imaging with one-nanometer accuracy 
(FIONA) (Yildiz and Selvin 2005). Later, starting 
from 2006, several localization-based techniques, such 
as ► photoactivation localization microscopy PALM 
(Betzig et al. 2006), ► fluorescence photoactivation 
localization microscopy (FPALM) (Hess et al. 2006), 
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► stochastic optical reconstruction microscopy (STORM) 
(Rust et al. 2006), and ► ground state depletion imaging 
(GSDIM), have been developed providing super-resolution 
imaging capabilities. 

Variants based on the concept of FIONA were 
later developed, namely, in ► single-molecule high- 
resolution imaging with photobleaching (SHRIMP) 
the photobleaching process is used to turn off the 
molecules and ► single-molecule high-resolution 
colocalization (SHREC) is two color applications 
oriented. 

All these techniques are based on the localization of 
single sparse emitters and the difference between them 
is related to the optical system and to the 
photoswitching mechanism used. PALM exploits the 
photophysical properties of photoactivatable fluores¬ 
cent proteins and is implemented in a TIRF configura¬ 
tion, while in FPALM a wide-field optical scheme is 
used. The main difference between PALM, STORM, 
and GSDIM, relies on the transition used to switch 
between the dark and the bright state. In STORM 
paired photoswitchers based on synthetic fluorophores 
are used while in ground state depletion with 
single-molecule return (GSDIM) the transition to 
a metastable triplet state is used to drive molecules in 
a “dark” state. The rapid growth of far held optical 
super-resolution techniques led to a fast development 
of localization based techniques and their extension to 
multicolor super-resolution imaging (Bates et al. 2007) 
and three-dimensional imaging (Huang et al. 2008; 
Celia Zanacchi et al. 2011) recently extended to 
biological samples (Pavani et al. 2009). Localization 
based techniques (Moemer 2007 ; Biteen 2011; Patterson 
et al. 2010) have demonstrated a vast applicability to 
a wide range of biological contexts; from imaging of 
membrane protein distribution to imaging of nuclear 
complexes in both live and flxed samples. 

For more information on this topic, the reader is 
referred to the references listed below. 

Cross-References 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Photoactivation Localization 
Microscopy (FPALM) 

► Ground State Depletion Imaging 

► Photoactivated Localization Microscopy (PALM) 


► Single-Molecule High-Resolution Colocalization 
(SHREC) 

► Single-Molecule High-Resolution Imaging with 
Photobleaching (SHRIMP) 

► Stochastic Optical Reconstruction Microscopy 
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Induced Circular Dichroism 

► DNA-Ligand Circular Dichroism 


Induced Fit Binding 

► Plasticity at Protein-Protein Interaction Interfaces 

► Protein Dynamics and Allostery 


Induced Linear Dichroism - Ligand- 
Ligand Interactions 

► DNA-Ligand Flow Linear Dichroism 


Inelastic Neutron Scattering 

► Vibrational Spectroscopy with Neutrons 


INEPT 

Bryan VanSchouwen and Giuseppe Melacini 
McMaster University, Hamilton, ON, Canada 

Synonyms 

Insensitive nuclei enhanced by polarization transfer 

Definition 

INEPT is a method used in Nuclear Magnetic Reso¬ 
nance (NMR) to enhance the signal of a nucleus with 
low gyromagnetic ratio (“S,” e.g., *^C or *^N) by 
transfer of polarization from a nucleus with high gyro- 
magnetic ratio (“I,” often *H). The maximum signal- 
to-noise amplification is Yi/ys, where yi and ys are the 
gyromagnetic ratios of the high- and low-sensitivity 
nuclei, respectively. So, for instance, the maximum 
signal-to-noise enhancement for the 'H to *^C ('^N) 
polarization transfer is ^4 (^10). 


In the INEPT experiment, the I to S polarization 
transfer is obtained by exploiting the through-bond 
scalar coupling between the I and S spins (Jis) 
according to the following pulse sequence; 

90°(I) - t/2 - 180°(I), 180°(S) - t/2 - 90;(I), 90°(S) 

where ap(I or S) refers to a radio-frequency pulse on 
nucleus 1 or S resulting in a total rotation of a degrees 
around axis p. At the end of the spin-echo delay t, 
chemical shift evolution of the I spin is refocused by 
the 180° (I) pulse, while the Jjs scalar coupling Hamil¬ 
tonian evolves thanks to the additional 180°(S) pulse. 
When I = l/2Jis, the in-phase ly coherence generated 
by the first 90°x(I) pulse is quantitatively converted 
into antiphase I-coherence (i.e., 2IxSz) at the end of 
the T delay. The last two INEPT pulses then implement 
the actual coherence transfer into the antiphase S- 
coherence 2IzSy. Note the critical 90° phase shift 
between the two 90° (I) pulses of the INEPT. 

The generation of the antiphase S-coherence 2IzSy 
starting from the equilibrium Boltzmann polarization 
yJz explains why the theoretical maximum signal- 
to-noise enhancements is yi/ys- However, the evolution 
of passive coupling constants (i.e., Jihih) as well as 
transverse relaxation during the delay x leads to experi¬ 
mental signal-to-noise enhancements below the expected 
ji/js value. Nevertheless, a further potential benefit of the 
INEPT sequence is that the duration of the inter-scan 
delay is set according to the longitudinal relaxation rate 
(Ri) of the I rather than the S nucleus. 

The polarization enhancement provided by the 
INEPT pulse sequence is exploited in several two- 
and three-dimensional NMR experiments, in which 
insensitive nuclei, such as and ^^N, are indirectly 
detected. Eor instance, in the Heteronuclear Single- 
Quantum Coherence (HSQC) experiment the INEPT 
is followed by a frequency labeling delay for the 
S nucleus (i.e., and *^N), which is then reconverted 
through a reverse INEPT scheme into single-quantum in- 
phase I-coherence. As a result, initial creation and final 
detection of coherence both take advantage of the 
I-nucleus with high gyromagnetic ratio (i.e., *H), maxi¬ 
mizing sensitivity. Similar principles have been incorpo¬ 
rated into many multidimensional NMR experiments as 
means of coherence transfer between nuclei of interest. 

The INEPT scheme is useful also for spectral editing. 
When combined with phase cycling and/or pulsed field 
gradients, the INEPT pulse sequence efficiently selects 
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only for I nuclei bound to S nuclei and vice versa. Further 
editing based on multiplicity is possible through an addi¬ 
tional spin-echo. For instance, the signal of a CFI 2 
can have opposite sign relative to that of a CH or CH 3 . 

Cross-References 

► J Coupling 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR Methods for Kinetic Analysis 

► Protein NMR - Introduction 

► SOFAST HMQC 

► Triple Resonance NMR 

► TROSY 

► ZZ-Exchange 


Influenza Neuraminidase - 
Computational Studies 

Ornjira Aruksakunwong*, Thanyada 
Rungrotmongkor, Nadtanet Nunthaboor and 
Supot Hannongbua^ 

'Department of Chemistry, Faculty of Science, 
Rangsit University, Pathumtani, Thailand 
^Department of Biochemistry, Faculty of Science, 
Chulalongkom University, Bangkok, Thailand 
^Department of Chemistry, Faculty of Science, 
Mahasarakham University, Mahasarakham, Thailand 
"'Department of Chemistry, Faculty of Science, 
Chulalongkom University, Bangkok, Thailand 

Synonyms 

Glycohydrolase; Neuraminidase; Sialidase that 
cleaves the glycosidic linkages of neuraminic acids 

Definition 

Neuraminidase (NA) is one of the two glycoproteins on 
the surface of the influenza virus. Its roles occur after 
virus replication, by cleavage of the terminal sialic acid 
(SA) from the receptors of infected cells and releasing 
the progeny vims to infect other cells. If the activity of 
NA is blocked, the new virions cannot be released from 
the host cell and thus viral reproduction is prevented. 


Introduction 

Influenza neuraminidase (NA) has been recognized as 
an important target for antiviral drugs due to its essen¬ 
tial role in the viral replication cycle. NA has a total 
molecular weight of 240 kDa and consists of 
a mushroom-shaped head of four identical subunits in 
a square-planar arrangement (Taylor and Itzstein 
1994), as shown in Fig. la. Phylogenetic analysis of 
the NA DNA sequences recognizes nine distinct sub- 
types of NA, named N1-N9, and these are further 
classified into two groups based on the genetic and 
structure analysis: groupl (Nl, N4, N5, and N 8 ) and 
group2 (N2, N3, N 6 , N7, and N9). Significant confor¬ 
mational differences between groupl and group2 NAs 
have been reported (Russell et al. 2006). The group-1 
NA contains a flexible 150-loop region (residues 147- 
152) that adopts an open conformation in the apo form, 
but conformation changes from the “open” to the 
“closed” form through substrate or inhibitor binding 
at the active site. The open-loop conformation leads to 
formation of a 150-cavity adjacent to the active site. In 
the case of the group2 NA, its conformation is likely to 
be in the closed form in which the 150-cavity is absent 
from the active site, as shown in Fig. lb, c. Recently, 
we found that the 2009 HlNl pandemic vims (09N1) 
neuramidase was crystallized without the 150-cavity 
characteristic of group-1 NAs. However, from the com¬ 
putational simulations of apo- and laninamivir-bound 
form shown that 09N1 NA exists in solution preferen¬ 
tially with an open 150-cavity. The catalytic active site of 
all NAs is located in a concave cavity on the protein 
surface and contains a number of conserved amino acids 
(R118, D151, R152, R224, E276, R292, R371, and 
Y406) that directly interact with the terminal sialic acid 
(SA). In addition, the framework residues around the 
catalytic region (El 19, R156, W178, S179, D/N198, 
1222, E227, H274, E277, N294, and E425) stabilize the 
stmcture of the active site (Colman et al. 1983). 

Substrate Binding of NA 

Based on the available crystal structures of NA, the 
active site of all NAs contains three conserved argi¬ 
nines (Argil 8 , Arg292, and Arg371) which bind the 
carboxylate moiety of the terminal SA (Itzstein and 
Dyason 1999). The acetamido substituent is stabilized 
by Argl52 while Glu276 forms hydrogen bonds with 
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Influenza Neuraminidase - 
Computational Studies, 
Fig. 1 Three-dimensional 
structure of (a) the 
neuraminidase tetramer and 
the active site comparison of 
(b) group-1 and (c) group-2 
enzymes 




Influenza Neuraminidase - Computational Studies, 
Fig. 2 The active site of influenza neuraminidases 


the 8 - and 9-hydroxy groups as shown in Fig. 2. 
The NA substrate specificity was recently investigated 
using molecular dynamics (MD) simulation (Raab 
and Tvaroska 2011). The binding mode and behavior 
of three ligands (SA, methyl 3'sialyllactoside 
(SA-a2,3-Gal-Glc, 3SL) and methyl 6 'sialyllactoside 
(SA-a2,6-Gal-Glc, 6 SL)) binding to the active site of 
H5N1 NA were compared (Fig. 3). In the Nl-SA 
complex, the SA ring undergoes a transition from the 
62,5 to the ^Cs conformation. In contrast, the SA of 


both 3SL and 6 SL remained in the distorted boat con¬ 
formation. The binding mode of the SA unit in the Nl- 
3SL and N1-6SL complexes is much more stable than 
in the Nl-SA complex. The Glc unit of the N1-6SL 
complex has strong interactions with the 150-loop 
residues D151 and R152, whereas the Gal unit of the 
N1-3SL complex shows rather weak interactions with 
N247 and Y347. The MD results obtained imply that 
the N 1 enzyme of the avian influenza virus preferen¬ 
tially binds to the substrate with an a 2 ,3-linkage over 
that with an a 2 , 6 -linkage, which is consistent with the 
experimental data (Mochalova et al. 2005). 

NA Inhibitors and Mutations 

Structural analysis of NA in complex with the 
terminal SA led to the rational design of potent inhibitors. 
Currently, four approved NA inhibitors (zanamivir, 
oseltamivir, peramivir, and laninamivir) are available 
for treatment of influenza vims infections. Zanamivir 
(Relenza) and oseltamivir (Tamiflu) have been licensed 
for the treatment and prophylaxis of influenza (Jin and 
Kim 2004). Peramivir (Rapiacta) is a new NA inhibitor 
from BioCryst under FDA review in Phase 111 trials. 
However, the FDA had approved the use of Peramivir 
for the treatment of 2009 HlNl hospitalized patients 
under the Emergency Use Act, which terminated on 
June 23, 2010. Laninamivir (Inavir) has been recently 
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Influenza Neuraminidase - 
Computational Studies, 
Fig. 3 The chemical 
structures of sialic acid, 
methyl 3'sialyllactoside, and 
methyl 6'sialyllactoside 





methyl 3'sialyllactose 


OH 



approved for sale in Japan. This inhibitor potently inhibits 
the NA activities of various influenza A and B viruses 
including the pandemic 2009 HlNl virus as well as the 
oseltamivir-resistant strains (Kubo et al. 2010). 

The emergence of drug-resistant viruses has been 
shown to occur rapidly during influenza treatment. To 
date, high resistance and mutations against oseltamivir 
have been widely reported, not only in vitro but also 
in vivo including in clinical treatment. The H274Y and 
N294S mutations were detected in subtype Nl, 
whereas El 19V/I and R292K were noticed in subtypes 


N2 and N9. The H274Y mutation bearing H5N1-NA 
isoform in complex with oseltamivir was modeled and 
simulated (Malaisree et al. 2009), revealing that the 
source of the oseltamivir resistance is the reduction of 
the hydrophobicity and the size of the pocket in the 
area around an ethyl moiety at its bulky group. These 
changes were primarily due to the dramatic rotation of 
the hydrophilic —COO~ group of E276 toward the 
ethyl moiety. In addition, the hydrogen-bonding inter¬ 
actions with Nl residues at the —NH 3 ’^ and —NHAc 
groups of oseltamivir were weakened by the 
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R = sugar 






Sialic acid 


Influenza Neuraminidase - Computational Studies, Fig. 4 The general reaction pathway of glycohydrolase 
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Influenza Neuraminidase - Computational Studies, Fig. 5 The catalytic reaction steps and the corresponding intermediates of 
the substrate-neuraminidase complex 


replacement of a water molecule. On the other hand, 
the N294S mutation, which caused a slight change in 
the binding affinities of oseltamivir, maintained almost 
all of the intermolecular interactions with the N1 bind¬ 
ing residues except for the reduced hydrogen bonding 
between the —COO” group of oseltamivir and Y347 
(Rungrotmongkol et al. 2009). 


NA Enzymatic Reaction Mechanism 

NA acts as a glycohydrolase that catalyzes the hydrolysis 
of glycosidic linkages. NA specifically promotes the 
cleavage of terminal sialic acid a-ketosidically (A-acetyl 
neuraminic acid, Neu5Ac) from those glycoconjugates. 
The neighboring saccharide of SA is usually galactose. 
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When the glycosidic linkage is cleaved by hydrolysis and 
the receptors for the virus are destroyed, the SA falls off 
the cell surface. The general reaction is shown in Fig. 4. 

The catalytic pathways for sialidase consist of four 
major steps. The first step is the binding event in which 
the carboxylate group of the terminal SA transforms 
from the axial position into a pseudoequatorial position 
due to the charge-charge interactions with arginines 
Argils, Arg292, and Arg371, and steric constraints 
with residues at the base of the active site, such as 
Tyr406. The second-step reaction involves proton 
donation from the solvent and formation of the 
endocyclic sialosyl cation transition-state intermedi¬ 
ate. The final two steps of this mechanism are the 
formation and release of NeuSAc (Taylor and Itzstein 
1994). NMR experiments indicate that NeuSAc is ini¬ 
tially released as the a-anomer, which is consistent 
with the proposed SnI mechanism having a high 
degree of stereo facial selectivity. 

Considering the catalytic reaction, a water molecule 
is located in the active site adjacent to the reaction 
center of the enzyme-NeuSAc complex and between 
the charged residues Asp 151 and Argl52 with 
a position and orientation that is optimal for proton 
donation onto the glycosidic oxygen atom (Taylor and 
Itzstein 1994), as shown in Fig. 5. The enzyme- 
NeuSAc bonding distances observed over the simula¬ 
tion time indicate that Asp 151 and Argl52 have ter¬ 
minal side-chain atoms that are not directly associated 
with the binding of the substrate and the proposed 
transition-state intermediates. 

Cross-References 

► Glycohydrolase 
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Infrared Spectroscopy: Data Analysis 

Erik Goormaghtigh 

Structure and Function of Biological Membranes, 

The Free University of Brussels, Center for Structural 
Biology and Bioinformatics, Brussels, Belgium 

Synonyms 

2D correlation spectroscopy; Apodization; 
Asynchronous correlation; Baseline; Deconvolution; 
Derivation; Derivative; FTIR spectroscopy; Hetero¬ 
correlation; Infrared spectroscopy; Mie scattering; 
Multivariate statistics; Principal component analysis; 
Principal component regression; Supervised statistical 
method; Synchronous correlation; Unsupervised sta¬ 
tistical method; Water vapor 

Introduction 

FTIR data are particularly complex and intricate. Yet, 
the strength of FTIR spectroscopy precisely resides in 
this complexity. Most applications rely either on its 
unique capability to detect small differences in spectra, 
for instance, in the course of a chemical reaction (Barth 
and Zscherp 2002; Ritter et al. 2003; Liu et al. 2005; 
Stolz et al. 2005; Barth 2007) or among cell types or 
tissues. The quality of the FTIR spectrometers is such 
that biosensors or imaging methods sampling from 
10 ”^ to 10~^^ g yield spectra of sufficient quality to 
analyze chemical modifications or protein secondary 


structure (Goormaghtigh et al. 2009). Whether kinetics 
are monitored or images are recorded, modem FTIR 
spectroscopy generates thousands of spectra every day. 
The present challenge is to handle this mass of data. 
Preprocessing, quality assessment, and analyses must 
be automated to be of practical interest. This chapter 
singles out the importance of the main steps in data 
analysis and describes how data analysis can be han¬ 
dled when large numbers of spectra are recorded. 

"Quality" of the Spectra 

The recording of spectra is one of the most important 
steps prior to analysis. If the data do not contain the 
information or contain too much noise, there is no way 
to overcome the problem and no data processing can 
solve it. Several types of factors influence the quality 
of the spectra. Ideally, a high signal-to-noise ratio and 
the absence of optical distortion is the target. Practi¬ 
cally, visual inspection of spectra produced can be 
misleading in the absence of a complete knowledge 
and understanding of the data processing applied. 
Figure 1 illustrates this problem. In Fig. 1, the spec¬ 
trum of pure water vapor is reported at a resolution of 
2 cm“*. Such a spectrum is easily obtained by record¬ 
ing a background after purging the measurement 
chamber with dry air, then recording a spectrum after 
stopping purging for a few minutes. This type of con¬ 
tribution is always present in the FTIR spectra, as will 
be discussed below. After Fourier self-deconvolution 
with a Lorentzian line (FWHH = 20 cm”*) followed 
by apodization by a Gaussian line (FWHH = 36 cm”*) 
(Goormaghtigh and Ruysschaert 1990), the result 
is a spectrum that could be interpreted as the spectrum 
of a protein with a strong helical component at 
1,652 cm”* in the amide I and a corresponding amide 
II band located at 1,547 cm”*, which is quite expected 
for an a-helix, confirming the presence of a-helix 
secondary structure. Yet, there is no protein in the IR 
beam. This example illustrates the need for a good 
understanding of data processing and for its thorough 
description. 

The main factors that influence the quality of the 
spectra are briefly reviewed below. 

1. Spectrometer settings: The FTIR spectrometer set¬ 
tings (e.g., resolution or number of scans) largely 

contribute to the main characteristics of the spectra. 

Papers usually report the number of scans that are 
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Infrared Spectroscopy: Data Analysis, Fig. 1 FTIR spec¬ 
trum of water vapor at 2 cm“ * resolution (bottom) and the same 
spectrum after Fourier self-deconvolution with a Lorentzian line 
(FWHH = 20 cm^’) followed by apodization by a Gaussian 
line (FWHH = 36 cm^‘) 

averaged to obtain the final spectrum. It varies from 
one to several thousands. As the noise decreases as 
the square root of the number of scans, this is an 
important parameter, but not the only one. The 
scanner (mirror) velocity, the fact data are collected 
in the forward direction only or in the forward and 
backward directions are also important. High scan¬ 
ner velocities result in fast recording but also in 
poorer signal-to-noise ratios. In fact the time spent 
measuring a spectrum would be a more appropriate 
parameter. Furthermore, if the noise level is an 
important parameter, recording a high intensity sig¬ 
nal is as important to determine the signal-to-noise 
ratio. Detector type (MCT, DTGS), beam width, 
mirror alignments in accessories such as attenuated 
total reflection (ATR) largely define the final qual¬ 
ity of a spectrum. While it is expected that only 
30-50% of the light available in transmission is 
collected at the output of an ATR accessory, 
misalignment of the optics may bring this value to 
10% or less. In such a condition, when 90% of the 
dynamic range is lost before starting the experi¬ 
ment, the remaining signal has to be amplified, as 
well as the noise. 

2. Environment: A second key factor is the environ¬ 
ment which may vary between background and 
sample spectra. 

(a) Variations in water vapor content in the 
spectrometer must be reduced to a minimum. 
The proper corrections for water vapor contri¬ 
butions will be discussed later. 


(b) Temperature variations in the room may have 
dramatic impact on the spectra. They originate 
from sunlight, door opening, gradual heating 
in the sample compartment due to IR beam 
absorption and surrounding electronic environ¬ 
ment. Especially dramatic are temperature 
oscillations generated by air-conditioning. 

(c) Any vibration that disturbs the position of the 
interferometer mirror may distort the interfero- 
gram. They can be generated by hoods, centri¬ 
fuges, air-conditioning, compressors used for dry 
air production, and can be transported on long 
distance by building infrastructure and metallic 
pipes used for transporting gases, fluids, or part of 
a central heating. Sound waves may have signif¬ 
icant effects on the interferogram. 

3. Mode of recording: The third factor to be taken into 
account is obviously the type of spectra that are 
recorded. For instance, transmission spectra are 
not ATR spectra. ATR-FTIR is one of the most 
powerful methods for recording infrared spectra of 
biological materials in general, and for biological 
membranes in particular (Goormaghtigh et al. 
1999). The environment of the molecules can be 
modulated so that their conformation can be studied 
as a function of temperature, pressure, pH, as well 
as in the presence of specific ligands. In some con¬ 
ditions (high absorbance, incidence angle close to 
the critical angle) ATR spectra can be significantly 
different from their transmission counterparts with 
distorted band shapes displaying shoulders on the 
low wave number side (Goormaghtigh et al. 1999). 
An incidence angle of 45°, the most frequent 
design in ATR accessories, is close to the critical 
angle where distortions get very strong for internal 
reflection element with refractive index close to 
2.4 such as diamond (n = 2.38), ZnSe (« = 2.4), 
ZnS {n = 2.42), KRS-5 (« = 2.35), but not for 
germanium {n = 4). In imaging, other spectral 
distortions occur, in particular Mie scattering 
(Bassan et al. 2009; Bird et al. 2010). The number 
of infrared spectrum recording modes is so wide 
that it is not possible to address all the problems 
here. Each method needs specific consideration 
for its limitations and potential distortion of the 
spectra. 

Once high-quality spectra have been accumulated, 
evaluation of spectral quality and correction for known 
interference can begin (see below). 
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Spectral Processing 

1. Corrections for Atmospheric Water. The 
presence of sharp atmospheric water absorption lines 
superimposed on the sample spectra cannot be avoided 
in long-term experiments and become prominent 
when recording very small absorbances. As pointed 
out in a previous review (Goormaghtigh 2009), atmo¬ 
spheric water absorbance is best corrected when spec¬ 
tra are recorded at relatively high resolution for taking 
advantage of the linewidth difference existing between 
the atmospheric water vapor and the solid sample 
bands. Briefly, three levels of complexity in the cor¬ 
rections can be considered (Goormaghtigh 2009): 

(a) Direct correction: A reference water vapor spec¬ 
trum is subtracted in order to zero the area in 
a spectrum region where a prominent water 
vapor band is located. Typically, spectra are 
recorded at 2 cm” ^ resolution and the subtraction 
coefficient is computed from the atmospheric 
water band area between, e.g., 1,562 and 
1,555 cm”^ (a straight line is drawn between the 
spectmm points at these two wave numbers) on the 
sample spectmm and on the reference atmospheric 
water spectmm. This level of processing is gener¬ 
ally sufficient for most analyses. If an acceptable 
result is not achieved at this point, a further step 
must be undertaken. 

(b) In order to account for small shifts of the water 
vapor bands, the reference water vapor spectmm 
can be shifted by interpolation. Curve fitting of 
a Gaussian lineshape of the top of the water vapor 
band, between, e.g., 1,573 and 1,579 cm~^ on the 
reference and sample spectmm allow the accurate 
computation of the shift. Once the shift is deter¬ 
mined, the reference water vapor spectmm is 
shifted (typically by 0.1-0.2 cm” *) by linear inter¬ 
polation before subtraction as described above. 

(c) The experimental spectmm contains in fact sev¬ 
eral water vapor contributions (present at the time 
of background recording, of sample spectmm 
recording or reference water vapor recording) 
which may be shifted one with respect to the 
other. Subtraction coefficients and shifts of sev¬ 
eral water vapor spectra can be determined by 
a minimization procedure aiming at obtaining 
a corrected line as short as possible between, 
e.g., 1,700 and 1,600 cm”*. The detailed proce¬ 
dure has been reported in Goormaghtigh (2009). 



Infrared Spectroscopy: Data Analysis, Fig. 2 ATR-FTIR 
spectmm of bovine pancreas phospholipase A 2 as recorded 
at a resolution of 2 cm^* {green line), after Fourier self¬ 
deconvolution by a Lorentzian, FWFIFI = 18 cm“' followed by 
apodization by a Gaussian, FWHFl = 6.7 cm”’ (red line) and after 
Fourier self-deconvolution by a Lorentzian, FWFlFt =18 cm”’ 
followed by apodization by a Gaussian, FWHFl = 9 cm” ’ (black 
line). The spectmm of water vapor (blue line) is reported 


Finally, whatever the procedure followed, the 
corrected spectrum can smoothed to 4 cm”’ a resolu¬ 
tion high enough not to lose information in the IR 
spectrum of liquid or solid samples. 

Selection of the method essentially depends on the 
type of analysis and of the time available. Minimizing 
five coefficients for thousands of spectra recorded by 
imaging (one image recorded on a 128 x 128 focal 
plane array contains 16,384 spectra) takes about an 
hour if each step can be achieved in 0.2 s. 

Very good looking spectra still contain some water 
vapor residuals which may appear in some conditions. 
Figure 2 presents a good ATR spectrum of a phospholi¬ 
pase A 2 (green line) and a water vapor spectmm 
(blue line) for the sake of visual reference. Fourier self¬ 
deconvolution of the phospholipase spectmm (black 
line) (deconvolution by a Lorentzian, FWHH = 
18 cm”’, apodization by a Gaussian, FWHH = 
9 cm”’) (Goormaghtigh and Ruysschaert 1990) reveals 
features typical of protein amide contribution at 1,679, 
1,656, 1,636, and 1,547 cm”’. The band at 1,515 cm”’ 
can safely be assigned to tyrosine side chains. The band 
at 1,615 cm”’ is usually assigned to amino acid side 
chain contributions. Yet, another deconvolution with 
one parameter changed (red line) (deconvolution by 
a Lorentzian, FWHH = 18 cm”’, apodization by 
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a Gaussian, FWHH = 6.7 cm” *) reveals that many water 
vapor contributions are present and the 1,615 cm” * band 
in particular appears to be strongly related to a water 
vapor contribution, shedding doubts on its potential 
assignment to side chains. 

2. Corrections for buffer molecules and amino acid 
side chain contributions: Buffer molecules (tris, 
phosphate,...) may interfere with the analysis of 
the spectra and can be subtracted. This is not 
straightforward as subtraction of a buffer spectrum 
also subtracts the liquid water contribution. In turn 
a second round of correction for liquid water con¬ 
tribution may be necessary. It is important to 
remember here that the spectrum of water itself 
depends on the solutes present in the solution 
(Goormaghtigh et al. 1994). The problem is eased 
when working on dry or just hydrated films by 
ATR-FTIR. 

When the protein backbone spectrum must be 
examined, contribution from amino acid side 
chains can interfere, some of them being particu¬ 
larly problematic such as arginine. Correction 
for side chain contributions is complex, as it must 
take into account amino acid composition, pH, 
and scaling. Procedures have been reviewed 
previously (Goormaghtigh et al. 1994; Barth 2000; 
Goormaghtigh 2009). 

3. Corrections for Mie scattering: Mie scattering has 
recently become the subject of intense discussions 
in the field of FTIR imaging (Bassan et al. 2009; 
Bird et al. 2010), in particular among the numerous 
groups using a reflection mode for acquisition of 
infrared images. Mie scattering may result in severe 
distortions of the spectra and need correction. The 
references quoted above will give clues about the 
procedures to correct the spectra. 

4. Spectral Rescaling: Spectrum intensity depends on 
the amount of material and in ATR, on the way the 
material is spread on the ATR crystal. In turn, 
scaling is necessary to compare spectra, to average 
spectra, to run principal component analysis (PCA) 
or correlation analyses (see below). Typically, the 
area under a band or several bands will be set to 
a given value. An alternative method is to use the 
entire spectrum for scaling (vector normalization). 

5. Baseline correction: Baseline correction is usually 
required. Sophisticated curved baselines can be 
calculated and are offered by most of the current 
softwares. These baselines provide an easy way to 



Infrared Spectroscopy: Data Analysis, Fig. 3 ATR-FTIR 
spectrum of bovine pancreas phospholipase A2 as recorded at 
a resolution of 2 cm”* (green line) and after second derivative 
(blue line). As customary, the second-derivative spectrum has 
been multiplied by — 1 for an easier comparison with the absor¬ 
bance spectrum 

obtain good-looking spectra but two problems must 
be considered. The first one is that the user usually 
does not know what is calculated. The second one is 
that the computation may be sensitive to features 
that slightly vary from one sample to the next. In 
turn, subtraction of two baseline-corrected spectra 
will superimpose differences between spectra and 
differences between baselines. Until the user has 
a clear understanding of what the baseline is, it 
should be avoided before computing differences. 
Straight baselines are a simple alternative to present 
and examine spectra. 

6 . Derivation: It is common in the field of cell and 
tissue analysis to work on second-derivative spec¬ 
tra. The advantage is that the spectroscopist gets rid 
of the (linear) baselines and can compare more 
easily spectra from different origins or recorded in 
different conditions. Obviously second derivation 
does not add any information to the spectra but, as 
with Fourier self-deconvolution, spectral features 
visually appear more “resolved.” Furthermore, as 
derivatives by definition depend on variations in the 
spectra (the exact number of points and their weight 
depend on the level of smoothing which is system¬ 
atically applied), more stable features can be 
expected. Figure 3 presents the spectrum of phos¬ 
pholipase A2 before (green line) and after (blue line) 
taking the second derivative. The question raised 
above about the assignment of the 1,615 cm”^ band 
appears again here. The second-derivative spectrum 
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Data Analysis, 

Fig. 4 Preprocessing 
summary sheet from the 
Kinetics program written 
under MatLab in the 
laboratory of the author. 
Operations will be processed 
sequentially on all the spectra 
loaded and the corrected 
spectra appear in a new tab of 
the display figure 
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presents indeed the same type of apparent “resolu¬ 
tion” enhancement as the deconvolved spectram 
presented in Fig. 2. 

7. Practical management of several processings: The 
practical challenge is to handle thousands of spectra, 
each of them being subjected for instance to water 
vapor subtraction, apodization to 4 cm” * resolution, 
scaling and second-derivative computation. This is 
usually not conveniently provided by general soft¬ 
wares available with FTIR spectrometers. Several 
groups have developed their own software, usually 
in a MatLab environment, to carry out these various 
processings with one mouse click. An example of 
such an integrated processing sheet is presented in 
Fig. 4. One click on “Execute” will perform all the 
planned tasks on as many thousands spectra as loaded. 

Data Analysis 

The analysis of the spectra typically occurs in 

three steps: 


1. Visual observation of the data: Even when 10^ 
spectra are recorded and must be analyzed, it remains 
necessary to examine visually some selected spectra. 
The best softwares available cannot avoid misinter¬ 
pretation when flawed data are provided. Figure 1 
described earlier reports such an example. One of the 
most powerful tools for meaningful observation of 
the data is difference spectroscopy. 

2. Unsupervised statistical methods: Unsupervised 
methods usually provide a straightforward analysis 
of the variance present in the data. The most popular 
method is Principal Component Analysis (PCA) 
which, upon extracting eigenvectors from the 
covariance matrix of the data, provides a set of so- 
called principal components (PC) which fully 
describes the variance present in the data. The PC 
are orthogonal and sorted according to the amount 
of variance they describe. Usually 3-5 PC describe 
more than 95% of the variance, indicating that any 
of the thousands of spectra present can be described 
almost exactly by 3-5 numbers. Figure 5 illustrates 
the decomposition of a set of 50 protein spectra by 
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Infrared Spectroscopy: 

Data Analysis, Fig. 5 (a) 

Series of 50 ATR-FTIR 
spectra of different proteins 
sorted from [f-sheet rich 
proteins (blue) to a-helix rich 
proteins {red), (b) Mean of the 
spectra presented in (a) (blue) 
and the first two PC, PCI 
(green) and PC2 (red), (c) 
Projection of the spectra 
presented in (a) in the PC 1 
(describing 62% of the total 
variance) and PC2 (describing 
20% of the total variance) 
space. The color of the stars 
refers to the color of the 
corresponding spectrum in 
part (a) 
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PCA. Proteins have been sorted from low ot-helix 
content to high a-helix content. The major source of 
variance (62% of the total variance) is reported by 
PCI (Fig. 5b) which obviously mostly describes the 
difference between a-helix poor and a-helix rich 
proteins. In turn, projection of the 50 spectra on 
PCI (Fig. 5c) essentially separates the spectra on 
the basis of the a-helix content. 

3. Supemsed statistical methods'. Supervised methods 
suppose a training step to build a model that relates 
spectral features to a quality of the sample known 
a priori. In the previous example, it was observed, a 
posteriori, that PCI seems to be related to a-helix 
content but this can hardly be used to quantify 
a-helix content. In supervised methods, a model 
optimizing the relation between the PC or any 
rearrangement of the absorbance data will be built 
to best predict the a-helix content. Figure 6 presents 
a linear regression between PCA scores and a-helix 
content for the data presented in Fig. 5. 

The variety of supervised and unsupervised 
methods is so large that an entire book would be 
necessary to give only a broad overview of the field. 
The examples above aim at illustrating the principle 
rather than providing an exhaustive overview of the 
existing approaches. 
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Infrared Spectroscopy: Data Analysis, Fig. 6 Principal 
Component Regression (PCR) example. A linear relation between 
the scores of the spectra presented in Fig. 5 on the first 3 PC and the 
a-helix content has been obtained by the least square method. In the 
cross-validation mode used, the predicted a-helix content is 
computed from a model built in the absence of the spectrum to be 
analyzed. In turn, 50 different models were built. The spectrum 
numbers are the same as in Fig. 5. The central red line is the best fit 
between predicted and actual a-helix contents. The side red lines 
are located at ± the standard deviation 
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Infrared Spectroscopy: Data Analysis, Fig. 7 Synchronous 
(left panel) and asynchronous (right panel) correlations present 
in the series of 50 protein spectra presented at Fig. 5a. For the 
synchronous correlations presented in the left panel, the arrows 
are located at (from bottom to top) 1,690 cm^’ (P-sheet), 1,655 
cm^* (a-helix), and 1,630 cm^* (P-sheet), 1,580 cm^* 
(unknown), 1,545 cm“* (a-helix), and 1,530 cm“* (P-sheet). 
The main coirelation with their sign (negative, blue 


or positive, red) and assignments are indicated. For the asyn¬ 
chronous correlations presented in the right panel, the dotted 
lines indicate the position of the position of the a-helix and 
P-sheet structures (horizontally) and of an unknown component 
discussed in the text (vertically). To better visualize weaker 
correlations in both panels, level curves have been drawn more 
densely near the zero plane 


2D Correlation Analysis 

Over the last 20 years, 2D correlation, originally devel¬ 
oped in the field of nuclear magnetic resonance, has 
been adapted to infrared spectroscopy. The goal of 2D 
correlation FTIR spectroscopy, first introduced by 
Noda (Noda et al. 1993), is to spread data over 
a second dimension in order to establish correlations 
between pairs of wave numbers. This data processing 
opens the possibility to extract spectral information 
from highly overlapped contributions buried in the 
first dimension. 

2D correlation spectra are usually calculated 
according to the formalism developed by Noda 
(2000) that replaces Fourier transforms with Hilbert 
transforms. Briefly, an average spectrum is subtracted 
from a set of dynamic spectra before cross-correlation 
analysis. From the resulting set of dynamic spectra, 
synchronous, O (vi, V 2 ), and asynchronous spectrum, 
i’ 2 )> are calculated according to 


1 ” . 

./=i 

m m 

t=i 

where Ay(v) is the difference between the spectral 
absorbance at wave number v for a spectrum / and the 
mean absorbance at this wave number (for further 
details, see (Noda 2000), and corresponds to the 
jth row and Hh column element of the discrete Hilbert- 
Noda transformation matrix given by: 

0 , ify = k 

— --, otherwise 

n{k-j) 

Synchronous maps reveal spectral changes that 
occur at a similar pace and asynchronous ones 
point out spectral changes evolving at different rates 
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(Noda et al. 2000; Noda 2000, 2004). Figure 7 illus¬ 
trates the synchronous and asynchronous correlations 
that exist in the amide l-II region of the series of 
protein spectra presented in Fig. 5. The most prominent 
synchronous correlations are labeled on Fig. 7, left 
panel. Two bands, 1,655 and 1,545, can be assigned 
to tx-helix structures, and the 1,690, 1,630, and 1,530 
cm“* bands can be assigned to P-sheet structures. Off- 
diagonal contributions distinctly show the high wave 
number contribution of the P-sheet at 1,690 cm“*. The 
negative sign of the correlation (in blue) also explicitly 
indicates that the tx-helix and P-sheet content varies in 
opposite directions. This is due to the fact that in the 
protein database the sum of the two structure contents 
is rather constant for all the proteins. The 2D correla¬ 
tion analysis obviously reveals the different contribu¬ 
tions of the main secondary structures present in the 
protein database which could not be resolved by 
looking at the original data. Two negative correlation 
bands (1,570/1,640 and 1,585/1,635 cm”') remain 
unassigned. After changing the scaling of the contours 
(not shown), it can be observed that the 1,570 cm”* 
band is negatively correlated with the a-helix 
structure, and conversely, the 1,585 cm”* band is 
negatively correlated with the P-sheet structure. The 
major protein side-chain contributions in this area are 
Asp (ca. 1,579 cm”*) and Glu (ca. 1,560 cm”*). The 
cross-peaks shown on Fig. 7 could reflect the well- 
known predilection of Glu for a-helix structures and 
Asp for p-bends. The presence of a cross-peak in the 
2D asynchronous spectrum indicates that the two 
wave numbers are responding out of phase. The right 
panel of Fig. 7 illustrates that beyond some level of 
correlation, some variations happen out of phase. 
For instance, the 1,655/1,665 cm”* cross-peak indi¬ 
cates some heterogeneity among the a-helix area in 
amide I. It is known that a-helix structures absorb in 
a range of wave numbers according to various struc¬ 
tural parameters, their length in particular. This 
illustrates the particular power of the asynchronous 
analysis to evidence with a high sensitivity the het¬ 
erogeneity of a band composed of different compo¬ 
nents characterized by different behavior. Again an 
unknown contribution is revealed near 1,580 cm”* 
and could be assigned to lateral amino acid side 
chains. 

Further insight into the structure of a series of spec¬ 
tra can be obtained using 2D correlation spectroscopy. 
Three aspects can be further developed: 


(a) The synchronous maps examined here represent 
the covariance matrix of the system. A limitation 
of this approach is that value of absorbance 
changes determines cross-peak intensities. In turn 
highly correlated events can go unnoticed because 
of their weak amplitudes. In order to focus only on 
the quality of the correlation without interference 
from absolute intensity changes, normalization of 
the absorbance at each wave number by the stan¬ 
dard deviation can be carried out. The obvious 
advantage of the normalization is to give each 
wave number a similar weight. 

(b) Examination of the synchronous and asynchronous 
map allows the determination of the sequence of the 
changes. For instance, monitoring amide hydrogen/ 
deuterium exchange by FTIR spectroscopy results 
in complex modifications of the spectra. Simulta¬ 
neous consideration for both map with the so-called 
Noda’s rules (Noda 2007) allow defining which 
contribution exchanges before the other. This can 
be extended to any variation in the sample and had 
been used recently to determine the sequence of 
chemical changes in prostate cancer cells exposed 
to antitumoral dmgs (unpublished data). 

(c) Heterocorrelation between data obtained by two dif¬ 
ferent methods. For instance, lipid extracts from the 
prostate cancer cells mentioned above obtained after 
different time of exposure to anticancer drugs were 
analyzed by both mass spectrometry and infrared 
spectroscopy. Heterocorrelation maps reveal how 
FTIR choline characteristic peak around 1,490 
cm” * is correlated with different peaks of the mass 
spectrum, allowing a rapid identification of the asso¬ 
ciate acyl chains (Gasper et al. 201 1). 


Summary 

Modern recording techniques allow the recording of 
several hundreds or thousands of spectra every day. 
The present challenge is to handle these spectra. 
Corrections for water vapor, smoothing, detection of 
outliers must be automated to be of practical interest. 
A guideline is provided here to help handle data 
processing and analysis when large numbers of 
spectra are produced. The need for specific procedures 
to process and extract the relevant features is 
emphasized. 
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Cross-References 

► Infrared Spectroscopic Imaging 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Infrared Spectroscopy of Protein Reactions 

► Infrared Spectroscopy of Protein Structure 

► Membrane Proteins: Structure and Organization 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 
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Infrared Spectroscopy of Cells, Tissues, 
and Biofluids 

Dieter Naumann 

Robert Koch-Institute, Berlin, Germany 

Synonyms 

Biofluids; FTIR spectroscopy 


Definition 

Modern infrared spectroscopy has been developed into 
a useful biodiagnostic tool for the analysis of cells, 
tissues, and body fluids. Dedicated technologies have 
evolved for rapidly discriminating between diverse 
microorganisms, imaging tissue structures, testing sin¬ 
gle eukaryotic cells, and identifying various disease 
states in humans and animals from body fluids. To 
utilize the characteristic information contained in the 
complex, flngerprint-like infrared signatures of biolog¬ 
ical samples various spectroscopy-specific bioinfor- 
matic tools have been established. 
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Introduction 

Infrared (IR) spectroscopy is a relatively old spectro¬ 
scopic modality that provides pictures of the molecular 
vibrations performed by molecules. It has been only 
for the last two or three decades that this type of 
vibrational spectroscopy has been used systematically 
to investigate the complex building blocks of biologi¬ 
cal systems or, even more recently, intact cells, 
tissues, and biofluids (Diem et al. 2008; Lasch and 
Kneipp 2008; Barth and Haris 2009). The progress 
achieved and the practical applications realized until 
now have definitely disproved the notion that IR spec¬ 
troscopy is an “old-fashioned technology,” and will be 
summarized here. 

Infrared Spectra Are Fingerprint-like Signatures of 
Biological Structures 

Infrared spectra of cells, tissues, or biofluids are the 
expression of the sum of cellular chemistry/biochem¬ 
istry providing information on the total composition of 
complex biological material under investigation and 
thus give snapshots on cell division, differentiation, 
growth, and metabolism. In this view, biomedical 
infrared spectroscopy is an explorative analysis modal¬ 
ity that can be used to diagnose disease or dysfunction 
in cells, tissues, and biofluids via spectral fingerprints 
that may change as an indicator of the presence of 
a particular disease or in response to drug intervention, 
environmental stress, or genetic modification. One 
additional, very attractive peculiarity of vibrational 
spectroscopy is that it also provides diagnostic infor¬ 
mation on structural states of the molecules under 
study, since certain bands are sensitive, for example, 
to the secondary structure in proteins, while others 
report on the state of order of the membranes or the 
conformation of the nucleic acid structures. In this 
sense the total information content in vibrational spec¬ 
tra of biological materials is enormous. 

Figure la shows typical spectra of a microbial, 
a body fluid, and a tissue sample. While exact assign¬ 
ments to discrete molecular compounds are certainly 
not possible, many spectral features can be made vis¬ 
ible for the eyes by applying resolution enhancement 
techniques. In general, 40-60 spectral features are 
resolved by calculating, for example, second deriva¬ 
tives (see Fig. lb). Tentative assignments based on the 
comparison with spectra of the main building blocks in 
whole cells can be derived from Table 1. 


Measurement Techniques for Biomedical Analysis 

A major advantage of IR spectroscopy is that nearly 
any kind of material can be measured and that it is not 
limited to the physical state of the sample. Addition¬ 
ally, there are no principal restrictions to record 
infrared spectra of a given sample under different 
physicochemical conditions concerning temperature, 
pressure, state of dispersion, pH, etc. This is of rele¬ 
vance for biomedical analyses, since it is pertinent to 
test biological specimens under conditions that leave 
the sample’s structures preferentially unperturbed and 
non-disintegrated. 

It has been suggested that biological specimens can 
best be compared when using the traditional absor¬ 
bance/transmission (A/T) or the ► attenuated total 
reflection techniques (ATR). A/T spectra of biological 
specimen can be obtained from liquid solutions, dis¬ 
persions or suspensions, viscous solutions, tissue 
slices, single cells or from the equivalent dried samples 
using suitable IR transparent substrates. Different 
cuvette systems have been designed for IR measure¬ 
ments of biological samples. One technical solution for 
the rapid and automated analysis of many biological 
specimens as dried film samples is given in Fig. 2a 
(Naumann 2000). Figure 2b shows an ATR-cuvette, 
specifically designed for the consecutive measurement 
of several dried or fully hydrated samples deposited on 
one and the same ATR-crystal (Naumann 2000). 

In the literature, the notion is often found that infrared 
spectroscopy is generally not suitable for biological 
materials in aqueous surrounding. Using, however, com¬ 
mercially available, high-pressure stable flow-through 
cells (Fabian et al. 2005) together with sensitive broad¬ 
band MCT detectors and HPLC technologies, the quality 
of spectra from biofluids is excellent in terms of signal- 
to-noise ratio and reproducibility allowing even the cal¬ 
culation of second derivatives (see Fig. 3). Thus, drying 
of the liquid samples on infrared transparent optical sub¬ 
strates is not required, and there is no need to rigidly 
control the relative humidity of the dried samples. Thus, 
unfavorable light scattering at the sample due to 
heterogeneity of the dried film discs or cells can be 
minimized. 

Infrared microscopes coupled to modern IR spec¬ 
trometers and equipped with sensitive MCT-detectors 
pushed the detection limits of infrared spectroscopy 
down to the sub-nanogram level and opened the field of 
spatial resolution and imaging to infrared spectroscopy 
(Wetzel 2008). Using IR microscopes together with 
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Infrared Spectroscopy of 
Cells, Tissues, and 
Biofluids, Fig. 1 (a) Survey 
spectra obtained from 
different biological samples: 

(1) body fluid (synovial fluid); 

(2) microbial sample 
{Staphylococcus aureus)', 

(3) tissue section (from 
Scrapie-infected hamster 
brain), (b) Original 
absorbance spectrum as 
measured from a dried film 
sample of S. aureus with its 
second derivative spectrum 
overlaid 






Wave number / cm-i 


a computer-controlled x,y stage, complete images of 
the spatial distribution of chemical structures in tissue 
slices can be obtained in a mapping experiment. 
Recently, so-called focal plane array (FPA) detectors 
have been implemented in research grade infrared 
microscopes that substitute the single element detec¬ 
tors by a matrix of individual detector elements. These 
FPA detectors made it possible for the first time to 
measure many microscopically small spots in hetero¬ 
geneous biological samples simultaneously without 
the necessity to map step by step over the sample, 
thus paving the way for true infrared imaging of bio¬ 
logical specimen (Wetzel 2008). 

Sampling of Biological Material, Data Acquisition, 
Variability of Biological Specimen, the Problem of 
Reproducibility and Standardization 

The most serious problem when analyzing complex 
biological materials is the enormous diversity and com¬ 
plexity of cell types, tissue structures, or bodyfluids that 
makes it an absolute necessity to test statistically signif¬ 
icant numbers of samples in independent measurements. 
Reproducibility and repeatability of sampling, sample 
preparation, and measurement significantly influence 


data structure of the primary database. Thus, standard¬ 
ization efforts have been stimulated by the necessity 
to exchange spectral data between different laborato¬ 
ries and validate spectral reference databases. For 
microbial identifications, to give an example, stan¬ 
dardized sampling, sample preparation, data acquisi¬ 
tion, evaluation techniques, and rigid quality control 
measures have been described (Naumann 2000; 
Wenning et al. 2008). 

Data Treatment and Evaluation Techniques 

In biomedical infrared spectroscopy, not only a single 
spectrum has to be analyzed at a given time but 
thousands - and in case of spectroscopic imaging - even 
hundred thousands of spectra. Thus, the availability of 
sophisticated data evaluation concepts is a virtual 
necessity. These concepts should ideally include 
efficient data pretreatment algorithms such as quality 
testing, normalization, filtering, and adequate techniques 
for data reduction. The necessity to use multivariate 
pattern recognition methodologies when dealing with 
spectral data of complex biomedical materials has been 
realized already more than 20 years ago (Naumann et al. 
1991). 
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Infrared Spectroscopy of Cells, Tissues, and Biofluids, 
Table 1 Tentative assignment of some bands frequently found 
in infrared spectra of biological materials 


Frequency (cm 

*) Assignment 

-3,500 

0-FI str of hydroxyl groups 

-3,200 

N-H str (amide A) of proteins 

-2,955 

C-H str (asym) of -CH 3 in fatty acid chains 

-2,930 

C-H str (antisym) of > CH 2 

-2,918 

C-H str (antisym) of > CH 2 in fatty acid 
chains 

-2,898 

C-H str of C-H in methine groups 

-2,870 

C-H str (sym) of-CHj 

-2,850 

C-H str (sym) of > CH 2 in fatty acid chains 

-1,740 

>C=0 str of esters 

-1,715 

>C=0 str of carbonic acids 

-1,680-1,715 

>C=0 in nucleic acids 

-1,695 

Amide I band components 

-1,685 

resulting from antiparallel pleated sheets and 

-1,675 

6 -tums of proteins 

-1,655 

Amide 1 of a-helical structures 

-1,637 

Amide I of 6 -pleated sheet structures 

-1,550-1,520 

Amide 11 

-1,515 

Tyrosine 

-1,468 

C-H def of > CH 2 

-1,400 

C =0 str (sym) of COO^ 

-1,310-1,240 

Amide III band components of proteins 

-1,250-1,220 

P=0 str (asym) of > PO 2 ’ phosphodiesters 

-1,200-900 

C-0, C-C str, C-O-H, C-O-C def of 
carbohydrates 

-1,090-1,085 

P=0 str (sym) of > P 02 ^ 

-720 

C-H rocking of > CH 2 in fatty acid chains 

-900-600 

“Fingerprint region” 

str stretching, 
antisymmetric. 

def deformation, sym symmetric, asym 


Out of a large number of multivariate analysis tech¬ 
niques available, ► principal component analysis 
(PCA), hierarchical cluster analysis (HCA), and ► arti¬ 
ficial neural network (ANN) analysis have been fre¬ 
quently used in the literature (Diem et al. 2008; Lasch 
and Kneipp 2008; Barth and Haris 2009). Meanwhile 
nearly the whole arsenal of multivariate bio-informatic 
techniques is being employed and multivariate statis¬ 
tical analysis of spectroscopic data constitutes an 
own discipline within the realm of biomedical 
spectroscopy. 

Completely new approaches to data analysis were 
recently proposed using genetic algorithms in combina¬ 
tion with partial least square regression (PLSR) analysis 
techniques to correlate genes from gene expression pro¬ 
files (obtained by micro-array technologies) to metabolic 


markers (obtained from spectral data by IR spectros¬ 
copy) from the same samples. Covariance patterns in 
these very complex mixed data sets helped to rapidly 
recognize and visualize the interrelationships and trends 
in a developing and changing biological system that is 
not easily achieved by any other means (Kohler et al. 
2008). 

Characterization of Microorganisms 

Identification of microorganisms is a major task in 
medical, food, and environmental microbiology. In 
clinical microbiology, the unambiguous identification 
of a microorganism determines its pathogenic potential 
and is a crucial prerequisite for efficient therapy. It has 
been convincingly shown in the literature, that IR 
spectroscopy can be used to rapidly classify and iden¬ 
tify bacteria and yeasts and has developed into an 
attractive alternative to conventional techniques 
(Wenning et al. 2008). Figure 4 gives an example of 
its species and sub-species differentiation capacity 
using hierarchical cluster analysis. 

The main advantages of FT-IR spectroscopy in 
microbiology are extreme rapidity compared to con¬ 
ventional techniques, uniform applicability to very 
diverse microorganisms, and a high specificity that 
allows differentiations even down to subspecies level. 
Drawing upon the experience obtained to date, the 
serial type of a dedicated instrument for IR-based 
routine identifications of microorganisms is already 
available together with spectra reference libraries 
from an IR spectrometer producing company. 

Analyzing Biofluids 

Biological fluids (blood, serum, synovial fluid, amni- 
otic fluid, or urine) are routinely analyzed in clinical 
chemistry to identify ongoing pathological processes 
in the human body because biochemical changes asso¬ 
ciated with different diseases influence the composi¬ 
tion of body fluids and such fluids can be easily isolated 
from the body. 

When analyzing infrared spectra of biofluids, one 
possibility is to obtain quantitative sample informa¬ 
tion, typically the concentration of analytes such as 
cholesterol, glucose, urea, or albumin in human 
serum, or of glucose, lactate, and lipids in amniotic 
fluid, to give two examples (Shaw et al. 2008). The 
results obtained so far on various different biofluids 
demonstrate that mid-infrared spectroscopy in con¬ 
junction with sophisticated multivariate data analysis 
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Infrared Spectroscopy of Cells, Tissues, and Biofluids, 
Fig. 2 (a) FT-IR instrument IPS 28/B from Bruker Optics 
Inc. {bottom) and the technical drawing of a cuvette useful for 
measuring dry film samples {top), (b) Multisample ATR-cuvette 
for measuring dried or hydrated samples {bottom), and the 
technical drawing of this device {top). {1) ATR-crystal 


(single reflection, 8 cm long) with 10 marked sample positions S; 
(2) demountable cartridge carrying the ATR-crystal; (5) sliding 
carriage for moving the ATR-crystal through the IR-beam to 
measure the samples S; {4) cartridge drive; (5) housing with 
mirror optics; (6) IR-beam 


could become a novel practical application in clinical 
laboratories. This approach would beneht from its 
reagent-free way to get quantitative data on several 
analytes simultaneously from one single spectrum 
and its inherent speed. 

Another possibility relies on a more qualitative data 
analysis concept. This approach postulates that char¬ 
acteristic disease related compositional and/or struc¬ 
tural changes in body fluids cause a multiplicity of 
reproducible spectral alterations. It is assumed that 
the sum of these changes constitutes another spectral 
fingerprint which is characteristic of the disease. The 
classification of disease-related spectral pattern has 
been called “diagnostic pattern recognition” (DPR) 
(Petrich 2008). This analysis concept has been 


successfully applied in the last decade to establish an 
IR-based modality for the diagnosis of diabetes 
mellitus, metabolic syndrome, and rheumatoid arthritis 
and transmissible spongiform encephalopathies from 
serum (Lasch et al. 2000, 2003; Petrich 2008). 

Characterization of Eukaryotic Cells and Tissues 

The most important step forward in biomedical infra¬ 
red spectroscopy is certainly the advent of high-quality 
infrared microscopes which allowed to obtain spectral 
information on single cells and to achieve label-free 
imaging of tissues or even cells in a way that is familiar 
to cytologists or histopathologists. Today, bench-top 
instrumentation for routine IR imaging of diseased 
tissue sections is available. Equipped with focal plane 
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Infrared Spectroscopy of Cells, Tissues, and Biofluids, 
Fig. 3 (a, left panel), FT-IR cell from Microbiolytics GmbH, 
Germany; the cell has a path-length of approximately 7 pm, the 
optical material is CaF 2 . (a, right panel). Schematic diagram of 
the apparatus used for measurement of liquid serum samples: 
Injection port in fill position and stop-flow valve in flow position. 


sample is injected by the HPLC-pump into the flow-through cell, 
after switching the injection port valve into the volume stream, 
(b), FT-IR spectra obtained with the device of Fig. 3a, overlaid 
spectra from three independent “shots” obtained with a liquid 
serum sample after subtraction of a water spectrum, (c). Overlaid 
second derivatives of the same three spectra shown in Fig. 3b 


array detectors for mid-infrared imaging rapid seg¬ 
mentation of histological structures without any tissue 
staining and imaging of larger cells is possible under 
conditions of a routine laboratory. 

Spectral variations in single eukaryotic cells 
within the cell cycle have been investigated in 


depth by M. Diem and co-workers (Romeo et al. 
2008). The same group also published stimulating 
IR microspectroscopic data on cultured and exfoli¬ 
ated human cells including cancer, oral mucosa, 
canine, cervical, urothelial, giant sarcoma and fibro¬ 
blast cells, and detailed descriptions of technical 



































Infrared Spectroscopy of Cells, Tissues, and Blofluids 


1063 




Infrared Spectroscopy of Cells, Tissues, and Biofluids, 
Fig. 4 (a). Dendrogram of a hierai'chical cluster analysis 
perfoimed on 19 strains belonging to seven different species of 
the genus Enterococcus (with four independent measurements on 
each of the 19 different strains); clusters: (1) E. faecium 
(2) E. durans (5) E. hirae (4) E. faecium (5) E. faecalis 
(6) E. galUnarum (7) E. casseliflavusQ. (b), Spectral typing of 20 
closely related strains and isolates of the species Enterococcus 
feacium. Cluster analysis was performed using the first derivatives 
and considering the spectral ranges: (a) 1,500-1,200, 1,200-900, 
and 900-700 cm^* and (b) 1,200-900 cm^'. Ward’s algorithm 
was used for clustering the spectra. For calculating the distance 
matrix the so-called D-value (Naumann 2000) was used as distance 
measure. All spectra have been measured from dried film samples 
using the multisample cuvette shown in Fig. 2a 

details such as cell cultivation and sample prepara¬ 
tion for FT-IR measurement, data acquisition, and 
data evaluation. 

Segmentation of Histological Structures by 
Multivariate Imaging 

Infrared micro-spectrometry can be used to map the 
relative absorbances of particular functional groups 
across a sample. In this way chemical information is 


obtained on the molecular stmcture for which the func¬ 
tional group stands. That is why this approach was given 
the name chemical mapping or chemical imaging. 

Another option is the IR microspectroscopic mapping 
over a tissue section to attain segmentation of histolog¬ 
ical stractures. IR imaging using infrared microscopes 
equipped with single detector elements or FPA detectors 
has been successfully applied to various tissues and 
a considerable number of applications have been 
published in the last 10 years which address the problem 
of how to achieve segmentation of histological and par¬ 
ticularly diseased structures in tissues by multivariate 
imaging methodologies (Lasch et al. 2006; Bhargava 
et al. 2007; Bhargava and Levin 2008). 

The functional group or chemical mapping 
approach works nicely to get a quick overview of 
histological structures in a tissue slice without the 
necessity to stain or label the tissue sample. The iden¬ 
tification of diseased tissue structures in a complex 
environment of different other histological structures 
is, however, a more demanding problem. Indeed, the 
chemical imaging approach generally does not lead to 
a precise segmentation of histological structures in 
most cases. To improve discrimination capacity and 
enhance image contrast, multivariate data analysis 
techniques turned out to be more efficient (Lasch 
et al. 2006; Bhargava et al. 2007; Bhargava and 
Levin 2008; Diem et al. 2008; Lasch and Kneipp 2008; 
Barth and Haris 2009). One possibility is to use cluster 
analysis methodologies to achieve segmentation of 
histological structures as exemplarily shown in Fig. 5. 

Objective Identification of Histological Structures 
Based on Spectral Reference Data Bases 

Multivariate techniques such as PCA and HCA can be 
used to achieve segmentation of histological structures 
as is shown by the false color image in Fig. 5b. Ulti¬ 
mately, however, the objective identification of histo¬ 
logical structures in tissue samples of unknown patient 
material necessitates image reconstruction on the basis 
of a spectra reference database containing ideally spec¬ 
tra of all relevant tissue structures. 

There are only two well-documented proof of con¬ 
cepts on the examples of colon tissue (colon adenocar¬ 
cinoma) and prostate tissue (prostate cancer) in the 
literature, suggesting that objective identification of 
tissue structures is indeed feasible (Lasch et al. 2006; 
Bhargava et al. 2007). A test data base of colon refer¬ 
ence spectra was created by selecting carefully spectra 
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Infrared Spectroscopy of Cells, Tissues, and Biofluids, 
Fig. 5 IR image reconstruction for a hamster brain tissue thin 
section using multivariate clustering. IR microspectroscopic 
mapping (lateral resolution approximately 60 pm) yielded 
more than 900 spectra, subsequently subjected to PCA. 
(a) Dendrogram of a hierarchical cluster analysis with the 
spectral distances calculated in PCA space and using the first 


three factor loadings, (b) Nine clusters can be defined which 
contain the spectra corresponding to nine different histological 
structures. Nine different images each reconstructed from the 
pixel spectra of the nine clusters, respectively. The nine cluster 
images belong to the following main histological structures: 
1 to stratum ganglionare; 2 and 3 to stratum moleculare; 
4, 5, and 6 to stratum granulosum; 7, S, and 9 to substantia alba 


from thousands of spectra obtained from individual 
patient tissue samples, which were then assigned 
to individual histological structures with the help 
of a trained histopathologist and compiled into 
a reference data base for subsequent identification 
of histological structures in tissue slices of unknown 
patient samples. These spectral reference data were 
then used to train modular ANNs for objective identi¬ 
fication of the various tissue structures and the adeno¬ 
carcinoma in particular (Lasch et al. 2006). 

Summary 

The most important step forward in biomedical infra¬ 
red spectroscopy is the availability of high-quality 
FT-IR microscopes to obtain diagnostically useful 
information at the single cell level and to achieve 
label-free segmentation of tissue structures in a way 


that is familiar to cytologists or histopathologists. 
Equipped with focal plane array detectors rapid and 
objective segmentation of histological structures with¬ 
out any tissue staining and imaging of larger cells is 
possible under conditions of a routine laboratory. 

The attractiveness of FT-IR spectroscopy in 
microbiology are extreme rapidity compared to conven¬ 
tional techniques, uniform applicability to very diverse 
microorganisms, and a high specificity that allows dif¬ 
ferentiations down to the subspecies level. Drawing upon 
the experience obtained to date, dedicated FT-IR instru¬ 
mentation for routine characterizations of microorgan¬ 
isms is already available in the market. 

Cross-References 

► Infrared Spectroscopy: Data Analysis 

► Near-Infrared Spectroscopy in Biological 
Molecules and Tissues 
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► Vibrational Spectroscopy with Neutrons 

► Vibrational Spectroscopy, A Short History of 

► Vibrational Studies of Structure of Protein?Ligand 
Complexes 
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Infrared Spectroscopy of DNA 

Anthony William Parker* and Susan Jane Quinn^ 
'Central Laser Facility Research Complex at Harwell, 
Science & Technology Facilities Council, Rutherford 
Appleton Laboratory, Didcot, Oxfordshire, UK 
^School of Chemistry and Chemical Biology, 
University College Dublin, Dublin 4, Ireland 

Synonyms 

DNA; Infrared spectroscopy of nucleic acids; 
IR spectroscopy of DNA 

Definition 

The absorption of infrared light by DNA and its com¬ 
ponents provides rich information regarding its struc¬ 
ture and function and exotic time-resolved laser-based 
methods give solution phase dynamic behavior. 

Introduction 

Infrared (IR) studies have contributed to a wealth of 
information on the structural properties of nucleic 
acids (see also entry ► Machinery of DNA Replica¬ 
tion). IR provides complementary information to X- 
ray crystallography (see also entry ► X-Ray Diffrac¬ 
tion and Crystallography of Oligosaccharides and 
Polysaccharides), NMR (see also entry ► NMR 
Methods for Kinetic Analysis) and of course the sister 
technique of Raman spectroscopy (see also entry 
► Raman Spectroscopy) and several review articles 
give both a historical and technical perspective from 
the simple bases to complex polymorphic systems 
of DNA (Zhizhina and Oleinik 1972; 
Taillandier and Liqueir 1992; Banyay et al. 2003). 
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Infrared Spectroscopy of 
DNA, Fig. 1 (a) Nucleic 
acids and (b) infrared bands of 
polynucleotides and aqueous 
solvent 
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Recent developments in time-resolved infrared spec¬ 
troscopy, ► lasers, detectors and computational 
techniques have furthered our understanding of the 
time-dependent chemistry of DNA. 

Ground State Infrared Spectroscopy 

While the IR spectral region covers downward of 
3,400 cm“\ nucleic acids are typically considered 
between 1,800 and 700 cm”*, which comprises 


spectral regions associated with distinct structural 
components, from bases to sugar groups to phosphate 
stretches (Fig. 1). 

IR measurement of solids, liquids, gels, hbers, etc., 
is able to provide structural information on DNA, such 
as characterizing the double helix properties, under 
varied conditions. The strong absorbance at near 
1,640 cm“* of water (V 2 bending mode), necessitates 
the use of D 2 O which results in slight spectral shifts of 
the nucleic acid base modes due to exchange of labile 
protons. However, H 2 O has been used in the 
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1,300-1,575 cm“* region where IR frequencies asso¬ 
ciated with base-sugar bending modes sensitive to 
glycoside torsion angles occur. Furthermore, the 
3,000-4,000 cm“* region yields information on the 
solvation of the OH and NH 2 groups. 

In D 2 O, the 1,800-1,500 cm“* region is dominated 
by base vibrations from in-plane and double-bond 
stretching vibrations, including C=C, C=N and 
C=0. The electron-rich character of the 7t-double 
bonds means the IR signatures are sensitive to changes 
in base stacking and base pairing, as well as solvent 
interactions and binding to metal ions (Kornilova et al. 
1998; Hackl et al. 2005). The concomitant spectral 
shifts and changes in band intensity allow structural 
and conformational changes to be monitored. For 
example, upon Watson-Crick pairing of guanine to 
cytosine, the guanine C^—O^ vibration shifts from 
1,673-1,660 cm“^ to 1,689-1,678 cm“' with a simul¬ 
taneous decrease in intensity of the guanine ring vibra¬ 
tion doublet ate. 1,590-1,564 cm” . This sensitivity of 
IR spectroscopy to nucleic acid structure is exploited 
to (a) distinguish different secondary and tertiary 
forms of DNA (including A, B, and Z forms); 
(b) identify marker bands for novel structures such as 
triplex, quadruplex, and i-motif DNA; (c) report on 
how local environment influences structure; and 
(d) relate ligand-binding interactions. 

(a) A key application of IR spectroscopy has been to 
identify marker bands correlated to different DNA 
backbone structures, namely, A-, B-, and Z- DNA con¬ 
formations. One such marker band is the antisymmetric 
phosphate (PO 2 ”) stretch which is observed at 
1,215 cm”* in Z-form, 1,225 cm”* in B-form, and 
1,240 cm”* in A-form. The symmetric phosphate 
stretch at c. 1,090 cm” * is quite insensitive to the helical 
form present and acts as an internal standard for quan¬ 
titative analysis. Sugar-puckering modes between 865 
and 820 cm”* can be used to determine the relative 
amounts of N- and S-type sugars (Lindqvist and 
Graslund 2001). Furthermore, the melting of double¬ 
strand to single-strand DNA is readily monitored by 
a decrease in 1,690-1,680 cm”* band intensities (gua¬ 
nine C(,=Os stretch and C 2=02 stretch of thymine/ 
uracil) and an increase of band intensities at c. 1,680- 
1,650 cm”* (from the C=0 modes of the “free” bases). 
The ratio in the intensities of the two band sets allows 
the dynamics of the melting process to be followed 
(Banyay et al. 2003). The vibrations between 1,500 and 
1,250 cm”* are sensitive to glycosidic bond rotation. 


backbone conformation, and sugar-puckering modes 
localized to the base and base-sugar entities, and have 
been used to determine the presence of anti- or syn- 
conformations (Banyay et al. 2003). 

(b) Triple helical structures are of particular biological 
relevance as they are analogous to RNA-binding double- 
stranded (ds) DNA structures in vivo and have been 
implicated in inducing inhibition of the key DNA pro¬ 
cesses of replication, transcription, and recombination 
(Banyay et al. 2003). The association of a third base 
results in perturbation of the in-plane double-bond 
stretching vibrations of nucleic acid bases in the double 
helix. For example, the formation of the rC - 1 - *rG.rC base 
triplets (note: two types of triple helices may be formed, 
a pyrimidine*purine.pyrimidine or a purine*purine. 
pyrimidine, the asterisk denotes the third (Hoogsteen or 
reverse Hoogsteen) strand while the (.) corresponds to 
Watson-Crick base pairing of the initial duplex, the r is 
for ribose sugar as in RNA) as in poly rC - 1 - *poly rC.poly 
rC, gives a high wavenumber absorption assigned to 
a € 2=02 carbonyl vibration of protonated cytosine 
involved in Hoogsteen-type base pairing with guanine 
in an iC.rC Watson-Crick duplex. IR marker bands have 
also been identified for quadruplex and i-motif structures. 

(c) Condensed DNA in the cell nucleus is tightly 

packed, representing an enormous feat of polymer engi¬ 
neering. This arrangement is mimicked in solution for 
high concentrations of DNA in the presence of positive 
divalent and trivalent metal ions, which stimulate close 
packing of DNA around the ions through neutralization of 
the negative charges on the phosphate groups. The large- 
scale chiral structures are amenable to vibrational circular 
dichroism investigations. Cr^'*' ions in the presence of calf 
thymus (ratio of 0.36 [Cr]/[P]) causes only small IR 
changes in the nitrogen base and sugar-phosphate regions 
(1,086 cm”*) and carbonyl absorption regions 

(Andrushchenko et al. 2002). However, the vibrational 
circular dichrosim (VCD) spectmm reveals a four-times 
increase in amplitude of the main VCD couplets and this 
correlates with electronic circular dichroism (BCD) v|/ 
spectra, where v|/, FS/, is an acronym for v|/-DNA, Poly¬ 
mer and Salt-Induced DNA condensation produced in 
high-concentrated solutions of monovalent salts (NaCl). 
VCD spectra shows DNA condensation and formation of 
large and dense particles of regularly arranged double¬ 
helical DNA molecules of a size comparable to the wave¬ 
length of the probing IR beam (for VCD see also entries: 

► Vibrational Circular Dichroism of Biopolymers and 

► Polarized Infrared Spectroscopy). 
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(d) IR has also been used to characterize groove¬ 
binding molecule interactions. This work is challeng¬ 
ing as it requires deconvolution of any overlapping 
ligand vibration from that of the nucleic acid bands 
(Taillandier and Liqueir 1992). 

Work performed on 10 K solid samples in the far- 
infrared spectral region, 16-130 cm“* (0.5^.0 THz) 
identified a distinct set of resonances of the four 
individual bases which relate to vibrational motion 
of hydrogen bond systems responsible for the arrange¬ 
ment of the molecules in the microcrystalline 
structure. For the individual nucleosides (Fischer 
et al. 2002), these absorptions give additional narrow 
resonances at low frequencies tentatively assigned to 
sugar group modes (see also entry ► Far-Infrared 
Spectroscopy of Proteins). 

2D-IR 

Conceptually, 2D-IR is similar to 2D-NMR with the 
major difference stemming from the ultrafast time- 
scales observable through the optical time domain 
(femtoseconds) as opposed to magnetic resonance 
(microsecond) (Hochstrasser 2007; Hunt 2009) (see 
also entry ► Nonlinear IR Spectroscopy of Proteins). 
The “normal modes” of nucleic acid bases are heavily 
influenced by mechanical motions, for example, hydro¬ 
gen bonds and electrostatic couplings between the 
transition dipoles of adjacent bases (both intra- and 
inter-strand), and with solvent molecules. These cou¬ 
plings lead to anharmonicities that perturb the vibra¬ 
tional signatures. Thus, the ability to pump a specific IR 
band and follow the changes in other IR modes by 2D 
infrared spectroscopy allows molecular interactions to 
be profiled across the entire structural framework (see 
also entry ► Hydrodynamics of Nucleic Acids: Model¬ 
ing Overall Conformation and Dynamics). 

The interaction of DNA with surrounding water 
molecules can be investigated using ultrafast vibra¬ 
tional spectroscopy and the dynamics of hydrated 
DNA using this approach has been recently reviewed 
(Szyc et al. 2010). Highlighted therein was the TRIR- 
IR (time-resolved pump-probe experiment with IR 
pump and IR probe pulse) study of the self¬ 
complementary 23 base sequence 5'-T(TA)io-TT-3', 
in the 3,000-3,700 cm“* region, under different solva¬ 
tion conditions. The hydration level of high optical 
quality thin films of the base sequences was controlled 


carefully between 0% relative humidity (rh) (two water 
molecules per base pair) and 92% rh (20 water mole¬ 
cules per base unit) using a specialized IR humidity 
cell connected to an aqueous reservoir. At 92% rh the 
spectra contain an additional broad spectral feature due 
to the presence of the surrounding water molecules that 
interact with the 23 base sequences. The experiment 
probed the interaction between DNA modes and bound 
water molecules by pumped (exciting specifically) NH 
and NH 2 stretching bands of thymine and adenine 
(away from the absorption from OH water stretches) 
at 3,200 and 3,350 cm“\ respectively and probing the 
OH stretching mode of tightly bound water at 
3,075 cm“* as distinct from the broad OH stretching 
IR absorption which spans the entire spectral window 
and the bleached base modes initially pumped (see 
Fig. 2). Varying the time delay between the pump 
and probe laser pulses from 0 to 1,200 fs yields 
a cross-peak in the 2D experiment arising due to the 
coupling between the base and water modes. The 
experiment showed that the vibrationally excited states 
of 0% rh samples recover (i.e., the IR negative bands 
grow back) on the 500 fs timescale, however, at 
92% rh, the kinetics obtained from analysis of the 
3,075 cm“* region (OH stretching mode) shows 
a two-component decay due to the contribution of the 
NH/NH 2 stretching modes, in which IR absorption 
band is shifted to a lower wavenumber at higher rh 
because of hydrogen bonding to water molecules. By 
switching the IR pump excitation to an OH stretching 
absorption of water at 3,550 cm“\ the inverse interac¬ 
tion of water with the NH/NH 2 base-stretching modes 
was also interrogated. Because only water modes that 
couple to base NH/NH 2 vibrations are detected in this 
experiment, only water molecules in the hydration 
shell of DNA are monitored and bulk water does not 
contribute. Here for 0% rh, the v = 1 excited state of 
the OH stretching mode has a 500 fs lifetime which is 
longer than in bulk water (200 fs), for 92% rh there is 
faster recovery, on 200 fs, associated with secondary 
water layers not in direct contact with the base. Szyc 
et al. (2010) went on to investigate the way excess 
energy cascades through the DNA structure, funneling 
from higher to lower vibrational modes and outward to 
solvent molecules via phosphate-water interactions. 
This involved IR pumping the NH stretch (3,190 cm“^) 
and monitoring the changes to the Vantisymn,etric (P 02 )~ 
mode. At low hydration levels, the water molecules are 
held in a rigid geometry to DNA’s phosphate backbone 
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Infrared Spectroscopy of DNA, Fig. 2 Schematic represen¬ 
tation of the linear hydrogen hond of an N-H donor group and an 
oxygen (solvent) acceptor atom with the potential-energy dia¬ 
gram of an anharmonic oscillator with quantum states v = 0, 1, 
and 2. The experiment pumps one region of the IR spectra {black 
upward arrow, v = 0 —> 1) and probes (red arrows) at both 
frequencies of the v = 0 —> 1 and v = 1 —> 2 transitions. The 
kinetics obtained are given below (b-i) illustrate the change of 
absorbance AA at a probe photon energy of Epr = 3,075 cm“* 
after excitation of the DNA oligomers at Eex = 3,250 cm“* 
plotted as a function of pump-probe delay. Data are for 0% rh 
(solid circles) and 92% rh (open circles, parallel polarizations of 
pump and probe), (b-ii) Corresponding data for a probe photon 
energy of Epr = 3,205 cm“’ (AA values scaled for comparison 
of 0% and 92% r.h.) (Adapted from (Szyc et al. 2010)) 

and rotation of water molecules is suppressed. At higher, 
fully hydrated levels, moving toward bulk water the 
water shell is the main heat sink, energy transfer occur¬ 
ring in the femtosecond timescale while the transfer of 
energy within DNA occurs on a 20 ps timescale. Heating, 
however, reduces the efficiency of the process through 
a lessening of water-to-phosphate bonding. The work 


gives unique insights into the timescales of interactions 
of DNA and water through the vibrational manifold. 
Ultrafast infrared spectroscopy has also been used to 
monitor thermal rises in temperature brought about fol¬ 
lowing UV excitation (Schreier et al. 2007). 

Recent computational studies have aided interpre¬ 
tation of IR and 2D-1R spectra (Taniguchi and Saito 
2009) calculating IR frequencies in close agreement to 
experimental values obtained in solution (see also 
entry ► Quantum Mechanical Simulations of Biopoly¬ 
mer Vibrational Spectra). Such work builds on previ¬ 
ous theoretical studies (e.g., Miyamoto et al. 2006). 
Coupling between vibrations on different bases alters 
the vibrational spectrum of DNA with respect to that of 
the individual bases. In a comprehensive study (Lee 
et al. 2006a, b, c, 2007) the 1,400-1,800 cm“^ region 
has been analyzed to elucidate structural fluctuations 
and base pairing in nucleic acids. The vibronic cou¬ 
pling constants for both Watson-Crick pairs and mul¬ 
tilayer systems were calculated using a Hessian matrix 
reconstruction (HMR) method (using DFT geometry 
optimization and B3LYP/6-31 G* DFT method) and 
showed H-bonding interactions between the bases to 
be the most influential for mode coupling (Lee et al. 
2006a). Focusing on eight fundamental base modes, 
the study showed that the normal modes of base pairs 
(G:C and A:T) and multilayer base stacks could be 
described by linear combinations of the fundamental 
modes of individual bases. A follow-up article replaced 
labile protons with deuterium, allowing comparison 
with experimental data taken in D 2 O (Lee and Cho 
2006b). The exciton coupling model was able to emu¬ 
late experimentally determined IR spectra of double- 
stranded poly(dG):poly(dC) and poly(dA):poly(dT). 
For the GC pair, the C=0 stretching vibrations of 
G and C couple and two C=0 stretching normal vibra¬ 
tions with frequencies of 1687.6 and 1666.4 cm“* were 
identifled and assigned to the antisymmetric, GsCs(—), 
and symmetric, GsCs(-i-), modes, respectively. 2D-IR 
photon echo spectra of DNA oligomers have also been 
simulated (Lee and Cho 2006c). The IR and 2D-IR 
spectra of A-, B-, and Z-forms of DNA have also 
been simulated with the aim of identifying key marker 
IR bands associated with each structural form in the 
1,400-1,700 cm”' region (Lee and Cho 2007). The 
antisymmetric stretch for GsCs(—) corresponding to 
the highest frequency carbonyl stretch increases in 
frequency in the order A-form > B-form > Z-form. 
To illustrate the way 2D-IR can interrogate DNA 
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structure Fig. 3 shows spectra of d(GC)8, in the B-form 
secondary structure, recorded using a 4 fs pulse 
sequence with differing relative polarizations Firstly, 
Fig. 3a gives the linear IR spectra where the peak labels 
correspond to the cytosine ring mode (A), cytosine 
carbonyl modes (B and C), and guanine carbonyl (D). 
Figure 3b, c has differing relative polarizations of the 
IR pulse sequence 2b all parallel, 2c two pulses at 90° 
to the other two. Peaks E-H, which appear above and 
below the diagonal, are cross-peaks produced by cou¬ 
pling across guanine and cytosine. The diagonal peaks 
provide information to quantify how the guanine and 
cytosine interact with one another. For example, the 
difference in intensity between all parallel polarization 
and orthogonal polarization pulse sequence provides 
direct information on the relative angle between the IR 
transition dipoles for peaks B and C indicating they 
are orientated roughly orthogonally to one another. 
Figure 3d applies four-pulse sequence with 45°, 
—45°, 90°, 0° relative polarizations as a means to rela¬ 
tively suppress the diagonal peaks and enhance the 
cross-peaks to reveal the extent of the frequency shifts 
of the diagonal peaks and the corresponding intensity 
changes of cross-peaks for the three structures used to 
monitor structural exchange in solution. The right-hand 
side of Fig. 3 gives simulated spectra permitting the 
diagonal anharmonicities, magnitudes of the transition 
dipoles, and bonding angles to be estimated. 

Further 2D-IR spectroscopy studies of the carbonyl 
vibrational modes of guanine and cytosine bases have 
compared both A- and B-forms of DNA with regard to 
understanding the influences that secondary structure 
has on IR spectra of bases (Krummel and Zanni 2006). 
Studies on dCGsCs) and d(GC)g find that hydrogen- 
bonded guanine/cytosine base pairs are primarily elec¬ 
trostatic in origin. The workers have also characterized 
the intra-strand and inter-strand coupling of base vibra¬ 
tions for polydGdC systems (Krummel et al. 2003). 
2D-IR spectroscopy has also been shown to be able 
to follow DNA melting (Greetham et al. 2010) (see 
also entry ► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation). 

Time-Resolved Infrared Spectroscopy 

Intermediates and Electronic Excited States 

DNA, as the repository of our genetic code, requires 
resilience to chemical modification (see also entry 


► DNA Repair and Recombination). The most studied 
form of this change is photochemical and occurs across 
a wide range of timescales (picosecond to millisec¬ 
onds). In the last decade, TRUV-IR (time-resolved 
infrared spectroscopy with UV pump pulse and IR 
probe pulse) spectroscopy has investigated the forma¬ 
tion of excited state species, and generation of radical 
and anion intermediates through photochemical reac¬ 
tions. A summary article of TRIR studies on nucleic 
acid base systems up until 2009 is Towrie et al. (2009) 
and on the latest instrumental capabilities for TRIR, 
Greetham et al. (2010). One present limitation of this 
technique lies in the lack of availability of infrared 
diode array detectors that are sensitive below 
1,000 cm~* (see also entry ► Kinetics: Overview). 

TRUV-IR studies began with the investigation of 
the individual nucleic acid bases, nucleosides, and 
nucleotides following 267 nm excitation. In the case 
of 5'-dAMP, 5'-dGMP, and 5'-TMP, relaxation of IR 
modes occurred between 3 and 7 ps, which is too long 
to be attributed to an electronic excited state which for 
these mono-nucleosides is sub-picoseconds and 
assigned, therefore, to rapid vibrational relaxation in 
ground electronic excited state. The IR experiment 
captures and temporally resolves the extremely rapid 
energy propagation from the initially created excited 
singlet state to produce vibrationally hot ground 
state modes (u>l) via internal conversion. However, 
for 5'-dCMP and 5'-dCytidine, but not for the 
corresponding base, an additional transient band at 
c. 1,570 cm“^ is observed with a much longer lifetime 
of 33 ps and this is tentatively assigned to the InNtt* 
electronic excited state, (Quinn et al. 2007) see Fig. 4. 
This l,570-cm“* cytidine marker band has been useful 
for studying how lifetimes of excited states are 
influenced by base stacking and/or hydrogen bonding 
(Doorley et al. 2009). One of the conclusions from 
time-resolved IR experiments like this is that it directly 
quantifies the ultrafast internal conversion of the 
DNA bases, as seen through the eyes of the vibrational 
spectral fingerprint. This is a fundamental property 
of nucleic acid bases and what makes them so 
photostable, which is an essential requirement for the 
safe, long-term storage of genetic information. 

TRUV-IR was used to follow the reaction of two 
neighboring thymine groups in poly(dT) and (dT) 2 o, to 
form a cyclobutane dimer (Schreier et al. 2007). This 
UV-activated reaction is one of the major causes of 
photodamage of DNA occurring with a yield of ca. 3% 
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Infrared Spectroscopy of DNA, Fig. 3 Linear 2D-IR spectra 
of d(GC)8. Left Experimental and right calculated, (a) Linear-IR 
spectram; (b) 2D-1R spectrum, with relative polarizations of 
(0‘’,0°,0“,0°); (c) 2D-IR spectrum with relative polarizations of 



cog (cm 


(0°,90“,90°,0°). (d) 2D-IR spectrum with relative polarizations 
of (45°,—45°,90°,0°) (Reproduced from (Krummel and Zanni 
2006)) 























I 


1072 


Infrared Spectroscopy of DNA 


Infrared Spectroscopy of 
DNA, Fig. 4 Time-resolved 
IR difference spectra obtained 
upon UV excition of 

(a) 20 mM 5'dCytidine and 

(b) 20 mM Cytosine both in 
50 mM phosphate buffer pH 7 
with absorption spectra below. 
Delays at —50, —25 (green), 2, 
3, 4, 5, 6.5, 8 (red), 10, 12.5, 
15, 20, 35, 50, 100, 150, 200, 
500 and 1,000 ps (black). The 
black arrows indicate 
direction of intensity change 
with time (Taken from (Quinn 
et al. 2007)) 


a 0.0010 


- 0.0020 

1400 
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and has long been speculated to proceed through 
a triplet state. Ultrafast 2D-1R experiments performed 
by Schreier et al. showed the thymine dimer is formed 
in less than 1 ps - a timescale too fast for reaction via 
the triplet manifold. This conclusion was reached by 
comparison of the behavior of 5'-TMP with that of 
(dT)i8 in the 1,300-1,500 cm“* spectral region and 
the presence of characteristic photodimer bands at 
1,465, 1,404, and 1,320 cm“* in the latter, which are 
clearly resolved at early times (Fig. 5b). Such ultrafast 
timescale necessitates preorganiztion, indicating the 
reaction of the singlet state occurs via geometrically 
optimized structures. The thymine triplet state has also 
been studied by ns-TRUV-IR identifying it to be a ttti* 
state of thymine. The triplet state has characteristic 
bands at 1,380, 1,604, 1,682, and 1,704 cm“* 


that decay with a lifetime of 560 ns in deuterated 
acetonitrile. At faster, picosecond timescale, a broad 
band initially centered at 1,640 cm“* has been 
assigned to the vibrationally excited ground state 
(Hare et al. 2008). 

Radicals 

The guanine radical cation, G"^* and the guanine radical 
G*, also written G(—H)» to indicate proton loss from 
guanine, are important intermediates in DNA redox 
chemistry. Picosecond-TRIR has investigated the 
direct DNA damage process induced using 200 nm 
photoionization of guanine establishing an infrared 
marker band at c. 1,700 cm“* with a lifetime >1 ns; 
photoionization of [poly(dG-dC )]2 gives a similar tran¬ 
sient species (Kuimova et al. 2006). The assignment of 
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Infrared Spectroscopy of DNA, Fig. 5 Comparison of 
the femtosecond IR difference signals for the three 
dimerizable thymine samples (TlpTl: locked dinucleotide with 
locked C3' endo conformation of the sugar moiety; TpT: 
thymidylyl-(5'-3'-)-thymidine) and the nondimerizahle refer¬ 
ence sample TMP. All signals are to scale. Left. Time trace for 


a detection wavenumber of 1,404 cm^’. Right. IR difference 
spectra recorded 100 ps after photoexcitation. Signal strengths 
appear in the order of the dimerization yields. The arrow marks 
the spectral location for the time traces on the left (Taken from 
(Schreier et al. 2009)) 


this transient is controversial because the guanine rad¬ 
ical cation, produced immediately after electron loss 
may undergo rapid deprotonation to form the neutral 
radical. To resolve the controversy, steady-state IR 
experiments on guanine systems frozen in 77 K glass 
(Parker et al. 2010) were performed. The work irradi¬ 
ates (193 nm) a frozen sample to generate and trap the 
radical species for several hours, permitting IR spectra 
to be taken of any chemical changes at a leisurely pace. 
The sample must not be moved between UV irradiation 
and taking of the IR spectra because of the small 
absorbance changes observed and the need to obtain 
a consistent background reading (see entry “► Infrared 
Spectroscopy of Protein Reactions” for principles of 
IR difference spectroscopy). The experiments comple¬ 
ment much work using electron spin resonance spec¬ 
troscopy investigating DNA radical intermediates. The 
IR study is supported by density functional theory 
(DFT) calculations (EDF1/6-31 -i- G*) which repro¬ 
duce the IR spectral features observed permitting 
assignment of the guanine radical cation G"^', and its 
formation from 193 nm photoionization of DNA. 
Deprotonation of this species to produce the neutral 
radical G(—H)* does not occur in the 77 K glass. The 
IR spectra are accurately emulated by the DFT calcu¬ 
lations. Experiments and calculations indicate that the 
formation of G"^' within the double helix does not 
significantly perturb the geometry of the G/C pair, 
even though there is a significant movement of the 
N1 proton away from G toward C - perhaps nature’s 
way of stabilizing the chemistry of the native form 
of DNA. 


Future Expectations and Trends 

The ability of infrared spectroscopy to study solution- 
phase properties of nucleic acid bases has been put to 
full use over the years. Emerging technology for time- 
resolved and 2D-IR methods promises to give new 
insights into the dynamics of the mechanical and elec¬ 
tronic properties of DNA. Time-resolved studies push 
toward an ultimate goal to move away from a static 
picture of nature’s wonderful polymer to one where 
DNA’s interactions can be followed in real time and, 
with that, serve to promote our understanding of how 
molecule-molecule bonding directs biological interac¬ 
tions to yield tailored biological function. 
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Synonyms 

ATR; Attenuated total reflection; FTIR spectroscopy; 
Infrared spectroscopy; Infrared Spectroscopy of 
Oriented Samples; Lipids; Membrane lipids; Ordering 

Definition 

The infrared spectrum of lipids contains a wealth of 
information on lipid composition (head group and 
chain length/unsaturation) as well as on membrane 
physical state (chain ordering, phase transition). 
Furthermore, linear dichroism measures the orienta¬ 
tion of different molecule parts with respect to the 
membrane supporting internal reflection element. 
Some of these features can be observed in complex 
systems such as cells. 
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Introduction 

Lipids in their native environment are usually difficult to 
study. The scattering of light prevents meaningful stud¬ 
ies in a large part of the electromagnetic radiation range. 
Attenuated total reflection (ATR) approach solves the 
problem of the scattering and opens new avenues of 
research. Lipids are indeed extraordinarily amenable 
to form stacks of membranes on flat ATR supports. 
Multilayer stacks are usually stable in an aqueous envi¬ 
ronment and sufficiently permeable to solutes to allow 
the study of their interactions with the membranes. 

Infrared spectra contain a wealth of information on 
both the chemical structure of molecules and on their 
conformation. In the case of lipids, the infrared spec¬ 
trum of a lipid provides a unique signature of the lipid 
class. Furthermore, fine details of the structure can 
be monitored. The use of ATR-FTIR allows the mon¬ 
itoring of lipid nature and structure in close-to- 
physiological conditions. In turn, the environment of 
the lipids can be modulated so that their conformation 
can be studied as a function of different parameters 
such as temperature or the presence of specific ligands. 
Because of the long IR wavelength, light scattering 
problems are virtually nonexistent. A unique advan¬ 
tage of infrared spectroscopy is that it allows simulta¬ 
neous study of the structure of lipids and proteins in 
intact biological membranes without introduction of 
foreign perturbing probes. The most interesting protein 
amide I and amide II bands (Goormaghtigh et al. 2006) 
are indeed located in a spectral region where most 
lipids do not absorb (Tamm and Tatulian 1997). 

ATR-FTIR Spectroscopy of Lipids 

Attenuated total reflection (ATR) FTIR relies on the 
presence of an evanescent wave at the reflecting inter¬ 
face. In ATR-FTIR, the infrared beam is directed into 
a high refractive index medium which is transparent 
for the IR radiation of interest. Below a critical angle, 
the beam is completely reflected when it impinges on 
the surface of the internal reflection element (IRE). 
Several internal total reflections can occur within the 
IRE until the beam reaches the end. It can be shown 
from Maxwell’s equations that superimposition of 
incoming and reflected waves yields a standing wave 
within the IRE, established normal to the totally 
reflecting surface, and an electromagnetic disturbance 


exists in the rarer medium beyond the reflecting inter¬ 
face. This so-called evanescent wave is characterized 
by its amplitude which falls off exponentially with the 
distance from the interface (Goormaghtigh et al. 1999). 
Any molecule present in the evanescent wave will 
contribute to the spectrum. Usually, lipids are depos¬ 
ited as oriented multilayer stacks. Such an organization 
has the advantage to provide both the spectrum of the 
lipids but also the orientation of the different part of the 
lipid molecules by using linear dichroism. Good ori¬ 
entation is obtained only on clean IRE. When germa¬ 
nium or silicon is used as IRE, cleaning is best obtained 
with a high-pH detergent solution or in a plasma 
cleaner. Multilayer stacks of oriented membranes are 
obtained by evaporating the solvent from a solution or 
suspension. This step is essential to form a film in close 
contact with the IRE and to orient phospholipid- 
containing membranes parallel to the IRE surface 
(Eringeli and Giinthard 1981). If necessary, subsequent 
hydration can be obtained by flushing a chamber 
containing the film with a buffer. In the latter situation, 
the concentration of any chemical species which can 
interact with the protein can be modulated (Eringeli 
1992; Goormaghtigh et al. 1999). This technique has 
been pioneered by Baenziger on the acetylcholine 
receptor (Baenziger et al. 1992, 1993). 

Lipid Polar Head Groups 

Membrane lipids include a variety of species whose 
distribution between membrane organelles and 
between inner and outer leaflet is precisely regulated 
by the cell metabolism. One of the most abundant 
lipids is phosphatidylcholine (PC). The spectrum of 
dipalmitoylphosphatidylcholine is presented in Fig. 1. 
The spectrum includes two distinct regions. The high 
wave number part of the spectrum (3,100-2,800 cm“*) 
contains contribution from C-H stretching vibrations 
only. In turn, it mainly originates from the hydrocarbon 
chains. The low wave number region of the spectram 
(below 1,800 cm“^) is essentially related to the polar 
head groups of the lipids as indicated in Fig. 1. The ester 
v(C = O) is usually the strongest peak, followed by the 
phosphate contributions near 1,240 cm“^ (Va.s(P02~)) 
and 1,090 cm“* (Vs(P02~)). The hydrocarbon chains 
do contribute near 1,465 cm“\ but all-trans conforma¬ 
tions rather absorb near 1,470 cm~*. A full list of assign¬ 
ments can be found in (Fringeli and Giinthard 1981). 
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Infrared Spectroscopy of 
Membrane Lipids, 

Fig. 1 ATR-FTIR spectrum of 
dipalmitoylphosphatidylcholine 
between 3,050 and 800 cm^* 


DPPC IR spectrum 




SM 

DPPS 

DPPG 

DPPE 

DPPC 


Infrared Spectroscopy of Membrane Lipids, Fig. 2 ATR- 
FTIR spectrum of dipalmitoylphosphatidylcholine (DPPC), 
dipalmitoylphosphatidylethanolamine (DPPE), dipalmitoylpho- 
sphatidylglycerol (DPPG), dipalmitoylphosphatidylserine 
(DPPS), and sphingomyelin (SM) between 1,800 and 800 cm“' 

Interestingly, the FTIR spectra of the various lipid 
species are significantly different. Figure 2 reports the 
FTIR spectrum of five major phospholipids found in 
eukaryote cell membranes. It appears clearly from 


Fig. 2 that each lipid species has a unique FTIR spec¬ 
trum which could allow identification of the various 
species in a complex mixture. The analytical problem 
is, however, complicated by the fact that the spectrum 
also depends on the lipid physical state, as will become 
evident from the analysis of thermal-induced phase 
transitions (see below). Again, assignments can be 
found in (Fringeli and Giinthard 1981). 


Lipid Hydrocarbon Chains 

The contribution of the lipid hydrocarbon chains is 
present in various spectral regions. The most promi¬ 
nent ones appear between 3,050 and 2,800 cm“*. 
This region essentially contains C-H stretching bands 
from different vibrational modes: Vas(CH 2 ) near 
2,920 cm“', v,(CH 2 ) near 2,850 cm“', Vas(CH 3 ) near 
2,960 cm”*, VsCCHs) near 2,870 cm”*, and importantly 
Vas(CH = CH) near 3,010 cm”*. These vibrations pre¬ 
sent a number of interesting features: (1) the fact there is 
little overlap with other vibrations, including in complex 
systems such as cells and tissues; (2) these vibrational 
modes are largely uncoupled from other modes, i.e., they 
do not depend on lipid head group; and (3) they are 
sensitive to the stmcture (disordering) of the chains. 
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cm '' 

Infrared Spectroscopy of Membrane Lipids, Fig. 3 ATR- 
FTIR spectrum of 18-carbon long fatty acids with 0 (stearic 
acid), 1 (oleic acid), and 3 (linolenic acid) unsaturations. The 
area under the C-H stretching band from CH = CH groups at 
3,010 cm“* has been colored for better visualization 

Lipid phase transitions are, for instance, easily monitored 
by the position of these bands (see below): 

1. Chain Unsaturation: C-H stretching vibrations 
offer a perfect tool to evaluate lipid unsaturation 
levels. The principle is illustrated in Fig. 3. The 
ratio between v(CH 2 ) and Vas(CH = CH) bands 
immediately yield an unsaturation index which is 
relevant in membrane studies or in the food industry 
(Sherazi et al. 2007). Oxidation (Russin et al. 2003), 
peroxidation (van deVoort et al. 1994), hydroper¬ 
oxides (Ma et al. 1998), or cis/trans double bond 
content (van deVoort et al. 1995) are of immense 
interest for food quality monitoring. 

2. Chain Length: The ratio between CH 2 modes and 
CH 3 modes potentially offers an opportunity to 
investigate chain length in lipids. Figure 4 illus¬ 
trates the principle of such an evaluation. 
Obviously, chain length can be obtained by simple 
analysis of the Vas(CH 2 )/Vas(CH 3 ) ratio. 

Lipid Phase Transitions 

Lipid phase transitions can be monitored with a high 
accuracy by ATR-FTIR. Yet, the measurements need 
to be carried out in the presence of an excess of water 
(preferentially in the presence of liquid water) as phase 
transitions are highly dependent on full hydration (Pohle 
et al. 1998). As previously reported (Goormaghtigh et al. 
1999), at low hydration levels, drastic conformational 
changes may occur, as demonstrated by Fringeli for 
DPPC (Fringeli 1981), and a phase transition does not 


occur. It is also crucial to record a background for every 
temperature as the infrared spectrum of water depends on 
temperature. Even worse, transmission of infrared light 
through germanium is very strongly dependent on tem¬ 
perature. Figure 5 reports a series of DPPC spectra 
recorded as a function of temperature. In the experiments 
presented here, a trapezoidal germanium crystal (50 x 
20 X 2 mm^) was used as internal reflection with an 
aperture angle of 45° yielding 25 internal reflections. The 
crystal was then placed in an ATR holder for liquid 
samples with an inlet and outlet. A computer-controlled 
elevator allows the crystal to be positioned at different 
heights such that different lanes can be analyzed sepa¬ 
rately. In turn, the baseline and sample could be recorded 
on the same crystal after positioning the crystal such that 
either the lipid multilayer stack or a clean lane of the 
crystal was analyzed by the IR beam. For a 20-mm wide 
crystal, up to four lanes can be analyzed successively 
with no overlap with a 3-mm beam aperture. In the 
experiments described in Fig. 5, one of the two acyl 
chains of DPPC has been fully labeled with deuterium, 
shifting the major C-H stretching bands from ca 2,920 
and 2,850 cm” * to 2,195 and 2,088 cm” *. Such a labeling 
is particularly useful because molecules found in living 
cells do not absorb in this range of wave numbers. One 
molecule or one part of a molecule can therefore be 
studied in a complex environment, including in living 
cells (Moore et al. 1996, 1997). With the labeling 
described here, the phase transition can be monitored 
independently for each chain. Observing the spectra of 
Fig. 5 does not reveal any spectacular change near 41°C 
where the phase transition is expected to occur. Yet, 
a detailed analysis of almost any region of the spectrum 
reveals the presence of a transition. The most popular 
region analyzed for monitoring the gel-fluid phase tran¬ 
sition includes the C-H stretching bands whose charac¬ 
teristics are related to the number of gauche bonds in the 
chain (Davies et al. 1990; Mendelsohn et al. 1992; Moore 
et al. 1997; Kota et al. 1999; Mendelsohn and Moore 
2000; Szalontai et al. 2000; Mannock et al. 2001). 

As it can be observed in Fig. 5b, the accuracy of the 
wave numbers is better than 0.1 cm”'.This needs to be 
discussed in view of the nominal resolution of the 
spectrometer which was set at 2 cm”' and an encoding 
of the data every 1 cm”' in these experiments. In fact, 
O.I cm”' accuracy is reached by fitting a series of 11 
data points, i.e., 5 cm”' before the approximate max¬ 
imum and 5 cm”' after, by a third order polynomial 
and finding the roots of its derivative. 
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Infrared Spectroscopy of 
Membrane Lipids, 

Fig. 4 ATR-FTIR spectrum 
of a series of synthetic 
phosphatidylcholine lipids 
with two identical acyl chains 
containing one unsaturation 
each. The chain length was 
varied from 14 to 24 carbons 
as indicated on the figure. The 
inset reports the ratio between 
VasfCHz) and Vas(CH 3 ) as 
a function of chain length 



0.15 


0.1 


.Q 

< 


0.05 


0 

3500 3000 2500 2000 1500 1000 

cm^'' 



b 

2924 

'T 2923 
o 

I 2922 
E 

E 2921 

CO 
0 ) 

2920 

2919 

10 15 20 25 30 35 40 45 50 

Temperature (“C) 



Infrared Spectroscopy of Membrane Lipids, Fig. 5 (a) 

ATR-FTIR spectra of dipalmitoylphosphatidylcholine (DPPC) 
recorded in the presence of an excess of water as a function of 
temperature. From blue to red, temperatures are 15°C, 20°C, 
25°C, 30°C, 32°C, 34°C, 36°C, 38°C, 40“C, 41‘’C, 42°C, 43°C, 
45°C, and 48°C. Spectra have been offset for a better readability. 


A background spectrum recorded at the same temperature as the 
sample has been obtained on another lane of the same crystal and 
subtracted as described in the text. One of the two chains of 
DPPC has been fully labeled with ^H. (b) Position of the max¬ 
imum of the Vas(CFt 2 ) band reported as a function of temperature. 
The solid line is a spline fit of the data 


Lipid Orientation 

Polarized attenuated total internal reflection (P-ATR) 
infrared spectroscopy is presently one of the best tools 
available for the structural study of membrane-embedded 


peptides and proteins. The method is simple (oriented 
multilayer systems are easily obtained by drying 
a membrane suspension on the internal reflection ele¬ 
ment), and it does not require specific labeling, although 
labeling with *^C, **0, or allows the study of selected 
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Infrared Spectroscopy of Membrane Lipids, 
Fig. 6 Dichroic spectrum of DPPC obtained by subtraction of 
the perpendicular polarization from the parallel polarization. For 
details, the reader is referred to Goormaghtigh et al. (1999). The 
exact position of the peak has been obtained as described above 


for monitoring peak position in the course of the phase transition. 
Colored bands are discussed in the text. The dotted spectrum is 
a raw spectrum of DPPC reported here as a visual reference. 
Spectra have heen rescaled for the figure 


molecules or part of molecules. One of the main interests 
of polarized ATR-FTIR spectroscopy is that it yields 
insight into the orientation of the membrane molecules, 
including protein secondary structures. Orientational 
order parameters can indeed be derived from the dichroic 
ratio of the protein amide I or amide U bands. 

A prerequisite for measuring lipid orientation in 
a lipid multilayer stack is to obtain a good orientation 
of the lipid bilayer. This is easily obtained for pure 
lipids but raises questions when complex natural mem¬ 
branes are considered. This question has been 
addressed in the case of the tubulovesicle membranes 
present in parietal cells of the stomach which contain 
the IT^, K'^-gastric ATPase (Ivanov et al. 2004). It was 
found using atomic force microscopy that the align¬ 
ment of these natural membranes counting up to 100 
bilayers remained close to perfect. Figure 6 reports 
a dichroic spectrum of a DPPC multilayer stack. 

It appears from Fig. 6 that intense bands of the 
DPPC spectrum such as the v(C = O) are not neces¬ 
sarily strongly dichroic. On the contrary, the dichroic 
spectrum resolves small intensity bands such as the 
yw(CH 2 ) series which appears at 1,180, 1,199, 1,222, 
1,246, 1,267, 1,287, 1,310, and 1,330 cm”'. The rela¬ 
tively strong intensity of this series (with respect to 
alkanes) originates from a resonance with polar head 


group vibrations. It occurs only if the chain is all-trans 
down to the ester group and contains n/2 bands, n being 
the number of carbon in the chain. Gauche kinks would 
reduce the number of peaks in the progression 
(Chia and Mendelsohn 1992; Senak et al. 1992; 
Mendelsohn et al. 1992). In turn, these bands quanti¬ 
tatively describe the number of all-trans chains in 
lipids. This progression completely disappears above 
the phase transition. Furthermore, as the transition 
dipole is oriented parallel to the chain, it is an excellent 
indicator of the orientation of the all-trans acyl chain. 
The positive dichroism of the yw(CH 2 ) bands and the 
negative dichroism of the Vas(CH 2 ) at ca 2,920 cm“' 
and Vs(CH 2 ) at ca 2,850 cm” * whose transition dipole 
are perpendicular to the chains, along with the negative 
dichroism of the §(CH 2 ) at ca 1,468 cm” (opposite to 
the choline 1,492-cm”* band) indicates a good orien¬ 
tation of the lipids parallel to the germanium surface. 
This orientation can be quantified as described 
elsewhere (Goormaghtigh et al. 1999). 

Lipids in Cells 

It recently appeared from infrared spectra of cancer cells 
exposed to various anticancer drugs that the lipid 
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composition was modified (Gasper et al. 2009, 2010; 
Derenne et al. 2011). There is therefore an urgent need 
to study lipid composition in cancer cells. For instance, as 
reported in (Gasper et al. 2010), Fourier transform infra¬ 
red (FTIR) spectroscopy was used to investigate modih- 
cations of the prostate cancer PC-3 cell lipidome after 
exposure to sublethal concentrations of a cardenolide. 
FTIR spectroscopy offered an overview of the lipid clas¬ 
ses present in the whole sample. The achievements of 
FTIR spectroscopy were compared with data obtained 
by mass spectrometry on the same samples. FTIR spec¬ 
troscopy could identify content variations in some lipid 
classes, e.g., essentially these containing choline head 
groups such as phosphatidylcholine and sphingomyelin. 
FTIR spectra were also able to report changes in ester/ 
choline/phosphate ratios characterizing lipid changes 
induced by the cardenolide molecule. Furthermore, quan¬ 
titation of major lipid classes (PC, PE, PG, SM) could be 
obtained by curve fitting of the FTIR spectra. Yet, FTIR 
failed to resolve lipid classes for which the polar heads do 
not display specific IR features such as phosphatidyl- 
glycerol and cardiolipin. Quantihcation of lipids in com¬ 
plex mixtures by multivariate statistics might provide 
a useful tool in a near future (Dreissig et al. 2009). 

Summary 

Fourier transform infrared (FTIR) spectroscopy is a tool 
of choice for the study of lipids. In particular, when 
coupled with the attenuated total reflection (ATR) 
approach, a wealth of information can be obtained from 
lipids in the absence of any perturbating probe. Lipid 
unsaturation level, lipid chain length, and lipid chain 
ordering, including in lipid phase transitions, can be 
measured, including in the presence of membrane pro¬ 
teins. The orientation of different segments of lipid mol¬ 
ecules can be defined using polarized light by linear 
dichroism. Finally, some lipid bands can be monitored 
in entire cells or tissue. 

Cross-References 

► Atomic Force Microscopy (AFM) for Topography 
and Recognition Imaging at Single Molecule Level 

► Chemical Diversity of Lipids 

► Functional Roles of Lipids in Membranes 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 


► Infrared Spectroscopy of Protein Reactions 

► Infrared Spectroscopy of Protein Structure 

► Infrared Spectroscopy: Data Analysis 

► Lipids: Isolation and Purihcation 

► Membrane Proteins: Structure and Organization 

► Supported Lipid Bilayers 
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Infrared Spectroscopy of Nucleic Acids 

► Infrared Spectroscopy of DNA 


Infrared Spectroscopy of Oriented 
Samples 

► Infrared Spectroscopy of Membrane Lipids 

► Polarized Infrared Spectroscopy 


Infrared Spectroscopy of Protein 
Dynamics: H/D Exchange 

Erik Goormaghtigh 

Structure and Function of Biological Membranes, 

The Free University of Brussels, Center for Structural 
Biology and Bioinformatics, Brussels, Belgium 

Synonyms 

Hydrogen-deuterium exchange; Protein dynamics 

Definition 

Attenuated total reflection Fourier transform infrared 
spectroscopy (ATR-FTIR) is used to monitor hydrogen- 
deuterium exchange rate in proteins. Monitoring amide 
proton rate of exchange sheds light on protein stability and 
potential conformational changes induced hy protein 
ligands. 

The infrared spectrum of a protein displays two 
prominent hands called amide I and amide II. Amide 
carbonyl stretching is the major contribution to 
amide I, amide N-H bending is the major contribution 
to Amide II. In the presence of heavy water (D 2 O) the 
labile hydrogen on the amide N-H group is replaced by 
deuterium. The new N-D vibrator absorbs at much 
smaller wavenumbers than the original N-H group. 
In turn, the large amide II band located near 
1,540 cm“' completely disappears when exchange is 
complete (Goormaghtigh et al. 1999; Goormaghtigh 
2009). The rate of disappearance of the band sheds 
light on the stability of the protein structures. 
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Infrared Spectroscopy of Protein Dynamics: H/D 
Exchange, Fig. 1 Series of infrared spectra of a membrane 
protein presented between 1,800 and 1,400 cm~*. Spectra have 
been recorded as a function of the time of exposure to heavy 
water, from 0 to 250 min. Evolution of the area of the amide II 
band is reported in panel B as a function of the time of exposure 


to D 2 O. The decay has been analyzed as the sum of three 
exponential decays. The result of the analysis is indicated on 
panel B. Panel C reports the distribution of the time constants 
obtained by inverse Laplace transform of the data presented 
in panel B 


Structures that are very dynamic, that is, which folds 
and unfolds rapidly, do exchange rapidly, almost 
immediately with respect to the recording time of 
FTIR spectra. Stable structures exchange much more 
slowly, in minutes, hours, days, or much more. In turn, 
analysis of the exchange rate sheds light on protein 
structure stability. 


In the presence of specific ligands, for example, 
ATP or ATP analogs for an ATPase (Raussens et al. 
1996, 1997; Scheirlinckx et al. 2004), this dynamic is 
usually much affected as protein domains change 
their mutual interactions. It turns out that analysis of 
the H/D exchange rate is very sensitive to conforma¬ 
tional changes that are otherwise undetectable. 
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The number of peptide bonds whose exchange rate has 
been modified can be easily quantified. In addition, 
detailed analysis of amide II in the course of 
the exchange process indicates which the secondary 
structures are affected by the conformational changes. 

Practically, exchange can be induced by dilution of 
an aqueous sample in a large volume of heavy water. 
The exchange can then be monitored in an infrared 
liquid cell. With this set up, the first seconds of mea¬ 
surement are usually lost because of the time required 
to introduce the sample in the cell. Alternatively, 
films of dried samples can be exposed to heavy 
water vapor and the exchange is monitored by atten¬ 
uated total reflection on hydrated films. Spectra are of 
better quality and there is no lag at the beginning of 
the experiment. Yet, drying could affect some unsta¬ 
ble proteins. 

An example of H/D exchange measurement is 
reported in Fig. la for a membrane protein. The first 
band located at 1,740 cm”' is assigned to the ester 
carbonyl group of the lipids. The two other major 
bands are called amide I (near 1,650 cm”') and 
amide II (near 1,540 cm”'). It can be observed that 
amide II intensity decreases as a function of the time of 
exposure to heavy water. When exchange is complete 
(when all amide N-H become N-D), its intensity 
falls to near zero and the remaining absorption is due 
to side chains. Simultaneously, the band located near 
1,450 cm”' (called amide IT, mainly due to the C-N 
stretching vibration) increases. The decay of amide II 
intensity is reported in Fig. lb. As this decay is the 
superimposition of many exponential decays (as many 
as there are peptide bonds in the protein), it can be 
analyzed in two ways. First, it can be hypothesized that 
the behavior of the entire population of amide bonds 
can be approximated by a limited number of discrete 
time constant, for instance, 3. When the data of Fig. lb 
are fitted as a combination of three exponentials, 
three contributions with time constants of 1.9, 94, and 
990 min are found. Second, the exchange curve 
reported in Fig. lb can be submitted to inverse Laplace 
transform. In turn, the distribution of the time constants 
is retrieved without hypothesis on the number of 
components (Fig. Ic). This latter analysis is similar to 
the use of a Fourier transform to retrieve different 
frequencies in a signal. The analysis of the exchange 
rate can be further developed to provide information 
on the exchange of individual secondary structure 
types (Raussens et al. 2004). 


Cross-References 

► Far-Infrared Spectroscopy of Proteins 

► H/D Exchange 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 
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Synonyms 

Protein dynamics; Ultrafast infrared spectroscopy 


Definition 

Femtosecond time-resolved infrared (IR) spectroscopy 
after electronic excitation is used to investigate the 
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dynamics of the chromophore-protein complex in 
biologically relevant chromoproteins. 

Introduction: Why Ultrafast? 

Protein function is intimately related to both protein 
structure and dynamics. The fundamental concept of 
hierarchically ordered energetic protein substates, as 
introduced by Frauenfelder (cf. references in Lim and 
Anfinrud 2005), straightforwardly relates energy land¬ 
scape to conformational changes and respective rate 
constants. Thus, the occurrence of dynamic processes 
on a broad range of time scales, ranging from seconds 
to below the nanosecond regime and down to even 
femtoseconds, is an inherent property of proteins. 
Laser pulses with femtosecond duration allow to 
explore these processes in real time. In general, they 
act as an ultrashort perturbation that is coupled to the 
protein, e.g., via photon absorption by an adequate 
chromophore. The subsequent relaxation, i.e., recov¬ 
ery to an equilibrium state can then be observed 
in various ways by the interaction of the molecular 
system with temporally delayed laser pulses. 
The focus of this chapter lies on experiments 
involving linear infrared spectroscopy after 
electronic excitation of a natural chromophore in pro¬ 
teins with or without photobiological function (for 
two-dimensional IR spectroscopy cf. entry 
“► Nonlinear IR Spectroscopy of Proteins” by Hamm 
and Zanni). 

Prominent examples for photobiologically active 
chromoproteins are retinal-binding ion pumps and 
light sensors, bilin-binding red-light sensors (phyto¬ 
chromes), chlorophyll-based photosynthetic systems, 
flavin-binding blue-light sensors, photoactive yellow 
protein, and others (Briggs and Spudich 2005). In these 
systems, light absorption initiates a series of thermally 
driven “dark-reactions” which facilitate the specific 
biological function. A crucial step is the coupling 
between the electronically excited chromophore and 
its immediate vicinity, i.e., amino acid residues, pep¬ 
tide backbone, and protein-bound water molecules. 
Instead of mere dissipation, the absorbed energy is 
very quickly and progressively transformed into meta¬ 
stable intermediate protein states in a precise manner 
that best suits the specific purpose. The various cou¬ 
pling mechanisms involve photophysical processes as 
vibrational energy redistribution and cooling, Forster 


energy transfer and dipolar interactions, as well as 
photochemical processes as electron transfer, proton 
transfer, cis-trans isomerization, and breaking or for¬ 
mation of covalent and hydrogen bonds. Naturally, 
these competing deactivation mechanisms act on 
the femto- to picosecond time scale, inducing confor¬ 
mational alterations. In principle, all these processes 
can be monitored by transient IR spectroscopy via 
changes of the various vibrational states of either the 
chromophore or the protein moiety or both, specifi¬ 
cally via the vibrational frequency, absorption cross 
section, occupation number, spectral bandwidth, and 
change of transition dipole orientation. Thereby, 
a wealth of information is obtained. Current research 
activities are directed to characterizing the photoin- 
duced chromophore dynamics and its coupling to the 
protein moiety in order to understand the mechanisms 
that very efficiently exploit the absorbed photon 
for photobiological purposes on an ultrafast time 
scale. It is self-evident that ultrafast IR spectroscopy 
is embedded in other research activities and 
methods such as UVA^is spectroscopy for the elec¬ 
tronic state dynamics, ultrafast resonance Raman 
(RR) spectroscopy addressing solely chromophore 
vibrational dynamics (cf. entry “►Ultrafast Raman 
Spectroscopy” by Dasgupta, Frontiera and Fang), 
molecular dynamics simulations and quantum chemis¬ 
try, as well as structure determination, e.g., by X-ray 
diffraction and nuclear magnetic resonance 
spectroscopy. 

Techniques for Transient IR-Vibrational 
Spectroscopy After Electronic Excitation on 
Femtosecond to Picosecond Time Scale 

State-of-the-art methodology of transient IR spectros¬ 
copy (cf. references in Lim and Anfinrud 2005; Groot 
and Van Grondelle 2008) employs nonlinear optical 
techniques to generate synchronized and indepen¬ 
dently wavelength-tunable pump (electronic excita¬ 
tion) and probe (mid-lR) pulses, starting with 
sufficiently energetic ultrashort pulses at a given 
(fixed) wavelength. The latter, e.g., with pulse energy 
«1 mJ, wavelength «775 nm, duration wlOO fs and 
spectral width wlO nm (full width at half maximum for 
Gaussian pulse shape), are provided for example 
by commercially available Titanium-Sapphire (Ti:Sa) 
chirped-pulse regenerative-amplifier systems (Fig. 1). 
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Infrared Spectroscopy of Protein Dynamics: Ultrafast 
Kinetics, Fig. 1 Scheme of time-resolved spectrometer, 
employing excitation pulses in the UV-Vis and probe pulses in 
the mid-IR. CP-RA chirped-pulse regenerative amplifier, OPA 


optical parametric amplifier, DFG difference frequency genera¬ 
tion, NOP A noncollinear parametric amplifier, SHG second har¬ 
monic generation, SFG sum frequency generation 


The excitation pulses can be produced by noncollinear- 
parametric-amplification (NOPA) of a broad spectral 
continuum (generated from a small portion of the 
fundamental pulse at 775 nm) and pumped by a stron¬ 
ger pulse at 388 nm (frequency-doubled fundamental 
pulse). NOPAs offer broad spectral tunability 
(ca. 470-760 nm) with the opportunity to obtain a 
pulse duration of 50-20 fs or less. If necessary, spectral 
regions in the UV or near-IR can be accessed by further 
nonlinear mixing. Correspondingly, mid-IR probe 
pulses with duration <150 fs and related bandwidth 
are obtained at the difference frequency of two suitable 
near-IR pulses. This covers the spectral region of inter¬ 
est from above 10 pm to below 4 pm with most of the 
fingerprint vibrations of chromophore and protein, 
including amide I and amide II protein modes as well 
as low-energy hydrogen-out-of-plane modes and high- 
energy X-H(D) stretching modes. Since the IR probe 
and the excitation pulses are derived from the same Ti: 
Sa pulse, a well-defined time delay between them is 
easily set up by an optical delay line. 

The transmitted broad band IR pulses are spectrally 
resolved and detected, e.g., by mercury-cadmium- 
telluride (MCT) arrays. In an experiment, the pump- 
induced change of IR-absorbance is measured as 
function of delay time and IR wavenumber. If the 
experiment is performed in a polarization-resolved 
manner, the absorption anisotropy yields information 
on the angle between electronic and vibrational transi¬ 
tion dipoles as well as their temporal evolution and the 
related structural dynamics. 


Proteins and other biological molecules usually 
require an aqueous environment; however, water is 
a strong IR absorber. Thus, in order to achieve suffi¬ 
cient IR transmission of the sample, its optical path 
length has to be reduced which in turn requires an 
adequately high protein concentration. Other possibil¬ 
ities include the exchange of H 2 O with D 2 O or the 
reduction of the water content by preparing humidified 
protein films. 

Usually, the necessity of sufficient data averaging 
due to the very small pump-induced IR absorbance 
changes requires to repeat the experiment at relatively 
high repetition rates (e.g., 1 kHz). On the other hand, 
the long recovery time of the sample after photoexci¬ 
tation demands to move the sample laterally to the 
focused laser beams, thereby exchanging the exited 
and probed sample volume shot by shot (“fresh-sample 
condition”). 

Application to Chromoproteins 

After time-resolved IR spectroscopy explored the 
nanosecond regime, e.g., via the step-scan FTIR tech¬ 
nique (cf. references in Groot and Van Grondelle 2008 
and entries “► Infrared spectroscopy of protein fold¬ 
ing, misfolding and aggregation” by Hauser and 
“► Fourier Transform Infrared Spectroscopy for Bio¬ 
physical Applications: Technical Aspects” by Fahmi), 
the picosecond and then femtosecond time regime was 
tackled with a number of pioneering experiments 











































I 


1086 


Infrared Spectroscopy of Protein Dynamics: Ultrafast Kinetics 


(Hochstrasser 1998) on primary photoreactions in heme 
proteins, bacteriorhodopsin, and photosynthetic reaction 
centers. Since then, this technique has been developed 
much further and has been applied to numerous biolog¬ 
ical systems and many questions concerning ultrafast 
dynamics and reaction kinetics in protein environment. 
In the following, some results are summarized for 
a number of photobiological systems. It is far beyond 
the scope of this contribution to reference all research 
groups in this and strongly related fields; however, the 
work of most of them is dealt with in the cited articles. 

Retinal Proteins 

This widespread class of proteins realizes very 
diverse photobiological functions (Briggs and 
Spudich 2005). Important examples include the visual 
pigment in vertebrate rhodopsin (Rh), the proton 
pumps bacteriorhodopsin (BR), proteorhodopsin (PR) 
and xanthorhodopsin (XR), the chloride pump 
halorhodopsin (HR), the light sensors sensory rhodop¬ 
sin I and II (SRI, SRII), as well as the light-gated 
ion channel channelrhodopsin (ChR), and others. The 
primary reaction in all these systems is the all-fra«s to 
l3-cis photoizomerization (11-cw to aW-trans in Rh) of 
the covalently bound retinal chromophore (cf. 
chapter by Diller in Braun et al. 2008), although in 
slightly different binding pockets, specified for the 
respective function. Important questions concern first 
the course and kinetics of the respective isomerization 
reaction in the diverse systems and second the dynamic 
coupling between the excited chromophore and 
the protein moiety. Both questions are strongly related 
since trans-cis isomerization and conformational 
changes of both chromophore and binding pocket includ¬ 
ing protein-bound water molecules can be anticipated to 
occur in a concerted way. 

In contrast to UVA'^is spectroscopy, transient IR spec¬ 
troscopy is much better suited to give answers on stmc- 
tural grounds. Following the intensity of a 13-cw marker 
band in BR, i.e., a C-C stretching vibration at 1,190 cm” * 
(Figs. 2, 3 left), it has been demonstrated that in BR the 
l3-cis configuration is formed with a time constant of 
500 fs (Herbst et al. 2002). Similar experiments on HR, 
SR, and PR exemplified the diversity of the primary 
photoreaction in terms of excited state decay, isomeriza¬ 
tion timing, and kinetic reaction schemes. The protein 
response to the primary light-induced perturbation has 
recently been studied employing isotopically labeled BR 
and SRII samples, including SRII in D 2 O buffer, BR in 



wavenumber/cm '' 


Infrared Spectroscopy of Protein Dynamics: Ultrafast 
Kinetics, Fig. 2 Mid-IR difference spectra of bacteriorhodop¬ 
sin after excitation at 570 nm at delay times of 1 and 12 ps. 
Arrows indicate (1) the marker band of retinal in l3-cis config¬ 
uration, (2) and (3) non-chromophore vibrational bands in the 
amide II and amide I region, respectively 

H2'*^0 medium, SRII with ^^N-labeled protein, and BR 
with *^Ci 4 *^Ci 5 -labeled retinal chromophore (GroB et al. 
2009). Protein vibrational bands of amide and non-amide 
origin were observed to appear within 0.3 or 3 ps after 
chromophore excitation and to partially decay with 
a time constant of 9-18 ps (Figs. 2, 3 right). These results 
were discussed within the framework of various possible 
mechanisms, including steric effects, the transfer of elec¬ 
tronic excess energy from the chromophore to the pro¬ 
tein, and the effect of the strong change of electric dipole 
moment that is generated upon electronic excitation of 
the chromophore. Thus, in this primary reaction step, 
protein vibrational dynamics could be tracked, indicating 
an immediate response to the excited chromophore in the 
course of directed energy transformation. 

Phytochrome 

In plants, bacteria, and fungi, a number of light-related 
processes are controlled by phytochromes (Briggs and 
Spudich 2005), a class of photoreceptor proteins with 
two photochemically interconvertible states, Pr and Pfr, 
that absorb in the red and far-red spectral region, respec¬ 
tively. The primary processes involve photoinduced Z-E 
isomerization of the methine bridge between rings 
C and D of the linear tetrapyrrole chromophore (bilin), 
e.g., phytochromobilin (POB), phycocyanobilin (PCB), 
or biliverdin (BV). The photoinduced conformational 
changes of the chromophore translate into structural 
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Infrared Spectroscopy of Protein Dynamics: Ultrafast 
Kinetics, Fig. 3 Mid-IR absolution transients of bacteriorho- 
dopsin after excitation at 570 nm (cf. arrows in Fig. 2). Left. The 
rise of the positive signal at 1,185 cm^* with a time constant of 
0.5 ps displays the formation of the chromophore vibrational 


band around 1,190 cm”*, characteristic for its 13-c« configura¬ 
tion (cf. Fig. 2). Right: Transients at 1,550 and 1,660 cm”* 
display ultrafast protein response to chromophore isomerization 
within 0.3-3 ps with a slower relaxation (9-18 ps) 


alterations of the protein, which in turn control signal 
transduction cascades that steer various biological func¬ 
tions as, e.g., seed germination and flowering in plants. In 
analogy to the issues pertaining to retinal proteins, tran¬ 
sient IR spectroscopy can make valuable contributions to 
the description of conformational changes of both chro¬ 
mophore and protein moiety during the primary reaction. 
Interestingly, usually the photoreaction is relatively 
slow and at least biphasic, with time constants between 
^5 and ^100 ps, whereas the Pfr photoisomerization 
proceeds as well biphasic but much faster, i.e., within 
a few picoseconds (e.g., Schumann et al. 2008 and ref¬ 
erences therein). Since, in addition, in most phyto¬ 
chromes investigated so far, the P^ state is the signaling 
state and thermally metastable (showing “dark rever¬ 
sion” to Pr), several questions are currently under inves¬ 
tigation and debate: How does the conformation of the 
bilin chromophore during the P^ and Pfr reaction evolve? 
Are the kinetics best described by a scheme of first order 
reactions or by a distribution of reaction rates? Why are 
the reaction rates of Pr and P^ so different? Are they 
related to thermal stability, i.e., to dark reversion in the 
case of Pfr? Can the dynamic coupling between the 
chromophore and the protein moiety be described on 
a molecular level, e.g., by tracking hydrogen-bond pat¬ 
terns? With the aid of crystal structures that are currently 
emerging for phytochrome systems, these subjects either 
have been addressed aheady or can be addressed by 
ultrafast IR spectroscopy. As an example, for the agro- 
bacterial phytochrome AgpI-BV, transient absorption in 


the visible revealed only two kinetic time constants for 
each of the Pr and the Pfr reaction, whereas from transient 
IR data, an additional, third time constant was derived in 
each case (Schumann et al. 2008 and references therein). 
This complemental information allowed to construct 
more detailed reaction schemes. Thus, it was suggested 
that the Pr and the Pfr photoreaction follow almost iden¬ 
tical kinetic schemes, albeit with the Pr reaction roughly 
10 times slower than the Pr reaction. Moreover, the IR 
experiments served as an independent measure of the 
isomerization quantum yield in Pr and Pfr by determina¬ 
tion of the fraction of prompt ground state recovery. The 
results were cross-checked with the quantum yield of 
product state formation and gave indications for a shunt 
reaction in the Pfr ^ Pr half-cycle. 

Flavin Proteins 

In contrast to red-light sensing in phytochromes, 
flavoproteins provide blue-light sensing, e.g., in 
light-oxygen-voltage (LOV) domain-containing proteins, 
cryptochromes, and blue light using FAD (BLUF) pro¬ 
teins (Briggs and Spudich 2005). Here the divers primary 
reactions that constitute flavin photoactivation and finally 
lead to the respective signaling state include formation of 
a cysteinyl-flavin adduct (in LOV), one-electron reduc¬ 
tion of the oxidized flavin (in cryptochromes), and the 
rearrangement of a hydrogen-bonded network among 
FAD and nearby amino acid side chains (in BLUF). 
Following, for instance, the spectral position of C-0 or 
C=0 stretching vibrations of the flavin chromophore and 
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specific amino acid residues that are exposed to hydrogen 
bonding, it is possible to devise scenarios for the transient 
conformational changes and reaction mechanism for the 
photoinduced hydrogen-bond network switching in 
BLUF domains (Stelling et al. 2007; Bonetti et al. 2008 
and references therein). As an example, in the BLUF 
domain protein AppA from Rhodohacter sphaeroides, 
an absorption transient observed at 1,666 cm“* and 
assigned to the carbonyl stretch of a glutamine side 
chain (Q63) was suggested to indicate the prompt change 
of protein chromophore H-bonded interactions upon 
chromophore excitation as primary step in the BLUF 
domain photocycle (Stelling et al. 2007). 

Photosynthetic Systems 

The task of photosynthetic systems in higher plants, 
algae, and bacteria is the efficient transformation of 
photon energy into chemical energy. Appropriately 
designed antenna systems first funnel the absorbed 
photon energy to the reaction center (RC), which then 
accomplishes electron transfer across a cell membrane 
and fixes the energy in chemically metastable states on 
a sub-nanosecond time scale (cf. chapters in Braun 
et al. 2008). Participating chromophors and cofactors 
include pigments like bacteriochlorophyll-a (BChl-a), 
bacteriopheophytin-a (BPhe-a), quinone (Q), and 
carotenoids. With their much higher “fine-structure” 
in comparison to electronic spectra, transient IR spec¬ 
tra yield much more details about the electronic struc¬ 
ture in the various transient chromophore states during 
electron and energy transfer as well as their interaction 
with the scaffolding protein units on a microscopic 
level (Hamm et al. 1995; Groot and van Grondelle 
2008). This, combined with the known crystal struc¬ 
tures of the RCs and quantum chemical calculations, is 
used to better understand the mechanisms of sunlight 
energy conversion in nature. Along this line (Groot and 
van Grondelle 2008 and references therein), the 
energy-transfer dynamics in the core antenna CP47 of 
Photosystem II was linked to the arrangement of the 
individual chlorophylls in the CP47 complex. Combin¬ 
ing VisWis and Vis/mid-lR pump-probe experiments, 
time scales of 1^ ps and 12-28 ps were found for 
energy transfer within the complex and a correlation 
was derived between two pools of the (red-absorbing) 
CP47-chlorophylls and corresponding chlorophyll 
keto C=0 stretching frequencies between 1,696 and 
1,664 cm” , indicating quite different strength of 
hydrogen bonding to the protein moiety. 


Heme Proteins 

The iron-porphyrin (heme) group in a number of 
enzymes as myoglobin (Mb), hemoglobin (Hb), and 
others is not active photobiologically, but instead, as 
prosthetic group, serves oxygen storage, transport, and 
release. Photodissoziation of other reversible ligands 
as CO or NO from the heme substrate in conjunction 
with mid-IR probing and polarization anisotropy mea¬ 
surements is utilized to investigate dynamics of the 
protein-cofactor complex that facilitate ligand binding 
and release on an ultrafast time scale (Lim and 
Anfinrud 2005 and references therein). In this way, 
the ligand orientation in HbCO was characterized by 
an angle of 8.3° between the CO transition dipole and 
the heme normal. Likewise, the orientational distribu¬ 
tion of the photolyzed ligand in its docking sites of 
MbCO and the time of ligand trapping (0.2 and 0.52 ps) 
were determined. For the geminate rebinding of the 
NO ligand to Mb, a time-dependent rebinding rate 
(initial and final time constants of 7.7 and 150 ps, 
resp.) was derived, suggesting the relevance of protein 
conformational relaxation for the rebinding process. 
Finally, the escape of the ligand (MbCO) from the 
docking site to the solvent was found to occur on the 
microsecond time scale. 

Cross-References 

► Electron Transfer Through Proteins 

► Flavins 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► FTIR Spectroscopy 

► Hemes 

► Infrared Spectroscopy of Protein Reactions 

► Kinetic Isotope Effects 

► Polarized Infrared Spectroscopy 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 

► Ultrafast Raman Spectroscopy 
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Synonyms 

Amide I spectroscopy; Protein fluorescent dye labeling 


Definition 

Infrared spectroscopy is a powerful method to 
study the molecular mechanisms of protein folding, 
misfolding and aggregation on different time scales. 


Introduction 

Understanding the molecular mechanisms of protein fold¬ 
ing, misfolding, and aggregation is an active field in 
biophysical research. The biosynthesis of a protein is 
catalyzed by the ribosome, and the polypeptide chains 
must make a complex array of interactions in order to 
fold to the correct structures. Chaperones are assumed to 
regulate folding steps of nascent proteins and to prevent 
aggregation or degradation. The protein biogenesis is 
followed by manifold posttranslational modifications 
such as ubiquitylation, glycosylation, phosphorylation, 
and ADP-ribosylation that control the concentration, 
localization, assembly, and specificities of proteins. 
Misfolding can occur during the biogenesis or posttran- 
slationally. Misfolded proteins usually lose their biologi¬ 
cal function or even cause diseases. These diseases 
are also called “protein conformational diseases” includ¬ 
ing type U diabetes (islet amyloid peptide, LAPP), 
Alzheimer’s (AP-peptide), Parkinson’s (a-synuclein), 
and Chorea Huntington’s (PolyQ proteins) disease. 

Much of our knowledge about protein folding has 
been derived from analyzing isolated proteins in vitro. 
Studies started in the early 1960s with the famous exper¬ 
iments of Anfinsen and coworkers who showed that the 
small globular protein ribonuclease can fold reversibly, 
implying that the thermodynamic stable state is 
a conformation at the global minimum of its accessible 
free energy (Anfinsen et al. 1961). Some years later, in 
the late 1960s, Levinthal demonstrated that there are too 
many possible conformations for a protein to find its 
native structure in the conformational space just by ran¬ 
dom search (Levinthal 1968). This is also known as the 
“Levinthal’s paradox.” Levinthal concluded that pro¬ 
teins fold by specific folding pathways. A lot of experi¬ 
ments started searching for folding pathways and 
concluding the folding mechanism from intermediate 
states along the pathway. On-pathway and off-pathway 
intermediates have been distinguished, implying that 
only on-pathway intermediates give insight into the fold¬ 
ing mechanism whereas off-pathway intermediates do 
not. The interpretation of kinetic data is most often done 
using single- or multi-exponential time decays of optical 
properties that monitor structural changes of the protein 
stracture after a fast initiation of the folding or unfolding 
process (Dill and Chan 1997). 

Funneled energy landscapes have been accomplished 
in the last years for a further conceptual description of 
how a protein folds or misfolds (Dill and Chan 1997). 
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An energy landscape is the free energy of each confor¬ 
mation as a function of the degrees of freedom, e.g., 
the dihedral bond angles along the peptide backbone. 
Each conformation is represented by a point on the 
multidimensional energy surface, hills correspond to 
high energy conformations that are less favorable as, 
e.g., burying polar groups in hydrophobic cores, and 
valleys correspond to configurations that are more favor¬ 
able than others nearby. The native state is represented 
by the global minimum as the lowest energy state. The 
folding process can be described as path on the energy 
surface starting from the unfolded polypeptide chain 
toward the native, folded state and thereby passing sev¬ 
eral intermediate states. The polypeptide chain can also 
end up in a misfolded or aggregated state. The knowl¬ 
edge of the potential energy surface is important since it 
determines the dynamics of the polypeptide chain. 

Various biophysical methods like NMR, EPR, and 
optical spectroscopy techniques have been developed 
during the last years that facilitate the study of protein 
folding in vitro with high structural and kinetic resolu¬ 
tion. Experimental analysis of protein folding mecha¬ 
nisms requires methods providing fast initiation of 
folding/unfolding, sensitivity for the protein structure, 
and the appropriate time resolution. Most approaches 
use rapid-mixing or perturbation techniques in order to 
initiate the folding or unfolding event. IR spectroscopy 
has been successfully used and contributed signihcantly 
to the progress in studying folding dynamics and aggre¬ 
gation processes. 

Protein Conformation Studied with Infrared 
Spectroscopy 

The vibrational modes that provide insight into protein 
conformation and conformational changes are the 
polypeptide backbone vibrations. These are coupled 
modes. The normal mode analysis of the polypeptide 
backbone vibrations has been started in the 1970s with 
the model compound N-methylacetamide (NMA), 
a molecule with one frans-peptide group. The normal 
modes of the peptide groups are called amide modes 
and are distinguished according to the contributing 
vibrations (Krimm and Bandekar 1986). The amide 
I mode has been established as IR marker band for 
secondary structure determination. 

Amide I Spectroscopy. The amide I mode of the 
polypeptide backbone arises mainly from the CO 


stretching vibration of the peptide groups (~80%) 
with minor contributions from the out-of-phase CN 
stretching vibration, the CCN deformation, and the 
NH in-plane bend. All peptide groups contribute to 
the amide I band, and it absorbs between 1,700 and 
1,600 cm“*. The peptide oscillators couple, and it is 
the coupling that is responsible for the secondary struc¬ 
ture dependence of the amide I band. The coupling 
occurs in different manners: One is the through-bond 
coupling, i.e., the coupling of the neighboring peptide 
group along the chemical bond. Since the amide 
I vibrations do not involve large displacements of the 
Co: atoms, one amide I mode will interact only slightly 
with the neighboring amide I mode, and, in conse¬ 
quence, the effect on the vibration of the polypeptide 
system is minor. The dominant coupling mechanism 
that makes the amide I vibration sensitive to secondary 
structure and determines its frequency is the through- 
space transition-dipole coupling (TDC) (Krimm and 
Abe 1972; Kubelka and Keiderling 2001). The position 
of the amide I absorbance maximum of different sec¬ 
ondary structure elements also correlates with the 
strength of hydrogen bonding. In general, the amide 
I frequency decreases with increasing hydrogen bond 
strength (Barth and Zscherp 2002). The solvent expo¬ 
sure and the resulting hydrogen bonds between the 
amide groups and the solvent molecules also have an 
impact on the position of the amide 1 frequency. 

Amide I Band Assignments. The secondary structure 
elements of a protein absorb at characteristic positions 
in the amide I region. However, the amide I band of 
a protein is usually broad and consists of several 
overlapping band components resulting from the 
various secondary structure elements. Different struc¬ 
tural elements are often not resolved in the experimen¬ 
tally obtained spectra. Amide I band assignments to 
secondary structure elements are commonly made 
either by band narrowing and curve htting for 
deconvolution or by use of a calibration set of spectra 
from proteins and peptides with known structure for 
pattern recognition. Assignments have been collected 
and evaluated (Goormaghtigh et al. 1994; Barth and 
Zscherp 2002). Nevertheless, the amide I band assign¬ 
ment is not unique or straightforward. Eor example, 
a-helical and disordered structures show amide I bands 
in nearby frequency ranges what may cause complica¬ 
tions in the analysis. On the other hand, amide groups 
in antiparallel P-sheet structures give rise to two major 
diagnostic bands, one between 1,615 and 1,640 cm~* 
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and a weaker band between 1,670 and 1,695 cm” . 
This makes IR spectroscopy, in contrast to CD spec¬ 
troscopy, very useful in indicating the presence of, and 
to monitor changes in, P-sheet structures. P-sheets in 
ordered or amorphous aggregates often are character¬ 
ized by a significant increase in splitting between the 
low- and high-frequency p-components in comparison 
to that observed for p-sheets in natively folded proteins 
(Fabian and Naumann 2012). A shift of the low- 
wavenumber band of P-sheets is interpreted as 
structural change of the P-sheet, leading to increased 
hydrogen bonding (Torii et al. 1998), a more planar 
sheet (Kubelka and Keiderling 2001), or a larger num¬ 
ber of strands (Chirgadze and Nevskaya 1976). 

Choice of Solvent. Structural IR studies of peptides 
and proteins are usually performed in the solvent D 2 O 
instead of H 2 O since H 2 O has a strong absorption of 
the HO bending vibration at ^1,650 cm“* which over¬ 
lays the amide I region. Due to the mass effect on the 
vibrational frequency, this vibration is shifted to 
^ 1,200 cm” * in D 2 O and thus out of the amide I region. 
The amide I band is termed as amide F if the protein is 
dissolved in D 2 O. Since the amide I mode is sensitive 
to N-deuteration of the backbone, hydrogen/deuterium 
exchange induces a shift of the amide F mode to lower 
frequencies. However, compared to the large effect on 
the amide II band, the shift of the amide I band upon 
deuteration is relatively small (5-10 cm”*) (Fabian 
and Naumann 2012). 

Experimental Approaches 

Triggering the FoldinglUnfoldinglAggregation Reaction. 
There are different approaches to study protein folding 
and stability. In general, equilibrium and time-resolved 
measurements can be distinguished. In equilibrium mea¬ 
surements, the protein conformation is monitored in 
thermodynamic equilibrium, and a thermodynamic anal¬ 
ysis can yield properties that characterize the protein 
stability as, e.g., the transition temperature or the 
equilibrium constant for the folding/unfolding reaction. 
Time-resolved measurements focus on kinetic properties 
to obtain insights into folding mechanisms. Therefore, 
the triggering of the reaction should be faster than the 
initiated conformational dynamics under study. The 
folding/unfolding/aggregation reaction can be triggered 
in several ways, also depending on the analyzed protein. 
Raising the temperature, e.g., induces thermal unfolding 


for most proteins, but it can also initiate the folding 
reaction in case of cold-denaturing proteins. Changing 
the pH value may induce conformational changes or 
aggregation. Chemical denaturants can be used to induce 
unfolding. The most commonly applied chemical dena¬ 
turants are urea and guanidinium chloride (GdmCl). 
However, theses denaturants have broad and strong IR 
bands that overlap the amide I region. Isotopic labeling 
of the denaturant, e.g., use of *^C-urea or *^C-GdmCl, 
and measurements in D 2 O circumvent this problem by 
shifting the intense denaturant bands out of the amide F 
frequency range. Besides changing the temperature, the 
pH value, or adding chemical denaturants, also measure¬ 
ments under high pressure can yield useful information 
on the structure, energetics, phase behavior, and confor¬ 
mational dynamics of proteins. The extension of the 
characterization to the pressure axis increases the under¬ 
standing of proteins as well as the ability to manipulate 
conformational states of proteins. High hydrostatic pres¬ 
sure has also been shown to be effective for disaggrega¬ 
tion and refolding of proteins from insoluble aggregates 
(Winter et al. 2012). Conformational dynamics studies 
are performed, e.g., with stopped-flow techniques. These 
techniques are limited to analyze millisecond dynamics 
due to their sample mixing time. Faster processes in the 
sub-millisecond time domain are commonly investigated 
with perturbation techniques using pulsed lasers, e.g., 
laser-induced temperature-jump, pH-jump, photolabile 
caged compounds, or light-triggered molecular switches. 

IR-Prohing Techniques. ► Fourier transform 
Infrared (FTIR) spectroscopy is commonly used to 
probe IR spectra. Due to the underlying interferometer 
technique, complete spectra can be monitored fast and 
at high signal-to-noise ratio. FTIR also provides a high 
time resolution, in the millisecond time domain with 
the rapid-scan approach and in the nanosecond time 
domain with the step-scan approach. IR lasers are used 
for a variety of monochromatic IR techniques and 
facilitate even sub-nanosecond dynamics. If the 
studied protein conformational changes are small, 
amide I absorption spectra will be difficult to interpret. 
Second-derivative spectra and reaction-induced differ¬ 
ence spectra are often used to reveal small absorption 
changes. 

Isotopic Editing. The secondary structure depen¬ 
dence of the amide I band arises from the coupling of 
the polypeptide amide I modes. However, only the 
average backbone conformation can be determined 
from the amide I band. The delocalization of the 
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Infrared Spectroscopy of 
Protein Folding, Misfolding 
and Aggregation, 

Fig. 1 Spectral changes in 
the amide /’ region reflecting 
dijferent protein 
conformations of the f-barrel 
outer membrane protein porin 
from P. denitrificans upon 
heating and cooling. Second 
derivative FTIR spectra at 
different temperatures: (3- 
sheet structure at room 
temperature ( ), 

disordered structure upon 

heating to 75° C (.) and 

aggregated structure upon 
cooling to 30° C (— -) 
indicated by amide T 
frequency shifts from 1,625 to 
1,650 to 1,615 cm^* 



vibrational modes prevents the use of the amide I band 
to get structure information about individual residues 
within the polypeptide. In order to elucidate structure 
and dynamics of polypeptides on the residue level, 
isotopic labeling of individual carbonyl C=0 groups 
of the polypeptide backbone provides a powerful mean 
to isolate amide I modes of specific residues within the 
secondary structure. Efficient labels are '^C=0 and 
i3c_i8o, inducing a downshift of the amide 
I frequency of about 37 cm“* for *^C=0 (Tadesse 
et al. 1991) and of about 75 cm“* for 
(Nagarajan et al. 2011) if the mode would 
be completely localized. However, in a structured 
polypeptide, the amide I modes are coupled and the 
observable frequency shifts depend on the coupling 
mechanisms between the labeled carbonyls. Thus, the 
frequency shifts of carbonyl labels are position depen¬ 
dent. Isotopic editing permits structure resolution at 
the residue level. 

Folding - Misfolding - Aggregation 

The amide I region is indicative for the secondary 
structure composition of the protein. Changes in the 
protein conformation will lead to changes in the amide 
mode intensities and/or frequencies. Figure 1 shows 
an example for a thermally induced transition of the 
membrane protein porin from P. denitrificans from 


a P-barrel to a disordered structure upon heating and 
to an aggregated structure upon further cooling that 
can be followed by the frequency shifts in the amide T 
region. A misfolded protein can be identified by com¬ 
paring its amide I spectrum with that of the native 
protein. However, if the structural changes are small, 
the identification of the misfolded structure might 
be difficult. A structural change of a few individual 
residues may not result in a significant change of the 
amide spectrum, in particular if the protein has several 
secondary structure elements with overlapping amide 
modes. ► Hydrogen/deuterium exchange experiments 
can be performed to induce frequency downshifts of 
more solvent-exposed regions. An approach to get 
residue-specific resolution is the use of isotope labels. 
The induced frequency shifts and the intensity of the 
shifted amide I component depend on the labeling 
position within the secondary structure element. 
Studies with isotopically labeled a-helical 

peptides provided insights into helix conformation, 
stability, formation, and structural transitions. In the 
case of *^C=0 isotopically labeled p-sheet peptides, 
frequency-shifted amide I band components can 
be induced with very high intensity (Halverson et al. 
1991; Brauner et al. 2000). This anomalous high inten¬ 
sity is attributed to a mixing of '^C=0 and *^C=0 
modes between strands. Also, other mechanisms can 
contribute to the intensity enhancement like in-phase 
vibrations of interstrand-coupled modes, primarily 
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Infrared Spectroscopy of Protein Folding, Misfolding and 
Aggregation, Fig. 2 (a) Thermal unfoling of a 12 residue f- 
hairpin peptide (TrpziplC) with isotopic editing. Amide I’ 
absorption spectra measured with FTIR in thermal equilibrium 
and heated stepwise from 5°C {blue, (f-hairpin structure) to 85°C 
(red, disordered structure). Left, unlabeled; middle: two cross¬ 
strand *^C=0 labels at the P-hairpin termini; right: two cross¬ 
strand '^C=0 labels adjacent to the P-hairpin turn. The fre¬ 
quency shift and intensity of the *^C-amide T modes (green 
arrow) depend on the labeling position within the hairpin struc¬ 
ture. (b) Folding dynamics probed at single wavelengths in the 



Temperature (°C) 


amide T region after a laser-induced temperature jump 
of ^10°C. Left: Transients shown exemplarily for three different 
peptide temperatures probed at 1,625 cm^' representative for the 
decay of the P-haiipin (blue) and probed at 1,661 cm^* repre¬ 
sentative for the rise of the disordered structure (red). Right: 
Relaxation times for different peptide temperatures derived from 
the transients. Data points were fit to the Arrhenius equation. In 
the low temperature region, it becomes obvious that intermediate 
states exist, indicating that the P-hairpin is a multistate folder, 
and the loss of P-hairpin structure is faster than the formation of 
disordered structure 


from the central portions of the P-sheet (Kubelka and 
Keiderling 2001). Figure 2a demonstrates the position- 
dependent frequency shift and intensity of two cross¬ 
strand *^C=0 labels introduced at different positions 
within a 12-residue hairpin peptide. Since P-sheet 
structures have a higher tendency to aggregate than 
Qc-helical structures, the isotopic intensity enhance¬ 
ment is very useful in the study of aggregation 
processes. 


Folding Dynamics 

Protein folding involves several stmctural transitions, 
including ordering of the polypeptide backbone to form 
specihc secondary stmctures and the global topology and 
the packing of the side chains to form the stabilizing 
tertiary interactions of the hydrophobic core. The dynam¬ 
ics of these transitions and their relative order, whether 
they happen concurrently or sequentially or even with 
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more complex time dependence, are still only poorly 
understood (Dyer 2007). The early folding events include 
the formation of turns, loops, a-helices, and P-hairpins 
which occur on the nanosecond-to-low-microsecond 
time scale (Bieri and Kiefhaher 1999). The subsequent 
slower steps involve the stabilization of the secondary 
stmcture elements, formation of side chain contacts, and 
exclusion of water from the protein interior. Some pro¬ 
teins reach their native state within a millisecond, 
whereas for others, the complete folding process may 
require several hours. An ideal folding experiment should 
provide structural specificity on all time scales at residue- 
specific resolution using site-specific, nonperturbing 
spectroscopic probes. A lot of folding studies use fluores¬ 
cence-monitored methods to measure folding rates. Fluo¬ 
rescence spectroscopy can be a local probe that Is 
sensitive to structural changes of a chromophore or spe¬ 
cific quenching interactions, but it can be difficult to 
interpret in terms of the global folding reaction. The 
introduction of fluorescence probes info different parts 
of the stmcture risks perturbing both the stmcture and the 
dynamics being studied. The intrinsic chromophores in 
a protein that are mainly probed by fluorescence are the 
local tryptophan environments, whereas infrared and 
amide I spectroscopy probes the global backbone confor¬ 
mation. Figure 2b shows an example of probing confor¬ 
mational dynamics of a P-hairpin with laser-excited 
temperature jump IR spectroscopy. Isotopic labeling 
('^C=0) has been used to derive site-specific dynamics 
providing insight into the folding mechanism (Hauser 
et al. 2008). Other studies have used site-specific IR 
probes for the side chain (azide) and backbone 
(*^C=*^0) in order to study both the backbone and side 
chain dynamics separately with residue-specific resolu¬ 
tion (Nagarajan et al. 201 1). The interplay between order¬ 
ing of the backbone and ordering of the side chains is 
particularly important for the formation of P-sheet stmc- 
tures, as the polypeptide chain searches for the native 
stabilizing cross-strand interactions. 

Aggregation Processes 

Misfolding and aggregation are the origin of many 
diseases. The detailed study of these processes is very 
important in view of developing therapeutics that 
target specific aspecfs of fhe disease process. Alfhough 
differenf proteins are associated with different dis¬ 
eases, most of the aggregation processes are related 


with fibril formation. The transformation of normally 
soluble polypeptides and proteins into amyloid fibers 
has proven a central characteristic of many human 
diseases, including diabetes type II, Alzheimer’s, 
Parkinson’s, and Huntington’s disease. Thus, there is 
much interest in studying the aggregation process from 
its molecular origins and identifying the structures of 
transient intermediate states. For several amyloid dis¬ 
eases, there is evidence that the toxic species are actu¬ 
ally prefibril infermediafes. Although the aggregates 
and the aggregation process are in focus of numerous 
studies, molecular details about the critical intermedi¬ 
ate states are elusive since it is extremely difficult to 
obtain structural and kinetic information for aggregat¬ 
ing systems. Very little is known so far about these 
critical intermediates, and IR spectroscopy is a very 
powerful tool to reveal molecular details. It can be 
applied to diluted proteins as well as to protein aggre¬ 
gates. In combination with isotopic labeling, it has 
been used to get a residue-specific view of the aggre¬ 
gation pathway of amyloid fibril formation without 
structure-perturbing labels (Shim et al. 2009). 

Summary 

IR spectroscopy yields the required time resolution to 
study folding, misfolding, and aggregation pathways 
of proteins. It provides molecular sensitivity for 
changes of the secondary structure, of the environment 
of hydrogen bonding networks, and of the solvent 
accessibility. The combination with site-specific isofo- 
pic labeling facililafes a residue-level sfrucfural spec¬ 
ificity without perturbing spectroscopic probes. 

Cross-References 

► H/D Exchange 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► IR Spectroscopy of Protein Structure 

► Nonlinear IR Spectroscopy of Proteins 
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Infrared Spectroscopy of Protein 
Reactions 

► Reaction-Induced Infrared Difference Spectroscopy 


Infrared Spectroscopy of Protein 
Structure 

Parvez I. Haris 

Faculty of Health & Life Sciences, De Montfort 
University, Leicester, UK 

Synonyms 

FTIR spectroscopy; IR spectroscopy of proteins; 
IR spectroscopy of protein structure; Protein confor¬ 
mation analysis using infrared spectroscopy; Protein 
structure 

Definition 

Infrared (IR) spectroscopic analysis of protein struc¬ 
ture involves the use of infrared radiation to assess 
vibrational modes arising from atoms within protein 
molecules and relating this to the primary, secondary, 
tertiary, and quaternary structure of the protein. Infra¬ 
red spectral band frequency, band intensity, and band 
width can be used to determine protein structure and 
dynamics. The vibration of the protein amide bonds are 
most widely used for protein structure analysis using 
infrared spectroscopy. This entry gives examples of 
how infrared spectroscopy can be used to probe 
different levels of protein structure, both qualitatively 
and quantitatively. Particular emphasis is placed on 
discussion of systems, such as protein aggregates 
and protein-lipid complexes, which cannot be readily 
analyzed using other biophysical techniques. 
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Introduction 

Infrared spectroscopy is one of the earliest biophysical 
techniques to be used in the structural characterization 
of proteins (see Barth and Haris 2009). This biophys¬ 
ical technique can be used to assess all levels of protein 
structure (primary, secondary, tertiary, and quaternary) 
in diverse environments (Haris and Chapman 1992; 
Barth 2007). 

Infrared spectroscopy can provide a wealth of infor¬ 
mation regarding protein structure since the infrared 
spectrum contains vibration from every atom present 
in the protein molecule (Barth 2007). The theoretical 
basis of infrared spectroscopy has been covered by 
Barth in the article “►Molecular Vibrations and 
Their Interaction with Electromagnetic Radiation.” 
It can be stated that a combination of experimental 
and theoretical work has laid the foundation for relat¬ 
ing the structure of a protein with its infrared spectrum 
(Barth 2007; Choi and Cho 2009). Since each protein 
has a unique amino acid sequence, it will therefore 
have a unique infrared vibrational fingerprint that can 
be used to understand diverse aspects of its structure 
(see Barth 2007; Haris and Chapman 1995). This entry 
gives examples where infrared spectroscopy has been 
successfully applied to probe protein structure and 
dynamics. It also stresses issues that should be taken 
into consideration to avoid potential pitfalls and errors 
in the determination of protein structure from infrared 
spectra. 

Extracting Structural Information from 
Infrared Spectra of Proteins 

The use of infrared spectroscopy for protein structure 
analysis was severely hindered as the overlap of absor¬ 
bance from H 2 O limited the application of this tech¬ 
nique for protein structure analysis in aqueous media. 
Much of the early infrared studies on proteins were 
carried out in the dry state (thin films or KBr disks). 
Measurements in the solution state were carried out 
using heavy water (deuterium oxide, ^H 20 ) as the 
solvent. As can be seen from Fig. 1, the H 2 O absor¬ 
bance directly overlaps with the most important amide 
vibration (amide I band, 1,600-1,700 cm“') that is 
utilized for secondary structure analysis. The spectrum 
in Fig. 1 was obtained with a very thin path length 
(6 pm) CaF 2 cell. Despite this, the absorbance associated 



Infrared Spectroscopy of Protein Structure, Fig. 1 (a) 

Overlaid transmission infrared spectrum of water {solid line) 
and the spectrum of a protein dissolved in water {broken line). 
There is very little difference between the two spectra due to the 
overwhelming absorbance of H 2 O molecules, (b) This spectrum 
was obtained after digitally subtracting the H 2 O absorbance. 
This is done by cancelling the water overtone band at 2,100 
cm^' to obtain a flat baseline between 1,700-2,000 cm^l. The 
protein amide bands can be visualized in the absorbance (differ¬ 
ence) spectrum and their frequency, intensity, and band width 
can be used for structural analysis 

with O-H bending vibration of the H 2 O molecule results 
in an intense peak at 1,640 cm“*. To reduce water 
absorbance, short path length cells and measurement 
using attenuated total reflection (ATR) procedure is gen¬ 
erally used (Barth 2007; Goormaghtigh 2009). Thanks 
to computerization, it is now possible to record spectra 
of proteins in H 2 O and then digitally subtract the 
overlapping absorbance of H 2 O molecules, thereby 
revealing the absorbance from the protein molecule. An 
example of this is shown in Fig. 1, where the spectrum of 
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water and that of a protein dissolved in water is shown in 
the top panel (Fig. la). Very little difference can be seen 
between the spectrum of water and that of the protein 
dissolved in water. However, after digitally subtracting 
the water spectrum, it is now possible to see the peaks 
arising from the protein molecule. There are up to nine 
characteristic amide bands, named amide A, B, I, II, 

111.VII that have been identified in infrared spectra of 

proteins (Barth 2007). The most intense peaks seen in the 
protein spectrum in Fig. lb are those associated with the 
amide I and amide II vibrational modes. The amide 
I, amide II, and to a lesser extent, the amide III bands 
are commonly used for protein structure analysis (Barth 
and Haris 2009; Hering and Haris 2009). The amide 
I mode is mainly associated with C=0 stretching vibra¬ 
tion with some contribution from N-H vibration (Barth 
2007; Choi and Cho 2009). It is not surprising that since 
the amide I band is the most intense of the protein peaks 
(see Fig. lb), it is the most widely used band for protein 
secondary structural analysis. The vast majority of stud¬ 
ies have also identified it to be the most sensitive amide 
band for monitoring differences in protein secondary 
structure. Theoretical calculations have been used to 
identify amide band frequencies associated with differ¬ 
ent secondary structural elements (Barth 2007; Choi and 
Cho 2009). This has been combined with experimental 
studies on proteins and peptides of known structures to 
establish a correlation between secondary structure 
and amide I band frequencies that is quite consistent 
(Hering and Haris 2009). Assignments of protein amide 
I bands, based on theoretical and experimental studies, 
are presented in Table 1. As can be seen from this table, 
there is overlap of absorbance between different second¬ 
ary structures. Furthermore, some proteins contain 
secondary structural elements that deviate from the “stan¬ 
dard” type of structures, due to unusual arrangements, 
ranging from distortions in the organization of the struc¬ 
tural element, unusual hydrogen-bonding pattern to 
exceptional solvation (Haris and Chapman 1992). Thus, 
for example, studies have revealed that the helical struc¬ 
ture of calmodulin absorbs at 1,645 cm“* instead of the 
more commonly seen 1,650 cm“* in ^H20 and this was 
attributed to its highly flexible and solvated helical struc¬ 
tures (Haris and Chapman 1992). 

Figure 2 shows the absorbance spectrum of bacterio- 
rhodopsin and porin, after subtraction of overlapping 
water absorbance. Both of these are membrane proteins, 
and their high resolution X-ray crystal structures are 
known, the highly helical bacteriorhodopsin displays its 


Infrared Spectroscopy of Protein Structure, 
Table 1 Amide I bands attributed to different protein secondary 
structural classes. Note that the range given here is based on 
assignments published in the literature (Hering and Haris 2009) 



H 2 O (cm ') 

^H 20 (cm”‘) 

a-helix 

1,645-1,660 

1,645-1,660 

P-sheet 

1,620-1,640 

1,670“-1,695 

1,615-1,640 

1,670“-1,695 

Turns 

1,620-1,640 

1,650-1,695 

1,620-1,640 

1,650-1,695 

Other (unordered) 

1,640-1,657 

1,660-1,670 

1,640-1,654 

1,660-1,670 


“The presence of an additional band in the regionl,670-1,695 
cm^' is associated with the presence of antiparallel (l-sheet 
structure 


amide 1 band maximum at 1,662 cm~* whereas the 
P-sheet containing porin molecule shows its amide 
I maximum position at 1,629 cm“^ It is interesting to 
note that while the band frequency of porin is consistent 
with what is expected for P-sheet structure, the amide 
I band maximum for bacteriorhodopsin is rather higher 
than the frequency normally seen for a-helical structure. 
Helices in an aqueous environment are able to participate 
in inter-molecular hydrogen-bonding with water mole¬ 
cules, which results in a shift of their amide 1 band toward 
lower frequency. However, this not possible for a peptide 
inserted in the phospholipd membrane, hence the amide 
I band of membrane embedded helices occur at a higher 
frequency (Haris and Chapman 1992). In this respect, it 
has been previously pointed out (Haris and Chapman 
1992) that the amide 1 band of membrane proteins gen¬ 
erally occur at a higher wave number (1,656-1,662 
cm”*) compared to water soluble proteins (1,650-1,655 
cm”*). The highest amide I band position thus far 
detected for a membrane protein is 1,662 cm”* (for 
bacteriorhodopsin, see Figs. 2 and 3). Since the amide 
I band is broad, absorbance from minor structural ele¬ 
ments cannot be readily visualized. Hence, methods such 
as second-derivative and deconvolution procedures are 
applied to reveal the overlapping bands. Thus, for exam¬ 
ple, the second-derivative spectrum of bacteriorhodopsin 
(Fig. 3) shows additional peaks that are not easily seen in 
the original absorbance spectrum (Fig. 2). The negative 
bands in the second-derivative spectra correspond 
to component bands in the absorbance spectrum. 
For example, the second-derivative spectrum (Fig. 3) 
reveals the presence of a small proportion of (I-sheet 
structure (1,636 cm”*) in bacteriorhodopsin which is not 
readily evident from the absorbance spectrum (Fig. 2). 
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Infrared Spectroscopy of Protein Structure, Fig. 2 This 
shows the absorbance spectra of two membrane proteins. The 
absorbance spectra were obtained after the subtraction of the 
overlapping H 2 O absorbance. Bacteriorhodopsin (broken line) 
has seven transmembrane a-helices and its amide I band maxi¬ 
mum is at 1,662 cm“’, In contrast, porin (solid line), which 
has a predominantly P-barrel structure has band position near 
1,630 cm^'. The presence of antiparallel P-sheet structure in 
porin is evident from a weaker band near 1,690 cm^ *. These two 
proteins represent the two main types of folding motifs in mem¬ 
brane proteins. The amide II bands of the proteins are located 
near 1,550 cm“*. This band is less sensitive to secondary struc¬ 
ture differences 


The amide I band of rhodopsin, which has a similar struc¬ 
ture to bacteriorhodopsin, occurs at 1,657 cm“*. The 
reason for the unusually high amide I band frequency 
for the helical structure of bacteriorhodopsin has been 
debated for more than 2 decades. Recently, a highly plau¬ 
sible explanation for this has been achieved by calculating 
the amide I spectmm of bacteriorhodopsin using transi¬ 
tion dipole coupling model (Karjalainen and Barth 2012). 
This revealed that there is interhelical coupling which 
produces a shift of the amide I band toward higher fre¬ 
quency. This shift to higher frequency is particularly 
pronounced for bacteriorhodopsin as it forms oligomers 
in the membrane. However, for rhodopsin (see Fig. 3) this 


effect is less as there is only intramolecular coupling 
between the helices. 

In general, P-sheet structures absorb in the region 
1,640-1,620 cm“* but in some cases they can overlap 
in the region where random coil structure may occur 
(Bering and Haris 2009). However, this situation is rare 
and P-sheet structure can be reliably determined using 
infrared spectroscopy. Infrared spectroscopy can be 
effectively used to distinguish between parallel and anti¬ 
parallel P-sheet structure. Antiparallel p-sheet structure 
shows an additional band, which is much weaker, at 
1,690-1,695 cm~^ (see later). This is evident from 
Fig. 2 for the spectmm of porin which has a weaker 
band in this region and is consistent with the presence 
of antiparallel p-sheet stmcture. However, absorbance in 
this region is weak and overlap can occur with P-tum 
stractures and even non-hydrogen bonded polypeptide 
segments (Barth 2007). Thus, caution is necessary when 
making such interpretation, especially in the case of 
a protein with unknown stmcture. However, in those 
cases where the stmcture of the protein has been well 
characterized, using X-ray or NMR spectroscopy, the 
use of FTIR (Fourier transform infrared) spectroscopy 
for probing subtle changes in minor peaks can be very 
attractive as they can be much more easily interpreted by 
taking into consideration the known three-dimensional 
(3D) stmcture of the protein. 

Examples of studies that have exploited the Amide 
I band for investigating different aspects of protein 
structure are described below. 

1. Methods for qualitative and quantitative assessment 
of protein secondary structure 
The amide I band contains vibrational modes from 
each of the carbonyl groups that exist in a protein but 
they are highly overlapped due to the relatively small 
frequency range of the amide I band where protein 
secondary structures absorb (1,615-1,700 cm”'). 
This congestion of the amide I band can be 
partially relived through the use of “resolution 
enhancement” techniques such as second-derivative 
and deconvolution methods (Haris and Chapman 
1992; Bering and Haris 2009; Barth 2007). These 
methods enable (see Fig. 3) the identification of 
bands arising from different stmctural elements in an 
infrared spectrum. The position and intensity of these 
bands have been used for qualitative analysis of pro¬ 
tein secondary stmcture (Bering and Haris 2009). 
Quantitative estimation of protein secondary structure 
from infrared spectra has been most commonly carried 
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Infrared Spectroscopy of Protein Structure, Fig. 3 Second- 
derivative spectra of bacteriorhodopsin (solid line) and rhodop- 
sin (broken line). The spectra are for membrane protein samples 
suspended in aqueous (H 2 O) media. Both of these proteins have 
seven transmembrane helices and their X-ray structures are very 
similar. The infrared spectra of the two proteins are also virtually 
identical, with the exception of a shift of the amide I band toward 
higher frequency for bacteriorhodopsin. This can be attributed 
to different hydrogen-bonding pattern, lipid-protein, and 


out using curve fitting of the amide I region of the 
original absorbance spectrum or the second-derivative 
and deconvolved spectrum (Hexing and Haris 2009). 
Other methods developed include factor analysis and 
partial least squares method. More recently, artificial 
intelligence techniques such as neural networks and 
genetic algorithms for quantification of protein sec¬ 
ondary stracture have been developed (Hering and 
Haris 2009). Although based on different mathemati¬ 
cal principles, the content of secondary structure esti¬ 
mated from infrared spectra using these methods 
agrees well with values obtained from X-ray diffrac¬ 
tion data. The standard error of prediction obtained 
from analysis of infrared spectra, using different math¬ 
ematical and computational methods, ranges from 
2.1-12.14, 2.4-11.22, 3.4-8.07 for a-helix, p-sheet, 
and turn structures, respectively (Hering and Haiis 
2009). These are similar to those reported for Circular 


protein-protein interaction in the two proteins. However, more 
recently it has been suggested that interhelical coupling may be 
responsible for the shift of the amide I band toward higher 
frequency for bacteriorhodopsin (Karjalainen and Barth 2012). 
The hands near 1,636-1,639 cm^* may represent some fi-sheet 
structure although overlap from turn structures is also possible. 
The very weak bands in the region 1,681-1,691 cm^* may 
reflect the presence of turn structures and contribution from 
anti-parallel (l-sheet structure 


Dichroism (CD) and Raman spectroscopy (Hering 
and Haris 2009). 

The relatively recent emergence of 2D infrared 
(2D1R) spectroscopy has further improved the 
possibility of unmasking information hidden within 
the highly congested amide I band. 

2. Characterization of the secondary structure of 
a protein with unknown structure; 

Generally it has been popular to use CD spectroscopy 
to gain information on the structure of a protein whose 
structure has not been determined using X-ray dif¬ 
fraction or NMR spectroscopy. In recent times, infra¬ 
red spectroscopy is also being used in such situations. 
FTIR spectroscopy is used to analyze the purified 
protein dissolved in aqueous media (in H 2 O and/or 
^H 20 ) or sometimes in the solid state in the form of 
thin films or KBr disks. After subtraction of the sol¬ 
vent background absorbance, the amide I band is 
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further analyzed using second-derivative or 
deconvolution methods. For example, this approach 
was used to determine the secondary structure of the 
complement protein factor H (Haris and Chapman 
1992). The presence of bands at 1,635 cm~ was 
attributed to (l-sheet, and the presence of an additional 
band at 1,685 cm” * indicated the presence of antipar¬ 
allel P-sheet structure. This information was com¬ 
bined with data from theoretical secondary stmcture 
prediction (from the amino acid sequence) to propose 
a model for the 3D structure of factor H. The struc¬ 
tural model, including the presence of antiparallel P- 
sheet structure, determined by infrared spectroscopy, 
was subsequently proven to be correct using NMR 
spectroscopy (Haris and Chapman 1992). With the 
availability of methods for quantitative analysis of 
secondary structure, analysis of proteins, with 
unknown stmcture using infrared spectroscopy is 
increasingly becoming popular. This is especially 
the case with membrane proteins which cannot be 
easily analyzed using CD spectroscopy. Furthermore, 
the intensity of the CD signal often disappears when 
proteins and peptides aggregate. 

3. Infrared analysis of hydrogen-deuterium exchange 
in proteins 

Infrared spectroscopy can be used to probe 
hydrogen-deuterium exchange in proteins (Haris 
and Chapman 1992; Barth 2007; Goormaghtigh 
2009). This is a powerful approach for investigating 
protein structure and dynamics. The amide II band 
near 1,550 cm”^ is particularly useful for this since 
the replacement of the proton by deuterium in the 
peptide bond (CONH to CON^H) results in a large 
shift of this band (Haris and Chapman 1992). This is 
because the amide 11 band arises principally from 
N-H bending vibration strongly coupled to C-N 
stretching. The transition from N-H to N-H results 
in a shift in the vibration by almost 100 cm”* from 
approximately 1,550 cm”* (N-H) to approximately 
1,450 cm”* (C-N stretching vibration without 
N-^H contribution). H-^H exchange also effects 
the vibration of the amide I bond. This is because 
both the CO and N-H groups are part of the same 
peptide bond unit and there is weak coupling of CO 
stretching and in-plane bending vibration of the 
N-H groups. However, the shift observed is much 
smaller (5-15 cm”*). Nevertheless, this has been 
successfully used for establishing the relationship 
between H-^H exchange and different types of 


secondary structure. It has been demonstrated that 
H-H exchange can be used to detect subtle differ¬ 
ences in the tertiary structure of ribonuclease S and 
ribonuclease A (Haris et al. 1986). These two pro¬ 
teins have the same amino acid sequence and differ 
by the presence of a cleaved peptide bond in ribo¬ 
nuclease S. The X-ray structure of the two proteins 
are virtually identical; however, FTIR spectroscopy 
was able to reveal differences in H-^H exchange 
properties that can be attributed to differences 
in their tertiary structure (Haris et al. 1986). 
This approach is now very popular in the character¬ 
ization of protein structure and dynamics (Haris and 
Chapman 1995; Barth 2007; Goormaghtigh 2009). 
4. Characterization of protein conformational change: 
In these studies, the conformation of a protein (with 
known or unknown X-ray/NMR structure) is ana¬ 
lyzed to monitor changes in the structure of the 
protein resulting from its interaction with another 
molecule, changes in pH, temperature, pressure, 
physical state, etc. (Haris and Chapman 1995; 
Barth 2007). Some recent examples of studies in 
this area are presented below, particularly focusing 
on systems that are technically challenging for other 
biophysical methods. 

Studies of Protein Aggregates 

The ability to use FTIR spectroscopy to characterize 
proteins in “hard to study environments,” such as in 
the aggregated state or in membrane systems, makes 
it a particularly invaluable tool in many situations. 
The technique is being extensively used in the study of 
peptide aggregation, oligomerization and fibrillization to 
understand the mechanism underlying different protein 
misfolding diseases including prion disease, Alzheimer’s 
disease, Parkinson’s disease, etc. 

The Nobel Prize winning work on the discovery of 
prions had to rely on the use of infrared spectroscopy 
(Prusiner 1998) to demonstrate that the aggregated 
form of the prion protein (PrP®‘’) is different from the 
native PrP'^ protein and is dominated by P-sheet struc¬ 
ture. Scientists engaged in this project had little choice, 
the PrP®‘^ protein could not be crystallized for X-ray 
diffraction. Furthermore, since it exists in the form of 
insoluble aggregates, NMR and CD spectroscopy 
could not be used. The transmission FTIR spectrum 
of purified prion protein PrP*^ displayed an amide 
I band at 1,653 cm”* in ^H20 (Pan et al. 1993). 
This was assigned to a-helical structure. The authors 
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confirmed this assignment by also recording the CD 
spectrum of the protein. In contrast, the infrared spec¬ 
trum of PrP^*^ displayed amide I band frequencies that 
were consistent with the presence of p-sheet structure. 
Quantihcation of the secondary structure was achieved 
by least-squares iterative curve htting of each of the 
peaks to Lorentzian line shapes. This type of analysis 
revealed that PrP*^ protein contained 42% a-helical 
structure and only 3% P-sheet. In contrast, PrP®*^ was 
found to contain 30% a-helix and 43% P-sheet (Pan 
et al. 1993; Prusiner 1998). However, more recent 
studies carried out on PrP®^^ after extensive purification 
steps suggest that the p-sheet content is much higher 
than previously estimated and there is virtually no 
a-helix. This discrepancy has been attributed to 
overestimation of the helical structure in the earlier 
studies due to presence of impurities in PrP®"^ and 
problems with band assignments used in the curve 
fitting of the amide 1 band (Baron et al. 2011). 
This study illustrates why it is important to have 
a highly pure protein or at least have knowledge 
of the purity of the protein before carrying out 
quantitative estimation of protein secondary structure. 

Majority of the studies on protein aggregation in 
relation to protein misfolding and disease have focused 
on Alzheimer’s disease. The toxic action of P-amyloid 
(AP) peptide in the form hbrils or oligomers is thought 
to be important in the pathogenesis of Alzheimer’s 
disease. Studies with P-amyloid peptides have shown 
that those peptides adopting antiparallel structures 
show a band near 1,630 cm~* and an additional weaker 
band at approximately 1,695 cm“*. The intensity ratio 
of these two bands has been used to determine the 
proportion of the antiparallel p-strands in a P-sheet 
(Itkin et al. 2011). This approach has been used to 
assess quaternary structure of peptide aggregates 
when they assemble to form oligomers and fibrils. 
For example, in a recent study ATR-FTIR spectros¬ 
copy (Itkin et al. 2011) was used to investigate 
the role of calcium ions in the formation of amyloid 
P-Peptide (1-40) oligomers. The authors found that for 
the AP- (1^0) peptide, in the “no Csl'^ condition,” the 
ratio of the 1,695/1,630 cm“* dramatically decreases 
from 0.32 (t = 0 h) to 0.08 (t = 96 h). This large 
reduction in the ratio was interpreted as a decrease in 
antiparallel P-sheet structure due to the formation of 
fibrillar assemblies from oligomers, in the absence of 
added Ca"^. The authors confirmed their findings by 
PAGE analysis and ThT fluorescence measurements. 


Through monitoring changes in band frequency, inten¬ 
sity, and width, the authors were able to investigate 
secondary, tertiary, and quaternary structural changes 
associated with the aggregation of amyloid peptides. 
The authors concluded from their study that calcium 
ions promote the formation of oligomers of Ap (1^0), 
which are suggested to play a role in the pathogenesis 
of Alzheimer’s disease (Itkin et al. 2011). 

Conformational Change Associated with Protein 
Interaction with Metals and Small Molecules 

Many studies have investigated the interaction of sin¬ 
gle elements such as specific metal ions (as discussed 
above regarding amyloid peptide interaction with 
calcium ions) with proteins and their effect on protein 
secondary structure (Barth 2007; Haris and Chapman 
1995). The effect of pH has been investigated in many 
different studies. The binding of small molecules 
(drugs, amino acids, phosphate groups, nucleotides, 
vitamins, etc.) has also been studied. These studies 
have revealed conformational changes in proteins. 
For example, Barth and coworkers (Stolz et al. 2009) 
have used FTIR spectroscopy to monitor changes in 
the secondary structure and tertiary structure of NaV 
K'^-ATPase and Ca^’^-ATPase as a function of interac¬ 
tion with metal ions and ATP. In this study, they 
attributed a negative band at 1,659 cm“* and 
a positive band at 1,620 cm”*, in the difference spec¬ 
trum of the protein as being consistent with conversion 
of a-helices to p-sheets associated with interaction of 
ATP with Na-'/K'^-ATPase. 

Conformational Change Associated with Protein 
Interaction with Other Macromolecules 

Interaction of proteins with lipid membranes has been 
extensively studied since infrared spectroscopy is one 
of the few techniques that can be used for analysis of 
lipid-protein complexes. As the bands arising from the 
most commonly used lipids do not contain amide 
groups, they do not overlap with the amide I band of 
the protein. This enables the simultaneous monitoring 
of protein and lipid structure when the two molecules 
are involved in interacting with each other. 

In contrast to lipid-protein interaction, very little work 
has been carried out on protein-protein interaction using 
infrared spectroscopy. This is because the signals from 
the two interacting proteins overlap, making it almost 
impossible to distinguish the absorbance from the indi¬ 
vidual proteins (Haris 2010). To overcome this problem. 
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a new method for investigating protein-protein interac¬ 
tion, where one of the proteins is fully labeled using 
and amino acids was introduced in 1992 (Haris et al. 
1992). This method has been extended to the study of 
receptor-ligand interactions and is likely to be further 
used as it becomes easier to isotopically label proteins 
(Li and Arakawa 2009; Vogel et al. 2007; Haris 2010) 

Identifying Proteins Using Infrared Spectroscopy 

The promising role of FTIR spectroscopy in proteo- 
mics research, focusing on protein secondary structure, 
has been previously highlighted (Hering and Haris 
2009). Recently, a modified version of 2DIR (called 
electron-vibration-vibration 2D coherent spectros¬ 
copy) has been used to distinguish 10 different proteins 
based on their amino acid content (Fournier et al. 
2008). The approach first involves the identification 
of 2DIR vibrational signatures of the different amino 
acids. Subsequently, their corresponding signal inten¬ 
sities were used to quantitatively estimate their levels 
using the vibrational feature of a methyl group as an 
internal reference. The authors reported that successful 
differentiation of proteins can be achieved within 
4-5 min per protein (Fournier et al. 2008). This 
is a promising development which is likely to be 
increasingly applied. 

Polarized Infrared Spectroscopy of Proteins in 
Membranes 

Polarized infrared spectroscopy has been successfully 
employed in determining the orientation of a protein 
and its secondary structural elements with respect to 
the lipid bilayer (Goormaghtigh 2009; Haris and 
Chapman 1992). Isotopically labeled residues are being 
used for determining the orientation of specific residues 
in peptides and proteins and evaluating if they undergo 
any conformational changes due to interaction with 
other molecules or alterations in the functional state 
of a protein. In a recent study, ID and 2D infrared 
spectroscopy, in conjunction with polarized infrared 
spectroscopy, was used to determine the structure and 
orientation of the M2 protein, which is a useful model 
system for investigating the mechanism of H"^ gating in 
ion channels (Manor et al. 2009). Polarized infrared 
spectroscopy was used to determine the orientation of 
the M2 peptide with respect to the lipid bilayer. 
Information regarding specific residues lining the ion 
channel was determined using 2D infrared spectroscopy. 
Isotopically labeled amino acids were used to identify 


the role of specific residues in the stracture and function 
of the ion channel. The authors used amino 

acids to spectroscopically separate the signal of the indi¬ 
vidual residues. This results in a shift of the labeled 
amino acid by 60 cm“^ shift toward lower frequency, 
thereby separating its amide vibration from the broad 
amide I band corresponding to the unlabeled amino acid 
residues. Rotation of the transmembrane a-helices was 
associated with the operation of the ion channel. 

Effect of Environment on Protein Structure 
Investigated Using Infrared Spectroscopy 

The major attraction of infrared spectroscopy as 
a biophysical tool is its ease of application in the 
probing of conformational change in environments 
that are beyond the reach of other techniques. Proba¬ 
bly, the biggest advantage is the ability to use the 
technique to characterize protein structure in the 
presence of membranes (lipid-protein complexes) as 
well as monitoring changes in structure associated with 
conversion from one physical state to another includ¬ 
ing protein precipitates/aggregates. Figure 4 shows the 
changes in the amide I band of an ion channel peptide 
associated with transferring it from an aqueous media 
to trifluoroethanol (induces helicity) and phospholipid 
membranes. The shift in the amide I frequency 
from 1,641 cm“' (water) to 1,650-1,654 cm“' 
(trifluoroethanol and phospholipid membranes) can 
be attributed to a change from random coil structure 
(1,641 cm”*) to a helical structure (1,650 cm”*). 

The use of infrared spectroscopy to probe structural 
changes associated with application of high pressure 
and temperature is described with the example of 
transthyretin (TTR). This protein is amyloidogenic 
and its aggregation is associated with several familial 
amyloid diseases. Cordeiro et al. (2006) used FTIR 
spectroscopy to study the thermal denaturation of the 
protein. The unfolding of the protein was evident from 
the loss of intramolecular P-sheet structure and the 
simultaneous appearance of absorbance of nonnative 
structure that is often attributed to antiparallel P-sheet 
structure (bands at 1,616 and 1,686 cm”*). Pressure- 
induced denaturation studies revealed that even at very 
high pressures (12 kbar), only a small proportion of the 
P-sheet structure was lost. Comparison of the FTIR 
spectrum of the native tetramer form of the protein 
with the amyloid fibril form (produced at atmospheric 
pressure upon acidification at (pH 4.4), revealed that 
there is little change in p-sheets upon fibrillization. 
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Infrared Spectroscopy of Protein Structure, Fig. 4 Infrared 
spectra of the Shaker potassium channel S4 peptide in different 
media: ^H 20 (broken line), trifiuoroethanol (dotted line), and in 
dimyristoyl phosphatidylcholine lipid membranes (solid line). 
The amide I band at 1,641 cm^' represents random coil struc¬ 
ture, while the band at 1,654 cm^* represents a-helical structure. 
This figure is adapted from Haris et al. (1994) 


However, the alignment of p-sheets was found to be 
altered, as a band at 1,625 cm” was attributed to 
distinct P-sheet contacts. The conversion of the protein 
from tetramers to fibrils was associated with a loss 
in random coil content. The authors found that the 
TTR protein, recovered from a cycle of compression- 
decompression, displays an infrared spectrum that is 
quite similar to that of fibrils produced at atmospheric 
pressure. This was taken to suggest that high hydro¬ 
static pressure may convert tetramers of the TTR 
protein into an amyloidogenic conformation. This 
study was valuable in showing how the amide I band, 
which is sensitive to protein packing, provides 
a fingerprint of the fibrils versus amorphous aggregates 
(soluble and insoluble forms) of TTR. 

Potential Pitfalls 

Like all spectroscopic techniques, infrared spectroscopy, 
applied to protein structure analysis, is not free from 


problems and pitfalls that experimentalists should be 
aware of. When using infrared spectroscopy for protein 
stracture analysis in general, the following issues should 
be taken into consideration: 

1. Protein Purity: It is important to ensure the protein 
is not “contaminated.” Here, contamination refers 
to the presence of any other molecule in the protein 
solution that is not part of the protein. This could 
include the following: 

(a) Buffer: One needs to use buffers that do not 
absorb in the region of interest. Thus buffers 
containing carbonyl and amide groups should 
be avoided or used at a concentration that would 
not cause significant overlap with the protein 
signal. 

(b) Other proteins/peptides/amino acids: Obviously 
any other protein/peptide/amino acid present in 
the system could overlap with the absorbance of 
the protein being analyzed and therefore compli¬ 
cate the analysis. If any other protein is present 
(and this cannot be easily removed) then at least 
some knowledge of the quantity of this protein that 
is present should be known so that this could be 
taken into consideration during the analysis of the 
data. In any case, unambiguous interpretation of 
the secondary structure of a protein can only be 
done when one is certain that the solution being 
analyzed only contains the protein under investi¬ 
gation and no other proteins/peptides/amino acids. 

(c) Other molecules: Often during purification of 
proteins or synthesis of peptides, other molecules 
are introduced into the system. For example, in the 
purification of proteins, one may add detergents 
and salts that may contain carbonyl or amide 
groups that can absorb in the amide I region. 
Therefore, one should know what other molecules 
are present in the protein sample that could absorb 
in the amide I region. The problem with the over¬ 
lap of absorbance from trifluoroacetic acid (TFA) 
which absorbs at 1,674 cm”' with peptide amide 
I absorbance is now well known (Haris and Chap¬ 
man 1995), but in the past, several studies have 
erroneously attributed the TFA peak to protein 
secondary stmcture. A full knowledge of the 
different stages between the protein purification/ 
synthesis step and the infrared analysis must be 
known so that everything possible is done to 
ensure that there is no interference in the amide 
I band from an unwanted molecule. 
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2. Subtraction of overlapping absorbance'. When 
analyzing protein amide I band for a sample dissolved 
in a solvent, it becomes necessary to subtract the 
absorbance of the solvent to fully view the amide 

I band without interference from the solvent peaks. 
This is particularly important in the case of the sub¬ 
traction of H 2 O absorbance. Since the amide I band is 
directly overlapped by the intense O-H bending 
vibration (Fig. la), the subtraction of this peak can 
be rather difficult. Often over-subtraction or under¬ 
subtraction can occur due to slight differences in path 
length (which is more common when short path 
length transmission cells are used). Measuring sam¬ 
ples using ATR accessory is generally free from this 
problem. The temperature of the sample and the 
buffer (solvent) should be as identical as possible as 
the H 2 O spectrum is highly sensitive to temperature. 
The infrared spectmm of water can change due to pH, 
presence of other molecules, concentration of sub¬ 
stances dissolved in water, etc. Awareness about 
these issues is necessary to ensure accuracy and reli¬ 
ability of the water subtraction and the subsequent 
stracture analysis. When analyzing the amide I band, 
it is also important to consider that overlap from the 
absorbance of certain amino acid side chains, within 
the protein being analyzed, can also occur (Barth 
2007). Attempts have been made to subtract this 
overlapping absorbance from the amide I and amide 

II bands although it is not straightforward 
(Goormaghtigh 2009). 

3. Water vapor. Water vapor peaks can significantly 
overlap with the amide I region and make it difficult 
to carry out detailed analysis of the minor spectral 
features, especially when techniques such as sec¬ 
ond-derivative or deconvolution methods are used 
to identify minor bands. One solution is to purge the 
instrument with dry air or nitrogen to reduce the 
absorbance from water vapor. Also it is possible to 
subtract a prerecorded water vapor spectrum from 
the protein spectrum. However, this should be done 
with care as over-subtraction or under-subtraction 
could introduce artifacts that could cause misinter¬ 
pretation of the spectral data. It is worth keeping in 
mind that a corrected spectrum is never as good as 
a carefully recorded spectrum without water vapor. 

4. Further processing of spectral data using computa¬ 
tional and mathematical procedures: Besides 
second-derivative and deconvolution analysis, many 
different types of computational software are being 


used for spectral analysis. However, they should be 
carried out with care and only applied to high quality 
spectral data. 

5. The effect of sample environment and variation in 
sample physical state: Protein spectra can change as 
a function of environment (Haris and Severcan 
1999; van de Weert et al. 2001). Thus, for example, 
the absorbance of a secondary structural element in 
an organic solvent like methanol or in phospholipid 
membranes may occur at higher wave number due 
to the reduced potential for them to participate in 
intermolecular hydrogen-bonding, compared to the 
same structure in an aqueous environment where 
such bonding is possible. Likewise, the spectral 
band frequency, intensity, and width of a protein 
spectrum recorded in the solid state (with water 
removed by lyophilization) can be significantly 
different to that of the same protein in aqueous 
solution as has been previously reported (van de 
Weert et al. 2001). Thus, alteration in hydrogen¬ 
bonding potential and molecular packing density, as 
a consequence of altering sample physical state, 
should be taken into consideration in the interpreta¬ 
tion of spectral data. 

Summary 

Infrared spectroscopy is a powerful tool for 
protein structure characterization. It is widely used for 
protein secondary and tertiary structure determination 
through the analysis of the amide vibration, amino acid 
chain vibration, and hydrogen-deuterium exchange. 
This approach has also been used for the determination 
of protein quaternary stmcture through the analysis of 
protein aggregation, oligomer/fibril formation, as well as 
subunit association/dissociation. Recently, the technique 
has been extended to the analysis of protein primary 
stracture with the identification of proteins based on 
their amino acid composition. Structural changes in pro¬ 
teins, resulting from interaction with other proteins, 
membranes, DNA, RNA, and a wide array of small 
molecules has been valuable in advancing our under¬ 
standing of biochemical mechanisms. Infrared spectros¬ 
copy has also been valuable for investigating the effect of 
the surrounding environment on protein stracture. This 
has included studies investigating the effect of pressure, 
temperature, pH, and alteration in physical state of the 
protein. Secondary stracture of a protein can be 
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quantitatively determined from its infrared spectrum 
using different methods with a degree of accuracy that 
is comparable to other optical spectroscopic methods. 
Use of 2DIR, isotopic labeling, second-derivative, and 
deconvolution methods are helping to separate and iden¬ 
tify small spectral changes, enabling the determination of 
structural information at the level of single amino acid 
residues. However, there are potential pitfalls that one 
should be aware of when using infrared spectroscopy and 
some these have been highlighted. More emphasis 
should be placed on fundamental research in order to 
understand how diverse stractural and environmental 
changes are manifested in alteration in infrared band 
frequency, intensity, and band width. Armed with such 
knowledge, the true power of infrared spectroscopy in 
protein structure analysis can be fully realized. 

Cross-References 

► Far-Infrared Spectroscopy of Proteins 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Isotope Edited Raman and Infrared Difference 
Spectroscopy Studies of Protein Structure 

► Near-Infrared Spectroscopy in Biological 
Molecules and Tissues 

► Nonlinear IR Spectroscopy of Proteins 

► Protein Dynamics Studied with ID and 2D Infrared 
Spectroscopy 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Raman Spectroscopic Instrumentation, 
Experimental Considerations 

► Reaction-Induced Infrared Difference Spectroscopy 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 

► Rhodopsins - Intramembrane Signaling by 
Hydrogen Bonding 
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^SOLEIL Synchrotron, L’Orme des Merisiers BP48, 
Gif Sur Yvette Cedex, France 


Synonyms 

Fourier transform infrared spectroscopy; Infrared 
spectroscopy; Infrared spectro-microscopy; Infrared 
spectroscopic imaging; Synchrotron radiation; 
Synchrotron light 

Definitions 

A Synchrotron facility has, at its heart, a particle accel¬ 
erator that accelerates charged particles (typically 
electrons) in a circular path, where the magnetic field 
(to turn the particles so they circulate) and the electric 
field (to accelerate the particles) are carefully synchro¬ 
nized with the travelling particle beam. 


Synchrotron radiation is the light produced by 
a synchrotron, which spans a very wide electromagnetic 
spectrum, extending from radio waves to infrared light, 
visible light, ultraviolet light. X-rays, and gamma rays. 
It is distinguished by its characteristic properties: 
brightness, pulsed nature, polarization, and broadband 
spectrum. 

A heamline is an experimental station that utilizes 
a specific energy domain emitted by the synchrotron 
for research. 

Infrared spectroscopy exploits the far- to near- 
infrared region of the electromagnetic spectrum for 
determining the physical properties of solid state com¬ 
ponents, for surface science, and for identifying the 
chemical composition of compounds by exploiting 
the fact that molecules absorb specific frequencies 
that are characteristic of their structure. 

Introduction 

Since the early 1990s, synchrotron radiation (SR) has 
benefited the field of infrared (IR) spectroscopy in 
a wide range of scientific disciplines from condensed 
matter physics to medicine. While a synchrotron infra¬ 
red source does not typically produce more power than 
a conventional thermal (globar) source, its brightness 
(defined as the photon flux or power emitted per source 
area and solid angle) is 100-1,000 times greater. This 
advantage, arising from the small effective source size 
and narrow angular range of emission, has enabled 
a wide spectrum of throughput-limited experiments 
that were not possible with a conventional IR source 
(Carr 1999, 2001). 

Although the brightness advantage of the synchrotron 
source is its most important quality, it possesses other 
unique and useful features. For example, it is a continuum 
source with spectral coverage spanning the entire infrared 
spectmm, it is pulsed on the picosecond timescale, and it 
possesses well-defined polarization states (Carr 2001). 
The source can be used in pump-probe studies of dynam¬ 
ics and is particularly well suited to experiments where 
multiple and widely varying photon energies or wave¬ 
lengths are required. As an example, combined x-ray 
(e.g., fluorescence, absorption, scattering) and IR micro¬ 
scopic analysis on the same sample represents an activity 
of interest in the research community. 

The earliest efforts to utilize the synchrotron IR 
source were motivated by the need for a bright far-IR 
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Infrared Spectroscopy 
using Synchrotron 
Radiation, Fig. 1 Infrared 
beamline end stations at the 
national synchrotron light 
source. From Left to right: L. 
Mihaly recording far-IR data 
with high resolution, L.M. 
Miller, and G.P. Williams 
using the microscope end 
station, and G.L. Carr aligning 
a spectrometer with the 
synchrotron beam 



source for gas phase spectroscopy and for low- 
throughput experimental methods such as grazing 
incidence spectroscopy from surfaces. Although it 
was understood that the brightness advantage of the 
synchrotron source extended into the mid-IR, the ear¬ 
liest demonstrations of its use for microspectroscopy 
did not occur until the early 1990s a custom IR micro¬ 
scope was used to focus light through a high-pressure 
diamond anvil cell at the NSLS. Similar efforts to 
develop synchrotron-based IR microspectroscopy 
were ongoing at UVSOR in Japan at the same time. 
The first commercial infrared microscope was installed 
at the NSLS a few years later (Fig. 1). 

Facilities for performing IR spectroscopy and 
microspectroscopy have expanded throughout the 
world in response to an increasing demand for 
beamtime from the diverse scientific communities. 
Currently, there are 28 infrared beamlines worldwide, 
either operational or under construction (Table 1), 
and beamtime is provided free of charge based on 
a peer-reviewed proposal system. 

Applications of Synchrotron IR Spectroscopy 

FTIR Microspectroscopy and Imaging 

The high brightness of synchrotron IR light enables 
diffraction-limited spatial resolution with high-quality 
(signal-to-noise) spectra (Fig. 2). To date, FTIR 
microspectroscopy has been used to study the compo¬ 
sition of a wide range of materials, such as polymer 
distributions in copolymers, carbon sequestration in 


soils, fluid inclusions in minerals, and the organic 
content in interplanetary dust particles (Figs. 3-6). 
However, biological applications have seen the largest 
growth since the development of synchrotron-based 
FTIRM (Miller and Dumas 2006; Hamm 2009; Miller 
and Dumas 2010). For example, it has been used to 
probe newly deposited bone in osteoporosis and to assess 
the influence of dmg treatments. Bone composition was 
also determined within areas of microdamaged bone, as 
indicated by positive (i.e., magenta) fuchsin staining in 
Fig. 5. FTIR images demonstrate that microdamaged 
bone is chemically different from the surrounding 
undamaged tissue. Specifically, the results obtained 
show that microdamaged bone is not more mineralized 
than the surrounding tissue, but the mineral stoichiometry 
is altered, potentially compromising bone’s mechanical 
properties. Synchrotron FTIRM has also been used to 
study Alzheimer’s disease (AD). This disease is charac¬ 
terized by the misfolding and plaque-like accumulation 
of a naturally occurring peptide in the brain called amy¬ 
loid beta (AP). Specifically, it has been used to image the 
in situ secondary structure of amyloid plaques in 
brain tissue from AD brains with high spatial resolution. 
Figure 6 shows a FTIR spectrum obtained inside the 
plaque, which reveals that misfolded AP has a high 
P-sheet content. Some brain samples have been shown 
to exhibit additional absorption features in the FTIRM 
spectra, assigned to creatine (Fig. 6c). Chemical images 
of Ap and creatine distribution from a hippocampal sec¬ 
tion of Alzheimer-diseased human brain, obtained with 
an aperture of 3 x 3 pm and 1 pm step size, are displayed 
in Fig. 6d, e, respectively. 
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Infrared Spectroscopy using Synchrotron Radiation, Table 1 Actual status of synchrotoron infrared beamlines worldwide. 
Some are under design, or in construction. Many of them are operational. (From http://infrared.als.lbl.gov/content/web-links/45-srir) 

Facility name 

Location 

URL 

NSLS 

New York, USA 

http://infrared.nsls.bnl.gov/ 

ALS 

Berkeley, USA 

http: //infrared, als. Ibl. go v/content/ 

SRC 02 - IRENI 

Madison, USA 

http://www.src.wisc.edu/facility/complist.htm 

Diamond B22 

Oxfordshire, UK 

http://www.diamond.ac.uk/Horae/Beamlines/B22.html 

BESSY IRIS 

Berlin, Germany 

http://www.bessy.de/bit/bit show object.html.php?i bit id object=155 

SOLEIL SMIS 

Gif Sur Yvette, France 

http://www.synchrotron-soleil.fr/portal/page/portal/Recherche/ 

LignesLumiere/SMIS 

SOLEIL AIDES 

Gif Sur Yvette, France 

http://www.synchrotron-soleil.fr/portal/page/portal/Recherche/ 

LignesLumiere/AILES 

SLSIR 

Villigen, Switzerland 

http://sls.web.psi.ch/view.php/beamlines/ir/index.html 

Australian Synchrotron IR 
Microscopy 

Melbourne, Australia 

http://www.synchrotron.org.au/index.php/aussyncbeamlines/infrared- 

micro/beamline-team 

Australian Synchrotron Far- 
IR & High-res beamline 

Melbourne, Australia 

http://www.synchrotron.org.au/index.php/aussyncbeamlines/far-infrared- 

and-high-resolution-ftir/beamline-team 

DAFNE SINBAD 

Frascati, Italy 

http://www.lnf.infn.it/public/index.php? 

option=com_content&view=article&id=59:dane-luce&catid=40: 

tecnologie&Itemid=80 

ELETTRA SISSI 

Trieste, Italy 

http://www.elettra.trieste.it/experiments/beamlines/sissi/index.html 

CLS Mid-IR 

Saskatoon, Canada 

http://www.lightsource.ca/experimental/midir.php 

CLS Far-IR 

Saskatoon, Canada 

http://www.lightsource.ca/experimental/farir.php 

NSRRC BL14A 

Hinschu, Taiwan 

http://l 40.110.203.42/EFD.php?num=250 

ANKA IRl 

Eggenstein- 

LeopoldshafenGermany 

http://ankaweb.fzk.de/website.php?page=instrumentation_beam&id=l 

MAX-lab Beamline 73 

Lund, Sweden 

http://www.maxlab.lu.se/beamlines/bl73/ 

ESRFID21 SR-FTIR 

Grenoble, France 

http://www.esrf.eu/UsersAndScience/Experiments/lmaging/ID21 

UVSOR BL6A1 

Okazaki, Japan 

http://www.uvsor.ims.ac.jp/defaultE.htral 

SPring-8 BL43IR 

Hyogo, Japan 

http: //infrared43. spring8 .or. jp/ 

CAMD IR beamline 

Baton Rouge, USA 

http://www.camd.lsu.edu/index.htm 

JLab THz lab 

Newport News, USA 

http://www.jlab.org/FEL/terahertz/ 

NSRL U4 

Hefei, China 

http://www.nsrl.ustc.edu.cn/en/pages/Facilities/ 

SSLS ISMI 

Singapore, Singapore 

http://ssls.nus.edu.sg/facility/facility.htm 

SRLI IR 

Korat, Thailand 

http://www.slri.or.th/new eng/index.php 

SESAME 

Amman, Jordan 

http://www.sesame.org.jo/ 

ALBA IR beamline 

Barcelona, Spain 

http://www.cells.es/ 

Metrology Light Source 

Berlin, Germany 

http://www.ptb.de/mls/mls beamlines.html 

SSRF 

Shangai, China 

http://ssrf.sinap.ac.cn/english/ 


Other recent synchrotron FTIRM studies have 
investigated the structure of infectious prion proteins 
in scrapie, altered lipid and collagen content and struc¬ 
ture in heart disease, early detection of demyelination 
in an animal model of multiple sclerosis, the effects of 
drug treatment in wound healing, the differentiation of 
stem cells, and the nutritional value of cereal grain 
endosperm. 

In recent years, the demand for high spatial chemical 
imaging has grown rapidly. Most frequently, FTIR 
microscopes are equipped with single-pixel detectors 


of high responsivity, generally liquid nitrogen-cooled, 
broadband or narrow-band mercury cadmium telluride 
(MCT) detectors. However, the most recent develop¬ 
ment in IR detectors involves the coupling of two- 
dimensional arrays detectors to an FTIR microscope 
(Carr et al. 2007). Focal Plane Array (FPA) detectors 
consist of hundreds of infrared detector elements that 
enable spectroscopic imaging of large areas with high 
pixel resolution (5-20 pm), with significant time savings 
compared to conventional single-element IR detectors. 
Despite these advantages, FPA detectors typically have 
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Wavenumbers (cm-i) 

Infrared Spectroscopy using Synchrotron Radiation, 
Fig. 2 Infrared spectra recorded with a dual aperture of 6 x 6 
pm^, at 4 cm' resolution, coadding 128 scans with (a) the 
synchrotron source and (b) with the internal (Globar) source 
illustrating the improved S/N with the synchrotron IR source 



Infrared Spectroscopy using Synchrotron Radiation, 
Fig. 3 (a) Bright field-light microscope image of organic fluid 
inclusions (dark diagonal tracks indicated by arrows) in natural 
quartz, (b) Synchrotron FPA image of the aliphatic C-H absor¬ 
bance bands in the quartz crystal. The sample was illuminated 
with a low magnification objective (15X, 0.57 NA) for long 
working distance and collected with a high magnification 
(74X, 0.65 NA) objective. A 64 x 64 pixel MCT FPA detector 
was used to image the sample, where the resulting field of view 
on the FPA was 35 x 35 pm with 0.54 pm per pixel. The IR 
image (128 x 64 pixels) of the distribution of organics was 
generated in 16 min with a 64 x 64 pixel FPA detector (200 
scans, 6 cm' resolution). With a single-pixel detector, the same 
image would take more than 3 days to collect. Scale bar is 10 pm 



10 pm 


Infrared Spectroscopy using Synchrotron Radiation, 
Fig. 4 (a) Bright field light microscope image of individual 
human melanoma cells grown on a CaF 2 slide, fixed, and air 
dried, (b) Synchrotron FPA image (200 scans, 6 cm“' resolu¬ 
tion) of the asymmetric phosphate stretch at 1,240 cm“', 
representing the nucleic acid distribution in the cells. Scale bar 
is 10 pm (Adapted from Acerbo and Miller 2009) 


a lower signal-to-noise than conventional MCT detec¬ 
tors, and the spatial resolution can be compromised by 
pixel-to-pixel blurring. Improvements have been dem¬ 
onstrated recently, however, where FPA detectors have 
been coupled to synchrotron infrared beamlines, holding 
the promise of bringing this imaging approach below the 
diffraction limit with faster image acquisition times 
(Miller and Dumas 2006; Nasse et al. 201 1). 

Materials Under Extreme Conditions 

In geosciences, there is considerable interest in under¬ 
standing the material properties of Earth and planetary 
interiors. To simulate these environments, conditions 
of high pressure and temperature are required, which 
involve the use of diamond compression cells and laser 
heating. A broad spectral range, including x-rays and 
infrared light, are necessary to probe these systems. 
Synchrotron-based methods are ideal for combined IR 
and x-ray experiments performed within the confines 
of a diamond anvil cell (Hemley et al. 1996). An IR 
microscope with a long working distance objective has 
been used for near- to far-IR absorption and reflectivity 
measurements on samples under extreme conditions of 
pressure (to >200 GPa) and temperature (4 K < 
T< 1,000 K), as produced in a diamond anvil cell. 
Complementary studies utilizing x-ray absorption, 
x-ray diffraction, and Raman microscopy have also 
been demonstrated. Specific examples include high- 
pressure sfudies of fhe doming modes and dynamics 
of model heme compounds, anomalous Iransforma- 
tions in Ice VIII, effecls of femperafure and pressure 
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Infrared Spectroscopy using Synchrotron Radiation, 
Fig. 5 {Top) Visible image of a microcrack indicating the area 
imaged with the IR microscope. The fuchsin stain (magenta) 
highlights the microcrack. Data were collected within the crack 
and from the surrounding area. Panels A-F represent the 


corresponding IR images of the same area. The individual images 
illustrate the (A) carbonate/phosphate ratio, (B) carbonate/protein 
ratio, (C) acid phosphate/total phosphate ratio, (D) phosphate/ 
protein ratio, (E) crystallinity, and (F) collagen crosslinking. 
Scale bar is 25 pm (Adapted from Ruppel et al. 2006) 


on zinc dialkyl dithio phosphate (ZDDP), and the iden¬ 
tification of both solid and fluid nanoscale inclusions in 
microdiamonds. 

Condensed Matter Physics 

In condensed matter physics, time-resolved measure¬ 
ments over a broad IR spectral range are frequently 
used to determine the optical properties of materials. 
Taking advantage of the pulsed nature of the synchro¬ 
tron beam, the photoexcited state in superconducting 
metals and alloys can be probed via pump-probe spec¬ 
troscopy (Lobo et al. 2005). A pulsed laser is used 
to create nonequilibrium states and the infrared 
pulses from the synchrotron, to which the laser is 
synchronized, measures changes in the material optical 
properties. Both the time- and frequency-dependent 
photoinduced spectra of superconducting thin films 
can be measured in the far-lR regime. Systems that 
have been studied include Pb, Nb, NbN, Nbo sTio.sN, 
and Pbo. 75 Bio. 25 , a-MoGe, BaKBi 03 , and MgB 2 


superconducting thin films. Additional studies have 
focused on semiconductor systems such as manganese- 
doped III-V compounds. The measurement of magneto¬ 
optical properties has also been enhanced by the use of 
synchrotron-based IR spectroscopy, where materials are 
probed under high magnetic fields and low temperatures. 
Systems that have been studied include LaMn 03 , a well- 
known antiferromagnet and the parent compound of 
the so-called colossal magnetoresistance materials, the 
single molecular magnet Mn 12 -acetate, spin resonance 
on NaNi 02 , the investigation of correlated magnetic 
systems including LiCu 202 , and magneto-optical studies 
on superconductors, including carbon-doped MgB 2 . 

Surface Science 

Adsorption states on metallic surfaces have been stud¬ 
ied, especially in the far- IR region, to identify the 
adsorbate-substrate motion. Systems such as CO/Cu 
(100), CO/Cu(lll), CO/Cu(110), O2/Cu(100)CO/Pt 
(111), various organic molecules/Cu(110), C60 on 
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Infrared Spectroscopy using Synchrotron Radiation, 
Fig. 6 (a) Light microscope image of Alzheimer-diseased 
brain tissue, (b) Infrared spectra recorded inside and outside 
the plaque, showing the shoulder at 1,630 cm^*, assigned to 
P-amyloid. (c) Infrared spectra from inside the plaque. One of 


them displays additional absorption features, which are assigned 
to creatine, (d) Infrared image of amyloid plaque distribution, 
calculated as a peak height ratio of 1,630/1,655 cm“ *. (e) Chem¬ 
ical image of creatine distribution inside the plaque (Adapted 
from Gallant et al. 2006) 


Ag(lll), Ethylene on Cu(llO), NO/Cu(lll) K/Cu 
(111) have been studied. The most striking results 
were the appearance of anti-absorption bands, 
arising from frustrated rotational motion of the 
adsorbed molecules. Friction of the molecules was 
revealed and demonstrated a direct relationship with 
the D.C. resistivity (Dumas and Williams 2001). 

Cross-References 

► Fourier Transform Infrared Photoacoustic 
Spectroscopy (FTIR-PAS) 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Live-Cell Single-Molecule Imaging 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 
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Insulin-Degrading Enzyme - 
Computational Studies 

Tiziana Marino, Nino Russo and Marirosa Toscano 
Department of Chemistry, University of Calabria, 
Arcavacata di Rende (CS), Italy 

Synonyms 

Inverzincin 


Definition 

Human IDE is a Zn^'^containing protease involved in 
the clearance of insulin and amyloid-beta. 

Basic Characteristics 

Insulin-degrading enzyme («110-kDa), a metalloendo- 
peptidase located in cytosol, peroxisomes, endosomes, 
and on the cell surface, is able to accomplish its hydro¬ 
lase function on small proteins of diverse sequence (e.g., 
amyloid (I-protein (AP), insulin, glucagon, amylin, atrial 
natriuretic factor, and calcitonin) (Mirsky and Broth- 
Kahn 1949). IDE as a member of the inverzincin family, 
contains an inverted zinc-metalloprotease core motif 
(HXXEH) located within 200 residues of the N-terminal 
portion. 

This enzyme was first identified by its ability to 
degrade the A and B chains of the hormone insulin at 
the sites Leul3-Tyrl4 and Tyrl4-Glnl5 (A chain) 
and Ser9-Hisl0, HislO-Leull,Glul3-Alal4, Tyrl5- 
Leul6, and Phe25-Tyr26 (B chain) (Duckworth 1988), 
but more recently, its involvement in the catalytic cleav¬ 
age of amyloid-P(AP) peptide was also observed. In fact, 
it was demonstrated that enhanced IDE activity effec¬ 
tively reduces brain amyloid-P whereas loss-of-function 
mutations of IDE in rodents generate glucose intolerance 
and high amyloid-p peptide levels in the brain (Earris 
et al. 2003). 

Single crystal X-ray analysis showed that the active 
site containing the metallic center lies within 
a proteolytic cavity large enough to selectively bind 
and degrade substrates of different lengths (such as 
insulin (51 aa) and AP (39^3 aa)). Notwithstanding 
numerous studies devoted to elucidate different 
aspects of the IDE activity, the information concerning 
mechanistic details of the proteolysis performed by 
this enzyme is quite poor. 

Density functional theory (DET) based methods were 
used to explore the working mechanism at atomistic 
level followed by IDE in the cleavage of the peptide 
bond of a short peptide simulating a region of insulin 
B chain (CHaNH-Leu-Tyr-Leu-CONHCHs). The 
starting geometry of the cluster representing the active 
site of the enzyme was derived from X-ray structure 
deposited in Protein Data Bank (PDB code: 2 G54) 
(Shen et al. 2006). The cluster composed by 159 atoms 
reproduces the inner and outer coordination shell around 
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Insulin-Degrading Enzyme - Computational Studies, Fig. 1 Model for IDE active site 




TS2 


rotation 


Insulin-Degrading Enzyme 
- Computational Studies, 
Fig. 2 IDE investigated 
catalytic mechanism 
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Insulin-Degrading Enzyme - Computational Studies 


the zinc ion and includes the residues that are certainly 
involved in the catalytic cycle (Fig. 1). In particular, the 
zinc ion is linked to HislOS, Hisl 12, andGlul89, whereas 
Tyr831, Glulll, Thr220, Gly221, Glul82, and Arg824 
are included in the outer coordination sphere of the metal 
ion. In order to reduce the size of the model, every con¬ 
sidered residue is mimicked hy methyl-imidazole (histi¬ 
dine), acetate (glutamate), methyl-guanidine (arginine), 
and p-methylphenol (tyrosine). The B3LYP functional 

Protein environment (^4) 



Insulin-Degrading Enzyme - Computational Studies, 
Fig. 3 Potential energy profile obtained for the followed mech¬ 
anism including solvation effects 


associated to a differentiated basis set, (6-31 -i- G(d,p) 
and 6-31 G(d,p) to treat the first and second coordination 
shells of catalytic zinc ion respectively) was used (Amata 
et al. 2009). Vibrational frequencies’ calculation was 
carried out at the same level of theory of the geometries 
optimization in order to obtain the zero point energies 
(ZPEs) and to verify the nature of the various stationary 
points (minimum or transition state (TS)) present on the 
potential energy surface (PES). The polarization effects 
caused by the protein surrounding was taken into account 
by using the polarizable continuum model assuming that 
the protein environment around the active site is 
a homogenous polarizable medium having the dielectric 
constant (s) equal to 4 (Amata et al. 2009, 2011; Alberto 
etal. 2010). 

Starting from different mechanistic suggestions, pre¬ 
viously reported in the literature, a complete catalytic 
cycle was depicted and quantihed at a quantum chemical 
level (Eig. 2). The relative potential energy surface is 
reported in Eig. 3. Three fundamental steps are 
evidenced: first, the nucleophilic attack by a zinc-bound 
hydroxide on the substrate; second, a reorientation of the 
key Glul 11 residue to assume a position more suitable to 
act as acid, transferring the proton to the amide nitrogen 
of substrate, and, finally, the cleavage of the C-N 
peptide bond. 


Insulin-Degrading Enzyme 
- Computational Studies, 
Fig. 4 Optimized geometry 
of the first transition state, TSl 
(unimportant hydrogen atoms 
are not shown) 
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Calculations indicated the nucleophilic addition, 
occurring in the first transition state (TSl) with an 
expense of 15.6 kcal/mol in protein surrounding, was 
the rate-determining step of the reaction (see Fig. 3). 
TSl, whose optimized structure is depicted in Fig. 4, 
suggests a penta-coordinated zinc ion owing to the 
incipient formation of a bond with the nearest carbonyl 
oxygen of the substrate (Zn^"^ — O 2 = 2.225 A). 
The conhrmation of its hrst-order saddle point nature 
was given by the imaginary frequency associated to the 
stretching vibrational mode of the incoming Zn^"^ — O 2 
bond (120; cm”*). The characterization of this transi¬ 
tion state should be important in future design of new 
inhibitors for the IDE activity. 

The subsequent chemical steps along the path 
are fast and lead to a product more stable than the 
Michaelis-Menten complex (ES). 

The model used showed to be adequate enough 
since all the considered amino acid residues partici¬ 
pated to the catalytic process functioning as acid/base, 
or anchoring points for substrate and/or stabilization 
factors, as extensively supported by the experimental¬ 
ists’ community. 

Cross-References 

► Quantum Chemical Cluster Approach to Modeling 
Enzyme-Catalyzed Reactions 


References 

Alberto ME, Marino T, Russo N, Ramos MJ. Atomistic details of 
the catalytic mechanism of Fe(III)-Zn(II) purple acid phos¬ 
phatase. J Chem Theory Comput. 2010;6:2424-33. 

Amata O, Marino T, Russo N, Toscano M. Human insulin¬ 
degrading enzyme working mechanism. J Am Chem 
Soc. 2009;131:14804-11. 

Amata O, Marino T, Russo N, Toscano M. Catalytic activity of 
a ^-class zinc and cadmium containing carbonic anhydrase 
compared work mechanisms. Phys Chem Chem Phys. 
2011;13:3468-77. 

Duckworth WC. Insulin degradation: mechanisms, products, and 
significance. Endocrinol Rev. 1988;9:319^5. 

Farris W, Mansourian S, Chang Y, Lindsley L, Eckman EA, 
Frosch MP, Eckman CB, Tanzi RE, Selkoe DJ, Guenette S. 
Insulin-degrading enzyme regulates the levels of insulin, 
amyloid beta-protein, and the beta-amyloid precursor protein 
intracellular domain in vivo. Proc Natl Acad Sci USA. 
2003;100:4162-7. 

Mirsky LA, Broth-Kahn RH. The inactivation of insulin 
by tissue extracts. I. The distribution and properties of 


insulin inactivating extracts (insulinase). Arch Biochem. 
1949;20:1-9. 

Shen Y, Joachimiak A, Rosner MR, Tang WJ. Structure of 
human insulin-degrading enzyme reveal a new substrate 
recognition mechanism. Nature. 2006;443:870-4. 


Integral Membrane Protein Structure 

► Membrane Protein Structure 

► Membrane Proteins; Structure and Organization 


Intein 

► Protein Segmental Labeling 

► Protein Splicing - Computational Studies 


Interaction Interfaces 

► Mutagenic Scanning to Define Binding Flotspots 


Interactome Analysis 

► Protein-Protein Interactions - Interpreting Data 
from Large-Scale Screens 


Interface Engineering 

► Biofabrication 


Interferon a/p Receptor 

► Type I Interferon Receptor 









I 


1116 


Interspin Distance Determination by EPR 
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Richard Ward and Olav Schiemann 
Biomedical Sciences Research Complex, Centre of 
Magnetic Resonance, University of St Andrews, 

St Andrews, UK 

Synonyms 

Double electron-electron resonance (DEER); 
PELDOR and DEER; Pulsed electron-electron double 
resonance; Site-directed spin-labeling 

Definition 

PELDOR: A pulsed EPR technique that measures 
distances in the nanometer range between unpaired 
electrons. 

Introduction 

In order to understand how biomacromolecules, such 
as proteins and nucleic acids, function on the molecu¬ 
lar level, it is essential to obtain detailed structural 
information about them. Structural information on the 
nanometer scale can be obtained using EPR spectros¬ 
copy by measuring the dipolar coupling between 
unpaired electrons (Berliner et al. 2000). These may 
either be localized at metal ions or transient radical 
cofactors, or can be artificially introduced via site- 
directed spin labeling with nitroxides. The advantages 
of EPR are: 

• The large magnetic dipole moment of the electron 
enables long-range distance measurements of up to 
8 nm. 

• EPR has no size restriction with respect to the 
biomolecule. 

• EPR is rather sensitive, requiring usually only 5-10 
nanomoles of material. 

• EPR is tolerant of a great range of buffer 
conditions and allows measurements within 
membranes. 

This chapter will focus on distance measurements 
between nitroxides. 


Magnetic Dipole-Dipole Coupling and the 
Pake Pattern 

The interaction energy of the magnetic dipole-dipole 
coupling between two unpaired electron spins can be 
described as: 

E = — ( 1 ) 

where R and R are the distance and distance vector 
between the spins A and B, while Jig are the dipole 
moments of spins A and B. Converting this classical 
description into a quantum mechanical one yields the 
following relationship: 

(1 - 3cos^0) (2) 

where is the dipolar coupling frequency, D^ip is the 
splitting constant, R is the distance between the 
two spins, and 0 is the angle between the distance 

vector and the external magnetic field, see Pig. la. 

Rearranging (2) and assuming nitroxides (g = 2) as 
the spin centers lead to: 

^ ^ \lvdip{e ^ 90 °) 

Thus, (3) allows one to convert the measured dipo¬ 
lar coupling into the corresponding distance. For 
examples see Table 1. 

In CW EPR and for a fixed single spin pair or 
a single crystal of spin pairs, the dipole-dipole interac¬ 
tion leads to a splitting of the resonance line into 
a doublet centered about the resonance frequency of 
the observed spin. The magnitude of the splitting 
depends on the distance R and the angle 0. If the single 
spin pair or the single crystal is oriented in such a way 
that 6 = 0°, the doublet will be split by 2Ddip/R^, at 
0 = 54.7°, the two lines will coalesce and the dipolar 
frequency would be zero, and at 0 = 90°, the doublet 
will be split by DjjpIR^. For a frozen solution or 
a powder sample all orientations of 0 are present and 
the resulting spectrum is the so-called Pake pattern 
(Fig. lb). If the spin pair tumbles fast on the EPR 
timescale as in liquid solutions, the dipolar coupling 
is averaged to zero. That’s why measurements of the 
dipolar coupling are done under solid state conditions. 
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Vdip(0 = 90°) 


0 V 

Interspin Distance Determination by EPR, Fig. 1 (a) Sche¬ 
matic of the interacting spin pair A-B in a magnetic field Bq. (b) 
Fake pattern of a powder sample with 
Vd.p(e = 0°) = 2.Vdip(0=9O'‘) 


Interspin Distance Determination by EPR, 
Table 1 Distances converted to dipolar coupling frequencies 
for nitroxides 


Distance (nm) 

Dipolar coupling in MHz (gauss) 

1 52(18) 

2 

6.5 (2.3) 

3.7 

1 (0.35) 

8 

0.1 (0.036) 




If the distance between the two spins is very small 
(<1.5 nm) one might have to take into account the 
exchange coupling constant, especially in the case of 
conjugated bond systems. 
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Interspin Distance Determination by EPR, Fig. 2 Pulse 
sequences that have been used to determine the dipolar coupling 


frequency 


EPR Methods to Measure the Dipolar 
Coupling 

EPR spectroscopy offers a variety of methods to mea¬ 
sure the dipolar coupling between two unpaired elec¬ 
trons. Continuous wave (CW) methods are usually 
restricted to rather short distances whereas pulsed 
EPR experiments have been shown to reliably measure 
distances of up to 8 nm. 

Continuous-Wave EPR Experiments 

Precise interspin distances can be obtained from CW 
EPR spectra if the dipolar splitting is larger than the 
inhomogeneous linewidth of the radical centers. For 
nitroxide radicals, which have been greatly used in the 
investigation of biological systems, this linewidth is 
roughly 6-8 gauss in frozen solution. Thus, the CW 
EPR lineshape will only be broadened significantly at 


distances shorter than 2 nm. The dipolar coupling can 
be extracted from such CW EPR spectra by simulation 
or deconvolution techniques taking the lineshape of the 
singly spin-labeled biomolecule into account (Zhang 
et al. 2009). 

Pulsed EPR Experiments 

Figure 2 gives an overview over some of the pulsed 
EPR sequences used to measure distances (Schiemann 
and Prisner 2007). In a simple two-pulse Electron Spin 
Echo Envelope Experiment (ESEEM) the dipolar 
coupling frequency can be observed as an oscillation of 
the Hahn-echo intensity. However, hyperfine couplings 
to nearby nuclei usually obscure the dipolar frequency. 
More advanced but still single-frequency pulse sequences 
that suppress the hyperfine coupling contribution are 
“2-1-1,” SIFTER, double quantum coherence (DQC) 
EPR, and relaxation-induced dipolar modulation 
enhancement (RIDME). Out of these “2-1-1’’has still rather 
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strong hyperfine coupling contributions and RIDME 
relies on one of the spin centers having a much faster T i 
relaxation time than the other. SIFTER is based on the 
solid echo sequence known from NMR but has not been 
used often. DQC-EPR is based on a double quantum filter 
and is an elegant way to select the dipolar coupling; 
however, for radicals with large spectral width it is tech¬ 
nically rather demanding. Light-induced radical pairs 
provide additional means of measuring the dipolar cou¬ 
pling, for example, via the out-of-phase echo. 

An alternative strategy for removing unwanted hyper¬ 
fine interactions is used in Pulsed Electron-Electron 
Double Resonance (PELDOR or DEER). Here, two 
different microwave frequencies generated from two 
non-phase correlated microwave sources are used. Ini¬ 
tially a three-pulse experiment was established which 
was later on extended to the dead time free four-pulse 
PELDOR experiment, which allows a more accurate 
measurement of the dipolar coupling to be made. 

All the experiments above extract the dipolar cou¬ 
pling from oscillations in the corresponding time traces. 
Another possibility is to analyze changes in transverse or 
longitudinal relaxation times, for example, via 2-pulse 
ESEEM or inversion recovery experiments, respec¬ 
tively. If a non-dipolar coupled paramagnetic center 
relaxes with relaxation times Ti and T 2 then the change 
in these relaxation times in the presence of a second 
paramagnetic center can be used to calculate the dipolar 
coupling and thus the distance between the spin centers. 
These calculations require knowledge of the relaxation 
times in the non-dipolar coupled system under identical 
conditions, which may not always be easy to achieve. An 
advantage of these measurements is that they can be 
performed in the liquid state. In the following the focus 
of this chapter will be on PELDOR. 

PELDOR 

In the four-pulse PELDOR experiment (Jeschke 
2002), the detection sequence 7i/2—Xi— 71 —Ti — 
echo— 12 —71—T 2 — RE (see Fig. 2) is applied at 
a microwave frequency Va. This creates a refocused 
echo (RE) from the spins in resonance with Va (A 
spins). Introduction of the inversion pulse at fre¬ 
quency Vb during the time interval T flips spins, 
which are in resonance with Vb (B spins). 
Incrementing the inversion pulse from t = 0 to t = 
T leads to an oscillation of the amplitude V of the 
refocused echo in dependence of t if the spins A and 
B are coupled (Fig. 3a). The frequency of this 


oscillation is the dipolar coupling v^ip. Since 
PELDOR is not a single molecule experiment, usu¬ 
ally concentrations of about 100 pM are used, the 
PELDOR signal y(t) is the product of the interaction 
between two spins in one molecule y(t)intra and 
the interaction between spins on different molecules 

V(t)in,er (4). 

V(t) = V(t),,,,, X V(t),„,,,. (4) 

If the spins are randomly distributed in the sample, 
V(t)inter Can be written in the form of a monoexponential 
decay function which depends on the radical 
concentration. The intramolecular signal V(t)intra can be 
described by (5). 

^ £ n (1 - - cos(vdipt))) \ 

\ a =1 B =1 / 

B^A 

(5) 

Vdip is given by (2), n is the number of radicals per 
biomolecule, )^b is the fraction of B spins inverted by 
the pump pulse, t is the time delay of the pump pulse, 
and <.. .> the integration over all values of rAB and 0. 

Distance Extraction from PELDOR Time Traces 

The first step for the analysis of PELDOR data is to 
remove the contribution from the intermolecular spin- 
spin interactions. This is done by fitting the last part of 
the experimental time trace where all modulation is 
damped to a monoexponential decay function (see 
Fig. 3a). The original time trace is then divided by 
this function leaving the intramolecular contribution 
V(t)intra (Fig- 3b). Obviously, the fitting of the back¬ 
ground decay has to be done carefully and can lead to 
the introduction of artifacts, especially if modulations 
are not seen. Subsequently, the time trace can either be 
Fourier transformed and the distance calculated from 
the frequency read off at 0^ (Fig. 3c) or it can be 
directly transformed into a distance distribution using 
the regularization method from Tikhonov or maximum 
entropy methods. Programs that implement these 
approaches can be found on the websites of G. Jeschke 
and J. H. Freed. Since the extraction of the distance 
distribution from the time trace is an “ill-posed” math¬ 
ematical problem, obtaining spectra with good signal 
to noise and at least one full period of oscillation is 
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Interspin Distance 
Determination by EPR, 

Fig. 3 The PELDOR signal. 

(a) The green line is the 
convolute of intra- and 
intermolecular interaction as 
obtained in a PELDOR 
experiment. The red line is the 
fitted intermolecular 
contribution, (b) The wanted 
intramolecular signal obtained 
after dividing the original time 
trace {green line in a) by the 
intermolecular contribution 
(red line in a), (c) Pake pattern 
obtained after Fourier 
transformation of the signal in 

(b) (Schiemann et al. 2007) 





Frequency (v^d) 


required to avoid artifactual distance distributions. The 
need to resolve one full oscillation period for a precise 
distance determination requires that the time window 
T of the experiment is long enough to include a full 
oscillation, which in turn requires that the relaxation 
time Tm of the system is long enough. For example, to 
measure a distance of 80A, the longest distance 
measured so far, it was necessary to dissolve the 
bisnitroxide model system in a special matrix to 
make Tm long enough (Jeschke et al. 2004). Interest¬ 
ingly, a recent paper hy Ward et al. demonstrated that 
such long distances can even be accessed in aqueous 
buffer solutions and for large protein complexes such 
as the histone octamer, if the proteins and the buffer are 
fully deuterated (Ward et al. 2010). It should also be 
noted that the aforementioned programs assume full 
excitation of the Pake pattern. Thus for cases where 
specific orientations of 0 are selected, the PELDOR 
time trace has to be simulated (Denysenkov et al. 2006) 
or the Tikhonov regularization method extended 
(Marko et al. 2009). Analyzing the damping of the 
modulation has been shown to enable one to draw 
conclusions about molecular dynamics on the nanome¬ 
ter scale (Jeschke et al. 2010). This becomes even more 
powerful if done in combination with orientation selec¬ 
tive measurements (Margraf et al. 2007). 


In addition to the information about distance distribu¬ 
tions, the PELDOR signal also contains information on 
how many spins are coupled within one complex. This is 
encoded in the modulation depth V[ (Bode et al. 2007). 

Structure Generation Methods 

Having gathered the distance distributions and orien¬ 
tation information, the next step is to translate them 
into hiomolecular structures. In principle it should be 
possible to implement interspin distance data as 
restraints to evaluate a family of structural models for 
proteins or nucleic acids of unknown structure. The 
model starting structures can be obtained by homology 
modeling methods or by using a program such as 
Rosetta, which predicts de novo protein structure 
from the primary sequence. The use of interspin dis¬ 
tances obtained from nitroxide spin labels, however, is 
complicated by the flexible tether, which attaches the 
nitroxide to the protein. One group has shown that by 
assuming a motion-on-a-cone model for the spin label 
it is possible to determine a relationship between the 
C |3 atom of the amino acid labeled and the position of 
the unpaired electron (Alexander et al. 2008). They 
demonstrated that by using Rosetta and 25 interspin 
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Interspin Distance Determination by EPR, Fig. 4 Tandem 
POTRA domain’s orientation determined by PELDOR. (a) 
Comparison of average PELDOR-refined structures, red and 
magenta, with starting crystal structures, gray and black, (b) 


PELDOR time traces and extracted distance distributions for 
the three double spin-labeled POTRA samples measured 
(Reproduced with permission from Ward et al. 2009) 


distances, converted to Cu-Cu distances, they were able 
to determine a model structure of T4 lysozyme. In this 
way they achieved a root mean square deviation of lA 
resolution in the a-helical domain compared to its crystal 
stracture. An alternative approach is to attach the spin 
labels in silico to the model stracture and carry out 
molecular dynamics from which a large family of struc¬ 
tures with the label in different conformations can be 
obtained. By measuring the distances between the 
nitroxides for each of these conformations it is then 
possible to simulate a distance distribution for compari¬ 
son with the EPR data. This approach was employed to 
investigate the relative orientation of two tandem poly¬ 
peptide transport-associated (POTRA) domains for 


which different domain orientations had been found 
in X-ray structures (Ward et al. 2009). Three interspin 
distances were measured across the two POTRA 
domains, and each gave distance distributions that were 
consistent with the two domains being in a well-defined 
orientation. Interestingly, neither of the simulated dis¬ 
tance distributions obtained from two crystal structures 
agreed with the EPR data from the two POTRA domains 
in frozen solution. In order to determine the solution state 
orientation of the two domains, the crystal structures 
were refined using the EPR distance restraints (Pig. 4). 
This was achieved by generating a model stracture with 
a single “super residue,” which contained the coordinates 
of all the spin-label nitrogen positions from the initial 
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Interspin Distance Determination by EPR, Fig. 5 Wza pro- were attached. Example PELDOR time traces with distance 
tein studied by PELDOR. (a) Structure of the octameric Wza distributions (inset) for Wza labeled at (a) Q335 and (b) G58 
protein. The red spheres indicate where nitroxide spin labels (Hagelueken et al. 2009) 


molecular dynamics, for each label position. By 
performing further molecular dynamics, where both the 
super residue and the secondary structure of each domain 
was fixed, the two domains were allowed to move inde¬ 
pendently with respect to each other until the EPR dis¬ 
tance restraints were met. 

PELDOR-Based Distance Measurements in 
Biological Systems 

PELDOR measurements on biological samples are 
usually done in deuterated water and cryoprotectant 


(e.g., ethylene glycol or glycerol), both of which sig¬ 
nificantly slow the rate at which the Hahn echo decays 
thus providing better signal to noise data and permit¬ 
ting the measurement of longer distances. It has now 
been applied to peptides, soluble proteins, membrane 
proteins in membranes, and nucleic acids; examples 
for each will be given below. 

Herget et al. have demonstrated that the conforma¬ 
tion of spin-labeled peptides alters upon binding to the 
transport-associated antigen processing (TAP) protein. 
Based upon distances obtained from both CW and 
pulsed EPR this group was able to suggest the confor¬ 
mation of the bound state (Herget et al. 2011). 
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Interspin Distance Determination by EPR, Fig. 6 Spin- nitroxide spin labels, (b) PELDOR time traces illustrating the 
labeled tetrameric potassium ion channel KcsA studied by orientation selection in PELDOR time traces (Reproduced with 
PELDOR. (a) Structure of KcsA showing the positions of the permission from Endeward et al. 2009) 


For the protein visual arrestin, Hanson et al. dem¬ 
onstrated that its solution structure was not the same as 
in the crystal and that the tetramer is disrupted when 
bound to light-activated rhodopsin (Hanson et al. 
2007). Folding of proteins is another important area 
of structural biology. Dockter et al. were able to inves¬ 
tigate the refolding pathway of the integral membrane 
protein light-harvesting complex II with time-resolved 
PELDOR (Dockter et al. 2009). By monitoring two 
intramolecular distances with time they could show 
that two different domains of the protein refolded at 
different rates, which is confirmed by time-resolved 
FRET measurements. 

PELDOR has also been used to obtain structural details 
of membrane protein systems such as the octameric car¬ 
bohydrate exporter protein Wza (Hagelueken et al. 2009) 
(Fig. 5) and the tetrameric potassium ion channel KcsA 
(Endeward et al. 2009) (Eig. 6), either in detergent or 
membranes, respectively. 

In addition to these polypeptides also nucleic acids 
have been studied by PELDOR. For example, a 
PELDOR-based nanometer distance ruler for oligonu¬ 
cleotides has been developed (Schiemann et al. 2004) 
and Sicoli et al. could follow the conversion between 
an RNA hairpin-loop and duplex structure (Sicoli et al. 
2010 ). 

Summary 

EPR spectroscopy coupled with site-directed spin 
labeling with nitroxides allows for a very broad range 
of distance measurements in the nanometer range 


(1-8 nm). Out of these methods PELDOR is currently 
predominately used. Numerous studies have demon¬ 
strated the accuracy and applicability of this technique 
to biological systems. The ability to extract accurate 
nanometer-scale distances should allow the determina¬ 
tion of biomacromolecular structure and the elucida¬ 
tion of their interactions with other nanoscale objects. 

Cross-References 

► DEER of Metalloproteins 

► EPR Spectroscopy: General Principles 

► Spin Labeling of DNA and RNA 

► Spin-Labeling EPR of Proteins: Dynamics and 
Water Accessibility of Spin-Label Side Chains 
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Synonyms 

Hydrodynamics of macromolecules; Limiting viscos¬ 
ity parameter 

Definition 

The intrinsic viscosity as measured from a dilute 
solution of macromolecules contains information 
on the macromolecular shape, flexibility, and (for 
nonspherical particles) molar mass of macromole¬ 
cules. It is defined as the reduced specific viscosity in 
the limit of “infinite dilution” or zero concentration. 

Introduction 

Intrinsic viscosity measurement of macromolecular 
solutions has been described in some detail in an ear¬ 
lier publication in Progress in Biophysics and Molec¬ 
ular Biology (Harding 1997) although there have been 
some advances since then. The intrinsic viscosity is an 
intrinsic measure of the properties of a macromolecule 
in solution. It does not have the units of absolute 
viscosity (Poise or (SI units) Pa. s), but rather ml/g. 
It is measured by determining relative viscosities of 
solutions, where 
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Vr = Vho (1) 

The specific viscosity 

Vsp = - 1 (2) 

and the reduced viscosity (ml/g) or reduced specific 
viscosity is 

Vred = Vsp/c= iVr- ^)/c (3) 

where rj is the viscosity of the solution (or dispersion), 
is the viscosity of the solvent, and c is the (mass) 
concentration in g/ml. The c.g.s. system of units is pre¬ 
ferred, so the unit of reduced viscosity is ml/g. A related 
term is the inherent viscosity t^inh (or Ityl) defined by 

Vinh = (In Vr)/c (4) 

77j., rj, and rjo are zero shear viscosities. Where there is 
a dependence of the measured values on shear rate (for 
non-Newtonian fluids), an extrapolation to zero shear 
using appropriate equipment is required. 

Because of the effects of nonideality and/or asso¬ 
ciative phenomena, both Tyred and ryinh will be concen¬ 
tration dependent. The limit of 0 of both ryred and 
ryinh is defined as the intrinsic viscosity [rj], presumably 
so named because it is an intrinsic function of the 
dissolved/dispersed macromolecule: 

[?7] = limc^o(??,..d) = linic^o(?7.vp/c) (5) 

[t]] = limc^o?7mA = linic^o{(ln Vr)/c)} (6) 

Form of the Concentration Extrapolation 

The following equations have been given to describe 
the dependence of rjred and Iryl with concentration, 
correct to first order in concentration (i.e., dilute solu¬ 
tions). The most popular of these is the Huggins (1942) 
equation: 

??red = W(1 +^hH ■ C) (7) 

where is the (dimensionless) Huggins constant. 
is generally > 0 : that is, a plot of ? 7 red versus c usually 
has a positive slope. The equivalent concentration 
dependence relation to Eq. 7 for the inherent viscosity, 
(In Tyr)/c, is the Kraemer (1938) equation 


(ln? 7 r)/c= [t 7 ]( 1 -/<:kM- c) ( 8 ) 

with generally a negative slope and where A'k is the 
Kraemer constant. 

Solomon and Ciuta (1962) subsequently proposed 
a combination of Eqs. 7 and 8 to yield the approximate 
relation 

W - (1/c) ■ [2{Ty,p - ln(Ty,)]‘''^ (9) 

that is, if extrapolation to c = 0 of ryred or (In r],.)/c was 
not possible then to an approximation [ 77 ] could be 
evaluated at a single, finite concentration. 

Measurement of the Relative Viscosity and 
Intrinsic Viscosity 

In order to evaluate the intrinsic viscosity, the 
relative viscosity - using an appropriate viscometer - 
as well as the concentration c need to be determined. The 
latter is conventionally determined using either a spec¬ 
trophotometer for proteins or a refractometer for any 
macromolecule. Eor the former the extinction coefficient 
is required, for the latter the refractive increment dn/ 
dc. Tabulations of the latter are available but dn/dc 
values can not only vary between macromolecules 
(typically ^0.15-0.16 ml/g for polysaccharides and 
^0.18-0.19 ml/g for proteins, with glycoconjugates in 
between) but can also vary with solvent conditions. 
Relative viscosity is conventionally determined for 
Newtonian fluids using a U-tube or Ostwald viscometer 
in an accurately controlled water bath, controlled and 
measured to within at least 0.01°C. 

The relative viscosity can be determined from: 

(t/to) ■ (P/Po) ( 10 ) 

where t and t^, are the flow times for the biomolecular 
solution (at a particular concentration c) and solvent 
respectively p and are the corresponding solution 
and solvent densities. The relative viscosity without 
the density correction is known as the “kinematic” (as 
opposed to “dynamic”) relative viscosity 77 'fei = tito, 
subsequent derived parameters: 77 '^p, ?7'red> ln(^^ei)/c, 
and [ 77 ]' are the corresponding kinematic quantities. To 
a reasonable approximation, for concentrations < 1 mg/ 
ml ^rei Ptel- Although measurements at such low 
concentrations are possible with many solutions of, for 
example, polysaccharides and mucus glycoproteins, 
which have large relative viscosities, for globular 
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proteins and globular macromolecular assemblies (even 
large spheroidal plant viruses) this is not generally pos¬ 
sible since the relative viscosities are too small 1.003 
or less). However, in this case it is not necessary to 
measure solution density at each concentration since 
the correction of Tanford (1955) can be applied: 

W = W'+ [(1 - VPo)/Po] (11) 

or 

^red = <ed + [(1 “ ^Po)IPo] (12) 

where v is the partial specific volume of the macro¬ 
molecule. Of course, if this latter parameter is not 
known for the solvent conditions being used, or cannot 
be calculated from the chemical composition of the 
macromolecule (Perkins 1986), then solution density 
measurements are required; 

v= (1/Po) • (1(13) 

Po and p can be measured using a mechanical oscillator 
device as described by Kratky et al. (1973). A new 
generation of density meter such as the Anton-Paar 
(Graz, Austria) DMA4500 now makes measurement 
of Po and p relatively easy. 

There are two ways the precision of p, measurement 
can be increased, particularly for measurements at low 
concentration, both based on increasing the flow time 
(and hence flow-time difference t — fo) for solvent and 
solution. The first is the method of Szuchet-Derechin 
and Johnson (1966), which is to add a low concentra¬ 
tion of glycerol (^3%) to the solvent and solution: this 
has permitted the measurement of protein relative vis¬ 
cosities at concentrations <4 mg/ml. The second way 
is to use specially designed extended Ostwald viscom¬ 
eters (Fig. 1) (Holt and Creeth 1972), which increase 
the flow time difference (r — G) by extending the 
length of the capillary (the same result can, in princi¬ 
ple, be obtained by decreasing the capillary radius but 
this increases problems of capillary blockage). 

Another type of more sensitive viscometer is now 
becoming very popular - this is the Differential Pressure 
Viscometer (DPV) or “Ffiessure Imbalance” viscometer, 
which requires less volume (typically 0.1 ml compared 
with 2 ml): it involves capillary flow, but by a special 
design resembling a Wheatsone Bridge in electrical cir¬ 
cuits, relative viscosity is measured directly from the 



Intrinsic Viscosity, Fig. 1 Extended Ostwald viscometer. This 
has a very long capillary to maximize the flow time difference 
between solution and solvent 

pressure difference between solution flowing through 
one capillary and solvent going through another. These 
viscometers can be coupled online to a size exclusion 
chromatography system upstream and a concentration 
detector (uv/vis detector or refractometer) downstream, 
and with correct calibration if the time delay relative 
viscosities can be instantaneously converted to reduced 
viscosities. Also, since these viscometers are more sen¬ 
sitive than conventional flow-time viscometers, concen¬ 
trations sufficiently low that Ty^d [p] can be employed 
in many cases (if not the Solomon-Ciuta relation can be 
employed), permitting the measurement of intrinsic vis¬ 
cosities of components in a mixture. It is usual that 
a multi-angle light scattering (MALs) detector is also 
included between the columns and the DPV so that 
molecular weights can be measured simultaneously 
with [ry]. The DPV capillaries can be more sensitive to 
blockages than conventional Ostwald viscometers - 
a narrow piece of tubing incorporated between the 
MALLs and viscometer detectors helps to prevent such 
problems. Great care has to be expressed that samples 
are fully solubilized. This can be difficult for many 




I 


1126 


Intrinsic Viscosity 


polysaccharides and glycoconjugates, and in a series of 
papers Ross-Murphy, Harding, and coworkers have dem¬ 
onstrated the use of high-pressure methods (see Morris 
et al. 2008 and references therein). Figure 2 gives some 
examples of DPV determinations of intrinsic viscosity 
for two proteins (fibrinogen and immunoglobulin IgG3) 
and a mixture of polysaccharides (all pullulans). 

Interpretation 

This depends on the type of macromolecule. The depen¬ 
dence of [rj] on molecular weight for general conforma¬ 
tion types is conveniently represented by the Mark- 
Houwink-Kuhn-Sakurada or “MHKS” relation: 

[q] = K'M^ (14) 

with the limits of the MHKS coefficient a = 0 for spher¬ 
ical particles, 1.8 for rigid rods, and 0.5-0.8 for randomly 
coiled particles, depending on the extent of intramolec¬ 
ular exclusion effects and also possible contributions of 
solvent draining effects (see, e.g.. Berry 2010), although 
the latter are generally small compared with the strength 
of the hydrodynamic interactions between different seg¬ 
ments of a macromolecule (Tanford 1961). These effects 
are considered in more detail in the contribution 
by G. Pavlov (► Hydrodynamics of Macromolecules: 
Conformation Zoning for General Macromolecules). 
The pre-exponent factor K' also depends on the 
conformation as well as the solvent conditions. For 
linear, flexible molecules, more detailed relations are 
available such as the Bohdanecky (1973) or “Bushin- 
Bohdanecky” plot of versus which 

facilitates the measurement of the persistence length 
Lc - a measure of flexibility (practical limits are 
^2 nm for a random coil and ^200 nm for a very stiff 
rod) if the mass per unit length Mj^ is known from, for 
example, the primary structure of the polymer or from 
atomic force microscopy or electron microscopy (Stokke 
and Elgsaeter 1991). Similar representations are due to 
Burchard, Stockmayer, and Fixman, and also Hearst. 
If the mass per unit length is not known then the global 
or HYDFIT method involving combining the intrinsic 
viscosity with sedimentation and light scattering 
measurements is useful for estimating both Mi^ and Lp 
(► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules; Patel et al. 
2008). 


For quasi-rigid types of macromolecule it is possi¬ 
ble to probe the average shape of the molecule using 
whole-body ellipsoid or hydrodynamic bead modeling. 
The appropriate relation is 

[ 77 ] = V • Us (15) 

where v is a molecular shape parameter known as 
the Einstein-Simha (or Simha-Saito) viscosity incre¬ 
ment and Vs (ml/g) is the swollen specific volume: an 
anhydrous macromolecule will essentially expand when 
suspended or dissolved in solution, and Vs (= V • NJM) 
where V is the swollen volume (ml), M the molecular 
weight or molar mass (Da or g/mol), and Na is 
Avogadro’s number, is a measure of the (aqueous) 
solvent dynamically associated with the macromolecule 
on a time-averaged basis, and is defined as the volume 
of the macromolecule in solution per unit anhydrous 
mass of macromolecule. Vs can be related to a popular 
term called the “hydration” 8 , by the relation 

Vs = v + S/p^ (16) 

V is referred to as a “Universal shape function” since, 
unlike [q], it can be expressed as a function of shape of 
a particle independent of size or volume (Harding - 
ELLIPS and COVOL). Eor example, for rigid imper¬ 
meable spheres, it has the Einstein value for spheres of 
2.5, irrespective of size or volume. Eor prolate or oblate 
ellipsoids, exact and equivalent expressions have been 
given by Simha (1940) and Saito (1951) using different 
approaches. Eor general triaxial ellipsoids, exact and 
equivalent expressions have also been given (see Har¬ 
ding (1997) and references cited therein). The general 
ellipsoid is in fact the most sophisticated shape for 
which V has been specified exactly (► ELLIPS 
and COVOL). However, to a good approximation v 
(and hence [77]) can be calculated for bead models 
(► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules, ► US-SOMO: 
Methods for Construction and Hydration of Macromo- 
lecular Hydrodynamic Models). The more sophisticated 
the model becomes the more degenerate it becomes 
(i.e., the existence of many other models which also 
satisfy the measured data) - in which case intrinsic 
viscosity information must be combined with other 
hydrodynamic data (Serdyuk et al. 2007) such as 
from analytical ultracentrifugation, small angle x-ray scat¬ 
tering, fluorescence depolarization, and/or electro-optic 
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Intrinsic Viscosity, Fig. 2 Simultaneous [r|] and M„ measurement using a differential black line - light scattering profile (at a scattering angle of 90°). (b) Antibody IgG3 hinge 

pressure viscometer online with SEC-columns and a multiangle laser light scattering mutant in the presence of aggregate, data obtained with the Malvern Viscotek system 

(MALLS) detector upstream and a concentration detector (refractometer) downstream. (Longman et al. 2006). Blue line - viscometer profile; green line - concentration detector 

(a) Fibrinogen, data obtained with the Wyatt Viscostar system (Cardinal! et al. 2010). (refractometer) profile; red line - light scattering profile (at a scattering angle of 90°). 

Green line - viscometer profile; red line - concentration detector (refractometer) profile; (c) A mixture of four fractions of pullulans, analyzed with the Viscostar. Line colors - as (a) 
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measurements, and, if possible, stmctural information 
from x-ray crystallography or nuclear magnetic resonance 
(► Crystallohydrodynamics of IgG; ► Dynamic Light 
Scattering; ► Electro-optics and Macromolecular Hydro¬ 
dynamics; ► ELLIPS and COVOL; ► HYDRO Suite of 
Computer Programs for Solution Properties of Rigid Mac¬ 
romolecules; ► Hydrodynamic Modeling of Carbohydrate 
Polymers; ► Neutron Scattering of Membranes; ► US- 
SOMO: Methods for Constmction and Hydration of Mac¬ 
romolecular Hydrodynamic Models; ► X-Ray Scattering 
of Lipid Membranes). 

Perspectives 

Intrinsic viscosity is one of the most sensitive hydro- 
dynamic parameters to conformation. Eor Newtonian 
solutions (where the shear stress is directly propor¬ 
tional to shear rate) - for example, for solutions of 
globular proteins and also some polysaccharide solu¬ 
tions - conventional capillary viscometers can be used. 
If not an appropriate viscometer allowing extrapola¬ 
tion of the measured apparent viscosity or relative 
viscosity to zero shear is necessary (e.g., rolling ball 
viscometer with adjustable angle of fall or a cone and 
plate based rheometer). For polydisperse systems, 
where resolution of components is necessary (e.g., 
a protein in the presence of aggregates or dissociation 
products), differential pressure viscometers coupled 
downstream from SEC-columns and upstream from 
a concentration detector are useful. 

The intrinsic viscosity when used by itself can give 
some simple information about macromolecular con¬ 
formation, but is best used in conjunction with other 
measurements (from, e.g., analytical ultracentrifuga¬ 
tion, x-ray scattering, or light scattering). 

Cross-References 

► Crystallohydrodynamics of IgC 

► Dynamic Light Scattering 

► Electro-optics and Macromolecular Hydrodynamics 

► ELLIPS and COVOL 

► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules 

► Hydrodynamic Modeling of Carbohydrate Polymers 

► Hydrodynamics of Macromolecules: Conformation 
Zoning for General Macromolecules 


► Neutron Scattering of Membranes 

► US-SOMO: Methods for Construction and 
Hydration of Macromolecular Hydrodynamic 
Models 

► X-Ray Scattering of Lipid Membranes 
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G-Protein-Coupled Receptors: Recent 
Advances with Optical Biosensors 
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Science Center, San Antonio, TX, USA 

Introduction 

Given that ion channels underlie almost all the electri¬ 
cal activities of biological organisms, it is unsurprising 
that multiple mechanisms of ion channel regulation 
have evolved as a potent method of altering physiolog¬ 
ical states, from the cellular level to the behavior of 
the whole individual. A ubiquitous motif involves 
G-protein signaling pathways, in which stimulation 
of receptors by their cognate neurotransmitter or hor¬ 
mone results in alteration in ion channel activity, medi¬ 
ated by various intracellular signaling cascades. In 
some cases, the G-protein action is direct, in which 
active G-protein subunits directly interact with the 
channel to affect gating; however, more often is when 
intracellular 2nd-messengers are used to mediate the 
action. The use of optical biosensors is generally 
concerned with the latter motif, since it is awkward to 
design convenient optical sensors of direct G-protein/ 
target interactions. Usually, an optical sensor is made 
to detect the generation, depletion, or subcellular local¬ 
ization of the particular 2nd-messenger molecule 
involved in the signaling pathway, using ► live-cell 
imaging. A common strategy involves exploiting the 
binding motif of a protein that naturally interacts with 
the 2nd-messenger in question, and fusing that motif to 
some fluorescent protein(s) whose light can be easily 
detected in an appropriate microscope. In this review, 
we will focus on several types of optical sensors, and 
illustrate their mode of action in monitoring G-protein 
modulation of two different ion channels. Although 
intracellular Ca^"^ has a long history as a ubiquitous 
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2 nd-messenger that modulates a plethora of ion chan¬ 
nels, the literature concerning its optical monitoring is 
exceptionally rich, and will not be covered here. We 
will focus on two main G-protein signaling pathways, 
mediated by Gq/i i and Gs, and the optical probes that 
have been recently developed to track the molecules 
produced or consumed during those signaling cascades. 

Optical Reporters of Phosphoinositides and 
Other 2nd-Messengers of Gq/ii Signaling 

Over the past 15 years, a vast array of ion channels and 
transporters have been shown to be regulated by the 
abundance of the lipid phosphatidylinositol 4,5- 
bisphosphate (PIP2) in the plasma membrane (PM) 
(Gamper and Shapiro 2007). In the context of PIP 2 
regulation, we will refer here to “channels,” although 
transporters are similarly affected. In most cases, the 
presence of PIP2, presumed to directly bind to the 
channel proteins stabilizes the activity of the channels, 
allowing them to be activated by their physiological 
stimulus, such as voltage, heat/cold, G proteins, 
etc. Thus, PIP2 serves as a kind of “inverse” 2nd- 
messenger, with its unbinding constituting the signal, 
rather than binding. A common motif is the stimulation 
of receptors coupled to the Gq/i 1 family of G proteins, 
activation of phospholipase C (PLC), and hydrolysis of 
PIP2 into cytosolic inositol 1,4,5-trisphosphate (IP3) 
and membrane-bound diacylglycerol (DAG). For 
many channels, PIP 2 unbinding can occur by two 
mechanisms: either by depletion of PIP2 abundance 
from its consumption by PLC activity, or by reduction 
in the affinity of PIP 2 for the channels by other signals, 
such as Ca^Vcalmodulin, protein kinases A&C, pro¬ 
tons, Na"^ ions, G proteins, arachidonic acid, heat/cold, 
stretch, and others (Logothetis et al. 2007). Thus, the 
ability to optically monitor the abundance and subcel- 
lular localization of phosphoinositides such as PIP 2 in 
individual living cells has been a powerful tool in the 
field. 

The PLC8-PH probe of PIP 2 hydrolysis. The first 
PIP 2 probe developed consisted of the “pleckstrin 
homology” (PH) domain of the 5 isoform of PLC 
(Stauffer et al. 1998; Vamai and Balia 1998). This 
PLC5-PH probe binds with strong affinity to PIP 2 
molecules, but with over tenfold stronger affinity to 
IP 3 (Hirose et al. 1999). Figure la graphically shows 
how this probe functions. In resting cells, in whose 


cytoplasm [IP 3 ] is very low, the probe localizes to the 
PM, bound to PIP 2 . However, when PIP 2 hydrolysis 
occurs, the probe translocates to the IP 3 molecules 
accumulating in the cytoplasm, which can be optically 
monitored if PLC 8 -PH is fused to an appropriate fluo¬ 
rescent protein (FP) such as GFP. Besides use as 
a translocation reporter, another variant is to fuse the 
probe to two FPs that exhibit Forster resonance energy 
transfer (FRET), such as cyan and yellow FPs (CFP 
and YFP) (van der Wal et al. 2001). In this configura¬ 
tion, there will be robust FRET between the CEP and 
YEP in resting cells, since the probe molecules are 
crowded together in the PM, but ERET will sharply 
drop upon PIP 2 hydrolysis, since the CEP- and YEP- 
tagged PLC5-PH molecules bound to IP 3 in the cyto¬ 
plasm will now be much further apart. Eigure lb shows 
an example of the former approach in a sympathetic 
neuron cultured from rat superior cervical ganglia 
(SCG). These neurons express at least four well- 
characterized Gq/ 11 -coupled receptors, including the 
Ml muscarinic acetylcholine, B 2 bradykinin, P2Y 
purinergic, and ATi angiotensin II (Hernandez et al. 
2008). Shown are confocal micrographs of an SCG 
neuron transfected with the GEP-PLC5-PH probe 
using biolistic gene delivery. In control, the probe is 
localized to the PM, but stimulation of Mi, B 2 , or P2Y 
receptors with their agonists oxotremorine methiodide 
(oxo-M), bradykinin (BK), or uridine triphosphate 
(UTP) all causes a robust and reversible translocation 
of the probe from PM to the cytoplasm. Data from 
a group of such experiments (Pig. Ic) shows stimula¬ 
tion of four Gq/ 11 -coupled receptors in SCG neurons to 
induce PIP 2 hydrolysis, with BK and oxo-M being 
stronger than UTP or angiotensin 11. Shown in 
Fig. Id are traces from an SCG neuron of M current, 
a conductance that plays a dominant role in neuro¬ 
nal excitability. M currents are so named for their 
depression by muscarinic stimulation via activation 
of PLC and depletion of PIP 2 (Suh and Hille 2007). 
In addition, M current is also depressed by almost 
every other Gq/n-coupled receptor present in 
a neuron. Consistent with the obvious translocation of 
the PLC5-PH probe, both oxo-M and BK induced 
profound depression of M current (Fig. Id); however, 
the intracellular mechanism is distinct, which speaks to 
the precise event being reported by this probe (see 
below). 

Characteristics and Issues. It is clear that PLC5-PH 
binds to, localizes, and sequesters PIP 2 and that its 
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Ion Channel Regulation by G-Protein-Coupled Receptors: 
Recent Advances with Optical Biosensors, Fig. 1 The 

PLCd-PH probe reports PIP 2 hydrolysis upon stimulation of 
multiple Gii/ij-coupled receptors, (a) Schematic depiction of 
movement of the PLCi5-PH probe before {left panel) and after 
(right panel) activation of PLC via stimulation of Gq/i [-coupled 
receptors, (b) Confocal images from a SCO neuron sequentially 
stimulated by UTP, bradykinin (BK), and oxotremorine (Oxo- 
M). In the unstimulated cell (control), the probe was concen¬ 
trated mainly in the membrane, bound to PIP 2 . Upon stimulation 
by agonist, activation of PLC hydrolyzes PIP 2 and the probe 
translocates to the cytoplasm, bound to IP 3 . (c) Summarized data 


of the average fluorescence intensity, F, in a cytoplasmic region, 
normalized to the average intensity over 30 s before agonist 
application, F^, quantified as the percentage increase in F (AFI 
Ff} induced by the four agonists, (d) Example of M current 
regulation under whole-cell clamp before (a, c) and after (h, d) 
application of either bradykinin or oxo-M, and after complete 
blockade after application of linopirdine (e). GPCR G-protein- 
coupled receptor, PLC phospholipase C, DAG diacylglycerol, 
BK or hrady bradykinin, Oxo-M or 0 x 0 oxotremorine-M, LP 
linopirdine (G. P. Tolstykh, O. Zaika and M.S. Shapiro, 
unpublished observations; adapted from Hernandez et al. 
(2008)) 


translocation faithfully reports PIP 2 hydrolysis. The 
first uses of the prohe exploited all of these actions. 
Thus, PLC5-PH-GFP expressed in mammalian tissue- 
culture cells at rest localized to the plasma 


membrane, translocated to the cytoplasm upon 
stimulation of Gq/n-coupled receptors and lowered 
the abundance of free PIP 2 molecules available for 
cellular processes (Stauffer et al. 1998; Varnai and 
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Balia 1998; Raucher et al. 2000). Such experiments 
have proved invaluable in determining the role of PIP 2 
in regulation of ion channels. Translocation of PLC§- 
PH was shown to correlate with receptor-mediated 
inhibition of M current (Winks et al. 2005), voltage¬ 
gated Ca^"^ channels (VGCCs) (Gamper et al. 2004), 
Kir channels (Kobrinsky et al. 2000), Trp sensory 
channels (Runnels et al. 2002), and others. Since 
PLC5-PH-bound PIP 2 molecules are widely assumed 
to be neither susceptible to hydrolysis by PLC, nor able 
to interact with PIP 2 -sensitive transport proteins 
(Nahorski et al. 2003; Szentpetery et al. 2009), the 
probe acts as a “buffer” of PIP 2 abundance. Thus, 
expression of PLC5-PH in tissue-culture cells or neu¬ 
rons reduced IPs-mediated [Ca^"^]! rises, lowered tonic 
and Ca^'^-current amplitudes, and slowed depres¬ 
sion of currents by PLC-linked muscarinic receptors 
(Lei et al. 2001; Gamper et al. 2004; Jensen et al. 2009; 
Szentpetery et al. 2009). 

A critical issue with this probe concerns whether its 
translocation from membrane to cytoplasm reports PIP 2 
depletion in the PM, physiological accumulation of 
cytoplasmic IP 3 , or both (Hirose et al. 1999; van der 
Wal et al. 2001). For example, investigators have used 
the PLC5-PH probe as a reporter of PIP 2 depletion via 
PIP 2 dephosphorylation by sudden activation of a PIP 2 
phosphatase (Suh et al. 2006; Vamai et al. 2006; Jensen 
et al. 2009), which does not result in IP 3 production, but 
yet causes probe translocation. To answer such ques¬ 
tions, biophysically oriented cellular models incorporat¬ 
ing the steps of PIP 2 metabolism and PLC5-PH 
interactions have been constructed, using measured 
affinities of the probe for PIP 2 and IP 3 , known rates of 
PLC hydrolysis, diffusion constants of the molecules 
involved, and other parameters (Nash et al. 2001; Xu 
et al. 2003; Brown et al. 2008). These studies indicate 
that the event reported by PLC5-PH translocation likely 
depends on the numbers of probe molecules expressed 
in the cell, with weaker expression perturbing the 
PLC/PIP 2 system less, whereas the stronger expression 
typically used by investigators inducing greater arti¬ 
facts, for example, from the buffering effect noted 
above. Indeed, biophysical analysis using quantified 
parameters is strongly recommended in the interpreta¬ 
tion of such data. Finally, another probe based on the 
PIP 2 -binding domain of the tubby transcription factor 
has been used in an attempt to more faithfully track 
changing abundance of PIP 2 in the PM, since 
the tubby probe does not bind appreciably to IP 3 


(Quinn et al. 2008). Although the probe is useful in 
studying PIP 2 -mediated channel modulation, it appears 
to have several technical issues that reduce its efficacy 
(Szenfpetery et al. 2009). 

Other lipid-sensing probes used in channel modu¬ 
lation studies. Other PLC signaling probes focus on 
the production of DAG (Oancea et al. 1998) or IP 3 
(Matsu-ura et al. 2006) to monitor PIP 2 hydrolysis, 
likely activation of PKC or stimulation of IP 3 recep¬ 
tors. The former has been used in the study of 
M channels (Horowitz et al. 2005) and Trp channels 
(Delmas and Brown 2002). Finally, some channels are 
regulated by other phosphoinositides besides PIP 2 , 
such as phosphatidylinositol 3,4-bisphosphate and 
phosphatidylinositol 3,4,5-trisphosphate, which have 
been shown to regulate epithelial Na"^ (Pochynyuk 
et al. 2008) and Trp (Bezzerides et al. 2004) channels. 
For those channels, optical probes for those lipids have 
proved invaluable, such as those based on the PH 
domain of Akt kinase (Haugh et al. 2000), which is 
physiologically regulated by PI 3-kinase activity. This 
paradigm has been expanded to include sensors of the 
activity of monomeric GTPases, such as Ras, Rho Arf, 
and Rah (Heo et al. 2006), which have been shown 
involved in the regulation of multiple channels. 
Finally, visualization of PI(3)P, PI(4)P, and PI(5)P 
has been accomplished by careful choice of domains 
from proteins dedicated to specific recognition of, or 
subcellular targeting to, such singly-phosphorylated 
phosphoinositides (Balia 2007). 

Optical Reporters of cAMP and Protein 
Kinase A (PKA) 

After Ca^"^, cAMP was the hrst 2nd-messenger identi- 
hed, described as the signal linking adrenergic stimula¬ 
tion to intracellular actions, via P-adrenergic receptors 
(P-ARs), Gs and protein kinase A (PKA). Since then, 
cAMP/PKA has been shown to regulate many different 
channels, including VGCCs in the cardiovascular and 
nervous systems, CFTR CT channels in airway epithelia, 
and a variety of K"^ channels involved in cardiac pace¬ 
making, neuronal computation, and arterial tone. cAMP 
also directly regulates the activity of channels critical to 
vision, olfaction, and heart function, via binding to so- 
called cyclic nucleotide-gated channels, HCN K"^ chan¬ 
nels, and the channels responsible for currents called I^ 
or Iq. Thus, an optical probe of cAMP production would 
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Ion Channel Regulation by G-Protein-Coupled Receptors: 
Recent Advances with Optical Biosensors, Fig. 2 Emission 
of the Epacl-camps biosensor from a ventricular myocytes, and 
its concentration dependence versus isoproterenol, (a) Expres¬ 
sion pattern of CFP fluorescence from a guinea pig ventricular 


myocytes transduced with Epac2-camps using adenovirus, (b) 
Time course of the relative change in CFP:YFP emission inten¬ 
sity ratio (AR/Ro) produced by Epac2-camps during exposure of 
a ventricular myocytes to increasing concentrations of isopro¬ 
terenol (Iso) (Adapted from lancu et al. (2008)) 


be highly desirable. The first probes observed FRET 
between fluorescently tagged regulatory and catalytic 
subunits of PKA (Adams et al. 1991; Zaccolo et al. 
2000 ); however, those probes have suffered from several 
issues. Recently, FRET probes have been developed 
(DiPilato et al. 2004; Nikolaev et al. 2(X)4; van der 
Krogt et al. 2008) based on cAMP binding to the guanine 
exchange factor, Epac. These new probes have permitted 
the biophysical analysis of intracellular cAMP dynamics 
and observation of PKA microdomains and associated 
signaling complexes. Such probes have helped define 
such cAMP microdomains involved in modulation of 
“L-type” Ca^"^ channels in the heart (Warrier et al. 
2007; lancu et al. 2008). An example of this approach 
is illustrated in Fig. 2, in which guinea pig ventricular 
myocytes are infected with adenovirus expressing the 
cAMP biosensor, Epac2-camps (lancu et al. 2008). The 
ratio of CFP:YFP emission from excitation by 436 nm 
light reports intracellular cAMP concentration ([cAMP],). 


Shown in Fig. 2a are images of CFP emission (left) and the 
CFP;YFP ratio (right) before and after bath-application of 
isoproterenol, at a concentration which maximally stimu¬ 
lates the P-ARs of the cell. The probe reports tonic 
[cAMP]i to be very low in control, and then to rise strongly 
upon P-AR stimulation. The time course of probe emission 
in response to a range of isoproterenol concentrations is 
shown in Fig. 2b. Those responses display a useful 
dynamic range, allowing the investigator to quantifiably 
observe changes in [cAMP]; in real time, using ► live-cell 
imaging. 

Characteristics and Issues. As mentioned above, the 
probes based on FRET between the regulatory and cat¬ 
alytic fragments of PKA suffered from a kinetic lag from 
[cAMP];, and a restricted dynamic range, whereas the 
newer probes display a more faithful response. The 
probes based on Epac exploit a conformational change 
that occurs upon cAMP binding, which alters the FRET 
between donor and acceptor parts of the molecule 
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(typically reducing FRET when [cAMP] rises). Since the 
probe is genetically encoded, it can be intracellularly 
targeted to report [cAMP] at various organelles (DiPilato 
et al. 2004). It should be kept in mind that the probe 
reports changes in [cAMP], not the activation of PKA, 
which is considerably slower; thus, the diffusion of 
[cAMP] in the cytoplasm has been found to be much 
faster than the spatial spread of active PKA. Finally, the 
Epac-based sensors can be composed of a variety of 
Epac cAMP-binding domains from Epacl and Epac2 
with subtly distinct characteristics (Nikolaev et al. 
2004), allowing the investigator to tweak the experiment 
using the precise probe that is in accord with the goals of 
the experiment. There is also a robust choice in donor 
and acceptor FPs that work well in [cAMP] FRET 
probes (van der Krogt et al. 2008). For sure, the CEP 
and YEP pair enjoys robust versatility, as most labs are 
already equipped with the appropriate optics; however, 
substitution of YEP for Venus (which uses the same 
optics) has been found to reduce ultraviolet and pH 
artifacts and to increase the FRET range. Surprisingly, 
another useful pair is CEP and tdTomato (given the 
wider separation of their wavelengths), although the 
precise constract to use is critical. Although the full- 
length CFP-linker-tdTomato is reported to exhibit 
reduced FRET range and to have maturation problems 
of the tdTomato, the CFP-Alinker-tdTomato version 
demonstrates far superior characteristics, and is currently 
successfully in use in the author’s laboratory. Thus, 
investigators are advised to perform careful pilot exper¬ 
iments in their particular system before committing to 
a long-range research plan. 

Summary 

In summary, newly developed optical reporters of 
intracellular 2nd-messengers have revolutionized the 
study of channel modulation by G-protein signaling 
pathways. By allowing real-time fluorescent reporting 
of the creation, depletion, or intracellular movement of 
critical signaling molecules in individual living cells, 
investigators can elucidate the precise mechanisms, 
rates, and specificities of signaling cascades. Accom¬ 
panying their great potential, however, are caveats 
involving recognizing exactly which signaling mole¬ 
cule is being detected, knowing technically how to 
isolate the proper wavelengths of light in the system, 
and the necessary positive and negative controls for the 


experiment. However, if the experimental paradigm is 
designed correctly, a wealth of detailed new knowl¬ 
edge may be learned. 

Cross-References 

► Fluorescence and FRET in Membranes 

► Lipid Signaling and Phosphatidylinositols 

► Live-Cell Single-Molecule Imaging 
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Synonyms 

Cyclic nucleotides; Spatiotemporal organization of 
cyclic nucleotides; Ion channels as biosensors 

Definition 

Ion channels are used as biosensors in order to visual¬ 
ize the spatiotemporal changes that occur in cyclic 
nucleotides in living cells. 
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Introduction 

Ion channels are pore-forming proteins that allow the 
flow of ions down their electrochemical gradient thus 
helping to establish and control the small voltage gra¬ 
dient across the plasma membrane of cells. They are 
present in the membranes that surround all biological 
cells and can be distinguished based upon their ion 
selectivity, gating mechanism, and sequence similar¬ 
ity. Ion channels can be voltage gated, ligand gated, 
pH gated, or mechanically gated. These gating criteria 
along with a combination of sequence similarity 
and ion selectivity further subdivide ion channels into 
several subtypes: 

• Voltage-gated ion channels which open and close 
in response to membrane potential such as the volt- 
age-gated sodium channels, voltage-gated calcium 
channels, and some transient receptor potential 
channels, normally referred to simply as TRP 
channels. 

• Ligand-gated ion channels, also known as ionotropic 
receptors, open in response to specific ligand 
molecules binding to the extracellular domain of the 
receptor protein. Examples of such channels include 
the cation-permeable “nicotinic” Acetylcholine 
receptor, ionotropic glutamate-gated receptors and 
ATP-gated P2X receptors, and the anion-permeable 
y-aminobutyric acid-gated GABAa receptor. 

• Other gating includes the light-gated, the tempera¬ 
ture-gated, mechanosensitive ion channels, and the 
cyclic nucleotide-gated channels. This latter super¬ 
family of channels contains two families that 
are going to be the topic of this entry: the cyclic 
nucleotide-gated (CNG) channels and the hyperpo¬ 
larization-activated, cyclic nucleotide-gated (HCN) 
channels. 

The study of ion channels involves many scientific 
techniques such as voltage clamp electrophysiology 
(in particular patch-clamp), immunohistochemistry, 
and RT-PCR. 

State of the Art 

Cyclic adenosine 3',5'-monophosphate (cAMP) and 
cyclic guanosine 3',5'-monophosphate (cGMP) are 
ubiquitous second messengers regulating a myriad of 
cellular functions. In the heart, the force of contraction 
(inotropy) and beating frequency (chronotropy) are 


under the dual control of the sympathetic and parasym¬ 
pathetic systems. Both systems control in an opposite 
manner the synthesis of cAMP, and hence the activity of 
the cAMP effectors (Fig. 1). Catecholamines released 
into the synaptic cleft at sympathetic nerve terminals 
bind to Pi-adrenergic receptors (pi-ARs) on the cardiac 
sarcolemma, activate stimulatory G proteins (Gs), which 
in turn activate two isoforms of adenylyl cyclase (AC), 
ACS, and AC6, which catalyze the conversion of ATP to 
cAMP. cAMP exerts its effects then by activating the 
cAMP-dependent protein kinase (PKA), thereby pro¬ 
moting the phosphorylation and activation of key com¬ 
ponents of the excitation-contraction coupling (CEC) 
process. These include the sarcolemmal L-type Ca^"^ 
channels (LTCC), responsible for the initial Ca^"^ influx, 
the ryanodine receptors that allow Ca^"^ release from the 
sarcoplasmic reticulum (SR), troponin I, which controls 
myofilament sensitivity to Ca^"^, and phospholamban, 
which regulates Ca^"^ withdrawal from the cytosol and 
reuptake into the SR (Bers 2002). cAMP can also act in 
cardiac myocytes in a PKA-independent manner, 
through direct activation of Epac (Morel et al. 2005), 
a guanine nucleotide exchange factor for the small 
GTPase Rapl (Bos 2003), and HCN cyclic nucleotide¬ 
gated ion channels (Baruscotti et al. 2005). As a mirror 
image, acetylcholine released into the synaptic cleft at 
parasympathetic nerve terminals binds to the muscarinic 
M2 receptors and induces AC inhibition via inhibitory 
G protein (Gi), thus decreasing PKA activation (Fig. 1) 
(Dhein et al. 2001). 

It is generally accepted that cGMP opposes the 
effect of cAMP on cardiac function (Fischmeister 
et al. 2005). Intracellular cGMP production is achieved 
by two different forms of guanylyl cyclases: a soluble 
form (sGC), which is activated by NO; and a particulate 
form (pGC) which is activated by natriuretic peptides, 
such as ANP, BNP, and CNP. cGMP mediates its effects 
by activating the cGMP-dependent protein kinase 
(PKG), thus reducing cardiac contractility by inhibition 
of LTCC activity and reduction of myofilament Ca^"^ 
sensitivity (Mery et al. 1991; Layland et al. 2002; 
Schroder et al. 2003; Fischmeister et al. 2005). 

The intracellular level of these two second messengers 
is regulated by the balance between the activity of AC/GC 
and the cyclic nucleotide phosphodiesterases (PDEs) that 
degrade cAMP and cGMP to 5'AMP and 5'GMP, 
respectively. Cardiac PDEs fall into at least five fami¬ 
lies: PDEl, which is activated by Ca^'^-calmodulin; 
PDE2, which is stimulated by cGMP; PDE3, which is 
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Ion Channels: New Tools to Track Cyclic Nucleotide 
Changes in Living Cells, Fig. 1 Regulation of cardiac excita¬ 
tion-contraction coupling by cAMP pathways. Upon liberation 
of noradrenaline by sympathetic nerve terminals, pi-ARs acti¬ 
vate cardiac AC isoforms (ACS and AC6) via the stimulatory 
G protein (Gs). Cyclic AMP is being synthesized and activates 
PKA which phosphorylates different intracellular targets: 
LTCC phosphorylation leads to an increased Ca^"^ influx 
and RyR2 phosphorylation at the SR membrane to an increased 
Ca^"^ -induced Ca2-l- release, both acting to enhance the force 


of contraction; PLB phosphorylation leads to an increased SR 
Ca^"^ uptake by SERCA and Tnl phosphorylation to a reduction 
in Ca^"^ -sensitivity of the myofilaments, both acting to acceler¬ 
ate contractile relaxation; on the long term, CREB phosphoryla¬ 
tion in the nucleus activates transcription. These effects are 
antagonized by a liberation of acetylcholine from parasympa¬ 
thetic nerve terminals, through activation of muscarinic M2 
receptors and inhibition of AC activity via inhibitory 
G proteins (Gi) 


inhibited by cGMP; PDE4; and PDE5. Whereas PDEl 
and PDE2 can hydrolyze both cAMP and cGMP, PDE3 
preferentially hydrolyzes cAMP; PDE4 and PDE 5 are 
specihc for cAMP and cGMP, respectively. 

Although many studies have dissected cAMP and 
cGMP signaling pathways, there are a number of ques¬ 
tions that have always made us wonder but have only 
lately begun to receive the attention they deserve: how so 
many different receptors coupled to cAMP or cGMP 
signaling pathway manage to achieve specihc cellular 
responses? What is the purpose of the different adenylyl 
and guanylyl cyclases present in the same cell? Why do so 
many different cyclic nucleotide phosphodiesterases 
(PDEs) coexist to hydrolyze cAMP and cGMP? Do 
these cyclic nucleotides and their respective effectors 
freely diffuse inside the cell or are they localized? Some 
of these questions have received answers in recent studies 


combining molecular biology, huorescence imaging, and 
electrophysiological approaches. In particular, compel¬ 
ling evidence is now accumulating about the formation 
of molecular complexes (signalosomes) in distinct cellu¬ 
lar compartments that induce a localized cyclic nucleotide 
signaling in cardiomyocyte (Eischmeister et al. 2006). 

Methods to Study Cyclic Nucleotide 
Compartmentation in Intact Myocytes 

For 20 years, most of the evidence supporting a com¬ 
partmentation of cyclic nucleotide signaling in cardiac 
preparations was gathered using biochemical assays in 
fractionated dead tissues or cells. However, these 
methods do not allow understanding the spatiotemporal 
organization of cyclic nucleotide signals in real time and 
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Ion Channels: New Tools to Track Cyclic Nucleotide 
Changes in Living Cells, Fig. 2 CNG channels as cyclic 
nucleotide biosensors, (a) Expression of CNGA2 as revealed 
by confocal microscopy in native and Ad-CNG (adenovirus 
carrying CNG)-infected adult rat ventricular myocytes cultured 
for 24 h. CNGA2 mutant was detected using a mouse monoclo¬ 
nal anti-CNGA2 kindly provided by Dr F. Muller and Professor 
U.B. Kaupp (Institute for Biological Information Processing, 
Research Center Julich, Julich, Germany). No CNG channels 
were detected in noninfected (NI) cells, whereas Ad-CNG- 


in living cells. Hence, a number of sophisticated 
methods have been recently developed to monitor cyclic 
nucleotide changes using their own targets, such as 
CNG channels (Rich et al. 2000), HCN channels 
(Nikolaev et al. 2006), PKA (Zaccolo et al. 2000), PKG 
(Honda et al. 2001), and Epac (Nikolaev et al. 2004; 
Ponsioen et al. 2004) (Figs. 2 and 3). 

Some of these sensors, such as CNG channels, are 
based on their ionic channel properties: cAMP and or 


infected cells demonstrated a rather ubiquitous pattern of expres¬ 
sion with accentuation of the staining at the Z-lines. (b) Recom¬ 
binant CNG channels. Wild-type or genetically modified 
a-subunits of rat olfactory CNG channel (CNGA2) form a cat¬ 
ionic channel directly opened by cyclic nucleotides (CN). 
(c) Cardiomyocytes infected with an adenovirus encoding the 
native or modified chamrels elicit a nonselective cation current 
(Icng) only when CN concentration rises beneath the sarcolem- 
mal membrane 


cGMP levels are measured either by fluorescent cal¬ 
cium probes or by patch-clamp technique (Willoughby 
and Cooper 2008); whereas, the remaining probes are 
based on the use of their cAMP-and/or cGMP-binding 
domain between YEP and CEP to track cyclic nucleo¬ 
tide-dependent changes by FRET (Fluorescence Res¬ 
onance Energy transfer) between the two fluorescent 
proteins (Fig. 3). Properties of these indicators are 
summarized in Table 1. 























Ion Channels: New Tools to Track Cyclic Nucleotide Changes in Living Cells 


1139 



Ion Channels: New Tools to Track Cyclic Nucleotide 
Changes in Living Cells, Fig. 3 HCN channels as cAMP bio¬ 
sensors. (a) FRET-based imaging method. YEP and CEP pro¬ 
teins are fused to a cyclic nucleotide (CN)-binding protein (for 
instance, HCN, Epac, or a catalytie-inactive PKG). CN binding 
reduces FRET between CFP and YFP and the change in shape of 
the fluorescence emission spectrum allows CN concentrations to 
be visualized in real time, (b) CN imaging in a living cell (rat 
ventricular myocyte). CFPA'FP ratio is expressed in color code 
and is proportional to CN concentration within the cell 


Cyclic Nucleotide-Gated Channels: New Indicators 
to Track Subsarcolemmal cAMP in Single Living 
Cells 

A novel approach initially developed by the groups of 
Dermott Cooper and Jeff Karpen uses recombinant CNG 
channels as a probe to monitor cyclic nucleotide concen¬ 
trations generated beneath the membrane (Rich et al. 
2001b). CNG channels were identified in photoreceptors 
and olfactory sensory neurons, where they translate 
changes in intracellular cyclic nucleotide concentration 
triggered by light or odorants into variations of the mem¬ 
brane potential. Native CNG channels consist of a- and 
/(-subunits, with the a-subunit forming a nonspecific cat¬ 
ionic pore gated by cAMP or cGMP (Kaupp and Seifert 
2002). Upon binding of the cyclic nucleotide, CNG chan¬ 
nels allow Na"^, K"^, and Ca^"^ to enter the cell (Fig. 2) and 
this activity is easily detected using either patch-clamp or 
Ca^Nfiuorescence measurements (Rich et al. 2000). CNG 
channels display several characteristics that make them 
valuable cyclic nucleotide sensors: their gating is fast, 
they do not desensitize, they are virtually voltage- 
insensitive, and they display an excellent dynamic range 
(Rich and Karpen 2002). The rat olfactory a-subunit 


(CNGA2) makes a valuable cGMP sensor that was used 
recently to show cGMP compartmentation in cardiac 
myocytes (Castro et al. 2006, 2010). In order to study 
selectively cAMP dynamics. Rich et al. (2001b) have 
made point mutations in CNGA2 to decrease the sensi¬ 
tivity to cGMP and increase the sensitivity to cAMP. In 
particular, the E583M mutation confers on the channel an 
EC 50 for cAMP of approximately 10 pM, and the addi¬ 
tional C460W mutation further increases the sensitivity, 
to approximately 1 pM. These CNGA2 mutants were 
incorporated into adenovirus constracts to allow efficient 
expression in various cell types (Vandecasteele et al. 
2006). Adult cardiac myocytes infected with an 
adenovirus encoding the modified channels elicit 
a nonselective cation current that provides a reliable read¬ 
out of subsarcolemmal cAMP fluctuations (Rochais et al. 
2004,2006; Leroy et al. 2008; Abi-Gerges et al. 2009). 

Because CNG channels are blocked by external and 
internal Ca^"^ as well as by calmodulin (CaM), patch- 
clamp experiments should be performed at nominal 
external [Ca^"^] and internal pCa 8.5 (Rochais et al. 
2004; Castro et al. 2006). Since Ca^"^ regulates the 
activity of cardiac AC and PDEs, thus modulating 
cAMP signals, other CNG mutants (A61-90/C460W/ 
E583M or F68A/W81A/C460/E583M) that do not 
bind to Ca^VCaM complex have been generated later 
on (Rich et al. 2001b; Willoughby and Cooper 2007; 
Dubois 1985). 

The use of CNG channels in HEK cells showed that the 
activation of prostaglandin El (PGEl) receptors generates 
two types of cAMP signals: a rapid and transient increase 
of cAMP beneath the membrane while the increase in total 
cytosolic cAMP level is slow and sustained. The transient 
increase of subsarcolemmal cAMP was abolished after 
PDE inhibition (Rich et al. 2001a). 

In a series of studies performed in isolated rat cardiac 
myocytes, the use of CNG mutants showed the first 
evidence in real time that subsarcolemmal cAMP sig¬ 
nals generated upon activation of different GPCRs are 
spatially confined in specific microdomains and are 
highly controlled by a specific pattern of PDEs (Rochais 
et al. 2004, 2006). In addition, cAMP levels elicited 
upon P-adrenergic stimulation were shown to be nega¬ 
tively tuned by PKA-mediated activation of PDE3/ 
PDE4 (Rochais et al. 2004). Leroy and colleagues com¬ 
bined the use of the CNG channels with the FRET-based 
sensor Epac2-camps, in order to monitor respectively 
cAMP signals generated beneath the membrane and in 
the cytosol (Leroy et al. 2008). 
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Ion Channels: New Tools to Track Cyclic Nucleotide Changes in Living Cells, Table 1 Properties of cAMP sensors. Adapted 
from (Willoughby and Cooper 2008) 

Probe 

PKA 

Epac 

CNG channels 

HCN 

Methods 

FRET imaging 

FRET imaging 

Patch-clamp 

Calcium imaging 

FRET imaging 

Expression 

Microinjection 

Plasmid transfection; adenovirus 
infection 

Plasmid transfection; 
adenovirus infection 

Adenovirus infection 

Transgenic mice 
overexpresssing 
HNC2-camps 


Dynamic ~ 0.02-5 gM rs: 5-100 ^iM (Epac : Ri 1-50 pM (E583M) Ri 5 pM (in vitro) 

the whole protein) 


1-100 pM rsi 0.1-10 pM (Epac Ri 0.1-5 pM (C460W/ Excellent dynamic 


including cAMP-binding E583M, A61-90/C460W/ range 
domain) E583M) 


Localization 

Cytosol, plasmic membrane, 
binding to AKAPs (PKA- 
anchoring protein) 

Cytosol, plasmic 
membrane, mitochondria, 
nucleus 

Plasmic membrane 

Homogenous 
distribution within the 
cytosol 

Temporal 

resolution 

Limited (many seconds) 

Good (2 s) 

Excellent (<15 ms) 

Good 


Biochemical assays showed that the cAMP pathway 
is altered during cardiac hypertrophy and heart 
failure. Using the engineered CNG channels, we pro¬ 
vided the first evidence, in real time and on living, 
hypertrophied cardiac myocytes that subsarcolemmal 
cAMP signals generated upon the activation of 
P-adrenergic receptors are blunted compared to normal 
cardiac myocytes. The regulation of these cAMP 
domains by PDEs is also altered during hypertrophy 
(Abi-Gerges et al. 2009). 

Hyperpolarization-Activated, Cyclic Nucleotide- 
Gated Channels: A Novel Sensor to Monitor 
Cytosolic cAMP in Living Cells 

Hyperpolarization-activated, cyclic nucleotide-gated 
cation channels (HCN) are voltage-dependent chan¬ 
nels activated by hyperpolarization and sensitive to 
the cyclic nucleotides, cAMP and cGMP. They are 
permeable to the monovalent cations and Na"^. In 
vertebrates, the HCN family comprises four members 
(HCNl-4). While HCN3 is specifically expressed in 
neurons, the three other channels (HCN 1, 2, and 4) are 
detected in heart and brain. The HCN4 isoform is 
mainly expressed in sinoatrial cells while HCN2 is 
dominant in atrial and ventricular myocytes. HCN 
channels have, in their C-terminal part, a cyclic-nucle- 
otide-binding domain which is similar to the one that 
is present in PKA, PKG, Epac, and CNG channels. 
The current associated with these channels is known 
as the “funny” current (If) or the pacemaker current. It 
contributes to the generation of the cardiac automatism 


in the cells of the sinoatrial and atrioventricular node 
(Biel et al. 2002; Mangoni and Nargeot 2008). 

cAMP probes based on the use of PKA and Epac have 
the inherent disadvantage that their sensitivity is rela¬ 
tively high (affinity for cAMP « 1 pM). Therefore, the 
determination of cAMP with these constmcts is limited 
to the low range of cAMP concentrations, which is 
poorly adapted to cells in which cAMP can reach micro¬ 
molar concentrations, such as cardiac myocytes. There¬ 
fore, Nikolaev and colleagues have developed a novel 
sensor with optimized sensitivity to cAMP based on the 
use of a single cytoplasmic cAMP-binding domain of 
the HCN2 channel endogenously expressed in the heart, 
known as HCN2-camps (Nikolaev et al. 2006). Based on 
the crystal structure of HCN2, they fused its cAMP- 
binding domain between YEP and CEP to monitor 
cAMP-dependent conformational changes by ERET 
between the two fluorescent proteins (Fig. 3). This sen¬ 
sor, isolated from HEK293 cells, demonstrated micro¬ 
molar affinities for cAMP whereas cGMP activated the 
sensor only partially at high micromolar concentrations. 
HCN2-camps sensor was expressed in a cardiomyocyte- 
speciflc manner in the hearts of transgenic mice in order 
to study spatial and temporal cAMP dynamics after Pi- 
and P 2 -adrenergic subtype stimulation in freshly iso¬ 
lated adult cardiac myocytes. Using this probe, which 
is uniformly expressed in the cell, the authors showed 
that a localized Pi-adrenergic stimulation generated 
a cAMP gradient propagating throughout the cell, 
whereas local P 2 -adrenergic stimulation elicited con¬ 
fined cAMP pools. Furthermore, the activation of 
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Pi-adrenergic receptors elicited cAMP signals that were 
entirely controlled by PDE4, whereas the activation of 
P 2 -adrenergic receptors generated cAMP signals regu¬ 
lated by multiple PDE isoforms. Even after PDE inhibi¬ 
tion, the P 2 -adrenergic-mediated cAMP signals were 
still confined within the cells suggesting a role of the 
cell architecture in cAMP compartmentation (Nikolaev 
et al. 2006). 

Conclusion 

A number of different probes are now available for 
real-time measurements of cAMP (Nikolaev et al. 
2004, 2005; Zaccolo 2004; Ponsioen et al. 2004) and 
cGMP in living cells, including cardiac myocytes. The 
first approach is based on the properties of CNG chan¬ 
nels that are directly activated by the increase of cAMP 
and cGMP concentrations beneath the membrane. In 
order to visualize changes in cyclic nucleotide levels 
within the cytosol, another type of approach based on 
ERET imaging has been developed using cAMP and 
cGMP targets. Combining the two techniques allows 
a better understanding of the spatiotemporal organiza¬ 
tion of cyclic nucleotide signaling in many cell types. 

Summary 

Signaling cascades involving cyclic nucleotides play key 
roles in signal transduction in virtually all cell types. 
A current challenge in research today is to decipher the 
spatiotemporal regulation of cyclic nucleotide signaling 
that any given cell type has concocted to be able to 
discriminate among different external stimuli acting via 
a common signaling pathway. New approaches based on 
the use of ion channels as biosensors have been recently 
developed in order to track the dynamics of cAMP and 
cGMP as well as the activities of their regulators and 
effectors in the native biological context. This obviously 
applies to cyclic nucleotide-gated channels (CNG) 
(Pagan et al. 2001; Rich et al. 2001a, b; Brady et al. 
2004) and the hyperpolarization-activated cyclic nucleo¬ 
tide-gated channels (HCN) (Nikolaev et al. 2006) whose 
high sensitivity to cAMP and/or cGMP enable the mon¬ 
itoring, in real time and in living cells, of signals encoded 
by these second messengers (Rochais et al. 2004, 2006; 
Castro et al. 2006, 2010; Leroy et al. 2008; Abi-Gerges 
et al. 2009). This entry has focused on a series of 


genetically encoded probes based on the use of CNG 
and HCN channels for real-time monitoring of cyclic 
nucleotide signaling with a particular focus on their 
implementation in cardiac myocytes. 

Cross-References 

► Fluorescence and FRET in Membranes 
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Gating: VCF, FRET, and ERET 

► Gating of Potassium Channels by Cyclic Nucleotide 
Binding 

► HCN Channels: Biophysics and Functional 
Relevance 

► Ion Channel Regulation by G-Protein-Coupled 
Receptors: Recent Advances with Optical Biosensors 

► Katp and Sulfonylurea Receptor 

► Live-Cell Single-Molecule Imaging 

► Modeling the Heart 

► Molecular Dynamics Simulations of Carbohydrates 

► Nitric Oxide Synthase - Computational Studies 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► Protein Phosphorylation 

References 

Abi-Gerges A, Richter W, Lefebvre F, Mateo P, Varin A, 
Heymes C, Samuel J-L, Lugnier C, Conti M, Fischmeister R, 
Vandecasteele G. Decreased expression and activity of cAMP 
phosphodiesterases in cardiac hypertrophy and its impact on 
K-adrenergic cAMP signals. Circ Res. 2009;105:784-92. 
Baruscotti M, Bucchi A, DiFrancesco D. Physiology and 
pharmacology of the cardiac pacemaker (“funny”) cuirent. 
Pharmacol Ther. 2005;107:59-79. 

Bers DM. Cardiac excitation-contraction coupling. Nature. 
2002;415:198-205. 

Biel M, Schneider A, Wahl C. Cardiac HCN channels: structure, 
function, and modulation. Trends Cardiovasc Med. 
2002;12:206-12. 

Bos JL. Epac: a new cAMP target and new avenues in cAMP 
research. Nat Rev Mol Cell Biol. 2003;4:733-8. 

Brady JD, Rich TC, Le X, Stafford K, Fowler CJ, Lynch L, 
Karpen JW, Brown RL, Martens JR. Functional role of 
lipid raft microdomains in cyclic nucleotide-gated channel 
activation. Mol Pharmacol. 2004;65:503-11. 

Castro LRV, Verde 1, Cooper DMF, Fischmeister R. Cyclic 
guanosine monophosphate compartmentation in rat cardiac 
myocytes. Circulation. 2006;113:2221-8. 

Castro LRV, Schittl J, Fischmeister R. Feedback control through 
cGMP-dependent protein kinase contributes to differential 
regulation and compartmentation of cGMP in rat cardiac 
myocytes. Circ Res. 2010;107:1232-40. 



I 


1142 


Ion Mobility Mass Spectrometry - Principles 


Dhein S, Van Koppen CJ, Brodde OE. Muscarinic receptors in 
the mammalian heart. Pharmacol Res. 2001;44:161-82. 

Dubois J-M. Physiologie et pharmacologie des canaux Na et 
K des membranes axonales. J Physiol Paris. 1985;80:120-8. 

Fagan KA, Schaack J, Zweifach A, Cooper DMF. Adenovirus 
encoded cyclic nucleotide-gated channels: a new methodol¬ 
ogy for monitoring cAMP in living cells. FEES Lett. 
2001;500:85-90. 

Fischmeister R, Castro L, Abi-Gerges A, Rochais F, 
Vandecasteele G. Species- and tissue-dependent effects of 
NO and cyclic GMP on cardiac ion channels. Comp Biochem 
Physiol A Mol Integr Physiol. 2005;142:136-43. 

Fischmeister R, Castro LRV, Abi-Gerges A, Rochais F, 
Jurevicius J, Leroy J, Vandecasteele G. Compartmentation 
of cyclic nucleotide signaling in the heart: the role of cyclic 
nucleotide phosphodiesterases. Circ Res. 2006;99:816-28. 

Honda A, Adams SR, Sawyer CL, Lev-Ram V, Tsien RY, 
Dostmann WR. Spatiotemporal dynamics of guanosine 
3',5'-cyclic monophosphate revealed by a genetically 
encoded, fluorescent indicator. Proc Natl Acad Sci USA. 
2001;98:2437^2. 

Kaupp UB, Seifert R. Cyclic nucleotide-gated ion channels. 
Physiol Rev. 2002;82:769-824. 

Layland J, Li JM, Shah AM. Role of cyclic GMP-denendent 
protein kinase in the contractile response to exogenous nitric 
oxide in rat cardiac myocytes. J Physiol. 2002;540:457-67. 

Leroy J, Abi-Gerges A, Nikolaev VO, Richter W, Lechgne P, 
Mazet J-L, Conti M, Fischmeister R, Vandecasteele G. Spatio- 
temporal dynamics of 6-adrenergic cAMP signals and L-type 
Ca^"^ channel regulation in adult rat ventricular myocytes: role 
of phosphodiesterases. Circ Res. 2008;102:1091-100. 

Mangoni ME, Nargeot J. Genesis and regulation of the heart 
automaticity. Physiol Rev. 2008;88:919-82. 

Mery P-F, Lohmann SM, Walter U, Fischmeister R. Ca^"^ cuiTent 
is regulated by cyclic GMP-dependent protein kinase in 
mammalian cardiac myocytes. Proc Natl Acad Sci USA. 
1991;88:1197-20L 

Morel E, Marcantoni A, Gastineau M, Birkedal R, Rochais F, 
Gamier A, Lompre A-M, Vandecasteele G, Lezoualc’h F. 
The cAMP-binding protein Epac induces cardiomyocyte 
hypertrophy. Circ Res. 2005;97:1296-304. 

Nikolaev VO, Bunemann M, Hein L, Hannawacker A, 
Lohse MJ. Novel single chain cAMP sensors for receptor- 
induced signal propagation. J Biol Chem. 2004;279:37215-8. 

Nikolaev VO, Gambaryan S, Engelhardt S, Walter U, Lohse MJ. 
Real-time monitoring of live cell’s PDE2 activity: hormone- 
stimulated cAMP hydrolysis is faster than hormone- 
stimulated cAMP synthesis. J Biol Chem. 2005;280:1716-9. 

Nikolaev VO, Bunemann M, Schmitteckert E, Lohse MJ, 
Engelhardt S. Cyclic AMP imaging in adult cardiac 
myocytes reveals far-reaching Bi-adrenergic but locally con¬ 
fined B 2 -adrenergic receptor-mediated signaling. Circ Res. 
2006;99:1084-9L 

Ponsioen B, Zhao J, Riedl J, Zwartkrais F, van der Krogt G, 
Zaccolo M, Moolenaar WH, Bos JL, Jalink K. Detecting 
cAMP-induced Epac activation by fluorescence resonance 
energy transfer: Epac as a novel cAMP indicator. EMBO 
Rep. 2004;5:1176-80. 

Rich TC, Karpen JW. Review article: cyclic AMP sensors in 
living cells: what signals can they actually measure? Ann 
Biomed Eng. 2002;30:1088-99. 


Rich TC, Fagan KA, Nakata H, Schaack J, Cooper DMF, Karpen 
JW. Cyclic nucleotide-gated channels colocalize with 
adenylyl cyclase in regions of restricted cAMP diffusion. 
J Gen Physiol. 2000;116:147-6L 

Rich TC, Fagan KA, Tse TE, Schaack J, Cooper DM, Karpen 
JW. A uniform extracellular stimulus triggers distinct cAMP 
signals in different compartments of a simple cell. Proc Natl 
Acad Sci USA. 2001a;98:13049-54. 

Rich TC, Tse TE, Rohan JG, Schaack J, Karpen JW. In vivo 
assessment of local phosphodiesterase activity using tailored 
cyclic nucleotide-gated channels as cAMP sensors. J Gen 
Physiol. 2001b; 118:63-77. 

Rochais F, Vandecasteele G, Lefebvre F, Lugnier C, Lum H, 
Mazet J-L, Cooper DMF, Fischmeister R. Negative feedback 
exerted by PKA and cAMP phosphodiesterase on 
subsarcolemmal cAMP signals in intact cardiac myocytes. 
An in vivo study using adenovims-mediated expression of 
CNG channels. J Biol Chem. 2004;279:52095-105. 

Rochais F, Abi-Gerges A, Homer K, Lefebvre F, Cooper DMF, 
Conti M, Fischmeister R, Vandecasteele G. A specific pattern 
of phosphodiesterases controls the cAMP signals generated 
by different Gj-coupled receptors in adult rat ventricular 
myocytes. Circ Res. 2006;98:1081-8. 

Schroder F, Klein G, Fiedler B, Bastein M, Schnasse N, Hillmer A, 
Ames S, Gambaryan S, Drexler H, Walter U, Lohmann SM, 
Wollert KC. Single L-type Ca^’’’ channel regulation by cGMP- 
dependent protein kinase type I in adult cardiomyocytes from 
PKG I transgenic mice. Cardiovasc Res. 2003;60:268-77. 

Vandecasteele G, Rochais F, Abi-Gerges A, Fischmeister R. 
Functional localization of cAMP signalling in cardiac 
myocytes. Biochem Soc Trans. 2006;34:484-8. 

Willoughby D, Cooper DM. Organization and Ca^’’’ regulation of 
adenylyl cyclases in cAMP microdomains. Physiol Rev. 
2007;87:965-1010. 

Willoughby D, Cooper DM. Live-cell imaging of cAMP dynam¬ 
ics. Nat Methods. 2008;5:29-36. 

Zaccolo M. Use of chimeric fluorescent proteins and fluorescence 
resonance energy transfer to monitor cellular responses. Circ 
Res. 2004;94:866-73. 

Zaccolo M, De Giorgi F, Cho CY, Feng L, Knapp T, Negulescu 
PA, Taylor SS, Tsien RY, Pozzan T. A genetically encoded, 
fluorescent indicator for cyclic AMP in living cells. Nat Cell 
Biol. 2000;2:25-9. 


Ion Mobility Mass Spectrometry - 
Principles 

Jason M. D. Kalapothakis' and Perdita E. Barran^ 
'School of Physics and Astronomy, The University of 
Edinburgh, Edinburgh, Scotland, UK 
^The School of Chemistry, The University of 
Edinburgh, Edinburgh, Scotland, UK 

Synonyms 

Mass spectrometry 




Ion Mobility Mass Spectrometry - Principles 


1143 


Introduction 

The start of the analytical technique termed “ion 
mobility spectrometry” (IMS) can be traced to the 
beginnings of the twentieth century, when experimen¬ 
tal (Erikson 1927; Bradbury 1932) and theoretical 
(Langevin 1903) physicists became interested in the 
movement of ions in gases. In these early studies, as in 
all modem ion mobility methods, the transport proper¬ 
ties of ions are measured, as they “drift” under the 
influence of a weak electric field while experiencing 
collisions with a buffer gas. Such measurements allow 
the determination of ionic mobilities and diffusion 
coefficients. Following the experiments of Erikson 
(Erikson 1927) and Loeb and Bradbury (Bradbury 
1932), in the 1950s, McDaniel and Kebarle (Barnes 
et al. 1961; Mcdaniel et al. 1962) constructed low-field 
drift tubes which bear a great resemblance to drift-time 
IMS mobility instrumentation still used today. In 1965, 
Kebarle coupled such a drift tube to a mass spectrom¬ 
eter creating the hybrid technique of ion mobility mass 
spectrometry (IM-MS). (Kebarle and Hogg 1965) This 
work prompted the development of different configu¬ 
rations of ion mobility and mass spectrometry instru¬ 
mentation over the past 40 years and their application 
to an ever wider range of analyte species. Details of the 
early parallel development of ion mobility and of mass 
spectrometry and its applications to large macromole¬ 
cules can be found in an excellent review by Heck et al. 
(Uetrecht et al. 2010) focused on applications and one 
on instrumentation in a comprehensive summary by 
Hill et al. (Kanu et al. 2008) In the years 2005-2010, 
IM-MS has been developed into commercial instru¬ 
mentation and as such has become more widely 
applied as a research tool. 

Basics of an Ion Mobility Mass Spectrometer 

An ion mobility mass spectrometer usually comprises 
the following components: 

• An ion source, followed by 

• An accumulation!transfer region (not necessary if 
ions are produced directly in the drift region) 
guiding and pulsing ions into the 

• Drift tube, in which mobility separation occurs; ions 
are subsequently transported to a 

• Mass analyzer, after which ions reach an 

• Ion detector 


Different permutations of this basic architecture are 
possible, e.g., positioning a mass filter before the drift 
cell, instead of after. The modularity of mass spectrom¬ 
eters in general has allowed for a range of more complex 
designs to be realized (see (Wyttenbach and Bowers 
2003; Bohrer et al. 2008) and references therein). Ions 
once produced in the ion source must be transferred into 
the high-pressure drift region. If regions of lower pres¬ 
sure intervene between these two parts, ions must pos¬ 
sess sufficient energy to overcome the outward flow of 
gas from the drift cell’s entrance aperture. A potential 
difference between the accumulation region and the 
drift region must be imposed for such a scenario to be 
realized, which is termed injection energy. As it is 
practically difficult to balance the deceleration of ions 
due to the pressure differential with the acceleration due 
to injection energy, ions are likely to enter the drift cell 
with somewhat elevated kinetic energy. However, ther- 
malization becomes very rapid once the ions enter the 
low-field drift space (Mason and McDaniel 1988). 
Injection energy can also be utilized as a tool for struc¬ 
tural investigation of ions. By monitoring the injection 
energy ions can be collisionally activated by controlled 
amounts as they enter the drift cell, and resultant 
changes in mobility can be measured directly afterward. 

Despite differences in the configuration of mobility 
devices, there are some features common to all IM-MS 
experiments. A typical mobility experiment requires the 
injection of a pulse of ions into a chamber filled with 
a known gas at a known pressure across which an 
electric field is applied. The time taken for the ions to 
pass through the chamber is measured. Upon injection 
into the chamber or drift cell, the ions experience an 
electrostatic force pulling them through the cell; this 
force is countered by collisions between the ions and 
the buffer gas. The behavior of an ion moving through 
a gas under the influence of an electric field (E) is 
dependent on its energy which is determined by the 
ratio of electric field strength to buffer gas number 
density, E/N. (Mason and McDaniel 1988) At low 
E/N, the ions are said to be in the low-field limit; 
under this regime, the ions have low velocities which 
are linearly dependent of the field strength. At higher 
E/N, the ions may align in the field, and their mobilities 
become dependent on the field strength. In the low-field 
limit, the motion of the ions can be described in more 
simple terms, and most ion mobility measurements are 
therefore performed in the low-field limit. Under these 
conditions, we define the low field mobility, (K), 
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as the constant of proportionality between average drift 
speed, (vo) (defined by the drift time of ions (to) 
divided into the length of the drift tube), and the mag¬ 
nitude of electric field, E, as shown by Eq. 1 : 

v^ = i = KE. (1) 

fo 

Eor experiments performed at sufficiently high 
pressure, and low drift voltages, ions will quickly be 
thermally equilibrated, reaching a constant drift veloc¬ 
ity V(j which is determined by the magnitude of the 
electric field (E) and its influence on the charge of the 
ion as well as by the number of collisions that the ion 
makes with the buffer gas. Hence, the mobility, K, 
must contain terms relating to ionic shape, charge, 
and the buffer gas pressure. These latter properties 
combine to provide the rotationally averaged collision 
cross section (Q) for each ion which is critically depen¬ 
dent on the chosen buffer gas as well as on its temper¬ 
ature (see Eq. 2 below). Hence, determining Vj 
elucidates Q which can be interpreted as a conforma¬ 
tion dependent order parameter for the molecular ion 
under particular gas-phase conditions. 

To determine v^ and hence K and Q accurately with 
ion mobility instrumentation, the length of the drift region 
must be known precisely then the experimentalist records 
the average arrival times of ions at a detector. In 
IM-MS instrumentation, detection will also provide 
mass resolution, and so the arrival time for a given mass 
to charge ratio is found. 

There are three principal types of ion mobility 
instrumentation which have been successful combined 
with mass spectrometry traditional linear IM-MS, dif¬ 
ferential ion mobility (such as FAIMS), and travelling- 
wave mobility spectrometry, which makes use of 
temporally and spatially dependent fields. 


Types of Mobility Separation 

Drift Tube (DT) IM-MS 

Following from the apparatus used by Kebarle, the 
simplest configuration of IM-MS instrumentation 
consists of a tube located within a vacuum system 
filled with a buffer gas of interest across which is 
applied a weak electric field (5-100 V cm”*) is 
applied. Ions are steered and injected into a small 
orifice at one end of the drift tube. They drift due to 


the field and collisions with the buffer gas until they 
exit through another small orifice. By measuring the 
drift velocity of an ion through such a linear drift field 
and solving Eq. 1 above for the proportionality param¬ 
eter(the experimental ion mobility), it is possible 
to determine its collision cross section with some 
degree of accuracy (Mason and McDaniel 1988) 
according to the relationship in Eq. 2, an expression 
for the ion mobility at low fields from kinetic 
theory: 


^271 

N being the buffer gas number density, q the ionic 
charge, and nii, the masses of the analyte ion and the 
buffer gas, respectively, and Qq the diffusion momen¬ 
tum transfer integral or orientationally averaged cross 
section (which is a good indicator of the shape of the 
ions, on which the relative frequency and nature of the 
collisions with the buffer gas particles ultimately 
depend) kg is the Boltzmann constant, and T is the 
gas temperature. 

Several IM-MS instruments which incorporate drift 
cells over which a linear field gradient is applied have 
been reported. Notable efforts have been made in the 
“modern” era of IM-MS design by Bowers, Jarrold, 
Clemmer, Russell, and Hill. 

Fundamentals of Drift-Time Separation 
The relationship between drift velocity and mobility as 
described in Eq. 1 is fundamental for ions undergoing 
elastic collisions with neutrals, and a very similar 
expression can be derived by taking into account con¬ 
servation of momentum. This experimental collision 
cross section which is buffer gas dependent (Matz et al. 
2002) can be compared to cross sections derived from 
coordinates obtained from other structural investiga¬ 
tions (e.g., crystal structure coordinates) or from com¬ 
putational predictions used in this fashion IM-MS can 
provide a powerful method for “filtering candidate 
geometries.” 

The relations between collision cross section and 
mobility as described in Eq. 2 are fundamental for ions 
undergoing elastic collisions with neutrals, and a very 
similar expression can be derived by taking into account 
conservation of momentum. The experimentally mea¬ 
sured parameter is the arrival time 4 , the time between 
the creation of a pulse of ions and their arrival at the 
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detector, and includes the drift time and some “dead 
time,” i.e., time the ions spend outside the drift cell, Iq. 


— = —V • J — k^n = ■ n — v^V • n 

at 


k^n. (7) 


4 = + to- 


(3) 


Since to is an undesirable source of error, a method is 
needed to derive the mobility without including it to the 
drift time. For this reason, the reduced mobility, Kq, which 
is a function of temperature, pressure, and “standard” 
temperature and pressure, is typically defined: 


Ko = K 


P 

760Torr 


273.15K 

f 


(4) 


Obviously, at “standard” temperature and pressure, 
K and Kq coincide. Combining Eqs. 1 and 4 solving for 
fo and substituting into Eq. 3 gives: 

1 £2273.15KF 
'’'~Yo r760Torr V 

Thus, plotting the arrival time versus PfV (where V 
is the voltage) will yield a straight line with a slope 
inversely proportional to Kq intersecting the time axis 
at to, a convenient way of factoring out to- Thus, the 
mobility of the ions under study can be obtained 
experimentally, allowing the average collision cross 
section to be calculated from Eqs. 1, 2, 3, and 4 

The method of analysis outlined above can be 
applied using an average arrival time from experimen¬ 
tal arrival time distributions. However, when ions of 
identical m/z contain two or more subspecies partially 
resolved in the drift tube, an alternative method for 
obtaining ionic mobilities is called for. Eitting the 
arrival time distributions (ATDs) to an expression for 
ion flux offers a different methodology for deriving 
mobilities from experimental drift-tube data. 

Sharp ion pulses injected in drift tubes are subjected 
to two concurrent processes; drift along the direction 
of the applied hied and diffusion due to thermal 
fluctuations, allowing Fick’s law to be written as: 

J = Vd • « — HV • «. (6) 

J representing ion flux (current), n ion number 
density, and D the diffusion tensor. Assuming that 
depleting, but no accumulating, reactions may take 
place with rate k_, and ion transport can be described 
by this continuity equation: 


Note that in the above expression, D is conceived as 
being independent of ionic position; furthermore, lon¬ 
gitudinal and transverse diffusion coefficients are 
assumed to be identical. Both conditions are fulfilled 
when ions drift under the influence of a uniform and 
sufficiently weak field. At the vanishing-field limit, the 
mobility K and the diffusion coefficient D are related 
via the Nemst-Einstein-Townsend relation: 


K = 


keT' 


( 8 ) 


Moseley et al. (1969) have solved Eq. 7 for n using 
an initial condition of an infinitely thin disk of ions 
with radius same as the one for the entrance aperture ro 
of the drift tube and center at z = 0 (z being the 
direction of the field). Subsequently solving Eq. 6 for 
ions along the z-axis yielded: 


./ =■ 


ae 


-k_t 


As/ nDt 




(9) 


In the above expression, a represents the number of 
ions utilized for generating one ATD. Multiple species 
drifting independently can be modeled by Eq. 9 by the 
sum of currents resulting for ions with different drift 
speeds (v^) and diffusion coefficients (£>). 


Traveling Wave (TW) IM-MS 

So far the discussion has focused on drift tubes in 
which the potential varies linearly in space and is 
invariant in time. Nevertheless this need not be the 
case; travelling wave and FAIMS devices rely on spa¬ 
tially and temporally varying fields for separating ions. 
Waters MS Technologies introduced the first commer¬ 
cially available integrated IM-MS instrument the 
Synapt HDMS in 2006 (Pringle et al. 2007). With its 
origins in the pioneering work of Gerlich (Gerlich 
1992), this mobility separator consists of three stacked 
ring ion guides (called the Triwave) with RE applied to 
consecutive electrodes; to propel ions through the 
device, a travelling wave (TW) comprising a series of 
transient DC voltages is superimposed on top of the RF 
voltage. The TWIMS separation device is superficially 
similar to that of low and uniform-field IMS in that 
it comprises of a set of stacked ring electrodes. 
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In contrast to conventional IMS, the voltage “scans” the 
electrodes sequentially in “waves,” of predetermined 
amplitude and frequency, between which no field is 
applied; ions can be thought to be “surhng” on these 
waves. The potential inducing ionic motion must have 
the form of a periodic traveling wave along the longitu¬ 
dinal axis (z); V{x, y, kz — (at, t) (k being its wave number 
and (I) its frequency). Now, the motion of ions in a drift 
tube with no external force applied (i.e., near the trough 
of the wave) will be entirely thermal. As the wave crest 
approaches the ions, the latter will move proportionally 
to the spatial derivative of the wave assuming a suffi¬ 
ciently dense background gas; from Eq. 1 it can be 
clearly seen that the ionic speed as well will be a peri¬ 
odic travelling wave, since £ = - VE. Thus the drift 
speed can also be modeled as a travelling wave Vd(x, y, 
kz - (at, t). “Sharp” waves with high amplitude will 
result to a high ionic velocity at the direction of wave 
propagation. If the ionic velocity is high enough, a single 
wave will transfer the ions to the end of the drift tube and 
no further separation will occur. The presence of a 
buffer gas will provide some resistance to the directed 
motion of ions; broader waves with lower amplitudes 
and higher wave velocities will result in ionic speeds 
sufficiently lower than wave speeds, suggesting that ions 
eventually “tumble” at the other side of the wave and are 
decelerated in the direction of wave propagation until 
the next wave catches up. Given that this phenomenon 
occurs several times in the TM device, the average 
speed of the ionic distribution will converge to a value 
lower than wave speed. The phenomenon is essentially 
that of Stokes drift, resulting to a drift velocity that is 
non-lineaiiy dependent on wave amplitude and ionic 
mobility. This property allows the use of travelling- 
wave potentials for the construction of both analytical 
separation devices and simple high-efficiency ion trans¬ 
fer devices. Moreover, an equation for the ion density 
may be given using the simple approach described in the 
previous section, but accounting for the incori^ancy of 
the drift velocity and the diffusion tensor D: 


— = -V • / - = V • ( DV • n) 

dt ^ ’ (10) 

— V • {v£,{x,y, kz — cot, t) ■ ti) — k_n 

The main problem arising with the TW setup is, once 
again, that of field energy; during their peak velocity, 
collisions in the direction of wave propagation may 


induce the temperature of ions to increase above that 
of the buffer gas. 

Commercial distributions of this method have also 
included an additional RF-held to limit radial spread and 
thus increase ion transmission efficiency. In 2009, 
Waters launched the Synapt G2 System which improves 
some of the design features of the first-generation instru¬ 
ments, whilst essentially still a TW mobility separator. 
This second-generation instrument has a higher field 
pusher, into a dual-stage reflection, as well as the inclu¬ 
sion of a short helium-filled cell prior to the drift region 
to decelerate and cool ions, whilst retaining high trans¬ 
mission efficiency. Together these features have pro¬ 
vided higher sensitivity, improved mass accuracy, and 
an elevated data acquisition rate of 20 spectra per sec¬ 
ond. Moreover the ion mobility resolving power of the 
Synapt G2 is up to four times higher than in the original 
Synapt HDMS system. In the short time these instru¬ 
ments have been available; they have made a signihcant 
impact on the use of IM-MS for the study of macromo- 
lecular systems and in particular on the study of intact 
proteins and protein complexes. The mobility of ions 
through these travelling wave devices is not as directly 
related to collision cross section as in DT IM-MS 
instruments. Shvartsburg has made notable efforts to 
formally understand the relationship between collision 
cross section and mobility in a travelling wave device 
(Shvartsburg and Smith 2008), but as yet such 
approaches are unfortunately lacking within the IM-MS 
community. Instead the most fashionable method for 
estimation of collision cross sections from TW data 
involves calibration of mobilities measured with N 2 , as 
a drift gas in Synapt instmmentation against sets of stan¬ 
dards previously measured in he with linear drift tubes 
(Ruotolo et al. 2008). Ionic collision cross section values 
obtained by calibration are close to the values in helium 
rather than nitrogen, which is the experimental drift gas. 

Field Asymmetric Waveform Ion Mobility 
Spectrometry (FAIMS) 

The hnal type of ion mobility separation commonly used 
in conjunction with mass spectrometry, is held asymmet¬ 
ric waveform ion mobility spectrometry (FAIMS) or 
differential mobility spectrometry. First developed in 
the USSR in the early 1980s (Buryakov et al. 1993), it 
was successfully coupled with ESTMS by Purves, 
Gmevremont, and coworkers in the late 1990s. (Purves 
et al. 1998) The FAIMS/MS platform was made com¬ 
mercially available in 2003 as a frontend device that 
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could be coupled to a number of mass spectrometers. In 
this technique, ions pass between two electrodes in the 
presence of a tangential gas flow. FAIMS devices are 
usually placed at the source end of mass spectrometers 
where the gas flow may well be due to the drying gas 
requirements of an electrospray source. FAIMS exploits 
the nonlinear dependence of drift speed on acceleration 
field which becomes prominent at high fields for ion 
separation. The application of an asymmetric waveform 
to the electrodes will result in most ions discharging at the 
electrodes. Only slow ions for which the mobility is field 
invariant at the applied waveform amplitude will be 
transmitted through the FAIMS separator (Buryakov 
et al. 1993), whereas other species, especially those 
whose mobility is strongly field dependent, will tend to 
move toward the high-held plate. By scanning over 
a range of waveform amplitudes, separation between 
ions with different high-held mobilities becomes possi¬ 
ble. FAIMS adds to the power of gas-phase separation 
science and can make a major contribution in the analysis 
of complex mixtures by adding an extra dimension that is 
easily controlled. Instrumental developments have 
thrived is this area, with FAIMS devices being integrated 
with IMS-mass spectrometers**^, and an ESTFAIMS-MS 
employing a Paul trap mass analyzer has become com¬ 
mercially available from Thermo-Finnigan. However, 
high-energy collisions experienced during FAIMS have 
been shown to distort the stmctures of gas-phase peptide 
ions (Shvartsburg et al. 2007); therefore, this technique 
can be used for meaningful studies of molecular 
conformation only if detailed models of ion motion in 
this setup are taken into account. 

Resolving Power of IM Devices 

The resolving power of ion mobility apparatuses 
affects directly their applicability. Using a solution to 
the transport equation and an expression for the ionic 
mobility, it can be shown (Revercomb and Mason 
1975; Siems et al. 1994) that the resolution f/Af can 
be approximated by: 

( 11 , 

At \l6kBTln2j 

IMS resolution can be improved by decreasing the 
temperature (a feature of some temperature-controlled 
drift tubes), increasing the drift held, or increasing the 


length of the drift tube. The applied held, E, is limited 
by the density of the buffer gas, N, as the maximum 
held that can be applied must satisfy the low-held 
condition; to increase E further, N must be likewise 
increased. These considerations have lead to the devel¬ 
opment of two varieties of linear-held IMS devices 
coupled to mass spectrometers. Low-resolution drift 
cells operate at low pressures and necessarily propor¬ 
tionally low helds - they are more versatile and can be 
integrated with greater ease into novel arrangements or 
existing mass spectrometers. The device used for 
experiments described in the following chapters is 
one such low-resolution device. High-resolution drift 
cells (Dugourd et al. 1997) operate at pressures similar 
to atmospheric pressure and employ drift potentials of 
the kV scale. These devices cannot be inserted in 
a low-pressure chamber due to the high-pressure dif¬ 
ferential that would otherwise result. Therefore, the 
source must be intimately coupled to the drift region, 
an arrangement that also complicates buffer gas purity. 
Increasing the length of the drift region is an attractive 
approach for improving peak separation. However, the 
majority of the ion swarm becomes too dilute to be 
detected at achievable experiment times, as most of the 
ions discharge at the walls of the cell as radial spread 
surpasses by much the radius of the exit aperture in the 
absence of ion-focusing lenses inside the drift region. 
Nevertheless, with the pairing of such ion-focusing 
methods, such as ion funnels (Shaffer et al. 1997) or 
an RF-field applied simultaneously to the drift field’*, 
this approach came within reach. This advancement 
not only generated high-resolution designs which used 
lower pressures but also led to the development of long 
“tandem” IMS tubes (Koeniger et al. 2006) (with 
different sections of the tube being separated by an 
ion funnel) and circular drift-tube topologies (Bohrer 
et al. 2008) allowing theoretically infinite separation 
and separation-activation cycles. 

Summary 

Ion mobility (IM) spectrometry separates ions 
according their drift properties through a gas under 
the influence of an electric field. When analyzed 
under identical conditions, this behavior is dependent 
upon the charge and collisional cross section of the 
ions, in contrast to mass spectrometry which separates 
ions by mass/charge ratio. The coupling of an IM 
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device to a mass spectrometer provides an additional 
dimension of separation which greatly increases the 
analytical capability of each instrument acting alone. 

Cross-References 

► Ion Mobility Mass Spectrometry: Biological 
Applications 

► Mass Spectrometry: Methods of Ionization (Applied 
to Biopolymers) 
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Synonyms 

Mass spectrometry 


Introduction 

Ion mobility-mass spectrometry (IM-MS) is rapidly 
emerging as an effective tool for providing low- 
resolution structural data. Coupled with electrospray 
ionization (ESI), IM-MS has the potential to examine 
the dynamic and structural features of biological mol¬ 
ecules such as proteins, peptides, and oligonucleotides. 
Soft landing techniques have demonstrated that pro¬ 
teins can still perform their biological function post 
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MS analysis, suggesting that elements of protein struc¬ 
ture are maintained in the gas phase, or only reversibly 
disrupted. ESI-IM-MS allows analysis of small mono¬ 
meric proteins all the way up to large multimeric 
complexes. 

Peptides 

Short peptides demonstrate complex structural behav¬ 
ior in the gas phase. IM-MS, in conjunction with 
molecular modeling, can give insights into local sec¬ 
ondary structure. Jarrold and coworkers have shown 
that two peptides, A 4 G 7 A 4 -i-H'^ and Ac-A 4 G 7 A 4 -i-H'^, 
differing only by the presence of an N-terminal 
acyl-group, behave differently in the gas phase. The 
AC-A4G7A4 peptide was observed to adopt four differ¬ 
ent conformations between 100 and 230 K, including 
a- and 71 -helical structures, a partial helix and 
a globular conformation. The A4G7A4 peptide, how¬ 
ever, adopted a globular conformer only. The lack of 
helical structure was attributed to protonation at the 
N-terminus, destabilizing the helical dipole moment 
(Hartings et al. 2003). 

Peptides can adopt quite different structures in 
solution and in the gas phase. For example, the 
tryptic peptides, LLGNVLVVVLAR and 
HGTVVLTALGGILK, known to be helical in solu¬ 
tion, were observed to adopt a P-hairpin and random 
loop conformation in the gas phase, respectively. This 
was further confirmed by hydrogen-deuterium 
exchange (HDX), and highlighted the structural 
changes that can occur when the local environment is 
changed (Ruotolo and Russell 2004). 

Structural separation of complex mixtures using IM- 
MS has also been explored, enabling the “biomolecular 
inventory” of a specific sample to be analyzed with 
regard to both time (cell cycle, healthy/disease state) 
and space (tissue type, organism). The advantages of 
this method include the ability to distinguish isobaric 
species (i.e., peptides/lipids/carbohydrates of the same 
mass) from cell samples, due to their differing mobility. 
Perm and McLean were able to generate a 2D plot 
depicting the regions where specific types of biological 
molecule will appear, characterizing the conformational 
space indicative of peptides, lipids, carbohydrates, and 
nucleic acids. This method allows greater confidence 
when assigning the identity of molecules from complex 
mixtures (Fenn and McLean 2008). 


Protein Conformation and Dynamics 

Ion mobility lends itself well to studying the confor¬ 
mation and folding dynamics of proteins. Pioneering 
work in the mid-1990s demonstrated that cytochrome 
c exhibits multiple conformations in the gas phase. For 
high charge states, more extended structures were 
observed due to repulsive Coulombic effects. Lower 
charge states adopted compact, more native-like con¬ 
formations. To inform these measurements, theoretical 
Collisional Cross Section (Q) values for unfolded and 
compact structures can be determined from available 
NMR and crystal structures (Shvartsburg and Jarrold 
1996). In addition, molecular modeling can be used 
to generate intermediate folded states to aid in deter¬ 
mining the degree of gas-phase unfolding (Clemmer 
et al. 1995). 

Other early work observed low charge states of the 
100-residue bovine pancreatic trypsin inhibitor (BPTI) 
exhibiting an apparently “over-compact” form. This 
has since been recognized as a general phenomenon 
brought about by a degree of structural collapse in the 
absence of water. Higher charge states steadily pro¬ 
moted an increase in Q, but still only exhibiting 
a single conformation. Collisional activation of BPTI 
did not result in measurable unfolding, implying that 
the folded structure of BPTI is extremely stable in the 
gas phase (Shelimov et al. 1997). 

The gas-phase conformations of protein ions have 
been further probed using multidimensional IMS 
(IMS-IMS). These measurements allow a given con¬ 
formation to be selected and activated, followed by 
further mobility analysis to examine unfolding path¬ 
ways. Studies using this technique have probed the 
unfolding of ubiquitin and determined that partially 
folded intermediates, and even extended structures, 
retain some resemblance to their original structure, 
probably in the form of secondary structure elements 
(Koeniger et al. 2006). 

IM-MS has also been applied to studying protein 
refolding in the gas phase. Shelimov and Jarrold 
published two investigations examining the refolding 
of cytochrome c and apo-myoglobin in vacuo. Both 
proteins were sprayed from an acidified solution pro¬ 
moting protein unfolding, resulting in an extended 
structure, confirmed by high charge state distributions 
and increased drift times. Introduction of MTBD, 
a strong base, into the desolvation region allowed a 
proton stripping reaction to occur, lowering the charge 
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Ion Mobility Mass 
Spectrometry: Biological 
Applications, Fig. 1 Drift 
time distribution for 
cytochrome c (7+). Predicted 
drift times for conformations 
of cytochrome c are indicated 
by arrows and structures. 
More extended conformations 
take longer to traverse the drift 
tube than compact structures, 
resulting in a difference in 
drift time 
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state of the protein. Both cytochrome c and myoglobin 
presented with lower charge states after addition of 
MTBD, which were adopting structures reminiscent 
of a more compact state, confirmed by decreased Q 
and drift time values. Although cytochrome c appeared 
to adopt a compact structure, close to that of the native 
state (Shelimov and Jarrold 1996), myoglobin refolded 
to a more extended structure, requiring collisional 
activation to collapse to a compact conformer. It was 
therefore concluded that, for apo-myoglobin, there is 
an energy barrier that must be overcome to achieve 
the folded gas-phase conformation (Shelimov and 
Jarrold 1997) (Fig. 1). 

Recent studies have endeavored to apply ion mobil¬ 
ity of proteins in a more biologically relevant context. 
An elegant study utilized ion mobility and the calcium¬ 
binding protein, calmodulin, to demonstrate the 
conformational change in the protein upon calcium 
binding. Theoretical Q values from crystal structures 
for both compact and extended structures correlated 
almost perfectly with measured values (Wyttenbach 
et al. 2010). Another example of IM-MS applied to 
a biological setting is the detection of two gas-phase 
conformations of NADPH-cytochrome P450 reductase 
(CPR). In solution, CPR exhibits physiologically rele¬ 
vant conformational flexibility which permits both 
interflavin electron transport and efficient reduction 
of the cytochrome P450 substrate. Detection of com¬ 
pact and extended conformations of CPR was evident 
in 250 mM NH 4 OAC, with the extended species 
becoming further populated when sprayed from 1 M 
NH 4 OAC, demonstrating that the conformation of the 


enzyme could be perturbed by solution ionic strength. 
The relative proportions of these conformations were 
in good agreement with small-angle scattering (SAX) 
measurements (Jenner et al. 2011). 

The biological relevance of protein misfolding is 
apparent in diseases such as Parkinson’s disease, 
whereby the membrane-bound helical structure of 
a-synuclein can be released from the membrane 
into solution. This results in a relatively disordered 
structure which is capable of forming P-sheet-like 
oligomers that form the basis of Lewy bodies. Despite 
its unstructured nature, IM-MS analysis of a-synuclein 
revealed low charge state species ( 6 ~ to 8 ~) adopting a 
compact globular conformation. A transition between 
globular and extended conformations occurred 
between the 8 ~ and 9~ charge states, with an increase 
in Q of 50% (^800 A). The number of charges carried 
by the a-synuclein protein appeared to be an important 
determinant for its conformation. A method for reduc¬ 
ing charge is the ligation of metal ions, and with 
previous data suggesting that the formation of amyloid 
fibrils can be amplified in the presence of metal, the 
authors suggest that binding of a metal ion can induce 
structural collapse of the disordered form into 
a globular conformation capable of forming aggregates 
(Bernstein et al. 2004). 

Protein Assemblies 

Ion mobility can be used to probe the stracture and 
organization of noncovalent protein assemblies. The 
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combination of IM-MS and molecular modeling allows 
confident prediction of the structural organization of 
protein complexes. Ruotolo et al. applied traveling 
wave ion mobility spectrometry (TWIMS) effectively 
to examine the structural organization of the tryptophan 
RNA-binding attenuation protein (TRAP) complex. 
TRAP, which is an 88 kDa undecamer, is known to 
form a ring structure, based on X-ray crystallography 
investigations. The TRAP complex revealed three dis¬ 
tinct conformations in the gas phase: the native ring 
conformer, a compact ring, and a fully collapsed com¬ 
plex. Interestingly, the higher charge state species were 
observed to adopt the collapsed structures, whereas the 
lower charge states presented the native ring structure. 
This observation was thought to arise due to the 
increased internal energy of the high charge state spe¬ 
cies, generated by more energetic collisions with neutral 
buffer gas. It was suggested that the increased internal 
energy enabled the TRAP complex to relax into 
a compact conformation. Increasing the sample cone 
activation energy tested this hypothesis, and the low 
charge state ring conformers began to collapse down 
into compact conformations (Ruotolo et al. 2005). 

Another fascinating application of IM-MS to 
a biological system is the investigation of the GroEL 
molecular chaperone and the complex formed with its 
functional “lid” co-chaperonin GroES. The GroEL/ES 
system facilitates the correct folding of many proteins 
in an array of bacterial systems. Although much is 
known about the function and organization of the 
GroEL/ES complex, details regarding the exact con- 
formation-dependent mechanisms required for protein 
folding remain to be elucidated. It is thought that 
conformational changes within the GroEL core allow 
changes in the initial hydrophobic environment, pro¬ 
moting protein folding. 

GroEL forms a stacked-ringed, tetradecameric struc¬ 
ture, where two rings consisting of seven subunits sit on 
top of each other forming a cavity for protein folding. 
IM-MS revealed structural details of GroEL with a bound 
substrate (gp23), observing no substantial increase in Q 
upon substrate binding. A similar result was observed for 
the GroEL-GroES-gp23 complex, with the additional Q 
resulting from the presence of GroES only. However, 
addition of the charge-reducing solvent imidazole 
resulted in the observation of appreciably lower charge 
states, with a reduced Q value (^5%). 

The Heck lab has applied TWIMS to examine the 
structural organization of viral capsids, the largest 


structures to be examined with this technique. An illus¬ 
trative example of this work is the analysis of the hep¬ 
atitis B virus (HBV) capsid. HBV is known to form two 
types of icosahedral capsids: T = 3, (3 MDa) and T = 4 
(4 MDa), each consisting of different numbers of 
dimeric subunits. These structures can form in vitro 
making them available for MS analysis. IM-MS analysis 
revealed that both the T = 3 and T = 4 stoichiometries 
adopted two distinct conformations, the populations of 
which could not be altered by increased collisional 
activation or charge state effects, and it has been 
suggested that these are relics of the ionization process 
(Uetrecht et al. 2008). Eurther progression of this work 
led to the characterization of small oligomers from 
HBV, adopting the sheet-like conformation required 
for viral capsid assembly (Uetrecht et al. 201 1). 

Smaller noncovalent protein aggregates have also 
been observed by IM-MS; these are often proteins 
involved in protein misfolding diseases. Studies on 
P 2 -microglobulin (P 2 ni)> which is known to form amy¬ 
loid fibrils in individuals on hemodialysis, was 
observed to adopt three configurations by IM-MS, the 
native, partially folded, and acid-unfolded. The rela¬ 
tive abundance of each of these conformations could 
be dictated by changes in the pH value. A pH of ^6.28 
yielded a fully native P 2 m conformation, incapable of 
aggregating into fibrils. Adjusting the pH to more 
acidic conditions led to the observation of higher 
charge states with larger Q values. However, at pH 
4.28, a conformation with similar Q value, but with 
a higher susceptibility to protonation, was observed by 
IM-MS. This suggested the presence of partially folded 
intermediates, which are thought to be key to the 
formation of amyloid fibrils (Smith et al. 2007). 

Oligonucleotides 

Clemmer and coworkers first examined the conforma¬ 
tions of nucleic acids in the gas phase in 1997. Interest in 
the gas-phase configuration of such molecules stemmed 
from utilizing mass spectrometry to sequence lengths of 
nucleic acids by MS/MS techniques. It is proposed that 
the conformation of oligonucleotides affects the fragmen¬ 
tation sites within the molecules. The phosphodiester 
linkage in nucleic acids is electronegative by nature, 
and will typically produce negative ions. A simple 10- 
residue oligothymidine (T lo) sequence was measured by 
IM-MS, which highlighted extended conformations 
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occurring in the higher charge state species, which 
is presumably due to Coulombic repulsion effects. 
It was concluded from modeling that the unfolding 
profile observed htted best to distribution deprotonation 
sites, as opposed to neighboring sites (Hoaglund et al. 
1997). 

Summary 

IM-MS is rapidly developing as a powerful analytical 
tool for studying complex biological molecules. As 
more studies compare biomolecular structure in the 
solution and gas phases, a better understanding of the 
capabilities and limitations of IM-MS in structural 
biology is emerging. 


Cross-References 

► Comparative Protein Structure Modeling 

► H/D Exchange 

► Ion Mobility Mass Spectrometry - Principles 

► Mass Analysis 

► Mass Spectrometry: Application to Protein-Ligand 
Interactions 

► Mass Spectrometry: Methods of Ionization (Applied 
to Biopolymers) 

► Protein Circular Dichroism Analysis 

► Protein-Protein Interactions Studied by Mass 
Spectrometry 
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Synonyms 

FeS clusters 


Definition 

Iron-sulfur clusters are polynuclear inorganic cofac¬ 
tors composed of clusters of ferric (Fe^"^) and ferrous 
(Fe^"^) cations and sulfide (S^~) anions. 

Basic Characteristics 

The most common FeS cluster geometries are the 
rhomboidal [Fe 2 S 2 ] cluster and the cuhoidal [Fe 3 S 4 ] 
and [Fe 4 S 4 ] clusters (Beinert et al. 1997; Venkateswara 
Rao and Holm 2004). (Various notations commonly 
used to designate the nuclearity and charge of 
a cluster include; (Fe 2 S 2 )^'", [Fe 2 S 2 ]^'', or [2Fe-2S]^'". 
Explicit inclusion of anionic ligands would alter the 
stated charge, e.g., [Fe 2 S 2 (SCys) 4 ]^“ would designate 
a [Fe 2 S 2 ]^^ cluster coordinated by four cysteine 
thiolate ligands.) These clusters are coordinated within 
proteins by coordination bonds between the 
ions and various heteroatom-containing protein resi¬ 
dues, especially cysteine. FeS clusters are commonly 
found as an ► electron transfer cofactor in small 


proteins known as ferredoxins, as well as in ► electron 
transport chains or pathways within a variety of redox 
enzymes (Beinert 2000). Within a protein environ¬ 
ment, the clusters typically have only two accessible 
redox states and thus function as one-electron accep¬ 
tors and donors. FeS clusters can also have non-redox 
functions in proteins (Flint and Allen 1996), including 
chemical catalysis, signaling and transcriptional regu¬ 
lation, as well as general electrostatic and structural 
roles. 

Structures and Properties 

The structures of FeS clusters are largely dictated by 
the preference of ions for a tetrahedral coordi¬ 
nation geometry, in which is at the center and 

the S^“ ions and protein ligands are at alternating 
corners of a cube (Fig. 1). Although technically not 
an iron-sulfur cluster, the rubredoxin cofactor is a 
single ion coordinated by four cysteine thiolate 

ligands in a tetrahedral geometry, and is often 
referenced as a simple model for FeS cluster proper¬ 
ties. [Fe 2 S 2 ] ^ clusters contain two Fe^"^ ions and 
two S^~ ions in a planar rhombus, while the coordinat¬ 
ing protein ligands bond to Fe in a plane perpendicular 
to the rhombus. Symmetrical clusters with four cyste¬ 
ine thiolate ligands have Fe-S bond lengths in the 
range 2.2-2.3 A, while the Fe-Fe distance is in 
the range 2.70-2.75 A. One common variant found 
in nature is known as the Rieske iron-sulfur cluster, 
in which one Fe is coordinated by nitrogen atoms from 
two histidine residues, and the shorter Fe-N bonds 
(^2.1 A) result in a distortion of the otherwise sym¬ 
metric rhombus. 

Larger FeS clusters found in nature generally con¬ 
sist of one or more cuhoidal (or cubane-like) struc¬ 
tures. The [Fe 4 S 4 ]^'^ cluster consists of 2 Fe^"^ ions 
and 2 Fe^"^ ions in complex with 4 S^“ ions, with the 
atoms at the corners of a distorted cube, and the cluster 
is typically ligated by four protein residues coordinated 
to each of the Fe sites. Fe-S bond lengths and Fe-Fe 
distances are essentially indistinguishable from the 
range found in [Fe 2 S 2 ] clusters. The [Fe 4 S 4 ] cube 
can be thought of as a fusion of two [Fe 2 S 2 ]^ rhombs 
that have unique nondelocalized electronic properties 
that can be distinguished using Mossbauer spectros¬ 
copy. The [Fe 3 S 4 ]'^ cluster is generally found in 
a cuhoidal geometry that appears similar to a [Fe 4 S 4 ] 
cluster with one of the Fe atoms removed. 
Linear [Fe 3 S 4 ] clusters have also been noted as 
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Iron-Sulfur Clusters, Fig. 1 Depictions of common FeS clus¬ 
ter structures: Fe^’"^ from Clostridium pasteurianum rubredoxin 
(pdb 4RXN); [Fe 2 S 2 ]^'^ from Anahaena PCC7119 ferredoxin 
(pdb ICZP); [Fe 3 S 4 ]'^ from Azotohacter vinelandii ferredoxin 
I (pdb 6FD1); [Fe 4 S 4 ]^'^ ixom Azotohacter vinelandii nitrogenase 
Fe protein (pdb 1FP6). Only the side-chain (S-Cp-Co,) of each 
cysteine thiolate ligand is shown. Color legend: brown, iron; 
yellow, sulfur; cyan, carbon; hydrogen atoms are not shown 

products of oxidative degradation but are not com¬ 
monly found in native proteins. Larger FeS clusters 
and mixed clusters that incorporate other metals have 
been observed in specialized enzymes (Johnson et al. 
2005). For example, nitrogenase contains an [FegS?] 
cluster (the P cluster) and a [MoFe7S9(0/N)] cluster 
(the FeMoCo cluster), carbon monoxide dehydroge¬ 
nase contains a [NiFe 4 S 4 ] cluster, and acetyl CoA 
synthase contains an [Fe 4 S 4 ]^'^'^'^ cluster bridged via 
a cysteine thiolate ligand to a hinuclear Ni cofactor. 
In each of these examples, the remnant cuboidal 
[Fe 4 S 4 ] or [Fe 3 S 4 ] cluster is incorporated into a larger 
cluster by bridging sulhde or thiolate ligands. 

FeS clusters are predominantly found in nature as 
electron transfer cofactors. The typical range of redox 
potentials expected for various cluster types is given in 
Table 1 (Stephens et al. 1996). Most iron-sulfur clus¬ 
ters have low midpoint potentials that allow these 
cofactors to participate in electron transfer chains. In 
general, [Fe 4 S 4 ]^’^ clusters undergo reduction at very 
low potentials in the range —300 to —650 mV, while 
[Fe 2 S 2 ]^'^ and [Fe 3 S 4 ]’^ clusters are reduced at slightly 
higher potentials in the range from —100 to —450 mV. 
A small subset of [Fe 4 S 4 ] clusters, known as HiPiP 
clusters, instead undergo oxidation to an [Fe 4 S 4 ] 


cluster with potentials in the range from -1-120 to 
-h 360 mV. 

FeS clusters in which one of the coordination sites 
to an Fe atom remains open can participate in chemical 
catalysis, where the FeS cluster usually functions 
as a Lewis acid (Flint and Allen 1996). The most 
thoroughly characterized example is aconitase, 
a hydrolase that catalyzes isomerization of citrate to 
isocitrate via dehydration to cA-aconitate. The resting 
enzyme contains an [Fe 4 S 4 ]^'^ cluster coordinated by 3 
cysteine thiolate ligands and one hydroxide at the 
unique Fe site. The substrate also binds to the unique 
Fe site, where coordination of the citrate hydroxyl 
group facilitates Lewis acid catalysis of the dehydra¬ 
tion/hydration reaction. 

[Fe 4 S 4 ] and [Fe 3 S 4 ] clusters are typically degraded 
in air and aerobic protein purification will often result 
in depletion or complete loss of the native FeS cluster. 
This effect is more rapid if the cluster has an open 
coordination site, or if a reductant is present that can 
convert O 2 to the strong oxidant superoxide (O 2 " ). 
Furthermore, once the FeS cluster is degraded, the 
liberated cysteine thiolate ligands can he irreversibly 
oxidized to cysteine sulfenic and sulhnic acids. Since 
these oxidized cysteine residues will not bind FeS 
clusters, the result is loss of cofactor binding and 
protein function. 

Physical Characterization 

In a new system where an FeS cluster is observed or 
suspected, information regarding the cluster type 
and redox properties is needed to establish function. 
Various spectroscopic techniques have been applied 
to probe the electronic structure of FeS clusters 
(Spiro 1982; Que Jr 2000), and depending on instru¬ 
ment availability, many of these techniques can be 
used to establish cluster type and redox status (sum¬ 
marized in Table 1). 

Initial characterization of an FeS cluster should 
begin with a careful analysis of the and S^~ to 

protein stoichiometry. The UV/visible electronic 
absorption spectrum can also provide limited informa¬ 
tion. A protein containing only an [Fe 2 S 2 ]^^ cluster is 
brown, stable to air, and exhibits characteristic broad 
maxima near 330, 420, and 460 nm, although the 
relative intensities of these peaks can vary widely. 
Proteins containing either [Fe 3 S 4 ]’^ or [Fe 4 S 4 ]^'^ 
clusters exhibit very broad spectra with shoulders at 
380-430 nm. 
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Iron-Sulfur Clusters, Table 1 Typical physical constants and spectroscopic parameters associated with each type of FeS cluster. 
The range of values is based on studies of clusters found in bacterial ferredoxins. {n.d. not determined, n.s. no signal) 


Cluster type 

value (reduction) 
(mV) 

UV/visible absorption 
(2max, nm) 

EPR parameters 
g|| or gav) 

Mossbauer parameters 


S (mm/s) 

AEq (mm/s) 

[Fe^S^]"" 

-440 to-175 

330, 420, 460 

n.s. 

0.25-0.3 

0.5-1.0 

[Fe^S^]" 

n.d. 

550 

1.94, 2.02 

0.25, 0.5-0.7 

0.6, 2-4 

[Fe3S4]-" 

-425 to-130 

400-430 

2.01 

0.25-0.3 

0.5-0.6 

[Fe3S4f 

n.d. 

n.d. 

n.s. 

0.3, 0.45-0.5 

0.5, 1.5 

[Fe4S4f" 

+120 to +360 

310, 380, 450 

2.03, 2.14 

0.25-0.3, 0.4-0.45 

0.8-0.9, 0.9-1.1 

[Fe4S4f-" 

-650 to -280 

380-400 

n.s. 

0.45-0.5 

1.0-1.5 

[Fe4S4]'" 

n.d. 

n.d. 

1.94, 2.02 

0.45-0.5, 0.55-0.7 

1.0-1.5, 1.5-2 


Assuming the FeS cluster is stable to reduction or 
oxidation, electron paramagnetic resonance (► EPR) 
spectroscopy can provide complementary information. 
Neither [Fe 2 S 2 ]^'^ nor [Fe 4 S 4 ]^'^ clusters are detectable 
with standard perpendicular mode EPR spectrometers. 
Flowever, reduction of these clusters, for example, 
with sodium dithionite (Na 2 S 204 ), produces [Pe 2 S 2 ]^ 
and [Fe 4 S 4 ]'^ clusters that usually exhibit characteristic 
axial EPR spectra with gy « 2.02 and « 1.94. The 
[Fe 2 S 2 ]^ cluster usually undergoes slow magnetic 
relaxation, and as a result, the spectrum can be detected 
at temperatures up to 50 K. In contrast, the [Pe 4 S 4 ]’^ 
cluster is fast relaxing and can usually only be detected 
at <20 K. The as-isolated [Pe 3 S 4 ]’^ cluster exhibits 
a sharp nearly isotropic EPR signal centered at 
g « 2.01 that disappears upon reduction. The HiPiP 
[Fe 4 S 4 ] cluster usually exhibits an axial spectrum 
with g|| « 2.14 and gj^ « 2.03. Thus, using 
a combination of UV/visible and EPR spectra, one 
can often establish the major cluster type present. 

The preferred method for examining the complete 
Fe environment within a protein is Mdssbauer spec¬ 
troscopy, which requires ^^Fe enrichment, but can 
potentially provide a wealth of information regarding 
cluster type, oxidation state, and electronic structure of 
the cluster. A zero- or low-magnetic-held Mdssbauer 
spectrum appears as one or more overlapping doublets, 
with each doublet characterized by the centroid isomer 
shift ((5) and the quadrupole splitting parameter (AEq). 
Each unique Fe or valence-delocalized Fe system gives 
a unique characteristic doublet. The oxidation state of 
Fe can be assigned based on the isomer shift, with 
S « 0.25 mm/s for Fe^"^, 5 « 0.70 mm/s for Fe^"^, and 
with intermediate values for mixed valence systems. 
The specific cluster type can often be assigned based 
on the zero-field quadrupole splitting parameter, with 


[Fe 2 S 2 ]^^ and [Fe 3 S 4 ]'^ clusters exhibiting narrow split¬ 
ting (A£q « 0.5-1 mm/s) characteristic of high-spin 
Fe^"^, while [Fe 3 S 4 ]° and [Fe 4 S 4 ]^’^ exhibit broader 
splitting (A£q « 1-2 mm/s) due to the presence of 
a valence-delocalized Fe^'^'^Fe^'^'^ pairs. Detailed 
information regarding the electronic structure of FeS 
clusters can be obtained by analysis of Mossbauer 
spectra obtained in high magnetic fields. 

Other methods that can provide additional information 
regarding the electronic stmcture and ligand environment 
of FeS clusters include circular dichroism and magnetic 
circular dichroism spectroscopy, pulsed EPR methods 
including electron-nuclear double resonance (ENDOR) 
spectroscopy and electron spin-echo envelope modula¬ 
tion (ESEEM) spectroscopy, Raman or ► resonance 
Raman spectroscopy, and x-ray absorption spectroscopy. 

Once the cluster type and oxidation states have been 
established, the electrochemical midpoint potential for 
reduction or oxidation (E^, ► redox potential) of the 
cluster can be useful in assigning a redox or non-redox 
role for the cluster. Typically this is accomplished by 
recording UV/visible or EPR spectra of protein samples 
while slowly reducing with sodium dithionite or oxidiz¬ 
ing with sodium ferricyanide. The cell potential is deter¬ 
mined using microelectrodes, and various organic 
electron transfer mediators are added to facilitate more 
rapid redox equilibration of the system. Spectral 
changes as a function of cell potential can be analyzed 
using the Nernst equation to obtain the redox potential 
of the EeS cluster. 

Cross-References 

► Electron Transfer Cofactors 

► Electron Transfer Proteins: Overview 
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ISE (Ion-Selective Electrode) 


► Electron Transport Chains in Bacteria 

► EPR 

► Mitochondrial Electron Transport 

► NADH-Ubiquinone Oxidoreductase (Complex I) 

► Redox Potential 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 
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Synonyms 

Ab initio molecular modeling; Infrared difference 
spectroscopy of proteins; Infrared spectroscopy; 
Protein structure; Raman spectroscopy; Vibrational 
studies of structure of protein-ligand complexes 


Definitions 

Vibrational spectroscopy is a high-resolution tool 
when applied to protein structure. For many mechanis¬ 
tic problems, this approach offers unique and unbeat¬ 
able information. 

Introduction 

Insights into the structural properties of proteins often 
come from X-ray and NMR studies, which provide 
detailed and complete images of the protein’s atoms. 
For many problems, however, the resolution of these 
approaches is insufficient for detailed mechanistic stud¬ 
ies. Other spectroscopic methods, such as those 
described here, can often complement these methods 
in two different directions. First, FTIR and Raman spec¬ 
troscopic methods augment the resolution of protein 
images significantly with respect to critical specific fea¬ 
tures: critical bond lengths to very high resolution (to ca. 
±0.005 A), ionic states of molecular groups, and critical 
bond polarization. Second, fast time-resolved IR tech¬ 
niques under development can bridge the gap between 
these methods by using this structurally sensitive probe 
to determine the time-evolution of specific chemical 
bonds within protein-ligand complexes. These consid¬ 
erations are particularly relevant to studies of enzymatic 
catalysis. The studies on the dynamic attributes to the 
enzymatic catalysis are reviewed elsewhere (Callender 
and Dyer 2006). 

Here we focus on the determination of the protein- 
ligand structures in enzyme catalysis using isotope 
edited Raman and infrared difference spectroscopy. 
For a review on reaction-induced difference spectros¬ 
copy, see the chapter by Andreas Barth and references 
therein. For a reference on Raman difference spectros¬ 
copy, see Callender and Deng 1994. 

Vibrational Structure of Molecular Groups Within 
Proteins 

This is a two-step process; (1) Determine the vibrational 
spectrum from the specific molecular group(s) of inter¬ 
est. Since the Raman and IR spectra of proteins are 
“crowded,” some means must be devised to discern 
the embedded group’s spectrum from the protein 
background. A typical protein we study contains a few 
hundred amino acid residues and has an average molec¬ 
ular weight of about 20-80 kDa. To discern a particular 
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band from the protein background with a signal-to-noise 
ratio of 10:1 requires a signal-to-noise ratio in excess of 
300:1. This is possible employing isotope edited differ¬ 
ence Raman and FTIR spectroscopy. (2) Extract struc¬ 
tural information based on the observed vibrational 
bands, perhaps especially their changes upon some 
event, like ligand binding. For this, molecular modeling 
techniques based on ab initio normal mode analyses (cf., 
Deng et al. 1992a, b) and/or empirically derived struc¬ 
ture/frequency relationships are typically employed (cf., 
Deng et al. 1998c; Cheng et al. 2002). 

The reason that vibrational spectroscopy yields use¬ 
ful information on a quantitative basis is that the 
observed vibrational frequencies arise from interatomic 
force constants; these are a measure of the distribution of 
the electrons within the molecule. This electronic distri¬ 
bution is disturbed, in some cases, by protein-ligand 
interactions in direct proportion to the degree of inter¬ 
action. For example, the strengths of hydrogen bonds in 
model systems can be determined directly from linear 
free energy correlations between frequency measure¬ 
ments and the AH of formation of the H-bonds (Thijs 
and Zeegers-Huyskens 1984; Deng and Callender 
1999). Besides the determination of electron density in 
a chemical bond, vibrational spectroscopy can also be 
used to relate molecular structure/conformation to the 
size of interactions between molecular groups, espe¬ 
cially when complemented by vibrational analysis 
based on quantum chemical calculations. Thus, this 
spectroscopic method is well suited, for example, to 
assess substrate activation in enzyme catalysis. For 
example, in work on phosphoryl transfer processes, 
catalytic rate enhancements over 12 orders of magnitude 
are correlated with changes in bond length of a key bond 
of just 0.03 A (Cheng et al. 2002). Vibrational spectros¬ 
copy easily determines the ionization state of molecular 
groups, which, for example, determined the molecular 
mechanism in dihydrofolate reductase (Chen et al. 
1994). 

Isotope Edited Difference Raman and FTIR 
Spectroscopy 

Raman difference spectroscopy, where the vibrational 
spectrum of the bound ligand is obtained by subtraction 
of the apo-protein spectrum from that of the protein- 
ligand complex, has been used for various enzyme sys¬ 
tems. Isotope editing extends the application of Raman 
difference spectroscopy to systems in which measure¬ 
ments of the apo-protein spectrum is not a viable option 


because the bound ligand induces too many protein 
conformational changes that show up in the difference 
spectrum or in cases where the apo-protein is unstable. 
A difference spectram is formed between two protein- 
ligand complexes, one of which is specifically labeled 
with a stable isotope. Vibrational modes which are asso¬ 
ciated with the isotopic tag show as spectral shifts in 
the difference spectmm, while all other bands cancel. 
Isotope edited Raman difference techniques (and other 
difference approaches) have been reviewed previously 
(cf., Callender and Deng 1994; Deng and Callender 
1999). By employing isotope-editing techniques, single 
specific molecular groups embedded within a large pro¬ 
tein complex are easily determined (e.g., phosphate 
within a ca. 60 kDa protein, Deng et al. 1993). 

Described here in some detail are studies of isotope 
edited difference FTIR. One might expect to observe the 
bound ligand FTIR spectrum in a protein by using the 
difference between the spectram of the protein-ligand 
complex and the spectram of the protein alone. Unfor¬ 
tunately, as for Raman difference studies, that method 
only works when the ligand IR bands are in a region free 
of protein bands, such as that demonstrated in a classical 
FTIR study on the dihydroacetone phosphate bound 
in triosephosphate isomerase (Belasco and Knowles 
1980). In general, however, the ligand-induced protein 
band changes dominate in such difference spectra, and 
the IR bands from the bound ligand cannot be observed. 
This is illustrated in Fig. 1. Spectrum a in Fig. 1 shows 
the FTIR spectram of LDH/NADH binary complex 
(LDH: lactate dehydrogenase; NADH: reduced 
P-nicotinamide adenine dinucleotide) and b shows that 
of the LDH/NADH/oxamate ternary complex (oxamate 
is a substrate mimic of LDH). Their difference spectram 
is shown in c. The major band intensities in spectram c are 
about 50 mOD, at least 10 times more intense than those 
from the expected bound ligand bands (e.g., see Fig. 2). 
Spectram d was obtained when both positions of the 
sample shuttle (which enables alternating measurements 
between two different samples without opening the spec¬ 
trometer) were occupied by the same LDH/NADH/ 
oxamate complex. The resulting difference spectram 
serves as a control experiment for the expected noise 
level (about 0.2 mOD, Fig. 2d) in our isotope edited 
difference measurements. 

Although this attempt failed to observe the bound 
oxamate spectrum in c, there are several useful obser¬ 
vations can be made by the studies shown in Fig. 1. 
One is that longer signal averaging will not likely 
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Isotope Edited Raman and Infrared Difference Spectros¬ 
copy Studies of Protein Structure, Fig. 1 The FTIR spectra 
of (a) LDH/NADH binary complex; (b) LDH/NADH/oxamate 
ternary complex; (c) difference between b and a; and (d) differ¬ 
ence between two LDH/NADH/oxamate complexes, which 
should show up as a null. The intensity for spectrum (d) is 
multiplied by a factor of 100. All samples were prepared in 
D 2 O buffer with 50 mM triethanolamine, 5 mM fructose 1,6- 
bis phosphate, and 150 mM NaCl at pH7.4 (pH meter reading); 
the measurements were conducted at 295K. IR cells consisted of 
CaF 2 windows with 15-pm teflon spacers 


improve observation of the bound ligand IR bands 
since the noise level is already signihcantly lower 
than the expected ligand band intensities. The other is 
that the large protein bands shown in c indicate a strong 
interaction between the LDH/NADH complex and 
oxamate, resulting in profound changes of the protein 
bands due to the hydrogen bonding network changes, 
and concomitant shifts in frequencies of local groups, 
within LDH upon oxamate binding. For example, the 
negative peak at 1,659 cm“* in spectrum c (Fig. 1) 
likely contains contributions due to such effect. 

The results shown in Fig. 1 suggest that, in isotope 
edited difference FTIR studies, the focus should be on 


the reduction of the protein “artifacts” caused by 
unmatched protein-ligand concentrations of the two 
samples. Such artifacts in the difference can result 
from a number of factors yielding a signal imbalance 
between the two samples, such as a small difference in 
the oxamate and/or NADH concentrations, a small 
difference in the hydrogen concentration in deuterated 
samples, small temperature variations in the two sam¬ 
ples, and even equilibrium binding isotope effects. 
Although these “artifacts” may appear as noise in the 
difference spectrum, they cannot be reduced by 
increasing signal averaging time. Thus, we adopted 
a trial-and-error approach to reduce iteratively such 
protein “artifacts” until the IR bands from the 
bound ligand can be assigned conhdently. This is 
described below. 

An approach to ensure the correct assignments of 
the bands related with the motions of the labeled atom 
is to vary the subtraction factor in each iteration of 
the difference measurements, as illustrated in Fig. 2, 
which shows the isotope edited difference spectra 
between LDH/NADH/oxamate and LDH/NADH/ 
[*^Cl]oxamate (Callender and Deng 1994). Spectra 
from the unlabeled and labeled compounds are 
measured. Before subtraction, the intensities of the 
two spectra are normalized with respect to the domi¬ 
nating protein bands so that the trial subtraction 
factor is close to one. In Fig. 2, the top curve shows 
the slightly under-subtracted difference spectrum 
while the bottom curve shows the slightly over¬ 
subtracted difference spectrum. In these curves, the 
vibrational modes related with the C1 motions appear 
as positive-negative pairs; they do not change their 
intensities substantially or disappear with the variation 
of the subtraction factor, as demonstrated by the 
1,581-1,569 and 1,646-1,611 cm“' pairs. In addition, 
their band positions in all iterations were very repro¬ 
ducible, well within 0.5 cm”*, at least for the three 
sharp bands at 1,581, 1,569, and 1,646 cm”*. On the 
other hand, the protein bands generally change their 
appearance (frequency and/or intensity) greatly as sub¬ 
traction factor is slightly changed. Those protein “arti¬ 
facts” due to slightly unmatched samples, typically 
resulting from small differences in sample concentra¬ 
tion or optical alignment, most likely appear only in 
one side (unlabeled or labeled side) of the difference 
spectrum, and their intensities vary significantly from 
one iteration to the other even with the same subtrac¬ 
tion factor (e.g., the 1,545 cm”* band). Based on these 
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Wavenumber/cm“i 

Isotope Edited Raman and Infrared Difference Spectros¬ 
copy Studies of Protein Structure, Fig. 2 The difference 
FTIR spectra between LDH/NADH/oxamate complexes (rela¬ 
tive concentrations of the components are the same within each 
cuvette) with unlabeled and ’^Cl labeled oxamate using three 
different subtraction factors. Top cun’e: subtraction factor 0.99; 
middle curve: 1.00; and bottom cun’e: 1.01 


observations, it is argued that the hands due to bound 
oxamate in the LDH/NADH/oxamate complex can he 
assigned even when protein “artifacts” are still present. 

From Molecular Vibrational Spectra to Structure 

Due to specific protein interactions, the structure of 
a ligand can be greatly distorted upon binding. This has 
been demonstrated by the studies of phosphate bound 
in an enzyme transition state analog, PNP*ImmH*P 04 
complex (Deng et al. 2004; PNP, purine nucleoside 
phosphorylase; IMMH, a potent inhibitor of PNP). 
Figure 3 shows the isotope edited difference 
FTIR and Raman spectra of HP 04 ^" in solution and in 


the protein complex. For HP 04 ^“ in water, the three 
P»*0 stretches are coupled to form a symmetric stretch 
at 990 cm”*, which is active in both IR and Raman 
(Fig. 3a left and right). A twofold degenerate IR active 
asymmetric stretch mode appears at 1,084 cm”* 
(Fig 3a). Since the P-OH stretch is weak in Raman 
and water absorbs strongly below 900 cm” * in the IR 
spectrum, additional FTIR experiments on phosphate 
in D 2 O revealed the P-OD stretch mode for HP 04 ^” is 
at 834 cm”*, and it shifts slightly to 846 cm”* in the 
PNP*ImmH»P 04 complex (data not shown). 

The three bands observed in the difference IR spec¬ 
tra of the PNP*ImmH»P 04 complex shown in Fig. 3b 
(left), at 970, 1,016, and 1,119 cm”* for *®0-labeled 
phosphate and at 934, 991, 1,084 cm”* for *^0 
phosphate, arise from P»*0 stretches of the PO 4 moiety 
and lie in the frequency range consistent with dianionic 
phosphate (and not for H 2 PO 4 '). These three bands 
are highly unusual and indicate a highly distorted 
phosphate moiety in PNP»ImmH*P 04 . They can be 
assigned to an uncoupled P»*0 stretch mode and 
a coupled symmetric-asymmetric P**0 stretch pair. 
One of the P»*0 bonds is strongly polarized, causing 
its stretch motion to be decoupled from the stretches of 
the other two P»*0 bonds. The uncoupled P**0 stretch 
is active in both IR and Raman spectra, while the other 
two P»*0 stretches are coupled to form an IR active 
asymmetric stretch and an IR and Raman active 
symmetric stretch. 

The idea that decoupling of the P*»0 stretch 
motions requires strong interactions is tested quantita¬ 
tively by normal mode analysis using ab initio calcu¬ 
lations on dianionic phosphate with or without external 
hydrogen bonding/electrostatic interactions on one of 
the P»*0 bonds. The calculations indicate that one 
strong hydrogen bond with an 0-0 distance of 2.5 A 
can significantly change the asymmetric P»»0 stretch 
but not the symmetric P*»0 stretch. Similar results 
are obtained for electrostatic interaction of a cation 
(charge +1) 3.0 A from the P»*0 oxygen or two hydro¬ 
gen bonds with bonding distances of 2.5 and 2.8 A to 
one P*»0 oxygen. In these calculations, the bond 
length of the polarized P**0 bond is 0.02-0.03 A 
longer than the other two P»*0 bonds. When the inter¬ 
actions on one of the P»»0 oxygens include both a 2.5 A 
hydrogen bonding and a 3.0 A ion pair interaction, the 
stretch motion of this P»*0 bond becomes decoupled 
from the other two P»*0 bond stretches, and its fre¬ 
quency becomes lower than that of the symmetric 
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Isotope Edited Raman and 
Infrared Difference 
Spectroscopy Studies of 
Protein Structure, 

Fig. 3 The difference FTIR 
{left panel) and difference 
Raman (right panel) spectra 
between P *®04 and P **04 (a) 
in aqueous solution at pH 9.5 
(dianionic) and (b) in the 
PNP*ImmH*P 04 complex 
(4 mM:4 mM:4 mM, pH 7.2). 
The positive bands are from 
P *'’04 and the negative bands 
are from P’** 04 . The 
resolutions for IR and Raman 
spectra were 2 and 8 cm^*, 
respectively 




Wavenumber/cm-1 Wavenumber/cm-i 


stretch in diaionic phosphate. Here, the bond length of 
the polarized P»*0 bond is 0.04-0.05 A longer than the 
others. 


Summary 

This review describes the isotope edited Raman and 
FTIR spectroscopy techniques and some brief examples 
of their application in the structural determination of 
molecular groups in protein, in conjunction with molec¬ 
ular modeling based on ah initio normal mode analysis 
and empirical relationships. Examples are given to show 
that for molecular groups with high symmetry, such as 
phosphate, both Raman and FTIR measurements are 
required in the determination of the molecular structure 
under high stress in protein environment. A number of 
mechanistic studies on many enzyme systems have been 
performed using this approach, including lactate dehy¬ 
drogenase (Deng et al. 1994; Chen et al. 2002); 
dihydrofolate reductase (Chen et al. 1994; Deng and 
Callender 1998), ras (Wang et al. 1998), ribonucleoase 
(Deng et al. 1998a), PNP (Deng et al. 2010), and myosin 
(Deng et al. 1998b). 

Cross-References 

► Infrared Spectroscopy of Protein Structure 

► NMR-based Structural Proteomics 


► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 

► Vibrational Spectroscopy, A Short History of 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 

References 

Belasco JG, Knowles JR. Direct observation of substrate distortion 
by triosepbospbate isomerase using Fourier transform infrared 
spectroscopy. Biochemistry. 1980;19:472-7. 

Callender R, Deng H. Non-resonance Raman difference spectros¬ 
copy: a general probe of protein structure, ligand binding, enzy¬ 
matic catalysis, and tbe structures of other biomacromolecules. 
Annu Rev Biophys Biomol Stmct. 1994;23:215^5. 

Callender RH, Dyer RB. Advances in time-resolved approaches 
to characterize the dynamical nature of enzymatic catalysis. 
Chem Rev. 2006; 106:3031^2. 

Chen Y-Q, Rraut J, et al. Determination by Raman spectroscopy 
of the pKa of N5 of dihydrofolate bound to dihydrofolate 
reductase: mechanistic implications. Biochemistry. 1994;33: 
7021-6. 

Chen Y-Q, van Beek J, et al. Vibrational structure of NAD(P) 
cofactors bound to several NAD(P)-linked enzymes: an 
investigation of ground state activation. J Phys Chem. 
2002:106:10733^0. 

Cheng H, Nikolic-Hughes I, et al. Environmental effects 
on phosphoryl group bonding probed by vibrational spectros¬ 
copy: implications for understanding phosphoryl transfer and 
enzymatic catalysis. J Am Chem Soc. 2002;124:11295-306. 







Isotope Labeling 


1161 


Deng H, Callender R. The structure of dihydrofolate 
when bound to dihydrofolate reductase. J Am Chem 
Soc. 1998;120:7730-7. 

Deng H, Callender R. Raman spectroscopic studies of the struc¬ 
tures, energetics, and bond distortions of substrates bound to 
enzymes. Meth Enzymol. 1999;308:176-201. 

Deng H, Burgner J, et al. Raman spectroscopic studies of the 
effects of substrate binding on coenzymes bound to lactate 
dehydrogenase. J Am Chem Soc. 1992a;114:7997-8003. 

Deng H, Zheng J, et al. A vibrational analysis of the catalytically 
important C4-H bonds of NADH bound to lactate or malate 
dehydrogenase: ground state effects. Biochemistry. 
1992b;31:5085-92. 

Deng H, Ray WJB, John W, et al. Comparison of vibrational 
frequencies of critical bonds in ground state complexes and 
in a vanadate-based transition state complex of muscle phos- 
phoglucomutase. Mechanistic implications. Biochemistry. 
1993;32:12984-92. 

Deng H, Zheng J, et al. Source of catalysis in the lactate dehy¬ 
drogenase system: ground state interactions in the enzyme 
substrate complex. Biochemistry. 1994;33:2297-305. 

Deng H, Burgner J, et al. The structure of the ribonuclease 
uridine-vanadate transition state analog complex by Raman 
difference spectroscopy: mechanistic implications. J Am 
Chem Soc. 1998a;120:4717-22. 

Deng H, Wang J, et al. Raman difference spectroscopic studies 
of the myosin SI MgADP vanadate complex. Biochemistry. 
1998b;37:10972-9. 

Deng H, Wang J, et al. Relationship between bond stretching 
frequencies and internal bonding for [*'’ 04 ] and [** 04 ] 
phosphates in aqueous solution. J Phys Chem B. 1998c; 
102:3617-23. 

Deng H, Lewandowicz A, et al. Activating the phosphate 
nucleophile at the catalytic site of purine nucleoside 


phosphorylase: a vibrational spectroscopic study. J Am 
Chem Soc. 2004;126:9516-7. 

Deng H, Callender R, et al. Pyrophosphate activation in hypo- 
xanthine-guanine phosphoribosyltransferase with transition 
state analogue. Biochemistry. 2010;49:2705-14. 

Thijs R, Zeegers-Huyskens T. Infrared and Raman studies of 
hydrogen bonded complexes involving acetone, acetophenone, 
and benzopenone-I. Thermodynamic constants and frequency 
shifts of the vOH and vC = O stretching vibrations. 
Spectrochim Acta. 1984;40A(3):307-13. 

Wang JH, Xiao DG, et al. Raman difference studies of GDP and 
GTP binding to c-Harvey ras. Biochemistry. 1998;37: 
11106-16. 


Isotope Fractionation 

► Kinetic Isotope Effects 


Isotope Labeling 

► Isotope Edited Raman and Infrared Difference 
Spectroscopy Studies of Protein Structure 

► Protein Segmental Labeling 

► Stereo-Array Isotope Labeling (SAIL) Method 






J 


J Coupling 

Stefan Bagby 

Department of Biology & Biochemistry, 

University of Bath, Bath, UK 

Synonyms 

Scalar coupling; Spin-spin coupling 

Definition 

J or scalar coupling between magnetic nuclei 
(otherwise termed nuclear spins) is coupling via the 
intervening network of chemical bonds and depends on 
interaction between the nuclear spins and bonding 
electron spins. 

Introduction 

J or scalar coupling occurs between magnetic nuclei 
that are chemically bonded to one another (Cavanagh 
et al. 2007; Hore 1995; Rule and Hitchens 2006). 
Scalar coupling is important because many NMR 
pulse sequences used to investigate molecular 
structure, dynamics, and interactions depend on scalar 
coupling for magnetization transfer between nuclear 
spins. Scalar coupling, moreover, allows delineation of 
the chemical connectivity and stereochemistry within 
a molecule, be it a small molecule or biological 
macromolecule; together with through-space dipolar 
coupling, therefore, through-bond scalar coupling 


forms the basis of resonance assignment and NMR 
determination of 3D structures of molecules. Here 
I review some basic concepts, some methods to 
measure scalar coupling values in proteins, recent 
applications in protein studies, and briefly discuss 
scalar couplings in nucleic acids. Consideration is con¬ 
fined to the liquid state. 

Scalar Coupling Nomenclature 

Scalar coupling causes splitting of the spectral lines of 
two coupled spin Vi nuclei by an amount J (in Hz) called 
the coupling constant. The nomenclature for a scalar 
coupling constant is '’J(AX) where n is the number of 
chemical bonds connecting the coupled nuclei A and X. 
The letter “h” is included to denote a scalar coupling 
across a hydrogen bond, that is, ""J(AX). 

Physical Basis of Scalar Coupling 

Scalar coupling occurs via perturbation of electronic 
spin states that arises from overlap of nuclear spins 
with bonding electron orbitals. Scalar coupling is 
a molecular property; its magnitude depends on the 
product of the gyromagnetic ratios of the coupled spins 
and, unlike the resonant frequency, is independent of the 
magnitude of the external magnetic field, Bq. 

Weak and Strong Coupling and Effect on NMR 
Spectra 

The appearance of the spectrum arising from coupled 
spins depends on the magnitude of the coupling con¬ 
stant, the number and types of nuclear spins to which 
each of the spins is coupled, and the difference in 
resonant frequencies, Av in Hz, of the coupled spins. 
The spectrum due to a scalar coupled spin system 
becomes more complex as the number and types of 


G.C.K. Roberts (ed.), Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 




J 


1164 


J Coupling 


coupled nuclei increase. The simplest case is for two 
coupled spins A and X both with spin quantum number 
I — 16 when J^x << Av; this is described as weak 
coupling. In such a case, each signal is split into two 
lines, separated by J(AX), called a doublet; the chem¬ 
ical shift of each signal is exactly at the midpoint of 
each doublet. A spin 16 nucleus coupled to a spin 1 
nucleus will have its signal split into a triplet. The 
general rule is that a nuclear spin coupled to a spin 
I nucleus will have its signal split into 21 -i- 1 lines. As 
the coupling constant approaches Av (e.g., Av ten 
times greater than J), the coupling is no longer weak 
and the spectrum of the coupled spins can be predicted 
only by quantum mechanical analysis. Strong coupling 
occurs when the frequency separation Av and coupling 
constant J are comparable (Hore 1995); in this case, the 
spins are denoted by alphabetically adjacent letters 
(A and B) to highlight the closeness of their resonant 
frequencies. Coupling between nuclear spins of 
different elements is always weak due to the large 
resonant frequency differences involved. 

Relationship Between Scalar Coupling Constants 
and Dihedral Angles 

The magnitude of the scalar coupling constant across 
multiple bonds depends on the intervening dihedral 
angle(s) and can thus be used to establish local back¬ 
bone and side chain conformations. The magnitude of 
^J, for example, is a function of the dihedral angle 
formed by the three covalent bonds between the 
coupled spins, as first described by Karplus: 

= Acos^fl -I- Bcos0 -I- C 

where A, B, and C are constants that depend on the 
nuclei involved and 0 is the dihedral angle. Values for 
A, B, and C have been obtained by theoretical 
approaches, but semiempirical approaches have 
proved more workable and more recently the availabil¬ 
ity of large data sets has permitted more analytical 
approaches to Karplus equation parameterization 
(Cavanagh et al. 2007; Rule and Hitchens 2006). 

In proteins, ^J(*HN-'Ha) values for coupling 
between the backbone amide proton and a-proton 
within the same amino acid reflect the cj) dihedral 
angle and hence secondary structure elements: 
a sequence of consecutive ^J(*HN-^Ha) values less 
than 5.5 Hz is indicative of a-helical secondary struc¬ 
ture, whereas a sequence of ^J(*HN-*Ha) values 


greater than 8.5 Hz is indicative of P-strand. 
^J(*HN-*Ha) values in the range 6-8 Hz may 
reflect motional averaging of multiple conformations. 
^J(*Ha — *HP) values reflect Xi side-chain dihedral 
angles and hence side-chain rotamer configurations 
(Evans 1995; Rule and Hitchens 2006). 

The shape of Karplus curves means that up to four 
different dihedral angles can give the same value of ^J, 
but some of these may be sterically disallowed; the 
allowed dihedral angle space in proteins is depicted in 
the Ramachandran (j)-v|/ plot. Also, the degeneracy can 
be removed if the value of more than one coupling 
constant that reflects a particular dihedral angle is 
known. For example, up to six couplings depend 
on (j) and seven couplings depend on Xi (Vuister et al. 
1999). 

and couplings also depend on protein second¬ 
ary structure (Liu and Prestegard 2009; Schmidt et al. 
2009, 2010) and have been used in an index approach 
to annotate secondary structure elements (Schmidt 
et al. 2011). 

Experiments to Measure Scalar Coupling 
Constants 

Methods for measuring J values can be placed into two 
main groups: (1) frequency-based methods and 
(2) intensity-based methods (Liu and Prestegard 
2009; Vuister et al. 1999). Historically, *H-'H NMR 
was used to measure scalar couplings and tj) and Xi 
dihedral angles were derived from measurement of 
^J('HN-'Ha) and ^J('Ha-'HP) values, that is, 'H-'H 
couplings; subsequent developments have permitted 
measurement of ^H-'^N, 

and 'H-*^C coupling constants. Hence, the 
discussion below starts with consideration of 'H-*H 
NMR before moving on to heteronuclear experiments. 
For samples in anisotropic media that induce a degree 
of molecular alignment with the external magnetic 
field, the heteronuclear methods described below are 
often used to measure ► residual dipolar coupling 
values (symbol D) in addition to scalar coupling 
values. 

Frequency-Based Methods 

Here couplings are measured from the frequency sep¬ 
arations of lines in multiplet structures. values can 
be extracted from *H-^H ► COSY peak fine structure 
for spins that have one active coupling, for example 
^J(*HN-^Ha) of residues other than Gly in H 2 O and 
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^J('Ha-'HP) for residues with a single HP in D 2 O. For 
measurement of J values, however, COSY suffers from 
cancelation of positive and negative multiplet compo¬ 
nents as linewidth approaches or exceeds the value of 
J, resulting in intensity reduction and increase of sep¬ 
aration beyond the true value of J. 

Several variants of the ► COSY have been 

developed to simplify cross peak multiplet structure, 
particularly for ^J(*Ha-*HP) determination, including 
P-COSY, exclusive COSY (E-COSY), z-COSY, and 
primitive exclusive COSY (PE-COSY). The name 
E-COSY covers a number of methods in which the 
actively coupled spins are correlated without 
perturbing the passively coupled spins. The most com¬ 
mon approach to generating E-COSY data is by linear 
combination of pQF COSY (QF = quantum filtered) 
data with the order of filtration including p = 0, 1,2, 
.. .n. For a three-spin system, combination of 
2QF-COSY and 3QF-COSY is sufficient to generate 
an E-COSY pattern. Higher order QF-COSY data are 
required for longer spin systems. The E-COSY princi¬ 
ple has been extended to heteronuclear NMR methods, 
for example HNCA-J in which the *^Ca and *HN spins 
are correlated without perturbing the spin state of the 
*Htx nucleus, and HNCO-based E-COSY-type experi¬ 
ments (Cavanagh et al. 2007; Evans 1995; Rule and 
Hitchens 2006). 

The *J('H-*^N) coupling of ^93 Hz is typically 
larger than 'H/*^N linewidths in proteins, allowing 
relatively accurate direct determination of these 
amide couplings from frequency separations in 
multiplet structures. Eor weaker couplings such as 
'j(^^N-'^C’) -15 Hz, 'j('^N-^^Ca) -11 Hz, 
^J(^^N-*^Ca) —7 Hz, however, measurement by 
frequency separation is only possible in small proteins 
with small linewidths. In such cases, in-phase anti¬ 
phase (IPAP) (Permi and Annila 2000) or double IPAP 
(DIPAP) (Ldhr et al. 2011) approaches to separate 
multiplet components can be used. 

Intensity-Based Methods 

Quantitative J correlation Quantitative J correlation 
methods have been reviewed previously (Bax et al. 
1994; Vuister et al. 1999). In these methods, magneti¬ 
zation dephases for a fixed period under the influence 
of a scalar coupling, the magnitudes of the resulting 
in-phase and anti-phase terms are determined, and the 
J value is determined from the ratio of these terms 
(Vuister et al. 1999). In the 3D HNHA experiment. 


for example, probably the best known and most widely 
used quantitative J correlation method, ^J('HN-*Ha) 
values are extracted by quantitative analysis of the 
diagonal peak to cross peak intensity ratio in an 
*HN-*Ha J correlation experiment. In addition to 
HNHA, examples of quantitative J correlation experi¬ 
ments include HNHB, HN(CO)HB, HACAHB, 
and long-range carbon-carbon and long-range 
carbon-proton pulse sequences to measure 
^J('^C-'^C) and ^J('H-^^C) values (Bax et al. 1994; 
Vuister et al. 1999). 

More recently, (3,2)D UNHA and (3,2)D HNHB 
pulse sequences (Barnwal et al. 2007) have exploited 
G-matrix Eourier transform (GFT) NMR spectroscopy 
to facilitate fast recording of 3D data with high digital 
resolution and chemical shift dispersion for measure¬ 
ment of ^J(HNHa) and ^J(NHP). Compared with 
HNHA, (3,2)D UNHA and (3,2)D UNHB offer faster 
data acquisition, superior dispersion due to joint 
sampling of ^^N and *H shifts which can be tuned 
using a scaling factor, and lack of diagonal peaks 
which allows integration of peaks that would otherwise 
be impossible due to diagonal peak overlap. Eor the 
two proteins studied (mainly P-strand 9.5 kDa 
M-crystallin and mainly a-helical 16.2 kDa Eh- 
CaBP), the GET data allowed extraction of —96% of 
possible ^J('HN-'Ha) values and —90% of possible 
^J('^N-'HP) values. 

Quantitative J modulation Here a single 
J-modulated function is recorded but at multiple time 
points (Billeter et al. 1992). J modulation is typically 
used with 2D data sets so that sufficient time points can 
be acquired with acceptable spectrometer usage. In 
their paper describing a TROSY-based J-modulated 
method for measuring small *^N-*^C scalar (and dipo¬ 
lar) couplings in large proteins, Liu and Prestegard 
discuss the relative merits of quantitative 
J correlation and J modulation in more detail (Liu 
and Prestegard 2009). 

Scalar Coupling Constant Measurements on 
Structurally Disordered Proteins and Denatured 
States of Proteins 

There has been increasing awareness of the prevalence 
and importance of ► structurally disordered proteins or 
structurally disordered regions of proteins, not least 
because of their role in protein folding diseases 
(Boehr et al. 2009). Structurally disordered proteins 
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adopt a dynamic ensemble of conformational states 
and can have a propensity toward secondary structure 
formation and specific local interactions between side 
chains. NMR spectroscopy, including measurement of 
scalar coupling constants, is probably the most power¬ 
ful method for investigation of the structures and 
dynamics of structurally disordered proteins. Three 
bond scalar couplings involving peptide backbone 
nuclei are useful because they correlate with the dis¬ 
tribution of backbone dihedral angles. In particular, 
^J(^HN-*Ha) values, corresponding to the cj) dihedral 
angle, have been used frequently in structural analysis 
of structurally disordered proteins. 

Examples of NMR experiments designed to mea¬ 
sure ^J(*HN-*Ha) values in structurally disordered 
proteins include a phase-sensitive J-resolved sequence 
combined with SOFAST-HMQC that produces a 3D 
spectrum with E-COSY pattern arising from 
^J(^HN-'Ha) and ^J('^N-^Ha) couplings (Lendel and 
Damberg 2009), and a combination of HNCO and 
HMQC-based spin echo difference experiments 
(Otten et al. 2009). In the latter report, a small system¬ 
atic difference of about 0.4 Hz was noted between 3D 
HNHA and 2D spin echo difference measurements. 

Schwalbe and colleagues have developed the HN 
(COCA)HAHB experiment for measurement of 
^J(^Ha-*Hp) values in structurally disordered proteins 
and denatured states of proteins (Hahnke et al. 2010). 
When combined with ^J('^N-'^Cy) and ^J('^C'-'^Cy) 
values, the quantitative correlation HN(COCA)HAHB 
experiment can in addition permit stereospecific 
assignment of diastereotopic P-protons. 

Scalar Coupling Across Hydrogen Bonds 

Hydrogen bonds are fundamental forces in biomolec- 
ular structure and function. Through their dependence 
on hydrogen bond distances and angles, scalar 
couplings across hydrogen bonds in proteins and 
nucleic acids provide a very sensitive measure of the 
geometry of hydrogen bonds. Scalar couplings also 
provide important information for theoretical analyses 
of the nature of these crucial bonds. The properties of 
biomolecular hydrogen bonds and measurement of 
scalar couplings across hydrogen bonds in biomole¬ 
cules have been reviewed in detail, encompassing ana¬ 
lyses of the relationship between scalar coupling and 
bond geometry and distance, use for structure determi¬ 
nation, hydrogen bond formation during protein fold¬ 
ing, effect of ligand binding and changes in conditions 


on hydrogen bonding, and importance of time- 
dependent changes in hydrogen bond geometry 
(Grzesiek et al. 2004). Among more recent studies of 
hydrogen bonds involving scalar coupling is 
a relatively rare example involving side chains: 
Iwahara and colleagues combined long-range 
scalar coupling and relaxation measurements 
of NH 3 '^ groups with molecular dynamics simulations 
to probe the dynamics of lysine side chains in ubiquitin 
(Zandarashvili et al. 2011), showing that H bonds 
involving NH 3 '^ groups are transient and highly 
dynamic. A recent example of the use of hydrogen 
bond scalar couplings in structure determination 
involved the use of intermolecular hydrogen bond sca¬ 
lar couplings to determine to high precision the struc¬ 
ture of the ubiquitous bacterial second messenger 
cyclic diguanosine monophosphate (c-di-GMP) in its 
complex with bacterial c-di-GMP receptor pilZ 
(Habazettl et al. 2011); this complex is involved in 
controlling the switch from individual bacterial cells 
to biofilms. 

Scalar Couplings and Protein Dynamics 

Motion is an intrinsic feature of protein function. 

couplings can reflect conformational averaging of 
backbone cj) angles with interconversion rates 
extending into the millisecond range, similar to resid¬ 
ual dipolar couplings. Although this time dependence 
can hinder interpretation of scalar coupling, for exam¬ 
ple, whether a particular coupling constant reflects 
a single dihedral angle or an average of different 
angles, it can also be exploited to probe dynamics. In 
a study of the dynamics of protein backbone angles, for 
example, Markwick et al. introduced to the empirical 
Karplus relationship a time element corresponding to 
time and ensemble averaging (Markwick et al. 2009). 
These authors emphasize the complementarity of 
and orientation-dependent ► residual dipolar cou¬ 
plings to correlate motions of peptide planes via (j)- 
and v|/-dependent and ► residual dipolar couplings. 
Markwick et al. compared experimental values 
(^J('HN-'Ha), ^J('HN-'^C'), and ^J(*HN-'^Cp)) 
with those produced by standard and accelerated 
molecular dynamics-generated structural ensembles 
and by a single structure refined with ► residual dipo¬ 
lar couplings. The molecular dynamics structural 
ensembles produced values in slightly better agree¬ 
ment with experimental values than static protein 
structures, although it was noted that extraction of 
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dynamics data from values may be limited by the 
absence of independent reference parameters for the 
Karpins relationship and hy preincorporation of 
dynamics effects into empirical Karpins parameters. 

A similar approach has been adopted to assess the 
effect of protein dynamics on hydrogen bond scalar 
couplings. In one such study, agreement between 
experimental and predicted values for 

1.8-2.1 A resolution protein crystal structures was 
significantly improved by averaging over molecular 
dynamics trajectories with explicit water models. The 
agreement was only slightly improved for a higher 
resolution (1.1 A) structure, indicating that 
values are compatible with a static high 
resolution structure and could be used to assess struc¬ 
tural quality (Sass et al. 2007). 

Scalar Coupling in Nucleic Acids 

Certain properties of nucleic acids make high- 
resolution structure determination by NMR difficult 
relative to globular proteins; these properties include 
a lower proton density, a higher number of freely 
rotatable bonds, and often an elongated conformation. 
Sources of structural information other than NOEs 
therefore take on greater importance in structural- 
studies of nucleic acids hy NMR; these include scalar 
couplings, ► residual dipolar couplings, and ► cross- 
correlated relaxation rates. On the other hand, NMR 
studies of nucleic acids are aided by the fact that all of 
the constituent elements have spin Vi isotopes, with the 
exception of oxygen. 

Whereas just two dihedral angles, (j) and \|/, define 
the backbone conformation of a protein, six dihedral 
angles (a, (3, y, 5, e, and Q define the conformation of 
the phosphodiester backbone of a nucleic acid. 
Methods have been developed to measure 
^J(^H-^'P), and ^J('^C-^*P) values in nucleic acids and 
to relate these to dihedral angles via Karplus relation¬ 
ships. A high resolution structure of the 14mer 
cUUCGg tetraloop hairpin RNA provides a good illus¬ 
tration of the use of scalar coupling values in nucleic 
acid structural studies (Nozinovic et al. 2010). 

Decoupling 

We have seen that scalar coupling is essential to the 
NMR study of molecular properties. In many situa¬ 
tions, however, scalar coupling is undesirable for two 
reasons: scalar coupling increases the complexity of 
NMR spectra by generating multiplets that can be 


broad and complex; scalar coupling also reduces 
signal-to-noise by spreading a given signal intensity 
over all peaks in a multiplet. It is often desirable 
therefore to decouple nuclear spins from one another, 
that is, to reduce the scalar coupling-induced splittings 
to such a small value that they are no longer resolved 
and do not influence peak heights. 

The spectrum of nuclear spin A can be observed 
without the effect of scalar coupling to nuclear spin 
X by inverting the X spins at a rate that is fast com¬ 
pared to the magnitude of J(AX). A conceptually sim¬ 
ple way to invert the X spins is to apply a train of 180° 
pulses at the resonant frequency of X. This method 
becomes ineffective upon a small shift of the pulse 
train away from the resonant frequency of X. Due to 
the wide frequency range of some nuclear spins, par¬ 
ticularly *^N and more sophisticated decoupling 
methods have had to be developed. These involve 
repeated elements comprising sequences of pulses 
with particular angles, phases, and shapes and 
include MLEV-16, WALTZ-16, GARP-1, DIPSI, 
WURST, and SEDUCE (Freeman 1997; Rule and 
Hitchens 2006). 

Summary 

Along with chemical shift and dipolar coupling, scalar 
coupling is fundamental to the utility of NMR as 
a technique to study molecular structure and function. 
Many pulse sequences depend on scalar coupling 
to transfer magnetization between nuclear spins, and 
scalar coupling values convey information about 
molecular structure and dynamics. As a result, 
methods to measure scalar coupling values and to 
relate them accurately to structure and dynamics are 
an ongoing and important area of NMR research. 

Cross-References 

► Cross-Correlated Relaxation 

► Fluorescence Correlation Spectroscopy 

► Molecular Dynamics Simulations of Lipids 

► Multidimensional NMR Spectroscopy 

► NMR-based Structural Proteomics 

► Nuclear Overhauser Effect 

► Nucleic Acid NMR - Introduction 

► Protein NMR - Introduction 



J 


1168 


J-Based NMR Correlation Spectroscopy of Biological Solids 


► Residual Dipolar Coupling 

► Structurally Disordered Proteins 

► Triple Resonance NMR 

► TROSY 
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Definition 

A method for correlating resonance frequencies 
in groups of NMR active nuclei based upon a shared 
/-coupling (indirect spin-spin or scalar-coupling) 
interaction for the purpose of uniquely assigning 
nuclear magnetic resonance frequencies to specific 
backbone and side chain atoms in a solid-state protein. 

Basic Characteristics 

A necessary first step in the use of nuclear magnetic 
resonance (NMR) spectroscopy to answer questions of 
protein structure and function is the unique assignment 
of resonance frequencies to specific atomic nuclei 
within a macromolecule. Correlation spectroscopy 
(Ernst et al. 1987) aids this task in two significant 
ways. First, it provides a method for finding connec¬ 
tions between groups of lines (pairs, triples, etc.) based 
on spatial or chemical connectivity. Second, it greatly 
increases the effective resolution by expanding the 
dimensionality of the resulting spectrum. 

There are two primary interactions that are used to 
effect correlation spectroscopy in NMR - through- 
space, dipolar couplings, and through-bond, /-cou¬ 
plings. In solids, the dipolar-coupling interaction is 
orders of magnitude larger than the /-coupling inter¬ 
action (^2 kHz vs. ^50 Hz for directly bonded 
atoms), so many correlation methods favor the 
use of the dipolar interaction (Jaroniec 2009; Tycko 
2009). Over the last decade, however, it has become 
clear that /-couplings can be used to efficiently transfer 
magnetization between pairs of nuclear spins in solids 
and establish their through-bond connectivity 
and assignments via cross-peaks in 2D and 3D 
NMR experiments (Sakellariou and Emsley 2007; 
Lesage 2008; Mueller and Titman 2010). 

/-coupling-driven correlation offers several unique 
features that complement those of dipolar-driven 
methods. First, ./-based experiments provide a means 
to delineate through-bond connectivity, a critical step 
for establishing structure. Second, unlike dipolar inter¬ 
actions, which can average to zero under molecular 
motion even in the solid state, scalar couplings are 
relatively insensitive to molecular dynamics and can 
provide for correlation and spectral assignment in sit¬ 
uations where dipolar-driven experiments fail. Finally, 
despite the smaller size of the ./-coupling interaction. 


through-bond correlation in solids can be implemented 
in very efficient manners (e.g., constant-time experi¬ 
ments) that provide substantially increased spectral 
resolution without compromising sensitivity, which 
can be comparable to (or in favorable cases better 
than) that of dipolar methods. 

Solid-state NMR spectra of disordered biological 
solids are routinely acquired under conditions of 
magic-angle-spinning (MAS) and high-power proton 
decoupling (Mehring 1983; Schmidt-Rohr and Spiess 
1994), which effectively average interactions such as 
homonuclear and heteronuclear dipolar couplings 
to zero and the chemical shielding interaction to its 
isotropic value. The resulting line widths of and 
*^N resonances are typically reduced from many tens 
of kHz in static solids to often better than 100 Hz. Even 
with such improved resolution, through-bond homo- 
nuclear and heteronculear scalar couplings are often 
not resolved in biological solids. Yet, they are still 
present and provide a mechanism for magnetization 
transfer and correlation spectroscopy. 

There are two general subclasses of through-bond 
correlation spectroscopy experiments in solids 
(Mueller and Titman 2010). The first includes ana¬ 
logues of the COSY and INADEQUATE experiments 
in solution-state NMR, with coherences transferred 
via the weak scalar-coupling interaction, JIzSz- 
The second class, the analogues of the TOCSY exper¬ 
iment in solution-state NMR, makes use of the strong 
/-coupling (or isotropic-mixing) Hamiltonian, JI S, 
which is produced via rotor-synchronized radio¬ 
frequency nutation fields. 

In principle, many of the /-coupling-driven experi¬ 
ments developed in solution-state NMR (Ernst et al. 
1987; Cavanagh et al. 1996) could also be applied to 
solids, but special consideration must be given to the 
resolution and line shapes in these experiments. In 
solids, the resolution is often not adequate to prevent 
the cancellation of intensity in anti-phase lines, 
in which the multiplet components (split by the 
/-coupling) have opposite sign. This lack of resolution 
can be due either to a large inherent line width or, as is 
often the case in uniformly labeled proteins, moderate 
inherent line widths (40-70 Hz) that are broadened 
further by multiple passive couplings to additional 
spins. Anti-phase lines are a particular problem in 
correlation spectroscopy based on the weak /-coupling 
interaction. To combat this, such experiments often 
use a “refocused” format in which anti-phase 
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magnetization is brought back to in-phase magnetiza¬ 
tion via evolution under the /-coupling interaction 
during an additional echo period before detection. 
Introducing refocusing periods on the order of 1/2/ 
into the pulse sequence may seem incompatible with 
solid-state NMR, as relaxation during this time could 
lower sensitivity. The relaxation rate under echo 
sequences (often referred to as T 2 ), however, is signif¬ 
icantly slower than the relaxation rate of the directly 
observed magnetization and improves dramatically 
under fast MAS (>20 kHz) and high-power *H 
decoupling (>120 kHz). Scalar-coupling-driven cor¬ 
relation experiments based on the isotropic-mixing 
Hamiltonian similarly benefit from high-power 
decoupling during the magnetization transfer periods, 
hut have the benefit of cross-peaks that are inherently 
in-phase. 

Figure la shows an example of a 3D ./-based solid- 
state NMR correlation experiment, the weak-scalar- 
coupling-based NCOCA correlation experiment 
(Chen et al. 2007). This experiment is constant-time 
(Ernst et al. 1987) in both of the indirect dimensions, 
with the evolution and mixing periods combined into 
single intervals. This format improves resolution 
through homonuclear and heteronuclear decoupling 
in the indirect dimensions, giving increased sensitivity 
and resolution when the scalar couplings are partially 
or fully resolved. This increased resolution is evident 
in the experimental spectrum of the (11 immunoglobu¬ 
lin-binding domain of protein G (GBl) shown in 
Fig. lb, where all 55 expected cross-peaks are 
resolved. Each peak in this spectrum is defined by 
three NMR resonance frequencies, corresponding to 
a covalent chain of atoms along the protein backbone 
beginning with an amide nitrogen and extending to the 
neighboring carbonyl (; — 1) and its neighboring 
alpha-carbon as highlighted in Fig. Ic. For reference, 
representative /-couplings along a protein fragment 
are shown. When this NCOCA experiment is com¬ 
bined with NCACO and CANCO experiments, it is 
possible to unambiguously trace out and assign all of 
the resonance frequencies along the backbone of GB1. 
Part of this backbone assignment “walk” is shown in 
Fig. Id. 

2D and 3D /-based solid-state NMR experiments 
for homonuclear *^C correlation spectroscopy (using 
both the weak and strong isotropic spin-spin interac¬ 
tion) and heteronuclear ^^N-^^C correlation spectros¬ 
copy have been demonstrated for the assignment of 


resonance frequencies along the protein backbone and 
into the side chains (Mueller and Titman 2010). 
./-based heteronuclear correlation (/-HETCOR) 
involving protons in biosolids has also been developed 
for making assignments and for spectral editing 
(Sakellariou and Emsley 2007; Lesage 2008; Mueller 
and Titman 2010). A major hurdle to *H /-HETCOR 
experiments in solids remains the necessity to counter¬ 
act the strong homonuclear dipolar interactions 
between protons, which typically lead to significant 
line broadening. Great progress has been made, includ¬ 
ing improved homonuclear decoupling schemes, very 
fast MAS, and the use of deuteration. An important 
application is the correlation spectroscopy in mobile 
regions of proteins, such as the extracellular and cyto¬ 
plasmic domains of membrane proteins, where dipolar 
couplings (including those between protons) are 
greatly attenuated by motion. Recent experiments 
combine scalar-coupling-driven correlation with 
proton detection, which offers improved sensitivity 
compared to and ^^N detection (Reif 2009). 
In this approach, the sample’s proton density is typi¬ 
cally reduced by supplying deuterated growth media, 
with limited back exchange of amide protons. By com¬ 
bining with fast MAS (>20 kHz), proton line widths on 
the order of 20 Hz can be observed, providing *H T 2 
relaxation rates sufficient for/-transfers to be efficient. 
This opens up solution-state type assignment strategies 
using proton detection, such as 3D out-and-back 
HNCO, HNCA, and HNCACB experiments 
(Cavanagh et al. 1996). 

/-based methods for correlation spectroscopy in 
biological solids have evolved alongside significant 
progress in hardware, pulse sequence design, and sam¬ 
ple preparation. Despite being orders of magnitude 
smaller than the dipolar interaction, scalar-couplings 
can provide for high-sensitivity and high-resolution 
correlation spectroscopy. Dipolar and scalar coupling 
methods are complementary - the /-coupling interac¬ 
tion signifies through-bond covalent contact, while the 
dipolar coupling reports on spatial proximity. Delin¬ 
eating through-bond versus through-space contact is 
a necessary step for solving structures with NMR. 
Eurthermore, dipolar couplings are sensitive to molec¬ 
ular motion, while scalar couplings are robust, 
allowing for sequences to be developed that specifi¬ 
cally target static and mobile regions of the same 
protein. A number of 2D and 3D homonuclear and 
heteronuclear through-bond correlation experiments 
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J-Based NMR Correlation Spectroscopy of Biological Solids, Fig. 1 (continued) 
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set of atoms whose NMR frequencies are correlated under an NCOCA experiment. 
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have now been developed that allow backbone and side 
chain nitrogen, carbon, and amide proton resonance 
frequencies to be assigned in solid-state proteins and 
subsequently used as atomic-level probes of biomolec- 
ular structure, function, and dynamics. 
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Synonyms 

Ion channels; Potassium channels, K channels 

Definition 

ATP-sensitive potassium (Katp) channels are unique 
biochemical complexes of channel pore-forming 
subunit (Kirb.l or Kir6.2) and sulfonylurea receptor 
(SURl or SUR2) proteins. In responding to cytoplas¬ 
mic nucleotide levels, Katp channel activity provides 
a unique physiological link between cellular energetics 
and electrical excitability. Katp channels were discov¬ 
ered more than 25 years ago, and since then, the clon¬ 
ing, reconstitution, and molecular dissection of the 
channel complex has led to highly rehned biochemical 
and biophysical models of channel activity. 

Introduction 

Katp channels modulate electrical activity in multiple 
tissues (Ashcroft 1988). These channels are members 
of the potassium channel superfamily but uniquely are 
inhibited by non-hydrolytic binding of ATP and 


activated by interactions of Mg-nucleotides at distinct 
sites. The inhibitory effect dominates, and channels are 
closed, when cellular phosphorylation potential is high 
but, as metabolism decreases, the activating effects 
win out and channels open. In this way, these channels 
function as unique electrical transducers of the meta¬ 
bolic state of the cell. Cloning of the constituent sub¬ 
units in the 1990s and crystallization of structurally 
related proteins over the last decade have led to 
a detailed biochemical and biophysical understanding 
of how nucleotides interact with and modulate Katp 
channels (Nichols 2006). 

Composition of Katp Channel Complexes 

Katp channels are formed by the unique combination 
of two dissimilar proteins (Nichols 2006) (Fig. la): one 
(Kir6) is a member of the Kir channel family; the other 
(SUR) is a member of the ATP binding cassette (ABC) 
protein family, and structurally related to the cystic 
hbrosis transmembrane receptor. In the vertebrate 
genome, there are two Kir6 genes KCNJ8 (encoding 
Kir6.1) and KCNJll (encoding Kir6.2) and two SUR 
genes ABCCS (encoding SURl) and ABCC9 (encoding 
SUR2). Intriguingly, KCNJ8 is immediately down¬ 
stream of ABCC9 on human chromosome 12pl, and 
KCNJll is immediately downstream of ABCC8, on 
human chromosome lip 15, suggesting not only that 
one pair of genes arose as a duplication of the other 
pair, but also that within a pair, there may be common 
regulatory mechanisms that have provided an evolu¬ 
tionary pressure for the two genes to remain adjacent. 
However, it is clear that coexpression of any combina¬ 
tion of the two subunits can occur in both recombinant 
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Katp Bid Sulfonylurea Receptor, Fig. 1 The biochemical 
and biophysical basis of Katp channel activity, (a) Transmem¬ 
brane organization of Katp channel subunits. Kir6 subunits 
generate the channel pore, SUR subunits generate the regula¬ 
tory subunit. Each channel is a functional octamer of 4 Kir6 
subunits, each associated with 4 SUR subunits, (b) Cartoon 
model of Katp structure and gating. The metabolically con¬ 
trolled gate of the channel is located at the cytoplasmic end of 


the inner cavity of the Kir6.2-generated channels. ATP binding 
to the Kir6.2 subunits provides the energetic push to closure of 
the ligand-operated gate, perhaps through the physical link 
provided by the slide helix. MgATP binds to each of the ATP 
binding sites (ABSs) that are formed at the interface between 
NBFl and NBF2 in each SUR subunit. ATP hydrolysis at the 
second site results in a conformational “activated” state that is 
transduced to an “over-ride” of ATP inhibition at Kir6.2. 
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expression and in vivo. Biophysical and biochemical 
experiments on SUR1/Kir6.2 channels have 
established that four Kir6.2 subunits generate the chan¬ 
nel pore, each Kir6.2 being associated with one SURl 
protein (Fig. la). In recombinant cells, truncated 
Kir6.2 can form channels without SURl when 
C-terminally truncated, but in this case, truncation 
removes a trafficking sequence (Arg-Lys-Arg) that 
otherwise causes Kir6.2 retention in the endoplasmic 
reticulum (ER). SURl, which contains a similar ER- 
retention sequence, is also not efficiently trafficked to 
the surface membrane in the absence of Kir6.2, indi¬ 
cating that the two subunits normally act to mutually 
shield the trafficking signal on the other subunit. 

A High-Resolution Model of Kir6.2 

Crystallization of several bacterial Kir channels and 
most recently a full length eukaryotic Kir2.2 channel 
reveals a conserved architecture (Eig. Ic) with a trans¬ 
membrane domain consisting of two alpha-helices 
(Ml, M2) bridged by an extracellular loop that gener¬ 
ates the narrow portion of the pore and controls ion 
selectivity. Unique to the Kir channels is a conserved 
cytoplasmic domain that surrounds an extended cyto¬ 
plasmic pore. Consensus models of Kir6.2, based on 
such structures (Enkvetchakul and Nichols 2003; 
Haider et al. 2005) provide satisfying agreement with 
mutational studies of Kir6.2, particularly with regards 
to the inhibitory ATP binding site (Eig. lb, c, see 
below). At the membrane interface, where many 
ligands likely bind, crystal structures tend to be poorly 
resolved. This makes it harder to model with certainty 
and, while all euk-Kir channels are strongly activated 
by interaction with phosphatidylinositol-bisphosphate 

< - 

Katp and Sulfonylurea Receptor, Fig. 1 (continued) 
The “activated” state persists through MgADP dissociation, 
and can be maintained by MgADP rebinding. PIP2 at the inner 
leaflet also interacts with the channel subunit apparently in 
competition with ATP. (c) High-resolution model of Kir6.2 
tetramer structure with inset close-up view of the ATP binding 
site. Three subunits of a consensus Kir6.2 model (Haider et al. 
2005) are shown with the rear-most subunit removed for clarity. 
The subunit positioned to the front provides the bulk of one ATP 
binding site in its green C-terminal cytoplasmic domain. The N- 
terminal domain of this subunit provides the blue “slide helix” 
overlying the binding site to the right and then curves back to 
close the binding pocket around the adenine ring. Beyond the 
slide helix, at the turn of the N-terminus, residue R50 from the 
subunit to the left (yellow) is predicted to close the binding 
pocket at the y-phosphate of ATP. ATP is rendered by atom 


(PIP 2 ), KirBacl.l is confusingly inhibited by PIP 2 
(Enkvetchakul et al. 2005), but interestingly, the 
linkers between the cytoplasmic domain and the 
TMD are shorter in KirBacl.l than in euk-Kir chan¬ 
nels and this may be important in the detailed linkage 
of PIP 2 binding to the channel gate. 

Templates for SUR 

SURl and SUR2 each contain two six-helix transmem¬ 
brane domains (TMDl, TMD2), each followed by 
a nucleotide binding fold (NBEl, NBE2), as is found 
in all ABC proteins. SURs have an additional 
N-terminal TMDO domain consisting of five trans¬ 
membrane helices (Eig. la). TMDO is critical for traf¬ 
ficking Kir6.2 subunits to the surface membrane (Chan 
et al. 2002), and its role in controlling gating of Kir6.2 
suggests an intimate relationship with the pore-forming 
subunit. Low homology between the TMD regions of 
crystallized ABC proteins and SURs makes molecular 
modeling risky, although some insights have been 
made, including the likely location of binding sites for 
the channel opener diazoxide and the inhibitory sulfo¬ 
nylurea, glibenclamide (Bryan et al. 2004). 

There are now multiple crystal structures available 
for isolated NBEs from bacterial ABC proteins and 
although not all NBE structures are dimers, it is quite 
clear that this is the functional unit (Masia and Nichols 
2008): rather than each NBE containing a self- 
contained ATP binding site (ABS), each ABS is 
formed at the dimer interface and contains elements 
from each NBE. Modeling of the SUR NBEs as 
heterodimers (Fig. Id) seems reasonable given the 
high degree of conservation among NBE sequences. 
A complete Katp complex, generated from tandem 


type, oxygens in red, nitrogens in blue, (d) NBF1-NBF2 
heterodimeric structure of ATP binding sites (ABSs) in SURl 
(Masia and Nichols 2008). ATP is rendered in gray in ABS 1 and 
ABS2. Mutation of residues in dark and light blue cause reduced 
or slowed MgADP activation of channel activity, respectively. 
Mutation of residues in red causes enhanced MgADP activation, 
(e) Kinetic mechanism of Katp channel gating kinetics. Full 
allosteric (8-states per subunit, above) and reduced allosteric 
(6-states per subunit, below), models of tetrameric gating (see 
text), (f) Such models predict a characteristic nonlinear relation¬ 
ship between ATP sensitivity ([ATP] causing half-maximal 
inhibition, Ki/2,atp) and open probability (Op). Changes in 
intrinsic open state stability (L) or PIP 2 affinity (Kp) cause the 
channel to move up or down this nonlinear relationship whereas 
changes in ATP affinity (Ka) shift the channel onto a parallel 
relationship 
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SUR1-Kir6.2 constructs has been isolated and studied 
using electron micrography and 3D reconstruction 
(Mikhailov et al. 2005). The complex is a compact 
structure of ^18 x 13 nm. It implies intimate packing 
of 4 SURl subunits against 4 Kir6.2 subunits, with the 
TMDO domains nestled between TMDl, TMD2, and 
Kir6.2, and suggests that NBFl and NBF2 interactions 
may occur between subunits, a possibility yet to be 
experimentally tested. 

Mechanism of Nucleotide Gating of 
Katp Channels 

ATP (with or without Mg^"^) inhibits Katp channel 
activity, while PIP 2 activates channels, by direct inter¬ 
action with Kir6 subunits (Nichols 2006). Sulfonyl- 
ureas inhibit, and potassium channel opener drugs 
activate, channels by interaction with the SUR subunit. 
In addition, in the presence of Mg ATP and ADP 
can activate the channel through interaction with the 
NBFs of SUR. These actions have all been demon¬ 
strated and studied in depth using patch-clamp ana¬ 
lyses of inside-out membrane patches (► Patch-Clamp 
Recording of Single Channel Activity: Acquisition and 
Analysis). Inhibition by ATP binding to Kir6.2 and 
activation by Mg-nucleotides is almost certainly the 
primary physiological regulatory mechanism of the 
channel. 

Kinetic Mechanism of Katp Channel Gating 

Even in the absence of ATP, single Katp channel 
kinetics are complex, but key features lead to 
consistent models (► Fluorescence Techniques for 
Studying Ion Channel Gating: VCF, FRET, and 
LRET) that assume a single “fast” gate, and ligand- 
operated “slow” gates within each subunit 
(Enkvetchakul and Nichols 2003). Single channel ana¬ 
lyses (► Patch-Clamp Recording of Single Channel 
Activity: Acquisition and Analysis) consistently reveal 
one open time and multiple closed times. The “fast” 
gate gives rise to the short open and closed times (the 
“intra-burst” events), that are voltage-dependent, but 
ligand-independent, and are affected by mutations of 
residues in or near the selectivity filter. In addition, 
there are always several longer closed times that com¬ 
prise “inter-burst” closures. It is this “slow” gating that 
is modulated by nucleotides and membrane PIP 2 . The 
“slow” gate probably results from closing of the 


channel at the bottom of the inner cavity, at the narrow 
collar generated by the crossing of the bundle of M2 
helices (Pig. lb, c), similar to the main gate predicted 
for other cation channels. Mutations in M2, particu¬ 
larly near the cytoplasmic end, affect ligand-gating in 
multiple Kir channels, including Kir6.2. Reduction of 
single channel conductance by MTSEA’^-modification 
of introduced cysteines in this region (Loussouam 
et al. 2001) shows that this region can act as a significant 
barrier in the permeation process. Kinetic analyses show 
that the inner cavity is only accessible to cytoplasmic 
MTSEA"^ in the open state and that blockers are trapped 
in the inner cavity in the closed state (Phillips et al. 
2003), again consistent with the location of the gate 
being at the bottom of the inner cavity (Pig. lb). Molec¬ 
ular models of open- versus closed states suggest that 
very minimal backbone M2 movement (Kuo et al. 2005) 
is actually required to appropriately increase the diam¬ 
eter of the pore at this point. 

Tetrameric models (1) in which each subunit 
undergoes independent gating transitions not only rep¬ 
licate all essential features of ATP-dependent gating 
but also have important predictive properties 
(Enkvetchakul and Nichols 2003). The assumption 
that each of the four subunits can be in an open or 
closed conformation (in the absence of ligand), and 
that the channel conducts only if all subunits are in the 
open conformation automatically produces multiple 
closed states and implies that closure of any one 
subunit will be sufficient to close the channel. Assum¬ 
ing that the open channel can close by such individual 
subunit closures (O <-> C), or (at the selectivity filter) 
by a concerted event (i.e., “fast” gating, O <-> Cf), 
such a model produces a single short open time and 
multiple closed times with inter-burst closures domi¬ 
nated by subunit closures. The physical reality is that 
PIP 2 and ATP binding sites may be overlapping 
(Haider et al. 2005), consistent with negligible occu¬ 
pancy of CpA and OpA states predicted by the model. 
Furthermore, Oa and Cp states are rarely sampled in 
the wild-type models. For wild-type channels, the sub¬ 
units then primarily exist in only four states (2), and 
ATP primarily binds to the closed state. The important 
consequence of this last feature is that it predicts 
a characteristic nonlinear relationship (3) between 
ATP sensitivity ([ATP] causing half-maximal inhibi¬ 
tion, Ki/ 2 ,atp) and open probability (Op) (Fig. If, 
right), one that is very clearly adhered to by both 
wild-type and mutant channels under a range of 
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experimental conditions (Enkvetchakul and Nichols 
2003). Changes in intrinsic open state stability (L) or 
PIP 2 affinity (Kp) cause the channel to move up or 
down this nonlinear relationship, whereas changes in 
ATP affinity (Ka) shift the channel onto a parallel 
relationship. One consequence of these two different 
effects is that it is possible to reduce apparent ATP 
sensitivity (i.e., Ki/ 2 ,atp) to the same value (dashed 
line in Fig. If, right), by two different mechanistic 
routes (i.e., change in Ka or L), with very differing 
consequences for open probability. 

Biochemical Details of Nucleotide Gating 

The cytoplasmic domains of each Kir6.2 subunit pro¬ 
vide the binding sites for ATP and PIP 2 , which respec¬ 
tively stabilize closed or open states of the channel. 
Consistent with the above kinetic models (Fig. le), 
molecular modeling indicates four ATP binding sites 
on the channel (Fnkvetchakul and Nichols 2003; 
Haider et al. 2005), each formed at the Kir6.2 subunit 
interfaces (Fig. Ic). As discussed above, kinetic ana¬ 
lyses indicate that unliganded subunits (i.e., not bound 
to ATP or PIP 2 ) are relatively unstable and that ATP 
and PIP 2 binding are mutually excluded. Subunits will 
therefore be predominantly bound to one ligand or the 
other at any one time. In the absence of ATP, wild-type 
subunits will be bound to membrane PIP 2 (and hence in 
the permissive “open” state) ^90% of the time. Since 
all four subunits need to be in the permissive (“open”) 
state for the channel to be active, this results in the 
observed open probability of ^0.5 (0.9^). Binding of 
ATP to the unliganded subunit will trap it in the 
nonpermissive “closed” state. With Ka "^10 pM for 
WT subunits, and with [ATP] being 2-3 orders of 
magnitude higher than this in energized cells, each 
WT subunit is probably more than 90% bound to 
ATP most of the time. Trapping of any one subunit in 
the closed conformation is sufficient to close the chan¬ 
nel, and so for most cells under any but the most 
extreme conditions, the expected open probability 
would be <0.H, i.e., negligible, without the stimula¬ 
tory effects of Mg-nucleotides acting at the SUR 
subunit (see below). 

Docking of ATP in molecular models of Kir6.2 
(Fig. Ic) has been guided by the results of numerous 
mutagenesis studies that provide candidate residues for 
the ATP binding site. Many mutations actually alter 
ATP sensitivity independently of ATP binding affinity, 
by altering the intrinsic stability of the open state 


(i.e., they control the open-closed equilibrium constant, 
L, see above) and only a few residues (e.g., R50 in 
the N-terminus, and 1182, K185, R201, G334 in the 
C-terminus) actually control ATP affinity directly. 
These residues are well separated in the primary 
sequence, but are all very close to one another at the 
inter-subunit interface in the model (Fig. Ic). Potential 
PIP 2 interacting residues (R54, R176, R177, R206) are 
near the ATP site and appropriately located to interact 
with PIP 2 headgroups (Fig. Ic). Overlapping ATP- and 
PIP 2 -binding sites, and biochemical competition of the 
two ligands for the non-ligand bound state, are very 
consistent with the proposed “negative heterotropic” 
interaction of these two ligands in channel regulation 
(Fnkvetchakul and Nichols 2003). 

However, it is still unclear how the binding of ATP 
or of PIP 2 is coupled to the closing or opening of the 
gate. One interesting feature of Kir channel structures 
is the “slide helix” that lies along the plane of the 
membrane and physically connects the N-terminus - 
which in Kir6.2 forms part of the ATP binding site - to 
the cytoplasmic end of Ml. It is interesting that natu¬ 
rally occurring disease mutations which affect the 
channel open state stability cluster along the slide 
helix of Kir6.2, highlighting a role for this helix in 
controlling channel gating. 

How Do Nucleotides Activate the Channel? 

In the face of ATP inhibition at Kir6.2, physiological 
activation of the Katp channel arises acutely from the 
interaction of Mg-bound nucleotides with the ATP 
binding sites (ABSs) of SUR (Fig. Id) (Nichols et al. 
1996). In bacterial NBFs, dimerization is favored by 
presence of nucleotide, and ATP hydrolysis occurs in 
a cooperative manner leading to the hypothesis that 
the nucleotide-bound dimer is the catalytically active 
species. There is a consensus understanding that 
Mg-dependent ATP hydrolysis at the dimeric SUR 
NBFs (Fig. Id) also provides the “power stroke” that 
overcomes the inhibitory effect of ATP on Kir6.2 
(Fig. lb). Mg^'^-dependence of a^^P- and y^^P-azido- 
ATP labeling of SUR NBFs provides indirect 
evidence that SUR NBFs do hydrolyze ATP (Matsuo 
et al. 2000). Trapping of channels in an activated state 
by vanadate (a transition state analog that mimics 
a post-hydrolytic, ADP-bound state) and in an 
inactivated state by beryllium fluoride (a transition 
state analog that mimics the pre-hydrolytic, 
ATP-bound state) (Zingman et al. 2001) add further 
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consistency to the hypothesis that ATP hydrolysis 
underlies channel activation. 

However, there have been few direct hydrolysis 
measurements on SUR NBFs, and many analogous 
mutations to those that have drastic effects on ATP 
hydrolysis in bacterial NBFs have only minimal effects 
on Katp activation (Bienengraeber et al. 2000). In 
patch-clamp experiments, channel activation by exog¬ 
enous MgADP is typically more effective than activa¬ 
tion by MgATP and putative “hydrolysis” mutations 
lead to decreased MgADP stimulation of Katp chan¬ 
nels (Gribble et al. 1997; Shyng et al. 1997; 
Bienengraeber et al. 2000). Why MgADP should so 
strongly activate the channel in excised patches if 
a hydrolytic cycle is required for activation is not 
clear, and leads us to suggest that the “activated” 
state resulting from hydrolysis should persist following 
MgADP dissociation long enough for rebinding of 
exogenous MgADP to be able to maintain the state. 

How Biophysical Understanding Informs 
Disease Mechanisms - And Vice Versa 

The biophysical properties of Katp channels discussed 
above are very well illustrated in both human diseases, 
and animal models, in which the Kir6 and SUR sub¬ 
units are mutated. Katp channels are widely expressed 
in vertebrate tissues and genetic manipulation in mice 
has revealed key consequences of loss of channel sub¬ 
units in different tissues, highlighting the coordinate 
nature of the two subunits in generating tissue-specific 
channel activity (Seino and Miki 2004). For instance, 
the same tissue-specific phenotype is generated in sev¬ 
eral cases when one or the other partner is knocked out: 
both SUR1~^~ and Kir6.2“^“ reiterate a glucose- 
insensitive insulin secretory phenotype due to loss of 
Katp in the pancreas (Seino and Miki 2004). SUR2“^“ 
and Kir6.2“^“ animals both lack muscle Katp and 
have a phenotype mimicking human Prinzmetal angina 
(Suzuki et al. 2001; Chutkow et al. 2002), with spon¬ 
taneous coronary vasospasm leading to early death, 
probably due to vasoconstriction as a result of reduced 
vascular Katp current. 

Katp Defects in Hyperinsulinemia 

Katp channels suppress insulin secretion by 
hyperpolarizing the P-cell membrane, and so loss 
of Katp channel activity is expected to cause 


hypersecretion. Congenital hyperinsulinemia (HI), 
a genetic disorder characterized by dysregulated insu¬ 
lin secretion, is the most common cause of hypoglyce¬ 
mia in infancy. More than 50 loss-of-function HI 
mutations have now been recognized in Katp genes 
(Nestorowicz et al. 1998). While some mutations are in 
Kir6.2, the majority are in SURl and most cause one of 
two major defects: (1) biosynthetic or trafficking 
defects that lead to absent or reduced surface expres¬ 
sion of channels or (2) loss of MgADP activation of 
expressed channels, which could reflect direct effects 
on ATP hydrolysis or ADP binding at the NBFs, or of 
defects in the coupling to channel opening. As a result, 
P-cells remain depolarized, with chronically elevated 
[Ca], and therefore persistently secrete. One of the 
most common HI mutations (SUR1[AFI388]) causes 
both defective trafficking and lack of surface expres¬ 
sion of functional Katp channels and is retained in the 
ER, similar to the CFTR[AF508] mutation (Cartier 
et al. 2001). Another single amino acid mutation 
(R1394H) also causes defective trafficking in mamma¬ 
lian cells, but rather than being retained in the ER, the 
protein appears to accumulate in the Golgi (Partridge 
et al. 2001). Eor such mutations, manipulations that 
allow correction of the trafficking defects might be of 
therapeutic value for the disease. 

Katp Defects in Neonatal Diabetes 

While loss of Katp activity causes persistent 
hypersecretion of insulin, gain of function causes the 
opposite, i.e., lack of secretion and diabetes, and mul¬ 
tiple mutations in Kir6.2 and SURl have now been 
found to underlie neonatal diabetes mellitus (NDM). 
All cause enhanced activity at any given level of 
[ATP]:[ADP] (Koster et al. 2005a), by (1) decreased 
apparent affinity of the ATP binding pocket for the 
nucleotide; (2) increased intrinsic stability of the 
open state; or (3) enhanced Mg-nucleotide activation. 
Consistent with a direct effect on binding, residues 
R50, 1182, and R201, all identified as mutated in 
NDM patients, were all previously implicated as likely 
ATP binding residues in Kir6.2 and all are located in 
the ATP binding site in the model channel (Pig. Ic). 
Y330 and P333, also found to be mutated in NDM 
patients, are predicted to lie close to the phosphate 
tail in the binding pocket (Haider et al. 2005). Muta¬ 
tions that alter channel gating in the absence of ATP 
can occur throughout the Kir6.2 subunits, and in the 
TMO domain of the SURl subunits. By stabilizing the 
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open state of the channel (increasing L), these muta¬ 
tions reduce ATP sensitivity, without change in ATP 
binding affinity (Enkvetchakul and Nichols 2003). 
Loss of ATP sensitivity coupled to high open proba¬ 
bility in Q52R, C42R, Y330C, I1296L, and V59G 
mutations indicates this mechanism is operating in 
these NDM mutations (Koster et al. 2005b). Q52 and 
V59 are located within the “slide helix” region of 
Kir6.2, lending consistency to the suggestion that this 
“linker” serves to physically couple the ATP binding 
site to the gating region. An effect on the gate region 
itself might explain the diabetes-causing effects of 
mutations at residue K170 (K170N, K170R), located 
at the M2 bundle-crossing. 

The realization that neonatal diabetes results from 
gain of Katp channel function, has rapidly led to 
shifting therapy from injected insulin to the painless 
use of oral sulfonylureas (Sagen et al. 2004). However, 
the understanding of the molecular basis of Katp func¬ 
tion also immediately predicts an underlying caveat: 
sulfonylureas also stabilize the closed state of the 
channel and sulfonylurea sensitivity is also dependent 
on the open state stability of the channel, such that 
mutations which increase open state stability also 
reduce sulfonylurea sensitivity (Koster et al. 2005b). 
Since different NDM mutations alter ATP sensitivity 
by two distinct mechanisms (by reducing ATP affinity 
or by increasing the open state stability), they also 
show differential sulfonylurea sensitivity (Koster 
et al. 2005b), and this mechanistic information may 
help tailor individual treatment for the disease. 
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Ketosteroid Isomerase - Computational 
Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

KSI; 3-oxo-A5-steroid isomerase 


Definition 

One of the most efficient enzyme catalysts known, 
involved in the conversion of cholesterol into 
testosterone. 

Basic Characteristics 

Ketosteroid isomerase (KSI) catalyzes the isomeriza¬ 
tion of 3 -0X0- A5-steroids into their 3-oxo-A4-steroid 


conjugated isomers, such as 5-androstene-3,I7-dione 
to 4-androstene-3,I7-dione. The reaction starts with 
proton abstraction from a C-H group adjacent to 
a carbonyl carbon. Such carbon acid deprotonation 
reactions have high activation barriers in solution, but 
KSI catalyzes the reaction such that the enzymatic 
reaction rate is near the diffusion limit. Several other 
enzymes, such as ► triosephosphate isomerase and 

► citrate synthase, catalyze similar proton abstractions 
from carbon atoms. There is wide interest in how these 
enzymes achieve efficient catalysis. In addition, KSI 
has been the subject of debates on the role of electro¬ 
statics in enzyme catalysis. Computer simulation of 
reactions has been important in these debates. For 
example, ► empirical valence bond (EVB) simulations 
of the reaction indicated that electrostatic stabilization 
of the enolate intermediate in the reaction accounts for 
the majority of the reduction in the reaction barrier 
(Feierberg and Aqvist 2002). However, experimental 
results for binding of phenolates, thought to be transi¬ 
tion state analogues, were used to argue that electro¬ 
static effects are not so important for catalysis: This 
was because only a small change in the binding energy 
was found with changes in charge delocalization in the 
phenolates. Further computational studies were able 
explain this apparent contradiction (Warshel et al. 
2007): While the binding of the transition state ana¬ 
logues was calculated to have only a small electrostatic 
contribution, catalysis of the reaction, which involves 
a charge transfer process, had a significant electrostatic 
contribution from the enzyme active site. Later, exten¬ 
sive simulations using the ► empirical valence bond 
method confirmed the important electrostatic interac¬ 
tions between the enzyme and the enolate intermediate 
(Chakravorty et al. 2009). This study also calculated 
the contributions of nuclear quantum effects and bar¬ 
rier recrossing to the reaction rate, and similar methods 
were used to calculate the effect of mutations in active 
site residues, giving results in line with experimental 
data (Chakravorty and Hammes-Schiffer 2010). 

Cross-References 

► Citrate Synthase - Computational Studies 

► Empirical Valence Bond Methods 

► QM/MM Methods 

► Triosephosphate Isomerase - Computational 
Studies 
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Synonyms 

Molecular motors 


Introduction 

Molecular motors, especially microtubule motors, play 
a critical role in the intracellular transport of membra¬ 
nous organelles, protein complexes, and mRNAs, and 
they also have a role in physiological processes such as 
brain wiring, higher brain function, neuronal survival, 
tumor suppression, and the determination of left-right 
asymmetry (Hirokawa et al. 2009a). These functions 
are conveyed by microtubule motor proteins that 
belong to the kinesin superfamily proteins (KIFs). 
Most kinesins (kinesin 1-12) have the conserved 
motor domains, the minimal domain required for 
motility, in the N-terminal region and these are called 
N-kinesins. Most N-kinesins are motile and move 
toward microtubule plus ends (Lawrence et al. 2004; 
Miki et al. 2005; Hirokawa et al. 2009a). Kinesin 13 


(M-kinesin) have their motor domain in the middle, 
and depolymerize microtubules instead of moving 
actively along them (Moores and Milligan 2006). 
Kinesin 14 (C-kinesin) have their motor domain in 
the C-terminal region, and move toward microtubule 
minus ends (Endow 1999). 

Kinesin 1, the first kinesin to be discovered, was 
identified as the motor for vesicle transport in neuronal 
axons (Vale et al. 1985; Brady 1985). Kinesin 1 forms 
dimer composed of globular motor domain, stalk 
domain, and tail domain where the light chain is bind¬ 
ing (Hirokawa et al. 1989). The mammalian kinesin 1 
family members are called KIF5A, KIF5B, or KIF5C 
and here we refer to kinesin 1 family members as KIF5. 
KIFIA was the first kinesin 3 family member to be 
identified, and they transport synaptic vesicle precur¬ 
sors in axons (Okada et al. 1995; Hirokawa et al. 
2009a). Both KIF5 and KlFl A have evolved to convey 
long distance transport in neuronal axons, and there¬ 
fore they can both move further than 1 pm along 
microtubules without fully detaching from them. This 
high processivity enabled detailed single-molecule 
studies of their mechanism of motility. 

To explain the processive movement of KIF5, the 
hand-over-hand model has been proposed, in which 
kinesin dimers can use two heads to bind to microtu¬ 
bules (Block et al. 1990). The hand-over-hand model 
postulates that these two heads alternate so that the 
kinesin is always attached to the microtubule by at 
least one head during movement (Fig. la). On the 
other hand, studies of KIFIA reveal that a single 
KIFIA monomer can move along microtubules 
processively (Okada and Hirokawa 1999). Single¬ 
molecule studies of KIFIA demonstrate that its mov¬ 
ing monomeric head is weakly supported by an elec¬ 
trostatic interaction between the positively charged 
K-loop and the negatively charged C-terminal 
E-hook of the microtubule (Fig. lb). This enables 
KIFIA to move by one-dimensional diffusion to the 
next binding site without fully detaching from the 
microtubule. This diffusive monomeric movement 
was initially thought to be unique to KIFIA. However, 
KIF5 motor domains have the potential to move diffu¬ 
sively in a monomeric state, as monomeric KIF5 con¬ 
structs move diffusively when the K-loop of KIFIA is 
introduced into KIF5. Thus, kinesins have been 
thought to move by a conserved mechanism of motil¬ 
ity, rather than by mechanisms that are unique to each 
kinesin. 
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Fig. 1 Motility models of K1F5 and KIFIA. (a) Hand-over- 
hand movement of K1F5. Binding of ATP to the leading head 
pulls the lagging head forward in a movement known as the 
power stroke. The new leading head then finds the next micro¬ 
tubule binding site by diffusional search and binding triggers this 
head to release ADP. (b) Diffusion based movement of a KIFIA 
monomer. The power stroke by the binding of ATP can bias the 
position of the cargo to the plus end. Electrostatic interaction 
between the K-loop of KIFIA and the E-hook of microtubules 


enables one-dimensional diffusion to the next binding site with¬ 
out full detachment of the kinesin from the microtubule. Prefer¬ 
ential binding to the plus-end side of the microtubule biases the 
movement toward this end. It is noted that the motile property of 
each monomer both in KIF5 and KIFIA is quite similar; strong¬ 
binding in the ATP state with full-docking of the neck-linker, 
active detachment from the microtubule during the ATP hydro¬ 
lysis, diffusional binding in the ADP state, and then weak-to- 
strong transition during the ADP release (Images are modified 
from Hirokawa et al. (2009b)). 


The energy for the movement of kinesins is sup¬ 
plied by the hydrolysis of ATP. ATP is hydrolyzed in 
the “catalytic center” in the kinesin motor domain, and 
the liberated energy is somehow conveyed to the 
“actuator” to produce movement or mechanical work. 
Binding of ATP in the catalytic center leads to the 
strong-binding of kinesin to the microtubule (strong¬ 
binding state) through the conformational change of 
the actuator. ATP is then hydrolyzed in the catalytic 
center, somehow inducing the reverse conformational 
change of actuator to generate the weak-binding of 
kinesin to the microtubule (weak-binding state). The 
key to understanding the mechanism of kinesin motil¬ 
ity is thus to identify the catalytic center and the actu¬ 
ator, and the link between them, and to obtain 
“snapshots” of high-resolution structures for the 
many intermediate states of the ATP hydrolysis reac¬ 
tion. The conformational changes during the reaction 
would then be clearly understood. Recently, atomic 
structures of the motor domain were solved for 
many different kinesin families (Kull et al. 1996; 
Sablin et al. 1996; Ogawa et al. 2004), mostly in the 
post-hydrolysis MgADP-bound form. These structures 
are very similar even among the N-, M-, and 


C-kinesins, which are consistent with the conserved 
mechanism of kinesin motility. Up to now, only the 
structure of KIF1A has been solved for almost all of the 
intermediate states of the ATPase cycle (Kikkawa et al. 
2001; Nitta et al. 2004, 2008; Hirokawa et al. 2009b). 
In this entry, therefore, a structural model for how 
kinesins move by a general mechanism based on 
these KIFIA structures is discussed. 


Structure of the Conserved Kinesin Motor 
Domain 

The kinesin motor domain is composed of a single 
globular “catalytic core” and a short strand “neck- 
linker” (Fig. 2). The catalytic core domain contains 
both the ATPase catalytic center and the microtubule 
binding surface. The neck-linker extends from the 
C-terminus to connect the catalytic core to the rest of 
the molecule. 

Kinesins belong to the Walker-type ATPase family 
based on the following structural features of their cat¬ 
alytic core. A layer of central (3 sheets is sandwiched 
between two layers of a helices (Fig. 2b). The 
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Kinesin: Fundamental Properties and Structure, 
Fig. 2 Structure of the KIFIA motor domain seen from the 
bottom {microtubule binding side) (a) from the right side (b) 
and from the top (catalytic center side), showing the three- 

N-terminal half of the catalytic core domain forms the 
top a helix layer (green in Fig. 2b, c) and the central (3 
sheet layer (pink in Fig. 2b), and contains the ATP 
hydrolysis catalytic center. Loop L4 corresponds to the 
phosphate-binding loop (P-loop) and forms a shallow 
groove on the top surface of the catalytic core, which is 
the nucleotide binding pocket (catalytic center). The 
C-terminal half of the catalytic core domain contains 
a series of five structural elements: loop LI 1, helix tx4, 
loop L12, helix a5, and loop L13 (yellow in Fig. 2a, b). 
This region is known as “switch U” due to its analogy 
to the switch 11 structural element in G-proteins and it 
forms the microtubule binding surface on the bottom a 
helix layer. As described below, switch II, along with 
the neck-linker, serves as the actuator to produce 
mechanical work. The link between the P-loop and 
switch II is called “switch I” which locates around 
loop L9 and it catalyzes the hydrolysis of ATP by 
the nucleophilic attack of the y-phosphate of ATP 
(Fig. 2a, c). Thus, switch I moves and changes confor¬ 
mation according to the nucleotide state of the catalytic 
center. These conformational changes are accompa¬ 
nied by rearrangement of the salt bridges between 
switch I and switch II, which triggers the repositioning 
and the large conformational changes of switch 11. 
Switch I thus links the chemical states in the catalytic 
center to the actuator, switch II microtubule binding 
surface. 


layered structure of the nucleotide binding helices (green), the 
central P-sheets (pink) and the microtubule binding helices 
(yellow). Neck-linker is colored red. Nucleotides (ATP/ADP) 
are shown as the spacehlling models 


Switch II: Actuator for the Mechanical 
Movement 


K 


The key conformational change between the strong¬ 
binding state and the weak-binding state is the rota¬ 
tional movement of the switch II helix a4. This large 
movement is readily noticed when viewed from 
microtubule binding surface (Fig. 3a). The helix a4 
conformations in the strong-binding ATP state and in 
the weak-binding ADP state are so-called the “up” 
conformation and “down” conformation, respectively, 
by the analogy of myosin. Accompanied with this 
movement, the lengths of the loops at the left and 
right sides of this helix, the loop Lll and loop L12 
(K-loop), change. The implications of this conforma¬ 
tional change on the microtubule affinity were 
elucidated by docking these crystal structures into the 
cryo-EM structures of KIFlA-microtubule complex 
(hybrid method). 

In the “down” conformation (weak-binding ADP 
state), the end of a4 helix is partially melted to elon¬ 
gate the K-loop which interacts with the flexible acidic 
E-hook of microtubule (Eig. 3c). This interaction is 
responsible for the weak-binding in which KIFIA dif¬ 
fuses freely along the microtubule. Furthermore, the 
microtubule binding surface of KIFIA is flat and 
smooth with a center line of basic residues which is 
widely conserved among KIF motors (Fig. 3c). This 
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Kinesin: Fundamental Properties and Structure, 
Fig. 3 Switch II movement contributing to the alternation of 
the two binding modes, (a) Rotational movement of switch II 
helix a4 with the alternative elongation of the neighboring loop 
Lll and K-loop. The “up” and “down” conformations are 
colored red and yellow, respectively, (b). Pseudo atomic model 
of KIFlA-microtubule (white) complex. Helix a4 is fixed in 
the groove between a-tubulin and P-tubulin both in the strong¬ 
binding state and weak-binding state so that the nucleotide- 
induced conformational change of switch II rotates the motor 


domain around the helix 0(4. (c) KIFlA-microtubule binding 
during the weak-binding state. Elongated K-loop on the right 
side as well as the basic center line contribute to the 
one-dimensional diffusive-binding along the microtubule, 
(d) KIFlA-microtubule binding during the strong-binding 
state. Elongated loop Lll on the left side mainly contributes to 
the tight binding. Repulsive acidic cluster at the minus-end side 
might not only stop the diffusive movement, but also bias the 
binding of KIFIA toward the plus end (Images in the panels c 
and d are reproduced from Hirokawa et al. (2009b)) 


basic center line of KIFIA faces to the axial line on the 
surface ridge of microtubule protohlament, along 
which acidic residues are aligned (acidic ridge). The 
nonlocalized electrostatic interaction among the center 
line residues will restrict KIFIA to follow along the 
protohlament axis and to support the ID-diffusion of 
KIFIA without getting off the microtubule track like 
a linear-motor train. 

In the “up” form (strong-binding ATP state), the 
helix a4 shifts and rotates three dimensionally 
(Fig. 3d). The end of a4 is lifted up by the neck-linker 
detailed later, and a4 now protrudes on the microtu¬ 
bule binding surface. With this protrusion and rotation, 
the microtubule binding surface of KIFIA takes the 


zigzag shape, which perfectly hts to the surface of 
microtubule protohlament. The protrusion of 0(4 hts 
into the intra-tubulin-dimer groove (Fig. 3b). With this 
up conformation, the beginning of a4 helix is melted to 
elongate the loop Lll, which interacts with the Hll' 
and H3' helices of a-tubulin at the bottom of the intra- 
tubulin-dimer groove (Fig. 3b, d). KIFIA thus uses two 
microtubule binding loops, K-loop and Lll, alter¬ 
nately for the weak-binding and the strong-binding, 
respectively. This rotation also brings a cluster of the 
acidic residues to the minus-end side of the center line. 
This repulsive element at the minus-end side of KIFIA 
might bias the binding of KIFIA toward the plus end 
(Okada et al. 2003). 
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Kinesin: Fundamental Properties and Structure, 
Fig. 4 Schema to illustrate the chemo-mechanical coupling of 
kinesin ATPase. Chemical (nucleotide) states and mechanical 
states are shown in the top line and bottom line, respectively. 
Conserved Arg216 in switch I serves as a chemo-mechanical 


ATP Hydrolysis Generates Strong to Weak 
Transition of Microtubule Binding 

Transition from the strong-binding “up” conformation 
to the weak-binding “down” conformation is coupled 
to the ATP hydrolysis reaction in the catalytic center. 
The crystal structures of ATP hydrolysis intermediate 
states clarified the atomic link of this coupling. The 
“up” conformation of switch II is stabilized by the 
“back door” salt bridge between Glu253 of switch II 
at the beginning of LIl and Arg 216 of switch I 
(Fig. 4). This “back door” locates outside of the y 
phosphate of ATP, and serves as the gate through 
which the product phosphate is released from the cat¬ 
alytic center. The nucleophilic attack of the adjacent 
residue Ser215 to the y phosphate catalyzes the hydro¬ 
lysis reaction, and breaks the back door salt bridge 
interaction. The product phosphate is released through 
the opening of the back door. The breakage of the back 
door interaction liberates the L11 and triggers the 
conformational change of switch II. Lll is wound up 
into the helix (x4 so that it cannot reach the binding site 
on Qc-tubulin, resulting in the detachment of KIFIA 
from the microtubule. Rotational movement of a4 
then flattens the KIFIA surface and rearranges the 
distribution of the charged residues on its surface so 
as to prepare for the following diffusional mode. 
Finally, the C-terminal part of the helix a4 melts into 


coupler by making the two alternative salt bridge, namely, the 
“backdoor” and the “latch.” Switch I and switch II in the ADP 
state are also linked to the microtubule sensor which is involved 
in the microtubule activating pathway for the ADP release from 
kinesin (Images are modified from Nitta et al. (2008)) 


the K-loop, resulting in the elongation of K-loop to 
interact with E-hook of microtubule. Thus, the chem¬ 
ical reaction of ATP hydrolysis is coupled to the con¬ 
formational change of the microtubule binding 
element switch II: rearrangement of the zigzag shaped 
surface into the flat surface, and the switching from the 
Lll-mediated strong-binding to the K-loop-mediated 
weak-binding. 


K 


ADP Release Is Coupled to Weak-to-Strong 
Transition of Microtubule Binding 

ID-diffusional movement of KIFIA in the weak- 
binding state along the microtubule enables it to search 
for the next strong-binding site on the microtubule. In 
this mode, KIFIA traps ADP tightly in the pocket and 
this diffusion continues until the microtubule triggers 
the release of ADP from KIFIA. A high-resolution 
crystal structure of KIFIA with bound Mg ADP 
explains the atomic mechanism of slow ADP release 
(Nitta et al. 2008; Hirokawa et al. 2009b). ADP on the 
surface shallow groove formed by the P-loop is stably 
covered with a “Mg-water cap,” a layer of crystal ice 
formed by the hydrogen bond network around the 
essential cofactor Mg^"^ (Figs. 4 and 5a). This 
Mg-water cap also fills the gap between the 
P-phosphate of ADP and switches I and II. The 
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Fig. 5 Atomic mechanism of nucleotide exchange of kinesin. 
(a) Structures of the KIFIA motor domain before 
release. An oblique left-side view (left panel', a schematic in 
the inset shows the orientation) and a left-side view to show the 
key structural elements surrounding loop L7 (rightpanel). The 
Mg^"^ stabilizer interaction among Glul48 (in loop L7), 
Arg203 (in switch I) and Asp248 (in switch II) anchors the 
Mg^'^-water cap, which keeps ADP in the nucleotide binding 
pocket. Latch interaction among Tyrl50 (in loop L7), Arg216 
(in switch I) and Glu267 (in loop Lll) pulls up loop Lll and 
stabilizes switch II in the “down” conformation. Loop L7 is the 
pivot for these two sets of interactions. The charged tip resi¬ 
dues Glul52 and Argl53 of loop L7 are exposed to the 

nucleotide binding pocket is thus closed in the absence 
of microtubules. 

Structures of Mg "^-release intermediates of KIFIA 
explain the mechanism of microtubule-induced 


microtubule surface (Argl58 and Glul59 in helix H4 of 
P-tubulin). The attractive interaction between these dipole 
pairs pulls loop L7 toward the P-tubulin. Loop L7 can thereby 
sense the binding to the microtubule, hence its name of micro¬ 
tubule sensor, (b) Structures of the KIFIA motor domain after 
Mg^"^ release. An oblique left-side view (left panel) and a left¬ 
side view to show the key structural elements surrounding loop 
L7 (right panel). The downward movement of loop L7 (white 
arrow, right panel) breaks both the Mg^'^-stabilizer and the 
latch. The nucleotide binding pocket is thereby opened to 
allow the immediate exchange of ADP for ATP, and loop 
Lll extends to the groove between a- and P-tubulin to make 
a strong interaction (Images are reproduced from Hirokawa 
et al. (2009b)) 

kinesin activation. Switch I changes its conformation 
largely during Mg^"^ release. After ATP hydrolysis, 
switch I is linked to the L7 loop and switch II by two 
sets of hydrogen bonds (Figs. 4 and 5a). One set of 
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hydrogen bonds forms close to the N-terminal end of 
the switch II a4 helix, and stabilizes the conformation 
of the microtubule binding surface in the post¬ 
hydrolysis position, which is referred to as the 
“Latch.” The other set of hydrogen bonds forms at 
the root of the L7 loop. Glul48 in L7 makes 
a hydrogen bond with Arg203 in switch I, which inter¬ 
acts with Asp248 in switch II. These three residues not 
only link switches I and II to L7, but also make 
a hydrogen bond with the Mg-water cap, thus serving 
as the “Mg-stabilizer” or an anchoring point for the 
Mg-water cap. The release of Mg is accompanied by 
breakage of the two sets of hydrogen bonds between 
L7 and switches I and II. 

The intermediate structures of Mg release suggest 
that the downward movement (toward the microtu¬ 
bule) of loop L7 triggers the breakage of its hydrogen 
bonds to switches I and II (Figs. 4 and 5b). The tip of 
L7 has conserved residues, Glul52 and Argl53, which 
face the oppositely charged residues Argl58 and 
Glul59 in the helix H4 of P-tubulin that is located on 
the plus-end side of the microtubule (Fig. 3c). These 
tubulin residues are well conserved in P-tubulins but 
not in a-tubulins. This interaction, therefore, would 
contribute to the ability of kinesins to discriminate 
between P- and a- tubulin, and help to explain the 
directional bias for the diffusional movement of 
kinesins. 

Hence, the loop L7 might act as a triggering lever 
for Mg release and the nucleotide exchange that 
follows (Fig. 4). The root of L7 might have an essential 
role in bringing switch I, switch II and MgADP 
together to close the nucleotide binding pocket. Its 
tip might be the microtubule sensor that is selective 
for p-tubulin. If it is, the binding of P-tubulin 
would pull this sensor loop L7 downward to trigger 
breakage of the Mg-stabilizer and loss of the Mg-water 
cap. The nucleotide binding pocket would then be 
fully open (Fig. 5b), and the nucleotide readily 
exchanged without conformational changes. Consis¬ 
tent with this idea, the atomic structure of KIFIA 
immediately before and immediately after ADP-ATP 
exchange is essentially the same, yet the atomic struc¬ 
ture of KIFIA without bound nucleotide is still 
missing. 

After nucleotide exchange, the nucleotide binding 
pocket closes as a result of the formation of the back¬ 
door salt bridge between the L9 loop of switch I and the 
Lll loop of switch II. The conformational changes in 


the microtubule binding surface and the neck-linker 
enable this closure of the nucleotide binding pocket, as 
described below. 

Dual Roles of Neck-Linker: for the ATPase 
Regulation and for the Power Stroke 

The neck-linker of N-kinesins is located at the 
C-terminus of the catalytic core, next to the a6 helix, 
and it connects the catalytic core to cargo and to the 
partner head in kinesin dimers (Fig. la). Considering 
that the neck-linker is the sole element that links the 
motor domain to the cargo, or to another motor domain 
in the case of kinesin dimers, the neck-linker is 
expected to have important and conserved roles in 
linking the reactions in the kinesin catalytic core to 
outside elements. 

Previous cryo-EM studies suggest that the neck- 
linker has two conformations (Rice et al. 1999). The 
neck-linker is fully detached from the catalytic core in 
the ADP state. Upon nucleotide exchange from ADP to 
ATP, the neck-linker docks to the catalytic core, which 
is considered the power stroke for kinesin hand-over- 
hand movement. Crystal structures have revealed 
details of the neck-linker docking to the core (Fig. 6). 
The neck-linker is composed of three elements. A short 
segment called the neck-initial segment (NIS) is 
followed by two P-strands, p9 and piO. Among these 
three elements, the NIS is essential for the ATPase 
reaction cycle, while the latter two p-strands of the 
neck-linker are required for the power stroke. 

Before Mg-release, the neck-linker is not attached 
to the catalytic core (undocked) (Fig. 6b). The down¬ 
ward movement of the “microtubule sensor” loop L7 
triggers the opening of the nucleotide binding 
pocket. This conformational change is coupled to the 
rearrangement of the microtubule binding surface into 
the conformation with high affinity to microtubules, as 
described above. This high affinity conformation of a4 
is supported from the back side through the hydrophobic 
interaction between conserved Ile354 in the NIS and 
Leu285 in the a4 (Fig. 6a, c). A NIS deletion mutant 
severely abrogated the ATPase activity of the kinesin 
catalytic core and NIS alone, without the neck-linker 
elements P9 and plO, could restore most of the ATPase 
activity (Nitta et al. 2008), indicating that the docking of 
NIS is required for the ATPase reaction. Hence, the 
docking of NIS is required to stabilize the open 
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Kinesin: Fundamental Properties and Structure, 
Fig. 6 Conformational change of the neck-linker coupled to 
the switch II movement, (a) Structure of the neck-linker. Neck- 
linker is composed of the three elements, neck-initial segment 
(NIS), (39, and (310. (b) Undocked neck-linker in the weak- 


confomation of the kinesin catalytic center by 
supporting the protruded conformation of helix a4 and 
this regulation is applicable for the cooperative behavior 
between two motor domains moved in hand-over-hand 
motility. 

At the nucleotide exchange step, the middle neck- 
linker element P9 (pink in Fig. 6d) docks to the core 
and forms a rigid triple p-strand structure with the N- 
terminal strand PO (blue in Fig. 6d) and the L13 loop 
(orange in Fig. 6d). Then, the third neck-linker element 
PIO docks to the catalytic core (Fig. 6e), and switch 
I binds to the y phosphate of Mg-ATP by the closure of 
the nucleotide binding pocket (this step is called as 
“isomerization”). Mutation of these P sheets supports 
their role as the lever arm for the power stroke, as the 
ATPase activity is not affected when they are mutated. 


binding state with ADP. (c) Docking of the NIS which is coupled 
to the Mg^"^ release prior to the ADP release, (d) Docking of 
the (39 during the nucleotide exchange, (e) Docking of the (310 in 
the strong-binding state with ATP 


but the motility is severely impaired. The impairment 
of motility is most significant with P9 mutants, which 
suggests that P9 docking is the major power stroke. 
PIO docking might contribute more to the force pro¬ 
duction rather than the stroke size. 


Summary 

The structures of molecular motor KIFIA in the vari¬ 
ous nucleotide conditions gave an insight into the 
mechanisms of motility for both KIFIA and kinesins 
(KIFs) in general. Switch II makes the major binding 
site for microtubules and serves as an actuator for the 
mechanical movement of kinesin. The conformational 
change of switch II is driven by the hydrolysis of ATP 
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at the catalytic center. Switch I is situated between 
switch II and the catalytic center and links them. 
Switch I and switch II are further linked to the micro¬ 
tubule sensor loop L7 and the NIS of the neck-linker, 
both of which are key regulators for sensing kinesin- 
microtubule binding and the external load of the 
kinesin. This structural model could serve as the pro¬ 
totype for the atomic model of the regulatory mecha¬ 
nisms of kinesin motors. 


Cross-References 

► ATP-Driven Mechanical Work Performed by 
Molecular Motors 
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Synonyms 

Molecular motors 


Introduction 

Kinesin Superfamily Proteins (KIFs) comprise 
a family of closely related motor proteins that, in 
general, transport cargo to specific destinations within 
the cell using microtubules as rails and adenosine 
triphosphate, ATP, as the source of energy (reviewed 
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in Hirokawa et al. 2009b). Most KIFs move in either 
the direction of the plus or minus end of microtubules. 
The growing spectrum of cargo includes vesicular 
organelles, mitochondria, RNA-protein complexes, 
protein complexes, intra-flagellar rafts, and chromo¬ 
somes. Attributed functions other than transport range 
from microtubule depolymerization (Desai et al. 1999) 
to DNA repair and transcription regulation 
(Midorikawa et al. 2006), cell growth signaling regu¬ 
lation (Zhou et al. 2009), or mitotic spindle assembly 
(Vanneste et al. 2011). KIFs are expressed in all 
eukaryotes studied thus far, in many different cell 
types. There are 45 KIF gene loci in the human or 
mouse genome (Miki et al. 2001). A model plant 
Arahidopsis thaliana has 60, the fruit fly Drosophila 
melanogaster 25, nematode Caenorhahditis elegans 
19, budding yeast Saccharomyces cerevisiae 6, and 
a parasitic protozoa Leishmania major 50 (Miki et al. 
2005). The number of kinesins in each organism may 
vary among studies due to different data versions, dif¬ 
ferences in the definition of a KIF, use of loci number or 
expressed sequences, etc. The total number of KIFs in 
model organisms, such as mice, have long been pursued 
(Aizawa et al. 1992; Nakagawa et al. 1997). The fact 
that the cell uses a limited number of KIFs to transport 
innumerous cargo to specific destinations would indi¬ 
cate an elaborate system comprised of splice variants of 
motors and/or various adaptor molecules. Various KIFs 
are lethal when knocked out in mice, and the cause and 
time of death is diverse. Dysfunction of KIFs gives rise 
to neuronal migration defects caused hy uncontrolled 
microtubule branching during brain development 
(K1F2A; Homma et al. 2003), lack of oculomotor 
nerve innervation leading to congenital fibrosis of the 
extraocular muscle type 1 (K1F21A; Yamada et al. 
2003), or absence of nodal cilia resulting in randomized 
left-right asymmetry (KIF3; Nonaka et al. 1998) just to 
give some examples. The diversity of KIF function in 
critical biological processes indicates how important 
and involved this protein superfamily is in living organ¬ 
isms. As revealed by early electron microscopy studies, 
a typical KIF molecule has a globular head at one end, 
a fan-shaped tail at the reverse end, and a long filamen¬ 
tous region in between (Hirokawa et al. 1989). The head 
contains the microtubule-binding sites and ATP- 
hydrolyzing sites, generating force and is referred to as 
the catalytic core. The tail binds adaptor proteins such as 
light chains or binds directly to cargo. The filamentous 
region, the stalk, contains a hinge enabling the tail to 


interact with the head. The head contains ^360 amino 
acid residues that are highly conserved and defines the 
superfamily. The ATP-binding P-loop, ATP y-phosphate 
binding switch I, and force-producing switch II are espe¬ 
cially well conserved (Hirokawa et al. 2009a). Adjacent 
to the catalytic core is the family-specific neck which 
functions in generating a power stroke. The catalytic core 
and neck comprise the motor domain. The majority of 
KIFs possess the motor domain in the amino terminus; 
however, some KIFs such as members of the Kinesin-14 
family have motor domains close to their carboxyl ter¬ 
minal. Additionally, as some Kinesin-13 constituents, 
some have motor domains toward the middle of the 
protein. Using motor domain sequences, phylogenetic 
studies have been conducted. 

General Topology of the Phylogram 

Amino acid sequences of KIFs can be found in protein 
or genomic databases or research group home pages 
(e.g., http://cb.m.u-tokyo.ac.jp or http;//www.cellbio. 
duke.edu/kinesin/). Collected sequences were aligned 
using multiple alignment software such as MAFFT 
(Multiple Alignment using Fast Fourier Transform; 
Katoh et al. 2002). After manual inspection and editing, 
Bayesian Inference was used to calculate the relatedness 
of each sequence. Several research groups have pro¬ 
duced phylogenetic trees using hundreds of KIF 
sequences. The resulting topology of the respective 
trees is fundamentally identical though using slightly 
different data sets or methods of alignment and/or tree 
building. Branching within families is highly reproduc¬ 
ible as indicated by high bootstrap values. However, 
branches or nodes, between families are not supported. 
This suggests the relatedness between families is not 
certain. In 2004, the Kinesin superfamily research 
community proposed a standardized nomenclature for 
all Kinesin families (Lawrence et al. 2004). 
A phylogenetic tree, using 623 KIFs, was adopted as 
a foundation for the classification and can be viewed 
here: http://cb.m.u-tokyo.ac.jp (see also Fig. 1 and 
Miki et al. 2005). There are fifteen families. The fami¬ 
lies are numbered in an order in which Kinesin-1 
contains the first KIF identified and the final Kinesin- 
14A and 14B contain members with motor domains at 
the C-terminal end. Representative members are listed 
in Table 1. Basic domain structures are indicated 
in Fig. 2. 
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Kinesin Superfamily 
Classification, 

Fig. 1 Amino acid sequences 
of the catalytic core from 
623 KIFs were aligned using 
the MAFFT package, and the 
phylogenetic tree was inferred 
by Bayesian Inference. The 
classification is based on the 
branching of this tree and the 
class-specific features in the 
full length sequence 
(Fig. 2 and supplemental data). 
Fourteen families from 
Kinesin-1 to Kinesin-14B were 
identified. KIFs that do not 
group to these 14 families are 
classified as orphans. Taxa 
definition derives from the 
NCBI taxonomy page, v (red) : 
vertebrate, i (pink) : insect, 
n (purple) : nematode, 
p (green) : higher plant, 
f (blue) : fungi. Full tree and 
family-specific trees can be 
viewed at http://cb.m.u-tokyo. 
ac.jp (Modified from Miki 
et al. 2005) 
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Kinesin Superfamily Classification, 

Table 1 Representative family members 


Family 

Founding 

member(s) 

Representative members 

Reported function/structural 
features 

Member no.“ 

Kinesin-1 

LpKHC 

DmKHC 

KIF5B, KHC, NKin, KLPl, 

KinA, DdK5 

Vesicle transport, conventional 

3/1/1/4/3 

Kinesin-2 

MmKIF3A 

StrPuKRP85/95 

KIF3A/3B, KIF17, Kip85/95, 

Osm3, FlalO 

Vesicle-intraflagellar transport/ 
heterotrimeric 

4/3/3/0/0 

Kinesin-3 

CeUncl04 

MmKIFlB 

KIFIA, KIFIB, KIF13A, KIF16B, 
UNCI 04 

Organelle transport/monomeric 

8/4/2/0/1 

Kinesin-4 

MmKIF4A 

KIF4A, KIF21A/B, chromokinesin 

Organelle transport, 
chromosome movement 

5/3/2/3/1 

Kinesin-5 

AnBimC 

SchPoCut? 

KIFll,Eg5, BimC, CIN8, 

KIPl, Cut7 

Spindle formation/ 
homotetrameric, bipolar 

1/1/1/4 

Kinesin-6 

CgCHOl 

KIF20, KIF23, RabbKinesin, CHOI, 
MKLPl, Zen4, MPPl 

Cytokinesis, spindle polarity 

5/2/1/0/1 

Kinesin-7 

ScKip2 

HsCENP-E 

KIF 10, CENP-E, CMET, CANA, 

KIP2 

Kinetochore microtubule 
capture 

1/2/0/14/2 

Kinesin-8 

DmKLP67A 

KIF18A/18B KIF19A, KLP67A, 

KIP3 

Nuclear migration, 
mitochondrial transport 

3/2/1/2/0 

Kinesin-9 

CrKLPl 

KIF6, KIF9, KRP3, CrKLPl 

Unclear 

2/0/0/0/0 

Kinesin-10 

DmNod 

KIF22, KID, Nod 

Chromosome segregation/ 
Helix-hairpin-Helix DNA- 
binding motif 

l/l/O/l/O 

Kinesin-11 

ScSmyl 

KIF26A, KIF26B, VAB8, SMYl 

Signal transduction/divergent 
catalytic core 

2/1/1/2/0 

Kinesin-12 

XlXklp2 

KIF12, KIF15, HKLP2, KLP54D, 
Xklp2, PAKRPd 

Organelle transport/ 
homologous tail 

2/1/0/6/0 

Kinesin-13 

MmKIF2A 

CgMCAK 

KIF2A, MCAK, XKCMl, PfKinI 

Microtubule depolymerizing/ 
central motor 

4/3/2/1/1 

Kinesin-14A 

ScKAR3 

DmNCD 

KIFCl, CH02, Ned, Kar3, KatA 

Chromosome segregation/C- 
terminal motor 

1/1/4/4/1 

Kinesin-14B 

AtKCBP 

KIFC2, KIFC3, KatD, KCBP, KIF25 

Organelle transport/C-terminal 
motor 

3/0/1/16/0 

Orphans 


CeKLPlO, CeKLPlS, DdK9 

Ungrouped 

0/0/2/2/1 




Total 

45/25/19/60/10 


“Numbers of members in WamaaJDrosophilalC. elegans/ArabidopsisIDictyostelium 
Modified from Miki et al. (2005) 


Kinesin-1 

The first KIF to be reported, conventional Kinesin or 
Kinesin Heavy Chain (KHC), is contained within this 
family. Members of this family have a family-specific 
neck P-sheet and a highly conserved coiled-coil stalk. 
These KIFs are reported to function as intracellular 
organelle transporters or are involved in nuclear 
movement. 

Kinesin-1 is comprised of three subfamilies, animal 
KHC, fungal/plant KHC, and atypical KHC. Animal 
KHCs, such as KIFS A, KIF5B, and KIF5C, have both 


a cargo binding domain (CBD) in the stalk and light 
chain binding domain (LCBD) which binds to an 
adaptor-functioning kinesin light chain (KLC). In con¬ 
trast fungal/plant KHCs lack LCBDs. Animal KHC 
can form a heterotetramer with a pair of KLCs, while 
fungal KHCs are homodimers. 

Atypical KHCs lack both CBDs and LCBDs but 
contain a transmembrane domain or ARM-repeat 
domain, which may enable direct cargo binding. Atyp¬ 
ical KHCs in plants may substitute functions of the 
Kinesin-2 family, which is lacking in higher plants as 
described below. 
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Kinesin Superfamily 
Classification, 

Fig. 2 Structures and 
domains of representative 
members of each family and 
subfamily. Characteristic 
features of respective family 
and subfamily were deduced 
from amino acid alignments 
and results from the SMART 
server (Reproduced with 
permission from Miki et al. 
2005) 
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Kinesin-2 

These KIFs were identified secondly to conventional 
Kinesin. Three Kinesin-2 subfamilies exist: the 
KIF3A, KIF3B/C, and K1F17 subfamilies. The 
KIF3A subfamily contains clusters of positively and 
negatively charged residues just after the neck p-sheet, 
downstream of the catalytic core, whereas KIF3B/C 
molecules have clusters of oppositely charged residues 
in corresponding positions. This region enables mem¬ 
bers of these two subfamilies to heterodimerize. In 
contrast, KIF17 subfamily members are reported to 
homodimerize. KIF17 knockout mice have memory 
impairment (Yin et al. 2011). 

Family constituents are known to participate not 
only in organelle transport but also intraflagellar trans¬ 
port and spermatogenesis. Interestingly members of 
this family have not yet been described in fungi or 
higher plants; for example, they are absent from 
Magnoliophyta, plants which do not produce sperm 
or possess cilia or flagella. It would be interesting to 
know whether members of this family exist in plants 
such as Pterophyta or gymnosperms, which produce 
sperm. KIF3s are known to closely associate with 
Kinesin Adaptor Protein 3 or KAP3, an adapter mole¬ 
cule containing Armadillo (ARM) repeats. Atypical 
plant KHCs in the Kinesin-1 family contain ARM 
repeats which may function as an intrinsic adaptor 
molecules at the location of adaptor protein binding. 

Kinesin-3 

The founding member of this family was identified as 
the third N-terminal motor domain kinesin. Major 
characteristics of this family are the (I-sheet and helix 
in the family-specific neck region followed by 
a Forkhead-Associated (FHA) domain, a site of 
protein-protein interaction. This family is comprised 
of five major subfamilies, namely, the KIFl, KIF13, 
KIF14, KIF16, and KIF28 subfamilies, and an 
uncharacterized minor group, the NcKIFlC group. 

Members studied thus far are organelle transporters. 
This family includes KIFs found in animals, fungi, and 
multiple protozoa though not found in higher plants. 
Members are reported to be monomeric or 
homodimeric. The presence of monomeric motors is 
a major characteristic of this family and provides 
a starting point for analysis of the foundation of 


movement. Knockout mice which lack members of 
the KIFl subfamily die shortly after birth. Extensive 
studies have shown that some members play crucial 
roles in neurons by transporting synaptic vesicle pre¬ 
cursors down the axon (Niwa et al. 2008). 

Kinesin-4 

The kinesin-4 family represents an “extended” family 
of several kinesin subfamilies. The number “4” was 
taken from KIF4 as a mnemonic device. “KIN 
N-Chromo” or the “Chromokinesin” family are past 
references to this family, though not all members of 
this family bind to chromosomes, and not all KIFs that 
bind to chromosomes belong to this family. 

The neck (I-sheet and coiled-coil stalk are well con¬ 
served. Most KIFs in this family can be further classified 
into five subfamilies: the KIF4, KIF7, KIF21, KIF27, 
and NcKIF21A subfamilies. A relatively distant protein 
species is found in flagellates. 

Reported functions of members of this family range 
from organelle transport to chromosomal movement. 
Murine KIF4A is reported to regulate the activity of 
PARP-1, an ADP-ribose polymerase, resulting in con¬ 
trol of the survival or apoptosis of juvenile neurons in 
an activity-dependent manner. 

It is intriguing that the KIF4 subgroup, known to 
function both in the nucleus and cytoplasm, is con¬ 
served in higher plants and animals, whereas the 
KIF21 subfamily, reported to function in neuronal 
cells, is only present in animals. 

This family, as well as the Kinesin-2 and -3 fami¬ 
lies, has more family members in higher vertebrates, 
less members in lower vertebrates, and none or few in 
plants. In contrast, the Kinesin-5, -7, and -I4B families 
have multiple members from individual plant species 
but much fewer per animal species. 

Kinesin-5 

The family number “5” was taken from a member in 
animals, Eg5. This family is the most conserved, mono- 
phyletic family. The sequences of family members 
have not diverged in comparison to other families, and 
this family cannot be subdivided into smaller groups. 
Members have a family-specific neck adjacent down¬ 
stream of the catalytic core, a characteristic BimC box 



Kinesin Superfamily Classification 


1197 


domain close to the C-terminus and sparse coiled-coils 
in between. The BimC box motif is reported to be 
phosphorylated by p34/Cdc2, which localizes to micro¬ 
tubules and contributes to cell division. This kinase is 
found in species containing Kinesin-5 family members. 
Constituents are reported to form homotetramers and 
are known to be mitotic motors functioning in the 
formation of a bipolar spindle. 

Kinesin-6 

The designated family number “6” was taken from 
RabbKinesin, a member of this family and also from 
the past family name “N-6.” The family-specific neck 
region is loosely conserved in this family. A major 
characteristic is the long insert in loop L6 of the catalytic 
core. This loop may function in a putative regulatory 
mechanism of motor activity as it faces the cytoplasm, 
enabling access to cytosolic factors that may act as 
regulators. This family can be further subdivided into 
two subfamilies. Members of the KIF20 subfamily 
derive from fungi, animals, and a slime mold. The 
K1F23 subfamily is comprised of KlFs from animals. 

This family functions in cytokinesis or microtubule 
transport. Rab6Kinesin/KIF20A is the first KIF to be 
reported to bind to a Rah; however, later studies 
enlighten a role in cytokinesis. This latter function is 
shared among KIFs whose functions are known in this 
family and may explain why members of this family 
are not found in plants, which undergo cytokinesis in 
a different manner from animals. 

Kinesin-7 

The family designation “7” has been retained from the 
previous family number. 

Constituents possess a long, family-specific neck. 
The long protein species found in animals and plants 
are abundant in coiled-coil structure whereas other 
isoforms are shorter and contain less coiled-coils. 
This family is comprised of one major subfamily 
along with variants in plants. Though in animals this 
family is monophyletic, duplication and deviation is 
observed in plants. 

Members are reported to have a role in mitosis, such 
as microtubule capture at kinetochores and nuclear 
migration. Arahidopsis contains 15 members and rice 


has six members. Though conserved in a wide spectmm 
of species, abundance of members of this family in 
plants suggest fundamental but diverse roles in higher 
plants. A possible function of Kinesin-7 members in 
plants is the microtubule capture at phragmoplasts, the 
preprophase band or cortical arrays. 

Kinesin-8 

Constituents of this family have a helical family-specific 
neck. This family can be further divided into two: an 
animal-specific subfamily containing KIF18A/B and 
a ubiquitous subfamily that includes KIF19A. 

Various functions have been reported for this fam¬ 
ily from nuclear and cytoplasmic function of Kip3 in 
budding yeast, mitochondrial transport of Klp67A in 
Drosophila, and chromosome segregation during mito¬ 
sis in fission yeast by Klp5 and Klp6. Members of 
this family have been erroneously referred to as 
“Kinl” KIFs, a previous reference to the Kinesin-13 
family, due to their predicted microtubule 
destabilizing feature. Yeast Kip3 has been reported to 
possess both anterograde motility and microtubule 
catastrophe-inducing properties (Gardner et al. 201 1). 

Kinesin-9 

The designation “9” was taken from a family member, 
KIF9. This is a novel family identified by the family- 
specific neck adjacent downstream to the catalytic 
core. The amino acid sequence in the neck is conserved 
within this family. Catalytic core sequences also 
loosely establish this family. 

Members separate into two subfamilies and are not 
found in invertebrates, fungi, or higher plants. The 
precise function of family members is uncertain; how¬ 
ever, the absence of this family in higher plants sug¬ 
gests a function in cilia and flagella as in the case of 
Kinesin-2. Polymorphism in the KIF6 gene is impli¬ 
cated in cardiovascular disease risk (Bare et al. 2007). 

Kinesin-10 

This family contains KIF22, KID (for Kinesin-like 
DNA-binding protein), and Nod (from the phenotype 
when mutated, “No distributive Disjunction”). 
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Preservation of a family-specific neck sequence is 
weak contrary to most other families. However, 
a family characteristic Helix-hairpin-Helix DNA- 
binding motif class 1 (HhH) is present in the C-terminal 
tail. Members are predicted to bind directly to chro¬ 
mosomes and are identified in eukaryotes from all 
kingdoms. Inclusion of the tail sequence clearly iden¬ 
tifies Nod as a family member through the HhH motif. 

KIFs in this family have been reported to have 
a function in cell division, especially chromosomal 
movement which may be dependent on their HhH 
DNA-binding motif. 

Kinesin-11 

The founding member of this family is Smyl, which 
does not bind to microtubules perhaps due to a deviant 
catalytic core. The characteristic feature of this family 
is the highly divergent catalytic core. No conservation 
of a family-specific neck sequence is inferred. 

Family members are reported to participate in sig¬ 
nal transduction and may not be motile especially 
along microtubules, although the switch I and switch 
II motifs in the catalytic core, which function in ATP 
hydrolysis and are essential for motility, remain intact. 
Recently, a member from mouse, KIF26A, has been 
reported to play a crucial role in the cell growth 
signaling pathway. 

Kinesin-12 

The family number was taken from a member, KIF12, 
as a mnemonic device. This family is formed by the 
combination of families formerly known as “N-9,” 
“N-10,” and “AtPAKRPl” {Arahidopsis thaliana 
Phragmoplast-Associated Kinesin-Related Protein). 
This unification is justified by amino acid sequence 
homology in the C-terminal, which is abundant in 
coiled-coil structures. It is not unequivocal whether 
the KIF12 subfamily and animal and plant KIFI5 
subfamilies are phyletic or not. 

The three subfamilies share a family-specific neck 
P-sheet. The KIF15 and Plant KIF15 subfamilies have 
a weak homology in the C-terminal coiled-coil region. 
Conservation of the tail sequence within this family 
leans toward a common or related function for family 
members. 


Kinesin-13 

The first identified members of the family have cata¬ 
lytic cores in the middle of the molecule and have, 
therefore, been placed after N-terminal motors 
(12 families) and before C-terminal motors, Kinesin- 
14. The Kinesin-13 family, in the past, had been 
referred to as the M Kinesin family for “Middle type 
motor” or KinI family for “Internal type motor” since 
founding members of this family have catalytic cores 
at the center of the protein. 

This family possesses a conserved positively 
charged helical neck upstream of the catalytic core. 
Two subfamilies, the animal-specific KIF2 subfamily 
and the ubiquitous KIF24 subfamily, exist. The former 
has a catalytic core at the center, and the latter, at the 
N-terminal. 

The crystal structures of KIF2C and PfKinl, 
a protozoan homolog of KIF24 have been solved, 
elucidating the mechanism of the depolymerizing 
function. 

The limitation of the presence of the KIF2 subfam¬ 
ily in animals and the reported function implicate roles 
of these KIFs in animal-specific cell division associ¬ 
ated with the mitotic centromere and in neuronal 
development. 

Kinesin-14 

The position of the catalytic core places this group at 
the end of the current families. All members of this 
family have neck sequences N-terminal to the catalytic 
core, though some KIFs, such as AtKatD (KatD stands 
for Kinesin-like protein in Arahidopsis thaliana D), 
have catalytic cores approximately at the center. 
These structurally reversed KIFs have also a reversed 
motility, indicating the importance of the order of 
structural components for the direction of movement. 

Two distinct subfamilies comprise this family. 

Kinesin-14A 

Constituents of this family possess a family-specific 
neck helix before the catalytic core at the C-terminal. 
This family consists of one well-conserved subfamily 
deriving from organisms in all four kingdoms. 

Reports thus far imply these are mitotic motors 
functioning in nuclear fusion in yeast or multiple 
functions in higher organisms. 
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Kinesin-14B 

Members have a distinct family-specific helical neck 
region adjacent upstream of the catalytic core. Three 
major subgroups have been inferred. The KIFC2 
subgroup members are expressed in animals and 
participate in retrograde transport of vesicles. The 
KatD subgroup is comprised of multiple paralogs 
from plants, and members have a Calponin Homology 
(CH) domain. The possession of this domain suggests 
that these KIFs may retrieve endocytosed vesicles 
interacting with the actin cytoskeleton and then trans¬ 
port them along microtubules. 

The KCBP (Kinesin-like Calmodulin-Binding Pro¬ 
tein) subfamily is closely conserved in plants and has 
a relatively loosely conserved member in sea urchin. 
The abundance of members found in plants may relate 
to the lack of cytoplasmic dyneins in plants since 
both are retrograde motors, indicative of potentially 
redundant functions. 

K1F25 is assigned to the Kinesin-14B family by 
phylogeny and has a C terminal type neck upstream 
of the catalytic core at the genomic and cDNA level. 
Though orthologs exist in human and fish, there are no 
orthologs in rodents. 

Orphans 

Nine KIFs were not affiliated with any of the 14 fam¬ 
ilies. They are three KIFs from nematodes, one 
protochordate KIF, one slime mold KIF, one found in 
tetrahymena, two from Giardia, and one from 
Leishmania. Protozoan KIFs have distinct sequences. 
Many are not affiliated with any subgroup within 
a family. This represents the diversity and divergence 
the protozoans have undergone during evolution. 
As more protozoa are analyzed, more divergent KIFs 
can be expected. 

Summary 

The classification of Kinesin Superfamily proteins is 
presented. The founding study of KIF classification 
catagorized 98.5% of 623 KIFs into 15 families. As 
the organism evolves, the motor domain may also 
evolve and attain a new function or the molecule may 
stay relatively unchanged maintaining the original 
function. Essential in basic cellular functions, KIF 


families have constituents from the four species king¬ 
doms. Newly identified KIFs may be categorized into 
one of the 15 families. The function may be inferred 
from this classification. 


Cross-References 

► Domain Structure Classifications 

► Kinesin: Fundamental Properties and Structure 
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Definition 

Kinetic isotope effects reflect the consequences of the 
change in atomic mass on the kinetics of a reaction 
when a heavy isotope (e.g., deuterium) is substituted 
for a lighter atom (e.g., protium). The information 
obtained can help identify the contribution of bond¬ 
making and bond-breaking steps to the rate-limiting 
step of a reaction pathway, as well as assessing the role 
of quantum tunneling in a mechanism. 

Basic Characteristics 

Isotopes are widely used to study chemical mechanisms 
as they can identify the fate of atoms within a reactant 
as it is converted to product. A classic example is the 


identiflcation of the site of hydrolysis of a compound 
by determining the fate of an (heavy oxygen) 
atom when hydrolysis is carried out in H2**0. Heavy 
isotopes can also change the kinetics of a reaction 
and this may also provide valuable information 
(Cook 1991; Cook and Cleland 2007; Fersht 1999). 
In classical theory this effect is attributed to depen¬ 
dence on the mass of the frequency (v) of the funda¬ 
mental bond vibrations. 

V = (ks/p)*/^/27i (1) 

where kg = the spring constant of the bond and p = the 
reduced mass of the atoms (A and B) involved 
(= mA.mB/(mA -i- me)). 

The kinetic isotope effect (KIE) is deflned as the 
ratio of the observed rate constants for the natural and 
heavy isotopically labeled substrate, for example, 
using protonated (ka) compared with deuterated sub¬ 
strate (ko), KIE = ka/ko- The KIE is most marked for 
the hydrogen atom because the relative mass is 
increased approximately two- and threefold for deute¬ 
rium and tritium respectively, compared with the pro¬ 
tium. If the rate-limiting step of a reaction involves 
breakage or formation of a C-H bond, then substitution 
of D for H to give a C-D bond can lead to a 6-10 times 
slower reaction. Such an effect is termed a primary 
isotope effect in contrast to smaller secondary isotopes 
effects where the bond broken or formed is one or more 
atoms away from the site of isotopic substitution. 
A smaller primary kinetic isotope effect can also be 
observed if the transition state for transfer of hydrogen 
is not symmetric with respect to H. In addition, smaller 
primary KIEs for an overall reaction will be observed 
when the bond breaking or forming step is only par¬ 
tially rate limiting - a situation termed kinetic com¬ 
plexity. This problem may be resolved by using 
► transient-state rather than ► steady-state kinetic 
methods to follow the effect of isotopic substitution 
on individual steps. 

The magnitude of the primary KIE for H compared 
with D can be explained by the quantum mechanical 
minimum (zero point) energy = hv = 'A * 6.6 x 
Iq-m * y g ^ 10*^1 per molecule =15.7 kJ/mole for a 
typical R-H bond. Erom Eq. 1, Vq/Vh = (mH/mo)^^^ 
= 0.7; therefore, the zero point energy =11 kJ/mole 
for the R-D bond. Given an extra 4.7 kJ/mole is 
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required to break the R-D bond, the effect on the rate 
constant at 20° C according to the Arrhenius equation 
(k = will he: 

ln(kH/kD) = -(Ea*^ - Ea°) /RT = 4.7/2.4 
= 1.96, 

hence kn/ko = 7.1 at ambient temperatures. 

KIE values much greater than this may indicate 
a breakdown in the Arrhenius theory of activation 
energy and suggest that hydrogen can “tunnel” through 
the activation energy harrier. In this context, tunneling 
refers to the quantum mechanical concept that particles 
do not have a precise location and hence can exist 
with a small probability on the other side of the 
harrier (Allemann and Scrutton 2009). H atoms tunnel 
much more efficiently than D atoms and hence give 
rise to a large KIE. Eurthermore, tunneling is not 
temperature-dependent (cf. the Arrhenius equation 
above) and hence the KIE value is temperature- 
insensitive for this mechanism. From the analysis of 
secondary isotope effects and comparison of deute¬ 
rium with tritium, even primary deuterium isotope 
effects in the classical range of 6-10 are thought 
to reflect a large contribution from tunneling. For 
atoms other than hydrogen, kinetic isotope effects 
are much smaller and more difficult to measure from 
rate data per se. However, they can often be measured 
with high accuracy by determining relative concentra¬ 
tions of isotopes in dual-labeled samples (isotope 
fractionation). Tunneling distances are also much 
shorter for non-hydrogen atoms and therefore, data 
are currently interpreted using classical transition- 
state theory. 


Examples 

Alcohol dehydrogenase provides a classic example for 
the application of kinetic isotope effects. Early studies 
using deuterated substrates yielded substantial primary 
kinetic isotope effects, implicating hydride transfer as 
a major contribution to the rate-limiting step (Klinman 
1972; Shore and Gutfreund 1970). Subsequent second¬ 
ary isotope effect measurements and comparison of 
ka/ko with ka/kx values revealed further complexities 
which implied a tunneling mechanism for hydride 
transfer (Klinman 1989; Roston and Kohen 2010). 


KIEs have been widely used to study tunneling 
mechanisms in other enzymes (Allemann and Scrutton 
2009) and to assess the role of protein vibrations in 
bringing reactants to within a distance for efficient 
tunneling (e.g., morphinone reductase (Pudney et al. 
2009)). 


Cross-References 

► Steady-State Enzyme Kinetics 

► Transient-State Kinetic Methods 
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Synonyms 

Chemical kinetics 

Introduction 

The root of Kinetics comes from kinesis (movement) 
and, in general, it is a term applied to characterisation of 
phenomena with respect to time. Every measurement is 
carried out as a function of time and hence there is 
potentially a kinetic element to every experiment. This 
section focuses on chemical kinetics which involves the 
investigation and interpretation of reaction rates. Here 
the concentrations of chemical species or states are 
measured with time, principally for the determination 
of reaction mechanism. Kinetics may also be used 
empirically to determine the concentrations of species. 
This is the main-stay of many clinical assays. 

Kinetics is fundamental to many aspects of protein 
and nucleic acid chemistry, be it ligand binding, catal¬ 
ysis or folding (Goodrich and Kugel 2006; Gutfreund 
1995). Clearly kinetic studies are central to the charac¬ 
terisation of enzymes as catalysts and many of the 
principles of biochemical kinetics have been deter¬ 
mined using such proteins. Ligand binding is of more 
general significance; however this is most often studied 
under equilibrium conditions. There is, however, much 
to be gained by using kinetic approaches. Firstly, some 
kind of kinetic measurement is required to establish that 
an assumed equilibrium is attained. This is particularly 
important for high-affinity reactions (K^j < nM) where 
reactions may take minutes or hours to come to equilib¬ 
rium. Secondly, by determining the association (koJ 
and dissociation (koff) rate constants, an independent 
measure of K^ (= koff/koJ may be calculated 
which should confirm the value deduced from the 


concentration quotient determined at equilibrium. 
Thirdly, the biological importance of a reaction may 
be more dependent on its kinetics than the end-point 
concentration. Ca^"^ signalling is now known to involve 
intricate dynamics which tune into different receptor 
proteins and so determination of Kj values alone may 
miss many important functions. Finally, only kinetics 
can answer mechanistic questions (i.e., the pathway 
taken) because, by definition, the energetics of the start 
and end states are independent of the route taken. For 
example, ligand binding to a protein is often accompa¬ 
nied by a structural change of the protein. A substrate 
could bind to an open state of the active site which 
subsequently closes to make more favorable interac¬ 
tions (an ► induced fit mechanism). Alternatively, the 
apo enzyme could be in dynamic equilibrium with a low 
concentration of the closed conformation which selec¬ 
tively binds the substrate (► confomiational selection 
mechanism). These mechanisms are not mutually exclu¬ 
sive, but in practice one is likely to dominate. Equilib¬ 
rium measurements will only determine the 
concentration of the bound substrate - not how it was 
formed. Stractural methods may complement kinetic 
studies, but on their own they can also be ambiguous. 
Even if the apo enzyme takes on two conformations, the 
flux via each route cannot be determined from 
the structure. Of course, knowing the structure of the 
bound substrate-enzyme complex may demand that 
some kind of induced fit must occur if the substrate is 
found to be so occluded that it can only have bound 
there via a more open conformation. 


Kinetic Theory 

Chemical kinetics is based on fundamental rate laws 
derived from the ► Law of Mass Action. The simplest 
chemical reaction is a unimolecular conversion A ^ B 
which follows first-order kinetics i.e., the rate of for¬ 
mation of B is proportional to concentration of A. The 
proportionality constant is the so-called rate constant 
or rate coefficient, k. Hence, the rate for formation of 
B = d[B]/dt = k.[A]. 

Binding reactions, such as A H- B ^ C lead to 
second-order kinetics where the rate for formation of 
C = d[C]/dt = kon.[A].[B]. These expressions become 
complex when the reactions are reversible. 
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For illustration of the basic concepts, it is useful to 
consider the simplest mechanism for enzyme catalysis, 
where enzyme E catalyses the conversion of substrate 
S into product P: 


ki k2 

E + S ^ ES E + P (1) 

k-i 

There are three basic ways in which this reaction 
may be investigated and the rate constants calculated. 
S may be held at a known constant concentration in 
large excess over the enzyme concentration. Under 
these conditions ES will reach a steady concentration 
and product P will be produced at a constant rate. This 
is the basis of ► steady-state kinetics where d[P]/ 
dt = k 2 .[ES]. When [S] approaches infinity, all of the 
enzyme will be in the ES state (i.e., saturated) and 
hence [ES] = [E]o, the total (or initially added) enzyme 
concentration (Comish-Bowden 1995). Erom this, the 
catalytic rate, kcat may be calculated from the maxi¬ 
mum rate of product production per unit of enzyme. If 
the product and enzyme concentration are expressed in 
the same units of molarity, then kcat has the units of 
reciprocal time and provides a measure of k 2 . Steady- 
state kinetics cannot directly assess the value of the 
rate constants for the binding step (ki and k_i) but 
instead yields a quotient, the Michaelis constant. Km 
(= (k_i+k2)/ki). 

A second way to measure k 2 would be to suddenly 
prevent the formation of ES and allow it to decay to 
zero in real time. This is an example of ► transient 
kinetics (Gutfreund 1995; Johnson 2003). One way of 
doing this would be to mix a large excess of enzyme 
with substrate such that there is only sufficient reac¬ 
tants for a single turnover i.e., ES would quickly form 
via the first step and subsequently decay to product via 
k 2 . This is a more demanding experiment in that a high 
enzyme concentration is required and the reaction is 
likely to be complete within milliseconds, so demand¬ 
ing specialised equipment for its measurement. Under 
ideal conditions, when [E] is sufficiently high to ensure 
rapid and complete binding of S to give the ES com¬ 
plex, P will form exponentially according to the equa¬ 
tion [P] = [S]o (1 — One important feature of 

this equation is that k 2 is determined independently of 
[E] (provided the latter is known to be in excess of [S]). 
Compare this with the steady-state approach where an 


accurate estimate of the active enzyme concentration is 
required. If the steady-state and transient kinetic 
methods agree in their determination of k 2 then it is 
likely the enzyme is fully active. If the steady-state 
derived rate constant is slower, it suggests some of the 
enzyme is inactive. More complicated situations can 
arise where the enzyme is heterogeneous or the reac¬ 
tion mechanism is more complicated such that the first 
turnover differs from subsequent turnovers. ► Tran¬ 
sient kinetic analysis can reveal (or at least provide 
limits for) the values of ki and k_i. 

A final way to investigate this enzyme reaction 
would be to look at a ► single molecule of E under 
a microscope and measure how long S remains bound 
to it. This direct approach would have been just 
a thought experiment a few decades ago, but it has 
now been achieved in a number of favourable cases. 
However, even though this approach would be difficult 
to apply to all systems, it highlights some important 
concepts in kinetics; in particular that a rate constant 
provides a measure of a probability. Eor a first-order 
reaction such as ES ^ E -i- P, the concentration of ES 
decays exponentially with rate constant k 2 . The time 
for [ES] to decay to 1/e of its initial value is known as 
the time constant (= l/k 2 ) and represents the mean 
lifetime of the ES complex. Note this is slightly differ¬ 
ent from the half-time of the reaction which represents 
the time for the ES complex to decay to Vi its value 
(= 0.69/k2), and is the median time of the decay. Any 
particular ES molecule has a 50% probability of 
decaying within the half-time, but those that survive 
go on to have a 50% probability of decaying within the 
next half-time period and so on. The mean lifetime is 
longer than the half-time because it is weighted by the 
“tail” of molecules which, by chance, survive several 
half-time periods. In a single molecule assay, the 
observed lifetime of the ES complex will take on 
a range of values, whose distribution will be exponen¬ 
tial with a mean and median time corresponding to 
the ensemble value. Clearly, in a single molecule 
experiment, a large number of these stochastic 
events will need to be measured to build up such 
a distribution. The power of single molecule measure¬ 
ments becomes evident when more complex distribu¬ 
tions are obtained which reveal heterogeneity that 
may be masked in ensemble steady-state assays. Eor 
a simple reaction, longer and shorter event times 
should be randomly distributed whereas bunching of 
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these events indicates the enzyme is switching 
between active and less active states on a time scale 
slower than the individual catalytic cycles. 

Another important concept from this example is 
defining the actual time it takes for the reaction step to 
occur. There is a case for considering all reactions 
involving chemical bonds to occur on the time scale 
of their natural vibrational frequency i.e., picosec¬ 
onds (10“*^ s). However, the net (observed) reaction 
takes much longer than this owing to the unfavourable 
probability of all the chemical species involved being 
in the right place and orientation. This is akin to the 
basic concept of the transition state, where any par¬ 
ticular reaction is considered to comprise two steps; 
a highly unfavourable “equilibrium” leading to the 
transition state, followed by a rapid (10^^ s“') decay 
to product. In the case of proteins and other macro¬ 
molecules, reaction kinetics are likely to be limited by 
conformational rearrangements of their tertiary struc¬ 
ture, rather than the actual “chemistry” that occurs at 
the active site. This concept of conformational 
gating applies to both enzymes that adhere to the 
transition-state model, as well as redox enzymes for 
which tunnelling is implicated (Pudney et al. 2009). 
Identifying rate limiting steps is aided by the use 
of heavy isotopes which have intrinsically slower 
bond breaking kinetics (Cook and Cleland 2007). In 
addition to these ► isotope effects on the observed 
rate, isotopes are also useful in mechanistic studies to 
determine which bonds are cleaved in chemical 
reactions. 

Kinetic Assays 

► Steady-state kinetic assays are widely used to char¬ 
acterise enzymes because only catalytic amounts of 
enzyme are required (Engel 1981; Taylor 2002). 
Furthermore, only the disappearance of substrate or 
appearance of product is measured, making procedures 
generally applicable. However the information 
obtained is limited to a measure of the net flux through 
the pathway, kca,, which may reflect one or more rate- 
limiting steps. In addition a Kn, for each substrate can 
be determined, which provides information about the 
effective affinity of the substrate. In the case of multi¬ 
substrate reactions, steady-state methods can reveal 
information regarding the order of binding of sub¬ 
strates and release of products. Kj values of inhibitors 


and activator ligands may also be determined, as well 
as their mechanism of action (e.g., whether they bind to 
the same or different sites compared with the 
substrate). Measurement of substrate disappearance 
or product appearance can be followed continuously, 
typically using a spectrophotometric assay, or discon- 
tinuously by stopping the reaction and subjecting the 
quenched solution to analysis (e.g., chromatography). 
The former has the advantage of immediately reveal¬ 
ing the period when the reaction is in the steady 
state (i.e., the progress curve is linear), whereas the 
latter is more general and can provide a direct measure 
of [S] or [P]. 

In such assays three phases can be identified: (1) the 
pre-steady state refers to the initial transient phase 
which is usually undetected as it is complete within 
the first second and is of low amplitude (comparable 
to the enzyme concentration), (2) the steady-state phase 
which gives rise to a linear phase - the so-called initial 
rate, which is used for analysis, (3) the post steady-state 
phase where the reaction becomes non-linear due to loss 
of substrate and/or build-up of product. This phase may 
contain additional information but the contributions of 
the various factors may be difficult to untangle. In 
screening assays, a single time point may be measured 
in which it is assumed that the reaction is still in the 
steady-state, but it is good practice to measure more that 
a single time point to check this assumption (Fig. 1). 

It follows from point (1) above that detection of the 
transient phase generally requires higher enzyme 
concentrations than used in steady-state assays and 
sub-second time resolution. The latter requires dedi¬ 
cated instrumentation to achieve rapid mixing (e.g., 
► continuous-flow, ► stopped-flow and ► quenched- 
flow) on the millisecond time scale (Barman and 
Travers 1985; Eccleston et al. 2001; Hiromi 1979). 
However, such methodology reveals additional infor¬ 
mation about the binding steps and chemical transfor¬ 
mations that cannot be assessed by steady-state 
methods. With such instrumentation reactions can 
also be followed in which there is no steady-state 
phase (e.g., single turnover conditions). These 
methods are also applicable to ligand binding and 
protein folding assays in which there is no chemical 
transformation. Reactions are usually followed by 
optical spectroscopy (absorption, ► fluorescence) 
owing to good sensitivity and time resolution, but 
many other methods have been used, including mag¬ 
netic resonance methods. ► Nuclear magnetic 
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Kinetics: Overview, Fig. 1 Transient and steady-state phases 
of an enzyme-catalysed reaction for the scheme shown in (1), 
withkj = 1 pM"'.s“', k_i = 0.1 s“',k 2 = 2 s~* and [£]□= 1 pM 
and [S]o =10 pM. The concentrations are shown for the ES 
complex (black line), S (blue line) and P (green line). The 
transient phase is complete within about 0.2 s, after which 
point [ES] remains almost constant and S disappears linearly 
and P is produced linearly during the steady-state phase. To 
investigate the latter phase in practice, a larger ratio of [S]o to 
[E]o would be chosen 

resonance (NMR) itself has relatively low sensitivity 
and is therefore only comhined with rapid-mixing 
methods in specific cases. However, the NMR spec¬ 
trum may contain inherent kinetic information in the 
line widths, and can provide information about rate 
constants over a wide time scale (ps to s). Kinetics 
can also be followed by exchanging solutions that flow 
over a reactant that is immobilised on a solid support 
(Chaiken et al. 1992). While such methods may be 
limited to a time resolution of seconds, detection pro¬ 
cedures involving surface plasmon resonance and 
related techniques are sufficiently sensitive that low 
concentrations (< pM) may be used and enable bind¬ 
ing steps of high affinity reactions to be quantified. 

Transient phases can also be studied by perturbing 
a pre-existing equilibrium reaction (e.g., using temper¬ 
ature jump, pressure jump or electric field jump) which 
constitutes the topic of relaxation kinetics (Bemasconi 
1976). These have the advantage of better time resolu¬ 
tion than mixing methods, but also may reveal hidden 
transients between states in equilibrium. However, the 
amplitudes of relaxation transients are generally 
smaller than for the corresponding mixing method 
and their applications more restricted. In some reac¬ 
tions, transients can be initiated by a brief flash of light. 
Such ► flash photolysis experiments can have a time 
resolution well below nanoseconds. 


It is important to state that transient kinetic methods 
are not always synonymous with rapid reaction tech¬ 
niques. Some complex biological reactions occur on 
the seconds time scale or longer and can be analysed 
using non-specialised equipment. Indeed, high affinity 
interactions and enzyme-substrate complexes with low 
Kn, values (< pM) may be more easily characterised 
using transient kinetic approaches, because the 
assumptions associated with equilibrium and steady- 
state methods may be invalid. On the other hand, low 
affinity reactions (Kj > mM) are often associated with 
sub-millisecond kinetics and are beyond the scope of 
standard rapid-mixing methods. 

Ultimate sensitivity is provided by ► single mole¬ 
cule methods in which reactions are followed mechan¬ 
ically (optical trap, magnetic trap or atomic force 
microscopy) or optically (Gell et al. 2006; Selvin and 
Ha 2008). To achieve efficient detection, microscope- 
based optics are used. Single molecule methods have 
the advantage of not requiring a physical event to 
trigger the reaction, as is needed to synchronise mole¬ 
cules in ensemble methods. They also have the poten¬ 
tial to reveal rare events that would be masked in 
ensemble methods. However there remains a problem 
of determining if such rare events reflect the bulk 
population or a minor contaminant. Single molecule 
data are invariably accompanied by a large contribu¬ 
tion from stochastic noise arising from limited photon 
counts and thermal noise. Single molecule methods 
may involve following many molecules, each for 
a brief period of time with single molecule sensitivity. 
Non-scanning confocal microscopy provides a suitable 
optical arrangement. This approach is capable of better 
resolution of populations than ensemble methods. 
Alternatively one molecule may be followed for an 
extended period of time - although not always long 
enough for it to go through all chemical/conforma¬ 
tional states. To build up good statistics, it is preferable 
to obtain data in parallel from several tens to hundreds 
of molecules measured in parallel. This requires wide- 
field imaging but low background, such as provided by 
total internal reflectance fluorescence microscopy. 

Data Analysis 

Data analysis has two aspects. Firstly, the raw kinetic 
traces are analysed to extract values for rates, rate con¬ 
stants and associated parameters e.g., amplitudes and 
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stoichiometries. Secondly, these parameters are used to 
test models (reaction pathways). As is any quantitative 
science, only incompatible models can be ruled out. 
Compatible models cannot be proven to be correct; 
rather the simplest compatible model is accepted. 

One appeal of steady-state kinetics is that the raw 
rate data are, by definition, linear which simplifies data 
analysis. Furthermore, many of the derived rate equa¬ 
tions as a function of concentration can be rearranged 
into linear expressions. However, the latter approach 
for analysis is not desirable as it distorts the distribution 
of random errors around the mean. Fitting directly to 
non-linear expressions is therefore a better approach 
(Motulsky and Christopoulos 2004). Analytical expres¬ 
sions for transient kinetics are also routinely analysed 
using non-linear least-squares methods. However 
mechanisms do not need to be very complex before 
exact analytical expressions cannot be derived. In 
these cases, data are compared to simulated data 
derived for one or more models, and optimal values 
for the parameters are estimated by minimising the 
deviations between raw and simulated data. Care is 
required to make sure that the fitting procedure reaches 
a true minimum rather that a local minimum. Also some 
parameters may be poorly defined, suggesting the 
experimental data does not contain sufficient informa¬ 
tion for an exact solution to the model. Note that errors 
in parameters reported by fitting routines are only 
approximation. In particular, it is unlikely that errors 
will be symmetrical about the most likely value, so any 
programme in which parameter are returned with a ± 
error value should be viewed with caution. 

Model building usually starts with the simplest pos¬ 
sible one step reaction and building up the complexity 
if the data warrant additional steps to minimise sys¬ 
tematic deviations. However, adding more steps can 
always reduces the net variance and statistical tests are 
required to justify the additional complexity. The abil¬ 
ity to fit data directly to complex models does not 
render the use of analytical expressions redundant. 
Knowledge of the form of the kinetics profile for 
a given combination of states and steps helps to select 
the most likely minimal pathway. 

Finally, multiple sets of data may be fitted simulta¬ 
neously, a procedure known as global fitting. Provided 
that the data sets contain sufficient information, it is 
possible to extract unique values for some parameters 
which would be poorly defined for any single data set. 
However, such an approach is best avoided as a first 


step in the analysis; rather a minimal scheme is first 
derived with estimates of the dominant parameters that 
can be used as input values for subsequent refinement 
by global fitting. 

Summary 

Concepts from chemical kinetics find general applica¬ 
bility in biophysics and have the potential to yield 
unique information about a system that cannot be 
gleaned by equilibrium techniques alone. While the 
basic concepts and analytical approaches have a long 
history, new experimental approaches have continued 
to enrich the field for over two centuries. One of the 
remaining challenges is the measurement of kinetics 
within living cells where the effective concentrations 
may be very different from the test-tube (Minton 
2006). The overview provided here serves as an intro¬ 
duction to the other kinetic entries. 

Cross-References 

► Continuous Flow 

► Flash Photolysis 

► Fluorescence 

► Kinetic Isotope Effects 

► Law of Mass Action 

► Molecular Recognition: Lock-and-Key, Induced Fit, 
and Conformational Selection 

► NMR Methods for Kinetic Analysis 

► Nuclear Magnetic Resonance Imaging 

► Order of Reaction 

► Quenched-Flow Methods 

► Single-molecule Methods 

► Steady-State Enzyme Kinetics 

► Stopped-Elow Techniques 

► Transient-State Kinetic Methods 
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Kinetics: Relaxation Methods 
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School of Biosciences, University of Kent, 
Canterbury, Kent, UK 

Synonyms 

Perturbation methods; Rapid kinetic methods; 
Temperature jump 

Definition 

In relaxation methods a chemical or biological 
system at equilibrium is perturbed by a rapid change 


in external conditions (e.g., temperature); the 
response of the system as it relaxes to the new equilib¬ 
rium reveals the nature of the underlying reaction 
kinetics. 

Introduction 

Relaxation methods represent an alternative approach 
to the rapid mixing/flow methods (► continuous flow, 
► stopped flow, and ► quenched flow) for studying the 
dynamics of biological mechanisms. The main reason 
experimentalists turn to relaxation methods is for the 
faster time resolution which can extend the measure¬ 
ments from the ms to the ps time domain. However, the 
approaches have a much wider potential to define 
the dynamics and energetics (► Thermodynamics of 
Biomolecular Interactions) of biological reactions than 
simply the extended time scale. These methods are less 
widely used partly because much of the equipment is 
not commercially available and the studies can appear 
significantly more complex than the more common 
rapid mixing equipment. 

In relaxation methods a biological system of inter¬ 
est (e.g., a protein folding or ligand binding reaction) is 
set up and allowed to establish equilibrium (or a steady 
state). The system is then changed rapidly by altering 
conditions that perturb the equilibrium, for example, 
the temperature, pressure, or other physical parameter, 
and this induces the system to “relax” to new equilib¬ 
rium position. By monitoring the time scale of the 
concentration changes during the relaxation process it 
is possible to define the number and type of reacting 
intermediates, their rates of interconversion, and, from 
the amplitudes, the enthalpy change (for a temperature 
change) or the molar volume change (for a 
pressure perturbation). A typical reaction profile for 
a first-order reaction is shown in Fig. 1. 

Principle of the Approach and Underlying 
Theory 

For a simple first-order reaction 

A^ B (1) 

The equilibrium constant is defined as 
Keq = [B]/[A] =k+i/k_i, and k+i and k_i are the 
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Kinetics: Relaxation Methods, Fig. 1 A typical perturbation 
experiment. Here a step increase in pressure (335 bar) (a) is used 
to induce a perturbation of the equilibrium of a protein folding 
reaction. The step increase of hydrostatic pressure is complete in 


~100 ps and induces a relaxation of the reaction followed using 
tryptophan fluorescence over a time scale of 10 ms (c). Once the 
reaction has relaxed the pressure change is reversed (b) and the 
protein refolds (d) 


forward and reverse rate constants in the direction as 
written. 

Such reactions are accompanied by changes in the 
thermodynamic parameters such as enthalpy (AH°), 
molar volume change (AV°), or, if there is a change in 
the number or distribution of charge, the electric dipole 
moment (AM). The relationship between the enthalpy 
change for the reaction, the temperature, and the equi¬ 
librium constant is given by the van’t Hoff equation: 

AlnK/AT = AH/RT^ (2) 

and for small changes in K (AK < < K) 

AK/K = AHAT/RT^ (3) 

Hence the change in K (AK) is linearly related to 
AH and to AT. 

A similar set of equations describes the relationship 
of InK to pressure changes (AP) and the molar volume 
change for the reaction (AV°): 

AK/K = -AV°.AP/RT (4) 

And the change in AK is linear in AV° and AP. 


The relationship between the perturbation and the 
observed change in concentration of a reactant depends 
upon the value of K, as well as the applied perturbation 
(e.g., AP) and the thermodynamic parameter (AV°). 
This is most easily illustrated by a specific example 
such as that of (1) and is in Fig. 2. 

It is clear the system gives the largest change in 
the concentration of A when the equilibrium constant 
is close to one. The situation is more complex for 
higher order reactions but for a second-order reaction 
in which one reactant is in excess or is buffered 
then for 

E -f L ^ E.L (5) 

where K = [E][L]/[E.L] and [L] » [E]. 

Then the relationship is identical for the change in 
the concentration of E is plotted against K/[L], that is, 
the perturbation is largest when the [L] = K and thus 
[E]/[EL] = 1. The amplitude of the observed reaction 
as a function of [L] thus gives direct information about 
the equilibrium constant. Analysis of the amplitudes 
for more complex reaction schemes requires detailed 
treatment and the reader is referred to more complete 
texts (Bernasconi 1976; Connors 1990; Hiromi 1979); 
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Equilibrium constant (K) 


Kinetics: Relaxation Methods, Fig. 2 The amplitude of the 
relaxation as a function of the equilibrium constant K. The 
observed change in concentration of A in a first order, single 
step reaction A " B for a 10% increase in the equilibrium 
constant K = [B]/[A]. The relationship is identical for a second 
order reaction where E + L ** EL if [L] > > [E] and K = [E][L]/ 
[EL] and K/[L] are plotted against A[E]. Note the change in 
concentration is small ~2% at maximum requiring sensitive 
detection methods 

however, it remains a rule of thumb that the maximum 
amplitude for a perturbation is largest when the 
species being perturbed and observed are at their 
maximum concentrations on both sides of the 
equilibrium. 

Analysis of relaxation times are in principle more 
straightforward than for rapid mixing systems in that if 
the perturbation induced by the change in conditions is 
small, then the rate equations can always be reduced to 
a linear form, regardless of order. Thus even complex 
systems can be dealt with in a relatively straight for¬ 
ward way. 

The principle can be illustrated for the single 
step scheme of (1). If the concentration of A at time t 
is [A]t then the rate of change of [A] with time is 
defined as 

-d[A]/dt = k+i[A],-k_i[B], (6) 

Substituting for 

[A]. = [A],^ + A[A]., [B],= [B],q-A[A], 

and recalling that the equilibrium concentrations by 
definition do not change with time d[A]eq/dt = 0 
then from (6) 


LHS 

-d([A],q+A[A],)/dt 
= -d[A],ydt - d[AA],/dt = -d[AA],/dt 

RHS 

k+i[A],-k_i[B].= ([A^ + A[A],)k+i 
-k-i([B],q-A[A],) 

= k+i[A],q - k_i[B],^ + A[A].(k+i + k_i) 

= — A[A]j(k+i + k_i) 

Hence d[AA](/dt = — A[A]((k+i + k_i) 

On integration A[A]j = A[A]oe^('^+'+'^-dt 

The relaxation time (x) for the one step first-order 
system of (1) is given by 

l/T = k+i+k_i (7) 

By convention, relaxation times (x) are used in these 
perturbation studies, but 1/x is identical in form to the 
observed rate constant kobs more commonly used in 
rapid mixing methods. 

Table 1 lists the definition of the relaxation times for 
common single step systems. 

The relaxation times become more complex for 
multiple step systems. The simplest and most common 
two step systems are given in Table 2 and more com¬ 
prehensive listings are given by Bernasconi (1976). 
The systems are even more complex if the steps are 
not decoupled in time, and of course when consider¬ 
ation is given to the signal being observed, that is, 
whether the signal reports one step or both. 

In principle, any reaction scheme with n 
independent steps will yield relaxations consisting 
of n exponentials [F]j = Fqo X) exp(—t/x^j). How¬ 
ever, what can be observed will depend upon what 
signals are available, the sensitivity of each step to 
the perturbation, and the extent that the exponentials 
are separable in time. 

Data Collection 

A consequence of the perturbations to the equilibrium 
being small is that the relaxation methods require 
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Kinetics: Relaxation Methods, Table 1 Relaxation times for 
one-step systems 


System 

1/t 


B 

k+i -t- k i 



k+i([AL, + [BL,) 

1 -l-k_i 

2A^B 

4k+,[A],^ + k_i 


At- B ^ C + D 

k+i([AL, + [BL,) 

l+k i([CL, + [D]J 

A t- E ^ B -1- E 
Where E is an 
enzyme or catalyst 

(k+,+k_,)[E]e, 



sensitive detection methods, generally much more sen¬ 
sitive than required by rapid mixing methods. How¬ 
ever, a feature of relaxation methods applied at 
equilibrium is that the reactants are not consumed 
during the measurements. Thus provided the reactants 
are stable during the measurement (e.g., no significant 
► photobleaching by the measuring light), a large 
number of repeated measurements can be made on 
a single sample to allow averaging of the transients. 
Remembering that the random noise on a signal 
reduces with the square root of the number of obser¬ 
vation, then a 10 fold reduction in noise will be 
achieved by averaging 100 measurements of the same 
sample. 

Most relaxation methods system are built around 
optical detection, commonly absorbance and ► fluo¬ 
rescence, but many other signals have been used to 
address specific questions, for example, ► circular 
dichroism (Gruenewald and Knoche 1978), fluores¬ 
cence polarization, ► FTIR (Schiewek et al. 2007), 
conductivity (Stanley and Marshall 1994), ► NMR 
(Kitahara et al. 2002), low angle ► X-ray scattering 
(Knight et al. 1993), and light scattering (Geeves 1991; 
Davis 1985). In principle, any method for observing 
the reaction could be used as long as the data can be 
collected on a time scale compatible with the events 
being observed. 

Advantages and Disadvantages of 
Relaxation Methods 

Time scales can be very short psec or even shorter, but 
longer timescales can be limiting for some systems. 
Thus the jump in conditions can be fast, but long-term 
recording requires the conditions to be held constant 


after the jump. This is not always trivial, that is, with¬ 
out special measures temperature jumps tend to revert 
to the original temperature over a time scale of a few 
seconds. However, rapid temperature jumps have been 
applied followed by use of a temperature clamp (e.g., 
using a peltier device) to stabilize the system at the 
post-jump temperature (Ranatunga 1996). 

One of the advantages of relaxation methods is that 
they can access processes not accessible to rapid 
mixing systems. For example, in rapid mixing it is 
difficult to detect a fast event occurring after a slow 
event as the observed rate constant can be entirely 
limited by the slow event. Because perturbations are 
following all reactants at equilibrium, the order of fast 
and slow events is less important (Malnasi-Csizmadia 
et al. 2001). 

Being able to perturb the system in two directions, 
an increase in pressure or temperature and a decrease is 
possible for some perturbation systems and can pro¬ 
vide additional information on the pathways of com¬ 
plex reactions. 

Many perturbation methods can be applied to 
a variety of problems from purified biological mole¬ 
cules to complex systems. The application of 
a pressure or temperature change is not restricted by 
the complex architecture of biological systems 
(Ranatunga 1996). Studies have been reported of pro¬ 
tein polymerisations (e.g., cytoskeleton assembly, 
actin, microtubules filaments, and thick filaments 
(Davis 1985)) multi-protein complexes, micelles 
(Winter and Jeworrek 2009), organelles, and even 
whole cells. 

A disadvantage of the study of complex systems is 
that although relaxations can be observed the interpre¬ 
tation can be complex and large amounts of additional 
information are required to make a definitive assign¬ 
ment of the events observed. However, the great 
advantage of the relaxation system is that if the relax¬ 
ation occurs in the purified system, then the same 
relaxation should be present in a more complex sys¬ 
tem. Thus, if calcium binding to calmodulin occurs in 
solution with a significant AH or AV, then the same 
temperature and pressure dependence should be 
observed in the cell. This provides a way to attempt 
to correlate events in complex environments with 
known reactions. Indeed the amplitude dependence 
means that only reactants with significant occupancy 
will be perturbed in complex systems. Thus a moderate 
pressure change will perturb a protein conformational 
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Kinetics: Relaxation Methods, Table 2 Relaxation times 
relaxation times 

for simple two-step systems where the two steps have different 

System 

l/x short 

1/t long 

A ^ B « C 

k+i -H k i » k+2, k 2 

k+i -f k_i 

K-ik+2/(Ki -f 1) -f k_2 

A-f B « C « D 

(a) step 1 fast, step 2 slow 

(b) step 1 slow, step 2 fast 

k+i([AL, + [BL,)+k_i 

k+2 + k_2 

k+2Ki ([A],^ + [B],J / (Ki ([A],^ + [B],^) + l) + k_2 
k+i ([A]cq + [BJcq) + k ik 2/(k+2 + k 2) 


Kinetics: Relaxation Methods, Table 3 Major perturbation methods that have been used 

Perturbation 

Method 

Step size 

Time scale 

Sample 

volume 

Jumps in both 
directions 

Rapid repeat 
jumps 

notes 

Temperature 

Joule capacitor 
discharge 

1-10°C 

1 |rs-5 s 

50-1,000 pi 

N 

N 

a.b. 


IR Laser 

1-10°C 

10 ns—10 ms 

10-100 pi 

N 

N 

b. 


Microwave 


lps-10 ms 

100 pi 

N 

N 

b. 


Circulators, rapid flow 

>2°C 

>10 ms 

100-1,000 pi 

Y 

Y 


Pressure 

Peizocrystal stack 

Up to 500 bar 

>50 ps 

50 pi 

Y 

Y 



Oil chamber- valve 

Up to a few 
kbar 

>100 ps 

100-1,000 pi 

Y 

N 



Oil chamber - bursting 
disk 

Up to a few 
kbar 

>20 ps 

100-1,000 pi 

N 

N 


Electric field 

Cable discharge 


10 ns-10 ms 

100-1,000 pi 

Y 

Y 



Notes 

a. Requires reasonably high ionic strengths. 

b. Repeat measurements can be slow as temperature re-equilibration can take several seconds. 
Temperature clamps required for time longer than ~1 s 


K 


change at a poised equilibrium position but will not 
perturb protein folding under conditions where the 
protein is predominantly folded. 

Equipment 

An outline of the major methods that have been used is 
given in Table 3 and more details of specific method 
are given here. 

Temperature jumps. Most reactions take place with 
a finite AH° and can therefore be perturbed by temper¬ 
ature. This has been used widely for protein folding 
reactions, ligand binding reactions, and macromole¬ 
cule conformational changes. This is the only system 
which is readily commercially available. 

Pressure jumps require reactions with significant 
volume changes. These tend to be large changes 
in hydration shells round ionized groups 
(HA <-> + A ) or changes in water packing round 

exposed or buried hydrophobic groups. Thus volume 


changes are common for many biological reactions 
such as ionization, protein folding, protein conforma¬ 
tional changes, protein assembly, and for many ligand¬ 
binding reactions. Pressure can also be used to change 
the packing of lipids in membranes. 

Electric jield jumps. For a single step system as in 
(1), if A has a smaller dipole moment than B then the 
application of an electric field can perturb the equilib¬ 
rium. This is most common in reactions where the 
ionization state changes. This approach has been used 
for the conversion of an amino acid into its zwitterionic 
form, helix-coil transitions in macromolecules and 
other macromolecule conformational changes 
(Pdrschke 1982). A similar relationship holds as for 
other perturbations: 

AlnK/A|E| = AM/RT 

where lEI is the electric filed strength and AM is the 
molar change in the macroscopic electric moment or 
the molar polarization. 
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pH jumps linked with either T or P jumps. pH jumps 
can be a useful way to perturb reactions involving 
protons and can be readily achieved by linking tem¬ 
perature of pressure jumps to the appropriate buffer 
system, for example, phosphate buffers are very stable 
to temperature changes but are perturbed significantly 
by pressure. In contrast, buffers such as imidazole or 
Tris-HCl are independent of pressure with very small 
AV but are perturbed by temperature. The pH shift for 
such a buffer is very fast and normally follows closely 
the speed of the pressure or temperature change. In fact 
such buffers are often used to calibrate the speed of the 
perturbation in an optical cell using an appropriate 
absorbing or fluorescent buffer system. 

Sinusoidal and frequency domain perturbations. 
The discussion has been restricted here to step changes 
in the perturbing parameter but the use of sinusoidal or 
other type of perturbation have also been widely used, 
for example, via ultrasonic absorption or dielectric 
relaxation spectroscopy. In this respect, methods such 
as NMR and ESR can also be classified as perturbation 
methods and are dealt with elsewhere. 

Summary 

Relaxation methods represent an important alternative 
to rapid mixing as a means of measuring kinetic data. 
Relaxation methods include the use of rapid jumps or 
sinusoidal perturbations in temperature, pressure, 
electric field strength, or other extrinsic variables. 
The result of these changes is a change in the state of 
the system (the value of one or more equilibrium 
constants) causing a measurable parameter to relax to 
its new equilibrium position. Relaxation methods are 
notable for their ability to access intermediate steps 
and time ranges that are not measurable via rapid 
mixing. Modern equipment also often allows meaning¬ 
ful results to be obtained using relatively small 
quantities of samples and from complex samples 
which are unsuitable for rapid mixing approaches. 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Continuous Flow 

► Fluorescence 


► Law of Mass Action 

► NMR Methods for Kinetic Analysis 

► Order of Reaction 

► Quenched-Flow Methods 

► Single Fluorophores Photobleaching 

► Stopped-Flow Techniques 

► Thermodynamics of Biomolecular Interactions 

► Transient-State Kinetic Methods 

► X-Ray Scattering of Lipid Membranes 
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Introduction 

Data simulation and analysis bridge the gap between 
a mechanistic or kinetic model of a certain reaction and 
the corresponding experimental data. Since the tempo¬ 
ral behavior of most dynamic systems is nonlinear and 
highly complex, it usually cannot be deduced intui¬ 
tively. Although in some cases, kinetic models can be 
simplistic enough to allow for some intuitive insight, in 
most cases, specifically with nonlinear systems, the 
human brain is not adapted to compute the response 
of kinetic systems to perturbations of any kind. These 
perturbations could involve rapid mixing events or 
changes in equilibrium by pressure and temperature 
changes to obtain a response that could be recorded 
in time. Therefore, it is important to have tools 
for “what if’ analysis to get a prediction of the 
expected behavior of systems and ways to analyze 
data ideally without being misguided. The availability 
of immense computing power, even with ordinary 
personal computers in combination with major 
advances in simulation and fitting programs, allows 
for nearly unlimited complexity of analysis. However, 
it is still the quality of the data and good human 
sense that are crucial for scientific success. Therefore, 
it is still important to understand the basic principles 
of data analysis (of course including its limitations) 
and to look under the hood of the computational 
machines. 

The following sections describe in hierarchical, but 
also mostly functional order, the modules that are 
relevant for data simulation and analysis. 


Simulation 

Prior to the actual simulation of kinetic data, a mech¬ 
anistic model or scheme needs to be casted into math¬ 
ematical expressions that allow computing of data 
values. Usually this process generates differential 


equation systems (DES) that implicitly describe the 
kinetic model. Unfortunately, DES do not allow direct 
computing of simulated data. In order to be able to this, 
they need to be solved (integrated) to give so-called 
explicit equations, where the computed value y is 
a direct expression of time (t), the kinetic constants 
(parameters) and the initial concentrations of reactive 
species involved. Three major ways to achieve this are 
the Laplace transformation, the use of integration fac¬ 
tors, and the matrix method (Capellos and Bielski 
1972; Bernasconi 1976; Gutfreund and Ranatunga 
1999; Steinfeld et al. 1989). However, in all three 
cases, the DES can usually only be solved analytically 
if they are first order, that is, there are no terms in 
which a product of two concentrations appear. Conse¬ 
quently, analytical solutions exist for enzyme kinetic 
systems only with very specific boundary conditions. 
These solutions are combinations of exponential equa¬ 
tions or in extreme cases (e.g., steady state) simple 
equations that describe the kinetic behavior. For exam¬ 
ple, the following kinetic scheme that describes the 
reversible association of the species A and B to give 
the complex C is symbolized by the following simple 
notation (Eq. 1). 


K 


A+B^C ( 1 ) 

^'-1 


The corresponding differential equation set (DES) 
then reads (Eq. 2) 


d[A] 

dt 

d[B] 

dt 

d[C] 

dt 


-k+i[A][B] + k-i[C] 
-k+i[A][B]+k_i[C] 
k+i[A][B] - k^i[C] 


( 2 ) 


If the boundary conditions are chosen such that the 
initial concentration of B ([Bq]) by far exceeds that of 
A ([Aq]), that is, [Bq] >> [Aq] and also the initial 
concentration of C ([Cq]) at time zero is nil, that is, 
[Co] = 0, then the reaction becomes pseudo first order 
and [B](r) « [Bq]. This means that the concentration of 
B is approximated as being constant over time. We 
may thus lump the expression k+i[B] into the single 
pseudo constant U+i. The set of differential equations 
then reads (Eq. 3) 
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^ ^ 0 (3) 

dt 

This can be solved analytically to give the [C](f) 
(Eq. 4) with: 

[C](0 ^ ^ (l - (4) 

A solution to the daunting and often impossible task 
to reach analytical solutions occurred with the advance 
of computer power that allows for numerical integra¬ 
tion and thus computation of data values without 
restrictions to specific boundary conditions. Here, 
small intervals in time are considered in which a small 
change in the concentration of each species is calcu¬ 
lated from the differential equations (i.e., assumes AC/ 
Ar = dC/df) and the whole kinetic profile is built up 
from these small linear extrapolations. Popular integra¬ 
tion procedures are derivatives of the so-called Euler 
and more advanced Runge-Kutta method. However, if 
the rate constants involved in the kinetic system differ 
highly in magnitude (which is often the case), ordinary 
Runge-Kutta methods are numerically not stable 
enough and severe rounding errors occur. In this case, 
specialized algorithms that can deal with these so called 
“stiff’ DES need to be used. They were boosted mostly 
by the development of airplane simulation systems and 
an incredibly powerful algorithm still being used today 
was put forward by Gear (197 1). It should also be noted 
that among the first devices to simulate enzyme kinetic 
systems were analogue electrical circuits (e.g., dis¬ 
charge of a capacitor through a resistor) that were able 
to simulate kinetic systems in real time, yet without 
output of discrete numbers. 

Further advances stem from the fact that nowadays 
the user does not need to enter the DES that describe 
a particular kinetic model. Instead they may just enter 
the kinetic model itself and the computer program 
automatically generates the corresponding DES inter¬ 
nally, including the Jacobian Matrix that is normally 
needed for algorithms that compute stiff DES. 

After the model and assumptions about kinetic 
parameters including initial concentrations are 


entered, the program calculates the concentration of 
species as a function of time. These values can then be 
displayed and viewed. Initially one may probe with 
this procedure how the system behaves under certain 
conditions and whether the time traces are somewhat 
close to expected values. An extension to this proce¬ 
dure is to overlay real data with these simulated data to 
test for validity of assumed model and parameter 
values. This “By hand” fitting process can be an impor¬ 
tant step toward advancement of understanding of the 
system under investigation, but will finally be 
substituted by true data fitting where the parameters 
are optimized automatically. It does, however, also 
provide a good start for the actual fitting process and 
can be a very instmctive way of acquiring some intu¬ 
itive knowledge about the behavior of multistep 
mechanisms. 


Data Fitting 

Data fitting involves a series of computational pro¬ 
cesses that are outlined in Fig. 1 and described in the 
following paragraphs. 

Experimental data are obtained with kinetic 
methods such as ► steady-state enzymatic assays, and 
more importantly with transient kinetic methods like 
► stopped-flow, ► quenched-flow and ► relaxation 
methods (pressure or temperature jump) but also spe¬ 
cial ► NMR techniques or even time resolved 
crystallography. 

Simulated data are derived (computed) from 
a model that the researcher suggests as described 
above. The residuals constitute a central part of the 
fitting process since they compute the deviations of 
real and simulated points by simple subtraction. 
It should be noted that visual inspection of these 
residuals by the researcher can be an invaluable 
tool to judge the quality of the actual fit since system¬ 
atic deviations can be immediately recognized. For 
the fitting program, the sum of the squares of all 
residuals (Eq. 5) is calculated and the program 
attempts to minimize it as the target function of 
minimization routines. This gives the name “least 
squares” (LSQ). 

n 

X^ = Y, O^'-eal - (5) 
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It is noteworthy that here the squared difference for 
each data to simulated point can be weighted, for 
example, with known uncertainties or standard devia¬ 
tion of the individual experimental data points should 
they be known. 

The minimization routines are algorithms that intro¬ 
duce changes to the fitted parameters in a more or less 
directed way in order to achieve an improved ht. 
Prominent algorithms are the Levenberg-Marquart 
steepest decent method and Nelder-Mead Simplex, 
but also simulated annealing or evolutionary algo¬ 
rithms (Press et al. 1989; Price et al. 2005). These 
algorithms differ in robustness (the need for good ini¬ 
tial estimates of parameters), convergence speed, and 
also how sensitive they are to being trapped in local 
minima that represent wells of minimal LSQ but not 
the true global attainable minimum. This uncertainty in 
general increases with the complexity of the system 
and number of htted parameters. In general, a good 
practice, therefore, is to start the htting process with 
vastly different initial guesses for the fitted parameters 
to test if the fit always converges to the same minimum. 

A logical extension of data htting is the so-called 
global data ht (Fig. 2). Here instead of one data set, 
multiple data sets are provided from experiments, but 
also simulated from one kinetic model to produce sets 


Other factors that are important for data htting are 
proper scaling of data and introducing constraints for 
the parameter values that put high penalties on mean¬ 
ingless or impossible parameter values (e.g., negative 
rate constants). 

Model Discrimination 

Model discrimination is probably the most ambitious 
but also the most controversial topic to be considered 
since it lies at the heart of data analysis. Which models 
are compatible with the data, which models can be 
excluded? Certainly a kinetic model can never be 
proven. A general approach is to start with simple 
models that could be excluded until more complex 
models that are consistent with the data are reached. 
From models that are compatible with the data, in any 
case the simplest model that is also compatible with 
enzymological and mechanistic knowledge should be 
chosen according to Occam’s razor. It is, however, 
very difficult to put this into numbers. Possibly inspec¬ 
tion of the residuals is still the most powerful method 
to judge the quality of a ht. Likewise, judging how well 
parameters of a ht are determined is not trivial. 
Advanced programs like Dynaht or KinTekExplorer 
(see below) address this problem with sophisticated 
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parameter space searches that are based on Monte 
Carlo simulations (Press et al. 1989; Kuzmic 2009; 
Johnson 2009). Even if the analysis reveals that an 
extra phase is required to model the data, it may not 
have any “true” significance in terms of mechanism 
(e.g., a loss of fluorescence may indicate 
photobleaching rather than a change in concentration 
of a species due to a chemical reaction). Additional 
phases may also arise from heterogeneity of the sample 
and therefore it is important to check if the behavior is 
consistent between different sample preparations. 

Available Programs 

Computer programs that allow for simulation and anal¬ 
ysis of enzyme kinetic systems have come a long way. 
For a long time, the pioneering work of Carl Frieden 
with KinSim and later KinFit (http://biochem.wustl. 
edu/cflab/message.html) set the standard for complex 
systems (Zimmerle and Frieden 1989; Barshop et al. 
1983). Likewise, the program Enzfitter for (simple) 
explicit equations originated about 1986 and helped 
enzyme kineticists to analyze directly steady-state 
kinetics, ligand binding and kinetic systems that 
could be cast into explicit equations representing the 
analytical solution of differential equation systems 
where possible without data transformations. These 
programs were followed by a multitude of advanced 
software suites. Whereas the program Enzfitter from 
Robin Leatherbarrow has a direct in house successor 
(Grafit), KinSim/KinFit was followed by various inde¬ 
pendently developed software suites. 

Large mathematical program packages like Igor Pro 
(http://www.wavemetrics.com/), MatLab (Otto et al. 
2005) (http://www.mathworks.com/), or Mathematica 
(http://www.wolfram.com/) allow for complex analy¬ 
sis also of complex kinetic problems but are expensive 
and more generic, and hence not immediately accessi¬ 
ble (intuitive) to users who enter the field of enzyme 
kinetics. It should also be noted that spreadsheet pro¬ 
grams like Excel (Billo 2007) or Origin can be tailored 
for analysis of kinetic data, but here access is also not 
immediate and their capacities are limited. 

Computer programs that are tailored for simulation 
and analysis of kinetic data mostly involve commercial 
variants like Grafit (http://www.erithacus.com/grafit/), 
GraphPad Prism (http://www.graphpad.com/prism/ 
Prism.htm) for analytical equations or MicroMath 


(http://www.micromath.com/, discontinued) Berkeley 
Madonna by Robert Macey, George Oster and 
Tim Zahnley (http://www.berkeleymadonna.com/), 
KinTek Explorer (http://www.kintek-corp.com/ 
KGExplorer/index.php) (Johnson 2009; Johnson et al. 
2009a, b) if the system is too complex to be solved 
analytically and requires numerical integration. 

Notably at last two highly flexible and powerful 
software suites are free for academic use. The pro¬ 
grams Dynafit (http://www.biokin.com/dynafit/) by 
Petr Kuzmic (Kuzmic et al. 2009; Kuzmic 1996, 
2009) and Copasi by Pedro Mendes and Ursula 
Kummer (Hoops et al. 2006) (http://www.copasi.org/ 
tiki-index.php) provide exquisite and powerful simu¬ 
lation and data analysis capacities. It should also be 
mentioned that a free software exists to simulate single 
molecule or low ensemble number events from known 
kinetic (ensemble) parameters (Chemical Kinetics 
Simulation, CKS by IBM) (http://www.almaden.ibm. 
com/st/computational_science/ck/?cks). 

An additional source of software for analysis 
of kinetic data are the programs supplied by companies 
that supply instruments for transient kinetic measure¬ 
ments, e.g., TgK Scientific (http://www.hi-techsci. 
com/), Applied Photophysics (http://www. 
photophysics.com/), KinTek Corp. (http://www. 
kintek-corp.com/index.php), BioLogic (http://www. 
bio-logic.info/) and Olis (http://www.olisweb.com/). 

Summary 

Data simulation and fitting even of highly complex 
systems is now in reach for all scientists with the 
currently available software and computing power. 
However, still or even more so, many pitfalls exist 
that have to be avoided. Scientists who engage in 
complex simulations and data analysis should always 
be critical with the outcome and experiments still need 
to be planned carefully. Specifically systems should 
not be over fitted, and automation of data analysis 
should not be confused with the problem being solved. 
A still valid warning can be found in the classic text¬ 
book from William P. Jencks - “do not accept 
a solution that you do not (intuitively) understand” 
(Jencks 1987). However, it is indeed fascinating what 
possibilities now exist to deal with kinetic data, the 
limit is certainly set by the scientists, not the comput¬ 
ing power any more. 
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Synonyms 

Single-molecule methods 

Introduction 

Although single molecules have been imaged by elec¬ 
tron microscopy for many years, their detection by 
optical methods since the 1990s has allowed their 
kinetic, as well as structural, properties to be investi¬ 
gated (Weiss 2000). This advance has given new 
insights into mechanism by revealing details that 
are masked by traditional ensemble measurements. 
Furthermore, there is no need to initiate the reaction 
by mixing or perturbing the environment because an 
individual molecule can be observed as it spontaneously 
switches between states under equilibrium or ► steady- 
state conditions (Stone et al. 2007). This approach also 
has good dynamic range; for example, if a molecule 
exists in a particular state for just 1 % of its time, on 
average, then it may readily be observed at the single¬ 
molecule level but masked at the ensemble level because 
of the contribution of 99% of molecules in other states. 
Flowever, technical challenges remain because single- 
molecule measurements usually have low signal- 
to-noise ratios because of low photon counts or thermal 
noise. Furthermore, when a rare state is observed at the 
single-molecule level, it has to be established this 
reflects a natural state of the molecule of interest and 
not a contaminant, nor a molecule that has been modified 
(e.g., by immobilization), nor an intrinsic property of the 
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Time (s) 


Kinetics: Single-Molecule Techniques, Fig. 1 A simulated 
single-molecule record of a reaction A " B with a forward rate 
constant of 1 s“* and a reverse rate constant of 10 s“*. The 
stochastic jumps between the A and B states given an average 
lifetime of the A state of 0.74 s (1 s expected) and for the B state 
of 0.05 s (expected 0.1 s). The ratio of these times reflects the 
equilibrium constant of 0.068 (0.1 expected). The deviations 
from the expected values arise from the low number (13) of 
events that were counted. The reaction would need to be moni¬ 
tored for many hundreds of seconds to give an average value that 
accurately reflects the ensemble behavior 

probe (e.g., blinking of a fluorophore (Bagshaw and 
Cherny 2006)). Single-molecule measurements are 
dominated by stochastic (chance) events and therefore 
many hundreds to thousands of measurements are 
required to determine the average behavior. Rate con¬ 
stants for reactions become probabilities. For example, 
a rate constant of 1 s“* corresponds to a 50% chance of 
a molecule reacting within 0.69 s and the average life¬ 
time of the preceding state is 1 s (Fig. 1). 

Technical Considerations 

Single molecules have been monitored through their 
mechanical properties (using optical and ► magnetic 
tweezers or variations of ► atomic force microscopy) 
or optical methods, of which ► fluorescence is the most 
widely used detection method. Mechanical methods are 
appropriate to study mechanoproteins (e.g., molecular 
motors), nucleic acids, and other elongated polymers, as 
well as ► protein folding mechanisms. These techniques 
are covered elsewhere in this volume. This article 
focuses on the use of fluorescence methods for detection 
of transitions between states at the single-molecule 
level. A typical fluorophore with a high quantum yield 
and good photostability may emit, on average, around 
10-10^ photons before it photobleaches. After capture 


of this emission using a high-numerical-aperture micro¬ 
scope objective lens and filtering to remove the excita¬ 
tion light and Raman scattering, about 1-10% of these 
photons may arrive at the detector and generate a signal. 
Given many detectors have single photon sensitivity and 
high quantum yields, the detection of light from a single 
fluorophore is not particularly challenging (a dark- 
adapted human eye is sufficient). The key to observing 
single molecules is to reduce the background light so 
that the single fluorophore generates sufficient contrast. 
A good analogy here is the need for night skies away 
from human-generated light sources for a clear obser¬ 
vation of stars. Conventional epifluorescence micros¬ 
copy is capable of detecting single molecules in 
favorable cases, but it is better that the background 
from out-of-focus fluorophores is minimized by using 
confocal, multiphoton, near-field, or ► total internal 
reflection fluorescence microscopy. Even though 
10"^-10® photons may be detected before a fluorophore 
is bleached, when this signal is used to probe kinetics 
there is a problem. One hundred time bins may on 
average contain 100-1000 photons, but the former can 
only be counted to an accuracy of 10% (i.e., standard 
deviation = Vn). For this reason single-molecule fluo¬ 
rescence kinetic measurements are generally “photon- 
starved” and noisy. Invariably, a compromise must be 
reached between detection precision and time resolu¬ 
tion. Furthermore, a significant number of events will be 
terminated by photobleaching rather than the chemistry 
of interest (e.g., product dissociation) and it may not be 
possible to distinguish these events. In these circum¬ 
stances, the illumination intensity has to be tuned to 
the event of interest such that, on average, around 10% 
of the events are terminated by photobleaching, as deter¬ 
mined in a control system where there is no accompa¬ 
nying chemistry, leaving 90% of the events to report on 
the reaction itself Another way of overcoming this 
limitation is to design assays in which the fluorescence 
probe is on the substrate or product molecule such that 
new molecules will replace photobleached ones during 
the assay and many enzyme cycles can be monitored 
from a single enzyme molecule. This approach is 
exploited in ► single-molecule sequencing of DNA. 

Methodology 

Confocal and multiphoton microscopy methods 
achieve their low background characteristics by 
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selectively illuminating fluorophores at the focal point 
of the excitation light. They can be used in a non¬ 
scanning mode to excite single fluorophores present 
in dilute solution (Gell et al. 2006). It follows from 
Avagadro’s number (6 x 10^^ molecules/1) that 
a focused laser beam with a 1 femtoliter (10~*^ 1) 
volume (typical for a high-numerical-aperture objec¬ 
tive lens), illuminating a 1 nM (10~^ M) solution, will 
excite 0.6 molecules on average, that is, the occupancy 
distribution will be about 55% no molecules, 33% one 
molecule, 10% two molecules. A freely diffusing mac¬ 
romolecule may spend about 1 ms in the beam. Fluc¬ 
tuations in the emitted intensity due to this diffusion 
provide a measure of the diffusion coefficient (see 

► Fluorescence Correlation Spectroscopy). If the mol¬ 
ecule undergoes a change in state on a shorter time- 
scale than this, there will be a higher frequency 
component in the intensity fluctuations which will 
report on the kinetics of the interconversion. While 
the fluorescence emission event itself usually occurs 
on the nanosecond timescale, the limit on detecting fast 
processes is usually set by the fluorophore entering 
a non-emitting triplet state which may last for micro¬ 
seconds. Distinguishing chemical events from 
photophysical ones on the microsecond to millisecond 
timescale is quite challenging. If the kinetics of inter¬ 
conversion between molecular states takes longer than 
about 10 ms, then only a single state will be observed 
during a molecule’s transit through the beam. How¬ 
ever, individual molecules will be in different states, 
thus allowing the equilibrium distribution of states to 
be assessed by monitoring a population of molecules 
for several minutes, but one molecule at a time. This 
approach gives added resolution over ensemble 
methods with bulk detection as, for example, in 

► single-molecule fluorescence (Forster) resonance 
energy transfer (smFRET) measurements. By simulta¬ 
neous measurement of the donor and acceptor 
emission at the single-molecule level, which should 
be anti-correlated because of FRET, populations of 
molecules containing donor fluorophore or acceptor 
fluorophore alone can be resolved from those 
molecules of interest which contain the FRET pair 
(Selvin and Ha 2008). 

Confocal and multiphoton microscopy methods 
may also be used to image multiple single molecules 
by scanning the specimen; however this does limit the 
time resolution (often to seconds or more). Further¬ 
more, the molecules need to be immobilized so that 


they do not diffuse out of the field of observation. In 
these cases it is often better to use an inherently wide- 
field technique such as ► total internal reflection fluo¬ 
rescence microscopy (Funatsu et al. 1995; Selvin and 
Ha 2008). This technique relies on the thin excitation 
volume of an evanescent field to limit excitation to 
a depth of around 150 nm from the reflecting surface 
(typically the glassXwater interface of the coverslip or 
slide) to minimize the signal from out-of-focus 
fluorophores. With immobilized molecules, the time 
of observation is limited by photobleaching, which in 
turn depends on the laser power. For real-time contin¬ 
uous illumination and favorable fluorophores, this 
period may last lO-lOOs, which may be sufficient 
to capture the events of interest. Frequently, buffer 
conditions are chosen for reactions to occur on this 
timescale for convenience. For longer timescales inter¬ 
mittent illumination is required (Elenko et al. 2009). 

► Quantum dots are much longer-lived than organic 
fluorophores but may be bulkier and often suffer from 
blinking which confounds the interpretation of kinetic 
events. The recent description of suppression of 
blinking in quantum dots is a significant advance for 
these applications (Hamada et al. 2010). Once blinking 
is controlled, then fluctuations in emission intensity 
may reflect transitions between different chemical 
states of the molecule of interest, rather than the 
probe. In favorable cases, the switching between 
events is clear and abrupt and the time spent in each 
state can be measured directly from the trace (cf. 
Eig. 1). These times will be stochastically distributed 
around some mean lifetime for the state, which, in 
simple cases, will reflect the mean lifetime determined 
by ► transient-state methods (see ► Kinetics: Over¬ 
view). In the case of multiple states or distributed 
intensities, the transitions may not be so clear, but 
automated methods of finding transitions using 

► hidden Markov modeling can be adopted. Alterna¬ 
tively, autocorrelation analysis of a trace will reveal 
the timescales of fluctuations and yield mean transition 
times (Fig. 2). 


Single-Molecule Enzymology 

The catalytic activity of enzymes can be measured at 
the single-molecule level by examining the production 
of individual product molecules or by observing the 
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Kinetics: Single-Molecule Techniques, Fig. 2 Fluctuations 
in emission intensity from a single immobilized yellow fluores¬ 
cent protein (YFP) molecule due to slow proton-coupled chlo¬ 
ride binding that leads to a weakly fluorescent YFPH.Cl state, (a) 
The intensity trace indicates that the bright anionic YFP^state 
lasts for about 0.1-0.5 s but it would be difficult to measure the 
period directly for the individual transitions (cf. Fig. 1). (b) The 
autocorrelation function derived from this record shows an 


exponential decay with a time constant of 0.27 s. The 
corresponding rate constant (3.7 s“’) represents the sum of the 
forward and reverse rate constants for the YFP^** YFPH.Cl 
interconversion, but the forward rate constant has a large contri¬ 
bution from photobleaching due to the high laser power used 
(2.5 kW/cm^). See McAnaney for experimental details 
(McAnaney et al. 2005) 


individual enzyme molecules in a substrate-bound 
state (Chen et al. 2009). 

Myosin II ATPase 

The muscle protein, myosin, shows broad specihcity 
allowing the use of fluorescent ATP analogues as 
substrates to follow single turnovers (see ► Stopped- 
Flow Techniques section Fig. 4). It also has a low kcat 
(around 0.05 s~*) so that individual turnovers last for 


tens of seconds which can be recorded in real time with 
a standard video-rate camera. A single-molecule assay 
was developed in which myosin molecules were 
immobilized on a coverslip and a wide-fleld image 
recorded using ► total internal reflection fluorescence 
microscopy (Funatsu et al. 1995). Using a Cy3-ATP 
derivative, individual substrate molecules were 
imaged while bound to the myosin molecules during 
the turnover reaction. Free Cy3-ATP and the product 
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Kinetics: Single-Molecule Techniques, Fig. 3 Single- 
turnover assay of Cy3-EDA-ATP by skeletal muscle myosin 
subfragment 1. (a) TIRF image of single Cy3-EDA-ATP mole¬ 
cules bound to myosin subfragment 1. Each spot remained in 
a bright state for a variable period ranging from a few to many 
seconds, (b) The distribution of the lifetime of the bright spots as 
imaged in (a). Note that nearly half the spots lasted for up to 16 s, 
and about one quarter for 32 s. Fitting the distribution to a single 



Time (s) 


exponential yielded a half time of 14 s, shorter than that of an 
ensemble single-turnover measurement using a stopped-flow 
device (27 s; ► Stopped-Flow Techniques; Fig. 4), at least in 
part due to a contribution of photobleaching to the single¬ 
molecule measurement. Experiments performed in collaboration 
with Paul B. Conihear (Conibear et al. 1998) - see supplemen¬ 
tary online video file 


Cy3-ADP diffused sufficiendy rapidly so that they 
were not imaged individually, but gave rise to a weak 
background signal. The ► total internal reflection fluo¬ 
rescence detection method helped to minimize this 
signal by only detecting molecules within about 
150 nm of the surface. Thus the activity of individual 
myosin molecules was revealed by spots which fluctu¬ 
ated between dark (substrate-free) and bright (sub¬ 
strate/product-bound) states. The life times of the 
states were distributed exponentially (Fig. 3). The onset 
of a dark state could represent the release of Cy3-ADP 
product (the release of bound Cy3-ATP is very slow in 
myosins), or the photobleaching of the bound Cy3 dye. 
Control experiments conducted at different laser inten¬ 
sities allowed the photobleaching contribution to be 
assessed, and the mean time for Cy3-ATP turnover 
could be calculated. 

A further complication in this assay was that some 
myosin molecules showed rapid ► blinking, possible 
due to the fluorophore being transiently quenched by 
the glass surface (Bagshaw and Conibear 2000). Other 
dyes such as Cy5 suffer more problems with blinking, 
probably as a result of an inherent photochromic effect. 
Here, the excited state of the fluorophore relaxes back 
to give another photoisomer that no longer absorbs 
efficiently at the initial excitation wavelength and 


Rabbit SI * Cy3-ATP (i£ e inteival] 


Kinetics: Single-Molecule Techniques, Video. 1 Single- 
turnover assay of Cy3-EDA-ATP by skeletal muscle myosin 
subfragment 1 


hence the molecule remains in a dark state for many 
seconds. Excitation at other wavelengths can often 
reactivate such molecules so they return to their initial 
photocycle (Bagshaw and Cherny 2006). The problem 
of ► blinking means that it can be difficult to measure 
the true time of association of the substrate with the 
enzyme. On the other hand, reversible photobleaching 
can be expoited to give baseline signals in 
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► single-molecule FRET analysis and also find 
use in high-resolution imaging such as STORM 
(► Stochastic Optical Reconstruction Microscopy). 

p-Galactosidase 

The enzyme p-galactosidase catalyzes the hydrolysis 
of lactose but accepts artificial chromophoric ana¬ 
logues such as resofurin (3-galactose. This substrate 
has low fluorescence but the product resofurin is highly 
fluorescent and is readily detectable in a confocal 
microscope at the single-molecule level. Individual 
P-galactosidase molecules were immobilized on 
0.5 pm beads to aid imaging, and then the fluorescence 
emission was monitored at high speed (0.5 ms inter¬ 
vals) to count photons from individual product 
resofurin molecules, which diffused away from the 
enzyme within one or two frames (English et al. 
2006). The interval between photon bursts therefore 
represents the turnover and waiting times between 
catalytic cycles. At low substrate concentrations 
(<Kn,), the waiting time dominated and events were 
found to be randomly distributed in time with an over¬ 
all mono-exponential profile. On the other hand, at 
high substrate concentrations, the distribution of 
events was multi-exponential and events were not ran¬ 
domly distributed, that is, runs of short or long turnover 
times were clustered. These data indicate that the 
activity of individual P-galactosidase molecules fluc¬ 
tuated between periods of high and low values (English 
et al. 2006). However, when the population-averaged 
behavior was calculated, it showed an apparently sim¬ 
ple Michaelis-Menten-type dependence on substrate 
concentration, as observed in practice. In principle, 
heterogeneity in turnover times should be revealed by 
transient kinetic methods, but the dynamic range of the 
latter may not be sufficient to pick up small deviations 
from a single exponential function or small subpopu¬ 
lations of molecules. More importantly, heterogeneity 
in kcat measured by an ensemble single-turnover assay 
could not distinguish, in itself, between static and 
dynamic heterogeneity. In the single-molecule assay, 
individual enzyme molecules were observed to 
undergo periods of short or long turnover intervals 
and they switched between these phases on 1 ms to 
10 s timescale showing that the heterogeneity is 
dynamic (English et al. 2006). This characteristic has 
been termed a “memory effect” in that any particular 
turnover event is positively correlated with preceding 
turnover events (i.e., a fast turnover event is more 


likely to be followed by another fast event than 
a slow one and vice versa). Dynamic heterogeneity is 
likely to be a common feature of enzymes as many 
protein molecules are only marginally stable and will 
spontaneously interconvert between native and par¬ 
tially unfolded states. In the case of the 
single-molecule p-galactosidase assay, it is possible 
that further heterogeneity was introduced on immobi¬ 
lization such that the enzyme occasionally adhered to 
the surface by nonspecific interactions as well as 
through the engineered biotinylated tag. 
However, heterogeneity has also been seen in experi¬ 
ments with freely diffusible enzymes whose motion is 
restricted within femtoliter chambers (Chen et al. 
2009). 

Not all interactions reveal such complex behavior. 
GTP binding to and release from RNA aptamers occurs 
on the minutes timescale, but histograms of the life¬ 
times on the bound and free states follow a single 
exponential distribution that is consistent with ensem¬ 
ble measurements (Elenko et al. 2009). smERET 
methods have been widely used to study the kinetics 
of protein-nucleic acid interactions, as exemplified by 
the assembly of a telomerase ribonucleoprotein com¬ 
plex (Stone et al. 2007). 

Summary 

Measuring kinetics at the single-molecule level can 
reveal unique features of a mechanism that are masked 
by traditional ensemble methods. However, this 
characteristic also means that it is difficult to obtain 
independent evidence to support or refute novel con¬ 
clusions. At a minimum, the mean behavior of single 
molecules should be compatible with the ensemble 
data. Careful controls are required to ensure the signal 
measured corresponds to the proposed event. However 
this is a relatively new field which has much potential. 

Cross-References 

► Atomic Eorce Microscopy 

► Electron Microscopy 

► Fluorescence 

► Fluorescence Correlation Spectroscopy 

► Hidden Markov Modeling in Single-Molecule 
Biophysics 
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► Kinetics: Overview 

► Magnetic Tweezers 

► Optical Tweezers 

► Protein Folding 

► Quantum Effects/Tunneling 

► Single Fluorophore Blinking 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Single-Molecule Sequencing 

► Steady-State Enzyme Kinetics 

► Stochastic Optical Reconstruction Microscopy 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 

► Transient-State Kinetic Methods 


References 

Bagshaw CR, Cherny D. Blinking fluorophores: what do they tell 
us about protein dynamics? Biochem Soc Trans. 2006;34: 
979-82. 

Bagshaw CR, Conibear PB. Single molecule enzymology: crit¬ 
ical aspects exemplified by myosin ATPase activity. Single 
Molecules. 2000;1:269-275. 

Chen Q, Groote R, Schonherr H, Vancso GJ. Probing single 
enzyme kinetics in real-time. Chem Soc Rev. 2009;38: 
2671-83. 

Conibear PB, Kuhlman PA, Bagshaw CR. Measurement of 
ATPase activities of myosin at the level of tracks and single 
molecules. Adv Exp Med Biol. 1998;453:15-26. 

Elenko MP, Szostak JW, van Oijen AM. Single-molecule imag¬ 
ing of an in vitro-evolved RNA aptamer reveals homoge¬ 
neous ligand binding kinetics. J Am Chem Soc. 2009;131: 
9866-7. 

English BP, Min W, van Oijen AM, Lee KT, Luo G, Sun H, 
Cherayil BJ, Kou SC, Xie XS. Ever-fluctuating single 
enzyme molecules: Michaelis-Menten equation revisited. 
Nat Chem Biol. 2006;2:87-94. 

Funatsu T, Harada Y, Tokunaga M, Saito K, Yanagida T. Imag¬ 
ing of single fluorescent molecules and individual ATP turn¬ 
overs by single myosin molecules in aqueous solution. 
Nature. 1995;374:555-9. 


Gell C, Brockwell D, Smith A. Handbook of single molecule 
fluorescence spectroscopy. Oxford: Oxford University Press; 
2006. 

Hamada M, Nakanishi S, Itoh T, Ishikawa M, Biju V. Blinking 
suppression in CdSe/ZnS single quantum dots by Ti02 
nanoparticles. ACS Nano. 2010;4:4445-54. 

McAnaney TB, Zeng W, Doe CF, Bhanji N, Wakelin S, 
Pearson DS, Abbyad P, Shi X, Boxer SG, Bagshaw CR. 
Protonation, photobleaching, and photoactivation of yellow 
fluorescent protein (YEP 10 C): a unifying mechanism. 
Biochemistry. 2005;44:5510-24. 

Selvin PR, Ha T, editors. Single-molecule techniques: 
a laboratory manual. New York: Cold Spring Harbor Labo¬ 
ratory Press; 2008. 

Stone MD, Mihalusova M, O’Connor CM, Prathapam R, Collins 
K, Zhuang X. Stepwise protein-mediated RNA folding 
directs assembly of telomerase ribonucleoprotein. Nature. 
2007:446:458-61. 

Weiss S. Measuring conformational dynamics of biomolecules 
by single molecule fluorescence spectroscopy. Nat Struct 
Biol. 2000;7:724-9. 


Kratky-Porod Worm-Like Chain Model 

► Worm-Like Chain (WLC) Model 


KSI 

► Ketosteroid Isomerase - Computational Studies 


Kv Channel-Interacting Protein 

► N-Type Inactivation in Voltage-Gated Potassium 
Channels 








L 


Lac Carrier Protein 

► H'^-Lactose Membrane Transport Protein, LacY 


Lactate Dehydrogenase 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 


Lactate Dehydrogenase - Computational 
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Synonyms 

LDH 

Definition 

An enzyme that catalyzes the interconversion of pyru¬ 
vate and lactate. 


Basic Characteristics 

Several classes of lactate dehydrogenases exist, depen¬ 
dent on whether NAD(P)H or cytochrome c is used as 
cofactor and whether the enzyme acts on o-lactate 
or L-lactate. The most widely studied class is the 
NAD(P)H dependent enzyme that catalyzes conver¬ 
sion of pyruvate (the final product of glycolysis) into 
lactate in animal tissues when oxygen is limited, and 
the reverse reaction in the liver (in the so-called 
Cori cycle, or lactic acid cycle). Computer molecular 
simulation and modeling studies of the reaction have 
been used to investigate and illustrate general catalytic 
principles (Yadav et al. 1991; Wilkie and Williams 
1992). LDH has also been used as a test system in 
development of new methods for modeling enzyme 
reactions (such as QM/MM transition state optimiza¬ 
tion (Turner et al. 1999)). Ferrer et al. calculated 
two-dimensional potential energy profiles at the 
AM1/CHARMM22 level, with energy corrections at 
the MP2 ab initio level (Ferrer et al. 2005): They 
modeled the hydride transfer from the dihydronico¬ 
tinamide ring of NADH to the carhonyl C atom of 
pyruvate, as well as the proton transfer to the carhonyl 
O atom from a protonated catalytic histidine. As well 
as pointing out the importance of considering multiple 
protonation states of active site residues in QM/MM 
modeling, they established that the reaction probably 
occurs through an asynchronous concerted reaction 
pathway in which hydrogen transfer precedes proton 
transfer. This picture may change, however, when 
active site residues are mutated: They also modeled 
the effects of mutations (Ferrer et al. 2008). 


G.C.K. Roberts (ed.). Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
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► QM/MM Methods 


Laser Processing of Biomaterials and 
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List of Abbreviations 


Alq3 

tris(8-hydroxyquinolinato) 

aluminium 

ArF 

argon fluoride 

DNA 

deoxyribonucleic acid 

IR 

infrared 

KrF 

krypton fluoride 

MEH-PPV 

poly [2-methoxy-5-(2 ’ - 

ethylhexyloxy)-p-phenylene 

vinylene] 

PEG 

poly(ethylene glycol) 

PGA 

poly(glycolic acid) 

PLA 

poly(lactic acid) 

PLGA 

poly(lactic-co-glycolic acid) 

PMMA 

poly(methyl methacrylate) 

RNA 

ribonucleic acid 

Mn 12-acetate 

Mnl20i2(CH3C00)16 
(H20)4] • 2 CH 3 COOH 4 H 2 O 

NPP 

N-(4-nitrophenyl)-(l)-prolinol 

UV 

ultraviolet 


Synonyms 

_ Fabrication of biomaterials; Laser capture microdis- 

Laser Capture Microdissection Laser-based materials engineering of 

medically relevant materials 

► Laser Processing of Biomaterials and Cells 


Definition 


Laser Doppler Electrophoresis (LDE) 

► Electrophoretic Light Scattering 


Laser processing of biomaterials is the application of 
laser-based techniques for fabrication and/or analysis 
of medically-relevant materials and biological 
systems. 
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Introduction 

Light amplification by stimulated emission of radiation 
(laser) has been used for materials processing since the 
early 1960s. Lasers are employed for a variety of 
materials processing techniques, including coating, 
machining, polymerizing, melting, sintering, and 
welding (Narayan and Goering 2011). Through these 
unique capabilities, lasers may be used to form intri¬ 
cate geometries, alter material properties, and surface 
engineer complex materials. Laser-based surface engi¬ 
neering involves melting, heating, or modifying the 
surface of a given material; it may be used to prepare 
surfaces with micro- and nanoscale features (Prodanov 
et al. 2011). Lasers are highly advantageous instru¬ 
ments for materials engineering as they may be used 
for processing of a broad range of materials, including 
metals, ceramics, polymers, and biomaterials. Since 
these processes do not involve laser-material contact, 
contamination and substrate damage are minimized 
(Stratakis et al. 2009). 

Due to the unique capabilities offered by laser 
processing, laser-based methods have been widely 
used in clinical medicine for diagnosis and treatment 
of diseases as well as for fabrication of medical 
devices. The earliest efforts utilizing laser-material 
interactions for development of biomedical devices 
involved laser welding of dental biomaterials. 
For example, lasers have been used to join together 
components of dental appliances such as dental 
brackets, crown units, and partial dental prostheses 
(Narayan and Goering 201 1). Since this early research 
involving laser-based biomaterials, laser technology 
has rapidly evolved. Novel fabrication methods have 
been introduced, which provide a variety of laser 
processing capabilities. These laser-based techniques 
have become widely used for biomaterials fabrication 
as well as for analytical evaluation of biological struc¬ 
tures and processes. 

In this entry, three laser-based techniques for 
processing of biomaterials are described, including 
matrix-assisted pulsed laser evaporation, laser capture 
microdissection, and optical tweezers. Each of these 
processes provides unique capabilities for manipula¬ 
tion of biomaterials and tissues. The advantages and 
limitations of these processes are discussed as well as 
future prospects. 


Matrix-Assisted Pulsed Laser Evaporation 

Matrix-assisted pulsed laser evaporation (MAPLE) is 
a physical vapor deposition process that was created in 
the late 1990s at the US Naval Research Laboratory 
(Pique 2011). This process was originally developed to 
overcome the decomposition of polymeric target mate¬ 
rials associated with traditional pulsed laser deposition 
(PLD) (Caricato and Luches 2011). In MAPLE, the 
material to be deposited is diluted in a highly volatile 
solvent to form a matrix, which is flash frozen in liquid 
nitrogen. The frozen matrix is placed in a vacuum 
chamber and is used as a target. When the frozen 
matrix is irradiated with a pulsed laser beam, the vol¬ 
atile solvent absorbs the laser energy. This energy is 
converted into thermal energy, causing the solvent to 
vaporize. Kinetic energy from the collisions between 
the solvent and solute molecules causes the solute 
material to transfer into the gaseous phase. When 
both material species are expelled from the target, the 
volatile solvent is pumped away and the solute mate¬ 
rial is transferred onto the substrate, forming a thin film 
(Caricato and Luches 2011; Patz et al. 2007). 
A schematic of the MAPLE process is shown in Eig. 1. 

The primary goals of MAPLE are to preserve the 
structural and functional integrity of the materials, to 
control the thickness and uniformity of the film, to 
obtain desirable surface features, and to obtain ade¬ 
quate adhesion between the substrate and the deposited 
film (Caricato and Luches 2011). Control over the 
MAPLE processing parameters permits excellent con¬ 
trol over multiple film characteristics, including thick¬ 
ness, surface morphology, and homogeneity. 
An excimer laser (KrE; 248 nm or ArF: 193 nm) with 
a pulse width of 10-30 ns and a repetition rate between 
1 and 10 Hz is generally utilized for MAPLE 
processing. The laser is typically focused to a 
1-10 mm^ area, providing a laser fluence of 
0.01-1.0 ijcm. MAPLE is capable of depositing uni¬ 
form thin films over a broad range of thicknesses 
(10 nm to 1 pm) and over areas greater than 1 cm^ 
(Pique 2011). Operating conditions are dependent on 
the solute and solvent materials; numerous processing 
parameters may be optimized to obtain the desired 
thin film properties. Studies have demonstrated that 
the material properties of the deposited film are 
strongly dependent on matrix composition (e.g., solute 
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Laser Processing of Biomaterials and Ceils, Fig. 1 Schematic of the matrix assisted pulsed laser evaporation (MAPLE) process 
(With permission from Elsevier (Patz et al. 2007)) 


concentration and solvent properties), laser wave¬ 
length, laser energy density, laser pulse duration, and 
laser firing rate (Pique 2011). 

MAPLE is a versatile technique which presents 
numerous advantages over conventional physical or 
chemical vapor deposition methods. In traditional 
PLD, polymeric and organic materials may be dam¬ 
aged due to photochemical and/or photothermal pro¬ 
cesses; the resulting thin film may exhibit properties 
different than those of the starting material (e.g., lower 
molecular weight) (Pique 2011). By operating at rela¬ 
tively low fluences, MAPLE provides an appropriate 
process for deposition of polymer, organic material, or 
biomaterial thin films. Several other physical and 
chemical vapor deposition techniques decompose the 
target material and therefore cannot be used for such 
materials (Caricato and Luches 2011). Various types 
of polymers, organic substances, and biomaterials 
have been successfully deposited using MAPLE. For 
example, films containing biologically-relevant poly¬ 
mers (PEG, PGA, PLA, PLGA, PMMA, chitosan), 
electrically conductive polymers (polypyrrole), elec¬ 
troluminescent and photoluminescent polymers 
(MEH-PPV), as well as functionalized copolymers 
have been deposited using MAPLE (Pique 2011; Patz 
et al. 2007). In addition, films containing organic mol¬ 
ecules such as carbohydrates (glucose, sucrose, dextran, 
pullulan), organic magnetic materials (Mnl2-acetate), 
and other organic materials (NPP polypyrrole and Alq3) 
have been grown using MAPLE (Caricato and 


Luches 2011; Pique 2011). Films containing biomole¬ 
cules and active proteins have also been deposited using 
MAPLE; films containing horseradish peroxidase, col¬ 
lagen, bovine serum albumin, phospholipids, fibrino¬ 
gen, and mussel adhesive proteins have been grown 
using this process (Caricato and Luches 2011; Patz 
et al. 2007; Pique 2011). 

MAPLE has recently been applied for fabrication of 
nanoparticle/polymer film composites in which 
a colloidal nanoparticle solution is used as the target. 
MAPLE-deposited nanoparticle thin films provide bet¬ 
ter control over the film thickness and uniformity than 
those deposited by spin coating or drop casting. For 
example, tin dioxide and titanium dioxide nanoparti¬ 
cle/polymer film composites have been produced using 
MAPLE; it is expected that nanomaterials fabrication 
will become a more common application of MAPLE 
technology in the future (Caricato and Luches 201 1). 

Although conventional MAPLE provides several 
advantages over other thin film fabrication methods, 
there are a few limitations. The low concentration of 
solute molecules in the matrix solution results in a low 
deposition rate (Pique 2011). In addition, some solvent 
materials, such as chloroform, have been shown to 
release highly reactive Cl radicals, which may cause 
chemical degradation of the deposited films (Caricato 
and Luches 2011; Pique 2011). The most significant 
shortcoming of MAPLE technology is the difficulty in 
controlling surface morphology. While optimization of 
processing parameters has been shown to produce very 
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smooth and uniform coatings for some materials, many 
films exhibit rough surfaces with well-dehned aggre¬ 
gates or clusters (Pique 2011). 

Since the introduction of MAPLE, several varia¬ 
tions of the technology have been developed in order 
to reduce the limitations associated with conventional 
MAPLE technology. Resonant infrared MAPLE (RIR- 
MAPLE) is one variation that has shown great promise 
in providing increased deposition rates and decreased 
surface roughness values. RIR-MAPLE utilizes an IR 
laser rather than a UV laser to vaporize the target 
matrix. RIR-MAPLE may be carried out at larger 
laser fluences (3-5 J/cm^) and with higher solute 
matrix concentrations (15-20%) than traditional 
UV-MAPLE, enabling higher deposition rates (Pique 
2011). Off-axis MAPLE is another variation, which 
places the substrate parallel to the deposition axis rather 
than perpendicular to the deposition axis. This process 
has been shown to eliminate the presence of droplets 
that are associated with the aggregates in conventional 
MAPLE-deposited films (Pique 2011). A laser process 
known as MAPLE- direct write has the ability to 
fabricate free-standing patterns and structures of living 
cells. For example, Harris et al. created microscale 
patterns of MG 63 osteoblast-like cells, hydroxyapatite, 
an extracellular matrix using MAPLE-direct write 
(Harris et al. 2008). The cells within these patterns 
were viable and proliferated after MAPLE-direct write 
processing. MAPLE-direct write may be used for 
patterning biomaterials for a variety of applications, 
including use in drug delivery devices, artihcial, medi¬ 
cal implants, wound-healing adhesives, as well as 
chemical and biological sensors (Caricato and Luches 
2011; Riggs et al. 2011). The progression of MAPLE 
and its variants has enabled further control over thin 
film attributes and has become a standard technique for 
laser processing of polymeric, organic, and biological 
materials. It is expected that MAPLE will become 
a valuable tool for fabricating thin films of biological 
materials. 

Laser Capture Microdissection 

Laser capture microdissection (LCM), shown in Fig. 2, 
has become an essential instrument in the preparation 
of homogeneous cell populations for advanced cellular 
and molecular analysis of tissue microenvironments. 
LCM provides a fast, reliable method for capturing and 


isolating specihc cells from heterogeneous tissues 
while maintaining the molecular attributes of the 
cells (Espina et al. 2007). This unique preservation of 
the tissue environment is accomplished by combining 
direct microscopic imaging of the cells and activation 
of a thermoplastic membrane using pulsed laser 
energy. 

During LCM, a slide of sectioned tissue is placed 
onto a microscopic stage, where the cells of interest are 
identified. Near infrared (IR) laser pulses are then 
precisely applied to activate a thermoplastic polymer 
film adjacent to the tissue. The activated him embeds 
the cells of interest that are being targeted (Simone 
et al. 1998). The resulting polymer-cell composite is 
removed from the heterogeneous tissue, capturing only 
the targeted cells and leaving the remaining tissue 
intact (Espina et al. 2007). In many systems, the poly¬ 
mer him is on the bottom surface of a plastic cap, 
which is directly transferred to an extraction vial 
(Simone et al. 1998). The specimen may then be 
coupled with cell, DNA, RNA, or protein analyses. 

The LCM process is compatible with a variety of 
tissue types, tissue preservation methods, and cellular 
staining techniques. LCM has been applied to numer¬ 
ous sample types, including formalin-hxed or paraffin- 
embedded tissue sections, hash frozen samples, cell 
smears, and chromosome separations (Ladanyi et al. 

2006) . Standard staining protocols have been utilized 
for enhanced identihcation of cells during microdis¬ 
section. For example, immuno-LCM applies immuno- 
histochemical staining for more accurate identihcation 
of cells. This technique permits cells that exhibit dif¬ 
ferent immunologic phenotypes to be distinguished 
despite morphological similarities (Espina et al. 

2007) . Once the cells of interest have been targeted, 
the laser pulse may be applied. LCM uses short laser 
pulse durations, low laser power values, and long 
delays between laser pulses to reduce heat deposition 
at the tissue surface (Espina et al. 2007). This approach 
enables isolation of subpopulations of cells and indi¬ 
vidual cells without photochemical damage or degra¬ 
dation. By protecting the microenvironment of the 
tissue, LCM provides specimens that closely mimic 
the in vivo interactions of the native tissue (Espina 
et al. 2007). 

Preservation of the tissue specimen permits cou¬ 
pling of the samples with multiplex analysis of cellular 
and molecular interactions. Sensitive amplification 
methods based on polymerase chain reaction (PCR), 
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Laser Processing of 
Biomaterials and Cells, 
Fig. 2 Laser-capture 
microdissection (LCM) 
system. Through the use of 
a laser beam, tissue is 
embedded in a thermal 
polymer. The tissue is easily 
accessible to molecular 
analysis of any kind (With 
permission from Elsevier 
(Simone et al. 1998)) 
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reverse transcriptase PCR, or enzymatic function are 
used to analyze the tissues (Emmert-Buck et al. 1996). 
By contributing to the understanding and identification 
of genes, proteins, and cellular pathways associated 
with complex tissues, LCM may be used for various 
applications, including discovery of drug targets; anal¬ 
ysis of tumor microenvironment interactions; profiling 
of signaling pathways; real-time PCR; proteomic and 
genomic molecular profiling; forensic analysis; plant 
and cell biology analysis; biomarker discovery; and 
identifying molecular signatures of specific patholo¬ 
gies (Espina et al. 2007). 

LCM has significantly progressed since its commer¬ 
cial introduction by researchers at the National Insti¬ 
tute of Health in 1997 (Ladanyi et al. 2006). The 
primary limitation of LCM lies in the need for visual 
inspection of the cells. This step requires an individual 
trained in cell identification and can be time consum¬ 
ing. Cell recognition software and automated cell 
microdissection systems have gained significant atten¬ 
tion as approaches for overcoming the manual aspects 
of LCM technologies. Novel processes have been 
developed such as manual and automated platforms; 
in addition, systems containing combined IR- and UV- 
lasers may be used to provide additional UV laser 
cutting and laser catapulting features (Espina et al. 
2007). Future LCM technologies will likely permit 


microdissection of subcellular structures, such as 
organelles, and microdissection of viable cells. 
Advancements in tissue preservation and staining will 
contribute to improved downstream analysis of LCM 
specimens. 

Optical Tweezers 

Optical tweezers have been applied as quantitative 
tools to exert calibrated forces on molecules and struc¬ 
tures of interest as well as to measure forces and 
displacements generated by these systems. 
Nonbiological applications such as microrheology, 
particle sorting, microfabrication, colloidal hydrody¬ 
namics, and nonequilibrium thermodynamics have 
been investigated using optical tweezers (Moffitt 
et al. 2008); it should be noted that optical tweezers 
are most commonly applied for investigations of bio¬ 
logical systems. Optical tweezers have been used to 
manipulate individual cells, cellular systems, and bio¬ 
logical macromolecules, providing valuable insight 
into the mechanical properties of biological polymers 
and cells, the deformability of cellular vesicles and 
cells, and the dynamics of molecular motors (Moffitt 
et al. 2008; Zhang and Liu 2008). Investigations 
using optical tweezers have been invaluable in 
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comprehending the basic mechanisms of complex bio¬ 
logical structures and processes. 

Optical tweezers employ a highly focused laser 
beam to produce a field gradient trap for physically 
holding and manipulating microscopic particles. The 
laser beam is tightly focused using a high-numerical 
aperture microscope objective to form a stable optical 
trap (Moffitt et al. 2008). In conventional optical twee¬ 
zers, a laser beam that exhibits a Gaussian prohle is 
utilized (Zhang and Liu 2008). When the laser beam 
interacts with the particle, the particle experiences two 
kinds of forces: scattering forces and gradient forces. 
The scattering forces push the particle along the direc¬ 
tion of propagation of the laser and the gradient forces 
pull the particle along the spatial gradient of light 
intensity from the laser profile. The gradient force is 
a restoring force, which pulls the particle to the center 
of the beam. A balance of these forces will hold 
a particle at the focal point in a stable manner, forming 
an optical trap (Moffitt et al. 2008; Zhang and Liu 
2008). The two main trapping geometries include sin¬ 
gle-beam traps and dual-beam traps. In single-beam 
traps, only one single tightly focused laser beam is 
applied. Dual-beam traps enable trapping via two 
counterpropagating beams. Trapped particles may be 
translated in all directions by moving the laser or 
changing the focus of the laser (Nilsson et al. 2009) 

The force of the optical trap applied to the particle is 
proportional to the displacement of the particle from 
the center of the trap. The force generated by the 
optical trap (F) may be expressed as F = kx, where 
k is the trap stiffness and x is the distance between the 
center of the particle and the center of the trap (Collard 
et al. 2007). It is difficult to directly measure the 
trapping forces applied in optical tweezers; however, 
numerous methods have been developed for calibrated 
measurement of the displacement and the force. Cali¬ 
bration generally involves measuring the response of 
the trapped particle to a known force, such as a viscous 
force (Collard et al. 2007). Furthermore, the Brownian 
motion of the trapped particle may be detected to 
experimentally determine the trap stiffness (Zhang 
and Liu 2008). The trap stiffness may vary depending 
on the power of the laser source, the aperture of the 
microscope objective, and the size of the particle. The 
trap stiffness and the measured displacement of the 
particle with respect to the focal point of the laser 
may then be used to calculate the applied force. With 
standard commercial optical tweezer instruments. 


forces of up to 100 pN may be generated (Collard 
et al. 2007). The force exerted on the trapped particle 
is limited by the size and refractive index of the particle 
(Dienerowitz et al. 2008). 

In many optical tweezer applications, the magni¬ 
tude of the optical forces generated is inadequate to 
trap the molecules or particles of interest; therefore, 
dielectric particles, such as micrometer-sized polysty¬ 
rene beads, are applied as handles or probes (Moffitt 
et al. 2008). These beads may be biochemically linked 
with molecules of interest to facilitate characterization. 
To exert the necessary forces on these molecules for 
analytical evaluation, the other end of the molecule is 
typically bound. This end may be attached to the surface 
of the sample chamber or another bead may be linked 
and held via another optical trap or with a micropipette 
(Collard et al. 2007). The beads are then manipulated 
using the optical tweezers, which indirectly control the 
molecules of interest. As shown in Fig. 3, a common use 
of this technology for studying the deformation of 
red blood cells (Li and Liu 2008). By applying these 
concepts, optical tweezers have become an essential 
tool for examining the chemical, physical, and mechan¬ 
ical properties of molecular systems. 

Optical tweezers are a noncontact, low perturbation 
technology with high spatial and temporal resolution 
that enables researchers to quantitatively study the 
molecular kinetics of various molecules and structures 
(Long et al. 2011). Since the initial report of optic traps 
by Ashkin in 1970, there have been many major 
advances in the resolution and accuracy of optical 
tweezer technology (Moffitt et al. 2008). The spatial 
resolution has been reduced to the Angstrom length 
scale while the temporal resolution has been reduced to 
the microsecond time scale. Despite these achieve¬ 
ments, experimental noise remains a problem and 
becomes more limiting at longer time scales (>1 s) 
(Moffitt et al. 2008). Other significant limitations of 
optical tweezers include concerns over photodamage 
to biological samples and low throughput (Zhang and 
Liu 2008). Nonconventional optical tweezer proce¬ 
dures have been developed using non-Gaussian laser 
beams (e.g., Bessel mode or Laguerre-Gaussian 
mode), dual beams, and multiple optical traps (Zhang 
and Liu 2008). One popular method for producing 
multiple optical traps involves the use of holographic 
optical tweezers. Holographic optical tweezers utilize 
spatial light modulators to produce precise patterns 
of multiple optical traps; this approach provides 
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Laser Processing of 
Biomaterials and Cells, 

Fig. 3 Schematic drawings 
and microscopic images for 
the deformation of silica-bead- 
attached RBC stretehed by 
laser tweezers (Li and Liu 
2008) 
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versatility in terms of the size, geometry, three-dimen¬ 
sional orientation, and number of optical traps that are 
produced (Mofhtt et al. 2008). Holographic optical 
tweezer technology is expected to have signihcant 
biomedical applications as it may be used to spatially 
organize one or multiple cell types in a well-defined 
pattern; potential applications include use in tissue 
engineering and regenerative medicine (DaneshPanah 
et al. 2010). Several recent research efforts have 
involved expanding the capabilities of traditional opti¬ 
cal tweezers by identifying methods to control rotation 
and to measure torque (Mofhtt et al. 2008). In addition 
to the advancements in conventional optical tweezer 
technology and the development of multi-trap pro¬ 
cesses, novel hybrid instruments have been developed. 
These systems combine optical tweezers with other 
techniques such as rapid detection and position 
devices, Raman spectroscopy, confocal microscopy, 
and rapid imaging techniques (Zhang and Liu 2008). 
Optical tweezers have also been used in microfluidic 
platforms to analyze the response of cells to other cells 
and/or biomaterials. It is anticipated that further 
advancements in hybrid devices will expand the capa¬ 
bilities of optical tweezers and will permit manipula¬ 
tion of more complex biological systems. 

Summary 

Laser processing of biomaterials and cells has enabled 
fabrication of complex materials for biomedical 


applications. The extension of laser-based processes 
for polymeric, organic, and biological materials has 
facilitated use in tissue engineering, medical implant 
design, as well as biochemical and cellular analysis. 
MAPLE has demonstrated numerous advantages as 
a thin film deposition technique for complex materials. 
LCM and optical tweezers have become commonly 
used analytical tools. LCM has significantly contrib¬ 
uted to our understanding and identification of genes, 
proteins, and cellular pathways associated with tissues. 
Optical tweezers have also become important instru¬ 
ments for manipulation and quantitative analysis of 
complex biological systems. These laser-based tech¬ 
niques are relatively new and have rapidly evolved in 
recent years. It is expected that these laser processing 
methods will continue to be instrumental for future 
biomaterials fabrication as well as for cellular as well 
as biomolecular evaluation. 
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Law of Mass Action 

Clive R. Bagshaw 

Department of Biochemistry, University of Leicester, 
Leicester, UK 

Department of Chemistry and Biochemistry, 
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CA, USA 

Synonyms 

Chemical kinetics; Mass action 

Definition 

The Law of Mass Action is the foundation of chemical 
kinetics and states that the rate of an elementary reac¬ 
tion (i.e., one that effectively occurs as a single step) is 
proportional to the product of the concentration of the 
reactants. For a unimolecular reaction (i.e., one involv¬ 
ing a single species) the reaction rate increases linearly 
with the concentration of the reactant because each 
reactant molecule has the same probability of reacting 
within a fixed time interval. Therefore, the observed 
rate for the population of reactant molecules will be 
proportional to the number of molecules in the reaction 
vessel. For a bimolecular reaction, the rate is propor¬ 
tional to the product of the concentrations of the two 
reactants because the collision frequency increases 
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with concentration. The Law of Mass Action was orig¬ 
inally developed to describe an equilibrium system, 
A B C -I- D, where the quotient [A].[B]/[C].[D] is 
a constant, Kgq, but this reflects the balance of the rates 
in the forward and backward direction. In physical 
chemistry a distinction is often made between concen¬ 
tration and activity owing to nonideal behavior. How¬ 
ever, in biological contexts this difference is usually 
ignored as the discrepancy is within experimental error 
and reactants are usually at very low concentrations 
relative to the solvent molecules. However in an intra¬ 
cellular context, effective concentrations may be mod¬ 
ulated by “molecular crowding” which limits the 
diffusible space available to reactants. 

Cross-References 

► Kinetics: Overview 

LC-NMR 

► Flow NMR 

LC-NMR/MS 

► Flow NMR 


LDH 
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Lectins 

Albert M. Wu 

Glyco-Immunochemistry Research Lab, Institute of 
Molecular and Cellular Biology, College of Medicine, 
Chang-Gung University Kwei-San, Tao-Yuan, Taiwan 

Synonyms 

Adhesin; Agglutinin; Carbohydrate binding proteins; 
Oligosaccharide-protein interactions 


Definition 

Lectins are carbohydrate binding proteins or glycopro¬ 
teins, differing from (1) antibodies, (2) enzymes 
acting on glycans/glyco-ligands, and (3) sugar sensor/ 
transport proteins. The term “lectin” was coined by 
W. C. Boyd in 1954. Lectins are also termed: hemag¬ 
glutinins; adhesins; galectins; siglecs, .. .etc. The first 
alternative name was originated from the ability of 
proteins or glycoproteins to agglutinate the blood 
group ABO (H) glycotopes (epitopes) and sialic acid 
on red blood cells. On cell surfaces, they mediate cell¬ 
cell interactions by combining with complementary 
carbohydrates on opposing cells. Most lectin reagents 
were obtained from plants in the early 1970s and 
animal soluble tissue lectins became a wide attention 
in the life sciences in 1980s. The animal lectins were 
classified in to C-type lectins (selectins), S-type lectins 
(galectins) and other folds based on their amino acid 
sequences or Carbohydrate-Recognition Domain 
(CRD). Then, I-type lectins (siglecs) and more lectins 
were defined subsequently as detailed below. 


Basic Characteristics 

Lectins are found ubiquitously in plants, animals, and 
microbes. They play key roles in the control of various 
physiological and pathological processes in living 
organisms, such as fertilization, embryogenesis, cell 
migration, organ formation, inflammation, immune 
defense, and microbial infection (Sharon and Lis 
1989; Barondes 2003; Gabius 2009; Wu et al. 2009). 
Before the roles of carbohydrate-protein interactions 
had been explored, many plant lectins were identified, 
characterized and applied to glycobiology for 
the detection, isolation, and characterization of 
glycoconjugates and the examination of cell surface 
transitions during some living processes including cell 
differentiation to cancer as a crucial (Sharon and Lis 
2003). These plant lectins made valuable contributions 
to modern glycomics (Kamerling et al. 2007). 

The functions of the lectins vary from each other 
and defects in them are linked to many diseases 
(Gabius 2009; Varki et al. 2009). Studies on patho¬ 
genic lectin-glycan interactions, especially the viral 
agglutinins, can provide scientists clues toward thera¬ 
peutic strategies to prevent the infectious processes. 
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Lectins, Table 1 

Expression of binding properties of Gal/GalNAc reactive lectins by decreasing order of carbohydrate structural 

units 



Codes” 

Lectins 

Carbohydrate specificity” 

F/A 

Dolichos hiflorus (DBA) 

Fpenta > Ap” > A > Tn » P 


Helix pomatia (FIPA) 

Fpenta > A (>A|,“) > Tn, T » P 

A 

Soybean (SBA, Glycine max) 

A (>Ah“), Tn and I/II 

Tn 

Vida villosa B 4 (VVA-B 4 ) 

Two Tn ^ one Tn S> one or two T 


Salvia sclarea (SSA) 

Two Tn > single or three sequential Tn structures 

T 

Peanut (PNA, Arachis hypogaea) 

T » I/II » Tn 


Amaranthus caudatus (ACL) 

T » Tn » I/II 

T/Tn 

Agaricus hisporus (ABA) 

T„ and Tn > I 2 > GalNAc S> II, L 


Madura pomifera (MPA) 

T > Tn > > I/II and L 


Artocarpus integrifolia 

T„ > P„ > T, Tn, II > I » T3 

T/II 

Ricinus communis toxin (ricin, RCA 2 ) 

T > I/II and Tn 


Ahrus precatorius (APA) 

T > I/II > E > B > Tn 

I/II 

Ricinus communis (RCAi) 

Tri-II > II > I > E, B > T 


Etythrina cristagalli (coral tree, EGA) 

Tri-II > II > L, I 

6 

Grijfonia(Bandeiraea) simplicifolia-B^, (GSI-B 4 ) 

B > E > A 

E 

Ahrus precatorius toxin-a (Abrin-a) 

E, B > T, L, I/II 


Mistletoe lectin-1 (ML-1) 

E > II, L > T and I 


“The symbols in bold indicate the human blood group activity and/or lectin determinants,which are: F(GalNAcal^3GalNAc); 
Fpe„ta(GalNAcal^3GalNAcpi^3Galal^4Gaipi^4Glc); A(GalNAcal^3Gal); Ah(GalNAcal^3[LFucal^2]Gal); 
Tn(GalNAc—►Ser/Thr of protein core); T(Gaipi—>3GalNAc); T„(Gaipi—>3GalNAcal—>Ser/Thr of protein core); 
Tp(Gaipi^3GalNAcpi^.. .ceramide); I(Gaipi^3GlcNAc); II(Gaipi^4GlcNAc); Tri-II(Triantennary Gaipi^4GlcNAc); 
B(Galal^3Gal); E(Galal^4Gal); L(Gaipi^4Glc); P(GalNAcpi^3Gal); P„(GalNAcpi^3Galal^) 

'’Substitution of Lpucal—>2 to subterminal Gal is important for binding 
“Substitution of uFucal—>2 to subterminal Gal blocks binding 

C-type lectins expressed by dendritic cells have many 
functions - The £)endritic-Cell-5pecific /CAM-3 
Grabbing Aon-integrin (DC-SIGN) binds to HIV-1 
gpl20 in the viral attachments; Siglecs are distinct 
lectins with the immunoglobulin-fold that specifically 
bind to sialic acids. Their importance for disease 
has been revealed and deserves further exploration 
(Figdor et al. 2002). 

To provide a more valid and satisfactory depiction 
of the carbohydrate specificity of lectins in order to 
elucidate their functional roles and to optimize their 
biomedical applications, the following criteria should 
be taken into consideration: (1) the affinity for each 
monosaccharide and its derivatives; (2) expression of 
reactivities toward mammalian disaccharide structural 
units or their derivatives and finding the most 
active ligand; (3) simple oligovalent or cluster effect 
(e.g., multi-antennary oligosaccharides or Tn glyco- 
peptides); and (4) complex multivalent or cluster 
effects present in macromolecules with multiple 
known glycotopes and organized in microdomains on 
cell surfaces (Gabius et al. 2011). Example of item 


(2) is Table 1, in which the lectins and their carbohy¬ 
drate binding specificities are expressed by decreasing 
order of mammalian structural units. Most lectins are 
apparently specific for a monosaccharide but react with 
various oligosaccharide chains terminating with or 
containing this sugar with different affinities because 
the types of linkage and underlying sugar residues 
(glycotopes) also play significant roles in binding. 
For many lectins, e.g., Agaricus hispoms agglutinin 
(ABA), polyvalency of weak glycotopes (Galpl ^ 
3GalNAc<, (Ta) and Galpl ^ 3/4GlcNAcp (Ip/Hp)) 
are essential for lectin-glycan recognition and interac¬ 
tion. This phenomenon has a great biological impor¬ 
tance because polyvalent interactions between lectins 
and ligands on the cell surfaces are presumed to play 
critical roles in a wide variety of biochemical recogni¬ 
tion processes (Brewer et al. 2002; Wu 2011). Since 
the effect of polyvalency can be used to design poly¬ 
valent drugs with stronger therapeutic efficacy, it 
becomes pivotal to understand the polyvalent effects 
while establishing the binding profile of a lectin 
(Vance et al. 2008). 
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► FucP, the L-Fucose-H"^ Membrane Transport 
Protein and Related Transporters 
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Linear Dichroism 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 


Synonyms 

Differential absorption of linearly polarized light - 
anisotropic absorbance spectroscopy 


Definition 

Linear dichroism (LD) is the difference in absorption, 
A, of light linearly polarized parallel (//) and perpen¬ 
dicular (_L) to an orientation axis (Michl and Thulstrup 
1986; Rodger and Norden 1997; Norden et al. 2010). 

LD=A//-Ai (1) 

LD is used with systems that are either intrinsically 
oriented or are oriented during the experiment. The 
signal that is measured contains information about 
the orientation of subunits of the system relative to 
the orientation axis. 


The reduced linear dichroism of an electronic tran¬ 
sition that makes an angle a with the molecular orien¬ 
tation axis, z, (Fig. 1) is given by the equation (Norden 
1978; Norden et al. 1991, 2010) 

LD"^ =-= -5(3cos^a — l) (2) 

A/,yo 2 

where A,^o is the isotropic absorbance, S is referred to 
as the orientation parameter (5=1 for a perfectly 
oriented sample and 0 for an unoriented one). The 
molecular orientation axis, z, is the most oriented 
direction of the molecule. If Z is the laboratory orien¬ 
tation axis it is the parallel direction of Eq. 1 . z = Z 
when 5=1. Strictly, Eq. 2 holds only for uniaxial 
orientation, but in practice most molecules and orien¬ 
tation methods approximate to this case. 

A number of samples have the molecular orienta¬ 
tion axis perpendicular to the long axis of the sample. 
These include carbon nanotubes (Rajendra et al. 2004), 
flow distorted liposomes (Ardhammar et al. 1998; 
Rodger et al. 2002), and peptidoglycan sacculi. In 
these cases, there is an extra element of rotational 
averaging resulting in (Ardhammar et al. 1998; Rodger 
et al. 2002) the following equation: 

LD" = ^ = ^(1-3cos2^) (3) 

^iso ^ 

where /( is the angle between the liposome (or other 
structure) normal to the surface and the transition 
moment polarization as illustrated in Fig. 2. 

Basic Characteristics 

LD is related to the more commonly used technique of 
circular dichroism (CD), in that both require the dif¬ 
ference between the absorbances of different polarized 
light beams to be measured and CD spectropo- 
larimeters can be adapted to produce the required 
alternating beams of polarized light for LD. 

The LD experiment can be understood in the fol¬ 
lowing way. If one places one polarizer, such as 
a Polaroid film, on top of another and rotates it in 
front of a light source, the light coming through the 
polarizers goes from maximum to minimum every 90° 
of rotation. This is an LD experiment if one considers 
one polarizer to be the sample. In other words, the 
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Linear Dichroism, 

Fig. 1 The orientations of an 
electric dipole transition 
moment, ^ (whose length/ 
magnitude is ;i). The 
magnitude of the projection of 
onto the z axis is /i cosa and 
onto the xly plane is ji sina. 
The y component of this 
projection is /i sina siny. For 
a perfectly oriented molecular 
system Z = z and 5=1 


zj! 



X, Membrane normal 



Linear Dichroism, Fig. 2 Schematic diagram of the geometric 
paramaters relevant for liposomes distorted in shear flow 


oriented sample in an LD experiment has the proper¬ 
ties of a wavelength-dependent polarizer. Mains’ law 
(Kahr and Claborn 2008) states that the intensity of the 
light between crossed polarizers scales as the cosine 
square of the angle between their polarization direc¬ 
tions - hence the form of Eq. 1 . If the polarizers are 
perfect, no light would pass at the 90° angle between 
the polarizers, while maximum light intensity would 
be let through when they are set in parallel orientation. 
In practice, in an LD experiment, while we hope that 
the polarizer used in the manufacture of our spectrom¬ 
eter is perfect, the second “polarizer” - the sample - 
only absorbs a fraction of the incident photons and is 


seldom perfectly oriented, so even at wavelengths 
where it absorbs and thus operates as a polarizer, it is 
not a perfect polarizer. 

The main practical differences between CD and 
LD (apart from the polarizations of light used) are 
that LD signals tend to be orders of magnitude larger 
than CD signals, so the data are easier to collect, 
however, LD measurements are performed on systems 
that are either intrinsically oriented or are oriented 
during the experiment, so the samples are harder to 
prepare. 

Interpretations of LD data are based on Eqs. 1 and 2 
above. Erom Eq. I, if a = 0° (i.e., the transition of 
interest is oriented parallel to the orientation direction) 
then LD > 0; conversely, if a = 90° (i.e., the transition 
of interest is oriented perpendicular to the orientation 
direction) then LD < 0; and LD = 0 when a = 54.7°, 
the so-called magic angle. Orientations intermediate 
between these limits require Eq. 1 to be solved. With 
biomacromolecules, we often know the details of the 
spectroscopy of the chromophores (such as nucleic 
acid bases or amino acids) so the LD immediately 
gives us qualitative orientation information. 

Quantitative analysis of LD spectra requires knowl¬ 
edge of the orientation parameter S. In some situations 
it is relatively straightforward to determine S. One can 
often proceed by assuming that the largest positive 
peak is pure z-polarized (with a = 0) or the largest 
negative peak is pure xly polarized (with a. = 90°). Eor 
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example, if the 278 nm peak of tetracene in a stretched 
film has LD = 0.174 and = 0.13 from Eq. 1, we 
may deduce 


S = 



0.174 
3 X 0.13 


0.45 


(4) 


Thus, the orientation process for this long molecule 
is very efficient. S for flow-oriented DNA systems is 
typically 0.05-0.1 in magnitude. 


to single walled carbon nanotubes. J. Am. Chem. Soc. 
2004;126:11182-8. 

Rodger A, Norden B. Circular dichroism and linear dichroism. 

Oxford: Oxford University Press; 1997. 

Rodger A, Rajendra J, Mairington R, Ardhammar M, Norden B, 
Hirst JD, Gilbert ATB, Daffom TR, Halsall DJ, Woolhead CA, 
Robinson C, Pinheiro TJ, Kazlauskaite J, Seymour M, Perez N, 
Hannon MJ. Flow oriented linear dichroism to probe protein 
orientation in membrane environments. Phys. Chem. Chem. 
Phys. 2002;4:4051-7. 
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Linear Dichroism Spectra - Measurement 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Definition 

Linear dichroism (LD) is the difference in absorption 
of light polarized parallel and perpendicular to an 
orientation axis. The component parts of an LD exper¬ 
iment are: 

A source of linearly polarized light 
A means of detecting how much light is absorbed 
A way to change the relative orientations of sample 
and light beam polarization 
A method of orienting the sample 

Basic Characteristics 

This section outlines what is needed for collecting 
LD data. 

The Instrumentation 

There are two main methods for measuring LD spectra. 
The one requiring less specialized equipment is the 
two-spectra method. In this case, one inserts 
a polarizer into a normal absorbance spectrometer, 
measures a spectrum, and then rotates either the 
polarizer or the sample. If the sample is rotated, care 
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must be taken to have the same light intensity through 
the same part of the sample for both polarizations. If 
the polarizer is rotated care must be taken to ensure the 
same light intensity is present in both beams. 

The second method, the differential method, where 
the instrument produces alternating beams of the two 
polarizations, is easier for the user to implement and 
a wide range of sample orientation techniques may be 
used. However, much more sophisticated instrumenta¬ 
tion is required with the differential method. The dif¬ 
ferential method usually involves using a circular 
dichroism spectropolarimeter either with a quarter- 
wave plate to convert the circularly polarized light to 
linearly polarized light or alternatively a doubling of the 
voltage across the photoelastic modulator that produces 
the required oscillating linear polarizations directly. 
Both options for collecting LD data provide the 
required polarized light beams and detector (see defini¬ 
tion above). The differential method also automatically 
deals with the need to change the relative orientation of 
sample and light beam - as long as one knows which 
direction the instrument defines as parallel. 

Sample Orientation for LD Spectroscopy 

Once the instrumentation has been established, the 
continuing challenge for the user of LD spectroscopy 
is to produce macroscopically well-oriented samples 
appropriate for determining the information required. 
Which orientation method one should use depends 
on the sample and the question being asked. For 
example, long and relatively rigid polymers, such as 
double-stranded DNA or RNA, or molecular assem¬ 
blies of micrometer dimension, may be oriented in 
a fluid by shear flow whereas small molecules require 
a stronger orienting force. Some molecules which can¬ 
not themselves be oriented may be oriented when they 
hind to another molecule that is oriented. The rest of 
this entry summarizes various methods that may be 
implemented. More details of the underlying princi¬ 
ples of molecular alignment and its significance for LD 
are given references (Norden 1978; Norden and Seth 
1985; Norden et al. 1991, 2010; Rodger 1993; Rodger 
and Norden 1997; Rodger et al. 2006). 

Stretched Polymers as Matrices in Which to 
Orientate Small Molecules 

Small molecules can often be absorbed into polymer 
films. In 1934, A. Jablonski (1934) was the first to 
propose a method for orienting molecules hy adsorption 



Linear Dichroism Spectra - Measurement, Fig. 1 An LD 

spectrum of tetracene collected by the differential method. The 
base line was the stretched PE film to which dichloromethane had 
been added drop-wise and allowed to evaporate between addi¬ 
tions. The sample was dissolved in dichloromethane and added 
drop-wise to a 5x stretched polyethylene film. Note in this case 
that the short-axis polarized transition has a positive LD signal in 
its shorter wavelength vibronic components due to coupling with 
the much stronger, higher-energy long-axis polarized transition 

in anisotropic matrices. When the film is mechanically 
stretched, either before or after the small molecules are 
added, the analytes may align their long axes preferen¬ 
tially along the stretch direction. For a molecule to he 
aligned in a stretched polymer film it must be either an 
integral part of the film material or associated suffi¬ 
ciently strongly with the polymer chains that when the 
film is stretched the molecule follows the alignment of 
the polymer. The best baseline for such an experiment 
is either that collected for a piece of the same film that 
has been stretched but does not contain the molecule of 
interest, or the same film before adding the analyte or 
after washing it out. A typical film LD spectrum for 
tetracene is illustrated in Fig. 1. The two most popular 
films for LD are polyethylene for nonpolar molecules 
and polyvinyl alcohol for polar molecules. Polyvinyl 
chloride is also valuable in some applications. A film 
stretcher is illustrated in Fig. 2. Details of how to 
prepare films may be found in (Rodger and Norden 
1997; Ismail et al. 1998; Norden et al. 2010). 

Flow Orientation of Macromolecules 

If a rigid or semi-flexible polymer, such as DNA, is 
dissolved in a solvent, such as water, and then flowed 
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Linear Dichroism Spectra - Measurement, Fig. 2 A 

mechanical film stretcher showing oppositely threaded screws 
to ensure even stretching of the film 


past a stationary surface at about 1 m/s, then the mole¬ 
cules experience sufficient shear forces to give a net 
orientation of the long axis of the polymer along the 
flow direction. If light is incident on the sample perpen¬ 
dicular to the flow direction, then the absorbance paral¬ 
lel to the flow. An, and the absorbance perpendicular to 
the flow direction, , are different so an LD signal may 
be measured. If the cell components are quartz then data 
in the visible and UV regions may be collected. Other 
geometries for a flow LD experiment are also possible 
including measuring LD at ±45° to the flow direction 
(Rizzo and Schellman 1981). 

One method to produce the required flow rate is to 
use a linear flow-through system such as provided by an 
HPLC pump or a pair of syringes. However, such an 
open system has an inherently large sample requirement 
and tends not to be completely stable. Wada (1964, 
1972) solved these problems with the invention 
of a Couette flow cell for LD where the sample is end¬ 
lessly flowed between two cylinders one of which rotates 
and one of which is stationary. This is schematically 
illustrated in Fig. 3 and some cells are illustrated in Fig. 4. 

With a Couette flow system a number of possible 
baselines may be used. The simplest option is to stop the 
flow and measure the spectram. The validity of such 
a baseline relies on the non-rotating spectrum being 
independent of the position of the rotor. A baseline of 
the same cell rotating slowly, causing no measurable 
orientation of the sample, may be used if the motor is 
stable at low rotation speeds and the scan speed is slow 
compared with the rotation speed. Alternatively, 
a spectrum of only the solvent/buffer with higher rota¬ 
tion speed may be measured as the baseline. 

It is relatively easy to understand how a polymer 
such as DNA (Rodger et al. 2002) or a protein fiber 


such as tubulin (Marrington et al. 2006) can be oriented 
in flow. More surprising is the fact that spherical mem¬ 
brane bilayers called liposomes can be oriented. In this 
case the orientation arises because the forces of the flow 
distort the liposome shape as schematically illustrated 
in Fig. 5. This means that membrane peptides and pro¬ 
teins can also be studied by LD (Rodger et al. 2002). 

One of the key experimental parameters for flow 
LD is how fast the Couette cell spins. For an outer 
rotating Couette cell this is directly related to shear 
rate, G, by 


G/s-^ 


dv^ 

dx 


ItiRoQ. 
60(R„ - Ri) 


( 1 ) 


where the rate of rotation Q is in revolutions per minute 
(rpm), Rg is the outer cylinder radius, and R, is the inner 
cylinder radius. It should be noted that flow LD mag¬ 
nitudes are also dependent on solution viscosity, t], and 
temperature, T, as illustrated in Fig. 6. Small LD sig¬ 
nals can be enhanced by adding glycerol to the solution 
to increase its viscosity (Daffom et al. 2001; Hiort et al. 
1990). It must be noted that viscosity itself is a function 
of both temperature and shear rate: viscosity of water 
at 60° is half that at 20°. Further, viscosity of water at 
1,000 s“* is one third that at 0 s“'; higher shear rates 
have less effect (Rittman et al. 2009). 


Electric Field Orientation 

Effective orientation for polar or polarizable molecules 
may be achieved by setting up an electric field between 
two parallel plates. This produces uniaxial orientation 
of the molecules. Electric field orientation is concep¬ 
tually simpler than flow orientation, and, when 
performed in nonconducting solvents, is an equilib¬ 
rium orientation methodology whose effect can be 
predicted theoretically. 

Crystalline Samples 

Crystalline samples can provide perfect orientation if 
the sample is aligned so that the parallel direction is 
a unique axis in the crystal. However, for transmission 
spectroscopy, very thin slices are usually required 
because of high absorptivity. If the light is incidence 
along an axis about which the sample has threefold or 
higher symmetry, it will give LD = 0. Thus, to collect 
data for such a sample, the sample may be tilted from 
the perpendicular orientation and the equation for the 
magnitude modified accordingly (Norden 1978). 
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y, perpendicular direction 

X. propagation of light 
Z. orientation direction 

Linear Dichroism Spectra - Measurement, Fig. 3 Schematic of a Couette flow cell showing flow orientation in a coaxial flow cell 
with radial incident light. {X, 7, Z} denotes the laboratoiy fixed axis system 




Linear Dichroism Spectra - Measurement, Fig. 4 Large 
volume (2 mL) inner rotating cylinder Couette flow cell with 
500 pm annular gap (Rodger 1993) and microvolume (25-60 
pL) outer rotating (Marrington et al. 2004, 2005) Couette flow 


cell showing the outer quartz capillary (3 mm inner diameter) 
and inner quartz rod (2.4 mm outer diameter) which when 
assembled results in an annular gap of 250 pm 



Liposome 


Shear deformed liposome 


Membrane normal 



Linear Dichroism Spectra - Measurement, Fig. 5 Schematic diagram of liposomes distorted in shear flow 






























Linear Dichroism Spectra - Measurement 


1243 


L 


a 



Linear Dichroism Spectra - Measurement, Fig. 6 (a) Vis¬ 
cosity dependence of Couette flow LD (measured on a Jasco 
J-815 spectropolarimeter) at 260 nm for a solution of DNA 
(200 pM calf thymus DNA in water). The viscosity was deter¬ 
mined on the same solution as the LD using an Advanced 
Rheometer ARIOO. The viscosity was varied by changing the 

Orientation by Evaporation or Assembly 

Many molecules may be oriented simply by evaporat¬ 
ing them onto a surface that is transparent to the radi¬ 
ation. This method works particularly well for planar 
aromatic molecules or samples embedded in lipid bila¬ 
yers. However, the plate may need to he tilted for LD 
measurements (as discussed above for crystalline sam¬ 
ples) since the molecules usually orient preferentially 
parallel to the surface of the quartz resulting in the 
unique axis being perpendicular to the plate and all 
in-plane directions having the same distribution of 
molecular orientations. The extent to which the solvent 
needs to be removed depends on the sample. Most 
work with lipids usually proceeds by drying the sample 
and then adding a salt solution chosen to give the 
required humidity (O’Brien 1948). Some molecules 
such as Alzheimer’s fibers are sufficiently large and 
rigid that simply pipetting them onto a surface from 
solution may produce a significant degree of orienta¬ 
tion (Marshall et al. 2010). 

Summary 

In this entry, some options for instrument configuration 
and sample orientation methodologies have been 


b 



Temperature (oC) 

temperature from 20° C to 60° C. The shear rate for both LD and 
rheometry was 1,900 s“*. (b) Temperature dependence of the 
dynamic viscosity of DNA (200 pM calf thymus DNA in water). 
Viscosities were determined using a Cannon-Manning 25 E50 
semi-micro viscometer (Gilroy et al. 2011) 

discussed. Further information is available in the 
references. 
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► DNA-Ligand Circular Dichroism 
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► Linear Dichroism 

► Linear Dichroism Spectra - Measurement 

► Linear Dichroism Spectroscopy: Theory 

► Nucleic Acid Linear Dichroism 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 
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Coventry, UK 

Synonyms 

Linear dichroism spectroscopy 

Definition 

LD is defined as the difference in absorption of two 
linearly polarized light beams that are polarized at 
right angles to one another and to the direction of 
propagation: 

LD=A//-A^ (1) 

This notation is normally reserved for uniaxial sam¬ 
ples with // denoting the direction of the unique axis. 
So, although most LD experiments have (or are 
assumed to have) effective uniaxial orientation, it is 
better when deriving equations to use the more general 
definition 

LD = Az-Ay (2) 

where Az is the absorbance of Z-polarized light, and 
similarly Ay. {X,Y,Z} form what is known as 
a laboratory-fixed axis system since they are used to 
identify macroscopic aspects of the experiment. We 
shall use {x, y, z} for the molecule-fixed axis system 
where z is the molecular orientation axis. For any one 
molecule the two axis systems are related as illustrated 
in Fig. 1. Unfortunately, each molecule in a sample 
probably has different values of {c, Q. 
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Fig. 1 Macroscopic {X, Y, Z} 
and molecular axes {x, y, z} 
systems and the angles relating 
them. For a uniaxial rod-like 
system we write 6 = C 


Molecular 
orientation axis 


Molecular 
orientation axis 




Basic Characteristics 

The two key equations used in the analysis of 
biomacromolecules LD are derived below following 
the methodology of (Rodger and Norden 1997; Norden 
et al. 2010). 

LD of Uniaxial Systems 

Another key concept is that of the transition moment of 
a transition. It is a vectorial property having a well- 
defined direction (the transition polarization) within 
each molecule and a well-defined length (which is 
proportional to the square root of the absorbance). 
We denote the electric dipole transition moment by 
the vector /«,. Az is then proportional to the square of 
the Z-component of /u, so the LD of a sample is: 


the square of the component of the electric dipole 
transition moment, yu,, which lies along the direction 
of the electric field of the light, we may write 

Az = ku^cos^a 

4 , 2 . 2.2 ( 5 ) 

Ay = K/i sm asm y 

where a is the angle between the transition dipole 
moment and z (Fig. 1), and y is the angle the projection 
of this vector makes with the x axis. Thus 

LD = Az — Ay = kfi^ (cos^a — sin^asin^y) (6) 

Similarly, the isotropic absorbance is the average of 
the absorbance in the X, Y, and Z directions, so 


LD = k{,xl - ix\) (3) 

where () denotes an average over the orientational 
distribution of the molecules in the sample, and k is 
a positive constant. 

The main equation used to interpret LD spectra is 




^ (Az +Ay+Ax) = ^- (Az + 2Ay) 
(cos^a -I- 2sin^asin^y) 


(7) 


Since the experiment does not restrict the value of y, 
we average over sin^y. As <sin^y> = 1/2 we have 


LD 3 / T , 

LD' = — = -5(3cos^a - l) (4) 

^iso ^ 

where A,^o is the isotropic absorbance of the same sam¬ 
ple in the same path length, S is the orientation param¬ 
eter, and a is the angle between the molecular transition 
dipole moment and the molecular orientation axis. Its 
derivation proceeds from Eq. 3 as follows. 

Assume first that the molecule of interest is per¬ 
fectly oriented so that the molecular orientation axis 
lies along the macroscopic parallel direction, that is, 
z = Z, S — 1. Since the absorbance is proportional to 


LD = kfi ( cos a — - sin a 


cos^a — sin^a) 


2 ^ ' 

= (3cos^a — l) 


, kfJ-^ / 2 . 2 \ 

Al,o = ^ (cos a -F sm a) = — 


( 8 ) 


( 9 ) 
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X, Membrane normal 



Liposome 


Shear deformed liposome Model of a deformed liposome 

showing lipids with net orientation 


Linear Dichroism Spectroscopy: Theory, Fig. 2 Schematic of liposomes distorted in shear flow 


Equation 4 follows upon acknowledging that in 
reality the orientation is not perfect so we include S 
and get 

LD = ^Ai,oS{3cos^oi - 1) (10) 

The form of S is derived in the final section of this 
article. 

LD of Cylindrically Symmetric Systems 


orientation axis, to an extent accounted for by S. In 
this case, it is convenient to define the orientation of 
a transition moment, /ti, of an analyte by sitting on the 
analyte and defining x to be the normal to the surface of 
the cylinder that goes through the analyte. Let P be the 
angle between jt and fx (Fig. 2). The analyte orientation 
is not affected by the shear flow (the forces are too 
small), so on average any analyte transition moment, 
fjL, will be uniformly distributed about the x axis. Let i/r 
be the angle between the projection of fi onto the y/z 
plane and z. Thus in the local cylinder coordinate 
system 


There are a number of systems where the axis about 
which the chromophores of interest are oriented are 
perpendicular to the surface of a cylinder. Such sys¬ 
tems include flow distorted liposomes (as illustrated in 
Fig. 2), or ligands bound to the surface of carbon 
nanotubes. The equation analogous to Eq. 10 for such 
cylindrical systems is (Ardhammar et al. 1998; Rodger 
et al. 2002) 


fi = /i(cos P, sin/) smijj, sin^cos q (12) 

where /i is the magnitude of fi. The reduced linear 
dichroism is by definition 


LD' = 5 



35 




(13) 


LD'=^=^(1-3cos2jS) (11) 

^iso 4 

where fS is the angle between the lipid (or more gener¬ 
ally cylinder surface) normal and the transition 
moment polarization as illustrated in Fig. 2. A deriva¬ 
tion of this equation is outlined below. 

We assume the orientation is locally uniaxial. Let z 
be along the long axis of the cylinder as in Fig. 2. z is 
uniformly distributed about Z, the macroscopic 


Now, jj.^ = and jiy may be written as the dot 
product of the transition moment vector and the vector 
for the Y axis in the {x, y, z} coordinate system. Thus 

/iy = H ■ Y 

= ^i(cos fi, sin P sin if, sin/t cos iA){A-y z} 

• (siny,cosy,0){^_^_,j 

= ^i(cos/I siny-f sin/t sin ://cos y) (14) 
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where y takes values from 0 to 2ti. Thus, 

_ 2 ^ ( sin^^cos^i/r — cos^/lsin^y — sin^/lsin^i/rcos^y \ 
\ — 2cos/lsinysinjSsmi/rcosy J 

(15) 

Both ijj and y take values from 0 to 2n, so upon 
averaging over them: 

^ ^^ 2sin^;6-2cos^^-sin^;8 

4 (16) 

= ^‘5(1 - 3cos2/?) 

which is the required equation. 

Two special cases of Eq. 16 are: (i) when a 
transition moment is parallel to the cylinder normal 
(e.g., the lipids), for which /) = 0 and LD73 S = —1/2; 
and (ii) when it is perpendicular to the cylinder normal 
for which P = 90° and LD73 S = +1/4. The 210-nm 
transition of an a-helix inserted into the membrane is 
an example of case (i) - but this negative band usually 
only appears as a small dip between two larger positive 
signals. The same helix has fS = 90° for its 222 nm and 
lower wavelength 7t-7t* transitions. Conversely it is the 
210-nm transition of a helix lying on the surface that 
satisfies case (ii). 


In general, for the case of molecules oriented per¬ 
pendicular to the surface of a cylinder, the maximum 
positive LD' signal (3/4) is half that of the maximum 
negative LD'^ signal (—3/2). This is in contrast to the 
situation for the standard geometry of Eq. 4, such 
as flow-oriented DNA, where lower and upper limits 
are —3/2 and +3 respectively. 

The Orientation Parameter for a Uniaxial 
System 

The form of the orientation parameter for a uniaxial 
system can be derived as follows. Using the notation 
we developed above we may write the transition 
moment in the molecular coordinate system as 

fi = {^sin y sina, cos y sina., cosa)^^^ (17) 

This vector can be transformed into the laboratory- 
fixed axis system using Euler’s transformation where 
9 is the angle between z and Z, so 
(j) is the angle between x and a reference line defined fo 
be perpendicular to both Z and z, and 
ij/ is the angle between X and the reference line. 

Thus, 


f^{XYZ} — 


( coscpcosij/ — sincpsimJ/cosO 
—cos(l)siml/ — sin(l)cosil/cos6 
sin<j>sin9 


sin(l)cosil/ -f cos(l)simlicosd simj/sinO \ / sinysina \ 
—sin(l)sin\l/+ cos(l)cos\l/cos6 cosij/sinO cosysina i.i (18) 
—cos(f)sin6 cosO J \ cosol 


Upon squaring the components and averaging over y, 
(j), and xj/ we get 

l^z ~ 2 + '^cos^Ocos^a^jj^ 

= ^ ((l — cos^O) (l — cos^a) + 2cos^0cos^a)jj^ 
= ^ (3cos^9cos^a — cos^a — cos^O + l)/t^ 

= (^{'icos^O — l)(3co^^a— l)-f^^/t^ 

( 19 ) 


= - (sin^a + cos^6sin^oi + Isin^dcos^oi.) // 

= -((l — cos^a) +cos^6( \ — cos^a) +2(l — cos^9)cos^a)fi^ 
— ^ (—3cos^dcos^oc + cos^oc + cos^O +1 }m^ 

= l)(3co?a-+ 

( 20 ) 

Thus, 

LD = — fly 

= - (3cos^0 — l) - (3cos^a — l) (21) 


and 
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So 

5 = ^ {3cos^6 - 1) (22) 

The main message to take from this is that mole¬ 
cules in most oriented systems adopt a distribution 
about Z. 


Summary 

In this entry, the derivation of the two key equations 
most frequently used to analyze biomacromolecules 
linear dichroism has been given. Further information 
is available in the references. 


Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► DNA-Ligand Circular Dichroism 

► DNA-Ligand Flow Linear Dichroism 

► Linear Dichroism 

► Linear Dichroism Spectra - Measurement 

► Linear Dichroism Spectroscopy: Theory 

► Nucleic Acid Linear Dichroism 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 
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Linear Motifs in Protein-Protein 
Interactions 

Robert B. Russell 

Cell Networks, University of Heidelberg, 

Heidelberg, Germany 

Synonyms 

Protein complexes; Protein-protein interactions; 
Peptide motifs 

Definition 

Protein-protein interactions are highly diverse in terms 
of molecular mechanism. Perhaps, most typical are 
those where one large globular segment in one protein 
binds to another such segment in another. These kinds 
of interactions are seen everywhere in nature, particu¬ 
larly in large molecular machines such as RNA poly¬ 
merases or the exosome. Another major class of 
mechanism includes those interactions mediated by a 
globular domain in one protein binding to a short linear 
stretch - typically between 3 and 10 amino acids in 
length - in another. The peptide stretches seen to bind 
the same or similar globular domains are frequently 
observed to contain a repeating pattern of residues or 
motif. Short linear motifs (or SLiMs) play major roles 
in interactions, being involved in binding, modifica¬ 
tion, cleavages, or targeting events in a wide range of 
cellular processes, particularly signaling, transcription, 
and intracellular transport. For example, one of the 
best known motifs is the canonical SH3 PxxP binding 
motif, where the prolines indicate the core residues 
common to the majority of SH3 protein binding sites. 

Linear motifs stand apart from protein domains in 
a number of fundamental ways. First, their short length 
makes them virtually impossible to detect significantly 
by traditional sequence comparison methods. Second, 
they show a weaker pattern of conservation across 
orthologous proteins than protein domains and can, in 
contrast to domains, readily arise by convergent evo¬ 
lution. There are only a few hundred linear motifs 
currently known (in the ELM database as of early 
2012) compared to more than 10,000 globular protein 
domains. This suggests that there are perhaps dozens 
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or even hundreds of new linear motifs to be discovered, 
and indeed, a number of computational methods have 
been developed to uncover them. 

Significantly, the vast majority of linear motifs occur 
outside the globular parts of the proteome. At least 90% 
of known motifs occur within stretches of sequences 
that do not fold into tight compact units and are often 
intrinsically disordered, thus providing clues as to why 
so many eukaryotic proteins contain such regions. For 
example, the transcription factor p53 is largely disor¬ 
dered, and several known linear motifs lie within the 
disordered parts of its sequence. Given the roles of 
linear motifs in protein-protein interactions, this tallies 
well with the fact that p53 is one of the most promiscu¬ 
ous proteins in the cell with regard to interactions. 

Cross-References 

► Coupled Folding and Binding 

► Post-Translational Modifications 

► Structurally Disordered Proteins 
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Linear Prediction in NMR Spectroscopy 

Frank Delaglio 

Software Science Consultant, North Potomac, 

MD, USA 


Synonyms 

Autoregressive analysis; LPSVD 


Linear Prediction (LP) refers to methods which use 
autoregressive analysis to model one-dimensional 
time-domain NMR data. LP can be used to estimate 
signal frequencies, exponential decays, and amplitudes. 
More commonly, LP is used to extrapolate time-domain 
data, so that the corresponding Fourier transform spec¬ 
trum has greater apparent resolution and smaller trun¬ 
cation artifacts. The same approach can be used to 
replace missing or distorted points in time-domain data. 

LP methods determine a set of coefficients Cj ... Ck 
which model the points Xj ... Xn in a uniformly spaced 
time-domain data series. Depending on the specific LP 
method, the coefficients are adjusted and applied in 
different ways. In the LP model, a time-domain data 
point is expressed as a linear combination of the 
K points it follows (forward LP) or the K points 
which come after it (backward LP). The LP model 
has special significance for NMR, since in the absence 
of noise it is exact for exponentially decaying sinu¬ 
soids, the usual ideal form of NMR signals. 

The LP model leads to a series of linear equations in 
terms of the K LP coefficients and the N data points. If 
the number of coefficients is chosen so that K <= N/2, 
the coefficients can be estimated by conventional 
least squares methods, usually Singular Value Decom¬ 
position (SVD). Each LP coefficient encodes 
a frequency and decay, which can be extracted by 
finding the roots of a complex polynomial whose 
coefficients are the Ci ... Cn values. Once the 
frequencies and damping factors are determined, 
amplitudes can be extracted by fitting the original 
data to a corresponding sum of K damped sinusoids, 
again using conventional least squares methods. In 
practice, these parameter estimates are often of little 
direct use, since they encode combinations of true 
signal and noise. So, rather than using such parametric 
information directly, the LP coefficients are instead 
used to extend the time-domain data. 

In this extrapolation procedure, the LP coefficients 
are used to append a new synthetic point using a linear 
combination of the last K points in the original data. 
The new point can then be used along with the previous 
K-1 points to append yet another synthetic point. This 
process can be continued indefinitely, but in practice it 
becomes more unstable as additional points are 
predicted from previous synthetic ones. Therefore, 
LP is often limited to extrapolating data to about 
twice its original size, so that the synthetic data does 
not outweigh the original data. 
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Various refinements are used to improve LP 
performance in the presence of noise. In root-fixing 
approaches, decays encoded in the LP coefficients are 
adjusted so that they always correspond to decreasing 
signals. In forward-backward approaches, forward 
and backward coefficients are determined separately, 
and averaged in an attempt to generate a single 
improved set of coefficients. In mirror-image 
approaches, which require non-decaying signals, 
time-domain data is temporarily appended to its mir¬ 
ror image before LP, increasing the effective data 
size so that more linear prediction coefficients can 
be extracted. 


Cross-References 

► Absorbance Spectroscopy: Spectral Artifacts and 
Other Sources of Error 

► Maximum Entropy Reconstruction 

► NMR 

► Sparse Sampling in NMR 

► Triple Resonance NMR 


Lipid Bilayer 

► Alkanols - Effects on Lipid Bilayers 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal Titra¬ 
tion Calorimetry (ITC) of Lipid Bilayers 

► Diffraction Methods for Studying Transmembrane 
Pore Formation and Membrane Fusion 

► Fluorescence and FRET in Membranes 

► Fluorescence Correlation Spectroscopy of Lipids 

► Lipid Domains 

► Lipid Flip-Flop 

► Lipid Lateral Diffusion 

► Lipid Organization, Aggregation, and Self- 
assembly 

► Membrane Fluidity 

► Micropipette Manipulation of Lipid Bilayer Mem¬ 
branes 

► Neutron Scattering of Membranes 

► Pressure Effects on Lipid Membranes 

► Spin-Labeling EPR of Lipid Membranes 


► Sum Frequency Generation Vibrational Spectros¬ 
copy 

► Supported Lipid Bilayers 

► Thermodynamics of Lipid Interactions 


Lipid Bilayer Asymmetry 

Salvatore Chiantia and Erwin London 
Dept, of Biochemistry and Cell Biology, 

Stony Brook University, Stony Brook, NY, USA 

Synonyms 

Bilayer asymmetry; Membrane asymmetry 

Definition 

Lipid hilayer asymmetry refers to the difference 
between the lipid composition and/or physical proper¬ 
ties of the two lipid monolayers that make up a lipid 
bilayer. 

Basic Characteristics 

Lipid bilayer asymmetry refers in general to the 
vectorial nature of biological membranes, structures 
constituted by a bilayer of lipids into which 
► membrane proteins are embedded. Membrane lipids 
contain a hydrophilic/polar headgroup that contacts 
the aqueous environment as well as (in general) 
two hydrophobic, long hydrocarbon chains, each of 
which most commonly contain 16-24 carbon atoms 
(Gennis 1989; Heimburg 2007). The membrane 
bilayer is composed of two lipid monolayers (leaflets). 
In each monolayer, the lipids are oriented so that their 
polar headgroups interact with water, while the hydro¬ 
carbon chains from each monolayer form the core of 
the membrane. The hydrocarbon chains from each 
monolayer contact each other at the bilayer midplane. 
For the plasma membrane that surrounds the cell, the 
two monolayers are often referred to as the inner leaflet, 
which faces the cytoplasm of the cell, and the outer 
leaflet, which faces the external environment. However, 
because the cell has many internal membranes for which 
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the cytoplasm-facing leaflet is on the outside, it is less 
ambiguous to distinguish the lipid monolayers by the 
terms cytofacial/cytoplasmic (i.e., the leaflet in direct 
contact with the cytoplasm) and exofacial/exoplasmic 
(i.e., the leaflet opposite the cytofacial leaflet). 

While membrane proteins are also asymmetrically 
oriented across the bilayer, the expression bilayer 
asymmetry mainly refers to the difference in lipid 
composition of the two leaflets. There are in fact 
many different types of membrane lipids, which can 
vary in terms of both the chemical structure of the polar 
headgroups and/or that of the hydrocarbon chains, so 
that there can be asymmetry both of lipid headgroup 
type and hydrocarbon chain structure in the two mono- 
layers. This implies that the two sides of the membrane 
might have different physical properties and that 
thermodynamic forces can act across the bilayer due 
to gradients in electrostatic, elastic, or, in general, 
chemical potential (Heimburg 2007). As an example, 
transverse lipid asymmetry has been quantitatively 
clearly demonstrated for the plasma membrane of 
erythrocytes. In these simple membrane systems, 
sphingomyelin (SM) and phosphatidylcholine (PC) 
are enriched in the outer leaflet, while aminopho- 
spholipids like phosphatidylethanolamine (PE) and 
phosphatidylserine (PS) are found mostly in the inner 
leaflet (Rothman and Lenard 1977; Gennis 1989). 
Similarly, lipid asymmetry has been characterized in 
the outer membrane of gram-negative bacteria, 
viruses, fibroblast phagosomes, platelets, and other 
membranes of eukaryotic cells (Rothman and Lenard 
1977; Gennis 1989; Fadeel and Xue 2009). The extent 
to which there is asymmetry of cholesterol, which can 
undergo rapid transverse diffusion between the inner 
and outer leaflet, has proven difficult to measure 
and remains controversial. 

Although the presence of a compositional differ¬ 
ence between the leaflets seems to be a general feature 
of many naturale membranes, the exact details of the 
lipid distribution across the bilayers vary from case to 
case. The amount of available unambiguous data is 
rather limited, due mainly to the complexity of the 
methods used to determine membrane asymmetry 
and to the inherent artifacts (e.g., interleaflet contam¬ 
ination, difficult isolation of a single specific cellular 
membrane, perturbation of the bilayer by asymmetry 
measurement methods, and lipid pools inaccessible to 
detection (Rothman and Lenard 1977)). Nevertheless, 
several approaches have been proven useful for 


the determination of the lipid distribution across 
the bilayer, namely, selective chemical labeling, 
phospholipid transfer proteins, and digestion by phos¬ 
pholipases (Gennis 1989). The common basic feature 
of these methods is the presence of a water-soluble 
“probe” that cannot cross a closed bilayer and that 
can thus interact exclusively with the exposed leaflet. 

Generation and Maintenance of Lipid Asymmetry 

The origin of lipid asymmetry in biological membranes 
is based on the intrinsic asymmetric nature of lipid 
biosynthesis and metabolism. For biosynthesis, the 
needed enzymes, substrates, and energy sources are 
most frequently located in the cytoplasm of the cell 
and in the cytosolic side of its membranes. For example, 
PE, PC, and PS are synthesized on the cytofacial side of 
the endoplasmic reticulum (ER), with the aminopho- 
spholipids thus already on the side of the membrane 
where they will be finally enriched once transported in 
the plasma membrane. Similarly, SM and many 
glycosphingolipids are synthesized in the luminal sur¬ 
face (i.e., exoplasmic leaflet) of the ER or Golgi, 
although initial steps in their assembly occur in the 
cytoplasm (Daleke 2003). The presence of enzymes 
that degrade specific lipids on only side of a membrane 
can alter asymmetry by analogous principles. 

Biosynthesis and degradation are not the only 
factors involved in asymmetry because lipids, under 
certain conditions, can undergo transverse transloca¬ 
tion between leaflets. Spontaneous translocation of 
lipids across the bilayer is often extremely slow. The 
spontaneous transbilayer motion of lipids (flip-flop) 
has in fact been measured to happen at negligible 
rates, with half-times of several hours or days, in 
model membrane vesicles and certain natural mem¬ 
brane systems (e.g., influenza virus) (Gennis 1989). 
However, there are lipid transporters that specifically 
catalyze and direct the otherwise slow and random 
transbilayer dynamics. These transmembrane proteins 
can be generally divided into the following: 
(1) flippases (including phospholipid translocases), 
which catalyze the ATP-dependent transport into the 
cytoplasmic leaflet, (2) floppases, which catalyze the 
ATP-dependent transport towards the exoplasmic leaf¬ 
let, and (3) scmmhlases, which catalyze bidirectional 
lipid movement in an energy-independent fashion 
(Daleke 2003). Several recent studies identified a 
group of P-fype ► ATPases, the ATP-binding cassette 
transporter family, and a scramblase family of proteins. 
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as candidates for the these three categories of trans¬ 
porters, respectively (Fadeel and Xue 2009; Holzer 
et al. 2010). 

Biological Function of Lipid Asymmetry 

The considerable amount of energy invested by the cell 
in order to maintain and regulate the transbilayer 
lipid organization of its membranes is justified by the 
numerous normal and pathological processes connected 
to changes in membrane asymmetry. As already 
mentioned, the uneven distribution of lipid species 
across the bilayer determines its physical properties, 
such as permeability, local curvature, membrane shape 
or the negative surface charge of the cytosolic leaflet 
(Rothman and Lenard 1977; Holzer et al. 2010). 
The latter factor in particular influences the electrostatic 
interactions between the cell membrane and charged 
membrane-associated molecules. Also, the enrichment 
of aminophospholipids in the cytoplasmic leaflet and 
the breakdown of this asymmetry are connected to 
cellular processes such as apoptosis, neuronal develop¬ 
ment, cell differentiation, engulfment by macrophages, 
and platelets activation (Balasubramanian and Schroit 
2003). Perturbation of lipid asymmetry also appears in 
the context of several pathological conditions like 
tumors, thalassemia, and diabetes (Balasubramanian 
and Schroit 2003). As a further example, Scott syn¬ 
drome, a congenital bleeding disorder, is clearly caused 
by a deficiency of Ca^'^-induced phospholipid 
scramblase activity. Of interest, the underlying alter¬ 
ation of bilayer asymmetry control in Scott syndrome 
is not restricted to platelets but is also evident in other 
cell types, like lymphocytes (Fadeel and Xue 2009). 
Finally, it is well known that several cell types display 
a voltage gradient across their plasma membrane, the 
resting potential of a nerve cell being for example 
around —70 mV (Heimburg 2007). Recent studies sug¬ 
gest that the asymmetric distribution of lipids across the 
membrane and the intrinsic membrane potential are 
coupled (Fadeel and Xue 2009). 

Asymmetric Model Membrane Systems 

In order to understand the structure and function of 
complex biological membranes, artificial bilayers 
composed of membrane lipids have been used 
as simpler and more well-defined physical models. 
Biophysical investigation of such model membrane 


systems can be used to derive fundamental information 
regarding transbilayer dynamics, lipid domain forma¬ 
tion, membrane fusion, and the energetic interaction 
(coupling) between the two leaflets. They have proven 
invaluable for such studies, but it has been difficult 
to prepare asymmetric model membranes until 
recently. Important progress has been made in prepar¬ 
ing asymmetric supported bilayers using different 
approaches: vesicle fusion (Lin et al. 2006), sequential 
depositing of monolayers, or a combination of the two 
(Kiessling et al. 2009). Furthermore, methods have 
been developed to prepare asymmetric (free-standing) 
membranes based on redistribution of charged lipids 
via a pH gradient (Eastman et al. 1991), leaflet-by- 
leaflet assembly through an oil-water interface (Pautot 
et al. 2003; Hamada et al. 2008), Montal-Mueller 
assembly (Collins and Keller 2008), or outer leaflet 
exchange (Pagano et al. 1981). The latter approach is 
based on the transfer of certain lipids between the outer 
leaflets of two distinct vesicle populations. This pro¬ 
cess can be optimized using transporter molecules that 
facilitate the intermembrane dynamics, like phospho¬ 
lipid transfer proteins (Bloj and Zilversmit 1981) or 
cyclodextrins (Cheng et al. 2009), a family of cyclic 
oligosaccharides. The use of cyclodextrin has proven 
very useful in preparing stable asymmetric vesicles 
with different dimensions (from 10*“^ nm to 10^ pm) 
and a large variety of lipid species (Cheng et al. 2009, 
2011; Chiantia et al. 2011). 
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Synonyms 

Lateral stress profile; (negative) Lateral tension profile 

Definition 

The lateral pressure profile is the distribution of lateral 
stresses across the width of a lipid bilayer. It is com¬ 
posed of repulsive pressure components from interac¬ 
tions between the lipid molecules and a cohesive 
hydrophobic tension that favors segregation of the 
lipid chains from water. These components are distrib¬ 
uted inhomogeneously across the bilayer. 

Introduction 

The bilayer analog of the well-known surface pressure 
of a lipid monolayer spread at the air-water, or oil- 
water, interface is a negative tension. Thus, the net 
lateral pressure is zero in a lipid bilayer that is not 
under tension. The bilayer does, however, possess an 
internal lateral pressure that balances the hydrophobic 
tension, which is the driving force for self-assembly of 
amphiphilic lipid molecules in water. These different 
stresses are not distributed uniformly across the bilayer, 
and the resulting lateral pressure profile can affect the 
conformation and stability of membrane-embedded pro¬ 
teins or drive transitions to nonlamellar lipid phases. 

Lateral Pressure 

The lateral pressure in a hydrated lipid assembly is 
defined by the derivative of the free energy with 
respect to the area: 


tttot - 



( 1 ) 


where Agtot is the total elastic free energy and A/ is the 
surface area per lipid molecule. For a lipid monolayer 
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at the air-water or oil-water interface, this is the con¬ 
ventional surface pressure, and for a bilayer vesicle, it 
corresponds to a negative membrane tension. Units are 
force per unit length (= energy per unit area). 

At mechanical equilibrium (dAg^ot/dAi = 0), in the 
absence of external forces, the lateral pressure in 
a bilayer vesicle is zero, i.e., the natural state of the 
membrane is tension-free. The free energy of the 
bilayer can be partitioned into a net-repulsive compo¬ 
nent, Agint, that arises from the lipid-lipid interactions 
(headgroups and chains) and headgroup-water interac¬ 
tions, and a cohesive component, —YphotA/, that mini¬ 
mizes hydrophobic contact of the chains with water 
(see, e.g., Cevc and Marsh 1987). At equilibrium, the 
corresponding opposing forces balance giving: 

ttint = Tphob (2) 

where (= —OAgint/cM/) is the repulsive internal 
lateral pressure in the bilayer and Yphob is the hydro- 
phobic tension or free energy density. In terms of 
molecular surface, the surface density of the hydropho¬ 
bic interaction is «10-20 kJ moU'/nm^, depending on 
whether volume corrections are introduced (Cevc and 
Marsh 1987; Marsh 1996). Taking into account rough¬ 
ness of the molecular surface (a factor of two for 
a hemisphere, or of 1.2 for close-packed spheres), the 
hydrophobic free energy density at the planar polar- 
apolar interface corresponds to a hydrophobic tension 
of Yphob « 35-39 mN m“' at 300 K (Marsh 1996). It 
thus might be expected that a monolayer is equivalent 
to a bilayer at surface pressures in the region of 
35-39 mN m“*; at this point of the monolayer n-A 
isotherm, the area per lipid A/ is indeed similar to that 
in the bilayer (Marsh 1996, 2006a). 

Because the net tension in a bilayer membrane is 
zero, the repulsive internal lateral pressure, ttin,, cannot 
be measured directly and contributes only in second 
order to the elastic stretching energy via the lateral 
compressibility (Marsh 1997). The elastic modulus 
for area extension is given by: 


Ka — Ai 


/ 9^Agtot \ 
V dA] 



(3) 


Typical values obtained from unilamellar bilayer 
vesicles are Ka « 70-100 mN m~* per monolayer 


Lipid Bilayer Lateral Pressure Profile, Table 1 Area exten¬ 
sion modulus, of lipid monolayers and bilayers (Cevc and 
Marsh 1987; Marsh 2006a) 



Monolayer 


Bilayer 

Lipid 

T 

(°C) 

71 

(mN m“ 

A{dnldA)T 
*) (mN m^’) 

T 

(°C) 

KaI2 

(mN m^‘) 

(18:1cA'’)2PC 

21 

35 

78 

21 

118 it 8 

(12:0)2PE 

44.2 

32 

112 





transbilayer position, z (nm) 

Lipid Bilayer Lateral Pressure Profile, Fig. 1 Atomistic 
molecular dynamics simulations of the lateral pressure profiles 
in hydrated bilayers of dipalmitoyl phosphatidylcholine {solid 
line) or dioleoyl phosphatidylcholine {dashed line) at 50°C 
(Adapted from Ollila et al. 2007a, b) 


(Marsh 2012), which is similar to values obtained 
from the slopes of monolayer ti-A isotherms at 
a surface pressure of ^35 mN m“' (Cevc and Marsh 
1987; Marsh 2006a) (see Table 1). 

Lateral Stress Profile 

The lateral pressure enters in first order to the bending 
elasticity, but this is via the transmembrane profile of 
lateral pressure: 


P{^) 


dtltot 


(4) 


where z lies along the bilayer normal, rather than by the 
lateral pressure directly. The lateral stress profile,/9(z), 
has the dimensions of a three-dimensional pressure, 
i.e., force per unit area. Representative simulations of 
the transmembrane profiles of lipid bilayers that are 
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Lipid Bilayer Lateral Pressure Profile, Table 2 Moments of the lateral pressure profile from coarse-grained MD simulations and 
experimental values of bending elasticity 


Lipid 

Pi (pN) 

kcCo (pN) 

2SPi - P 2 (J) 

k, (J) 

References 

(18:1cA®)2PE 

-10.5 

-15.2 ±0.9“ 


-(3.0 ± 0.3) X 10"“'’ 

Marrink et al. (2007) 

(14:0)2PC 

-0.8 ±0.1 

KdO 

-(2.3 ±0.1) X 10"“ 

-(2.3 ± 0.2) X 10"“'’ 

Orsi et al. (2008) 


“Experimental value from Hn phases corrected to the neutral plane (Marsh 2012) 

'’Calculated from the experimental monolayer value of for (18:lcA^)2PE or (14:0)2PC (Marsh 2012), together with the experi¬ 
mental ratio kc/kc = —0.75 it 0.08 (Templer et al. 1998) 


obtained from molecular dynamics (MD) calculations 
are shown in Fig. 1 . These are characterized by a large 
negative (cohesive) component that is centered about 
the polar-apolar interface, and positive net repulsive 
components in the region of the lipid headgroups and 
lipid chains. The negative peak has an integrated 
area that is comparable to the hydrophobic tension 
(cf. above). The positive pressure in the headgroup 
region is the lateral hydration pressure (Cevc and 
Marsh 1985, 1987) plus possible electrostatic repul¬ 
sion, and that in the chain region is an entropic pressure 
driven by rotational isomerism in the lipid chains 
(Fattal and Ben-Shaul 1993). 

Moments of the Lateral Pressure Profile 

The spontaneous bending moment per unit length of 
one monolayer of a lipid bilayer is given by the first 
moment, Py, of the lateral pressure profile (see, e.g.. 
Marsh 2001; Marsh et al. 2003): 

i 

kcCo = J zp{z).dz = Pi (5) 

0 

where is the elastic modulus of mean curvature and 
Co is the spontaneous curvature (Marsh 2006b); I is the 
width of the lipid monolayer, and the origin of z is at 
the bilayer midplane. The first moment does not 
depend on the choice of the origin for z because 
fp(z).dz = 0. Spontaneous curvature expresses the 
intrinsic propensity of a lipid monolayer to bend and 
is determined from the radius of the pivotal plane 
(Co = llRo,p) of inverted hexagonal (Flu) phases of 
the lipid in question (Marsh 2011). Evaluation of the 
first moment of the lateral pressure profile simulated by 
coarse-grained molecular dynamics produces values 
for the spontaneous bending moment in each mono- 
layer that are of the same order as those found 



Lipid Bilayer Lateral Pressure Profile, Fig. 2 Saddle-splay 
bending of a lipid bilayer, where the principal curvatures are 
Ci(=l/Si) = —C 2 (= 1 /^ 2 ), giving zero mean curvature and 
a Gaussian curvature of Cq = C\C 2 ~ — I/R\ 

experimentally (see Table 2). For a symmetrical lipid 
bilayer membrane, the net spontaneous curvature is 
zero because the spontaneous bending moments in 
the two apposing monolayers cancel. Both monolayers 
are then in a state of curvature frustration that is 
imposed upon them by being forced into a lamellar 
configuration. The resulting inhomogeneity in lateral 
stress can control the folding, insertion, and activity of 
membrane-bound proteins (Marsh 2007; Pocanschi 
et al. 2006). 
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The second moment, P 2 , of the lateral pressure 
profile determines the Gaussian curvature modulus 
of the lipid monolayer (see Helfrich 1981; Marsh 
2006b): 

/ 

kc = — J (z — S)^p(z).dz = 26P\ — P 2 (6) 

0 

where z = (5 is the position of the neutral plane 
(or plane of inextension). Bicontinuous inverse cubic 
lipid phases are based on infinite periodic minimal 
surfaces that have zero mean curvature (ci + C 2 = 0) 
everywhere, and only the Gaussian curvature is non¬ 
zero (ciC2<0) - see Fig. 2. 

The ratio of Gaussian to mean curvature moduli, 
based on the swelling behavior with increasing 
water content of inverse cubic phases that are formed 
from a mixture of monooleoylglycerol, dioleoyl 
phosphatidylcholine, and dioleoyl phosphatidyletha- 
nolamine (58:38:4 mol/mol/mol) at 25°C, is 
kc/kc = —0.75 zkO.OS for a monolayer (Templer 
et al. 1998). Coarse-grained MD simulations produce 
moments of the transbilayer stress profile that are 
consistent with this ratio (see Table 2). The lateral 
pressure profile is, therefore, one of the crucial 
controlling factors in the formation of nonlamellar 
lipid phases. 


Summary 

The net lateral pressure in a tension-free lipid bilayer 
is zero, but the compensating lateral stresses are 
distributed nonuniformly across the bilayer width. 
The area derivative of the repulsive component of 
the internal lateral pressure (which is equivalent to 
a monolayer surface pressure) is proportional to the 
area elastic modulus. The first moment of the lateral 
pressure profile for one monolayer leaflet is equal to 
the product of the monolayer mean curvature modu¬ 
lus and spontaneous curvature. The second moment 
of the leaflet lateral pressure profile determines the 
Gaussian curvature modulus. Bilayer lateral pressure 
profiles are important for bending elasticity, 
nonlamellar phases, and conformational stability of 
embedded proteins. 
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Synonyms 

Lipid bilayer 

Introduction 

Biological membranes are not homogeneous along the 
plane of the membrane, but rather form clusters of 
molecules resulting in the formation of membrane 
domains. In eukaryotes, sterols, such as cholesterol, 
play a major role in driving the formation of lateral 
inhomogeneities in the membrane. However, domains 
also are present in bacterial membranes that are devoid 
of cholesterol. We will review some of the interactions 
resulting in the formation of domains and indicate 
how they can affect the biological properties of the 
membrane. 

Membrane Domains 

It is now appreciated that biological membranes and 
many model membranes are inhomogeneous in the 
plane of the bilayer. This is a more recent conceptual 
advance over the fluid mosaic model of membranes. As 
a consequence of this lateral heterogeneity, some 
molecular components will be enriched in certain 
regions of the membrane that are referred to as domains. 

Biological membranes are composed principally of 
phospholipids and proteins. ► Membrane proteins can 
also be reconstituted into a lipid membrane to study the 
properties of a membrane with a more limited number 
of components than biological membranes. Domains 
can be enriched in either lipid and/or in protein 
components in both biological and model membranes. 
If the system is at equilibrium, the molecular compo¬ 
nents of the membrane will rearrange themselves 
to achieve a state of minimum energy. The line 


tension that develops as a result of the formation of 
a boundary between a domain and the bulk lipid 
makes an important contribution to the thermodynam¬ 
ics of domain formation (Garcia-Saez et al. 2007) 
(► Thermodynamics of Lipid Interactions). Molecules 
that partition into the boundary interface can 
strongly affect lateral heterogeneity (Schafer and 
Marrink 2010). 

The most common arrangement of lipids in biolog¬ 
ical membranes is the bilayer. Model membranes can 
undergo morphological rearrangements to form other 
phases including cubic phases and the inverted hexag¬ 
onal phase (► Lipid Shape and Curvature Stress). This 
may also occur, particularly the formation of cubic 
phases, in biological membranes, but it is not 
a common situation. In this chapter on domains, we 
will only consider domains in bilayer membranes. 

Bilayers can form crystalline, subgel, gel, ripple, or 
liquid crystalline phases (► Phase Transitions and 
Phase Behavior of Lipids). Biological membranes are 
generally in the liquid crystalline phase. Organisms 
have homeostatic mechanisms to maintain their mem¬ 
branes in this physical state and avoid the formation of 
gel phases that may be too rigid to carry out certain 
biological functions, or inverted hexagonal phases that 
would destroy the permeability barrier of the mem¬ 
brane bilayer. The liquid crystalline phase can exist 
in two different states, the liquid-ordered or Lq phase 
and the liquid-disordered or Lj phase. These are both 
liquid crystalline phases in which the lipid components 
undergo rapid lateral diffusion, more rapid than is 
found with gel state lipid. However, the acyl chain 
geometry in the Lq phase is extended, resembling 
that of a gel phase, with saturated CH 2 -CH 2 groups 
forming trans rotomers. In contrast, the acyl chain 
geometry of the L^ phase has both trans and gauche 
rotomers and is less extended. Phase diagrams can 
represent transitions between phases. At a particular 
point in the phase diagram, defined as the critical point, 
the system becomes very sensitive to large fluctuations 
as a result of small changes in concentrations or in the 
environment (Honerkamp-Smith et al. 2009). 

Model Raft Domains 

The increased interest in membrane domain formation 
among biologists is a consequence of the growing 
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awareness of the existence of such structures in 
biological membranes. One type of domain has been 
termed the membrane “raft.” Raft domains in biolog¬ 
ical membranes are enriched with cholesterol and 
sphingomyelin (► Sphingolipids and Gangliosides) 
and are believed to be in the liquid-ordered (Lq) 
phase. Interestingly, model membranes composed of 
equimolar quantities of palmitoyl-sphingomyelin, 
dioleoyl-phosphatidylcholine, and cholesterol will 
spontaneously form sphingomyelin-enriched domains 
over a wide range of temperatures (Feigenson 2009). 

Raft Domains in Biological Membranes 

There has been an ongoing discussion about the extent 
of the relationship between model raft domains and 
cholesterol-dependent domains in biological mem¬ 
branes. There is a general agreement that the domains 
found in biological membranes are smaller in size and 
have a shorter lifetime. The most recent evidence 
(Brameshuber et al. 2011) indicates that there are 
mobile long-lived cholesterol-dependent domains in 
the plasma membrane of living CHO and Jurkat 
T cells in the resting state with a lifetime longer 
than several seconds, the upper limit of detection by 
this method (► Single Fluorophore Photobleaching). 
The size of these domains is estimated to be ^45 nm. 
Raft domains are associated with the actin 
cytoskeleton that is required to maintain the structural 
and functional (► Functional Roles of Lipids in 
Membranes) properties of rafts (Chichili and Rodgers 
2009). 

Proteins and Cholesterol-Rich Domains 

One of the major differences between model phospho¬ 
lipid membranes and biological membranes is the 
presence of a large fraction of proteins in the biological 
membrane. It Is known that certain proteins spontane¬ 
ously partition into raft domains and these proteins can 
stabilize rafts. However, not only proteins that 
partition into cholesterol-rich domains will favor the 
formation of these domains but also proteins that pref¬ 
erentially bind to phospholipids rather than cholesterol 
will also favor the clustering of cholesterol but in other 
regions of the membrane. This is a thermodynamic 
requirement, as long as the system is in equilibrium. 


There is particular interest in understanding which 
proteins partition into raft domains (Poveda et al. 

2008) . This is because there is evidence that these 
domains play an import role in signal transduction 
(Zech et al. 2009). Some of the features resulting 
in the partitioning of proteins in cholesterol-rich 
domains include the presence of certain types of 
lipidation, a segment of the protein with the amino 
acid pattern called the CRAC domain or a cluster of 
transmembrane helices with a particular structure 
termed the sterol-sensing domain (Epand et al. 2010). 
The CRAC domain is a protein segment with the 
sequence - LA^-Xi_ 5 -Y- Xi_ 5 -R/K-. The first residue 
can be either L or V and the last residue, either of 
the two basic amino acids, R or K. There are also two 
variable segments, Xi_ 5 , each of which can have any 
length between 1 and 5 residues, as well as any kind of 
sequence (Li and Papadopoulos 1998). In contrast to 
the short CRAC domain, the sterol-sensing domain is 
comprised of a cluster of five transmembrane helices 
and is found in several proteins that are involved in 
cholesterol homeostasis. There also appears to be 
a pattern of particular amino acids within transmem¬ 
brane segments of G-protein-coupled receptors 
(Paila et al. 2009) that results in the interaction of 
these proteins with clusters of cholesterol molecules 
(Cherezov et al. 2007), apparently modulating the 
function (► Functional Roles of Lipids in Membranes) 
of G-protein-coupled receptors. 

Caveolae 

Although there Is some uncertainty in the exact size 
and lifetime of biological rafts, there is much more 
certainty about the raft-like domains, the caveolae, 
that are abundant in certain types of cells (Parat 

2009) . Both caveolae and rafts are believed to be in 
the Lq phase and have similar lipid compositions. 
However, unlike rafts, caveolae are morphologically 
distinct 50-80 nm diameter, flask-shaped invagina¬ 
tions in the plasma membrane. The protein caveolin 
is highly enriched in this domain and plays an 
important function in maintaining the integrity of this 
domain (Parton and Simons 2007). Caveolae are 
believed to function in signal transduction but are 
also involved in a separate pathway for endocytosis, 
including the uptake of certain pathogens (Harris et al. 
2002; Parton and Simons 2007; Razani et al. 2002). 
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Another group of proteins, the cavins, are involved in 
regulation of caveolae hy promoting membrane 
remodeling and trafficking (Chidlow and Sessa 
2010). Caveolae have been implicated in chronic 
inflammatory conditions and other pathologies 
including atherosclerosis, inflammatory bowel disease, 
muscular dystrophy, and generalized dyslipidemia. 

Bacterial Domains 

There are several examples indicating that bacterial 
membranes have lipid domains (Matsumoto et al. 
2006). There is increasing interest in these domains 
because they can also be formed with the addition 
of certain antimicrobial agents. This phenomenon is 
correlated with the toxicity of these agents toward 
different species of bacteria (Epand and Epand 2010). 

There are bacterial domains that develop at the time 
of bacterial cell division (Mazor et al. 2008); however, 
there are also domains detected in anucleated bacteria 
that are apparently independent of cell division 
(Muchova et al. 2010). A feature of bacterial 
membranes that has allowed more facile imaging of 
lipid domains in these biological membranes is that 
most bacterial strains have a significant amount 
of cardiolipin in their membranes and clusters of 
cardiolipin will preferentially bind to 10-N-nonyl 
acridine orange (Mileykovskaya et al. 2001). 

Membrane Domains in Mitochondria 

Mitochondria, like Gram-negative bacteria, are 
enclosed by a double membrane. Eurthermore, the 
inner membrane of mitochondria, like most bacterial 
membranes, is the only membrane of eukaryotic 
cells that contains a high concentration of cardiolipin. 
The inner mitochondrial membrane also has 
a particular folded morphology that results in the 
formation of the mitochondrial cristae. An important 
role of mitochondria is the production of ATP by 
oxidative phosphorylation (► Mitochondrial Electron 
Transport). This ATP is produced on the inner mono- 
layer of the inner membrane of mitochondria, but the 
utilization of ATP occurs principally in the cytosol. 
There thus has to be a system for transporting this 
energy across two membranes. In many cells, this is 
accomplished indirectly through the phosphocreatine 


circuit (Wallimann et al. 2007). ATP is transported 
across the inner mitochondrial membrane by the 
ADP/ATP carrier protein that exchanges ATP in the 
mitochondrial matrix for ADP in the mitochondrial 
interstitial space. Most of the ADP formed in the 
interstitial space is a consequence of the transfer of 
the y phosphate of ATP to creatine, catalyzed by the 
enzyme mitochondrial creatine kinase, producing ADP 
and phosphocreatine. The ADP is shuttled into the 
mitochondrial matrix where it serves as a substrate 
for oxidative phosphorylation, while the phosphocrea¬ 
tine is exported across the mitochondrial outer 
membrane to be used as an energy source for the 
production of ATP in the cytosol, catalyzed by other 
isoforms of creatine kinase. This system requires 
a high degree of organization and interactions between 
lipids and proteins. It is also likely that in some cases 
these protein-lipid interactions lead to the formation of 
lipid domains. There is direct evidence that oligomers 
of the ubiquitous isoform of mitochondrial creatine 
kinase clusters cardiolipin (Epand et al. 2007). 

Mitochondria also play an important role in a major 
pathway of apoptosis as a result of the movement 
of cardiolipin from the mitochondrial inner membrane 
to the outer membrane where it participates in the 
activation of the proapoptotic protein Bid. It is 
suggested that in the outer membrane, cardiolipin 
forms a membrane domain where apoptotic proteins 
can bind and be activated (Sorice et al. 2009). 

Summary 

Lipid domains are found in both model and biological 
membrane. There are at least two driving forces for the 
formation of these domains. One is the preferential 
interaction of one component for another. The other 
is more specifically based on electrostatic interactions 
between a polycationic molecule and anionic lipids. 

In many cases, cholesterol is an important compo¬ 
nent in the formation of domains by the mechanism of 
preferential interactions. This is likely the case because 
the chemical structure and physical properties of cho¬ 
lesterol are very different from those of phospholipids 
and, therefore, there is frequently a preferential 
interaction of other lipid components or of peptides 
or proteins with cholesterol or with phospholipids. The 
phase behavior of lipid mixtures containing cholesterol 
that segregate into domains has been well studied and 
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is reasonably well understood. There is much evidence 
that domains with some relationship to those of 
cholesterol-rich domains of model membranes are 
also present in biological membranes. However, the 
biological membranes are more complex with many 
more molecular components as well as interactions 
with an underlying cytoskeleton that has also 
been shown to affect domain formation. Nevertheless, 
it is clear that there are domains in biological 
membranes, but a complete understanding of their 
composition, physical properties, and heterogeneity is 
still evolving. In biological membranes, raft domains 
play an important role in signal transduction by 
sequestering different proteins that can interact with 
one another or with the lipid components of these rafts. 
Despite the lack of complete knowledge about the rafts 
of biological membranes, there is a related domain, the 
caveolae that appears to have similar lipid composition 
to those of rafts. However, caveolae are larger 
structures with a long lifetime. Their morphology, 
size, and chemical composition are well established. 
Caveolae also play important biological roles both 
in signal transduction as well as in the movement of 
substances into and out of cells. 

The other mechanism for domain formation is 
the clustering of anionic lipids by polycationic 
molecules. This mechanism is particularly effective 
for the sequestering of the lipid phosphatidylinositol- 
(4,5)-bisphosphate that has several negative charges 
(McLaughlin and Murray 2005). Anionic lipid 
clustering also plays a role in the action of certain 
antimicrobial agents and in the formation of 
cardiolipin-rich domains in mitochondria. 


Cross-References 

► Lipid Lateral Diffusion 

► Thermodynamics of Lipid Interactions 
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Synonyms 

Transbilayer movement; Lipid bilayer 

Definitions 

Biogenic membranes are “self-synthesizing” mem¬ 
branes, capable of synthesizing and integrating their 
lipid and protein components. Examples are the endoplas¬ 
mic reticulum and the bacterial cytoplasmic membrane. 

Flippases are membrane proteins that facilitate vec¬ 
torial, ATP-dependent transport of lipids across mem¬ 
branes, or the ATP-independent bidirectional 
movement of lipids. ATP-dependent flippases are 
either P-type ATPases or ATP-binding cassette 
(ABC) transporters. The molecular identity of most 
ATP-independent flippases (also called scramblases) 
is not known. 


Flipping describes the reorientation of a lipid from 
one leaflet of a membrane bilayer to the other. It is used 
interchangeably with “flip-flop,” “transbilayer translo¬ 
cation,” or “transbilayer movement.” Flip refers to 
transport of a lipid from the exoplasmic or lumenal 
membrane leaflet to the cytoplasmic leaflet, that is, 
out ^ in, whereas flop designates in ^ out transport. 

Membrane proteins are integral to the membrane 
and contain one or more transmembrane domains com¬ 
prised of sequences rich in hydrophobic amino acids. 

Polar lipids are amphipathic molecules, with 
a hydrophobic unit derived from fatty acids, sphingo- 
sine, or isoprenoids (such as dolichol and undecaprenol), 
and a zwitterionic or charged hydrophilic headgroup. 
Examples are phosphatidylcholine, sphingomyelin, 
dolichyl phosphate mannose, and bacterial Lipid II. 

Introduction 

Polar lipids diffuse rapidly in the plane of the mem¬ 
brane, but their transverse diffusion - movement from 
one side of the bilayer to the other, or flip-flop - is 
intrinsically very slow (Komberg and McConnell 
1971). This is because the hydrophobic interior of the 
membrane presents an energy barrier to the transit of 
the polar lipid headgroup. For common phospholipids 
such as phosphatidylcholine (PC), the barrier is esti¬ 
mated at ^20-50 kcal/mol. However, rapid flip-flop of 
polar lipids is essential for numerous physiological 
processes, and lipids move rapidly across many types 
of biological membranes as a result of the action of 
lipid transport proteins termed flippases. It is important 
to note that for lipids without a substantial polar 
headgroup, spontaneous flipping is rapid and does not 
require flippases. Examples of such lipids include cho¬ 
lesterol, ceramide, and diacylglycerol whose 
headgroup is a simple hydroxyl moiety. 

Lipid flippases are membrane proteins that either 
use the energy of ATP hydrolysis to move lipids across 
the membrane against their concentration gradient, or 
facilitate bidirectional movement of lipids without 
metabolic energy input (Fig. 1). The former are some¬ 
times referred to as flippases or floppases depending on 
whether they move lipids toward the cytoplasmic or 
exoplasmic (or lumenal) leaflet of cellular membranes, 
respectively, whereas the latter are also termed 
“scramblases” because they dissipate transbilayer 
lipid asymmetry. 
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Lipid Flip-Flop, Fig. 1 Lipid 
flip-flop and the role of 
flippases 


ATP-independent 





flip-flop scramblase 


ATP-dependent 



extracellular 
(lumenal) side 


cytoplasmic side 


flippase floppase 


Lipid Transport Catalyzed by P-Type 
ATPases and ABC Transporters 

Flippases in the eukaryotic plasma membrane (PM) are 
generally membrane proteins that belong to the family of 
type IV P-type ATPases (P4-ATPases) or the family of 
ABC (ATP-binding cassette) transporters. These pro¬ 
teins use the energy of ATP hydrolysis to catalyze 
transbilayer transport of a variety of polar lipids. The 
P4-ATPases transport glycerophospholipids from the 
exoplasmic to the cytoplasmic leaflet of the PM. In 
many instances, transport is specific for certain phospho¬ 
lipids. Indeed, the first of the PM lipid translocation 
activities to be described was shown to be highly specihc 
for aminophospholipids, especially phosphatidylserine 
(Seigneuret and Devaux 1984). ABC transporters gen¬ 
erally catalyze transport from the cytoplasmic to the 
exoplasmic leaflet. The principle function of PM lipid 
transporters is to maintain the biologically significant 
transbilayer lipid asymmetry of the PM, hrst reported 
in the early 1970s by Bretscher, in which phosphatidyl¬ 
choline, sphingomyelin, and glycolipids are located in 
the exoplasmic leaflet whereas phosphatidylserine, 
phosphatidylethanolamine, and phosphoinositides are 
enriched on the cytoplasmic side. Loss of lipid asymme¬ 
try at the PM is a hallmark of programmed cell death 
where exposure of phosphatidylserine at the cell surface 
signals engulfment of the cell by macrophages. 

P4-ATPases commonly associate with a member of 
the Cdc50 family of membrane proteins. Complex for¬ 
mation must occur before the P4-ATPase can exit the 
endoplasmic reticulum (ER). P4-ATPase complexes 
have been isolated from yeast and retina and shown to 
have flippase activity in reconstituted vesicles. However, 
it is unclear whether the Cdc50 subunit is essential for 
flippase activity or whether it provides a regulatory func¬ 
tion. The specificity of P4-ATPases varies. Originally 
thought to be specihc for aminophospholipids in general 
and phosphatidylserine in particular, members of this 


transporter family can also transport other phospho¬ 
lipids. The activity of P4-ATPases is regulated. In Sac- 
charomyces cerevisiae, one regulatory mechanism is 
phosphorylation triggered by changes in membrane 
composition. 

The ability of ABC transporters to transport lipids 
was originally highlighted in studies of P-glycoprotein 
(also called MDRl). Studies in transgenic animals as 
well as in cultured cells showed that this protein could 
transport phospholipids from the cytoplasmic leahet to 
the exoplasmic leahet of the PM. ABC transporters 
also function in less obvious ways to transport lipids. 
For example, sterol movement across membranes is 
controlled in several physiologically important con¬ 
texts by ABC transporters (such as ABCAl) even 
though sterols can hip-hop across membranes at 
a rapid spontaneous rate. The role of ABC transporters 
in these cases may be to increase the chemical activity 
of the sterol molecule, thus facilitating its hand-off to 
an acceptor particle or protein. 

The PM of eukaryotes has an ATP-independent 
hippase activity that depends on an elevation in cal¬ 
cium ion concentration. This hippase, historically 
referred to as scramblase, is typically activated under 
conditions where the ATP-dependent hippase activi¬ 
ties of the PM are inactivated. Thus, in apoptotic cells, 
the P4-ATPase and ABC transporter activities of the 
PM are silenced, scramblase is activated, and 
phosphatidylserine is exposed at the cell surface. The 
protein responsible for calcium-induced phospholipid 
scrambling has not been convincingly identihed 
although candidates (PLSCRl, TMEM16F) (Suzuki 
et al. 2010) have been proposed. 

ATP-independent Lipid Flippases 

Glycerophospholipids are synthesized on the cytoplas¬ 
mic face of biogenic membranes such as the ER in 
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eukaryotes and cytoplasmic or inner membrane (CM) in 
bacteria. For normal expansion of the membrane during 
cell growth, half of the newly synthesized lipids must be 
flipped to the opposite leaflet of the bilayer. The rate of 
phospholipid synthesis and flipping must match the 
demand for new membrane as the cell grows. In 
a bacterial cell that doubles every ^30 min, at least 
^5,000 phospholipids must flip across the CM every 
second. For Gram-negative cells where phospholipids 
are also needed for outer membrane biogenesis, the 
demand for lipid synthesis and flipping is greater. Bio¬ 
chemical reconstitution studies indicate that both the ER 
and CM have specific transport proteins that facilitate 
ATP-independent bidirectional movement of phospho¬ 
lipids across the membrane. The flippases are 
nonselective with respect to glycerophospholipid stmc- 
ture and are capable of flipping different stereoisomers. 
Although their activity has been studied for over 
30 years, the molecular identity of these flippases is not 
known. It was recently shown that certain G-protein 
coupled receptors are ATP-independent phospholipid 
flippases (Menon et al. 2011). 

The ER also supports the biosynthesis of complex 
glycolipids required for A-glycosylation and glycosyl- 
phosphatidylinositol (GPI)-anchoring of proteins. The 
assembly of these glycolipids occurs via a multistep 
pathway that starts in the cytoplasmic leaflet of the ER 
and continues in the lumenal leaflet, necessitating flip¬ 
ping of lipid intermediates. For synthesis of the 
dolichol-linked oligosaccharide precursor of protein 
A-glycans, three different lipids must be flipped across 
the ER membrane. Biochemical reconstitution studies 
indicate that distinct ATP-independent flippases are 
required for these flipping events. Although the molec¬ 
ular identity of these flippases is not known, activity 
assays indicate that they are highly specific and capa¬ 
ble of distinguishing between structural isomers. 

Related flipping events occur in the CM. Thus, 
Lipid II, a glycopeptidolipid precursor of the bacterial 
cell wall, must move from the cytoplasmic side of the 
membrane to the periplasmic side to contribute build¬ 
ing blocks for cell wall assembly. The membrane pro¬ 
teins, EtsW (Mohammadi et al. 2011) and MviN, have 
both been proposed as being responsible for flipping 
Lipid II flippase. Lipopolysaccharide (EPS), a major 
component of the cell surface of Gram-negative bac¬ 
teria, is composed of lipid A, a core oligosaccharide 
and in some cases, a repeating oligosaccharide termed 
0-antigen. EPS biosynthesis is initiated on the 


cytoplasmic side of the CM and requires flipping of 
both Lipid A (by the ABC transporter MsbA) and lipid- 
linked O-antigen oligosaccharide units (by the Wzx 
family of proteins) before EPS can be assembled and 
delivered to the cell surface. 

A number of pathogenic bacteria repel cationic 
antimicrobial peptides by decreasing their cell surface 
negative charge. They do this by modifying phosphati- 
dylglycerol with lysine on the cytoplasmic face of the 
CM, then moving the positively charged, lysinylated 
phospholipid across the membrane. Lysinylated 
phosphatidylglycerol is synthesized and flipped across 
the membrane by MprE, a CM membrane protein. 


Cross-References 

► Bacterial Lipopolysaccharide, OPS, and Lipid A 

► Chemical Diversity of Lipids 

► Functional Roles of Lipids in Membranes 

► Glycerolipids; Chemistry 

► Glycoproteins 

► Glycosphingolipids 

► Glycosylphosphatidylinositol 

► Lipid Bilayer Asymmetry 

► Lipid Trafficking in Cells 

► Lipids: Isolation and Purification 

► Membrane Proteins; Structure and Organization 

► Sum Frequency Generation Vibrational 
Spectroscopy 
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Synonyms 

Bacterial lipids from Aclioleplasma laidlawii and 
Escherichia coli\ Cholesterol; Lateral phase 
separation; Lipid Bilayer; Lipid membrane; Lipid 
packing; Lipid phase equilibria; Lipid raft; Phospho¬ 
lipids; Sphingomyelin 

Definition 

Based on statistical thermodynamics, Einstein showed 
that Brownian motion, i.e., the stochastic motion of 
particles due to thermal collisions with the solvent 
molecules, is well described by an equation where 
the mean square of the translation distance of the 
particle, < >, is proportional to Dt, where D is the 

diffusion coefficient and t is the time. Lipid lateral 
dijfusion — the translational motion of lipids along 
the lipid hilayer or membrane characterized by a 
diffusion constant, and in two dimensions, 

the diffusion law assumes the form < > = 4Dit. 

Introduction 

The Singer and Nicholson model of the “fluid mem- 
hrane” 1972 aroused a great interest in the dynamics 
and in particular the two-dimensional diffusion of 
molecular components in biological membranes. 
Both lipids and proteins undergo random walks within 


the plane of the lipid bilayer. The theoretical and 
experimental aspects of lateral diffusion of lipids and 
proteins in lipid membranes have been reviewed by 
several authors, two of which can be mentioned here, 
namely, by Almeida and Vaz (1995) and by Saxton and 
Jacobson (1997). The former of these reviews dis¬ 
cusses lipid lateral diffusion in both homogeneous 
and heterogeneous bilayers composed of both one 
and several different lipids. It appears that the most 
interesting turn that the field has taken in recent years, 
both theoretically and experimentally, has been the 
study of lipid lateral diffusion in heterogeneous mem¬ 
branes - in multicomponent lipid bilayers, where two 
or more laterally separated phases coexist over a large 
range of temperatures. Then, a new fundamental prop¬ 
erty may appear, namely, the lateral connectivity of 
each phase, the so-called percolation. This property of 
membranes is discussed in Saxton’s review (“diffusion 
in an archipelago”), and the reader is directed to that 
review to learn more about this matter. To study the 
organization of biological membranes remains 
a challenge, because it is inherently difficult to char¬ 
acterize fluctuating lipid assemblies in the membranes 
of living cells. Lipid bilayer membranes and isolated 
plasma membranes, on the other hand, are successfully 
studied. Moreover, large-scale phase separation can 
occur in these systems. In particular, ternary mixtures 
of saturated and unsaturated lipids plus cholesterol can 
segregate into two coexisting fluid lipid domains, 
liquid-ordered (/q), and liquid-disordered (/j) phases. 
Such domains have been widely studied because they 
may be closely linked to lipid nanodomains in cell 
membranes. Here, we briefly discuss the lateral diffu¬ 
sion of the lipid components in bilayers containing one 
or several different lipids and in particular systems 
with cholesterol, leading to the formation of domains, 
often referred to as “lipid rafts” (Oradd and Lindblom 
2007). In recent years, a great number of publications 
on this subject has appeared, using various methods, as 
different fluorescence methods like fluorescence 
recovery after photobleaching (FRAP) or fluorescence 
correlation spectroscopy (PCS) and single particle 
tracking (SPT). These methods are treated at other 
places in this encyclopedia and will not be discussed 
any further. Here, results of lipid diffusion in model 
membranes measured by a powerful and recently 
improved technique of NMR spectroscopy will be 
discussed. This method is convenient, straightforward, 
and noninvasive; no probe molecule is necessary; and 
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the lipid lateral diffusion coefficients are directly 
obtained (for reviews see Lindblom and Oradd 1994, 
2009). 

NMR Measurements of Lipid Lateral 
Diffusion Coefficients 

The translational diffusion coefficient is obtained by 
monitoring the NMR signal in the time domain, 
a so-called spin echo, for a sample located in 
a magnetic field gradient. This so-called pulsed field 
gradient (pfg) NMR spectroscopy provides one of the 
most attractive techniques for studies of molecular 
transport, and lipid lateral diffusion coefficients (D^) 
are most conveniently measured on macroscopically 
aligned bilayers (Lindblom and Oradd 1994). Oriented 
samples with bilayer normals at an angle of 54.7° (the 
so-called magic angle) with respect to the external 
magnetic field are generally used in order to get well- 
resolved spectra. Nonoriented lipid crystalline samples 
usually show broad, featureless spectra in NMR spec¬ 
troscopy and are difficult to use for diffusion measure¬ 
ments. Generally, the *H NMR spectra for aligned 
samples show linewidths that are reduced by several 
kilohertz. 

For the diffusion measurements, the echo amplitude 
of each spectral component. A,, is given by (see 
Lindblom and Oradd 1994 for details): 

A; = Ao/ exp ^ ^ (1) 

where Aq, is a factor proportional to the content of the 
studied nucleus (usually proton) in the system, y is the 
gyromagnetic ratio, A is the time interval between two 
identical gradient pulses, (A—<5/3) is the diffusion time, 
5 and g are the duration and amplitude of the pulsed 
field gradients, respectively, and £), is the self¬ 
diffusion coefficient of spectral component i. The 
observed spectrum will be a superposition of all spec¬ 
tral components present in the sample. Varying one or 
more of the variables <5, A or g results in a series of 
attenuated spectra, from which D, can be extracted. 
A global analysis of the entire data set generally 
gives one fast component corresponding to the 
water and one or two lipid components, together 
with the corresponding spectral shapes (Oradd and 
Lindblom 2007). 


Diffusion Models 

As long as the lipid bilayer is laterally homogeneous, 
diffusion is described by one diffusion coefficient and 
< > = ADit, where < > is the mean-squared 

displacement and t is the diffusion time. The situation 
is more complicated when lateral heterogeneities are 
present, like if the lipid matrix contains diffusion obsta¬ 
cles. Here, we will only discuss fluid membranes, where 
domains or phase separation may occur on a length 
scale observed by the pfg method, i.e., the size of 
domains is of the order of micrometer, the time during 
which diffusion is observed is of the order of millisec¬ 
ond. Then, two or more diffusion coefficients may be 
detected, depending on the number of phases present in 
the lipid membrane. In principle, hindered diffusion, 
like in a smaller domain or cavity, may be observed as 
schematically shown in Fig. 1 , where a time-dependent 
diffusion coefficient is observed upon a change in the 
length of the diffusion time A—513. 

Although a good number of quantitative studies of 
lateral diffusion in several lipid systems have been 
published, the mechanism by which lipids migrate 
along the bilayer has remained unknown. There are in 
principle two different models proposed on this matter, 
namely, a free volume theory, discussed by Almeida 
and Vaz (1995), and a recent model from computer 
simulation studies where it is proposed that diffusion 
of lipids in fluid bilayers takes place through concerted 
lipid motions and tens or possibly hundreds of lipids 
move in concert as loosely defined clusters (for 
a recent review, see Apajalahti et al. 2010). This is in 
contrast to the free volume or free area model, where it 
is assumed that the lipids diffuse via thermally acti¬ 
vated, lateral displacements or jumps; these jumps 
occur as often as there is a large enough free volume 
pocket adjacent to the lipid. Further investigations, both 
experimentally and theoretically, are needed to finally 
settle this point, although the collective motion of lipids 
sounds like a very attractive mechanism judging from 
the many theoretical studies recently published. 

Lipid Packing: Influences of Lipid Head- 
Group Area on 

In general, a closer packing of the lipids in a bilayer 
results in a decrease in (Lindblom and Oradd 2009). 
Table 1 shows Djj, for a number of lipid bilayer 
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Lipid Lateral Diffusion, 

Fig. 1 A schematic drawing 
showing the effect of a change 
in the time during which 
diffusion occurs (A—S/3) on 
a diffusing molecule inside or 
outside a cavity 
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Lipid Lateral Diffusion, Table 1 Head-group areas and 
corresponding for some lipids in bilayers 


Lipid 

Area (nm^) 

DiCpm^s ') 

t(°C) 

DOPC 

0.72 

8.25 

25 

POPC 

0.68 

7.79 

25 

DPPC 

0.64 

17.8 

45 

DMPC 

0.61 

5.82 

25 

SOPC 

0.63 

6.6 

25 

SLPC 

0.66 

8.2 

25 

SAPC 

0.68 

10.0 

25 

SDPC 

0.70 

11.2 

25 

eSM 

0.53 

4.5 

50 

DOPG 

0.80 

15 

30 

A. laidlawii lipid extract 

ca. 0.6 

2.7 

30 

CHOL/(CHOL -t DMPC) 

0.0 

0.61 

9.0 

30 

0.1 

0.53 

4.9 

30 

0.2 

0.48 

2.6 

30 

0.3 

0.44 

2.0 

30 


DOPC dioleoylphosphatidylcholine, POPC palmitoyloleoyl- 
phosphatidylcholine, DPPC dipalmitoylphosphatidylcholine, 
DMPC dimyristoylphosphatidylcholine, SOPC stearoyloleoyl- 
phosphatidylcholine, SLPC stearoyllinoleoylphosphati- 
dylcholine, SAPC stearoylarachidonoylphosphatidylcholine, 
SDPC stearoyldocosahexaenoylphosphatidylcholine, eSM egg 
sphingomyelin, DOPG dioleoylphosphatidylglycerol, CHOL 
cholesterol 


systems and, in particular, an illustration of the effect 
of increased unsaturation in the sn-2 fatty acyl chain of 
phosphatidylcholines on the lipid lateral diffusion. Dj^ 
of phospholipids with an increasing number of double 
bonds from one (SOPC), two (SLPC), and four (SAPC) 
to six (SDPC), is shown. It can be seen that the lateral 
diffusion increases with increasing number of double 
bonds, as a consequence of the increased head-group 
area caused by the unsaturation. It can also be inferred 
from this table that decreases in the order DOPC > 
POPC > DPPC > DMPC again in line with the decrease 
in the head-group area, mainly caused by the degree of 
saturation/unsaturation of the acyl chains. Moreover, egg 
sphingomyelin (eSM) has a lower Dj^ than DPPC, and 
dioleoylphosphatidylglycerol (DOPG) with its repulsive 
charged head group exhibits a larger diffusion coefficient 
than DOPC, once more in agreement with what can be 
expected from the head-group area. 

Finally, the effect on Dj^ upon a change in lipid 
packing is shown by the classical systems of 
a mixture of phospholipids and cholesterol and the 
“condensing effect of cholesterol.” In Table 1, this 
is illustrated by an increasing content of CHOL in 
DMPC bilayers. The flat, rigid ring system of choles¬ 
terol is effective in tightening the packing of the 
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DOPC/DPPC/CHOL, 35:35:30 mol% 
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Lipid Lateral Diffusion, Fig. 2 D[^ in oriented bilayers of 
DOPC/DPPC-d 75 /CHOL-f 7 . By using differently labeled lipids, 
it is possible to study the lateral diffusion of each lipid species 
individually. As the temperature goes below 30°C, the two- 


phase area is entered and two separate diffusion coefficients 
are observed for DOPC and CHOL. DPPC, on the other hand, 
seems to partition only into the /q phase (Adapted from Oradd 
et al. (2005)) 


hydrocarbon chains, resulting in a reduced lipid lateral 
diffusion. 

The lateral diffusion has also been investigated for 
bacterial membranes, and it was found that for two 
different such species, the much more tightly packed 
lipid bilayers from the Acholeplasma laidlawii mem¬ 
brane has a much slower translational diffusion than 
membrane lipids from Escherichia coli. Thus, there is 
a striking difference in for membranes from 
A. laidlawii, which consists of 70-80 mol% 
glucolipids, and E. coli that consists of only phospho¬ 
lipids; the diffusional motion of the latter kind of lipids 
is much faster. Another interesting observation is that 
Dj^ for A. laidlawii membranes is independent of the 
growth temperature of the bacteria, because the lipid 
composition is regulated and substantially varied with 
temperature, so that the packing of the lipids in the 
membrane is the same at the different growth temper¬ 
atures (Lindblom et al. 2002). 

Lateral Phase Separation and Domain 
Formation in Bilayers 

The influence of domains on the lipid lateral diffusion 
enables pfg NMR to determine lateral phase separation 
in lipid bilayers. Lipids will diffuse either into or out 


from the separated phases in the fluid bilayer by some 
exchange mechanism or, provided that the border 
between different domains presents an obstacle to 
lipid diffusion, the lipids will encounter restrictions 
in their translational motion. 

Two illustrative examples will be given here: The 
ternary system DOPC/DPPC/CHOL, where large 
domains form, and the binary system eSM/CHOL, 
where smaller domains with lipid exchange is present. 

In the ternary system, the three components were 
isotopically labeled; DOPC contained *H, DPPC 
contained labeled acyl chains, and CHOL has 
at the 25-position on the hydrocarbon chain. By 
observing *H, ^H, or NMR on the labeled lipids, 
it is thus possible to separately study each individual 
lipid species as illustrated in Fig. 2. Lateral phase 
separation is observed for all the three different NMR 
active nuclei, and it is also found that the diffusion of 
the lipids is governed by the properties of the phase, in 
which they reside, rather than the properties of the 
molecules themselves. Thus, all lipid species, includ¬ 
ing CHOL, have the same diffusion coefficient as long 
as they are in the same phase! From the diffusion data, 
the partitioning of the lipids into the two phases could 
also be estimated. Note that at the phase separation 
temperature, the diffusion coefficient increases slightly 
in the disordered phase. 
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Lipid Lateral Diffusion, 

Fig. 3 Lateral eSM diffusion 
coefficients at different CHOL 
concentrations for the eSM/ 
CHOL system with 35 wt% 
^H20 and at 313 K {circle), 
318 K {triangle top down), 

323 K {square), 328 K 
{diamond), and 333 K 
{triangle top up). The thick 
solid lines are estimations of 
the extension of the two-phase 
area (with and l^i phases). 
The solid and dotted lines are 
linear estimations of in the 
one-phase and two-phase 
areas, respectively (Adapted 
from Filippov et al. (2003)) 



The temperature step is 5 K from 313 K (bottom) to 333 K (top) 


The system of eSM/CHOL exhibits domain forma¬ 
tion in the range of 6-22 mol% CHOL, and this is seen 
as a sudden break in the curve versus CHOL 
(Fig. 3). Due to fast exchange between the separated 
phases, /„ and 1^, the observed diffusion coefficient, 
will be a weighted average of the diffusion 
coefficients in the separate phases: 

Dz,(2(/.) = poDpiQ + (1 - Po)DL{ld) (2) 

in which po is the relative amount of the /q phase. As po 
increases, the diffusion will decrease as Di^lJ 
becomes dominant (note: lower diffusion because of 
closer packing in the /q phase). A fast exchange is an 
indication that the domains are smaller than the mean 
diffusion length (ca. 1 pm) calculated from r^ = TD^t. 
In the one-phase regions on each side of the two-phase 
area, the diffusion coefficients are almost constant. 
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► Lipid Domains 

► Lipid Organization, Aggregation, and 
Self-assembly 

► NMR 

► NMR Methods for Kinetic Analysis 


► Pulsed Field Gradient NMR 

► Supported Lipid Bilayers 

References 

Almeida PFF, Vaz WLC. Lateral diffusion in membranes. In: 
Lipowsky R, Sackmann E, editors. Structure and dynamics of 
membranes. From cells to vesicles. Handbook of biological 
physics, vol. 1. Amsterdam: Elsevier; 1995. p. 305-57. 

Apajalahti T, Niemela P, Govindan PN, Miettinen MS, 
Salonen E, Mairink SJ, Vattulainen I. Concerted diffusion 
of lipids in raft-like membranes. Faraday Discuss. 
2010;144:411-30. 

Filippov A, Oradd G, Lindblom G. The effect of cholesterol on 
the lateral diffusion of phospholipids in oriented bilayers. 
Biophys J. 2003;84:3079-86. 

Lindblom G, Oradd G. NMR studies of translational diffusion in 
lyotropic liquid crystals and lipid membranes. Prog Nucl 
Magn Reson Spectrosc. 1994;26:483-516. 

Lindblom G, Oradd G. Lipid lateral diffusion and membrane 
heterogeneity. Biochim Biophys Acta. 2009;1788:234M4. 

Lindhlom G, Oradd G, Rilfors L, Morein S. Regulation of lipid 
composition in Acholeplasma laidlawii and Escherichia coli 
membranes. NMR studies of lipid lateral diffusion at differ¬ 
ent growth temperatures. Biochemistry. 2002;41:11512-5. 

Oradd G, Lindblom G. Lateral diffusion coefficients of raft lipids 
from pulsed field gradient NMR. In: McIntosh TJ, editor. 
Methods in molecular biology, Lipid rafts, vol. 398. Totowa: 
Humana; 2007. p. 127M-L 

Oradd G, Westerman PW, Lindblom G. Lateral diffusion coef¬ 
ficients of separate lipid species in a ternary raft-forming 
bilayer; a pfg-NMR multinuclear study. Biophys J. 
2005;89:315-20. 

Saxton M, Jacobson K. Single particle tracking: applications to 
membrane dynamics. Annu Rev Biophys Biomol Struct. 
1997;26:373-99. 













Lipid Mesophases for Crystallizing Membrane Proteins 


1269 


L 


Lipid Membrane 

► Lipid Lateral Diffusion 


Lipid Mesophases for Crystallizing 
Membrane Proteins 

Charlotte E. Conn 

CSIRO Materials Science and Engineering, Clayton 
South, VIC, Australia 

Synonyms 

In cuho crystallization; In meso crystallization 

Definitions 

In meso aystallization uses nanostructured lipidic 
materials, in particular, bicontinuous cubic or sponge 
phases, as matrices for the growth of protein crystals, 
particularly membrane proteins. 

Lipid mesophases self-assemble when amphiphilic 
lipid molecules are mixed with an aqueous solvent, 
typically water. 

Bicontinuous cubic phases consist of a continuous 
lipid bilayer, which subdivides space into two 
interpenetrating, but not connected, water networks. 
The bilayer midplane may be closely described as 
a surface of constant zero mean curvature, known as 
a triply periodic minimal surface. 

Introduction 

Integral membrane proteins (MPs) are found embed¬ 
ded within the phospholipid bilayer of the cell mem¬ 
brane. More than 30% of the genome codes for MPs 
which are implicated in diseases ranging from cardio¬ 
vascular disease to neurological diseases such as 
schizophrenia and Parkinson’s disease. Determining 
the structures of MPs is important to determine their 
mode of action and for the rational design of drugs. 
However, the structures of many MPs remain elusive. 
This is reflected in the lack of specificity of current 
drugs; many commercially available drugs target 



Lipid Mesophases for Crystallizing Membrane Proteins, 
Fig. 1 The dopamine D2 long (D2L) receptor, implicated in 
Parkinson’s disease, is depicted within a cubic mesophase envi¬ 
ronment (Conn et al. (2010b) - Reproduced by permission of the 
Royal Society of Chemistry) 

several homologous receptors within each receptor 
subtype. The main obstacle in structural determination 
of MPs is in high-quality crystal growth for X-ray 
diffraction (XRD) analysis (DeLucas 2009). 

lust over 15 years ago, it was demonstrated that 3-D 
crystals of MP could grow within the matrix of 
a nanostructured lipidic cubic phase (Landau and 
Rosenbusch 1996). Bicontinuous cubic phases provide 
a unique environment for crystallization of MPs 
(Fig. 1) and fulfill two crucial roles. The hrst is that 
their fundamental bilayer structure mimics the pro¬ 
tein’s natural environment within the cell membrane, 
and secondly, the continuous nature of the bilayer and 
the water networks allows for lateral diffusion of the 
protein across the plane of the bilayer thus facilitating 
crystallization. MPs are more likely to retain their 
active conformation within a bilayer-based environ¬ 
ment, and crystals grown in lipidic mesophases have 
generally been robust, of high quality, and often 
superior to detergent grown crystals. Human 
G-protein-coupled receptors recently solved using 
in meso crystallization include the adenosine A 2 A 
receptor, the P 2 -adrenergic-receptor-Gs protein com¬ 
plex (Rasmussen et al. 2011), and the dopamine D3 
receptor. 
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Lipid Mesophases for Crystallizing Membrane Proteins, Fig. 2 A schematic representation of the (a) Qn*^, (b) Qn*", and (c) Qii° 
phases (Conn et al. (2010a) - Reproduced by permission of the Royal Society of chemistry) 


Lipid Mesophases for In Meso Crystallization 

Bicontinuous cubic phases, which are based around the 
fundamental structure of the lipid hilayer, are the pri¬ 
mary type of lipid mesophase used for in meso crystal¬ 
lization. The hilayer midplane is considered to he 
draped across a periodic minimal surface of zero 
mean curvature and subdivides space into two 
interpenetrating but not connected water networks 
(Hyde et al. 1997). There are three main types 
(Fig. 2): the diamond, Qn*^, phase (crystallographic 
space group Pn3m), the gyroid, Qu , phase (crystallo¬ 
graphic space group Ia3d), and the primitive, Qu*”, 
phase (crystallographic space group Im3m). The 
lipidic cubic phase is highly viscous and incompatible 
with standard crystallization robotics which has 
limited its use. A Plexus Crystal IMP robotic system, 
which dispenses nano volumes of viscous cubic phase 
directly onto a 96-well crystallization plate using a 
microsyringe pump, was developed to overcome 
some of these limitations (Caffrey and Cherezov 
2009). It offers reproducibility, a reduction in sample 
volume relative to manual setup and fast setup times 
(<10 min). However, the robot is expensive and 
specific to the technique of in meso crystallization. In 
addition, a microfluidic device, which mixes protein 
directly with preformed cubic phase, has recently been 
used for the growth of crystals of photosynthetic 
reaction center from Rhodohacter sphaewides and 
Blastochloris viridis (Li et al. 2010). 

There has been increasing interest in the use of 
lipidic sponge mesophases as a new generation of 
crystallization media for MPs (Cherezov et al. 2006). 


The lipidic sponge mesophase may he visualized as 
a melted cubic phase with a less curved lipid hilayer 
and two to three times larger aqueous pores. Such 
phases may he simply obtained by mixing the dry 
lipid with a buffer solution and a crystallization agent 
such as PEG, MPD, or jeffamine M600. Due to their 
high fluidity, sponge phases are compatible with con¬ 
ventional crystallization robots. In addition, the larger 
aqueous pore size makes such systems suitable for 
larger MPs containing large extra- and/or intracellular 
domains extending the range of proteins which may be 
investigated. 

Novel Lipids 

Virtually all crystallization trials to date have used the 
same lipid, monoolein (MO), or a very similar analog, 
despite recent research showing that physical parame¬ 
ters of the cubic phase can affect crystallization 
(Caffrey 2009). The MO molecule is susceptible to 
hydrolysis and transesteriflcation and MO is unsuitable 
for crystallization of more labile proteins as it does not 
form a cubic phase below room temperature. Due to 
the scarcity of suitable cubic phase matrices, in meso 
crystallization is currently incompatible with the 
majority of MPs and their functional complexes, due 
to their large size. Recently, some research has begun 
to investigate new cubic phases suitable for in meso 
crystallization. The Caffrey research group has 
focused on the synthesis and characterization of 
a homologous series of monoacylglycerols (MAGs) 
(Caffrey 2009). By variation of the hydrocarbon 
chain length, it is possible to exert some control over 
the bilayer thickness and water channel diameter. 
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although these parameters are relatively tightly 
coupled and cannot be varied independently of each 
other. This research has indicated that a reduction in 
bilayer thickness may promote crystal growth, and 
crystals of bacteriorhodopsin, cytochrome caa3 oxi¬ 
dase, and the cobalamin transporter BtuB were suc¬ 
cessfully grown within a shorter chain (Cl4) MAG 
(Caffrey 2009). However, the presence of the ester 
bond within MAGs increases their susceptibility to 
chemical degradation, which could potentially affect 
the cubic structure. 

Therefore, recent efforts have focused on the design 
and characterization of alternative lipids which form 
the cubic phase but are more chemically robust, retain 
the cubic phase down to lower temperatures, and may 
provide a more “natural” environment for the MP. The 
LCP (lipidic cubic phase) resources website run by the 
Cherezov Lab at the Scripps Institute currently lists 
only eight non-MAG molecules, many of which are 
sugar surfactants, as suitable matrices for in meso 
crystallization. Recent research by the Drummond 
research group has focused on endogenous lipids and 
their analogs, including those based on the biologically 
relevant monoethanolamide headgroup (Conn et al. 
2010a); many unsaturated monoethanolamides, 
including arachidonyl monoethanolamide (ananda- 
mide), are endogenous and have been shown in 
a number of studies to exhibit active biological and 
medicinal properties. 

Addition of Membrane Additives 

Most cubic phases consist of a binary lipid-water 
system. However, the native cell membrane consists 
of a range of lipids from three different classes: phos¬ 
pholipids, glycolipids, and cholesterols. The particular 
lipid composition is dependent on the cell type 
with hydrocarbon chain lengths typically between 
16 and 20, saturated or unsaturated, and usually con¬ 
tain an even number of C atoms. 

The cubic phase is fairly robust to addition of addi¬ 
tives which may provide an environment more similar 
to the native cell membrane environment of the partic¬ 
ular MP. Phospholipids (including phosphatidylcho¬ 
line (PC), phosphatidylethanolamine (PE), and 
phosphatidylserine (PS)), cardiolipin, 2-MO, and cho¬ 
lesterol are all tolerated by the MO cubic phase to 
varying degrees (Caffrey 2009). In particular, addition 
of cholesterol has been extremely successful in crystal 
growth, particularly for human GPCRs (Caffrey 2009). 


Lipid additives can affect both the local bilayer envi¬ 
ronment of the protein and the cubic nanostructure in 
a variety of ways. 

Membrane fluidity - The fluidity of the lipid bilayer 
may affect crystal growth by influencing the diffusion 
rate of proteins. Cholesterol decreases membrane flu¬ 
idity while unsaturated chain hydrocarbons increase 
membrane fluidity. 

Lateral pressure within the hilayer — The lipid 
bilayer exerts a lateral pressure on embedded MPs 
thought to be crucial for retention of the active confor¬ 
mational state. The lateral pressure depends strongly 
on the lipid composition within the bilayer. In general, 
more than 50% of a typical cell membrane will consist 
of type 0 lipids, which have a cylindrical molecular 
architecture, and typically form a flat bilayer structure. 
Increasing the composition of type 0 lipids may there¬ 
fore provide a more native environment for the incor¬ 
porated MP. 

Cubic nanostructure - The nanostructure of the 
lipid matrix strongly depends on the molecular archi¬ 
tecture of all lipids within the bilayer. Both cholesterol 
and type 0 phospholipids will act to swell the cubic 
nanostructure suitable for crystallization of larger MP 
complexes. Incorporated lipids may, however, pro¬ 
mote a transition to a noncubic architecture unsuitable 
for crystallization. 

Mechanism of Crystallogenesis 

The process of in meso crystallization is deceptively 
simple. The bicontinuous cubic phase spontaneously 
self-assembles upon addition of protein solution at an 
appropriate ratio. Crystallogenesis, or crystal growth, 
begins upon addition of a precipitant, known as 
a crystallization screen, normally a combination of 
polyethylene glycol (PEG), a salt, and buffer. It has 
been proposed that addition of the crystallization 
screen components triggers phase separation, with pro¬ 
tein nucleation occurring in the protein-rich separated 
phase. A local lamellar phase is proposed to act as 
a conduit-moving protein from the bulk cubic phase 
to the growing crystal face (Eig. 3) (Caffrey 2008). 

The proposed mechanism for in meso crystalliza¬ 
tion therefore depends, at least initially, on retention of 
the underlying cubic or sponge mesophase. However, 
a crystallization trial requires screening across a wide 
range of crystallization conditions, containing poly¬ 
mers, salts, buffers, and at varying pH, all of which 
are known to drive structural changes in lipid 
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Lipid Mesophases for 
Crystallizing Membrane 
Proteins, Fig. 3 A schematic 
representation of the proposed 
mechanism for membrane 
protein crystallization within 
a bicontinuous cubic phase. 
The protein is depicted 
embedded within the bilayer 
environment of the cubic 
phase. Addition of precipitants 
triggers phase separation. 

A local lamellar phase is 
shown acting as a conduit- 
moving protein from the bulk 
cubic phase to the growing 
crystal face (Reprinted with 
permission from Caffrey 
(2008). Copyright 2008 
American Chemical Society) 



mesophases. Each individual well within 
a crystallization trial will therefore contain a unique 
combination of lipid, MP and associated detergents 
and buffers, and the components of the crystallization 
screen. Any of these components could affect the 
structure of the underlying lipid mesophase, and this 
structure may evolve during the time period of crystal 
growth. Detergents and buffers associated with the 
MP, as well as the components of the crystallization 
screen, have been shown to destabilize the cubic phase 
matrix. In addition, incorporation of the MP itself can 
significantly impact the cubic phase. In particular, 
destruction of cubic phase was shown to occur when 
there was a large discrepancy in size between the 
hydrophobic and hydrophilic domain sizes of the MP 
and the bilayer thickness and water channel diameter 
of the cubic mesophase (Conn et al. 2010a). 

Despite recent advances, in practice, the technique 
of in meso crystallization is poorly understood, and it 
remains an essentially empirical technique. In particu¬ 
lar, virtually nothing is known about the factors that 
promote crystallization. Crystallization trials therefore 
require screening across a wide range of conditions; 
successful crystal growth invariably results from 
screening tens of thousands of conditions randomly. 
This situation is particularly unfavorable for crystalli¬ 
zation of MPs which are typically available in 


extremely small quantities; less than 1 mg of protein 
may result from months of expression and purification. 
The scarcity of MP has contributed to the dearth of 
knowledge in this area; available MP tends to be used 
directly in crystallization trials rather than in funda¬ 
mental characterization studies. 
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Synonyms 

Lipid bilayer; Polymorphic and mesomorphic behav¬ 
ior of lipids; Self-assembly 

Definition 

Lipids are a class of naturally occurring organic mol¬ 
ecules that perform essential cellular functions in liv¬ 
ing systems. They have been dehned as fatty acids, and 


their derivatives and substances are related biosynthet- 
ically or functionally to these compounds (Larsson 
1994). Lipids form the structural matrix of biological 
membranes, function as energy storing molecules in 
the form of fat, and are involved in signaling pathways. 
Self-assembly is a thermodynamically spontaneous 
process during which a disordered system composed 
of entities such as molecules comes together to 
form well-defined microstructures or macrophases as 
a result of local and long-range physicochemical 
interactions. 

Introduction 

Lipids (Mouritsen 2005) come in a large variety of 
sizes, shapes, and chemistry. The chemical classifica¬ 
tion of lipids categorizes them mainly into fatty acids, 
glycerolipids, glycerophospholipids, sphingolipids, 
sterols, and saccharolipids. Of these, glyceropho¬ 
spholipids, sphingolipids, sterols, and saccharolipids 
are the major components of biological membranes 
and the first three are commonly used in model 
membrane constructs. These four categories of lipids 
have a common underlying chemical property: 
amphipathicity, which enables them to inhabit 
oil-water interfaces. The lipids are composed of 
a hydrophilic head group that associates with water 
and one or more hydrophobic acyl tails which prefer 
to partition into an oily region. The chemical nature 
and size of the head group as well as the hydrophobic 
tails is very varied. On a molecular level, the richness 
of lipid phase behavior owes its existence to this 
chemical complexity of the individual heads and 
tails of the lipids as well as the balance between the 
sizes and shapes of the two. In living tissue, lipids 
are mainly organized into (1) 5-nm-thick lipid 
bilayers that form the cellular membrane and several 
intracellular structures such as the endoplasmic retic¬ 
ulum and the Golgi apparatus, (2) intracellular lipid 
droplets, and (3) lipoprotein particles which shuttle 
between cells. 

Outside a living cell, individual lipids and lipid 
mixtures can be observed to undergo a rich variety of 
phase behavior depending on lipid composition and 
external conditions such as pH, temperature, pressure, 
hydration, and ionic strength. The structures include 
planar lipid bilayers, uni- and multilamellar vesicles, 
giant unilamellar vesicles, lipid droplets and 
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Lipid Organization, Aggregation, and Self-assembly, 
Fig. 1 Snapshots of exemplary lipid structures. Left: Simulation 
snapshot of a self-assemhled triglyceride lipid aggregate inside 
a phospholipid bilayer (Khandelia et al. 2010). Shown here is the 
side view, depicting two periodic images along the bilayer 
normal (the bar is 10 nm). POPC tails are shown in blue, 
POPC phosphate beads are shown as green spheres, triglyceride 
tails are shown in red, triglyceride glycerol backbone as red 
spheres, and water is depicted in cyan. The blue lines represent 
the central simulation cell boundaries. Right. Molecular 

lipoproteins, and cubic, hexagonal, and inverted 
hexagonal phases (Hyde et al. 1997) to name a few. 
Figures 1 and 2 highlight some of them as typical 
examples. 

In this entry, we will present an overview of the 
structures and phases that model lipid mixtures can 
adopt in mostly aqueous phases and the biophysical 
methods that are commonly used to analyze lipid sys¬ 
tems. The physical and chemical principles that 
emerge from analysis of model lipid mixtures are 
often applicable to complex living systems and are of 
signihcant consequence to the science and art of the 
design of drug delivery vehicles, which increasingly 
exploit the rich environment-dependent phase behav¬ 
ior of lipid particles. 


Methods 

The structures of lipid systems and their dynamic 
properties can be determined and characterized by 
numerous techniques. Given that space is limited, 
here, we aim to provide some flavor for the techniques 
that are commonly used in the held, more extensive 
discussions being available elsewhere (Serdyuk et al. 
2007; Feller 2008). 


structure of high-density lipoprotein with two apoA-I proteins 
around the lipid droplet (Vuorela et al. 2010). Dark gray stands 
for POPC head group and dark brown for palmitoyl-PC 
(PPC) head groups, light gray for POPC hydrocarbon chains, 
light brown for PPC chains, light orange for cholesterol 
OH-groups, bright yellow for cholesterol body, dark orange for 
cholesteryl ester (CE) bond, orange for CE ester body and chain, 
dark green for triglyceride (TG) ester bonds, and bright green 
for TG chain. Proline residues in apoA-I sequences are shown 
in green 



Lipid Organization, Aggregation, and Self-assembly, 
Fig. 2 “Spheres” - liposomes of varying lipid and protein 
composition, the typical size being 40 pm. Courtesy of Luis 
Bagatolli and Jorge Bernardino de la Serna (Science in Your 
Eyes 2011) 

Experimentally, large-scale phase behavior is most 
often elucidated hy differential scanning calorimetry, 
which gauges changes in heat capacity to pinpoint the 
boundaries between different phases. Neutron and 
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small angle x-ray scattering are also commonly used 
for measuring phase behavior and lipid structures, as 
are also Langmuir pressure-area isotherms for lipid 
monolayers to consider different phases emerging for 
increasing surface pressure in the membrane plane. In 
fluorescence microscopy, one follows the partitioning 
of fluorescent probes into different membrane regions, 
and by knowing the preference of each probe for 
ordered versus disordered environments, one can get 
insight into the structures in a given lipid system. 
Meanwhile, nuclear magnetic resonance (NMR) is 
perhaps the most commonly used method to elucidate 
the ordering of lipid hydrocarbon chains, which corre¬ 
lates with the ordering of lipids in general. NMR also 
allows one to exploit lipids’ dynamic properties to 
unravel different phases: as the pace of lipid diffusion 
largely depends on how densely the lipids are packed 
with respect to one another, NMR can differentiate 
between long-range gel and fluid phases and also 
detect nanodomains with different packing properties, 
reflecting their ordered nature. 

For consideration of structures over smaller scales, 
the techniques mentioned above work quite well for 
nanodomains and can be complemented by, e.g., 
atomic force microscopy, which is an ideal technique 
to unravel nanometer-sized lipid structures, structural 
defects, and different lipid domains. 

The dynamics of lipid systems overall is a very 
exciting topic to explore (Vattulainen and Mouritsen 
2005), and NMR is just one of the many suitable 
techniques for this purpose. Considering again 
diffusion as an example, fluorescence recovery after 
photobleaching (FRAP) is a particularly often 
used technique, as is also fluorescence correlation 
spectroscopy (FCS) and single-particle tracking 
(SPT). The downside of all of these methods is the 
use of fluorescent probes, which affect the structural 
and dynamical properties of lipid systems in which 
they lie. Their use in studies of lipid systems and 
the interpretation of their results therefore warrant 
particular care. 

On the computational side, the current methods of 
choice are either atomistic or coarse-grained (CG) 
simulations usually based on classical molecular 
dynamics. The present models allow simulations of 
essentially all lipid types with all possible structures 
that one can think of. The main limitation of lipid 
simulations is the performance of the simulation 
engines and computers. Atomistic simulations are 


currently limited by system sizes of the order of 1 mil¬ 
lion atoms (roughly a system of size 20 x 20 x 20 nm^) 
and time scales of the order of 1 millisecond. The 
speedup of coarse-grained models compared to atom¬ 
istic ones varies quite a lot, but roughly, a speedup of 
100-10,000 is typical in the field. That is, for the same 
system sizes as in atomistic simulations, CG models 
can go beyond the time scale of seconds. If coarse- 
graining is extended over molecules to design models 
based on fields instead of molecules, such as in the case 
of phase-field models, the possible time and length 
scales increase even further. With the ever increasing 
scales considered in simulations, we also face the 
challenge of hydrodynamic interactions. Those inter¬ 
actions emerge from the conservation of momentum in 
local particle-particle interactions and play an impor¬ 
tant role especially at large length and time scales. 
However, currently, too few simulation techniques 
take hydrodynamics properly into account. The use 
of hydrodynamic methods such as dissipative particle 
dynamics and stochastic rotational dynamics are there¬ 
fore to be strongly encouraged in large-scale simula¬ 
tions of lipid systems. Altogether, the field of 
simulations has established itself as a remarkable 
means that not only complements experiments but 
also provides testable predictions. Examples below 
provide a taste of this fascinating novel field. 

Physical Principles of Lipid Organization and 
Self-assembly 

Self-assembly and organization of lipids are driven by 
forces that are surprisingly weak (Evans and 
Wennerstrom 1999). This is the essence of “soft 
matter” that lipid systems also are - in soft materials, 
the forces of interest are comparable to the thermal 
energy k^T, where is the Boltzmann constant and T 
is the temperature. 

The soft nature of lipid systems and its relation to 
thermal fluctuations has two important consequences. 
First, what matters in lipid organization are the inter¬ 
actions that are somewhat larger than k-^T but consid¬ 
erably smaller than the strength of covalent bonds, the 
latter being ^200 k-^T. In practice, the most central 
interactions are those due to dispersion (van der Waals, 
about 1-3 k^T), hydrogen bonds (^3-8 k^T), and 
electrostatics (with a wide range of interaction 
strengths depending on distance and solvent 



L 


1276 


Lipid Organization, Aggregation, and Self-assembly 


properties). These give rise to the enthalpic component 
(E) of free energy (F), which the system aims to min¬ 
imize. Second, in soft materials, one cannot neglect 
the entropic component of free energy, which aims 
to drive the system away from crystallization by 
keeping it fluid through thermal motion. The subtle 
competition of enthalpic forces with entropic ones 
in systems with amphiphilic molecules is often pic¬ 
tured as the so-called hydrophobic effect, which rep¬ 
resents the tendency of water to exclude nonpolar 
molecules such as oil. The effect originates from the 
inability of water to interact favorably with oil-like 
molecules, for which reason water has to restructure 
its hydrogen bonding network around oil-like particles, 
which in turn decreases entropy (S) and therefore also 
the entropic component (TS) of the free energy 
F — E—TS. Given this, it is easy to figure out what 
happens when lipid molecules are solvated in water; 
to minimize the free energy, above the so-called 
critical micelle concentration, lipids spontaneously 
aggregate Into spherical lipid structures known 
as micelles, where the contact between oil-like hydro¬ 
carbon chains of lipids and water is minimized and 
only the polar head groups of lipids face water 
molecules. 

At larger lipid concentrations, essentially the same 
principle drives the formation of other lipid structures, 
such as lipid droplets, liposomes, planar bilayers, 
cylindrical micelles, and the so-called plumber’s 
nightmare cubic phase (Hyde et al. 1997). What deter¬ 
mines the shape and the morphology of an emerging 
lipid structure depends on, e.g., lipid-lipid interactions, 
pH, salt, and distribution of lipids in a membrane. 
Commonly, the structure Is predicted by the surfactant 
packing parameter p = v/ 4 . qq, where v is the volume of 
a single lipid, Aq is the cross-sectional area of the head 
group region, and 4 . is the critical (maximum) length of 
the hydrocarbon region. The parameter describes in 
a simplified manner conditions for, e.g., micelles 
{p < 1/3) and planar bilayers {p ~ 1). However, its 
predictions should be considered as suggestive only, 
since v, Z^, and aq are not only difficult to measure but 
also depend on thermodynamic conditions and inter¬ 
actions between lipids, ions, and the lipid composition 
overall. In simplified model systems, the parameter 
p works reasonably well, but should be applied with 
great care in native systems. 


Lipid Structures with Varying Shapes 

Lamellar Lipid Phases 

At sufficiently high lipid concentration, an aqueous 
lipid dispersion forms unilamellar vesicles consisting 
of a single bilayer of lipids (packing parameter p k, 
or multilamellar vesicles comprised of concentric mul¬ 
tiple bilayers separated by a layer of water (Science in 
Your Eyes 2011), see Fig. 2. The essential structure 
here is a lipid bilayer, which consists of lipids arranged 
in two lamellae with the hydrophobic lipid tails 
pointing toward each other and the hydrophilic 
headgroups guarding the hydrophobic core of the 
bilayer from water (Bloom et al. 1991). Above the 
so-called main phase transition temperature T^, 
a single-component lipid bilayer with a thickness of 
^5 nm is in the fluid (liquid-crystalline, L^.) phase, 
which is liquid in terms of allowing lipids to diffuse 
freely in the plane of the membrane. Meanwhile, the 
bilayer is also liquid-crystalline as there is orienta¬ 
tional order due to the lipids being on average aligned 
along the bilayer normal. The E^ phase is also often 
denoted as liquid-disordered since the liquid- 
crystalline order in this phase is rather weak, the con¬ 
formations of lipid hydrocarbon chains being largely 
disordered. For decreasing temperature, the diffusion 
of lipids slows down and, at T^, eventually drops down 
by several orders of magnitude as the bilayer 
undergoes a phase transition from the fluid to the gel 
phase with a thickness of ^6 nm. The gel phase is 
broadly speaking quasicrystalline, characterized by 
tight packing of lipids, tilted lipid orientations with 
respect to membrane normal, strong conformational 
order in the hydrocarbon region, and quasi-long- 
range order in the membrane plane. In multilamellar 
vesicles, a “ripple” phase sometimes intervenes 
between the fluid and the solid-like gel phases, and 
the accompanying phase transition is referred to as the 
pretransition. Overall, the transitions are determined 
by soft interactions between adjoining lipids as briefly 
discussed in the previous section. Thus, the chemical 
nature of the lipid (size of head group, degree of 
unsaturation, length of the lipid tails, etc.) determines 
its T„j. For example, DPPC (dipalmitoylphosphati- 
dylcholine), which is a lipid with fully saturated tails, 
has above room temperature, while POPC 
(palmitoyl-oleoyl-PC), which is identical to DPPC 
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except for one slightly longer hydrocarbon chain that 
has a single double bond, melts well below room 
temperature. In native cell membranes comprised of 
hundreds or even thousands of different lipids, the 
main transition temperature is not well defined since 
reflects the properties of a single-component 
bilayer (see below). Nonetheless, there is 
a considerable amount of data suggesting that cell 
membranes at physiological temperatures operate in 
the fluid phase. The consequent ability of lipids to 
freely diffuse in fluid membranes enables them to 
quickly seal membrane defects. Given that one of the 
central tasks of membranes is to compartmentalize 
cells, this is also essential for their biological function. 

When several lipid types with different are 
mixed together, the mixture’s phase behavior becomes 
more complex with a range of phase transition temper¬ 
atures, and several phases can simultaneously coexist 
at a given temperature. Even for a two-component 
system, the resulting phase diagram can he rather com¬ 
plex. The phase behavior of the lipid mixture can then 
be described as a typical two-axis binary liquid mix¬ 
ture diagram such as that of ethanol water. Particularly 
relevant to mammalian membranes is a mixture of PCs 
with cholesterol, both lipid types being abundant and 
the molar cholesterol concentration in some related 
membranes being even ^50%. Cholesterol is in many 
ways fascinating, one of the reasons being that it 
induces a new liquid-ordered phase (Lq) to comple¬ 
ment the fluid and gel phases (Bloom et al. 1991). In 
the Lg phase, a membrane is liquid as it allows rapid 
diffusion of lipids as in the fluid phase, but it is also 
ordered due to the high conformational order as in the 
gel phase. Interestingly, while cholesterol slows down 
lipid diffusion and hence decreases membrane fluidity 
in the fluid phase, it increases the fluidity in the gel 
phase. 

One of the topics that have received substantial 
attention during the last 15-20 years is the relevance 
of lipids in biological functions. The lipid that has 
found its place in the spotlight is cholesterol. There is 
now a major body of evidence supporting the view that 
membrane proteins and lipids such as cholesterol form 
functional nanoscale complexes that work in unison 
(Lingwood and Simons 2010). Lipid rafts, as they are 
commonly called, are considered to be rich especially 
in cholesterol and sphingolipids, and the role of these 


nano-sized membrane proteins in cellular functions is 
currently investigated very actively. Lipid rafts seem 
to be particularly important in signaling, and they 
likely also play an important role in membrane- 
associated phenomena such as receptor trafficking 
and regulation of neurotransmission. 

At an air-water interface, lipids can also assemble 
into a monolayer, with the lipid tails pointing into the 
air. An appealing example of this structure is the lipid 
monolayer in the lung alveolae, which reduces the 
surface tension of the air-water interface and enables 
mammals to breathe without exerting otherwise over¬ 
whelmingly large forces that would be required to 
contract and expand the alveolar surface. 

Nonlamellar Phases 

Quite a few lipids are not characterized by an approx¬ 
imately constant cross-sectional area profile along 
their length. Instead, they can be cone-like and hence 
promote an inherent spontaneous curvature when 
inserted at water-lipid interfaces. Lipids in a bilayer 
containing two monolayers of finite spontaneous cur¬ 
vature are frustrated because of entropic reasons. If the 
curvature stress exceeds the noncovalent cohesive 
forces that keep a bilayer together, then the lipids 
form nonlamellar structures to release the stress, 
resulting in complex three-dimensional structures. 
Inverted hexagonal, cubic, and less common hexago¬ 
nal phases are examples of these structures. 
The formation of these microstructures is not only 
driven by lipid composition but also by environmental 
conditions. Lor example, the inverted hexagonal phase 
can be promoted from a lamellar phase by increasing 
the temperature, or by incorporation of a variety of 
amphipathic molecules such as drugs, detergents, and 
acylglycerols. Meanwhile, mammalian cells are 
endowed with two important nonlamellar lipid struc¬ 
tures; lipoproteins and lipid droplets (Vuorela et al. 
2010; Hevonoja et al. 2000), see Lig. 1. Lipoproteins 
known as carriers of cholesterol and its esters consist 
of a hydrophobic core comprised of disordered 
triacylglycerols and cholesteryl esters, surrounded by 
a lipid monolayer containing phospholipids and cho¬ 
lesterol. In the low-density lipoprotein, a single protein 
wraps the ^25-nm-sized lipid particle, while in the 
high-density lipoprotein, the number of proteins 
around the lipid droplet is greater. While the formation 



L 


1278 


Lipid Organization, Aggregation, and Self-assembly 


mechanism of lipid droplet is currently not understood, 
lipoprotein particles possibly emerge as globular tri¬ 
glyceride droplets that pick up phospholipids when 
released from the membrane of a cell or a cell organelle. 

Current State of the Art 

What is amazing in both experiments and simulations 
is their ability to pinpoint how individual molecules 
move in space, cluster together, form ordered struc¬ 
tures, and, by doing so, give rise to fascinating phe¬ 
nomena that describe the dynamics of a given system. 
In atomistic and CG simulations, these molecular- 
scale dynamic phenomena can be visualized more 
accurately than in experiments since, in quite a few 
experimental techniques, the low spatial resolution is 
a major limitation. However, as also in experiments the 
methodology has made substantial progress, character¬ 
ization of molecular-scale processes is becoming more 
and more possible. Thus, below, we briefly present 
a couple of examples for the state of the art in the 
field, highlighting the progress in both experiments 
and simulations. 

Experimental Means to Visualize Lipids in Motion 

The most appropriate means to visualize molecular 
motion in lipid systems is to use fluorescence. As 
natural lipids do not fluoresce, they are merged 
with fluorescent labels such as pyrene and BODIPY. 
The labels are most often attached to the lipids’ hydro- 
phobic chains, as in this manner, the perturbations 
induced by the labels are the weakest. The fluorescent 
probes are then excited by photons (light), and after 
a typical waiting time of a few nanoseconds, the probe 
emits part of its excess energy as light. Obviously, this 
reveals the location of the probe just like in the systems 
shown in Fig. 2. This simple principle is the core of 
a number of techniques based on fluorescence imaging, 
such as FRAP, FCS, SPT, and confocal microscopy. 

The main limitation of fluorescent techniques is 
their spatial resolution. For visible light, the smallest 
wavelength is about 400 nm, which implies that the 
smallest spatial scale that can be resolved is ^200 nm. 
Resolution of this size is rather poor considering that 
molecular phenomena take place in scales of nanome¬ 
ters and sizes of lipid domains with different physical 
properties range from a few to a few hundred nanome¬ 
ters. Improvements to enhance spatial resolution have 


yet been developed, one of the most remarkable tech¬ 
niques being stimulated emission depletion (STED) 
(Hell 2007; Huang et al. 2009). STED is based on 
using two laser pulses with matching focal points to 
excite and de-excite fluorescent dyes. Considering 
labeled lipids in a flat membrane, the first laser excites 
the dyes, while the second laser with a local intensity 
minimum such as a doughnut intensity profile (where 
the intensity is zero in the center of a circle and evolves 
like a hill for increasing radial distance from the focal 
point) inhibits the fluorescence of all the excited dyes 
rendering them nonfluorescent, except for those close 
to the focal center. By increasing the intensity of the 
second de-excitation spot, the region in a membrane 
where dyes remain fluorescent can be made smaller. In 
lipid membranes, the detected region of interest has 
been made as small as about 20 nm (Fig. 3), which has 
allowed mapping proteins in cells (Donnert et al. 2006) 
and studying nanoscopic details of lipid membrane 
diffusion in living cells (Eggeling et al. 2009). In 
solid-state applications, the detected region can be as 
tiny as ^5 nm (Rittweger et al. 2009). Concluding, 
STED has potential to study the details of molecular 
processes under physiological conditions at ^ nano¬ 
meter length scales. 

Lipid Dynamics Inside Computers 

Improved force fields, increasing computational 
power, and new multiscale modeling algorithms have 
not only enabled particle-based simulations to explore 
phenomena occurring over time and length scales once 
deemed inaccessible, but have also significantly 
boosted predictive ability. Domain formation in planar 
bilayers, dynamics of lipid-protein complexes, vesicle 
fusion, structural analysis of lipid droplets and lipopro¬ 
teins, and simulations of entire viral particles are some 
recent examples. 

A key goal in the field of particle-based simulations 
is to obtain simultaneous access to multiple time and 
length scales during the course of a simulation. Phe¬ 
nomena taking place over disparate time scales can be 
uncoupled to improve the computational efficiency of 
simulations without making overindulgent compro¬ 
mises in accuracy. In multiscale modeling, 
a phenomenon is investigated at various time and 
length scales using models of varying degree of coarse¬ 
ness. The procedure involves refining the courser 
model from simulations of finer models. Such 
multiscale models have facilitated simulations of 
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Fig. 3 STED microscopy on synaptotagmin I molecules on 
endosomes (Donnert et al. 2006). Left. STED recognizes sharp 
dots of 25^0 nm in size, highlighting its resolution and the 


spatial arrangement of synaptotagmin I on the endosome. 
Right: Corresponding intensity profile, showing the spatial res¬ 
olution to be in the order of ~20 nm (Permission granted; 
copyright (2011) National Academy of Sciences, USA) 


vesicles of the size of 100 nm with elastic and dynamic 
properties that match those in real systems (Izvekov 
and Voth 2009). What is more, although simultaneous 
temporal access to all length scales yet remains elu¬ 
sive, it is also possible to fine grain the coarse-grained 
system back to atomic description for detailed studies 
of molecular structures. At the moment, the develop¬ 
ment of accurate algorithms that allow shuttling 
between course-grained and fine-grained representa¬ 
tions of lipid systems is a very active field of work 
(Izvekov and Voth 2009; Rzepiela et al. 2010). 

Summary 

The structures of systems comprised of lipids are fas¬ 
cinatingly complex, and their role in biological func¬ 
tions is largely just beginning to be understood. What 
we do know is that lipids as such can function, e.g., as 
messengers in signaling. Lipoproteins as carriers of 
cholesterol and its esters include both lipids and pro¬ 
teins that work in unison. In cell membranes, integral 
proteins that take care of a great deal of cellular func¬ 
tions exist and work together with lipids. Meanwhile, 
the structure of a lipid system can also play a role, 
since, e.g., in liposomes the asymmetric pressure pro¬ 
file across a membrane contributes to the gating of 
mechanosensitive channels. These and many other 
phenomena observed in lipid systems have recently 
been elucidated by combination of experiments with 
simulations, which overall is an excellent strategy to 


gain better understanding of the many lipid structures 
and the roles of lipids in biological functions. We hope 
that this short overview attracts the readers to contrib¬ 
ute to this exciting field, to unravel the many open 
questions dealing with lipids’ biological significance. 

Cross-References 

► Atomic Force Microscopy 

► Chemical Diversity of Lipids 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 

► Dynamic Light Scattering 

► Fluorescence 

► Fluorescence Correlation Spectroscopy 

► Glycerolipids; Chemistry 

► Lipid Domains 

► Lipid Lateral Diffusion 

► Lipid Signaling and Phosphatidylinositols 

► Membrane Fluidity 

► Membrane Proteins; Structure and Organization 

► Molecular Dynamics Simulations of Lipids 

► NMR 

► Phase Transitions and Phase Behavior of Lipids 

► Pressure Effects on Lipid Membranes 

► Solid-State NMR 

► Stimulated Emission Depletion (STED) Microscopy 

► Thermodynamics of Lipid Interactions 

► X-Ray Scattering of Lipid Membranes 
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Lipid Protocells 

Anders N. Albertsen and Pierre-Alain Monnard 
Department of Physics, Chemistry and Pharmacy, 
FLinT center. University of Southern Denmark, 
Odense, Denmark 

Synonyms 

Chemoton; Minimal artificial cell; Minimal self- 
replicating chemical system 

Definition 

Protocells are self-assembled chemical systems that 
are proposed either as minimal models for living cell 
progenitors or as artificial cells. The reductionist view 
of life has lead to the idea that the cellular components 
of information, metabolism, and container are the 
essential pieces of living systems that work together 
through dynamic interconnections (Deamer 1997; 
Rasmussen et al. 2004; Szostak et al. 2001). These 
interactions lead to emergent properties within the 
system, such as self-maintenance and self-replication 
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of all parts. Thus, a protocell should be composed of at 
least these three components. 

The container is often viewed as a self-assembled 
stracture that has to be stable enough to contain metab¬ 
olism and information components, thereby defining the 
protocell identity (Matsuura et al. 2002). However, it 
must still be rather dynamic to allow the protocell life 
cycle (growth and division) to occur. The metabolism of 
a protocell provides the components, e.g., lipids or 
information polymers, necessary for maintenance and 
reproduction of the protocell by conversion of external 
resources. The information molecules perform the cru¬ 
cial role of regulating the functions of the protocell as 
well as storing protocell information. The primary com¬ 
ponent that needs to be regulated, in the protocell, is the 
metabolism. 

Certain chemical compounds named amphiphiles 
are capable of spontaneously self-assembling into struc¬ 
tures defined by a fine bilayer boundary, called vesicles 
or liposomes. These entities possess, to a certain extent, 
all the desired container properties. Among amphi¬ 
philes, lipids are a broad group of natural molecules 
that includes fats, waxes, sterols, fat-soluble vitamins, 
monoglycerides, diglycerides, and phospholipids. Thus, 
they have often been used in protocell container design. 

Introduction 

Protocells have been proposed and investigated as 
models for progenitors of contemporary living cells 
(Origin of Life) (Monnard and Deamer 2002; 
Walde 2006) or as artificial cells (Artificial Life) 
(Luisi et al. 2006). In the exploration of the Origin of 
Life, it is necessary to consider the environment on the 
early Earth (i.e., 4 billion years ago) which will define 
the availability of chemicals. In contrast, when design¬ 
ing artificial cells, it is possible to assemble systems 
that are not necessarily biomime tic. 

The importance of container structures in protocell 
design, as well as in contemporary cells, has been 
postulated very early on. Indeed, containers provide 
a rare opportunity to compartmentalize (i.e., spatially 
organize), regulate mixing of chemicals, and protect 
molecules from degradation. Thus, the properties of 
(pro)cellular networks are usually intimately linked to 
the presence of container structures. 

In the 1920s, Oparin and Haldane (independently) 
proposed that life began on the early Earth as 


self-assembled structures of organic material. How¬ 
ever, the former viewed cells as a collection of aggre¬ 
gated colloidal particles, while the latter suggested that 
cells consist of numerous half-living chemical mole¬ 
cules suspended in water and enclosed in an oily film 
or membranes. 

During the second half of the twentieth century, the 
research on protocells was greatly advanced by discov¬ 
eries in cellular architecture and function, as well as by 
an ever-increasing understanding of the environmental 
conditions and potential chemistry on the early Earth. 
For example, as the multiple roles of membranes in 
cells were elucidated, it became clear that all cellular 
membranes are predominantly composed of lipid 
bilayers and that phospholipid is a nearly ubiquitous 
amphiphilic component of these bilayers. In 1965, 
Bangham and coworkers demonstrated that phospho¬ 
lipids alone could spontaneously form bilayer vesicles, 
called liposomes, having dimensions in the range of 
bacteria. 

Analysis of chemical contents of carbonaceous mete¬ 
orites, such as the Murchison meteorite (which is 
considered representative of those that impacted Earth 
during the late accretion period) established the abundant 
presence of lipids. Monocarboxylic acids ranging from 
8 to 12 carbons in length were detected, which are now 
considered when protocells are constructed as models for 
progenitors of contemporary cells (Monnard and 
Deamer 2002). 

Lipids Commonly Used for Protocell 
Compartments 

Although lipids are a broad group of molecules, we will 
limit our enumeration to those commonly used in 
protocell design (see Fig. 1). These lipids originate 
predominantly from two types of biochemical subunits, 
ketoacyl and isoprene groups, and belong to the follow¬ 
ing categories: fatty acyls, glycerolipids, phospholipids, 
as well as isoprenol lipids. These molecules have 
amphiphilic character that allows them to spontaneously 
form various structures depending on the solubilization 
medium. Most significantly, they can form vesicles and 
liposomes (lipid bilayer membranes surrounding an 
aqueous core) which mimic cellular architecture. 

Although the ability of phospholipids to self- 
assemble into membranous vesicles is common knowl¬ 
edge, it is less well known that bilayer structures, this 
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Fig. 1 Examples of protocell 
lipids: (1) phospholipid: 1,2- 
dimysristoyl-sn-glycerol-3- 
phosphocholine (DMPC); 
(2-4) fatty acid and its 
derivatives, decanoic acid, 
decyl alcohol, and glycerol 
monodecanoate; and 
(5) isoprenols (which can be 
formed by multiple subunits) 




X = 0 to 3 




time simply called vesicles, can also be prepared from 
single-chain amphiphiles such as fatty acids, i.e., 
monocarboxylic acids, or mixtures of these with fatty 
alcohols, or monoglycerides (Walde 2006). Isoprenols 
can also form vesicles on their own, while sterols, such 
as cholesterol, will only contribute as additives to 
membrane formation, often altering the hilayer 
properties. 

Stability of Vesicles and Liposomes 

To construct a functional protocell, the stability of the 
container structure is paramount as the preservation of 
the interactions between metabolic and information 
components is the foremost task of the container and 
maintain its component concentrations. 

Fatty Acid Vesicles 

In suspensions of fatty acids at concentrations above the 
critical vesicle concentration (CVC) or critical hilayer 
concentration (CBC), the fatty acid hilayer membranes 
are stabilized by van der Waals and hydrophobic inter¬ 
actions between their hydrocarbon chains, as well as by 
hydrogen bonds, formed between deprotonated and 
protonated acid head groups. Thus, the formation of 
hilayer vesicles is highly sensitive to the pH of the 
solution. Below the apparent pK^ of the acid, fatty 
acid droplets will replace the hilayer structures and. 


above it, micelles will form. In addition, CBC increases 
with decreasing chain length and degree of unsaturation 
of the hydrocarbon chain (Monnard and Deamer 2003). 
For instance, octanoic acid (the shortest vesicle¬ 
forming carboxylic acid) and oleic acid (18 carbons, 
monounsatured fatty acid) form bilayers at concentra¬ 
tions higher than 145 mM (pH 6.5) and 0.19 mM (pH 
9.2), respectively (Maurer et al. 2009). 

Besides pH, environmental conditions such the salin¬ 
ity (type of salts, divalent, or monovalent), ionic strength, 
and temperature will affect hilayer stability. Fatty acid 
vesicles are unstable at sodium chloride concentrations 
near that of seawater. Furthermore, millimolar concen¬ 
trations of divalent cations cause fatty acid vesicles to 
precipitate as nonmembranous aggregates. 

High temperatures can also destabilize fatty acid 
membranes, leading to increased bilayer permeability 
followed by fatty acid precipitation. Single unsaturated 
and saturated long-hydrocarbon-chain amphiphiles, 
such as lauric and oleic acid, are known to form stable 
stractures at elevated temperatures. In fact, those satu¬ 
rated fatty acids with hydrocarbon chains longer than 
lie can only form stable vesicles when the temperature 
is above 30° C. But, the effect of long-term temperature 
exposure has only been investigated for some saturated 
short-chain fatty acids as well as oleic acid. As might be 
expected, even oleic acid vesicles were disrupted by 
24-h incubation at temperatures >65°C and decanoic 
acid vesicles collapsed at 45°C. 
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Vesicle-vesicle interactions (aggregation and 
fusion) can be triggered by temperature, and this is 
interesting from a protocell function perspective. 
While individuality of modern cells is necessary 
for evolution on a relatively long period, it may 
not be essential in primitive cellular systems. The 
exchange of internal contents as well as bilayer 
components could have led to increased complexity 
so that the emergence of new functions could be 
promoted. 

Phospholipid Bilayer Membranes 

Phospholipids, in particular phosphatidylcholines 
(PCs), have been used for protocell design. Their 
chemical or biochemical synthesis is rather complex, 
as it requires four enzymes, thus reducing the prebiotic 
likelihood. 

These amphiphiles have two acyl chains, a fact that 
leads to properties in marked contrast with fatty acid 
bilayers. PC liposomes spontaneously form at CBCs as 
low as 10^ to 10“® mM. These stractures are very stable 
over a broad pH range, resistant to high concentrations 
of divalent cations, and can withstand wide temperature 
variations. However, they tend to be rather imperme¬ 
able to solutes. 

Mixed Amphiphile Systems 

All modern organisms use lipid mixtures to construct 
cell membranes and to modulate membrane chemical 
and physical properties. Examples include tocopherols 
and carotenoids (vitamins E and A) that are functional 
components of membranes, as well as essential fatty 
acids such as linoleic and arachidonic acid. Cholesterol 
is also used by eukaryotic cells to modulate bilayer 
fluidity and permeability. 

Membranes composed of a pure amphiphile are 
often used in protocell designs although studies of 
mixed bilayers have demonstrated that modulation 
can be achieved by mixing single-chain amphiphiles. 
Eor instance, adding a secondary amphiphile, such as 
a fatty alcohol or a polyol (see Eig. 1(3) and (4)), to 
pure fatty acid bilayers can substantially lower the 
CBC and mitigate disruptive effects of salts, high pH, 
or temperature effects on fatty acid bilayers. When 
these vesicles are exposed to digestive enzymes, 
the encapsulated species remained functional, a result 
that establishes the relative stability of the mixed 
structures. 


Compartmentalization of Metabolitic and 
Information Machinery 

Protocell development is often focused on three types of 
reactions; the synthesis of biopolymers; the synthesis of 
other protocell building blocks, such as amphiphiles; 
and the uptake of energy. The biopolymer synthesis 
are central to the functionality of the protocell due to 
their ability to catalyze reactions under mild conditions, 
as well as encode the information needed to constmct 
a cell, while the synthesis of other building blocks is 
a prerequisite for self-maintenance and self-reproduc¬ 
tion. Energy uptake and transduction is essential to 
permit the emergence of (semi)autonomous protocell. 

Encapsulation 

The membranous boundaries of protocells should be 
sufficiently impermeable to polymeric materials so 
that complex networks of catalyzed reactions would 
remain enclosed within, even though they should be 
more permeable than contemporary cells, to permit the 
uptake of nutrients. 

A large number of methodologies have been devel¬ 
oped that are usually based on the random encapsula¬ 
tion of solutes (Monnard et al. 2008; Walde and 
Ichikawa 2001). This fact renders the encapsulation 
of functional multiple-component catalytic systems, 
such as transcription and translation machinery, 
increasingly difficult as the number of components 
increase, suggesting that the protocell metabolism 
should be kept simple, especially considering the 
need for replication. 

Membrane Permeability and Catalysis 

Protocells will lack specialized membrane transport 
systems to take up substrates from their environment 
so that simple mechanisms such as passive diffusion 
will play an essential role for transmembrane transport 
(Mansy et al. 2008; Monnard et al. 2007). Studies of 
cellular membranes have however demonstrated the 
role of the lipid bilayer as the primary permeability 
barrier to free diffusion of polar and ionic solutes. 
Recent observations have established that the perme¬ 
ability to ionic solutes is inversely proportional to the 
length and the degree of unsaturation of the hydrocar¬ 
bon chains, as well as the charge density of the solute. 
The permeability of PC bilayers to large ionic solutes 
depends on the phase of the bilayers (gel-like or 
liquid-crystalline phase), by extension on the 
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temperature and is generally lower than that of fatty 
acid bilayers. Permeability studies have shown that 
PC bilayers are not simply a barrier but also exhibit 
a surprising degree of selectivity. DMPC bilayers 
(see Fig. 1(1)) are permeable to ADP or even ATP, 
but impermeable to pApA. They were also more per¬ 
meable to ribose than to other pentoses and hexoses. 

The permeability can be further modulated by vary¬ 
ing bilayer composition by adding cosurfactants, such 
as fatty acids or detergents. Mixed PC bilayers have an 
increased permeability to large ionic solutes, in 
marked contrast to the stabilization of fatty acid bila¬ 
yers by alcohols. It would therefore be advantageous 
for protocells to have membrane boundaries composed 
of an amphiphilic mixture. 

Energy Uptake 

Three sources of energy can be harvested by 
protocells: chemical energy in the form of high-energy 
chemical bonds, oxidation-reduction reactions, and 
light energy (Deamer 1997). To perform synthetic 
reactions (e.g., to form a C-C bond), the uptake of 
energy and conversion into stable chemical gradients 
is necessary as these reactions usually require 
a vectorial two-electron transfer reaction. Here again, 
protocell membranes could play a crucial role, as their 
counterparts in cells do. 

The question is whether protocell structures can 
maintain gradients during their build-up until the 
amplitude is large enough to drive uphill synthetic 
reactions. Recent studies have shown that a chemical 
gradient across a membrane can be created using 
embedded, photosensitive polycyclic aromatic hydro¬ 
carbons (PAHs). Self-assembled bilayers can also pro¬ 
vide the framework necessary to achieve the vectorial 
organization in photosynthetic machinery. That is, the 
creation of a pH gradient strong enough to the permit 
ATP synthesis which requires 4 protons for each ATP 
molecule (Steinberg-Yfrach et al. 1998). 

Semiautonomous Metabolism 

A minimal transcription system could be composed of 
RNAs that has the dual functions of catalysis and 
information storage (hypothesized in the “RNA 
world” concept). But, the required molecules, an 
efficient RNA-dependent RNA polymerase remains 
elusive. Therefore, most of the research on genetic 
information transfer within vesicles has been 
conducted using enzyme systems (see reviews 


Monnard et al. 2008; Walde and Ichikawa 2001). 
Single polymerase enzymes, as well as transcription 
and translation systems (about 80 molecular compo¬ 
nents), have been successfully encapsulated in both 
liposomes and vesicles. These model systems can 
help determine how information transfer can perform 
in protocells. 

These designs aim to creating a fully functional cell 
with the help of reconstituted biological architectures 
composed only of the necessary components for 
protein production (Noireaux and Libchaber 2004), 
but they do not represent a fully independent metabo¬ 
lism; thus, the protocell has to gather the missing 
molecules from the external medium. Interestingly, 
protein production within liposome can be more effi¬ 
cient than in bulk solution. The simultaneous expres¬ 
sion of two membrane proteins associated with the 
production of a lipid, phosphatidic acid, was conhrmed 
as their products were positively identihed in the sys¬ 
tem boundaries (Kuruma et al. 2009). That is, con¬ 
tainer growth can in principle be linked with an 
internalized production of amphiphiles. 

Considering that the encapsulation procedures used 
to prepare these liposomes achieves small encapsula¬ 
tion yields, there is a negligible probability of a simul¬ 
taneous encapsulation of the many components (more 
than 80) in a spherical compartment with a 100-nm 
radius. Thus, the improvement of protein production 
observed is likely much higher locally. 

An alternate approach to colocalization of reaction 
components is to anchor the entire reaction network to 
the structure (Rasmussen et al. 2004), i.e., on the 
surface of a bilayer. The advantages of this system 
are numerous: Nutrients and waste products can freely 
diffuse to and from the metabolic/informational core. 
It also eliminates the problem of vesicle lysis, as only 
the bilayer itself needs to stay intact for the protocell to 
survive. This system is more flexible in terms of struc¬ 
ture types as oil droplets or reverse micelles can be also 
used. However, it should be noted that the type and 
complexity of the catalytic assemblies anchored in 
vesicle bilayers will be limited compared to the com¬ 
plexity of a transcription-translation apparatus. 

One such an “interfacial” system has recently been 
proposed, it envisions information and catalytic mole¬ 
cules anchored by aliphatic hydrocarbon moieties. 
Furthermore, while information is typically thought 
of as a distant control mechanism, the information 
molecules in the interfacial system are not true 
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encoding molecules, but rather as sequence-dependent 
actuators of the metabolic process. The nucleic acids 
would play the role of an electron donor/relay in the 
amphiphile-precursor photofragmentation catalyzed by 
a ruthenium complex. The effectiveness of the electron 
relay function should then depend on the base sequence 
of the information molecule, as investigation of electron 
transfer in DNA has established. This approach sim¬ 
plifies information utilization by a protocell and should 
ease protocellular reproduction as small DNA frag¬ 
ments can be nonenzymatically replicated. 

The current embodiment of this protocell with 
a single nucleobase, 8-oxoguanine instead of a DNA 
polymer, has already allowed the investigation of several 
processes essential for the function of a protocell: i.e., the 
production of building blocks from precursors using an 
information control and external energy (light) harvested 
by the protocell metabolism (Maurer et al. 2011). 

Self-Reproducing Compartments 

Functional protocells need to reproduce their three 
components in a coordinated manner. Indeed, if the 
rate of amphiphile formation substantially exceeds the 
reproduction rates of the metabolic and genetic 
network elements, these species will be rapidly diluted 
in the growing container. Consequently, newly formed 
vesicles will lose the parent characteristics. In contrast, 
an exponential replication of its internal content may 
lead to the release and loss of protocell identity upon 
bursting of the container boundary. 

Luisi and coworkers at the ETH in Zurich have 
approached the question of self-reproducing compart¬ 
ments by using vesicles formed from oleic acid 
(Walde et al. 1994). Oleic anhydrides are used as 
a water-immiscible source of amphiphile precursors. 
Their hydrolysis provides membrane building blocks 
that can be used to make bilayers. Significantly, when 
the hydrolysis is carried out with preformed vesicles, 
these structures induce an autocatalytic-like behavior 
in the system as hydrolysis rates increase with increas¬ 
ing vesicle concentration. 

Szostak and coworkers (Hanczyc et al. 2003) at 
Harvard Medical School were able to demonstrate 
not only growth of fatty acid vesicles containing 
encapsulated RNA but also a kind of division when 
the suspension was filtered through microscopic pores 
or submitted to an external shaking. They further 
established a detailed kinetic analysis of the growth, 
which demonstrated that vesicles could compete for 


membrane components present in their environment, 
and undergo a kind of evolutionary selection (Hanczyc 
and Szostak 2004). 

Attempts have been recently made to closely 
link the replication of containers to an endogenous 
production of amphiphiles either using simple metal 
catalytic systems or complex transcription and transla¬ 
tion machinery encapsulated within amphiphilic con¬ 
tainers, the so-called minimal cell concept (Luisi et al. 
2006). These experiments indicate that in principle an 
internalized production of amphiphiles at rates com¬ 
patible with the replication of the compartment is 
achievable. However, the system still lacks the possi¬ 
bility to replicate all its components, in particular the 
information part. 

Summary 

Much progress has been made implementing one or 
even multiple functions seen in living cells within 
artificial constructs called lipid protocells. However, 
the main obstacle to a truly functional protocell is 
the complete integration of all functions in a single 
self-maintaining, self-reproducing system capable of 
evolution, thereby possessing the emergent properties 
of life. Thus, it seems to be reasonable to attempt 
the design of protocells with a systemic approach, 
i.e., construct them with all their components present 
even in an extremely simplified manner. These self- 
replicating chemical systems will, however, likely 
remain inherently nonliving but allow us to define the 
requirements and limits of abridged metabolic, infor¬ 
mational, and structural interactions. 
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Synonyms 

Inositol lipids; Phosphatidylinositol lipids; 
Phosphoinositides 

Definition 

Phosphatidylinositols (PI) are a class of lipids charac¬ 
terized by their inositol head groups which can be 







Lipid Signaling and Phosphatidylinositols 


1287 


L 


phosphorylated at the 3-, 4-, and 5-positions to yield 
the seven naturally occurring species of phosphory¬ 
lated Pis. 

Basic Characteristics 

Phosphatidylinositol is an important phospholipid in 
the cytosolic side of eukaryotic cell membranes. It is an 
acidic phospholipid consisting of a o-myo-inositol-1- 
phosphate linked via its phosphate group 
to diacylglycerol. Phosphatidylinositol is synthesized 
hy the enzyme cytidine diphosphate (CDP)- 
diacylglycerol inositol phosphatidyltransferase from 
the precursors cytidine diphosphate diacylglycerol 
and inositol (McMaster and Jackson 2004). The inosi¬ 
tol head group can be reversibly phosphorylated to 
yield phosphatidylinositol monophosphates (PIP), 
phosphatidylinositol bisphosphate (PIP2), and 
phosphatidylinositol trisphosphate (PIP3). Seven phos¬ 
phorylated PI species have been documented in 
eukaryotic cells. Although they are quantitative 
minor components of the cell, phosphoinositides are 
important key players in a wide variety of cellular 
functions including membrane trafficking, cell prolif¬ 
eration, growth, and nuclear events (Di Paolo and De 
Camilli 2006). 

Phosphatidylinositol 3-Phosphate 

Phosphatidylinositol 3-phosphate (PI3P) is a mono- 
phosphorylated phosphoinositide at the 3-position of 
the inositol ring. It is predominantly found in early and 
multivesicular endosomes. P13-kinase phosphorylates 
PI upon stimulation of tyrosine kinase receptor and 
G-protein-coupled receptors to form PI3P. P13P can 
also be generated from PI(3,4)P2 by 4-phosphatase 
activity (INPP4) as well as from PI(3,5)P2 by 
5-phosphatase activity (SAC3) (Sasaki et al. 2009). 
PI3P is dephosphorylated by the family mammalian 
myotubularin lipid phosphatases. It is important for the 
recruitment of effector proteins that contain a specific 
PI3P-binding domain. Specific PI3P recognition mod¬ 
ule includes the zinc-finger domains which were ini¬ 
tially found inFabl, YOTB, Vac 1, and EEAl proteins. 
These domains are collectively termed as FYVE 
domains. These FYVE domains containing proteins 
are involved in endocytic trafficking (Tatiana 2006). 


PI3P plays also an important role in regulation of 
autophagy and cytokinesis. 

Phosphatidylinositol 4-Phosphate 

Phosphatidylinositol 4-phosphate (P14P), the most 
abundant PI lipid is the precursor of the important 
signaling molecule PI(4,5)P2, but it has its character¬ 
istic functions through interactions with a number of 
effectors as well. PI4P is synthesized from PI by type II 
and type III PI 4-kinases. Another biosynthesis route 
for PI4P is the turnover of PI(4,5)P2 by type II 
5-phosphatases like synaptojanin, oculocerebrorenal 
syndrome of lowe (OCRL), skeletal muscle and kid¬ 
ney-enriched inositol 5-phosphatase (SKIP), 1NPP5B, 
INPP5E, and INPP5J. 

PI4P is prevalently in the Golgi apparatus, but can 
also be found at the plasma membrane. In the Golgi 
complex, PI4P has important roles in biogenesis of 
transport vesicles as well as maintaining the structural 
and functional organization of the Golgi complex 
(Antonietta De Matteis et al. 2005). At the plasma 
membrane P14P serves mainly as the substrate for PI 
(4,5)P2 synthesis. Several trans-Golgi-network- 
associated proteins have been identified to bind PI4P. 
These proteins include EAPPl, FAPP2, EpsinR, and 
OSBP. EAPPl, FAPP2, and OSBP bind PI4P through 
their PH domain (Balia et al. 2009). These P14P effec¬ 
tor proteins are involved in membrane trafficking upon 
binding to PI4P. It has been shown that endogenous 
and recombinant FAPPs translocate to the Golgi com¬ 
plex as well as to the trans-Golgi-network where trans¬ 
port to the plasma membrane are carried out. FAPPs 
are essential components in controlling the PI4P- 
regulated machinery for generating of post-Golgi 
carriers to the plasma membrane. 

Phosphatidylinositol 5-Phosphate 

Phosphatidylinositol 5-phosphate (PI5P) is constitu- 
tively present in many cell types, at levels roughly 
100 -fold less than the most abundant mono- 
phosphorylated phosphoinositide PI4P. PI5P has been 
shown to be present in the nucleus, Golgi complex, ER, 
and in the plasma membrane. In contrast to other 
phosphoinositides, a specific PI 5-kinase to generate 
PI5P from PI has not been identified to date. 
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Phosphatases instead play an important role in the 
synthesis of PI5P. PI5P can be synthesized by phos¬ 
phatases of the myotubularin family from PI( 3 , 5 )P 2 . 
It can also be generated by dephosphorylation of PI 

(4.5) P2. Two human genes encoding for PI(4,5)P2 

4- phosphatases I and II were identihed to play a role 
in the synthesis of PI5P. In contrast to other phosphoi- 
nositides, PI5P metabolism is not controlled by 
dephosphorylation but phosphorylation mainly by 
PIP 4-kinases. PI5P plays a regulatory role in 
the nucleus and was found to bind the PHD hnger 
motif of the nuclear protein ING2, which regulates 
p53 acetylation. Proteins with a PHD domain like 
INGl, ACF, and ATXl interact with PI5P (Halstead 
et al. 2005). 

Phosphatidylinositol 3,4-Bisphosphate 

Phosphatidylinositol 3,4-bisphosphate (PI( 3 , 4 )P 2 ) is 
a minor lipid showing only trace levels in unstimulated 
cells. Upon stimulation the level of PI( 3 , 4 )P 2 can 
increase substantially, mainly by the action of 

5- phosphatases SHIP, SKIP and synaptojanin by 
dephosphorylating PIP 3 . PI( 3 , 4 )P 2 is mostly found at 
the plasma membrane and in the early endocytic 
pathway. Proteins possessing a PI( 3 , 4 )P 2 -binding 
PH domain includes TAPPl, TAPP2, and Akt/PKB. 
The interactions of these proteins with PI( 3 , 4 )P 2 
suggest the role of PI( 3 , 4 )P 2 in signaling stretching 
from plasma membrane to the nucleus. It has 
also been shown that the PX domain of p47PHOX is 
specihc for PI( 3 , 4 )P 2 . Disruption of the interaction 
between p47PHOX and PI( 3 , 4 )P 2 can cause chronic 
granulomatous disease (McMaster and Jackson 2004). 

Phosphatidylinositol 3,5-Bisphosphate 

Phosphatidylinositol 3,5-bisphosphate (PI( 3 , 5 )P 2 ) is 
mainly concentrated on late compartments of the 
endosomal pathway. The synthesis of PI( 3 , 5 )P 2 
requires PI3P as precursor. The type III PIP kin 
PIKfyve which contains a FYVE domain that is 
required for interaction with PI3P has been identified 
as the kinase for synthesizing PI( 3 , 5 )P 2 . Breakdown of 
PI( 3 , 5 )P 2 is catalyzed by phosphatases from the 
myotubularin family, which dephosphosphorylates PI 

( 3 . 5 ) P 2 at the 3-position of the inositol headgroup to 


form PI5P. PI( 3 , 5 )P 2 has important functions in 
multivesicular body sorting, endosomal dynamics, 
and autophagy. Deficiency in PI( 3 , 5 )P 2 synthesis due 
to mutations in the type III PIPkins, can lead to severe 
neurodegeneration and early lethality (Mayinger 
2012 ). 

Phosphatidylinositol 4,5-Bisphosphate 

Phosphatidylinositol 4,5-bisphosphate (PI( 4 , 5 )P 2 ) is 
enriched at the plasma membrane and is a precursor 
of second messengers and regulates a number of pro¬ 
cesses at the cell surface. PI( 4 , 5 )P 2 has also been 
detected at the Golgi as well as in the nuclear envelope. 
The main pool of PI( 4 , 5 )P 2 is synthesized by the action 
of two distinct classes of lipid kinases. Type I PIP 
kinase uses PI4P as substrate the main source of PI 

( 4 , 5 )P 2 biosynthesis. PI( 4 , 5 )P 2 is also being generated 
by type II PIP kinase using PI5P as substrate. It plays 
an important role in signal transduction through 
phospholipase C-mediated cleavage into IP 3 and 
diacylglycerol as well as substrate in the PI3-kinase- 
mediated synthesis of PIP 3 . PI( 4 , 5 )P 2 is also important 
for the regulation of the actin cytoskeleton and ion 
channels. Additionally, it regulates clathrin-mediated 
endocytosis and exocytosis which is tightly controlled 
by the dephosphorylation of PI( 4 , 5 )P 2 through the type 
II phosphatase synaptojanin (McLaughlin et al. 2002). 

Phosphatidylinositol (3,4,5)-Trisphosphate 

Phosphatidylinositol (3,4,5)-trisphosphate is a quanti¬ 
tative minor PI lipid mainly localized at the plasma 
membrane. In unstimulated or serum-starved cells, 
PIP 3 exist in only trace amount which can rapidly and 
transiently accumulate upon a variety of external 
stimuli. PIP 3 is synthesized by the action of class 
I PI3-kinase using PI( 4 , 5 )P 2 as substrate. The cellular 
balance is maintained by tumor suppressor 3- 
phosphatase PTEN and 5-phosphatases SHIPl and 
SHIP2. PIP 3 is an important effector of signaling cas¬ 
cade controlling cell growth and proliferation through 
the protein kinase Akt/PKB as a direct downstream 
effector. Akt possess a PIP 3 -binding PH domain 
which induces a conformational change in the kinase 
upon PIP 3 -binding and therefore activates the enzyme 
(Vanhaesebroeck et al. 2001 ). 
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BMP 

bis-Monoacylglycerophosphate 

CERT 

Ceramide transfer protein 

ER 

Endoplasmic reticulum 
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Glycosyl-phosphatidylinositol 

Insig 

Insulin-induced gene 
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Low-density lipoprotein 

NPC 

Niemann-Pick disease type C 

OSBP 

Oxysterol-binding protein 

PA 

Phosphatidic acid 

PC 

Phosphatidylcholine 

PE 

Phosphatidylethanolamine 

PI 

Phosphatidylinositol 

PI(3,4,5)P3 

Phosphatidylinositol-(3,4,5)- 
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PI(3,5)P2 

Phosphatidylinositol-(3,5)- 
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PI(4,5)P2 

Phosphatidylinositol-(4,5)- 
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PI3P 

Phosphatidylinositol-3-phosphate 

PI4P 

Phosphatidylinositol-4-phosphate 

PITPs 

Pl/PC-transfer protein 

PS 

Phosphatidylserine 

SCAP 

SREBP cleavage-activating protein 
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Sphingomyelin 
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Sterol regulatory element-binding 
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Steroidogenic acute regulatory protein 
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StAR-related lipid transfer 
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Definition 

The processes by which lipids move among organelles 
by vesicular trafficking processes or by diffusion 
through the cytoplasm in association with carrier 
proteins. 

Introduction 

Lipids play many important roles in cell physiology. 
Lipid bilayer membranes provide the main barrier 
separating the interior and exterior of a cell, restricting 
the transport of ions and polar molecules in and out of 
the cell. Lipid bilayers similarly provide a permeability 
barrier between the cytoplasm and the interior of 
organelles. The lipid composition of biological mem¬ 
branes must be maintained in a range that allows rapid 
diffusion of proteins within the bilayer while 
maintaining the barrier function. Several lipid species 
can be acted upon enzymatically to produce signaling 
second messengers that activate downstream signaling 
networks. The organization of lipids in a membrane 
bilayer can also affect the signaling properties of 
receptors and thereby modulating intracellular signal 
transduction. Organelles within a cell maintain lipid 
compositions that are individually distinct, and aspects 
of these lipid differences are an important component 
of the “identity” of the organelles that is used by 
proteins to interact selectively with certain organelles. 
The distinct lipid composition of these organelles is 
maintained even though there is a very robust transport 
of lipids between organelles. The processes regulating 
lipid traffic and the mechanisms for establishing and 
maintaining lipid composition of organelles are 
discussed herein. This review will focus on mamma¬ 
lian cells, but similar trafficking occurs in all eukary¬ 
otic cells. 

Membrane Composition Varies Among 
Organelles 

Each type of organelle maintains a distinct protein 
composition, which allows the organelle to carry out 
its specific functions. There has been considerable 
study of the mechanisms for trafficking membrane 
proteins to specific organelles, and fhis selecfive Iraf- 
ficking is often based on peptide sequences in the 


cytoplasmic domains of proteins that are recognized 
by other proteins that form part of the scaffolds for 
budding transport vesicles or tubules from a parent 
organelle (Palade 1975; Bonifacino and Traub 2003). 
Lipid compositions also vary significantly among 
organelles (van Meer et al. 2008), but as discussed in 
a later section, the mechanisms for sorting lipids are 
not as well understood as for proteins. Some of the key 
differences in lipid composition are illustrated in 
Fig. 1. The plasma membrane is relatively enriched 
in sphingomyelin (SM), which is on the extracellular 
leaflet of the bilayer, and cholesterol (^30% of the 
lipid molecules). Phosphatidylserine (PS) is found on 
many organelles; on the plasma membrane, it is nearly 
exclusively on the cytoplasmic leaflet. External PS is 
found on apoptotic cells, and it is recognized by recep¬ 
tors on macrophages that engulf the dying cells (Savill 
and Fadok 2000). Phosphatidylinositol (PI), the pre¬ 
cursor of phosphoinositides, is synthesized primarily 
in the endoplasmic reticulum (ER) and typically rep¬ 
resents less than 15% of the total phospholipids found 
in eukaryotic cells. Phosphatidylinositol-4-phosphate 
(PI4P) and phosphatidylinositol-(4,5)-bisphosphate 
[PI(4,5)P2] make up the bulk of phosphoinositides in 
mammalian cells, with PI(4,5)P2 constituting ^1% of 
the phospholipids in the inner leaflet of the plasma 
membrane (Di Paolo and De Camilli 2006). 

Many lipids are synthesized in the ER and then must 
be transported to other cellular organelles. The ER 
contains lipids with a much higher level of 
unsaturation than the plasma membrane, and it main¬ 
tains a low level of cholesterol (^5% of lipid mole¬ 
cules). The ER is unusual in that it does not contain 
a significant transbilayer asymmetry for most of its 
lipids, indicating that there are mechanisms for flip¬ 
ping lipids across this membrane relatively rapidly 
(Sanyal and Menon 2009). 

The lipid composition shifts along the secretory 
pathway, as sterols and sphingolipids increase in con¬ 
centration from the ER through the Golgi to the plasma 
membrane. For example, trans-Golgi network (TGN)- 
derived secretory vesicles in yeast contain 2.3-fold 
more ergosterol and sphingolipids than the TGN 
(Klemm et al. 2009). The overall lipid composition of 
early endosomes is similar to that found in the plasma 
membrane, with some differences in the fatty acid 
structure. Cholesterol and SM, as well as PS, are highly 
enriched in the recycling endosome. Late endosomes 
contain large amounts of neutral lipids such as 
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triglycerides and cholesteryl ester, which are in the 
core of lipoproteins that are being digested. The mem¬ 
brane of late endosomes has a complex structure, with 
inward budding invaginations and detached internal 
vesicles. Bis-monoacylglycerophosphate (BMP, 15% 
of total lipids) is selectively enriched in the internal 
membranes of late endosomes (Gruenberg 2001; 
Kobayashi et al. 2001). Glycerolipids and 
sphingolipids are digested on these internal mem¬ 
branes to avoid damage to the limiting membrane 
(Kolter and Sandhoff 2010). 

The lipid composition in the mitochondria varies 
little among different cell types, with phosphatidylcho¬ 
line (PC) and phosphatidylethanolamine (PE) being 
the most abundant phospholipids (40% and 30% of 
total mitochondrial phospholipids, respectively), 
followed by glycerophospholipid cardiolipin (which 
is only found in mitochondria) and PI (10-15% of 
phospholipids), and phosphatidic acid (PA) and PS 
(5% of total mitochondrial phospholipids). 
Sphingolipids and sterols are not present in large 
amounts in mitochondrial membranes, although 
cholesterol, the precursor for steroid hormones, is 
delivered to mitochondria in steroidogenic cells 
(Osman et al. 201 1). 

Basic Mechanisms of Lipid Transport 

There are two major methods for transport of lipids 
between organelles; they can move as part of a vesicle 
or tubule, or they can be transported by nonvesicular 
processes through the cytoplasm. Since lipids are 
poorly water soluble, nonvesicular transport is medi¬ 
ated by carrier proteins that can carry lipids between 
membranes. A specialized type of nonvesicular trans¬ 
port involves exchange of lipids between organelles at 
sites of close apposition (Fig. 1). Each of these trans¬ 
port mechanisms will be described briefly. 

Vesicular Sorting 

There is good evidence for sorting of lipids as vesicles 
and tubules bud off from a parent organelle. For exam¬ 
ple, fluorescent lipid analogs follow different 
endocytic routes after internalization from the 
plasma membrane depending upon the degree of 
unsaturation and the length of their hydrocarbon 
tails (Mukherjee et al. 1999). Lipid-anchored 
proteins, which are associated with membranes via 


a glycosyl-phosphatidylinositol (GPI) anchor, follow 
different intracellular itineraries than most other lipid 
molecules (Mayor and Riezman 2004). Additionally, 
toxins (e.g., shiga or cholera toxins) bound to glyco- 
lipids follow distinctive pathways after internalization 
that allow the toxins to reach the ER, where they are 
able to use the cell’s protein translocation machinery to 
cross into the cytoplasm (Sandvig et al. 2004). 

The molecular basis for the sorting of lipids during 
membrane-trafficking processes is not well under¬ 
stood. In several cases, e.g., sorting of fluorescent 
lipid analogs and GPI-anchored proteins, changing 
cholesterol levels in cells alters the trafficking, indicat¬ 
ing that membrane biophysical properties that are 
affected by cholesterol content are part of the sorting 
mechanism. One suggestion is that coexisting mem¬ 
brane microdomains are present in the parent organelle 
and that one type of microdomain might be preferen¬ 
tially incorporated into the budding vesicles or tubules. 
For example, if more ordered microdomains (some¬ 
times called rafts) are enriched in a subset of proteins 
that are recruited by cytoplasmic coats on the budding 
membranes, then order-inducing lipids such as choles¬ 
terol and lipids with mostly saturated acyl chains 
would be selectively enriched in the budding transport 
vesicles (Fig. 2a). This mechanism has been suggested 
for lipid sorting in the biosynthetic secretory pathway 
(Simons and Ikonen 1997). 

Another possibility is that curvature preferences 
play an important role in lipid sorting. Sites of vesicle 
and tubule budding often involve regions of high mem¬ 
brane curvature. Lipid molecules can be nearly cylin¬ 
drical in their overall shape (i.e., the same radius at the 
headgroup and at the end of the acyl chain), which 
would lead to a preference for planar bilayers. Cone- 
shaped lipids (with small headgroup area) or inverted 
cone lipids (with large headgroups compared to the 
acyl chain area) will have preferences to be in curved 
membranes (Fig. 2b). While the preference of individ¬ 
ual lipid molecules for curved regions does not impose 
a strong selection, curvature can induce phase separa¬ 
tion in lipid mixtures in which the composition is close 
to a phase separation boundary (Maxfield and van 
Meer 2010). Thus, curvature preferences and forma¬ 
tion of microdomains may work synergistically to 
promote lipid sorting during vesicle formation. 

A third mechanism for lipid sorting would involve 
specific binding of lipid species to proteins that 
are incorporated into (or excluded from) forming 
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Lipid Trafficking in Cells, Fig. 1 Lipid distribution and move¬ 
ment among intracellular organelles. Phosphoinositides and 
special lipids involved in signaling and organelle recognition 
pathways are shown in ovals. PC, PE, PS, PI, and PA are 
synthesized in the ER and leave mainly hy vesicular transport. 
SM and glycosphingolipids are synthesized in the Golgi and also 
move by vesicular transport. Each organelle is color-coded 
to reflect the relative level of cholesterol in its membranes. 
(The cholesterol levels for the limiting membrane of the late 
endosomes and lysosomes are uncertain because of the choles¬ 
terol associated with lipoproteins and internal membranes 
that are enriched in bis-monoacylglycerophosphate (BMP).) 
Circulating low-density lipoprotein (LDL) particles carrying 
cholesterol and cholesteryl ester are internalized through the 

vesicles. There is little evidence that such a mechanism 
plays a general role in lipid sorting. However, there 
is mounting evidence that lipid-modifying 
enzymes are incorporated into forming vesicles. In 
particular, phosphatidylinositol kinases and phospha¬ 
tases alter the phosphoinositide properties of the newly 
formed vesicles so that they no longer match the 
phosphoinositide signature of the donor membrane 
and begin to take on the characteristics of the target 
membrane with which they will fuse (Di Paolo and 
De Camilli 2006). 


LDL receptors and transported to sorting endosomes. Free cho¬ 
lesterol, which is released upon cholesteryl ester hydrolysis in 
late endosomes and lysosomes, requires proteins NPCl and 
NPC2 to leave these organelles. It then moves to other organelles 
by poorly characterized mechanisms. Excess free cholesterol is 
esterified, and fatty acid sterol esters are packed into lipid 
droplets (LD). Pathways of intracellular sterol transport, both 
vesicular and nonvesicular, are shown. Phosphoinositides serve 
as organelle identity molecules: PI3P, phosphatidylinositol-3- 
phosphate; PI4P, phosphatidylinositol-4-phosphate; PI(3,5)P2, 
phosphatidylinositol-(3,5)-bisphosphate; PI(4,5)P2, phosphati- 
dylinositol-(4,5)-bisphosphate; and PI(3,4,5)P3, phosphatidy- 
linositol-(3,4,5)-trisphosphate. ERC, endocytic-recycling 
compartment 

Nonvesicular Transport 

Lipid molecules are held in membranes by noncovalent 
interactions and by the free energy associated with 
sequestering the hydrocarbon tails away from water. 
However, lipids in biological membranes are very 
dynamic, and they can have motions that transiently 
remove them partially from the bilayer, but complete 
spontaneous desorption is energetically unfavorable. 
It requires ^50 kJ/mol to remove cholesterol from 
membranes (Zhang et al. 2008) and ^80 kJ/mol 
to remove phospholipids (Marrink et al. 2009). 
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Cylinder (roughly equal head group and tail cross-sectional areas) 
no curvature preference; 
e g. phosphatidylcholine, sphingomyelin 

Inverted cone (larger head group than tail cross-sectional areas) 
prefers positive curvature; 
e g. lysophosphatidic acid, glycosphingolipids 

Cone (larger tail than head group cross-sectional areas) 
prefers negative curvature; 

e g. chole^erol, phosphatidylethanolamine, diacylglycerol 


Lipid Trafficking in Cells, Fig. 2 Lipid sorting by vesicular 
mechanisms. This figure illustrates two putative mechanisms for 
sorting lipids during the vesicle-budding process in membrane 
transport, (a) Proteins associated with a particular type of 
microdomain, in this illustration cholesterol-enriched ordered 
lipid domains (pink), may also associate with coat proteins that 
form a budding vesicle. This would enrich the new vesicle in the 
types of lipids in the microdomain, (b) The shape of lipids can 


affect their ability to be accommodated in regions of high 
curvature present at sites of vesicle budding. Cone-shaped lipids 
(red) might be accommodated in the cytoplasmic leaflet of the 
neck region of a vesicle bud, whereas inverted cone lipids 
(green) might be better accommodated in the lumenal leaflet of 
the neck region. Different curvature (and the associated lipid 
preferences) are found in the more distal parts of the budding 
vesicle or tubule 


Because of their poor solubility in water, lipid mole¬ 
cules, including cholesterol, must be associated with 
carrier proteins for efficient nonvesicular transport 
between organelles. Several such transport proteins 
have been identified. Members of the steroidogenic 
acute regulatory (StAR) protein-related lipid transfer 
(START) domain family of proteins have been 
shown to bind and transport cholesterol and some 
other lipids. The founding member of this family, 
StARD 1, transports cholesterol to the inner membrane 
of mitochondria, where it can be converted to steroid 
hormones in steroidogenic tissues (Strauss et al. 2003; 
Miller and Auchus 2011). Other members of the 
START family have been shown to bind cholesterol 
(Alpy and Tomasetto 2005; Lavigne et al. 2010) 
and may play an important role in transport. Another 
member of this family, ceramide transfer protein 
(CERT), binds and transports ceramide (Hanada et al. 
2009). 


The oxysterol-binding protein (OSBP) was identified 
as a protein that binds 25-hydroxycholesterol. A related 
family of proteins (Osh proteins) contains OSBP 
homology domains, and many of these can bind sterols 
and transport them between membranes in vitro. The 
significance of this transport in cells is not known (Faim 
and McMaster 2008). A family of Pl/PC-transfer pro¬ 
teins (PITPs) can transfer lipids between membranes, 
but their main function is to coordinate lipid traffic and 
lipid metabolism (Mousley et al. 2010). 

With their lower free energy for desorption from 
the membrane, sterols are a good candidate for 
nonvesicular transport, and there is evidence for sub¬ 
stantial nonvesicular sterol transport in yeast and in 
mammalian cells. For example, sterol in the endocytic 
recycling compartment, which contains about 35^0% 
of the cholesterol in a cultured fibroblast line, 
exchanges with other sterol pools in the cell with a ti /2 
of about 2-3 min (Hao et al. 2002). This suggests that 
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nonvesicular transport is the largest component of 
intracellular sterol transport. Similar conclusions 
were reached hy studying the transport of newly syn¬ 
thesized cholesterol from the ER to the plasma 
membrane (Maxfield and Menon 2006). In yeast and 
in mammalian cells, treatments that block vesicular 
transport did not greatly alter transport of sterol from 
the ER to the plasma membrane. Unfortunately, the 
identity of the major cholesterol carriers has not been 
elucidated. 

This rapid equilibration among organelles raises 
a question about how large differences in the sterol 
content of different organelle membranes is 
maintained (Mesmin and Maxfield 2009). Other mem¬ 
brane components of the various organelles can stabi¬ 
lize sterols to varying degrees, and this allows different 
concentrations of sterol to coexist in different organ¬ 
elles. For example, lipids with saturated acyl chains 
stabilize cholesterol in the membrane, and this type of 
lipid is enriched in the plasma membrane (high cho¬ 
lesterol) while unsaturated lipids predominate in the 
ER (low cholesterol). 

The ER is the most important organelle for sterol 
regulation (Brown and Goldstein 2009). Cholesterol is 
synthesized in the ER. Esterification of cholesterol for 
storage in lipid droplets is the cellular high capacity, 
rapid response system for dealing with excess choles¬ 
terol, and acyl CoA cholesteryl acyltransferase, the 
enzyme that esterifies cellular cholesterol, resides in 
the ER. The complexes containing Insig, sterol regu¬ 
latory element-binding protein (SREBP), and SREBP 
cleavage-activating protein (SCAP) also reside in the 
ER, and these proteins sense cholesterol levels and 
regulate the transcription of proteins involved in the 
uptake and synthesis of sterols. In high-cholesterol 
conditions, cholesterol binds to SCAP and Insig, lead¬ 
ing to retention of SREBP in the ER. When cholesterol 
levels fall, SCAP and SREBP are recruited to transport 
vesicles and delivered to the Golgi apparatus. Two 
sequential proteolytic cleavages of SREBP in the 
Golgi lead to release of a cytoplasmic fragment of 
SREBP, which contains a transcriptional regulator 
that increases transcription of genes involved in cho¬ 
lesterol synthesis and the gene for the LDL receptor. In 
order for this system to work effectively, the ER must 
be able to sense the cholesterol levels in other organ¬ 
elles such as the plasma membrane. Since there is little 
vesicular transport from the plasma membrane to the 
ER, it is likely that nonvesicular sterol transport is an 


important component of this regulation, but the iden¬ 
tity of the sterol carriers remains uncertain. 

Endocytic Uptake of Lipoproteins 

Nucleated cells can synthesize cholesterol and take it 
up by endocytosis of lipoproteins. Outside the central 
nervous system, LDLs are the main cholesterol car¬ 
riers; the core of LDL is packed with cholesteryl esters, 
and the outer surface contains the protein (mainly 
apoB) and a monolayer of lipids and cholesterol. 
LDL binds to the LDL receptor and enters the cell by 
receptor-mediated endocytosis (Brown and Goldstein 
1986). The LDL dissociates from its receptor in an 
acidified endosome, and the LDL receptor is recycled 
to the plasma membrane while the LDL is delivered to 
late endosomes and lysosomes and is degraded. 
Cholesteryl esters are hydrolyzed by lysosomal acid 
lipase. The unesterified cholesterol, which is very 
poorly soluble in water, is transported out of the lyso¬ 
somes by a process that requires the proteins NPCl and 
NPC2. It appears that cholesterol is first bound by 
NPC2, a soluble protein in the lumen of late 
endosomes. The cholesterol is then transferred to an 
N-terminal cholesterol-binding site on NPCl, 
a polytopic membrane protein in the late endosomes. 
It is unclear how NPC1 then facilitates the transport of 
cholesterol out of the late endosomes and lysosomes, 
but defects in either NPCl or NPC2 cause Niemann- 
Pick disease type C, a severe lysosomal storage 
disorder in which cholesterol and other lipids accumu¬ 
late in lysosomal storage organelles (Mesmin and 
Maxfield 2009). 

Transport of Newly Synthesized Lipids 

Most lipids are synthesized in the ER in mammalian 
cells, although significant lipid synthesis and 
remodeling also occurs in the Golgi apparatus and in 
mitochondria. An excellent review of transport of 
newly synthesized lipids has been presented recently 
(Blom et al. 2011). The predominant transport pathway 
for newly made phospholipids is by vesicular trans¬ 
port. (This must operate in such a way that the lipids 
with more saturated acyl chains are continually 
returned to the ER to maintain that organelle’s mem¬ 
brane characteristics.) 
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An important exception is ceramide, which can be 
transported by CERT, which has a ceramide-binding 
START domain. CERT also has a pleckstrin homology 
domain for targeting to Golgi membranes and a EE AT 
motif that binds the ER protein, VAP. Thus, CERT is 
able to efficiently shuttle ceramide from the cytoplas¬ 
mic side of the ER, where it is made to the cytoplasmic 
side of the Golgi, where it can be converted to 
glucosylceramide. It is not entirely clear how the 
glucosylceramide is flipped to the lumenal leaflet of 
the biosynthetic organelles, which is essential to con¬ 
version to more complex glycolipids. One proposal 
(Halter et al. 2007) is that glucosylceramide is returned 
to the ER by nonvesicular transport on EAPP2 and that 
it is then flipped in the ER membrane and transported 
by vesicular transport to the Golgi. This somewhat 
baroque pathway may be used to regulate SM and 
glycolipid synthesis since the activity of the 
nonvesicular transport processes can modulate 
proteins that are involved in other aspects of lipid 
metabolism (D’Angelo et al. 2008). 

Summary 

Lipids move rapidly among cellular organelles by 
a combination of vesicular and nonvesicular transport 
processes. Despite all of this trafficking, organelles 
maintain very different lipid compositions. The main¬ 
tenance of the unique lipid compositions of organelles 
requires sorting of lipids at many steps of vesicle 
formation. The underlying mechanisms for this lipid 
sorting are only partially understood. Some lipids 
(especially cholesterol) are transported by soluble 
carrier proteins among organelles. 
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Synonyms 

Mass action; Mass spectrometry 

Definition 

Systems level analysis and characterization of lipids 
and their interacting moieties (Wenk 2005). 

Introduction 

Lipidomics is a mass-spectrometry-based systems 
level analysis of cellular lipids (Ivanova et al. 2009). 
This requires the identification and quantitation of 
thousands of lipid molecular species. In addition, func¬ 
tion, interaction, and dynamics of lipids are in the 
focus of this discipline and also the understanding of 
the changes in lipid profiles that occur in response to 
perturbation of the system, such as diseases, pharma¬ 
cological treatment, or the targeted disruption of 
certain genes. 

As a discipline of systems biology, lipidomics 
attempts to obtain, analyze, and integrate complex 
data on the lipidome of cells, tissues, organs, biofluids, 
or organisms (“systems”). Together with other metab¬ 
olites such as amino acids, carbohydrates, and nucleo¬ 
tides, lipids form the metabolome of the system 
(► Metabolomics). The knowledge of lipid profiles 
should allow correlations with genome (DNA), 
transcriptome (RNA), proteome (proteins), and other 
metabolites of a system. As a technological platform, 
lipidomics faces the problem to determine levels of 
a large and structurally very heterogeneous group of 
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Lipidomics, Fig. 1 Structures of selected lipids: a plasmalogen and a phosphatidylcholine as membrane lipids, a prostaglandin as 
a lipid mediator, and a cholesterol ester as a storage lipid 


metabolites (Fig. 1) that can differ drastically in 
their abundance and their physical and functional 
properties (► Chemical Diversity of Lipids). Also, 
the subcellular distribution of a certain lipid can be 
critical for its function (van Meer 2005), for example, 
it can be necessary to determine its level in an intra¬ 
cellular compartment like the endoplasmic reticulum, 
a nerve ending, or the nucleus. Even if it is present on 
the inner or outer leaflet of a membrane might he 
important. 

Lipids 

Lipids are usually defined by common solubility prop¬ 
erties: The majority of lipids is only slightly soluble in 
water and can be extracted from biological sources 
with organic solvents. Efforts for standardization hy 
the LIPIDMAPS consortium (www.lipidmaps.org) led 
to a definition of lipids according to their biosynthetic 
origin and to their classification into eight different 
groups: fatty acyls (► Fatty Acids, Alkanols, and 
Diacylglycerols), glycerolipids (► Glycerolipids: 
Chemistry), glycerophospholipids, sphingolipids 
(► Sphingolipids and Gangliosides; ► Glycolipids), 
sterol lipids, prenol lipids, saccharolipids, and 
polyketides (Fahy et al. 2005, 2009). 


Membrane Lipids 

Lipids differ not only in their chemical structures hut 
also in the types of aggregates that they form in the 
aqueous environment of a living system 
(► Hierarchically Structured Lipid Systems). Amphi¬ 
philic lipids with cylindrical shape like phosphatidyl¬ 
cholines and sphingomyelins form lamellar phases and 
are found together with cholesterol as components of 
biological membranes (► Functional Roles of Lipids 
in Membranes). Membranes can contain more than 
1,000 different lipid species, and different cellular 
membranes maintain different lipid compositions 
(van Meer 2005). When embedded in bilayers, cone- 
shaped lipids with small head groups such as phospha- 
tidylethanolamine (PE) or phosphatidic acid (PA) can 
induce concave phase frontiers and a more dense pack¬ 
ing in planar bilayers. Dependent on their species or 
subcellular origin (► Lipid Trafficking in Cells), mem¬ 
branes differ in their total lipid composition, but can 
show different lipid compositions of their two leaflets 
(► Lipid Bilayer Asymmetry; ► Lipid Elip-Elop) and 
also between the apical and basolateral membranes of 
a polarized cell. Lateral differences in lipid composi¬ 
tions give rise to transient microdomains that account, 
for example, for differences in bilayer thickness and, 
subsequently, protein transport (► Lipid Domains). 
Although information on the spatial distribution of 
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membrane lipids can be important, most attempts in 
lipidomics aim currently at the determination of the 
total lipid composition of cells, tissues, or organelles 
and apply techniques where this information is lost. 

Mediator Lipids 

Distinguished from membrane lipids are highly bioac¬ 
tive lipids that are transiently formed in often minute 
concentrations. These can be extracellular mediators 
such as the eicosanoids, lysophospholipids, platelet 
activating factor, or in part, structurally very complex 
metabolites that arise from the oxidation of fatty acids. 
Structure elucidation of these lipids and determination 
of their concentration in complex matrices is 
a technological challenge in lipidomics and usually 
addressed in so-called targeted approaches. Assign¬ 
ment of structures can often not be achieved by MS 
alone, such as absolute configurations or double-bond 
geometries. 

Storage Lipids 

Triacylglycerols are used as storage forms of meta¬ 
bolic energy, and cholesterol esters are the storage 
forms of cholesterol. These lipids aggregate to lipid 
globules in an aqueous environment, and within cells, 
they are deposited in specialized storage organelles, 
the lipid droplets. These occur in large amounts in the 
cytoplasm of adipocytes, differ in size and lipid com¬ 
position, and are highly relevant for human diseases 
such as obesity, diabetes, and metabolic syndrome. 

Lipid Analysis 

Especially, the new developments in mass spectrome¬ 
try (MS; ► Mass Spectrometry) had a huge impact on 
lipid analysis and enabled lipidomics as a large-scale 


approach. They allowed not only the rapid quantifica¬ 
tion of different lipid classes but also the determination 
of complex molecular lipid species that arise by com¬ 
bination of different acyl (more general: “radyl”) 
chains (Fahy et al. 2005). Structure determination and 
quantification by MS occurs after different steps of 
sample preparation, lipid extraction, and eventually, 
one or more separation steps (Fig. 2). 

Sample Preparation 

Sample preparation includes steps like determination 
of wet- or dry weights of organs or tissues, of cell 
numbers, or protein- or DNA content for normalization 
purposes. To ensure reproducible results, sample prep¬ 
aration should be standardized and might rely on 
advanced methods that can be conducted in parallel, 
like immunomagnetic cell separation. Various 
methods for homogenization and subceiiuiar fraction¬ 
ation are available. In special cases, also derivatization 
of the analyte can be carried out, for example, the 
formation of methyl esters of fatty acids for separation 
by GC. 

Lipid Extraction 

Since lipids can have different physicochemical prop¬ 
erties including polarities, a comprehensive extraction 
of all lipid classes is not possible with only one solvent 
system. A popular method for lipid extraction has been 
established by Folch in 1957: Lipids are extracted by 
a mixture of chloroform and methanol followed by 
a partition step against an aqueous solvent. Under 
these conditions, most lipids are found in the organic 
phase, but more hydrophilic lipids like gangliosides 
are separated into the aqueous phase. Another method 
for lipid extraction that was introduced by Bligh and 
Dyer in 1959 is carried out by a monophasic mixture of 
chloroform, methanol, and water (1:2:0.8 v/v/v). 
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Changing the ratio to 2:2:1.8 (v/v/v) leads to a phase 
separation, where most lipids can be recovered from 
the organic phase. Quantitative lipid extraction by 
either of these or other methods is not always achieved; 
for example, the recovery of acidic lipids like phos- 
phatidic acid (PA) and phosphatidylglycerol (PG) can 
be less than 30% in classical Folch and Bligh-Dyer 
extraction, where these lipids can become tightly 
bound to proteins. Lipids can also be covalently 
bound to proteins; for example, ro-hydroxylated 
ceramides are bound to protein of the cornified enve¬ 
lope of the human skin (► Skin Lipids). Such lipids are 
only released after chemical cleavage of the 
lipid-protein linkage. Special conditions are required 
for extraction of more polar lipids. For example, 
phosphatidylinositol-phosphates (► Lipid Signaling 
and Phosphatidylinositols) are not easily extractable, 
sensitive to harsh conditions, and adhere tightly to 
most materials used in the procedures of lipid analysis 
like glassware and metals. 

Lipid Standards 

The addition of internal standards can be required 
for lipid quantihcation or measurement of lipid dynam¬ 
ics (Postle and Flunt 2009). Standard lipids (Moore 
et al. 2007) that can be distinguished from endoge¬ 
nously occurring lipids contain rare alkyl chain 
combinations or the stable isotopes ^FI or so that 
their miz values do not overlap with those of the 
analyte. For the analysis of structurally complex lipids 
like complex glycosphingolipids or gangliosides, stan¬ 
dards have to be chemically prepared in a laborious 
manner and are still not available for a number of 
analytes. 

Separation Techniques 

Lipid classes can be separated from crude lipid extracts 
by different methods. If ESI-MS is used as the key 
analytical technique, also, the direct infusion of 
unseparated lipid extracts into the mass spectrometer 
is common in the so-called shotgun lipidomics 
approach. Separation can be achieved by chromato¬ 
graphic techniques which are coupled online or offline 
to MS, such as GC/MS, LC/MS, and LC/MS/MS. This 
allows identihcation of lipids by their accurately deter¬ 
mined masses and retention times. A very robust tech¬ 
nique is thin layer chromatography (Fuchs et al. 2011), 
by which lipid classes are easily separated and quanti¬ 
fied after staining by densitometry. 


Sample Storage 

Samples are stored at low temperatures in an inert 
atmosphere, lipid extracts in diluted solutions. Expo¬ 
sure to air or enzyme sources might lead to lipid 
alteration or decomposition; especially, cholesterol is 
highly sensitive toward oxidation by oxygen with for¬ 
mation of oxy sterols. 

Mass Spectrometry 
Instrumentation 

In MS, mass to charge ratios (mIz values) are deter¬ 
mined with instruments that contain an ion source, 
a mass analyzer (or combinations of them), and an 
ion detector (► Mass Spectrometry). Time-of-flight 
mass (TOE) spectrometers use a mass analyzer in 
which m/z values are determined by measuring their 
flight time through an evacuated tube of a defined 
length. Ion-trap mass spectrometers allow tandem 
MS and MS" analyses of lipids. Limitations of this 
technique, such as limited mass accuracy and resolving 
power, are in part overcome by Orbitrap mass spec¬ 
trometers. Triple quadrupole mass spectrometers were 
especially valuable for lipidomics since they allowed 
precursor ion- and neutral loss scanning and quantita¬ 
tive analyses using the multiple reaction monitoring 
(MRM) mode (Blanksby and Mitchell 2010). Limita¬ 
tions are low resolving power, precision, and accuracy 
in m/z determination and that they are not suitable to 
perform MS" experiments with n > 2. Also, hybrid 
combinations of analyzers such as quadrupole-time of 
flight (Q-TOE) are in use. Specialized instruments 
applied to lipidomics are, for example, Eourier trans¬ 
form ion cyclotron resonance spectrometers, instru¬ 
ments for secondary ion MS (SIMS), or for ion 
mobility MS, which gives information about the 
molecular shape of the analyte (► Ion-Mobility Mass 
Spectrometry: Biological Applications). 

Ionization Methods 

The ionization technique most frequently applied to 
lipid analysis is electrospray ionization (ESI), but also 
other methods are used. 

Electrospray Ionization (ESI) 

ESI is a “soft” ionization method that can operate 
without fragmentation of the analyte molecules. 
While it is suitable for the analysis of many 
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amphiphilic membrane lipids, less polar lipids show 
generally poorer ionization efficiencies in classic ESI 
so that additional measures like adduct formation, 
chemical derivatization, or application of nano-ESI 
are required. Typical drawbacks of ESI include signal 
suppression in complex matrices and varying adduct 
patterns. Nano-ESI is a miniaturization of ESI with 
enhanced sensitivity and reduced signal suppression. 
Many lipids have been analyzed using ESI and nano- 
ESI also in automated and high-throughput 
applications. 

Atmospheric Pressure Chemical Ionization (APCI) 
and Atmospheric Pressure Photoionization (APPI) 

In APCI, charge is transferred onto the analyte by 
reaction with ionized carrier gas molecules. The ioni¬ 
zation conditions are less mild than ESI, and sensitive 
molecules show in-source fragmentation. APCI is suit¬ 
able for lipids of lower polarity, and the solvent plays 
a less dominant role for ionization than in ESI. This 
allows measurement of lipids dissolved in organic 
solvents such as chloroform or hexane. APPI is 
a modification of APCI, which has been reported to 
be more sensitive and efficient than APCI. 

Matrix-Assisted Laser Desorption/Ionization 
(MALDI) 

In MALDI, the analyte is placed onto an UV-absorbing 
solid matrix of aromatic acids, which are 
cocrystallized with the sample. Irradiation of suitable 
positions on the target with a laser of appropriate 
wavelength causes analyte desorption and ionization. 
MALDI has also been applied to the analysis of lipids 
and in particular glycolipids (Euchs et al. 2010). 
MALDI is a qualitative method that normally provides 
no quantitative results. However, it is more tolerant to 
salts and other disturbing components and also allows 
imaging MS from tissue sections. 

Imaging Techniques 

Imaging MS is a technique to visualize the distribution 
of molecules within tissue sections. Disadvantages of 
MALDI are the limited lateral resolution of 20-50 pm, 
unfavorable signal to noise ratio for low molecular 
weight analytes, and long acquisition times. Secondary 
ion MS (SIMS) is suitable for imaging MS of lipids 
with lateral resolution in the submicrometer range. 


Desorption electrospray ionization (DESI) has 
been used to create two-dimensional images of the 
distribution of lipid species in human tissues. In addi¬ 
tion to MS approaches, also coherent anti-Stokes 
Raman scattering (CARS) microscopy (► Infrared 
Spectroscopy of Membrane Lipids) is used to deter¬ 
mine the special distribution of nonlabeled lipids 
(Rinia et al. 2008). 

Tandem Mass Spectrometry 

The generation of fragment ions for structure elucida¬ 
tion and quantification is called tandem MS. It can be 
carried out either as tandem-in-space using combina¬ 
tions of a quadrupole and another mass analyzer, such 
as a triple quadrupole, or as tandem-in-time using an 
ion trap. Ion traps allow multiple MS (MS"), and 
a triple quadrupole (TQ) allows different MS/MS 
scan modes, especially product ion scans, precursor- 
or parent ion scans, and neutral loss scans. TQ mass 
spectrometers have been frequently used for lipid anal¬ 
ysis. The second of their three quadrupoles can be 
filled with an inert collision gas for collision-induced 
dissociation (CID) of the analyte molecules. This setup 
allows product ion or daughter ion scans, which 
produce fragments from a defined parent ion. In 
a parent (or precursor) ion scan, all parent ions that 
give rise to a certain fragment are registered. This scan 
mode is used to identify lipids with common head 
groups or backbones (Eig. 3). In a neutral loss scan, 
miz values of parent ions that produce daughter ions by 
the loss of a defined uncharged molecule are regis¬ 
tered. This can be the loss of a certain fatty acid in 
the analysis of phospholipids, or the loss of a sugar in 
the analysis of glycolipids. These scan modes or com¬ 
binations of them are also used for quantification of 
lipids in comparison usually to isotope-labeled stan¬ 
dard substances of known concentration. 

Multiple parent ion scan (MPIS) of head group 
and backbone (fatty acids) fragments are used for 
the determination of phospholipid molecular species 
from crude lipid extracts. Another important feature 
for lipid quantification by LC/MS/MS approaches is 
multiple reaction monitoring (MRM). Here, multiple 
combinations of specific precursor ion/product ion 
transitions can be monitored and used for the 
rapid and sensitive quantification of molecular lipid 
species. 
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Fig. 3 Diagnostic fragments 
for the identification of 
selected phospholipid classes 
by tandem MS. Phospholipids: 
phosphatidic acid (PA), 
phosphatidylcholine (PC), 
phosphatidylethanolamine 
(PE), phosphatidylglycerol 
(PG), phosphatidylinositol 
(PI), and phosphatidylserine 
(PS). Ion mode (+ or —), scan 
mode (PIS, parent ion scan; 
NL, neutral loss scan), and 
structure of fragments are 
given. For details, see (Hsu 
and Turk 2005) 
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Shotgun Lipidomics 

The combination of electrospray ionization and 
tandem MS is frequently used for direct lipid quantita¬ 
tion from crude extracts. Changing the experimental 
conditions during ESI enables intrasource separation 
of lipid classes (Han and Gross 2005) and the structural 
and quantitative analysis of molecular lipid species. 
Limitations are signal suppression effects during MS 
ionization. 


Bioinformatics 

The complexity of the lipidome and its regulation 
requires bioinformatic approaches for data processing 
and lipid quantitation, statistical data analysis, path¬ 
way analysis, and modeling of systems (Niemela 
et al. 2009). For example, isotope correction of peak 
intensities is needed to get more accurate and precise 
data for monoisotopic masses. 
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Applications 

Although the “lipidome” comprises the complete 
lipid profile, it is still not possible to identify and 
quantify the whole structural variety of the different 
lipids present in most systems by one or few 
methods. Nearly complete lipid compositions were 
determined in synaptic vesicles or viruses, and 
a growing number of lipids were determined in 
yeasts and many other sources using shotgun 
approaches or LC/MS. Targeted lipidomics using 
LC/MS/MS in combination with totals synthesis (to 
determine stereochemistry) and bioinformatics led to 
the identification of specialized proresolving media¬ 
tors of inflammation, the resolvins, protectins, and 
maresins (Bannenberg and Serhan 2010). Many stud¬ 
ies have become available that correlate lipid profiles 
with human diseases and physiological processes 
(Oresic et al. 2008; Bou Khalil et al. 2010), and 
challenges for an improved understanding of the 
biological role of lipids are currently addressed 
(Wenk 2010). 
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Introduction 

Pure lipid preparations in the range of 1-100 mg are 
commonly required for biophysical studies. This 
contrasts with the microgram requirements of 
analytical and biochemical methods. Structure studies 
by X-ray (Caffrey 1989) and NMR (► X-Ray Scatter¬ 
ing of Lipid Membranes, ► NMR) are especially 
demanding regarding the lipid amounts. As a result, 
scarce lipid sources such as cell cultures, which are 
appropriate using the microscale procedures in bio¬ 
chemical studies, cannot be applied. The capability to 
purify abundant and sufficiently purified mg samples is 
a critical prerequisite for biophysical biological lipid 
studies. 

For amounts required by biophysical studies, the 
extraction of lipids from biological sources is 
challenged by chemical synthesis, which can be scaled 
up into the range of several hundreds of milligram. 
However, synthetic lipids form a single molecular 
species in contrast with biological lipids which are 


comprised of a variety of molecular species with 
different fatty acyl chains. Indeed, a single molecular 
structure exhibits clear-cut phase cooperativity at the 
thermotropic and lyotropic transition lines of the 
phase diagram, in contrast with the broad transitions 
undergone by polydisperse lipid classes extracted from 
biological sources (Koynova and Caffrey 1994; 
Caffrey 1989; for example, see www.lipidat.tcd.ie) 
(► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal Titra¬ 
tion Calorimetry (ITC) of Lipid Bilayers). The inter¬ 
pretation of biophysical observations for synthetic 
lipids takes great advantage of the characteristic 
values detected in the narrow intervals upon which 
the molecular regime is changed. However, studies 
conducted with synthetic lipids may not be reliable to 
judge the behavior of a biological material. Compari¬ 
son and confirmatory studies are required to establish 
the relevance of synthetic lipids to more complex 
mixtures. 

A first step in the preparation of biological lipids is 
the selection of appropriate sources. Ideally, the source 
should be abundant, commercially available, and 
sufficiently enriched in the lipid of interest (Christie 
1993). The conditions are fulfilled for most neutral 
lipids extracted from oil, milk, or storage tissues 
(e.g., white adipose tissue, oil processed from seeds 
and nuts). Polar lipids may be prepared as by-products 
of “degumming” these crude oils prepared for neutral 
lipids extraction in a prepurified grade on an industrial 
scale. Technologies of soybean, canola, and sunflower 
are examples of trait-enhanced oilseeds with alteration 
of the molecular species profile (Flickinger 2007). 
Preparation from animal tissues with a particular lipid 
metabolism may be also a potential source. For an 
example, fatty fishes of cold oceanic areas have been 
investigated for their rich content in polyunsaturated 
omega-3 lipids (► Essential Fatty Acids), lipophilic 
vitamins A and D, and antioxidants (Cabrini et al. 
1992). To identify a source of particular lipids, the 
empiric knowledge of an activity of many crude lipid 
preparations is generally taken in historical and medic¬ 
inal practices. Interesting physicochemical activities 
can be also traced back with the specialist eye in the 
empiric usages of the culinary practices and recipe, 
cosmetics, lubricants, and even as partially defined 
reagents in the laboratory. The use of lipids extracted 
from their biological source points to remarkable bio¬ 
physical properties. Rich sources have been identified. 
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and a list would provide a historical perspective on the 
efforts of the preparative biochemistry of lipids during 
the last two centuries. A few examples are illustrative 
of such efforts as egg yolk containing lecithin used 
as an emulsiher; spermaceti with cosmetics usage 
(spermaceti is a mixture of palmitoyl-cetyl ester 
and triglycerides which acts to modify the buoyancy 
in the sperm whale head during diving) (Clarke 1970; 
Clarke 1978); the central nervous system white matter 
used as a binder of sauces which contains the heat- 
resistant sphingomyelin of myelin sheath (their 
usage has been proscribed after the bovine spongiform 
encephalitis episode); the so-called cephalin, a mixture 
of phosphatidylethanolamine and phosphatidylserine 
extracted from the central nervous gray matter which 
was used mixed with kaolin or cholesterol for coagu¬ 
lation tests (Pohle and Stewart 1941; Chargaff 1937); 
gangliosides named after extraction from the spinal 
ganglia (Burmaster 1946); cerebrosides from a dry 
extract called “protagon” comprised in the “mechani¬ 
cal mixture” of lipids prepared from the ox brain 
(Gamgee and Blankenhorn 1879; Pearson 1914; 
Robson 1951) (► Glycosphingolipids); beef liver 
lipid extracts containing most of the glyceropho- 
spholipid classes (Spencer and Schaffrin 1964; Youngs 
and Comatzer 1963), sterol intermediate metabolites 
(e.g., the precursor of cholesterol, 7-dehydrocho- 
lesterol), and bile acids; and adrenal glands that have 
also been a long-time source for crude steroid prepa¬ 
rations cited in the pharmacopeia before the “BSE” 
epizooty. 

The selection of unexpected sources of lipids with 
a potential biophysical application requires a detailed 
review of the literature. For this purpose, the query in 
the large medical data banks for storage diseases 
(OMIM www.ncbi.nlm.nih.gov/omim and Orphanet 
http://www.orpha.net) is especially fruitful for identi¬ 
fying the multigram accumulation of glycolipids (e.g., 
preparation of GM2 from the spleen of Gaucher 
patients affected by lysosomal dehcit). Similarly, the 
accumulation of very long-chain fatty acids (24-26 
carbon atoms) or branched fatty acid such as phytanic 
acid can be exploited in the peroxisomal deficits. Fluo¬ 
rescence studies with lipid molecular probes (e.g., 
parinaric acid) have also benefited from the availabil¬ 
ity of botanical data banks to pinpoint a particular 


source (e.g., the nut of Parinarium glaheirimum from 
Fiji and Palau Islands as a rich source of the conjugated 
tetraene fatty acid (Sklar et al. 1981)). 

Hemi-synthesis starting from prepurified biological 
lipid is an alternative workflow combining the 
advantages of abundant starting material with 
efficient enzyme transformation. The method may be 
illustrated by two examples: (1) phosphatidyl-ethanol- 
amine, -glycerol, -ethanol, and -butanol can be pre¬ 
pared by trans-phosphatidylation from the abundant 
egg yolk phosphatidylcholine by phospholipase 
D (Yu et al. 1996) and (2) asymmetric molecular 
species of PC can be prepared by phospholipase A2 
deacylation followed by reacylation with a short-chain 
fatty acid (Huang 1990). The asymmetric phosphati¬ 
dylcholine shows various interdigitated bilayer 
arrangements (► Lipid Bilayer Asymmetry). Notice¬ 
ably, hemi-synthesis preserves the stereochemistry 
characteristics of the biological compound. 

Extraction using Soxhlet methodology is appropri¬ 
ate for milligram to gram lipid preparations. The 
derived methods are iterative processes where 
a solvent heated continuously to reflux is condensed 
and repeatedly drained through the lipid containing 
sample (Luque de Castro and Priego-Capote 2010). 
A broad variety of devices are available commer¬ 
cially to allow small and large solvent volumes and 
warm or hot extraction. Importantly, a variety of 
solvents with safe and minimum volumes (chloro¬ 
form, hexane or cyclohexane, petroleum ether, 
diethyl ether) can be used under an inert atmosphere 
preventing oxidation of the unsaturated lipids. The 
hydrolysis of the raw homogenized lipid-containing 
material as well as microwave- and ultrasound- 
assisted extraction can be processed in the same 
vessel that is eventually used for solvent extraction. 
The process can be automated and monitored by 
a computer to allow the unattended and time- 
consuming (several hours) repetitive operation. The 
functioning is depicted in the website of equipment 
providers (e.g., see http://www.buchi.com and http:// 
www.foss.dk). 

Methods derived from “solvent partition” allow 
many research laboratory-scaled preparations of 
microgram to mg lipid amounts with a clear-cut pref¬ 
erence. The partition into apolar solvent of amphiphilic 
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lipids (e.g., anionic phospholipids) is favored by 
changing pH and salt composition of the 
hydroalcoholic mixture separating after the more 
apolar solvent partition. Because a concentrated 
homogenate may be used as a starting material for 
a “quantitative” preparation, a difficulty arises if abun¬ 
dant tissue proteins accumulate at the interface (Folch 
et al. 1957). The method has been adapted to diluted 
aqueous suspensions (Bligh and Dyer 1959). Chloro¬ 
form (and chlorinated solvents) partition, although 
established as a reference method, requires unaccept¬ 
able volumes of toxic solvent for scaling up to 
a preparative scale (Matyash et al. 2008). An alterna¬ 
tive method has been proposed, using the 
nonchlorinated solvent methyl-rert/o-butyl ether as 
the apolar phase (Matyash et al. 2008). In contrast 
with chloroform, the low density of this solvent allows 
the extraction of lipids from the upper layer that can be 
collected easily by automated robot pipette without 
risk of a contamination by proteins around the needle 
pipette. 

Specific methods have been also set up for indus¬ 
trial preparation of particular lipids. For example, the 
soy lecithin fraction is obtained in the process of 
degumming crude soy oil with a purity around 20% 
phosphosphatidylcholine, 20% phosphatidylethanol- 
amine, and 10% phosphatidylinositol. The phospho¬ 
lipids are mixed with mucilaginous compounds to 
form the so-called gum of soybean oil. The world 
production of soy lecithin fraction is estimated 
between 200,000 and 300,000 t (http://www. 
soyinfocenter.com) with major industrial uses related 
to their physical property as a food additive in marga¬ 
rine for antispatter and as an emulsifier, in chocolates, 
caramels, and coatings, to control viscosity, crystalli¬ 
zation, and sticking, and in instant foods such as cocoa 
powders, coffee creamer, and instant breakfast for 
wetting, dispersing, and emulsifying. The crude leci¬ 
thin surfactant preparation is also used in cosmetics, 
pharmaceuticals, coatings (paints, magnetic tape coat¬ 
ings, waxes), plastic and rubber industry, glass and 
ceramic processing, paper and printing, masonry and 
asphalt products, petroleum industry, metal 
processing, pesticides, adhesives, textiles, and 
leathers. Such a broad spectrum of applications under¬ 
lines the usefulness of pilot biophysical studies 


conducted on the biological lipid extracts at a low 
grade of chemical purity, in contrast with research- 
oriented studies. 

Conclusions 

For detailed methodological information, the reader is 
referred to the book collection entirely dedicated to 
lipids such as the Oily Press Lipid Library (published 
by PJ Barnes & Associates, PO Box 200, Bridgwater 
TA7 OYZ, UK). A few websites are also carefully 
maintained and are highly recommended for their 
pages describing preparation methods: www. 
cyberlipid.org (Ed. C. Leray), www.lipidlibrary.aocs. 
org (Ed. W.W. Christie), www.lipidomicnet.org 
(European Lipidomics Initiative), and www. 
Iipidmaps.org (Ed. E.A. Dennis). 
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Lipopolysaccharides: Physical Chemistry 

Klaus Brandenburg and Thomas Gutsmann 
Research Center Borstel, LG Biophysics, Borstel, 
Germany 


Synonyms 

Endotoxins; LPS 


Definition 

Lipopolysaccharides (LPS) belong to the class of 
glycolipids and are membrane-forming constitu¬ 
ents of the outer membrane of Gram-negative 
bacteria. 


Basic Characteristics 

Basics 

Whereas in most cellular membranes the main amphi¬ 
philic compounds constituting the lipid matrix are 
phospholipids, in many membrane systems also, 
other lipid species are found. To this class belong 
glycolipids, which have a sugar moiety as head 
group, ranging from mono- to polysaccharides. 
Due to their particular physicochemistry, these mole¬ 
cules might have a variety of different functions 
such as cell recognition, adhesion, and signaling. 
One important glycolipid is lipopolysaccharide (LPS, 
endotoxin) from Gram-negative bacteria, which 
constitutes the outer leaflet of their outer membrane. 
LPS is made responsible for the high permeability 
barrier of the outer membrane, in particular against 
hydrophobic drugs, and is - after the release from 
the bacteria due to cell division or the action of com¬ 
ponents of the defense system or antibiotics - one of 
the strongest stimulators of the human immune 
system known. This property may be beneficial 
at low LPS concentrations but lead to severe patho¬ 
physiological effects at high LPS concentrations 
such as sepsis, severe sepsis, and endotoxic shock 
with high death rates at critical care units (Rietschel 
et al. 1994). 


LPS Chemistry 

The chemical structure of enterobacterial LPS 
can be divided into three parts, the lipid 
A moiety, the “endotoxic principle” of LPS, the 
oligosaccharide core differing for the various 
rough mutant LPS (R-LPS) and the 0-antigen 
(smooth form S-LPS, see Fig. 1 for strains from 
Salmonella enterica serovar Minnesota). The chem¬ 
ical heterogeneity is low for the first part (“conser¬ 
vative structure”), medium for the second part, and 
high for the latter. The immunostimulatory activity 
is nearly identical for all rough mutant and S-LPS, 
which implies that the lipid A moiety is the deci¬ 
sive unit for bioactivity (Rietschel et al. 1994). 
Lipid A from enterobacterial LPS consists 
of a diglucosamine moiety phosphorylated at 
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Lipopolysaccharides: 
Physical Chemistry, 

Fig. 1 Chemical structures of 
lipopolysaccharides from 
rough mutant Re through Ra 
from Salmonella enterica 
serovar Minnesota 
{parenthesis: strains). Linked 
to LPS is a polysaccharide 
portion (0-antigen) having 
a high heterogeneity for 
different strains and genera 



position 1 and 4' and acylated in positions 2, 3, and 
2',3' with up to seven hydroxyl fatty acids, mostly 
of length Cl4. 

LPS Physicochemistry 

The critical micellar concentration (CMC), i.e., the 
concentration at which the aggregates dissolve into 
monomers, can be estimated to be lower than the 
nanomolar range (Gutsmann et al. 2007). Since at 
these concentrations the biological activities start, it 
can be concluded that an aggregated state of LPS is the 
prerequisite for bioactivity. This has been proven actu¬ 
ally, as shown in Fig. 2, in which the tumor-necrosis- 
factor-a-inducing capacities in human mononuclear 
cells of lipid A (left) and deep rough mutant LPS 
(right) as monomers and as aggregates are compared, 
under different conditions of serum and also in 
the presence of lipopolysaccharide-binding protein 
(LBP) (Mueller et al. 2004). Differing results in 
literature, in which monomers were described to be 
the active unit, were obtained with inadequate 
methods, e.g., by using metabolically ^FI-labeled 
LPS, which has severe restrictions due to 
radiation damage. The details of these investigations 
are discussed in reviews (Gutsmann et al. 2007; 
Brandenburg et al. 2010). 


The number of acyl chains governs decisively the 
bioactivity of LPS, with complete activity for hexa- and 
hepta-acyl lipid A, only very low activity for penta-acyl 
lipid A and no activity for tetra-acyl lipid A. For an 
explanation of these surprising results, the molecular 
conformation of the different species must be consid¬ 
ered: It could be shown that hexa- and hepta-acyl lipid 
A adopt cubic inverted aggregate structures, leading to 
a conical shape of the single molecules within the 
aggregates - the hydrophobic part has a higher cross 
section than the hydrophilic part. The two lower acyl¬ 
ated lipid A adopt multilamellar aggregate structures 
characteristic for a cylindrical shape of the single mol¬ 
ecules, concomitant with absence of agonistic, but 
exerting antagonistic, activity against the action of 
agonistic LPS. In conically shaped lipid A, the 
diglucosamine backbone has a high inclination angle 
with respect to the direction of the acyl chains, whereas 
in cylindrically shaped lipid A, there is no or only 
a small inclination angle. Also the binding of the acyl 
chains to the two glucosamines is of relevance, 
the 4/2 binding pattern (4 to the non-reducing and 
2 to the reducing glucosamine part) leads to high, 
whereas the 3/3 configuration leads to no agonistic, 
but antagonistic, activity (Fig. 3). A summary of 
these interrelationships is demonstrated in Fig. 3 for 
selected compounds with variations in the lipid 
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Lipid A concentration / M 

Lipopolysaccharides: Physical Chemistry, Fig. 2 Induction 
of tumor-necrosis-factor-a in human mononuclear cells by 
lipid A and LPS Re from Salmonella enterica serovar Minnesota 
Re (strain R595) by monomers {dark bars) and aggregates 
(light bars) at concentrations of 0.1 and 1 mM. N. D. not 



control 10“'° 10“° 

Re LPS concentration / M 


detectable, (a) Serum-free, (b) in the presence of 100 ng/ml 
LBP, (c) in the presence of 4% AB serum. The monomers 
were produced by a dialysis chamber (With permission 
by ASBMB) 


A acylation pattern (Gutsmannetal. 2007; Brandenburg 30-37°C takes place for all enterobacterial LPS, 
et al. 2010). whereas it lies at 45°C for free lipid A. This behavior 

A further physicochemical parameter possibly could explain the observation that lipid A has a lower 
relevant for biological activity is the acyl chain melting bioactivity than parent LPS of one to two orders of 
behavior. It was found that a gel to liquid crystalline magnitude due to the much lower mobility or “fluidity” 
phase transition of the acyl chains in the range of its acyl chains at 37°C, impeding the interaction 
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Lipopolysaccharides: Physical Chemistry, Fig. 3 Chemical structure, small-angle X-ray scattering patterns, molecular confor¬ 
mation, and agonistic and antagonistic activity of various lipid A with different acylation patterns 


with binding proteins (Brandenburg et al. 2010). 
However, a high fluidity per se does not necessarily 
lead to biological activity; the basic prerequisite is 
the conical molecular conformation as illustrated 
in Fig. 3. 


Cross-References 

► Bacterial Lipopolysaccharide, OPS, and Lipid A 

► Lipid Organization, Aggregation, and Self- 
assembly 
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► Membrane Fluidity 

► Polarized Infrared Spectroscopy 

► X-Ray Scattering of Lipid Membranes 
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Synonyms 

In vivo imaging; Single-molecule imaging 

Definition 

Live-cell single-molecule imaging refers to the inves¬ 
tigation of single fluorescent molecules in living cells. 
This process can be used to determine the localization, 
dynamics, and structure of biomolecules in cells. 

Basic Characteristics 

The first single molecules were observed by absorption 
(Kador et al. 1999) and fluorescence emission 
(Orrit and Bernard 1990) over 20 years ago. The pos¬ 
sibility of examining one molecule at a time has 


enabled investigations of properties free from ensem¬ 
ble averaging in heterogeneous systems, as well as 
sub-diffraction-limited (superresolution) imaging of 
dynamics and structure. With information attainable 
on the scale of a few nanometers, one very important 
application of single-molecule fluorescence (SMF) 
imaging has emerged as the ability to investigate 
biological samples, and in particular, to examine 
molecules within live cells. 

Advantages 

Fluorescence imaging, which can be performed in an 
optical microscope under ambient conditions, is an 
important instrument for cell biology as light can be 
used to noninvasively probe a sample with relatively 
small perturbation of the specimen. A host of bright 
probes and attachment schemes have been developed 
to label biomolecules with high specificity (cf. ► Pro¬ 
tein Fluorescent Dye Labeling; ► Fluorescence Label¬ 
ing of Nucleic Acids); these labels include genetically 
encoded fluorescent proteins and organic dyes. Identi¬ 
fying specific molecules of interest is therefore 
relatively straightforward in fluorescence microscopy. 
Live-cell SMF imaging is often done in a ► total inter¬ 
nal reflection fluorescence microscopy (TIRFM) con¬ 
figuration, though bacteria cells, with their thinner 
diameter, can be visualized in a widefield setup. 

Live-cell imaging has several important advantages 
for biological studies. In particular, it enables investi¬ 
gations of dynamics within the physiologically 
relevant, crowded environment of the cell. The dynam¬ 
ics that have been studied in live cells range from 
tracking the movement of single proteins (► Single- 
Particle Tracking) to visualizing morphological 
changes in superstructures such as polymerized pro¬ 
teins (► Photoactivated Localization Microscopy 
(PALM)). By enabling dynamical investigations, 
live-cell imaging can also elucidate responses to stim¬ 
uli and expression of novel proteins during the course 
of the cell cycle. Furthermore, even when looking at 
quasi-static structures in cells, live-cell imaging is free 
from the artifacts that have been found to plague 
chemically fixed cells or the ice damage that can 
arise from plunge-freezing. 

The dynamics in a cell can be used to improve 
experimental conditions. When SMF imaging is com¬ 
bined with single-particle tracking, a mobile molecule 
is localized in several distinct positions over the course 
of an experiment, reducing the need for overexpression 
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Superresolution 
Single-particle Imaging of Quasi- 



Proteins 


Live-Cell Single-Molecule Imaging, Fig. 1 Relevant timescales for single-molecule fluorescence imaging in live bacterial cells 


of that molecule. In addition, the dynamics of mole¬ 
cules can he used in post-processing algorithms to 
differentiate between highly mobile particles (e.g., 
freely diffusing cytoplasmic proteins) and less rapidly 
moving particles (e.g., membrane- or structure-bound 
proteins). 

Challenges 

Live-cell SMF experiments require some particular 
controls and considerations. In particular, it is of 
utmost importance to verify that the labeling and imag¬ 
ing conditions in a living cell are non-perturbative. 
Significant overexpression must be avoided; fortu¬ 
nately this is simple in SMF experiments where only 
a small number of fluorescently tagged molecules 
is desired. However, even at low expression levels, 
labeling can lead to mislocalizations, which must be 
ruled out, at the very least, by comparing live-cell 
results to results obtained in bulk-level antibody- 
stained fixed cells, electron microscopy, biochemical 
studies, etc. SMF investigations generally use low- 
intensity continuous-wave (CW) lasers, but one must 
still determine that these conditions do not heat or 
damage cells. In prokaryotic cells, this can be done 
by verifying that the bacteria still grow and divide 
following imaging. 

While preparing a live-cell sample, the options for 
labeling are more limited than in vitro or in fixed cells. 
Fluorescent tags must be genetically encoded or pene¬ 
trate the cell membrane. In particular, antibodies, 
which are very large and can polymerize, are not 
generally suitable for live-cell imaging. Most work to 
date on SMF imaging in live cells is enabled by fusions 
between a protein of interest and a fluorescent protein 
tag. Such a labeling scheme provides high specificity. 


but careful attention must be paid to the effect of the 
fluorescent protein, which can be as large as the protein 
of interest, and which may tend to dimerize with other 
fluorescent proteins. Several small-molecule labeling 
schemes have emerged, including some that are com¬ 
patible with live-cell imaging, yet it certainly remains 
a challenge in live cells to investigate biomolecules 
other than proteins because of the limited number of 
labeling schemes. 

Timescales 

Because the sample is a dynamic entity, the timescale 
of a live-cell experiment must be taken into careful 
consideration. There exists a trade-off between the 
spatial resolution for single-molecule localizations, 
which improves beyond the standard diffraction limit 
roughly as one over the square root of the number of 
detected photons (Thompson et al. 2002; ► Fluores¬ 
cence Imaging with One Nanometer Accuracy), and 
the time resolution. This trade-off is further 
compounded for single-molecule-based superresolution 
imaging, such as photoactivated locaization microscopy 
(PALM), where the resolution of an image is defined as 
the separation distance between neighboring localiza¬ 
tion events; this resolution is improved by increasing 
the number of fluorophores that are localized and there¬ 
fore the number of imaging frames. Therefore, though 
the electron-multiplying charge-coupled detector 
(EMCCD) cameras typically used can operate at as 
fast as ^50 frames/s, very fast processes may not be 
resolved with SMF imaging. However, slower (longer- 
exposure) experiments are not possible in live-cell 
imaging as most processes are dynamic and a compro¬ 
mise must therefore be found. Figure 1 shows the time- 
scales of some typical processes in bacterial cell 
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imaging; SMF experiments can resolve events that 
occur on the timescale of approximately 10 ms -10 min. 

Applications 

Live-cell single-molecule imaging has been success¬ 
fully applied to prokaryotic and eukaryotic cells. 
Experiments on bacterial cell imaging (Biteen and 
Moemer 2010), protein expression in bacterial cells 
(Xie et al. 2008), and single-particle tracking in mam¬ 
malian cells (Patterson et al. 2010) have been reviewed 
extensively. The majority of these experiments are 
done on proteins genetically fused to a fluorescent 
protein, which may be photoswitchable to permit 
high-resolution structural reconstructions (Biteen 
et al. 2008) or tracking of multiple single particles 
(Shroff et al. 2008). These experiments have led to 
discoveries including quantifying protein motion and 
diffusion, measuring superstructures, investigating the 
expression of low copy-number proteins, and evaluat¬ 
ing transport in membranes and cells. 

Cross-References 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Labeling of Nucleic Acids 

► Photoactivated Localization Microscopy (PALM) 

► Protein Fluorescent Dye Labeling 

► Single-Particle Tracking 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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Synonyms 

Light, oxygen, or voltage domains (LOV) 


Definition 

LOV proteins are UVA/Blue-light photoreceptors 
employing LOV (light, oxygen, voltage) domains as 
photosensory modules. They are wide spread among 
plants, fungi, bacteria, and archaea. LOV domains are 
a/p folds of ca. 110 amino acids and bear a fully 
oxidized FMN (flavin mononucleotide) chromophore, 
noncovalently bound in the dark-adapted state of the 
protein. Upon photoexcitation, FMN forms a covalent 
bond with a nearby cysteine (between the cys-SH 
group and position 4a of the flavin) during the decay 
of the FMN triplet state (Fig. 1). According to its 
absorption maximum, the adduct is referred to as 
LOV 390 (Fig. 2) and is supposed to be the signaling 
state in vivo. In most LOV proteins, LOV390 thermally 
recovers to the dark-adapted state, LOV447. Signal 
propagation from the LOV core to effector domains 
or protein partners occur via the antiparallel P-scaffold 
and helical caps flanking the LOV core (Fig. 3). The 
peculiar photobiophysics of LOV proteins can be 
exploited for advanced biotechnological applications, 
such as fluorescence imaging, super-resolution micros¬ 
copy, and optogenetics (Fig. 4). 
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LOV Proteins: 
Photobiophysics, 

Fig. 1 Proposed photocycle 
of LOV domains after blue- 
light excitation of the dark- 
adapted state. Subscripts 
indicate the absorption 
maxima as shown in Fig. 2 
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LOV Proteins: Photobiophysics, Fig. 2 Spectral properties 
of a LOV domain (here the Bacillus suhtilis protein YtvA) in the 
dark-adapted state (dark line) and in the FMN-Cys adduct. In 
blue, \)a£ fluorescence spectrum of the dark-adapted state. Fluo¬ 
rescence is totally lost in the adduct 

Basic Characteristics 

The Photocycle of LOV Proteins 

A LOV (Light, Oxygen, Voltage) domain is a small¬ 
sized (ca. 110 aa) UVA/blue-light sensing module, first 
characterized in the plant photoreceptor phototropin. 


a light-activated kinase (Briggs 2006). LOV domains, 
photosensitive thanks to a bound flavin chromophore 
(Edwards 2006; Kritsky et al. 2010; ► Flavins), are 
largely distributed in the living world, identified in 
plants, bacteria, archaea, and fungi (Herrou and 
Crosson 2011). In prokaryotes they are present in 
10-12% of the sequenced genomes (Losi and Gartner 
2012). LOV domains, belonging to the PAS 
(PerArntSim) superfamily, present a characteristic 
a/p fold, where the secondary structure elements are 
named ApBpCpDaEaFaGpHpip, starting from the 
N-terminus (Zoltowski and Gardner 2011). The five 
P-strands form an extended and twisted antiparallel 
P-scaffold (Losi 2007), in most cases flanked by heli¬ 
cal regions, located N- and/or C-terminally to the LOV 
core. The photocycle of LOV domains starts from the 
dark-adapted state LOV 447 , in which flavin mononu¬ 
cleotide (FMN) is noncovalently bound in its fully 
oxidized state. The photocycle involves the formation 
a covalent bond between FMN (position4a) and 
a conserved cysteine (LOV 390 ), also referred to as 
adduct or photoadduct, via the short ps decay of the 
FMN triplet state. Formation of LOV 390 involves the 
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LOV Proteins: Photobiophysics, Fig. 3 Signal propagation 
mechanisms in LOV proteins, (a) in Aiphotl (Avena saliva 
phototropin 1), light relieves inhibition of the kinase activity 
by promoting undocking of the J-linker helix, mostly clamped 
at the P-scaffold of LOV2 in the dark; (b) in fi/LOV-HtH 
(Eiythwhacter litoralis LOV-HelixTumHelix) protein, dimer¬ 
ization is blocked in the dark by direct contact between the LOV 


and HtH domains. Light promotes dimerization by weakening 
this contact, allowing HtH binding to target DNA; (c) in the 
short-LOV protein VcVVD {Neurospora crassa Vivid), light 
promotes dimerization by partial undocking of the helical 
Neap; (d) In SiYtvA {Bacillus suhtilis YtvA) a torquing of the 
J-linker helix might activate the effector/secondary switch STAS 
domain in the protein dimer 


establishment of new C(4a)-S and N-H(5) bonds, most 
probably via the fast decay of a FMNH‘-H 2 CS‘ radical 
pair (Fig. 1). With few exceptions, for which the 
photoaddition reactions is virtually irreversible, 
LOV 390 thermally decays to the parent state with 
breakage of the covalent bond, on the s-to-h timescale 
at room temperature (Losi and Gartner 2012). The 
driving force for the dark-recovery reaction results 
from the high energy content of LOVLit, ca. 
110-140 kJ/mol, indicative for a strained protein con¬ 
formation. As an example. Fig. 2 reports photocycle 
and spectral parameters of the bacterial LOV protein 
YtvA. The dynamics of the photocycle can be regu¬ 
lated by introducing mutations in the vicinity of the 
chromophore, chiefly involved in the hydrogen bonds 
network stabilizing FMN in the cavity (Raffelberg 
et al. 2011 ), but also by changing more peripheric 
amino acids (Losi and Gartner 2012). 

Besides thermal recovery, there are evidence for 
photochromicity in LOV domains, whereby LOV 390 
can be photoexcited, yielding the parent state in a fast 
but poorly efficient process (vide infra). 

Signal Transmission in LOV Proteins 

LOV proteins are highly modular, with diverse effec¬ 
tor domains linked to the LOV core, including kinases. 


phosphatases, second-messenger regulators, or DNA- 
binding motifs (Herrou and Crosson 2011; Zoltowski 
and Gardner 2011; Losi and Gartner 2012). 
Standalone, extended LOV domains are also present 
in many instances. The photochemical event triggers 
intraprotein signal propagation via conformational 
changes that travel from the chromophore cavity to 
molecular surfaces, where they affect interdomain or 
protein-protein interactions. 

The extended, antiparallel (3-sheet of the LOV core 
is a chromophore/environment interface: on one side it 
hosts residues directly interacting with the isoalloxa- 
zine ring of FMN (Raffelberg et al. 2011), and on the 
other side it communicates with helical regions 
flanking the LOV core (N-terminal cap and/or 
Ja-linker) or it interacts directly with effector domains 
(Zoltowski and Gardner 2011; Losi and Gartner 2012). 

Photo-perturbation of the LOV core (3-sheet medi¬ 
ates signal propagation via different mechanisms, 
involving the N-terminal cap and/or Ja-linker to effec¬ 
tor domains by, e.g., undocking of the Ja-linker and 
torquing (Fig. 3). 

The P-scaffold surface also participates in LOV- 
LOV dimerization, which in solution may be a highly 
dynamic phenomenon (► Transient Grating Spectros¬ 
copy: Dynamics of Photoreceptors). For example, the 
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LOV Proteins: Photobiophysics, Fig. 4 LOV domains bio¬ 
technology applications. The fluorescence of the dark-adapted 
state can be exploited to engineer reporter genes. Blue-light 
excitation results in adduct formation, i.e., the signaling state, 
that may in turn regulate a variety of metabolic processes. This 
feature is exploited in optogenetics. The adduct is biologically 
active and nonfluorescent (FluooPF/BiooN). but the system can 
be driven into a photoequilibrium by means of v/o/etAJVA light, 
with a very low yield for recovery of fluorescence. This 
photoswitching feature has potential application in super¬ 
resolution fluorescence microscopy (nanoscopy) 


fungal LOV protein VVD (Vivid) is monomeric in the 
dark and tends to undergo a light-driven dimerization, 
hut the lit form itself appears to be a rapid monomer- 
dimer equilibrium (Fig. 3c). 

Advanced Biophysical Applications: Microscopy, 
Nanoscopy, and Optogenetics 

The fluorescence of the dark-adapted state, LOV 447 , 
allows applications of LOV proteins in fluorescence 
imaging (► Flavin Mononucleotide-Binding Fluores¬ 
cent Proteins). As fluorescence is lost upon formation 
of the LOV 390 adduct, this last process can be annihi¬ 
lated by mutating the reactive cysteine. Compared with 
green fluorescent proteins and its derivatives, where 
chromophore formation is oxygen dependent, LOV 
domains are advantageous for applications in anaero¬ 
bic or microaerobic environments as well as for inves¬ 
tigating viral infections of plants. Since the latter is 
considered to be the biologically active (signaling) 
state, it may be stated that fluorescence is ON when 


the biological photoresponses are OFF, and vice versa, 
i.e., LOVoark and LOV^t can be labeled FlnoowBiooff 
and FluooFpBiooN; respectively (Fig. 4). 

Interestingly, an early report on the LOV2 domain 
from the phy3 receptor of Adiantum indicated that for 
this protein the light adapted species can be 
photoconverted in good yield to the dark adapted, 
fluorescent state, by shining near UV-violet light on 
the photoreceptor (Kennis et al. 2004). The finding of 
photoswitching phenomena in LOV domain proteins 
went largely unnoticed and the potential of this prop¬ 
erty was almost completely unexplored. The availabil¬ 
ity of genetically encodable fluorescent reporters with 
photochromic properties is of fundamental relevance 
to super-resolution microscopy (nanoscopy) studies, 
based on the stochastic photoactivation of single mol¬ 
ecules, like in FPALM (Patterson et al. 2010; 
► Photoactivated Localization Microscopy (PALM)). 

Optogenetics indicates the use of light-gated pro¬ 
teins originally designed by nature as tools to 
photomodulate cell activities. Prompted by the success 
of the retinal-protein channelrhodopsin that provides 
a nontoxic, inheritable mechanism for the selective 
manipulation of cell membrane potential, optogenetics 
applications are now also becoming available with 
LOV proteins and other flavin-binding photoreceptors 
(Losi and Gartner 2012). Given the modularity of LOV 
proteins, a number of light-regulated functions are at 
hand, such as turnover of second messengers, kinase 
and phosphatase activities, gene activation. 

It is also noted that LOV proteins have the potential 
to function at the same time as optogenetic tools (ini¬ 
tiating a biological function, e.g., enzyme activities) 
AND as photoswitchable proteins in super-resolution 
microscopy, given the possibility to establish 
a photoequilibrium between the LOV447 and LOV390. 
Although this opportunity needs to be demonstrated in 
living cells, it represents a novel potential application 
of LOV proteins. 


Cross-References 

► Flavin Mononucleotide 

► Flavin Mononucleotide-Binding Fluorescent 
Proteins 

► Flavins 

► Photoactivated Localization Microscopy (PALM) 

► Photoadduct 
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Low Angle Neutron Scattering 


► Riboflavin 

► Riboflavin 5'-phosphate (FMN) 

► Transient Grating Spectroscopy: Dynamics of 
Photoreceptors 
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Lysozyme - Computational Studies 

Anna Louise Bowman 

Center for Drug Discovery, The Bouve College of 
Health Sciences, Northeastern University, Boston, 
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Synonyms 

Muramidase; A-acetylmuramide glycanhydrolase 

Definition 

Lysozyme is an enzyme (EC 3.2.1.17) which catalyzes 
hydrolysis of the P(l^) glycosidic bond between 
A-acetyl-muramic acid and A-acetyl-o-glucosamine 
and serves as an important model system for many 
fields of study. 

Introduction 

Lysozyme is a model system for many studies; hen egg 
white lysozyme (HEWL) was the first protein to be 
sequenced that contains together all the 20 amino 
acids, the first enzyme to have its 3D structure deter¬ 
mined by X-ray crystallography, the first enzyme for 
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Lysozyme - Computational Studies, Fig. 1 The catalytic mechanism for HEWL proposed by Phillips via an oxocarbenium ion 
intermediate 


which a detailed chemical mechanism was proposed, 
and the first enzyme to be studied with QM/MM 
methods. HEWL can be considered as one of the 
most influential enzymes in the field of biochemistry 
(Jolles 1996). 

HEWL consists of a single polypeptide chain 
containing 129 amino acids and is linked by four 
disulphide bonds. A deep crevice or cleft containing 
the active site divides the molecule into two domains; 
one of them is almost entirely p-sheet structure (resi¬ 
dues 40-85), whereas the other is comprised of the 
N and C-terminal segments (residues 1-39 and 
101-129) and is more alpha helical in nature. The 
two domains are linked by an a-helix (residues 89-99). 

Lysozyme hydrolyzes a component of the polysac¬ 
charide cell wall in Gram-positive bacteria, cleaving 
the P(l^) glycosidic bond between A-acetyl-muramic 
acid (NAM) and N-acetyl-o-glucosamine (NAG). The 
enzyme binds the polysaccharide in a cleft six saccha¬ 
ride units long, traditionally designated sites A-E (or in 
more systematic nomenclature —4 to -i-2). Erom X-ray 
crystallography (subsites A, B, and C) and model 
building studies (subsites E and E) most sugars were 
found to fit perfectly, however, the D (—1) site NAM 
suffered bad steric contacts with enzyme. It is thought 
that the D site NAM is distorted away from the chair 
conformation when bound to minimize this steric 
strain (Erey and Hegeman 2007). 

Proposed Mechanisms 

The determination of the structure of HEWL led 
directly to the proposal of a dissociative (SNl-type) 
mechanism involving an oxocarbenium ion 


intermediate; Eig. 1. The carboxylic acid group of 
glutamic acid 35 was proposed to donate a hydrogen 
to the glycosidic oxygen atom between rings D and 
E (—1 and -i-l). This results in cleavage of the glyco¬ 
sidic bond and, according to Phillips, yields an ionic 
(oxocarbenium) intermediate. The positive charge was 
proposed to be stabilized by the lone pair of electrons 
of the ring oxygen and electrostatically by the negative 
charge of the carboxylate group of the neighboring 
Asp52. The oxocarbenium ion then reacts with 
a solvent water molecule and Glu35 becomes 
reprotonated. This mechanism had been a textbook 
example mainly because historical importance of lyso¬ 
zyme and the coherent message the structure seemed to 
provide. 

Evidence from crystallography and electrospray 
ionization mass spectrometry indicates that catalysis 
proceeds via a covalent intermediate (Vocadlo et al. 
2001). However, these results required the use of 
a (catalytically less competent) mutant HEWL or an 
unnatural substrate. The mechanism is an SN2-type 
reaction: after cleavage of the glycosidic bond the 
D site NAM forms a covalent intermediate with 
Asp52. The transition state for this SN2-type reaction 
would be an oxocarbenium ion analogous to that envis¬ 
aged as the intermediate in the mechanism proposed by 
Phillips. In this transition state electron donation from 
the ring oxygen would stabilize most of the 
positive charge that would otherwise develop at CL 
This mechanism, originally proposed by Koshland, is 
the generally accepted mechanism for most retaining 
P-glycosidases; Pig. 2. 

A third mechanism for HEWL in which an 
endocyclic bond is broken was proposed by Post and 
Karplus, on the basis of a 55 ps molecular dynamics 
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Lysozyme - Computational Studies, Fig. 2 The catalytic mechanism for HEWL proposed by Koshland via a covalent 
intermediate 
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Lysozyme - Computational Studies, Fig. 3 The catalytic endocyclic mechanism for HEWL proposed by Post and Kaiplus 


Lysozyme - Computational 
Studies, Fig. 4 The covalent 
intermediate formed in HEWL 
from QM/MM molecular 
dynamics simulations 
(Bowman et al. 2008) 
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simulation of a lysozyme complex with NAMg. This 
mechanism does not require distortion of the NAM 
D site ring. The initial step is protonation of the ring 
oxygen 05 by Glu35. Cleavage of the endocyclic 
Cl-05 bond then forms the acyclic oxocarbenium 
ion intermediate, which is stabilized by Asp52 (Stole 
et al. 1997); Fig. 3. 

Reaction Modeling 

The first QM/MM study of HEWL focused on the 
oxocarbenium ion and concluded that electronic stabili¬ 
zation would be paramount for formation of this pro¬ 
posed intermediate (Warshel and Levitt 1976). Density 
functional theory calculations on a small model system 
did not support the mechanism proposed by Phillips, and 
suggested that both the mechanism proposed by 
Koshland and that by Post/Karplus were energetically 
feasible (Bottom et al. 2005). QM/MM molecular 
dynamics simulations on the wild-type HEWL with the 
natural substrate provided evidence that the reaction 
proceeds via a covalent intermediate (Bowman et al. 
2008); Eig. 4. The transition state to breakage of the 
glycosidic bond resembled an oxocarbenium ion. The 
endocyclic mechanism was also simulated, but was 
found to be prohibitively high in energy compared with 
the exocyclic protonation mechanism. 

The simulations of Bowman et al. indicate that the 
findings for a mutant HEWL with a fluorinated sub¬ 
strate (Vocadlo et al. 2001) are indeed relevant for the 
wild-type enzyme with its natural substrate (Bowman 
et al. 2008). Together experimental and computational 
methods indicate that the reaction in HEWL proceeds 


via a covalent intermediate contrary to the previously 
widely accepted mechanism. 

Cross-References 

► QM/MM Methods 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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M2 Proton Channel from Influenza A: 
Example of Structural Sensitivity to 
Environment 

Timothy A. Cross 

Department of Chemistry and Biochemistry, Institute 
of Molecular Biophysics and the National High 
Magnetic Field Laboratory, Florida State University, 
Tallahassee, FL, USA 

Synonyms 

Ion channels; Proton channel; Solid-state NMR 

Introduction 

Numerous structures of the M2 proton channel from 
Influenza A have been characterized using solution and 
solid-state NMR spectroscopy as well as X-ray 
crystallography. This provides a unique opportunity to 
study the influence of membrane mimetic environments 
on the stracture of a membrane protein. Indeed, many of 
the stmctural differences observed in this set of structures 
can be attributed to the influence of membrane mimetic 
environments that do not fully model the native mem¬ 
brane environment. The conclusion is that it is necessary 
to validate stractures that are not obtained in an environ¬ 
ment that is a good model of the native membrane. 

The M2 Proton Channel: A Proven Drug Target 

Amantadine is a drug approved for use by the US Food 
and Drug Administration in 1966 to combat Influenza 
decades before it was known that the drug functioned 


by inhibiting the M2 protein in the virion. In 1980, an 
open reading frame for the M2 protein was identified in 
segment 7 of the Influenza RNA and the sequence was 
determined (Lamb and Lai 1981). In the early 1990s it 
was discovered that the M2 protein formed a proton 
channel (Duff and Ashley 1992; Pinto et al. 1992) and 
that the protein was a tetrameric structure in which 
a single transmembrane (TM) helix was contributed 
from each monomer to form a pore. The TM domain 
separated an N-terminal segment of approximately 
25 residues on the external surface of the virus from 
a C-terminal sequence of 51 residues on the viral 
interior that, like the other viral coat proteins of Influ¬ 
enza, interacts with the Ml protein on the viral interior. 

The M2 Environment 

The lipid content of fowl plague virus from the influ¬ 
enza viral family has been shown to be high in choles¬ 
terol and sphingomyelin (Scheiffele et al. 1997) 
suggesting that viral budding takes place through 
a raft-like domain. This further suggested that the 
lipid environment for M2 should also be high in 
sphingomyelin and cholesterol. However, it was rec¬ 
ognized early on that while M2 was expressed at nearly 
the same level as hemagglutinin and neuraminidase 
that the amount of M2 protein in the viral particles 
was more than an order of magnitude smaller (Zebedee 
and Lamb 1988) suggesting that the M2 protein is not 
very soluble in the viral coat. In 2005, Schroeder and 
Lin (Schroeder and Lin 2005) suggested that M2 
associates with the periphery of raft-like domains and 
more recently Lamb and coworkers showed that M2 is 
associated with viral budding and that the M2 protein 
was localized to the base of budding filamentous 
virions (Rossman et al. 2010). 


G.C.K. Roberts (ed.), Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 
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Consequently, it is not entirely clear what the most 
appropriate lipid environment is for the M2 protein. 
However, it has recently been shown that amantadine 
does not bind to M2 when it is in a raft-like environ¬ 
ment (Cady et al. 2011) and that the histidine pKas are 
also not consistent (Hu et al. 2011) with the pKa 
measurements when M2 is solubilized in lipids that 
form liquid crystalline domains (Hu et al. 2006) and 
with the current functional mechanism (Sharma et al. 
2010). Therefore, it appears as if M2 is solubilized as 
has been suggested in the literature at the periphery of 
rafts in a lipid environment that is liquid crystalline. 

I\/12 Structural History 

Because the TM helix (residues 22^6) alone forms 
a tetrameric structure with proton conductance and 
sensitivity to amantadine, this construct was an early 
favorite for molecular dynamics simulations and for 
early structural studies. The first molecular dynamics 
effort was by Sansom in 1993 that packed the tetra¬ 
meric helices in a parallel bundle with the sidechains of 
Val27, Ser31, and His37 oriented toward the axis of the 
bundle (Sansom and Kerr 1993). This was followed by 
numerous molecular dynamics and other modeling 
efforts from a variety of labs consistently showing 
a left-handed bundle of helices with the hydrophilic 
residues on the interior, but with small helical tilt 
angles. 

In 1999, IR spectroscopy of uniformly aligned sam¬ 
ples suggested that the helix was tilt at 31° in DMPC 
bilayers based on amide I frequencies for carbonyl 
labeled Ala29 and Ala 30 (Kukol et al. 1999). In 2000, 
solid-state NMR data for multiple sites along the poly¬ 
peptide backbone in uniformly aligned DPMC bilayers 
suggested a somewhat larger tilt of 38° and a uniform 
helical structure was implied by the observation of 
a helical wheel observed directly in the NMR spectra 
(Wang et al. 2000) known as a PISA wheel for polar 
induced slant angles (Marassi and Opella 2000). Both 
the IR and solid-state NMR data clearly showed that 
the helical structure was fourfold symmetric. 

The complete backbone structure of the TM helix 
was published in 2001 (Wang et al. 2001) followed by 
the tetrameric structure in 2002 (Nishimura et al. 
2002). The structure shown in Fig. 1 (PDB: INYJ) 
displayed none of the coiled-coil nature that was antic¬ 
ipated for such a four-helix bundle. Indeed, the helical 
structure was very highly restrained as uniform helices 
with torsion angles (([) = —60° and \|/ = —45°) similar 


to the original helical models from the 1950s (Pauling 
and Corey 1950) that were calculated in vacuo 
(i.e., a very low dielectric environment). Such unifor¬ 
mity in the helical structures has recently been 
reviewed by Page et al. (2008). 

This initial structure was influenced by the sample 
preparation protocol in which the lipid and synthetic 
peptides were codissolved in an organic solvent prior 
to drying and hydration to form the uniformly aligned 
bilayers. In more recent years, the proteoliposomes 
have been dialyzed before preparing the NMR sam¬ 
ples. Such samples of the TM helices display consid¬ 
erably greater linewidths resulting from increased 
dynamics and conformational heterogeneity (Li et al. 
2007). While the fundamental uniformity of the helices 
dictated by the low dielectric environment remained, 
the helical tilt shifted from the upper to mid 30s to 32° 
(Li et al. 2007), a value very similar to that observed 
from the IR spectroscopy mentioned above. When 
amantadine was added to the preparations the 
linewidths sharpened considerably, but the helix 
kinked, such that the amino terminal half of the helix 
remained tilted at 31°, while the carboxy terminal 
portion of the helix had a tilt of 20° (Fig. 1, PDB: 
2H95refined) (Hu et al. 2007). 

The first structural studies of the full-length M2 
protein were performed in 2003 using amino acid 
specific labeling that resulted in a limited data set for 
the tilt of the TM helix and for the amphipathic helix 
(residues 47-62) that is just to the C-terminal side of 
the TM helix. The TM helix was shown to have an 
approximate tilt of 25° and the amphipathic helix a tilt 
of either 80° or 100° (Tian et al. 2003). Furthermore, it 
was shown by hydrogen-deuterium exchange that the 
slowest exchanging amides in the entire protein were 
not in the TM helix, but in the amphipathic helix that 
interacts with both the bilayer interfacial region and 
the hydrophobic interstices of the bilayer. 

In 2008, crystal structures of the TM domain were 
published both with and without amantadine (Fig. 1, 
PDB: 3C9J and 3BKD) (Stouffer et al. 2008). Once 
again the structure was a left-handed bundle of helices, 
but the bundle without drug displayed none of the 
symmetry that was so clear in the IR and solid-state 
NMR spectra of this domain in lipid bilayers. In par¬ 
ticular, one of the helices was tilted at a much greater 
angle than the others and there was no significant 
interaction between the His37 and Trp 41 residues 
critical for proton conductance and gating of the 
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M2 Proton Channel from 
Influenza A: Example of 
Structural Sensitivity to 
Environment, Fig. 1 M2 

structures in the Protein Data 
Bank (PDB) with their 
accession numbers. The 
structures in forest green were 
achieved by solid-state NMR 
in a lipid bilayer environment: 
INYJ (Nishimura et al. 2002), 
2H95 (Hu et al. 2007), 2KQT 
(Cady et al. 2010), 2L0J 
(Sharma et al. 2010). The 
structures in cyan are 
detergent-based X-ray crystal 
structures: 3C9J 
(Stouffer et al. 2008), 

3BKD (Stouffer et al. 2008), 
3LBW (Acharya et al. 2010). 
The structures in chartreuse 
are solution NMR structures in 
detergent micelles: 2RLF 
(Schnell and Chou 2008), 
2KWX (Pielak and Chou 
2010) 


2002: 1NYJ 


2007: 2H95refined 




2010:2L0J 





channel. This nonnative conformation appears to have 
been induced by substantial electrostatic crystal con¬ 
tacts between an antiparallel pair of four-helix bundles 
and hence the symmetry in the structure was broken 
(Fig. 2a) (Cross et al. 2011). A second structure in the 
presence of amantadine was also solved and while this 
structure also had severely splayed helices as in the apo 
structure, it is symmetric and the crystal contacts are 
much reduced in the amantadine bound crystals. This 
is clear evidence that the crystal packing can severely 
distort TM structures. The helix tilt is not so different 
from the solid-state NMR structure, but the crossing 
point for the helices instead of being in the center of the 
membrane is at the N-terminus of the TM helices. 

Octylglucoside and polyethylene glycol molecules 
penetrate into the pore and between the helices of the 


apo structure (Fig. 2b) (Stouffer et al. 2008). Indeed, 
the penetration of these organics included hydrogen 
bonding to His37, clearly interfering with the conduc¬ 
tance mechanism. Such organics were not observed for 
the lower resolution bound state, but may have been 
present resulting in the splaying of the helices. These 
splayed structures mean that the conductance pore is 
not contained by the protein. 

In the same issue of Nature that published the crys¬ 
tal structure, a solution NMR structure of a somewhat 
longer construct including the TM and amphipathic 
helices was published (Fig. 1, PDB: 2RLF) (Schnell 
and Chou 2008). This construct has conductance prop¬ 
erties that are indistinguishable from the full-length 
protein and is therefore referred to as the conductance 
domain (Ma et al. 2009). Again, the four-helix bundle 
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I\/I2 Proton Channel from Influenza A: Example of Struc¬ 
tural Sensitivity to Environment, Fig. 2 (a and b) Two M2 

tetramers in the crystal unit cell of 3BKD (Stouffer et al. 2008). 
(a) Salt bridge between helices and tetramers causing asymme¬ 
try in the tetramer. (b) Octylglucoside and polyethylene glycol 
in space-filling view splaying the helices apart, (c-e) Pores 


formed by 2RLF (Schnell and Chou 2008) (c), 2L0J (Sharma 
et al. 2010) (d), 2KWX, the V27A drug-resistant mutation 
(Pielak and Chou 2010) (e). The restriction for drug to bind in 
the pore is much greater in 2RLF than in 2L0J and is even less 
restricted in 2KWX where the off rate is probably too great to 
achieve effective binding 


formed as a left-handed structure, but here the tilt of 
the helices was less than that observed in the solid-state 
NMR structure. The amphipathic helices were shown 
to form a water soluble and fully exchangeable bundle 
as a result of a 90° rotation about the helix axes so that 
the hydrophobic surface of each helix interacted with 
the hydrophobic surface of the other helices. 
Undefined loops connected each TM helix with this 
water soluble helical bundle. Moreover, the drug bind¬ 
ing site was identified as being on the exterior of the 
structure and near this junction between the TM and 
amphipathic helices. 

In 2010, Hong and coworkers refined the amanta¬ 
dine bound solid-state NMR structure of the TM 
domain (Fig. 1, PDB: 2KQT) (Hu et al. 2007) by 
measuring distances from the helices to the bound 
drug (Cady et al. 2010). In fact, this effort described 
two different sites, a high affinity site in the pore of the 
channel and a low affinity site where the solution NMR 
spectroscopy had suggested the drug binding location. 
Also in 2010, a new solid-state NMR structure of the 
conductance domain was achieved that had the amphi¬ 
pathic helices imbedded in the bilayer interface (Fig. 1, 


PDB: 2L0J) (Sharma et al. 2010). The data for the TM 
domain overlaid well with the previous data using 
sample preparations that are dialyzed to remove all 
traces of organics. Moreover, the amphipathic helices 
were determined to have a tilt of 105°, very similar to 
the results obtained on the full-length protein (Tian 
et al. 2003). In this position two hydrophobic residues 
pack into the weak binding site for the drug thereby 
eliminating such a binding site once the amphipathic 
helix interacts with the bilayer. In addition, it was 
shown that the stoichiometry between drug and tetra¬ 
mer was 1:1 (Sharma et al. 2011). In 2010, two addi¬ 
tional structures were published: a G34A mutant 
designed to stabilize the TM domain was crystallized 
and the backbone superimposes very well with this 
most recent solid-state NMR structure (Fig. 1, PDB: 
3LBW) (Acharya et al. 2010). The other structure was 
a solution NMR structure and this was of the V27A 
drug-resistant mutant of the conductance domain 
(Fig. 1, PDB: 2KWX) (Pielak and Chou 2010). This 
structure also had the amphipathic helices interacting 
with the lipid interface, but not as deeply as the solid- 
state NMR structure. Furthermore, the TM helices had 
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a very similar helix tilt as in the first solution NMR 
structure, i.e., quite a bit less than the solid-state NMR 
structures. 

Environment Induced Structural Effects 

The structural differences in these many structures do 
not appear to result from numerous functional states, 
but are primarily the result of structural influences by 
the membrane mimetic environment (Cross et al. 
2011). Christian Anfinsen in 1973 stated that “the 
native conformation (of a protein) is determined by 
the totality of interatomic interactions and hence by 
the amino acid sequence in a given environment” 
(Anfinsen 1973). For membrane proteins the heteroge¬ 
neous environment can have a very significant 
impact on the membrane protein structure, especially 
for small TM proteins that have a greater percentage of 
interactions with the environment compared to the 
interactions within the protein structure. 

The detergent-based crystal structures of the TM 
domain illustrate two potential problems associated 
with such an environment (Stouffer et al. 2008). For 
the apo structure, 3BKD, crystal packing results in 
a Coulombic interaction between two tetramers 
involving Asp24 and Arg 45 (Fig. 2a) as well as exten¬ 
sive van der Waals interactions between the tetramers. 
For the amantadine bound structure, 3C9J, the Cou¬ 
lombic interaction between tetramers is no longer pre¬ 
sent, although van der Waals interactions remain 
(Stouffer et al. 2008). While the apo structure is an 
asymmetric tetramer, the drug-bound structure is 
symmetric. It appears that the Coulombic interactions 
associated with crystal contacts may be responsible for 
distorting the symmetry of this tetrameric structure. 
Although the drug-bound structure is symmetric it still 
has splayed helices that generate fenestrations from the 
pore into the would-be membrane environment. In the 
viral coat the splayed helices would generate a large 
aqueous vestibule open to the viral interior and open to 
the hydrophobic interstices of the viral coat. 

The resolution of the drug-bound structure (3C9J) 
was not good enough to define the coordinates for the 
detergent, (l-octylglucoside (OG) and the precipitant, 
polyethylene glycol (PEG) (Stouffer et al. 2008), but 
some of these molecules were defined in the apo struc¬ 
ture (3BKD) including two OG molecules and a PEG 
imbedded in the pore at the C-terminal end of the 
tetramer (Eig. 2b). Presumably, these same molecules 
are present in the pore of the drug-bound structure 


crystallized under similar conditions and yielding 
similar splayed helices. Unlike lipids that have nM 
monomer concentrations when bilayers form, 
detergents typically have mM concentrations above 
the critical micelle concentration (Cevc and Marsh 
1987). Here, these monomeric organics appear to be 
responsible for splaying apart the helices of these 
tetrameric crystal structures. 

The more recent crystal structure (3LB W) (Acharya 
et al. 2010) represents a crystallographic triumph for 
the smallest TM protein crystallized in a conformation 
that appears to be consistent with the membrane envi¬ 
ronment. While the only interactions between the TM 
helices in this construct are van der Waals interactions 
and maybe other weak electrostatic interactions, the 
interactions with the environment are limited to a few 
additional van der Waals interactions. Such interac¬ 
tions with the environment result in neither structural 
asymmetry nor splaying of the helices that had 
previously resulted in fenestrations into the membrane 
interstices. 

The first solution NMR structure (2RLP) (Schnell 
and Chou 2008) was of the conductance domain solu¬ 
bilized in diheptanoylphosphatidylcholine (DHPC) 
micelles. Three features of the micelle environment 
appear to have distorted this structure. Eirst, the 
micelle surface is highly curved potentially impairing 
the interaction of linear amphipathic helices. Further¬ 
more, the water concentration gradient is such that 
water penetrates much further into this environment 
than into a membrane environment (Menger and Boyer 
1980). Both of these factors will influence the interac¬ 
tion of amphipathic helices with this membrane 
mimetic. Indeed, for 2RLF the amphipathic helices 
choose to form a water soluble tetrameric helical 
bundle with a hydrophobic core as opposed to the 
helices becoming buried in the interfacial region of 
the membrane mimetic. 

The third feature of the micelle that appears to come 
into play here is that the hydrophobic dimension of 
detergent micelles is an adjustable parameter 
influenced by the TM protein. While the hydrophobic 
dimension of native membranes can also be influenced 
by TM proteins through the release of curvature frus¬ 
tration, the mechanism for hydrophobic expansion in 
a micelle involves the mere addition of detergent mol¬ 
ecules to the micelle in forming a somewhat prolate 
ellipsoid. Consequently, the native membrane con¬ 
strains the tilt of hydrophobic TM helices (de Planque 
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and Killian 2003) more than a detergent micelle. For 
the solution NMR structures (2RLF and 2KWX) 
(Pielak and Chou 2010; Schnell and Chou 2008) the 
tilt of the helices is significantly less than that seen in 
the structures obtained in synthetic lipid bilayers 
(Hu et al. 2007; Sharma et al. 2010) and in the recent 
crystal structure (Acharya et al. 2010). The result of 
a more parallel packed tetramer of helices is a smaller 
opening into the pore (Fig. 2c) compared to the recent 
solid-state NMR structure (Fig. 2d) (2L0J). This could 
explain the lack of drug binding, which is known to 
have a slow on rate in membrane and bilayer environ¬ 
ments, but here in detergent micelles the binding is 
observed to be negligible and may result from further 
restrictions by this secondary gate associated with 
Val27 that forms the entry point into the pore. 

The second solution NMR structure (2KWX) was 
obtained in a similar DHPC micelle environment on the 
same conductance domain but with the V27A mutation 
(Pielak and Chou 2010). The result is that the secondary 
gate normally formed by Val27 has been eliminated 
and one turn further into the pore the Ala30 residues 
now provide the greatest restriction for the entrance 
into the pore (Fig. 2e). The most interesting develop¬ 
ment with this structure is that the amphipathic helix is 
now rotationally reoriented by 90° with its hydrophobic 
surface interacting with the micelle interstices. While 
the position is not the same as in the bilayer bound 
structure, it raises the interesting possibility that two 
conformations for the amphipathic helices may be pos¬ 
sible. One, as observed in the synthetic bilayers would 
induce curvature in the membranes as needed for viral 
budding (Rossman and Lamb 2011), and the second 
conformation as observed in the micelles where the 
protein has far less influence on membrane curvature 
as seen in the host cell. 

Summary 

Clearly, the structure of TM a-helical proteins can be 
dramatically influenced by different membrane 
mimetic environments. Understanding this influence 
is going to be critical for the evaluation of all a-helical 
membrane proteins in the protein data bank and espe¬ 
cially those with a relatively small number of helices. 
Furthermore, it is clear that structural data from lipid 
bilayers is going to be critical for validating structures 
obtained from a detergent environment. 


Cross-References 

► Membrane Proteins in Phospholipid Bilayers: 
Structure Determination by Solid-State NMR 
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Synonyms 

DNA polymerase 

Introduction 

Since the discovery of the double helix more than 
60 years ago, extensive studies have revealed the 
basic mechanisms underlying one of the most essential 
processes at the core of every cell: the replication of 
genomic information. The basics of this process are 
highly conserved throughout all organisms from 
viruses to eukaryotes. The higher complexity of 
eukaryotic cells and the need to adjust to a wider 
spectrum of internal and external signals require addi¬ 
tional factors that fine-tune the replication machinery 
to respond specifically to the needs of the given envi¬ 
ronment. An additional distinction of eukaryotic repli¬ 
cation is the ► posttranslational modification of 
a number of replication proteins through phosphoryla¬ 
tion and ubiquitination, which promote responses to 
cell cycle-dependent signals and DNA damage. 

The fundamental steps of DNA replication include 
melting of the duplex at replication initiation sites, 
assembly of a multicomponent replication fork, and 
coordinated synthesis of the daughter strands. The 
core components required to accomplish this task 
include a DNA ► helicase that unwinds the parental 
DNA strands, a primase activity that initiates DNA 
replication by synthesizing short RNA primers, two 
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► DNA polymerases that extend these primers for 
processive synthesis, processivity clamps that stabilize 
the DNA polymerase at the fork, and a clamp loader that 
assembles the clamps at primed sites. Single-strand 
DNA-binding proteins coat the unwound single-stranded 
lagging strand template to prevent induction of a DNA 
damage response and remove secondary structures 
that may interfere with DNA synthesis (Fig. 1). 
This multiprotein assembly at a rephcation fork is called 
the “replisome.” 

The antiparallel nature of the DNA double helix and 
the 5' to 3' direction of nucleotide polymerization 
require that one polymerase extends the daughter strand 
continuously (leading strand polymerase) while the 
other polymerase (lagging strand polymerase) synthe¬ 
sizes short fragments of DNA in the opposite direction. 
These DNA fragments are referred to as Okazaki frag¬ 
ments and are initiated upon synthesis of RNA primers 
by primase. Thus, the lagging strand polymerase repeat¬ 
edly cycles from the end of one Okazaki fragment to the 
beginning of the next, while the leading strand polymer¬ 
ase extends DNA continuously in the direction of the 
moving replication fork (Fig. 2). Studies from multiple 
systems including Escherichia coli and bacteriophages 
have demonstrated that the two polymerases are 
coupled and physically connected via protein-protein 
interactions. A model that allows the coupling of two 
polymerases despite their opposite direction of DNA 
synthesis is referred to as the trombone loop model 
(Komberg and Baker 1992), where the unwound lag¬ 
ging strand template bends back to allow the two poly¬ 
merases to remain bound to one another (Fig. 1). The 
loop grows as leading strand synthesis progresses and is 
released upon dissociation of the lagging strand poly¬ 
merase at the end of an Okazaki fragment. 

The characterization of proteins at the replication 
fork has historically included the following steps: 
(1) the identification of individual components of the 
replisome, (2) the characterization of their enzymatic 
activities, (3) high-resolution studies of their three- 
dimensional structures, (4) understanding their interac¬ 
tion with DNA and other components of the replisome, 
and (5) the dynamic behavior of each of the components 
within a moving replication fork. Addressing these 
points required obtaining the proteins in quantity and 
the establishment of suitable model systems. E. coli and 
its bacteriophages T4 and T7 have proven to be excel¬ 
lent systems, most importantly because their complete 
replisomes can be reconstituted from isolated and 


purified subunits into functional replication forks. 
The identification of the individual components of 
replisomes involved numerous genetic studies, while 
biochemical and biophysical techniques such as 
► FRET, pre-steady state ► kinetics, or ultracentrifu¬ 
gation generated a vast amount of information regard¬ 
ing binding affinities or catalytic activities such as 
speed, processivity, and fidelity. ► Electron micros¬ 
copy (EM) and X-ray crystallography have generated 
detailed information, including three-dimensional 
structures of individual components and higher-order 
molecular complexes. Recently, great advances have 
been made in the development of new techniques, 
including single-molecule analysis and ► live-cell 
imaging that allow for monitoring dynamic interactions 
on a single-molecule level. 

Comparison of the sequence, structure, and function 
of many of the replication components has revealed 
high conservation in the major activities and 
overall organization, but has also revealed fascinating 
species-specific differences. This review focuses on 
the organization of the replication fork and the major 
components of both bacterial and eukaryotic 
replisomes. The structure and mechanism of 
replisomes and their similarities and differences will 
be discussed. Eurthermore, the recent developments in 
biophysical techniques, including single-molecule 
analysis and in vivo fluorescent imaging, and their 
potential roles in answering questions that have histor¬ 
ically been difficult to approach, will be explored. 

DNA Templates 

Reconstitution of the replisome and understanding 
how it works required the development of suitable 
DNA substrates. This will be discussed below in the 
context of the E. coli replisome machinery. DNA syn¬ 
thesis can be studied using large (5-7 kb) singly 
primed single-stranded (ss) DNA substrates such as 
M13mpl8 or (j)X174 circular bacteriophage genomes 
(Fig. 3a). Monitoring coupled leading and lagging 
strand DNA synthesis, however, requires all the 
replisome proteins to function together and needs 
more complicated substrates (Fig. 3b). One substrate 
that is widely used is a “rolling circle” (RC) substrate, 
which is a duplex circular DNA containing an inter¬ 
ruption in one strand and a 5' ssDNA tail. The E. coli 
DnaB hexamer ring can self-thread onto the 5' ssDNA 
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Machinery of DNA Replication, Fig. 1 Organization of cel¬ 
lular replicative complexes, (a) Bacterial replication fork. The 
parental duplex is unwound by the DnaB helicase (yellow) that 
encircles the lagging strand and travels ahead of the polymerase 
(blue) in the direction of the moving replication fork. Primase 
(purple) synthesizes short RNA primers to initiate Okazaki frag¬ 
ment synthesis on the lagging strand. The exposed single-strand 
lagging strand template DNA is covered by SSB (orange). The 
DNA polymerases are coupled through the clamp loader (green), 
which uses the energy of ATP hydrolysis to assemble the (5 
processivity clamp (red) around primed sites on the DNA. 
Although bacterial replisomes have recently been shown to 
contain three copies of the DNA polymerase, only two are 
shown here for clarity. The third polymerase may function to 
aid completion of Okazaki fragments from which the lagging 


polymerase dissociates prior to completion, (b) Eukaryotic rep¬ 
lication fork. The MCM2-7 helicase (yellow) is assembled on the 
leading strand ahead of the polymerase as it unwinds the duplex 
DNA. The helicase interacts with a number of additional factors 
that are needed for optimal stimulation of its activity as well as to 
bridge the helicase activity with Pola-primase (purple) through 
interactions that include MCMIO (beige) and GINS (brown)-Ctf4 
(green). Cdc45 (dark purple) makes contacts to both MCM and 
the GINS and stimulates helicase activity. The leading strand 
polymerase Pol s (blue) and the lagging strand polymerase Pol 5 
(turquoise) make tight contacts to PCNA processivity clamps 
(red). The unwound single-strand lagging strand template DNA 
is covered by RPA (orange). The clamp loader (green) uses the 
energy of ATP hydrolysis to assemble PCNA processivity clamps 
around primed sites 
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Machinery of DNA Repiication, Fig. 2 Cycle of lagging 
strand synthesis in E. coli. (a) As the replication fork moves, 
DnaB helicase recruits DnaG primase, which synthesizes short 
RNA primers on the unwound lagging strand, (b) While the 
lagging strand polymerase finishes synthesis of the current Oka¬ 
zaki fragment, the clamp loader displaces primase from the 
newly synthesized primer and places a P clamp around the 
primer/template junction, (c) The completion of the Okazaki 


fragment induces polymerase to dissociate from the P clamp 
and DNA and allows recruitment to the newly synthesized 
upstream primer through interaction with the r-subunit of the 
clamp loader, leaving the P clamp behind, (d) The cycle is 
complete upon association of the lagging strand polymerase 
with a new P clamp on an upstream RNA primer to begin 
synthesis of a new Okazaki fragment 


tail, circumventing the need for factors that assist 
helicase assembly onto DNA (Fig. 3b, I). The poly¬ 
merase and sliding clamp (i.e., E. coli P) assemble on 
the 3' terminus of the circle with the help of 
a multiprotein clamp loader. Provided only 2 dNTPs, 
assembly can proceed without elongation, and 


synchronous replication can be initiated by adding 
the remaining nucleotides and single-strand-binding 
protein SSB. DnaB helicase unwinds the product as 
the leading strand polymerase goes around the circle to 
provide a long ssDNA tail. DnaG primase synthesizes 
RNA primers on this ssDNA tail that are extended by 
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Machinery of DNA Replication, Fig. 3 Scheme of in vitro 
DNA replication assays, (a) Primer extension assay. A circular 
ssDNA substrate (e.g., a 7.2 kb M13mpl8 or 5.4 kb (t)xl74 
circular bacteriophage genome) is coated with single-strand- 
binding proteins and hybridized with a short DNA primer (>18 
nt), upon which the (3 clamp is assembled using the clamp loader. 
Addition of ATP and two dNTPs allows (3 clamp loading but 
prevents primer extension. DNA polymerase assembles with [3 
on the primed template and replication is initiated upon addition 
of the remaining two dNTPs. The DNA polymerase dissociates 

the lagging strand DNA polymerase. Alternative sub¬ 
strates used in the study of replisome action include 
forked linear substrates (Fig. 3b, II). 

Rolling circle substrates have been used to visualize 
replication forks in vitro by electron microscopy (EM). 
EM provides a powerful method for analyzing DNA- 
protein complexes that are too large to be visualized by 
X-ray crystallography or NMR methods, yet too small 
to be analyzed by light microscopy. The first direct 
evidence for the presence of loop intermediates during 
DNA replication arose from studies of the T7 
replisome on a rolling circle DNA substrate (Park 
et al. 1998). The replication products are clearly visible 
emanating from the circular substrate. Lagging strand 


rapidly upon completing the circle, (b) Rolling circle replication 
assays to monitor coupled leading and lagging strand DNA 
synthesis. A circular (I) or forked linear (II) double-stranded 
DNA substrate is used for assembly of a replication fork as 
described in (a). A 5' ssDNA tail allows for self-assembly of 
the DnaB helicase. Upon addition of the remaining nucleotides, 
SSB, and DnaG primase, both polymerases are engaged during 
processive DNA synthesis. The leading strand replication prod¬ 
uct in a rolling circle reaction produces the template for lagging 
strand synthesis 

DNA loops are visualized on many molecules, and 
ssDNA can be distinguished by coating with the 
ssDNA-binding protein. EM studies have also been 
extremely valuable for measuring the distribution and 
length of Okazaki fragments on individual replicating 
molecules. A study in the T7 system demonstrated that 
Okazaki fragments display broad size distribution, 
independent of primase concentration (Chastain et al. 
2000 ). 

DNA Polymerases 

The study of the structure and function of DNA poly¬ 
merases has revealed fascinating insights into this 
essential class of enzymes. In bacteria and their 
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phages, the two polymerases are identical subunits, Pol 
III for E. coli, and gp43 and gp5 for T4 and T7, 
respectively. Pol III core is a heterotrimer that contains 
the DNA polymerase (a-subunit), the proofreading 3'- 
5' exonuclease activity (e-subunit), and the 0-subunit. 
The a-subunit of Pol III has a hdelity of one 
misincorporated nucleotide every 10 ^ to 10 ® nucleo¬ 
tides, which is improved by a factor of 10 in the 
presence of the exonuclease activity of 8 (Kornberg 
and Baker 1992). 

Pol 111 core is assembled into a larger multiprotein 
complex called Pol III holoenzyme (Pol III H.E.). The 
holoenzyme contains ten different subunits that can be 
assorted into three major functional groups: ( 1 ) the 
catalytic Pol III core, (2) the processivity factor (3, 
which encircles DNA and binds Pol III core for high 
speed and processivity, and (3) the multiprotein clamp 
loader complex (yT 2 § 8 xv|/) that uses ATP to open and 
close the (3 clamp onto primed sites. The x-subunits of 
the clamp loader attach to Pol 111 core, and this con¬ 
nection allows the two polymerases of E. coli to be 
physically linked (Fig. 1). Recent data revealed that 
clamp loaders can contain three x-subunits, which 
allows for binding three polymerases (Mclnemey 
et al. 2007). The function of a third polymerase at the 
replication fork is still unknown. Bacterial polymer¬ 
ases, when coupled to their respective clamp, are very 
fast and are processive enzymes that can proceed at 
a rate of 1,000 nt/s. This requires rapid polymerase 
recycling during Okazaki fragment synthesis, where 
a new fragment must be extended approximately every 
1-2 s, generating Okazaki fragments in the range of 
1-2 kb. 

Eukaryotic replisomes have two different DNA 
polymerases: Pol s - which is primarily responsible 
for leading strand synthesis, and Pol § - for lagging 
strand synthesis (Johansson and MacNeill 2010). Both 
polymerases are composed of multiple subunits. Pol s 
contains four, and Pol § contains three (yeast) or four 
(human) subunits, respectively (Johansson and 
MacNeill 2010; Nick McElhinny et al. 2008). The 
largest subunit of Pol 5 and Pol 8 contains both the 
polymerase and exonuclease activity but the exact 
function of the smaller subunits are not yet understood. 
In contrast to bacterial replisomes, eukaryotic replica¬ 
tion forks move much slower with an average speed 
of approximately 50 nt/s. Also, eukaryotic Okazaki 
fragments are much shorter (only 100-200 nt long). 
Eukaryotic replisomes also contain proteins of 


unknown function that have no apparent homologues 
to bacterial proteins. 

There are many types of DNA polymerases involved 
in DNA replication and repair. The different polymer¬ 
ases are grouped into different classes based on the 
chain topology and secondary structural elements, as 
well as the organization of the catalytic center. Despite 
their differences, the mechanism of catalysis of the 
phosphoryl transfer reaction is highly conserved. All 
DNA polymerases utilize a two-metal ion-catalyzed 
mechanism for activating the 3' nucleotide of the 
preexisting DNA chain and to assist the pyrophosphate 
group of the dNTP being incorporated. The overall 
architecture of all polymerases is shaped as a right 
hand consisting of thumb, palm, and hngers domains. 
The catalytic center lies in the palm domain, while the 
fingers make important contacts to the incoming nucle¬ 
otides as well as the template base to which they are 
paired. The binding of nucleotides to the fingers domain 
causes a signihcant rotation, which closes a cavity to 
water and accelerates the phosphoryl transfer reaction. 
The thumb domain positions the duplex portion of the 
primed DNA and promotes processivity and transloca¬ 
tion along the template (Steitz 1998). 

Crystal structures of bacterial C-family replicative 
polymerases reveal interesting insights into the evolu¬ 
tion and unusual structural features of bacterial poly¬ 
merases (Bailey et al. 2006; Earners et al. 2006). The 
catalytic site is surprisingly related to the Pol (3-like 
nucleotidyltransferase superfamily X. The bacterial 
C-family polymerases are large and have many 
unusual appendages, including a much more extensive 
fingers domain. Interestingly, Pol III harbors a 
C-terminal OB domain, an ssDNA-binding motif 
which is located at a position near the active site. 
Biochemical and biophysical studies demonstrate that 
the OB domain binds ssDNA and stabilizes Pol III-(3 
binding needed for processive action. Upon reaching 
a downstream Okazaki fragment as ssDNA is 
converted to dsDNA, the OB domain is released from 
the template strand, which weakens the interaction 
between Pol III and the processivity clamp, allowing 
the polymerase to dissociate (Georgescu et al. 2009). 

Eukaryotic polymerases are members of the 
B-family, and are structurally different from the 
bacterial C-family polymerases. The first structural 
information of a eukaryotic member of the B family 
was obtained from the crystal structure comprising T7 
DNA polymerase with primed DNA and an incoming 
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dCTP (Doublie et al. 1998). Overall, the structure 
displays an organization of the subdomains and resi¬ 
dues involved in fidelity and proofreading that is well 
conserved between viral and eukaryotic DNA poly¬ 
merases. The incoming nucleotide is bound in 
a similar manner as all polymerase ternary complexes 
that have since been solved. The fingers and palm 
domains close together to form a tight buried pocket 
that only accommodates the geometry of a correct 
Watson-Crick base pair. It may seem perplexing that 
the exonuclease domain of DNA polymerases is dis¬ 
tant from the polymerase-active site, but a mismatch 
causes the polymerase to pause, giving sufficient time 
for the DNA to relocate from the polymerase site to the 
exonuclease site. 

Sliding Clamps and Clamp Loaders 

All replicative polymerases are intrinsically slow and 
non-processive in the absence of their processivity 
clamp. For instance, E. coli Pol III core synthesizes 
DNA at a rate of only 20 nt/s as compared to the rate of 
a replication fork that can reach 600-1000 nt/s. Thus, 
the attachment of the polymerase to a processivity 
factor provides crucial stimulation to both speed and 
stability of the polymerase on DNA. 

Despite large sequence differences, the structures 
and mechanism of sliding clamps and clamp loaders 
are highly conserved. Sliding clamps are donut-shaped 
ring structures with a central channel large enough to 
encircle duplex DNA. Clamps adopt a sixfold pseudo¬ 
symmetry with six domains that form the complete ring 
(Kong et al. 1992). The E. coli (3 clamp consists of two 
identical subunits, three domains each, that dimerize 
head-to-tail to form the ring-shaped structure (Fig. 4a). 
Eukaryotic PCNA is a homo-trimer with two domains 
per protomer, but each domain has a similar chain fold 
to E. coli P to form a trimer that adopts a similar six- 
domain ring structure (Fig. 4b). All sliding clamps con¬ 
tain a hydrophobic pocket on the C-terminal face to 
which the clamp loader as well as DNA polymerases 
attach (O’Donnell and Kuriyan 2006). 

The multi-subunit clamp loaders are circular 
heteropentameric complexes with sequence similari¬ 
ties among their subunits. Insight into the clamp loader 
mechanism has been obtained from crystal structures 
of bacterial and eukaryotic clamp loaders and clamp 
loader-DNA complexes (O’Donnell and Kuriyan 
2006). The E. coli clamp loader is a multimeric com¬ 
plex (yT2§§^x4') with the ATPase activities residing in 


the three y- and i-subunits. Both, y- and i-subunits are 
encoded by the same gene, dnaX. The y-subunit 
(47 kDa) is a truncated version of the full-length 
T-subunit (71 kDa), resulting from a translational 
frameshift. The crystal structure of the minimal 
(ysSfi') clamp loader complex (the x- and \|/-subunits 
are not needed for clamp loading) reveals a circular 
arrangement, where the C-terminal domains pack into 
a tight uninterrupted collar that holds the pentamer 
together (Fig. 4c, d). The N-terminal domains of all 
five subunits in both bacterial and eukaryotic clamp 
loaders adopt the chain fold of the AAAh- (ATPases 
Associated with a variety of Activities) family. Bind¬ 
ing of ATP to the clamp loader induces a conforma¬ 
tional change that increases the affinity of the clamp 
loader to the sliding clamp. The structure of the E. coli 
5-P complex has revealed details of the clamp-opening 
step and indicates that as a closed ring structure, the 
P dimer is under spring tension (O’Donnell and 
Kuriyan 2006). The N-terminus of 5 harbors the inter¬ 
action domain with p and is shaped as a triangular 
wedge, with a tip that is formed by two adjacent p 
strands and a loop preceding them. Two conserved 
hydrophobic residues fit into the highly conserved 
hydrophobic protein-binding pocket on the surface of 
p. A second important interaction site extends from the 
triangular wedge in §. The binding of 5 into the hydro- 
phobic pocket of P distorts the p dimer interface, and 
opening of the interface allows the domains to relax 
and the ring to spring open. This enhances the affinity 
of the clamp loader for primed DNA. The clamp is 
positioned below the clamp loader in alignment with 
the gap between the AAA-i- domains of 5 and 5'. The 
DNA-binding site is at the center of the clamp loader 
and requires a sharp bend in the DNA to enter. Duplex 
DNA is too rigid to enable this sharp bend but the 
ssDNA at a primed DNA junction provides the neces¬ 
sary flexibility for DNA binding. This requirement for 
DNA bending appears to underlie the specificity of 
clamp loading at a primed template junction. Binding 
of primed DNA stimulates ATP hydrolysis, allowing 
the clamp loader to release from the clamp and the ring 
to close around DNA. 

The five subunits of RFC (RFCl-5) are AAAh- sub¬ 
units that are arranged in a spiral circle similar to the 
E. coli clamp loader. The RFC-PCNA-ATPyS struc¬ 
ture shows that the clamp is located directly under¬ 
neath the AAA-i- domains of all five subunits similar 
as for E. coli (Fig. 4d). Remarkably, the spiral 
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Machinery of DNA Replication, Fig. 4 Structure of sliding 
clamps and clamp loaders, (a) Ribbon representation of the E. 
coli P homodimer (pdb code, 2pol). The two monomers (pink 
and blue) interact head-to-tail and form a highly symmetrical 
ring-shaped structure that encircles DNA. The three domains of 
each subunit have identical chain-folding topologies and form an 
outside perimeter of a continuous antiparallel P sheet. The inside 
cavity is lined with 12a helices, (b) Ribbon representation of the 
Saccharomyces cerevisiae PCNA homotrimer (pdb code, ISXJ), 
which displays the same chain-folding topology that is observed 
in the domains of the P clamp, (c) Left. Schematic representation 
of the arrangement of the E. coli Y358'clamp loader subunits 


demonstrating the circular orientation of the five subunits. The 
pentameric circular assembly is interrupted by a gap between the 
6 - and 5'-subunits, leaving space for the passage of DNA. The 
Xt|; subunits are thought to attach to the y-subunit via v|/. Right: 
Ribbon representation of the crystal structure of the minimal y 
complex clamp loader (y 3 S 8 ') (pdb code 1JR3). The C-terminal 
domains create a tight circular collar. The N-termini of each 
subunit contain the two AAA"^ domains and are suspended from 
the C-terminal domains. The C-terminal domains of the 8 - and 
8 '-subunits create a gap for DNA to enter the internal 
DNA-binding site. The P clamp interacts with the clamp loader 
N-terminal surface, (d) Left: Schematic representation of the 
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organization of RFl-5 matches the helical structure of 
a B-form DNA quite precisely, which allows RFC to fit 
tightly around the double helix. The RFC-PCNA com¬ 
plex binds to primed DNA, and the recognition of the 
double-stranded/single-stranded junction stimulates 
ATP hydrolysis by the clamp loader. This results in 
dissociation of RFC from the clamp, leaving PCNA 
encircling DNA and ready to receive DNA polymerase 
for DNA synthesis. 

DNA Helicases and DNA Primases 

Replicative helicases are circular hexamers that encircle 
one strand of DNA and use ATP to fuel translocation 
along it. The melting of the duplex DNA occurs as 
a consequence of excluding one strand of the DNA 
from the helicase ring as the helicase moves along the 
template (Fig. 1). The primase must associate with the 
helicase for activity thereby localizing RNA primers to 
the forked junction (Chang and Marians 2000; Komberg 
and Baker 1992). In fact, in T7 phage the helicase and 
primase are covalently connected by being encoded by 
the same gene. More typically, the two proteins tran¬ 
siently interact. The E. coli DnaB helicase is a ring of six 
identical subunits that encircle the lagging strand tem¬ 
plate. In contrast, the presumed eukaryotic replicative 
helicase is encoded by six different but homologous 
subunits of the minichromosome maintenance proteins 
(MCM2-7), which translocate along the leading strand 
ahead of the polymerase. 

In the case of E. coli, each DnaB monomer consists 
of two domains. The N-terminal domain (NTD) links 
to DnaG primase, and the larger C-terminal domain 
(CTD) exhibits a RecA-like fold and contains con¬ 
served sequence domains involved in nucleotide bind¬ 
ing and hydrolysis. The C-terminal face of the DnaB 
hexamer is directed toward the replication fork 
(Biswas and Biswas 1999). The two domains of 
DnaB form a double-layered ring structure in which 
the NTD packs into a tight triangular collar seated on 
top of a more loosely packed ring of CTDs. The diam¬ 
eter of the central channel varies between 25 and 50 A, 
which is wide enough to accommodate duplex DNA. 


The crystal structure of DnaB bound to the helicase- 
binding domain (HBD) of DnaG primase reveals bind¬ 
ing of three molecules of HBD to the NTD collar of the 
helicase, with one HBD forming contacts with two 
adjacent NTDs (Bailey et al. 2007). This arrangement 
stabilizes the threefold arrangement of the NTD collar 
and orients the primase-active site with the proposed 
ssDNA-binding site of DnaB, allowing access to prime 
the ssDNA extruding from the DnaB channel. This 
organization may stimulate DnaG activity by increas¬ 
ing the local concentration of ssDNA and ensuring that 
multiple DnaG subunits are in close proximity with 
each other. 

Insight into eukaryotic MCM helicases was 
obtained by crystal structures of archaeal MCMs, 
which are homologous to eukaryotic MCMs (Brewster 
and Chen 2010). Archaeal MCMs are homohexamers 
and like E. coli DnaB they harbor two domains: an 
NTD - which is used mostly for structural organiza¬ 
tion, and a C-terminal domain - connected by a flexible 
linker that harbors the AAA-i- ATPase and unwinding 
activity. Modeling of the crystal structures into 
a homo-hexamer reveals the shape of a dumbbell 
with a narrower waist between the N- and C-terminal 
domains, an organization that is supported by EM 
studies (Gomez-Llorente et al. 2005). Recently, EM 
studies of Saccharomyces cerevisiae MCM2-7 pro¬ 
vided evidence that MCMs are loaded as a dimer 
onto origin DNA in a head-to-head fashion, possibly 
to facilitate two bidirectional replication forks at ori¬ 
gins (Remus et al. 2009). 

In contrast to bacterial primases, the eukaryotic 
primase is a heterodimeric enzyme that forms a stable 
association with the Pol a DNA polymerase catalytic 
subunit, resulting in a tetrameric Pol a-primase com¬ 
plex. Thus, short RNA primers are synthesized by the 
primase activity and extended by Pol a DNA polymer¬ 
ase for 20-30 nt before replicative DNA polymerases 
take over for processive DNA synthesis (Johansson 
and MacNeill 2010). Structural information of eukary¬ 
otic Pol a-primase is still required for a more complete 
understanding of its mechanism. 


< - 

Machinery of DNA Replication, Fig. 4 (continued) arrange¬ 
ment of the RFC clamp loader subunits is similar to the bacterial 
clamp loader subunits shown in (c). The pentameric circular 
assembly is inteiTupted by a gap between the RFCl and RFC5 
subunits, similar to the gap between 5 and 8 ' in the E. coli clamp 


loader. Right: Crystal structure of the S. cerevisiae RFC-PCNA 
complex solved in the presence of ATPyS (pdb code 1SXJ). RFC2 
and RFC5 do not contact the closed PCNA clamp. The overall 
architecture adopts a very similar overall structure as the bacterial 
Y 3 S 8 'clamp loader complex 
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Additional factors present at eukaryotic replication 
forks but not in bacteria include Cdc45 and the GINS 
complex. These proteins interact with the MCM2-7 
helicases and are required for activation of the helicase 
and fork progression (Masai et al. 2010). In yeast, the 
checkpoint proteins Mrcl, Tofl-Csm3 are required for 
forks to pause at protein-DNA barriers. Mrcl has also 
been shown to directly interact with Pol e, thus cou¬ 
pling the polymerase with the helicase. On the lagging 
strand, Ctf4 bridges the helicase with Pol a-primase 
(Gambus et al. 2009). 

New Emerging Techniques 

The recent development of single-molecule tech¬ 
niques has allowed direct observation of dynamic 
processes at moving replication forks in real time. 
Single-molecule analysis uses DNA substrates that 
are attached to the surface of a flow cell. Different 
techniques have been established to directly monitor 
DNA replication. In one approach, the DNA is com¬ 
prised of a 48.5 kb-long lambda phage duplex DNA, 
which harbors a 5' biotin, allowing attachment of the 
substrate to streptavidin that coats the bottom of 
a flow cell. The other end is linked to a bead coated 
with an anti-digoxigenin antibody. Initiation of lead¬ 
ing strand synthesis converts the duplex DNA tem¬ 
plate into ssDNA replication intermediates. ssDNA 
tends to coil, which results in shortening of the DNA 
molecule compared to a duplex portion of the sub¬ 
strate. Thus, DNA replication can be monitored fol¬ 
lowing changes in the length of the template. This 
approach has been applied to study the T7 and E. coli 
system and revealed insight into replication loops 
during processive DNA synthesis (Van Oijen and 
Loparo 2010) (Fig. 3). An alternative approach uses 
5' biotinylated minicircle DNA substrates, which are 
immobilized through a streptavidin interaction. Lead¬ 
ing strand DNA synthesis generates a long ssDNA 
substrate, which can be visualized using complemen¬ 
tary fluorescently labeled oligonucleotides. Coupling 
of leading and lagging strand synthesis converts the 
ssDNA template into duplex DNA, which can be 
monitored using fluorescent intercalating agents 
(Yao et al. 2009). The advantage of this system lies 
in its ability to reveal the growth of replication forks 
on individual molecules in real time. Furthermore, 
application of a constant flow during the reaction 
removes any excess proteins and prevents them 


from re-binding to the replication fork. Thus, direct 
information on the stability of individual replication 
proteins within a replisome is readily available. The 
constant flow also allows for supplementing soluble 
proteins to the reaction, permitting the characteriza¬ 
tion of their function at a moving replication fork. 
This approach has provided information on the 
processivity of the replisome and revealed that the 
replisome travels for an average of 86 kb before 
dissociation of the DNA polymerase. The 
processivity of a single DnaB helicase reaches an 
average of 140 kb. 

Recent advancements have also enabled the visual¬ 
ization of the fate and dynamics of individual replica¬ 
tion forks within E. coli in vivo using fluorescently 
labeled proteins and their visualization by live-cell 
imaging (Reyes-Lamothe et al. 2010). In this 
approach, data points were taken in very short time 
frames of 3 ms, which allows for the visualization of 
events at high resolution within the cell. This method 
has provided important information on the organiza¬ 
tion of sister replication forks with respect to genetic 
loci and shown that independent replication forks fol¬ 
low the path of chromosomal DNA. These data are in 
contrast to a model in which replisomes act within 
a fixed replication factory. Using fluorescently labeled 
replication proteins, this approach has also provided 
insight into the organization and stoichiometry of rep¬ 
lication fork components during the course of active 
DNA synthesis in vivo. This technique has great poten¬ 
tial to visualize dynamic interactions of proteins within 
the replisome but also their interactions with compo¬ 
nents of other cellular pathways including DNA repair 
machineries and transcriptional activities. 

Future Directions 

Our understanding of the composition of replication 
forks and the function of the core components of the 
replisome within the past few decades has provided 
tremendous insight into the process of DNA replica¬ 
tion. The greatest source of information has derived 
from understanding basic mechanisms in bacterio¬ 
phage and E. coli systems, where many of the princi¬ 
pal mechanisms have been found to be conserved in 
eukaryotes, although many questions about bacterial 
replisome functions remain. Among the interesting 
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questions that remain to be addressed in bacterial 
systems such as E. coli include how a processive 
lagging strand polymerase is able to dissociate from 
DNA and bind new RNA primers within a very short 
time frame in order to keep up with the speed of the 
leading strand polymerase. The mechanisms of poly¬ 
merase recycling are far from understood and pose 
a challenging situation for such an efficient and 
processive replisome. Also important are questions 
that address how the replisome deals with template 
lesions and protein blocks and how they resolve these 
adverse encounters. Recent data have provided 
insight into the events of a collision of an RNA poly¬ 
merase with the DNA replication machinery in E. coli 
and have revealed that upon a co-directional colli¬ 
sion, the DNA polymerase can take over an RNA 
template synthesized by the RNA polymerase. In the 
opposite direction, the DNA polymerase stalls and 
requires additional factors, including the transcrip¬ 
tion-repair coupling factor Mfd, to remove the RNA 
polymerase in order to continue. Other helicases, 
including Rep and UvrD, may also be involved in 
helping the replisome deal with protein blocks 
(Pomerantz and O’Donnell 2010). 

One of the greatest challenges is the reconstitution 
of a eukaryotic replisome in vitro. This feat will ulti¬ 
mately open the doors to detailed mechanistic insight 
into the events at a eukaryotic replication fork. Func¬ 
tional eukaryotic replication forks require posttransla- 
tional cell cycle-specific modifications, as well as 
additional factors present at replication forks. The 
recent purification and reconstitution of a pre¬ 
replication complex from S. cerevisiae and the expres¬ 
sion of the two replicative polymerases Pol § and Pol e 
in vitro have already provided insight into the nature of 
these enzymes and will direct future studies (Dua et al. 
2002; Langston and O’Donnell 2008). 

Single-molecule studies will provide invaluable 
new tools for visualizing the dynamics and stabilities 
of individual components upon collision with protein 
blocks or sites of DNA damage and will allow for the 
dissection of the steps that follow upon these events. 
This might include the collapse of replication forks, 
exchange of the replicative polymerase with other 
types of polymerases involved in bypass of lesions 
and repair, or the recruitment of a number of acces¬ 
sory factors. Future research is also needed to under¬ 
stand the functional importance of posttranslational 


modifications of eukaryotic proteins and their role 
during the process of DNA replication. Clearly there 
are numerous important and exciting questions that 
await the application of novel biophysical 
technologies. 

Cross-References 

► ATPase: Overview 

► DNA Polymerase 

► DNA Repair and Recombination 

► Electron Microscopy 

► Fluorescence 

► Fluorescence and FRET in Membranes 

► Helicases 

► Kinetics: Overview 

► Live-Cell Single-Molecule Imaging 

► Post-Translational Modifications 

► Single-Molecule Methods 

References 

Bailey S, Eliason WK, Steitz TA. Structure of hexameric DnaB 
helicase and its complex with a domain of DnaG primase. 
Science. 2007;318:459-63. 

Bailey S, Wing RA, Steitz TA. The structure of T. aquaticus 
DNA polymerase III is distinct from eukaryotic replicative 
DNA polymerases. Cell. 2006;126:893-904. 

Biswas EE, Biswas SB. Mechanism of DnaB helicase of 
Escherichia coli: structural domains involved in ATP hydro¬ 
lysis, DNA binding, and oligomerization. Biochemistry. 
1999;38:10919-928. 

Brewster AS, Chen XS. Insights into the MCM functional mech¬ 
anism: lessons learned from the archaeal MCM complex. 
Crit Rev Biochem Mol Biol. 2010;45:243-56. 

Chang P, Marians Kl. Identification of a region of Escherichia 
coli DnaB required for functional interaction with 
DnaG at the replication fork. J Biol Chem. 2000;275: 
26187-195. 

Chastain PD 2nd, Makhov AM, Nossal NG, Griffith ID. Analysis 
of the Okaaki fragment distributions along single long DNAs 
replicated hy the bacteriophage & 4 proteins. Mol Cell. 
2000;6:803-14. 

Doublie S, Tabor S, Long AM, Richardson CC, Ellenberger T. 
Crystal structure of a bacteriophage T7 DNA replication 
complex at 2.2 A resolution. Nature. 1998;391:251-8. 

Dua R, Levy DL, Li CM, Snow PM, Campbell JL. In vivo 
reconstitution of Saccharomyces cerevisiae DNA polymer¬ 
ase epsilon in insect cells. Purification and characterization. 
J Biol Chem. 2002;277:7889-96. 

Gambus A, van Deursen F, Polychronopoulos D, Foltman M, 
Jones RC, Edmondson RD, Calzada A, Labib K. A key role 




1338 


Macromolecular Crowding: Effects on Association Equiiibria and Kinetics 


for Ctf4 in coupling the MCM2-7 helicase to DNA polymer¬ 
ase alpha within the eukaryotic replisome. EMBO J. 
2009;28:2992-3004. 

Georgescu RE, Kurth I, Yao NY, Stewart J, Yurieva O, 
O’Donnell M. Mechanism of polymerase collision release 
from sliding clamps on the lagging strand. EMBO J. 
2009;28:2981-91. 

Gomez-Llorente Y, Fletcher RJ, Chen XS, Carazo JM, San 
Martin C. Polymorphism and double hexamer structure in 
the archaeal minichromosome maintenance (MCM) helicase 
from Methanobacterium thermoautotrophicum. J Biol Chem. 
2005;280:40909-15. 

Johansson E, MacNeill SA. The eukaryotic DNA polymerases 
take shape. Trends Biochem Sci. 2010;35:339-47. 

Kong XP, Onrust R, O’Donnell M, Kuriyan J. Three- 
dimensional structure of the beta subunit of E. coli DNA 
polymerase III holoenzyme: a sliding DNA clamp. Cell. 
1992;69:425-37. 

Komberg A, Baker TA. DNA replication. 2nd ed. New York: 
W.H. Freeman; 1992. 

Earners MH, Georgescu RE, Lee SG, O’Donnell M, Kuriyan 
J. Crystal structure of the catalytic alpha subunit of 
E. coli replicative DNA polymerase III. Cell. 2006; 126: 
881-92. 

Langston LD, O’Donnell M. DNA polymerase delta is highly 
processive with proliferating cell nuclear antigen and 
undergoes collision release upon completing DNA. J Biol 
Chem. 2008;283:29522-531. 

Masai FI, Matsumoto S, You Z, Yoshizawa-Sugata N, Oda M. 
Eukaryotic chromosome DNA replication: where, when, and 
how? Annu Rev Biochem. 2010;79:89-130. 

Mclnemey P, Johnson A, Katz F, O’Donnell M. Characteriza¬ 
tion of a triple DNA polymerase replisome. Mol Cell. 
2007;27:527-38. 

Nick McElhinny SA, Gordenin DA, Stith CM, Burgers PM, 
Kunkel TA. Division of labor at the eukaryotic replication 
fork. Mol Cell. 2008;30:137-144. 

O’Donnell M, Kuriyan J. Clamp loaders and replication initia¬ 
tion. Curr Opin Struct Biol. 2006; 16:35^1. 

Park K, Debyser Z, Tabor S, Richardson CC, Griffith JD. For¬ 
mation of a DNA loop at the replication fork generated by 
bacteriophage T7 replication proteins. J Biol Chem. 1998; 
273:5260-70. 

Pomerantz RT, O’Donnell M. What happens when replication 
and transcription complexes collide? Cell Cycle. 2010;9: 
2537^3. 

Remus D, Beuron F, Tolun G, Griffith JD, Morris EP, Diffley JF. 
Concerted loading of Mcm2-7 double hexamers around 
DNA during DNA replication origin licensing. Cell. 2009; 
139:719-30. 

Reyes-Lamothe R, Sherratt DJ, Leake MC. Stoichiometry and 
architecture of active DNA replication machinery in 
Escherichia coli. Science. 2010;328:498. 

Steitz TA. A mechanism for all polymerases. Nature. 1998; 
391:231-2. 

Van Oijen AM, Loparo JJ. Single-molecule studies of the 
replisome. Annu Rev Biophys. 2010;39:429^8. 

Yao NY, Georgescu RE, Finkelstein J, O’Donnell ME. Single¬ 
molecule analysis reveals that the lagging strand increases 
replisome processivity but slows replication fork progres¬ 
sion. Proc Natl Acad Sci USA. 2009; 106:13236^1. 


Macromolecular Crowding: Effects on 
Association Equilibria and Kinetics 

Allen P. Minton 

Laboratory of Biochemistry and Genetics, National 
Institute of Diabetes and Digestive and Kidney 
Diseases, U.S. Department of Health and Human 
Services, National Institutes of Health, Bethesda, 

MD, USA 

Synonyms 

Molecular crowding and binding affinity 

Definition 

Macromolecular crowding is a term used to denote the 
effect of high fractional volume occupancy hy nomi¬ 
nally inert solutes upon the equilihrium and transport 
properties of the same or other solutes. It is thought to 
be of biological significance, as most or all physiolog¬ 
ical fluid media contain a significant volume fraction 
of soluble macromolecules - proteins, nucleic acids, 
and polysaccharides - ranging from around 0.05 
(hlood plasma, lymph) to as great as ca. 0.4 (eye lens, 
mitochondrial inner matrix). Theory predicts that in 
the absence of compensating attractive interactions 
between “inert” volume-occupying and reactant spe¬ 
cies, macromolecular associations between reactants 
are enhanced in highly volume-occupied, or crowded, 
media relative to those in dilute solution. The extent of 
enhancement depends strongly upon the sizes and 
shapes of reactant and product species relative to 
those of predominant crowding species. 

Thermodynamic Formulation 

The effect of crowding on association equilibria may 
be illustrated using a simple prototypical binding reac¬ 
tion: the association of two macromolecules to form 
a heterodimer: A -i- B ^ AB. A thermodynamic cycle 
is constructed as illustrated in Fig. 1. In this figure, the 
standard state free energy change accompanying for¬ 
mation of the dimer in dilute solution is denoted by 
and that in crowded solution by AFab, and the 
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Macromolecular Crowding: Effects on Association Equilib¬ 
ria and Kinetics, Fig. 1 Thermodynamic cycle relating 
heteroassociation of A and B in dilute solution (upper row) and 
crowded solution (lower row). Indicated standard state free 


energy changes refer to Helmholtz free energy in a constant 
volume system or Gibbs free energy in a constant pressure 
system (figure reproduced from Zhou et al. (2008) with 
permission) 


Standard state free energy change accompanying trans¬ 
fer of solute species i from dilute to crowded solution is 
denoted by Since free energy is a state func¬ 

tion, it follows that 

AAFab = AFab - AF% 

= - (AF7"''* -f AF’7"''^) (1) 

The effect of crowding on the association equilib¬ 
rium constant is quantified as the “crowding factor” 

F = Kab/K% = cx^{-AAFabIRT) ( 2 ) 

If the free energy change accompanying transfer of 
reactants into the crowded medium is more positive 
than the free energy change accompanying transfer 
of product, then AAFab < 0 and crowding will 
enhance heteroassociation. Conversely, if the free 
energy change accompanying transfer of reactants 
into the crowded medium is more negative than the 
free energy change accompanying transfer of 
product, then AAFab > 0 and crowding will inhibit 
heteroassociation. 

Effects of Volume Exclusion 

Early theoretical and experimental studies of the effect 
of crowding on macromolecular associations focused 
on the role of volume exclusion arising from the 
mutual impenetrability of macromolecules, since 


steric repulsion is present independent of, and in addi¬ 
tion to, any other reactant-crowder interaction. In order 
to estimate the contribution of steric repulsion to 
AE^owd^ it is useful to employ hard particle represen¬ 
tations of native globular proteins in conjunction with 
established theory of hard particle fluids (Minton 
1998). A general prediction of such treatments is that 
for a particle of given volume, AF“°"‘* increases with 
surface area (Minton 1981). Hence, steric repulsion in 
crowded media is expected to enhance any reaction 
that results in a reduction of solvent-exposed surface 
area, such as the formation of compact aggregates of 
macromolecules, including soluble oligomers, fibrils 
and crystallites, or the folding of a polypeptide or 
polynucleotide chain into a compact globule (Zhou 
et al. 2008). For example, the effect of excluded vol¬ 
ume on the equilibrium association constant for dimer¬ 
ization may be estimated by modeling the crowding 
species as a hard sphere of volume v^, dilute monomer 
as a hard sphere of volume Vi, and the dimer as a hard 
sphere of volume 2vi. The dependence of the crowding 
factor for formation of dimer r{(f)) — K 2 {(t>)/K 2 { 0 ) 
upon the volume fraction of crowder (j), calculated 
using the scaled particle theory of hard sphere mixtures 
(Lebowitz et al. 1965), is plotted in Fig. 2. Simple 
excluded volume models have been shown to account 
semiquantitatively and, in some cases, quantitatively 
for a variety of experimentally observed effects of 
crowding upon association equilibria (reviewed in 
Zhou et al. 2008). 

The effect of volume exclusion upon reaction kinet¬ 
ics is somewhat more complex. For slow reactions 
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Macromolecular Crowding: Effects on Association Equilib¬ 
ria and Kinetics, Fig. 2 Scaled particle theory calculation of 
the dependence of the crowding factor for dimer formation upon 
fractional volume occupancy of crowder and upon relative sizes 
of tracer and crowder. Curve a\ Vi = Vet curve h\ Vi = 10 Vc 

obeying transition-state kinetics, excluded volume 
would be expected to stabilize the more compact tran¬ 
sition state relative to the separated reactants and, 
hence, accelerate association (Minton 1983). How¬ 
ever, the rate of fast association reactions is limited 
by the rate with which reactant molecules encounter 
each other and attain the appropriate mutual orienta¬ 
tions necessary for docking, and hence the rates of 
translational and rotational diffusion. Both transla¬ 
tional and rotational diffusions are slowed by 
crowding, so one would expect excluded volume 
effects to decelerate fast associations (Minton 1981, 
1990). Taking both factors into account, the associa¬ 
tion rate constant of a particular reaction that is slow in 
the absence of excluded volume effects Is expected to 
undergo an initial increase with increasing volume 
occupancy by inert macromolecules, reach 
a maximum, and then decrease with further increase 
in volume occupancy, as schematically indicated in 
Fig. 3. 

Effects of Attractive Interactions Between 
Crowding and Test Molecules 

Early theoretical and experimental studies of 
crowding-induced phenomena employed theoretical 
and experimental models of systems in which excluded 
volume was assumed to provide the dominant mode of 



Macromolecular Crowding: Effects on Association Equilib¬ 
ria and Kinetics, Fig. 3 Effect of increasing volume occu¬ 
pancy upon reaction rate constant for association. Long dashed 
line', transition state limited rate constant. Shoil dashed line'. 
diffusion limited rate constant. Solid line', overall rate constant 
(figure reproduced from Minton (1990) with permission) 

interaction between crowding and test molecules. Sys¬ 
tems selected for experimental study contained 
a single “crowder,” or concentrated macrosolute (pro¬ 
tein or polymer) in addition to trace species whose 
reactions were being studied. Some, but not all, of 
these Crowders did in fact behave as inert particles. 
For example, the effect of volume occupancy upon the 
thermodynamic properties of several concentrated pro¬ 
tein solutions is well described by simple hard sphere 
models (Minton 2007). However, other proteins 
thought to be monomeric in solution show evidence 
of reversible weak or indefinite self-association at suf¬ 
ficiently high concentration (see, e.g., Jimenez et al. 
2007), suggesting that nonspecific “stickiness” deriv¬ 
ing from electrostatic attraction between oppositely 
charged regions and hydrophobic attraction between 
nonpolar regions of protein surfaces may be a common 
property of proteins. In addition, the interaction 
between some polymers and some proteins are not 
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well described by excluded volume theory (Minton 
1983; Phillip et al. 2009; Jiao et al. 2010) prompting 
recent investigations into the role of nonspecific attrac¬ 
tive interactions between crowding proteins or poly¬ 
mers and trace proteins. 

The change in free energy of association associated 
with the addition of solute, AAF^^, may be 
decomposed into contributions arising from excluded 
volume (steric) interactions and “soft” attractive and 
repulsive interactions between crowding agent(s) and 
reactants and/or product. 


r indicates that AA//^‘ « 0, which could mean in 
principle that soft interactions between the crowder 
and reactants are identical to those between the 
crowder and products but more likely indicates that 
soft interactions between crowder and both reactants 
and products are energetically negligible in compari¬ 
son to the corresponding excluded volume 
interactions. 

When > 0, Eq. 6 predicts that at a partic¬ 

ular temperature called the theta or compensation tem¬ 
perature, defined as 


AAFab = AAE“™' + AAF^f (3) 


The excluded volume contribution to changes in the 
free energy of association is purely entropic, whereas 
the soft contribution will consist of both energetic (or 
enthalpic) and entropic components: 

AAE“'°' = -TAASS^'^’' (4) 

AAF^f = AA//-i' - TAAS^‘ (5) 


and the crowding factor defined in Eq. 2 rewritten as 


lnr = - 


AAFab 

RT 


AA//|^'‘ 1 
R f 


AAA^"°' -f AAS^°ft 
R 


(6) 


A fundamental distinction between these two con¬ 
tributions is that while the excluded volume contribu¬ 
tion to a crowding effect is due entirely to the sizes and 
shapes of reactants and crowding species, soft interac¬ 
tions are likely to depend upon the chemical composi¬ 
tion and details of the primary structure of the 
interacting molecules, such as the locations of charged 
and hydrophobic regions of the solvent-accessible 
surface. 

It follows from Eq. 6 that the temperature depen¬ 
dence of r {not Kab) is a diagnostic for the presence of 
soft interactions between crowding agents and reac¬ 
tants and/or products. If F increases with increasing 
temperature, then AA//|‘^’^ > 0, indicating that 
crowding agents are preferentially attracted to reac¬ 
tants. If r decreases with increasing temperature, 
then AA//^^* < 0, indicating that crowding agents are 
preferentially attracted to product. The absence of 
a measurable effect of temperature variation on 


Te 


AAS^”°‘ -f AA5^°ft 


(7) 


the effects of attraction and excluded volume repulsion 
between reactants and crowders will cancel each other, 
and no net crowding effect will be observed (Jiao et al. 
2010). If excluded volume theory predicts that 
a particular association would be enhanced by addi¬ 
tion of a high concentration of crowding agent, the 
absence of an observable enhancement would indi¬ 
cate that the experiment was carried out at or near the 
compensation temperature characteristic of the par¬ 
ticular experimental system - reactant(s), product(s), 
and crowding agent(s) - studied. Characterization of 
the respective contributions of excluded volume and 
attractive interactions to the total effect of crowding 
upon association equilibria requires measurement of 
the dependence of F upon both crowder concentration 
and temperature. 


Cross-References 

► Thermodynamics of Biomolecular Interactions 
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Macromolecular Crystallography: Crystal 
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Uppsala University, Uppsala, Sweden 

Synonyms 

Macromolecular crystallization 


Definition 

► Macromolecular crystallography requires that the 
macromolecules (proteins, nucleic acids, viruses, ribo¬ 
somes, protein complexes, etc.) be prepared as 
ordered, three-dimensional crystals. 


Introduction 

Nucleation is the formation of the first-ordered aggre¬ 
gates that act as a template upon which more of the 
molecules accrue. This leads to the growth of an 
ordered three-dimensional solid phase (a crystal). 
Nucleation and crystal growth can only occur in 
a supersaturated solution. Thus, the first step in crystal 
preparation is to create a supersaturated solution. 

Crystallization Setup Basics 

While numerous techniques exist for macromolecular 
crystallization trials, the vapor-diffusion setup is the 
most popular one (McPherson 1999). In this setup, 
a small droplet of the macromolecule solution is 
mixed with a precipitant in a 1:1 ratio. (Other ratios 
are also possible.) A larger volume of the precipitant is 
placed in a reservoir outside the droplet and the 
experimental setup is sealed (Fig. 1). If a 1:1 ratio of 
macromolecule to precipitant is used, then the initial 
concentration of the precipitant in the droplet can be 
assumed to be one-half of that in the reservoir solution. 
As a consequence, the droplet has a higher vapor 
pressure of water than the reservoir solution. This 
inequilibrium causes water and other volatile species 
(in the form of a vapor) to leave the droplet until both 
the droplet and reservoir solution have equal vapor 
pressures of water (Bergfors 2009). 

As water vapor leaves the droplet, the droplet 
decreases in volume and as a result, the concentration 
of both the macromolecule and the precipitant 
increases. Ideally, this drives the macromolecule 
solution into a state of supersaturation. The supersatu¬ 
rated state is a prerequisite, with no guarantee, for 
nucleation to occur. Usually many experiments are 
done to determine the right conditions to bring about 
supersaturation and nucleation. Finding the right ones 
is still mostly a trial-and-error process. A well-ordered 
crystal is only one of many possible solid states that 
can evolve from a supersaturated solution. Disordered 
precipitates and other solid phases, like dense oils and 
gels, are far more likely. 

The most commonly used precipitants in the 
context of macromolecular crystallization are: 
polyethylene glycols (molecular weights from 
400 to 20,000), salts (e.g., chlorides, sulfates, citrates, 
phosphates), and organic solvents (e.g., ethanol, 
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Fig. 1 Vapor-diffusion setup in a plastic tray designed for 
crystallization experiments. The setup is sealed with 
a transparent, airtight seal. The droplet volume ranges from 
100 nl up to a few microliters. The reservoir volume is much 

isopropanol, 2-methyl-2,4-pentanediol). The precipi- 
tants can also be used in combination with each other 
and contain a buffer. Thus, a reservoir solution might 
consist of 20% (w/v) polyethylene glycol 4000 (as the 
main precipitant) and 0.2 M sodium chloride (a salt 
and co-precipitant) in a 0.1 M buffer, pH 6.5. The 
temperature plays an important role in the rate of 
equilibration between the droplet and the reservoir 
solution. It also affects the solubility of the macro¬ 
molecule. The crystallization setups are stored at dif¬ 
ferent temperatures, from 4°C to 37°C, with the 
majority of them being tested at room temperature 
(20°C), mostly for the convenience of the 
experimenters. 

Initial Screening for Crystallization 
Conditions 

Many factors affect the solubility of a macromole¬ 
cule and thus can be exploited to drive it into 
a state of supersaturation. Some of these are: the 
concentrations of the macromolecule and precipi¬ 
tant, the type of precipitant, the pH and type of 
buffer, the ionic strength, the temperature, as well 
as ligands, cofactors, and small-molecule additives, 
detergents, etc., that may bind to the macromole¬ 
cule. Given the large number of parameters, at 
present there are no algorithms to reliably predict 
which factors or combinations of them will 


airtight seal 


vapor space 




droplet 

reservoir 

solution 


greater: 50-1,000 gl. (a) Water vaporizes (symbolized by the 
arrow) from the droplet and condenses onto the surface of the 
reseiwoir solution, (b) Evaporation of water from the droplet 
leads to a decrease in droplet volume 

crystallize a given macromolecule. Initial screen¬ 
ing experiments often rely on commercial kits 
containing ready-made mixtures (conditions) with 
variable concentrations of precipitants, salts, and 
small-molecule additives in different buffers. The 
selection of conditions is loosely based on various 
incomplete factorial designs and other types of 
► sparse sampling or databases of previously 
successful conditions. 

To minimize the amount of materials (in particular, 
the macromolecule) in the crystallization trials, many 
laboratories use robotic workstations. Robotic liquid 
dispensers can accurately dispense hundreds of 
nanoliter-sized droplets in a few minutes. Some robots 
also dispense the reservoir solutions. More advanced 
robotic processes incorporate hardware and software 
for monitoring the crystallization experiments at regu¬ 
lar intervals. 

Optimization of the Crystals 

Crystals can appear in the initial screens within hours 
to days to months, but it is estimated that only 
0.2-0.5% of the experiments yield a result that can be 
regarded as “crystalline.” The crystals that appear in 
the initial screen are usually not of high enough quality 
for diffraction studies, which means that subsequent 
rounds of experiments are required to optimize 
them (Fig. 2). 
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Macromolecular Crystallography: Crystal Preparation, 
Fig. 2 Crystals from an initial screen and after optimization, 
(a) This shower of needle-like crystals was obtained from 
an initial screen. The crystals are too thin (<2 pm) for X-ray 
diffraction. Nonetheless, the condition is a starting point 


The researcher might also want to study potential 
inhibitors or substrates of the macromolecule. 
For example, small-molecule ► fragment screens or 
molecules from other screening procedures are 
sometimes used to create new drug leads. 
The fragments/inhibitors/substrates, etc. can be dif¬ 
fused into existing crystals (“soaking”), or they can 
be added to the macromolecule solution before the 
crystallization trials (“co-crystallization”). Soaking is 
the easier of the two and is therefore usually tried first 
(Fig. 3). Unlike crystals of small molecules, macromo¬ 
lecular crystals contain anywhere from 20% to 80% 
solvent (Giege and McPherson 2001). The solvent 
channels make macromolecular crystals quite fragile 
to manipulate but they are also the reason why it is 
possible to diffuse molecules into the crystals for 
binding studies. 

Cryoprotection of the Crystals 

Once the crystal growth conditions have been 
optimized, the crystal must be removed from the drop¬ 
let and analyzed in an X-ray beam. The researcher uses 
specially made loops to remove the crystal from the 
droplet after which the loop with its crystal is 
immediately plunged into a cryogen (usually liquid 
nitrogen) for transport to the X-ray source. While in 
the X-ray beam, the crystal is kept continuously under 
a stream of cold nitrogen gas. Collection of ► X-ray 
diffraction data from the crystal is done at cryogenic 
temperatures to minimize radiation damage to the 


for designing the optimization experiments, (b) To be suitable 
for testing in the X-ray beam, the crystal must be single 
and of a certain volume (at least 0.001 mm^). The crystal 
shown here is quite large (0.2 x 0.25 x 0.6 mm) and diffracted 
to 2.7 A 
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Fig. 3 A crystal soaking experiment. This crystal of lysozyme, 
0.3 X 0.3 X 0.3 mm, has been grown by vapor diffusion. The 
droplet volume (after equilibration against the reservoir solu¬ 
tion) is approximately one-half of the initial volume of 4 pi. 
A volume (0.2 pi) of a red dye (Eosin scarlet) has been carefully 
pipetted into the droplet. Due to its limited solubility in the 
constituents of the droplet, much of the dye has precipitated, 
but some of it still appears to have diffused into the crystal. After 
about 4 h, the crystal had turned completely red 

crystal (Garman and Owen 2006). It is usually possible 
to collect a complete data set from a single crystal 
before its internal order is destroyed by the energy of 
the X-rays. 

Cross-References 

► Fragment Screen 

► Lysozyme - Computational Studies 
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► Macromolecular Crystallography: Overview 

► Sparse Sampling in NMR 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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Macromolecular Crystallography: Future 
of FELs for Structural Biology 

Colin Nave 
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Synonyms 

Coherent diffraction imaging (CDI) or coherent 
diffractive imaging 

Definition 

Free Electron Laser (FEL) - An instrument which 
amplifies coherent light via interactions with free 
(i.e., unbound) electrons passing through a periodic 
magnetic field. 

Basic Characteristics 

In a free electron laser (EEL), relativistic electrons pass 
through a long periodic magnetic array (an undulator) 
and interact with an electromagnetic radiation field 
causing an amplification of the radiation (Saldin et al. 
1999; Paganin 2006). The initial radiation field can be 
an external one or be produced spontaneously within 
the undulator. The electrons bunch up during the 


amplification process resulting in the production of 
intense beams of coherent radiation with very short 
pulse lengths. Since the first demonstration of a PEL 
operating in the infrared (Madey 1971), a large amount 
of effort has gone into developing reliable devices oper¬ 
ating at shorter wavelengths including the x-ray region. 
One of the drivers behind the developments is to obtain 
structural information from single, or small assemblies 
of, biological macromolecules via x-ray scattering. 

PEL radiation provides a number of advantages for 
such studies. Due to the short pulse length 
(e.g., 10-100 fs), it is possible to record scattering infor¬ 
mation before radiation damage occurs. Some aspects of 
sample preparation (e.g., the need to obtain crystals of 
sufficient size) are simpler with a pulsed source as the 
individual molecules or molecular assemblies can be 
sprayed into the beam. Pinally, it should be possible to 
examine fast structural changes with such a source by 
varying the delay between initiating a reaction (e.g., with 
a pump laser) and the probe x-ray pulse. 

In crystallography, the Bragg diffraction does not 
normally sample the scattering from the contents of the 
unit cell at sufficiently fine intervals to reconstruct an 
image from the intensities alone. It was pointed out 
(Sayre 1952) that, for a centrosymmetric structure, it 
should be possible to obtain a unique image purely 
from the intensities if the transform is adequately sam¬ 
pled. More recently, developments have occurred to 
apply the oversampling to the more general case of 
noncentric structures. Prom the experimental point of 
view, the data needs to be collected by a technique 
known as coherent diffraction imaging (CDI). In this 
technique, the sample needs to be illuminated 
coherently. A useful tutorial on coherence for the 
x-ray case can be found in van der Veen and Pfeiffer 
(2004) with a more comprehensive treatment in 
Paganin (2006). The intensity of the scattered x-rays 
is recorded on a detector which needs to have 
a resolution capable of recording the fringes in the 
scattering pattern from an object of the size studied. 
Many scattering patterns with different orientations are 
required to build up a complete set of 3D data. The 
method is therefore suitable for the study of objects 
which are robust to radiation damage effects (where 
multiple views can be obtained) or for cases where 
multiple copies of identical particles are available. 
The latter cases include structure determination of 
biological macromolecules, macromolecular assem¬ 
blies (including those with internal symmetry such as 
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some viruses), and small protein crystals. The method 
is also being developed to examine single biological 
cells using low-dose methods on non-pulsed sources or 
using FELs. In the latter case (Bergh et al. 2008), 
images corresponding to projections through the 
sample would be obtained. It might also be possible 
to obtain some 3D information using simultaneous 
beams from different directions. 

The prospect of the availability of EEL radiation in 
the x-ray region has stimulated several developments 
necessary for biological structure determination. 
Detailed simulations of the interaction between the 
x-ray beam and the sample have been carried out in 
order to determine the beam properties required to 
record CDI data before the sample is destroyed by the 
intense x-ray pulse (e.g., Neutze et al. 2000). To recover 
the images, algorithms are being developed based on 
iterative transforms between real and reciprocal space 
with constraints applied at each stage (Marchesini et al. 
2003; Paganin 2006). Methods of delivering hydrated 
samples in to the beam are also being pursued. 

The recent availability of an operational EEL work¬ 
ing in the x-ray region (the LCDS at Stanford) has 
brought together these developments to produce initial 
results at 1.8 keV photon energy. X-ray diffraction 
patterns were obtained from protein nanocrystals of 
photosystem 1 showing clear fringes between the 
diffraction spots due to the coherent illumination (Chap¬ 
man et al. 201 1). By processing data from over 15,000 
of the best images (out of more than three million 
recorded) it was possible to obtain a crystallographic 
data set. Using standard crystallographic techniques, 
a corresponding electron density map was calculated 
showing the expected features at 8.5 A resolution. In 
another example, (Seibert et al. 2011) diffraction pat¬ 
terns of mimivims were recorded and images of the 
0.75 pm virus particles were obtained at 32 nm resolu¬ 
tion. In this case, the oversampled scattering from the 
particle was recorded and the images of the particle 
recovered using the iterative transform technique. 

In both cases, the radiation dose used to obtain the 
diffraction patterns was over 20 times the maximum 
tolerable dose which has been found for the case of 
protein crystallography at cryo-temperatures. This indi¬ 
cates that the expected benefits of the short EEL pulses 
occur under real experimental conditions. The field is 
likely to develop rapidly following the initial success on 
nanocrystals and mimivims particles. Eurther develop¬ 
ments in experimental procedures (e.g., sample delivery. 


background reduction, detectors) as well as analysis pro¬ 
cedures (e.g., recovering low-contrast images from the 
scattering data) can be expected. It is therefore hoped 
that the present brief review of the use of x-ray FELS for 
stractural biology will soon be out of date. 
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Definition 

Given its capability of delivering atomic-level detailed 
models of large and complex protein structures, macro- 
molecular crystallography, based on X-ray diffraction, 
is the dominating structure determination technique in 
structural biology and dmg discovery. 

Introduction 

Diffraction methods using X-rays, neutrons, and elec¬ 
trons are powerful techniques delivering information 
about macromolecular structure. The motivation to 
use X-ray crystallography as the primary means of 
macromolecular structure determination is founded 
on the fact that accurate and precise molecular models 
of large and complex structures - often revealing 
details at the atomic resolution level - can be obtained 
rapidly and reliably by means of X-ray crystallogra¬ 
phy. About 90% of all structure models deposited in 
the Protein Data Bank (PDB) are determined by X-ray 
crystallography, while the remaining 10% are deter¬ 
mined by solution Nuclear Magnetic Resonance 
(NMR) spectroscopy, largely in the 5-25 kDa range 
of molecular weight. 

Advantages and Limitations of the Method 

X-ray crystallography is the most important struc¬ 
ture determination technique in structural biology 
and drug discovery, given its powerful capability of 
delivering atomic-level detailed models of protein 
structures (Rupp 2009). Elucidating the atomic 
details of molecular interactions is particularly 
important for example in the clarification of enzy¬ 
matic mechanisms and is essential for drug target 
structures serving as leads in rational drug design. 
In addition to providing accurate models of molecu¬ 
lar structures, another advantage of crystallographic 
structure determination is that no principal difficulty 
limits the accurate description of very large molec¬ 
ular structures and molecular complexes, as 
evidenced in the nearly 2 MDa structure of the 50S 
ribosomal suhunit 3 containing 27 different proteins 
(^4,000 residues) and the rihosomal 5S and 23 S 
RNA, of together 2,833 nucleotides (Ban et al. 
2000 ). 


Relevance to Biological State 

Despite the fact that proteins must self-assemble into 
solid crystals, their biological relevance with respect to 
their native solution state is generally maintained. 
Crystals are formed through a loose periodic network 
of weak, non-covalent interactions between molecules, 
and they contain large solvent channels which allow 
relatively free diffusion of small molecules through the 
crystal. This solvent access maintains a solution envi¬ 
ronment and also accommodates conformational free¬ 
dom for surface-exposed side chains or loops as well as 
some “breathing” motions of the structure core. Com¬ 
parison of many Nuclear Magnetic Resonance (NMR) 
solution structure ensembles with crystallographic 
structure models has shown that the core structure of 
protein molecules remains unchanged compared to the 
solution state during crystallization (Brunger 1997). In 
addition, enzymes packed in crystals even maintain 
biological activity. The maintained activity in crystals 
actually creates a challenge for the crystallographer, 
often necessitating the design of inactive enzyme sub¬ 
strate analogues or substitutes in order to dissect the 
molecular reaction mechanisms. 

Flexibility, Dynamics, and Disorder 

The maintenance of the core structure and of enzy¬ 
matic function demonstrates that crystals structures are 
a very good approximation of the native protein struc¬ 
ture. Nonetheless, highly flexible or mobile regions, 
frequently the amino- or carboxyl-termini of the pro¬ 
tein chain or flexible loops connecting secondary struc¬ 
ture elements, can be poorly defined or even absent in 
the electron density and thus can be modeled only with 
limited confidence. In certain situations, flexible and 
dynamic regions of a protein molecule can be rigidly 
fixed in a specific conformation due to crystal packing 
interactions. This situation represents in most cases 
just a snapshot of one possible conformation out of 
many, and such a specific conformation may locally 
not represent the protein structure in solution. 
Although no large-scale dynamic movements can 
take place in a crystal, there is in fact quite some 
information present in the crystal structure about the 
dynamic behavior of the molecules. High B-factors for 
an atom (a generic displacement measure indicating 
how well the model is defined in that region) are 
indicative of flexibility or disorder in that local region. 
Analysis of concerted movements of entire regions of 
the molecule by Translation-Libration-Screw (TLS) 



1348 


Macromolecular Crystallography: Overview 


parameterization (Painter and Merritt 2006) during 
refinement can give indications for the propensity 
toward even large domain movements. 

Experiment 

The fundamental experiment in crystallography is 
deceptively simple: A crystal of the material of interest 
is placed into a finely focused brilliant X-ray beam, 
diffraction patterns are recorded, and the atomic struc¬ 
ture of the molecules giving rise to the diffraction 
pattern is reconstructed from the diffraction data 
(Fig. 1). While the process description appears quite 
straightforward, there are a number of practical as well 
as principal challenges that need to be overcome. 

Practical Challenges 

ftoteins, nucleic acids, or molecular complexes thereof 
- by their nature as large and flexible macromolecules - 
seldom self-assemble readily into regular, periodically 
repeating arrangements typical for crystals (Bergfors 
2009). Growing well-diffracting protein crystals can 
prove quite challenging. Obtaining the material to be 
crystallized is nontrivial, and more often than not the 
protein needs to be modified or engineered so that it in 
fact can crystallize. An additional complication during 
the data collection process originates from the fact that 
protein crystals are highly sensitive to radiation damage 
by ionizing X-ray radiation. To prevent decay of the 
crystals through radiation damage, flash-cooling of crys¬ 
tals to cryogenic temperature is near-universal practice 
(Holton 2009). However, many crystals are difficult to 
flash-cool, and it is hard to predict which cryo-protection 
conditions will work. 

Phase Problem 

Visible light scattered from objects can be focused 
through refractive lenses to create a magnified image 
of the object. This is not the case for X-rays, which are 
also electromagnetic radiation but of several orders 
shorter wavelength - the refractive index of X-rays in 
different materials is essentially equal and close to 
unity, and no refractive lenses can be constructed for 
X-rays. Instead, the electron density of the scattering 
molecular structure must be reconstructed by Fourier 
transform techniques. In itself, the Fourier reconstruc¬ 
tion from reciprocal diffraction space back into direct 
molecular space poses no principal difficulties, except 



Macromolecular Crystallography: Overview, Fig. 1 The 

principle of X-ray structure determination. A crystal mounted 
on a goniostat with at least one rotatable axis is exposed to 
a finely collimated, intense X-ray beam in the 5-20 keV energy 
range (~2.3-0.6 A wavelength). Individual diffraction images 
containing hundreds of reflections (diffraction spots) are 
recorded during small rotation increments of the crystal and 
combined into a diffraction data set. The basic mathematical 
tool of back-transformation from reciprocal diffraction space 
into direct space is the Fourier transform (FT), which together 
with separately acquired phases for each diffraction spot allows 
synthesizing or reconstructing the electron density {blue grid) of 
the molecules self-assembled into the diffracting crystal. An 
atomic model of the structure is then built into the 
three-dimensional electron density (Figure © 2011 from Bio- 
molecular Crystallography by Bernhard Rupp. Reproduced by 
permission of Garland Science/Taylor & Francis LLC) 


that for reconstruction of electron density, two terms 
are needed as Fourier coefficients: the structure factor 
amplitudes - readily accessible in form of the square 
root of the measured and corrected diffraction spot 
intensities - and as a second term for each observed 
diffraction spot, its relative phase angle. These phase 
angles are not directly accessible and must be supplied 
by additional phasing experiments (Fig. 2). 

Phase Determination 

A critical principle of any diffraction experiment is 
that during the recording of the diffraction pattern 
information is lost, and the back-transform of the 
data into a molecular model is not possible from 
diffraction data alone. While the intensities of the 
X-rays scattered from a crystal in a diffraction exper¬ 
iment can be readily determined (Dauter 2010), the 
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Macromolecular Crystallography: Overview, Fig. 2 The 

crystallographic phase problem. The measurable component of 
the Fourier transform of the crystal is only the scalar structure 
factor amplitude F(h) proportional to square root of the diffrac¬ 
tion spot intensity /(h). The missing phases (p(h) must be sup¬ 
plied by additional phasing experiments or in form of model 


phases via molecular replacement. The two necessary Fourier 
coefficients in the back-transform formula are emphasized in 
blue (Figure © 2011 from Biomolecular Crystallography by 
Bernhard Rupp. Reproduced by permission of Garland Sci¬ 
ence/Taylor & Francis LLC) 


phase relations between the X-rays in the diffraction 
data are lost. The phase angles, however, contain 
substantive information required to reconstruct 
a protein structure from the diffraction data. This 
resulting problem is aptly called the “phase problem” 
in crystallography, and numerous strategies to obtain 
the missing phases (to solve the structure) have been 
devised (Taylor 2003). The most generally applicable 
de novo phasing techniques do not require any prior 
structural information. De novo phasing techniques 
are based on determining the heavy atom substructure 
of a labeled protein crystal, from which phase infor¬ 
mation can be derived. If a model of a structurally 
similar protein or substantial parts of the structure are 
already known, molecular replacement techniques 
can be used to obtain starting phases. After either 
phasing method has succeeded, the resulting models 
are subsequently refined and rebuilt until the best 
correspondence between diffraction data, model, 
and prior chemical knowledge is obtained. 

Substructure Methods 

In the absence of a suitable known structure model, 
phases must be determined de novo by a separate 
phasing experiment, hence also the name experimen¬ 
tal phasing. Experimental phasing methods are gen¬ 
erally applicable and depend on the determination of 
a marker atom substructure by exploiting - often very 


weak - intensity differences between isomorphous 
data sets (i.e., maintaining the same crystal structure). 
Isomorphous differences are found between native 
and derivative crystals that contain a heavy atom 
soaked into the crystal, forming the basis for tradi¬ 
tional isomorphous replacement phasing methods. 
Anomalous differences in reflection data sets from 
crystals containing anomalous scatterers - generally 
atoms heavier than H, C, N, O - are most frequently 
used for experimental phase determination. Even the 
very small anomalous signal from natively present 
sulfur can be used in certain cases and when measured 
carefully to determine the sulfur substructure 
(Sheldrick 2010). The heavy atom substructure then 
provides the initial phases required to reconstruct the 
electron density. One popular phasing technique 
exploits the fact that the amino acid methionine can 
be replaced with seleno-methionine by 
overexpressing the protein in a suitable expression 
system. The Se atoms then provide a site-specific 
source of anomalous phasing signal. Together with 
density modification techniques even a single anom¬ 
alous data set may suffice to solve the enantiomorph 
ambiguity inherent in the marker atom substructures 
and solve a protein structure. Experimental phasing 
accounts for about one fourth of all structures depos¬ 
ited in the PDB, and anomalous phasing methods 
dominate. 
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Macromolecular Crystallography: Overview, Fig. 3 The 

concept of isomorphous difference data. The top line shows the 
“Gedankenexperiment” in real space of subtracting a native 
protein crystal from an exactly isomorphous derivative crystal. 
The light atoms “cancel” out, and only the heavy marker atom 
remains in the difference crystal. While one cannot produce 
a real difference crystal, a difference diffraction pattern from 


the differences between experimental data of the derivative and 
the native protein can be obtained. The difference diffraction 
pattern has the same reciprocal dimensions and thus the same 
number of reflections, but approximates the much simpler sce¬ 
nario of the “difference crystal.” (Figure © 2011 from Biomo- 
lecular Crystallography by Bernhard Rupp. Reproduced by 
permission of Garland Science/Taylor & Francis LLC) 


In almost all cases, the initial protein structure 
phases obtained during the substructure phasing stage 
are further improved by various density modification 
techniques providing substantially improved electron 
density maps into which the initial protein stmcture 
model is built (Fig. 3). 

Electron Density Interpretation and 
Model Building 

Once an interpretable electron density map (a 3-D contour 
grid of the electron density, as shown in Figs. 1 and 2) is 
obtained from improved experimental phases, a model of 
the protein stmcture must be built into the electron den¬ 
sity. The model building is carried out using programs 
that graphically display the electron density and allow 
placement and manipulation of protein backbone markers 
and residues. Various electron density fitting and geome¬ 
try refinement tools as well as automated model building 
programs greatly accelerate the process. Automated 
model building programs often provide a quite reasonable 
starting model, which can then be completed and polished 
by hand. Model building constitutes the most intense 
involvement in the protein structure, and with poor or 
low-resolution electron density, a large degree of chemi¬ 
cal intuition and experience are required. 


Refinement and Model Correction 

Despite most careful model building, the initial struc¬ 
ture model will, in addition to missing certain parts, 
contain many small errors such as incorrect bond dis¬ 
tances and angles, poor backbone geometry, or improb¬ 
able torsion angles. These small errors are corrected 
during the course of model refinement (Tronmd 2004). 
The reciprocal space rehnement program adjusts gen¬ 
eral scaling parameters to match observed and calcu¬ 
lated diffraction data overall, and then refines the atomic 
position coordinates of the initial model so that the 
differences between observed and calculated diffraction 
data are minimized - hence the name reciprocal space 
rehnement. With improving model quality, individual 
atomic B-factors (a general measure for positional dis¬ 
placement) are rehned. Bulk solvent model corrections 
are also applied during the rehnement. The global mea¬ 
sure of the agreement between calculated and observed 
structure factor amplitudes is a linear residual between 
experimentally observed structure factor amplitudes 
and calculated model structure factor amplitudes, the 
crystallographic R-value. 

A general problem in protein structure rehnement is 
the low data to parameter ratio. Such rehnements are 
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generally not stable against experimental data alone. 
The number of measured reflections seldom sufficiently 
exceeds the number of adjustable model parameters to 
allow free refinement, and prior stereochemical knowl¬ 
edge must nearly always be Incorporated into macro- 
molecular refinement in the form of stereochemical 
restraints that keep the structure model within physi¬ 
cally reasonable bounds. Additional restraint terms cre¬ 
ating penalties for deviations from known geometry 
target values are used, and safeguards against overfitting 
are necessary. Overfitting is the introduction and varia¬ 
tion of additional model parameters, which artificially 
improves the fit between observed and calculated data, 
but actually does not improve the structure model. 
Common sources of overfitting are introduction of too 
many water molecules or placing various kinds of sol¬ 
vent molecules (or ligands) into spurious density. 
Crossvalidation against a small subset of unused (free) 
data, monitored by the R(free)-value for the free data 
set, is standard practice. During refinement and rebuild¬ 
ing, the model is thus constantly subjected to an array of 
validation techniques, ranging from basic geometry 
checks to detailed chemical and folding plausibility 
checks based on prior knowledge. To fix the errors, the 
electron density around the questionable residues is 
inspected again, necessary corrections are made, and 
the structure is again refined. This process is repeated 
until all significant errors are corrected and the differ¬ 
ence electron density shows no more offending high- 
level features. Once reasonably polished, the structure is 
ready for detailed analysis, and the structure factor 
amplitudes and the model coordinates are deposited 
with the Protein Data Bank and released to the public. 

Structure Validation and Analysis 

Even after careful model correction and refinement, 
certain errors may persist or are not reported by the 
refinement program’s diagnostics. Validation is gener¬ 
ally necessary before using or analyzing a structure 
model. The quality of a protein structure is a local 
property, and global agreement indicators such as the 
R-values cannot be specific as far as the local quality of 
a structure model is concerned. Local geometry vali¬ 
dation programs evaluate on a per-residue basis the 
local geometry and flag outliers. Extended stretches 
of consistently high deviations in either case are indic¬ 
ative of serious “problem zones” within the model. 


A very powerful method of assessing the local quality 
of a protein structure is the real space correlation of the 
model against a bias minimized electron density map. 

Analysis details depend to a large degree on the pur¬ 
pose of the stmcture study and on which hypothesis the 
stmcture model was intended to test. The stmcture model 
itself is a hypothesis - it must withstand scmtiny against 
both the experimental evidence in the form of electron 
density and the entire pool of established prior knowledge 
and laws of nature (Rupp 2010). Unconventional and 
unusual features always require strong and convincing 
support through clear electron density, often further 
supported by biochemical evidence that may exist in 
addition to the crystallographic stmcture model (Eig. 4). 

Crystallographic Models and 
Coordinate Files 

Protein structure models at the minimum must contain 
the atomic coordinates of each atom of the asymmetric 
unit cell contents, generally proyided in Cartesian world 
coordinates in dimensions of A. In addition, the basic 
crystallographic information in the form of cell param¬ 
eters (unit cell dimensions a, b, c, and angles a, (3, y) and 
space group allows a graphical display program to gen¬ 
erate also all symmetry-related molecules to assemble 
the packing environment of the molecules in the crystal 
structure. The coordinates and additional information 
are deposited in the Protein Data Bank (PDB), which is 
the authoritative public repository and archive for exper¬ 
imentally structures and data files (Berman 2008). The 
PDB archives contain atomic coordinates, bibliographic 
citations, primary and secondary structure information, 
as well as crystallographic structure factors and NMR 
experimental data. In addition to collecting, armotating, 
and curating data files, the PDB provides deposition and 
validation services, cross-links to other databases, and 
also provides a variety of analysis tools and structure 
viewers. The PDB, however, is not responsible for the 
correctness of deposited structures. 

Summary 

Diffraction methods using X-rays, neutrons, and elec¬ 
trons are powerful techniques delivering information 
about macromolecular structure. X-ray crystallogra¬ 
phy is the most important structure determination 
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Macromolecular 
Crystallography: Overview, 
Fig. 4 Key stages in the 
structure determination 
process. The flow diagram 
provides an oveiwiew about 
the major steps in a structure 
determination project. Blue 
shaded boxes indicate 
experimental laboratory work, 
while all steps past data 
collection are conducted in 
silico (Figure © 2011 from 
Biomolecular Crystallography 
by Bernhard 
Rupp. Reproduced by 
permission of Garland 
Science/Taylor & Francis 
LLC) 



technique in structural biology and drug discovery, 
given its powerful capability of delivering atomic- 
level detailed models of protein structures. Primary 
challenges are the preparation of soluble protein and 
its crystallization, which are often nontrivial, partic¬ 
ularly for membrane proteins. Technique and instru¬ 
mentation are sophisticated but mature, with data 
collection frequently automated at synchrotron user 
facilities. Computational methods for the determina¬ 
tion of the structures including phase determination, 
model building, and refinement are well developed. 
Care must be taken to validate the resulting structure 
model locally against electron density, with special 


consideration given to biologically relevant regions 
such as binding sites and ligands. 
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Introduction 

Macromolecular single-crystal neutron diffraction 
techniques allow direct determination of the proton 


and deuteron positions of a macromolecule and its 
solvent at resolutions of ^ 1.5 and 2.5 A, respectively, 
thereby providing complementary information to that 
gained via X-ray ► macromolecular crystallography. 
By locating the positions of the protons and deuterons, 
the protonation states of key amino acid residues, sub¬ 
strates, or inhibitors can be determined, along with the 
positions and orientations of individual water mole¬ 
cules, providing details that can often be crucial toward 
understanding the macromolecule’s specific function 
and behavior (Blakeley 2009; Niimura and Podjarny 
2011 ). 

X-ray ► macromolecular crystallography is the 
method of choice for determination of biological struc¬ 
tures. The availability of intense synchrotron radiation 
sources combined with advances in detection systems 
allows the use of tiny crystals and rapid data collection 
rates. Many tens of thousands of macromolecular 
X-ray structures (>70,000 as of 2012) have been deter¬ 
mined, contributing signihcantly to the comprehension 
of a vast array of ever more complex biological 
systems and processes. The great majority (^98%) of 
these structures, however, provide us with no, or 
very little, information relating to the positions of the 
hydrogen (H) atoms, owing to the fact that, with only 
one electron, hydrogen scatters X-rays very weakly. 
The difficulty in identifying hydrogen atoms using 
X-rays is further exacerbated by their degree of 
thermal motion, such that labile hydrogen atoms 
(B-factors > ^10 A^) are extremely difficult to 
visualize even when ultrahigh resolution X-ray 
data (i.e., better than 1.2 A) are available. The basic 
principles of ► neutron scattering are similar to those 
of ► X-ray scattering in many respects (Sivia 2011); 
however, the key difference is that for neutrons the 
scattering centers are the atomic nuclei, with the total 
scattering by a nucleus being the sum of the coherent 
and incoherent scattering terms. The coherent scatter¬ 
ing gives rise to Bragg reflections, while the incoherent 
scattering does not produce any interference effects but 
rather a uniform background radiation. As the scatter¬ 
ing centers are the atomic nuclei, the coherent scatter¬ 
ing lengths show no correlation to the number of 
electrons but rather depend on the nuclear forces, 
which can even vary between different isotopes of 
the same element. Positive values of coherent scatter¬ 
ing length h reflect a 180° change in phase between the 
incident and the scattered neutron waves, while nega¬ 
tive values of h are associated with a resonance level 
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Macromolecular Neutron Diffraction, Table 1 Neutron coherent scattering lengths and incoherent scattering cross sections, and 
► X-ray scattering lengths, for the common elements of a macromolecule 


Isotope 

Atomic number 

Neutron incoherent cross 
section (barns, 

1 bam = 10 “^^' cm“^) 

Neutron coherent 
scattering length, h 
(lO-'^cm) 

X-ray scattering lengths 
( 10 ^'^ cm) 

sin 0 = 0 sin 0/k = 0.5 A“’ 

>H 

1 

80.27 

-0.374 

0.28 

0.02 

"H (D) 

1 

2.05 

0.667 

0.28 

0.02 

‘^C 

6 

0.00 

0.665 

1.69 

0.48 

14n 

7 

0.50 

0.937 

1.97 

0.53 

160 

8 

0.00 

0.580 

2.25 

0.62 

""Mg 

12 

0.00 

0.549 

3.38 

1.35 


16 

0.00 

0.280 

4.50 

1.90 


19 

0.25 

0.379 

5.30 

2.20 


25 

0.40 

-0.375 

7.00 

3.10 


26 

0.00 

1.010 

7.30 

3.30 


that produces an extra 180° phase shift. Hydrogen, and 
its isotope deuterium (D), can be more readily located 
using neutron diffraction techniques because their 
respective coherent scattering lengths are comparable 
to those of the other atoms of a macromolecule 
(Table 1). This “high visibility” of protons and deu- 
terons allows their positions to be determined at reso¬ 
lutions of ^1.5 and 2.5 A, respectively. Unfortunately, 
as hydrogen has a very large incoherent scattering 
cross-section in neutron diffraction, the study of mac¬ 
romolecules to high resolution is hampered by the 
extremely large incoherent signal that originates from 
the considerable hydrogen content (^50% of the atoms 
of a macromolecule are hydrogen) of the crystals. In 
practice, therefore, it is advantageous to exchange the 
hydrogen atoms in the crystal for deuterium atoms, as 
deuterium has a much lower incoherent scattering 
cross section. ► H/D exchange methods have tradition¬ 
ally involved growing crystals from, or soaking the 
crystals in, D 2 O solutions. This has the effect of 
exchanging solvent-accessible groups (e.g., water, 
amino, and hydroxyl group hydrogen atoms), 
amounting to ^15-20% of the hydrogen content of 
the macromolecule. 

Macromolecular Neutron Diffraction 
Applications 

A necessary step in the study of an enzyme mechanism 
is to determine the protonation states of key active site 
amino acid residues such as histidine (Fig. 1), arginine. 


aspartic acid, glutamic acid (Fig. 2), and lysine, and if 
present, the substrate, or inhibitor. These details cannot 
be reliably determined by consideration of pAl^ values 
alone, as it is well established that pA'^ values in the 
environment of an enzyme active site can differ signif¬ 
icantly from that in solution. Using neutron diffraction 
data it is possible to directly determine the protonation 
states, providing evidence that can help strengthen, or 
rule out, a proposed catalytic mechanism (Blakeley 
et al. 2008; Kovalevsky et al. 2010a; Tomanicek 
et al. 2011). Furthermore, knowledge of the positions 
of the protons and/or deuterons allows us to visualize 
the hydrogen bonding interactions that occur in the 
active site. In a similar way, neutron diffraction data 
can be used to elucidate the hydrogen bonding inter¬ 
actions that occur for ligand binding, or in protein- 
nucleic acid or protein-protein complexes. 

Neutron diffraction is particularly useful for the 
study of solvent structure (Blakeley et al. 2004; 
Niimura 2006; Howard et al. 2011). It is well known 
that water molecules can play diverse and important 
roles for the efficient functioning of a macromolecule, 
and yet, using X-rays only the electron density for the 
oxygen atoms of water molecules is typically 
observed. With neutrons, all atomic nuclei of 
a deuterated water molecule (i.e., D 2 O) scatter with 
approximately equal strength and as such allow deter¬ 
mination of the deuterated water molecule positions 
and orientations (Fig. 3). These details provide us with 
distance and angle information relating to hydrogen 
bonding, allowing us to visualize the networks formed. 
Moreover, it is possible to further distinguish D 2 O 
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Macromolecular Neutron Diffraction, Fig. 1 Histidine and Ne2. Nuclear scattering density maps generated from Fcaics 

shown in (a) the singly protonated form at N51, (b) the singly to 1.9 A resolution and contoured at 1.8 rms 

protonated form at Ne2, (c) the doubly protonated at both N51 



Macromolecular Neutron Diffraction, Fig. 2 Glutamic acid/ 
glutamate shown in (a) the protonated state, i.e., glutamic acid, 
(b) the deprotonated (charged) state, i.e., glutamate. Nuclear 
scattering density maps generated from Fcaics to 1.9 A resolution 
and contoured at 1.8 rms 


molecules from deuteroxide ions (OD~) and tri- 
deuterium ions (D 3 O- 1 -), which can be important in 
deciphering the catalytic mechanism of certain 
enzymes (Kovalevsky et al. 2011). 

Neutron diffraction data allow us to easily deter¬ 
mine the correct orientation of the carboxamide group 
of asparagine (Fig. 4) and glutamine residues, as the 
combined neutron coherent scattering signal from an 
ND 2 group ((tND 2 = 0.937 - 1 - (2 x 0.667)) is signifi¬ 
cantly larger than that from a single O atom 
{ho — 0.580). In addition, neutron diffraction data 
allow us to determine the orientations of the hydroxyl 
groups of tyrosine (Fig. 5), serine and threonine 
residues, and methyl groups of alanine, leucine 
(Fig. 6 ), isoleucine, valine, threonine, and methionine 
residues. These details provide us not only with 
structural information, but also an insight into the 
preferred orientations of these rotors and hence 
information relating to protein dynamics (McDowell 
and Kossiakoff 1996; Ostermann and Parak 2005). 
Moreover, by exploiting the fact that hydrogen and 
deuterium have different scattering magnitudes and 
phases, neutron diffraction provides us with a tool for 


identifying isotopically labeled structural features and 
determining the pattern and extent of H/D-isotope 
substitution, providing information on the solvent 
accessibility of individual amino acid residues 
(Fig. 7), on the mobility and flexibility of interesting 
domains and on the ► H/D exchange dynamics them¬ 
selves (Niimura et al. 2005; Ostermann and Parak 
2005; Niimura and Podjamy 2011). 

Finally, an important difference between X-ray and 
neutrons is that while X-rays are ionizing radiation 
and cause damage to the crystals, mostly through 
the generation of free radicals, neutrons are a non¬ 
destructive probe. X-ray radiation damage can lead to 
significant changes in the structure and so diffraction 
data are routinely collected at 100 K, often 
using several cryo-protected crystals. In contrast, 
neutron diffraction data are generally collected at 
room temperature, avoiding difficulties associated 
with cryo-cooling (e.g., increased mosaicity) and 
allowing determination of macromolecular structures 
close to physiological temperatures. Nevertheless, 
neutron diffraction data collection at cryo-tempera- 
tures (Blakeley et al. 2004; Meilleur et al. 2005) 
remains of interest as reduced thermal motion should 
lead to lower overall B-factors and perhaps better 
visualization of the more mobile parts of the system, 
as was shown by the 15K neutron diffraction study of 
concanavalin A (Blakeley et al. 2004) for which twice 
as many D 2 O water molecules were observed in the 
15K structure than for the room temperature structure. 
Furthermore, the ability to collect neutron diffraction 
data at cryo-temperatures permits the possibility to 
cryo-trap intermediate states during an enzymatic reac¬ 
tion, thereby providing details of their protonation 
states, which can help decipher the pathway for 
catalysis. 
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Macromolecular Neutron 
Diffraction, Fig. 3 Water 
molecules (as D 2 O) from 
perdeuterated type-III 
antifreeze protein (PDB code 
3QF6, Howard et al. 2011). 
Nuclear scattering density 
maps generated from F^aics to 
1.9 A resolution and contoured 
at 2.0 rms 






Macromolecular Neutron Diffraction, Fig. 4 The orienta¬ 
tion of an asparagine residue side chain is easily determined 
using neutron diffraction data. Nuclear scattering density 
maps generated from F^aics to 1.9 A resolution and contoured 
at 1.8 rms 

The Development of Macromolecular 
Neutron Diffraction 

Despite the wealth of information that can be gained 
from neutron diffraction studies, the number of 
macromolecular neutron structures determined is 
very small (less than 60 as of 2012). There are several 
reasons for this. Firstly, neutrons are much more 
difficult to produce than X-rays, in terms of cost and 
particle flux. Indeed, there are presently only a handful 
of locations in the world at which macromolecular 



Macromolecular Neutron Diffraction, Fig. 5 A tyrosine res¬ 
idue showing the orientation of the hydroxyl group (as 0-D). 
Nuclear scattering density maps generated from F^aics to 1.9 A 
resolution and contoured at 1.8 rms 

neutron diffraction studies can be performed (Niimura 
and Podjarny 2011). Moreover, the fluxes from these 
neutron facilities are many orders of magnitude less 
than the corresponding fluxes of X-ray sources, a fact 
that forces neutron diffraction studies to take more 
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Macromolecular Neutron Diffraction, Fig. 6 A leucine res¬ 
idue showing the orientation of the methyl groups (as CD 3 ). 
Nuclear scattering density maps generated from Fcaios to 1.9 A 
resolution and contoured at 1.8 rms 

time and require much larger crystal volumes. Over the 
years, however, improvements have been made in all 
aspects of the technique - from the neutron source to 
the sample itself, and in data analysis thereafter. In 
particular, improvements in sample preparation, 
neutron instrumentation and detection systems, and 
data collection methods have dramatically reduced 
the crystal volumes and data collection times required, 
and in doing so have permitted the study of ever larger, 
more complex macromolecular systems (Blakeley 
2009). 

Sample Preparation: Growth of Large Single 
Crystals and Perdeuteration 

The major bottleneck for macromolecular neutron 
diffraction studies is the need for large crystal 
volumes - the actual crystal volume required 
depending on the sample unit-cell volume. Clearly 
growth of large crystals is a challenge. Fortunately, 
for all neutron diffraction studies, the macromolecule 
has already been crystallized and the X-ray structure 
determined, such that the initial crystallization 


Macromolecular Neutron Diffraction, Fig. 7 Two adjacent 
backbone amide groups from a D 20 -exchanged protein single¬ 
crystal showing the effect of ► H/D Exchange. In the first 
amide group (to the left of the image), the hydrogen atom has 
remained un-exchanged and is clearly visible as a negative 
nuclear scattering density peak (in red), while in the second 
amide group (to the right of the image), the hydrogen atom has 
been exchanged for deuterium and so is clearly visible as a 
positive nuclear scattering density peak (in blue). Nuclear scat¬ 
tering density maps generated from Folios to 1.5 A resolution, 
positive nuclear scattering density map contoured to 1.5 rms, and 
negative nuclear scattering density map contoured to —3.5 rms 

conditions can be used as a guide for growing 
the larger crystals needed with neutrons. Methods 
for growing large crystals are explained in detail in 
Neutron Protein Crystallography (Niimura and 
Podjarny 2011), but briefly the key is to use knowl¬ 
edge of the crystallization phase diagram (CPD) of the 
macromolecule. A CPD consists of three regions: 
(1) the undersaturated region, a solution phase in 
which crystals dissolve; (2) the metastable region, in 
which nucleation of new crystals does not occur but 
macromolecules crystallize on the seed crystal; and 
(3) the spontaneous nucleation region, in which both 
crystal growth and nucleation occur. Determination of 
the CPD thus provides the location and boundaries of 
these regions, such that, by placing a seed crystal 
within the metastable region and maintaining it 
there via adjustment of various parameters of the 
crystallization (e.g., protein concentration), large 
crystals can be grown. 

In order to reduce the extremely large incoherent 
scattering signal that originates from the many hydro¬ 
gen atoms of a macromolecule, it has been common¬ 
place to grow crystals from, or soak the crystals in, 
D 2 O solutions. However, by this method those 













1358 


Macromolecular Neutron Diffraction 


hydrogen atoms attached to carbon atoms (^80-85% 
of the hydrogen content of the macromolecule) remain 
un-exchanged, and as such, a substantial incoherent 
signal is still observed. To substitute all of the hydro¬ 
gen atoms within a macromolecule for deuterium 
atoms, i.e., perdeuteration, requires gene expression 
in a deuterated growth media (Shu et al. 1996; 
Blakeley 2009). For perdeuterated crystals, the inco¬ 
herent scattering signal is significantly reduced (up to 
40-fold), while the coherent scattering signal is also 
increased, such that the signal-to-noise ratio of the data 
is vastly improved. The availability of perdeuterated 
crystals has been of major importance for macromo¬ 
lecular neutron diffraction, as it permits the use of 
smaller crystals by at least an order of magnitude 
(Blakeley et al. 2008; Howard et al. 2011) relative to 
protiated crystals (grown or soaked in D 2 O). 
Perdeuteration offers several other advantages for neu¬ 
tron data collection, such as the ability to collect 
higher-resolution data, use shorter data collection 
times, or study larger, more complex systems. Further¬ 
more, perdeuteration helps reduce the number of spa¬ 
tially overlapped reflections (on a hxed-radius 
detector) and aids the efficient cryo-cooling of the 
crystal, due to the larger surface-area-to-volume ratio 
(Blakeley et al. 2010). Perdeuteration also simplifies 
the structure refinement of the neutron data, as cancel¬ 
ation of nuclear scattering density between protons and 
their neighboring atoms is avoided. Moreover, the 
problem of partial occupancies of hydrogen and deu¬ 
terium, which can occur at certain sites when using 
D20-exchanged protiated crystals, is removed. In 
2003, the Institut Laue-Langevin (ILL) created the 
first “Deuteration Laboratory” (http://www.ill.eu/ 
sites/deuteration/) aimed at maximizing the effective¬ 
ness of biological ► neutron scattering work. Since 
then, other neutron facilities have created laboratories 
for the partial or complete deuteration of macromole¬ 
cules, such as the Bio-Deuteration Laboratory at Oak 
Ridge National Laboratory (ORNL) and the Biological 
Deuteration Laboratory at the Los Alamos Neutron 
Science Center (LANSCE). 

Key Developments at Reactor Neutron Sources 

For many years macromolecular neutron diffraction 
studies were restricted to just a handful of cases for 
which (1) very large crystal volumes (i.e., several 
mm^) could be grown and (2) possess relatively small 
unit-cell volumes (e.g., hen egg-white ► lysozyme). 


This was not only due to the inherent low fluxes of 
neutron sources, but also to the lack of optimized 
instrumentation and data collection methods for such 
studies. Since the 1990s, however, great progress has 
been made in developing instrumentation that is spe¬ 
cifically designed for neutron macromolecular crystal¬ 
lography and in utilizing data collection methods that 
aim to maximize the flux on the sample. At reactor 
neutron sources, the development of quasi-Laue data 
collection methods, in which a narrow wavelength 
band pass (e.g., §L/L = 25%) is extracted from the 
original broadband spectrum of wavelengths (a white 
beam), provides large gains in flux relative to mono¬ 
chromatic methods, while reducing background scat¬ 
tering and reflection overlap compared to the use of the 
full white beam. Quasi-Laue methods thus permit the 
use of much smaller crystal volumes than if monochro¬ 
matic data collection methods are used. In addition, the 
use of cylindrical neutron-sensitive image-plate (NIP) 
detectors that completely surround the sample provide 
much greater coverage of reciprocal space (>27t sr), so 
that the large number of Bragg reflections produced 
can be measured simultaneously, thereby reducing the 
data collection times necessary (Cipriani et al. 1996; 
Meilleur et al. 2005; Blakeley 2009). These develop¬ 
ments have been incorporated into the design of new 
instrumentation, exemplified by the quasi-Laue dif¬ 
fractometer LADI-III (Blakeley et al. 2010), located 
at the ILL. LADI-III includes a cylindrical NIP detector 
with data collected using quasi-Laue methods (typically 
Grange ~ 3^ A) via the use of multilayer band-pass 
filters. Various filters available allow one to optimize 
the data collection strategy by selecting the wavelength 
range and wavelength that are best suited to the sample 
(in terms of the unit-cell volume, crystal volume etc.). 
LADI-III is therefore able to rapidly collect diffraction 
data (^3-14 days) to resolutions of 1.4-2.5 A from 
macromolecules covering a wide range of complexity 
(unit-cell volumes ^50,000-1,500,000 A^). Moreover, 
LADI-III is well suited to harness the advantages that 
perdeuteration offers for data collection, so that rela¬ 
tively small crystal volumes of ^0.1-0.2 mm^ can be 
used (Blakeley et al. 2008; Howard et al. 201 1). 

Key Developments at Spallation Neutron Sources 

Until fairly recently, all macromolecular neutron 
diffraction studies had been performed at reactor 
neutron sources. Although the advances at reactor 
sources have been impressive, there is little prospect 
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Macromolecular Neutron Diffraction, Tabie 

neutron sources 

2 Neutron diffractometers currently 

in operation 

at reactor 

and spallation 

Diffractometer name (neutron 
source, power reactor or power 
spallation source target, country) 

Detector type 

Data collection 
method 

Typical crystal 
volume (mm^) 

Cell edges 
(A) 

Days to 
collect 

Maximum 
resolution (A) 

BIX-3 

(JRR-3M, 20 MW, Japan) 

Cylindrical 

NIP 

Monochromatic 

(L = 2.9 A) 

>1 

<100 

>20 

1.4 

BIX-4 

(JRR-3M, 20 MW, Japan) 

Cylindrical 

NIP 

Monochromatic 
(L = 2.6 A) 

>1 

<100 

>15 

1.4 

LADI-III 

(ILL, 58 MW, France) 

Cylindrical 

NIP 

Quasi-Laue 
(typically L,ange = 

3^ A) (6L/L options 
from 5% to 25%) 

>0.1 

(perdeuterated) 
>0.5 (DgO- 
exchanged) 

<160 

>3 

1.4 

D19 

(ILL, 58 MW, France) 

Area PSD 

Monochromatic 
(L choice from 0.8 to 
2.4 A) 

>1 

<100 

>7 

1 

PCS 

(LANSCE, 0.1 MW W target, 
USA) 

Time- 

sensitive area 
PSD 

Pulsed (20 Hz) white 
TOF Laue 

(W = 0-6-7 A) 

>0.3 

<180 

>4 

1 


Macromolecular Neutron Diffraction, Table 3 Neutron diffractometers currently being built, or under commissioning, at reactor 
and spallation neutron sources 


Diffractometer name 
(neutron source, power 
reactor or power spallation 
source target, country) 

Detector type 

Data collection method 

Estimated 
typical crystal 
volume (mm^) 

Estimated 
cell edges 

(A) 

Estimate 
of days 
to collect 

Estimated 
maximum 
resolution (A) 

BioDiff (FRM-II, 20 MW, 
Germany) 

Cylindrical NIP 
(plus an 
additional 
CCD-camera) 

Monochromatic (L choice 
from 2.4 to 5.6 A) 

>1 

<120 

14 

1.4 

Bio-C (HANARO, 30 MW, 
S. Korea) 

Cylindrical NIP 

Monochromatic 

>1 

<100 

14 

1.4 

IMAGINE (HFIR, 84 MW, 
USA) 

Cylindrical NIP 

Quasi-Laue (AL = 1 A, 
Grange Options; 2-3, 3^, 
3.5^.5 A) 

>0.1 

(perdeuterated) 
>0.5 (DgO- 
exchanged) 

<160 

>3 

1.2 

TOPAZ (SNS, 2 MW Hg 
target, USA) 

14 time- 
sensitive Anger 
cameras 

Pulsed (60 Hz) white TOF 
Laue (7,,„j,g0 — 0.5 

3.85 A) with polarized 
neutron option 

>1 

<50 

>1 

1 

MaNDi (SNS, 2 MW Hg 
target, USA) 

Several time- 
sensitive Anger 

Pulsed (60 Hz) white TOF 
Laue 

>0.1 

<150 

1-7 

1.5 


cameras 

Adjustable wavelength 
range AL ~ 2.16 A 

>1 

<300 

1-7 

2.5 

iBIX (J-PARC, I MW Hg 
target, Japan) 

14 time- 
sensitive 
wavelength- 
shifting fiber 
detectors 

Pulsed (25 Hz) white TOF 
Laue (L,ange = 0.7-7.8 A) 
Adjustable wavelength 
range AX ~ 4.0 A 

>0.1 

<150 

>3 

1.2 


of any large increase in the flux of reactor neutron 
sources because of inherent limitations imposed by 
the fission process. However, a spallation neutron 
source offers fewer flux limitations, as neutrons are 


produced by bombarding a metal target with pulses 
of high-energy protons. Neutrons produced by proton 
pulses are “time-stamped” such that by recording time- 
of-flight (TOF) information (Schoenbom and Pitcher 
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1996; Wilson 2000; Niimura and Podjarny 2011), the 
corresponding energy and wavelength of each neutron 
can be calculated. TOP techniques in combination with 
large position-sensitive detectors (PSDs) allow wave- 
length-resolved Laue patterns to be collected using all 
the available neutrons. The TOP Laue method there¬ 
fore has all of the advantages of quasi-Laue methods 
employed at reactor sources, but does not suffer in the 
same way from reflection overlap and a buildup of 
background scattering over the wavelength range. In 
2002 , the first diffractometer dedicated to macromo¬ 
lecular neutron diffraction studies at a spallation 
source was built at LANSCE. The instrument PCS 
(Kovalevsky et al. 2010b) uses pulsed neutrons in the 
wavelength range 0.6-7.0 A, with data recorded as 3D 
TOP Laue patterns on a large PSD. Due to the high 
peak flux obtained from the TOP Laue method, crys¬ 
tals as small as 0.3 mm^ have been used to collect 
diffraction data to medium resolution (1.5-2.5 A). In 
the last few years, two new spallation neutron sources 
have been constructed; the Spallation Neutron Source 
(SNS) at ORNL and the Japan Proton Accelerator 
Research Complex (J-PARC) of the Japanese Atomic 
Energy Agency (JAEA). SNS and J-PARC both have 
higher peak fluxes than at LANSCE, and so the new 
TOE Laue diffractometers currently being commis¬ 
sioned should provide even greater reductions in 
sample volume requirements and data collection 
times, extending yet further the range and complexity 
of systems that can be studied (Niimura and Podjarny 
2011). See Table 2 for diffractometers currently in 
operation at both reactor and spallation neutron 
sources, and Table 3 for those diffractometers 
currently being built or commissioned. 

Summary 

Neutron diffraction techniques allow direct determina¬ 
tion of the proton and deuteron positions of 
a macromolecule and its solvent, providing informa¬ 
tion on the protonation states of amino acid residues, 
substrates, or inhibitors, along with the positions and 
orientations of individual water molecules. Recent 
advances for neutron macromolecular crystallography 
have led to an expanding field addressing larger, more 
complex problems. In particular, improvements in 
neutron instrumentation, data collection methods, and 
sample preparation have dramatically lowered the 


sample volume requirements to around 0.1-0.2 mm^ 
and data collection times to just a few days. This trend 
toward the study of larger systems using smaller 
samples is set to continue with new instrumentation 
coming online at reactor and spallation neutron sources 
and upgrades being made to existing instrumentation. 

Cross-References 

► H/D Exchange 

► Lysozyme - Computational Studies 

► Macromolecular Crystallography: Overview 

► Neutron Scattering of Membranes 

► X-Ray Scattering of Lipid Membranes 
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Synonyms 
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Introduction 

Solid-state ► Nuclear Magnetic Resonance (NMR) 
spectroscopy is an emerging technique in structural 
biology that allows to study protein structures with 
atomic resolution that are inaccessible by other methods 
such as liquid-state NMR or X-ray crystallography. 
Indeed, solid-state NMR does not require proteins to 
be soluble or crystallized. Examples of such proteins are 
protein fibers (e.g., amyloids, silk, and keratin), micro¬ 
crystalline proteins, protein aggregates, proteins in fro¬ 
zen solution, and membrane proteins inside a bilayer. 

An important technique that is routinely used to 
improve the resolution in solid-state NMR is Magic 
Angle Spinning (MAS). MAS is a fast sample rotation 
that averages orientation dependent (anisotropic) spin 
interactions, which lead to line broadening in the static 
case (Fig. 1). However, these anisotropic interactions 
contain important structural information and can, 
therefore, be reintroduced with radio-frequency (rf) 
pulse sequences under MAS. Proton spectroscopy on 
fully protonated systems is difficult even under MAS 
but can be facilitated with proton decoupling 
techniques and deuteration. 

In the following, 1 will discuss the most important 
classes of pulse sequences that are used to investigate 
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proteins under MAS. I will give important applications 
of these experiments, show how they can be used to 
measure proteins dynamics, and how they can be 
combined to determine protein structures in the solid 
state. 

Obtaining High Spectral Resolution 

In liquid-state NMR, the molecule under investigation 
generally tumbles isotropically with a correlation time 
that averages all anisotropic spin interactions. This 
averaging leads to greatly simplified NMR spectra, 
that, in the case of a spin Vi, only reflect the isotropic 
chemical shift i.e., the change of the local mag¬ 
netic field due to the electronic environment), and the 
isotropic J-coupling (Hj, i.e., the spin-spin coupling 
mediated through the electrons of a chemical bond). 
See also ► Solution NMR. In static solid-state NMR 
and in the absence of any motional averaging, the 
spectra reflect all anisotropic interactions, in addition 
to the liquid-state NMR interactions described above. 
For a spin Vi these include the chemical shift anisot¬ 
ropy (Hcsa, i-E-, the anisotropic part of the local mag¬ 
netic field changes) and the dipolar spin-spin 
interactions (Ho)- Spins larger than Vi (e.g., deuterium) 
also have a quadrupolar coupling {Hq). However, the 
following discussion will mostly focus on spins Vi. 
These anisotropic NMR interactions lead to a line 
shift (CSA) and line splittings (dipolar coupling, 
quadrupolar coupling) that depend on the orientation 
of the molecule with respect to the external magnetic 
field. In the common case of a powder, in which all 
molecular orientations are present at the same time, 
CSA and dipolar coupling lead to massive line broad¬ 
ening as illustrated in the static powder spectrum 
shown in Fig. lb. Although CSA and dipolar couplings 
contain important chemical and structural information, 
they lead to spectra that are difficult to interpret, 
especially if the number of atoms increases and the 
resulting spectral overlap makes it impossible to iden¬ 
tify individual resonances. 

MAS addresses this problem by rotating the sample 
around an axis tilted by 54.7° (the Magic Angle) with 
respect to the static magnetic field (Fig. la). This 
sample rotation can lead from partial to complete 
averaging of the anisotropic interaction depending on 
sample rotation frequency and strength of the 
corresponding interaction. Figure lb, c illustrates the 


effect of MAS on the *^N spectrum of glycine. Low 
spinning frequencies lead to the appearance of 
spinning sidebands that roughly follow the shape of 
the static powder spectrum and are spaced by the 
spinning frequency (Fig. lb). Higher MAS frequencies 
that exceed the static linewidth and, therefore, strength 
of the interaction result in an almost complete averag¬ 
ing of dipolar couplings and CSA (Fig. Ic). High MAS 
frequencies can lead to spectra that are of similar 
quality as in liquid-state NMR. For a more rigorous 
treatment of the theory of MAS, please refer to, for 
example, Mehring 1983. The current MAS technology 
with rotation frequencies of up to 80 kHz is very 
effective for averaging the (homonuclear) dipolar cou¬ 
plings and the CSA of most spin Vi nuclei found in 
biological samples such as carbon (^^C), nitrogen 
('^N), and phosphorus (^'P). For other nuclei, espe¬ 
cially protons, which have very strong homonuclear 
and heteronuclear dipolar couplings, MAS is often not 
sufficient to obtain high-resolution spectra. 

For routine solid-state NMR investigations of 
proteins, MAS removes all anisotropic interactions 
besides the dipolar couplings involving protons. The 
heteronuclear proton dipolar couplings can be 
removed by rf pulses on the protons in addition to 
MAS (see red spectrum in Fig. Ic). This technique 
called (heteronuclear) proton decoupling leads to 
high-resolution spectra of nuclei such as *^C and 
*^N (Ernst 2003). Decoupling protons while 
detecting protons is possible, but technically more 
demanding and does not lead to linewidths that are 
comparable to those routinely observed in liquid- 
state NMR. 

Another way to remove proton dipolar couplings is 
to replace protons with deuterium. Protein samples 
with low proton content allow the application of 
NMR experiments very similar to those used in 
liquid-state NMR under MAS without the need of 
additional proton decoupling sequences (Reif 2012). 

A final obstacle to achieving high-resolution spec¬ 
tra lies not in NMR interactions but in the sample 
itself. High resolution is only archived in samples 
of high structural homogeneity, i.e., in samples in 
which all molecules adopt the same conformation. 
Structural heterogeneity (i.e., static or dynamic vari¬ 
ability of the sample) often leads to severe broadening 
of the resonances and careful sample preparation is 
necessary to obtain optimal resolution in solid-state 
NMR. 
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Magic Angle Spinning Solid-State NMR on Proteins, 
Fig. 1 (a) Schematic of a MAS rotor {red) inside a MAS stator 
(Artwork: Courtesy of Dr. Rene Verel). (b) Spectra of an isolated 
spin with CSA. Low MAS frequencies lead to spinning 
sidebands. At high MAS frequencies only one narrow line 
remains, (c) *^N spectrum of uniformly labeled glycine. 

The static spectrum is severely broadened due to CSA and 


dipolar couplings to carbons and protons. The line gets 
narrower and the spinning sidebands disappear with higher 
MAS frequencies. However, maximum resolution at 10 kHz 
MAS is only achieved with additional proton decoupling {red). 
Spectra were simulated using the program SPINEVOLUTION 
(Veshtort and Griffin 2006) 


Resonance Assignment 

MAS in combination with proton decoupling or partial 
deuteration leads to high-resolution solid-state NMR 
spectra. The current state of the art makes it possible to 
assign individual resonances of proteins of the order of 
200 amino acids (Shi et al. 201 1). But having achieved 
atomic resolution, how can site-specific structural 
information be extracted? 

An essential first step of most studies of proteins by 
solid-state NMR is, as it is the case in liquid-state 
NMR, the ► resonance assignment (see also ► Protein 
NMR - Introduction). This assignment focuses mainly 
on *^N and unless protons are highly diluted. Even 
a relatively small protein such as ubiquitin has about 
378 resonance lines in the carbon spectrum. The reso¬ 
nance assignment links every carbon resonance fre¬ 
quency observed in the spectra to the corresponding 
carbon atom in the primary sequence of the protein. 

Resonance assignments of proteins are usually done 
with a set of two- to four-dimensional correlation 
experiments. These experiments are composed of 
building blocks that transfer magnetization between 
spins (e.g., from *^N to *^C) and building blocks that 
detect magnetization. The resulting spectra contain 
cross-peaks that indicate the proximity of two atoms 
(see ► Multidimensional NMR Spectroscopy and 
► Triple Resonance NMR). The smart combination 
of this proximity information with the prior knowledge 


of the protein’s primary sequence, and educated 
guesses (based on their resonance frequency) to what 
amino acid types these resonances belong, makes it 
possible to solve the resonance assignment puzzle. In 
contrast to liquid-state NMR, solid-state NMR reso¬ 
nance assignments are often done with experiments 
that transfer magnetization by reintroducing dipolar 
couplings (for a recent review about dipolar recoupling 
sequences see Nielsen et al. 2012). There is a large 
variety of recoupling sequences for homonuclear and 
heteronuclear dipolar couplings. Besides using dipolar 
couplings, magnetization transfer can also be accom¬ 
plished via J-couplings similar to liquid-state NMR. 
An example of an assigned 2D spectrum of an 

amyloid hbril is shown in Fig. 2a. 

Structural Information 

The second step in structural studies with solid-state 
NMR is to measure structural information such as 
distances and dihedral angles, which can be used as 
conformational restraints in subsequent structure cal¬ 
culations. Already the chemical shift assignment itself 
contains valuable structural and dynamical informa¬ 
tion. The extent of assignment can be used to distin¬ 
guish static from very dynamic protein domains. 
Furthermore, the chemical shifts of some nuclei, espe¬ 
cially of C’, Ca, and C(3 are highly dependent on the 
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Magic Angle Spinning 
Solid-State NMR on 
Proteins, Fig. 2 (a) 2D 

DREAM correlation 
spectrum of the HET-s prion 
protein fragment 218-289 in 
its infective amyloid 
conformation recorded at 
a proton frequency of 
600 MHz. (b) 2D DARR 

correlation spectrum 
of a seven-helical membrane 
photoreceptor, sensory 
rhodopsin from Anabaena sp. 
PCC 7120 recorded at a proton 
frequency of 800 MHz. These 
spectra illustrate the degree of 
resolution that can be achieved 
with modem solid-state NMR 
technology to obtain specific 
resonance assignments of 
proteins that could not be 
studied otherwise with atomic 
resolution, ((b) reproduced 
with permission of PNAS 
from Shi et al. (201 1)) 
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backbone dihedral angles and can be used to predict 
secondary structure elements (see ► Chemical Shift 
Index). One example is the amyloid fibril formed by 
the HET-s prion protein. Here, the solid-state assign¬ 
ment alone helped identify the static core of the amyloid 


fibril and showed that this core was in a predominantly 
P-sheet conformation (Ritter et al. 2005). 

The chemical shift assignment can also be used to 
directly predict the dihedral angles of the protein 
backbone. Such dihedral angles restraints, which are 


[uidd] Oe^9 [uidd] q. 
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Magic Angle Spinning Solid-State NMR on Proteins, 
Fig. 3 (a) Structure of the HET-s(218-289) amyloid fibril 
(PDB code 2RNM). Three monomers inside the fibril are 
shown as ribbon diagram. The individual monomers are depicted 
in red, green, and blue. This solid-state NMR structure revealed 
that HET-s(218-289) forms a P-solenoid in its infective amyloid 
conformation, (b) GBl structure calculated by combining pre¬ 
cise distances with accurate dihedral angle information from 
CSA measurements. The 10 lowest energy structures are 

important for structure calculation, can be determined 
with higher precision when measuring not only the 
chemical shift, but also the corresponding CSA. 
Consequently, there are multidimensional NMR exper¬ 
iments that precisely measure site-specific CSAs (Wylie 
and Rienstra 2008). There are also other types of exper¬ 
iments that measure dihedral angles, for example, hy 
correlating CSA or dipolar tensors (Ladizhansky 2010). 

Distance information is the foundation of structure 
determination in NMR. The dipolar coupling Ho 
between two spins is proportional to 1/r^ and can, 
therefore, be used as a measure for the distance 
between pairs of spins (see also ► Interspin Distance 
Determination by EPR). In liquid-state NMR, dipolar 
couplings are usually averaged due to the molecular 
tumbling and distances have to be measured via 
dipole-dipole cross-relaxation (NOE’s, see ► Nuclear 
Overhauser Effect). In solid-state NMR under MAS, 
dipolar couplings can be measured directly via specific 
dipolar recoupling sequences. 

There are several dipolar recoupling techniques that 
work best on isolated spin systems, for instance, on 
samples in which only two spins are isotopically 
labeled. Although a new sample has to be made for 
every distance, these distances can he measured with 
very high precision. A popular pulse sequence for 
selectively labeled samples is REDOR, which mea¬ 
sures the distances between two different nuclei 
(heteronuclear distance, e.g., between ^^N and *^C). 
REDOR is simple, robust, and has been used to study 
many biological systems such as the different amyloid 


presented in blue with 2QMT crystal structure represented in 
red. (c) The lowest 10 energy structures are presented in CPK 
to illustrate the overall heavy atom order. The all heavy 
atom rmsd is 0.72 A, the backbone rmsd 0.16 A, and the 
agreement with the crystal structure is 0.51 A, showing that 
solid-state NMR provides the means to calculate structures 
of a quality comparable to those from X-ray diffraction 
((b) and (c) reproduced with permission of PNAS from Wylie 
et al. 201 1) 

P-sheet structures adopted by the ccP peptide (Verel 
et al. 2008). A popular sequence to measure homonu- 
clear distances is rotational resonance (RR). RR has 
for example been used to determine the structure of 
acetylcholine inside the membrane-imbedded acetyl¬ 
choline receptor (Williamson et al. 2007). 

RR and REDOR are limited to selectively labeled or 
very small molecules and other techniques have to be 
used to measure distances in uniformly labeled pro¬ 
teins. 3D TEDOR is a technique that measures precise 
*^N-*^C distances similar to REDOR, but is applicable 
to uniformly labeled proteins. TEDOR works best on 
well-resolved spectra and has, for instance, been used 
to determine the structure of the TTR amyloid 
(Jaroniec et al. 2004). 

A problem that occurs when measuring homonu- 
clear dipolar couplings in fully labeled proteins is 
that weak dipolar couplings corresponding to long 
distances are suppressed in the presence of strong 
dipolar couplings from directly bonded nuclei. This 
effect, termed dipolar truncation, is the reason why 
many homonuclear dipolar recoupling sequences 
such as DREAM, RFDR, or R and C sequences are 
not suitable for structure determination. One solution 
to avoid dipolar truncation is to make protein samples 
that have a reduced number of spin labels (sparse 
labeling). Sparse labeling can be achieved by growing 
bacteria on carbon sources such as [2-*^C] glycerol 
(LeMaster and Kushlan 1996). 

Another way to avoid dipolar truncation is to use 
second-order recoupling sequences that are less 
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sensitive to this effect (PDSD, DARR, MIRROR, 
PAR, PAIN, etc.). These methods provide distances 
of low precision since the cross-peak intensity and 
buildup also depend on several factors other than the 
distance (see also ► Spin diffusion). Distances that are 
determined that way enter the structure calculation in 
the form of relatively loose distance constraints, for 
example, as lower and upper distance limits. 

Structure Calculation 

The first two steps of structure determination with 
NMR are resonance assignment and measurement of 
conformational restraints. In a third step, a computer 
program calculates the three-dimensional structure 
using distance and dihedral angle information, and 
other known conformational restraints as input. Other 
structural information includes the primary sequence, 
disulphide bonds, the oligomerization state and sym¬ 
metry of the protein, etc. Most programs used for NMR 
structure determination in the solid state were devel¬ 
oped for liquid-state NMR and are based on simulated 
annealing. See also ► NMR-based Structural Proteo- 
mics and ► CYANA. 

An interesting example of a solid-state NMR struc¬ 
ture that could not have been determined otherwise is 
the amyloid fibril formed by the HET-s prion protein 
(see Fig. 3a). In this study, sparsely labeled samples and 
spin-diffusion techniques were used to measure 
and *H-*H distances within a monomer. A mixture of 
either or *^N labeled monomers and CP experiments 
were used to determine the contacts between the mono¬ 
mers. An example in which very precise conformational 
constraints were combined to get a high quality struc¬ 
ture is shown in Fig. 3b and c (Wylie et al. 201 1). In this 
study, 3D TEDOR distances and distances from to 
spin-diffusion experiments were combined with 
dihedral angle constraints coming from CSA measure¬ 
ments. The result is a structure that rivals X-ray crystal¬ 
lographic structures in quality. 

Protein Dynamics 

There are two ways to characterize protein dynamics 
with solid-state NMR. One way is to measure the 
dynamic averaging of known anisotropic interactions, 
which are sensitive to dynamics in the ms-ps 


timescale. The other way are to measure dynamics 
via relaxation, which accesses ps-ps timescales. 

Anisotropic interactions become partially averaged 
and decrease in strength with increasing dynamics. In 
an extreme case, a molecule in a macroscopically solid 
sample can be so dynamic that the dipolar are virtually 
absent and dipolar-based techniques fail. A good 
example of a highly dynamic protein domain is the 
HET-s prion protein. The parts of HET-s that are 
framing the static core of this amyloid fibril are so 
dynamic that they can only be observed using I-cou- 
pling-based pulse sequences under MAS (Siemer et al. 
2006). 

In the case of partial averaged anisotropic interac¬ 
tions, the extent of dynamics can be quantified 
(Torchia 2011). For example, the strength of a H-C 
bond dipolar coupling is known for the static case. 
Motions that happen on a timescale faster than the 
coupling lead to scaling of this coupling that is usually 
expressed as an order parameter S^. This order param¬ 
eter can be used in conjunction with a motional model 
to describe the timescale and extent of motion. A good 
example of this approach is a recent study of the 
backbone dynamics of ubiquitin via scaled *H-'^N 
couplings. Here, *H-*^N couplings were measured 
using a 3D variant of the REDOR experiment giving 
site-specific order parameters for the entire protein 
backbone (Schanda et al. 2010). 

Order parameters can also be calculated from par¬ 
tially averaged CSAs (Wylie and Rienstra 2008) and 
quadrupolar coupling tensors. The quadrupole used 
most frequently to study dynamics in proteins is 
deuterium. Static solid-state NMR spectra of site spe¬ 
cifically deuterium labeled samples have long been 
important to study molecular dynamics of biopolymers 
(Palmer et al. 1996). However, deuterium quadrupolar 
tensors can also be measured under MAS and 
in the context of multidimensional experiments on 
perdeuterated proteins (Hologne et al. 2005). 

The quantification of protein dynamics via relaxa¬ 
tion in solid-state NMR is similar to liquid-state NMR 
(see ► Protein Conformational Dynamics by Relaxa¬ 
tion Dispersion). The main difference is that in liquid- 
state NMR, relaxation is determined by local structural 
fluctuations and by the tumbling of the molecule in 
solution. In solid-state NMR, there is no molecular 
tumbling and local structural fluctuations are respon¬ 
sible for the bulk of the relaxation. However, other 
phenomena such as spin diffusion or unwanted dipolar 
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Magic Angle Spinning 
Solid-State NMR on 
Proteins, Fig. 4 (a) Selected 
R|p dispersion-curve data 
measured in GB1 at 60 kHz 
MAS, no *H decoupling was 
used during the spin lock. 

(b) Comparison of amide 
R|p and Rp Under the 
assumption of motional 
models, these relaxation data 
can be transformed into 
motional timescales and 
amplitudes (Reproduced with 
permission of ACS from 
Lewandowski et al. (201 1)) 


a 


16 
12 
8 

„ 4 

m 0 
16 
12 
8 
4 
0 


cc 



12 16 20 24 12 16 20 24 12 16 20 24 

spin-lock field amplitude (kHz) 


Pi 


Pz 


Pa P4 



10 20 30 

residue 


40 


50 


recoupling can obscure the true relaxation rates in 
solid-state NMR. Careful estimation of these side 
effects or the removal of couplings that could interfere 
with the relaxation measurement (e.g., by using high 
MAS frequencies or by perdeuterating the protein) 
allows the measurements of relaxation rates in the 
context of multidimensional NMR experiments that 
provide site-specihc dynamical information from uni¬ 
formly labeled proteins. An interesting example of 
this, shown in Fig. 4, is the study of backbone dynam¬ 
ics of crystalline GBl by using fast MAS and site- 
specific *^N Ri and Rip measurements (Lewandowski 
etal. 2011). 

Summary 

MAS solid-state NMR is a very powerful technique to 
study solid proteins that are not accessible to other 


structure biological methods. MAS is a fast sample 
rotation method that averages most of the anisotropic 
NMR interactions, thereby improving the spectral 
resolution. Since MAS is insufficient to fully average 
couplings involving protons, proton decoupling or 
deuteration is necessary to achieve optimal resolution. 
The clever combination of MAS with rf pulse 
sequences that recouple specific anisotropic interac¬ 
tions makes it possible to measure structural informa¬ 
tion while maintaining atomic resolution. There are 
recoupling sequences that measure distance and dihe¬ 
dral angle restraints from which protein structures can 
be determined. Similar recoupling methods or the mea¬ 
surement of relaxation rates can be used to measure 
protein dynamics. The applications of these techniques 
is vast and can be used to study the structure and 
dynamics of amyloid and membrane proteins and pro¬ 
tein binding, diffusion, and water accessibility to name 
just a few. 
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Definition 

Magnetic Resonance Imaging (MRI) is the process 
of spatial mapping of processes related to Nuclear 
Magnetic Resonance (NMR). In its commonly 
accepted definition, it is the formation of images 
related to water content of tissues in a human body. 
These images give good spatial morphological 
information but the power of MRI as a biophysical 
tool is the ability of NMR to reveal molecular behavior 
of atoms. The images can reveal diseased states of 
tissue and measure molecular diffusion or flow, for 
example. 

Introduction 

MRI grew out of the established field of NMR which 
had, by the early 1970s, already established itself as 
a significant analytical technique in chemistry and 
physics. In 1973 Paul Lauterbur (Lauterbur 1973) and 
Peter Mansfield (Mansfield and Grannell 1973) inde¬ 
pendently published papers outlining the rudiments of 
the imaging process using NMR. The popularity of 
MRI has increased exponentially with scanners 
becoming commonplace in hospitals and clinics 
worldwide. In 2010 there were, on average, over 40 
exams per 1,000 population in the OECD. In the 
research environment, MRI may be combined with 
spectroscopy (NMR spectroscopy) and other imaging 
modalities. The ubiquity of MRI finds this technique 
used on a range of length scales and materials - not just 
living tissue (Callaghan 1995; Kimmich 1997; 
McRobbie et al. 2007). In this section the instrumen¬ 
tation is described together with general principles 
involved in image formation. The definitions attempt 
to give the reader some insight into the terms and 
concepts used in MRI which are used in literature 
on the subject. Further applications of the MRI 
methodology may be found in entries ► CPMG, 
► Spin Diffusion, ► Flow NMR, ► NMR, ► NMR 
Methods for Kinetic Analysis, ► Pulsed Field 
Gradient NMR. 


The Magnetic Resonance Spectrometer 

The principal hardware elements of a MR Spectrome¬ 
ter are shown in Fig. 1 and will be referred to in the 


following sections. Central to the spectrometer is the 
host control computer which provides the operator 
interface, data acquisition, image reconstruction, and 
image archiving. On clinical systems, these functions 
would be carried out by separate computer subsystems. 
The timing and control of an MR spectrometer has to 
be precise and this function is carried out by 
a dedicated sequence controller (Callaghan 1995). 
This controls the timings and waveform shapes for 
the radio-frequency (RF) pulses, gradient switching, 
and acquisition periods. It is essential in NMR systems 
that frequency and timing stability are maintained so 
all RF generation and waveform generation are locked 
to a master clock frequency. 

Magnets for MRI 

MRI requires high magnetic fields which are typically 
of the order of 0.5-10 tesla (T) for human imaging. The 
static magnetic field (Bq) determines the polarization of 
the spins which, in general, increases linearly with field 
strength (Haacke et al. 1999). The polarization of the 
spins determines the amount of magnetization available 
in an imaging or spectroscopy acquisition. The 
magnetic field should be temporally stable and have 
a high degree of spatial uniformity over the region of 
interest. In the case of a typical clinical human body 
scanner, a 3 T magnet has a field uniformity of better 
than 1 part in 10® over a 0.6-m diameter sphere. These 
requirements are usually only met by the use of super¬ 
conductive magnet technology requiring cryogenic 
cooling of the electromagnet and these are found in 
the majority of hospital radiology departments in the 
developed world. Field strengths for whole-body 
systems used in research environments can now be 7 
T or greater. Such a system is shown in Fig. 2a and is 
used for functional brain imaging, spectroscopy, or 
ultrahigh resolution structural imaging. It is also possi¬ 
ble for imaging systems to be based on permanent or 
electromagnet designs. These more accessible magnets 
will have field strengths up to 0.5 T but a limited spatial 
uniformity. They can be used for low resolution, inter¬ 
ventional or routine extremity imaging. Magnets for 
MRI are continually developing and an open 0.5 
T cryogen-free magnet based on magnesium di-boride 
(MgB 2 ) superconductor technology is shown in Fig. 2b. 
Microscopy imaging systems based on narrower bore 
magnets can utilize magnets of up to 17 T field strength. 
To improve the spatial homogeneity of any magnet a set 
of shim coils may be utilized which provide slight 
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Fig. 1 Schematic diagram 
showing principal elements of 
a typical MRI scanner 




Magnetic Resonance Imaging (MRI) Methodology, Fig. 2 

(a) A 7 T superconductive cylindrical geometry magnet showing 
local RF transmit and receive coil placed in close proximity to 
the subject’s head. The subject is being prepared for a functional 
imaging brain scan. The subject will subsequently be moved to 
the center of the magnet, (b) A low-held 0.5 T magnet with 


corrections to the main magnetic field. The required 
corrections are sample and subjecl dependenl al very 
high magnetic fields requiring the spectrometer to 
automatically measure and correct the field before 
imaging commences. For lower field magnets small 
pieces of ferrous “shim” are placed around the rim of 
the main magnet at installation time to give the 


cylindrical pole pieces allowing freer access for subjects and 
staff. The magnet exploits cryogen-free magnesium di-boride 
superconductive technology. The subject is being prepared for 
a head scan and a local multichannel receiver coil is being placed 
around her head (Photographs courtesy of (a) Philips Healthcare 
and (b) Paramed Medical Systems) 


optimized field. This type of shimming can only be 
carried out by the manufacturer of the scanner system. 

Radio-Frequency Coils 

The radio-frequency (RF) coil (or probe) has the func¬ 
tion of generating (transmitting) an oscillating mag¬ 
netic field for excitation of the spins in the sample and 
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subsequently receiving the signals generated by the 
spins in the sample (Mispelter et al. 2006). The RF 
coils are tuned to the Larmor frequency of the NMR 
spin as described in the ► NMR entry. For a 3 
T scanner this corresponds to around 128 MHz for 
hydrogen nuclei (protons) which are exploited in the 
bulk of imaging applications. Should imaging of other 
nuclei (e.g., sodium or fluorine (► *'^F NMR)) be 
desired then usually a different set of coils is required. 
The transmit and receive functions may be combined 
into a single physical unit but may also be separate 
units. For example, on a clinical body scanner, the 
transmit coil is built into the bore of the magnet. This 
type of scheme is depicted in Fig. 1 where the transmit 
pulses are generated by modulating the RF with the 
pulse shape required and amplifying it before passing it 
to a whole-body transmit coil. On a 3 T scanner, the 
power amplifier may be rated at up to 20 kW - however 
only short pulses are used which reduces the power 
(and hence heating) delivered to the subject to within 
regulatory limits (3.8 W/kg for the human head). 
While the transmit coil may receive the signal for 
whole-body imaging it is usual to arrange a set 
(upwards of 8) of more sensitive receiver coils close 
to the desired imaging volume such as a head or car¬ 
diac array coil (Roemer et al. 1990). Making smaller 
and closer coils has a number of advantages in that the 
images will have a higher signal to noise ratio (SNR) or 
else the coils may provide some spatial discrimination 
which can be used to speed up the image acquisition 
process. The receive signal from the coils is first ampli¬ 
fied and then passed to the receiver subsystem which 
acquires and demodulates the signal, passing the out¬ 
puts to the image reconstruction processing computer. 

Gradients 

Magnetic held gradients are required for the spatial 
encoding of the signal in order to generate the image. 
In essence there are three gradient electromagnetic 
coils providing a linear variation in the longitudinal 
component (parallel to the main magnetic field) in the 
three spatial directions (G^, Gy, and Gz). These coils 
give just a spatially dependent and small addition 
or subtraction (of order mT) to the main held, 
thereby giving a nuclear spin, in a particular location 
within the imaging space where the gradients are 
linear, a frequency offset (Morris 1986). This process 
is illustrated in Fig. 3. If the magnitude of gradient field 
is zero then the frequency of spins at iso-center is given 


by the frequency of the Larmor precession (coo = yBo 
where y is the gyromagnetic constant for the nuclear 
spin). At a distance z from iso-center, the spins in the 
desired slice of the object will have a frequency 
cUj = coo + yGzZ (Mansfield and Morris 1982). Hence 
determining the frequency will define the position if Gz 
is known. For whole human body applications the 
electrical power required to switch the gradient coils 
is considerable (e.g., 600 V and 300 amps to generate 
a gradient of 20 mT/m switching on in a few hundred 
microseconds on a whole-body system), requiring 
cooling of both the gradient coils and amplifier units. 
Microscopy or other systems with a smaller volume of 
imaging have lower electrical requirements although 
higher gradient strengths of 1,000 mT/m may be gen¬ 
erated (Glover and Mansfield 2002). 

Slice Selection 

The first step in image encoding is an excitation of 
the spins in the volume of interest, or slice for a 
two-dimensional image. Unlike other imaging 
modalities which are dependent on wave effects or 
geometry of sample and detector, the resolution and 
slice can be set arbitrarily within the sensitive volume 
of the magnet, coil, and gradients. Figure 3 shows 
the selection of spins in a finite slice of width Az. 
Applying a band-limited pulse at a frequency offset 
of cOs and having a bandwidth of Am = jG^Az will 
excite the spins in only the required slice. This is 
accomplished by using a modulated amplitude RF 
pulse similar in form to a sine (sin(x)/x). RF pulses 
may be further optimized to give, for example, more 
homogeneous selection profiles, refocusing pulses, 
adiabatic pulses for the inversion of spin populations, 
or three-dimensional volume selection for spectros¬ 
copy (Haacke et al. 1999). 

Imaging 

Mapping (or imaging) of the spin density could be 
carried out by selecting a volume element of interest 
and recording the NMR signal. Moving the sensitive 
volume around the object builds up a two- or 
three-dimensional representation of the object. Early 
scanning methods were variations upon this theme. 
However, this type of encoding is highly inefficient 
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Magnetic Resonance Imaging (MRI) Methodology, 
Fig. 3 Relationship between position and frequency developed 
by gradient as used to define imaging slice 


and slow. It is much quicker to exploit the link between 
the frequency representation and the spin density of 
physical object caused by the application of a gradient 
during the acquisition of the signal. The Fourier Trans¬ 
form of the acquired signal with a gradient applied in 
a particular direction gives the spin density profile 
projected onto that axis. Imaging can therefore 
be achieved by stepping the gradient angle round the 
object and reconstructing the profiles as would 
be carried out in a computed tomography scanner. 
Again, while conceptually straightforward this is 
not always the most efficient method of encoding 
the image although radial scan geometry is used 
in many current applications (Lauterbur 1973; 
Morris 1986; Callaghan 1995; Haacke et al. 1999; 
McRobble et al. 2007). 

k-Space 

The MR signal collected during the application of 
gradients is in the reciprocal space (or spatial 
frequency) domain. As frequency is dehned as a rate 
of change of phase then this domain can be thought of 
as a “phase” space and is commonly termed k-space. 
This reciprocal correspondence of the two domains 
gives the MRI user a valuable insight into the process 
of image formation and is used to design pulse 
sequences and determine contrast weighting of images. 
This relationship is shown in Fig. 4 which depicts the 
spatial domain (the image space) and the frequency 


(or k-space) domain. For example, the extent or max¬ 
imum phase evolution employed yields the resolution 
(or point spread function) of the final image and the 
sampling rate (assuming matched bandwidth of 
the signal) yields the field of view of the image. 
Hence, the k in k-space is akin to a wave-vector 
although it can be arbitrarily chosen by setting 
appropriate gradient values and readout times. It is 
not related to the wavelength of the radiation or the 
detector geometry as would be the case in most other 
imaging modalities. Although the user has control over 
the rate and extent of the coverage of k-space, there 
are constraints which bound the choice of imaging 
parameters. These are, of course, the length of time 
available to encode the image governed by the relaxa¬ 
tion of the spin system as well as the amount of noise in 
the hnal image largely governed by the bandwidth of 
the measurement: The noise sources are Gaussian 
and of thermal origin (proportional to the square root 
of the bandwidth) and arise from the conductivity of 
the sample and the coil plus electronics. In general, the 
faster the coverage of k-space, the more noise is 
present in the hnal image. Again the user has control 
over the rate of collection of the k-space data and many 
different strategies may be employed which can vary 
from a single line for each excitation (high SNR but 
slow acquisition) to the whole of k-space in one exci¬ 
tation and acquisition. The latter may be of only a few 
milliseconds in the case of echo-planar imaging (EPI) 
used for high-speed imaging or functional MRI 
(Mansheld 1977). 

k-Space Encoding 

Because the data collected in k-space preserves both 
the phase and amplitude information of the signal, it is 
theoretically possible to reconstruct the image from 
just half of the full k-space data set. This is due to the 
symmetry of the phase space. However, this is rarely 
carried out in its purest form and slightly more than 
half of the full k-space is encoded. The evolution of the 
sample point location in k-space is given by the time 
integral of the applied gradients. If all positions in 
k-space have to be encoded, then the gradients may 
be switched and reversed such that the acquired 
k-space point traverses k-space in an ordered manner. 
The strategy employed is termed the “pulse sequence.” 
Coverage could be a raster scan, a radial scan, or 
a spiral, for example (Haacke et al. 1999). An example 
sequence to record one line of k-space is shown in 
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Fig. 4 Reciprocal link via 
Fourier Transform (FI) of 
spatial (LHS) and frequency 
(k-space) (RHS) domains. 
Data are collected in the 
k-space domain and Fourier 
Transformed to give the 
spatial map or image. The 
inset graphics show the 
k-space representation of a 
2D slice from a head MRI scan 



Fig. 5. The sections of the sequence are termed slice, 
phase, and read which is a common terminology used 
for defining sequences. For this example the gradient 
axes are labeled Z, Y, andX respectively. This is merely 
for clarity of example but selection of gradient direc¬ 
tion is arbitrary and could even be in combination 
during all the periods which would give oblique imag¬ 
ing planes of any orientation. The slice period defines 
the imaging plane of interest as described above with 
a band-limited RF pulse applied during the slice 
gradient. A “refocusing” slice gradient reversal is 
used to cancel the accumulated phase in the slice 
gradient direction such that the effective k-space posi¬ 
tion at time point lis = ky = 0. This is shown in the 
corresponding point on the k-space representation. 
Application of a phase gradient (Gy) causes a phase 
evolution down the ky axis to point 2 which is going to 
be the desired row of k-space to sample. A negative 
gradient in the read direction (GJ sets the extent of k^ 
to use (point 3). The read gradient is reversed and the 
k-space point traverses the k^ axis while the signal is 
recorded, through point 4 and ends at point 5. The read 
gradient can sometimes be thought of as a frequency 
encoding as the signal recorded will contain frequency 
components corresponding to the spatial profile of the 
object. This property is similar to that of slice selection 
as shown in Fig. 3 where the gradient profile maps 
position onto frequency. Hence, a Fourier Transform 
of the signal gives the spatial profile of the object 
directly. Having obtained one line of k-space, the 
process may be repeated for this slice (as shown in 
Fig. 6a), another slice, or the read gradient may 
be inverted and another line of ^^.-space acquired 


(using a small phase gradient to go to another ky 
position (as shown in Fig. 6b). A sequence which 
collects a complete plane of k-space data in a raster 
scan using one excitation pulse is known as 
echo-planar imaging (EPI). Once all the desired lines 
for the slice (or volume) have been acquired, then the 
image can be reconstructed using an algorithm based 
on a two-dimensional (or 3D for a volume acquisition) 
Fourier Transform. 

Gradient Echo Imaging 

The above example is usually termed a “gradient 
echo” sequence. The term comes from the use of 
gradient reversals to encode the image. The time 
delay between the RF excitation and point 4 in 
Fig. 5 (where the k^ value is zero) is termed the 
time-to-echo (TE). This point defines the maximum 
coherence of the signal and the term “echo” arises 
from the observation that there is usually a maximum 
amplitude of the signal. 

Spin-Echo Imaging 

The spin-echo sequence is a modification of the above 
definition. An 180° pulse (see ► NMR) is used to 
“refocus” the coherence of the spins at the point 
defined by TE such that incoherences arising from 
magnetic fields not related to the applied gradients 
may also contribute to the signal. The signal 
(and hence the resulting image) is then 
rendered immune to these incoherent effects and 
hence the contrast in the image is affected. In 
k-space, the application of a refocusing pulse would 
take the sampling position to a position with an 
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Magnetic Resonance 
Imaging (MRI) 
Methodology, Fig. 5 A 

simplified pulse sequence to 
read one line of k-space. The 
gradient sections (slice, phase, 
and read) would usually be 
separately played out on three 
orthogonal gradient axes (e.g., 
X, Y, and Z). The trajectory of 
the acquisition point in 
k-space is shown. Only the 
data between points 3 and 5 
are acquired for image 
reconstruction 



Magnetic Resonance 
Imaging (MRI) 
Methodology, Fig. 6 The 

k-space trajectories used to 
encode a complete image for 
(a) a 2DFT gradient 
echo sequence and (b) an 
echo-planar sequence 



2D FT Gracdient Echo 


Echo Planar Imaging 


inverted and ky value. Spin-echo-related images will 
tend to have less T 2 * (see ► NMR) weighting than is 
the case in gradient echo imaging. Spin-echo imaging 
inevitably requires more RF power which may result in 
slower acquisition of the images (because the maxi¬ 
mum allowed average RF power deposition level may 
be reached) but also may be used to characterize the 
true T 2 values for the tissues. Imaging sequences based 
on a series of inversion pulses such as RARE (and 


variants) are related to the CPMG sequence (Haacke 
et al. 1999). 


Contrast Enhancement, Encoding, and 
Quantification 

The power of NMR and therefore MRI is the ability of 
the image to reflect the state of the spin system under 
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observation. In the case of MRI, this is usually the 
protons in water. The water environment in the body 
is found in many states: bound to molecules, unbound, 
intracellular, and extracellular. These environments 
will have different characteristics in terms of their 
water relaxation times. Any one voxel in an image 
records the total of all the states. MRI voxel size is 
usually much greater than cellular length scales 
(except for the special case of MR microscopy) and 
hence attribution of a single uniform relaxation or 
tissue parameter can only be an average of the observ¬ 
able distribution of value. However, full or even partial 
quantification of these relaxation parameters can 
discriminate between disease and damaged tissue. 
Almost any NMR or spectroscopic parameter can be 
“imaged” or at least localized to a volume of interest 
(Gadian 1995). For most imaging needs, the images are 
“weighted” with respect to a particular parameter such 
as the relaxation parameters (Tj, T 2 , or T 2 *), diffusion 
(D), or magnetization transfer (MT), for example 
(Kimmich 1997; McRohbie et al. 2007). As described 
above, the T 2 and T 2 * parameters can be deduced by 
use of a comparison between gradient and spin-echo 
images of the same slice, or using a long value of TE 
which allows short T 2 * components of the signal to 
decay and not contribute to the image. The repetition 
time (TR) plays an important role in the determination 
of image contrast. It is defined as the time between 
excitations (and therefore acquisition) of previously 
utilized magnetization. Taking the simple example of 
Fig. 5, if the next row in the same slice was acquired 
immediately using another excitation before the mag¬ 
netization had time to recover, then there would he 
little signal to acquire. Hence the magnitude of the 
recovery curve following an excitation, or set of exci¬ 
tations, determines the T 1 contrast in the image. This 
contrast is a complex interplay of the repetition time 
(TR) and flip angle of the excitation(s) used in the 
sequence. Use of a flip angle equal to the Ernst angle 
yields the maximum signal in the image for a particular 
highlighted Ti value. To determine the Ti value in 
a quantitative way or to improve T 1 contrast still fur¬ 
ther, an inversion pulse (180°) can precede the imaging 
sequence. The user can then pick a time interval (TI) to 
read out the available net magnetization available as 
the signal recovers (see ► Kinetics: Relaxation 
Methods). Choosing a value of TI corresponding to 
the zero crossing of the recovery curve can effectively 
cancel out (or filter out) a specific value of Ti. Fluid 


attenuated methods (such as FLAIR) can be used to 
“remove” free water (e.g., the CSF) contribution from 
the image, thus highlighting just gray and white matter 
in brain tissue (McRohbie et al. 2007). 

Other Preparation Sequences 

The imaging readout may be preceded by a variety 
of preparation sequences or integrated with the 
sequence to encode other information within 
the image. In general, the minimum requirement for 
any quantification would be two sequences, one with 
no encoding taken as a reference image and 
one with encoding present. The difference or ratio of 
the image magnitudes can then be used to determine 
a specific parameter. For full quantification of 
some parameters, many images might be needed 
with a progression of weightings used to model the 
parameter to be measured. 

Flow quantification may be used by exploiting the 
phase evolution during a gradient pulse in a specific 
direction and its reverse evolution. If a spin has moved 
along that axis, then there will be a loss of coherence 
directly related to the velocity component along that 
axis. Diffusion is a similar process although the motion 
of the spin is random (or perhaps restricted as in 
a nerve fiber). This random loss of phase leads to 
a reduction in image intensity due to applied gradients. 
The complete diffusion tensor may be measured 
by recording a complete set of images with different 
diffusion direction gradients applied. 

Although imaging may be integrated with spectros¬ 
copy, the most usual and simplest form is in the form of 
fat-water suppression or imaging. The excitation 
sequence can either be set to edit out one or other of 
the contributions from fat or water due to its difference 
in chemical shift (see ► NMR). Two images can 
be obtained independently or, as is more usual, 
acquiring the second image with the fat peak 
selectively inverted. The sum and difference of the 
two then gives discrimination of the water and fat 
images (Haacke et al. 1999). 

Magnetization transfer (MT) can be employed 
to reveal further information about the exchange of 
water with molecules in cells, for example. 
Fast exchange leads to very broad line-widths 
(short T 2 components) which are usually inaccessible 
to conventional imaging as the signal decays before 
it can be used for imaging. In MT, an off-resonance 
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pulse is used to saturate the spins which are strongly 
bound but have some rate of exchange with the 
visible pool of spins. Taking images with a variety 
of offsets gives an MT curve which can be related 
to the exchange rate and line-width of the short T 2 
environment (Kimmich 1997). 

Optimization of Imaging Methods 

The encoding of the image can be carried out in many 
different ways in order to meet a set of criteria and 
constraints. These criteria may be speed or quantihca- 
tion, two or three dimensions, image weighting. 
However, all have a common thread of desiring the 
highest quality information, lowest uncertainty in 
parameter measurement for minimum time. The latter 
can certainly be an overriding aspect of human body 
imaging but less of a consideration for in vitro or 
prepared samples and materials. Full use of the 
available time can be made by interleaving scans, 
slices, and different weightings. Encoding multiple 
lines of k-space for each excitation is efficient with 
the complete encoding of k-space in one acquisition 
called echo-planar imaging (EPI) introduced by 
Mansfield (1977). 

More recently, with the use of non-Fourier-based 
reconstruction methods, the amount of k-space 
required for image formation may be reduced while 
maintaining image quality and quantification (Hoge 
et al. 2005). These sparse or partial k-space methods 
exploit the apparent redundancy in the image where 
there is little signal contributing to the image. Hence, 
these data are omitted from the acquisition process and 
reconstruction is based on a minimization process 
within given constraints. The reduction in data 
required can be considerable, leading to a faster 
acquisition - particularly if other a priori information 
can be included in the model, for example, an inversion 
recovery model. 

Use of multiple local receive-only coils (Roemer 
et al. 1990) can be used to regain some of the spatial 
information lost when a reduced k-space set of data is 
used. The image is deliberately subsampled such 
that the image would wrap round and overlap if 
reconstructed normally. However, the location of 
a particular spin can be fully determined by the rela¬ 
tive signal strengths received by the separate coils. 
Two examples based on this type of processing are 


SMASH (Sodickson and Manning 1997) and 
Sensitivity Encoding (SENSE) (Pruessmann et al. 
1999, 2001). 


Summary 

Magnetic Resonance Imaging is the spatially 
resolved aspect of Nuclear Magnetic Resonance 
methods. MRI maybe synonymous with imaging of 
the human body in a clinical setting, but because of its 
NMR sensitivity to molecular environment on 
a nano-, micro-, and macro-scopic scale, the images 
contain a rich array of information. Whereas NMR 
in vitro methods generally feature a high level of 
spectral resolution unavailable in MRI, the latter 
technique can probe form and function at the organ¬ 
ism level. This characteristic of MRI is giving 
researchers a richer understanding of brain, cardiac, 
and other function as well as highly resolved structure 
in vivo. MRI methodology is continuing to develop 
both in hardware technology as well as increased 
speed and resolution afforded by computational 
processing and parallelism. 

Cross-References 

► ‘‘^E NMR 

► CPMG 

► Elow NMR 

► Kinetics: Relaxation Methods 

► MRI and PET Imaging: Clinical Applications 

► NMR 

► NMR Methods for Kinetic Analysis 

► Pulsed Eield Gradient NMR 

► Spin Diffusion 
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Synonyms 

Magnetic field constant force clamp; Single-molecule 
methods 


Definition 

A magnetic tweezer is an instrument using a magnetic 
field to exert a force on a superparamagnetic particle. 

Introduction 

In essence, as used in molecular biology, a magnetic 
tweezer is a tethered ball on a string attached at one 
end to a surface. The ball happens to be a superpara¬ 
magnetic bead, and the string is a biopolymer such as 
a molecule of DNA. The surface is a glass microscope 
slide so the bead can be observed with a microscope. 
Magnetic forces from external magnets are applied to 
the bead, and video microscopy is used to follow the 
motion of the bead in the x, y, and z planes to understand 
the properties of the biopolymer. 

The properties of untethered superparamagnetic 
beads, say within a membrane or a cell, subjected to 
a magnetic field have also been studied to learn more 
about the properties of the biological system (Ziemann 
et al. 1994; de Vries et al. 2007). 

Some highlights that have used magnetic tweezers 
on a DNA tethered superparamagnetic bead are the 
quantitative measurement of the elasticity of individ¬ 
ual DNA molecules, the observations of plectoneme 
formation in individual DNA molecules, the activity of 
topoisomerase II on an individual DNA molecule, 
bubble formation by an initiating RNA polymerase, 
transcription elongation, the mechanism of action of 
topoisomerase lA, action of RuvAB, the mechanism of 
gyrase, proteins affecting DNA braiding, and the effect 
of chromatin. The reader is directed to numerous 
reviews and references therein (Strick et al. 1998; 
Zlatanova and Leuba 2003a, b; Leuba et al. 2004, 
2009; Neuman and Nagy 2008; Kapanidis and Strick 
2009; Meglio et al. 2009; Einzi and Dunlap 2010; 
Koster et al. 2010). 

Determination of the Stretching Force 
Acting on a Biopolymer 

For the discerning student we go into detail how the 
forces are determined. A typical magnetic tweezers 
system under consideration consists of a biopolymer 
molecule of length L attached on one end to a magnetic 
sphere and on the other end to a glass slide surface. The 
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Magnetic Tweezers, Fig. 1 Schematic of forces in a tethered 
magnetic tweezers setup 

diameter of the sphere usually is a few micrometers, 
and the length of the biopolymer is several times lon¬ 
ger than the sphere diameter. From Fig. 1 we can see 
that the magnetic force Fm maintains the biopolymer 
molecule oriented perpendicularly to the surface. The 
bead constantly receives random impacts from the 
solvent molecules that create a force component per¬ 
pendicular to the external field orientation (here taken 
as the vertical axis), making an angle 0. The solvent 
fluctuations at the microscopic level may be 
represented mathematically by a fluctuating force 
X(t). The tangential component of the magnetic force 
is the restoring force Ft- According to Fig. \,Fj = Fm 
sin (s/L) ^ Fm (xIL) = kx, where k is the force con¬ 
stant. For small angles, 0, the arc may be replaced for 
the distance, x (Fig. 1). Thus, when the amplitude of 
the motion of the pendulum is small, the equation that 
describes its dynamics is mathematically identical to 
that of a harmonic oscillator. 

Fluctuation-Dissipation Theorem (FDT) 

The Langevin equation provides the correct descrip¬ 
tion of the motion of Brownian particles 
(Chandrasekhar 1943; Berg 1993; Nelson 2004). As 
we shall see in the next section, one of the terms in the 
Langevin equation is a fluctuation force X(t), which 


represents the random impacts that the microsphere 
receives from the solvent molecules. By definition, 
the average of the fluctuating force is zero, 
{X{t)) — 0. In addition, X(t) satisfies the fluctuation- 
dissipation theorem (FDT), {X{t)X{f)) = 

73(1 — /). This theorem establishes a connection 
between X(t), and the drift coefficient and the thermal 
energy. For the particular case of a sphere the drift 
coefficient is ; = 6nr]a, where rj is the viscosity of 
the solvent, and a is the radius of the sphere. The factor 
k^T is the thermal energy; the small magnitude of the 
Boltzmann’s constant (kg = 1.38 x 10”^^ J K“*) indi¬ 
cates that the fluctuations are very small. According to 
FDT, the correlation time of the fluctuating force is 
initially i^ksT, but decays rapidly to zero, indicating 
that dispersion forces are highly uncorrelated. 

We can expand our discussion of the FDT 
employing the Einstein-Smoluchowski equation for 
the diffusion coefficient of a spherical particle, 
D — ksT /;. We can write the FDT in terms of rj, a, 
and D, as follows: 

(X(f)A(f')) = C^Dd{t-/) = SeTL^rfa^DSit - /) 

The correlation of the fluctuating force is proportional 
to the square of the viscosity of the medium as well as to 
the square of the radius of the sphere, but varies linearly 
with the diffusion coefficient of the sphere. 

Measurement of the Force Applied to 
a Biopolymer (e.g., DNA) 

The inverted pendulum undergoing small displace¬ 
ments about its equilibrium position may be described 
employing the Langevin equation for a harmonic 
oscillator: 

X(t) was originally called “complementary force” 
by P. Langevin (Langevin 1908) and satisfies the prop¬ 
erties already described. The second term on the left- 
hand side of Eq. 1 is the viscous resistance, which is the 
mechanism that the system employs to dissipate 
energy, and assumes that friction is proportional to 
the velocity. The third term is the force due to the 
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displacement of the particle from the equilibrium 
position. Multiplying Eq. 1 by jr we obtain: 


cPt l dr 2 k 

Try -^ ® 

dt^ m dt m 


( 8 ) 


d^x ^ dx , ,,, , 

mx^^+ (x-h Kx = xX{t) 


dfi 


dt 


( 2 ) 


dx d^x 

We can eliminate the x— and x—^ terms from 
„ - , . dt dt^ 

Eq. 2 by using: 


This is a second order equation with constant coef- 
hcients that has the solution / /, = Substituting this 
solution we obtain the indicial equation, 
(y^ + yC/wt + 2K/OT)r/, = 0. The two roots of this 
quadratic equation are: 


dx 1 d{x^) 

dt 2 dt 


(3a) 


and then taking another derivative with respect to time: 

1 c/2(x2) 


d 

dt 


dx 

dt 


= X 


d^x 

dfi 


2 dfi 


(3b) 


we obtain: 

m d^{x^) 
2 dfi 



C d{x^) 

2 dt 


+ KX^ 


= xX{t) 


( 4 ) 


Taking time averages, we arrive at: 


m d^{x^) 
~2 df- 





{x){X{t)) 

( 5 ) 


where the angle brackets represent averages. Now, we 
apply the property, {X{t)) — 0 and Eq. 5 simplihes to: 


md^{x^) (d{x)\ 

2 ) 




Using 

d{x] 


the 

1 


equipartition principle. 


-ml I = TzksT, and dehning the fluctuation in 

the mean-square displacement as, r = (x^), Eq. 6 
simplifies to: 


d^r C dr 2 k hgT 

Try 3-y 3— ^ ~ — 

dt^ m dt m m 


( 7 ) 


This is an inhomogeneous equation in which kgT jm 
is constant. 

Solution of the Homogeneous Equation. The 
standard strategy to solve Eq. 7 is to first solve the 
homogeneous equation: 


-c + - 8mK ^ -c - - 8mK 

y+ =-^, and y_ =--- 

2m 2m 

( 9 ) 

Thus, the general solution of the homogeneous 
equation is: 

rh{t) = c+ exp(y+f) -h c_ exp(y_f) (10) 

If we impose the boundary condition, 
77i(0) = c+ + C- = 0, then Eq. 10 simplifies to: 

ri,{t) = c+(exp(y_^r) - exp(y_r)) (11) 

Solution of the Inhomogeneous Equation. Notice 
that if rp = K, and r'^ = r" = 0, then, K — kgT jK, and 
the solution is: 

(x^(t)) = I'h + i-p = c+(exp(y+t) - exp(y_t)) + IcbT/k. 

( 12 ) 


Now let us discuss two limiting cases: 

(a) < 8mK: Inertial forces are important. The roots 
will be complex and the system will oscillate. This 
situation is of little interest for scientists interested 
in magnetic tweezers. 

(b) > Sm/c: Inertial forces are negligible. When 
a particle satisfies this condition, it displays 
Brownian motion. The solutions of the homoge¬ 
neous equation are real and will approach zero at 
long times, lim r/j(f) ^ 0, so that (x^) = ksT/K. 
Because the mass only enters in the solution of the 
homogeneous equation, for a long enough time 
inertial forces are not important (i.e., the mass 
dependence disappears). From a practical point of 
view, this signifies that in order to obtain meaning¬ 
ful/reliable data, we have to wait until the system 
samples the whole phase space. Substituting for 
the force constant, k = Em/L, we arrive at the key 
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equation (Strick et al. 1998) that is employed to 
measure the force: 


Fm = 



(13) 


extremely powerful single-molecule biology 
approach, and the use of this instrument has only just 
begun. Based on the limitation of references for this 
entry, we apologize to all authors whose contributions 
we could not cite. 


Einstein’s Equation. Einstein’s formula for the 
probability of a fluctuation for a closed system is 
p oc , where AS is the change in entropy associ¬ 
ated with the fluctuation (Landau and Lifshitz 2008). 
We are interested in the dependence of the entropy on 
the displacement of the sphere from its equilibrium 
position. Since the entropy in the equilibrium position 
must be a maximum, the first derivative of S is zero 
and the second derivative is negative. Thus, we can 
expand the entropy around the equilibrium position: 

AS = S(0) — - {S^S)x^ = S(0) — 2 ^^' Substituting 

this result in Einstein’s formula and using the normal¬ 
ization condition, we obtain a distribution function: 
p{x)dx = We can employ this 

equation to obtain the mean-square displacement: 



/ OO 

g-lr _ 2kB/ A. 

•OO 


(14) 


Combining Eqs. 13 and 14, we can find an expres¬ 
sion for the parameter L, 1 = 8^S = 2Fm/TL. Next, we 
substitute this result into the distribution function: 


p{x)dx= ^J^^^sxp{-{Fm/L)x^/ kBT)dx. (15) 

Equation 15 is the distribution function for fluctua¬ 
tions in the displacements in terms of the applied force, 
the length of the biopolymer, and the temperature. 


Summary 

In this entry, we have briefly described the magnetic 
tweezers and mentioned certain highlights that have 
been achieved with this instrument. We have explained 
the theoretical basis of the instrument. We hope that we 
have piqued the curiosity of the reader and given them 
enough of an understanding to further pursue knowl¬ 
edge of this instrument. The magnetic tweezers is an 


Cross-References 

► Dual-Beam Optical Tweezers 

► Force-Eluorescence Spectroscopy 

► Optical Tweezers 

► Single-Particle Tracking 

► Tethered Particle Microscopy 
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Computational Studies 
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Arcavacata di Rende (CS), Italy 

Definition 

Arginase is a metalloenzyme that hydrolyses 
L-arginine to L-omithine and urea. 

Introduction 

The metalloenzyme arginase catalyzes the hydroly¬ 
sis of L-arginine to L-ornithine and urea. The enzyme 
is mainly found in the liver, where it catalyzes the 
final cytosolic step of the cycle of urea, which pro¬ 
vides the principal route for the disposal of nitroge¬ 
nous waste from protein catabolism (Krebs and 
Henseleit 1932). 

Arginase activity is also found in nonhepatic cells 
like the red blood cell, the lactating mammary gland, 
the kidney, the myometrium, and the macrophage. 
Here, the function of arginase is to modulate arginine 
and ornithine concentrations for biosynthesis. 
L-ornithine serves for the production of growth factors 
polyamines. An overexpression of arginase in cells 
enhances poly amines’ production required for prolif¬ 
eration and neoplastic transformation of mammalian 
cells (Xie and Fidler 1998). Thus, a role in promoting 
tumor cell proliferation is revealed for arginase. 

Arginases can be activated for catalysis by Mn(II), 
Co(II), Ni(n), Fe(II), and Cd(n), although the 


physiological activator is accepted to be Mn(II) 
(Hellerman and Perkins 1935). 

Arginase from rat liver is a homotrimeric protein. 
Each of the three identical subunits exhibits 
a characteristic a/P fold and possesses a pair of anti- 
ferromagnetically spin-coupled Mn(II) ions neces¬ 
sary for optimal activity. Ligands are HislOl and 
Asp 128, that coordinate to MnA, and His 126 and 
Asp234, that coordinate to MnB. Asp 124, Asp232, 
and a water molecule/hydroxyl group are bridging 
ligands for the two Mn ions (Ivanov and Klein 
2004). The water molecule is involved in a strong 
hydrogen bond with Asp 128 (Khangulov et al. 

1998) . The substrate binds to cations, while the bridg¬ 
ing H 20 / 0 H“ is located in front of the substrate 
carbonyl carbon atom. The Asp 128 is strongly 
interacting with the hydroxyl/water (Bewley et al. 

1999) (see Scheme 1). 

The structure of metal-depleted arginase, 
containing only one Mn(II) per subunit, reveals that 
MnA is the dissociated metal ion in the half-active 
state. A fully occupied binuclear manganese 
metal cluster is required for optimal catalysis and 
thermostability (Scolnick et al. 1997). 

Catalytic mechanism (Scheme 2) starts with the 
binding of L-arginine to the Mn ions through the -NH 2 
groups. It undergoes a nucleophilic attack by the 
bridging H 20 / 0 H“, leading to a tetrahedral intermedi¬ 
ate, whose protonation mediated by the Asp 128 residue 
entails products formation. 

The catalytic cycle reported in the Scheme 2 and 
widely accepted as the more reliable one has been 
validated by means of density functional-based 
methods (Leopoldini et al. 2009). 

The model cluster used to simulate the active site of 
arginase (see Scheme 1) is made up by the hrst shell 
coordination sphere of the two Mn^"^ cations (OH/H 2 O, 
Aspl24, Asp232, Asp234, HislOl, Hisl26, Aspl28), and 
by some nearby residues (Glu277, Aspl83, Asnl30, 
Serl37, and Hisl41). Ligands are represented by the 
functional part of the side chains only. The presence of 
a metal-bonded HO” is justified considering that the 
metal-bound water molecule is deprotonated by aspar¬ 
tate/glutamate residues, providing the nucleophile agent. 
This model is investigated within the mechanism 1. 
However, also a model with an unprotonated Asp 128 
(replaced by CH 3 COO”) is considered because of the 
absence of sure indication about the protonation state of 
the active site. 
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Manganese-Containing 
Arginase - Computational 
Studies, Fig. 1 Potential 
energy surfaces for the Mn 2 
(black line) and Mn (red line) 
clusters 



reaction coordinate 


Two exchange-correlation functionals, B3LYP 
and MPWBIK, are employed to perform geometry 
optimization. The 6-31 -i- G* basis set and the 
LANL2DZ pseudopotential are chosen to describe 
the C, N, O, and H atoms, and the metals, respectively. 
Frequency calculations are performed to establish the 
character of the stationary points as minima or saddle 
points. Potential energy surfaces (PESs) are given as 
6-311-i-i-G** single point energy calculations on 
the 6-31 -I- G* optimized geometries (Leopoldini 
et al. 2009). 

The Mn-Mn arginase-catalyzed hydrolysis of argi¬ 
nine has been shown to be pFI dependent (Leopoldini 
et al. 2009). Particularly, density functional computa¬ 
tions involving a water molecule rather than 
a hydroxide ion in the active site have provided 
a very high energy barrier for the rate-determining 
step. The very high relative energy (41.6 kcal/mol) 
can be due to the fact that it incorporates the energy 
cost for both nucleophile addition on substrate and 
deprotonation of the catalytic water by Aspl28. 
Results clearly have suggested that the reaction involv¬ 
ing a water molecule nucleophile is too slow to be 
compatible with an enzymatic catalysis. This mecha¬ 
nism has been also quite questionable about the behav¬ 
ior of Glu277 as a ligand to MnA, in disagreement with 
all the available X-ray structures. For all these reasons, 
a reaction path according to the mechanism involving 
a water molecule seems unlikely. 


In the Fig. 1, the potential energy surfaces (PESs) 
obtained through DFT computations are reported for 
both the binuclear (black line) and mononuclear (red 
line) manganese cluster of the active site. In both cases, 
the presence of a hydroxide ion as nucleophile is 
considered. 

Eirstly, the presence of a hydroxide ion (and 
a negative Asp 128 amino acid) modifies the reactivity 
when compared to the results involving a water mole¬ 
cule (Leopoldini et al. 2009). The lone pair carried by 
the —OH nucleophile is directly available to perform 
the addition on the guanidine carbon atom. The sub¬ 
strate binds to the di-metal site through weak interac¬ 
tions. The Glu277 in this case is not a ligand to the 
MnA site, but helps in substrate binding. 

The activation energy for the rate-limiting step that 
is represented by the nucleophilic OH addition on 
substrate carbon atom is computed to be very low 
(17.6 kcal/mol) with respect to the value computed 
for the mechanism involving a water molecule 
(41.6 kcal/mol). 

The former mechanism seems to be more energeti¬ 
cally reliable considering that a kcat of 250 ± 20 s“* 
(Cavalli et al. 1994) and thus a barrier of « 14 kcal/mol 
are proposed experimentally, together with the evi¬ 
dence of an optimum pH range between 9 and 9.5 for 
the enzyme (Kuhn et al. 1991). 

The effect on catalysis of the depletion of one 
metal from the active site has been investigated and 
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Manganese-Containing 
Arginase - Computational 
Studies, Fig. 2 Optimized 
geometries of the enzyme- 
substrate complex (ES) and 
the first transition state (TSl) 
for the Mn 2 and Mn clusters 
(unimportant hydrogen atoms 
are not shown) 



Mnj cluster 



the influence on energetics can be inferred from the 
Fig. 1 (red line). 

Even if elementary steps are different for the two 
clusters (Leopoldini et al. 2009), the rate-determining 
step is always the —OFI nucleophilic addition. But, in 
the case of the Mn cluster, it requires an activation 
energy of 29.6 kcal/mol. This value is higher than the 


one coming from the Mn 2 cluster. Explanations can be 
argued from the optimized geometries for the enzyme- 
substrate complex (ES) and the transition state for the 
OH addition (TSl) reported in the Eig. 2, for the Mn 2 
and Mn clusters. Arginine, because of the absence of 
MnA, is not correctly oriented to be attacked by the 
nucleophile, so the energy barrier in the mononuclear 
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cluster incorporates also the cost for right location of 
the substrate. 

These theoretical results are quite consistent 
with the experimental indications about the reduced 
catalytic activity of the metal-depleted enzyme 
(Scolnick et al. 1997). 

Thus, arginase enzyme presents a higher catalytic 
activity in alkaline medium. An intact binuclear manga¬ 
nese cluster and a ligand field are required for optimal 
catalysis. In fact, MnA cation favors the correct binding 
and orientation of the substrate, as the higher activation 
energy obtained for the MnA-depleted cluster confirms. 

Cross-References 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 
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Markov Models of Molecular Kinetics 

Frank Noe 

Institute of Mathematics, FU Berlin, Berlin, Germany 

Introduction 

Markov (state) models (MSMs) are an approach to 
understand conformational dynamics of molecules 
using computer simulations. An MSM consists of 
(1) a subdivision of the state space into a discrete set 
of microstates, often using some clustering method, 
and (2) a Markovian model to describe the transition 
dynamics amongst these microstates, usually 
a transition probability matrix or rate matrix. 

MSMs are especially useful when studying com¬ 
plex macromolecular changes, such as folding, native- 
state transitions, and binding. Such systems are often 
metastable, that is, the protein(s) fluctuate within a set 
of structures for a long time before enough thermal 
energy is accumulated to leave this set and transition to 
another metastable set (Frauenfelder et al. 1991; 
de Groot et al. 2001). It is the interest of chemical 
physicists and biophysicists to identify the essential 
metastable states, quantify their free energies or prob¬ 
abilities, the kinetics arising from the transitions 
between them, and the structural mechanisms 
involved. 
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In order to overcome the limitation of indirect 
observability of experiments, molecular dynamics 
(MD) simulations are becoming increasingly accepted 
as a tool to investigate structural details of molecular 
processes and relate them to experimentally resolved 
features. MSMs are a systematic framework for ana¬ 
lyzing and also for driving molecular dynamics simu¬ 
lations. Compared to standard analyses of molecular 
dynamics simulations, MSMs have a number of useful 
features: 

1. Long-term molecular kinetics may be predicted 
from short-time simulations. 

2. Great amounts of simulation data can be analyzed 
with relatively little subjectivity of the analyst. 

3. Stationary and kinetic quantities can be calculated, 
such as conformational free energy differences, 
metastable states, and the ensemble of transition 
pathways. 

4. Simulation data and measurement data can be 
reconciled in a rigorous and explicit way. 

5. Statistical information contained in MSMs can be 
used to allocate new simulations adaptively. 

Due to the advent of large-scale distributed com¬ 
puting frameworks and the recent performance 
increase of computer clusters, large numbers of short 
trajectories are becoming more and more easy to gen¬ 
erate (Voelz et al. 2010; Noe et al. 2009; Shaw et al. 
2010). MSMs and other methods based on trajectory 
ensembles are thus increasingly useful and important 
in the process of investigating conformational dynam¬ 
ics with simulations. There are currently two relatively 
complete software packages for building and analyz¬ 
ing MSMs: MSMbuilder (Beauchamp et al. 201 1) and 
EMMA (Senne et al. 2012). 

Markov Model Theory 

Basics 

The dynamics of the molecular system considered is 
given by trajectories of a stochastic process x(t) in the 
continuous state space consisting of positions and 
momenta. The stochasticity of x(t) comes through cou¬ 
pling the system to a thermostat. Following properties 
are assumed for x(t): (1) x(t) is Markovian in full state 
space, (2) x(f) is ergodic and states are visited with 
a frequency given by the Boltzmann distribution 
p(x) = Z(^)“^ exp(—/?//(x)) (3) the dynamics are in 
thermal equilibrium and thus x(r) fulfills microscopic 


detailed balance. See Prinz et al. (2011) for a more 
extensive description. These conditions are fulfilled by 
not all, but many dynamical models frequently used to 
simulate molecular dynamics. Even for setups violat¬ 
ing these conditions, MSMs are often useful, although 
they are then not justified by a solid theory. 

Let the state space with coordinates x be discretized 
into “microstates” {Si,. ..,Sn). Ty(T) represents the 
time-stationary probability to find the system in state 
j at time t + z given that it was in state i at time t: 

Tij^z) = P [x{t + z) € Sj\x{t) G Si], 

defining a transition matrix T(t) G ". Note that t 
can be orders of magnitude shorter than the longest 
timescales of the system. The transition matrix can also 
be written in terms of correlation functions (Swope 
et al. 2004): 


where ti, is the stationary probability to be in set S,: 

71/ = P [x(f) G Si] 

and Cij°'^(T) = KjTjj (z) is an unconditional transition 
probability. Suppose that p(f) G M" is a column vector 
whose elements denote the probability to be within 
a sety G {1, ..., 77 } at time r. After time z, the proba¬ 
bilities will have changed according to: 

pr(f+T) = p^(t)T(T) (2) 

The stationary probabilities of discrete states, tt,, 
yield the unique discrete stationary distribution of T: 

7r^ = 7i^T(T) (3) 

Clustering, Estimation and Statistics 

The transition probabilities T/j are usually estimated from 
molecular dynamics simulations. Suppose a trajectory 
x{t) is given. The simulation data is first discretized 
onto a microstate discretization (Sj ,..., S„). This is usu¬ 
ally done with clustering methods such as density-based 
clustering, k-medoids or k-means, using RMSD, Euclid¬ 
ean positions, or internal coordinates as metric (Voelz 
et al. 2010; Chodera et al. 2007; Prinz et al. 2011). 
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The count matrix C(t) is then defined by counting 
the number of transitions between discrete sets along 
the trajectory: 

Cy(T) = |{x(f) e Si, x(f+T) e S;,}|. (4) 

If multiple trajectories are available, then the count 
matrices of these trajectories are simply added up. 
Based on C(t), the transition matrix can be estimated 
with maximum likelihood by Prinz et al. (201 1): 


Provided that the trajectories x(f) are started from 
a local equilibrium within the set 5, that contains the 
starting structure x(0) (Prinz et al. 2011), the estimator 
Eq. 5 is asymptotically unbiased, that is, for a long 
enough trajectory, T(t) will converge to the correct 
transition matrix T(t). It is important to note that Ty as 
given by Eq. 5 does not necessarily fulfill the detailed 
balance equations: 7i,Ty = njTji, but generally 
7 t,Ty ^ HjTjj. This is a result of limited statistics and 
is usually accounted for by using a maximum likeli¬ 
hood estimator that makes sure that the detailed bal¬ 
ance equations are fulfilled (Prinz et al. 2011). 

Since simulation data is finite, all validation pro¬ 
cedures (either consistency checks or comparisons to 
experimental data) need to account for statistical 
uncertainties. Standard deviations or confidence inter¬ 
vals of the transition matrix elements and of properties 
computed from the transition matrix can be calculated 
from the count matrix C(t). See Singhal and Pande 
(2005), Noe (2008) for details (Pig. 1). 

Predicting Long-Term Kinetics from Short 
Simulations and the Systematic Error Caused by 
This 

Markov models are an approximation of molecular 
kinetics. The discretization of state space into sets 
{S],..S„) erases the information where exactly the 
continuous process x(t) was. As a result, the jump 
process on {Sj ,..., S„) is no longer Markovian even if 
x(f) was; nevertheless, it is approximated by a Markov 
chain. What are the consequences of this approxima¬ 
tion? The following two quantities are obtained from 
Markov models without systematic error: 

1. The propagation of transition probabilities by one 
step T, p^{t -I- t) = p^(f)T(T). 


2. Stationary properties, such as the stationary distri¬ 
bution n and associated expectation of state func¬ 
tions Eh (a) = {ti, a). 

However, state space discretization introduces sys¬ 
tematic error in the reproduction of long-time kinetics, 
that is, the prediction: 

P^{t+kT) « p^(f)T^(T), (6) 

is only approximately true. However, good approxi¬ 
mation of this equation is essential, because it repre¬ 
sents one of the main advantages of Markov models, 
namely, to predict long-time kinetics by using short 
trajectories of length order t. Based on rigorous theo¬ 
retical results (Sarich et al. 2010; Prinz et al. 2011), it is 
now known that the error of Eq. 6 decreases toward 
zero with increasingly fine discretization and increas¬ 
ingly long lagtime t. 

A practical way to test the quality of a specific state 
space discretization is the Chapman-Kolmogorow Test 
(Prinz et al. 2011), which tests the approximate valid¬ 
ity of Eq. 6. Eigure 2 shows the results of such a test for 
a two- and a six-state partition of a diffusion in 
a double well. The six-state partition clearly outper¬ 
forms the two-state partition. 

Eigenvalues and Eigenvectors 

The transition matrix T(t) can be written as a linear 
combination of its eigenvalues 1, and left eigenvectors 1,: 

P^(A:t) = 7t^ + ^ if (T)a,l/’. (7) 

1=2 

with coefficients a, that depend on the initial distribu¬ 
tion p(0). The first eigenvector is equal to the station¬ 
ary distribution 1; = ti and has the eigenvalue = 1. 
All other eigenvalues are smaller than one, hence 
lim^H^oo = TT. The terms with i > 2 indicate 

exponential relaxation processes with a timescale 
implied by the eigenvalues: 


Since the relaxation timescales ti are physical prop¬ 
erties of the dynamics, they should be invariant 
under change of the lag time t used to parametrize 
the transition matrix (Swope et al. 2004). For large 
enough t, t, should converge to their true value 
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Markov Models of Molecular Kinetics, Fig. 1 (a) Potential 
energy function with 100 microstates and four metastable sets 
and corresponding stationary probabilities tt,-. (b) Density plot of 
the transition matrix for a simple diffusion in the potential. The 
matrix is nearly block-diagonal, with the transition probability 
being large within blocks allowing rapid transitions within meta¬ 
stable basins, and small or nearly zero for transitions between 
different metastable basins, (c) Eigenvalues of the transition 


matrix. The gap between the four slow processes (/!,, Ri 1) and 
the fast processes is clearly visible, (d) The four dominant 
eigenvectors, r,, .. .r 4 , which indicate the associated dynamical 
processes. The first eigenvector is associated to the stationary 
process, the second to a transition between A-f B ^ C-f D and 
the third and fourth eigenfunction to transitions between 
A B and C D, respectively, (e) The left eigenvectors I;, 
..., I 4 (Figure adapted from Prinz et al. (201 1)) 


(assuming sufficient statistics). Therefore, the conver¬ 
gence of t, with increasing t has often been employed 
as an indicator for selecting t (Swope et al. 2004; 
Chodera et al. 2007; Prinz et al. 201 1) (see Fig. 3). 


The relevance of the eigenvectors is illustrated in 
Fig. Id, showing the four dominant eigenvectors for 
the diffusion in a four-well potential. The first eigen¬ 
vector corresponds to the stationary distribution. The 
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Markov Models of Molecular Kinetics, Fig. 2 Chapman- 
Kolmogorov-Test for MSMs of a diffusion in a double-well 
potential (a), (b, c) compare the probability of being in the left 
minimum over time, given that the dynamics starts in the left 
basin. The test was done for the two-well potential using 



a trajectory of length 10*’ steps. Tested are Markov models that 
use lag times t = 100, 500, 2000 and (b) 2-state discretization 
(split atx = 50), (c) 6-state discretization (split atx — 40,45,50, 
55, 60) (Figure adapted from Prinz et al. (2011)) 


second eigenvector corresponds to the slowest process 
and has positive signs in regions A and B and negative 
signs in regions C and D, thus corresponding to the 
transition between (A,B) and (C,D). The third eigen¬ 
vector corresponds to the transition between A and B, 
while the fourth corresponds to the transition between 
C and D. 

Metastable States 

Markov models from clustered molecular dynamics 
data often require thousands of microstates. It is 
thus desirable to find a simplihed representation that 
communicates the essential properties of the kinetics. 


Let us consider the coarse partition of state space 
Q. — I Cl, C 2 , ■ ■ C„] where each cluster C, contains 
multiple microstates S,. We are interested in hnding 
a clustering that is maximally metastable. In other 
words, each cluster C, should represent a set of struc¬ 
tures that the dynamics remain in for a long time before 
jumping to another cluster Cj. Thus, each cluster C, can 
be associated with a free energy basin. 

Schiitte and coworkers proposed that the metastable 
states could be identified by grouping microstates 
according to the signs in the dominant eigenvectors 
I 2,13 etc. (Schiitte et al. 1999; Weber 2003). Based on 
that, Weber (2003) developed PCCA-I-, an optimal 
method for identifying metastable sets. When plotting 
the values each microstate has in its m — I right 
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Markov Models of Molecular Kinetics, Fig. 3 Convergence 
of the slowest implied timescale t 2 = —t/ In X 2 {t) of the diffu¬ 
sion in a double-well potential depending on the MSM 
discretization. The metastable partition (black, solid) has greater 
error than the finer partitions {blue, green) (Figure adapted from 
Prinz et al. (201 1)) 

eigenvectors r 2 , r 3 , ..Tm, these values lie in 
a simplex whose vertices correspond to metastable 
states and the most metastable partition is found by 
assigning microstates to their closes vertices. See 
Fig. 4 for an illustration, and Weber (2003) for 
a more detailed description. 

Note that metastable states are very useful for illus¬ 
trative purposes. If the dynamics are very metastable, 
they may even serve as MSM microstates because then 
the dynamics loses memory before exiting to another 
metastable state, yielding an effectively Markovian 
partition de Groot et al. (2001). In general, this is not 
the case, and for quantitatively modeling the system 
kinetics, it is thus recommeded to maintain a fine 
discretization as the MSM discretization error will 
increase when states are lumped (see section “Long- 
Term Kinetics from Short Simulations”). 

Figure 5 shows metastable states of the folding 
dynamics of Pin WW Noe et al. (2009). 

Transition Pathways 

Understanding the mechanisms of conformational 
transitions, such as protein folding, RNA folding, 
native conformational transitions in proteins, or 


binding of ligands to proteins, is one of the grand 
challenges in biophysics. Let A and B be two subsets 
of state space (e.g., denatured and folded state), and 
let all remaining states be “intermediate” states I. What 
is the probability distribution of the trajectories leaving 
A and continuing on to B1 That is, what is the typical 
sequence of states I used along the transition path¬ 
ways? When an MSM is available, these questions 
can be answered by Transition Path Theory (TPT) 
(Weinan and vanden-Eijnden 2006; Metzner et al. 
2009; Noe et al. 2009). 

The TPT equations are derived for rate matrices in 
Metzner et al. (2009) and for transition matrices in Noe 
et al. (2009). The essential ingredient required to com¬ 
pute the statistics of transition pathways is the 
committor probability, qf ■ qf is the probability 
when being at state i, the system will reach the set 
B next rather than A (Du et al. 1998). The committor 
can be calculated from the equations: 

qi = Q IgA 

qi = I i G B 

^keiTikq^= —HkesTik i G I 

The backward-committor probability, q” is the 
probability, when being at state i, that the system was 
in set A previously rather than in B. For dynamics 
obeying detailed balance: 

9r=i -9,^- 

Consider the probability flux between two states 
i and j, given by niTij. TPT only considers trajectories 
that successfully move from A to B without recurring 
to A beforehand. The flux pertaining to these reactive 
trajectories only is given by multiplying the flux by the 
probability to come from A and to move on to B: 

fij = mTTijq+. 

Furthermore, contributions from recrossings or 
detours are removed. Thus, the net flux is defined by 
fij = va?iX {Q, fij — fji}. Considering detailed balance 
dynamics and when ordering states along the reaction 
coordinate qf such that q^ < qf, an equivalent 
expression is (Berezhkovskii et al. 2009): 

f;^ = 7liTij{q+-ql). 
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Markov Models of Molecular Kinetics, Fig. 4 Metastable 
states of the one-dimensional dynamics (see Fig. la) identified 
by PCCA-t. (a), (c), (e): Plot of the eigenvector elements of one, 
two, and three eigenvectors. The colors indicate groups of ele¬ 
ments (and thus conformational states) that are clustered 
together, (b), (d), (f): Clustering of conformation space into 


two, three, and four clusters, respectively. Each of these parti¬ 
tions is a valid selection in a hierarchy of possible decomposi¬ 
tions of the system dynamics. Moving down this hierarchy 
means that more states are being distinguished, revealing more 
structural details and smaller timescales (Figure adapted from 
Prinz et al., Phys. Chem. Chem. Phys. 13, p 16912 (2011)) 


fi]' defines a network of fluxes leaving states A and 
entering states B. It is similar to an electric network 
where the “voltage” ) across a “conductivity” 

TiiTij gives rise to a “current”/!^. The total flux created 
in A and consumed in 5 is: 

^ = E E ^‘Tijql = E E - <?,+). 

ieAj^A if^Bj&B 

Of special interest is the reaction rate constant Noe 
et al. (2009): 

kAB=F j ^ 71,'^7 ^ . 

The fluxes /+ can be coarse grained by summing 
fluxes crossing the boundaries of metastable states (see 
Fig. 5). This yields a simplified view on the transition 


investigated. The flux /+ can be decomposed into 
individual pathways (Metzner et al. 2009) and their 
relative contribution to the A ^ B process can be 
evaluated (Noe et al. 2009). As an example, the folding 
pathways for the Pin WW protein are shown in Fig. 5. 

Experimental Observables/Dynamical 
Fingerprints 

Biophysical experiments measure one or multiple 
observables a(x) which are functions of the high¬ 
dimensional macromolecular coordinates, a could be 
a fluorescence or transfer efficiency in a fluorescence 
experiment, an NMR chemical shift, the intensity of an 
IR spectral peak, the distance in a pulling experiment, 
etc. Let a, be the mean value of observable a over the 
state Si- Given the observable vector a = [a,], various 
experimental measurements can be expressed in terms 
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Markov Models of Molecular Kinetics, Fig. 5 Flux of the folding transitions among the metastable states of the PinWW protein 
(Figure adapted from Noe et al. (2009)) 


of the transition matrix T and linked to its eigenvalues 
and eigenvectors (Noe et al. 201 1). 

In stationary equilibrium experiments, the mean value 
of an observable tr, E^[a], is recorded. This may be either 
done my measuring E^i [a\ directly from an unperturbed 
ensemble of molecules, or by recording sufficiently many 
and long single molecule traces a{t) and averaging over 
them. The expected measured signal is: 


(=1 

where (x, y) denotes the scalar product between two 
vectors x and y. 

Kinetic information is available from time- 
correlation experiments. These may be realized by 
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computing time-correlation functions from single mol¬ 
ecule trajectories (e.g., fluorescence correlation spec¬ 
troscopy) or by scattering techniques (e.g., inelastic 
neutron scattering). The time cross-correlation of two 
observables a and b can be computed as: 

E [a{t) h(t+kx)] = EE 

1=1 f=i 

= (a,7t)(b,7t) -f^exp (a, 1) (b, b). 

( 10 ) 

For the autocorrelation of a\ 

E [a(r) a{t + ^t)] = (a, n)^ 

+ ^exp (a, 1,)' 

In relaxation experiments, the system is allowed to relax 
from a nonequilibrium starting state with probability 
distribution p(0). Examples are temperature-jump, pres¬ 
sure-jump, or pH-jump experiments, rapid mixing 
experiments, or experiments where measurement at 
r = 0 starts from a synchronized starting state, such as 
in processes that are started by an external trigger like 
a photoflash. After time f = 0 the conditions are 
governed by a transition matrix T(t) with stationary 
distribution ti ^ p(0). The ensemble average Ep( 0 )[a 
(t)] is recorded while the system relaxes from the initial 
distribution p(0) to the new equilibrium distribution tt: 

n 

Ep(o) [a{kz)\ = ^ aiPi{kx) 

/-I 

n / k \ 

= (a, Tt) +^exp (p'(0):b)(a:>/)- (H) 

where p\{0) = pi{Q)/ni. Both Eqs. 10 and 11 have the 
form of a multiexponential decay function with 
implied timescales of the transition matrix. Each 
timescale enters the observation with an amplitude 
that depends on the overlap between the 
corresponding eigenvector Eq. 9 and the observable 
(s), and in relaxation experiments also on the initial 
conditions of the experiment. Eor any given experi¬ 
mental observable, many amplitudes will be near 
zero; thus, even complicated kinetics may have the 


signature of two- or three-state systems in a single 
given kinetic experiment. 

The ability to link experimentally measurable 
relaxation timescales to individual eigenvalue/eigen¬ 
vector pairs allows structural processes to be 
assigned to these timescales via the eigenvector 
(see “Interpretation of Eigenvectors” above). This 
reconciliation of simulations and experiments is 
described in detail via the concept of dynamical fin¬ 
gerprints. Eurthermore, this approach permits to 
design experiments that are optimal to probe individ¬ 
ual relaxations (Noe et al. 2011). 

Summary 

Markov modeling is a theoretical framework suitable 
for analyzing molecular dynamics or any other sto¬ 
chastic process that is ergodic and Markovian in full 
state space. Markov (state) models (MSMs) approxi¬ 
mate the complex original dynamics by transition 
probabilities between discrete subsets of the possibly 
high-dimensional state space. In molecular dynamics, 
these subsets may correspond to molecular conforma¬ 
tions, rotamers, foldamers, or binding states. 
A sufficiently fine clustering in the MSM will retain 
the relevant details of the complex energy landscape, 
specifically the information which states are kineti- 
cally connected and which are not. This allows rela¬ 
tively detailed analyses such as using transition path 
theory to calculate the ensemble of transition pathways 
between two subsets of state space, or the assignment 
of structural processes to the kinetic features of exper¬ 
imental observables. 

It has been intensively debated whether it is gen¬ 
erally feasible to approximate the high-dimensional 
continuous dynamics of macromolecules by 
a discrete Markov process on relatively few (typi¬ 
cally 10-10^) discrete states. A number of theoret¬ 
ical developments between 2000 and 2010 have 
shown that this is indeed feasible if the system has 
relatively few slow relaxation processes, typically 
arising from the transitions between metastable 
states. This makes MSMs especially interesting 
to biological macromolecular processes, such as 
conformational changes, folding, binding, and 
oligomerization of peptides, proteins, and nucleic 
acids. Whether MSMs can also be practically 
useful to investigate processes with combinatorially 
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exploding state spaces, such as spin systems, 
remains a subject of ongoing research. 

Current challenges of Markov model methodology 
lie especially in the development of robust adaptive 
methods for discretization and sampling: (1) The basic 
mathematical relation between a state space 
discretization and the quality of an MSMs is now 
understood. The translation of the mathematical 
insight of discretization quality into a robust adaptive 
discretization algorithm is an important step toward 
efficient construction of MSMs for complex systems. 
(2) Enhanced sampling methods to explore the state 
space such as metadynamics and multi-ensemble 
methods are complementary to MSM modeling of the 
equilibrium dynamics. Consistently integrating these 
approaches is an important step toward efficient simu¬ 
lation (Sriraman et al. 2005). (3) It has been demon¬ 
strated on simulation models that the statistical 
uncertainties of the MSM transition matrix and quan¬ 
tities calculated from it can be used to allocate new 
simulations so as to speed up the convergence (Singhal 
and Pande 2005). These approaches need further 
development and are likely to significantly influence 
the molecular dynamics field. 

Cross-References 

► Molecular Dynamics Simulations of Lipids 
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Mass Action 

► Law of Mass Action 


Mass Analysis 


a fixed distance L from an ion source to a detector. 
The accelerating potential V at the source exit is pulsed 
in order to produce discrete packets of ions (usually 
with kinetic energy of several keV). The miz value is 
related to t hy Eq. 1 . 


t = L 



( 1 ) 


Neil J. Oldham 

School of Chemistry, The University of Nottingham, 
Nottingham, UK 

Synonyms 

Mass measurement 

Definition 

The analysis of ions hy measurement of their mass-to- 
charge ratio {mIz). 

Introduction 

Mass spectrometry is concerned with measurement of the 
mass-to-charge ratio (miz) of ions. In the case of bio¬ 
polymers, ions are most frequently generated by 
electospray ionization (ESI) or matrix-assisted laser 
desorption/ionization (MALDI). Measurement of miz is 
conducted in a region of the mass spectrometer known as 
the mass analyzer. There are numerous designs of mass 
analyzer, but all rely on the use of electric or magnetic 
fields to manipulate ions in a manner that leads to deter¬ 
mination of their miz. A comprehensive treatment of the 
principles of mass analysis, in the various analyzers 
known, is beyond the scope of this entry. Instead, 
a brief description of the methods most commonly used 
for the analysis of biomolecules is provided. Eor a more 
detailed account, the reader is directed to de Hoffmann 
and Stroobant (2002) and Watson and Sparkman (2007). 

Time-of-Flight (TOF) Analyzers 

Time-of-flight (TOE) analyzers (Wiley and McLaren 
1955) measure the time t taken for an ion to travel 


Measurement of t for a series of calibrated reference 
ions of known miz allows the miz of an unknown to be 
determined. The simplest TOE analyzer is composed 
of a linear flight tube (1-2 m in length) with the 
detector placed at one end. This design gives 
a resolving power (R) of approximately 1,000 (using 
the full-width half-maximum (EWHM) definition) and 
a theoretically unlimited miz range. A number of mod¬ 
ifications may be employed to increase resolving 
power: the most common being orthogonal accelera¬ 
tion (oa) and the use of a reflectron ion mirror. oaTOE 
analyzers sample a continuous ion beam using a high 
voltage pusher to send ion pulses at right angles down 
a flight tube. Because the continuous ion beam can be 
radially confined, orthogonal sampling results in ion 
packets with little dispersion in kinetic energy (KE). 
This leads to a significant increase in resolving power 
of the TOF. The addition of a reflectron further 
increases resolving power by focusing ions onto the 
detector, again reducing KE dispersion. An oa-reTOF 
analyzer, combining both technologies, increases 
resolving power over a linear TOF by a factor 
of >10. The use of multipass reflectrons can result in 
R values in excess of 20,000 EWHM. 

Quadrupole Mass Analyzers 

The quadrupole mass analyzer (or quadrupole mass 
filter) is composed of four parallel rods to which 
radio frequency (RE) and direct current (DC) fields 
are applied. Ions, accelerated into the region between 
the rods by the application of a low potential (ca. 10 V), 
experience a hyperbolic electric field as they travel 
through the quadrupole toward the detector. Only 
ions of a specific miz value have stable trajectories 
for a given combination of DC potential and RE ampli¬ 
tude. Thus, with suitable calibration, the quadrupole 
can be used to measure the miz of a sample ion by 
scanning these fields. Quadrupoles generally operate at 
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relatively low resolving powers (< 1,000) and have 
limited miz range (<4,000 for most commercial 
instruments). They are particularly powerful if used 
in combination with a TOP analyzer to produce 
a Q-TOF mass spectrometer. When operated in the 
RF-only mode, the quadrupole acts as an ion guide 
and transmits ions of all m/z, allowing an oaTOF to 
act as the mass analyzer. Flowever, when a DC voltage 
is applied, the quadrupole can be used to select precur¬ 
sor ions for MS/MS analysis (see below). 

Quadrupole Ion Trap Analyzers 

The quadrupole ion trap (QIT) was originally devel¬ 
oped by Paul in the 1950s (Paul and Steinwedel 1953) 
and made first available as commercial mass spectrom¬ 
eter In the 1980s. The 3D version is composed of a pair 
of domed end caps sandwiching a central ring elec¬ 
trode, which produces a hyperbolic cross section, not 
unlike that of a quadrupole mass analyzer. Also in 
common with a quadrupole, a combination of DC and 
RF fields is applied to the electrodes. However, due to 
the geometry of the device, this results in a trapping 
event (aided by a low-pressure buffer gas). Initially, 
ions of all mIz can be held in the trap, but as the RF 
amplitude applied to the ring electrode is increased, the 
trajectory of ions with successively higher miz values 
becomes unstable and they fly toward the end caps. 
Placing an opening in one end cap allows ions to leave 
the trap and strike an external detector. Calibration of 
the RF amplitude with ions of known miz allows a mass 
spectrum to be recorded. The QIT has several advan¬ 
tages over the quadrupole, including a much faster 
duty cycle (resulting in high sensitivity), and the ability 
to conduct MS/MS analysis by temporal separation of 
a selected precursor and subsequent fragmentation by 
collisions induced with the trap buffer gas (usually 
helium). Like the quadrupole, it has a rather limited 
effective miz range. 

More recently, the linear (or 2D) QIT has overtaken 
the 3D version in commercial popularity. The physical 
appearance of the linear QIT is rather like that of the 
quadrupole, but it includes end caps to allow axial 
trapping. The linear QIT has a higher sensitivity and 
capacity than the 3D QIT, making it more attractive in 
hybrid instruments, where the trap is placed in front of 
another mass analyzer, such as an orbitrap or ion 
cyclotron resonance analyzer. 


Orbitrap Analyzers 

The orbitrap is the latest design of mass analyzer to 
enjoy widespread use. It was developed by Alexander 
Makarov (2000), building on the Kingdon trap (1923). 
It consists of a pair of elongated cup-shaped outer 
electrodes containing a central core electrode. Ions 
injected into the evacuated space between the outer 
and core electrodes orbit the core due to a balance of 
electrostatic and centrifugal forces. Although m/z is 
related to frequency of rotation about the core, this 
property is not used in the orbitrap, as ion velocity 
distribution results in poor resolution. Instead, the rela¬ 
tionship between axial harmonic motion of the ions 
and m/z is exploited. Ions in the orbitrap experience 
a quadro-Iogarithmic field which causes them to oscil¬ 
late in the axial direction with a frequency which 
is related to m/z by Eq. 2, where k is field curvature. 

The axial frequency is independent of ion energy 
and position within the trap, meaning that ion kinetic 
energy dispersion has little or no effect on resolving 
power of the analyzer. Motion is detected by an image 
current generated as the ions oscillate axially from the 
left-hand to the right-hand outer electrodes. Fourier 
transform of the composite signal allows extraction 
of individual frequencies and therefore individual m/z 
values. The more oscillations detected, the higher the 
resolving power, with R values in excess of 200,000 
possible. This high-resolution capability has made the 
orbitrap an attractive option for mass analysis. Its 
performance is only surpassed by FT-ICR-MS, but 
comes without the need for a superconducting magnet. 

Ion Cyclotron Resonance Analyzers 

Ion cyclotron resonance (ICR) mass analyzers 
(Comisarow and Marshall 1996) utilize a strong, 
homogeneous magnetic field (usually supplied by 
a superconducting magnet) to trap ions in the x, 
y plane using the Lorentz force. Axial trapping 
(in the z direction) is achieved by the presence of 
a pair of end plate electrodes held at repulsive poten¬ 
tial, with the front plate gated to allow ion entry. The 
remainder of the ICR cell is composed of a pair of 
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transmitter and a pair of receiver plates. Ions trapped 
by the Lorentz force undergo cyclotron motion around 
the B field lines with angular frequency co^., which is 
related to their m/z hy Eq. 3. Thus, if RF oscillation of 
corresponding frequency is applied to the transmitter 
plates, a resonance condition is met and the ions absorb 
energy and move with an increased cyclotron radius. 
This brings them in proximity to the detector plates and 
results in an image current. 


m 

By applying a broadband RF pulse it is possible to 
bring ions of a wide miz range into resonance and 
utilize Fourier transform (FT) methods to extract indi¬ 
vidual frequencies and thus mIz values. This is com¬ 
monly known as FT-ICR-MS and is an extremely 
powerful method of mass analysis. It offers unparal¬ 
leled resolving power (R > 1,000,000) as well as the 
opportunity for MS/MS by collisional activation, laser 
irradiation, and electron capture (see below). Fike 
the orbitrap, the resolving power is proportional to 
the number of harmonic oscillations of the ions, so 
ultrahigh resolution measurements can take 10-20 s 
per scan. 

Tandem Mass Spectrometry (MS/MS) 

Tandem mass spectrometry (MS/MS) is the ability to 
isolate an ion from a mass spectrum and generate and 
analyze fragments of that species. It is often accom¬ 
plished hy placing one mass analyzer in front of 
another and introducing ion activation between the 
two. The first mass analyzer can he used to select the 
precursor ion and the second to analyze the product 
ions. Activation is most commonly achieved by 
collision-induced activation/dissociation (CIA/CID). 
A collision cell, containing a low-pressure gas (gener¬ 
ally argon), is housed between the first and second 
mass analyzers, and a voltage offset between the first 
analyzer exit and cell entrance is applied. The 
increased kinetic energy of ions accelerated into the 
cell is converted to internal energy by multiple colli¬ 
sions with the gas resulting in fragmentation. Q-TOF- 
type instruments commonly employ this approach. 
CID of peptide ions, for example, generally produces 
a series of N- and C-terminal fragments (usually b- and 


y-type) which can be used to derive sequence informa¬ 
tion for use in proteomics applications. 

In QIT mass analyzers it is possible to isolate 
a selected precursor ion by the application of special 
RF amplitude ramp. The RF amplitude is increased 
from high to low and then low to high miz leaving 
a gap at the appropriate frequency, which corresponds 
to the precursor miz. This has the effect of sweeping all 
ions out of the trap except for those of the desired miz. 
Activation of the precursor is achieved by applying 
a low-amplitude RF signal to the QIT end caps, at the 
resonant frequency of the precursor ion, which causes 
it to absorb energy and collide with the He buffer gas. 
CID gives rise to fragment ions which immediately fall 
out of resonance (due to their new miz values) and are 
cooled and held in the trap. A standard scan is then 
performed to determine the miz of the product ions. 
Due to the nature of this process, it is possible to 
perform multiple rounds of ion isolation and activation 
in a QIT leading to MS". 

CID can - in principle - be performed in an orbitrap 
or an FT-ICR cell, using resonance excitation. How¬ 
ever, the orbitrap suffers from relatively inefficient 
trapping of product ions, and both analyzers require 
high vacuum for optimum performance, which is 
compromised by injection of a collision gas. One solu¬ 
tion is to produce an orbitrap, or FT-ICR, hybrid with 
a quadrupole or QIT. CID can then be performed 
outside the orbitrap or ICR cell. Alternatively, other 
methods of ion activation can be employed for MS/MS 
applications. 

Although CID is the method most commonly used 
in MS/MS, precursor ion activation can be achieved by 
other means. Infrared multiphoton dissociation 
(IRMPD), for example, uses IR laser irradiation to 
induce fragmentation of (usually) trapped ions. Acti¬ 
vation occurs in a stepwise manner, by the absorption 
of multiple photons, and is therefore thermal in nature, 
producing fragmentation similar to that seen for CID. 
The technique is often utilized in combination with 
FT-ICR-MS as an alternative to CID. Electron capture 
and electron transfer dissociation (ECD and ETD) are 
relatively new methods of ion activation. Here, elec¬ 
trons are captured by multiply charged analyte cations 
either directly or by transfer from a radical anion 
reagent. In both cases, the result is an analyte radical 
cation, possessing less charge than the original precur¬ 
sor. The high energy released by neutralization of 
a charge induces rapid fragmentation in a nonergodic 
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manner. When applied to multiply charged peptide 
ions, ECD and ETD usually result in the production 
of c- and z-type ions. A nonergodic fragmentation 
mechanism means that ECD and ETD can lead to 
cleavage of strong bonds in the presence of weak 
bonds - a feature that can be exploited in mapping 
protein ► phosphorylation and noncovalent 
interactions. 


Summary 

Numerous designs of mass analyzers exist, with 
a range of resolving powers, duty cycle times, miz 
ranges, and cost. For biomolecular analysis, Q-TOFs 
and QITs offer broadly similar performance for pep¬ 
tide analysis. Orbitraps and FT-ICR-MSs provide 
ultrahigh resolving power and mass accuracy. TOF- 
based instruments have the advantage of near limitless 
m/z range, which is particularly important for studying 
intact protein-protein complexes by MS. Hybrid 
instruments, composed of two or more analyzer 
types, can incorporate the advantages of each method 
of mass analysis. 
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Mass Spectrometry in Proteomics 
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Nottingham, Nottingham, UK 

Introduction 

Proteomics, the study of the proteome, is 
a postgenomic discipline. The word proteome, which 
is an amalgamation of “protein” and “genome,” was 
first introduced by Marc Wilkins in the mid-1990s 
(Wilkins et al. 1996). The proteome refers to the com¬ 
plete profile of proteins expressed in a tissue, cell, or 
biological system at a given time. Other definitions of 
proteomics include: the study of all the proteins 
expressed in a cell, the analysis of proteins and their 
functions, large-scale analysis of the structure and 
function of proteins as well as protein-protein interac¬ 
tions, and the systematic analysis of the protein expres¬ 
sion of healthy and diseased tissues. Essentially, 
proteomics is the study of multiprotein systems 
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where the main purpose is the characterization of the 
behavior of the system as opposed to the behavior of 
single components. 

A wealth of information can be provided by prote¬ 
omics data, since proteins have structure, sequence, 
biological, and physiological functions and interact 
with other molecules. The challenge of proteomics 
lies in the complexity of the proteome, which unlike 
the genome, is a dynamic system that constantly 
changes. The vast number of proteins involved is prob¬ 
lematic as proteins may be present in many forms due 
to posttranslational modifications. The vastly different 
concentrations over which proteins are expressed 
(<1 pg mL~* to > mg mL“') add to the challenge. 

The aims of proteomic research may be classified 
into four main categories: Protein mining is the iden¬ 
tification of as many proteins as possible in a sample. 
Protein expression profiling is the comparison of spe¬ 
cific proteins/peptides in healthy and diseased states. 
Protein network mapping is the identification of 
interacting proteins, and protein modification mapping 
is the identification of protein posttranslational modi¬ 
fications. A combination of advances in mass spec¬ 
trometry, protein chemistry, protein separation 
techniques, and bioinformatics has enabled proteomics 
to become an established scientific field. 

Protein Identification 

Mass spectrometry is particularly suited to the identi¬ 
fication of proteins due to its high sensitivity, low 
sample volume requirements, and the ability to provide 
accurate mass measurements and sequencing informa¬ 
tion. The most common mass spectrometric platforms 
for proteomics include MALDI-TOF MS, ESTQ-TOF 
MS, and ESTion trap MS. There are two main 
approaches to protein identification, namely, hottom- 
up and top-down. 

The Top-Down Approach 

In the top-down method, intact protein ions are intro¬ 
duced into the mass spectrometer and subjected to gas 
phase dissociation. This technique was first demon¬ 
strated for ESI generated ribonuclease A ions, using 
a triple quadrupole instrument (Loo et al. 1990). Accu¬ 
rate mass measurement of the whole protein and of the 
resulting fragment ions allows identification of the 
protein. As the resulting product ion spectra consists 


of multiply charged ions this can lead to problems 
determining the masses of the fragments. For this rea¬ 
son top-down proteomics is usually carried out using 
an FT-ICR mass spectrometer which, due to its high 
resolving power, is capable of determining product ion 
charge state. FT-ICR instruments are also able to carry 
out electron capture dissociation (ECD), the preferred 
technique for fragmentation of intact proteins 
(Zubarev et al. 1998). ECD generates extensive frag¬ 
mentation, with more random peptide backbone cleav¬ 
age than that achieved through collisionally induced 
dissociation (CID), or infrared multiphoton dissocia¬ 
tion (IRMPD). ECD is also useful for preserving post¬ 
translational modifications allowing them to be 
mapped. To date top-down proteomics is not as widely 
used as bottom-up techniques. The main reason for this 
is that the mass analyzers most suited to this type of 
analysis (FT-ICR MS, or hybrid ion trap-orbitrap) are 
expensive to purchase. Also, due to the complexity of 
data generated from multiply charged proteins, the 
technique is generally limited to the analysis of single 
proteins or relatively simple mixtures. Until recently, 
large proteins could not be easily dissociated in the gas 
phase. Han et al. reported the use of “prefolding disso¬ 
ciation” to analyze a glycoprotein in excess of 200 kDa 
(Han et al. 2006). Karabacak et al. extended this upper 
mass limit to 669 kDa (Karabacak et al. 2009) 

There are several benefits to the top-down 
approach, namely, as MS/MS is carried out on the 
intact protein the entire sequence is available allowing 
complete sequencing of the protein and any associated 
modifications. There is also no need for lengthy diges¬ 
tion methods, and molecular connectivity is retained 
enabling structural characterization. 

The Bottom-Up Approach 

Protein identification can also be carried out using the 
bottom-up techniques. Protein mixtures can be either 
pre-separated prior to proteolytic digestion or digested 
followed by separation of the resulting peptides. In 
both cases, the peptides are analyzed by mass spec¬ 
trometry. Peptide mass fingerprinting (PMF) is the 
most commonly used strategy. This technique was 
simultaneously described by several groups in 1993 
(Pappin et al. 1993; Yates et al. 1993; lames et al. 
1993). Typically, the protein mixture is separated 
using ID or 2D sodium dodecyl sulfate polyacryl¬ 
amide gel electrophoresis (SDS-PAGE). The protein 
spots of interest are excised and subjected to in-gel 
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enzymatic digestion usually with trypsin. The resulting 
peptides create a unique fingerprint (Patterson and 
Aebersold 1995) and the masses of these peptides are 
determined using mass spectrometry. PMF database 
searches compare the experimental peptide masses 
with calculated peptide masses resulting from the theo¬ 
retical digestion of protein sequences in databases. This 
results in a number of possible proteins that the PMF 
peptides may be derived from. Scoring algorithms are 
applied to rank these proteins and to indicate the most 
significant match. Incomplete protein separation is the 
main drawback to the PMF approach. As this leads to 
peptide fragments from two more proteins in the mass 
spectrum, it can result in false-positive protein identifi¬ 
cations. To overcome this, tryptic peptides of interest 
may be isolated and fragmented by tandem mass spec¬ 
trometry (MS/MS). The resulting b and y fragment ion 
masses are correlated with theoretical peptide sequences 
using MS/MS protein identification algorithms. The 
main advantages of utilizing MS/MS analysis are 
increased confidence of protein identihcation, detection 
of possible posttranslational modifications, and the abil¬ 
ity to analyze complex protein mixtures. 

Shotgun proteomics is a bottom-up method which is 
used for the analysis of these complex protein mixtures 
(Link et al. 1999). In this approach, the proteolytic 
digestion is carried out without prior protein separation 
and the resulting peptides are separated by liquid chro¬ 
matography followed by MS/MS analysis. The main 
benefit of shotgun proteomics is that a vast number of 
proteins can be identihed in a single analysis. How¬ 
ever, as the complexity of the sample increases, incom¬ 
plete separation of all the peptides may occur and low 
abundant peptides may be suppressed. This usually 
leads to a decrease in the sequence coverage 
obtained due to a decreased ability to identify peptides 
for MS/MS analysis during data-dependent acquisi¬ 
tion. Shotgun proteomics can also result in the acqui¬ 
sition of product ion spectra that lack sufficient ions for 
protein identihcation. Although the identihcation of 
a single tryptic peptide can be sufficient to unambigu¬ 
ously identify the protein, providing the peptide is 
unique to that protein. 

Multidimensional protein identihcation technology 
(MudPIT) is a shotgun proteomics technique (Washburn 
et al. 2001). This method uses a biphasic column to 
separate peptides prior to online tandem mass spectro- 
metric analysis. The column consists of fused capillary 
packed with a strong cation exchange (SCX) resin in the 


proximal end and reversed phase resin in the distal end. 
MudPIT is a cyclic procedure with fractions eluted step¬ 
wise from the SCX phase into the reversed phase using 
increasing salt concentrations. A hydrophobic gradient is 
used to elute peptides from this second phase directly into 
the mass spectrometer and the process is repeated. 
Extremely complex mixtures can be separated by 
increasing the number of gradients. MudPIT is able to 
detect and identify a wide range of proteins including 
those with extremes in molecular weight, isoelectric 
point, abundance, and hydrophobicity. The method also 
has the advantage of being automated. 

Middle-Down 

Middle-down proteomics is an amalgamation of top- 
down and bottom-up proteomics whereby the protein 
is subjected to limited proteolysis, usually using Lys-C 
or CNBr, to produce polypeptides in the range 
3-20 kDa (Forbes et al. 2001; Wu et al. 2006). Exact 
mass measurement of the resulting large peptides can 
be determined followed by MS/MS analyses to gain 
sequence information. As longer peptides carry higher 
charge this enhances fragmentation resulting in 
improved sequence coverage compared with bottom- 
up proteomics. Other advantages of the middle-down 
approach include the ability to be coupled with online 
chromatographic separation and retention of posttrans¬ 
lational modifications. 

Characterization of Posttranslational 
Modifications 

The ability to provide accurate mass measurement 
along with sequencing capabilities has made mass 
spectrometry an essential tool for the identification 
and site mapping of posttranslational modifications 
(PTMs). When a protein is modified, the molecular 
weight of the protein differs from that of the 
unmodified sequence by the mass of the modification. 
Therefore, PTMs can be detected by obtaining an 
accurate mass measurement for the intact protein and 
comparing this with the theoretical mass of the native 
protein sequence. Subsequent tandem mass spectrom¬ 
etry (MS/MS) of the modified peptides enables the 
modified amino acid residue to be identified. Alterna¬ 
tively, a top-down approach can be used where the 
intact protein is fragmented in order to map the 
modification. 
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Phosphorylation 

Phosphorylation is one of the most common and 
important forms of PTM. The study of phosphopro- 
teins is of particular relevance as they are involved in 
many cellular processes including cell regulation and 
signaling. Phosphorylation causes a mass increase of 
80 Da per phosphorylation site. This mass increase can 
be detected by mass spectrometry and the number of 
modihcation sites can also be determined. However, 
due to the low abundance of most phosphoproteins, 
samples need to be enriched prior to MS analysis. This 
is usually carried out using immunoaffinity chroma¬ 
tography or immobilized metal affinity chromatogra¬ 
phy (IMAC) (Porath 1992). 

Phosphopeptide mapping can be carried out in order 
to detect the location of the modification. The phos- 
phoprotein is enzymatically digested and 
phosphopeptides can be identified by the increase 
caused by the addition of the phosphate. Confirmation 
of the identification of the phosphopeptide can be 
carried out through the addition of phosphatase. This 
cleaves the phosphate group resulting in a mass 
decrease of 80 Da which can be detected by comparing 
mass spectra before and after treatment. CID MS/MS 
in negative ion mode can be used to produce phos¬ 
phate-specific fragment ions, such as H 2 P 04 ~, P 03 ~, 
and P 02 ~ with miz values of 97, 79, and 63, respec¬ 
tively. The presence of these diagnostic ions indicates 
protein phosphorylation. Neutral loss scan mode can 
also be used since the phosphoester bond fragments 
leading to the loss of HPO 3 {m/z 80) or H 3 PO 4 (m/z 98) 
as neutral species with limited protein backbone frag¬ 
mentation. The neutral loss product ion can then be 
isolated and fragmented in positive ion mode to yield 
sequencing information. Extensive fragmentation can 
be obtained by using ECD to cleave the polypeptide 
backbone. This has the added advantage that PTMs are 
preserved allowing the exact sites of phosphorylation 
to be mapped. Figure 1 shows CAD and ECD product 
ion spectra for the fragmentation of a phosphopeptide 
(Stingl et al. 2006). ECD provides unambiguous 
assignment of the phosphate groups to SerlO and 
Serl3. 

Glycosylation 

Glycosylation, the attachment of oligiosaccharides to 
a protein, is a common but complex PTM. The com¬ 
plexity arises from the branching nature of glycans, 
which leads to a vast number of isoforms. 


Glycoprotein analysis is extremely challenging since 
it is desirable to have information on peptide 
sequence, saccharide composition, and modification 
sites. Mass spectrometry can be used to gain insights 
into all of these. Carrying out MS/MS at increasing 
CID energies enables the protein and carbohydrate 
sequences to be determined. At low collision ener¬ 
gies, the sequence of the sugars in the glycan moiety 
can be established. At higher collision energies, pep¬ 
tide backbone fragmentation allows protein sequenc¬ 
ing information to be obtained. Further increase of the 
CID energy leads to cross-ring cleavage, which pro¬ 
vides structural information of the oligiosaccharide 
(North et al. 2009). When glycoproteins fragment 
they generate oxonium ions, such as Hex"^ (m/z 163), 
HexNAc-" (m/z 204), HexHexNAc-" (m/z 366), 
NeuNAc"^ (m/z 292). These marker ions can also be 
used to confirm the presence of glycosylation. The 
glycosylation site can be detected using peptide map¬ 
ping. Samples are analyzed before and after the 
removal of the carbohydrates from the glycoprotein 
by treatment with deglycosylation agents. The modi¬ 
fied and unmodified proteins are tryptically digested 
and subjected to mass spectrometric analysis. Com¬ 
parison of the two data sets leads to the detection of 
glycosylated peptides. Like phosphorylation, glyco¬ 
sylation is a labile PTM and therefore site mapping is 
generally carried out using ECD. 

Other Posttranslational Modifications 

Mass spectrometry can detect many PTMs, includ¬ 
ing acetylation, methylation, and oxidation which 
cause mass increases of 42 Da, 14 Da, and 16 Da 
respectively. Comprehensive lists of mass differ¬ 
ences due to various modifications can be found in 
the literature (Wilkins et al. 1999; Seo and Lee 
2004). MS/MS analysis enables the modification 
site to be identified. Ubiquitination can be detected 
by the presence of a Gly-Gly tag (m/z 114), which 
remains attached to the lysine residue of the target 
protein following tryptic digestion (Peng et al. 
2003). The detection of sumoylation is more chal¬ 
lenging not only because of the absence of arginine 
or lysine residues in the carboxyl terminus but also 
because SUMO undergoes fragmentation along with 
the target protein. Sequencing software has been 
developed to help overcome these problems for the 
identification of sumoylation sites (Pedrioli et al. 
2006). 
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Mass Spectrometry in 
Proteomics, Fig. 1 CAD (a) 

and ECD (b) product ion 
spectra for the fragmentation 
of the triply charged precursor 
ion [M + at mh 625.2 of 
the doubly phosphorylated 
peptide C. lOP (Reprinted from 
reference Stingl et al. 2006) 




m/z 


Quantification 

Strategies for quantification of proteins in biological 
samples have progressed vastly in the last 20 years - 
from the use of fast atom bombardment mass 
spectrometry and labeling synthetic peptides with 
deuterium to labeling with specific reagent tags and 
isotopically labeled essential amino acids. Advances in 
quantification technologies now allow the relative and 
absolute amounts of multiple proteins in complex bio¬ 
logical samples to be determined. The ability to quan¬ 
tify changes in the abundance of proteins between 
samples results in precise measurement of cellular 


state at the molecular level. Quantification of pro¬ 
teins/peptides using mass spectrometry is typically 
carried out with the use of isotope tagging or, synthetic 
reference peptides, as internal standards. 

SILAC 

Stable isotope labeling with amino acids in cell culture 
(SILAC) is a relatively new technique for protein 
quantification (Ong et al. 2002). It builds upon previ¬ 
ous work where the use of stable isotope-enriched 
media for bacterial and yeast growth was described. 
Ong et al. were the first to describe the application of 
the technique to study differential protein expression. 
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SILAC is dependent upon the metabolic uptake of 
essential amino acids. This method of quantification 
is useful when trying to observe differences between 
two populations. One cell culture is labeled with stable 
isotopic nuclei, such as or whilst the other 
(treatment or control) is unlabeled. As the cells grow in 
the culture media, the heavy isotope gradually 
becomes incorporated into all newly synthesized pro¬ 
teins. Mixing the labeled and unlabeled cell cultures 
together, digesting and analyzing by mass spectrome¬ 
try using bottom-up approaches allows relative quan¬ 
tification of the protein to be carried out. 

The advantages of SILAC are that the labeled pro¬ 
tein will behave in much the same manner as the 
unlabeled native protein and the label is applied as 
cells are grown in culture, which means all proteins 
become fully labeled (some chemical modihcations do 
not give this level of labeling efficiency). As cells have 
to be grown in media, this means that SILAC is 
unsuitable for tissue samples. It is important to select 
a labeled amino acid that does not undergo metabolism 
which could lead to isotope scrambling. The use of 
arginine, leucine, lysine, serine, methionine, and tyro¬ 
sine has been described by numerous groups within 
published literature. Arginine and lysine are particu¬ 
larly useful in work where proteins of interest will be 
digested with trypsin, as tryptic peptides will contain 
one of these residues in the C-terminal position. 

iTRAQ™ 

iTRAQT^* reagents are a set of isobaric tags that are 
used to differentially label and quantify up to 
either four or eight different samples simultaneously 
(Ross et al. 2004). iTRAQ™ tags react with the protein 
N-terminus and Lys side chains. They have many 
applications in gathering quantitative information - 
from a labeling method for use with specific bio¬ 
markers, to monitoring protein-protein interactions, 
to an area as specialized as drug-induced protein 
expression (Zieske 2006). 

iTRAQTM is similar in experimental design to the 
sulfhydryl-labeling technique known as ICAT, but 
iTRAQT^* labels the majority of tryptic peptides, 
whereas ICAT only label peptides with a Cys residue 
(Thelen and Peck 2007). These specific mass tags label 
at the N-termini and lysine side chains of peptides in 
a digest mixture. A reactive ester group binds to free 
primary amines in the side chains of lysine. Due to the 
overall isobaric nature of the iTRAQ™ tag, labeled 


peptides appear as a single precursor ion peak in 
the MS. Upon MS/MS analysis, the tags report the 
low-mass ions that are used for quantitation. Succes¬ 
sive reporter groups increase by 1 Da, with signature 
ions at miz 114.1, 115.1, 116.1, and 117.1 for the 
4-plex reagent. Additional ions at mIz 118.1, 119.1, 
120 .1, and 121.1 are observed for the 8-plex reagent. 
Quantitation is calculated by measuring the relative 
areas of these peaks. Because of the specific nature 
of the amine labeling, it allows peptides that are in 
complex signaling pathways, involving phosphoryla¬ 
tion, to be labeled, thus investigating the complex 
nature of PTMs. 

AQUA 

AQUA peptides are synthetic peptides intended to be 
internal standards for absolute quantihcation. They con¬ 
tain a heavy amino acid at one of more positions and are 
designed to behave in the same manner as native pep¬ 
tides (Gerber et al. 2003). If there are multiple proteins of 
interest multiple peptides can be synthesized and used to 
determine absolute levels of each protein. Covalent 
modifications such as phosphorylation, methylation, 
etc., can be considered in peptide design, allowing the 
peptide to be chemically identical to any naturally occur¬ 
ring posttranslationally modified peptides. 

The method is split into two stages, the first being the 
design and synthesis of a peptide from the protein of 
interest. It is at this point where the stable isotopes are 
incorporated at a single amino acid residue. The frag¬ 
mentation pattern of this internal standard is analyzed 
by MS/MS and the transition of a single precursor-to- 
product ion is chosen for selective reaction monitoring 
(SRM). The second stage is the addition of this peptide 
to the sample at a known concentration and volume, 
prior to digestion with trypsin. Using LC-SRM-MS, the 
presence of both the fragment ions from the native and 
AQUA peptide can be measured. The relative values of 
ion abundance allow absolute quantification. 

Label-Free Quantitation 

Label-free quantitation (Old et al. 2005) can be 
achieved in two ways: one method uses peak integra¬ 
tion to compare the signal intensity of peptide ions 
from an MS scan with data from multiple liquid chro- 
matography-mass spectrometry runs; the other uses 
spectral counting, comparing the number of MS/MS 
spectra assigned to an individual protein. Liu et al. (Liu 
et al. 2004) found that standard proteins added to yeast 
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extracts, quantified by this method, resulted in linearity 
over two orders of magnitude and showed good corre¬ 
lation to the relative protein concentration. 

Label-free quantitation relies on software correctly 
to match peptides that elute from a LC gradient. In 
simplistic terms, label-free quantitation aligns separate 
LC-MS/MS runs of peptide mixtures and calculates the 
difference in intensities of the same peptide found in 
each run. This label-free approach may not quite have 
the same accuracy as methods using stable isotopes, 
but it could be considered simpler (in experimental 
design) and overcomes labeling problems with certain 
cells types, for example, tissue samples and cells that 
cannot be cultured. 

Biomarker Detection and Identification 

Information provided through analysis of the proteome 
is important in the identification and screening of diag¬ 
nostic markers, known as biomarkers. These markers 
are often low molecular weight proteins secreted into 
the blood stream, or other fluids as a result of 
a biological or disease process. Biomarkers not only 
help to enhance the understanding of disease states but 
allow early diagnosis, therapeutic and remission mon¬ 
itoring of a wide variety of conditions. Mass spectrom¬ 
etry has been successfully applied to the discovery of 
many disease-specific biomarkers, ranging from those 
indicative of a wide variety of cancers through to 
neurological conditions (Pavlou and Diamandis 2010; 
Kroksveen et al. 201 1). 

Typically, protein expression profiling is carried out 
to identify patterns in the mass spectral data which are 
capable of discriminating between control and disease 
state samples (Petricoin et al. 2002). SELDI and 
MALDI-TOF are two of the most widely used 
approaches for initial screening studies. Bioinformat¬ 
ics is subsequently employed to identify the optimal 
subset of predictive biomarkers indicative of 
a particular state. The successful identification of 
these targeted biomarkers can be determined by bot- 
tom-up or top-down proteomic strategies. 

Summary 

Mass spectrometry is a key technology in accelerating 
the field of proteomics. This entry highlights some of 


the key aspects in which MS has made significant 
contributions. This includes unambiguous protein 
identification, a variety of quantification techniques, 
preservation and identification of posttranslational 
modification sites, as well as facilitating biomarker 
discovery programs. 

Cross-References 

► Mass Spectrometry of A-Linked Carbohydrates and 
Glycoproteins 

► Mass Spectrometry: Mapping Large Stable Protein 
Complexes 

► Mass Spectrometry: Methods of Ionization (Applied 
to Biopolymers) 

► Post-Translational Modifications 

► Post-Translational Modification of Histone Proteins 

► Protein-Protein Interactions Studied by Mass 
Spectrometry 
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Mass Spectrometry of ^/-Linked 
Carbohydrates and Glycoproteins 

David John Harvey 

Department of Biochemistry, Oxford Glycobiology 
Institute, Oxford, UK 

Synonyms 

N-Glycan analysis 


Definition 

Use of various types of mass spectrometry to deter¬ 
mine the detailed structure of the carbohydrates 
attached to asparagine in glycoproteins. 

Introduction 

Ai-linked glycans are those attached to proteins with an 
amide bond to asparagine in an Asp-Xxx-Ser 
(or -Thr or, occasionally -Cys) motif where Xxx is 
any amino acid except proline. Their structures are 
generally well defined (Varki et al. 2008), unlike the 
O-linked glycans (those attached directly to serine or 
threonine) which, although usually smaller, are more 
difficult to analyze. Typical structures of A-glycans are 
shown in Fig. 1. All have a common trimannosyl 
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Mass Spectrometry of Af-Linked Carbohydrates and Gly¬ 
coproteins, Fig. 1 Biosynthesis of W-glycans (reaction path¬ 
ways are illustrative and do not necessarily imply the occurrence 
of specific enzymes except where specified). Key to symbols 
used to draw the M-glycan structures: ■ = GlcNAc, 
O = mannose, <3> = fucose, 9 “ galactose, □ = glucose, 
□ = sialic acid. The angle of the lines connecting the symbols 
shows linkage with full lines indicating P-bonds and broken lines 
showing ot-bonds. For further details see (Harvey et al. 2009). 
The first stage in the biosynthesis of W-glycans is the attachment 
of the tetradecyl glycan (I) to the developing protein followed by 
removal of the glucose residues by two enzymes, glucosidases 
I and II give the “high-mamrose” glycan known as “Man-9,” II. 
Following this step, mannosidases trim the outer four mannose 
residues to give Man-5 CVI) via III, IV, and V. Man-5 is 
a substrate for the enzyme GlcNAc-transferase I which then 
adds a GlcNAc residue to the 3-antenna to give VII which is 
then a substrate for another exomannosidase which trims the 


remaining two outer mannose residues from the 6-antenna to 
give VIII. This glycan is a substrate for GlcNAc-transferase 2 
which adds a second GlcNAc residue to the 2-position of the 
remaining mannose of the 6-antenna (IX). This compound is the 
base for biosynthesis of the “complex” glycans of which only 
a few are shown in the figure. The scheme shows addition of two 
galactose residues to give the “biantennary” complex glycan, X 
and addition of a third antenna (in several stages) to give the 
“triantennary” glycan XIV and a fourth to give the tetra- 
antennary glycan, XIX. A bisecting GlcNAc residue can be 
added (XV) and fucose can be added at the 6-position of the 
reducing-terminal GlcNAc to give compounds such as XI ,XII, 
and XVIII. Antennae are frequently terminated with sialic acid 
(XIII, XVI, XVII, XIX, and XX). As an alternative to the 
cleavage of mannose residues from VII, the 3-antenna can be 
built up as in the complex glycans to give “hybrid” glycans. 
Additional information can be found in standard glycobiology 
textbooks, for example (Varki et al. 2008) 
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chitobiose (Man 3 GlcNAc 2 ) core with several attached 
antennae, and their analysis therefore requires the 
identification of factors such as: the type of glycan 
(high-mannose, hybrid, or complex [see Fig. 1]); the 
number and composition of each antenna; number and 
location of fucose residues; presence or absence of 
a bisecting GlcNAc residue (pi ^ 4-linked to the 
core mannose residue); and the presence of any further 
substitution. Flowever, because of the branched nature 
of these compounds, their structural characterization 
has not yet achieved the sophistication of that of the 
protein moiety of glycoproteins whose primary struc¬ 
ture consists only of a linear chain of amino acids. This 
entry describes the mass spectrometric methods that 
are most frequently used for the structural determina¬ 
tion of these compounds. 

Mass Spectrometry of Intact Glycoproteins 
and Glycopeptides 

Glycoproteins can contain up to several tens of 
glycosylation sites, each of which can be variously 
occupied from 0% to 100% and contain anything 
from 1 to over 100 different structures. The total 
number of glycoforms (individual molecular species) 
can, therefore, be very large making their resolution by 
mass spectrometry difficult or impossible. Neverthe¬ 
less, glycoforms of relatively small glycoproteins such 
as ribonuclease B (15 kDa) that have a single site 
occupied by a limited set of glycans can be resolved 
reasonably well and their masses determined 
accurately. If the mass of the protein is known from 
its sequence, and the protein contains a single occupied 
A-link site and no additional posttranslational 
modifications, the glycan masses can be obtained by 
subtraction of the protein mass from the experimen¬ 
tally determined mass. The glycan compositions, in 
terms of the constituent isobaric monosaccharides, 
can then be calculated with reasonable accuracy 
because A-linked glycans contain only a limited num¬ 
ber of generic monosaccharides such as hexose 
(mannose, glucose, galactose) A-acetylaminohexose, 
deoxyhexose (fucose), and sialic acid. Combinations 
of the masses of these monosaccharides give unique 
molecular weights for most combinations in the mass 
range of the common A-linked glycans. Table 1 lists 


Mass Spectrometry of Af-Linked Carbohydrates and Gly¬ 
coproteins, Table 1 Residue masses of common monosaccha¬ 
rides found in M-glycans 


Monosaccharide 

Residue formula 

Residue mass“ 

Pentose 

C 5 H 8 O 4 

132.042 

132.116 

Deoxyhexose 

C 6 H 10 O 4 

146.078 

146.143 

Hexose 

CeHioOs 

162.053 

162.142 

Hexosamine 

QH„N04 

161.069 

161.157 

HexNAc 

C 8 H 13 NO 5 

203.079 

203.179 

Hexuronic acid 

C 6 H 8 O 5 

176.032 

176.126 

iV-acetyl-neuraminic acid 

CuH^NOg 

291.095 

291.258 

A-glycoyl-neuraminic acid 

CuHnNO, 

307.090 

307.257 


“Top figure = monoisotopic mass (based on C = 12.000000, 
H = 1.007825, N = 14.003074, O = 15.994915), Lower 
figure = average mass (based on C = 12.011, H = 1.00794, 
N = 14.0067, O = 15.9994) 

the masses of the common monosaccharide 
constituents. For more complex glycoproteins, where 
resolution of the glycoforms cannot be achieved, the 
mass spectrum only indicates the presence of 
posttranslational modifications, such as glycosylation, 
by the presence of broad unresolved peaks with 
a mass higher than that predicted from the amino 
acid sequence. To obtain the glycan composition in 
these cases, it is necessary to reduce the mass by, for 
example, digesting with trypsin. 

The masses of the intact glycans can be obtained by 
addition of the residue masses given above, the mass of 
the terminal group (H 2 O for an unmodified glycan) 
18.011 (monoisotopic) and 18.152 (average), and the 
mass of any reducing-terminal or other derivative. 

Determination of Site Occupancy 

Site occupancy refers to the percent of each glycosyl¬ 
ation site that is occupied and the glycans that are 
attached at that site. To obtain this information, the 
glycoprotein is digested to glycopeptides, commonly 
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by trypsin, with the aim of isolating each glycosylation 
site to an individual glycopeptide. These glycopeptides 
are then commonly separated by HPLC and analyzed 
by online electrospray ionization mass spectrometry 
(ESI-MS) or off-line matrix-assisted laser desorp¬ 
tion/ionization time-of-flight (MALDI-TOF) MS. 
Glycopeptides, however, often give weak signals 
in the presence of peptides. Several methods have 
been developed to overcome this problem. In one 
such method (Carr et al. 1990), high-energy frag¬ 
mentation is used to produce fragment ions that are 
characteristic of the glycan (e.g., miz 163 (hexose), 
204 (HexNAc), and 366 (Hex-HexNAc)) and the 
presence of these ions is then mapped against the 
profile of the glycopeptides. 

Release of Carbohydrates 

Although detailed structural analysis of the glycan can be 
performed on the glycopeptide, it is more common to 
remove the peptide and perform the analysis on the 
released glycan. Such structural determinations often 
entail monitoring sugar profiles by chromatographic 
techniques that require labeling the glycan with 
a chromophore of fluorophore, and thus glycan release 
methods must produce a glycan with an intact reducing 
terminus (the site of attachment of the label). Such 
methods are also ideal for structural determination by 
mass spectrometry. Two methods, chemical or enzy¬ 
matic, are available. The most common chemical method 
employs anhydrous hydrazine (Patel et al. 1993), 
a reagent that cleaves amide bonds and which is capable 
of releasing all of the glycans as intact reducing sugars. 
Its disadvantages are that the protein is totally destroyed, 
thus removing any information on site occupancy and 
also that the acetyl groups are removed from N- 
acetylamino-sugars, necessitating a re-A-acetylation 
step. Concomitant O-acetylation often occurs as a side 
reaction leading to a complicated spectrum from the 
resulting glycan profile. Enzymatic release is generally 
more satisfactory and uses one of a range of 
endoglycosidases but is most often performed with pro¬ 
tein A-glycosidase E (PNGase F) This enzyme releases 
most glycans except those containing a 3-linked fucose 
attached to the reducing-terminal GlcNAc residue. In 
these cases, PNGase A is effective. Both enzymes are 
amidases and cleave the glycans as glycosylamines that 
must be converted to the reducing sugars, usually by 


treatment with a weak acid. In a recent method, glycans 
have been released from glycoproteins separated by, and 
retained in, SDS-PAGE gels (Harvey 2009a). In other 
methods, glycan release has been performed directly on 
the target of a MALDI-TOF mass spectrometer. Another 
common endoglycosidase, endoglycosidase H (endo H) 
cleaves between the GlcNAc residues of the chitobiose 
core but has the disadvantage that information on the 
reducing-terminal GlcNAc residue, such as the presence 
or absence of fucose, is lost. 

Cleanup of Glycans Prior to Mass 
Spectrometry 

Sample purity is critical for obtaining satisfactory mass 
spectra and, unsurprisingly, many methods are avail¬ 
able. The most common involve the use of HPLC, ion- 
exchange resins to remove salts, graphitized carbon 
mini-columns, or drop dialysis. The use of a Nafion 
117 membrane (Bornsen et al. 1995) has proved to be 
very useful, both for removing salts and for introduc¬ 
ing a specific cation (see below) for MALDI ioniza¬ 
tion. Recently, ► ion mobility mass spectrometry has 
been used by making use of the differences in mobil¬ 
ities between the glycans and contaminants (Fenn and 
McLean 2009; Harvey et al. 2011). 

Detailed Structural Identification by Mass 
Spectrometry 

Use of Exoglycosidase Digestion. One very popular 
method for obtaining detailed structures of the glycans 
is to use a series of exoglycosidases to remove glycans 
from the nonreducing terminus and to use the enzyme 
specificity to provide information on the number of 
glycans removed, their type, and their linkage. Moni¬ 
toring of the profiles before and after enzyme treatment 
is often performed by HPLC, hence the need to label 
the glycans but can equally well be monitored by mass 
spectrometric techniques such as MALDI-TOF 
(Fig. 2). In fact, the latter technique has the advantage 
of higher resolution (in terms of the glycan separation) 
than HPLC and is also much faster. It suffers, however, 
from the inability to resolve isomeric structures. 
Nevertheless, such information can sometimes be 
obtained by fragmentation, particularly in the negative 
ion mode (see below). 
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Mass Spectrometry of Af-Linked Carbohydrates and Gly¬ 
coproteins, Fig. 2 Exoglycosidase sequencing of a mixture of 
M-glycans from IgG by MALDI-TOF MS. (a) MALDI-TOF 
spectrum of a mixture of A-glycans (Symbols for the mono¬ 
saccharides are defined in the legend to Fig. 1). (b) MALDI-TOF 
spectrum of the glycans after incubation with Arthrohacter 
ureafaciens sialidase showing removal of sialic acid, (c) 


m/z 

MALDI-TOF spectrum after removal of sialic acid and incuba¬ 
tion with bovine testis (I-galactosidase to remove galactose 
residues, (d) MALDI-TOF spectrum following further digestion 
with (I-A-acetylglucosaminidase. These enzymes reduce the 
complex glycans to the two core structures (with and without 
fucose) leaving the high-mannose glycans whose nature could be 
proved by additional digestion with a mannosidase 


Use of Combined Gas-Liquid Chromatography! 
Mass Spectrometry (GC/MS). Although GC/MS is 
confined to the analysis of relatively small carbohy¬ 
drates, particularly monosaccharides, it provides 
information on the nature of the constituent monosac¬ 
charides, a property not available from techniques that 
record the intact glycan and provide only an intact 
mass. Glycans need derivatization for GC/MS analy¬ 
sis; common derivatives are permethyl, per-acetyl, or 
per-trimethylsilyl (TMS) ethers (Harvey 2011). Much 
composition, and particularly linkage information, can 
be obtained by a combination of permethylation, 
hydrolysis, and acetylation, a technique known as 
“methylation-, or linkage-analysis” (Carpita and Shea 
1989). Although many modifications are available, the 
basic procedure involves permethylation to convert all 


free hydroxyl groups to their methyl esters. The 
molecules are then hydrolyzed to generate monosac¬ 
charides with additional free hydroxyl groups at the 
sites of linkage. The monosaccharides are reduced to 
avoid the production of two peaks on chromatographic 
analysis due to a- and P-anomers and the newly gen¬ 
erated hydroxyl groups are derivatized with a reagent 
such as acetic anhydride to produce, in this case, 
partially methylated alditol acetates, known as 
PM A As. The positions of the various substituents are 
subsequently located by GC/MS to provide the linkage 
information. 

For complete analysis of A-glycans, it is usually 
necessary to employ exoglycosidase digestion, hydro¬ 
lysis, and HPLC or GC/MS linkage analysis in order to 
obtain the information of the nature of the 
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monosaccharides and then use fragmentation spectra 
to determine how the monosaccharides are linked in 
the parent glycan. The choice between these methods 
frequently depends on the quantity of the analyte avail¬ 
able. Linkage analysis involves four steps of chemical 
modification and is difficult to perform on small 
amounts of material. Exoglycosidase digestion on the 
other hand is ideally suited to analysis of small 
amounts of material but is critically dependent on the 
availability and purity of the necessary 
exoglycosidases. 

Ionization of Intact A/-Glycans for Mass 
Determination and Fragmentation 

Three common methods, fast-atom bombardment 
(FAB), MALDI, and electrospray are available for 
ionization of intact A-glycans although only the latter 
two are encountered in most recent work. 

FAB. Although FAB, introduced in the early 1980s, 
has mainly been replaced by MALDI and ESI, it has 
nevertheless played an important role in the analysis of 
intact glycans (Dell et al. 1993). To obtain a spectrum, 
the glycan must first be derivatized, usually by 
permethylation, and then suspended in a suitable 
matrix such as thioglycerol. The sample is then 
bombarded with ions or atoms. Generally, under 
these conditions, the glycan produces [M -i- Na]"^ ions 
although small amounts of [M -i- K]"^ ions are often seen 
in addition. The spectra also contain abundant 
fragment ions which means that the method is gener¬ 
ally unsuitable for obtaining glycan profiles. In 
addition, the matrix tends to produce ions throughout 
the A-glycan mass range resulting in poor signal- 
to-noise ratios and comparatively poor sensitivity. 
Nevertheless, permethylation, combined with 
MALDI or electrospray mass spectrometry remains a 
popular technique because of enhanced information 
that can be obtained by fragmentation; specifically, 
linkage sites are revealed by production of OH groups 
at the site of cleavage. 

Matrix-Assisted Laser Desorption!Ionization 
(MALDI). MALDI mass spectrometry (Hillenkamp 
et al. 2007), usually implemented on time-of-flight 
(TOF) mass spectrometers, on the other hand is 
much more sensitive and the matrix ions usually 
only occupy the lower portion of the spectrum 
(below about miz 500). The method, as applied to 


carbohydrates, has been the subject of a series of 
comprehensive reviews ((Harvey 2012) and earlier 
references cited within). Like FAB, MALDI ioniza¬ 
tion produces predominantly [M -i- Na]"^ ions from 
neutral glycans although other cations can be intro¬ 
duced by doping the matrix with an appropriate salt or 
by ion-exchange using the Nafion membrane men¬ 
tioned above. Only singly charged ions are produced 
and the technique is ideal for obtaining quantitative 
profiles of neutral glycans. Sialylated glycans produce 
a mixture of [M-H]“, [M -i- Na]"^ and [M-nH -i- (n -i- 
INa)]"^ ions, the latter being due to salt formation, but 
the main problem with these compounds is their ten¬ 
dency to fragment with loss of sialic acid. Neverthe¬ 
less, stabilization can be achieved by suitable 
derivatization such as methyl ester formation (Powell 
and Harvey 1996). Spectra are obtained by 
co-crystallization of the glycan with a suitable matrix 
of which many have been described. The most com¬ 
mon matrix is 2,5-dihydrobenzoic acid (DHB) with 
targets usually prepared by simple evaporation of the 
glycan matrix solution (the dried droplet technique). 
This method gives rather inhomogeneous targets but 
the situation can be improved by recrystallization 
from ethanol or by use of mixed matrices by the 
addition of compounds such as 2-hydroxy-5- 
methoxy-benzoic acid, 1-amino-Ao-quinoline, or 
spermine. 2,4,6-Trihydroxyacetophenone (THAP) is 
more appropriate for sialylated glycans, particularly 
when mixed with ammonium citrate because it mini¬ 
mizes laser-induced loss of sialic acid (see (Harvey 
1999) for more information on matrices). 

Electrospray. Solutions of the glycans are sprayed 
through a narrow orifice under a relatively high voltage 
with the production of several types of ion, depending 
on the ion-source conditions and additives to the sol¬ 
vent. In positive ion mode, [M -i- Na]"^ ions can be 
obtained, as in MALDI, at high cone voltages but 
these ions are usually accompanied by a considerable 
amount of in-source fragmentation. Both [M -t- H]"^ and 
doubly charged ions produced by protonation are 
formed under milder ion-source conditions. Prominent 
[M-H] “ ions are formed in negative ion mode but these 
ions frequently produce in-source fragments. 
However, stable negative ions of the type 
[M -I- A]“, where A = an anion such as S 04 ^“, Cl“, 
Br“, H 2 P 04 “, or N 03 “, can be produced by adding the 
appropriate salt to the electrospray solvent (Harvey 
et al. 2008). An advantage of electrospray over 
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MALDI is that the milder ionization conditions cause 
little, if any, sialic acid loss but, on the other hand, the 
production of several types of ion and some cross-ring 
in-source fragmentation limits the technique for glycan 
profiling. In addition, sensitivity generally falls as 
a function of increasing mass such that, in positive 
ion mode, it is difficult to record [M -i- Na]"^ ions from 
the larger glycans. Nevertheless, the method is well 
suited for producing ions for subsequent 
fragmentation. 

Mass Spectrometric Fragmentation 

Fragmentation of carbohydrates yield predominantly 
two types of ion, glycosidic fragments that are formed 
by cleavages between the sugar rings and cross-ring 
fragments that are the result of cleavages of two bonds 
within a ring. The former fragments provide informa¬ 
tion about the sequence of the carbohydrate whereas 
the latter provide more detailed information of linkage 
and branching. The nomenclature for describing these 
ions (Fig. 3) was introduced by Domon and Costello in 
1988 and has been widely adopted. 

Although these features are common to all fragmen¬ 
tation spectra, the major types of ion and general 
appearance of the spectra are strongly influenced by 
the type of ion that is fragmented, the energy imparted 
during fragmentation, and the ionic charge. 

Fragmentation of[M + H]^ Ions. Fragmentation of 
[M -I- ions from underivatized A-linked glycans is 
relatively simple and not particularly informative 
because it produces mainly B- and Y-type glycosidic 
cleavage ions. Cross-ring cleavage ions are generally 
of low abundance or absent. The presence of ions 
produced by two or more cleavages, leading to prod¬ 
ucts of equal mass, often results in spectra that are 
difficult to interpret. Another complicating factor is 
the appearance of rearrangement ions, particularly 
from glycans derivatized at the reducing terminus and 
containing fucose. Glycans that are derivatized at the 
reducing terminus by reductive amination such that the 
derivative contains a quaternary ammonium group or 
an amine that can be protonated, retain the charge on 
the derivative and, consequently, their fragmentation 
spectra are dominated by Y-type cleavages that pro¬ 
vide predominantly sequence information. 

Fragmentation of [M + Alkali Metal]^ Ions. In 
contrast to the fragmentation of [M -i- H]"^ ions. 



Mass Spectrometry of Al-Linked Carbohydrates and Gly¬ 
coproteins, Fig. 3 The Domon and Costello nomenclature for 
describing fragment ions from carbohydrates. Ions retaining the 
charge on the nonreducing terminus are named A (cross-ring), B, 
and C (glycosidic) whereas those ions retaining charge on the 
reducing terminus are X (cross-ring), Y, and Z as outlined in 
Fig. 3. Numbering (as subscripts) is from the reducing terminus 
for the X, Y, and Z ions and from the nonreducing end for the 
others. Cross-ring cleavage ions additionally have a superscript 
number defining the bonds that are broken 


fragmentation of [M + Li]"^ or [M -i- Na]"^ ions from 
A-linked glycans are much more complicated but at the 
same time contain more structural information (Fig. 4). 

Internal fragments (losses from two or more termi¬ 
nal sites) are again common but the spectra usually 
contain A-type cross-ring fragments in low abundance 
giving linkage information. The most significant of 
these ions are the °’^Ar and ^’^Ar ions from the reduc¬ 
ing terminus and °’^A, ’^A, and ^’^A ions from the 

branching mannose residue; the latter two ions can be 
used to define the composition of the individual 
antennae attached to the core mannose. An additional 
advantage of the fragmentation of the [M + metal] 
ions is that the rearrangement ions reported from the 
[M + H]"^ ions are absent. Under high energy condi¬ 
tions, such as those produced by magnetic sector or 
TOF/TOF-type instruments (Mechref et al. 2003; 
Stephens et al. 2004), additional abundant X-type 
cross-ring fragment ions are produced. It is increas¬ 
ingly difficult to fragment adducts of the higher alkali 
metals to the extent that [M + Cs]"^ ions yield practi¬ 
cally no fragments other than Cs"^. Further information 
on glycan structure and the fragmentation mechanisms 
leading to the diagnostic ions in their MS/MS spectra 
can be obtained by performing additional successive 
stages of fragmentation with instruments such as the 
ion trap, the so-called MS" technique. Web-available 
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Mass Spectrometry of Af-Linked Carbohydrates and Glycoproteins, Fig. 4 Positive ion fragmentation spectrum of the high- 
mannose glycan Man 5 GlcNAc 2 


software enables all such fragment ions to be 
calculated from the given glycan structures (Ceroni 
et al. 2008). 

Fragmentation of Negative Ions. Although gener¬ 
ally less popular than positive ion fragmentation, 
negative ion spectra of neutral A-glycans provide 
much more structural information than positive ion 
methods and, in particular, provide direct information 
on features such as the location of fucose residues and 
the presence or absence of “bisecting” GlcNAc 
residues that are difficult to obtain by either positive 
ion methods or exoglycosidase digestions (Harvey 
et al. 2008). Although both glycosidic and cross-ring 
fragments are produced, as in positive ion spectra, the 
glycosidic ions tend to be of the C-type, and A-type 
cross-ring fragments often dominate the spectra. 
These fragments are generally the result of an initial 


loss of a proton (or acid if the glycan is ionized by 
ionic attachment) and subsequent electron 
rearrangement (Fig. 5). 

Acidic carbohydrates, such as those containing sialic 
acid or carboxyl-containing derivatives, for example, 2- 
aminobenzoic acid (2-AA) are less satisfactory because 
they ionize by loss of one or more protons from acidic 
groups to give [M-nH]"“ ions with localized charges 
and, consequently, restricted fragmentation. 

Of particular interest in the negative ion fragmen¬ 
tation of neutral glycans is an ion termed ion 
D (see Fig. 5) that is produced by formal loss of the 
chitobiose core and the 3-antenna and which leads 
directly to the composition of the 6-antenna and 
provides information on isomers. If the glycan 
contains a bisecting GlcNAc residue, this ion elimi¬ 
nates the GlcNAc residue to give a prominent fragment 
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Mass Spectrometry of Af-Linked Carbohydrates and Gly¬ 
coproteins, Fig. 5 Negative ion fragmentation spectra of 
M-linked glycans (phosphate adducts). The glycan symbols are 
as defined in the legend to Fig. 1 (a) High-mannose glycan 
Man 5 GlcNAc 2 . Ions at m/z 1072 and 869 are cross-ring 
cleavages of the core GlcNAc residues. Ions at m/z 647, 629, 
and 575 are the D, [D-18]^ and fragments that define 
branching of the 6-antenna, (b) Biantennary glycan. The pattern 
of ions at the high mass end of the spectrum (m/z 1478, 1418, and 
1275) define the reducing terminus and parallel those ions in the 
spectrum above reflecting the same core structure. The D and 


[D-18]^ ions are at m/z 688 and 670, respectively. The ion at m/z 
424 is a cross-ring cleavage ion (* ’^A 3 ) defining the composition 
of the antennae, (c) Fucosylated biantennary glycan. The spec¬ 
trum is virtually identical to that of the biantennary glycan 
(spectrum b) except for the 405 mass unit interval between the 
[M + H 2 P 04 ]^ ion at m/z 1883 and m/z 1478, which indicates 
fucose attached to the core GlcNAc residue, (d) Bisected 
biantennary glycan. The defining [M-221]^ ion (see text) is at 
m/z 670. (e) Triantennary glycan. The cross-ring cleavage ion at 
m/z 831 defines branching of the 3-antenna. Other ions parallel 
those in the spectrum of the biantennary glycan (spectrum b) 


221 mass units lower (Fig. 5d). Cleavages of the 
GlcNAc residues of the core provide information of 
fucose substitution and the presence of a fragment at 
m/z 831 (Fig. 5e) defines the presence of a branched 
3-antenna in the spectra of complex glycans. These and 
other diagnostic fragments are summarized in Table 2. 

Summary 

Mass spectrometry is an essential technique for the 
structural determination of carbohydrates and A-linked 
glycans in particular. Spectra of intact glycoproteins or 


derived glycopeptides provide information on the site of 
attachment of the glycans, and fragmentation of the 
released glycans, particularly in negative ion mode, 
gives a considerable amount of information on the 
detailed structure of the compounds. However, the 
most widely used techniques, MALDI and ESI mass 
spectrometry, do not provide information of the 
constituent monosaccharides; this information must 
still be obtained by orthogonal techniques such as GC/ 
MS or exoglycosidase digestion. Methods for glycan 
analysis are still being developed with ion mobility 
mass spectrometry currently proving to be the most 
promising for both isomer separation and for extracting 
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Mass Spectrometry of Af-Linked Carbohydrates and Glycoproteins, Table 2 

ion spectra of W-linked glycans ionized by anion addition 

Ions defining structural features 

in the negative 

Structural feature 

Ion“ 

Ionic composition 

m/z 

Composition 

Molecular 

[M -t X]^ 

- 

Antenna sequence 

C (glycosidic) 

Gal, Man, Glc 

179 



GlcNAc, GalNAc 

220 



Fuc-Gal 

325 



Gal-GlcNAc 

382 



Gal-[Fuc]GlcNAc 

528 



aGal-Gal, Man-Man 

341 



Man-[Man] Man 

503 



GalNAc-GlcNAc 

423 

Antenna composition 

F (Cross-ring) 

Man 

221 



GlcNAc 

262 



Gal-GlcNAc 

424 



Gal-[Fuc]GlcNAc 

570 



aGal-Gal 

383 



aGal-Gal-GlcNAc 

586 



GalNAc-GIcNAc 

465 



(Gal-GlcNAc )2 

789 



(Gal-GlcNAc )3 

1154 


B 3 (Glycosidic) 

Neu5Ac-Gal-GlcNAc 

655 



Neu5Gc-Gal-GlcNAc 

671 

Fucose at 6 -position of reducing 

(Cross-ring) 

[M-Cl-307]^ 

[M-342]- 

terminus 


[M-N03-307]^ 

[M-369]- 



[M-H2P04-307]^ 

[M-405]^ 

Absence of fucose at 6 -position of 

(Cross-ring) 

[M-Cl-161]^ 

[M-196]^ 

reducing terminus 


[M-N03-161]^ 

[M-223]^ 



[M-H 2 P 04 - 161 ]^ 

[M-259]^ 

Composition of 6 -antenna 

D and [D-18]“([D-36]“)(Probably glycosidic)' 

= GlcNAc 

526, 508 



Gal-GlcNAc 

688 , 670 



Gal-[Fuc]GlcNAc 

834, 816 



(Gal-GlcNAc )2 

053, 1035 
(1017) 



(Gal-GlcNAc) 2 Fuc 

1199,1181 
(1163) 



Man 3 

647, 629 



Man 4 

809, 791 



Man 5 

971,953 


°'^Ar .2 and °’"^Ar. 2 '’ (Cross-ring) 

GlcNAc 

292, 262“ 



Gal-GlcNAc 

454, 424 



Gal-[Fuc]GlcNAc 

600, 570 



(Gal-GlcNAc )2 

819, 789 



(Gal-GlcNAc) 2 Fuc 

965,935 



Man 3 

251,221 



Man 4 

413, 383 



Man 5 

575,545 


(continued) 
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Table 2 (continued) 

Structural feature 

Ion“ 

Ionic composition 

m/z 

Composition of 3-antenna 

°’"'Ar. 3 '“ (E) ion (Cross-ring) 

Gal-GlcNAc 

466 



G1cNAc2 

507 



Gal-GlcNAc 2 

669 



(Gal-GlcNAc )2 

831 



(Gal-GlcNAc) 2 Fuc 

977 

Presence of bisect 

Abundant [D-221]^ ion Ion at [M-221-18]^Missing 

GlcNAc 

508 


Ion D (Probably glycosidic)“ 

Gal-GlcNAc 

670 



Gal-[Fuc]GlcNAc 

816 



(Gal-GlcNAc )2 

1035 



(Gal-GlcNAc) 2 Fuc 

1181 



Man3 

629 



Man 4 

791 



Man 5 

953 

Presence of sialic acid 

Bi (Glycosidic) 

Neu5Ac 

290 



Neu5Gc 

306 

Presence of a.2 6-linked sialic 

“■'^A2-C02 

Neu5Ac 

306 

acid 

(Cross-ring) 

Neu5Gc 

322 


“Demon and Costello nomenclature 
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tion of mass spectra of glycans. J Proteome Res. 
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hydrate fragmentations in FAB-MS/MS spectra of 
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Fenn LS, McLean JA. Simultaneous glycoproteomics on the 
basis of structure using ion mohility-mass spectrometry. 
Mol Biosyst. 2009;5:1298-1232. 

Harvey DJ. Matrix-assisted laser desorption/ionization mass 
spectrometry of carbohydrates. Mass Spectrom Rev. 
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'’“R” is used to denote the reducing-terminal? GlcNAc residue 
“Mechanism not determined 


the spectra of glycans from complex or contaminated 

samples where other methods have failed. 
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► Mass Spectrometry: Methods of Ionization (Applied 
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Mass Spectrometry: Application 
to Protein-Ligand Interactions 

Jonathan T. S. Hopper 

Department of Physical and Theoretical Chemistry, 
University of Oxford, Oxford, UK 

Synonyms 

Mass spectrometry 


Introduction 

The study of protein-ligand (P-L) interactions is cru¬ 
cial for understanding many biological processes at the 
molecular level. These may include the mechanism by 


which an enzyme turns over its substrate, or the bio¬ 
logical activity of exogenous molecules such as drugs 
and toxins. Indeed, fundamental processes in the cell 
depend on the continuous formation and dissociation 
of noncovalent P-L interactions. The application of 
mass spectrometry (MS) to the study of protein mole¬ 
cules was revolutionized by the introduction of soft 
ionization techniques in the 1980s. Ganem and Chait 
provided the first demonstration that electrospray ion¬ 
ization (ESI) could be used to preserve noncovalent 
P-L complexes within a mass spectrometer (Ganem 
et al. 1991; Katta and Chait 1991). Since then, 
a variety of mass spectrometric techniques have been 
developed that allow important information to 
be obtained concerning these complexes. This entry 
highlights the application of MS in studying P-L 
interactions and modern techniques that are currently 
used by researchers in this field. 

Characterizing Protein-Ligand Complexes by 
Mass Spectrometry 

Detection of Binding and Determining 
Stoichiometries 

The preservation of noncovalent interactions through 
the use of “soft” ionization techniques allows 
researchers to directly observe intact P-L complexes 
by way of mass spectrometry detection methods. Ulti¬ 
mately, the data obtained from MS yields information 
concerning the species mass and therefore allows the 
presence of a P-L-binding event to be identified as 
a discrete increase in mass, illustrated in Ligure 1 . As 
a consequence of this ability, MS is well suited to 
determining stoichiometries in cases where multiple 
species can interact with one protein (or indeed multi¬ 
ple proteins interacting about a single ligand). The 
range of complexes that are now amenable to analysis 
by MS-based approaches is rapidly expanding. This 
has been driven by the introduction of techniques that 
aid the survival of P-L interactions that are destabilized 
in a desolvated environment (Sun et al. 2007). Conse¬ 
quently, the number of examples of P-L interactions 
being studied by MS is now vast and have been 
highlighted in recent texts (Lerguson et al. 2009). 

Diffusion Measurements 

Diffusion measurements can be used to monitor 
P-L-binding events. The theoretical basis of this method 
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Mass Spectrometry: 
Application to Protein- 
Ligand Interactions, 

Fig. 1 An ESI-MS spectrum 
of FKBP-FK.506 complex. 
Protein images generated from 
PDB file IFKJ using PyMOL 
software. Protein (FKBP) is 
displayed as the blue structure 
and the ligand (FK.506) is 
colored red. Charge states are 
also indicated above each 
species 



relies on monitoring the translational diffusion of 
a protein and its ligand within solution. The diffusion 
coefficient (D) of a molecule within a solution can be 
defined by the following relationship: 



6nrjRs 


where k is the Boltzmann constant, T is the tempera¬ 
ture, rj is the viscosity of the solution, and is the 
Stokes radius (a hard sphere representation of the 
structure). This relationship illustrates that large mac¬ 
romolecules such as proteins will diffuse at a slower 
rate than their ligands. Therefore, the binding of 
a ligand molecule to a protein will reduce the diffusion 
rate of the former. This effect can therefore be utilized 
to monitor P-L-binding events directly from solution. 
The ESI-MS is positioned at the end of the flow tube to 
detect the analytes as they exit the capillary and allow 
dispersion profiles to be generated. In the absence of 
a P-L interaction, the dispersion profile of the ligand 
will be notably different to that obtained for the pro¬ 
tein. Conversely, if the two species do interact in 
solution, the profiles will appear similar. This 
technique is most effective when the ligand’s physical 
dimensions are significantly smaller than the 
dimensions of the P-L complex. 

Unlike experiments which rely upon the ability to 
preserve noncovalent interactions in the gas-phase, 
diffusion measurements monitor P-L binding within a 
solvated environment (prior to ESI-MS analysis). 


Therefore, the technique is well suited to determining 
dissociation constants for complexes which are 
unstable within the desolvated environment of a mass 
spectrometer (Clark and Konermann 2004) 

Hydrogen/Deuterium Exchange (HDX) 

The use of hydrogen/deuterium exchange (HDX), 
in conjunction with MS detection, is a powerful 
tool for investigating structural characteristics and 
dynamics of proteins. HDX involves rendering the 
protein or protein complex in an environment com¬ 
posed of deuterating reagent, commonly deuterium 
oxide (D 2 O). Hydrogen atoms located on the back¬ 
bone amides and polar side-chain heteroatoms of 
the protein exchange with the deuterium atoms of 
the surrounding solvent. HDX can be considered an 
energy neutral process (ignoring any isotope 
effect), and therefore allows the opportunity to 
study the reaction surface of a protein without the 
product ions being subject to any energy-induced 
changes. Mass spectrometry is then well suited to 
determining the amount of exchanges that have 
occurred. 

The exchange of specific amide hydrogen depends 
on its solvent accessibility, hydrogen bonding, and its 
spatial location in relation to secondary structure, 
including helices and p-sheets. These aspects of 
exchange mean that it is useful in probing changes in 
protein conformation, since buried amides will become 
available for exchange as the protein structure unfolds. 
HDX also provides a useful method for monitoring 
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noncovalent ligand-binding events. The introduction of 
a ligand to the binding interface of a protein will render 
this region of the protein structure less solvent accessi¬ 
ble, and will therefore result in a lower HDX rate. 
Ligand binding also reduces the protein flexibility and 
will alter the structural dynamics of the protein. Since 
this method is indirect, that is, the P-L complex is not 
required to survive within the MS, it can also he applied 
to complexes that are unstable in the gas phase. 

Enzymatic cleavage of the deuterated protein with 
pepsin can be used to provide information on site- 
specific exchange within the protein sequence. Pepsin 
is chosen as it is active under the acidic conditions used 
to quench exchange. If significant conformational 
rearrangement does not occur as a result of ligand 
binding, the binding energy of an interaction can be 
estimated by differences in the H/D exchange rates 
of the apo- and holo-protein. The interested reader is 
directed to a comprehensive description of this tech¬ 
nique (Green and Lebrilla 2006). 

Hydroxyl Radical Footprinting 

Hydroxyl radical footprinting assays are designed to 
monitor protein conformation by labeling protein res¬ 
idues. Principally, as with the use of deuterating sol¬ 
vent, these assays give direct measurements of protein 
structural dynamics and solvent accessibility due to the 
size similarity of the hydroxyl anion and water. 
Hydroxyl radicals may be generated in several ways, 
including electrochemistry, radiolysis of water, photo¬ 
dissociation of hydrogen peroxide, or Fenton chemis¬ 
try. The exposure time of the protein samples is 
dictated by the rate of hydroxyl radical formation (dic¬ 
tated by the method utilized). The resulting oxidized 
protein can then be analyzed by MS to determine the 
change in mass. Information based on the site-specific 
modifications can be obtained using either a “bottom- 
up” (whereby the protein is digested then subjected to 
LC-MS) or a “top-down” (via tandem MS/MS) 
approach. 

A major drawback to using hydroxyl radicals is that 
the protein conformation can be significantly altered 
during the labeling reaction. Innovative methods have 
been introduced where the reaction times are less than 
the time required for structural rearrangement. An 
example of this uses a 248 nm KrF excimer laser to 
induce hydroxyl radicals from hydrogen peroxide, in 
the presence of a scavenger, with lifetimes less than 
1 ps (Hambly and Gross 2005). Apomyoglobin was 


analyzed as a model protein and results were in accord 
with ► NMR data demonstrating the potential of the 
method for studying protein structures and interac¬ 
tions. A key advantage to hydroxyl labeling approach 
is the irreversible nature of the covalent modification. 
This is best demonstrated when it is utilized in 
top-down or bottom-up techniques of peptide map¬ 
ping. Scrambling effects that are problematic when 
conducting tandem MS analysis of deuterated proteins 
are absent in this technique. Additionally, digested 
samples do not require specialized chromatographic 
conditions that are essential in HDX experiments to 
limit back exchange. 

Electron Capture Dissociation 

Electron capture dissociation (ECD) has emerged as 
a powerful method for fragmentation of multiply charged 
biomolecules. The process involves the capture of an 
electron by a precursor ion to generate an excited radical 
species, which subsequently undergoes rapid dissocia¬ 
tion. ECD is a nonergodic rapid fragmentation process, 
typically occurring on timescales of < 10“*"* s, and capa¬ 
ble of breaking strong bonds in the presence of weaker 
interactions (Hoffmann and Stroobant 2007). 

ECD has found useful applications in areas such as 
protein sequencing, characterizing post-translational 
modifications (PTMs) and in studying protein fold¬ 
ing/unfolding in the gas phase. The ability to specifi¬ 
cally cleave the covalent backbone of a protein or 
peptide, without destroying weaker noncovalent inter¬ 
actions with a ligand, allows exciting “top-down” 
opportunities. The fragmentation technique has dem¬ 
onstrated the capability of mapping ligand-binding 
sites in protein structures (Xie et al. 2006). New 
supercharging reagents, such as m-nitrobenzyl alcohol 
and sulfolane, have also recently shown great potential 
in this area (Yin and Loo 201 1). 

The nature of the technique also means it has great 
value in HDX experimental procedures. Following 
digestion of a deuterated protein, the resulting peptides 
can be fragmented by ECD MS/MS without significant 
scrambling of deuterium atoms. Thermal ion activa¬ 
tion MS/MS methods, such as CID and IRMPD, induce 
unwanted H/D scrambling resulting in loss of site- 
specific labeling patterns. 

Ion Mobility Spectrometry (IMS) 

► Ion mobility is a gas-phase electrophoretic 
technique, which is becoming increasingly used 
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together with MS to provide complimentary size/ 
charge information. 

Concerning its application in P-L complexes, ► ion 
mobility can be used to determine conformational 
rearrangements that result from binding (Atmanene 
et al. 2010). ► Ion mobility has also been used in 
conjunction with collision induced activation to detect 
differences in structural stability of bound and 
unbound protein-ligand complexes. This work shows 
that the additional intermolecular interactions pro¬ 
vided by the ligands confer structural stability to the 
proteins in the gas phase (Hopper and Oldham 2009), 
similar to the effect of ligand-binding events in solu¬ 
tion. This type of experimental procedure has also been 
used to investigate the unfolding pathways of proteins 
in the presence of ligands (Hyung et al. 2009). 

Quantifying Protein-Ligand Interactions 

Using mass spectrometry to determine quantitative 
information on P-L binding is an attractive concept. 
MS offers several advantages over other biophysical 
approaches, including sensitivity, speed, and the 
requirement of only trace amounts of material. 

Gas-phase dissociation techniques allow P-L inter¬ 
actions to be probed within the mass spectrometer, 
although this yields information of the complex in the 
desolvated/ionized state. Often, such measurements do 
not correlate with solution-based methods, as the rela¬ 
tive contributions made by different interactions 
change upon desolvation. Alternatively, MS can be 
used as a “detection method” for monitoring the appar¬ 
ent solution equilibrium between the protein and its 
binding partner. The latter is clearly more informative 
for understanding biological molecules in their native 
environment and numerous examples demonstrate 
good correlation to established solution-based meth¬ 
odologies, highlighted in Mathur et al. (2010). 

Titration Experiments 

Separation of analytes according to differences in miz 
means that MS is able to detect the increases in mass 
that result from ligand interactions with a protein. 
Therefore, the mass spectrum can be utilized to detect 
the amount of bound complex in relation to the amount 
of protein that remains unbound. 

Titration allows determination of the apparent solu¬ 
tion affinity constant {K^ or Kj) of a P-L system. 
Usually, the protein concentration is kept constant 
and the ligand is titrated into the solution. ESI-MS is 


used to monitor the intensity of the P-L complex and 
the intensity of the free protein at each ligand concen¬ 
tration. By summing the intensities of free and bound 
protein peaks and expressing this as a ratio (R), the 
affinity constant {Ka) of the P-L interaction can be 
calculated from the following expression: 


Ka 


R 


[i^]o 


l+R 


Y. hp~‘ 

Yi 


Where [P]o and [L]o are the concentrations of protein 
and ligand respectively. 

It should be noted that nonlinear curve htting 
approaches have also been employed. The success of 
this direct observation approach relies on the ability to 
minimize the amount of dissociation within the MS, so 
as to gain an accurate representation of the solution 
binding equilibrium. Buffer considerations are also 
important. Ammonium salts such as ammonium 
acetate act to keep the solution at near biological pH 
and are sufficiently volatile to be removed during the 
ESI process. More fundamental questions concerning 
the impact of the ESI process on the solution equilibria 
are also important, and these have been discussed in 
detail by Mathur et al. (2010). 

Competition Approaches 

Competition methodologies are based on observing var¬ 
ious different ligands simultaneously as they compete 
for a binding site on a specific protein. The approach 
provides direct information regarding the relative affin¬ 
ities of these ligands toward the target protein. The 
relative binding affinities of the ligands are reflected in 
the intensity of their signal relative to each other. The 
speed of this approach, as with many MS-based exper¬ 
iments, is the main advantage to this technique. In order 
for these measurements to be valid requires the ioniza¬ 
tion efficiencies of the host and its complexes to be 
similar. Incorporating reference ligands of known affin¬ 
ities allows absolute binding affinities of other ligands to 
be determined (Kempen and Brodbelt 2000). 


SUPREX 

SUPREX (Stability of t/npurified Proteins from Pates 
of H/D Exchange) is a mass spectrometric-based tech¬ 
nique that allows the thermodynamic information of 
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P-L interactions to be determined. It utilizes a HDX 
labeling approach to monitor the degree of unfolding 
induced by varying degrees of solution-based denatur- 
ant, commonly guanidine hydrochloride (GdmCl) or 
urea. Due to the high GdmCl salt content of the resulting 
denatured samples, the analysis is best suited to MALDI 
(matrix-assisted laser desorption ionization), which has 
a much better salt tolerance for such samples. This said, 
a recent approach introduced in 2008 termed “SPROX” 
utilizes an irreversible oxidation methodology (Fitzger¬ 
ald and West 2009). The stability of the oxidized pro¬ 
teins allows subsequent desalting techniques to be used 
that render the samples suitable for ESTMS analysis. 

A SUPREX protocol involves diluting the protein 
or P-L samples with a deuterated buffer solution 
containing varying concentrations of a chemical dena- 
turant. Exchange is then allowed to proceed for 
a specific amount of time, after which the sample is 
added to a sinapinic acid matrix and analyzed via 
MALDI-MS to determine the amount of deuterium 
uptake. A SUPREX curve can then be constructed by 
plotting the shift in mass (relative to the protonated 
protein mass) as a function of denaturant concentra¬ 
tion. The denaturant concentration at the midpoint of 

the curve c^uprex j^g applied in obtaining 

2 

the solution phase folding free energy (AG/) and m 
(defined as (5AG//(5[denafurant]) for the protein. 

- AGf = -t- RT In 

2 {Mpr') 

where R is the gas constant, T is the temperature, ki„, is 
the average exchange rate of an amide proton, t is time, 
n is the number of protein subunits, and [P] is the 
protein concentration. If this is also applied to the 
ligated protein, the differences in the folding free 
energy between the apo- and holo-forms, allow the 
P-L-binding constant to be obtained. 


AACj \ 

g-^sr — 1 j 

A major advantage in using SUPREX approaches is 
that it can be used to analyze unpurified proteins. An 
exceptional application that emphasizes this beneht 
involved the in vivo determination of a protein’s 


stability within the Escherichia coli cytoplasm 
(Ghaemmaghami and Oas 2001), which demonstrates 
the future potential of this technique. The technique 
has also demonstrated excellent potential as a tool in 
high-throughput P-L screening applications, achieving 
a sample rate of 3 minAigand using a conventional 
MALDI instrument; however, the authors postulate that 
a rate of <20 sAigand could be achieved using high- 
throughput MALDI equipment (Hopper et al. 2008). 

A thorough description of this technique is provided 
by Kaltashov and Eyles (2005), including mathemati¬ 
cal interpretations and recent examples. 

PLIMSTEX 

PLIMSTEX (Protein-L/gand Interactions in solution by 
MS, Titration and HAA EXchange) can be used to char¬ 
acterize the conformational change that results from 
ligand binding, along with the stoichiometry of the 
interactions and the corresponding affinity constants. 

In PLIMSTEX experiments, the protein samples are 
prepared in aqueous buffer containing different concen¬ 
trations of ligand, which are then diluted in D 2 O to 
allow the exchange to commence. The exchange time 
is an important consideration in these experiments and 
is selected based on the time that results in the largest 
difference in deuterium uptake between the protein and 
the P-L complex. ESI-MS analysis is used to determine 
the number of deuterium atoms incorporated into sol- 
vent-accessible amide sites. This can be done using LC- 
MS approaches on the intact protein or on proteolytic 
fragments. PLIMSTEX curves can then be constructed 
by plotting the deuterium uptake as a function of ligand/ 
protein concentration ratio. Binding constant informa¬ 
tion can be extracted from the data fitting function. 

Although similar to SUPREX, PLIMSTEX has 
some notable advantages. First, no denaturant is 
required for these experiments which may prevent the 
P-L interaction forming. This also means that it is 
compatible with electrospray ionization-based tech¬ 
niques. PLIMSTEX allows multiple binding events to 
be monitored and can be used to assess stoichiometry. 
This said, however, the technique is strictly limited to 
binding events that result in a change in the number of 
protected amide protons (Eitzgerald and West 2009). 

Blackbody Infrared Dissociation (BIRD) 

Obtaining energetic information on P-L systems in the 
gas phase via tandem MS experiments rely on the 
ability to characterize the internal energy imparted on 
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the parent ion as well as the time over which the 
dissociation takes place. Such parameters are difficult 
to derive from many dissociative techniques, such 
as CID, since the internal energy distributions and 
effective temperatures of the ions are difficult to 
predict due to the complex collision processes and 
their dependence on many instrumental parameters. 

BIRD requires low background pressures to remove 
any excitation through ion-molecule collisions as well 
as the ability to carry out the activation over long 
activation times. Such criteria can be attained within 
an FTICR cell and ensures that the transfer of energy to 
the precursor ion proceeds via absorption of blackbody 
photons emitted from the chamber walls. Infrared pho¬ 
tons that are emitted from the cell walls result in 
vibrational excitation of the analyte ion; however, 
this is a slow process and typically excitation experi¬ 
ments are carried out on the timescale of seconds/ 
minutes. Another experimental consideration is to 
accurately determine and control the temperature of 
the cell. Several methods have been adopted to achieve 
this and are discussed in detail in other texts 
(Hakansson and Klassen 2010). 

For large molecules such as proteins, the rates of 
energy equilibration with the surroundings are much 
greater than the rate of dissociation, and a limit of rapid 
energy exchange (REX) is attained. Under these cir¬ 
cumstances, time-resolved measurements allow disso¬ 
ciation kinetics to be calculated according to the 
Arrhenius relationship. Further details of BIRD meth¬ 
odology can be found in more comprehensive texts 
(Wesdemiotis and Wang 2006). 

Summary 

Mass spectrometry is now a well-established technique 
for studying P-L complexes, capable of providing 
a wealth of information regarding the binding and 
structure of such biomolecules. This entry highlights 
methods that are useful in modem research, which 
contribute to a greater understanding of these intricate 
and dynamic molecular entities. 

Cross-References 

► H/D Exchange 

► Ion Mobility Mass Spectrometry - Principles 

► NMR-based Structural Proteomics 
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Mass Spectrometry: Mapping Large 
Stable Protein Complexes 
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Synonyms 

Characterizing protein-protein interactions; Mass 
spectrometry; Protein quaternary structure; Protein- 
Protein Interactions 

Definition 

Large, stable protein complexes are assemblies of two 
or more polypeptide chains, linked by specific 
noncovalent protein-protein interactions, which are 
stable on a biologically relevant timescale. 

Mapping protein complexes is the process of deter¬ 
mining noncovalent binding interactions and quater¬ 
nary structure of protein subunits in a multiprotein 
assembly. 

Introduction 

The large majority of cellular processes are carried out 
by a sophisticated network of protein machinery. 
Knowledge of these molecular interactions is vital to 
understanding and controlling the biological processes 
of the cell. Therefore, closely connected to the study of 
protein function is the analysis of protein structure and 
identification of binding partners (Waksman 2005). 


In many contexts this is described as “mapping protein 
complexes,” which currently forms a challenging yet 
rich area of biophysical research. 

Protein-protein complexes form as a result of col¬ 
lective weak, noncovalent binding interactions 
between polypeptide subunits, which include hydro¬ 
gen bonding, electrostatic interactions, and van der 
Waals forces. The extent of these interactions can 
determine the shape, stability, and lifetime of the com¬ 
plex. If the binding interactions are sufficiently strong 
to maintain intermolecular association over 
a significant time period, the complexes are said to be 
stable, and can range from dimeric assemblies on the 
order of tens of kiloDaltons to extensive multimers 
with megaDalton molecular masses. 

Different approaches have been employed to iden¬ 
tify protein-protein interactions for structural analysis 
and to characterize protein complexes. In those cases 
where high-resolution methods (such as X-ray crystal¬ 
lography and NMR spectroscopy) are applicable, the 
solved three-dimensional structure gives atomic level 
insights into stable interactions and subunit arrange¬ 
ment within a protein assembly. However, obtaining 
structures of large, multiprotein complexes is chal¬ 
lenging, laborious, and in some instances not possible 
by such traditional approaches. Furthermore, given the 
structural complexity and extreme number of different 
assemblies in cells, these methods are not well adapted 
to the large scale mapping of protein complexes. 

The first interactome maps were described using the 
yeast-two-hybrid system, a genetic assay well suited 
and highly utilized for the large scale screening of 
protein-protein interactions (Suter et al. 2008). In this 
approach, two fusion proteins are co-expressed in 
yeast, one fused to a DNA binding domain, the other 
to a transcription activation domain. If the two proteins 
interact, then the activation and binding domains of the 
transcription factor are indirectly connected, resulting 
in activation of selected reporter genes. In this way, 
a successful interaction between the fused proteins is 
inferred from a change in cell phenotype. Mapping 
networks of interactions can be achieved by screening 
a library of proteins, in some cases for entire organ¬ 
isms; however, this approach provides limited quater¬ 
nary structure information. Furthermore, some 
interactions may be under represented as a protein 
complex may require higher order interactions 
between multiple subunits to be stabilized. Finally, it 
is now known that this type of screening approach can 
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Mass Spectrometry: Mapping Large Stable Protein Com¬ 
plexes, Fig. 1 Scheme for a mass spectrometry-based 
approach to the determination of protein complex architecture. 
Following affinity purification, a combination of proteomics and 

often yield an undesirable level of false-positive 
results, and thus various complementary methods are 
required to confirm putative binding interactions. 

In recent times, a more focused approach toward 
mapping large, stable protein complexes has primarily 
been achieved by the combination of affinity purifica¬ 
tion in conjunction with mass spectrometric tech¬ 
niques. Continued developments in instrumentation 
and analysis have seen mass spectrometry (MS) 
emerge from a technique used to provide protein pri¬ 
mary sequence to a structural biology tool for higher 
order protein structure determination, and establish 
itself as key technology for the in-depth analysis of 
proteins and protein complexes (Downard 2007; 
Gingras et al. 2007). The emergence of sensitive and 
high-throughput MS methods has further encouraged 
the development of techniques for the biochemical 
purification of stable protein complexes in their native 


native mass spectrometry can provide complex composition and 
subunit interaction maps. In combination with ion mobility and 
computational methods, a well defined model of an intricate 
protein assembly can be generated 

form. This entry describes the mapping of large, stable 
protein complexes with a primary focus on MS and 
complementary techniques (Fig. 1). 

Affinity Purification of Large, Stable Protein 
Complexes 

Major progress in the isolation of protein assemblies 
for structural mapping has been achieved primarily 
utilizing affinity purihcation approaches (Bauer and 
Kuster 2003). By these techniques, large, stable 
multiprotein complexes can be isolated directly from 
cell lysates at endogenous levels through one or more 
affinity-based purification steps. Affinity purification 
can be performed under near physiological conditions 
directly from the cell type of interest, thereby 
maintaining native structures, and has the ability to 
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preserve and identify relevant post-translational 
modifications. 

In a classic co-immunoprecipitation experiment, 
a protein complex is affinity captured from cell lysates 
using an immobilized antibody that recognizes 
a specific epitope on a protein of interest. The retrieved 
complex is washed extensively to remove nonspecific 
binding partners, and eluted for analysis. More gener- 
ically, antibodies can be used to target proteins of 
interest expressed with an epitope tag, which is then 
used as an analytical handle to purify the protein and its 
interaction partners. A variety of different epitope tags 
(e.g., Myc, HA, Flag, KT3) have been used success¬ 
fully and antibodies against these tags are commer¬ 
cially available. 

A dual purification strategy known as tandem affin¬ 
ity purification (TAP) offers further improvement over 
traditional single step methods and is now widely used 
for high-throughput or large scale protein complex 
mapping studies. In this experiment, a protein is 
expressed containing a tag consisting of two sections, 
joined by a cleavable linker. In the original TAP tag, 
a proximal calmodulin-binding peptide tag is separated 
from a distal protein A tag by a TEV protease cleavage 
site. During purification, the complex is first isolated 
by affinity purification on IgG-sepharose. The tag is 
then cleaved with TEV, and the released protein com¬ 
plex (now fused only to the proximal tag) is 
immobilized for further purification on calmodulin 
sepharose in the presence of calcium. Addition of 
EGTA depletes calcium ions essential for calmodulin 
binding, releasing the recombinant protein and its 
interacting partners for analysis. The two subsequent 
purification steps improve specificity of the procedure. 

Identifying Protein Complex Components 

Eollowing isolation and purification, mass spectrome¬ 
try is currently the method of choice for identification 
of proteins from a multiprotein complex since it offers 
sensitive ► mass analysis, routinely using quantities of 
protein in the femtomole range. MS sequencing of 
peptides can be achieved within seconds and is easily 
automated, thereby being compatible with high- 
throughput strategies. Eurthermore, it also allows 
for the characterization of post-translational modifica¬ 
tions, exogenous modifications such as chemical cross- 
linking, and can be adapted for quantification. 


Typically, these analyses follow protocols similar 
to those derived for ► mass spectrometry in proteo- 
mics, utilizing a bottom up approach (Bauer and 
Kuster 2003). Protein complexes are isolated and the 
individual proteins, often separated by gel electropho¬ 
resis or liquid chromatography, are enzymatically 
digested. The resulting peptides are identified by 
a combination of accurate peptide mass fingerprinting 
and sequence profiles generated from tandem MS, and 
can be searched against protein sequence databases 
to provide a list of proteins present in the sample. 
An overview of the TAP-tag-MS experiment is given 
in Pig. 2. 

Since the initial landmark TAP-tag-MS experi¬ 
ments were reported, numerous studies have provided 
valuable data on the composition of protein complexes 
and global organization of proteins for several organ¬ 
isms including yeast and human cells. Por instance, 
this strategy has been used to generate 
a comprehensive map of the yeast interactome, com¬ 
prising 7,123 protein-protein interactions (Krogan 
et al. 2006). This approach has also been utilized in 
a more focused manner to identify proteins involved in 
single signaling pathways, single regulation systems, 
or single protein complexes. As an example, smaller 
protein networks have been mapped in mammalian 
cells, such as the Par interactome involved in cell 
polarity (Brajenovic et al. 2004). 

From Interactions to Complexes 

Affinity purification-MS approaches are also having 
important implications for structural genomics 
(Benesch et al. 2007). By MS analysis of intact protein 
complexes and subcomplexes, stoichiometry, binding 
interactions, and a high-confidence model for the 
subunit architecture of a noncovalent assembly can 
often be proposed. 

In recent years, mass spectrometers have been mod¬ 
ified to overcome the hurdles associated with transmit¬ 
ting and detecting large ions of macromolecular 
assemblies up to the MDa range. Furthermore, soft 
ionization techniques, in particular electrospray ioni¬ 
zation from the volatile aqueous buffer ammonium 
acetate, have allowed for the generation of gas phase 
ions of large, nonvolatile molecules such as proteins 
and their noncovalent complexes in a manner that 
avoids structural rearrangement or dissociation. Once 
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Mass Spectrometry: 
Mapping Large Stable 
Protein Complexes, 

Fig. 2 General overview of 
a TAP-tag-MS experiment for 
identification of proteins in 
a stable protein complex. 
Isolation of the native complex 
is achieved in a two-step 
affinity purification protocol, 
utilizing protein A and 
calmodulin-binding peptide 
tags, separated by a TEV 
cleavage site, which bind to 
IgG and calmodulin beads, 
respectively. Proteins in the 
complex can be separated by 
SDS-PAGE and enzymatically 
digested for proteomics 
analysis and protein 
identification through 
sequence database searching 
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the components of a protein complex have been iden¬ 
tified and their masses determined, accurate measure¬ 
ment of the intact mass of a protein complex can be 
used to define the stoichiometry of each component. 

Further structural information for protein complex 
mapping can be obtained by inducing dissociation of 
the assembly using gas or solution phase approaches, 
allowing insight into subsets of interacting compo¬ 
nents within the complex. In the gas phase, this is 
most commonly achieved by collision induced disso¬ 
ciation (CID), whereby ions of interest are selected 
(typically by a quadrupole mass analyzer) and accel¬ 
erated into a gas-filled collision cell (Benesch et al. 
2007). Ion-neutral collisions activate the protein com¬ 
plex, causing dissociation, and the product ions are 
subsequently analyzed to provide a tandem (MS/MS) 
spectrum from which structural information can be 
inferred. The typical dissociation pathway for 
a noncovalent protein assembly results in ejection of 
a highly charged subunit with concomitant formation 
of a “stripped complex” containing the remainder of 
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Mass Spectrometry: Mapping Large Stable Protein Com¬ 
plexes, Fig. 3 Tandem mass spectrum of the 4S* charge state 
(m/z 10,700) of the intact 517 kDa RNA polymerase II complex. 
Eliminated subunits such as Rpb4 and Rpb7 are visible at low 
m/z while charge distributions observed above m/z 12,000 are 
concomitant high mass fragments, wherein A indicates elimina¬ 
tion of that subunit from the precursor assembly. As indicated in 
the structural cartoons, peripheral subunits are predominantly 
eliminated (Reproduced from Heck 2008) 
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Mass Spectrometry: 
Mapping Large Stable 
Protein Complexes, 

Fig. 4 Model of the human 
eIF3 derived from 27 
subcomplexes, IP data, and 
interactions identified in the 
yeast complex. The complex 
dissociates into distinct 
modules in response to 
changes in ionic strength; in 
this case, the spectmm shown 
was recorded at intermediate 
ionic strength (350 mM 
AmAc). Arrows denote 
additional interactions not 
readily represented in this 
model. eIF3 subunits are 
represented as spheres scaled 
according to their mass 
(Reproduced from Zhou et al. 
2008) 
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the assembly. The CID process facilitates stoichiomet¬ 
ric determination since the product ions are distributed 
over a wider miz range, and hence more easily 
resolved. Furthermore, it helps to identify peripheral 
subunits which dissociate more readily than core com¬ 
ponents of the protein assembly, as seen in the tandem 
mass spectrum of RNA polymerase II given in Fig. 3 
(Heck 2008). 

Additional solution-based dissociation methods 
are also employed to investigate subunit binding 
interactions and define the arrangement of a protein 
complex (Hernandez et al. 2006). Controlled manip¬ 
ulation of solvent conditions, typically by changing 
ionic strength or addition of chaotropic agents, can 
result in an ensemble of protein subcomplexes which 
can be subsequently analyzed by MS. This process 
appears to primarily dissociate weak subunit inter¬ 
faces within the assembly, and retain much of the 
noncovalent bonding and conformational properties 
of the original complex. Characterization of these 
subcomplexes by MS makes it possible to infer 
a subunit interaction map, and in combination with 
complementary techniques such as ion mobility and 
computational modeling, help to elucidate the three- 
dimensional architectural organization for the native 
assembly. An example of this approach can be seen in 
Fig. 4 for the 13 component human eukaryotic trans¬ 
lation factor eIF3 (Zhou et al. 2008). 


Ion Mobility Mass Spectrometry for Protein 
Complex Structure 

While MS is a powerful tool for protein structure 
investigation, its usefulness for mapping protein com¬ 
plexes can be enhanced when combined with ion 
mobility (IM) analysis (Uetrecht et al. 2010). The 
► principles of ion mobility spectrometry are 
described elsewhere, but in brief are based upon the 
rate at which an ion will migrate under the influence of 
an electric field through a region of neutral buffer gas, 
which is in turn related to cross section and therefore 
molecular structure. This methodology is not only able 
to identify and characterize components of protein 
complexes by MS, but incorporate conformational 
information by IM measurement of cross section. 
Application of IM-MS for the study of large protein 
assemblies is relatively recent, with the introduction of 
accessible commercial instrumentation (Waters 
Synapt HDMS in 2006) providing impetus for rapid 
development in the held. 

The hrst dehnitive evidence that the quaternary 
topologies of protein complexes can be maintained in 
the gas phase came from IM-MS studies of the 
undecameric tryptophan RNA-binding attenuation 
protein complex (Ruotolo et al. 2005), and numerous 
reports have followed. Furthermore, a recent study 
designed to investigate the topology of subcomplexes 
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generated in solution suggests that the suhcomplexes 
retain much of their native conformation upon solution 
phase disruption and therefore could be considered as 
“building blocks” for the generation of model struc¬ 
tures (Pukala et al. 2009). Consequently, IM-MS data 
in combination with solution phase disruption 
approaches and computational tools are emerging as 
a promising approach to define topological models of 
protein complexes. 

Since IM-MS is not a high-resolution technique, it 
relies on modeling protein complexes to create a link 
between measured collision cross-section data and 
structural composition. IM-MS data can provide 
a series of distance and connectivity constraints for 
model generation. Recent studies have utilized 
a combination of IM-MS constraints, high (NMR spec¬ 
troscopy and X-ray crystallography) and low (electron 
microscopy, small angle scattering methods) resolu¬ 
tion structural data with coarse-grained and homology¬ 
modeling methods to define models of complex protein 
assemblies. 

Chemical Cross-linking 

Although protein complexes can be purified to homo¬ 
geneity in vitro often the necessary conditions for 
purification interfere with the detection and analysis 
of protein interactions. In addition, mapping protein 
complexes poses a challenging task since many protein 
interactions are transient. The use of chemical cross- 
linking agents is a valuable way to stabilize in vivo 
protein interactions, and gain further insight into the 
quaternary arrangement of subunits in a multiprotein 
assembly (Sinz 2006). Chemical cross-linkers possess 
two reactive groups that can covalently link protein 
subunits, joined by a spacer arm. The range of chem¬ 
ical selectivity for cross-linking reagents toward cer¬ 
tain functional groups (e.g., primary amines, 
sulfhydryls, or carboxylic acids) and the variety in 
spacer arm lengths offer the possibility for optimiza¬ 
tion of cross-linking reactions. 

Once covalently linked, MS can be used to analyze 
the intact complex and determine assembly composi¬ 
tion and stoichiometry. This is a particularly useful 
approach for revealing interactions between mem¬ 
brane proteins, since detergent concentrations used to 
solubilize membranes and extract proteins typically 
disrupt protein-protein binding. Alternatively, by 


analyzing protein complexes following proteolysis, 
structural information from the cross-linked peptides 
can be obtained. The spacer arm defines the maximum 
distance between the two cross-linked moieties, and as 
such, identification of the cross-linking sites provides 
distance constraints and information on protein inter¬ 
faces which can be incorporated into structural models. 
Chemical cross-linking can also be used in combina¬ 
tion with other MS-based strategies such as limited 
proteolysis, HD exchange, and radical footprinting to 
further map subunit binding interfaces and protein 
dynamics (Downard 2007). 

Putting It Together - Defining Architecture 
of Multiprotein Complexes 

Employing a multifaceted MS-based approach has 
allowed researchers to map and propose low resolution 
structural models for large, stable protein complexes, 
combining valuable information regarding protein 
composition, stoichiometry, post-translational modifi¬ 
cations, subunit interactions, and topology. 

The yeast exosome represents the first endogenous 
complex targeted by structural MS (Taverner et al. 
2008) and provides a useful summary of the method¬ 
ology. Affinity purification and MS analysis of the 
exosome complex revealed a ten subunit cytosolic 
species, and identified a number of peripherally 
located subunits (notably Csl4). Dimeric and trimeric 
subcomplexes formed by addition of chaotropic agents 
were further defined by tandem MS to generate a 3D 
subunit interaction map which could be aligned with 
the ring structure identified by electron microscopy. In 
this manner, a model of the quaternary structure was 
proposed based around a six-subunit ring, where all 
three RNA-binding motifs exist on one face of the ring 
and the largest subunit, Dis3 RNase, sits on the oppos¬ 
ing face. In this case, the resulting structural model was 
refined and corroborated with homology modeling and 
X-ray crystallographic data for the human homologue. 

Integration of IM with MS has also helped to refine 
structural models of protein complexes for which there 
is limited high-resolution data, as recently demon¬ 
strated for components of the Escherichia coli 
replisome (Politis et al. 2010). In this study, IM-MS 
was used to record collision cross-section measure¬ 
ments for relevant subunits, subcomplexes, and com¬ 
plexes which established distance constraints for 
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downstream topology searches. An integrative compu¬ 
tational method utilizing molecular modeling, IM-MS, 
and incomplete atomic structures (principally from 
X-ray crystallography) was used to generate models 
of the subunit architecture of multimeric protein com¬ 
plexes within the replisome. 

Quantitative Mapping of Protein Complexes 

To date all high-throughput protein interaction maps that 
have been generated represent a static view. However, 
since protein complexes vary in order to regulate func¬ 
tion, it has become crucial to develop methods that can 
probe dynamic changes in the composition of protein 
complexes as a function of external stimuli or cell cycle. 
Quantitative proteomics methods, including stable iso¬ 
tope incorporation in addition to label-free strategies, are 
primarily utilized for this task (Oeljeklaus et al. 2009). 

Quantitative MS techniques allow for simultaneous 
identification of proteins and determination of protein 
abundance, or more commonly, differences in abun¬ 
dance between distinct samples. In stable isotope 
labeling experiments, isotopic mass tags are incorpo¬ 
rated (either metabolically during expression or chem¬ 
ically following isolation) into proteins of samples 
derived from different experimental conditions. Dif¬ 
ferentially isotope-labeled samples are combined for 
MS analysis, with the predictable mass shifts resulting 
from incorporated mass tags facilitating direct com¬ 
parison of signal intensities in the same analysis. 
Label-free approaches relying on the comparison of 
proteolytic peptide abundance across different liquid 
chromatography MS/MS runs have also been utilized 
in proteomics experiments; however, reports of their 
use for the analysis of protein complexes are still rather 
limited. Determination of protein abundance by quan¬ 
titative MS also enables accurate mapping of distinct 
protein complexes, which can be incorporated into 
entire protein interaction networks. 

Summary 

Mapping large, stable protein complexes remain one of 
the key challenges in the post-genomic era. In recent 
years, affinity purification combined with MS-based 
technologies have been utilized successfully in both 
large- and small-scale mapping of stable protein 


complexes. These approaches have the potential to 
define composition, binding interactions, dynamic 
changes, and structure, from primary through to qua¬ 
ternary organization, in protein assemblies. It is likely 
that complete maps of large, stable protein complexes 
defined by these and other complementary strategies 
will provide a starting point to address current ques¬ 
tions of protein biological function and regulation. 

Cross-References 

► Ion Mobility Mass Spectrometry - Principles 

► Mass Analysis 

► Mass Spectrometry in Proteomics 

► NMR 

► Protein-Protein Interactions Studied by Mass 
Spectrometry 
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Mass Spectrometry: Methods of 
Ionization (Applied to Biopolymers) 

Neil J. Oldham 

School of Chemistry, The University of Nottingham, 
Nottingham, UK 

Definition 

Ionization: the process of converting atoms or mole¬ 
cules to ions. In the context of hiopolymer mass spec¬ 
trometry this usually involves the addition or removal 
of protons. 


Introduction 

The mass spectrometric analysis of biopolymers such 
as proteins, peptides, oligonucleotides, and carbohy¬ 
drates has been revolutionized by the development of 
two key ionization methods; electrospray ionization 
(ESI) and matrix-assisted laser desorption ionization 
(MALDI). Both methods facilitate the transfer of large, 
polar biomolecules from the condensed phase in to the 
gas phase, permitting determination of their mass/ 
charge ratios (m/z). In the case of ESI, ionization of 
the biomolecules takes place in solution, under the 
influence of a strong electric field, and before evapo¬ 
rative desolvation of the analyte. MALDI is achieved 
by embedding the biomolecule in a crystalline matrix 
capable of absorbing energy from laser irradiation, 
which causes both desorption and ionization. 
Eor large biomolecules, MALDI and ESI spectra differ 
markedly in their degree of analyte charging. MALDI 
produces principally singly charged ions, even for 
relatively large proteins (70 kDa), whereas ESI gives 
rise to a distribution of multiply charged molecular 
species. 

The basic principles of ESI and MALDI are 
described below. Eor detailed descriptions of both 
techniques and their applications, please see 
(Cole 2010). 

Electrospray Ionization 

The practical development of ESI can be traced back to 
Dole in the 1960s (Dole et al. 1968), building on 
the early experimental work of Zeleny (1914) and the 
theoretical work of Rayleigh (1882). It is the 
pioneering work of John Eenn, however, which has 
led to modern ESI and its application in biomolecular 
analysis (Eenn et al. 1989). The basic ESI setup is 
composed of a conductive capillary, containing 
a sample in a flowing protic solvent, held at a high 
potential relative to the entrance to the mass spectrometer 
(MS). In positive ion mode, this leads to an accumulation 
of positive ions within the solvent at the tip of the 
capillary. This “Taylor cone” is unstable and produces 
a fine spray of like-charged droplets which repel each 
other and produce a plume. The charged droplets migrate 
through the atmospheric pressure region of the ESI 
source toward the counter electrode, undergoing evapo¬ 
ration and Coulombic explosion as they shrink, often 
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under the influence of a nebulizer and/or drying gas. 
Small droplets and partially desolvated ions pass, through 
guiding electric fields, into the intermediate vacuum 
stages of the MS either via a series of cones or through 
a heated capillary. On emergence from this differentially 
pumped region, the ions are totally desolvated and their 
mass/charge (mlz) ratio can be measured by an appropri¬ 
ate mass analyzer, usually held under high vacuum. 

The precise mechanism by which gas phase ions are 
released from ESI droplets remains unresolved. 
Two prominent theories exist: the charge residue 
model (CRM; Dole et al. 1968) and the ion evaporation 
model (lEM; Iribame and Thomson 1976). In the 
former, the analyte remains in the shrinking droplet, 
finally freed by solvent evaporation, while in the latter, 
desolvated, or almost desolvated, ions are ejected from 
the droplet. It may be that both mechanisms occur, 
depending upon the size of the analyte ion. 

ESI is very intolerant of salts and involatile buffers, 
and these should be minimized by thorough desalting 
into water, a mixture of water/organic solvent/formic 
acid (0.1%), or volatile buffers such as ammonium 
acetate. The appearance of the ESI spectrum of 
a protein is highly dependent on the solvent from 
which it is sprayed. Denaturing solvent systems such 
as water/acetonitrile/formic acid produce very highly 
charged species, as the protein is unfolded in solution 
and its basic amino acid residues protonated. Spraying 
a protein from ammonium acetate at near-neutral pH 
leads to fewer charges, as only the surface basic resi¬ 
dues of the native protein are protonated. The ability of 
ESI to produce multiply charged protein ions means 
that even large proteins can produce signals with the 
m/z range of many ► mass analyzers. 

Because of its requirement for a flowing solvent, 
ESI is ideally suited for coupling to liquid chromatog¬ 
raphy (EC), especially reverse phase EC, where the 
solvents are generally polar and protic. The ESTbased 
LC/MS has become a major tool in chemical and 
biochemical analysis. ESI is a very soft ionization 
method, typically transferring 1 eV or less to the 
sample ion. Because of this, little or no fragmentation 
of the molecular species is observed. 

MALDI 

MALDI developed out of laser desorption ionization 
(EDI); a technique where energy transferred from the 


absorption of radiation is utilized both to desorb 
a sample molecule from a surface and to ionize it. 
EDI is limited in its application to those molecules 
with a significant extinction coefficient at the wave¬ 
length of laser irradiation. The breakthrough in gener¬ 
alizing EDI came in the 1980s when it was recognized 
that analytes without a strong absorption could be 
desorbed and ionized in the presence of a large excess 
of absorbing material or “matrix” (Karas et al. 1985). 
Erom initial applications using small sample mole¬ 
cules, it was quickly realized that large proteins could 
be ionized by this approach (Karas et al. 1989; Tanaka 
et al. 1988). 

The precise mechanism of MALDI is a matter of 
ongoing discussion. The process is usually divided in 
to two steps: primary and secondary ion formation 
(Knochenmuss 2010). Primary ion formation concerns 
generation of the initial ionic species, formed almost 
immediately after laser ablation. These are generally 
thought to be matrix-derived, but may be analyte ions, 
depending on the mechanism involved. Secondary ion¬ 
ization results from the ion-molecule reactions which 
occur in the desorption plume following primary ion 
formation. 

The choice of matrix is crucial for successful 
MALDI measurement. Many matrices commonly 
used for biomolecules are derivatives of benzoic or 
cinnamic acids. a-Cyanohydroxycinnamic acid is 
frequently used for peptides, whereas sinapinic acid 
is generally preferred for proteins. Dihydroxy benzoic 
acid is often the matrix of choice for oligonucleotides 
and oligosaccharides, and also for negative ion appli¬ 
cations. For positive ion measurements, trifluroacetic 
acid (0.1%) may be added to promote ionization. 
Matrix and sample are usually premixed in a 10^:1 
M ratio before spotting on the target plate, with the 
matrix at near saturation concentrations (dried droplet 
method). As an alternative, the sample can be depos¬ 
ited on to a preformed microcrystalline matrix surface 
(overlayer method). The tendency for MALDI to 
generate singly charged ions (and little fragmenta¬ 
tion) results in a relatively simple spectrum. This 
makes the technique suitable for analysis of mixtures, 
such as those produced by tryptic digestion of 
a protein. The analysis of large intact proteins by 
MALDI usually requires a time of flight (TOE) 
► mass analyzer operating in the linear mode, as the 
m/z values produced often exceed the upper limit of 
many other analyzer designs. 
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The power of MALDI has been expanded by its 
application in imaging (Brunelle and Laprevote 
2010). Prepared sections of biological material can 
imaged by rastering the laser over the sample surface. 
By viewing specific miz values with an intensity heat 
map, an image of the distribution of biological mole¬ 
cules within the tissue can be built up. 


Summary 

ESI and MALDI have revolutionized the mass 
spectrometry of biomolecules. By allowing large pro¬ 
teins and other biopolymers to be ionized and trans¬ 
ferred into the gas phase with little or no degradation, 
they have produced a rapid and sensitive method for 
biomolecular analysis. A number of variants of ESI 
and MALDI have been developed recently, including 
desorption ESI (DESI), laser desorption ESI (ELDI), 
and MALDI-ESI (MALDESI) (Cole 2010), which 
continue to expand the applications of these powerful 
ionization techniques. 


Cross-References 

► Mass Analysis 

► Mass Spectrometry in Proteomics 

► Mass Spectrometry: Mapping Large Stable Protein 
Complexes 

► Protein-Protein Interactions Studied by Mass 
Spectrometry 
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Maximum Entropy Reconstruction 
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Definition 

Maximum entropy (MaxEnt) reconstruction is 
a technique for computing frequency spectra from time 
series data, for example, a free induction decay. It is 
based on the principle of information entropy introduced 
by Claude Shannon (1948). The MaxEnt spectrum con¬ 
tains the smallest amount of information consistent with 
the experimental data. In contrast to the discrete Eourier 
transform (DFT), MaxEnt references the measured data 
indirectly, enabling it to handle nonuniform sampling 
(NUS, also called sparse sampling). Thus, an important 
application of MaxEnt is the computation of 
multidimensional NMR spectra from NUS data. 

Mathematically, MaxEnt reconstruction is formu¬ 
lated as 

Maximize S(f) subject to C(d,ni) < Co (1) 

where f is the frequency spectrum (of arbitrary 
dimension), d is the measured data, and m is the 
inverse discrete Eourier transform of f. S(f) is 
the entropy, C(d, m) is a measure of consistency, and 
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Co is an estimate of the noise amplitude. The Shannon 
entropy functional 

M 

Ssh annon (f) = ^/;iog,/;- (2) 

/=i 

is not applicable to phase-sensitive spectra containing 
positive and negative components. A complex entropy 
functional suitable for NMR spectra has been derived 
(Hoch et al. 1990, Daniell and Hore 1989). C(d, m) is 
typically the unweighted chi-squared statistic, 

N 

C(d, m) = ^ {di - niif (3) 

/=i 

The constrained optimization of Eq. 1 is solved by 
converting it to an unconstrained optimization, 

maximize 0(f, d) = S(f) -f /lC(d, m) (4) 

where is a Lagrange multiplier, adjusted to achieve 
C(d, m) = Cq. Efficient algorithms for MaxEnt recon¬ 
struction have been developed that enable large 
multidimensional MaxEnt reconstructions to be 
performed using laptop computers (Skilling and 
Bryan 1984; Hoch and Stem 1996). 

An important property of MaxEnt reconstraction is 
that it is nonlinear. An implication is that metrics such 
as signal-to-noise ratio are no longer reliable indicators of 
sensitivity, and more robust metrics must be used to 
distinguish sensitivity from purely cosmetic reductions 
in noise level (Donoho et al. 1990). Several approaches 
have been developed for calibrating or minimizing the 
nonlinearity so that MaxEnt can be used in contexts where 
quantifying magnitudes of signal components 
is important, such as nuclear Overhauser or metabolomics 
experiments (Hyberts et al. 2007; Schmieder et al. 1997). 

Application of MaxEnt to NUS data requires only 
that the time values not sampled be excluded from the 
sum in Eq. 2, provided that the data are collected on 
a subset of the normal grid defined by the Nyquist 
condition. A comparison of MaxEnt reconstruction 
with linear prediction (LP) extrapolation demonstrated 
that the ability of MaxEnt to provide high resolution 
from short data records mainly derives from the ability 
to use NUS data (Stem et al. 2002). MaxEnt can 
be applied to data collected using coupled evolution 
periods (radial sampling), demonstrating a fundamen¬ 
tal connection between reduced dimensionality 


methods (RD (Ding and Gronenbom 2002), back pro¬ 
jection (Kupce and Freeman 2003), or GET (Kim and 
Szyperski 2003)) and more general NUS approaches 
(Mobli et al. 2006). The development of efficient sam¬ 
pling strategies for multidimensional NMR experi¬ 
ments is now an active area of research. Reductions 
in experiment time of an order of magnitude or more 
and vast improvements in resolution along indirect 
dimensions compared to conventional uniform sam¬ 
pling have made NUS approaches routine in biomo- 
lecular NMR, especially at ultrahigh magnetic fields 
where the sampling problem is more acute due to the 
shorter sampling interval imposed by greater spectral 
dispersion and the Nyquist condition. 

Cross-References 

► Linear Prediction in NMR Spectroscopy 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR Methods for Kinetic Analysis 

► Nuclear Overhauser Effect 

► Sparse Sampling in NMR 
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Synonyms 

Ion channels; Mechanosensitive ion channels; 
MS channels; Stretch-activated ion channels; 
Stretch-gated ion channels 

Definition 

Mechanosensitive ion channels are sensitive to 
mechanical stresses on the lipid bilayer and thus 
are activated by stretching of the cell membrane in 
response to a variety of mechanical stimuli. 


Mechanosensitive ion channels are important 
molecular transducers in living organisms and 
play important roles in a variety of physiological 
processes including hearing, touch, and blood pres¬ 
sure regulation, to name a few. Multidisciplinary 
studies on mechanosensitive ion channel proteins 
are facilitating an understanding of their structure, 
function, and mechanisms of mechanosensory 
transduction. 

Introduction 

The first mechanosensitive (MS) channels were 
reported in embryonic chick skeletal muscle (Guharay 
and Sachs 1984) and frog muscle (Brehm et al. 1984). 
Over the last 25 years, this type of ion channel has been 
identified in a variety of living cells from all evolu¬ 
tionary provenances including Bacteria, Archaea, 
yeasts, green algae, plants, animals, and humans. 
Today, many channels have been recognized as being 
mechanosensitive or stretch activated, and include the 
prokaryotic MS channels of large and small conduc¬ 
tance (MscL and MscS, respectively), the eukaryotic 
transient receptor potential (TRP) type cation chan¬ 
nels, 2P-type potassium channels (TREK-1, TREK-2, 
TRAAK) channels, and sodium channels from the 
MEC/ENaC family (ENaC, MEC4/MEC10), to name 
but a few (Hamill and Martinac 2001). Please see 
Table 1 for a non-exhaustive list of MS channels that 
have been cloned to date. 

The structure and function of prokaryotic MS 
channels MscL and MscS have provided researchers 
with much of what is known today as to how 
MS channels function and are regulated (Hamill 
2007a, b). In bacteria, these channels serve to protect 
the cells from osmotic shock. In conditions of hypoto- 
nicity a bacterial cell will typically expand or swell. 
This swelling is a result of turgor pressure, which in 
turn causes the cell membrane to stretch. This stretching 
of the bacterial cell membrane is what is responsible for 
the opening of these MS channels leading to a release of 
ions and other small osmolytes from within the cell 
cytoplasm. The turgor pressure then becomes reduced 
and the bacteria can continue to function in their new 
hypotonic environment. Here, an outline of the various 
methods that have been employed to identify the struc¬ 
ture and function of these unique types of channels is 
given. 
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Mechanosensitivity of Ion Channels, Table 1 Cloned mechanosensitive channels to date (non-exhaustive) 

Channel 

Provenance 

Gating mechanism 

Physiological role 

Book/review/reference 

MscL 

Bacteria 

Bilayer stretch 

Turgor control 

Hamill (2007); Martinac (2008) 

MscS 

Bacteria 

Bilayer stretch 

Turgor control 

Hamill (2007); Martinac (2008) 

MscK 

Bacteria 

Bilayer stretch 

Turgor control 

Kubalski and Martinac (2005) 

MscM 

Bacteria 

Bilayer stretch 

Turgor control 

Schumann et al. (2010) 

MscCG 

Bacteria 

Bilayer stretch 

Turgor control 

Bomgen et al. (2010) 

TRPYl 

Yeast 

Bilayer stretch 

Turgor control 

Hamill (2007); Martinac (2008) 

MSLl 

Plant 

Bilayer stretch 

Cell morphology 

Hamill (2007) 

MSL2 

Plant 

Bilayer stretch 

Cell morphology 

Hamill (2007) 

MSL3 

Plant 

Bilayer stretch 

Cell morphology 

Hamill (2007) 

TREKl 

Mammalian 

Bilayer stretch 

Resting membrane potential 

Hamill (2007) 

TREK2 

Mammalian 

Bilayer stretch 

Resting membrane potential 

Hamill (2007) 

TRAAK 

Mammalian 

Bilayer stretch 

Resting membrane potential 

Hamill (2007) 

TRPC6 

Mammalian 

Bilayer stretch 

Control of vascular smooth 
muscle tone 

Martinac (2008) 

TRPCF 

Mammalian 

Bilayer stretch/tether 

Fluid flow in kidney epithelia 

Hamill (2007); Nilius (2007); Martinac 
(2008) 

TRPV4 

Mammalian 

Bilayer stretch/tether 

Osmoregulation and touch 

Hamill (2007); Nilius (2007) 

TRPP 

Mammalian 

Tether 

Fluid flow in epithelia 

Hamill (2007) 

TRPM3“ 

Mammalian 

Bilayer stretch 

Osmoregulation 

Nilius (2007) 

NMDA 

receptors 

Mammalian 

Bilayer stretch/tether 

Learning and memory 

Martinac and Kloda (2011) 

ENaC 

Caenorhahditis 

elegans 

Bilayer stretch/tether 

Maintenance of body salt/ 
water homeostasis 

Hamill (2007); Martinac (2008); 

Martinac and Kloda (2011) 

MEC4/ 

MECIO 

C. elegans 

Tether 

Touch 

Hamill (2007) 

MscMJ 

Archaea 

Bilayer stretch 

Turgor control 

Martinac and Kloda (2011) 

MscMJLR 

Archaea 

Bilayer stretch 

Turgor control 

Kloda and Martinac (2002); Martinac and 
Kloda (2011) 
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Patch Clamp Recording of Bacterial 
Mechanosensitive Channels 

The dimensions of Escherichia coli, a Gram-negative 
rod-shaped bacterium, are about 0.8 pm wide and 2 pm 
long, which is almost the same size as the diameter of 
a patch pipette. Therefore, a larger bacterial object 
(with cell wall removed) is necessary for patching the 
inner membrane. This is accomplished by generating 
giant spheroplasts formed from bacterial cells (Blount 
et al. 1999) (Fig. 1). The ease of patch clamping giant 
spheroplasts will depend on the strain used. For exam¬ 
ple, AW737 E. coli strains are more accommodating 
than MJF strains. 

Purified MS channels can also be patched from 
artificial liposomes described in a separate section 
below. To determine whether MS channels are present 
typically requires the membrane patch to be stretched, 
which is achieved by applying a negative or in some 


cases positive hydrostatic pressure to the membrane 
patch by using a pressure pump connected to the 
pipette holder of the recording apparatus. At certain 
pressure thresholds the channels will open. Patch- 
clamp traces for MS ion channels typically exhibit 
a square wave pattern (Fig. Ic). 

Reconstituting Mechanosensitive Channels 
in Lipids 

Given that they are the most widely studied MS channels, 
both MscL and MscS have also been reconstituted into 
a wide range of lipid systems. A dehydration/rehydration 
(D/R) method for channel incorporation into liposomes 
was the first developed that gave good reconstitution 
efficiencies when studied using the patch-clamp 
technique. This method (Martinac et al. 2010) requires 
2 days preparation time but gives very good reconstitu¬ 
tion efficiencies with MscL. The D/R method has also 
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Mechanosensitivity of Ion Channels, Fig. 1 Generation and 
patching bacterial giant spheroplasts. (a) Escherichia coli cells 
stained with green-fluorescent SYTO 9 dye which is membrane- 
permeable and binds to nucleic acid. Scale bar = 5 pm. (b) The 
patch pipette is shown approaching a giant spheroplast. (c) 
Channel activity of wild-type (strain AW737) MscS and MscL 


(upper), and coiTesponding negative pressure applied through 
a patch pipette to the inside-out membrane patch of giant sphe¬ 
roplasts (bottom) are shown. The insets show the magnification 
of the MscS (box) and MscL (oval) traces, respectively. Pipette 
potential was held at 4-20 mV 


been used to successfully incorporate other bacterial MS 
channels including MscK (MS channel of potassium) and 
those from Haloferax volcanii and Methanocaldococcus. 
jannaschii (Martmac 2007). 

Another method used is the sucrose method 
(Martinac et al. 2010), which yields good reconstitu¬ 
tion efficiencies for both MscS and MscL. Both 
methods have been successful using wild-type (WT) 
and mutant channels. 

An electric field method, using a solvent-free pro¬ 
cess, is available using Vesicle Prep Pro® (Nanion 
Technologies, GmbH Miinchen, Germany). This sol- 
vent-free method results in giant unilamellar lipo¬ 
somes that can be used to incorporate MS channels 
(see Fig. 2a). Functional MscL activity using this 
method has been demonstrated by patch-clamp 
(Fig. 2b). 

Despite the plethora of literature available on the 
bacterial MS channels, there have been comparatively 
few reports on the reconstitution into liposomes of other 
MS channels, in part due to their difficulty in cloning. 
MS channels that have, however, been reconstituted 
using the D/R method include the A-methyl-D-aspartate 


(NMDA) receptor. Gramicidin A, and the Canonical 
Transient Receptor Potential protein (TRPC1) (Martinac 
andKloda 2011). 

Opening Channels with Amphipaths 

Amphipaths are molecules that possess both hydropho¬ 
bic and hydrophilic properties, such as those found in 
detergents, in active lipid substances such as fatty acids, 
and the phospholipids of biological membranes. Amphi- 
pathic molecules are capable of altering the curvature of 
the plasma membrane of cells due to their biochemical 
and organizational properties, such as shape, size of the 
polar head group, degree of hydration, and charge. The 
hydrophobic (lipid-like) group typically consist of 
a hydrocarbon moiety, whereas the hydrophilic region 
has typically either polar (uncharged) or ionic (anionic or 
cationic) groups. The hydrophilic elements have 
a bilayer-coupling effect such that differential incorpo¬ 
ration of an amphipathic molecule into the respective 
bilayer leaflets can change the bilayer intrinsic curvature, 
imitating membrane stretch. 

Studies focusing on dissecting the molecular mecha¬ 
nism of MS channel function are often based on the use 
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Mechanosensitivity of Ion Channels, Fig. 2 (a) Giant from GUVs prepared using Vesicle Prep Pro®. The pipette 
unilamellar liposomes (GUV) containing MscL prepared using voltage was +30 mV 
the Vesicle Prep Pro®, (b) Patch-clamp recording of MscL 


of amphipaths. Amphipathic drugs strongly support the 
bilayer model of MS channel activation in which the 
mechanical gating force comes from the surrounding 
lipids. In the lipid bilayer model, asymmetrical effects 
of amphipaths may be explained in terms of compensa¬ 
tory conformational changes due to differential lipid acyl 
chain extension and/or compression that create the inner 
or outer monolayer distortion (Hamill and Martinac 
2001). An MS channel that displays asymmetrical 
mismatch with respect to each monolayer might there¬ 
fore be expected to respond differentially, depending 
upon the direction of membrane curvature (Hamill and 
Martinac 2001). For example, small changes in phos¬ 
pholipid acyl chain length (i.e., PC-20 to PC-18) can 
switch gramicidin A from a stretch-activated to a stretch- 
inactivated channel (Martinac and Kloda 2011). 

This bilayer-protein hydrophobic surface matching 
is one hallmark of MS channel activation and provides 
a hint for their modulation by amphipaths. Although 
fatty acids have been proposed to affect the gating 
properties of some ion channels by direct binding to 
the channel proteins, the perturbation of the plasma 
membrane induced by exogenous lipids also plays an 
important role in this process. MscL and MscS are 
activated by amphipaths such as chlorpromazine 
(CPZ), trinitrophenol (TNP), local anesthetics, and 
lysophospholipids (Hamill and Martinac 2001). Simi¬ 
larly, the activity of eukaryotic MS ion channels 
can be dramatically modulated by amphipathic lipid 


molecules such as phosphatidyl inositol-bisphosphate 
(PIP2), diacylglycerol (DAG), arachidonic acid, and 
lysophospholipids (Martinac and Kloda 2011). 

High Hydrostatic Pressure Recording 

Life does exist under high hydrostatic pressure (HHP). 
Animals of varying organizational levels have been dis¬ 
covered near the bottom of the Mariana Trench 
(>80 MPa) (Petrov et al. 201 1). How have they adapted 
to such harsh conditions? MS charmels, reconstituted into 
artihcial membranes (such as Azolectin) or present in 
native E. coli, can be used as a biophysical tool to 
study the effects of HHP on membrane bound proteins. 
Technically this is done by using a patch-clamp facility 
equipped with a “flying-patch” device (Fig. 3). This 
device provides conditions for the safe transfer of 
a patch pipette to a recording chamber under high 
pressure. This then allows the currents of MS channels 
to be recorded while various levels of HHP are applied 
(Petrov et al. 2011). The key element of the device is 
a cylindrical plastic protecting chamber. Once a 
giga-Ohm seal patch has been formed inside the patch 
pipette, the experimenter can slide down the cylindrical 
protection chamber toward the tip of the pipette. A few 
microliters of solution from the recording chamber will 
be captured inside and held in place by capillary forces. 
Thus, the most fragile part - the electrode tip with mem¬ 
brane patch - is isolated, surrounded by bath solution, 
and can be positioned inside a high pressure chamber. 
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Mechanosensitivity of Ion 
Channels, Fig. 3 High 
pressure setup (cartoon) and 
“flying-patch” device (photo). 
OP oil pump, CV check valve, 
OR oil reservoir, M 
manometer, HPC high- 
pressure chamber, FFD 
“flying-patch” device, HS 
patch-clamp headstage, T tap 




The design of the high pressure setup makes it 
impossible to apply negative pressure to the patch 
membrane in order to stretch it. This restricts its use 
to MS channels which have a lower activation thresh¬ 
old (such as MscS and MscK), or to using specifically 
mutated MS channels, which are able to spontaneously 
open. Both approaches have successfully been utilized. 

Forster Resonance Energy Transfer (FRET) 
Characterization of MS Channels 

To better understand the structural properties and 
mechanisms of action of MS ion channels, fluorescence 
spectroscopy methods have become more widely used. 
Forster resonance energy transfer (FRET), can be used to 
measure a change in inter- or intra-MS channel distances. 
For MS channels, fusion sites for fluorescent protein 
(e.g., green fluorescent protein [GFP]) linkage are 
limited to the intracellular regions of the protein, and as 
many MS channels contain multiple identical subunits 
(i.e., they are homomeric), each complex could be fused 
with 4, 5, or even 6 fluorescent proteins. This would 
mean that in order to produce a protein with an 
appropriate FRET pair, a cell might need to be 
co-transfected with two plasmids. How the various 
subunits come together to form a functional MS channel 
(if they do at all) could then be entirely up to chance. 

Though labeling the intact multimeric form of the 
protein with organic fluorophores can overcome the 
problem of protein formation, there is still the problem 
of an unknown labeling efficiency with respect to the 


number of donor and acceptors bound to the protein. 
To undertake intra-channel distance measurements by 
measuring FRET efficiency then would require molec¬ 
ular dynamic simulations to take into account the var¬ 
ious possible combinations of donor/acceptor labeling 
arrangements. As well, labeling with organic 
fluorophores can typically require mutagenesis of the 
protein to provide residues, such as cysteine, where the 
fluorophores are able to chemically bind. The selection 
of such residues needs careful consideration if the 
function of the MS channel is not to be affected. 
Endogenous reactive residues might also need to be 
mutated out to prevent labeling in unwanted regions of 
the protein. 

Keeping in mind all these considerations, it might 
be more advantageous to create a FRET pair where the 
donor fluorophore is itself an endogenous amino acid. 
Many studies use tryptophan for this purpose. Using 
tryptophan as the donor means that only an acceptor 
fluorophore would need to be attached to the protein 
under study. 

The other consideration with undertaking FRET- 
based distance measurements of homomeric proteins 
concerns the orientation factor (referred to as in the 
literature). Determining this value accurately is vital to 
interpreting distances from FRET efficiency measure¬ 
ments. If multiple donors and multiple acceptors are 
attached to the protein, calculating the orientation fac¬ 
tor accurately is problematic, and molecular dynamic 
simulations might again be required. 
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Liposome Efflux Assays 

Membrane-bound MS ion channels that have been 
cloned, recombinantly expressed and purified can nor¬ 
mally be readily reconstituted into artificial liposomes 
(see Reconstituting Mechanosensitive Channels in 
Lipids above). By filling these liposomes with a high 
concentration of a self-quenching fluorophore such as 
calcein or carboxyfluorescein, channel activity can be 
quantified according to the leakage of these 
fluorophores from their liposomal compartments into 
the surrounding media. Once free of the conhnes of the 
liposome, overall fluorescence is increased as the dye 
becomes dispersed and is no longer self-quenched. The 
method of liposome creation for this assay typically 
involves extruding lipids containing reconstituted pro¬ 
tein in a solution of very highly concentrated 
fluorophore (typically of the order of 50 mM) through 
a membrane. This typically produces unilammelar ves¬ 
icles of a size corresponding to the pore size of the 
membrane used. Extruders for this application are 
commercially available (AVESTIN). Following extru¬ 
sion, the liposomes can be separated from their 
fluorophore containing media by using a Sephadex 
G25 gravity fed column. 

After the separation of free dye, the liposomes are 
ready to be used for whatever experimental assay is 
required. Results can be easily measured using 
a fluorescence plate reader. As a positive control, the 
detergent Triton-X 100 is typically added to break open 
all the liposomes and release all the fluorophore to the 
surrounding media causing a considerable increase in 
overall fluorescence. 

Electron Paramagnetic Resonance (EPR) 

Site-directed spin labeling (SDSL) and electron para¬ 
magnetic resonance (EPR) spectroscopy presents an 
attractive approach for the study of the structure and 
molecular dynamics of MS charmels. In SDSL, a single 
cysteine is introduced into a protein with all non- 
disulhde bonded cysteines removed by site-directed 
mutagenesis. This unique cysteine is modified with 
a nitroxide spin label, characterized by an N-O 
group containing the unpaired electron necessary to 
produce the EPR signal. Measurement of the distance 
between the spin label probes in multimeric MS 
channels, and the proteins in general, allows for the 
determination of the spatial orientation of the protein’s 
secondary structural elements. This enables modeling 
of the MS channel structure with a spatial resolution at 


the level of the backbone fold. Solvent accessibility 
(aqueous vs membrane environment) and the polarity 
of the spin label microenvironment contrast transmem¬ 
brane versus extracellular channel domains, as well as 
their movements between the two environments. The 
distances between the structural channel domains 
can be measured in the range ^0.5-8 nm by the 
combination of continuous wave (CW) and pulsed 
EPR methods. Moreover, the structural dynamics of 
the channel domains can, by EPR spectroscopy, be 
detected on the millisecond timescale (McHaorab and 
Perozo 2000). 

SDSL EPR spectroscopy was instrumental to 
unravel the nature of the conformational transition 
from the closed form of MscL of E. coli to the open 
form that X-ray crystallography has, thus far, not been 
able to characterize. Changes in membrane intrinsic 
curvature induced by the asymmetric addition of the 
amphipath lysophosphatidylcholine (LPC) (see Open¬ 
ing Channels by Amphipaths above), can create large 
EPR spectroscopic changes. In the open state, the 
residues forming the channel gate are further apart, 
resulting in an EPR spectrum that appears sharper, 
and exhibits an overall change in line shape relative 
to the closed state. MscS of E. coli is another MS 
channel whose structural dynamics has been deter¬ 
mined in a similar way using CW-EPR spectroscopy 
(Kubalski and Martinac 2005). 

Molecular Dynamic Simulations 

To perform molecular dynamics (MD) simulations of 
a protein, such as an ion channel, requires knowledge 
of its 3D structure at an atomic resolution, usually 
obtained by X-ray crystallography. To date, MscL 
and MscS are the only MS channels fulfilling this 
requirement (Kubalski and Martinac 2005). MD sim¬ 
ulations of both channels have aimed primarily at 
unraveling the gating pathway of these channels, and 
at how the channel proteins detect changes in mem¬ 
brane bilayer tension or composition. These studies 
focus on molecular interactions such as van der 
Waals forces and hydrophobic matching, as well as 
hydrogen bonds with the lipid head groups. Simula¬ 
tions have also been employed to address the question 
as to why changes of the lipid headgroups, or the length 
of the acyl chains, alter the membrane tension required 
to gate an MS channel (Corry and Martinac 2008). The 
hydrophobic lock of MscL and MscS has also been the 
focus of molecular dynamic simulations, particularly 
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with regard to the hydration properties of the channel 
(Anishkin et al. 2010). 

Large conformational changes in MscL have only 
been observed when the protein is acted upon by an 
external force. Consequently, most of the MscL simu¬ 
lations have been carried out to investigate the nature 
and sequence of the channel structural changes under 
the influence of a force applied externally. The only 
difference between these studies consisted in the way 
the external force was applied to the MscL protein. The 
manner of the external force application ranged from 
surface tension applied to MscL without a lipid bilayer, 
direct force application to the outer TM2 helix, aniso¬ 
tropic pressure coupling to a bare MscL protein, and 
pressure applied to the bilayer around MscL. 

In the MscS case, most of the computational simu¬ 
lations have dealt with the question about the channel 
conduction state represented by its crystal structure 
which was published in 2002 (Kubalski and Martinac, 
2005). All MD simulation studies suggest that the 
crystal structure does not represent the fully open 
state of the pore, but rather indicate that the structure 
may represent an inactive state of the channel. As the 
nature of both the open state and closed state crystal 
structures of MscS has not been completely resolved, it 
is difficult to determine the MscS gating pathway from 
simulation studies. This is largely due to the short 
nanosecond timescales of molecular studies. 

Summary 

This entry presents an introduction into the special 
scientific techniques used to identify and characterize 
MS ion channels, both in vitro and in vivo. Since the 
first MS channel was identified in 1984 (Guharay and 
Sachs 1984), many research hours have been devoted 
to better understanding these channels and their 
mechanisms of action. There is much, however, that 
is still mysterious about these channels in general, and 
it is hoped that the brief descriptions presented here 
will encourage further research into the mechanosen¬ 
sitivity paradigm. 

Cross-References 

► EPR 

► Fluorescence Resonant Energy Transfer (FRET) 


► Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and FRET 

► Macromolecular Crystallography: Overview 

► Molecular Dynamics Simulations of Lipids 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 
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Membrane Fluidity 
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Synonyms 

Mobility of membrane components 

Definition 

The variety of anisotropic motions which contribute to 
the mobility of components in the biological membrane 

Introduction 

In 1972, Singer and Nicolson (Singer and Nicolson 
1972) suggested the so-called fluid mosaic model of 


the biological membrane (Fig. 1). This useful hypoth¬ 
esis explained many phenomena occurring in model 
and biological membranes. According to this model, 
membrane proteins and other membrane-embedded 
compounds are suspended in a two-dimensional fluid 
formed by phospholipids. This fluid state of membrane 
lipids is critical for membrane function. It allows, for 
example, free diffusion and equal distribution of new 
cell-synthesized lipids and proteins; lateral diffusion of 
proteins and other molecules in signaling events and 
other membrane reactions; membrane fusion, that is, 
fusion of vesicles with organelles; separation of mem¬ 
branes during cell division; etc. 

Membrane fluidity, which describes the ease of 
movement for molecules in the membrane environment, 
is a general concept that lacks a precise definition. It is 
much broader than the strict physical definition of flu¬ 
idity as the reciprocal of viscosity in the case of isotropic 
liquids. In general, “membrane fluidity” implies various 
anisotropic motions, which contribute to the mobility of 
components of a membrane. 

The lipid membrane, as a whole, shows a unique 
combination of fluidity and rigidity. In terms of the 
solubility and diffusion of small nonpolar molecules, 
the membrane behaves very much like an oil drop. In 
contrast, the translational diffusion constants of lipids 
and proteins in membranes are characteristic of media 
with the viscosity over two orders of magnitude greater 
than that of oil such as hexadecane. Also, in most 
cases, the membrane represents an impermeable 
barrier for ions and other hydrophilic compounds. 

Figure 2 shows characteristic frequencies (reciprocal 
of characteristic times) of different kinds of molecular 
motions in the membrane in comparison to frequency 
ranges in which various spectroscopic techniques are 
sensitive to molecular motion (Cennis 1989). 

Physical Parameters Associated with 
Membrane Fluidity 

Diffusion Constants 

Diffusion is the random movement of a particle due to 
exchange of thermal energy with its environment. 
Membrane lipids and proteins participate in highly 
anisotropic translational and rotational diffusion 
motion. Translational diffusion in the plane of the 
membrane is described by the mean square lateral 
displacement after a time At: = ADjtst. Lipid 







Membrane Fluidity 


1441 


M 



OLIGOSACCHARIDE 
SIDE CHAIN 


PHOSPHOLIPID 


CHOLESTEROL 


GLOBULAR 
PROTEIN 

HYDROPHOBIC 
SEGMENT OF 
ALPHA-HELIX PROTEIN 


ALPHA-HELIX PROTEIN 


GLYCOLIPID 


Membrane Fluidity, Fig. 1 Singer-Nicolson model of fluid membrane (From Bretscher (1985)) 


I-1 Lipid “flip-flop” 

I-1 Protein rotation 

Motion ' -' Lateral “hopping” (lipids and proteins) 

I-1 Lipid rotation (about long axis) 

I-1 Trans/gauche isomerization 
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Membrane Fluidity, Fig. 2 The characteristic frequencies of molecular motions of membrane proteins and lipids compared with 
the frequency ranges in which various spectroscopic techniques are sensitive to molecular motion (Modified from (Gennis 1989)) 


lateral diffusion coefficients in fluid phase bilayers are 
typically in the range D-y ^ 10“** to 10“^ cm^/s (Marsh 
1990). 

Methods for measuring translational diffusion coef¬ 
ficients for lipids in membranes can be classified into 
two distinct categories. In the first one, which corre¬ 
sponds to short-range diffusion measurements, D is 


obtained from determination of frequencies of 
bimolecular collisions within the membrane, through 
fluorescence quenching, ESR, etc. In the second one, 
which corresponds to long-range diffusion measure¬ 
ments, D is usually determined from the time required 
to fill a defined region of the membrane. The 
most popular fluorescent technique for studying 
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long-term diffusion is fluorescence recovery after 
photohleaching (FRAP) (Gennis 1989) which can 
measure translational diffusion coefficients in the 
range from about 10”^ to 10” ^ cm 7 s. Among mag¬ 
netic resonance techniques for measuring long-range 
diffusion are dynamic imaging of diffusion by ESR 
(DID-ESR) (Freed 1994), which has been used to 
measure 10 ”^ > Dj > 10 ”*° cm^/s, and pulse field 
gradient (PFG) NMR spectroscopy (Price 1997). 

Rotational diffusion is characterized by the mean 
square angular deviation during the time interval At: 
(0^) = ^D^At. Highly anisotropic motion, typical for 
lipid molecules in the membrane, is usually described 
by two rotational diffusion coefficients /)r|| and 
which correspond to diffusion about the long diffusion 
axis and perpendicular to it, respectively. The diffusion 
coefficients are related to corresponding rotational cor¬ 
relation times measured by NMR, ESR, fluorescent 
depolarization, etc., as: Tr|| = 1 / 6 £>r||; trx = 1 / 6 £^R 4 - 
For fluid phase bilayers, the typical rotational diffusion 
coefficients are of the order of T)r|| ^ ( 1 ^) 10 ** s”* and 
75 rj_ (l^)TO^ s ”* (Marsh 1990) This example can 

help in visualizing lipid motion in the membrane: for an 
area per lipid of 60with DrN and Dj of 4-10“ s” * and 
110 cm/s, respectively, one can see that a lipid 
molecule travels a distance of approximately its width, 
while rotating once around its long axis. A value of Dj 
of 10 ”* cm^/s, typically measured for lipids in 
biomembranes, corresponds to a net distance traversed 
of about 2 pm in 1 s. 

Flip-Flop Diffusion 

Lipid molecules, in principle, can exchange between 
the two monolayers of the bilayer. For polar lipids, it is 
an extremely slow process with characteristic times of 
hours or even days (Homan and Pownall 1988). For 
membrane proteins, no appreciable flip-flop mobility 
has yet been observed, in good accord with the fact that 
inner and outer leaflets of natural membranes are usu¬ 
ally asymmetric with respect to their protein and lipid 
composition. On the other hand, cholesterol has 
a relatively high rate of spontaneous flipping between 
two membrane leaflets (ti,.^ ^ 1 s) (Lange et al. 1981). 

Order Parameters 

The membrane lipid layer is an ordered liquid crystal 
phase with the preferred orientation of the lipid mole¬ 
cules perpendicular to the membrane plane. By 


definition, if 0 is the angle of the long molecular axis 
with respect to the bilayer normal, the order parameter 
S, which is a measure of the orientation distribution, is 
given as the average of P2(cos9), the second Legendre 
polynomial: S — (R 2 (cos 0 )) = ^(Scos^H — l). One 
can see that S varies between —1/2 and 1. 
These limiting cases have the following meaning: 
when S = 1, all molecules are exactly perpendicular 
to the membrane plane. When S = —1/2, 9 = 90°, and 
all molecules have their long axis parallel to the mem¬ 
brane surface. The case of S = 0 usually corresponds to 
a random distribution of molecular axes relative to the 
membrane normal. In energy terms, the rotation of the 
molecular long axis in the liquid crystal is restricted 
within an orienting potential that is simply approxi¬ 
mated as U{9) = 1 ■ cos^d, where 1 is the strength of 
the potential. The ordering of the lipid chain relative to 
the bilayer normal can then be expressed as: 

Szz = 5(3cOS^0 — l) 


IS (3cos20 - 

1 ) exp 

m 

kT 

sin 9d9 

fo exp 

u(0y 

kT 

sin 9d9 


A variety of physical techniques allow for determi¬ 
nation of order parameters, separately (NMR), or 
simultaneously with mobility parameters (fluores¬ 
cence depolarization, fluorescence correlation spec¬ 
troscopy, ESR). 

Permeability 

The passive permeability of lipid membranes is 
another fluidity related parameter. For many nonpolar 
molecules, the predominant permeation pathway is 
solubility-diffusion, a combination of partitioning 
and diffusion across the bilayer, both of which depend 
on lipid fluidity. 

Phase State and Membrane Fluidity 

A remarkable property of lipid bilayers is their struc¬ 
tural phase transitions (thermotropic polymorphism). 
For example, fully hydrated pure diacyl- 
phosphatidylcholines exhibit one fluid phase, Lo,, and 
three crystalline phases, Pp, L'p, and (Gennis 1989) 
(Table 1). Because of the high degree of disorder 
caused by defects, the Pp and L'p phases are usually 
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Membrane Fluidity, Table 1 Translational diffusion coeffi¬ 
cients of lipids and order parameters in some membrane phases 


Phase 

Df 

S“ 

Liquid-crystalline (L^) 

10^** cm^/s 

0-0.2 

Gel (Lp) 

10^" cm^/s 

0.2-0.9 

Gel (Pp) 

Similar to Lp, but the bilayer 
is rippled 


Gel (L'p) 

The same as Lp, but the 
chains are tilted 32“ 


Liquid-ordered (Lq) 

10^** cm^/s 

0.2-0.9 


“Measured by NMR 


called gel phases. The Pp phase is sometimes called 
a “ripple phase” because the surface of the bilayer is 
rippled and presents a wave-like appearance in elec¬ 
tron micrographs (Fig. 3). Depending on the nature of 
the lipid and presence of additional components (cho¬ 
lesterol, etc.), the Pp phase may be present or absent in 
the phase diagram, and a tilted gel L'p could be 
replaced by the Lp phase, having similar physical 
properties, but no tilt of the hydrocarbon chains. 

In the gel phase, lipid chains are usually well 
aligned with little rotation around the C-C bonds 
which are predominantly in the trans position. The 
lipid chains are tightly packed, the chain ordering is 
high, the bilayer thickness is maximal, and the surface 
area per lipid headgroup is relatively small. The phys¬ 
iological importance of gel-like phases is limited. 

At the “main transition” temperature, T^ the gel 
phase undergoes a transition to the (liquid crystal) 
phase. At the transition point, the surface area 
increases, and the bilayer thickness and chain order 
decrease. In the fluid phase, hydrocarbon chains tend to 
contain a larger number of gauche isomers (Yeagle 
2005). At this transition, the DSC (differential scan 
calorimetry) shows a sharp peak in the heat capacity 
occurring over a narrow temperature range. The tran¬ 
sition between L'p and Pp phases also can be detected 
by DSC and is called the pretransition. The transition 
between Lg and L'p phases is in most cases hard to 
observe due to typical supercooling of the L'p phase. 
Depending on the membrane composition, hydration, 
and temperature, a number of 2D and 3D nonlamellar 
lipid phases are possible, including the well-studied 
hexagonal (Hj and Hu) and cubic phases. 

Natural biomembranes contain a complex mixture 
of various phospholipids with cholesterol and 
sphingomyelins. In general, they exist in the fluid 
phase. Maintaining membrane fluidity seems to be 
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Membrane Fluidity, Fig. 3 Organization of the lamellar 
bilayer phases of DPPC in the fluid (Lo,), ripple (Pp), gel (L'p), 
and pseudocrystalline (Lo) states. A top view of the packing of the 
hydrocarbon chains is shown in the last column (From (Jain 1988)) 


extremely important for the survival of the cell and 
the whole organism. It is well known for model mem¬ 
branes that a decrease in the chain length or the intro¬ 
duction of unsaturation into the hydrocarbon chain 
causes a decrease in the main transition temperature. 
Consistent with this observation, microorganisms, 
plants, and animals (poikilotherms or hibernating 
mammals) are acclimated to low temperatures by alter¬ 
ing their membrane lipid composition, increasing the 
degree of lipid unsaturation or decreasing the average 
chain length (McElhaney 1984). 

Cholesterol and Membrane Fluidity. Cholesterol, 
which is an important constituent of cell membranes, 
plays a crucial role in maintaining membrane fluid¬ 
ity. It effectively inhibits the transition to the gel 
phase. Even though some plasma membranes, such 
as nerve myelin membranes, contain a high concen¬ 
tration of lipids that form gel phase bilayers, the 
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presence of cholesterol keeps these membranes in 
a fluid phase. However, interaction with the rigid 
cholesterol ring affects hydrocarbon chains of lipids 
in the liquid crystal phase (Lot) and leads to formation 
of a new phase, the liquid-ordered (Lq) phase (Quinn 
and Wolf 2009). The phase is well characterized by 
a variety of physical methods and does not exist in 
pure lipids or their mixtures. In the liquid-ordered 
phase, the long axis rotation and lateral diffusion 
rates are similar to the L„ phase, but the acyl chains 
are predominantly in an all-trans conformation, and 
hence, the order parameters are similar to the Lp 
phase (Table 1). Recently, the cholesterol-rich Lq 
phase has been strongly associated with 
microdomains in live cells - the so-called lipid rafts 
(Lingwood and Simons 2010). 


Fluidity Versus Mosaicity: Beyond 
the Singer-Nicolson Model 

The Singer-Nicolson model of the membrane 
played a very important role in understanding 
membrane structure and function. However, many 
properties of biomembranes are not consistent with 
the model. In recent years, a growing consensus 
points at more complex membrane structure, 
which can be characterized as “dynamically struc¬ 
tured fluid mosaic.” Compared to the original fluid 
mosaic model, the emphasis has shifted from fluid¬ 
ity to mosaicity. Experimental observations have 
led to the “membrane microdomain” concept, 
describing compartmentalization/organization of 
membrane components into nonrandom, well- 
defined patterns. 

One observation, which is inconsistent with the 
simple fluid mosaic model, is the reduced diffusion 
coefficients of membrane molecules in the plasma 
membrane compared to model membranes and 
biomembranes deprived of their cytoskeleton 
(blebs). Another observation is the oligomeriza¬ 
tion-induced slowing of diffusion (Kusumi et al. 
2005). It manifests itself in much greater effect of 
diffusant size on the translational diffusion rate than 
predicted by the theory of Saffman-Delbriick (SD) 
based on the Singer-Nicolson model (Saffman and 
Delbriick 1975). If a transmembrane protein is 
approximated as a rigid cylinder of radius r and 
height h, floating in a two-dimensional liquid of 


viscosity 77 with matching thickness (/ 1 ) surrounded 
by an aqueous medium of viscosity 77 ^, the SD theory 
gives the following expression for its translational 
diffusion coefficient 0^: 


Dj — 


keT 

Anr]h 



where y is the Euler constant («0.577216). This for¬ 
mula predicts a very weak dependence of translational 
diffusion on the size of diffusant. For example, for 
a tenfold increase in the protein radius, from 5 to 
50 A, the theory predicts for a 50 A-thick bilayer 
a decrease in the diffusion rate by a factor of 1.57. 
Although this insensitivity looks somehow counterin¬ 
tuitive, the Saffman-Delbriick formula gives good 
results for proteins incorporated into model 
membranes. 

On the contrary, for the plasma membrane, the 
effect of oligomerization on the diffusion rate is 
much stronger than predicted by the formula. For 
example, for linked couples of GFP (green fluorescent 
protein) - E-cadherin - oligomerization with an aggre¬ 
gation number between 2 and 10 slows down the 
diffusion up to 40 times (lino et al. 2001). Also, 
single-particle tracking experiments carried out by 
Kusumi and coauthors showed that in natural 
biomembranes, the diffusion does not follow usual 
Brownian patterns but consists of a series of random 
Brownian walks within confined areas (compartments) 
followed by longer-distance hops between compart¬ 
ment (see trajectory in Fig. 4). The compartment size 
varies depending on the cell nature but is relatively 
insensitive to the diffusant, showing the same size for 
transmembrane proteins and phospholipids. For exam¬ 
ple, in the case of normal rat kidney epithelial (NRK) 
cells, the average compartment size is 230 nm with an 
average residency time within the compartment of ^ 
11 ns (Kusumi et al. 2005). 

The hop-diffusion pattern cannot be found in 
liposomes or membrane blebs. In these membranes, 
the membrane molecules show simple Brownian 
diffusion with a single diffusion coefficient (Fujiwara 
et al. 2002 ). 

Because the assumption of simple Brownian 
diffusion breaks down, the diffusion in biomembranes 
cannot be described by a single diffusion coefficient. 
For instance, FRAP experiments in the plasma mem¬ 
brane showed that the observed translational diffusion 
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Fig. 4 Membrane skeleton 
fence and anchored 
transmembrane picket model 
(From (Ritchie et al. 2003)) 



Membrane-Skeleton 
“Fence” Model 


Anchored-Transmembrane 
“Picket” Model 


k Transmembrane protein 


Membrane skeleton (MSK) 


proteins 


rates depend on the size of the initial photohleached 
spot, which is also inconsistent with a simple Singer- 
Nicolson model. 

Accumulating evidence clearly points at involve¬ 
ment of the cell cytoskeleton in the compartmentaliza- 
tion of the membrane, in particular the fine 
cytoskeleton filaments formed by actin in most eukary¬ 
otic cells or spectrin in mammalian red blood cells. 
However, single-particle tracking experiments show 
the same patterns of hop-diffusion for lipid molecules 
located in the extracellular leaflet of the plasma mem¬ 
brane. How can the membrane skeleton, which is 
located only on the cytoplasmic surface of the mem¬ 
brane, suppress the motion of lipids on the extracellu¬ 
lar side? 

To reconcile this apparent contradiction, the mem¬ 
brane skeleton fence and anchored transmemhrane 
picket model was proposed (Ritchie et al. 2003). 
According to this model, transmemhrane proteins 
anchored to and lined up along the membrane skeleton 
(fence) effectively act as a row of posts for the fence 
against the free diffusion of lipids (Fig. 4). This model 
is consistent with the observation that the hop rate of 
transmemhrane proteins increases after the partial 
removal of the cytoplasmic domain of transmemhrane 
proteins but is not affected by the removal of the major 
fraction of the extracellular domains of transmemhrane 
proteins or extracellular matrix. Within the compartment 
borders, membrane molecules undergo simple Brownian 
diffusion. In a sense, the Singer-Nicolson model is ade¬ 
quate for dimensions of about 10 x 10 nm, the special 
scale of the original cartoon depicted by the authors in 
1972. However, beyond such distances, simple exten¬ 
sions of the fluid mosaic model fail and a substantial 
paradigm shift are required from a two-dimensional 
continuum fluid to the compartmentalized fluid. 


Summary 

Membrane fluidity is a general concept. It implies 
various anisotropic motions, which contribute to the 
mobility of components of a membrane. The so- 
called fluid mosaic model of the biological membrane 
(Singer and Nicolson 1972) assumes that membrane 
proteins and other membrane-embedded compounds 
are suspended in a two-dimensional fluid formed by 
phospholipids. This useful model explained many 
phenomena occurring in model and biological mem¬ 
branes. However, in recent years, a growing consen¬ 
sus points at more complex membrane structure, 
which can be characterized as “dynamically struc¬ 
tured fluid mosaic” with the cell cytoskeleton playing 
crucial role in the compartmentalization of the 
membrane. 

Cross-References 

► EPR Spectroscopy: General Principles 

► Functional Roles of Lipids in Membranes 

► Fluorescence and FRET in Membranes 

► Interspin Distance Determination by EPR 

► Lipid Flip-Plop 

► Lipid Lateral Diffusion 

► NMR 

► Phase Transitions and Phase Behavior of Lipids 

► Pulsed Pield Gradient NMR 

► Single Particle Tracking 
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Membrane Ionic and Capacitative 
Currents 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 


Membrane Lipid Electrostatics 

Gregor Cevc 

The Advanced Treatments Institute, Gauting, 
Germany 

Introduction 

lonizable polar residue(s) on lipid membrane 
headgroups can gain net charges in polar solvents, 
such as water, dependent on the aggregated lipids as 
well as the dissolved proton concentration (see the 
Table 1). Ions also interact with lipid charges and 
thus affect membrane properties. The right panel of 
Fig. 1 illustrates, for example, sodium chloride effects 
on some phase transitions in fully hydrated zwitter- 
ionic phosphatidylcholine bilayers. The left panel 
reveals charge-driven accumulation of sodium coun¬ 
terions near the corresponding anionic phosphatidyl- 
glycerol bilayers. Electro-osmotic phenomena 
involving living organism membranes are more com¬ 
plex but fundamentally similar. 

Membrane-Solvent Interactions 

Even the simplest boundary devoid of charges 
perturbs proximal solvent. Hydrogen bonds or 
their restrictions orient solvent molecules and 
enforce local density and polarization oscillations. 
The quasi-exponential envelope of the positive 
oscillations decays on a length scale of Aw 
0.1 nm. The short range of solvation allows spatial 
distribution and thermal motions of lipid polar res¬ 
idues to average-out the oscillatory solvent structure 
effects, which are consequently unobservable near 
lipid bilayers (Cevc 1990), unlike near hard, smooth 
surfaces. The small value of Aw also helps efficient 
screening of lipid partial charges hy the atomic 
partial charges on each solvent molecule. 
Consequently, the polar moieties orientation and 
motion, partial charges density distribution, and 
the restricted, quasi-exponentially changing, sol¬ 
vent-accessible space within each lipid-solvent 
interphase together govern membrane lipid electro¬ 
statics and solvation. 
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Membrane Lipid Electrostatics, Table 1 Negative logarithm of the protonation constant, —log A„ = pA„ (or its apparent value 
pA^ipP) Pqj. main dissociable groups of ionic and zwitterionic lipids. The column “charges” specifies the headgroup associated 

charge(s), resulting from increasing pH of an aqueous medium 

Substance/lipid 

Charges 

P^X04.1 

P^X04,2 

P^COOH 

P^NH3 

R-NH 3 (above CMC) 

+;0 




10.6 (<9.5)“ 

R-COOH (above CMC) 

0 ;- 



4.8'’ (>6.4, 

)“ 

A-acetylneuraminic acid 

0 ;- 



3.9/2.6“ 


Ganglioside GM3 

0 ;- 



3.8 


Ganglioside GMl 

0 ;- 



4.7 


R-HSO 4 (above CMC) 

0 ;-; 2 - 

-1.7** (>-0.5)“ 




R-H 2 PO 4 (above CMC) 

0 ;-; 2 - 

1.6“(>2.5“) 

6 . 6 “ (>7.5)“ 



R,R’-Ptdlns 

0 ;- 

>2.7* 




R,R’-PtdGro 

0 ;- 

>2.9* 




R,R’-Ptd-OH 

0 ;-; 2 - 

>3.5* 

>9.5 



R,R’-PtdSer 

+;+-; 2+-;2 

!- <L 


>5.5 

>11.5® 

R,R’-PtdEtn 


<3^ 



>11.25® 

R,R’-PtdCho 

+;+- 

< 1 ' 





“For molecules with R equal or greater than C7 (for monocharged compounds: >C10) pST''*’’’ changes with the number of carbon 
atoms in the chains due to the increasing likelihood of self-aggregation above the critical micelle (or aggregation) concentration, 
CMC. This creates interfacial (electrostatic) effects, and — pK^ can then be as large as it 5 pH units. An approximate, but 

normally satisfactory, estimate of the effect is pA'"'®’’’ = pA^„ ± ApA^cCrtc) i ApA'p-(r 7 p,a) — ApA'dehyd. with ApArc(crc) ~ il/cJl.iksT, 
ApKp. ^ 1 and ApTfjgj^yd — AGdehyd/^'^^sT^' rfc and ftp depend on lipid aggregation/packing, s is a function of the titratable group s 
location/lipid aggregation. ApA^c is positive for anionic and negative for cationic membranes. ApA^p' has the same sign as ApA'c when 
protonation increases the total interfacial charge. It has the opposite sign otherwise. The magnitude of ApA^p- increases with 
interfacial depth of binding, as does, effectively, ACjehyd 

'’More precisely, formic, acetic, and propionic acids have pA'a = 3.8, 4.8, 4.9, respectively 

“The published values differ: the nonbracketed value is from the same source as for GM3 = NeuSAc(a,2 ^ 3)Gal(P,l ^ 4)Glc(P,l 
^ l’)Cer and GMl = Gal(P,l ^ 3)GalNAc((P,l ^ 4)[Neu5Ac(a,2 ^ 3)]Gal(P,l ^ 4)Glc(P,l ^ l’))Cer 
‘'Sulfuric acid dissociates with pAj = —2.8; 1.9, i = 1;2, its methyl- or alkyl-ester withpAj = —1.9 or — 1.7, and benzenesulfonic acid 
with pAj = —2.8; a plant sulfolipid, sulfoquinovosyldiacylglycerol, has pA“'’'’ < 0; cholesterol sulfate has pA“'’'’ > 0 (<3.5) 

“More specifically, beta-glyceryl-, methyl-, ethyl-, and «-propyl-phosphoric acid have pAj = 1.4; 6.2, 1.5; 6.3, 1.6; 6 . 6 , and 1.9; 6.7, 
for, i = 1; 2, respectively, whereas phosphoric acid has pAj = 2.15; 7.2; 12.4, i = 1; 2; 3 
'^Intrinsic pA„ 

®Phosphorylserine has pApo 4 ,i < 1 and pApo 4,2 = 5.8, pAcoon = 2.6, and pA^HS = 9.9, whereas phosphorylethanolamine has 
P^po 4 = 5.8 and pAnhs = 10.2. N-acetylneuraminic acid = Neu5Ac = sialic acid, Ptdins = phosphatidylinositol, PtdGro = 
phosphatidylglycerol, Ptd-OH = phosphatidic acid, PtdGroSer = phosphatidylserine, PtdGroEtn = phosphatidylethanolamine, 
PtdGroCho = phosphatidylcholine. R and R’ can be identical 


Membrane Dipole Potential 

Membrane-solvent interactions are coresponsible for 
the so-called dipole potential, i/tp, which for phospho¬ 
lipids changes from positive to negative on moving 
from the solvent toward the bilayer interior. The over¬ 
all i/tp value first increases steeply with partial charges 
area density, Cp, but then saturates rapidly and 
becomes nearly salt independent as i/tp(z = 0) = 2 
sinh“*(AwMwp) ~ 24 wMwp> according to the simplest 
mean-field description of a planar polar surface (z = 0). 
Awp = 2£oo£o^Br/ewt7p is the thermal energy {=kQT) 


excitation limit for the surface-bound water molecules 
with proton/defect charge evv = 8.915 x 10 “^°As.£s is 
the solvent relative permittivity, eo = 8.85 x 10”^^ As 
(Vm)“* is the permittivity of free space and Sao ~ 3.4. 
The best measurements suggest for pure diacyl- 
phosphatidylcholines with molecular area 
^0.65 nm^: ij/p ^ 0.4 V, corresponding to Iwp 
^0.45 nm. Lipids or lipid mixtures with smaller 
summed solvent-accessible partial charges have 
lower i/tp-values, and vice versa (fluid-chain diacyl- 
phosphatidylethanolamines or dialkyl-phosphatidyl- 
cholines: ipp ^ 0.25 V), unless their molecular area is 
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Membrane Lipid Eiectrostatics, Fig. 1 Electrolyte solution 
effects on the partial phase diagram of 1,2-dipalmitoyl-in-3- 
glycero-phosphatidylcholine (right panel) and the 
corresponding phosphatidylglycerol (left panel) multibilayer 
vesicles at pH ~ 7. Lipid chain-melting phase transition temper¬ 
ature, (Pp’ to Lc,: full symbols) and pretransition temperature, 
Tp (Lp’ to Pp’: open symbols) of zwitterionic phosphatidylcho¬ 
line both increase linearly with the negative logarithm of NaCl 
solution activity coefficient, The more complicated ion 
dependence of anionic phosphatidylglycerol phase transitions 
reflects the Coulombic electrostatic as well as the 


0 2 4 6 



non-Coulombic interfacial solvation effects. The former cause 
only 1.5°C, due to saturation of the innermost interfacial 

(“Stern”) layer with Na"^ counterions (ij/c ~ 0 V for [NaCl] > 
1.5 M). The non-Coulombic surface potential (ij/p ~ 0.4 V 
for [NaCl] ~ 0 M) is incompletely screened even in a saturated 
NaCl solution ([NaCl]buik ^ 6-3 M), as is evident from 
the fact that Pal 2 PtdGro headgroup protonation, which elimi¬ 
nates ij/c and diminishes ij/p, causes AT^ ~ 13°C. Inset: lipid 
charge-dependent accumulation of Na"^ counterions near an 
anionic Pal 2 PtdGro“ bilayer surface, as derived from Tp-shift 
analysis 


appreciably smaller (ordered-chains diacyl-phosphati- 
dylcholine: i/tp ^ 0.45 V). Atomistic computer models 
confirm lipid and solvent partial charges, rather than 
lipid headgroup dipoles, to be the main sources of i/tp. 
Such models need further improvement, however, as 
they currently overestimate the non-Coulombic poten¬ 
tial two- to threefold; they moreover suggest alkali 
metal ion binding near the yrt-2-carbonyl groups on 
uncharged phospholipids at low salt concentration, in 
conflict with experimental findings. 

Membrane Dielectric Profile 

Atomistic models lead to a description of interfacial 
partial charge distribution in terms of a displacement 
field; such a field is then expressible via a multipole 
moment expansion and a density distribution function. 
Phenomenological mean-field models rely on an 
interfacial relative permittivity function instead (Cevc 


and Marsh 1983). This function changes over the water- 
lipid interphase from the bulk electrolyte solution value, 
£s 5; 80, down to the value measured for the lipid bilayer 
interior, ep « 2.5 (Marsh 2001). For each bilayer half one 
can conveniently parameterize the one-dimensional 
relative permittivity dependence as E(z,Zi„io,lipf) = El + 
(£s - £l){1 - exp[(zinto - z)Mim]}, for Z > Zinto and 
£(z,Zinto>'^int) = for z < Zjnto. Experiments suggest that 
the inner interphase thickness is Zjn(o = 0.4 nm (relative to 
the Gibbs dividing plane) and lint = 0.256 nm for fully 
hydrated fluid-chain phospholipid bilayers. Increasing 
lipid headgroup accessibility or size and/or decreasing 
the bulk electrolyte concentration or composition modify 
these parameter values moderately; however, the relative 
permittivity function (i.e., total effective interphase) 
width remains around 1 nm for all typical phospholipid 
model membranes. On the other hand sterol incorporation 
or lipid headgroup elongation widens the interphase (for 
gangliosides to around 2.5 nm, for erythrocyte 
membranes to 3.5 nm, for the bacterial outer wall up to 
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20 nm). Lower local relative permittivity within the lipid- 
solution interphase (e(z) < 8s) enhances electrostatic 
effects and the energetic cost of ion dehydration within 
the interphase, AGdehyd> proportional to the ion’s own 
hydration energy. Local solvent polarization can either 
support or hamper interfacial ion accumulation, 
depending on the relative sign and atomic distribution 
of the ionic charge(s). 

The Coulomb's, or Net Electrostatic, 
Potential of a Membrane 

Net charges on a membrane create the Coulomb’s 
electrostatic potential, ipc, which is initially propor¬ 
tional to the surface net charge density, Oq, but finally 
saturates at a solution-dependent value. The latter is for 
a uniformly charged planar surface in the Gouy- 
Chapman approximation given by i/^c,gc(z = 0) = 2 
sinh“* (-^dMgc)- The Gouy-Chapman length, 
Aqc = '^-Es^oksTIZeffc. defines the thermal energy dis¬ 
tance limit for ions carrying Ze charges. For example, 
a bilayer of fully ionized, fluid-chain, monocharged 
lipids with a molecular area around 0.65 nm^ has Gq 
^ 0.25 C m~^ and Aqc "^0.15 nm in 0.1 M monovalent 
ion solution at T = 298 K. This yields i/tc,Gc(z = 0) 
^ 0.12 V, and can keep only the innermost ionic layer 
thermally unexcited (see the Fig. 1). Further away from 
a charged membrane the so-called ionic diffuse double 
layer prevails, which is akin but not identical to the 
thermally smeared ionic clouds that screen ion charges 
in solution. Its thickness, the so-called Debye-Hiickel 
length, depends on the bulk ion concentration, c, the 
ion valence, Z, and £s, as well as T\ = (1/Zc) 
(£s£o^Br/2,000iVAc)°if = Z“; Na stands for the 
Avogadro’s constant. At room temperature and for 
c = 0.1 M, one has — 0.95 nm, but increasing 
corrections to the value are needed for c > 0.15 M 
(Varela et al. 2003). The range of membrane lipid 
electrostatics is thus not extendable by extra charging 
(4 gc ‘C Xo) but rather by lowering the bulk ion con¬ 
centration or else by widening the lipid net charge 
distribution. Near a charged membrane counterions 
(i.e., ions with the charge Ze having different sign 
than the membrane lipids net charge) are exponentially 
accumulated (cx exp \Ze\\ic^z)jk-^r\), whereas the con¬ 
centration of ions with a similar sign (coions) is 
depleted relative to the bulk concentration (see inset 
to Fig. 1). 


Ion Binding to and Distribution near 
a Membrane 

Ion binding to a lipid membrane is also possible. 
Higher ion valence and lipid charge both raise the 
probability and/or strength of such binding, via local 
ion concentration increase. Detailed experiments sug¬ 
gest that alkali metals (Z = -i-l), with the exception of 
protons and lithium ions, do not bind to uncharged 
phospholipids. The latter on the contrary bind negative 
monovalent anions (Z = —1), such as halides, in the 
order of the indirect Hofmeister series. In partial con¬ 
trast, some atomistic computer simulations suggest 
cation binding to uncharged lipids. A detailed com¬ 
puter simulation confirmed that negative polar or pos¬ 
itive apolar (i.e., hydrophobic) surfaces generally 
follow the indirect anionic Hofmeister series; the 
study correctly predicted the shift to the direct 
Hofmeister series for negative hydrophobic or positive 
hydrophilic surfaces and partial reversal of the series 
for surfaces of intermediate polarity or charge 
(Schwierz et al. 2010). With increasing ion valence 
(|Z| > 2), lipid net charge density and/or adjustability 
of partial charge distribution promotes lipid-ion bind¬ 
ing. Most alkali earth and other divalent or polyvalent 
ions therefore avidly associate with lipids, the smaller 
their radius the more so (Cevc and Marsh 1983; Cevc 
1990). Many highly polarizable organic ions tend to 
associate tightly with lipids too. Conversely, a high 
lipid headgroup polarizability can, but need not, 
suppress tight lipid-ion association. 

Membrane condensation, which sterically restricts 
ion penetration into the lipid-water interphase, nor¬ 
mally restrains ion-lipid binding, except when several 
lipid molecules are coinvolved. A lipid phase transi¬ 
tion therefore often releases ions or triggers extra ion 
binding to a membrane. For example, a charged lipid 
membrane typically expels small interfacially bound 
ions during a membrane disordering phase transition; 
ion pulses may thus be generated. However, organic 
ions that typically insert themselves deep into the 
interphase bind more easily and stronger to a lipid 
membrane with fluid than with ordered lipid tails. 
A membrane-condensing phase transition conse¬ 
quently expels such ions from the interphase. 

In turn, lipid membranes respond to changes in 
interfacial ion concentration. For example, ion binding 
typically increases packing density of initially charged 
and decreases packing density of initially neutral 
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lipids. Raising the bulk ion concentration condenses 
charged as well as uncharged lipid membranes, even 
without ion binding. Despite this, most lipid bilayers 
swell more in more concentrated (electrolyte) solu¬ 
tions, owing to ion-dependent screening and weaken¬ 
ing of the van der Waals’ attraction. Relatively large 
and hydrophilic ions bound to a membrane moreover 
increase interphase thickness, lipid mobility, mem¬ 
brane solvation, and membrane repulsion - especially 
when the bound ions are organic. Some such ions can 
even solubilize lipid membranes. On the contrary, 
small inorganic ions, which bind tightly to the main 
membrane water-binding sites, typically diminish lipid 
headgroup hydrophilicity and rigidify lipid bilayers. In 
parallel, such bound ions slow down lipid lateral 
diffusion. Tight ion-lipid complexes coalesce into 
(quasi-)anhydrous aggregates in excess electrolyte 
solution. Increasing the bulk ion concentration or 
replacing ions with a low affinity for lipids with higher 
lipid-affinity ions can therefore induce isothermal 
phase transitions in lipid membranes (cf. Fig. 1). The 
resulting phase transition temperature shift is inversely 
proportional to the phase transition enthalpy and there¬ 
fore typically smaller for the longer-chain lipids. 

The Nernst's, or Transmembrane, 
Electrostatic Potential 

Separation of different electrolytes (e.g., with different 
concentrations Ci and C 2 ) by a membrane creates 
a transmembrane ion activity gradient and an electro¬ 
static tmnsmemhrane potential “jump,” the so-called 
Nernstpotential. = (k^TIZe) In (C 1 /C 2 ). Membrane 
lipid asymmetry also generates a transmembrane poten¬ 
tial gradient, di/'c 'C owing to transbilayer elec¬ 
trostatic coupling. In either situation, the resulting total 
potential gradient influences transmembrane ion trans¬ 
port depending on the sign of the gradient. Ion flow (i.e., 
electrical current) across a lipid bilayer is mainly driven 
by and is roughly proportional to the electrostatic trans¬ 
membrane potential difference, zIi/'n + The mem¬ 
brane permeability barrier, which includes lipid- 
solution interphase and thus ij/p, opposes the transport. 
Steric effects and membrane surface hydration conse¬ 
quently influence transmembrane ion migration. Lipid 
membrane low permeability to ions that do not dehy¬ 
drate in the interphase reflects the high energetic cost of 
ion-charge transfer from the polar bulk phase into the 


apolar bilayer interior. Transmembrane pores, mem¬ 
brane defects, or ion-carrier complexes can lower this 
energy barrier by up to 3-5 orders of magnitude. Effec¬ 
tive lipid dehydration, in turn, increases the barrier. An 
ordered-chains bilayer is approximately one order of 
magnitude less permeable than a bilayer with fluid 
chains. A lipid bilayer undergoing an order-disorder 
(chain-melting) phase transition, which creates numer¬ 
ous domain boundaries, membrane defects, or pores, is 
10-100 times more permeable to ions than a uniformly 
fluid bilayer. Lipid rafts are effective transport pro¬ 
moters, for similar reason, as are certain single-chain 
lipids (such as fatty acids) that can act as transmembrane 
ion carriers. Membrane electrostatics modulates all of 
these phenomena by affecting interfacial ion concentra¬ 
tion and by increasing the probability of pore and ion- 
carrier formation. 

Membrane Electrostatics Modeling 

Electrostatics of a lipid bilayer represented as a uniformly 
charged planar surface is exactly described by the 
“dressed-ion” diffuse double layer theory (Kjellander 
and Mitchell 1997). Despite this, the classical analytical 
solution to the Poisson-Boltzmann equation for such 
surface - the so-called Gouy-Chapman’s approximation 
- remains the most popular tool for theoretically describ¬ 
ing membrane electrostatics (Cevc 1990; Langner and 
Kubica 1999), although it neglects charge- and solvent- 
size, as well as polarization effects. Refinement by inclu¬ 
sion of counterion binding and adsorbed ion-size effects 
yields the Gouy-Chapman-Stem model, which affords 
a better, but for lipid membranes little more realistic, 
data fit, owing to the additional adjustable parameters. 
Any electrostatic data description excluding intrinsic 
lipid headgroup and solvent structure is intrinsically 
unreliable. One should therefore account for interfacial 
solvation at least phenomenologically, e.g., within the 
framework of a nonlocal electrostatic mean-field approx¬ 
imation (Dogonadze et al. 1985). This then affords an 
effective, spatially variable, interfacial dielectric constant 
profile that allows incorporation of lipid headgroup struc¬ 
ture and distribution effects, but involves solving two 
coupled Poisson-Boltzmann-like equations (Cevc and 
Marsh 1983). Solving a larger set of differential nonlocal 
electrostatic equations can even describe electrostatic 
properties of complex molecular or segmental ensem¬ 
bles, including membrane proteins (Weggler et al. 2010). 
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Combining an atomistic molecular dynamics simulation 
with the mean-field Poisson-Boltzmann equation for the 
solvent provides an attractive hybrid description of sur¬ 
face electrostatics (Schwierz et al. 2010). Such theoreti¬ 
cal approaches allow describing membrane lipid 
electrostatics to an increasing level of detail and enable 
new information extraction via versatile experimental 
data analysis, given sufficient computing power. The 
biggest present issue is a lack of a consensual and robust 
set of realistic force-field parameters, which still pre¬ 
cludes similarity of different groups’ or software package 
results (Demchenko and Yesylevskyy 2009). 

Thermodynamics of Membrane 
Electrostatics 

A membrane and its suspending electrolyte are one 
thermodynamic unit. Changing properties of one system 
component therefore influence characteristics and/or 
behavior of the rest. Most influential are modifications 
within the interfacial region, where ions and/or water 
molecules can bind to the polar lipid residues. This is 
reflected in around a ten times higher hydration-free 
energy compared with the corresponding Coulomb’s 
free energy (10 > 1 kJ mol“') and in the short range of 
solvation phenomena. Intermembrane pressure (which 
is identical to the spatial derivative of membrane lipid 
free energy) or the lateral membrane pressure (which 
corresponds to the free energy derivative with regard to 
the surface area) senses and reflects this large difference. 
Near a lipid membrane, the repulsion and lateral pressure 
of solvation are thus normally orders of magnitude stron¬ 
ger than the corresponding Coulomb’s pressure. At 
greater separations (>2-3 nm), the Coulomb’s repulsion 
prevails, due to its relatively long range ^ iw)- 
Attractive electrostatic intermembrane interactions 
stemming from charge-charge correlation effects can 
generate intermembrane attraction that exceeds the stan¬ 
dard van der Waals’ attraction up to a distance of several 
nanometers. 

Experimental Studies of Lipid Membrane 
Electrostatics 

Many modem membrane electrostatics assays are specific 
(such as anomalous X-ray diffraction/refraction, surface- 
enhanced spectroscopy), highly resolving (such as atomic 


interfacial force microscopy), label-free (such as lipid 
properties titration), but not free of pitfalls (such as sub¬ 
strate or local perturbation effects). Older tests of lipid 
membrane electrostatics are less direct and thus often 
subject to misinterpretation. Some follow colloidal prop¬ 
erties of lipid vesicles (such as mobility in external elec¬ 
trical held (zeta potential) or response to such held (cyclic 
voltagram)) or ion effects on membrane disjoining and/or 
intermembrane pressure. Others measure charged label 
binding or transport, etc. Any experimental data analysis 
is anyhow delicate when it comes to an individual elec¬ 
trostatic parameter derivation: correctness of the conclu¬ 
sions relies on allowing for all the relevant phenomena in 
data analysis. At minimum, a good model should account 
for the net- (and ideally partial) charge distribution and 
relative permittivity prohle (lowered dielectric constant 
near the polar lipid groups) within the lipid-solution inter¬ 
phase. Presumption of a uniformly charged, inhnitely 
narrow interface, as in the simple Gouy-Chapman 
(-Stem) model, for (phospho)lipid bilayer membranes 
produces fortuitous data hts at best. Label-based results 
can, but need not, rehect more than generally local, or the 
locally perturbed, bilayer lipid electrostatics. 

Summary 

Electrostatics plays an important role in controlling 
lipid membrane properties. It is involved in charged 
lipid-lipid, lipid-solvent, and lipid-ion interactions, as 
well as the resulting molecular distributions. Electro¬ 
static contributions - and, in turn, sensitivity - to the 
membrane permeability barrier are also essential for 
biological membrane functioning, given that 
a membrane and its bathing solution are thermodynam¬ 
ically nonseparable. Proper modeling of bilayer and 
solvent electrostatics is thus the key to understanding 
such complex systems, ideally based on the detailed 
experimental data gathered with progressively better 
resolving experimental techniques and combined with 
sufficiently detailed theoretical modelling. 
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Membrane Protein Function 

Alan Goddard, Joanne Oates and Anthony Watts 
Biomemhrane Structure Unit, Department of 
Biochemistry, University of Oxford, Oxford, UK 

Synonyms 

7-Transmembrane Domain Receptors 

Definition 

The varied functions performed by specific membrane 
proteins and peptides including, for example, enzy¬ 
matic reactions, membrane transport, cell communica¬ 
tion, light harvesting, membrane disruption, and cell 
adhesion. 

Introduction 

Membrane proteins account for approximately 30% of 
the proteins encoded by a typical genome (Wallin and 
von Heijne 1998). This abundance gives some indica¬ 
tion of the diversity of function of this class of protein. 
Typically, they are involved in regulating the cell’s 
response to, and interaction with, the environment. 
Each of these processes is critical to correct cellular 
function, and the examples below serve to highlight the 
importance of this class of proteins (Smith 2010). 

Receptors 

Receptors on the cell surface transduce signals from 
the external environment across the cellular mem¬ 
brane, leading to downstream responses. They detect 
a wide variety of stimuli and result in an equally 
diverse array of cellular responses. Due to the diffi¬ 
culty in obtaining structural data on membrane- 
spanning proteins, receptors are often classified by 
their predicted topology - the simplest consists of 
a single transmembrane helix while G protein-coupled 
receptors (GPCRs) have seven transmembrane helices. 
Receptors can be broadly divided into three domains. 
The extracellular domain detects (normally be direct 
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binding) the stimulating molecule. The transmem¬ 
brane domain transduces the signal across the mem¬ 
brane, possibly involving a conformational change. 
Finally, the intracellular domain interacts with down¬ 
stream signaling components within the cell (e.g., 
GPCRs) or possesses an intrinsic enzymatic activity 
which induces the signaling cascade (e.g., insulin 
receptor which functions via tyrosine kinase activity). 

G protein-coupled receptors (GPCRs) are the larg¬ 
est family of cell surface receptors and are character¬ 
ized hy seven transmembrane alpha-helices. They 
detect a wide variety of ligands and signal via 
heterotrimeric guanine nucleotide-hinding proteins 
(G proteins). In the unstimulated state, the Ga, Gp, 
and Gy subunits form a heterotrimeric G protein in 
which the Ga subunit is bound to GDP. When the 
receptor is stimulated, it undergoes a conformational 
change, leading to association with the G protein and 
a concomitant change in the Ga structure leading to the 
release of GDP. This GDP is replaced by GTP, and the 
Ga subunit dissociates from the GPy dimer and the 
receptor. The Ga and GPy subunits activate a wide 
variety of effectors, depending on their exact identity. 
Signaling is terminated hy hydrolysis of the GTP to 
GDP and reformation of the heterotrimeric G protein. 

GPCRs can be divided into six classes by sequence 
identity and ligand: 

Class A - Rhodopsin like 

Class B - Secretin 

Class C - Metabotropic glutamate 

Class D - Fungal pheromones 

Class E - cAMP 

Class F - Frizzled/Smoothened 

Rhodopsin is found in the retina and is involved in 
transduction of light signals and is the best character¬ 
ized GPCR (Smith 2010). In contrast to many GPCRs, 
it does not detect its ligand directly. Instead, it contains 
a chromophore - retinal - which lies horizontally in the 
protein and is photohleachable by light. Photons cause 
the isomerization of 11-cw-retinal into all-trans-retinal 
and an associated conformational change in rhodopsin 
structure. This leads to activation of transducin. Rho¬ 
dopsin absorbs most strongly in the green-blue region 
of the spectrum and is therefore purple in color. 

Bacteriorhodopsin is found in a variety of archaea 
including Halohacteria. Although this protein consists 
of seven transmembrane domains and contains the 
retinal chromophore which detects light, it is not 
a GPCR. Instead of coupling to a G protein, stimulation 


by light results in isomerization of the retinal and 
a conformational change which allows protons to 
move across the membrane through bacteriorhodopsin. 

Ion Channels 

Ion channels are complexes of integral membrane pro¬ 
teins, present in all cell types, which facilitate the flow 
of ions such as sodium or potassium down an electro¬ 
chemical gradient. Ion channels form aqueous pores 
across the hilayer which are highly conductive to the 
passage of various ion species both into and out of the 
cell. These proteins have a central role in cell pro¬ 
cesses, including transmission of nerve impulses, mus¬ 
cle contraction, and insulin release, and are therefore 
clinically important as their malfunction leads to 
a number of disease states, the most well known 
being cystic fibrosis which affects a chloride channel. 
The mechanism by which a channel is opened and 
closed is normally referred to as gating, and gating 
mechanisms are used in a classification system. Volt- 
age-gated channels are controlled by the membrane 
potential and include those for sodium and calcium. 
Ligand-gated channels respond to small molecules 
which bind to specific sites in the extracellular part of 
the channel and include the nicotinic acetylcholine 
receptor and y-aminobutyric acid-gated GABAa 
receptor. Other gating mechanisms include mechani¬ 
cal stress, temperature, and second messenger binding 
on the cyctoplasmic side of the channel. Channel struc¬ 
ture varies with ion type and gating mechanism but 
typically consists of several transmembrane helices 
surrounding a central channel; a loop region is inserted 
into the channel in a number of cases which confers 
specificity. Most research with ion channels uses the 
technique of patch clamping which measures the flow 
of ions through a single channel; the development of 
which was awarded the Nobel Prize for medicine 
in 1991 (Neher et al. 1978). Determination of the 
structure of the voltage-gated potassium channel won 
the Nobel Prize for chemistry in 2003 (MacKinnon 
et al. 1998). 

Translocons 

Translocons are membrane protein complexes which 
regulate and facilitate both the transport of proteins 
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across, and the integration of membrane proteins into, 
the cell membrane. In bacteria, protein transport across 
the cytoplasmic membrane occurs by two major 
systems, the most common being the general secretory 
pathway or Sec system and the twin-arginine 
translocase or Tat system for the transport of folded 
proteins. In eukaryotes, the Sec61 channel translocates 
proteins into the endoplasmic reticulum from the cyto¬ 
sol. Translocation is coupled to translation in eukary¬ 
otes with the nascent chain emerging from the 
ribosome and being translocated immediately through 
the channel. Proteins which are to be transported usu¬ 
ally contain a specific sequence at their N-terminus 
which may be directly recognized by the translocon 
or by specific chaperons which guide the protein to the 
translocon (examples include the SecA component of 
the bacterial Sec system and the signal recognition 
particle of the eukaryotic system). In general, the trans¬ 
location of proteins requires an input of energy in the 
form of ATP hydrolysis. The bacterial and eukaryotic 
Sec systems are involved in the insertion of membrane 
proteins into the bilayer; hydrophobic segments are 
recognized by the translocon, and translocation is 
arrested to allow the lateral movement of the segment 
out of the channel and into the bilayer. Other protein 
translocons include the Toe and Tic complexes of the 
chloroplast membranes and the Tom and Tim 
complexes of the mitochondria, both of which couple 
translocation across two bilayers. 

Photosynthesis 

The light-dependent reactions of photosynthesis, 
whether in bacteria or chloroplasts, take place at the 
plasma membrane. In chloroplasts, the thylakoid mem¬ 
brane separates the lumen from the stroma (the site of 
the dark reactions). This membrane contains light¬ 
harvesting complexes, photosystems I and II (PSI and 
PSII), cytochrome fig/, and ATP synthase. Light¬ 
harvesting complexes (antenna complexes) comprise 
the light-harvesting system which surrounds the reac¬ 
tion center of photosystems and transfer electrons to 
a single chlorophyll a molecule within it. Light¬ 
harvesting complexes contain a number of molecules 
such as chlorophyll b, carotenes, and xanthophylls 
which enable the plant to absorb a wide wavelength 
of light. These complexes effectively funnel energy to 
the reaction center of the photosystems. Photosystem 


II splits water into electrons, protons, and molecular 
oxygen. The electrons are transferred to plastoquinone 
(a mobile electron carrier in the membrane). Electrons 
then pass to cytochrome fig/, a transmembrane proton 
pump, and then on to plastocyanin (a water-soluble 
electron carrier). The proton gradient established by 
cytochrome fig/ is then used to generate ATP via the 
transmembrane ATP synthase which exports protons 
and generates ATP. The electrons from plastocyanin 
pass to photosystem I where they are transferred to 
NADPH or back to cytochrome fig/. ATP and 
NADPH are then used to synthesize organic molecules 
from CO 2 . The light-dependent reactions of photosyn¬ 
thesis involve a variety of different membrane proteins 
to move electrons and protons and to perform chemical 
reactions. 

Transport Proteins 

Transport across biological membranes may be pas¬ 
sive (no energy required) or active (energy required). 
Primary active transport utilizes chemical energy, e.g., 
ATP, whereas secondary active transport involves the 
use of electrochemical gradients. Transport proteins 
can function via a variety of mechanisms. Uniporters 
transport a single substrate in one direction. Examples 
of such transporters are GLUTl (involved in the move¬ 
ment of glucose across cell membranes in mammalian 
cells) and valinomycin (a nonpeptide potassium trans¬ 
porter derived from Streptomyces). Symporters trans¬ 
port two or more different molecules across 
membranes. A prime example of this is lactose perme¬ 
ase (LacY) which transports lactose, along with pro¬ 
tons, into bacterial cells. In humans, the glucose/Na"^ 
transporter moves glucose across the luminal mem¬ 
brane of endothelial cells to allow it to be taken up 
into the bloodstream. Antiporters move one substrate 
across the membrane in each direction - the electro¬ 
chemical gradient of one substrate is used to drive the 
transport of the other. Examples of this type of trans¬ 
port mechanism include the ADP/ATP exchanger and 
the Na'^/H'^ exchanger. 

ABC Transporters 

ABC (ATP-binding cassette) transporters use ATP to 
enable active transport of a range of substrates across 
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membranes. They are found in both prokaryotes 
(where they may import or export substrates) and 
eukaryotes (where they only export). ABC transporters 
consist of two domains - the transmembrane domain 
(TMD) and the nucleotide binding domain (NBD). The 
minimal unit of the ABC transporter is two TMDs and 
two NBDs. The TMD consists of six alpha-helices 
such that the dimer has 12 in total. This domain can 
be classified as type 1, 11, or 111 depending upon the 
fold. The NBD is cytoplasmic and consists of two 
domains. The first is the catalytic core which is 
involved in ATP hydrolysis and consists of beta sheets. 
The second domain is a unique alpha-helical 
subdomain. The genes encoding ABC transporters 
often produce a fusion polypeptide TMD-NBD- 
TMD-NBD which ensures correct orientation and 
stoichiometry of the subunits. 

Membrane Pumps 

Membrane proteins involved in the active transport of 
solutes across cell membranes in an ATP-dependent 
manner are known as pumps. ATP-dependent pumps 
are classified by type (type-P I-V, type-F, and type-V) 
based on sequence homology and structure. Type-P 
pumps function by phosphorylation of the alpha subunit, 
leading to a conformational change. Examples of type-P 
pumps are the sodium potassium ATPase which func¬ 
tions as an antiport ion pump in the plasma membrane, 
actively transporting sodium ions out of the cell in 
exchange for the entry of potassium, and the calcium 
ATPase which serves to remove calcium from the cell 
cytoplasm to enable its continued use as a signaling 
molecule. Type-V pumps are not phosphorylated; they 
are found in the endosomal, lysosomal, and vacuolar 
membranes where they are responsible for maintaining 
the proton gradient. Type-F pumps again only transport 
protons and are not modified by phosphate; they are 
present in the membranes of bacteria, mitochondria, 
and chloroplasts. This type of pump is can also be 
known as an ATP synthase as it utilizes the energy 
from the electron transport chain to generate ATP. 

Porins 

Porins are transmembrane proteins arranged in a beta- 
barrel structure forming a cylindrical aqueous channel 


to aid the diffusion of small molecule solutes (typically 
less than 1,500 Da) into and out of the cell, typically 
sugars, ions, and amino acids. Porins are highly 
represented in bacterial outer membrane, with the 
best studied example being OmpF (Nikaido 2003), 
and are also present in chloroplast and mitochondrial 
outer membranes. The structure of the channel is 
formed by a varying number of beta strands which 
defines the size of solutes which can pass through the 
pore, and an alternating pattern of hydrophobic and 
hydrophilic amino acids which, interact with the lipid 
membrane and form the lining of the pore, respec¬ 
tively. An external loop between the strands is believed 
to form an “eyelet” which additionally restricts the size 
of the channel regulating solute passage. Porin mono¬ 
mers are typically arranged as trimers in the membrane 
but with each monomer functioning as an individual 
pore. Both general and substrate-specific porins are 
found, and expression of different porins is controlled 
by the cell in response to environmental stress such as 
osmotic pressure and temperature (OmpC/OmpF) as 
well as chemicals such as phosphate (PhoE). 

Aquaporins 

Aquaporins are integral membrane proteins regulating the 
passage of water across the membrane. They are highly 
selective for water and are controlled by the osmotic 
gradient. They have a six-transmembrane-domain archi¬ 
tecture, with the N- and C-terminal halves of the protein 
being sequence related. Each half consists of three trans¬ 
membrane domains with the loops between helices two 
and three, in one half, and five and six in the other forming 
hemipores with each traversing one leaflet of the bilayer. 
The pores close together during water transport to form 
the full pore defined as an hourglass stmcture. This 
arrangement was confirmed by cryo-electron microscopy 
and atomic force microscopy. The narrowest diameter of 
the channel was found to be 2.8 A - just large enough for 
a water molecule to pass. Aquaporins form a tetrameric 
assembly in the membrane, but each monomer alone is 
capable of acting as a channel. 

Antimicrobial Peptides 

Antimicrobial peptides act as effective broad-spectrum 
antibiotics. They generally consist of a short 
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polypeptide (12-50 amino acids) with a high hydro- 
phobic content. Members of this family can adopt 
four different conformations: alpha-helical, e.g., 
magainin; extended, e.g., indolicidin; beta sheet sta¬ 
bilized by disulphide bonds, e.g., defensin; and 
mixed, e.g., protegrin-1. Many of these peptides are 
actually disordered in solution and only order upon 
partitioning into membranes. These proteins tend to 
be amphipathic to enable partitioning into the mem¬ 
brane. They act via two mechanisms, either the for¬ 
mation of a pore or penetration into the cell and 
subsequent interaction with essential cellular compo¬ 
nents. These peptides are relatively selective for pro¬ 
karyotic cells due to the differences in membrane 
composition when compared to eukaryotes. Prokary¬ 
otes typically have many zwitterionic phospholipids 
in their membranes, creating an overall charge and 
allowing electrostatic interaction with the peptide. 
However, eukaryotic cells generally have an 
uncharged outer membrane, prohibiting hydrophobic 
interactions with the cationic peptides. 

Fusogenic Peptides 

As their name suggests, fusogenic peptides promote 
fusion of lipid membranes. A prime example is hem¬ 
agglutinin from influenza virus. Hemagglutinin is 
a spike-shaped protein found on the viral surface. As 
expected, the exposed portion is largely hydrophilic 
and recognizes sialic acid on the surface of host cells. 
Binding of hemagglutinin to sialic acid causes the viral 
particle to be endocytosed into the host cell. As part of 
the host defense, the endosome is then acidified. This 
causes partial unfolding of hemagglutinin and exposes 
a hydrophobic region of the protein. This region then 
inserts into the endosomal membrane. As the rest of the 
protein adopts its new conformation at lower pH, the 
viral and endosomal membranes are drawn closer 
together, resulting in fusion. 

Cell Adhesion Molecules and Scaffold 
Proteins 

Cell adhesion molecules (CAMs) are displayed on 
the cell surface and interact with other cells or 
components of the extracellular matrix. They are 
tethered to the cytoskeleton on the intracellular 


side and are classified according to calcium depen¬ 
dency. The immunoglobulin superfamily and 
integrins are both calcium independent, while the 
cadherins such as snares and selectins are calcium 
dependent. 

Some proteins act as a scaffold on to which other 
proteins can be tethered and localized to the plasma 
membrane. These proteins usually form part of signal¬ 
ing cascades and include receptors such as the epider¬ 
mal growth factor receptor and the platelet-derived 
growth factor receptor. Upon activation by extracellu¬ 
lar ligand binding, the membrane scaffold protein 
becomes phosphorylated on the cytoplasmic side and 
is then able to recruit downstream signaling compo¬ 
nents. In this way, the scaffold protein orchestrates the 
signaling cascade through the formation of membrane- 
localized complexes. Scaffold proteins play a role in 
regulating and coordinating signal transduction by 
reducing or enhancing the downstream signal, through 
the recruitment of specific proteins, and are therefore 
central to feedback loops. 

Cross-References 

► Ion Channels 

► Membrane Protein Structure 
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► Membrane Transport, Energetics and Overview 
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Synonyms 

Seven-Transmembrane Domain Receptors; Integral 
membrane protein structure 

Definition 

Membrane proteins are proteins with at least one 
domain that is physically embedded into a biological 
membrane. Protein structure is defined as the three- 
dimensional configuration of the protein amino acid 
sequence. 

Introduction 

The term “membrane protein” is widely used to 
describe proteins that are inserted in, anchored to, or 
otherwise associated with a biological membrane. 
From a structural point of view, however, only mem¬ 
brane-inserted domains present characteristics that are 
clearly different from those of their soluble counter¬ 
parts. These structural features stem from evolutionary 
adaptation to the peculiar membrane environment. In 
contrast, overall structural architecture of membrane- 
associated or membrane-anchored domains presents 
features similar to those of soluble domains. 
This entry focuses on the structure of integral mem¬ 
brane proteins (IMPs), or proteins whose polypeptide 
chain spans the whole width of the membrane at least 
once. In the following, the term “membrane protein” 
will refer to IMPs unless stated otherwise. 

IMPs are estimated to constitute between 20% and 
30% of the protein endowment of an organism, both in 
prokaryotes and in eukaryotes (Liu and Rost 2001; 
Fagerberg et al. 2010). Functionally, they constitute 
the main communication channel between the cell and 
the outside world and additionally, in eukaryotes, 
between different membrane-delimited cellular com¬ 
partments. Accordingly, signaling and transport are 


prominent among the diverse functions of membrane 
proteins. IMPs are also directly or indirectly impli¬ 
cated in a number of diseases including cancer, diabe¬ 
tes, cystic fibrosis, and Crohn’s disease. They play an 
important role in viral infection and are the target of 
numerous toxins. In fact, some IMPs are of outstanding 
medical importance, with G-protein coupled receptors 
constituting a large part of current drug targets (Jacoby 
et al. 2006). Another important drug target for which 
a structure is available is the M2 proton channel of the 
influenza virus, recently co-crystallized with two of its 
inhibitors (PDB codes; 3BKD, 3C9J, and 2RLF). 

Due to their membrane localization, generally high 
flexibility and naturally low expression levels, IMPs 
are among the most challenging proteins when it 
comes to experimental characterization. In particular, 
any study requiring extraction of the protein from the 
membrane is bound to encounter enormous difficulties. 
Structure determination using X-ray crystallography 
or NMR falls exactly into this category. While the 
first high-resolution structure for a globular water- 
soluble protein, myoglobin, was solved in 1958 
(PDB code: IMBN), it was not until 1985 that the 
first atomic-level resolution structure of an IMP was 
determined, namely, the structure of the photosyn¬ 
thetic reaction center (PDB code: IPRC, Deisenhofer 
et al. 1985). Even more significantly, in the 10 years 
after the seminal paper by Deisenhofer et al., only 
a handful of other membrane protein structures were 
determined, including bacterial porin (PDB code: 
2POR), matrix porin (PDB code: 20MF) and 
phosphorin (PDB code: IPHO), Fl-ATPase (PDB 
code: IBMF), prostaglandin H2 synthase-1 (PDB 
code: IPRH), LamB maltoporin (PDB code: IMAL), 
and cytochrome c oxidase (PDB code: lARl). 

Over the last decade, however, significant improve¬ 
ments to the experimental techniques used to manipu¬ 
late membrane proteins have advanced the knowledge 
of IMP structure in important ways. In particular, this 
has led to a better understanding of how IMP structure 
relates to sequence and, ultimately, to function. It has 
to be emphasized, however, that the number of struc¬ 
tures known today still covers only a small part of the 
IMP sequence space, suggesting that much is left to be 
discovered. The present entry provides an introduction 
to what is the current knowledge of membrane protein 
structure. The three distinct IMP structural classes, 
namely, a-helical, P-barrel, and a-helical barrel mem¬ 
brane proteins, are discussed separately. More space 
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and attention is given to a-helical membrane proteins, 
which represent the most abundant IMP class in living 
cells. 

Proteins and Membranes 

Most biological membranes, including the plasma 
membrane of eukaryotic and prokaryotic cells and the 
membranes of mitochondria and chloroplasts, are con¬ 
stituted of a bilayer of phospholipids. The specific type 
of constituent phospholipids is diverse among different 
membranes (van Meet et al. 2008). Also, some mem¬ 
branes are formed by other molecules than phospho¬ 
lipids. A prominent example is the outer membrane of 
Gram-negative bacteria, where an inner phospholipid 
layer is paired to an outer layer of lipopolysaccharides. 
Despite these differences, a feature common to all 
biological membranes is the presence of a hydrophobic 
core delimited by charged head groups that interact 
with the aqueous solution. In order to be stable and 
functionally active in this heterogeneous environment, 
IMPs have developed specific structural features that 
differentiate them from their water-soluble globular 
counter parts. 

a-Helical Integral Membrane Proteins 

Membrane localization: a-Helical integral membrane 
proteins (aIMPs) (Fig. la, b) are found in the plasma 
membrane of eukaryotic and prokaryotic cells as well 
as in the inner membrane of mitochondria and chloro¬ 
plasts. Additionally, they have been reported in the 
outer membrane of mitochondria (Burri et al. 2006). 
They are estimated to constitute up to 30% of all pro¬ 
teins in genomes. Most aIMPs are co-translationally 
inserted into the membrane, in prokaryotes via the 
SecY complex (PDB code: IRHZ) located in the 
inner membrane and in eukaryotes via the Sec61 com¬ 
plex located in the endoplasmic reticulum membrane 
(Zimmer et al. 2008). 

Secondary structure in the membrane region: The 
most striking structural feature of aIMPs is represented 
by their long, hydrophobic transmembrane helices 
(TMHs) (Fig. la, b). Helical conformation in the mem¬ 
brane-spanning segments of these proteins is instru¬ 
mental in achieving membrane solubility. Indeed, 
intra-helical hydrogen bonds help minimizing the 


cost of insertion of the polar backbone into the highly 
hydrophobic membrane interior. For the same reason, 
TMHs feature a predominantly hydrophobic amino 
acid sequence. Polar and charged amino acids, which 
are less commonly found within the membrane core, 
often fulfill important functional roles. For example, 
a series of membrane-embedded, charged residues is 
known to be responsible for voltage sensing in voltage¬ 
gated potassium channels (see the structure of the 
Kvl.2 potassium channel, PDB codes: 2A79, SLUT); 
also, polar and charged residues in the uracil trans¬ 
porter have recently been shown to directly bind the 
substrate (PDB code: 3QE7). 

Depending on the number of TMHs in their struc¬ 
ture, aIMPs are classified as either single-span or 
multi-span. Single-span or bitopic aIMPs have one 
TMH, as depicted in Fig. la; multi-span or polytopic 
aIMPs have more than one TMH. The TMHs of multi¬ 
span proteins are assembled into multi-helix bundles 
(Fig. lb). The single multi-span protein subunit of 
known structure with the highest number of TMHs is 
the sodium/sugar symporter vSGLT with 15 TMHs 
(PDB code: 2XQ2). 

Given that TMHs have to cross the membrane from 
side to side, their preferential orientation is along the 
normal to the membrane plane, that is, perpendicular to 
the lipid bilayer. Tilt angles with respect to this direc¬ 
tion, however, span a rather wide range of values, with 
increasing tilt angles generally corresponding to longer 
TMHs. The minimum length needed for a helix to fully 
cross the membrane is between 15 and 20 amino acids. 

Topology: TMHs in aIMPs may be connected by 
short loops or separated by long water-soluble regions 
that fold into full intracellular, extracellular, or peri- 
plasmic globular domains. One example of an aIMP 
with large soluble domains is the endoplasmic reticu¬ 
lum calcium ATPase (PDB code: 1SU4). The number 
and position of the different transmembrane segments 
along the protein sequence and their direction of inser¬ 
tion into the membrane, from inside to outside or vice 
versa, is usually referred to as “membrane protein 
topology”. Most proteins have one dominant topology, 
where the location of TMH connecting loops with 
respect to the membrane follows the so-called positive 
inside rule (von Heijne and Gavel 1988). According to 
this observation, loops with an excess of basic residues, 
i.e., arginine and lysine residues, are typically located 
in the cell cytoplasm, although the rule becomes less 
stringent with increasing loop length. A small number 
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Membrane Protein 
Structure, Fig. 1 Structural 
classes of membrane proteins. 
(a) Human CD4, an example 
of a bitopic ot-helical integral 
membrane protein (PDB code: 
2KLU), (b) pi-adrenergic 
receptor, an example of 
a polytopic a-helical integral 
membrane protein (PDB code: 
2YCZ), (c) transmembrane 
domain of a P-barrel integral 
membrane protein (OmpA, 
PDB code: 2JMM), and 
(d) Wza, the only known 
structure of an a-helical barrel 
integral membrane protein 
(8 subunits, PDB code: 2 J58). 
Protein images were created 
using the following 
visualization programs: VMD 
and PyMOL (Humphrey et al. 
1996; Schrodinger 2011). 
Color code: (a-c) in purple 
a-helices, in yellow P-strands, 
in cyan turns, in white coils, 
(d) Different colors 
correspond to different 
subunits. The dashed lines 
indicate the membrane region, 
in red plasma membranes, in 
blue outer membranes. 
Membrane boundaries are 
taken from the online database 
0PM (Lomize et al. 2006) 



of membrane proteins have been suggested to have 
dual topology, that is, to have no preferential orienta¬ 
tion inside the membrane (Rapp et al. 2006). In several 
instances, membrane proteins present an internal sym¬ 
metry that likely derives from ancient gene duplication 
events. Interestingly, when the domain undergoing 
duplication features an odd number of TMHs, the two 
homologous parts of the resulting protein are embed¬ 
ded in the membrane with opposite topologies (von 
Heijne 2006). 

Memhrane-water inteiface versus membrane core: 
The membrane-water interfacial region is generally 
defined as the area delimited on one side by the 
charged heads of the lipids and extending for 5-10 A 
into the water medium. At the interface, aIMP struc¬ 
ture is characterized by coil and, to a lesser extent, 
helical regions. The latter are often amphipathic 


helices lying parallel to the membrane plane. Notably 
this region appears to be devoid of P-strands (Granseth 
et al. 2005). Contrary to what is observed for the 
protein sequences embedded in the membrane core, 
polar and charged amino acids are more common in 
the interfacial region. Glycine and proline residues, 
which are classic helix breaker residues, are also com¬ 
monly found in the interfacial coil regions. Snorkeling 
and anti-snorkeling effects have been described, with 
side chains of polar and charged amino acids located in 
the membrane core pointing toward the water medium 
and side chains of hydrophobic amino acids located at 
the water-membrane interface pointing toward the 
membrane. The aromatic residues tryptophan and tyro¬ 
sine snorkel or anti-snorkel depending on whether they 
are found in the membrane core or at the interface, 
respectively. 
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Membrane Protein Structure, Fig. 2 Structural motifs of 
a-helical membrane proteins, (a) Reentrant helix-coil region 
(PDB code: 1BL8), (b) intramembranous loop (PDB code: 
3PJZ), (c) the unusually extended, largely membrane-embedded 
arm of the GlpG rhomboid protease (PDB code: 2IC8). (d) The 
intramembranous P-sheet of the E. coli uracil transporter UreA 
(PDB code: 3QE7). Protein images were created using the 


Structural motifs: An increasing number of 
intramembrane structural motifs are being discovered 
in aIMPs (Fig. 2). 

In many instances, TMHs are interrupted by kinks, 
with proline residues often found at or close to the kink 
(Langelaan et al. 2010). Kinks are believed to provide 
flexibility needed for function. One example involving 
a different residue is the glycine-mediated gating 
mechanism proposed for potassium channels. 

Reentrant regions, depicted in Fig. 2a, were first 
observed in the structure of the KcsA potassium chan¬ 
nel where they line the potassium selectivity filter 
(Doyle et al. 1998). As the name implies, these seg¬ 
ments penetrate the membrane, span part of its hydro- 
phobic core, turn and exit on the same side of the 
membrane. Reentrant regions have been classified 
into three different groups according to their secondary 
structure composition: helix-helix, helix-coil (or coil- 
helix), and regions with no secondary structure 


b 



visualization program VMD (Humphrey et al. 1996). Color 
code: in violet, specific structural motifs are highlighted; in 
green, the rest of the protein is shown. Note that for the sake of 
clarity some of the TMHs in the above proteins have been 
omitted from the visualisation (d). Red, dashed lines indicate 
the plasma membrane region. Membrane boundaries are taken 
from the online database 0PM (Lomize et al. 2006). 


(Viklund et al. 2006). These regions have been related 
to several functional aspects, from ion selectivity to 
substrate binding. 

Intramembranous loops such as the long glycine- 
rich loop of the TrkH potassium transporter (PDB 
code; 3PJZ) can interrupt TMHs, increasing protein 
flexibility as well as playing more specific functional 
roles (see Fig. 2b). 

Some structural motifs have so far been observed 
only in individual IMP structures. These include the 
membrane-inserted coil-helical extended arm of the 
GlpG rhomboid protease (PDB code: 2IC8) depicted 
in Fig. 2c and the recently observed intramembranous 
P-sheet of the Escherichia coli uracil transporter UreA 
(PDB Code: 3QE7) shown in Fig. 2d. The UreA 
P-sheet is formed by two short P-strands located on 
the third and tenth transmembrane segments of the 
protein. This unusual motif represents the first trans¬ 
membrane P-strand regions to be observed in an aIMP. 
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Membrane Protein Structure, Fig. 3 Examples of membrane 
protein structures, (a) Bacterial photosynthetic reaction center, 
the first IMP structure solved at atomic resolution (4 subunits, 
PDB code: IPRC), (b) endoplasmic reticulum calcium ATPase 
with a large soluble domain (PDB code: 1SU4), (c) Leucine 
transporter LeuT, an example for an internal symmetry. Two 
transmembrane regions, TMHs 1-5 and 6-10, likely derived from 
a gene duplication event and have opposite topologies in the 
membrane region (PDB code: 2A65), (d) Aquaporin-1, a homo- 
tetramer (4 subunits, PDB code: 1J4N), (e) autotransporter EstA, 
a p-barrel with a large soluble domain (PDB code: 3KVN), and 


(f) usher translocation pore, a P-barrel with a globular domain in 
the pore lumen and a gap in the P-barrel (PDB code: 2VQI). 
Protein images were created using the visualization programs 
Chimera (Pettersen et al. 2004). Color code: (b, e, f) secondary 
structure colored as in Fig. la-c; (a, d) different colors corre¬ 
spond to different subunits; (c) in purple and cyan, respectively, 
the two transmembrane regions with opposite topology are 
shown, in white the rest of the protein. The dashed lines indicate 
the membrane region, in red plasma or endoplasmic reticulum 
membranes, in blue outer membranes. Membrane boundaries are 
taken from the online database 0PM (Lomize et al. 2006) 


Local structural motifs often contribute to the crea¬ 
tion of a non-hydrophobic environment within the 
transmembrane region of alMPs. These regions can 
more easily accommodate water and other polar or 
charged molecules that are being transported across 
the membrane. A prototypical example is the water- 
filled cavity located in the interior of several potassium 
channels and transporters, which is cracial for 


potassium permeation. The cavity is lined by four reen¬ 
trant regions and by the surrounding TMHs (Doyle et al. 
1998). Another example is aquaporins, which are water 
channels abundant in prokaryotes and eukaryotes. In 
aquaporins, six TMHs and two reentrant regions form 
a pore that allows water molecules to cross membranes 
(PDB code: 1J4N). In Fig. 3a-d, structures for some of 
the proteins discussed above are shown. 
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P-Barrel Integral Membrane Proteins 

Membrane localization'. p-Barrel integral membrane 
proteins (piMPs), see Fig. Ic, are found in the outer 
membrane of Gram-negative bacteria, Gram-positive 
mycobacteria, mitochondria, and chloroplasts and are 
in fact often referred to as “outer membrane proteins.” 
It has to be noted, however, that other membrane 
protein types are found in outer membranes, including 
Qc-barrel membrane proteins that are described below, 
lipid-anchored proteins (not discussed here) and, in 
mitochondria, a-helical membrane proteins. 

piMPs are estimated to constitute between 2% and 
3% of proteins in the genomes of Gram-negative bac¬ 
teria (Wimley 2003). In contrast, only a handful of 
them are believed to exist in the mitochondrial outer 
membrane of eukaryotes (Imai et al. 2011). Function¬ 
ally, these proteins fulfill many different tasks, and 
include transporters, enzymes, and receptors. 

Secondary structure in the membrane and mem¬ 
brane topology. Similar to aIMPs, piMPs use second¬ 
ary structure as a mean to minimize insertion cost of 
backbone polar groups inside the membrane hydro- 
phobic core. pIMPs transmembrane segments, how¬ 
ever, take the form of P-strands, rather than a-helices, 
with the strands arranged into a barrel-shaped P-sheet 
(Ic). The up-and-down P-barrel carves a pore into the 
membrane, with the interior of the pore exposed to the 
aqueous solvent. As a consequence, the membrane- 
inserted portion of the protein is exposed to an amphi- 
pathic environment, facing water on the internal side of 
the p-barrel and lipids on its external side. For this 
reason, and contrary to TMHs, transmembrane 
P-strand sequences alternate between hydrophilic 
(pore- or water-facing) and hydrophobic (membrane¬ 
facing) residues (Fairman et al. 2011). In available 
structures the number of transmembrane P-strands 
ranges from 8 to 24. With the sole exception of the 
voltage-dependent anion channel 1 (VDACl) featur¬ 
ing 19 transmembrane P-strands (PDB codes: 3EMN, 
2K4T), known P-barrels are constituted of an even 
number of strands. VDACl is also the only P-barrel 
membrane protein from mitochondria for which 
a structure is currently available. 

Transmembrane P-strands are most often connected 
by coil regions. These regions are generally long on the 
extracellular side and short on the periplasmic side. 
In some cases, in place of coil regions, fully structured 
water-soluble domains can be observed. One example 


is observed in the structure of protein esterase EstA 
(PDB code: 3KVN, Eig. 3e). Another example is the 
usher translocation domain, where a globular domain 
is located directly in the pore lumen (PDB code: 2VQI, 
Pig. 3f). Remarkably, this plug domain disrupts the 
continuity of the membrane domain within the protein 
sequence, with transmembrane P-strands found both 
N- and C-terminal with respect to the globular inser¬ 
tion. Another peculiarity of the usher structure is the 
gap found in the structure of the transmembrane p- 
barrel, a structural motif that has to date not been 
observed in any other piMP. 

A Novel Structural Class of Membrane 
Proteins 

Until very recently, it was assumed that only two 
structural classes of IMPs existed: a-helical and 
P-barrel membrane proteins. In 2006, structure deter¬ 
mination of the Wza capsular polysaccharide 
translocon revealed the existence of what appears to 
be a completely new class (Dong et al. 2006; Fig. Id). 
Wza has been referred to as an a-barrel IMP. While the 
transmembrane region of Wza is embedded in the 
outer membrane of E. coli, its transmembrane seg¬ 
ments adopt an a-helical conformation, in contrast to 
the p-barrel domains commonly observed in outer 
membrane proteins. At the same time, Wza TMHs 
assemble into a pore-like structure and are exposed to 
an amphipathic environment, mimicking the trans¬ 
membrane p-strands of piMPs. Discovery of the 
novel architecture of Wza suggests that, notwithstand¬ 
ing undisputable progress in recent years, knowledge 
of the membrane protein structural universe is still far 
from complete. 

Summary 

This entry provides an introduction to the three- 
dimensional structure of membrane-inserted, integral 
membrane proteins. Integral membrane protein struc¬ 
tures are unique since they had to adapt to the highly 
heterogeneous environment imposed onto them by the 
membrane. Here are described the main structural ele¬ 
ments found in three distinct classes of membrane 
proteins: a-helical, P-barrel, and a-barrel integral 
membrane proteins. 
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Synonyms 

Solid-state NMR 


Introduction 

Solid-state NMR is capable of determining the struc¬ 
tures of membrane proteins in their native environment 
of liquid crystalline phospholipid bilayers. The highly 
asymmetric bilayer environment is problematic for 





1464 


Membrane Proteins in Phospholipid Bilayers: Structure Determination by Solid-State NMR 


experimental structure determination. First, any 
change in the environment, such as using nonlipids, 
has the potential to alter the proteins. Second, the 
bilayer environment effectively slows the reorientation 
of the protein so that it behaves like a solid on the NMR 
timescales; with conventional solution NMR methods, 
the spectra are very broad with distinguishable signals 
only from mobile residues at the N- and C- termini 
(Park et al. 201 1). Flowever, it is not possible to crys¬ 
tallize proteins from phospholipid bilayers for struc¬ 
ture determination by X-ray diffraction, and the lipids, 
phases, and the protein sequences are altered to 
obtain crystals suitable for structure determination 
(Rosenbaum et al. 2007; Caffrey 2009). Fortunately, 
solid-state NMR spectroscopy is well suited for study¬ 
ing membrane proteins in phospholipid bilayers. 
Although they behave like solids because of the 
immobilizing effects of the bilayers, they undergo 
rotational diffusion about the bilayer normal, and this 
can be effectively incorporated into a method for struc¬ 
ture determination (Edidin 1974). 

Solid-State NMR Spectroscopy 

In the late 1960s and early 1970s, NMR spectroscopy 
was revolutionized by the advent of methods that gave 
high-resolution spectra of single-crystal and powder 
samples. Previously, high-resolution NMR was 
restricted to solutions of relatively small molecules 
that undergo rapid isotropic reorientation in solution. 
This provided narrow line widths at the expense of the 
structural information inherent in the anisotropic spin 
interactions. Multiple-pulse methods provided homo- 
nuclear ^H-^H decoupling while retaining differences 
among chemical shift frequencies (Waugh et al. 1968). 
Double-resonance methods provided high-resolution 
spectra of “dilute” nuclei, such as and *^N, by the 
combination of cross polarization for sensitivity 
enhancement and heteronuclear decoupling for high 
resolution (Pines et al. 1972). Importantly, in single¬ 
crystal or uniaxially aligned samples, the observed 
frequencies provided valuable angular information 
for sites with respect to the direction of the applied 
magnetic field. With the addition of the third compo¬ 
nent of high-resolution solid-state NMR, magic angle 
spinning (MAS) (Schaefer and Stejskal 1976), it 
became possible to obtain spectra of powder samples 
similar to those that resulted from dissolving the same 


compounds in solution. However, MAS solid-state 
NMR spectra do not retain the angular information 
present in those obtained from stationary samples. 
Regaining this information has been subject of recent 
developments in the field of solid-state NMR spectros¬ 
copy and is an essential aspect of the method for 
protein structure determination described in this 
article. 

Following the initial results on single crystals, 
powders, and synthetic polymers, the goal was to 
extend solid-state NMR to biopolymers. Applications 
to supramolecular assemblies, such as membranes and 
virus particles, indicated that the basic building blocks 
of solid-state NMR experiments, multiple pulse, dou¬ 
ble resonance, and magic angle spinning, yield high- 
resolution NMR spectra of immobilized proteins. 
However, following the initial demonstrations, MAS 
solid-state NMR (Opella et al. 1979) and oriented 
sample (OS) solid-state NMR (Cross and Opella 
1983) approaches were developed largely along sepa¬ 
rate paths. Both yielded three-dimensional structures 
of proteins, but membrane proteins in phospholipid 
bilayers remained extremely challenging for both 
approaches. An alternative approach has been devel¬ 
oped, rotationally aligned (RA) solid-state NMR (Park 
et al. 2010; Marassi et al. 2011; Das et al. 2012), which 
combines the samples and experiments of MAS solid- 
state NMR with the measurements of orientationally 
dependent frequencies of OS solid-state NMR to yield 
a method precisely tailored for membrane proteins in 
phospholipid bilayers. 

Rotationally Aligned Solid-State NMR 
Spectroscopy 

The rotationally aligned (RA) solid-state NMR 
approach to structure determination of membrane pro¬ 
teins is generally applicable, since it depends on 
a fundamental property of membrane proteins in liquid 
crystalline phospholipid bilayers, which is their rota¬ 
tional diffusion about the bilayer normal. Once the 
protein has been shown to undergo the requisite rota¬ 
tional diffusion about the bilayer normal, it is possible to 
determine its structure by RA solid-state NMR in five 
steps (Marassi et al. 2011): (1) Express, purify, and 
refold the membrane protein. (2) Resolve the signals 
from the protein by applying a series of two- and three- 
dimensional, double- and triple-resonance experiments 
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on the uniformly ^^C/^^N-labeled protein sample. 

(3) Assign the signals to individual sites in the protein. 

(4) Recouple the rotationally averaged powder patterns, 
and the frequencies associated with the discontinuities 
of the axially symmetric powder patterns are measured. 

(5) The structure of the protein is calculated using the 
orientational constraints based on the angle between the 
principal axis of the spin interaction tensor and the 
bilayer normal. 

Sample Preparation 

The protein is expressed in bacteria in media suitable 
for uniform labeling with both and *^N. The poly¬ 
peptide sequence of interest is attached to 
a hydrophobic fusion protein to ensure that the 
expressed polypeptide accumulates in inclusion bodies 
rather than the cell membrane, where large amounts of 
an unnatural protein are often lethal to the rapidly 
growing bacteria. The fusion protein assists with the 
purification process before it is cleaved off. Final puri¬ 
fication is accomplished with HPLC or FPLC so that 
the membrane proteins yield one band on an SDS gel 
within the limits of detection. After transfer to phos¬ 
pholipids, typically DMPC, the residual detergent is 
carefully removed so that there is no detergent present 
in the sample within the limits of detection. The deter¬ 
gent-free proteoliposomes containing the membrane 
protein are concentrated by centrifugation and trans¬ 
ferred to the rotor of a magic angle spinning probe 
(Park et al. 2012). 

Resolve Individual Signals with 
Multidimensional Solid-State NMR 
Experiments 

Physically rotating an unoriented, immobile sample at 
the “magic” angle of 54.74° relative to the magnetic 
field, in combination with radio frequency irradiations, 
yields spectra where each chemically distinct *H, *^C, 
or site in the protein provides a single signal 
(Schaefer and Stejskal 1976). The MAS solid-state 
NMR spectra, by themselves, can be used for resolu¬ 
tion and assignment purposes, even though the thou¬ 
sands of signals from a protein are highly overlapped in 
one-dimensional spectra. Fortunately, two- and three- 
dimensional spectra are capable of resolving most or 


all of the resonances present in a uniformly *^C/^^N- 
labeled protein. 

In the early 1970s, it was realized that membrane 
proteins undergo rotational diffusion about the bilayer 
normal at a frequency that is fast compared to the 
frequency spans (^10^ Hz) of the heteronuclear dipo¬ 
lar and chemical shift powder patterns (Edidin 1974). 
This is readily demonstrated by comparing the breadth 
and shape of the chemical shift powder patterns 
from *^C'-labeled proteins in bilayers at temperatures 
above and below the gel to liquid crystalline phase 
transition temperatures of the lipids in Fig. 1. It is 
found that the most convenient way of verifying that 
a membrane protein undergoes the required rotational 
diffusion in the bilayer under the experimental condi¬ 
tions is to observe the spinning side band patterns with 
relatively slow (5 kHz) magic angle spinning. The low- 
temperature sample, where the lipids are in the gel 
phase, has many spinning sidebands that trace out the 
broad static *^C' powder pattern. In contrast, at a higher 
temperature where the phospholipids are liquid crystal¬ 
line, the rotation of the protein is fast enough to average 
the powder pattern to that expected for carbonyl groups 
in a helix aligned at an angle close to that of the bilayer 
normal. Since the resulting powder pattern is axially 
symmetric and narrow, no sidebands are observed 
even under the slow magic angle spinning conditions. 
This qualitative difference is readily observed and pro¬ 
vides assurance that the protein is undergoing fast rota¬ 
tion about the bilayer normal, which is a requirement for 
these experiments. RA solid-state NMR is based on the 
measurements of angles relative to the fixed axis of the 
bilayer normal that are revealed by the motional aver¬ 
aging of powder patterns. 

Recent developments in decoupling and recoupling 
methods rely on suspending the averaging effect of 
magic angle spinning so that they retain the anisotropic 
information in the spin-interaction tensors without 
sacrificing the high sensitivity and resolution provided 
by MAS. Polarization transfer techniques incorporated 
into multidimensional experiments provide coherence 
transfer pathways that can be used to make resonance 
assignments and to measure structural constraints. 
Heteronuclear dipole-dipole interactions can be 
recoupled in a number of ways. Symmetry-based 
pulse sequences developed by Levitt and coworkers 
are of particular importance; heteronuclear dipolar 
recoupling using symmetry-based rf pulses on 
a single channel can restore the effect on the other 
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Membrane Proteins in Phospholipid Bilayers: Structure 
Determination by Solid-State NMR, Fig. 1 solid-state 
NMR spectra of uniformly '^C/*^N-labeled MerFt in 
proteoliposomes. The majority of the resonance intensity, cen¬ 
tered near 175 ppm, is from the '^C' backbone sites. The spectra in 
panels (c) and (g) were obtained under experimental conditions 
(sample temperature of 25° C) where the protein undergoes fast 
rotational diffusion about the bilayer normal in proteoliposomes. 
The spectra in panels (d) and (h) were obtained under experimen¬ 
tal conditions (sample temperature of 10°C) where the protein is 
immobile on the timescale of a static ’^C' chemical shift powder 
pattern (~10^ Hz). The simulated spectra are for a single ’^C' 
group in a transmembrane helix aligned approximately parallel to 
the lipid bilayer normal, (b) and (f) are for a static protein, (a) and 
(e) are for a protein undergoing fast rotational diffusion about the 
bilayer normal. The experimental spectra in (g) and (h) were 
obtained from samples undergoing slow (5 kHz) magic angle 
spinning. The corresponding simulated spectra in (e) and (f) 
confirm that a family of sidebands is observable in the absence 
of rotational diffusion of the membrane protein. Comparisons of 
breadth (a vs. b and c vs. d) or presence of spinning sidebands (e 
vs. f and g vs. h) are diagnostic for the presence of fast rotational 
diffusion of the protein under the experimental conditions used to 
measure the chemical shift anisotropy and heteronuclear dipolar 
coupling powder patterns (From Das et al. 2012) 


nuclei, such as and during detection (Xhao 
et al. 2001). In particular, dipolar frequency 

measurements with high sensitivity detection is 
a crucial development in application to proteins. 


In solid-state NMR, depending upon the correlation 
time of motion of molecules, resonance assignment 
protocols are based on bond J-coupling and space 
dipole-dipole interactions. The dipolar coupling 
decreases with the third power of the internuclear 
distance. Due to the phenomenon of dipolar truncation, 
the polarization transfer in recoupling experiments on 
uniformly labeled proteins is limited to one-bond 
transfers; this makes it particularly well suited for 
assignment experiments. 


Assignments 

Unidirectional polarization transfer techniques can be 
used to make backbone and side chain assignments. 
Resonance assignments are made based on two- 
dimensional correlation, homonuclear spin 

exchange, and heteronuclear correlation spec¬ 

troscopy. These experiments also serve as basic build¬ 
ing blocks for higher dimensional experiments used in 
assignment protocols. Two-dimensional cor¬ 

relation experiments can be performed in several ways: 
(1) proton-driven spin diffusion (PDSD) (Szeverenyi 
et al. 1982; Frey and Opella 1984), (2) dipolar-assisted 
rotational resonance (DARR) condition (Takegoshi 
et al. 2001), and (3) homonuclear dipolar recoupling 
schemes using zero or double quantum filtering tech¬ 
niques (Hohwy et al. 1999). Carbon-carbon spin diffu¬ 
sion through Z-magnetization in the laboratory frame in 
PDSD and by proton irradiation in DARR is widely 
used in carbon-carbon spin exchange. Mixing with a 
short period up to 50 ms essentially correlates nuclei 
with one-bond, and a longer mixing period up to 500 ms 
provides total intraresidue and interresidue correlations. 

Triple resonance experiments utilizing nitrogen, 
carbonyl carbon, and alpha carbon nuclei present in 
peptide planes require efficient polarization transfer 
from nitrogen to carbon in a unidirectional manner. 
The spectral editing via nuclei simplifies the spec¬ 
trum and leads to a high-resolution multidimensional 
experiment. This is achieved in a two-stage polariza¬ 
tion transfer pathway: (1) magnetization transfer from 
amide proton to nitrogen and (2) amide nitrogen to 
either carbonyl carbon or alpha carbon. This is 
obtained by setting the appropriate spin lock field 
strength on the and ramping through the 

SPECIFIC CP match condition on the ^^C channel 
(Fig. 2). 
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Membrane Proteins in Phospholipid Bilayers: Structure 
Determination by Solid-State NMR, Fig. 2 Examples of 
two- and three-dimensional spectroscopic data obtained on uni¬ 
formly '^C/'^N-labeled MerFt in phospholipid bilayers with 
'^C-detected MAS solid-state NMR. (a) Two-dimensional 
dipolar coupling/'^N chemical shift separated local 
held spectrum, (b)-(c) Two-dimensional dipolar 

coupling/15N chemical shift spectral planes at selected 15N 


chemical shift frequencies in a three-dimensional ’^C-detected 
dipolar coupling/'^N chemical shift spectrum. The sig¬ 
nals coiTesponding to Leu 31 and Asp 69 are marked in the two- 
dimensional spectrum (a) and the signals corresponding to Gly 
14, Leu 31, and D 69 are marked in the two-dimensional spectral 
planes at the '^N chemical shift frequencies of 119.6, 117.0, and 
108.8 ppm, respectively (From Das et al. 2012) 


Addition of extra evolution of chemical shift fre¬ 
quency followed by coherence transfer pathway 
between homonuclei or heteronuclei leads to two- 
dimensional spectrum and further resolution enhance¬ 
ment. Total correlation of carbon nuclei was achieved 
using 50 ms mixing. The heteronuclear corre¬ 

lation spectrum has a single signal between each 
bonded amide nitrogen and alpha carbon in the protein. 
This can be verified in expanded spectra of relatively 
small membrane proteins. Larger proteins require 
three-dimensional spectra. 

In addition to resolution, three-dimensional experi¬ 
ments such as NCACO, NCOCA, and CANCO enable 
the assignment of the backbone resonances (Baldus 
2007). In NCACO-type three-dimensional experi¬ 
ments, both heteronuclear polarization transfer and 
homonuclear spin exchange are utilized. The 
chemical shift interaction evolves before SPECIFIC 
CP (Petkova et al. 2003) transfer to the *^CA nuclei. 
Adding *^CA chemical shift evolution followed by 
*^C/^^C spin diffusion mixing period yields a three- 
dimensional correlation spectrum. In a similar fashion, 
the NCOCA three-dimensional experiment is 
performed, except that SPECIFIC CP is from to 
*^C' followed by ^^C/'^C spin diffusion. The connec¬ 
tions between CAi and COi chemical shift frequencies 
are encoded in the intraresidue NjCAjCOi spectra by 
the interresidue Nj+iCOiCA; correlation experiments. 
This provides an efficient means for identifying amide 
pairs in neighboring residues. In crowded spectrum, 


well-dispersed chemical shifts of side chains can be 
added for assignment with high-resolution data acqui¬ 
sition. When three-dimensional NCACX data are 
available, side chain resonance patterns can often be 
observed in the direct CX dimension, enabling residue 
types to be identified. Similarly, a three-step cross 
polarization transfer used in CAN(CO)CA-type exper¬ 
iments can also be highly effective in making assign¬ 
ments in crowded spectra. 

Measuring Orientationally Dependent 
Frequencies 

Recently developed pulse sequences enable recoupling 
of undistorted powder line shapes of chemical shift 
anisotropy (Chan and Tycko 2003) and dipolar powder 
patterns in MAS solid-state NMR spectra. 
Heteronuclear dipolar couplings are recoupled under 
MAS in two ways, either by proton-detected local field 
(POLE) or by separated local field (SLF) spectroscopy. 
Levitt and coworkers (Xhao et al. 2001) have shown 
that a series of symmetry pulses for recoupling of 
heteronuclear dipolar couplings R18i^ and R 182 ^ are 
particularly useful. R18i^ consists of a pair of 
I 8 O 70180-70 pulse pairs, where each 180° pulse 
occupies exactly 1/18 of a rotor period. A total of nine 
repetitions of pulse pairs are applied over one rotor 
period and this leads to recoupling of dipolar 

couplings and *H chemical shift anisotropies (CSA) 
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Membrane Proteins in Phospholipid Bilayers: Structure 
Determination by Solid-State NMR, Fig. 3 Examples of 
spectroscopic data for residue L31 obtained from NMR spectra 
of uniformly *^C/'^N-labeled MerFt in proteoliposomes at 25°C. 
A-C Simulated powder patterns for a static peptide bond. A 
amide CSA. B heteronuclear dipolar coupling. 

C heteronuclear dipolar coupling. D-F Experimentally 

measured powder patterns from L31 in MerFt undergoing fast 
rotational diffusion. The dashed vertical lines mark the parallel 
(II) and perpendicular (±) edge frequencies of the powder 


patterns. As described in the Experimental Methods section, 
the DC and DC frequencies entered in 

Table SI were derived from the perpendicular edge frequencies 
of the rotationally averaged powder patterns; the measured 
values of the DCs were multiplied by 4 to obtain the full DC 
value corresponding to the twice the parallel edge frequency. For 
the *^N CSA, the perpendicular edge frequency was reduced to 
its traceless value and then multiplied by 2 to obtain the full CSA 
value (From Das et al. 2012) 


when applied on the 'H channel while homonuclear 
*H/^H dipolar couplings are suppressed. A two-pulse 
phase modulation with 70° and —70° phases for each 
pulse length of 5 ps was used in R18i^ cycle under 
11.111 kHz spinning (Fig. 3). 

Two-dimensional *^C/*^N-detected separated local 
field (SLF) experiments that correlate 'H-^^C/*H-'^N 
DC frequencies with isotropic chemical shifts 

were performed in two steps. First, magneti¬ 

zation was generated by cross polarization from *H 
prior to dipolar evolution. Second, or 

dipolar evolution was carried out under R18i^ symme¬ 
try pulse sequences to assist in the simultaneous 
recoupling of the heteronuclear dipolar couplings and 
decoupling of homonuclear dipolar couplings. 

dipolar coupling frequencies were 
encoded in a constant time manner with refocusing of 
chemical shifts and rapid dephasing of the 


signals. Free evolution of isotropic chemical 
shift frequencies are measured under heteronuclear 
decoupling of and Following dipolar evo¬ 

lution, magnetization was transferred to 
using SPECIFIC CP, and the '^C signals were acquired 
with *H decoupling. Three-dimensional experiments 
were carried out in a similar fashion with incorporation 
of a chemical shift evolution period in the third 
dimension; nine repetitions of the paired pulses over 
one rotor period and a dwell time of 90 ps. 

Previously, stationary sample and MAS solid-state 
NMR (McDermott 2009) were used to study mem¬ 
brane proteins in lipid environments using uniaxially 
oriented and unoriented samples, respectively. The 
MAS experiments were performed primarily at low 
temperatures, which suppressed global motions, but 
enhanced sensitivity. The global dynamics in the 
form of fast rotational diffusion about the bilayer 
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Membrane Proteins in 
Phospholipid Bilayers: 
Structure Determination by 
Solid-State NMR, 

Fig. 4 Three-dimensional 
structures of two membrane 
proteins determined by 
rotationally aligned solid-state 
NMR in phospholipid 
bilayers, (a) MerFt with 60 
residues and 2 transmembrane 
helices, (b) CXCRl with 350 
residues and 7 transmembrane 
helices 
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normal are essential for RA solid-state NMR; it has 
been studied in magnetically aligned bicelles as well as 
bilayers (Nevzorov et al. 2005; Park et al. 2011). The 
angular measurements obtained from ^H-^^N/*H-*^C 
dipolar couplings and chemical shift anisotropy 
are equivalent, whether made from unoriented samples 
in stationary or magic angle spinning (MAS) solid- 
state experiments or from magnetically aligned sam¬ 
ples using oriented sample (OS) solid-state NMR 
methods. Pulse sequences optimized for oriented and 
MAS solid-state NMR experiments for structural char¬ 
acterization based on the dynamics of the membrane 
proteins in their native-like environment with mea¬ 
surement of heteronuclear dipolar couplings and struc¬ 
ture of a 60-residue mercury transporter protein have 
been demonstrated (Marassi et al. 2011) These exper¬ 
iments can be used to recouple motionally averaged 
powder patterns from a *^N/^^C-labeled membrane 
protein undergoing uniaxial rotational diffusion in 
phospholipid liposomes. The and *H-*^N 

dipolar coupling and and ^^N chemical shift fre¬ 
quencies measured from the edges of these powder 
patterns provide orientation-dependent structural 
restraints for protein structure determination. 

Calculation of Protein Structures 

The three-dimensional structures of the proteins are 
calculated using a two-step procedure. First, the 
fragment-replacement approach embodied in the pro¬ 
gram Rosetta (Das and Baker 2008) is used to generate 
at least 10,000 structural models for the protein 


sequence in an implicit membrane environment. 
Second, the lowest energy model that emerges from 
Rosetta is subject to refinement by simulated annealing 
in XPLOR-NIH (Schwieters et al. 2006) using the 
experimental constraints. About 100 refined structures 
are calculated and the 10-20 lowest energy structures 
used for analysis (Fig. 4). 

Summary 

The combination of MAS solid-state NMR and OS 
solid-state NMR on uniformly *^C- and *^N-labeled 
protein samples provides a general approach to structure 
determination of membrane proteins, which is referred 
to as rotationally aligned (RA) solid-state NMR 
(Marassi et al. 2011; Das et al. 2012). There are several 
advantages to this method. They include that the protein 
sequences are unaltered; there are no mutations, tmnca- 
tions, insertions of other proteins, or the addition of 
stabilizing antibody fragments or small molecules. The 
proteins reside in their native environment in phospho¬ 
lipid bilayers under physiological conditions of temper¬ 
ature and pH. Ligand, including other proteins, 
antibodies, small molecules, and changes in the lipids 
of the membrane bilayers can be added to the samples. 
NMR is adept at describing the structure and dynamics 
of proteins in an integrated manner. 

At present, solid-state NMR is the only method 
capable of determining the three-dimensional struc¬ 
tures of membrane proteins in their native phospho¬ 
lipid bilayer environment. The structures have atomic 
resolution and have the property that any experimental 
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errors do not accumulate since all measurements are 
made relative to a single external axis, the bilayer 
normal, assuring the accuracy of the structures. Spec¬ 
troscopic and structural differences between samples 
of the proteins alone in the bilayers and in the presence 
of proteins or drugs that they bind to provide the 
possibility that previously inaccessible information of 
biological and biomedical interest will emerge from 
these studies without concerns for the distortions of 
nonnatural lipids or protein sequences. 
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Synonyms 

Lipid, protein interactions; Membrane protein biogen¬ 
esis; Protein Fluorescent Dye Labeling; Self-assembly 

Definition 

Folding of amino acid sequences to stable, functional 
proteins that are integral to cellular membranes and 
traverse the membrane and perform essential roles in 
cell communication, transport, and ATP production. 

Introduction 

Biological membranes are complex environments, 
where membrane proteins are surrounded by a bilayer 
composed of many different types of lipids. The phys¬ 
ical properties of the bilayer influence protein structure 
and folding, and must be taken into account when 
working with integral membrane proteins in vitro. 

Biological Membrane Organization 

Biological membranes perform a number of roles that 
are crucial to organism survival. At a basic level, they 
provide a permeability barrier to prevent proteins, ions, 
and metabolites from leaking out of the cell and 
unwanted toxins leaking in. Functionally, membranes 
are involved in nutrient uptake and waste export, 
osmotic homeostasis, cell-to-cell signaling, and motil¬ 
ity. They also maintain electrochemical gradients and 
compartmentalize the cell. As such, biomembranes 
have important roles in respiration and photosynthesis 
and support considerable enzymatic activity. To accom¬ 
plish these diverse tasks, membranes have 
a correspondingly complex organization. A lipid bilayer 


forms the basic fabric of a biological membrane. A wide 
variety of lipids are present in nature, with differing 
distributions between organisms and membranes within 
organisms and between the inner and outer leaflets of 
the constituent membrane bilayers. Gram-negative bac¬ 
teria such as Escherichia coli are surrounded by two 
membranes, the inner membrane containing a high per¬ 
centage of lipids with a phosphatidylethanolamine (PE) 
headgroup, together with phosphatidylglycerols (PG) 
and cardiolipin. The second, outer membrane is highly 
asymmetric, with its two monolayer leaflet having dif¬ 
ferent lipid compositions; the inner leaflet is similar to 
that of the inner membrane, but the outer leaflet is 
largely composed of lipopolysaccharides. Contrast¬ 
ingly, Gram-positive bacteria have only one cytoplas¬ 
mic membrane that is predominately composed of PG- 
and diphosphatidylglycerol-based lipids with mainly 
saturated and branched acyl chains (Wilkinson 1988; 
O’Leary and Wilkinson 1988). Mammalian cells have 
several distinct membrane environments, with the lipid 
composition of subcellular membranes varying consid¬ 
erably from one another. For example, cytoplasmic 
membranes contain mostly phosphatidylcholine (PC), 
PE, phosphatidylserine, sphingomyelin, and choles¬ 
terol, while in the endoplasmic reticulum and mitochon¬ 
drial membranes, cholesterol is present at much lower 
concentrations (Yorek 1993). Most of these biolipids 
are examples of type II lipids, which form monolayers 
that have a tendency to curve toward water, though 
different lipids can form either lamellar (bilayer) or 
nonlamellar (nonbilayer) phases (Israelachvili 2011). 
However, the phase properties of biological membranes 
are far more complex as a result of the broad mix of 
lipids, the monolayer asymmetry, and the presence of 
proteins embedded within the membrane. Additionally, 
microdomains within the membrane, known as lipid 
rafts, have been more recently observed. These rafts 
are proposed to be transient structures that are generally 
enriched in cholesterol, sphingolipids, and saturated 
phospholipids compared to the surrounding bilayer, cre¬ 
ating localized areas of increased order and bilayer 
thickness. 

Membrane Proteins 

The fluid mosaic model, originally proposed in 1972, 
describes the membrane as a 2D lipid matrix within 
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Membrane Proteins: 
Folding and Stability, 

Fig. 1 Schematic of 
membrane proteins in 
commonly used in vitro 
systems, (a) The beta-barrel 
outer membrane protein PagP 
surrounded by detergent 
molecules in a micellar 
structure (protein data bank 
file ITHQ). (b) The alpha- 
helical bundle 

bacteriorhodopsin in a bicelle, 
with lipids forming a central 
bilayer structure that is edged 
by detergent molecules (pdb 
file 3NS0). (c) The two 
domain alpha-helical 
transporter LacY in 
a unilamellar liposome, 
a sphere formed from a single 
bilayer with an internal 
aqueous cavity (pdb file 
2V8N) 



which proteins can freely diffuse (Singer and Nicolson 
1972). This concept has been further developed, both 
by the concept of lipid microdomains and by the dis¬ 
covery that there are more proteins within the mem¬ 
brane than originally estimated, with anywhere from 
20 % to 80% of the membrane fraction, by weight, 
being composed of protein (Engelman 2005). Current 
genomic analysis suggests that around 30% of the total 
protein content of a cell is membrane protein. Many of 
these proteins are bound only peripherally to the mem¬ 
brane and can be relatively easily dissociated from it. 
Proteins can interact with the bilayer in several ways. 
Some bind nonspecifically to the membrane surface 
due to electrostatic interactions between the protein 
and anionic lipid components, other proteins have 
domains that directly bind specific lipids, for example, 
many proteins involved in signaling have one or more 
pleckstrin homology (PH) domains that interact with 
phosphatidylinositol lipids. Another method of driving 
protein-membrane association is by the posttransla- 
tional covalent modification of proteins with a lipid 
group, commonly myristoyl or palmitoyl acyl chains, 
that inserts into the bilayer and anchors the protein to 


the membrane (Luckey 2008). Integral membrane pro¬ 
teins are fully embedded in, and traverse, the bilayer 
and typically require the complete disruption of the 
membrane to extract them. In contrast to water-soluble 
proteins, integral membrane proteins must reside in 
a strongly hydrophobic environment, where the energy 
cost of transferring completely unfolded peptides with 
their exposed, polar peptide bonds into the nonpolar 
bilayer is high. To reduce this energy, two basic struc¬ 
tural motifs have emerged where the secondary struc¬ 
tural elements result in complete internal hydrogen 
bonding of the main peptide backbone. Beta-barrels 
are found in the outer membranes of bacteria and 
mitochondria, where an antiparallel beta-sheet curves 
round in a circular pattern to form a stable and rigid 
structure with all the intrastrand hydrogen bonding 
requirements satisfied. In other membranes, integral 
membrane proteins comprise the other major structural 
motif, the alpha-helical bundle (Fig. 1). Despite 
the more limited structural space, there is considerable 
variety within these two basic categories, with classifi¬ 
cation systems for membrane transporters (http://www. 
tcdb.org), G-protein coupled receptors (GPCRs), 
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and ion channels (http://www.iuphar-db.org) being 
developed on a rational basis. Classes of transporters 
include porins, electrochemical potential-driven trans¬ 
porters, primary active transporters and transport elec¬ 
tron carriers. The major class of GPCRs is the Class 
A Rhodopsin-like receptors. Despite the biological 
importance of membrane proteins, there have been 
relatively few 3D structures solved compared to that 
of soluble proteins, with around 300 distinct stmctures 
discovered to date (see updates at http://blanco.biomol. 
uci.edu/mpstruc). 

Working with Membrane Proteins In Vitro 

One of the primary difficulties in working with mem¬ 
brane proteins is obtaining sufficient quantities of puri¬ 
fied protein for study. With a few exceptions, 
membrane proteins tend to be present in relatively 
low levels in their native bilayer, and heterologous 
overexpression readily leads to misfolded protein in 
the form of inclusion bodies in the cytoplasm. There¬ 
fore, high yields of functional protein are rarely 
obtained. The heterogeneric and dynamic membrane 
environment is also not amenable to many of the stan¬ 
dard biophysical techniques for investigating structure 
and function. Most studies therefore require the protein 
to be extracted from its bilayer environment into aque¬ 
ous solution. The high hydrophobicity and correspond¬ 
ingly low solubility of membrane proteins mean that 
they have a strong tendency to aggregate in solution; 
therefore, the protein has to be reconstituted into 
another model system that can act as a mimic to the 
native bilayer (Seddon et al. 2004, Fig. 1). Detergents 
have for a long time been used for this purpose. Deter¬ 
gents are amphiphilic surfactants, consisting of a polar 
headgroup and hydrophobic tail, that self-assemble 
into spherical micellar structures in aqueous solutions 
above a characteristic concentration, known as the 
critical micelle concentration. They are particularly 
important for the crucial stage of solubilizing the pro¬ 
tein from the bilayer. Detergent added to cell mem¬ 
branes will first integrate into the bilayer, then at higher 
concentrations disrupt the membrane to form protein- 
lipid-detergent complexes, and eventually displace the 
lipid molecules to form protein-containing micelles. 
This allows for purification of the target protein and 
is a suitable system for many standard downstream 
biophysical applications, such as fluorescence, circular 
dichroism, and isothermal calorimetry. Flowever, the 
micelle is a relatively poor mimic for the bilayer 


environment, and many membrane proteins are inac¬ 
tive or unstable under these conditions. Alternative 
model systems that more closely resemble the native 
environment are therefore desirable. One such system 
is the bicelle, a discoidal structure formed when certain 
short-chain lipids or detergents (DHPC or CHAPS/ 
CHAPSO) and long-chain lipids (DMPC) are mixed 
under defined proportions, concentrations, and temper¬ 
ature. In these conditions, the long-chain lipid parti¬ 
tions into a central flat bilayer structure with the 
detergent/short-chain lipid forming a solubilizing ring 
around the edge. These bicelles have several advan¬ 
tages over traditional detergent micelles. They provide 
a more natural bilayer environment for the protein 
without the extreme curvature of the micelles, they 
have a much lower total detergent concentration, and 
they can be magnetically aligned, therefore lending 
themselves to NMR. They have been used as 
a substitute for detergent in x-ray crystallography stud¬ 
ies. More recently, similar structures, known as 
nanodiscs, have been developed that comprise 
a similar central bilayer with the edge provided by 
two copies of the amphipathic membrane scaffold 
protein (MSP). Different sized nanodiscs are made by 
using different lengths of MSP. Another common 
approach to working with membrane proteins in vitro 
is to reconstitute the protein into liposomes. Lipo¬ 
somes are closed spherical vesicles with an internal 
aqueous compartment surrounded by, usually, a single 
phospholipid bilayer. They can range in size from 
small unilamellar vesicles of 20-50 nm in diameter, 
through large unilamellar vesicles of 50 nm-1 um, to 
giant vesicles of greater than 5 um. Proteins are 
reconstituted into the liposome bilayer by several tech¬ 
niques, most commonly by dilution or detergent 
presaturation. A dilution approach involves adding 
the detergent-solubilized protein to the vesicles such 
that the concentration of detergent drops below 
the CMC. As the micelle becomes destabilized, the 
protein partitions into the bilayer. With detergent 
presaturation, low concentrations of detergent are 
added to the liposome, disrupting lipid-lipid contacts 
and facilitating the uptake of the protein. Excess deter¬ 
gent is subsequently removed by dialysis, absorption 
by hydrophobic beads, or chromatography. Liposomes 
provide a good bilayer mimic compared to micelles, 
and can be made using a much wider range of lipids 
and conditions than bicelles, allowing for the study of 
both specific and global lipid effects on the proteins. 
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Membrane Proteins: Folding and Stability, Fig. 2 (continued) 
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However, the comparatively high lipid to protein ratio 
and background light scattering can make this system 
difficult to use with some, especially spectroscopic, 
techniques. Liposomes are a particularly good system 
for studying transport proteins, as the isolated internal 
compartment provides the directionality required for 
measurements of activity. 

Membrane Protein Stability 

In all these in vitro systems, the surrounding hydro- 
phobic environment can have a profound effect on the 
stability of integral membrane proteins. For proteins in 
micelles, the choice of detergent can substantially alter 
stability. The GPCR rhodopsin is significantly more 
thermostable in nonionic dodecyl maltoside rather than 
zwitterionic lauryldimethylamine oxide. Bicelle struc¬ 
tures were more stabilizing than either micellar system 
(McKibbin et al. 2007). Some proteins have lipids 
specifically bound that increase thermostability, 
such as cardiolipin bound to the purple bacterial reac¬ 
tion center or cholesterol bound to some GPCRs. How¬ 
ever, the global bilayer environment can also 
contribute toward protein stability. Bacteriorhodopsin 
reconstituted into vesicles composed of the saturated 
lipid 1,2-dimyristoyl-jn-glycero-S -phosphocholine 
(DMPC) denatured at a lower temperature than 
when the unsaturated l,2-dioleoyl-^n-glycero-3- 
phophocholine (DOPC) was added. The addition of 
the unsaturated lipid increased both the curvature elas¬ 
tic stress and the bilayer lateral pressure in the acyl 
chain center of the bilayer, leading to an increase in 
thermostability in the protein (Allen et al. 2004). 
Changes in curvature stress can also affect beta-barrel 


proteins. The bacterial outer membrane porin OmpA 
can be fully unfolded from liposomes by the com¬ 
monly used denaturant urea. Increasing the fraction 
of nonlamellar PE lipids in PC/PG bilayers was found 
to stabilize the protein, as was increasing the bilayer 
thickness by increasing the length of the lipid acyl 
chains (Tamm et al. 2004). 

Protein Folding In Vitro 

The process of folding and assembly of a protein from 
a polypeptide chain to its native 3D structure is of 
fundamental importance, yet the general principles of 
protein folding have emerged from extensive investi¬ 
gations on small soluble proteins. The study of mem¬ 
brane protein folding has lagged far behind, and it is 
unclear how these water-soluble principles will trans¬ 
late, particularly as integral membrane proteins must 
expose, rather than hide, their hydrophobic surfaces to 
the solvent environment. The two-stage model for 
alpha-helical membrane protein folding suggests 
a simplified process with two distinct stages of protein 
insertion and then folding (Engelman et al. 2003). In 
the first stage, individual transmembrane helices insert 
stably in the bilayer and then form the important helix- 
helix associations and tertiary structure in the second 
stage to give the final protein fold. This model is 
oversimplified as not all transmembrane helices are 
independently stable; nonetheless, the model can pro¬ 
vide a useful basis for in vitro folding studies. Investi¬ 
gating folding first involves denaturing the protein. 
With membrane proteins, thermal denaturation mostly 
results in irreversible unfolding and aggregation and so 
chemical denaturants, such as SDS, urea, and GuHCl, 


< - 

Membrane Proteins: Folding and Stability, Fig. 2 Sche¬ 
matics of proposed membrane protein folding models, in vivo 
and in vitro, (a) Cotranslational expression of a helical bundle 
integral membrane protein. The translated peptide passes 
directly from the ribosome into the translocon, from which the 
highly hydrophobic transmembrane segments exit through the 
lateral gate into the surrounding bilayer. After exiting the 
translocon, the helices assemble into the final fold (pdb file 
3NS0). (b) Posttranslational expression of an outer membrane 
beta-barrel protein. The chaperone protein SecB binds trans¬ 
lated, unfolded protein in the cytosol and transfers it to SecA. 
ATP is used to drive transport through the translocon into the 


periplasmic space where the protein associates with the inner 
leaflet of the outer membrane. Membrane insertion and 
folding into the final barrel structure happen simultaneously 
(pdb file ITHQ). (c) In vitro folding of a helical membrane 
protein. Protein partially unfolded in SDS is mixed with 
lipids to dilute out the denaturant. In the bilayer, the original 
secondary structural content is recovered and the final fold is 
formed (pdb file 3NS0). (d) In vitro folding of an outer 
membrane protein. Fully denatured protein in urea is diluted 
into lipids where it passes through a membrane-absorbed state 
before folding and inserting across the bilayer simultaneously 
(pdb file ITHQ) 
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are generally used. Unlike water-soluble proteins, 
alpha-helical membrane proteins tend not to unfold 
fully in these denaturants but retain significant second¬ 
ary structure. This corresponds to the first stage of the 
model, with substantial helical elements remaining as 
a folding core that seems to be critical for successful 
folding in vitro. Refolding these partially denatured 
stages into detergent or lipid systems allows the study 
of the critical helical packing of the final stage of 
folding (Fig. 2). Several proteins have now been stud¬ 
ied in this way including diacylglycerol kinase, KscA, 
EmrE, and GalP. However, the first and most detailed 
investigations have involved the light-driven, 
archaebacterial proton pump bacteriorhodopsin (bR). 
The protein has seven transmembrane helices and is 
partially denatured by SDS with the loss of about 30% 
of its secondary structure and its natively, covalently 
bound retinal cofactor. The unfolding is fully revers¬ 
ible by dilution of the SDS state into DMPC/CHAPS 
mixed micelles, allowing the calculation of the ther¬ 
modynamic properties of the protein, including the 
free energy for unfolding. SDS-denatured bR can also 
be refolded into lipid vesicles with very high yields of 
regeneration. It was found that the refolding yield 
could be adjusted in a predictable manner by changes 
to the bilayer curvature stress (Allen et al. 2004). The 
addition of nonlamellar lipid to DPoPC vesicles 
increased the curvature stress and decreased recovery 
of functional protein, while, correspondingly, relaxing 
the bilayer by the addition of saturated or single chain 
lipids increased the folding yield. The folding pathway 
for beta-barrel proteins is very different to that of 
helical bundle membrane proteins (Tamm et al. 
2001). They are much less hydrophobic and can gen¬ 
erally be fully unfolded by high concentrations of 
either urea or GuHCl; they are then proposed to absorb 
onto the bilayer, and then fold and insert in one step. 
Reversible folding systems have been developed for 
several outer membrane proteins including OmpA, 
OmpG, and PagP. 

Transition States 

To fold correctly, proteins must overcome an activa¬ 
tion barrier to pass through a high-energy transition 
state. Understanding this transition state is vital to 
resolving how a protein folds to its native structure. 
Phi-value analysis is a method combining site-directed 
mutagenesis, equilibrium thermodynamics and kinetic 
data to probe the transition state structure and 


energetics during folding in vitro. To use this method, 
the folding system must be microscopically reversible 
and ideally two-state, as well as described experimen¬ 
tally by linear free energy changes with denaturant 
concentration. A phi-value is a measure of the change 
in activation energy relative to the change in the over¬ 
all free energy of folding, induced by the directed 
mutation of a single amino acid. Phi-values tend to 
fall between zero and one, where a value of zero 
means that the mutation has changed the overall free 
energy but not the activation energy, whereas a value 
of one means that the change in the free energy of 
activation is the same as the overall change in free 
energy. These values can be interpreted in terms of 
the structure of the transition state, where high values 
close to one indicate native-like interactions near the 
mutation site, and low values close to zero indicate 
a denatured structure. This method has been applied to 
many small water-soluble proteins, but to date, only 
two membrane proteins have been analyzed this way, 
bR and PagP. Mutations have been carried out along 
two of the seven helices of bR, together with some 
double mutants to probe specific hydrogen bonds, and 
phi-values calculated. The values on helix B were 
mostly close to one, with some intermediate values 
measured on the cytoplasmic end of the helix where 
it packs against other helices (Curnow and Booth 

2009) . In contrast, mutations along helix G were 
mostly low or intermediate. Taken together, these 
results suggest that helix B folds early and is part of 
a stable folding nucleus with native-like structure in 
the transition state, while helix G is more unstructured 
and constitutes a relatively late folding event (Cumow 
et al. 2001). Phi-value analysis has been extended to 
the beta-barrel membrane protein PagP. In an appro¬ 
priate lipid environment, PagP could be fully and 
reversibly unfolded in urea in a two-state transition 
from an unstructured lipid-associated state to the 
native structure. Nineteen mutations were made that 
spanned the eight beta-strands that comprise the barrel. 
Phi-value analysis of these mutations implied that the 
transition state is polarized, with the C-terminal part of 
the protein more structured, suggesting a concerted 
tilted folding-insertion mechanism (Huysmans et al. 

2010) . Several intermediate phi-values were observed 
in both the PagP and bR studies. These values can arise 
from multiple contacts, transition states, or folding 
paths or could potentially suggest contributions from 
the surrounding bilayer environment. 
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Protein Folding In Vivo 

The mechanisms for membrane protein folding in vivo 
are also becoming clearer. Most membrane proteins 
are exported from the cytosol through a highly con¬ 
served membrane protein complex called the 
translocon. This protein-conducting channel is also 
responsible for the secretion of proteins targeted for 
extracellular locations, the periplasm or the outer 
membrane (du Plessis et al. 2011). It must therefore 
operate in two dimensions, opening horizontally for 
secreted proteins and laterally for membrane proteins. 
The core of the membrane complex is the 
heterotrimeric SecYEG complex in bacteria (Sec61 
in eukaryotes). Crystal structures show a “clamshell”- 
like structure, showing both a central peptide channel 
and a potential lateral gate, allowing access to the lipid 
bilayer. In cotranslational translocation, the most com¬ 
mon method for membrane proteins, the ribosome 
binds directly to the translocon such that the growing 
peptide passes directly into the complex. Nascent poly¬ 
peptide chains are targeted to the translocon by 
a cleavable N-terminal signal sequence or the first 
transmembrane segment of a membrane protein, 
known as a signal anchor. During translocation, open¬ 
ing of the lateral gate exposes the transmembrane 
segments to the lipid bilayer, where their length and 
hydrophobicity cause them to partition into the bilayer. 
Once the helices have been inserted into the mem¬ 
brane, the final folding can take place (Fig. 2). In this 
way, the in vitro model can be seen to reproduce this 
posttranslocon phase of folding. Beta-barrels interact 
with the translocon slightly differently. The unfolded 
protein is bound posttranslationally in the cytosol by 
the chaperone SecB, which then transfers the peptide 
to the ATPase SecA, complexed with SecYEG. SecA 
drives the translocation, where the low hydrophobicity 
of the extended beta-strands allows the protein to pass 
through into the periplasm. Although chaperones are 
likely to be involved in preventing aggregation, the 
unfolded protein can bind and fold directly into the 
outer membrane in a similar manner to the in vitro 
folding studies. 

Summary 

Biological membranes are complex environments, 
reflected in the large variety of lipid composition and 


organization and the range of membrane protein struc¬ 
tures. Lipid-protein interactions profoundly influence 
the folding and stability of integral membrane proteins. 
When working with membrane proteins in vitro, it is 
important to be aware of the potential influence of the 
bilayer. Synthetic lipid systems mimic the native mem¬ 
brane environment while creating bilayer conditions 
that can be simply, and increasingly predictably, 
manipulated. In vitro folding systems are beginning 
to be established to enable the detailed mechanistic 
study of membrane protein folding of both helical 
and barrel-type structures and to allow comparisons 
with in vivo folding pathways. 

Cross-References 
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► Critical Fluctuations in Lipid Mixtures 

► Functional Roles of Lipids in Membranes 

► Homology Modeling of Protein Structures 

► Lipid Bilayer Asymmetry 

► Lipid Bilayer Lateral Pressure Profile 

► Lipid Domains 

► Lipid Shape and Curvature Stress 

► Lipids: Isolation and Purification 

► Phospholipid Bilayer Nanodiscs: Application to 
Single-Molecule Measurements 
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Membrane Proteins: Structure and 
Organization 

Alan Goddard, Joanne Oates and Anthony Watts 
Biomembrane Structure Unit, Department of 
Biochemistry, University of Oxford, Oxford, UK 

Synonyms 

Integral Membrane Protein Structure 


Definition 

Membrane proteins are those which are integral to, or 
associated with, the various lipid membranes within 
and around cells. 


Introduction 

Membrane proteins are classified by their method of 
attachment into four main groups: integral, peripheral, 
partially penetrating, and lipid-anchored. Additionally, 
integral proteins can be further divided by the type of 
fold they possess - either alpha-helical or beta-barrel. 
Traditionally, these proteins have been challenging to 
study due to their hydrophobic nature and low natural 
abundance. 

Integral Membrane Proteins 

Integral membrane proteins constitute a significant 
proportion of the total protein of most cells and are 
characterized by insertion through the plasma mem¬ 
brane, resulting in permanent attachment. As such 
they can only be removed by relatively harsh 
methods including detergent solubilization or the 
use of nonpolar solvents. The proteins are 
surrounded in the membrane by annular lipids - 
those that are in direct contact with the protein. 
Such lipids have been implicated in the physiologi¬ 
cal activity of certain integral membrane proteins. 
As expected, the regions of the protein in contact 
with the lipids tend to contain hydrophobic residues. 
Integral membrane proteins can be divided into two 
classes, based upon the precise nature of their inter¬ 
action with the membrane. Monotopic membrane 
proteins are those which are attached to the mem¬ 
brane from one side only. These proteins/domains 
carry out their function on the side of the membrane 
in which they are located. Examples of these kinds of 
proteins include carnitine palmitoyltransferase-2 
from rat, which is located to the inner leaflet of the 
mitochondrial inner membrane; prostaglandin H2 
syntheses 1 and 2; microsomal prostaglandin 
E synthase; and the monotopic domains of fatty 
acid amide hydrolase and monoamine oxidase B. 
The latter two examples also contain transmembrane 
(TM) domains; however, the majority of the proteins 
including the active site are located monotopically, 
on one side of the membrane. In contrast, polytopic 
or transmembrane proteins span the membrane 
completely, and are able to carry out functions on 
both sides of the membrane. They comprise over 
one-quarter of the total protein within an organism 
and have many important cellular roles: acting as 
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receptors, forming ion channels and translocons, as 
well as energy-generating pumps and reaction cen¬ 
ters. These proteins can have varying numbers of 
transmembrane domains ranging from one, such as 
the receptor tyrosine kinase family, to several, such 
as the seven TM G protein-coupled receptor and are 
classihed as either single- or multi-pass. Integral 
membrane proteins can adopt two types of fold: 
alpha-helical or beta-sheet. Alpha-helices are by far 
the most represented structure for a protein in 
a bilayer due to energetically favorable hydrogen- 
bond formation within the protein backbone. Beta- 
barrels are also low energy structures present in 
bacteria, mitochondria, and chloroplasts, although 
they have only been studied in detail in the outer 
membrane of Gram-negative bacteria. 

Peripheral Membrane Proteins 

Peripheral membrane proteins are loosely associ¬ 
ated with the polar head group region of the lipid 
bilayer in a non-covalent manner. Unlike integral 
membrane proteins they do not contact the hydro- 
phobic interior of the bilayer. Typically peripheral 
proteins are maintained in close proximity to the 
bilayer through interaction with other membrane 
proteins. The loose membrane association means 
that this group of proteins can be isolated from 
the bilayer using relatively mild conditions without 
the need for detergents. Cytochrome c is one such 
protein which interacts with components of the 
electron transport chain but does not penetrate the 
bilayer. 

Partially Penetrating Membrane Proteins 

In addition to those proteins which are peripheral 
or integral to the membrane are a class of proteins 
which associate with both the polar heads of the 
bilayer and also with the hydrocarbon tails of the 
lipids. These are described as partially penetrating 
membrane proteins. These proteins may use 
a surface loop, exposed via a conformational 
change, to interact with the membrane. Addition¬ 
ally, membrane-targeting domains and peripheral 
membrane proteins may also be partially penetrat¬ 
ing during the interaction process. Examples of 


this type of protein include the antimicrobials 
magainin and dermaseptin. 

Lipid-Anchored Membrane Proteins 

Many proteins associate with the membrane via cova¬ 
lent lipid modihcations. 

Fatty acids can be directly attached to the pep¬ 
tide backbone and can also be easily incorporated 
into lipid bilayers. The two most common exam¬ 
ples are palmitoylation and myristoylation. Modi¬ 
fication with fatty acids increases the overall 
hydrophobicity of the protein and mediates mem¬ 
brane attachment. S-palmitoylation involves that 
attachment of fatty acids, most commonly palmitic 
acid, to cysteine residues via an ester bond. Due to 
the nature of this bond, the attachment is revers¬ 
ible and the precise palmitoylation state of the 
protein may be key in modulating activity. The 
reverse reaction is catalyzed by palmitoyl protein 
thioesterases. Myristoylation is the irreversible co- 
translational attachment of the myristate-derived 
myristoyl group via an amide bond to the alpha- 
amino group of the N-terminal amino acid of the 
peptide. The most common site for such attach¬ 
ment is glycine, although attachment at other 
amino acids occurs. The process is catalyzed 
by N-myristoyltransferase (NMT). Prenylation 
involves the attachment of farnesyl or geranyl- 
geranyl to the C-terminus of a protein via 
a conserved CaaX motif, where “C” is the cysteine 
residue that is prenylated and “a” is an aliphatic 
amino acid. The identity of residue “X” determines 
the nature of prenylation by determining which 
enzyme acts upon the protein substrate. 
Prenylation is catalyzed by farnesyl and geranyl- 
geranyl transferases, CaaX protease, and methyl 
transferase. 

Proteins can also be attached to membranes via 
a GPI anchor (glycophosphatidylinositol). This is 
attached posttranslationally to the C-terminus of 
the protein. Firstly, GPTanchored proteins contain 
a signal sequence which directs the protein to the 
endoplasmic reticulum (ER). The C-terminus is 
hydrolyzed and amino acids remain inserted in 
the ER membrane. The hydrolyzed end is cleaved 
and replaced by the GPI anchor. The protein is then 
targeted to vesicles, then golgi, and finally to the 
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extracellular membrane, where it remains attached. 
The GPI anchor can be cleaved by phospholipases 
for controlled release. 

Alpha-Helical Membrane Proteins 

The vast majority of transmembrane proteins are 
contained in this group. Alpha-helical proteins can he 
further classified according to orientation and number of 
membrane spans. Type I proteins are single-pass with an 
extracellular or luminal N-terminal domain. Type II pro¬ 
teins are single-pass but with the N-terminus located on 
the intracellular or cytoplasmic side. Type III proteins or 
multi-pass proteins contain more than one transmem- 
hrane domain and can be in either orientation. Type IV 
membrane proteins consist of complexes of multiple 
domains from separate polypeptide chains. The mem¬ 
brane-spanning helix forms due to energetically favorable 
hydrogen-bond formation within the protein backbone 
and can be either right or left handed. A typical helix 
contains 3.6 residues per turn, although helices with 3.10 
residues are also found, and is around 30 A in length. The 
majority of residues within a transmembrane helix are 
hydrophobic, and membrane-spanning regions can be 
identified using computational algorithms/hydrophobic- 
ity plots to identify regions of hydrophobic sequence. 
Examples of alpha-helical transmembrane proteins 
include G protein-coupled receptor, ion channels, 
translocons, and light-harvesting complexes. 

Beta-Barrel Membrane Proteins 

A transmembrane protein which spans the bilayer as 
a beta-barrel structure comprised of antiparallel 
beta-strands forming a rigid and stable barrel shape 
which usually forms a channel for substrates. Known 
structures of beta-barrels contain between 8 and 22 
strands, allowing flexibility in the size of the barrel 
channel. In general, beta-barrels can be identified by 
their alternating pattern of lipid-exposed hydrophobic 
residues and interior hydrophilic residues with both 
the N- and C-termini located on the intracellular side. 
The tilt or inclination of the barrel in the membrane 
can be defined by the number and stagger of the 
strands. Both oligomeric and monomeric beta-barrels 
are found in biological membranes. Beta-barrel 


membrane proteins commonly form pores known as 
porins in the outer membrane of Gram-negative bac¬ 
teria, chloroplasts, and mitochondria where the vari¬ 
ation in barrel structure enables selectivity of a wide 
range of substrates. They are also found in microbial 
toxins. 

Membrane Protein Insertion and Folding 

The process by which membrane proteins insert into the 
bilayer and assemble to form stable structures has been 
studied in greater depth for alpha-helices than beta- 
barrels. In both cases membrane insertion can occur 
spontaneously; however, alpha-helices usually require 
the assistance of a translocon and an input of energy. In 
order to conceptualize the membrane protein folding 
process, a number of models have been proposed 
which consider isolated alpha-helical segments. The 
two-stage model of membrane protein folding was pro¬ 
posed by Popot and Engelman in 1990 (Popot and 
Engelman 1990) and describes the establishment of 
stable individual helical domains across the bilayer as 
the first step followed subsequently by lateral associa¬ 
tion of the helices. This model was later refined to 
include a third step describing the insertion of cofactors 
and formation of quaternary complexes (Engelman 
et al. 2003). In 1999 a further model was proposed by 
Wimley and White which added additional stages to 
describe the formation of helical domains which 
involves partitioning of the helical segments into the 
bilayer interfacial region proceeded by helix formation 
and then bilayer insertion (White and Wimley 1999). 
This four-stage process of partitioning, folding, inser¬ 
tion, and association was described using thermody¬ 
namic principles. The development of hydrophobicity 
scales, to describe the energetic cost of inserting each 
amino acid into the bilayer, has greatly improved our 
understanding of this aspect of membrane protein fold¬ 
ing. The lateral association of helices within the bilayer 
has also been extensively explored leading to the dis¬ 
covery of specific residues and motifs that promote 
helix association such as GxxxG motif, where two 
small glycine residues are separated by three amino 
acids, thus placing the two glycines on the same face 
of the helix where their small size facilitates an interac¬ 
tion. This motif was first characterized in glycophorin 
A which is the model for association of transmembrane 
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domain (Engelman and Russ 2000). Other determinants 
of interaction include motifs of small polar residues 
such as serine and threonine, as well as leucine zippers 
similar to those found in soluble proteins. 

Cross-References 

► Ion Channels 

► Lipids 

► Membrane Protein Function 

► Protein Folding 

► Rhodopsin - Stability and Characterization of 
Unfolded Structures 

► Rhodopsin: Stability and Structural Organization in 
Membranes 
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Definition 

At least one member of each transporter family listed 
below with an asterisk* has a 3D stracture similar to 
that of at least one protein from another family. Further¬ 
more, according to Saier et al. (2009), each of the addi¬ 
tional families has members with sufficiently similar 
aligned sequences of amino acids between different fam¬ 
ilies for them all to have a common evolutionary ancestor. 
Each family is therefore thought to be part of a transporter 
superfamily variously known as the amino acid-poly- 
amine-organocation (APC) superfamily (Chang et al. 
2004; Saier et al. 2009), the LeuT superfamily 
(Krishnamurthy et al. 2009), or the 5-helix inverted repeat 
(SHIR) transporter superfamily (Adelman et al. 201 1). 

Introduction 

The general transport reactions of this superfamily 
(Fig. 1) can be represented as follows: 

Substrate(out) + cation* (out) ^ Nucleobase or Vitamin(in) 

+ cation* (in) OR Substrate 1 (out) [ication* (out)] 

+ substrate2(in)[±cation*(in)] ^ substrate 1 (in)[ication*(in)] 
+ substrate2(out)]±cation*(out)] 
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Acid Osmo- 

resistance regulation 


Membrane Transport Proteins: The Five-Helix Inverted 
Repeat Superfamily, Fig. 1 Scheme for the energized trans¬ 
port of substrates of the SHIRT superfamily proteins. The large 
oval represents the cytoplasmic membrane of a microorganism. 
In bacteria, a transmembrane electrochemical gradient of pro¬ 
tons is generated by respiration or ATP hydrolysis, as shown on 
the left and transmemhrane gradient of sodium ions by an H"^/ 
Na"^ antiport (right). In higher organisms, a transmemhrane 


gradient of protons would be generated by an FF^-ATPase and/ 
or a transmembrane gradient of sodium ions by Na'^,K’^-ATPase. 
The gradients may he used to drive cation-nutrient symport, and 
substrate/substrate antiport secondary active transport systems 
shown around the circumference, with NCS-1 family members 
and their substrates along the top and other 5H1RT superfamily 
members along the bottom. Each is a single protein indicated by 
a circle 


The first protein of this superfamily to have its 
structure determined, LeuT, unexpectedly turned 
out to be similar in structure to proteins from 
the following families indicated by an asterisk*, 
originally classihed as unrelated according to the 
dissimilarities of their amino acid sequences (Saier 
et al. 2006, 2009): 

*2.A.3 - The Amino Acid-Polyamine-Organocation 
(APC) Family 

*2.A.15 - The Betaine/Camitine/Choline Transporter 
(BCCT) Family 

2.A.18 - The Amino Acid/Auxin Permease (AAAP) 
Family 

*2.A.21 - The Solute: Sodium Symporter (SSS) Family 
*2.A.22 - The Neurotransmitter: Sodium Symporter 
(NSS) Family 

2.A.25 - The Alanine or Glycine:Cation Symporter 
(AGCS) Family 

2.A.30 - The Cation:Chloride Cotransporter (CCC) 
Family 


*2.A.39 - The Nucleobase: Cation Symporter-1 
(NCSl) Family 

2.A.40 - The Nucleobase:Cation Symporter-2 (NCS2) 
Family 

2.A.42 - The Hydroxy/Aromatic Amino Acid Perme¬ 
ase (HAAAP) Family 

2.A.53 - The Sulfate Permease (SulP) Family 


Similarity of Crystal Structures of the 
Transport Proteins 

Each protein within each of the above families is 
related to the others in the same family by statisti¬ 
cally robust similarities of their amino acid 
sequences. At least one member of the APC, BCCT, 
SSS, NSS, and NCS-1 families has a 3D structure 
similar to that of at least one protein from another 
family (Fig. 2). Taken together with sequence simi¬ 
larities between at least one pair of proteins from 
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BetP 


vSGLT 


Membrane Transport Proteins: The Five-Helix Inverted 
Repeat Superfamily, Fig. 2 Comparisons of the protein fold 
of members of the LeuT superfamily viewed parallel to the 
membrane. Structure of Mhpl (PDB accession code: 2JLN. 
pdb). The transmembrane helices (TMs) 11 and 12 are colored 
in gray. Structure of LeuTAa (2A65.pdb). TMs 11 and 12 are 

different families (Saier et al. 2009; Ren et al. 2007; 
Ren and Paulsen 2010) and with similarities in 
hydropathic profiles of proteins otherwise thought 
not to be related (Lolkema and Slotbloom 2005), all 
these families are conceived to be part of 
a transporter superfamily. 

The crystal structure of the LeuT membrane trans¬ 
port protein from the neurotransmitter-sodium- 
symport (NSS) family was solved (Yamashita et al. 
2005). It is composed of 12 transmembrane helices 
(TMs) (Fig. 2). The hrst ten TMs are arranged in an 
inverted repeat arrangement of helices 1-5, and 6-10. 
A similar arrangement of (not necessarily contiguous) 
helices was then observed in a number of other struc¬ 
tures that span numerous secondary transporters from 
diverse families (above), which, owing to the dissim¬ 
ilarity of their sequences, were not expected to have 
a similar fold. In addition to LeuT these include: the 
sodium-galactose symporter vSGLT (Faham et al. 
2008) from the sodium-solute symporter (SSS) family; 
Mhpl from the nucleobase cation symport-1 (NCSl) 
family (Weyand et al. 2008, 2010, 2011); the sodium- 
betaine symporter BetP (Ressl et al. 2009) from the 
betaine/choline/carnitine transporter (BCCT) family; 
CaiT (Tang et al. 2010; Schulze et al. 2010), 


colored in gray. Structure of vSGLT (3DH4.pdb). TMs 1, 12 
and 13 are colored in gray. Structure of the betaine or proline 
transporter BetP (2WIT.pdb). CaiT. Structure of the arginine/ 
agmatine transporter AdiC (3HQK.pdb). Structure of an amino 
acid or polyamine transporter Ape (3GIA.pdb). The coloring 
scheme of the rest of the ribbon is as Fig. 1 

an antiporter from the same family; and two members 
of the amino acid-polyamine-organocation (APC) 
family AdiC (Fang et al. 2009; Gao et al. 2009; 
Kowalczyk et al. 2011) and ApcT (Shaffer et al. 
2009). This common fold has been described as 
a 5-helix inverted repeat in which the first five TMs 
are topologically equivalent to the second five and 
are related to one another by a pseudo twofold axis 
that runs through the center of the membrane 
(Krishnamurthy et al. 2009; Abramson and Wright 
2009; Boudker and Verdon 2010; Weyand et al. 
2011) (Fig. 2). The two halves of the protein are 
completely intertwined with both contributing to the 
ligand-binding site, which is situated buried in the 
membrane approximately at the centers of the respec¬ 
tive proteins. 

Since the inverted repeat occurs within each of the 
known protein structures and is apparently expected 
for all (Lolkema and Slotbloom 2005; Abramson and 
Wright 2009; Krishnamurthy et al. 2009; Boudker 
and Verdon 2010), the question arises as to whether 
there are features of the molecular mechanism of 
transport common to all of these proteins. The 5HIR 
transporter superfamily contains proteins with very 
different substrates that can be either symporters or 
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antiporters (Fig. 1). For the sodium-coupled 
symporters, it seems likely that the switching between 
the outward- and inward-facing structures is based on 
a similar principle. The hrst protein for which the 
structure was determined in three conformational 
states was the sodium-benzylhydantoin transporter, 
Mhpl, from Microhacterium liquefaciens', a mecha¬ 
nism was derived for switching from the outward¬ 
facing open conformation through an occluded struc¬ 
ture to the inward-facing open state (Weyand et al. 
2008, 2011; Shimamura et al. 2010). In this a bundle 
of the four helices 3, 4, 8, 9 known as the hash motif 
moves as a relatively rigid body with respect to 
a bundle of helices 1, 2, 6, 7 to provide the major 
element, the thick gate, of the switch from open-out 
to open-in. Additionally a reciprocating movement of 
helices 10 and 5 closes an external thin gate and opens 
an internal thin gate (cf Adelman et al. 2011; see 
► Mhpl, the Na'^-Hydantoin Membrane Transport 
Protein). A similar mechanism of the switching 
between outward- and inward-facing conformations 
caused by a rigid-body movement of the four-helix 
bundle relative to the rest of the protein was, in fact, 
first proposed for LeuT based on the asymmetry of the 
crystal structure and investigated more thoroughly 
using a mutational analysis of the serotonin transporter 
(Forrest and Rudnick 2009). 

Can the fundamentals of the mechanism proposed 
for Mhpl be extended to all the members of the SHIRT 
(5-helix inverted repeat transporter) superfamily? This 
question is to some extent addressed by the recent 
structures of the LeuT transporter, which show out¬ 
ward-, occluded, and inward-facing conformations of 
this protein (Krishnamurthy and Gouaux 2012). Like 
the mechanism above for Mhpl, the major change 
between outward- and inward-facing conformations is 
a rotation of the bundle relative to the hash motif. 
However, the details of the conformational changes 
are rather different between the two proteins. In LeuT, 
there is no independent movement of TM5 or TMIO and 
instead the bundle helices are more mobile with TMs 1 
and 6 flexing upon ligand binding and TMl bending 
considerably as the protein changes from outward to 
inward facing. Similarly, by comparing the outward¬ 
facing AdiC with the inward-facing ApcT from the 
APC family or the occluded BetP with the inward¬ 
facing CaiT from the BCCT family, the trend of the 
hash motif moving relative to the bundle can still be 
observed but the mechanism of occlusion is different. 


For example, in AdiC, which has been solved in the 
outward-facing and occluded states, TMIO, TM2, and 
TM6 move upon binding the ligand. Thus, the mecha¬ 
nism for Mhpl (Shimamura et al. 2010; Weyand et al. 
2011 ) is to some extent observed in other members of 
the superfamily but the details vary, presumably 
reflecting the different substrates as well as the archi¬ 
tecture of the noncore helices. Parallel studies of these 
proteins involving higher-resolution structures and 
mutational analysis combined with kinetic measure¬ 
ments are required if we are really to understand the 
general principles of the mechanism in this superfamily. 

Summary 

Eleven families of membrane transport proteins, pre¬ 
viously classifled as separate because of the apparent 
dissimilarities in their amino acids sequences, now 
comprise one Superfamily because of similarities in 
their protein folds, hydropathic profiles, and residual 
sequence similarities. 

Cross-References 

► Membrane Protein Structure 

► Membrane Transport, Energetics and Overview 

► Mhpl, the Na'^-Hydantoin Membrane Transport 
Protein 

► Nucleobase-Cation-Symport-2 Family and the 
Uracil: Proton Symporter, UraA 
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Synonyms 

Purine-Related Transporter (PRT) 

Definition 

The NCSl family designated A.2.39 (Saier et al. 2006, 
2009) consists of over 1,000 currently sequenced pro¬ 
teins derived from Gram-negative and Gram-positive 
bacteria, archaea, yeast, fungi, and plants (Fig. 1; Ren 
et al. 2007; Ren and Paulsen 2007,2010). Proteins of the 
NCSl family are 419-635 aminoacyl residues long and 
putatively possess 12 transmembrane a-helices (TMHs). 
The NCS-1 transporters are essential components of 
salvage pathways for nucleobases, vitamins, and related 
metabolites, and their generalized transport reaction is: 

Nucleobase or vitamin(out) + cation+(out) ^ 
Nucleobase or vitamin(in) -F cation+(in) 

Introduction 

In fungi, members of the NCS-1 family have variously 
been identified as transporters of nucleobases. 
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Membrane Transport Proteins: The Nucleobase-Cation- 
Symport-1 Family, Fig. 1 Unrooted phylogenetic tree illus¬ 
trating relationships between selected NCS-1 family members 
from eubacteria (black), archaea (red), green plants (green), 
and fungi (blue). Transporters that have been functionally char¬ 
acterized are indicated by the names used in the literature, 
followed by species designation (Sc Saccharomyces cerevisiae. 


Sp Schizosaccharomyces pombe, Ca Candida albicans. An 
Aspergillus nidulans, Ec Escherichia coli. Ml Microbacterium 
liquefaciens) and are boxed. Other putative transporters are 
indicated by their UniProt accession numbers. The alignment 
was made using Clustal X and then manually adjusted 
using BioEdit, and the phylogenetic tree was created using the 
neighbor-joining method 


nucleosides, coenzymes, and vitamins (see 
Pantazopoulou and Diallinas 2007; Diallinas 2009; 
Weyand et al. 2011 for reviews and detailed references). 
Phylogenetically, they fall into two distinct subfamilies 
(Fig. 1). 

One subfamily is typified by the well-characterized 
transporter Fcy2p of Saccharomyces cerevisiae, which 
transports the purine nucleobases adenine, guanine, and 


hypoxanthine, together with the pyrimidine nucleobase 
cytosine. Its close homologues in S. cerevisiae, Fcy21p 
and Fcy22p, are likely to have similar permeant selec- 
tivities (Paluszynski et al. 2006), and functionally sim¬ 
ilar transporters have also been identified m Aspergillus 
nidulans (FcyB) as well as in Candida albicans 
(Fcy21p). In contrast to these nucleobase transporters, 
subfamily member Tpnlp of S. cerevisiae instead 
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Allantoin Hydantoins Cytosine Uracil Thiamine 



Membrane Transport Proteins: The Nucleobase-Cation- 
Symport-1 Family, Fig. 2 Scheme for the energized transport 
of substrates of the NCS-1 supeifamily proteins. The large oval 
represents the cytoplasmic membrane of a microorganism. In 
bacteria, a transmembrane electrochemical gradient of protons is 
generated by respiration or ATP hydrolysis, as shown on the left, 
and a subsequent sodium gradient by a H’*^/Na’^ antiport shown 
on the right. In higher organisms, a transmembrane gradient of 


protons would be generated by an TF^-ATPase and/or a gradient 
of sodium ions by a Na'^,K'^-ATPase. The gradient may be used 
to drive proton/sodium antiport, proton-nutrient symport, and 
sodium-nutrient symport secondary active transport systems 
shown around the circumference, with NCS-1 family members 
and their substrates shown along the top. Each is a single protein 
indicated by a circle, and the Mhpl protein of known structure is 
indicated in red 


transports vitamin Bg (pyridoxine, and probably also 
pyridoxal and pyridoxamine). Each of the yeast trans¬ 
porters also represents a route of uptake for the clini¬ 
cally used antifungal agent 5-fluorocytosine (5FC; 
Ancotil®), while Fcy2p, Fcy21p, and Fcy22p also trans¬ 
port the toxic adenine analogue 8-azaadenine 
(Paluszynski et al. 2006). The importance of Fcy2p for 
this process is highlighted by the fact that nonsense 
mutations in the homologous transporter FCY2 have 
been shown to be responsible for 5FC resistance in 
clinical isolates of Candida lusitaniae. 

Members of the second subfamily of functionally 
characterized NCS-1 transporters from fungi have 
been identified as transporters of an even more structur¬ 
ally diverse family of permeants than the first, ranging 
in size from uracil to thiamine. The best characterized 
member of the family, Fur4p from S. cerevisiae 
(Yoo et al. 1992), is a uracil transporter. Functionally 
similar homologues have also been described from 


Schizosaccharomyces pomhe (Fur4p) and from 
Aspergillus nidulans (FurD). In contrast, the 
S. cerevisiae protein Dal4p (Yoo et al. 1992), which is 
68 % identical in sequence to Fur4p, transports allantoin, 
as does the FurA protein from Aspergillus nidulans. 
Interestingly, phylogenetic analyses have suggested 
that within this subfamily of fungal transporters conver¬ 
gent evolution toward uracil and allantoin transport 
activity has occurred multiple times independently. 
Also part of the fungal transporter subfamily related to 
Fur4p is the nucleoside transporter Fuilp of 
S. cerevisiae, which is selective for uridine rather than 
uracil. More distantly related members of the subfamily 
are the S. cerevisiae transporters, Thi7p and Nrtlp, 
which primarily transport thiamine and nicotinamide 
riboside respectively, although thiamine is also a 
low-affinity permeant of Nrtlp. Nicotinamide riboside 
is a precursor of NAD"^ and plays a key role in calorie 
restriction-mediated life span extension in yeast. 
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In the bacteria, the best characterized NCS-1 family 
member is Mhpl from Microhacterium liquefaciens, 
the structure of which has been determined (Weyand 
et al. 2008, 2010, 2011; Shimamura et al. 2008, 2010). 
Mhp 1 is a hydantoin transporter, with a preference for 
permeants with a hydrophobic substituent at position 5 
of the hydantoin such as 5-indolyl methyl hydantoin 
(IMH) and 5-benzyl hydantoin (BH), and often 
a preference for the L-isomer over the o-isomer (Suzuki 
and Henderson 2006); neither allantoin nor cytosine 
(Fig. 2) is a substrate for Mhpl (Suzuki and Henderson 
2006). A similar transporter for hydantoins occurs in 
Arthrobacter aurescens. Other characterized bacterial 
NCS-1 family members are the transporters for allan¬ 
toin in Bacillus suhtilis (designated Pud, Schultz et al. 
2001), for cytosine in Escherichia coli (designated 
CodB, Danielsen et al. 1992), and putatively for 
hydroxyl-methylpyrimidine in Pseudomonas putida 
(CytX, Rodionov et al. 2002). 

Members of the NCS-1 family that transport 
hydantoins have importance in biotechnology for con¬ 
version of L- or D-hydantoin derivatives to 
corresponding compounds, of signihcant added value 
for biosynthesis of amino acids or drugs (Suzuki et al. 
2005; Javier Las Heras-Vazquez et al. 2009). 

The secondary active transport of the permeants by 
the NCS-1 family is driven by the transmembrane cation 
gradient of Na"^ or H"^ (Fig. 2) with a stoichiometry that 
probably varies with individual proteins, depending 
upon the presence or absence of positive or negative 
charges in the organic substrate (Pantazopoulou and 
Diallinas 2007; Weyand et al. 201 1). 

Summary 

The Nucleobase-Cation-Symport-1 (NCS-1) family of 
membrane transport proteins consists of over 1,000 
currently sequenced proteins derived from Gram¬ 
negative and Gram-positive bacteria, archaea, yeast, 
fungi, and plants categorized as Family 2.A.39 in the 
Transport Classihcation Database. Known substrates 
include nucleobases, nucleosides, allantoin, deriva¬ 
tives of hydantoin and vitamins. Generally, they are 
electrochemical potential-driven transporters where 
translocation of substrate is coupled to the prevailing 
gradient of Na"^ or H"^. The Microbacterium hydantoin 
permease, “Mhpl,” is the only member of the NCS-1 
family for which the structure has been determined. 


Cross-References 

► Membrane Transport, Energetics and Overview 

► Mhpl, the Na'^-Hydantoin Membrane Transport 
Protein 

► Membrane Transport Proteins: The Five-Helix 
Inverted Repeat Superfamily 
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Synonyms 

2.A.17; Peptide Transporter (PTR); PF00854; PTR2; 
sue 15 


Definition 

The Proton-dependent Oligopeptide Transporter 
(POT) family of membrane transporters, designated 


2.A.17 in the Transporter Classification Database 
(TCDB; Saier et al. 2009), PTR2 (PF00854) in the 
Pfam protein family database (Punta et al. 2012) and 
Solute Carrier (SLC) family SLC15 in humans, are 
distributed almost ubiquitously in living organisms: 
release 26.0 of the Pfam database contains more than 
5,000 POT sequences. Functionally characterized 
POTs variously transport a wide range of substrates, 
including oligopeptides, amino acids, and nitrates, 
across cell membranes using energy provided by trans¬ 
membrane proton gradients. 

Introduction 

POTs constitute a subfamily of the Major Facilitator 
Superfamily (MFS) of membrane transporters, 
although they are only distantly related to the other 
subfamilies. The transporters are predicted to contain 
either 12 or 14 transmembrane (TM) helices, with 
cytoplasmic N- and C-termini (Daniel et al. 2006), 
a topology that has been experimentally confirmed by 
the recent determination of the crystal structure of the 
transporter PepTso from the bacterium Shewanella 
oneidensis (Newstead et al. 2011). Mammalian POTs 
possess an additional, large extracellular hydrophilic 
domain, of unknown function, between TMs 9 and 10 
(Fig. 1). POT family members are widely distributed in 
metazoa, plants, fungi, and eubacteria (Daniel et al. 
2006), but to date only a single example (UniProt 
accession FOTBIl) has been identified in archaea. 

The transporters function in cell nutrition, typically 
importing a broad range of dipeptides and tripeptides 
(Fig. 2), but not larger peptides or amino acids. Glycyl- 
sarcosine (glycyl-A-methylglycine; Fig. 2) is typically 
used as a non-hydrolyzable dipeptide analogue for 
their experimental characterization. However, in 
humans, two family members, SLC 15 A3 and 
SLC15A4, also transport histidine (Daniel and Kottra 
2004). In plants, some family members function as 
transporters and/or sensors of nitrate, while others 
transport nitrite, histidine, dicarboxylates, or, at least 
in one case, the auxin indole-3-acetic acid (Fig. 2) 
(Hammes et al. 2010; Dechorgnat et al. 2011). Like 
many other transporters in eubacteria, fungi, and 
plants, transport is energized by the transmembrane 
proton gradient, established by the respiratory chain 
or by proton-pumping ATPases (Harder et al. 2008; 
Hammes et al. 2010). In metazoans, homologues of 
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Membrane Transport Proteins: The Proton-Depen¬ 
dent Oligopeptide Transporter Family, Fig. 1 Membrane 
topology of POT family proteins. The transmembrane helices 
labeled “A” and “B” are present only in a subset of transporters 
from bacteria, and the large extracellular domain linking trans¬ 
membrane helices 9 and 10 (in green) is found only in mamma¬ 
lian peptide transporters 

bacterial transporters are typically energized by trans¬ 
membrane sodium ion gradients. However, in mam¬ 
malian POTs, transport of peptides is linked to co¬ 
transport of protons, as in lower organisms (Daniel 
et al. 2006). The electrochemical gradients of protons 
required for such uptake arise from the action of 
sodium/proton exchangers which create a localized 
acidic microenvironment at the apical brush border 
membrane of cells in the small intestine and kidney 
proximal tubule (Fig. 3) (Daniel et al. 2006). 

Biological Roles and Medical Significance in 
Humans 

In eubacteria and fungi, POTs allow the exploitation of 
peptides as carhon and nitrogen sources, while in 
plants members of the subfamily designated NTRl 
(Nitrate Transporter 1 family) play roles not only in 
nitrogen uptake from the soil in the form of nitrate, but 
also in its distribution within the plant via loading to or 
unloading from xylem vessels (Dechorgnat et al. 
2011). The two best characterized POTs in humans, 
PepTl (SLC15A1) and PepT2 (SLC15A2), are located 
most abundantly in the apical, brush border mem¬ 
branes of cells in the jejunum and kidney proximal 
tubule respectively, and are involved in uptake of 
dietary peptides and in reabsorption of peptides from 
the glomerular filtrate (Daniel and Kottra 2004). 
PepTl is also found in kidney proximal tubules and 
in epithelial cells of the extrahepatic biliary duct, while 


PepT2 is found in the choroid plexus. Following cellu¬ 
lar uptake, peptides are largely broken down by pepti¬ 
dases into amino acids that can then exit the cell via 
basolateral amino acid transporters. Both PepTl and 
PepT2 can transport many different, although not all, 
natural dipeptides and tripeptides. However, they dif¬ 
fer in their affinities for these substrates, PepTl having 
approximately tenfold higher affinity for many, but not 
all, of its substrates than PepT2 (Brandsch 2009). Per¬ 
haps surprisingly, mice bearing targeted disruptions in 
either the PepTl or PepT2 genes show no obvious 
phenotypes on a normal diet, although investigation 
of PepTl-deficient animals has revealed an apparent 
role for PepTl in food intake regulation in mice fed 
a high-protein diet (Nassl et al. 2011). Similarly, 
PepT2-deficient mice exhibit changes in drug disposi¬ 
tion and toxicity (Kamal et al. 2008). No genetic dis¬ 
eases involving POTs are known, although a number of 
single nucleotide polymorphisms have been reported, 
including one in PepT2 (R57H) that yields an inactive 
protein (Daniel et al. 2006). Abnormal expression of 
PepTl in the colon of patients with inflammatory 
bowel disease has been associated with aggravation 
of inflammation via import of bacterial di/tripeptides 
into the colonic cells (Ingersoll et al. 2012). 

Although PepTl is not directly involved in genetic 
disease, it plays key roles in disease treatment because 
it represents a route of uptake for several orally admin¬ 
istered therapeutic drugs. Similarly, through its activ¬ 
ity in renal reabsorption, PepT2 influences the 
pharmacokinetics of such drugs. The latter include 
beta-lactam antibiotics such as the aminocepha- 
losporin cephadroxil and the heme precursor 
5-aminolevulinic acid, which is used as a prodrug in 
photodynamic therapy (Brandsch 2009) (Fig. 2). The 
ability of these compounds to act as substrates proba¬ 
bly reflects their steric resemblance to the peptide 
backbone and stems from the fact that the substrate 
specificity of PepTl/2 is very broad, although in the 
case of peptide substrates L-amino acids are preferred. 
These transporters can even transport omega amino 
fatty acids, provided that the charged termini are sep¬ 
arated by at least four methylene groups (Daniel and 
Kottra 2004). In the case of peptides, extensive inves¬ 
tigations of substrate specificity have revealed the 
importance of a free N-terminal a-amino group, an 
acidic C-terminal group (i.e., carboxylate), and the 
carbonyl group of the N-terminal peptide bond for 
high-affinity transport (Brandsch et al. 2008). 
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Membrane Transport 
Proteins: The Proton- 
Dependent Oligopeptide 
Transporter Family, 

Fig. 2 Substrates of POT 
family proteins, (a) Peptides 
and peptide analogues; (b) 
natural non-peptide substrates 
transported by some 
mammalian and plant POT 
family proteins; (c) 
therapeutic drugs transported 
by human peptide transporters 
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Dipeptides of hydrophobic/aromatic amino acids typi¬ 
cally have higher apparent affinities than more hydro¬ 
philic peptides. Many attempts have been made to 
exploit the broad substrate specificity of the peptide 
transporters to increase the oral bioavailability of drugs 
by synthesis of amino acid and dipeptide ester prodrugs 
(Brandsch 2009). The most successful of such 
prodrugs to date has been valacyclovir (Fig. 2), the l- 
valyl ester of the antiviral drug acyclovir; valacyclovir 
is currently used for oral treatment of several types of 
viral infection. 

Molecular Mechanism 

An Alternating Conformation Model for Transport 

Uptake of peptides by POTs is driven by the electro¬ 
chemical gradient of protons across the membrane and 
is electrogenic. In most cases, the translocation of 
peptides is tightly coupled to that of protons, although 


the neuronal transporter OPT-3 from the nematode 
Caenorhabditis elegans appears to function predomi¬ 
nantly as a proton channel (Daniel et al. 2006). Exten¬ 
sive electrophysiological investigations of human 
PepTl and PepT2 expressed in Xenopus oocytes have 
demonstrated that the stoichiometry of neutral peptide 
to proton transport is 1:1, although a 2:1 stoichiometry 
has been reported for rat PepT2 (Daniel and Kottra 
2004; Sala-Rabanal et al. 2006, 2008). Such studies 
have led to the proposal of seven-state kinetic models 
for the transport process, in which for cellular uptake 
there is ordered binding of the proton followed by the 
peptide substrate and in which both proton binding 
from the extracellular medium and relocation of the 
unloaded substrate-binding site from the cytoplasmic 
to extracellular side of the membrane are voltage- 
dependent (Sala-Rabanal et al. 2006, 2008). The kinet¬ 
ics are consistent with an alternating conformation 
model for transport, in which the substrate-binding 
site is alternately accessible to the extracellular and 
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Membrane Transport Proteins: The Proton-Depen¬ 
dent Oligopeptide Transporter Family, Fig. 3 Role of 
POT family members in peptide uptake in the intestine and 
kidney. Uptake of dipeptides (A 1 -A 2 ) and tripeptides (A 1 -A 2 - 
A 3 ) from the lumen of the mammalian small intestine or renal 
proximal tubule is brought about primarily by PepT 1 and PepT2 
respectively (PEPT), located at the brush border membrane of 
polarized epithelial cells. The proton gradient required for uptake 
is generated by proton-sodium antiporters (NHE3) also located at 
the brush border, which exploit the transmembrane sodium ion 
gradient established by sodium-potassium ATPases located in the 
basolateral membrane 


cytoplasmic side of the membrane, or is occluded from 
both (Fig. 4). Such a model for transport by members 
of the MFS is supported by the results of extensive 
biochemical investigations of proteins such as the lac¬ 
tose transporter LacY from Escherichia coli 
(Smirnova et al. 2011) and by elucidation of the struc¬ 
tures of E. coli MFS proteins LacY, EmrD and FucP in 
inward-facing, occluded and outward-facing states 
respectively (Smirnova et al. 2011; Dang et al. 2010). 

Structure of a Bacterial Peptide Transporter 

Further insights into the mechanism of transport have 
recently been provided by the crystal structure of the 
oligopeptide transporter PepTso from S. oneidensis 


(Newstead et al. 2011). Functionally, this transporter 
resembles human PepTl and PepT2 in the proton- 
dependence of transport, in its apparent affinity for 
the substrate glycyl-sarcosine and in its selectivity for 
di- and tripeptides (Newstead et al. 2011). Moreover 
the protein is more closely related to its human coun¬ 
terparts than most other bacterial peptide transporters, 
exhibiting ^30% sequence identity to PepTl and 
PepT2, and in particular shares most of the conserved 
sequence motifs of the latter. These include a histidine 
residue (F161) in TM2 and an aspartate residue (D316) 
in TM7, which are almost absolutely conserved in 
metazoan PepTl/2 homologues but not in most other 
bacterial, fungal, or plant POT family members. The 
structure represents an inward-facing but largely 
occluded state, with a proposed peptide-binding cavity 
located at the interface between two 6-helix bundles 
derived from the N- and C-terminal halves of the 
protein: as in other MFS proteins, the latter are similar 
in structure and are predicted to have arisen from an 
intragene duplication event (Fig. 5a, b). Structural 
similarities between TMl-3 and TM4-6 in the N- 
terminal domains of MFS proteins, and between 
TM7-9 and TMlO-12 in the C-terminal domains, 
which represent inverted repeats, suggest that each 
domain similarly originated from intragene duplica¬ 
tion (Radestock and Forrest 2011). Unlike most other 
POT family members, in PepTso the sequence linking 
the two 6-helix bundles contains a transmembrane 
helical hairpin of unknown function, located at the 
periphery of the structure (Fig. 5a, TM “A” and “B”). 

In the partially occluded structure, access to the 
peptide-binding site from the periplasmic space is 
prevented by the close association of TMs 1, 2, 7, and 
8 (Fig. 5). A salt bridge formed between R32 in TMl 
and D316 in TM7, both conserved residues in meta¬ 
zoan POTs, likely contributes to this association. At 
the cytoplasmic side of the membrane, access to the 
central hydrophilic cavity is likewise occluded, 
although only in part, by the apposition of bulky side- 
chains in TMs 4 and 5 in the N-terminal domain with 
side-chains in TMs 11 and 10 in the C-terminal 
domain. A hydrophilic cavity at the approximate cen¬ 
ter of the membrane between the N- and C-terminal 
domains of PepTso has been proposed to be the pep- 
tide-binding site for two reasons: analogy with sub¬ 
strate-binding sites in other MFS transporters and the 
presence there of electron density in the crystal struc¬ 
ture representing bound ligand. The identity of the 
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Membrane Transport Proteins: The Proton-Depen¬ 
dent Oligopeptide Transporter Family, Fig. 4 The alternat¬ 
ing conformation model for peptide transport. The results of 
electrophysiological investigations (Sala-Rabanal et al. 2006, 
2008) and other approaches suggest that during the transport 
cycle conformational changes involving the N- and C-terminal 
6 -helix domains (“N” and “C” respectively) render a central 
peptide binding cavity alternately accessible from the extracel¬ 
lular and cytoplasmic sides of the membrane. Occluded states, in 


inward-facing 

which the cavity is inaccessible from either side, are also 
involved. Red arrows show those steps proposed to be influenced 
by the transmembrane voltage in mammalian peptide trans¬ 
porters. The cavity contains conserved basic (“-r”) and acidic 
(“—”) residues from both domains, likely to be involved in 
binding the peptide (blue oblong). Binding of protons (red cir¬ 
cle) may involve conserved histidine (His) and/or glutamate 
(Glu) residues located in the N-terminal domain 


latter remains unknown (Newstead et al. 2011). 
Surrounded by TMs 1, 2, 4, and 5 from the N-terminal 
domain of the transporter and TMs 7, 8, 10, and 11 
from the C-terminal domain (Fig. 5), its dimensions of 
^13 X 12 X 11 A would he sufficient to accommodate 
di- and tripeptides. A second hydrophilic cavity, 
extending from the periplasmic surface of the mem¬ 
brane toward the central cavity and lined by residues 
from TMs 2, 7, 11, and 12, may represent a vestige of 
the entry route for peptides present in the outward- 
open state of the transporter. 

The Peptide-Binding Site 

Supporting its identification as the substrate-hinding 
site, the central hydrophilic cavity in the PepTso struc¬ 
ture is lined by conserved residues known to be impor¬ 
tant in peptide translocation in mammalian PepTl and 
PepT2 transporters (Fig. 5c). On the N-terminal side, 
these include three basic residues, R25 and R32 in TMl 


and K127 in TM4. The positions corresponding to R25 
and K127 are almost absolutely conserved throughout 
the POT family, while that corresponding to R32 is only 
conserved in bacterial and metazoan peptide trans¬ 
porters. While the roles of R32 and K127 have not yet 
been investigated, R25 appears to play an essential role: 
In human, PepT2 mutation of the corresponding residue 
(R27) to histidine inactivates transport (Daniel et al. 
2006). On the C-terminal side the side-chain of TMIO 
glutamate residue E419 likewise extends into the cavity. 
An acidic residue is almost invariably found at the 
corresponding position throughout the POT family. Its 
functional importance has been demonstrated by the 
fact that mutation of human PepTl residue E595 to 
cysteine, lysine, or arginine inactivates transport, while 
mutation to aspartate does not affect activity (Xu et al. 
2009). Mutation of the corresponding residue (E388) in 
the bacterial peptide transporter YjdL to alanine simi¬ 
larly inactivates transport, while a mutant with aspartate 
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Membrane Transport 
Proteins: The Proton- 
Dependent Oligopeptide 
Transporter Family, 

Fig. 5 Structure of the 
peptide transporter PepT 
Cartoon representation of the 
structure, viewed (a) from the 
plane of the bilayer, showing 
the locations of the 
periplasmic and central 
hydrophilic cavities, and (b) 
from the cytoplasm. A space¬ 
filling model of the dipeptide 
alaninyl alanine has been 
included in the central cavity. 
Transmembrane helices are 
numbered as in Fig. 1 , and 
evolutionarily related helices 
from the N- and C-terminal 
halves of the protein are 
colored identically, except for 
helices A and B, which are 
both shown in teal. 

(c) Detailed view of the central 
cavity, showing conserved 
residues likely to be involved 
in substrate binding. A model 
of alanyl alanine, in a possible 
binding pose, has been 
included to provide scale 



at this position retains substantial activity (Jensen et al. 
2012). While the crystal structure of a POT family 
member with bound peptide ligand has not yet been 
obtained, it is likely that the conserved basic and acidic 
residues located on opposite sides of the cavity contrib¬ 
ute to the binding of the charged carboxylate and amino 
groups of the peptide respectively. Additional contribu¬ 
tions to binding are likely to be made by conserved 
uncharged residues on the cavity surface, including the 
tyrosine residues Y29 and Y68 and the hydrophobic 
residues 1157, W312, F315, and W446. The presence 
of the latter is consistent with the reported higher affin¬ 
ity of peptides with hydrophobic side-chains (Daniel 
and Kottra 2004). 

Potential Mechanism of Proton Translocation 

The structure of PepTso also reveals the identity of 
residues that might potentially be involved in proton 


translocation. One of these is TM2 residue H61, 
corresponding to a conserved position in metazoan 
POTs that has been suggested to represent a primary 
site of protonation in human PepTl and PepT2 
(Newstead et al. 2011). It lies at the base of the extra¬ 
cellular cavity, buried in the interface between TM2 
and TM7. Mutation of H61 in PepTso to cysteine 
inactivates the transporter, a finding consistent with 
the importance of this residue as a potential site of 
protonation in metazoan and some bacterial peptide 
transporters. However, the absence of a corresponding 
histidine residue in most other bacterial, plant, and 
fungal POT family members indicates that other resi¬ 
dues must also contribute. These may include TMl 
residues E21 and E24, which are highly conserved 
throughout the POT family and so likely to play key 
roles. Lying within salt-bridging distance of the pro¬ 
posed substrate-binding residue R25, disruption of salt 
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bridge(s) upon protonation of these acidic residue(s) 
might well play a role in the mechanism by which 
proton and substrate translocation are coupled. In sup¬ 
port of a role in proton translocation, mutation to 
glutamine of the residue (E20) corresponding to 
PepTso residue E24, in the E. coli peptide transporter 
Yjdl, inhibited transport by 85% and rendered the 
activity pH-independent (Jensen et al. 2012). 

Summary 

lYoton-linked oligopeptide transporters are widely dis¬ 
tributed and play roles not only in nutrition but also in the 
uptake of therapeutic drugs in humans. Transport is an 
electrogenic process, driven by the transmembrane pro¬ 
ton gradient. Insights into the mechanism of transport 
have been provided by extensive studies of substrate 
selectivity in mammalian POTs, and by the results of 
site-directed mutagenesis. These studies have recently 
been complemented by determination of the crystal 
structure of a bacterial family member, PepTso- 
A central hydrophilic peptide-binding cavity, at the inter¬ 
face between N- and C-terminal bundles of 6 transmem¬ 
brane helices, contains conserved basic and acidic 
residues likely to play key roles in substrate recognition. 
Conserved histidine and glutamate residues are similarly 
predicted from the structure to be involved in the proton 
translocation that is coupled to peptide movement across 
the membrane. Like other MFS transporters, the mecha¬ 
nism of translocation probably involves alternate access 
of the binding site to the extracellular and cytoplasmic 
sides of the membrane, via a conformational change. 

Cross-References 

► fT^-Lactose Membrane Transport Protein, LacY 

► Membrane Protein Eunction 

► Membrane Proteins: Structure and Organization 

► Membrane Protein Structure 

► Membrane Transport, Energetics and Overview 
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Membrane Transport: Energetics and 
Overview 

Peter J. F. Henderson 

Astbury Centre for Structural Molecular Biology, 
Institute of Membrane and Systems Biology, 
University of Leeds, Leeds, UK 

Overview 

Cells and subcellular organelles are surrounded by 
a lipid-protein membrane designed to retain their 
contents and separate biological processes from their 
environment to maintain homeostasis. The membranes 
are comprised of lipids and proteins. It is an inherent 
physical property of the lipids that they form 
a hydrophobic bilayer essentially impermeable to the 
vast majority of compounds, except for a few, small, 
electrically neutral molecules, of which oxygen and 
carbon dioxide are particularly important and enter/ 
leave by simple diffusion. Nevertheless, hydrophilic 
nutrients have to enter the cell and waste products 
have to leave. Generally, for each individual type of 
nutrient/waste molecule there is a membrane protein 
thalfaciiitates diffusion. The high degree of specificity 
that each membrane transport protein {transporter) 
usually exhibits for a substrate is reminiscent of the 
nature of enzymes. In free-living single-cell species, 
the rate of transport of essential metabolites that can be 
achieved by simple and/or facilitated diffusion down 
their electrochemical gradients is insufficient for com¬ 
petitive metabolism, and active transport mechanisms 
have evolved that divert part of the cell’s energy bud¬ 
get into capturing molecules, or driving them out, 
against their prevailing electrochemical gradients. 
Primary active transport systems can utilize energy 
directly from light, oxidation, or ATP hydrolysis 


reactions. Secondary active transport systems use 
a preexisting electrochemical gradient, usually of 
ions such as and/or Na"^. A few transporters effect 
a chemical change of the substrate during transport, 
which is known as group translocation. 

Proteins that facilitate transport must be distin¬ 
guished from those that create a pore, or channel, in 
the membrane, which permits very rapid (often near 
the diffusion limit) transfer of small molecules, 
sometimes with little specificity (e.g., bacterial and 
mitochondrial porins), and sometimes with exquisite 
specificity (e.g., or Na"^ channels). Transporters and 
channels may have a common evolutionary origin, but 
their biological functions are quite different, and 
channels will not be considered further here. 

Once the occurrence of transport proteins was real¬ 
ized and accepted in the 1950s, their characterization 
became a matter largely of cataloguing substrate spec¬ 
ificity and kinetics, relying heavily on the availability of 
radioisotope-labeled substrates (Stein 1986). Rarely 
could the protein itself be studied. There were several 
reasons for this. First, there was very little information 
about their amino acid sequences. Second, they were 
very often of low abundance in the membrane, making 
them difficult to identify, let alone purify. Third, their 
hydrophobic nature made them refractory to the usual 
methods of protein purification. The advent of DNA 
sequencing started to change this, first when individual 
genes were associated with particular proteins, so their 
complete amino acid sequence could be predicted; this 
revealed, as expected, a high proportion of hydrophobic 
residues as befit a membrane protein, but also that a high 
proportion of secondary structure, either alpha-helix or 
beta-sheet, was common, which could often be neatly 
arranged in sequential helices (12 was a common num¬ 
ber), internally duplicated motifs, or beta-barrels. 

Alignments of the amino acid sequences of 
identified transport proteins started to reveal their 
evolutionary relationships, leading to the Transporter 
Classification Database (TCDB) of Saier (2012) and 
colleagues as more and more sequences of entire 
genomes accrued (discussed below). Transporters 
could be grouped in families, such as the mitochon¬ 
drial transporters related by sequence and therefore, 
almost certainly, structure (Kunji 2009) 
(► Mitochondrial Transport Protein Family). Another 
example is the Aucleobase-Cation-Symport family 
(NCS-1, ►Membrane Transport Proteins: 
The Nucleobase-Cation-Symport-1 Family), which. 
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because of the unexpected similarities in structures of 
transporters from different families, became classified 
into a “Superfamily” (► Membrane Transport Proteins: 
The Five-Helix Inverted Repeat Superfamily). Simi¬ 
larly, there are contributions on the “ATP-5inding-Cas- 
sette” superfamily (► Substrate Capture by ABC 
Transporters), and “Sugars Mil itventually be Exported 
Transporters” superfamily (SWEET, ► SWEET Gluco- 
side Transporter Superfamily). 

Also out of recombinant DNA technology arose 
ingenious techniques for, not just amplifying expres¬ 
sion of individual proteins, but also adding short amino 
acid sequences - tags - that enormously facilitated 
purification. In parallel, an industry arose around the 
development of mild detergents that solubilized 
a protein from the membrane without denaturing it. 
The combination of amplified expression, scaled-up 
production, efficient solubilization, and purification 
led to a surge in numbers of proteins (especially from 
bacteria where starting material is cheap), in a ht 
state for crystallization trials to determine three- 
dimensional (3D) structure hy X-ray crystallography, 
or more rarely by electron microscopy. 

X-ray crystallography of membrane proteins is 
still challenging, despite advances in technology 
(Bill et al. 2011). Nevertheless, there is a steady 
increase in the numbers of structures being deter¬ 
mined, including transport proteins (White 2005). 
Once structure is determined, then insights into the 
structure-activity relationship inevitably follow. One 
underlying theme in the examples of structures of 
transporters contributed in this section of the Ency¬ 
clopedia is how such insights into molecular mecha¬ 
nism are being achieved. In particular, the ways in 
which transport proteins effect alternating access of 
suhstrate(s) to each side of a membrane (Jardetsky 
1966) without making it “leaky” constitute a theme in 
many of the articles. 

Despite the occurrence in all organisms of a huge 
number of membrane proteins designed to transport 
an extremely wide range of substrates both into and 
out of cells and organelles, the bioenergetics of the 
transport processes follow a few straightforward pat¬ 
terns, which will now be characterized in some detail. 
The related experimental techniques are not 
described here, hut examples can he found in Ency¬ 
clopedia articles, particularly those by Niang and by 
Guan and Kaback. 


Bioenergetics of Membrane Transport 

Passive Diffusion 

Passive (simple) diffusion is the translocation of 
a solute across a membrane down its electrochemical 
gradient without the participation of a transport pro¬ 
tein. The process follows Tick’s law, and so obeys the 
relationship below in which the velocity has a linear 
relationship to the [solute] (Fig. la velocity vs. [S]) 

V = PAc 

where v = velocity, P = permeability coefficient for 
the particular solute, A = area, and c = difference in 
solute concentration across the cell membrane. 

Diffusion has a low temperature coefficient (v a 
absolute temperature) and is nonspecific. Typical 
biologically important compounds that follow this 
mechanism are O 2 , CO 2 , NH 3 HCO 2 H, CH 3 CO 2 H, 
and CH 2 OH.CHOH.CH 2 OH; they are small, neutral 
molecules that are soluble in lipid membranes. 

Diagnostic features are the low temperature coeffi¬ 
cient and the linear relationship between velocity and 
substrate concentration (Fig. la). 

Facilitated Diffusion 

In the early 1950s, a saturable catalytic carrier 
transport mechanism was proposed in order to explain 
the observation (Widdas 1952; Boudker and Verdon 
2010 ) that measured rates of transport were not 
linearly related to substrate concentration as predicted 
by Fick’s Law (see above). This process of 
“facilitated diffusion” results from the translocation 
across a membrane of a solute down its electrochemi¬ 
cal gradient catalyzed by a transport protein. The 
Michaelis-Menten relationship often adequately 
relates the initial rate of transport (v) to initial substrate 
concentration ([S] = c at zero time) (Fig. lb) 

v = V„ax.[S]/(K™+[S]) 

(Vmax = maximum velocity. Km = [S] where v is V^ax/ 
2). As with enzyme reactions, there is a high temper¬ 
ature coefficient and, usually, strong substrate speci¬ 
ficity. Biological substrates that follow this mechanism 
are typically charged and/or larger than about the size 
of glycerol, with a very low inherent solubility in 
biological membranes. Mitchell classified such trans¬ 
port of a single substrate as “uniport” (Mitchell 1963, 
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Js = -Ds(d[S]/dx) 


Jgl Rate of diffusion of soiute ‘S’ 

Dg; Diffusion coefficient (dependent on the 

chemicai nature of the soiute moiecuie) 


c/[S]/dx; Concentration gradient (change in cone. S 
per unit distance, x) 


Concentration of soiute outside ceii 
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^max[S] 

Js = 

+ [S] 


Jgl Rate of transport of soiute ‘S’ 


Jmax', Maximum rate of transport 

K^; Soiute concentration at which transport is 

haif the maximai rate 


Concentration of soiute outside ceii 

Membrane Transport: Energetics and Overview, Fig. 1 (a) Simple diffusion or non-mediated transport, (b) Protein-mediated 
transport 
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thickness of the barrier 
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1967, 1973, 1977), and glycerol transport is an 
example of such facilitated diffusion in E. coli (Heller 
et al. 1980). 

Importantly, in free-living single-cell organisms 
such as bacteria, yeasts, and algae, and in certain 
plant and animal tissues, the rate of capture of nutrient 
from the environment by this mechanism is too slow at 
the dilute concentrations that prevail in their normal 
environments to support competitive growth. Thus, 
shortly after facilitated diffusion was proposed, it 
was discovered that coupling of transport of an organic 
solute to transmembrane gradients of protons or 
sodium ions occurred (Mitchell 1963, 1967). Indeed, 
we often find that transport of vital nutrients is coupled 
to consumption of metabolic energy by active transport 


(see below) rather than facilitated diffusion. Presum¬ 
ably, during the course of evolution the expenditure of 
precious energy reserves on transport has been a very 
significant survival factor. 

In contrast, transport of solutes between intracellu¬ 
lar compartments in eukaryotes, or into tissue cells 
from the blood, often occurs by facilitated diffusion 
since high concentrations of solute are already 
established, for example, by the Na'^-glucose symport 
system (below) so that facilitated diffusion by the 
tissue glucose uniporters is subsequently sufficient to 
support cell metabolism. The seminal example of such 
facilitated transport, known as uniport, is the family of 
GLUT glucose transport proteins in mammalian 
tissues (Baldwin and Henderson 1989). 
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Active Transport 

Active transport is the movement of a molecule from 
a region of lesser concentration to an area of greater 
concentration (against its electrochemical concentra¬ 
tion gradient), requiring the presence of specihc inte¬ 
gral membrane transport proteins and the input of 
metabolic energy. This type of transport can be divided 
into distinct groups based upon the energy used to 
drive transport. A summary of active transport systems 
is shown in Fig. 2. 

Primary Active Transport 

Direct coupling of substrate transport to energy- 
producing processes such as respiration or photosyn¬ 
thesis or ATP hydrolysis enables transport of 
a substrate against the concentration/electrochemical 
gradient. The central examples in biology arise from 
the chemiosmotic hypothesis of oxidative and photo¬ 
synthetic phosphorylation, proposed in 1961 (Mitchell 
1961, 1977), suggesting that energy from light, or 
released by oxidation of NADH and FADH 2 , is used 
to transport protons across the cell membranes of 
bacteria, the thylakoid membranes in plants, and mito¬ 
chondrial inner membranes in both plants and animals. 
Energy release occurs in the form of electrons pro¬ 
duced from the breakdown of hydrogen originating 
from energy-rich molecules such as glucose. Energy 
flow down the electron transport chain to an electron 
acceptor enables the transport of protons across 
a closed membrane, thus generating potential energy 
in the form of a pH gradient and an electrical potential 
across the membrane. This gradient is known as the 
proton motive force (PME), and is a store of energy 
that is used for the flow of protons back across the 
membrane, down their electrochemical gradient, via 
a complex enzyme called F-type ATP synthase (Fig. 2) 
(Abrahams et al. 1994; Watt et al. 2010). This enzyme 
is found in all organisms and uses energy stored in the 
electrochemical gradient to generate ATP from aden¬ 
osine diphosphate (ADP) and organic phosphate (Pi). 
The overall process is made possible by the imperme¬ 
ability of the membranes to and OH^ ions. In fact, 
the protein is itself a proton pump operating in reverse 
as an H'^-ATPase. 

The coupling of primary active transport of protons 
driven by electron transfer during photosynthesis or 
respiration to the reverse of primary active transport 
by the F-type H'^-ATPase is, of course, the basic 


mechanism of photosynthesis and of oxidative phos¬ 
phorylation in most organisms. The corresponding 
enzymes in some organisms utilize a Na"^ gradient 
rather than the gradient. 

Another example of this type of transport system is 
the P-type Na’^/K’^-ATPase, involved in maintaining 
the high levels of intracellular ions and extracellu¬ 
lar Na"^ ions (Fig. 2) in eukaryotic organisms (Kaplan 
2002; Morth et al. 2007, 2011). The hydrolysis of one 
adenosine triphosphate (ATP) molecule is coupled to 
the movement of three Na"^ ions to the outside of the 
cell, and two ions to the inside of the cell, against 
their concentration gradients. This is the origin of the 
electrochemical gradient of high Na"^ out and high 
in, that is a vital feature of numerous cell phenomena, 
most evident in the electrical impulses occurring in 
nerve tissue. 

Prokaryotes have a similar ratio of K'^/Na'^, but the 
gradients are maintained by a P-type K'^-ATPase and 
a H'^/Na'^ exchange protein described below. 

Secondary Active Transport 

Mitchell realized that membrane transport proteins 
might obligatorily couple movement of a cation to 
movement of an organic substrate such that the poten¬ 
tial energy of the ion gradient (generated by primary 
active transport) could energize movement of the sub¬ 
strate against its electrochemical potential (Fig. 2). 
Proteins thus take advantage of the PMF formed by 
the chemi-osmotic process by using the energy stored 
in the electrochemical gradient to drive the trans¬ 
port of a substrate against its concentration gradient. 
Evidence for such secondary active transport was 
found for a number of transport systems (West and 
Mitchell 1970; Henderson 1990). In addition to the 
PME, other ion electrochemical gradients that are 
formed in cells can be used to drive secondary active 
transport systems, such as the Na"^ electrochemical 
gradient (Eig. 2; West and Mitchell 1974; Hunte et al. 
2005). 

Symport, Antiport, and Uniport 

Secondary active transport proteins can be divided into 
two types: symporters, in which the substrate and the 
ion are moved in the same direction; and antiporters, 
where the substrate is transported in the opposite direc¬ 
tion to that of the ion (Fig. 2) (Mitchell 1967, 1973). 
Some of these have evidently lost the coupling to 
movement of a cation or another molecule and effect 
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Secondary active transport 


Lactose Leucine Arginine 



Membrane Transport: Energetics and Overview, Fig. 2 The 

large oval represents the cytoplasmic membrane of a cell from 
any organism. In bacteria a transmembrane electrochemical 
gradient of protons is generated by respiration or by a F-type 
ATPase, as shown on the left and transmembrane gradient of 
sodium ions by a H'*'/Na'^ antiport (top left). In higher organisms 
a transmembrane gradient of protons would be generated by a 
P-type Ff^-ATPase and/or a transmembrane gradient of potas¬ 
sium/sodium ions by a P-type Na'^.H'^-ATPase (bottom left). 
The gradients may be used to drive cation-nutrient symport, 
and substrate/substrate antiport secondary active transport 


systems shown along the top, particulary by members of the 
Major Facilitator Superfamilies (MFS) and 5-Helix Inverted 
Repeat Transporters (SHIRT) illustrated. Each is usually a single 
protein indicated by a circle, but homotrimeric examples occur. 
The trimeric AcrB-rTolC antiport efflux system is also illustrated 
(top right). Along the bottom are also illustrated ATP-Binding- 
Cassette (ABC) systems for uptake of nutrients or efflux of 
wastes/toxins, and the phosphotransferase (PTS) mechanism of 
group translocation (see text), which is not found in higher 
organisms. On the right is shown facilitated diffusion by a 
uniport protein. 


movement of one solute only; they are known as 
uniporters (Fig. 2); for example, the “Glut” family of 
transporters for sugars, particularly glucose (Baldwin 
and Henderson 1989). Many of these proteins are 
monomeric polypeptides comprising 12 (some¬ 
times 10-14) transmembrane alpha-helices (Henderson 
1993; Lolkema and Slotboom 2005). This conclusion 
is usually based indirectly upon analysis of the hydro¬ 
pathic profile of the amino acid sequence of each 
protein predicted from its DNA sequence (Henderson 
1993; Griffith and Sansom 1998; Lolkema and 
Slotboom 2005). In a few cases, the prediction is 
reinforced by genetic, immunochemical, or other 
types of topological experiments. There is also 
structural information consistent with the 12-helix 
composition from electron diffraction analyses of 


two-dimensional protein crystals (e.g., Williams 
1999; Hirai and Subramaniam 2004) ultimately con¬ 
firmed by X-ray crystal structures (see below). In some 
cases, transporters are comprised of multimeric sub¬ 
units, three in the case of the Na’^/betaine symporter 
(BetP, Ressl et al. 2009), Na^glutamate (Gltpu, 
Boudker and Ryan) symporters, and HVmultidrug 
antiporter (AcrB, Nakashima et al. 2011). 

Classification of Transporters by Their 
Amino Acid Sequences 

The Transporter Classification Database 

Membrane proteins are classified according to the 
alignments of their amino acid sequences in the 
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Transporter Classification Database (TCDB) devised 
by the Saier Lab Bioinformatics Group (www.tcdb. 
org), and maintained by them and the Paulsen lab 
(Saier et al. 2006; Ren and Paulsen 2010). The TCDB 
is a curated database containing factual information 
from over 10,000 published references, and has classi¬ 
fied over 550 different transporter families. All mem¬ 
brane transport systems are classified on the basis of 
five criteria and are represented as a number which 
appears as follows: TC #V.W.X.Y.Z, where the com¬ 
ponents V.W.X.Y.Z. correspond to: V (a number), the 
transporter class (i.e., channel, porter, primary active 
transporter, or group translocator); W (a letter), the 
transporter subclass (which, in the case of primary 
active transporters, refers to the energy source used to 
drive transport); X (a number), the transporter family 
(sometimes a superfamily); Y (a number), the 
subfamily in which a transporter is found; and Z, the 
substrate(s). 

The MFS, ABC, SHIRT, and SWEET Superfamilies of 
Membrane Transport Proteins 

From the genome sequencing exercises it was recog¬ 
nized that membrane transport proteins in many cells 
fall predominantly into two classes: One of these uti¬ 
lizes ATP to energize the transport of substrates across 
the membrane - the “ATP-Binding Cassette” (ABC) 
superfamily (Sharom 2011, Fig. 2), and the second, the 
“Major Facilitator 5uperfamily” (MFS) of proteins 
similar by aligned amino acid sequences (Pao et al. 
1998; Fig. 2) is usually energized by the electrochem¬ 
ical gradient of protons (Fig. 2), or sometimes gradi¬ 
ents of sodium (Fig. 2) or phosphate ions. 

Members of the ABC Superfamily hydrolyze ATP 
directly to energize transport, and therefore implement 
“primary” active transport (Sharom 2011; ► Substrate 
Capture by ABC Transporters), while many of 
the MFS proteins are dependent on a preexisting elec¬ 
trochemical gradient of ions and so are “secondary” 
active transport proteins. In bacteria, each ABC 
transporter is often comprised of several polypeptides, 
though they tend to become fused during 
evolution to higher organisms. Nevertheless, the 12- 
helix theme is common for the membrane proteins 
of the ABC superfamily (Procko et al. 2009; 
Sharom 2011; ► Substrate Capture by ABC 

Transporters). 

The 5HIRT superfamily of proteins (► Membrane 
Transport Proteins: The Five-Helix Inverted Repeat 


Superfamily) was only recently recognized by the sim¬ 
ilarities of 3D structures of proteins previously catego¬ 
rized as belonging to different families, since there 
were no recognizable similarities in their aligned 
amino acid sequences. They are characterized by 
a 5-helix inverted repeat within their structures, an 
internal pseudosymmetry, which is different from 
the pseudosymmetry found within MFS proteins 
(Dang et al. 2010; Boudker and Verdon 2010; 
Radestock and Forrest 2011). While perhaps not as 
abundant as ABC and MFS transporters, they equally 
include proteins of profound importance in 
metabolism. 

The SWEET superfamily is newly discovered and 
reviewed by Frommer (► SWEET Glucoside Trans¬ 
porter Superfamily). 

Clearly, an overall structural fold for each of the 
ABC, MES, 5HIRT, or SWEET classes of transport 
protein arose early in evolution of primitive organisms 
(Saier et al. 2000), and has been retained with 
enormous numbers of subtle variations to accomplish 
differences in mechanism (symport, antiport, uniport), 
substrate specificity, cation recognition (or not), and 
kinetic parameters as higher organisms arose. 

Structures of Membrane Transport Proteins 
and the Alternating Access Model of 
Substrate Translocation 

By analyzing the alignments of the amino acid 
sequences of MFS proteins it became clear that 
symporters were sufficiently closely related to 
uniporters and antiporters for their structures to be 
similar. Indeed, this turned out to be the case for the 
symporters - LacY, FucP, and PepT, and antiporters - 
GlpT and EmrD, once their structures became 
apparent. Furthermore, transport proteins in higher 
eukaryotes including man were sufficiently related to 
those from primitive prokaryotes like enterobacteria 
that their structures and mechanisms were going to be 
very alike indeed. 

The spectacular confirmation of the 12-helix motif 
so far comes from X-ray diffraction analysis of three- 
dimensional crystals of five MFS proteins from E. coli, 
the lactose/H"^ symporter, LacY (Abramson et al. 
2003), the glycerol-P/Pi antiporter, GlpT, (Huang 
et al. 2003; Lemieux et al. 2004) the erythromycin/ 
H"^ antiporter, EmrD (Yin et al. 2006), the fucose/H"^ 
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symporter, FucP (Dang et al. 2010), and the polypep- 
tide/H"^ symporter (Newstead et al. 2011) all from 
different subfamilies of the MFS. Importantly, while 
some of the MFS protein structures were in the “open- 
inward” conformation (LacY, GlpT) or “closed” 
(EmrE), FucP was in the “open-outward” conforma¬ 
tion. By overlapping the structures of LacY and FucP, 
Dang et al. (2010) were able to hypothesize 
a mechanism for the alternating access of substrates 
from one side of the membrane to the other, without 
creation of a “leak” pathway for ions that would have 
uncoupled energy generation. The role of the sequence 
and structural pseudosymmetry observed in these pro¬ 
teins in the translocation mechanism has been 
discussed (Maiden et al. 1987; Boudker and Verdon 
2010; Radestock and Forrest 2011), and a number of 
experimental approaches have elucidated the step 
changes in conformation that could effect alternating 
access in the LacY protein (Kaback et al. 2011; Guan 
and Kaback, ► H'^-Lactose Membrane Transport 
Protein, LacY). 

The first transport protein for which structures of 
three conformations “outward-open,” “closed,” and 
“inward-open” were achieved was Mhpl (Cameron 
et al. ► Membrane Transport Proteins: The Five- 
Helix Inverted Repeat Superfamily). This provided, 
for the first time, the structural basis for the conforma¬ 
tional changes that effect alternating access. 

Structures of several ABC transport proteins have 
been achieved, both those energizing import and those 
energizing export, some of which are involved in resis¬ 
tance to antibiotics (Procko et al. 2009; Sharom 2011). 
Changes in their conformations during the transport 
cycle can also be interpreted in terms of the alternating 
access mechanism (Poolman and Slotbloom, 
► Substrate Capture by ABC Transporters). 

Membrane Transport Proteins in Health and 
Disease 

Currently, it is technically easier to solve structures of 
proteins from bacterial sources, and some of these are 
important in infectious diseases (Langton et al. 2005; 
Nakashima et al. 2011). Owing to their sequence and/ 
or structural homology to human transporters many 
bacterial proteins can serve as paradigms for the 
molecular mechanisms of mammalian proteins. Nev¬ 
ertheless, human transporters have their individual 


functional and regulatory quirks, and the focus in the 
field is now moving toward elucidation of their struc¬ 
ture-activity relationships targeting those implicated in 
diseases (e.g., Hediger et al. 2004; Sharom 2011). 

Summary 

The historical development of the membrane transport 
field is briefly reviewed, ranging from characferization 
of fransport reactions, through the influence of recom¬ 
binant DNA technologies that enabled identification 
and isolation of the proteins involved, to current deter¬ 
minations of 3D structures and illumination of the 
molecular events of substrate binding, translocation, 
and release. The classification of transport reactions 
according to the bioenergetics of the substrate move¬ 
ments is described. The advent of genome DNA 
sequences led to a global perspective of the evolution 
of transport proteins from microbe to man, as they 
could be categorized into families and superfamilies 
defined by statistically robust similarities in amino 
acid sequences. Recently, three-dimensional structures 
of transport proteins representing different families 
have started to emerge, and in some cases, unexpected 
relationships are found that transcend the absence of 
sequence homology. Furthermore, the structural basis 
of the alternating access concept of substrate transport 
is at last being revealed. 

Cross-References 

► ATPase: Overview 

► Bacterial Respiratory Chains 
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► H'^-Lactose Membrane Transport Protein, LacY 
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Membranes: A Field-Theoretic 
Description 

Michael Schick 

Department of Physics, University of Washington, 
Seattle, WA, USA 

Introduction 

Theoretical methods used to descrihe biological mem¬ 
branes are distinguished by the different length scales on 
which they examine the system. The method which has 
been used to a great extent in the past is 
a phenomenological one in which the length scale of 
interest is sufficiently large that all the details of the 
molecules which make up the membrane can be ignored. 
Consequently, the membrane is described as a two- 
dimensional sheet of which the free energy can be char¬ 
acterized by a few elastic constants. The description has 
been quite successful in the regime of its validity. The 
text of Safran (2003) can be consulted for a good descrip¬ 
tion and some applications. Its limitations are that 
changes in the membrane on small scales, typically the 
thickness of the membrane itself, are not described reli¬ 
ably, and the effects of molecular architecture are not 
clear. Presumably the architecture affects the elastic con¬ 
stants, but the calculation of the latter from the former is 
a separate one, and replete with its own difficulties. 

At the other extreme is the description of the mem¬ 
brane as an assembly of molecules which are described 


in atomic detail. The system is then studied by some 
method of computer simulation, such as “Molecular 
Dynamics.” This has been an extremely successful 
method, particularly for examining the effects on the 
membrane of particular biological molecules. As com¬ 
puters become ever more powerful, more and more 
problems in biology will be examined by simulation. 
Their limitation is, of course, in the time interval over 
which the membrane can be studied. The existence of 
thermodynamic phases has to be inferred from small 
samples, and the calculation of thermodynamic quan¬ 
tities is rather difficult. Some of these problems can be 
overcome by coarse-graining the description of the 
molecules and replacing them with a few interaction 
centers. One thereby trades atomic detail for an 
increase in the time over which a process can be 
examined. The interested reader is referred to the 
review by Saiz and Klein (2002). 

Field-theory methods are, in principle, completely 
equivalent to particle-based methods. But rather than 
using the discrete positions r,, i = 1,2.. .N of all N 
particles as basic variables, the held theory describes the 
system in terms of a held, or helds, which are functions 
dehned over all space r. By adjusting the scale over 
which one chooses to characterize these helds, one 
adjusts the scale over which one examines the features 
of the system. Thus, the description has great hexibility. 
Like its particle-based counterpart, the held theory can 
be simulated. Often one solves the theory within the 
self-consistent held approximation, discussed below, 
which provides the ensemble average of quantities of 
interest, thereby sacrihcing a knowledge of the effects 
of huctuations. But if such effects are not of primary 
interest, then the approximation provides a relatively 
direct method for following processes over long times 
and for obtaining various thermodynamic quantities. 
This means, inter alia, that the free energies of different 
conhgurations or phases can be calculated and those of 
lowest free energy can be determined. To make clear the 
means by which a held theory is constructed, it is useful 
to consider a simple example. This one, and several 
others, are discussed in the excellent monograph by 
Fredrickson (2006). 

What Is a Field Theory? 

The held theory which is most familiar is that which 
describes electromagnetism at zero temperature. 
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Suppose one has a number, N, of discrete charges q where /i = llkgT and the Hamiltonian H[p] is 
located at position r,. Then the total energy of the a functional of the particle density, Eq. 1. We insert 
system, U, can be written in terms of the particle density the identity 


/=1 


( 1 ) 


JVwVpexpi^fS J iw{r) (p(r) — p(r))<3fr| = 1 (8) 


as 



q^p{r')p{r" ) 
47 teo|r' — r" 


dr'dr" 


( 2 ) 


However, one invariably introduces the electric 
potential 


E(r) 


47 ieo|r — r'l 


dr'. 


which satisfies the Poisson equation 


VV(r) 


eo 


so that the energy becomes 

^ = 1^1 ‘?P(r)V(r)c/r. 


( 3 ) 


( 4 ) 


( 5 ) 


After using the Poisson equation to eliminate the 
charge density in terms of the potential and integrating 
by parts once, one obtains 


which expresses the energy entirely in terms of the 
field y(r). This is often a far more convenient starting 
point for any calculation than the particle-based 
expression Eq. 2. 

A similar change in perspective can also be brought 
about in statistical mechanics where one is to calculate 
the configurational partition function of the N particle 
system in a volume V at temperature T: 


Z = exp{-PF{T,N,V)} 


= IT / / H\p]dr 


( 7 ) 


where Vw and Vp denote functional integrations 
(Binney et al. 1992), to obtain 


where 


exp(—/IF) = J DpDw’JJdr, exp i/i J w(r)p(r)4rj 

exp{-/i/(//[pl-/w(r)p(r))rfr} 

./b,iv..xp{-«n/h 

+ j Wp] - TO’(r)p(r))dr) 


= J VpVwexpl^- liT{T,N,V, Kp])| (10) 


exp(-;S/[;H']) = J £/r, exp|-i/i J w{r)p{r)di 


( 11 ) 


and/[;w] is the free energy of a single object subject to 
the field iw{r). The above transformations are exact, 
but the integrals over the fluctuating fields w and p 
cannot be carried out in general. The mean field, or 
self-consistent field, approximation consists of approx¬ 
imating the exact partition function Z of Eq. 10 by the 
value of the integrand at which it is stationary with 
respect to variations in the fields; that is, the exact free 
energy F(T,N,V) is approximated by 


T{T,N,V, [w*,p*]) = Nfiiw*] + I {H[p*] 

— ;w*(r)p*(r))fifr 

where w* and p* are the solutions of the equations which 
express the condition that T should be an extremum: 


0 or iw*(r) 
dp 



p=p* 


( 13 ) 
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6(0 


= 0 


or 


p*{r)=N 



iw=iw* 


(14) 


As the right-hand side of Eq. 13 is real, one sees that 
the extremum of T occurs at a value of w on the 
imaginary axis of the complex vr plane. Second, one 
sees from Eq. 11 that: 

s nf f dripexp{-t^w*(r)p(r)dr} 

nf /t/r,exp{-;^/vr*(r)p(r)£/r} 


= N 


exp{—if)w*(r)} 
f dr exp{—z/?w*(r)} 


(16) 


= < P(r) > ■ 


(17) 


Thus, p*(r) is simply the average density calculated 
in the ensemble of a single particle in the real field 
iw (r). But, from Eq. 13, this field depends upon p 
itself. Thus, Eqs. 13 and 14 are a set of self-consistent 
equations for the fields. Once they are known, the 
mean-field free energy of Eq. 12 is also known. It is 
essentially N /, that is, N times the free energy of the 
single-particle system in the self-consistent field w* 
corrected for some simple overcounting of the interac¬ 
tions that the estimate Ay contains. 


Results 

Lipid Membranes 

The mean-field theory description of membranes has 
a long history. As noted above, mean-field theory 
essentially approximates the problem of calculating 
the partition function of a system of many interacting 
components as one of calculating the partition function 
of a single component in an external field. This field 
depends upon the average properties of the compo¬ 
nents as calculated from the single component parti¬ 
tion function. The approximation is obviously better 
the better the description is of the single components 
and the better the calculation of the single-component 
partition function. Biological lipids present a problem 
in that they are much larger than simple atoms or 
molecules for which the partition function can be 


easily calculated, and they are much smaller than 
very long polymers for which the partition function 
can, in many cases, also be easily calculated. There are 
few alternatives to describing the many configurations 
of the molecule by means of some coarse-grained 
model and actually enumerating all possible configu¬ 
rations of the lipid and evaluating the partition function 
directly. One common method, pioneered by Szleifer 
et al. (1986), is to ignore the head groups of the lipids 
and to focus only on their chains, to describe the 
thermal excitations in the chains by means of Elory’s 
rotational isomeric states model (Flory 1969), and to 
describe the orientation of the chain by the values of 
three Euler angles. Such a description can encompass 
on the order of ten million configurations. Clearly, the 
calculation of the partition function of the system in an 
external field is the necessary and difficult part of the 
approximation scheme. Lastly, Szleifer et al. modeled 
the hydrophobic part of the bilayer as a liquid, 
replacing all of the interactions in the system by 
a constraint that the density of this liquid be constant. 
This constraint is enforced by a field, which is the field 
in this field theory. As in any mean-field theory, it must 
be determined self-consistently; that is, the field 
depends on the average densities which are determined 
from the partition function containing the field itself. 
There are alternative formulations of the basic ideas 
which are also useful (Leermakers and Scheutjens 
1988; Khelashvili et al. 2005). 

Because the description of the lipid chains includes 
the angles between the carbons along the chain, one 
can readily obtain the average value of the angle 
between the normal to the bilayer and the vector from 
carbon number k lo k + 1. An orientational order 
parameter directly related to this angle is measured 
by nuclear magnetic resonance, so that theoretical 
predictions can be directly compared to experiment. 
Further, the effect of the addition of double bonds on 
the order parameter can be calculated and the order 
parameter of the two different tails of common 
two-tailed lipids determined. These and other applica¬ 
tions can be found in the review by Fattal and 
Ben-Shaul (1996). 

Several years ago, the above theory was extended to 
include a second field. Just as the field introduced by 
Szleifer et al. was coupled to the number density, so 
a second field was introduced (Elliott et al. 2005) 





Membranes: A Field-Theoretic Description 


1507 


M 


which coupled to the density of bonds of a specified 
angle. This permitted one to distinguish phases not 
only by their density, but also by the degree of order 
in their tails; that is, to distinguish the gel phase from 
the liquid phase in a one-component system. It also 
permitted a calculation of the phase separation 
between liquid and gel phases which occurs in 
a mixture of saturated and unsaturated lipids. With 
the addition of a model cholesterol, the field theory 
was employed to obtain the phase diagram of the 
ternary mixture which displays, not only a gel phase, 
but two liquid phases (Elliott et al. 2006). All three 
such phases are commonly observed in such 
ternary mixtures (Veatch and Keller 2005). Such 
a phase diagram has only been obtained by field-theory 
methods. Again the calculation of thermodynamic 
quantities is precisely one of the strengths of these 
methods. 

Polymeric Membranes 

The fusion of biological membranes is a crucial pro¬ 
cess which takes place whenever anything leaves the 
cell or enters it, whenever material is exchanged 
between vesicles within the cell, during infection by 
viruses, during fertilization, and many other biologi¬ 
cal processes. It is not well understood, however. 
Some aspects are clear. Because the lipid head groups 
interact favorably with water, it costs energy to 
remove the water between two vesicles in order to 
bring them close to one another. This has to be done 
by some outside agent. Because the removal of the 
water increases the free energy per unit area of the 
vesicle bilayers, that is, increases its surface tension, 
the free energy can decrease if the vesicles can 
decrease their area, which they can do by fusing. 
But again, how does this come about? Over 20 years 
ago, it was proposed that the process began when the 
tails of some opposing amphiphiles flip over to forge 
a connection, denoted a “stalk” which was modeled as 
axially symmetric (Kozlov and Markin 1983). Due to 
the surface tension, the stalk was assumed to spread 
radially, as shown in Fig. 1, leaving behind 
a hemifusion diaphragm consisting of the inner leaves 
of the opposing bilayers. Eventually a hole formed in 
this diaphragm producing the fusion pore. One way to 
investigate this process was to employ a field theory 
description of the bilayers, cause the system to follow 



Membranes: A Field-Theoretic Description, Fig. 1 Density 
profiles of the stalk-like structures shown in the r, z plane of 
cylindrical coordinates, (a) Two bilayers in solvent. There is no 
stalk between them, (b) The limit of stability of the initial stalk, 
(c) The metastable stalk itself, (d) The unstable transition state 
(saddle point) between the metastable stalk and the hemifusion 
diaphragm, (e) A small hemifusion diaphragm. Rg is the polymer 
radius of gyration 
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the proposed pathway, and to monitor the free energy 
along the way. Again this can be done with field- 
theory methods without difficulty. 

It would be advantageous, however, if this could be 
carried out in a system for which the evaluation of the 
single component partition function could be evaluated 
more easily than by the direct numeration of configu¬ 
rations described above for lipids. Amphiphilic lipids 
form bilayers because they afford a structure by which 
the hydrophobic tails can be shielded from unfavorable 
contacts with water by hydrophilic head groups which 
have favorable contacts. However, there are many 
other amphiphiles which form bilayers, and one 
suspects that many behaviors of membranes are uni¬ 
versal in that they are displayed by all membranes, and 
not just the biological ones. Furthermore, some amphi¬ 
philes lend themselves to theoretical calculation. That 
is the case with long, flexible, block copolymers which 
consist of a homopolymer of “A” monomers chemi¬ 
cally joined to a homopolymer of “B” monomers. In 
a solvent of A homopolymers, which interact favorably 
with other As and unfavorably with Bs, these 
block copolymers form bilayer vesicles. Their advan¬ 
tage for theory is that the single component 
partition function can be evaluated by solving 
a relatively simple, modified diffusion equation 
(Fredrickson 2006). 

By utilizing such a theoretical system, and the field 
theory outlined above, one can relatively easily bring 
about the formation of all of the configurations 
involved in the formation of a stalk, the hemifusion 
diaphragm, and a pore (Katsov et al. 2004). Indeed, 
Fig. 1 shows the average density, p{r), which was 
generated by the field theory. The free energy barriers 
encountered in the standard stalk-hemifusion 
diaphragm path to fusion could be compared to the 
barriers encountered in other proposed, nonstandard, 
paths. Such a comparison showed that the barriers in 
the nonstandard pathway were, in fact, smaller 
(Katsov et al. 2006). 

Summary 

This example shows both the limitations and the 
advantages of the field-theoretic approach. As Fig. 1 
was generated from a mean-field approach, one sees 
only the average densities, and none of the detail which 
would be produced by a simulation. On the other hand. 


it was relatively easy to calculate the free energies of 
different fusion pathways which had been proposed, 
and thereby determine which was the most favorable. 
For situations in which thermodynamic properties of 
membranes are paramount, as opposed to their micro¬ 
scopic structure, field-theoretic methods will continue 
to play an important role. 
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Synonyms 

Metabonomics 


Definition 

Metabolomics aims to characterize and quantify all 
low-molecular-mass metabolites found in an organ¬ 
ism. It is one of many “omics” whole system 
approaches that have been introduced in the genomics 
era. Together with proteomics, transcriptomics, and 
genomics, metabolomics is considered an integral 
part of systems biology. Analysis of the metabolome 
provides a snapshot of the cellular phenotype under 
specific conditions including genetic modifications or 
developmental differentiation. Unique patterns of gene 
expression combined with environmental influences 
result in a specific metabolic profile for each organism. 
The terms “metabolomics” and “metabonomics” are 


often used interchangeably. While the former empha¬ 
sizes attempts to analyze all small molecules in 
a complex biological sample, the main purpose of 
metabonomics is to investigate the metabolic changes 
that occur in response to biological or toxicological 
stimuli. Comprehensive and systematic profiling of 
numerous metabolites and their concentrations 
requires the extensive use of chemometrics techniques 
such as multivariate statistics. 

Several analytical methodologies are well 
established in metabolomics studies. Mass spectrome¬ 
try (MS) and nuclear magnetic resonance (NMR) spec¬ 
troscopy are the two main detection technologies used 
for metabolite profiling, but more specialized tech¬ 
niques such as infrared spectroscopy, electrochemical 
detection, or radiolabeling have also been used. 
NMR spectroscopy is quantitative but suffers from 
lower sensitivity. However, it does not require prior 
separation of the analytes, allowing the sample to be 
recovered. All other methods require efficient chro¬ 
matographic separation of the metabolites. Normally, 
MS is coupled to either gas chromatography (GC-MS) 
or liquid chromatography (LC-MS) or to capillary 
electrophoresis (CE-MS), although a few successful 
direct injection MS studies have been reported as 
well. Historically GC-MS has been, and continues to 
be, one of the most widely used methods, even though 
chemical derivatization pretreatments are required to 
make polar metabolites volatile. LC-MS has a higher 
sensitivity, and holds much future potential, but exten¬ 
sive use of stable isotopes is required to obtain quan¬ 
titative data, which influences the cost of the analysis. 

Metabolomics studies are generally applied to 
biofluids such as plasma, serum, urine, saliva, cerebro¬ 
spinal fluid, synovial fluid, etc. Studies of human 
biofluids deliver an integrated view of the system, rather 
than a reductionist picture of a few isolated metabolites. 
The same methodologies can also be used to study cell or 
tissue extracts, a strategy that is often referred to as 
“metabolite fingerprinting.” Moreover, “metabolic 
footprinting” studies of the “exometabolome” in cell 
culture media have also been shown to be quite informa¬ 
tive. Information about metabolites and their involve¬ 
ment in metabolic pathways is organized in several 
public databases, such as the Kyoto Encyclopedia of 
Genes and Genomes (KEGG) Pathway Database and 
the Human Metabolome Database (HMDB). These 
resources combine chemical, clinical, biochemical, and 
functional information. Other dedicated databases 
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contain experimental data that are helpful for the analysis 
of NMR or MS data. In biotechnology, metabolomics 
will help to revolutionize the field of metabolic engineer¬ 
ing. In a clinical setting, metabolomic analysis of human 
biofluids holds great potential for detecting physiological 
changes, which provides useful information for disease 
diagnosis, prognosis, organ dysfunction, toxicity assess¬ 
ment, or for monitoring drug therapy. 

Cross-References 

► Capillary Electrophoresis of Cell Metabolism 

► Drug Metabolism - Computational Studies of 
Mechanisms 

► In-Cell NMR 
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► Mass Spectrometry 
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► Total Correlation Spectroscopy (TOCSY) in NMR 
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► Rhodopsin - Stability and Characterization of 
Unfolded Structures 
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Synonyms 

Amine dehydrogenase 

Definition 

Methylamine dehydrogenase (MADH) (EC 1.4.99.3) 
is a tryptophan tryptophylquinone (TTQ)-dependent 
enzyme that catalyzes the oxidative deamination of 
a primary amine to an aldehyde and ammonia. 
The primary H/D kinetic isotope effect (KIE) for 
the rate-limiting proton transfer step indicates that 
► quantum effects/tunneling plays a role in the 
reaction. 

Introduction 

MADH and the related enzyme aromatic amine 
dehydrogenase (AADH) belong to a family of dehy¬ 
drogenases that catalyze the conversion of primary 
amines to the corresponding aldehydes and ammonias 
(Davidson 1993). Reaction occurs at the TTQ cofac¬ 
tor, a prosthetic group formed by posttranslational 
modifications of two tryptophan residues (see 
Fig. 1). Bond cleavage and reduction of the TTQ 
cofactor occur simultaneously on the timescale of 
experiments, with the quinone oxygen of the modified 
tryptophan gaining a formal negative charge. Using 
stopped-flow spectrophotometry, the rate of TTQ 
reduction can be monitored experimentally and the 
chemical step isolated kinetically, making this class 
of enzyme particularly suitable for the study of 
tunneling in enzymic proton transfers. The availabil¬ 
ity of kinetic data and crystal structures has made 
MADH an important model for studying tunneling 
in enzymes of this class. 
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Methylamine 
Dehydrogenase (MADH): 
Studies with Variational 
Transition State Theory 
(VTST), Fig. 1 The reductive 
half-reaction for the reaction 
of methylamine in MADH, 
showing the rate-limiting 
proton transfer step which 
involves quantum tunneling 




VTST/MT Calculations 

Several different computational simulation methods have 
been developed to model quantum effects in enzyme 
reactions (Ranaghan and Mulholland 2010). Variational 
transition state theory with multidimensional tunneling 
corrections (VTST/MT) (Truhlar et al. 2002, 2004; 
Truhlar 2010), originally developed to treat reactions in 
the gas phase, has been used to predict thermal 
rate constants for enzymic H-transfer reactions with 
the inclusion of quantum effects such as tunneling. For 
application to reactions in enzymes, ► QM/MM methods 


have typically been used in these calculations to calculate 
energy profiles and the required Hessian matrices 
(matrices of the second derivatives of the potential 
energy surface). With this type of method, the effect of 
tunneling is included in a separate step, making it 
possible to quantify the contribution of quantum 
tunneling to the reaction separately (see Eqs. 1 and 2). 
There are several models available to calculate the 
tunneling component of the reaction (see references in 
Trahlar (2005)), giving an overall “phenomenological” 
activation free energy that may be compared with 
experiment (when converted to a rate constant). 
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= -/?rin[y"^(7’)] (1) 

= A^G^““(7’) + A^G^^'^(r) (2) 

The VTST/MT method has been successfully applied 
to several enzymic reactions, but there are some 
drawbacks to the method. Due to the computational 
demands of calculations of this type, the level of QM in 
the QM/MM calculations is often restricted to ► semi- 
empirical quantum chemical methods (such as AMI or 
PM3) which are typically less accurate than more sophis¬ 
ticated QM methods, and may give a poor description. 
Semiempirical methods can be reparameterized to 
improve their description of a particular, specific reac¬ 
tion, however (i.e., using specific reaction parameters). 
VTST/MT approaches, as usually applied to enzymes, 
also do not directly consider the full dynamics of the 
system: Tunneling corrections are typically calculated 
for a small number of representative reaction pathways 
with the protein environment treated as fixed. Classical 
molecular dynamics simulations are often nowadays 
used to generate configurations of the enzyme for 
VTST/MT reaction modeling, i.e., stmctures for reaction 
path optimization. 

Insight into Tunneling in MADH from VTST 
Calculations 

Several studies of MADH have been carried out in 
order to explain the inflated value of the primary H/D 
KIE for the proton transfer step of the reaction 
{Methylophilus methylotrophus MADH 16.8 ± 0.5 at 
298 K (Basran et al. 1999), Paracoccus denitrificans 
MADH 17.2 ± 0.6 (Brooks et al. 1993)). In the earliest 
such study of the reaction, Faulder et al. (2001) calcu¬ 
lated a single minimum energy path (MEP) and the 
primary H/D KIE for the proton transfer step at the 
PM3/AMBER QM/MM level. Their calculated KIEs 
of 6.1 (without inclusion of tunneling), and 11.1 (when 
tunneling is included) show that tunneling needs to 
be included to obtain KIE comparable to the experi¬ 
mental values. In subsequent studies of MADH, 
Tresadern et al. (2003a, b) investigated the effect of 
protein conformation on the calculated KIE for the 
MADH/methylamine system and also investigated 


ethanolamine as an alternative substrate for the 
enzyme (Tresadern et al. 2003a, b). Different reactiv¬ 
ities were found for MEPs for the reaction of methyl- 
amine generated from molecular dynamics snapshots 
of the enzyme in the substrate complex and in the 
product complex, showing the importance of including 
some conformational sampling of the protein in such 
modeling studies. 

Alhambra et al. (2001) used a different variation of 
the VTST/MT framework, with ► QM/MM methods, 
to study the reaction of methylamine in MADH. The 
classical free energy barrier was calculated by umbrella 
sampling molecular dynamics simulation techniques. 
Multiple MEPs, and the tunneling contribution for each, 
were then calculated for different protein configurations 
to evaluate the tunneling contribution to the free energy 
barrier. All of these calculations were carried out using 
specific reaction parameters (SRPs) developed by this 
group for the PM3 semiempirical QM method to give 
a more accurate potential energy surface for the reaction. 
These calculations predicted a primary H/D KIE of 18.3 
at 298 K in excellent agreement with the experimentally 
determined value of 17.2 ± 0.6 for MADH from the 
species Paracoccus denitrificans. 

Ranaghan et al. (2007) also calculated classical free 
energy profiles and quantum corrections using VTST/ 
MT calculations and ► QM/MM methods. These calcu¬ 
lations compared two possible routes for reaction, 
namely proton transfer to either ODl or OD2 of the 
aspartate base for the MADH/methylamine system. 
Both proton transfers were calculated to have similar 
barrier heights and tunneling contributions, leading to 
a phenomenological activation free energy of 16.5 ± 
0.9 kcal mol” * (Ranaghan et al. 2007). The same workers 
also studied the related enzyme AADH (with tryptamine 
as a substrate), and found important differences. In 
AADH, a clear preference for proton transfer to ODl 
was found (in terms of activation and reaction free ener¬ 
gies). QM/MM MD simulations of the substrate complex 
show that ODl of the catalytic aspartate in MADH is less 
able to interact with the proton to be transferred than the 
equivalent oxygen atom in AADH. One possible cause 
for this observed difference is that MADH contains 
a non-conserved tyrosine residue (Tyr47ip in MADH 
which is equivalent to Phel69P in AADH). As a result, 
Thr474p forms hydrogen bonds with both the catalytic 
aspartate (Asp428P) and Tyr47ip, reducing the mobility 
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of Asp428P and Thr474p. These interactions are likely to 
affect significantly the ability of ODl to approach closely 
enough, frequently enough, to form a good hydrogen 
bond with the substrate proton to be abstracted, hindering 
the reaction involving this oxygen. The relative strengths 
of the hydrogen bonds formed by the two oxygens of the 
catalytic base are also likely to influence their reactivity. 
ODl forms a hydrogen bond to Thr474p which is signif¬ 
icantly stronger than the hydrogen bond formed between 
OD2 and Trp460p (Ranaghan et al. 2007). 

Summary 

VTST/MT calculations, with ► QM/MM methods, 
have provided a robust framework for the investigation 
of quantum effects in MADH and other enzymes. 
Several studies with these methods have highlighted 
the importance of using multiple structures of the 
enzyme-substrate complex. Therefore, it is advisable 
to generate multiple structures from molecular dynam¬ 
ics simulations, which can also be run with semiem- 
pirical QM/MM methods. An important technical 
point is that it is usually necessary in VTST/MT cal¬ 
culations to constrain most of the protein environment 
(i.e., with atoms of the enzyme apart from those 
directly involved in the reaction at the active site held 
fixed in position). The fact that KIEs can be calculated 
in good agreement with experimental data with 
a frozen protein environment shows that protein 
dynamics do not directly “drive” tunneling in this 
enzyme (or the related AADH). Hydrogen bonds in 
the active site of MADH have an important influence 
on the reactivity of the catalytic base. 

Cross-References 

► QM/MM Methods 

► Quantum Effects/Tunneling 

► Semiempirical Quantum Mechanical Methods 
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Synonyms 

Ahpl (originally Aureohacterium hydantoin perme¬ 
ase); Arihrohacter aurescens DSM 3747; HyuP 
(“Permease” in the hyu Operons from Pseudomonas 
sp. NS671); Mhpl {Microhacterium hydantoin perme¬ 
ase); Microhacterium liquefaciens AJ 3912 (24); 
Sodium-Hydantoin membrane transport protein, Mhpl 

Definition 

Mhpl is a member of the Nucleobase-Cation- 
Symport-1 (NCS-1) family designated A.2.39.5 
(Saier et al. 2006, 2009; Ren and Paulsen 2010), 
which carries out transport across membranes of 
hydantoins substituted with aromatic groups at the 
5-position. The wild-type protein contains 489 amino 
acids (Suzuki and Henderson 2006), modestly 


modified in a genetic construct at the N-terminus and 
C-terminus, where a (His )5 tag is incorporated to facil¬ 
itate amplified expression, purification, and crystalli¬ 
zation (Suzuki and Henderson 2006; Shimamura et al. 
2008). The transport reaction of Mhpl is: 

Hydantoin(out) -f Na^(out) ^ Hydantoin(in) 

-I- Na+(in) 

This reaction is of commercial interest, because of 
the potential for converting waste hydantoins to com¬ 
pounds of added value, for example, L-amino acids 
(Suzuki et al. 2005; Javier et al. 2009). Established 
substrates of Mhpl are: 

Introduction 

The first protein of the NCS-1 family to have its struc¬ 
ture determined, Mhpl, unexpectedly turned out to 
be similar in protein fold to the LeuT protein 
from the neurotransmitter-sodium-symport (NSS) 
family (Yamashita et al. 2005) and the vSGLT 
sugar-sodium-symport (SSS) family (Faham et al. 
2008), originally classified as unrelated according to 
the dissimilarities of their amino acid sequences (Saier 
et al. 2006). The NCS-1 family is, therefore, part of 
a Transporter Superfamily that has since been termed 
the Amino Acid-Polyamine-Organocation (APC) 
superfamily, the LeuT Superfamily, or the five-helix 
inverted repeat (5HIR) superfamily (Adelman et al. 
2011). The structure of Mhpl has now been deter¬ 
mined in four conformations, including open-out, 
occluded-with-substrate, and open-in, together with 
an additional form predicted by molecular dynamics 
simulations, which illuminate the structural and 
molecular basis of the alternating access mechanism 
(Weyand et al. 2008, 2011; Shimamura et al. 2010). 

X-ray Crystal Structure of Mhpl 

Helix Packing and Pseudosymmetry 

The Mhpl transporter is comprised of 12 transmem¬ 
brane helices (Fig. 1). The first 10 helices form a struc¬ 
tural motif, which is conserved in this superfamily 
of proteins found to contain the “LeuT fold” 
(Weyand et al. 2008; Abramson and Wright 2009; 
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Hydantoin substituted at the 
5-position with an aromatic group 



Krishnamurthy et al. 2009; Shimamura et al. 2010; 
Weyand et al. 2011), The arrangement consists of 
helices 1-5 and 6-10, which show an inverted repeat 
with a twofold axis approximately parallel to the mem¬ 
brane (Fig. 1). The 12 transmembrane helices can be 
grouped into three segments, which are referred to as 
the 4-helix bundle (transmembrane helices (TMs 1 and 
2 and their symmetry partners TMs 6 and 7) colored 
red in Fig. 1); the hash motif, which is another 4-helix 
bundle with the appearance of a hash sign ((TMs 3 and 
4 with TMs 8 and 9) colored yellow in Fig. 1); and the 
C-terminal TMs 11 and 12 (colored gray in Fig. 1) 
(Shimamura et al. 2010; Weyand et al. 2011). The 
first two motifs are linked by TM 5, and the non¬ 
membrane helices IN 2-3 and OUT 7-8; TM 10 links 
the hash motif with the C-terminal TMs 11 and 12 
(Fig. 1). Within helices 5 and 10 (colored blue in 
Fig. 1) there are significant movements during confor¬ 
mational changes (Fig. 1), so they are also referred to 
as “flexible helices.” 

The Na"^ Binding Site 

Na"^ is difficult to resolve by X-ray crystallography, its 
density being similar to that of FI 2 O. Nevertheless, 
finding potential chelating elements (-C=0, -OH) 
arranged in space in such a way as to present an ideal 
coordinating site for Na"^ (Weyand et al. 2008, 2011; 
Shimamura et al. 2010), taken with biochemical evi¬ 
dence of the participation of Na"^ in binding (Weyand 
et al. 2008, 2011; Jackson et al. unpublished data) is 
accepted as sufficient identification of a binding site 
for this cation. 

The proposed Na"^ binding site in Mhpl is located at 
the interface of TMs 1 and 8 (Weyand et al. 2008, 


2011; Shimamura et al. 2010; Fig. 1). In both the 
outward-open and outward-occluded structures 
(Fig. 1) the cation is modeled to interact with the 
carbonyl oxygen atoms of Ala 38 and He 41 in TMl 
and of Ala 309 in TM8 and with the -OH groups of Ser 
312 and Thr 313 in TM8 (Fig. 2). In the inward-facing 
structure (Shimamura et al. 2010; Weyand et al. 201 1), 
on the other hand, TMS has moved ^4.5 A away from 
TMl so that the Na"^ binding site is no longer 
intact (Fig. 2), which is reminiscent of the situation 
in the inward-facing vSGLT and LeuT structures 
(Faham et al. 2008; Watanabe et al. 2010; 
Krishnamurthy and Gouaux 2012). 

The Substrate Binding Site 

The location of the substrate in the Mhp 1 molecule was 
identified in electron density maps of the Mhpl 
crystals with L-benzylhydantoin, L-indolylmethyl- 
hydantoin, and a brominated derivative (Brueckner 
and Cameron unpublished data). An extended electron 
density was clearly observed at a position almost iden¬ 
tical to that of the leucine in the LeuT structure and 
close to that of the galactose in vSGLT. This site is 
located at the breaks in the discontinuous TMs 1 and 6 
and facing TMs 3 and 8 (Figs. 1, 2). It coincides with 
the foot of the outward-facing cavity (Fig. 1), which is 
composed of the neighboring surfaces of TMs 1, 3, 6, 
8, and 10 and allows access of the substrate to the 
binding site (Fig. 1). An extended structure of L-5- 
indolylmethylhydantoin determined by small mole¬ 
cule X-ray crystallography (Patching unpublished 
data) was consistent with the shape of this density 
and with molecular dynamics simulations of the most 
plausible minimal free energy poses of each of the 
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Mhp1, the Na^-Hydantoin Membrane Transport Protein, 
Fig. 1 Structure of the Mhpl hydantoin transport protein from 
Microhacterium liquefaciens (a) Mhpl topology. The positions 
of the substrate and the cation binding sites are indicated. The 
membrane is shown in gray and the inward-facing cavity 


observed in the structure is highlighted in light blue. The hori¬ 
zontal helices on the “IN” and “OUT” sides of membrane are 
indicated as “IN” and “OUT.” TMs 3 and 8 are packed onto each 
other in three-dimensional space, (b) The structures of Mhpl 
viewed in the plane of the membrane. A, B and C show the 
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Mhpl, the Na^-Hydantoin Membrane Transport Protein, 
Fig. 2 The sodium and substrate binding sites in the occluded 
and inward-open structures taken from Shimamura etal. (2010). 
The carbon atoms of the amino acids have been colored as in 

substrates (Beckstein unpublished data). The substrate 
molecules were placed between Tip 117 (TM3) and 
Trp220 (TM6) without requiring any modifications of 
torsion angles (Fig. 2). 

The hydantoin moiety forms a pi-stacking interac¬ 
tion with the indole-ring of Trpll7 and is within 
hydrogen bonding distance of Asn318 and Glnl21 
(Fig. 2). Trpll7 and Asn318 are conserved amongst 
all the transporters in the family and Glnl21 only 
varies in the uridine transporter, Fuil. Another con¬ 
served residue, Asn314, is within hydrogen bonding 
distance of Asn318 such that it may hold the aspara¬ 
gine side chain in position to interact with the sub¬ 
strate. The aromatic substituent of the hydantoin 
substrate is situated next to Trp220 and Gln42. The 
side chain of Trp220 moves into the binding site with 
respect to its position in the substrate-free structure 
(Fig. 2) and forms a pi-stacking interaction with the 
aromatic moiety (Fig. 2). 

< - 

Mhp1, the Na^-Hydantoin Membrane Transport Protein, 
Fig. 1 (continued) arrangements of the helices in the outward- 
open (h), closed (B), and inward-open (C) structures. D, E, and F 
depict slices through the surface of each of the three structures to 
reveal the aperture to the outside (D), the closed form (£) with 
trapped benzylhydantoin substrate shown in cyan, and the 
aperture to the inside (F). The Connolly surface of Mhpl is 


Fig. 1. The sodium ion is represented as a magenta sphere and 
the benzylhydantoin with cyan carbon atoms in the occluded 
structure. In the inward-open structure where these entities are 
not present they are represented in white 

The Structural Basis of the Alternating Access 
Mechanism 

The Mhpl protein has been solved in three different 
conformational states by X-ray crystallography 
(Weyand et al. 2008, 2010, 2011, unpublished data; 
Shimamura et al. 2010) (Fig. 1). The outward-facing 
open and the occluded state show very high similarity, 
whereas neither of those conformations superimposes 
well with the inward-facing open state (Fig. 1), leaving 
the question, whether or not there is only one occluded 
state. Indeed, molecular dynamics simulations do sug¬ 
gest the occurrence of an additional inward-facing 
occluded state (Beckstein unpublished data). The sig¬ 
nificant movements derived from all known structures 
appear to be an overall rigid body rotation of the hash 
motif (helices 3, 4, 8, 9) relative to the bundle (helices 
1,2,6, 7), which functions in a reciprocating manner 
around the pseudosymmetrical structural elements of 
the proteins (cf. Forrest and Rudnick 2009; Boudker 


shown in yellow (calculated with a probe radius of 2 A). The 
ribbon representation of Mhpl is shown in blue and red 
(Reproduced by permission of the American Association for 
the Advancement of Science. The programs used to make these 
and all the structural figures are described in Weyand et al (2008, 
2010, 2011; Shimamura et al. 2010) and their supplementary 
information) 
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Mhpl, the Na^-Hydantoin Membrane Transport Protein, 
Fig. 3 Accessibility of the binding sites of Mhpl in three con¬ 
formational states. Computer simulations of the Mhpl transport 
protein {helices in cartoon representation, front cut away) in 
a lipid membrane (gray) were carried out to determine the 
functional states of three crystallographic structures of Mhpl 
((a) outward-facing open, 2JLN; (b) outward-facing occluded, 
2JLO; and (c) inward-facing open 2x79). Water molecules (red! 


white) can only access the substrate and sodium binding site in 
the open conformations, and the binding sites are sealed in the 
occluded state. This behavior is predicted by the alternating 
access mechanism. Repeated simulations showed that the 
inward-facing conformation (c) readily released the bound 
Na"^ ion; the positions of the ion (magenta spheres) from six 
simulations were all overlaid in panel c and define the exit 
pathway 


and Verdon 2010). The C-terminus (helices 11, 12) 
moves relatively little in comparison. 

Using a novel simulation method (dynamic impor¬ 
tance sampling), we were able to visualize the func¬ 
tional states of the crystallographic structures of Mhpl 
(Fig. 3) and an additional hypothesized conformation 
in the membrane. The transition between the states can 
be described as the interplay of a buried thick gate, an 
external thin gate, and an internal thin gate. The thick 
gate is regulating the conformational change from the 
outward-facing occluded state to the inward-facing 
open state and obstructs the ligand binding site toward 
the intracellular space, when open to the outside. In 
contrast, the external thin gate comprises only some 
residues in helix 10, which orchestrate the opening and 
closing of the outward-facing substrate and Na"^ bind¬ 
ing sites. A complementary region of residues in helix 
5 comprises the internal thin gate, which allows open¬ 
ing and release of substrate and Na"^ toward the inside 
when the external thin gate and thick gates are closed 
to the outside. The interplay of these events to effect 
transport are illustrated schematically (Fig. 4). During 
the conformational changes the occluded state of the 
protein always prevents net movement of either cation 
or substrate without the other - their movement 
becomes “coupled.” Molecular dynamics simulations 
(Shimamura et al. 2010; Adelman et al. 201 1) showed 


evidence for a sequence of events apparent from the 
crystallographic structures and fully compatible with 
the alternating access mechanism of transmembrane 
transport (Jardetzky 1966) as follows. Starting from 
the outward-facing open state, the thin extracellular 
gate formed by TMIO shuts to form the occluded 
state. A large conformational change takes place dur¬ 
ing which the hash motif (the thick gate) swings around 
an axis roughly parallel to TM3 to form the inward¬ 
facing conformation. Through subsequent opening of 
the intracellular thin gate (formed by TM5) the sodium 
and substrate binding sites become connected to the 
cytosol, allowing egress of the transported ion and 
substrate. 

Recently the structures of LeuT in substrate-free 
outward-open and apo inward-open states have been 
determined (Krishnamurthy and Gouaux 2012), 
adding to the outward-closed form published previ¬ 
ously (Yamashita et al. 2005). Its similarities and dif¬ 
ferences to Mhpl are discussed by Cameron et al. 
(2012, Encyclopedia). 

The Kinetic Features of Alternating Access 

While three conformations of Mhpl have been 
resolved, the complete process of transport could 
involve as many as 12 intermediate states (Fig. 4, 
Forrest et al. 2011). Of course, some of these may be 
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Mhp1, the Na^-Hydantoin Membrane Transport Protein, 
Fig. 4 Schematic illustration of the possible conformations of 
an NCS-1 membrane transport protein. The bundle helices 1, 2, 
6, 7 are in red. The hash motif helices 3, 4, 8, 9 forming the 
internal thick gate are in yellow, the external thin gate 


containing part of helix 10 is in blue. The internal thin gate 
containing part of helix 5 is also in blue. The rotation axis of the 
thick gate, running approximately parallel to TM3, is shown as 
a filled black circle on TM3. The process of transport from 
outside to inside follows the steps from 1 to 10 


unstable and short-lived, and so unlikely to be detected 
with available biophysical techniques. For example, 
molecular dynamics simulations suggest the occur¬ 
rence of an additional conformationally different 
form of the inward-facing occluded state (Beckstein 
unpublished data). Nevertheless, establishing the num¬ 
ber of these states and the kinetic constants for their 
interconversions is the next stage in understanding the 
molecular mechanism of Mhpl, and dehning the 
differences between members of the NCS-1 family 
itself and between other members of the SHIRT 
superfamily. 

Molecular dynamics simulations (Shimamura et al. 
2010; Beckstein unpublished data) also suggest that 
the external and internal gates oscillate rapidly (on the 
sub-microsecond timescale), and very much faster than 
hypothesized changes in the conformation of the inter¬ 
nal gate or, indeed, observed rates of transport. It is 
therefore unlikely that any one step involving 
just a change in the gates limits the rate of the translo¬ 
cation process. This could simplify the kinetic model 
(Fig. 4) to a smaller number of relatively stable states. 

Measurements of the rates and extents of ligand 
binding to Mhpl can be accomplished by following 
changes in the inherent tryptophan fluorescence of puri¬ 
fied Mhpl (Fig. 5). Binding of a hydantoin is markedly 
enhanced in the presence of Na"^ ions to include both 
a binding step and conformational change (Fig. 5, 


Ivanova and Jackson unpublished). The reciprocal 
enhancement of Na"^ binding by benzyUiydantoin is 
also apparent (Fig. 5) - the binding of cation and sub¬ 
strate is coupled. The kinetics of these changes can be 
profoundly altered by mutating some of the key residues 
identified above (Jackson and Ivanova unpublished). 

Summary 

The structures of the Na'^-hydantoin transport protein, 
Mhpl, in outward-open, occluded, and inward-open 
conformations are summarized. A working model of 
the alternating access mechanism of transport is 
derived from detailed examination of the ligand bind¬ 
ing sites in the different conformations allied with 
molecular dynamics simulations. 

Our working model for secondary transport in 
Mhpl contains the following steps: 

1. In equilibrium and in the absence of substrate and 
ions, the protein switches spontaneously between 
the inward-facing and the outward-facing state. 

2. Once Na"^ binds to the outward-facing state, it 
stabilizes it and allows the substrate (here 
benzylhydantoin) to bind, too. 

3. The extracellular thin gate closes. 

4. The thick gate switches from outward facing to 
inward facing. 
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Mhpl, the Na'^-Hydantoin Membrane Transport Protein, 
Fig. 5 Tryptophan-fluorescence-quenching by L- 
henzylhydantoin. The Mhpl solution was titrated by 
L-benzylhydantoin and/or NaCl and the decrease in the trypto¬ 
phan fluorescence at 348 nm was monitored. The measurements 
were performed with {triangles) and without (circles) 2mM 
L-benzylhydantoin (upper) or 15 mM NaCl (lower) in the solu¬ 
tion (For details, see Weyand et al. (2008)) 

5. The intracellular thin gate opens and substrates are 
released to the intracellular space. 

6. Go back to 1. 


Cross-References 
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Synonyms 

Digital PCR 


Definition 

The detection of single DNA molecules has been made 
possible by the use of what is known as a digital 


polymerase chain reaction (digital PCR) approach. Con¬ 
ventionally, the process of digital PCR has involved 
sequential limiting dilutions of target DNA followed 
by PCR amplification (Simmonds et al. 1990; Vogelstein 
and Kinzler 1999). In theory, when the target DNA is at 
limiting dilution levels, each of the dilutions contains 
a single copy, or fewer, of the target DNA template. PCR 
is then conducted; a positive amplification signal is only 
observed if a DNA molecule is present. 

The use of valved microfluidics has enabled 
a different approach to digital PCR by partitioning 
DNA molecules instead of diluting them. In this con¬ 
text, the original sample is effectively split into multi¬ 
ple partitions prior to PCR. The concentration of any 
sequence in a DNA sample (copies/pl) can be calcu¬ 
lated using the numbers of positive partitions that 
contain at least one copy of that sequence. 

Basic Characteristics 
Digital Array™ IFCs 

The Fluidigm Digital ArrayIFC (Ottesen et al. 2006; 
Oehler et al. 2008; Qin et al. 2008) is a nanofluidic 
biochip where digital PCR reactions can be performed 
with isolated individual DNA template molecules. Inte¬ 
grated Fluidic Circuits (IFC) are arrays of nanoscale 
reaction chambers, channels, and valves integrated 
together to create individual digital PCR reactions 
(http://www.fluidigm.com/biomark-system.html). 

There are two key advantages of using a valved 
nanofluidic device for digital PCR. First and foremost, 
setting up a digital PCR experiment is significantly 
faster, simpler, and as a result more reproducible in 
comparison to conventional PCR. Another advantage 
is that a fraction of the chemistry utilized in a standard 
digital PCR experiment is used, as it is prepared only 
once, and the volume of each reaction is orders of 
magnitude smaller than conventional methods. 

The beauty of the Digital Array IFC design is its 
simplicity. Essentially, the design consists of hundreds 
of reaction chambers that are isolated by pressurized 
valves. A Digital Array IFC can be configured in 
a variety of ways to optimize for throughput, i.e., 
more sample inlets, or sensitivity, providing more 
reaction chambers per sample. The simplicity of the 
design provides both flexibility and robustness. Two 
forms of commercial Digital Array IFCs are currently 
available, which have configurations known as 12.765 
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Microfluidics for Single Molecule Detection, Fig. 1 The 

Fluidigm 48.770 Digital Array IFC 


and 48.770 (with 9,180 and 36,960 reaction chambers 
respectively), and utilize an instrument called 
the BioMark^™ system http://www.fluidigm.com/ 
biomark-system.html (Fig. 1). 

Fluidigm’s 48.770 Digital Array IFC has 48 indi¬ 
vidual panels with 770 reaction chambers per panel for 
digital PCR. Four microliters of sample-assay mix is 
pipetted into each inlet; up to 48 different samples may 
be tested in a single Digital Array IFC. The mixture 
fills the panel with the help of a specialized device 
known as the MX-IFC Controller, which pressurizes 
the inlet with air, forcing the liquid into the array. The 
material used to manufacture the chips is permeable, 
which allows the air in the chip to be displaced by fluid 
rapidly. Of the 4 pi of fluid pipetted into the inlet, 
0.65 pi is contained in the reaction chambers (the 
“analyzed volume”), with the remainder of the mixture 
left (“dead volume”) in the channels and inlet ports 
used to deliver the mixture. Each reaction chamber is 
approximately 0.84 nl in volume. The reaction cham¬ 
bers are isolated from each other by pressurized valves 
which are controlled by an on-board pressurized accu¬ 
mulator. These valves partition the sample into 770 
digital reactions per panel. The input concentration of 
the sample must be sufficiently low to create a digital 
signal, so that a percentage of the panel reaction cham¬ 
bers contain no DNA. 

If the sample concentration is known, one can cal¬ 
culate how much DNA should be added to the mix 
to result in the optimal range for positive reaction 
chambers (200-700, Dube et al. 2008). For more 


information, see http://www.plosone.org/article/info 
%3Adoi%2F10.1371%2Fjournal.pone.0002876. For 
the standard protocol, where 30% of the reaction mix 
is DNA, the approximate input sample concentration is 
4-32 ng/pl. If the sample concentration is not known, 
one can use multiple panels with different dilutions 
within the same chip. 

Programmable Digital Array IFC 

A chip almost identical in structure to the 48.770 has 
been developed that allows the user to configure the 
chip for alternate throughput and sensitivity require¬ 
ments, depending on their needs, at the time of use. 
A small change in protocol that involves a different 
sequence of valve actuation and alternate inlets for the 
sample pipette can result in either a 48.770 configura¬ 
tion as previously described, a 12.3080 configuration 
(12 samples, 3,080 reaction chambers per sample), or 
a 1.36960 (1 sample, 36,960 reaction chambers per 
sample). At the cost of throughput (1 sample versus 
48 samples) much higher sensitivity can be achieved 
with the 1.36960 configuration as shown in Fig. 2. 

Regardless of the configuration, after the array has 
been loaded and partitioned, it is thermocycled to 
amplify the target DNA when present. Chambers that 
contained at least one copy of target DNA at the start of 
the reaction amplify, creating a measureable difference 
between those with target DNA (amplified fluorescent 
signal), and those without. 

Digital PCR and Copy Number Variation Studies 

One of the major sources of variation in the human 
genome is caused by changes in the number of copies 
of DNA segments present on individual chromosomes. 
This form of copy number variation (CNV) has also 
been associated with a number of genetic disorders 
(Lupski 2007). It is therefore very important to study 
this form of variation in order to understand its impact 
on human health. 

Current CNV analysis is mainly dependent upon 
microarray-based SNP analysis, comparative genomic 
hybridization (CGH) platforms, DNA sequencing, or 
qPCR (quantitative polymerase chain reaction) which 
are low-sensitivity, low-resolution approaches. The 
use of digital PCR using microfluidics provides an 
approach which combines high sensitivity with supe¬ 
rior resolution (Lo et al. 2007; Qin et al. 2008). 

Using the Huidigm 12.765 Digital Array IFC, it has 
been shown that it is possible to differentiate as little as 
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Microfluidics for Single 
Molecule Detection, 

Fig. 2 Assuming that 80% of 
reaction chambers are positive 
for a single panel (770, 3080, 
or 36960), the chart shows the 
difference in uncertainty (95% 
confidence) for each 
configuration. At the cost of 
throughput, precision can be 
greatly enhanced (uncertainty 
greatly reduced) 


1.2 



48.770 12.3080 1.36960 


Chip configuration 



Microfluidics for Single Molecule Detection, Fig. 3 The 

prototype Ultra-High Density Digital Array IFC has an input/ 
output frame with two sample inlets and an integrated fluidic 
circuit (IFC) comprising two panels of 100,893 reaction cham¬ 
bers each. PCR mixes with DNA samples are loaded into each 
panel and the single DNA molecules are randomly partitioned 
into 100,893 chambers. After sample loading, the chip is 
thermocycled, imaged, and analyzed on a specially modified 
BioMark system 

a 15% difference in gene copy number, or between 6 and 
7 copies of a target gene (Qin et al. 2008). The system 
was easily capable of distinguishing between normal 
levels, deletions and duplications of the CYP2D6 gene, 
which is a highly polymorphic gene, and is of great 
clinical importance. More details can be found at http:// 
nar.oxfordjoumals.org/content/36/18/ell6.long. 


Ultra-High Density Digital Array IFC 

Finally, we have developed a prototype chip known as 
the Ultra-Fligh Density Digital Array IFC (Fig. 3). It 
has 201,786 reaction chambers assembled in 507 rows 
and 398 columns on a 32 mm-by-32 mm detection area 

200 PCR reactions/mm^). Each reaction chamber 
measures 40 pm x 40 pm x 80 pm, yielding a chamber 
volume of 128 picoliters. The chip is divided into two 
equal size panels (100,893 chambers each) that can be 
loaded with two different samples, with each panel 
having a total reaction volume of 12.9 pi. 

We again studied copy number variation by exam¬ 
ining two genes, one of which was the gene of interest 
(RPP30) and the other the reference gene (RNase P). 
A normal human genomic DNA sample was mixed 
with PCR reagents (master mix, primers, and fluores¬ 
cent probes for the two genes). The RNase P probe 
was FAM-labeled and the RPP30 probe was CAL 
Fluor Orange 560 (CFO)-labeled. The multiplex 
PCR reaction was performed separately on two Dig¬ 
ital Array IFCs on the BioMark system. Molecules of 
the two genes were independently amplified. FAM 
and CFO signals from all chambers were recorded 
by the CCD camera at the end of each PCR cycle. 
Figure 4 shows a computer-generated call map from 
part of the chip, showing the chambers with positive 
signals for either or both genes, each labeled with 
a different fluorescent dye. 

The ratios of the counts of the RNase P gene to the 
RPP30 gene were found to be very close to 1.0, as 
expected, since both are single copy genes (1.004 and 
1.000 for two independent chip runs). Our results dem¬ 
onstrate that the determination of copy number on the 
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Microfluidics for Single Molecule Detection, Fig. 4 Soft- 
ware-generated composite callmap from a prototype Ultra-High 
Density Digital Array IFC after PCR showing the chambers with 
positive signals for either or both genes, each labeled with 
a different fluorescent dye: red for RNase P (FAM) and blue 
for RPP30 (CFO) 

prototype Fluidigm Ultra-High Density Digital Array 
IFC can be extremely precise in comparison to con¬ 
ventional methods. 


Summary 

It must be stressed that while we have focused on the 
ability of digital PCR and Digital Array IFCs to detect 
differences in gene copy numbers, the same approach 
has been used for absolute quantitation (“molecule 
counting”) of genomic DNA and cDNA (Bhat et al. 
2009), and digital haplotyping (Menzel et al. 2010). 
We have also shown that Digital Array IFCs are 
between 50- and 200-fold more sensitive in detecting 
the Abl-T315I mutation in messenger RNA than con¬ 
ventional techniques (see http://www.ncbi.nlm.nih. 
gov/pubmed/18615105). Our microfluidic approach 
to examining single molecules is an extremely power¬ 
ful tool for both basic and clinical research. 


References 

Bhat S, Herrmann J, Armishaw P, Corbisier P, Emslie KR. 
Single molecule detection in nanofluidic digital array IFC 
enables accurate measurement of DNA copy number. Anal 
Bioanal Chem. 2009;394(2):457-67. 

Dube S, Qin J, Ramakrishnan R. Mathematical analysis of copy 
number variation in a DNA sample using digital PCR on a 


nanofluidic device. PLoS ONE 2008;3:e2876. http://www. 
plosone.org/article/info%3Adoi%2F10.1371%2Fjournal. 
pone.0002876 

Lo YM, Lun FM, Chan KC, Tsui NB, Chong KC, Lau TK, 
Leung TY, Zee BC, Cantor CR, Chiu RW. Digital PCR for 
the molecular detection of fetal chromosomal aneuploidy. 
Proc Natl Acad Sci USA. 2007;104:13116-21. 

Lupski JR. Genomic rearrangements and sporadic disease. Nat 
Genet. 2007;39:S43-7. 

Menzel S, Qin J, Vasavda N, Thein SL, Ramakrishnan R. Exper¬ 
imental generation of SNP haplotype signatures in patients 
with sickle cell anaemia, Sept 24. PLoS One 2010; 5(9): 
el3004. doi:10.1371/journal.pone.0013004 http://www. 

plosone.org/article/info:doi/10.1371/joumal.pone.0013004 

Oehler VG, Qin J, Ramakrishnan R, Facer G, Ananthnarayan S, 
Cummings C, et al. Absolute quantitative detection of ABE 
tyrosine kinase domain point mutations in chronic myeloid 
leukemia using a novel nanofluidic platform and mutation- 
specific PCR. Leukemia 2008;23:396-9. http://www.ncbi. 
nlm.nih.gov/pubmed/18615105 

Ottesen EA, Hong JW, Quake SR, Leadbetter JR. Microfluidic 
digital PCR enables multigene analysis of individual envi¬ 
ronmental bacteria. Science. 2006;314(5804):1464-7. 

Qin J, Jones RC, Ramakrishnan R. Studying copy number varia¬ 
tions using a nanoiluidic platform. Nucleic Acids Res 2008; 
36:el 16. http://nar.oxfordjournals.org/content/36/18/el 16.1ong 

Simmonds P, Balfe P, Peutherer JF, Ludlam CA, Bishop JQ, 
Brown AJ. Human immunodeficiency virus-infected individ¬ 
uals contain provirus in small numbers of peripheral mononu¬ 
clear cells and at low copy numbers. J Virol. 1990;64:864-72. 

Vogelstein B, Kinzler KW. Digital PCR. Proc Natl Acad Sci 
USA. 1999;96:9236^1. 


Micropipette Manipulation of Lipid 
Bilayer Membranes 

David Needham 

Department of Mechanical Engineering and Materials 
Science, Duke University, Durham, USA 

Introduction 

The micropipette manipulation technique was initiated 
by Rand and Burton (Rand and Burton 1964) and was 
later perfected by several groups, most notably, Evans 
and Hochmuth, and Chien and Skalak (Eig. 1). For 
almost 50 years these micropipette techniques have 
provided a unique ability to apply well-defined stresses 
for basic modes of membrane and cellular deforma¬ 
tion, and to simultaneously measure the strain, and 
rates of strain, resulting from these applied stresses 
(Evans and Skalak 1980). It was then this ability to 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 1 The Author working at a micromanipulation station 
creating microcrystals (Photo, Duke News Service) 


measure directly these stress and strain parameters 
which has allowed it to characterize the material 
behavior of individual erythrocytes (Evans and 
Hochmuth 1978), leukocytes (Meiselman et al. 
1984), and cancer cells (Needham 1991) in terms of 
elastic moduli and viscous coefficients as well as inter¬ 
surface interactions (Evans 1988). Starting in the 
1980s the technique was adapted and developed hy 
Evans and Kwok (Kwok and Evans 1981; Evans and 
Kwok 1982) and then by Evans and Needham (1987) 
to study the simple lipid bilayer membrane as, the 
so-called. Giant Unilamellar Vesicles (GUVs). 

In this section, the development and use of several 
micropipette techniques and analyses (Evans and Need¬ 
ham 1987; Needham and Zhelev 1996, 2000) will be 
described that characterize the two molecule-thick 
membrane as a soft elastic material that: is a very soft 
material with a compressibility somewhere between 
that of a bulk liquid and a gas; is stiffened considerably 
by the inclusion of cholesterol to levels equivalent to 
polyethylene; is permeable to water in relation to its 
compliance; can exchange small amounts of other lipids 
and surfactants with its surrounding milieu; displays 
a 25% change in area when taken through its main 
acyl chain freezing transition; and, as a solid material, 
shows the yield shear and shear viscosity of a Bingham 
plastic. When manipulated in pairs, lipid vesicles have 
also been found to be subject to the same range of 
attractive and repulsive colloidal interactions as many 
other particles, including: 


van der Waals attraction, limited by a very short-range 
hydration repulsion and the variable-range power law 
of electrostatic repulsion; steric repulsive barriers due 
to the presence of bound aqueous polymers like poly 
(ethyleneglycol); depletion flocculation by the same 
molecules when free in surrounding solution; and they 
are subject to intimate mixing when manipulated into 
contacts that reduce the hydration barrier and allow 
membrane-membrane fusion. 

Modes of Deformation 

As introduced and discussed at length by Evans (Evans 
and Skalak 1980), reviewed previously (Bloom et al. 
1991; Needham and Zhelev 1996), and shown in Eig. 2, 
three independent shape changes are usually considered 
in the analyses of vesicle membrane deformations, such 
as area expansion, in-plane shear, and bending. 

Together they characterize the deformation and rate 
of deformation of the lipid bilayer. These independent 
shape changes are produced by the application of 
external forces to the membrane elements, such as 
the hydrostatic pressure provided by the micropipette 
aspiration of a single lipid vesicle (see later. Fig. 6). 
Acting normal to the membrane the induced internal 
vesicle pressure is balanced by membrane tension 
components multiplied by the curvature of the mem¬ 
brane vesicle (a form of the Law of Laplace). 

Proportionalities between intensive forces and 
static deformations describe the three modes of defor¬ 
mation and are used to characterize the membrane’s 
material properties and give rise to first-order consti¬ 
tutive relations. For a liquid lipid bilayer (which does 
not support elastic shear stress), the two important 
elastic relations are area dilation and bending. 

Area dilation is characterized by an isothermal area 
expansion (or compressibility) modulus K^, given 
by the equation. 

At = Ka0£ (1) 

where a is the fractional change in membrane area 
(AA/Ao) produced by isotropic membrane stress Ax; 
and Ao is the initial vesicle area at a tension of 
^0.5 mN/m (below which membrane undulations are 
the dominant contribution to tension-induced area 
change (Evans and Rawicz 1990), see later. Single 
Vesicle Experiment, Bending). 
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Micropipette Manipulation 
of Lipid Bilayer 
Membranes, Fig. 2 Modes 
of deformation adapted from 
Evans and Skalak (1980). 

The mechanics of a cube are 
collapsed into an infinitely thin 
2D membrane. Normal, and 
in-plane stresses become 
isotropic dilation, shear, and 
cutting or bending stresses 
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Membrane bending is characterized by the bending 
rigidity kc, which is the ratio of the change in mem¬ 
brane bending moment AM to changes in total 
membrane curvature Ac, given by 

ke = AM/Ac (2) 

At a given position in the membrane, curvature 
change is the change in the principal radii of curvature 
Ri and R 2 , that is, 

Ac = A(1/Ri + 1/R2) (3) 

Ka and kc are related when a certain distribution of 
stress is assumed across the bilayer (Evans and Skalak 
1980). In both modes of deformation, the area per 
molecule at the membrane interface changes: 
For expansion, there is an increase in area per molecule 
in both monolayers of the bilayer, and for bending, the 
area per molecule increases in one side of the bilayer 
and decreases in the other. 

While liquid bilayers by definition do not support 
elastic shear, solid (or gel) phase lipid bilayers do 
support shear stress elastically and so, for this solid 
bilayer state, shear deformation is characterized by the 
Surface shear rigidity for a membrane element p, 
which is the ratio between surface shear stress and 
shear deformation e^. 


Ts = 2pes (4) 

Shear deformation, e^, is given by the in-plane 
extension as, 

es = (42 - 4 - 2)/2 (5) 

Viscous coefficients characterize the liquid behav¬ 
ior (viscosity) for each of these three modes and are 
given by proportionalities between forces (including 
moments) and rates of deformation (Evans and 
Skalak 1980; Bloom et al. 1991). For the liquid lipid 
bilayer though, time constants for dilational and 
bending deformation are on a “molecular time 
scale” (10~^ s to lO^'^^s (Bloom et al. 1991)) that is 
associated with acyl chain conformational changes. 
These time scales are not measurable in micropipette 
experiments where observation is limited to times 
^10“^ s due to the l/60th s speed of video signals. 
Viscous resistance to shear for liquid bilayers has, 
however, been measured by pulling bilayer tethers 
from a vesicle held under known tension (Waugh 
et al. 1992). For solid bilayers a shear viscosity has 
been measured using the micropipette technique 
(Evans and Needham 1987). 

Following a brief description of the apparatus and 
technique, this entry will conclude with a series of 
images and videos of some of the experiments that 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 3 (a) The Micropipette Manipulation Station, showing two 
manometers for pressure control from microatmospheres to 10s 
of milliatmospheres, three plate-mounted micromanipulators, 
and two pneumatic joy sticks for fine positional control of two 


of the pipettes, all centered on an inverted optical microscope; 
(b) Closeup view of a double microscope chamber showing three 
micropipettes entering the chamber from three plate-mounted 
micromanipulators (Olbrich 1997) 


have been carried out using the micropipette technique 
to study lipid bilayers and their interactions. 

Micropipette Manipulation Apparatus 

As shown in Fig. 3, the micropipette manipulation 
technique is centered around an inverted microscope 
that can mount up to four micromanipulators directly 
on the microscope stage plate (Olbrich 1997; Needham 
and Zhelev 2000). Control over micropipette suction 
pressure is in the range of microatmospheres to tenths 
of atmospheres (0.1 N/m^ to 10,000 N/m^) and is 
achieved by a water-hlled manometer equipped with 
a sensitive micrometer-driven displacement and 
coarser syringe control; positive and negative pres¬ 
sures are recorded by in-line pressure transducers 
(Validyne, Northridge, CA). A temperature control 
chamber allows the sample temperature to range from 
15°C to 70°C. All experiments, along with the temper¬ 
ature and pressure data, are recorded on a video tape 
and analyzed after the experiment. Recent increases in 
data storage have now allowed the use of digital 
recording. 

Single Vesicle Experiments 

Analysis of micropipette suction pressure combined 
with vesicle and pipette geometry give membrane 


stresses and resulting deformations; they provide 
direct measures of the elastic coefficients listed in 
Fig. 2. For micropipette suction pressures (AP) in the 
range of microatmospheres to tenths of atmospheres 
(0.1 N/m^ to 10,000 N/m^), the force (Fp) at the pipette 
tip can be estimated from the pipette cross-sectional 
area Ap, given by, 

Fp = AP/Ap (6) 

And so for a modest 8-pm-diameter pipe, it is possible 
to deliver forces as small as 5 pN, up to 750 nN. At the 
lowest end, this is equivalent to the force developed by 
a molecular Kinesin motor walking on microtubule. It is 
quite possible to make a 2-pm-diameter pipette which 
delivers a force of only 0.3 pN - just over twice the force 
needed to stretch double strand DNA to 50% relative 
extension. In bending, the membrane is just as soft as 
these materials. 

Bending 

Figure 4, and accompanying Video 1, show a simple 
micropipette experiment in which a pipette suction at 
the limit of control and resolution (2 patm, 0.2 N/m^, or 
2 dyn/cm^) is applied to a single lipid vesicle. 

The 4-nm-thick lipid bilayer membrane is viewed 
in the interference contrast optical microscope. 
The bending modulus, kc, is calculated from the 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 4 Membrane bending, (a) GUV at the pipette tip with zero 
applied suction pressure; (b) Application of suction pressure; 
(c) AP of 0.2 N/m^ bends the membrane into the micropipette 

suction pressure AP required to bend the membrane 
into the micropipette mouth to just 1 pipette radius 
(Rp). kc is approximately given by the equation 
(Evans and Needham 1987), 

k, = APRpVS (7) 

and so, with a suction pressure of only 0.2 N/m^, only 
a membrane tension on the order of 2.5 mN/m x 10“"* 
is being applied - in the range of ultra-low interfacial 
tensions usually associated with microemulsions. 
The value calculated for the bending stiffness for this 
lipid bilayer is 1.6 x 10“** N.m. This is actually an 
energy unit, and since 1 N.m = 1 J, the energy required 



Micropipette Manipulation of Lipid Bilayer Membranes, 
Video 1 Membrane bending 


to bend the membrane is 1.6 aJ and is about the same 
energy as a single covalent bond (^10 eV). Subse¬ 
quently, Evans and Rawicz (1990) showed that in the 
expansion experiment, below a threshold level of ten¬ 
sion that would create actual lipid-lipid area change of 
the bilayer, expansion is largely due to extending mem¬ 
brane undulations. This is an entropy-driven tension 
characteristic of the bending elasticity of the condensed 
fluid membrane. Thus, more sensitive measurements 
(Evans and Rawicz 1990; Waugh et al. 1992; Zhelev 
et al. 1994) give kc 0.1-0.5 x 10“** N.m. And so the 
earlier value is actually an overestimate of the bending 
stiffness of the lipid bilayer membrane, that is, despite 
applying only 5 pN of force to the membrane, this force 
is greater than what it takes to bend it. All this underlines 
the fact that as thin flexible sheets, that surround every 
cell on the planet, lipid bilayers are extremely flexible 
materials. 


Area Expansivity and Tensile Failure 

When much larger suction pressures are applied to 
a single vesicle, the membrane can be made to expand 
and eventually fail. In this experiment, the parameters 
needed are the membrane tension and the resulting 
area change, that is, tensile stress x and area change 
AA in order to calculate the Elastic Area Expansivity 
modulus Ka. As shown in Fig. 5, analyses of the 
pipette (Rp) and vesicle (Ry) radii give relationships 
for these tensile stress and area-strain parameters. 
Membrane tension is given by. 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 5 Dilation, (a) GUV (25 pm diameter) aspirated into the 
micropipette (8 pm diameter) ready for the expansion 
experiment (Note: The vesicle has a sucrose solution inside 
and a salt solution outside, creating a refractive index for better 
visualization.); (b) Application of intermediate suction pressure 
(~ 10,000 microatmospheres) producing an ~ 1.5% area change; 
(c) Further expansion, just before tensile failure at >20,000 
microatmospheres at a AL that corresponds to ~3% area change 


T = APRp/2(l - Rp/R,) (8) 

To measure the change in vesicle area, the pipette 
offers a unique sensitivity. With area changes of only 
a few percent before the vesicle membrane breaks 



Micropipette Manipulation of Lipid Bilayer Membranes, 
Video 2 Dilation 


under tension, the area change is linearized as changes 
in length, AL, of the membrane in the pipette, and is 
given by. 


AA = 27rRp(l-Rp/Rv)AL (9) 

The typical aspiration of a single 30-pm-diameter 
vesicle by an 8-pm-diameter pipette at suction pres¬ 
sures in the thousands of microatmospheres represents 
the application of forces in the hundreds of 
nanoNewtons. As shown in Fig. 5b, and Video 2, 
using (8) and (9), a suction pressure of 10,000 patm 
produces a tension of ^3 mN/m and a corresponding 
area change of ^1.5%. 

Increasing the suction pressure to ^20,000 
microatmospheres. Fig. 5c, brings the vesicle to the 
brink of tensile failure at ^ 6 mN/m and only ^3% 
area dilation. But note, as discussed by Evans and 
Ludwig, strengths of molecular anchoring and material 
cohesion in fluid-lipid membranes increase with rate of 
force and tension loading (as loading rate). Using 
a range of force rates, they pulled single 
molecules from membranes with force rates from 1 to 
10"^ pN s~* and ruptured giant membrane vesicles with 
tension rates from 10”^ to 10^ mN m“* s“^ (Evans and 
Ludwig 2000). At a loading rate of ^10”^ mN 
m“* s“\ the tensile strength for an SOPC membrane 
was measured to be ^10 mN/m, but at 100 times 
greater loading rate (10 mN m“' s“'), the tensile 
strength had doubled to 20 mN/m. They conclude 
that, “under dynamic loading, rupture events below 
the peak in the tension distribution are likely to be 
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governed by collective behavior, whereas beyond the 
peak events are likely to reflect microscopic 
peculiarities.” 

When carried out in small increments, the elastic 
area expansion modulus is obtained from the slope of 
this series of membrane tension plotted against the 
corresponding area strain. For this particular experi¬ 
ment and (relatively slow) rate of membrane loading, 
Ka is ^200 mN/m. Note that, as discussed by Evans 
and Rawicz, microscopic fluctuations persist at all 
levels of tension which leads to a slight 
“renormalization” of the direct elastic compliance 
(Evans and Rawicz 1990). 

When converted to an equivalent bulk modulus by 
dividing by the thickness of the membrane (4 nm) 
this expansivity becomes 50 x 10® N/m^ that is, 
50 MPa - somewhere between that of a liquid (bulk 
hydrocarbon = ^1 GPa) and a gas (Air = 0.1 MPa). 
Thus, the lipid bilayer membrane is incredibly soft and 
expandable. However, it can only be expanded to a few 
percent before it fails in tension. The incorporation of 
cholesterol is the single most effective way to increase 
this resistance to expansion. As shown by Needham 
and Nunn (1990), and reviewed by Kim and Needham 
(2001), when taken to the limits of composition, that is, 
50 mol% cholesterol in a bilayer composed of a long 
(diC20) saturated chain, phospholipid elastic moduli 
are now several thousands mN/m, and the tensile 
strength is 40 mN/m, equivalent to the compressibility 
of bulk hydrocarbons and polyethylene. 

Permeability to Water 

These strong and very tight cholesterol-rich lipid bila¬ 
yers also resist transport of water across them (Kim 
et al. 2003). In fact, the two properties of compliance 
and permeability are inversely proportional and inex¬ 
tricably linked, since they both originate in the strong 
bonding between all-trans lipid chains and the flat 
cholesterol rings, where, on average, the cholesterol 
ring structure is believed to associate with the first 
8-10 carbons of the phospholipid acyl chains. In addi¬ 
tion to the all-trans acyl chain-cholesterol ring interac¬ 
tions, Sphingomyelin-cholesterol interactions seem to 
be strengthened by additional hydrogen bonding in the 
head group region. As reviewed by Bloom et al. 
(1991) the result is that the highly incompressible 
bilayer of Bovine Sphingomyelin in a 1:1 ratio with 


cholesterol (Ka of 1,800 mN/m) shows a water perme¬ 
ability that is only 0.3 pm/s - some 230 times slower 
than that for liquid Lo( phase bilayers, such as DMPC 
(Ka =144 mN/m), that have a water permeability of 
69 pm/s. The “biologically average di chain 
(C18:l C18:0) lipid” SOPC (Ka - 200 mN/m) with 
a water permeability of 23 pm/s shows how a longer 
chain (Cl8 vs C14) with one double bond can retain 
a modest water permeability. However, highly unsat¬ 
urated lipids like diC18:3 have a modulus similar to 
SOPC (244 mN/m) but their permeability can still be 
relatively high at 144 pm/s. Olbrich et al. (2000) 
interpreted this in terms of free volume acquired in 
thermal expansion above the main gel-liquid crystal 
transition of a bilayer, indicating that major changes 
occur in chain packing and cohesive interactions when 
two or more cis-double bonds alternate with saturated 
bonds along a chain. 

Thermal Transitions 

As shown in Pig. 6, at the acyl chain melting phase 
transition a typical lipid bilayer undergoes a decrease 
in thickness of ^25% (from 54 A to 40 A) and an 
increase in area per molecule of ^60% (from 40 A^ 
to 65 A^). The bilayer also undergoes a volume 
change of about 4% (^1,170 A^/molecule-1,220 A^/ 
molecule for DPPC (Dipalmitoylphosphatidylcholine) 
(Tristram-Nagle and Nagle 2004)). 

This large change in vesicle area, therefore, needs to 
be supported or the vesicle membrane will simply bud off 
into (attached) daughter vesicles under the action of 
thermal motion. Under hyperosmotic conditions that are 
used to reduce vesicle volume, the pipette can support the 
membrane with low suction pressures, and hold the sin¬ 
gle vesicle as it undergoes freezing and then melting at its 
main crystallization phase transition. As shown in Pig. 7 
and Video 3, reducing the temperature of the bathing 
chamber from 18°C to 15°C for a CUV composed of 
a mixture of two phospholipids (SOPC:POPE 40:60 
(SOPC, Stearoyloleoylphosphatidylcholine; POPE, 
Palmitoyloleoylphosphatidylcholine)), the vesicle area 
change can be measured for this lipid mixture which is 
^26% normalized to the liquid phase vesicle area. Data 
for the whole series of SOPC:POPC mixtures is given 
in Kim and Needham (Kim and Needham 2001), and 
other mechano-calorimetry experiments on DMPC 
(Dimyristoylphosphatidylcholine) are in Needham and 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 6 Lipid hilayerphase transition, structural characteristics. 
At the melting transition, the lipid bilayer typically goes from 
a thickness and area per molecule of 54 A and 40 A^, to 40 A 
and 65 A^, respectively 


Evans (Needham et al. 1988). Here not only were the 
area changes measured at the DMPC transition but the 
structure and mechanical characteristics of the Pp’ ripple 
phase were also determined, showing just how versatile 
the micropipette technique can be. 


Molecular Exchange 

Using three micropipettes positioned around the 
microscope stage, one to hold the vesicle (right), and 
two to deliver test and bathing solutions (left), Need¬ 
ham et al. (1997) showed how a water-soluble 
lysolipid (MOPC Monooleoylphosphatidylcholine) 
would partition into a liquid lipid bilayer membrane 
(SOPC) and expand it. As shown in Fig. 8 and Video 4, 
this expansion (at constant holding suction pressure) is 
readily measured by the micropipette and vesicle 
geometry. For MOPC at its critical micelle concentra¬ 
tion of 1 pM, the initial maximum area change 
(Fig. 8c) was ^3% and this occurred in ^200 s. 
When the lower flow pipette is repositioned to again 
deliver the bathing solution the lysolipid is readily 
washed out as the surrounding solution is exchanged 
for a lysolipid-free media. 

When this change in area is converted to mol% 
lysolipid, from the known areas per molecule of 
MOPC (35 A^) and SOPC (67 A^), this experiment 
shows that lysolipid saturates the outer monolayer at 



Liquid La 
freezing 


Solid Lp 
remelting 


Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 7 Thermal transitions, (a) Vesicle held at 18.7°C, 
a temperature that is in its liquid L^, phase above its phase 
transition temperature, with enough excess membrane area to 
accommodate the reduction in area per molecule as it freezes; 
(b) Same vesicle now at 15.3°C in its solid Lp phase, ready to be 
reheated and undergo the reversible melting at the transition 


^6 mol% MOPC. At higher lysolipid solution concen¬ 
trations the membrane rapidly takes up the lysolipid 
and fails at ^16 mol% MOPC largely accumulated in 
the outer monolayer of the bilayer (Needham et al. 
1997). In similar vesicle micropipette experiments, 
Evans studied and developed a thermodynamic theory 
for partitioning of bile acid into the bilayer predicated 
on entropy of mixing and the activity contributed by 
mechanical tension (Evans et al. 1994). With this more 
complex ionic surfactant he found that energetic bar¬ 
riers to uptake can arise from electrical charge on the 
acid group and restriction of bile acid movement 
across the bilayer. However, increase in vesicle area 
correlated well with predictions for ideal thermody¬ 
namic mixing of glycocholate into both layers of the 
SOPC lipid bilayer. Carried out under controlled ten¬ 
sion, the micropipette measurements of area change 
also yielded several important molecular-scale proper¬ 
ties as follows: the area per molecular complex in the 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Video 3 Thermal transitions 



bilayer; the ratio of aggregate size in the bilayer to that 
in aqueous solution; the small degree of ionization at 
the surface; and an estimate of molecular charge 
(Evans et al. 1994). 

Yield Shear and Shear Viscosity (Solid Phase) 


Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 8 Molecular exchange, (a) Starting position: Vesicle 
exposed to bathing solution from lower flow pipette; (b) Vesicle 
exposed to a solution of lysolipid from the upper flow pipette 
creating a change in vesicle membrane area, measured from the 
change in projection length in the micropipette; (c) Maximum 
area change under lysolipid flow (1 pM); (d) Re-establish bath¬ 
ing solution flow and wash out the lysolipid from the membrane, 
again measured as a reduction in projection length 


By aspirating a solid phase vesicle with excess 
membrane area (over a sphere of the same volume) 
into the micropipette the solid membrane can be 
formed into a solid replica of the pipette aspirated 
geometry. When this replica is ejected and re-aspirated 
under low holding pipette suction pressure, the vesicle 
is now in the starting position for a yield shear and 
shear viscosity experiment. As shown in Fig. 9 and 
Video 5, the suction pressure necessary to achieve 
a length of membrane in pipette equal to 1 Rp is the 


point at which the membrane yields. Below this point, 
the membrane behaves elastically. 

Once this suction pressure has been determined, 
a step increase in suction pressure, to say 6x the 
yield pressure, is applied and the membrane is seen to 
flow into the micropipette at a steady rate, exhibiting 
positive and negative shear as depicted in Fig. 9b. The 
“nose” of the vesicle and the projection in the micro¬ 
pipette undergo simple plug flow. This excess pressure 
is maintained until the solid vesicle membrane is 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Video 4 Molecular exchange. (Top) Flow of lysolipid solution; 
(Bottom) Flow of bathing solution 


completely re-aspirated into micropipette (Fig. 9d). 
This experiment was carried out on solid phase 
DMPC bilayers (Evans and Needham 1987). Results 
show that solid phase lipids display Bingham plastic 
material behavior and are highly viscous in shear. As 
was stated at the time (Evans and Needham 1987), 
‘"yield shear and shear viscosity reflect density and 
mobility of crystal defects (grain boundaries and!or 
intra-grain dislocations), in the solid bilayer 
membranes.” In its Pp’ phase, yield shear values for 
DMPC go from 0.001 to 0.003 mN/m with reduction in 
temperature just below its phase transition temperature 
(20-13°C). Upon going through its next solid-solid 
phase transition, to Lp phase there is a corresponding 
increase in the yield shear by a factor of 10 to 0.036 
mN/m. Similarly, the surface shear viscosity in flow 
increases from, again, close to zero in the liquid phase 
(liquid bilayers have a viscosity similar to 


a 

Yield shear at 
length in pipet 
equal to 1 Rp 


b 

Application of 
suction pressure at 
6x excess of yield 
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Maintaining 
excess suction 
pressure 
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Complete re¬ 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 9 Yield shear and shear viscosity of a solid DMPC hilayer. 
(a) Yield shear at length in pipette equal to 1 Rp; (b) Application 
of suction pressure at 6x excess of yield causes membrane to 
flow; (c) Maintaining excess suction pressure at 6x yield 
pressure; (d) Complete re-aspiration of solid vesicle membrane 
into micropipette 


a 2-Poise oil) covering 0.003-0.026 mNs/m over the 
same temperature range. These surface shears repre¬ 
sent equivalent bulk yield shears from 250 to 
750 N/m^, (0.25-0.75 MPa) reaching 9,000 N/m^ 
(9 MPa) in the Lp phase, and the surface shear viscosity 
ranges from 750 to 6,500 Ns/m^ (0.75-6 kPa.s). These 
values are comparable to yields for polypropylene of 
^20-80 MPa, and plastic viscosities of 6 kPa.s at 
relatively low shear rates. Once again this hydrocarbon 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Video 5 Yield shear and shear viscosity of a solid hilayer 


membrane, despite being only 2-molecule thick, 
behaves much like common hydrocarbon materials. 


Vesicle Pairs: Vesicle-Vesicle Adhesion 

By manipulating two GUVs into initial contact and 
then allowing one to spread on the other, the micro¬ 
pipette technique allows one to measure 
a fundamental energy of nature, the van der Waals 
attraction energy. In this experiment, shown for two 
SOPC vesicles in Fig. 10 and Video 6, the right hand 
vesicle is held under high constant suction pressure to 
create a spherical well-defined adhesion surface. The 
adherent (left hand) vesicle is brought into initial 
contact and the starting position is one in which the 
relatively high suction pressure in the left hand 
pipette induces a membrane tension that just balances 
the spreading tension in the left hand vesicle mem¬ 
brane. This tension is given by (8) above. The extent 
of adhesion is controlled via tension in left hand 
adherent vesicle membrane, and so reductions in ten¬ 
sion allow the vesicle to spread in discrete steps until 
completely spread yet still held at 1 Rp length in the 
left hand pipette. This equilibrium spreading energy 
is simply given by the Young Dupre equation, as the 
work of adhesion Wa, dependent on the membrane 
tension x and the contact angle 0 between the vesicles 
at spread contact (Evans 1980), 


( 10 ) 


a 

Starting Position 


b 

Allowed to spread 


c 

Adherent vesicles 


d 

Evans’ original 
analysis of spread 
contact 



Micropipette Manipulation of Lipid Bilayer Membranes, 
Fig. 10 Vesicle-Vesicle adhesion, (a) Starting position where 
membrane tension just balances the spreading tension in the left 
hand vesicle membrane; (b) Vesicle allowed to spread by reduc¬ 
ing membrane tension; (c) Vesicles in equilibrium spread adher¬ 
ent contact; (d) Sketch of Evans’ original analysis of spread 
contact 


In this one experiment, lipid bilayer vesicles have 
provided unique and versatile systems for the study 
and evaluation of a range of colloid and surface 
phenomena at and between surfaces (Evans and Need¬ 
ham 1987, 1988), including the following: van der 
Waals attraction limited by hydration (+ thermal) 
repulsion for neutral bilayers; the classic DLVO inter¬ 
action of van der Waals attraction limited by electro¬ 
static repulsion for charged bilayers; depletion 
flocculation caused by non-adsorbing polymers (like 
PEG (Poly(ethyleneglycol)) and Dextrans) in the bath¬ 
ing solution, limited again by hydration repulsion 


Wa = t(1 — COS0) 
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Micropipette Manipulation of Lipid Bilayer Membranes, 
Video 6 Vesicle-vesicle adhesion 


(Evans and Needham 1988); depletion flocculation 
that can overcome electrostatic repulsion; steric repul¬ 
sion offered by grafted PEG polymer at the vesicle 
surfaces that oppose the inherent weak van der Waals 
attraction; the delicate balance between depletion floc¬ 
culation of PEG in solution and the steric repulsion of 
the same molecule grafted to the bilayer surfaces 
(Evans et al. 1996); and Anally the breakdown of 
membranes in contact (van der Waals vs hydration) 
producing membrane fusion. 

Adhesion tests for bilayers composed of neutral PCs 
(egg PC, DMPC, SOPC) in the Lq, state yield comparable 
values (0.01-0.015 erg/cm^) of free energy reduction per 
unit area of contact (Evans and Needham 1987). Vesi¬ 
cles made from the less-hydrated phospholipid POPE 
(Palmitoyloleoylphosphatidylethanolamine) exhibit 
much stronger adhesion with energies that are an order 
of magnitude greater (0.12-0.15 erg/cm^). Even larger 
free energy potentials (0.22 erg/cm^) are measured for 
adhesion with DGDG (Digalactosyldiglyceride) (sugar- 
lipid) bilayers. 

Regarding attraction between lipid bilayer mem¬ 
branes in concentrated solutions of non-adsorbing 
polymers, Evans and Needham provided experimental 
data and compared mean-field theory with measure¬ 
ments of adhesion energy using the micropipette tech¬ 
nique (Evans and Needham 1988; Evans 1989). They 
made direct measurements of the free energy potential 
for adhesion of synthetic lipid bilayers in aqueous 
(0.1 M NaCl) solutions of dextran (polyglucose) over 
a wide range of volume fraction (0.01-0.1) and 


molecular weight (10,000-150,000). Well modeled 
by a Flory interaction parameter of 0.43 (characteristic 
of a “good” solvent), these polymer solutions induced 
adhesion energy for neutral (phosphatidylcholine) 
bilayers, which was found to steadily increase 
(from 0.01 to >0.2 erg/cm^) with polymer volume 
fraction - without any indication of a plateau - and 
with little dependence of polymer size. For equilibrium 
exchange of the polymer between gap and bulk 
regions, the attractive stress on the membrane surfaces 
is simply the osmotic pressure at the midpoint of the 
gap relative to the bulk region. Furthermore, incorpo¬ 
ration of electric (phosphatidylserine) charges in the 
lipid bilayer surfaces stabilized adhesion at larger 
membrane separations, and the presence of polymer 
was able to overcome electrostatic repulsions that 
would normally separate membranes due to DLVO 
theory, that is, when repulsion is only balanced by 
van der Waals attraction. 

Finally, for some lipid systems, like the ones 
containing large amounts of phosphatidylethanol- 
amine (as POPE), contact can induce membrane- 
membrane fusion. In these unique experiments 
(Fig. 11 and Video 7) of vesicles in very close 
interbilayer contact, it appears that the high adhesion 
energy (because of the close apposition due to a much 
shorter-range hydration repulsion of the PE head- 
groups) and the tendency for POPE to form an inverted 
phase help to initiate intramembrane and 
intermembrane instability. In fact in at least one of 
these experiments (Fig. lid), the vesicles were 
touched, separated, and returned to adherent contact, 
inducing a small tether that was pulled at the point of 
contact. Then under an applied tension, the vesicles 
fused. 

Micropipette Manipulation of Other Particles 

The micropipette technique has then allowed many 
properties and phenomena to be investigated in-depth 
for both cells and for GUVs. It is now being adapted 
further to manipulate a range of microparticles as gases 
(gas bubbles in water (Duncan and Needham 2004)), 
liquids (oil in water and water in oil emulsions 
(Duncan and Needham 2006; Su et al. 2010)), and 
solids (crystallized, glassified, and precipitated mate¬ 
rials like salts, proteins, peptides, and polymers 
(Rickard et al. 2010)). These single particle science 
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Micropipette Manipulation 
of Lipid Bilayer 
Membranes, 

Fig. 11 Vesicle-Vesicle 
FUSION Four examples of 
vesicles maneuvered into 
contact resulting in fusion of 
their membranes into a single 
vesicle in (a), (b), and (d) or 
the formation of an internal 
vesicle as in (c) 
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and engineering experimental techniques have started 
to open up additional fields of surface and colloid 
science and their study for applications particularly in 
drug delivery. 

Summary 

To summarize, the micropipette manipulation tech¬ 
nique provides a unique ability to characterize the 
deformation of single cells (erythrocytes, leukocytes, 
and cancer cells) and giant unilamellar vesicles, in 
terms of well-defined material constants, including 
the elastic area expansivity and bending moduli, and 
elastic yield and viscosity in shear for solid phase 
membranes. Results show that lipid bilayers are 
extremely soft in bending and dilation, but that with 
the inclusion of cholesterol along with fully hydroge¬ 
nated acyl chains for the host lipid, the elasticity and 
strength approach values equivalent to bulk material 
properties of 10^ N/m^ (1 GPa), and ^10^ N/m^ 


(10 MPa) respectively, that is, comparable to the 
Young’s modulus of polyethylene. As a simple liquid 
phase lipid bilayer, such as egg PC, expansion elastic 
moduli are some ^40 x smaller, equivalent to a bulk 
modulus somewhere between that of a bulk liquid and 
a gas. This astonishing range of values (of 40 x) for 
a single material property, especially for a structure 
that is only 2 molecules thick, rivals the only modest 
elastic ranges for crystalline metals from lead (16 Gpa) 
to tungsten (411 GPa) of 25x; the range for ceramics 
from ice (1.2 GPa) to diamond (1,220 GPa) is l,000x. 
For solid phase membranes, the soft mechanics of the 
Pp’ ripple phase of DMPC was also revealed, and shear 
rigidity and shear viscosity of the solid phase mem¬ 
branes, equivalent to common hydrocarbon polymers 
just below their melt temperature, have also been mea¬ 
sured. Mechanical strength and elasticity also correlate 
with permeability to water (high modulus-low perme¬ 
ability), although slight exceptions have been shown 
for lipids with multiple double bonds in their acyl 
chains. 

























Micropipette Manipulation of Lipid Bilayer Membranes 


1537 


M 



Micropipette Manipulation of Lipid Bilayer Membranes, 
Video 7 Vesicle-Vesicle FUSION 


Beyond strength measurements, the micropipette 
technique has allowed studies of thermal transitions 
measuring area changes of 25-30% compared to the 
liquid phase that correspond exactly to the changes in 
area per molecule when the bilayer freezes. Using flow 
pipettes directed at a single vesicle, molecular 
exchange with small molecules like lysolipids and 
bile acids has shown the membrane’s propensity to 
take up these molecules from surrounding solution 
and create a corresponding membrane area change of 
^6% as the molecule (lysolipid) becomes incorporated 
in the bilayer. If incubation is continued, MOPC will 
cross the bilayer with a half life of tens of minutes, one 
of the fastest translocations measured for a lipid. 
Returning the lysolipid-free bathing medium rapidly 
washes this soluble surfactant out of the bilayer. 

When manipulated in pairs, the work of adhesion 
between two lipid vesicles has been measured to be 
0.01 ergs/cm^, when van der Waals attraction is limited 
by a very short-range hydration repulsion and the 
variable-range power law of electrostatic repulsion; steric 
repulsive barriers due to the presence of bound aqueous 
polymers like poly (ethyleneglycol); depletion floccula¬ 
tion by the same molecules when free in surrounding 
solution; and they are subject to intimate mixing when 
manipulated into contacts that reduce the hydration bar¬ 
rier and allow membrane-membrane fusion. 


M 
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Mitochondrial Electron Transport 
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Synonyms 

Mitochondrial respiratory chain 

Definition 

The redox components localized in the mitochondrial 
inner membrane which couple the oxidation of meta¬ 
bolic substrates such as reduced ► nicotinamide ade¬ 
nine dinucleotide (NADH) and reduced ► flavins (e.g., 
FADH 2 ) to the reduction of molecular oxygen to water, 
while conserving the released redox-linked energy into 
vectorial translocation of protons to establish 
a transmembrane protonmotive force for subsequent 
ATP synthesis. 

Introduction 

Mitochondria are subcellular organelles in eukaryotic 
cells which use most of the cell’s oxygen (95%) to 
provide chemical energy for the cell by the synthesis of 
adenosine triphosphate (ATP) (Tzagoloff 1982; 
Scheffler 2008). Reduced metabolites such as FADH 2 
and NADH are generated in mitochondria by oxidative 
metabolism and are then oxidized aerobically by the 


mitochondrial electron transport (also called) respira¬ 
tory chain. The respiratory chain transduces the energy 
released by these redox reactions into chemical energy 
by vectorial translocation of protons to establish an 
electrochemical gradient for subsequent ATP synthe¬ 
sis, as well as other processes such as the accumulation 
of calcium ions (Hatefi 1985; Saraste 1999; Rich and 
Marechal 2010). 

The Mitochondrion 

Mitochondria have two membranes, are often ellipsoid 
in shape, and are approximately one micron in length 
(Frey and Mannella 2000) (Fig. 1). The outer mem¬ 
brane is semi-permeable to substrates and molecules 
used in various metabolic functions. It also contains 
a voltage dependent anion channel (VDAC) and 
a portion of the protein import machinery (TOM com¬ 
plex) (Nicholls and Ferguson 2002). The highly imper¬ 
meable inner membrane, depicted in Fig. 1 as an 
extensively baffled structure, is composed of inner 
membrane boundary and cristae membrane regions. 
The inner membrane boundary contains proteins 
involved in protein import, translocation, and mito¬ 
chondrial fusion. The cristae membrane houses the 
electron transport chain and ► ATP synthase, as well 
as metabolite carrier proteins such as the adenine 
nucleotide exchange and substrate transporters. The 
inner membrane encloses the matrix with an inter¬ 
membrane space located between the inner and outer 
membranes. The matrix is the physical location of 
enzymes involved in the citric acid cycle, (l-oxidation 
of fatty acids, and amino acid oxidation. These meta¬ 
bolic pathways generate large amounts of reducing 
equivalents in the form of NADH and FADH 2 which 
need to be oxidized in order for cell metabolism to 
continue. The matrix also contains multiple copies of 
closed circular DNA of about 16.6 kb in size which 
encodes 37 genes; 13 for polypeptides that are subunits 
in the protein complexes of the mitochondrial electron 
transport chain and the ATP synthase (Wallace 2005). 

Historically, the major thrust of mitochondrial 
research has focused on its role in metabolism, 
although the organelle now is known to be involved 
in a diverse array of functions. The electrochemical 
potential across the mitochondrial inner membrane 
which arises from electron transfer has implications 
for additional mitochondrial functions, including cal¬ 
cium signaling and apoptosis (Wallace 2005; Orrenius 
et al. 2007). 
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Mitochondrial Electron Transport, Fig. 1 Schematic dia¬ 
gram of the mitochondrion 


The Mitochondrial Electron Transfer Chain 

Electrons released from NADH and FADH 2 during 
oxidation by the mitochondrial respiratory chain 
follow their electrochemical potential through multiple 
oxidation-reduction reactions contained in large multi¬ 
subunit integral membrane protein complexes and in the 
terminal reaction, and reduce molecular oxygen to 
water (Fig. 2) (Hatefi 1985; Saraste 1999; Rich and 
Marechal 2010). These protein complexes contain pros¬ 
thetic groups such as non-heme iron sulfur centers, 
► flavin nucleotides, hemes, and copper ions which act 
as redox centers. The energy released during the redox 
reactions is conserved by the vectorial translocation of 
protons which establishes an electrochemical gradient 
across the mitochondrial inner membrane. The ATP 
synthase utilizes the pH gradient generated by the 
redox reactions to produce ATP. Aerobic bacteria gen¬ 
erate their electrochemical gradient by a similar mech¬ 
anism, except that the cell membrane is the locus of the 
respiratory chain, and the electron transfer complexes 
generally have a simpler polypeptide composition 
(Trumpower and Gennis 1994). 

► Complex 1 and ► Complex 11 oxidize NADH and 
FADH 2 respectively and transfer the electrons to the 
membrane-bound, mobile electron carrier ubiquinone 
(Q). Complex I also conserves the energy of the oxi¬ 
dation reaction by pumping protons across the inner 
membrane (Brandt 2006). The reduced ubiquinol is 
oxidized by ► Complex III using the Q-cycle mecha¬ 
nism resulting in the net reduction of cytochrome c and 
proton translocation into the inter membrane space 
(Tmmpower and Gennis 1994; Crofts 2004). Cyto¬ 
chrome c, a small water soluble heme protein, shuttles 
electrons from Complex III (where it is reduced) to 


► Complex IV (where it is oxidized). Complex IV uses 
the electrons from reduced cytochrome c molecules 
and consumes protons from the matrix to reduce oxy¬ 
gen to water while simultaneously pumping protons 
across the mitochondrial inner membrane (Hosier et al. 
2006). The electrochemical gradient resulting from 
proton pumping activities of Complexes 1, III, and IV 
is used by Complex V to drive the synthesis of ATP 
from ADP and inorganic phosphate (von Ballmoos 
et al. 2009). 

Mitochondrial Respiratory Chain Complexes 

Complexes I-V comprise about 50% of the membrane 
protein within the inner membrane of the mitochon¬ 
dria. The remaining proteins are substrate transporters 
and other metabolic enzymes which interact with the 
respiratory chain (e.g., fatty acyl CoA dehydrogenase 
and a-glycerol phosphate dehydrogenase). Hatefi 
(1976) was the first to solubilize the mitochondrial 
inner membrane using a non-denaturing detergent, 
deoxycholate, and salt precipitation to purify all five 
complexes involved with electron transfer and oxida¬ 
tive phosphorylation. Subsequent work in the field has 
focused on understanding the molecular mechanism of 
electron transport and proton-pumping in isolated 
respiratory chain complexes reconstituted into lipo¬ 
somes. The three dimensional structures of all of respi¬ 
ratory chain complexes from either mitochondria or 
bacteria are all known from X-ray crystallographic 
studies. (See protein database: http://www.rcsb.org/ 
pdb/home/home. do) . 

Complex I, or NADH: Ubiquinone oxidoreductase 
consists of 14 core subunits (7 synthesized from mito¬ 
chondrial DNA) and up to 31 accessory subunits 
arranged into an F-shape, with a soluble domain 
extending into the mitochondrial matrix (Brandt 
2006; Hirst 2009). The soluble domain contains the 
catalytic site for NADH oxidation, as well as a flavin 
mononucleotide (FMN) and a series of seven iron- 
sulfur centers for transferring the electrons from 
NADH to the integral membrane protein domain 
where ubiquinone (coenzyme Q) reduction and proton 
translocation occur. The net result is that for every two 
electrons extracted from NADH and donated to ubi¬ 
quinone, four protons are translocated from the matrix 
to the intermembrane space. 

Succinate: Ubiquinone oxidoreductase, or 

► Complex II, contains four subunits, and catalyzes 
the oxidation of succinate to fumurate in the citric acid 
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Mitochondrial Electron 
Transport, Fig. 2 Schematic 
diagram of the mitochondrial 
electron transport chain 
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cycle (Cecchini 2003). Upon oxidation of succinate, 
► Complex 11 abstracts two electrons and two protons 
from succinate and transfers them to covalently 
bound FAD, forming FADH 2 . The electrons are then 
transferred to ubiquinone via the enzyme’s iron-sulfur 
clusters, while the protons are released back into the 
matrix. There is also a non-redox-active cytochrome h 
in ► Complex II. 

► Complex III or ► cytochrome hci or ubiquinol: 
Cytochrome c reductase catalyzes both the oxidation 
of ubiquinol and the reduction of cytochrome c, cou¬ 
pling the reaction to proton translocation across the 
inner membrane (Trumpower and Gennis 1994; Crofts 
2004). ► Complex III contains 13 subunits and 
a dihemic h cytochrome molecule (one high potential 
heme h, the other a low potential form), a cytochrome 
cI molecule, and a Rieske iron-sulfur protein, which all 
act as redox centers. Electrons from ubiquinol are 
transferred using a bifurcated electron transfer path¬ 
way with multiple catalytic sites and separate proton 
and electron transfer pathways (Trumpower and 
Gennis 1994; Crofts 2004). The reaction results in 
two individual single-electron transfers to two cyto¬ 
chrome Cl molecules through the Rieske iron sulfur 
protein, the uptake of two protons from the matrix, and 
the release of four protons into the inner membrane 
space using the Q-cycle mechanism (Crofts 2004). 
Cytochrome h is synthesized from mitochondrial 
DNA (Wallace 2005). 


Reduced cytochrome c is oxidized by ► Complex 
IV, ► cytochrome c oxidase also called the cyto¬ 
chrome c: oxygen oxidoreductase in a reaction 
that reduces oxygen to two molecules of water 
(Trumpower and Gennis 1994; Hosier et al. 2006). 
Electrons are transferred from cytochrome c to 
a diatomic copper, Cua, and then to a non-covalently 
attached heme a. The electron then is transferred to the 
binuclear center (oxygen-binding site) which contains 
heme <33 in close proximity to another copper atom 
(Cub). Oxygen binds at an open coordination site in 
heme < 33 ; Cub and heme as act cooperatively to use four 
electrons from cytochrome c to reduce oxygen into two 
molecules of water. One proton/electron is taken up 
from the matrix during the chemical reduction of water 
and an additional proton/electron is pumped across the 
inner membrane. The complex has 13 subunits of 
which three are encoded by mitochondrial DNA. 
The net result of the electron transport activity in 
mitochondria is that for every two electrons transferred 
from NADH to V 2 O 2 , there are ten protons pumped 
across the inner mitochondrial membrane, in contrast 
to six pumped protons from succinate to V 2 O 2 
(Saraste 1999; Rich and Marechal 2010). 

► Complex V is ATP synthase, which utilizes the 
electrochemical proton gradient generated by Com¬ 
plexes 1, III, and IV to drive the phosphorylation of 
ADP. ATP synthase is composed of a water soluble Fj 
protein complex, which contains the ADP and ATP 
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binding sites, and a membrane-bound Fq protein com¬ 
plex, joined together by a central and peripheral stalk 
(von Ballmoos et al. 2009). The majority of a cell’s 
ATP is synthesized by this complex. Thus, the mito¬ 
chondrial electron transport chain is of central impor¬ 
tance to virtually every biochemical pathway and 
process of life. 

Determination of the Oxidation-Reduction 
Components of the Mitochondrial Electron 
Transfer Chain 

All of the respiratory chain electron transfer carriers in 
isolated mitochondria were found to meet the follow¬ 
ing criteria; (1) they were readily oxidized or reduced 
by mitochondrial substrates such as oxygen, NADH, or 
succinate; (2) their kinetics of oxidation-reduction fit 
within the timeframe of mitochondrial electron trans¬ 
fer kinetics using substrates such as NADH or oxygen; 
(3) their content in the mitochondrial membrane was in 
close stoichiometry with other known carriers in the 
respiratory chain (although ubiquinone is in excess) 
(Nicholls and Ferguson 2002; Rich and Marechal 
2010 ). 

As most of the ► electron transfer proteins have 
strong chromophores which shift their absorbance 
peaks when oxidized or reduced, ultraviolet (ubiqui¬ 
none) and visible absorbance (► flavins and ► cyto¬ 
chromes) spectroscopy was used to identify and 
quantitate each electron carrier in the mitochondrial 
inner membrane. Dual wavelength absorbance spec¬ 
troscopy, developed by Chance (1991), monitored 
small absorbance changes in these chromophores in 
suspensions of mitochondria with high background 
absorbance (Nicholls and Ferguson 2002). This tech¬ 
nique was used to determine the kinetics and stoichi¬ 
ometry of the carriers in the respiratory chain. Two 
different experimental approaches were used: 

(1) NADH-linked substrates or succinate were added 
to mitochondria in the presence of oxygen and the 
oxidation-reduction kinetics of the carriers measured; 

(2) Limited oxygenated buffer was added to anaerobic 
mitochondria and the oxidation-reduction kinetics of 
the same carrier were measured. The rate constants 
allowed the electron transfer carriers to be placed in 
order from the reducing side (NADH or succinate) to 
the oxidizing side (oxygen) of the respiratory chain. 
Later, this was verified by rapid kinefics using 
a stopped flow, rapid mixing absorbance spectroscopy. 
Electron spin resonance spectroscopy was used to 


determine the location of the non-heme iron-sulfur 
proteins in the electron transfer chain. 

Also, equilibrium ► redox potential titrations of 
each carrier’s position in the chain were performed 
using potentiometric methods (Dutton and Wilson 
1974). Mid-point reduction potentials were assigned 
to each carrier from low potential (NADH) to high 
potential (oxygen) and the midpoint potentials corre¬ 
lated well with the proposed pathway of electrons in 
the chain. Electron transfer inhibitors, which blocked 
electron flow at specific sifes in the respiratory chain, 
were also used to monitor carrier position in the respi¬ 
ratory chain. Rotenone and piericidin for ► Complex I, 
malonate for ► Complex II, antimycin, stigmatellin, 
and myxothiazol for ► Complex III, and cyanide and 
carbon monoxide for ► Complex IV were all used to 
inhibit electron transfer at specific sites and allowed 
investigators to hnalize the route of electrons in the 
respiratory chain (Nicholls and Eerguson 2002). 

Interactions of Cellular Metabolic Pathways with 
the Mitochondrial Electron Transfer Chain 

FADH 2 and NADH are sources of electrons for the 
mitochondrial electron transfer chain. NADH is gen¬ 
erated in the citric acid cycle by isocitrate, 
a-ketoglutarate, and malate dehydrogenases in the 
mitochondrial matrix. Other enzymes which generate 
NADH in the matrix are pyruvate dehydrogenase, 
L-3-hydroxy lacy 1 Co A dehydrogenase in fatty acid 
oxidation, and glutamate dehydrogenase in amino 
acid oxidation. It is relevant to note that NAD(H) is 
impermeable to the mitochondrial inner membrane and 
that cytosolic dehydrogenase enzymes, for example, 
glyceraldehyde 3-phosphate and lactate dehydroge¬ 
nases must use indirect mechanisms such as the 
a-glycerol phosphate dehydrogenase and malate- 
aspartate shuttles to move reducing equivalents into 
the mitochondrial matrix for oxidation (Rich and 
Marechal 2010). In addition, EADH 2 generated by 

► succinate dehydrogenase in the citric acid cycle, by 
a-glycerol phosphate dehydrogenase and by the 

► electron transfer flavoprotein which interacts with 
acyl CoA dehydrogenase in fatty acid oxidation, is 
oxidized by the respiratory chain with each enzyme 
having its own ubiquinone-binding site. In addition, 
dihydroorotate dehydrogenase in pyrimidine biosyn¬ 
thesis produces EMNH 2 which also donates electrons 
to ubiquinone. The oxidation of NADH and EADH 2 
by the mitochondrial respiratory chain yields about 
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ten-fold more ATP than metabolism with in the 
absence of oxygen or low oxygen. 

Mitochondrial Electron Transport as a Source of 
Reactive Oxygen Species 

► Reactive oxygen species can be linked to atheroscle¬ 
rosis, chronic inflammation, ischemia/reperfusion 
injury, and neurodegenerative disorders. The mitochon¬ 
drial electron transport chain can produce superoxide 
radical (O 2 ’ ^ from oxygen in a single electron reduc¬ 
tion. Superoxide as a free radical can react with lipids to 
destroy membrane integrity, with proteins to denature 
them, or with mitochondrial DNA to create mutations in 
respiratory chain proteins (Wallace 2005; Orrenius et al. 
2007; Murphy 2009). Hydrogen peroxide can be pro¬ 
duced from superoxide either chemically or enzymati¬ 
cally using ► superoxide dismutase and in the presence 
of metal ions; toxic hydroxyl radicals can be formed. 
Non-heme iron centers and mitochondrial DNA are 
both labile to hydroxyl radicals (Wallace 2005). 

The primary sites for superoxide formation in the 
respiratory chain are Complexes I and III (Brandt 
2006; Orrenius et al. 2007; Murphy 2009). In Complex 
III, the amount of superoxide produced in normal 
respiring mitochondrial varies from 0.1^.0% of the 
electron transfer rate, with increases in superoxide 
production correlating with an increase in membrane 
potential across the inner membrane. In ► Complex I, 
superoxide production is stimulated in the presence of 
succinate and in the absence of ► Complex I sub¬ 
strates, suggesting that reverse electron flow creates 
superoxide. The locus of generation in ► Complex I is 
at an iron-sulfur protein near the ubiquinone-binding 
site (Murphy 2009). There is also indication that when 
electron transfer is inhibited in >- Complex I, the ► fla¬ 
vin in FMN can produce superoxide. Thus, reactive 
oxygen species will increase ► oxidative stress in cells 
and the net result is necrotic or apoptotic cell death and 
tissue degeneration. 

Summary 

The mitochondrial respiratory chain oxidizes reduced 
substrates generated by catabolic pathways in eukary¬ 
otic cells using molecular oxygen as the terminal elec¬ 
tron acceptor. The electron carriers sequentially 
catalyze a series of redox reactions through multiple 
electron transfer centers in multi-subunit protein 


complexes. In Complexes I, III, and IV, the energy 
released by the electron transfer reactions is conserved 
by unidirectional pumping of protons across the mito¬ 
chondrial inner membrane. This electrochemical gra¬ 
dient is utilized to synthesize most of cell’s ATP. As 
the mitochondrial matrix is the center for cell’s oxida¬ 
tive metabolism, mitochondrial electron transfer is 
central to optimal cell functioning. 
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Mitochondrial Transport Protein Family 

Edmund R. S. Kunji 

Mitochondrial Biology Unit, The Medical Research 
Council, Cambridge, UK 


Synonyms 

AACl, AAC2, AAC3, AAC4 mitochondrial ADP/ 
ATP carriers; AGCl, AGC2 mitochondrial aspartate/ 
glutamate carriers; APCl, APC2, APC3 mitochondrial 
Mg-ATP/Pi carriers; CoAT mitochondrial CoA trans¬ 
porter; CRC mitochondrial carnitine-acylcamitine 
carrier; DIC mitochondrial dicarboxylate carrier; 
GCl, GC2 mitochondrial glutamate carriers; ODC 
mitochondrial oxodicarboxylate carrier; OGC 


mitochondrial oxoglutarate carrier; ORCl, ORC2 
mitochondrial ornithine carriers; PIC mitochondrial 
phosphate carrier; SAMT mitochondrial 5-adenosyl- 
methionine transporter; TPC mitochondrial thiamine 
carrier; UCPl, UCP2, UCP3 uncoupling proteins 

Definition 

The vast majority of mitochondrial transport proteins, 
also called transporters or carriers, reside in the inner 
membrane of mitochondria (Fig. 1). They link the 
biochemical pathways of the cytosol and the mitochon¬ 
drial matrix by transporting metabolites, nucleotides, 
inorganic ions, and vitamins in and out of the mito¬ 
chondrion. Members of the mitochondrial transporter 
family have been assigned to human solute carrier 
family SLC25 (Saier Transporter Classification Data¬ 
base), PROSITE entry PDOC00189, and PFAM entry 
PF00153. The transporters are encoded by the nuclear 
genome and have been found in most eukaryotes with 
the exception of giardia and microsporidia. The mito¬ 
chondrial transport proteins are strictly eukaryotic in 
origin, but a few genes have been found on bacterial 
and viral genomes, which most likely occurred through 
horizontal gene transfer. Mitochondrial transport pro¬ 
teins are also found in organelles that are mitochon¬ 
drial remnants, such as hydrogenosomes and 
mitosomes, but a few have been found in unrelated 
organelles, such as peroxisomes, amyloplasts, and 
thylakoids. 

Introduction 

Mitochondria are involved in a large number of critical 
functions for the eukaryotic cell. They are best known 
for their involvement in the synthesis of ATP from the 
oxidation of breakdown products of sugars and fats via 
a series of energy conversion steps. However, mito¬ 
chondria are also central in the degradation, deamina¬ 
tion and conversion of amino acids, and the synthesis 
of iron-sulfur clusters and heme. Mitochondria also 
carry out macromolecular synthesis of mitochondrial 
DNA, RNA, and proteins. All of these pathways 
require the transport of small molecules across the 
impermeable mitochondrial inner membrane, includ¬ 
ing keto acids, amino acids, fatty acids, nucleotides, 
cofactors (vitamins), and inorganic ions. Members of 
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Mitochondrial Transport Protein Family, Fig. 1 Schematic 
representation of mitochondrial transporters in inner membrane 
of the mammalian mitochondrion. The outer and inner mem¬ 
branes are indicated with dashed and continuous lines, respec¬ 
tively. Identified mitochondrial carriers (Agrimi et al. 2004; 
Fiermonte et al. 2004; Palmieri 2004; Floyd et al. 2007) are 
shown in yellow. Not shown are other isoforms of the ADP/ 
ATP carrier (AAC2, AAC3, AAC4), Mg-ATP/Pi carrier (APC2, 
APC3), glutamine earner (GCl), aspartate glutamate carrier 
(AGC2), ornithine carrier (ORC2), and uncoupling proteins 
(UCP2, UCP3, UCP4, UCP5), although they might have 


a different function altogether. Not shown are mitoferrin (Shaw 
et al. 2006), which might transport iron ions, and the disputed 
FAD (Spaan et al. 2005) or folate carrier (Titus and Moran 
2000), which have not been characterized with isolated proteins. 
Also missing is MTCH2, which is involved in apoptosis and is 
likely to be in the outer membrane rather than the inner mem¬ 
brane (Grinberg et al. 2005). The light and dark blue arrows 
indicate the import and export steps over the inner mitochondrial 
membrane, respectively. Some transport steps occur in symport 
with protons (PIC, AGC1, ODC, ORC1, GC1, CIC) or as uniport 
with protons (UCPl) 


the mitochondrial transporter family carry out the vast 
majority of these transport steps. 

Role of Mitochondrial Transport Proteins in 
Metabolism 

By using the typical sequence features of mitochon¬ 
drial transport proteins (► Mitochondrial Transport 


Protein Family; Structure), it is possible to identify 
the genes that encode them in sequenced nuclear 
genomes. There are approximately 50 different trans¬ 
porters in man, 35 in yeast, and 70 in plants, but the 
total number of active transporters cannot be assessed 
fully at this stage, as there are many splicing variants, 
pseudo-genes, isoforms, and carriers that express in a 
tissue-specific way. About half of the functions of 
mitochondrial carriers are known in man and they 
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will be described in the following entry (Fig. 1) 
(Agrimi et al. 2004; Fiermonte et al. 2004, 2009; 
Palmieri 2004; Floyd et al. 2007). 

Mitochondrial ADP/ATP carriers AACl 
(SLC25A4), AAC2 (SLC25A5), AAC3 (SLC25A6), 
and AAC4 (SLC25A31) exchange cytosolic ADP for 
ATP that is synthesized in the mitochondrial matrix by 
ATP synthase, and thereby they replenish the cell with 
metabolic energy (Klingenberg 2008). 

Mitochondrial ATP-Mg^'^/Pi transporters APCl 
(SLC25A4), APC2 (SLC25A23), and APC3 
(SLC25A25) regulate the adenine nucleotide pools in 
the mitochondrial matrix upon fission of the mitochon¬ 
drion and supply adenine nucleotides for mitochon¬ 
drial DNA and RNA synthesis (Fiermonte et al. 2004). 

The pyrimidine nucleotide transporter PNCl 
(SLC25A33) translocates UTP primarily, but also 
pyrimidine (deoxy)nucleoside tri- and diphosphates 
and, to a lesser extent, pyrimidine (deoxy)nucleoside 
monophosphates (Floyd et al. 2007). 

The main function of the mitochondrial 
dicarboxylate carrier DIC (SLC25A10) is the transport 
of malonate, malate, succinate, sulfate, thiosulfate, and 
phosphate by electroneutral exchange for the homeo¬ 
stasis of TCA cycle intermediates and of the sulfate 
and phosphate pools (Fiermonte et al. 1998). 

The mitochondrial oxoglutarate carrier OGC 
(SLC25A11) exchanges cytosolic malate for 2- 
oxoglutarate from the mitochondrial matrix and plays 
an important role in the malate-aspartate shuttle, the 
oxoglutarate-isocitrate shuttle and gluconeogenesis 
(Runswick et al. 1990). 

The mitochondrial oxodicarboxylate carrier ODC 
(SLC25A21) transports mainly 2-oxoadipate, 
2 -aminoadipate, and 2-oxoglutarate for the biosynthe¬ 
sis of lysine and glutamate or the catabolism of lysine 
and tryptophan in the matrix (Fiermonte et al. 2001). 

The mitochondrial citrate transporter CIC 
(SLC25A1) translocates citrate and isocitrate, but not 
dicarboxylates, for fatty acid and cholesterol synthesis 
(Kaplan and Mayor 1993). 

Mitochondrial carnitine/acylcarnitine carriers CAC 
(SLC25A20) exchange cytosolic acyl-carnitine for 
carnitine in the matrix in an electroneutral fashion for 
the oxidation of fatty acids in the mitochondrial matrix 
(Huizing et al. 1997). 

The mitochondrial aspartate/glutamate carriers 
AGCl (SLC25A12) and AGC2 (SLC25A13) exchange 
aspartate from the matrix for cytosolic glutamate plus 


a proton and play a key role in the malate-aspartate 
NADH shuttle that transfers electrons from cytosolic 
NADH to matrix NADH (Palmieri et al. 2001). 

The mitochondrial ornithine transporters ORCl 
(SLC25A15) and ORC2 (SLC25A2) exchange citrul- 
line and ornithine, which are the start and end products 
of the urea cycle needed to remove ammonia produced 
in the mitochondrial matrix by deamination (Indiveri 
et al. 1997). 

The main physiological role of the mitochondrial 
glutamate carriers GCl (SLC25A22) and GC2 
(SLC25A18) is to import L-glutamate, produced by 
transamination in the cytosol, into the mitochondrial 
matrix, where glutamate dehydrogenase is located 
(Fiermonte et al. 2002). 

The mitochondrial phosphate transporter PTP 
(SLC25A3) supplies the mitochondrion with inorganic 
phosphate for the synthesis of the ATP (Runswick 
et al. 1987). 

The human SLC25A28 (Mfrn2) and SLC25A37 
(Mfml) might have roles in the utilization and/or 
transport of iron for heme or iron-sulfur cluster assem¬ 
bly (Muhlenhoff et al. 2003). 

Uncoupling protein UCPl (SLC25A7) is found pre¬ 
dominantly in brown adipose tissue of neonatal mam¬ 
mals to cope with low ambient temperatures after birth 
by dissipating the proton electrochemical gradient to 
generate heat (Aquila et al. 1985). The closely related 
UCP2 (SLC25A8), UCP3 (SLC25A9) and the more 
distantly related UCP4 (SLC25A27) and UCP5 
(SLC25A14) might have other roles. The mechanism 
of uncoupling has received a great deal of debate 
(Klingenberg and Echtay 2001; Cannon et al. 2006; 
Azzu and Brand 2010; Jezek et al. 2010). 

The mitochondrial CoA transporter CoAT 
(SLC25A42) exchanges CoA from the cytosol for 
adenosine nucleotides from the mitochondrial matrix 
for the conversion of pyruvate to acetyl-CoA, the tri¬ 
carboxylic acid cycle, fatty acid p-oxidation, and the 
metabolism of amino acids, such as leucine (Fiermonte 
et al. 2009). 

In man the folate transporter MFTC (SLC25A32) 
has been implicated in folate transport (Titus and 
Moran 2000), but was also found to complement 
a flxl deletion in yeast, indicating the gene codes for 
a flavin transporter instead (Spaan et al. 2005) in 
agreement with the close phylogenetic relation to the 
yeast carrier and the presence of an adenine-binding 
site (Robinson and Kunji 2006). 



Mitochondrial Transport Protein Family 


1547 


M 


The main function of the mitochondrial S-adenosyl- 
methionine transporter SAMT (SLC25A26) is to 
exchange cytosolic 5-adenosylmethionine (SAM) for 
matrix S-adenosylhomocysteine, but it also transports 
adenosylomithine and S-adenosylcysteine (Agrimi 
et al. 2004). 

The human thiamine-pyrophosphate transporter 
TPC (SLC25A19) imports thiamine-pyrophosphate 
into the mitochondrion for pyruvate and oxoglutarate 
dehydrogenase reactions (Lindhurst et al. 2006). 

Another 18 mitochondrial carriers have been iden¬ 
tified on the human genome that currently have no 
assigned substrate, although in some cases genetic 
analysis has indicated a possible role in a specific 
aspect of physiology, such as MTCH2 in apoptosis 
(Grinberg et al. 2005). 

There are a large number of transport steps across 
the inner mitochondrial membrane that currently have 
no gene assigned to them, but it is possible that mem¬ 
bers of the transporter family carry out these steps. 
There are several essential amino acids that are essen¬ 
tial in man and thus need to be transported into 
mitochondria for protein synthesis; phenylalanine, 
valine, threonine, tryptophan, isoleucine, methionine, 
and leucine. Iron-sulfur cluster assembly takes place in 
mitochondria, which requires the import of sulfur com¬ 
ponents possibly derived from cysteine. Another 
unidentified amino acid transporter is the mitochon¬ 
drial glutamine transporter, which is likely to be an 
electroneutral uniporter (Shapiro and Curthoys 1978). 
It is likely that nicotinamide mononucleotide is 
transported as a precursor in human mitochondria, 
which is subsequently converted to NAD"^ in the mito¬ 
chondrial matrix, requiring ATP (Berger et al. 2004). 
Aminolevulinic acid needs to enter the mitochondrion 
to start the synthesis of porphyrins. Guanine nucleo¬ 
tides are required for the synthesis of DNA and RNA in 
the matrix. Pyruvate needs to enter the mitochondrion 
for oxidation in the matrix and at the moment there is 
no candidate in man. Cofactor transporters for biotin, 
lipoic acid, and pyridoxal phosphate or their precursors 
have not been identified either. 

Summary 

Mitochondrial transport proteins, also called mito¬ 
chondrial carriers or transporters, link the biochemical 
pathways of the cytosol and mitochondrial matrix by 


transporting metabolites, nucleotides, inorganic ions, 
and cofactors across the mitochondrial inner mem¬ 
brane, which is impermeable for most organic com¬ 
pounds and ions. The transport steps are involved in 
the generation of metabolic energy from the oxidation 
of sugars and fats, the synthesis of heme and iron- 
sulfur clusters, the production of heat, the synthesis 
of macromolecules, and the synthesis, breakdown, and 
interconversion of amino acids. 

Cross-References 
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Synonyms 

Aac3p yeast ADP/ATP carrier isoform 3; ORCl 
human ornithine carrier isoform 1; ORC2 human 
ornithine carrier isoform 2 

Definition 

Members of the mitochondrial transporter or carrier 
family transport keto acids, fatty acids, amino acids, 
nucleotides, and cofactors across the mitochondrial 
inner membrane. They can be identified by a number 
of sequence and structural features that are unique for 
members of this protein family. First, they comprise 
three homologous sequence repeats, which form 
a threefold pseudosymmetrical structure. Second, 
each repeat consists of two transmembrane a-helices 
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linked to together by a linker and matrix a-helix. Third, 
there is a single substrate binding site in the central 
cavity. Fourth, substrate-proton symporters have neg¬ 
atively charged residues in the substrate binding site 
that might couple proton and substrate binding. Fifth, 
there are salt bridge networks on the matrix and cyto¬ 
plasmic side of the cavity that form or disrupt to 
regulate alternating access to the central substrate 
binding site. 

Introduction 

On the basis of many conserved sequence features 
mitochondrial transport proteins (known as carriers) 
are likely to have similar structures and transport 
mechanisms. A combination of sequence and struc¬ 
tural analysis has highlighted the basic functional ele¬ 
ments of mitochondrial carriers. 


Conserved Sequence and Structural 
Features of Mitochondrial Transport 
Proteins 

Mitochondrial carriers have a characteristic tripartite 
structure (Saraste and Walker 1982), consisting of 
three homologous amino acid sequences of about 100 
amino acids each (Fig. 1). Hydropathy profiling and 
secondary structure predictions indicated that each 
repeat consists of two transmembrane tx-helices and 
a short tx-helix in the matrix loop (Kunji 2004; Walker 
1992). In addition well-conserved sequence motifs are 
present in all three repeats of mitochondrial carriers 
(Fig. 2). Currently there is only one atomic structure; 
the bovine ADP/ATP carrier in complex with the 
inhibitor carboxy-atractyloside (Pebay-Peyroula et al. 
2003) (Fig. 3). The structure was a major step forward 
in understanding the basic structure of mitochondrial 
carriers in the cytoplasmic-open state, in which the 
substrate binding site is accessible from the lumen of 
the crista and intermembrane space, which are contig¬ 
uous with the cytosol. 

The odd- and even-numbered a-helices form 
a barrel with a large central water-filled cavity. The 
structure is threefold pseudosymmetrical in agreement 
with the sequence repeats (Kunji and Harding 2003; 
Pebay-Peyroula et al. 2003). There are three a-helices 
in the matrix loops that are embedded in the surface of 


the membrane on the matrix side of the protein. There 
are also short linker a-helices between the matrix and 
even-numbered a-helices (Figs. 2 and 3). 

On the matrix side of the transporter on the odd- 
numbered a-helices conserved PX[DE]XX[RK]X 
[RKjXQ motifs are found (Figs. 2 and 3). The proline 
residues of the motif are present at the kinks in the odd- 
numbered a-helices (Pebay-Peyroula et al. 2003), 
whereas the negatively and positively charged residues 
form the matrix salt bridge network in the cytoplasmic- 
open state (Nelson et al. 1998; Pebay-Peyroula et al. 
2003), when the substrate binding site is open to the 
lumen of the crista (Fig. 4). The glutamine residues 
might interact with the network from the matrix side 
(Miniero et al. 2011). In addition, there is a GXXXG 
motif, which may allow close helical approaches for 
movements. 

The matrix a-helices are the least conserved struc¬ 
tural elements, but there is a conserved negatively 
charged residue on the N-terminal side of the matrix 
a-helix and a [ED]G motif on the C-terminal side. The 
glycine of the latter motif causes a break between the 
matrix and linker a-helices and the conserved acidic 
residue putatively interacts with the positively charged 
residues that follow the residues of the matrix salt 
bridge network (see above). Between the linker a- 
helix and the even-numbered a-helices is a conserved 
motif [FWY][KR][G]. 

On the cytoplasmic side of the protein on the even- 
numbered a-helices are conserved [EW]XX[YF][DE] 
XX[RK] motifs (Eigs. 2 and 3). The charged residues 
could form the cytoplasmic salt bridge network when 
the carrier is in the matrix state with the substrate 
binding site accessible from the mitochondrial matrix 
(Robinson et al. 2008) (Fig. 4). The preceding aromatic 
residues are bulky and could form another layer of the 
cytoplasmic gate beneath the cytoplasmic salt bridge 
network and/or help to close gaps (Miniero et al. 2011). 

The Substrate Binding Site of Mitochondrial 
Carriers 

The structure did not reveal the location of the 
substrate binding site, as the inhibitor carboxy- 
atractyloside is not a chemical analogue of the sub¬ 
strates. Three independent approaches have been 
applied to locate the substrate binding site of mito¬ 
chondrial carriers. The first approach used comparative 
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Mitochondrial Transport Protein Family: Structure, 
Fig. 1 The topology model of the yeast ADP/ATP carrier 3 
(Aac3p). Shown are the three homologous sequence repeats 
that are typical for mitochondrial carriers. To illustrate the 
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sequence conservation the residues that are strictly conserved 
in all three repeats are colored red, whereas those present in two 
out of three repeats are colored in orange 


Structural models combined with chemical and broadly 
defined distance constraints to identify conserved sub¬ 
strate binding sites that were capable of discriminating 
between different substrates in amino and keto acid 
transporters (Kunji and Robinson 2006; Robinson and 
Kunji 2006). The same site was found to have the 
propensity to bind nucleotides and cofactors in nucle¬ 
otide and vitamin transporters. The properties of the 
binding site are in agreement with the notion that 
transporters need to recognize the substrate but not 


bind them tightly otherwise they are not released on 
the other side of the membrane. 

The main contact points involved in substrate bind¬ 
ing, indicated by roman numerals I, II, and III (Fig. 5), 
were located on the even-numbered a-helices and were 
related by pseudosymmetry. The contact point may 
involve several residues on the same face of the 
a-helix, but they are located around residues 79, 179, 
and 279 of the bovine ADP/ATP carrier. Contact 
point II on H4 determines the class of the substrate. 
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odd-numbered a-helix 

GXXXG PX[DE]XX[RK]X[RK]XQ 

G-motif matrix salt bridge network 


matrix a-helix linker a-helix even-numbered a-helix 

[DE]G ^ [YF][RK]G [YF]XX[YF][DE]XX[RK] ■ 

YRG-motif substrate binding site cytoplasmic salt bridge network 


Mitochondrial Transport Protein Family: Structure, 
Fig. 2 Sequence elements conserved in the three sequence 
repeats of mitochondrial transport proteins. The odd-numbered, 
matrix, linker, and even-numbered a-helices are shown in green, 
blue, orange, and red, respectively. One-letter amino acid code 


is used to define the signature motifs present in the sequence 
(black line) at the approximate position relative to the structural 
elements. The brackets indicate different amino acid residues 
found at that position in the sequence 


Mitochondrial Transport 
Protein Family: Structure, 
Fig. 3 Structural fold of the 
mitochondrial ADPIATP 
carrier. The structure of the 
bovine mitochondrial ADP/ 
ATP carrier (Pebay-Peyroula 
et al. 2003). The odd- 
numbered, matrix, linker, and 
even-numbered a-helices are 
shown in green, blue, orange, 
and red, respectively, as in 
Fig. 2 



i.e., amino acid, keto acid, or adenine nucleotide, as 
[KR][DE] binds an amino group, [RK]Q a keto group, 
and G[ILVM] the adenine moiety. Contact point I on 
H2 determines the type within the class, e.g., which 
side chain of an amino acid is bound. Contact point III 
on H6 is quite generic and comprises often a positively 
charged amino acid, as most substrates have negatively 
charged functional groups, such as phosphates or car- 
boxylates. The substrate binding site has evolved to 
bind different lengths of substrates to the same location 
and to provide residues with counter charges to 
neutralize negatively charged substrates and thus to 
facilitate their movement against the membrane 
potential. Recently, the contact points of the human 


ornithine carrier were mutated and shown to be critical 
for function (Monne et al. 2012). The two isoforms of 
the ornithine carrier differ in specificity and transport 
rate; ORCl can transport L-ornithine, L-arginine, and 
L-lysine at a high rate, whereas ORC2 can also trans¬ 
port L-histidine, L-homoarginine and the D-forms of 
the amino acids, but more slowly. They differ in con¬ 
tact point II residues, where ORC has RE and ORC2 
RQ. When Glu in ORCl was mutated to Gin and Gin in 
ORC2 was mutated to Glu, effectively swapping 
contact point II, the substrate specificity and the 
transport rate were exchanged too, providing the best 
experimental data yet that it is involved in determining 
the substrate specificity (Monne et al. 2012). 
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Mitochondrial Transport Protein Family: Structure, 
Fig. 4 Basic functional elements of the mitochondrial carrier. 
The backbone structure of the ADP/ATP carrier is shown in 
yellow. The matrix and cytoplasmic salt bridge network are 
shown in green and blue, respectively. The asymmetric residues 
of the central substrate binding site are color-coded according to 
the level of asymmetry; from red, highly asymmetric, to white, 
neutral (Robinson et al. 2008). The approximate boundary of the 
substrate binding site is indicated by a red dashed line 


The second approach used molecular dynamics 
simulations to identify residues of the hovine mito¬ 
chondrial ADP/ATP carrier that are involved in the 
trajectory and binding of ADP in the cavity (Dehez 
et al. 2008; Wang and Tajkhorshid 2008). It is not 
clear whether it is appropriate to take the carboxy- 
atractyloside inhibited state to represent the substrate 
binding state, as the carrier is locked in an aborted state 
that is not part of the transport cycle. However, the 
study demonstrated that ADP binds to the binding site 
with an aromatic stacking arrangement of the adenine 
moiety with Y186. After the initial binding event, the 
beta-phosphate of ADP binds to R235, which comes 
through the matrix salt bridge network formed by the 
charged residues of the signature motif. 

The third approach exploited the principle that 
mitochondrial carriers have a high degree of threefold 
pseudosymmetry in sequence (Saraste and Walker 
1982) and structure (Kunji and Harding 2003; Pebay- 
Peyroula et al. 2003), whereas the transported 
substrates are asymmetric in structure and chemistry 
(Robinson et al. 2008). Thus the residues involved in 
substrate binding must differ in properties from the other 
symmetry-related residues to couple the binding of the 
asymmetric substrate to a symmetric transport 




Mitochondrial Transport Protein Family: Structure, 
Fig. 5 Putative substrate and proton binding site of the mito¬ 
chondrial oxodicarboxylate carrier. A comparative homology 
model of the yeast oxodicarboxylate carrier (Kunji and Robinson 
2006; Palmieri et al. 2001) with bound substrate oxoglutarate 
shown in orange. Residues K94, R95, and K98 bind the 
carboxylate groups of oxodicarboxylates of different lengths. 
In between these groups are two glutamates E136 and E91, 
which are involved in proton binding, but only if a substrate is 
present. Contact point II residues HI87 and R186 bind the keto 
group of oxoglutarate, whereas K94 and E91 plus a proton hind 
the carhoxylate group (See text for further explanation) 


mechanism. For all known subfamilies of mitochondrial 
carriers conserved asymmetric residues were found to 
cluster at a site that overlapped with the substrate bind¬ 
ing site as defined by the contact points (Kunji and 
Robinson 2006; Robinson and Kunji 2006; Robinson 
et al. 2008). Importantly, no other clusters of asymmet¬ 
ric residues were found on the water-accessible surfaces 
of the carriers, indicating that mitochondrial carriers 
have most likely only one substrate binding site. This 
observation supported the “single binding center gated 
pore” mechanism of transport (Klingenberg 1976), 
a version of the alternating access mechanism of 
transport. In the case of transporters with known speci¬ 
ficity, the functional groups of the conserved asymmet¬ 
ric residues had the propensity to bind to the functional 
groups of the identified substrates (Robinson et al. 
2008). As the symmetry analysis centered only on quan¬ 
tifying the differences in properties of symmetry-related 
residues, the location of the clusters and complementary 
binding properties were an independent outcome of the 
analysis. 
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The location of the substrate binding site corre¬ 
sponds to the middle of the membrane, as observed 
for other transporters unrelated by structure or mecha¬ 
nism, such as the lactose transporter (Abramson et al. 
2003), maltose transporter (Oldham et al. 2007), leu¬ 
cine transporter (Yamashita et al. 2005; ► IT^-Lactose 
Membrane Transport Protein, LacY), and hydantoin 
transporter (Shimamura et al. 2010; ► Mhpl 

(Microhacterium hydantoin Permease)). This common 
location might indicate that there are similar underly¬ 
ing biophysical principles at work. Firstly, water-filled 
channels bring the substrates to the center of the mem¬ 
brane, which is energetically the most unfavorable 
place for a polar and charged substrate to be. Secondly, 
transporters must have gates on either side of the 
membrane, because the two compartments must not 
be brought into direct contact with each other during 
the transport cycle, otherwise the proton electrochem¬ 
ical gradient would collapse. Structurally, this require¬ 
ment is best fulfilled when the substrate binding site is 
in the middle of the membrane leaving space for sub¬ 
stantial gates on either side of the membrane. 

Proton Binding Sites of Symporters 

Some members of the mitochondrial transporter family 
transport substrates in symport with protons into the 
mitochondrion. Here, the proton electrochemical gradi¬ 
ent is used to drive the uptake of the substrates against 
their chemical gradients, as the concentration of the 
substrate might be higher on the inside of the mitochon¬ 
drion than on the outside. There are several basic ques¬ 
tions that need to be answered. First, how are protons 
bound by mitochondrial carriers? Second, how does pro¬ 
ton binding relate to the binding of substrates? Third, 
how can substrate and proton binding explain the bioen- 
ergetic principles that govern the transport processes? 

It Is technically challenging to define proton bind¬ 
ing sites structurally, as it is difficult to trap the sub¬ 
strate-proton binding states and to detect protons. Thus 
the analysis of putative proton binding sites here is 
based solely on structural and chemical considerations 
and the analysis of sequence information. Since pro¬ 
tons can only bind to the carrier asymmetrically, sym¬ 
metry analysis should be able to detect proton or other 
ion binding sites (Robinson et al. 2008). Conserved 
and negatively charged residues were found to be 
present in the substrate binding sites of mitochondrial 


carriers that carry out substrate-proton symport (Kunji 
and Robinson 2006, 2010; Robinson and Kunji 2006; 
Robinson et al. 2008). The most common negatively 
charged residue is glutamate, but aspartate also occurs, 
which have a pKa of ^4, depending on the local 
environment. For example, the oxodicarboxylate car¬ 
rier has a keto acid binding site in contact point II and 
three positively charged binding sites in contact point 
I for the binding of carboxylate groups of different 
lengths of oxodicarboxylates, such as 2-oxoglutarate, 
2-oxoadipate, and 2-oxopimelate (Kunji and Robinson 
2006; Palmieri et al. 2001) (Fig. 5). In between these 
positively charged receptors for the carboxylate groups 
are two negatively charged glutamates roughly equi¬ 
distant. A similar arrangement can be found in the 
aspartate/glutamate carrier, phosphate carrier, and 
GTP/GDP carrier, which are also known proton- 
substrate symporters (Kunji and Robinson 2010). 

At first glance this is counterintuitive, as the sub¬ 
strates often have negatively charged functional groups, 
which would be repelled by the negatively charged 
residues in the binding site. However, the binding of an 
additional proton could lead to proton-mediated carbox- 
ylate-carboxylate or phosphate-carboxylate interactions, 
which could contribute to the overall interaction energy 
of the substrate with the protein. Thus, the substrate 
cannot bind to the site unless a proton is bound, whereas 
a proton cannot be bound to the site unless a substrate is 
bound, as the local pH is unlikely to be below 4. This 
notion may explain the principle of strict proton¬ 
coupling as their binding is interdependent, but it may 
also explain why the substrate accumulates in the matrix 
as a consequence of proton-coupling. The mitochondrial 
matrix is more alkaline than the intermembrane space 
and thus it is less likely that a proton can be recruited for 
substrate binding. By coupling substrate to proton bind¬ 
ing, export of the substrate is blocked and thus it can be 
accumulated against its chemical gradient. 

Summary 

Mitochondrial transport proteins are encoded by three 
homologous sequence repeats and their structures are 
threefold pseudosymmetrical. The basic structure 
comprises six transmembrane a-helices, surrounding 
a central cavity, and three linker and matrix a-helices. 
There is a single substrate binding site in the central 
cavity, which binds the substrate at a location that 
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corresponds approximately to the middle of the mem¬ 
brane. The substrate binding sites of transporters that 
use the proton motive force to drive the transport of 
substrates contain also negatively charged residues 
that might couple proton and substrate binding. Alter¬ 
nating access to the substrate binding site from either 
side of the membrane is regulated by two salt bridge 
networks that form part of the access gates. 

Cross-References 

► Membrane Transport, Energetics and Overview 

► Mitochondrial Transport Protein Family 

► Mitochondrial Transport Protein Family: Structure 


References 

Abramson J, Smirnova I, Kasho V, Vemer G, Kaback HR, 
Iwata S. Structure and mechanism of the lactose permease 
oiEscherichia coli. Science. 2003;301:610-5. 

Dehez F, Pebay-Peyroula E, Chipot C. Binding of ADP in the 
mitochondrial ADP/ATP earner is driven by an electrostatic 
funnel. J Am Chem Soc. 2008;130:12725-33. 

Klingenberg M. The ADP-ATP carrier in mitochondrial 
membranes. In: Martonosi AN, editor. The enzymes of bio¬ 
logical membranes, vol. 3. New York: Plenum Press; 1976. 
p. 383M38. 

Kunji ERS. The role and structure of mitochondrial carriers. 
FEBS Lett. 2004;564:239M4. 

Kunji ERS, Harding M. Projection structure of the atractyloside- 
inhibited mitochondrial ADP/ATP carrier of Saccharomyces 
cerevisiae. J Biol Chem. 2003;278;36985-8. 

Kunji ERS, Robinson AJ. The conserved substrate binding 
site of mitochondrial carriers. Biochim Biophys Acta. 
2006; 1757:1237M8. 

Kunji ERS, Robinson AJ. Coupling of proton and substrate 
translocation in the transport cycle of mitochondrial carriers. 
Curr Opin Struct Biol. 2010;20:440-7. 

Miniero DV, Cappello AR, Curcio R, Ludovico A, Daddabbo L, 
Stipani I, Robinson AJ, Kunji ER, Palmieri F. Functional and 
structural role of amino acid residues in the matrix 
alpha-helices, termini and cytosolic loops of the bovine 
mitochondrial oxoglutarate carrier. Biochim Biophys Acta. 
2011;1807:302-10. 

Monne M, Miniero DV, Daddabbo L, Robinson AJ, Kunji ERS, 
Palmieri F. The substrate specificity of the two mitochondrial 
ornithine carriers can be swapped by a single mutation in the 
substrate binding site. J Biol Chem. 2012;287:7925-34. 

Nelson DR, Felix CM, Swanson JM. Highly conserved charge- 
pair networks in the mitochondrial carrier family. J Mol Biol. 
J998;277:285-308. 

Oldham ML, Khare D, Quiocho FA, Davidson AL, Chen J. 
Crystal structure of a catalytic intermediate of the maltose 
transporter. Nature. 2007;450:515-21. 


Palmieri L, Agrimi G, Runswick MJ, Fearnley JM, Palmieri F, 
Walker JE. Identification in Saccharomyces cerevisiae of 
two isoforms of a novel mitochondrial transporter for 2- 
oxoadipate and 2-oxoglutarate. J Biol Chem. 2001 ;276: 
1916-22. 

Pebay-Peyroula E, Dahout-Gonzalez C, Kahn R, Trezeguet V, 
Lauquin GJ, Brandolin G. Structure of mitochondrial ADP/ 
ATP carrier in complex with carboxyatractyloside. Nature. 
2003;426:39M4. 

Robinson AJ, Kunji ERS. Mitochondrial carriers in the cytoplas¬ 
mic state have a common substrate binding site. Proc Natl 
Acad Sci USA. 2006; 103:2617-22. 

Robinson AJ, Overy C, Kunji ERS. The mechanism of transport 
by mitochondrial carriers based on analysis of symmetry. 
Proc Natl Acad Sci USA. 2008; 105:17766-71. 

Saraste M, Walker JE. Internal sequence repeats and the path of 
polypeptide in mitochondrial ADP/ATP translocase. FEBS 
Lett. 1982;144:250-4. 

Shimamura T, Weyand S, Beckstein O, Rutherford NG, 
Hadden JM, Sharpies D, Sansom MS, Iwata S, Henderson PJ, 
Cameron AD. Molecular basis of alternating access membrane 
transport by the sodium-hydantoin transporter Mhpl. Science. 
2010;328:470-3. 

Walker JE. The mitochondrial transporter family. Curr Opin 
Struct Biol. 1992;2:519-26. 

Wang Y, Tajkhorshid E. Electrostatic funneling of substrate in 
mitochondrial inner membrane carriers. Proc Natl Acad Sci 
USA. 2008;105:9598-603. 

Yamashita A, Singh SK, Kawate T, Jin Y, Gouaux E. Crys¬ 
tal structure of a bacterial homologue of NaVCP- 
dependent neurotransmitter transporters. Nature. 2005; 
437:215-23. 


Mitochondrial Transporters: Molecular 
Mechanism 

Edmund R. S. Kunji 

Mitochondrial Biology Unit, The Medical Research 
Council, Cambridge, UK 

Definition 

The overall transport reactions catalyzed by the mito¬ 
chondrial transport proteins, also called mitochondrial 
transporters or carriers, can be represented as follows: 

Uniport 

Substrate l(out) ± proton (out) ^ substrate l(in) ± 
proton (in) 

OR 

Antiport (also known as exchange) 

Substrate l(out) ± proton (out) -i- substrate 2(in) ^ 
substrate 1 (in) ± proton (in) + substrate 2(out) 




Mitochondrial Transporters: Molecular Mechanism 


1555 


Introduction 

Members of the mitochondrial transporter family 
transport a large variety of substrates differing in 
size and chemical and physical properties (► Mem¬ 
brane Transport, Energetics and Overview), but they 
mostly exchange structurally and chemically 
related compounds that are linked by a biochemical 
pathway. For instance, ADP and ATP are related 
by phosphorylation, 5-adenosine-methionine and 
S-adenosine-homocysteine by demethylation, and 
acylcamitine and carnitine by fatty acid P-oxidation. 
Some transporters are involved in the net import of 
substrates. For instance, nucleotides and amino acids 
are imported into mitochondria for the incorporation 
into DNA/RNA and proteins, respectively. 

Driving Forces for Transport 

Like secondary active transport proteins of other trans¬ 
porter families (► Membrane Transport, Energetics 
and Overview), mitochondrial transport proteins 
require (electro)chemical gradients of substrates and/ 
or a proton electrochemical gradient to drive transport. 
These principles are best appreciated by giving specific 
examples (Fig. 1), but first a brief overview of mito¬ 
chondrial bioenergetics is required. 

Mammalian mitochondria play a central role in 
the oxidation of keto acids derived from the degra¬ 
dation of sugars by glycolysis and fatty acids derived 
from lipid breakdown (Fig. 1). Fatty acids are 
degraded by fatty acid p-oxidation, which is the 
cyclical breakdown of the hydrocarbon chain by 
removing two carbon atoms in each cycle. Electrons 
released from the bonds are passed by acyl-CoA 
dehydrogenases via the electron transfer flavopro- 
tein (ETF) and ETF-dehydrogenase to ubiquinone, 
and by p-hydroxyacyl CoA dehydrogenases to 
NAD"^ to form NADH, which is oxidized by complex 
I (see below). Glycolysis of sugars in the cytosol 
generates pyruvate, which is transported into the 
mitochondria by the pyruvate transporter (Papa 
et al. 1971; Thomas and Halestrap 1981). In the 
mitochondrial matrix pyruvate is converted into ace- 
tyl-CoA by pyruvate dehydrogenase and acetyl-CoA 
enters the citric acid cycle, where at three different 
points, electrons are extracted to reduce NAD"^ to 
NADH. FMN in complex I takes the electrons of 


NADH and passes them via a chain of iron-sulfur 
clusters to ubiquinone (Fig. 1) (Sazanov and 
Hinchliffe 2006; Efremov et al. 2010). In addition, 
succinate dehydrogenase, also called complex II, 
carries out the conversion of succinate to fumarate 
in the citric acid cycle, extracting electrons that pass 
via FAD and iron-sulfur clusters to ubiquinone (Sun 
et al. 2005) (Fig. 1). By accepting electrons and 
protons ubiquinone is converted into ubiquinol, 
which diffuses laterally in the membrane to complex 
III, also called bcl-complex (Iwata et al. 1998; 
Zhang et al. 1998; Hunte et al. 2000). In complex 
III, the electrons of ubiquinol are transferred to cyto¬ 
chrome c, which diffuses to complex IV, also called 
cytochrome c oxidase (Tsukihara et al. 1996), where 
electrons and protons are used for the conversion of 
oxygen to water (respiration). The redox reactions in 
complex I, complex III, and complex IV are coupled 
to the extrusion of protons from the mitochondrial 
matrix. The translocation of the positively charged 
protons generates a charge separation (membrane 
potential, AT) and a difference in proton concentra¬ 
tion (pH difference, ApH) across the inner mitochon¬ 
drial membrane. Together they form the proton 
electrochemical gradient (AiiH"^ = AT—2.3(RT/F)A 
pH) or proton motive force (Ap = —ApnVF), which 
is used by F-type ATP synthase to synthesize ATP 
from ADP and inorganic phosphate by a rotary 
mechanism (Abrahams et al. 1994; Stock et al. 
2000) (Fig. 1) (► Membrane Transport, Energetics 
and Overview). 

The ADP/ATP carrier facilitates the electrogenic 
exchange of ADP from the cytosol for ATP from the 
mitochondrial matrix (Klingenberg 2008). The exchange 
is driven by the chemical gradient of the substrates, as 
the concentration of ADP is relatively high on the cyto¬ 
solic side and the concentration of ATP is relatively high 
in the mitochondrial matrix. The exchange is electro¬ 
genic and driven by the membrane potential, as a net 
negative charge is moved toward the positive outside 
potential by the import of ADP^“ against the export of 
Atp'^“ (Klingenberg and Rottenberg 1977). 

The phosphate carrier carries out electroneutral 
uniport, in which a phosphate molecule is imported 
together with a proton (Kunji and Robinson 2010). 
The transport cycle is electroneutral, because no net 
charge movement is occurring. Previously it was 
assumed that phosphate was exchanged for 
a hydroxyl ion (OH~) (Stappen and Kramer 1994), 
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Mitochondrial Transporters: Molecular Mechanism, 
Fig. 1 Driving forces for transport by the mitochondrial trans¬ 
port proteins. The outer and inner membranes are indicated with 
dashed and continuous lines, respectively. Also shown is 
a mitochondrial crista, which is an invagination of the mitochon¬ 
drial inner membrane. Depicted in green and blue are the 


complexes of the respiratory chain and ATP synthase, respec¬ 
tively, and the transporters are shown in yellow. Four examples 
of different driving forces are illustrated; the phosphate carrier 
(PiC), the ADP/ATP carrier (AACl), aspartate/glutamate carrier 
(AGCl), and the oxoglutarate carrier (OGC) 


which is thermodynamically equivalent, but the pres¬ 
ence of a negatively charged residue in the substrate 
binding site argues against this proposal (Kunji and 
Robinson 2010). Although the phosphate carrier is 
fully able to carry out exchange, the coupling of pro¬ 
ton and phosphate binding in effect means that the 
transport is unidirectional, allowing the accumulation 
of phosphate in the mitochondrial matrix (Kunji and 
Robinson 2010). 

The aspartate/glutamate carrier imports glutamate 
plus a proton and exports aspartate. The transport 


cycle is electrogenic, as a net positive charge is 
moved in the direction of the negative inside mem¬ 
brane potential. The exchange is also driven by the 
chemical gradients of the two substrates (differences 
in concentration), which are determined by their links 
to biochemical pathways in the cytosol and mitochon¬ 
drial matrix. 

The dicarboxylate carrier carries out the 
electroneutral exchange of dicarboxylates or phos¬ 
phate (all ^“), meaning that the transport is only driven 
by the chemical gradients of substrates. 
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Mitochondrial Transporters: Molecular Mechanism, 
Fig. 2 ADP binding and the basic structural features of the 
yeast ADP/ATP carrier, (a) The backbone structure of the 
yeast ADP/ATP carrier is shown in yellow. The matrix and 
cytoplasmic salt bridge networks are shown in dai'k and light 
green, respectively. The asymmetric residues of the central 
substrate binding site are color-coded according to the level of 
asymmetry; from red, highly asymmetric, to white, neutral (Rob¬ 
inson et al. 2008). The contact points of the substrate binding site 
are shown in black and are numbered with roman numerals. The 


approximate boundary of the substrate binding site is indicated 
by a hexagon, (b) The residues of the cytoplasmic (top) and 
matrix network (bottom) in yeast ADP/ATP carrier Aac2p, 
respectively. The positively charged, negatively charged, and 
polar residues of the salt bridges are shown in blue, red, and 
green, respectively. In the cytoplasmic state, the matrix salt 
bridge network (bottom) is formed (dashed line), whereas in 
the matrix state, the cytoplasmic salt bridge network (top) is 
formed (dotted line). The salt bridge interactions and hydrogen 
bonds are shown in black and gray, respectively 


Dimer Versus Monomer Model 

fteviously it was thought that the mitochondrial trans¬ 
port proteins were dimeric based on a large number of 
studies (Kunji and Crichton 2010). The carriers had only 
six transmembrane ot-helices, whereas most transporters 
characterized at the time had 12, such as ABC trans¬ 
porters and members of the major facilitator superfam¬ 
ily (► Membrane Transport, Energetics and Overview). 

From a functional point of view, two models were 
proposed. On the basis of inhibitor analysis, a single 
binding center gated pore mechanism was proposed in 
which two gates regulated access to a single substrate 
binding site (Klingenberg 2008), which is a version of 
the alternating access model (► Membrane Transport, 
Energetics and Overview). It was suggested that the 


transport pathway was between the two domains of the 
dimer (Huang et al. 2001), as is the case for ABC trans¬ 
porters. On the basis of kinetic analysis, indicating the 
formation of a ternary complex, a dimer was proposed 
that consisted of two domains in which one binding site 
was accessible to the cytoplasm and one accessible to 
the matrix (Palmieri et al. 1993). Binding of substrates 
to both sites simultaneously would lead to conforma¬ 
tional changes opening the binding sites to the other side 
of the membrane, explaining well why most mitochon¬ 
drial transport proteins carry out equimolar exchange. 

However, the dimer models were no longer tenable 
when structural information for the mitochondrial 
ADP/ATP carrier became available. The projection 
structure of the yeast ADP/ATP carrier in the mem¬ 
brane (Kunji and Harding 2003) and the atomic 
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Mitochondrial Transporters: Molecular Mechanism, 
Fig. 3 Transport cycles of mitochondrial transport proteins. 
The charged residues of the matrix and cytoplasmic salt bridge 
networks are shown in dark and light green, respectively. The 
imported and exported substrates are orange and purple, respec¬ 
tively. All transport steps are fully reversible, but the direction of 
transport (red arrows) is determined by the chemical gradients 
of the substrates and proton motive force, if applicable. During 
substrate import, the matrix network on the odd-numbered 
a-helices is released while the cytoplasmic network on the 
even-numbered a-helices is formed, and the reverse during 
substrate export. The structural changes might occur with 


Structure of the bovine ADP/ATP carrier in detergent 
(Pebay-Peyroula et al. 2003) showed that the 
structural scaffold was a ring of six transmembrane 
a-helices with three-fold pseudo-symmetry in agree¬ 
ment with the three amino acid sequence repeats. The 
translocation path for nucleotides was clearly through 
the center of a single protomer, indicating that they 
could function as monomers (Kunji and Harding 
2003). Thus, the structure was neither compatible 
with a dimer consisting of protomers in opposing 
states nor with a dimer with the translocation 
path between two protomers. Subsequently, new 


three-fold pseudo-symmetry via a rotary twist (orange arrows). 
When the matrix network has three salt bridges, the likelihood 
that these are broken simultaneously is very low, meaning that 
a conversion from the empty m-state to the empty c-state does 
not occur and the exchange of substrates will be strictly equi¬ 
molar. However, when the matrix network is weak due to the 
absence of salt bridge interactions through polar, hydrophobic, 
or aromatic substitutions, the likelihood that the carriers convert 
from the empty m-state to the empty c-state is much higher. In 
this case, there might be more transport of the substrate from the 
outside to the inside of the mitochondrion, i.e., net import via 
a uniport step 


experimental evidence became available and old 
data were reinterpreted to indicate that mitochondrial 
carriers are monomeric in structure and function 
(Kunji and Crichton 2010). As will be argued below, 
the monomer has all of the required structural features 
for a functional transporter. 

Alternating Access Mechanism 

Mitochondrial carriers have a well-conserved three¬ 
fold pseudo-symmetry in their structures and 
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sequences (Saraste and Walker 1982; Kunji and 
Harding 2003; Pebay-Peyroula et al. 2003; Robinson 
et al. 2008). However, a strict conservation of the three 
repeats is not required to maintain the three-fold sym¬ 
metrical structural fold, as in general, the same fold can 
be achieved with different sequences of amino acids. 
Thus, if symmetry-related residues are identical, they 
are likely to be important for the function of the mito¬ 
chondrial transport proteins. Highly conserved sym¬ 
metrical residues were found at the water-membrane 
interfaces of the carriers, flanking the substrate binding 
site, which is asymmetrical. One set consisted of the 
three PX[DE]XX[RK] signature motifs, which form 
a salt bridge network on the matrix side of the cavity, 
when the substrate binding site is open to the mito¬ 
chondrial intermembrane space, called the cytoplas¬ 
mic state (Nelson et al. 1998; Pebay-Peyroula et al. 
2003) (Fig. 2). The second set consisted of three sym¬ 
metrical [FY][DE]XX[RK] motifs and is located on 
the cytoplasmic side of the cavity, which could poten¬ 
tially form a salt bridge network when the substrate 
binding site is accessible from the mitochondrial 
matrix, called the matrix state (Fig. 2) (Robinson 
et al. 2008). The opening and closing of the carrier 
could be coupled to the disruption and formation of 
these two salt bridge networks via a three-fold rotary 
twist induced by substrate binding (Fig. 3). A central 
substrate binding site and gates on either side of the 
membrane are consistent with the “single binding cen¬ 
ter gated pore” mechanism of transport and are fea¬ 
tures shared with structurally and mechanistically 
unrelated transporters (►Mhpl (Microbacterium 
hydantoin Permease), ► H'^-Factose Membrane Trans¬ 
port Protein, LacY, ► FucP). 

On the basis of the interaction energies of the 
networks, members of the mitochondrial carrier family 
can be divided into strict exchangers or net importers 
(Robinson et al. 2008). When both networks are strong, 
interconversion between the conformational states in 
the absence of substrate may be prevented, as the 
networks will pose a considerable energy barrier with 
a low probability of disrupting spontaneously (red 
circle in Fig. 3). Consequently, the exchange of the 
substrates will be equimolar and dependent on the 
presence of substrate on either side of the membrane 
providing the driving force (Fig. 3). If the cytoplasmic 
network is weak and the matrix network is strong, the 
protein could convert from the matrix to the cytoplas¬ 
mic state in the absence of substrate, as a weak network 


has a higher probability of breaking spontaneously 
(blue arrow in Fig. 3). In this case, the carriers could 
function as net importers into mitochondria if the sub¬ 
strate is present on the cytoplasmic side. Exchange will 
always be faster than uniport, as both substrates pro¬ 
vide energy input during exchange whereas there is 
a slow step in uniport when the carrier converts from 
the matrix to the cytoplasmic state in the absence of 
substrate. 

Summary 

Mitochondrial transport proteins carry out exchange and 
uniport transport steps across the inner membrane of 
mitochondria. These transport cycles are driven by the 
chemical gradients of the substrates across the mem¬ 
brane and in some cases by the proton motive force. The 
monomer is the functional unit, as it has all of the 
required structural and sequence features to carry out 
the transport cycle. The transport proteins have a single 
substrate binding site flanked by two salt bridge net¬ 
works that are components of the gates that regulate 
access of substrate(s) to the binding site in an alternating 
fashion. The interaction energy of the cytoplasmic salt 
bridge network may provide an energy barrier that pre¬ 
vents the conversion of the carrier from the matrix state 
to the cytoplasmic state in the absence of substrate. If 
the energy barrier is high, because all three salt bridge 
interactions are present, then this step does not occur 
and exchange of substrates is equimolar. If the energy 
barrier is low through polar, aromatic, or aliphatic sub¬ 
stitutions, the empty state conversion has a higher prob¬ 
ability of occurring and thus overall more substrate is 
imported than exported. 

Cross-References 

► Membrane Transport, Energetics and Overview 

► Mitochondrial Transport Protein Family 
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Introduction 

Since their discovery in the first half of the twentieth 
century, antibodies (ab) have occupied a prominent 
role in several research areas as well as in a large 
number of biotechnological and clinical applications. 
Not surprisingly, a vast number of studies have 
provided deep insight into their ontogenesis, sequence, 
function and three dimensional structure. This research 
effort has allowed, among other results, the possibility 
of generating accurate structural models of immuno¬ 
globulins in a relatively automatic fashion. 

This entry will survey computational methods for 
antibody structure prediction and will present the most 
useful tools and resources. 

Antibody Sequence and Structure 

Porter (1959) and Edelman (1959) elucidated the basic 
primary structure of the immunoglobulin (Ig) molecule 
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consisting of two heavy chains and two light chains. 
Each chain in turn consists of a variable domain 
(Vh and Vl) and one, three, or four constant domains 
(Ch and Cl) according to the heavy chain isotype 
(y, §, a, p, and s). The reader is referred to the many 
existing reviews (Alzari et al. 1988; Padlan 1994; 
Elgert 1996). Wu and Kabat (1970) by analyzing Fv 
fragment (i.e., the dimer of Vh and Vl domains. Fig. 1) 
identified three sequence portions on each chain, the 
so-called hypervariable regions, with an extremely 
variable amino acid composition in comparison with 
the other less variable parts. They correctly predicted 
such regions to assume a loop conformation arising 
from a relatively conserved framework and to be 
responsible for the selective binding of the antigen 
and, therefore, named them “complementary deter¬ 
mining regions” (CDRs) in contrast to the surrounding 
framework regions (FRs). 

The work of Chothia and Lesk (1987) extended 
such predictions and pointed out that five out of six 
CDRs, although presenting a very large sequence 
repertoire, would usually adopt a limited set of back¬ 
bone conformations, termed “canonical structures” 
(CSs). Such conformations can be predicted, as 
shown in the following section, by considering the 
loop length and some specific signature residue posi¬ 
tions, thus allowing an efficient prediction of the 
three-dimensional structure of Ig molecules. 

The architecture of antibodies, in which variable 
regions are interspersed between regions that are 
extremely conserved both in sequence and in structure, 
allowed the development of several unified numbering 
schemes for their sequences (Kabat and Wu 
1991; Al-Lazikani et al. 1997; Lefranc 1999; 
Abhinandan and Martin 2008). Such numberings 
unambiguously define portions or specific residues of 
immunoglobulins that have a similar position in the 
three-dimensional structure. For sake of brevity only 
the Kabat-Chothia (KC) numbering scheme that is 
commonly adopted in antibody modeling is described. 
In the KC scheme, each residue of the variable domain 
of both heavy and light chain is identified by a different 
sequential number with the only exception of 
insertions that may occur in the CDRs or in some 
positions in the FRs, which are identified by the 
number of the position in which the insertion occurs 
followed by consecutive letters. Figure 2 summarizes 
this scheme. It is worth mentioning that some 
conserved FR residues are present in almost all the 



Modeling of Antibody Structures, Fig. 1 Two views of the 
Ig variable domain. Light chain is depicted in red {light red: 
frame work,/«// red: hypervariable loops), heavy chain in green 
(light green: framework,/ k// green: hypervariable loops). Panel 
A reports a lateral view (antigen should be located on the topmost 
part of the Ig molecule), panel B depicts the antigen view of the 
same structure 

immunoglobulins. Such residues are extremely useful 
in identifying the various regions of the immunoglob¬ 
ulins and are reported in red in the figure. 


Canonical Structures 

In their definition of the “hypervariable loops” Chothia 
and Fesk (Chothia et al. 1989) carried out an analysis 
to determine the relationship between the amino acid 
sequences and the three-dimensional structures of the 
antigen-binding sites. First of all, they discovered that 
five among six hypervariable regions usually adopt just 
a small number of discrete main-chain conformations. 
Moreover, they identified relatively few residues that, 
through their packing, hydrogen bonding or ability to 
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Modeling of Antibody Structures, Fig. 2 Kabat-Chothia numbering of VK, VL, and insertions. Kabat definition of FRs and CDRs are depicted in yellow and gray, respec- 

VH. The numbers above the sequences represent the KC numbering of specific residues, tively; Chothia and Lesk definition of hypervariable loops is indicated by arrows. 
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assume unusual tp, v|/, or co conformations, are primar¬ 
ily responsible for the main-chain conformations of the 
hypervariable loops. These residues are found within 
the hypervariable regions as well as in the conserved 
P-sheet framework. These classes of commonly occur¬ 
ring main-chain conformations of the hypervariable 
regions, identified by the length of the loop and by 
specific key residues, are named “canonical 
structures.” 

The H3 loop is difficult: it is the most variable loop 
both in terms of sequence and structure, and in the 
majority of solved antibody-antigen complexes, 
several of its residues are in contact with the antigen. 
A sequence-structure relationship can only be found 
for the main-chain conformation of the 10 residues 
(four residues from the N terminus and six residues 
from the C terminus) of the loop proximal to the 
framework (the “torso” of the loop). 

Several investigations have been carried out to 
extend the library of canonical structures. To date, it 
has been estimated (Al-Lazikani et al. 1997; Morea 
et al. 1998; Chailyan et al. 2011a) that approximately 
80% of LI, L2, L3, HI, and H2 loops adopt some type 
of canonical structure. The expected RMSD between 
the main-chains of a target loop and a template loop 
with the same type of canonical structure is about 
0.7 A, and typically not exceeding 1.0-1.2 A RMSD. 

Using the current definitions (Table 1) when a target 
loop matches a known canonical structure and the loop 
template is extracted from a high-resolution, 
well-refined structure, one can expect the backbone 
of the framework to be correct within 1.0 A root 
mean square deviation; the backbone of the predicted 
loops to differ by about 0.7 A RMSD on the average, 
and no more than 1.0-1.2 A in all cases. In the H3 case, 
specific database search procedures can be used to 
predict the main-chain conformation of the head 
regions in many cases (Morea et al. 1998). Such 
methods usually give good results for short and 
medium size loop (up to 15 residue long H3), but for 
longer loops the deviation is usually exceeding the 2 A 
RMSD for the backbone. 

Automated Antibody Modeling Protocols 

Antibody modeling, design, engineering, and humani¬ 
zation partially or completely rely on the definition and 
ability to predict canonical structures. Given an 


Modeling of Antibody Structures, Table 1 Canonical 
structure definitions 


Loop 

CS 

Length 

Constraints 

kLI 

1 

6 

29 VIL 


2 

7 

29 VIL 


3 

13 

29 VIL 


4 

12 

29 VIL 


5 

11 

29 VIL 


6 

8 

29 VIL 

kL2 

1 

3 


kL3 

1 

6 

90 QNH 95 P 


2 

6 

90 Q 94 P 


3A 

5 

90 Q 91/92 G 96 P 


3B 

5 

90 Q 91/92 ST 96 P 


4 

4 

90 Q 


5 

7 

90 Q 95A P 


6 

5 

90 Q 94 L 


7 

5 

94 P 


8 

8 

90 Q 95A P 

kLI 

1 

10 

25 G 


2 

11 

25 G 31 FHY 66 K 90 S 


3 

11 

66 L 90 L 


4 

8 

28 VIL 66 ST 


5 

10 

25 R 28 G 


6 

11 

25G31 ND 66 K 90S 


7 

8 

28 VIL 66 N 


8 

9 


kL2 

1 

3 



2 

7 


kL3 

1 form A 

6 



1 form B 

6 



1 form C 

6 



1 form D 

6 



2 

8 



3 

7 

92 D 95 ST 


4 

7 

95 not ST 

HI 

1 

7 



2 

8 



3 

9 


H2 

1 

3 



2 

4 

71 AVL 


3 

4 

71 RK 


4 

6 


H3 

Bulged 

>10 

94 RK lOlD 


Non-bulged 

>10 

94 not RK 


Short 

10 



For each hypervariable loop {leftmost column), the known 
canonical structures {CS column) are defined according to 
the length of the loop {Length column) and specific key resi¬ 
dues {Constraints column). Loop lengths and key residues 
numbering follows the Kabat-Chothia scheme (see Fig. 2 for 
details) 
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unknown antibody sequence, there is a high probabil¬ 
ity that the hypervariable loops LI, L2, L3, HI, and H2 
have a main-chain conformation that corresponds to 
a known canonical structure, and that the torso region 
of H3 has a conformation that corresponds to one of the 
conformations observed in known structures. The 
framework regions are generally very highly 
conserved in sequence and structure and can be 
modeled accurately with relatively little difficulty. 
Thus, a reasonable model can be built for the frame¬ 
work and hypervariable loops using homology 
modeling technique. 

Besides the canonical structures approach 
described above, there are knowledge-based 
methods which use other antibody-hypervariable 
loops as templates, selecting them on the basis of 
their length and maximum sequence identity (rather 
than on the presence of key residues) or loops taken 
from either antibodies or other proteins selected on 
the basis of their length and of the structural similar¬ 
ity with the regions flanking the loop (database 
screening methods). Ab initio methods do not rely 
on the presence of a template loop in the database, 
and therefore, they are of general use. Their major 
limitation is that, due to limited comprehension of 
the physicochemical principles governing protein 
structures, the energy functions used to evaluate the 
different conformations often do not distinguish 
a correct prediction from an incorrect one, even 
when additional criteria are used to filter the results. 
Approaches combining knowledge-based and ab 
initio methods have also been described (Martin 
et al. 1989). 

Historically an early “hybrid” approach has been 
applied in modeling known antibody structures. 
Eigenbrot’s technique involves the initial derivation 
of an energy-minimized average framework region 
from selected homologue counterparts. Then loops 
are modeled on matching CSs solved CDRs (that 
might differ from the framework templates) and 
matching length CDR-H3 with the highest percent¬ 
age of sequence identity is grafted. Insertions and 
deletions are handled by acting on surrounding resi¬ 
dues (p/\|/ angles so as to preserve the molecular 
geometry. Finally the model is subjected to energy 
minimization. This process has been automated in the 
ABGEN algorithm (Mandal et al. 1996). AbM soft¬ 
ware is based on the algorithm CAMAL (Martin et al. 
1989) and adopts an alternative strategy in CDR-H3 


modeling. AbM combines a knowledge-based 
approach with an ab initio approach (CONGEN algo¬ 
rithm). An updated version of AbM was launched in 
2000 as an online service called WAM (Web Anti¬ 
body Modeling) (Whitelegg and Rees 2000). WAM 
CDR-H3 modeling protocol plays on the recognition 
of two possible conformational classes of the torso 
section (four residues from the N terminus and six 
residues from the C terminus): bulged (or kinked) and 
non-buldged (or extended), respectively depending 
on whether or not residues Asp-101H and Lys/Arg- 
94H interact through hydrogen bonds (Morea et al. 
1998). 

In general, knowledge-based methods and 
threading algorithms, e.g., THREADER (Miller 
et al. 1996), allow a screening step to be set up to 
validate the suitability of a sequence for each 
constructed conformation. This screening may be car¬ 
ried out also using some sort of energy function, such 
as that of RosettaAntibody (Sircar et al. 2009). 
RosettaAntibody is a modeling tool that partially relies 
on the Rosetta software suite, a well-known bioinfor¬ 
matics platform for protein structural prediction and 
design. The method generates an initial homology 
model after selecting different templates for CDRs 
(including H3) and framework. Then the model is 
subjected to Monte Carlo energy minimization cycles 
through the so-called full-atom Rosetta energy func¬ 
tion, affecting chemo-physical parameters such as H3 
conformation, side chain torsion angles and the mutual 
orientation of VL and VH. 

RosettaAntibody does not allow the user to select 
specific templates nor modeling strategies but, given 
a target sequence, returns 10 different models resulting 
from the minimization step of the initial model. In 
contrast, the PIGS (Prediction of Immunoglobulin 
Structure) (Marcatili et al. 2008) web-server, which 
predicts antibody structure using the canonical 
structure method, besides being a fully automatic 
procedure that does not require any user intervention, 
allows the selection of templates as well as of several 
other modeling parameters. PIGS first aligns the 
sequence of each variable domain (VL and VH) of 
the antibody of unknown structure (target) with the 
corresponding variable domain sequences of all the 
immunoglobulins of known structure. The user can 
then select several different combinations of templates 
according to the target characteristics to obtain 
a reasonably good model for the framework and each 
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hypervariable loop. If more than one template is used, 
all the regions copied from the different templates are 
arranged together in order to produce the hnal model. 

Caveats and Open Problems in Antibody 
Modeling 

Even though a large-scale assessment of antibody 
modeling accuracy has not been performed yet, 
a recent blinded study on nine benchmark structure 
sequences (Almagro et al. 2011) showed that antibody 
models produced using the most recent tools, 
RosettaAntibody and PIGS, have a similar expected 
accuracy, that is on average below 1 A RMSD for the 
framework regions and for the hypervariable loops LI, 
L2, L3, HI, and H2. H3 loop prediction yielded RMSD 
values between 1.5 and 5 A. The major sources of error 
in the process are the modeling of the H3 loop, the 
possible changes of conformation occurring upon anti¬ 
gen binding and the reconstruction of the correct 
packing between the VL and the VH domains. 

The first of the aforementioned problems, involving 
the modeling of the H3 loop, has been tackled in many 
different ways, but still its accuracy is far below that of 
the rest of the models. It is worth mentioning that this 
problem is crucial in modeling camelid single-domain 
antibodies, that lack light chains and usually display 
extremely long H3 loops with a characteristic confor¬ 
mational repertoire (usually H3 bends over to occupy 
the VH region that is normally in touch with the VL 
domain). Another open issue is the effect of antigen 
binding on the conformation of the CDR loops. Even 
though most of the loops display just a minor distortion 
upon antigen binding, H3 loops (especially longer 
ones) can assume quite different conformations in the 
apo and holo forms. It is therefore important to include 
this kind of information in the modeling process, and 
currently it cannot be achieved using the automatic 
procedures described above. 

The last point regards the correct packing between 
VL and VH domains. A recent paper (Chailyan 
et al. 2011b) describes the existence of at least two 
different packing modality that have a strong impact 
on the shape of the antigen-binding site and on the 
antigen specificity. This becomes relevant if two 
different solved structures are used as template for 
the framework of the two chains; in this case the 
regions copied from the two templates need to be 


packed together in order to obtain the final model. 
This process may introduce some small deviations 
between the model and the real structure. In order to 
avoid this, the user can choose whether to use, where 
possible, the same template (usually with a lower 
sequence similarity) for several loops and/or frame¬ 
works, thus minimizing the number of superpositions 
needed to build the final model. Such a choice is not 
trivial and depends on the existence of a suitable tem¬ 
plate with a good sequence identity. In a typical 
scenario, if the templates are selected using the highest 
similarity criteria, they may come from different 
antibodies; therefore, the method has to pack together 
all the regions copied from different templates and this 
may introduce errors. At the same time, if the same 
template is used for different regions it may have a low 
sequence identity to the target and, therefore, a lower 
expected similarity to the target. 

Conclusions 

Antibody modeling techniques have proved to be reli¬ 
able enough to be used in a very large number of 
applications, such as protein engineering (Morea 
et al. 1997) and docking (Pedotti et al. 2011). Using 
the many available resources (Marcatili et al. 2008; 
Sircar et al. 2009; Chailyan et al. 2012) such models 
can usually be obtained in a completely automatic 
fashion even by non-expert users. Nevertheless, one 
should remember that a number of choices can affect 
the final quality of the models: the modeling of side 
chains, of VL/VH packing, and of H3 loop are still 
open issues. 

Cross-References 

► Homology Modeling of Protein Structures 
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Synonyms 

RNA 

Definition 

Modeling of Ribo-Nucleic Acid (RNA) molecules is 
the determination/prediction of secondary and tertiary 
RNA structures by using experimental or theoretical 
approaches. 

Introduction 

Until recently, RNA has been considered a transfer 
molecule in the genetic information flow process. 
However, the myriad of noncoding RNAs recently 
discovered has shown a plethora of new functions 
that are now attributed to it. These functions include 
among others biological catalysis (e.g., ribozymes), 
gene regulation (e.g., micro-RNAs), or host defense 
(e.g., small interfering RNAs). Knowledge of the sec¬ 
ondary and tertiary structures of such RNA molecules 
will provide insights on the molecular mechanisms 
of their diverse functions. X-Ray crystallography, 
nuclear magnetic resonance (NMR) spectroscopy, and 
single-particle electron microscopy are the main 
biophysical techniques used for determining RNA sec¬ 
ondary and/or tertiary structures. Moreover, theoretical 
approaches are now widely used for predicting the 
folding of new RNA sequences. Next, we briefly intro¬ 
duce the molecular structure of RNA as well as how 
computational approaches can be used for modeling its 
secondary and tertiary structures. 

RNA Structure 

RNA is composed of riboses, phosphates, and aromatic 
bases (Neidle et al. 2009). Riboses and phosphates are 
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connected through the phosphodiester bond, making 
a backbone from which the aromatic bases are attached 
in a regular fashion (Fig. la). RNA backbone confor¬ 
mation can be described in terms of the torsion angles 
a, P, y, 5, £, and ^ (Fig. lb) while the ribose confor¬ 
mation is determined by the endocyclic torsion angles 
Vo to V 4 (Fig. Ic). As it is being synthesized, RNA 
molecules fold mainly by the driving forces of 
hydrogen bonding and stacking interactions. Long 
stretches of canonical Watson-Crick base pairs result 
in an antiparallel double helix. Flowever, each base 
has three sides (i.e., Watson-Crick, Hoogsteen, and 
sugar sides) that can pair resulting in 28 different 
base pairing possibilities between two nucleotides 
(Leontis and Westhof 2001). Nevertheless, canonical 
helices are maintained by Watson-Crick base pairing 
(Fig. Id). Additionally, nucleotide bases may also 
interact with the ribose or phosphate atoms, which 
may result in special RNA geometries. In contrast to 
DNA, this plethora of possible pairwise interactions 
between bases results in RNA adopting complex 
three-dimensional (3D) structures. A base pair in 
RNA is maintained by a minimum of two hydrogen 
bonds between the paired nucleotide bases. The com¬ 
bination of base pairs in an RNA molecule conforms 
the so-called secondary structure, which is composed 
by stems (double helices), loops, bulges, stem junc¬ 
tions, and pseudoknots. The final 3D RNA structure 
is maintained by tertiary interactions, including 
loop-loop interactions, stem-loop interactions, coax¬ 
ial stacking, and triple and quadruple helices (Moore 
1999). In recent years, the number of deposited RNA 
structures in the protein data bank (PDB) has exponen¬ 
tially increased (Fig. 2). Therefore, a plethora of new 
tools and databases for classifying the structures of 
RNA have flourished. For example, the nucleic acid 
database (NDB) stores nucleic acid structures using 
the same dictionaries and formats proposed by the 
Worldwide PDB, which includes information such as 
backbone torsion angles, base pair parameters, and 
hydrogen bond classification. The structural classifica¬ 
tion of RNA (SCOR) database organizes and classifies 
the RNA structures using a directed acyclic graph. 


RNA Alignment 

As RNA structure is more conserved than sequence 
(Capriotti and Marti-Renom 2010), structure/sequence 



Modeling of RNA Molecules, Fig. 1 The nucleotide unit, 
(a) Atomic resolution structure of a pair of RNA nucleotides 
showing the standard atom numbering, (b) Backbone torsion 
angles marked by gray arrow across a linear backbone represen¬ 
tation. A torsion angle is the angle between two planes formed by 
four consecutive atoms, (c) Endocyclic torsion angles in sugar 
rings. Same representation as in panel b. (d) A canonical 
base pair in RNA showing the different nucleotide edges 
(i.e., Hoogsteen, Watson-Crick, and Sugar edges) 


alignments can provide insights into RNA function. 
Moreover, RNA sequence alignments can be used for 
detecting homology. Standard methods such as 
BLAST, PSI-BLAST, CLUSTAL-W, MUSCLE, or 
T-Coffee can be used for aligning a pair of RNA 
sequences. However, sequence-based alignment 
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Year 


Modeling of RNA Molecules, Fig. 2 Yearly growth of RNA 
structure deposition in the PDB database between 1978 and 
2010. The red curve shows all entries in the PDB with at least 
one RNA structure, the orange cun’e shows only entries that 
were determined by X-Ray crystallography and yellow cun’e 
shows those determined by NMR spectroscopy 

programs that take into consideration RNA base 
pairing are likely to produce more accurate alignments. 
Such methods can broadly be divided into two groups: 
programs that predefine or predict secondary structure 
from an RNA sequence and then use this information 
for searching against a database of sequences, and 
programs that simultaneously predict and align sec¬ 
ondary structures or RNA sequences. The searching 
algorithms used by those methods are varied and 
include, among others, hidden markov models 
(HMM) and stochastic content free grammar (SCFG). 
For example, the RSEARCH program uses SCFG to 
build a tree-like structure encoding features from the 
query sequence that are later aligned against a RNA 
sequence database using a dynamic programming 
algorithm. The INFERNAL program uses a special 
type of SCFG approach, called covariance models, 
that combines sequence consensus and RNA second¬ 
ary structure consensus to search for homologous RNA 
sequences. The INFERNAL program has previously 
been used to build the Rfam database. 

RNA Secondary Structure Prediction 

RNA folding is a hierarchical process where the ter¬ 
tiary structure is mainly determined by the secondary 
structure, which is in turn determined by the 


nucleotides in the RNA primary sequence. Therefore, 
predicting the secondary structure of an RNA 
sequence can prove very useful for gaining insight 
into its tertiary structure and its function (Mathews 
et al. 2010). Algorithms for modeling the secondary 
structure of an RNA sequence were initially based on 
the so-called minimum free-energy (MFE) calcula¬ 
tions, which assumed a unique and stable native 
RNA conformation (Mathews and Turner 2006). Eor 
example, the RNAfold and UNAfold programs make 
use of the MFE approach. However, MFE does not 
guarantee finding the RNA “native structure” and 
suboptimal secondary structures need to be deter¬ 
mined based on RNA shape analysis. To solve this 
limitation, more recent approaches use genetic algo¬ 
rithms or statistical samplings of known RNA second¬ 
ary structures. Unfortunately, these algorithms cannot 
predict the so-called RNA pseudoknots, a nucleic acid 
secondary structure that contains at least two stem- 
loop structures in which the loop of a stem-loop base 
pairs with another part of the same RNA molecule, 
therefore forming a stem inside the original loop. Such 
limitation has partially been addressed by the devel¬ 
opment of methods that use multiple sequence infor¬ 
mation. Broadly, multiple sequence-based methods 
rely on the fact that when a single nucleotide is 
mutated in an RNA stem, a compensatory mutation 
on the interstrand corresponding nucleotide has to 
occur to maintain the overall secondary structure. 
Different algorithms align multiple sequences to 
detect such sequence co-variation and to predict the 
secondary structure of an RNA query sequence. Two 
examples are the RNAalifold program that combines 
MFE, partition function and co-variation, and the 
Pfold program that folds alignments using a SCEG 
approach. 

RNA Tertiary Structure Prediction 

Recently, there have been significant advances in auto¬ 
matic RNA 3D structure prediction (Shapiro et al. 
2007). For example, the MANIP program assembles 
separate structural parts into a complete 3D model by 
using RNA structural motifs from a pre-ensemble data¬ 
base. MANIP automatically recognizes and displays 
allowed and non-allowed hydrogen bonds between 
nucleotides in the RNA sequence. Then the refine¬ 
ment protocol incorporates canonical as well as 
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noncanonical base pairing constraints together with 
restraints imposed by covalent geometry, stereochem¬ 
istry, and van der Waals contacts. The Mc-Sym pro¬ 
gram uses coordinates and relations between residues 
extracted from the PDB and NDB to construct RNA 
3D models. Mc-Sym can apply user-derived restraints 
as well as molecular dynamics simulations to optimize 
the obtained 3D structures. The iFoldRNA program 
uses coarse-grained structural models to perform 
molecular dynamic simulations of RNA structures. 
iFoldRNA has been used to fold RNA molecules 
smaller than 50 nucleotides to near-atomic resolution 
(i.e., 2-5 A root mean square deviations to its native 
structure). It also provides other RNA folding param¬ 
eters such as specific heat, contact maps, simulation 
trajectories, gyration radii, RMSDs from native state, 
fraction of native-like contacts. Finally, the NAST 
program uses knowledge-based coarse-grained 
dynamics engine for modeling RNA structures. 
NAST allows the end-user to provide secondary or 
tertiary experimentally derived restraints to filter the 
predicted 3D models. 

For further information, the URL http://marciuslab. 
org/rna compiles an exhaustive list of databases and 
programs to model RNA secondary and tertiary 
structures. 

Cross-References 

► Protein Secondary Structure Prediction in 2012 

► Structural Genomics 

► Protein Structure Comparison Methods 
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Synonyms 

Action potentials; Cardiac physiome 

Definition 

Quantitative prediction over multiple space and 
time scales using computer models of the electrical 
activity in the mammalian heart, based on membrane 
and intracellular ion transport and binding dynamics, 
digital histology, and three-dimensional cardiac 
anatomy and architecture. 

Introduction 

The normal rhythmic beating of the heart is triggered 
by repetitive waves of depolarization (propagating 
action potentials) that spread from the pacemaking 
sinoatrial node, through the atria and conducting 
system (atrioventricular node. Bundle of His and 
Purkinje hbers) to the ventricular tissue, producing 
the synchronized atrial, followed by ventricular, 
contractions that form one cycle of the heart beat. 
Calcium entry during depolarization triggers calcium 
induced calcium release from the intracellular 
sarcoplasmic reticulum, and the rise in intracellular 
calcium triggers contraction. The essential features 
are autorhythmicity, the isolated heart continues 
to beat, and the synchronization produced by fast 
(up to 1 m/s) propagation. The rhythmic pumping 
drives the circulation, and loss of synchronous cardiac 
contractions, as in cardiac arrhythmias, can 
rapidly lead to death. Computational modeling of the 
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interactions among a few dozen membrane proteins - 
ligand- or voltage-gated ion channels that are 
characterized by their ionic selectivities, kinetics 
and pharmacology, ion pumps and exchangers, and 
connexins forming intercellular gap junctions; 
and intracellular ion binding, sequestration and 
release, within the tissue architecture and geometry 
of the heart can account for these processes, and 
predict the cell, tissue and whole heart level effects of 
differences in specific channel kinetics or densities that 
result from gender, remodeling, mutations, drugs, or 
disease (Panfilov and Holden 1997; Clayton and 
Panfilov 2008). 

Cardiac Membrane and Cell Excitation 

A single cardiac myocyte, isolated hy enzymic diges¬ 
tion, or a small spheroid of cells growth in gyratory cell 
culture, or a small ball of myocardial tissue are all too 
small for membrane potential V to change with 
distance, and so are effectively isopotential, with 
dV/dt — —/io„(V)/C, where (V) is a function 
representing how the total ionic current depends on 
voltage, and C the capacitance, of either a cell or 
a unit area of membrane. This can be obtained by the 
analysis of macroscopic currents obtained from whole 
cell microelectrode recording techniques. For 
autorhythmicity or excitability,/(F) is nonlinear, with 
the characteristics of a cubic. If the equilibrium solu¬ 
tion at which dV/dt = 0 is stable (all the eigenvalues 
have negative real parts) the system is excitable, and 
the threshold is at the negative slope intercept of the 
zero current axis. If the equilibrium solution is unstable 
and a complex conjugate pair of eigenvalues have 
positive real parts then the solution is oscillatory and 
the system is autorhythmic (pacemaking). Whether 
a myocyte is pacemaking or excitable depends on the 
stability of its equilibrium solution. The stability 
depends on the parameters that control current 
magnitudes (maximal channel conductances/channel 
densities), and parameters that determine the 
channel gating kinetics. 

The first model of a cardiac action potential in 
terms of ionic mechanisms (Noble 1962) was 
a modification the Hodgkin-Huxley equations to 
reproduce the prolonged Purkinje fiber action 
potential, and its automaticity. Cardiac action potential 
models are reconstructed from a number of 


membrane ionic currents, through channels, pumps, 
and exchangers. A current is characterized by its 
ion-selectivity, pharmacology, kinetics, and voltage- 
or ligand-dependence, recorded under voltage clamp, 
in macroscopic in vitro experiments on small 
isopotential preparations, or in patch-clamp experi¬ 
ments on channels expressed in amphibian eggs or 
mammalian cell lines. The cardiac action potential is 
generated by inward {Na'^ and outward (A"^), 

pacemaking, background and stretch-activated 
currents through ion channels (Grant 2009) 
Single channel currents are of the order of a pA, 
i.e., ^10® ions/s, and current flow through an open 
channel is by electrodiffusion, by binding to multiple 
binding sites within the channel. 

The instantaneous current voltage relation can be 
linear, and given by the product of a conductance G 
and a driving force, as background currents, for the 
Na'^ background current: 

^hNa — ^Na) 

and the fast No*' current 

Inu = gNaiy — Vno) = GNatn^hj{V — F]Va). 

The conductance g is the product of the maximal 
conductance G and the fraction that is conducting; this 
can be controlled by separate activation and inactiva¬ 
tion gates, (for m is activation, h inactivation, and j 
slow inactivation; 0 < m,h,j < 1 that can be interpreted 
as the fraction of the gates in a state that allows ion 
flow, or the probability that a gate is “open”). 

When there is a difference in ion concentration 
across the membrane the Goldman-Hodgkin-Katz 
constant field theory equations predict a nonlinear 
rectifying current, (the magnitude of the current for 
a given driving force depends on the direction of ion 
flow current), and this is marked for currents, 

where intra- and extracellular concentrations [Ca^^] 
differ by four orders of magnitude from 

100 nM diastolic to 1 pM systolic, [Ca'^'^Jo ^2 mM). 
For the L-type Ca current Ica.L 


ICa,L = Gca,Ldjfca{WF^/RT) 

x{{[Ca++lexp{2VF/RT) - 0.341 [Cfl++]J } 
/{{exp{2VT/RT)) - 1}, 
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where t/ is a voltage-dependent activation gate, 
/ a voltage-dependent inactivation gate, and fca 
a dependent inactivation gate; F and R are 

Faraday’s and the gas constant, and T the absolute 
temperature. The 0.341 comes form the ratio of the 
partition coefficients for Ca"^"^. 

The time dependence of ionic permeabilities and 
conductances with linear or nonlinear instantaneous 
current-voltage relationship is described by the kinet¬ 
ics of the activation and inactivation gating processes 
{m, h,j, d,f, fca in the two examples above), that have 
voltage-dependent macroscopic rate coefficients of 
opening a.(V) and closing P(V) that are described by 
empirical expressions based on curve fitting, e.g., for 
the fast activation of I^a 

an,{V) = !/({! +exp([-56.86-y)/9.03])}^ 

with analogous expressions for fi(V). An alternative 
and equivalent description, in place of the rate coeffi¬ 
cients a(V) and P(V), is by the time coefficient 
x(V) = l/(a.(V) + P(V)), and steady-state value, e.g., 
(V) + P„, (Vj). 

Single file transfer across multiple binding sites 
provides one mechanism for channel rectification. 
The strong inward rectification of channels is due 
to block of the channel by intracellular polyvalent 
cations (Hille 2001), and the inward rectifier K'^current 
Ix] is represented by: 

Iki ^ GkiV{[KI/5A)xkiUV - Ek), 

where x^ioo is a time independent rectifier factor that 
depends on voltage, and ^l([K]J5.4) describes the 
dependence on [KJo- 

Each current is described by terms that give its 
magnitude (maximal conductance G, or product of 
channel density and single channel conductance), gat¬ 
ing dynamics by activation and inactivation variables, 
and instantaneous current voltage relation that have 
been obtained in extensive series of voltage clamp 
experiments under different conditions. The channel 
density (which can act as a bifurcation parameter) 
fluctuates, due to changes in channel regulation, and 
trafficking and insertion into the membrane, and so can 
be modeled by a stochastic variable. The analysis of 
patch-clamp recordings of single channel activity in 
terms of opening, closing, and open time probability 
distributions gives a microscopic interpretation of a(V) 


and P(V) as probabilities, that are related to the transi¬ 
tion probabilities between multiple states in Markov 
model within kinetic state diagrams of a single chan¬ 
nel. Markov models allow the kinetics of drug-channel 
interactions to be modeled in detail. These equations 
for microscopic, multistate single channel currents and 
macroscopic membrane currents are semiempirical, 
and contain experimentally estimated parameters and 
dynamic variables, on time scales from 10 ps to s. 

There are also semiempirical expressions for pump 
and exchanger currents, and for intracellular ion con¬ 
centration dynamics, and for binding and 

sequestration. The equations for the individual currents 
are constructed from currents recorded under voltage 
clamp, and so will reconstruct these currents. If all the 
membrane currents are summed as lioJV) then 

dV/dt = -Iion{V)IC 

should reconstruct V(t) for the cell. However the model 
will require some informed manipulation, to ensure 
electroneutrality at rest (so at the resting membrane 
potential dV/dt— 0) and electrochemical neutrality (so 
the individual ionic fluxes are all individually equal 
to zero, and there are no slow drifts in intracellular 
concentrations and hence potential). Cardiac cell 
models are in a state of continuing development, and 
as a result of interactions with experiment have under¬ 
gone several paradigm shifts, with new mechanisms 
being incorporated, data being reinterpreted and some 
old mechanisms replaced (Noble and Rudy 2001). 

Families of ordinary differential equation cardiac 
cell models have been developed to describe the volt¬ 
age and time-dependent current/jon that reconstruct the 
action potentials V(t) of different types of cardiac 
myocytes cells of a number of species: mouse, rat, 
rabbit, dog, pig, and man. Cardiac cell electrophysiol¬ 
ogy models, although simple in principle, are compli¬ 
cated because of the large number of dynamic 
variables, with kinetics over several orders of magni¬ 
tude, and large number of parameters, and empirical 
expressions for gating variables, and so need to 
be specified by computer programs if they are to be 
used. A printed description probably contains typo¬ 
graphical errors, or a mismatch between the equations 
described and the revised equations used to generate 
the final figures, and models are generally published 
with the computer code as an electronic supplement, 
and are available in a standard markup language such 
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as CellML (Fink et al. 2010). These stiff (have time 
scales from 100 p-s to 100 ms), high order (have a large 
number of dynamic variables) cardiac cell electrophys¬ 
iology models are solved by numerical integration for 
time spans of seconds (during repetitive activity after 
more than a few seconds there are nonphysiological 
drifts in ionic concentration) to compute V(t), with 
a time step of ^10-100 ps. Computing technology is 
now fast enough for cell V(t) to be computed in close to 
real time on a PC, especially if efficient programming 
methods (adaptive time steps, use of look up tables 
rather than evaluation for the a(V) and P(V)) are used. 

The qualitative nature of solutions ofV(t) need to be 
physiologically appropriate: models for isolated cells 
that are pacemaking (sinoatrial and atrioventricular 
node, Purkinje fibers) need to be autorhythmic, while 
models of atrial and ventricular myocytes need to have 
a stable equilibrium that corresponds to the resting 
membrane potential, and be excitable. Small changes 
in the value of a parameter (say a conductance) can 
change the stability of the equilibrium, producing 
a qualitative change in behavior, and allowing periodic 
solutions to occur. This is a bifurcation, and periodic 
solutions can emerge at Hopf of homoclinic bifurca¬ 
tion points (Ermentrout and Rinzel 2009). Changes in a 
number of the cell model parameters can produce 
periodic solutions, and so pacemaking is an emergent, 
cell level system property, and is not due to a specific 
pacemaking mechanism, such as the hyperpolarization 
activated funny current If, or intracellular Ca"^"^ oscil¬ 
lators, or multiple redundant mechanisms, where 
removal of one mechanism exposes another. 

Figure 1 illustrates numerical solutions for a normal 
human ventricular cell model, and when the parame¬ 
ters have been modified to simulate a mutation that 
produces long-QT syndrome, and the effects of a drug 
whose actions on the channel kinetics are known. The 
action potential solutions for an excitable myocyte can 
be quantified by measures of its threshold (strength 
duration curve) and shape - resting potential, peak 
potential, maximum rates and rise and fall, times at 
which they occur, and action potential duration APD 
(measured as APDgo, time to 90% depolarization), 
and for an autorhythmic cell by its maximum and 
minimum potentials, rate and its response to brief 
perturbations, by a phase resetting curve (Winfree 
2001). These can be compared to experiments, and 
further model validation can be by reproducing the 
effects of drugs, if their actions on membrane channels 


have been described quantitatively. When validated, 
cell models can be used to predict the cell level effects 
of drugs whose channel level actions are known. 
Although antiarrhythmic drugs can be classified 
by what channels they act on (e.g., type la antiarrhyth- 
mics, such as flecainide reduce excitability by blocking 
Ino), many antiarrhythmics act on several ion 
channels (e.g., dronedarone) and can have multiple 
actions (e.g., the /^va blocker flecainide also acts on 
intracellular Ca’^'^ release kinetics) (Li et al. 2010). 

Virtual cells can also quantitatively test if the 
effects of inherited channelopathies (disease caused 
by mutations in genes coding membrane channels) 
account for their electrophysiological symptoms 
(e.g., long or short QT syndromes). Modeling patho¬ 
logical states, e.g., acute ischemia is complicated by 
actions on intracellular metabolism as well as directly 
on the electrophysiological processes being modeled, 
and chronic disease, e.g., atrial fibrillation can cause 
remodeling of the cell ion channels (Zhang et al. 2009). 

If the APD restitution curve has a maximal slope 
>1, then periodic action potentials at rates where 
the slope is >1 are unstable and a period-doubling 
bifurcation into alternans (a periodic sequence of 
long-short APDs) is expected. Alternans in itself is 
benign, but prognostically is a bad sign. The maximal 
slope of the APD restitution curve is used as an 
arrhythmogenic index, and can be computed, or 
estimated experimentally. 

Cardiac Heterogeneity 

Action potentials recorded from myocytes isolated 
from different points in the heart differ qualitatively 
(autorhythmic or excitable) and have quantitative 
differences in shape. Figure 2 illustrates propagating 
action potentials computed at different points through 
the cardiac pacemaking, conduction system and myo¬ 
cardium during normal sinus rhythm, and illustrates 
the wave nature of cardiac propagation. These spatial 
differences in action potential shape can be explained 
by spatial differences in the expression of different ion 
channel proteins. The mRNA abundance and the 
expressed protein density are not semiquantitative 
indices of the density of functionally active channels, 
but provide a guide for segmenting cardiac tissue 
and identifying different types of myocyte. Ten types 
of cardiac myocyte have been described, and 
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Modeling the Heart, Fig. 1 Human mid-myocardial ventric¬ 
ular action potentials computed from ten Tusscher and Panfilov 
2006 model (WT), modified for effect of a mutation in 
the human ether-a-go-go-related gene (hERG), which encodes 
the Kvl 1.1 potassium channel, reduces Ik,, and is responsible for 
LQT2 syndrome (R56Q), and the effects of the hERG channel 
opener NS1643. (a) solitary action potentials, produced by 
a brief depolarizing current pulse; (b) Ik, during these action 


potentials; (c) repetitive periodic activity in normal; (d) alter¬ 
nating periodic activity in mutant model, both produced by 
repetitive stimulation; (e) APD90 restitution curves obtained by 
repetitive stimulation at 1/s for 50 s, then changing the diastolic 
interval and measuring the APD90 of the next action potential; 
and (f) restitution curves obtained by repetitive stimulation 
different fixed rates, and plotting the APD90 of the periodic 
action potentials 


Modeling the Heart, 

Fig. 2 Space-time plot of 
color-coded computed 
membrane potential along 
rabbit cardiac conduction axis 
during (a) normal sinus 
rhythm (b) sinus rhythm 
slowed by incoiporating the 
actions of an acetylcholine 
analogue. The action potential 
is generated in the SAN and 
propagates through the rest of 
the heart toward the 
ventricular epicardium as 
a series of repetitive waves 
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quantitatively modeled for the rabbit heart - cells from 
the center and periphery of the sinoatrial node, atrial 
muscle fast and slow pathways of the atrioventricular 
node and the His bundle), Purkinje hbers and endo-, 
mid-, and epicardial layers of the left ventricle, where 
each cell type has a characteristic electrophysiology 
that is determined by the relative densities of the 
different ion channels. Cardiac tissue action potentials 
reflect the electrotonically smoothed local cell action 
potentials, and so spatial changes in tissue electrophys¬ 
iology could be due to changes in the ratio for mixtures 
of two or more cell types (a mosaic), or to a smooth 
tissue gradient in the expressed channel densities. 
Myocardial tissue also contains inexcitable fibroblasts 
that may be electrically coupled to the myocytes 
(Camelliti et al. 2006). 

Cardiac Propagation 

The 20-100 pm myocardial cells are coupled via 
low resistance gap junctions, and at the mm scale 
propagation appears smooth and can be modeled 
by a continuous, excitable medium represented by 
a reaction-diffusion equation: 

dV/dX = W{DVV)-Iion{V). 

Vis a spatial gradient operator, D is the electrical 
diffusion coefficient tensor (mm^ms~*) that gives the 
spatial scale. Spatial differences in cell electrophysiol¬ 
ogy and protein expression can be mapped as spatially 
varying parameters in the equations for Iion(V). The 
electrical diffusion tensor D is defined within the 
geometry of the heart and changes with location: it is 
determined principally by the average orientation of 
myocytes at any given location. Numerical solutions 
requires a time step determined by the kinetics 
of lioJV) and a space step small enough to ensure 
numerical stability, and convergence of the traveling 
wave solution. These integration steps can be selected 
empirically, by tracking the convergence of the 
solution as the steps are reduced, and are 
typically ^100 ps, ^100 pm 

The conduction velocity of a cardiac action 
potential depends on the membrane conductances, 
and intercellular coupling: via connexins, the proteins 
that form the gap junctions coupling myocytes, and 
perhaps also fibroblasts. Electrical coupling is modeled 


via the diffusion coefficient D: a value of D is 
selected to give the appropriate conduction velocity 
for the excitation model. Cardiac tissue possesses 
a complex fiber-sheet structure, and propagation is 
anisotropic, up to five times faster along the local 
myocyte orientation, and is orthotropic, faster within 
sheets than across sheets. 

One-dimensional models can support solitary wave 
and wave train solutions. At higher rates, the APD is 
shortened, velocity is reduced, and the wavelength 
is shortened. Colliding traveling wave solutions 
annihilate each other; this destructive interference 
results from the refractory period of the action 
potential. Figure 3 illustrates the responses of a 
one-dimensional model to stimulation at different 
times in the wake of an action potential. The vulnera¬ 
ble window is the period after a preceding action 
potential during which a unidirectional wave can be 
initiated, i.e., there is a unidirectional conduction 
block. The initiation of a single solitary wave in 
a one-dimensional ring provides a computationally 
simple model for reentry; such unidirectional 
propagation can only be produced in a homogeneous 
one-dimensional medium if the symmetry is broken, 
say by a preceding action potential. 

Stimulation during the vulnerable period in the 
wake of a plane wave in a two-dimensional medium 
would initiate a pair of spiral waves. The width of the 
vulnerable window and the transmural dispersion in 
APD are indices of pro-arrhythmogenicity, i.e., the 
likelihood of the occurrence of a reentrant arrhythmia. 
Figure 4 illustrates the onset and breakdown of 
ventricular tachycardia into ventricular fibrillation. 

In a two-dimensional medium, the propagation 
velocity depends on the curvature of the wavefront, 
as well as rate of activity, and the wave front can be 
broken at regions of unidirectional conduction block. 
The broken wave may evolve into a high frequency 
circulation of excitation, in the form of an Archime¬ 
dean spiral, around the core. The core acts as an 
organizing center that imposes its rhythm on the rest 
of the tissue as the tip of a propagating wave that 
reenters locations far from the core once every 
rotation. Even in ideal, homogeneous, isotropic 
media, spiral waves need not rotate rigidly around 
a circular core, but their tip can meander in 
a complicated biperiodic trajectory, and in heteroge¬ 
neous media will drift (Mikhailov et al. 1994). 
Meandering, drifting spiral waves provide 
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Modeling the Heart, Fig. 3 Computed response of human 
virtual ventricular wall to localized stimulation, (a-c) space 
time plots, with a SI stimulation at t = 0 on the endocardial 
border producing an action potential that propagates through the 
wall from left to right at a constant velocity. A second stimulus 
S2 at the * produces (a) no response: the stimulus is in the 


refractory period (b) bidirectional propagation and (c) unidirec¬ 
tional propagation, i.e., the stimulus is in the vulnerable window, 
(d-f) the transmural distribution of action potential duration 
and width of vulnerable window, for normal (WT), LQT2 
mutant (R56Q), and hERG channel opener (NS1663) modified 
tissue 


a conceptual model for self-limiting episodes of 
reentrant arrhythmias, as the the entry will terminate 
when the tip reaches inexcitable boundaries - the heart 
surfaces, or great veins and arteries. A spiral wave can 
be pinned onto a compact heterogeneity, such as 
a coronary blood vessel, if it is sufficiently large. Spiral 
waves in homogenous media can be stable, or, for 
some sets of ion channel excitation parameters, break 
down into spatiotemporal irregularity. Heterogeneity 
enhances spiral wave instabilities. 

The three-dimensional generalization of a spiral is 
a scroll, in which the reentry propagates around an 
extended, curved, meandering, and drifting filament. 


There are intrinsic 3D instabilities, and even in homoge¬ 
nous media, some sets of ion channel parameters produce 
stable spiral waves in 2D and unstable scroll waves in 3D. 

The breakdown of spiral or scroll waves into 
irregularity provides mechanisms for the development 
of fibrillation. 

Cardiac Tissue Architecture 

The voltage diffusion tensor D can be estimated from 
diffusion tensor magnetic resonance imaging 
(DTMRI), in which a voxel averaged diffusion tensor 
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Modeling the Heart, 

Fig. 4 Computational 
modeling of human 
ventricular fibrillation. 

(a) clinical electrocardiogram 
of spontaneous onset of an 
episode of ventricular 
fibrillation (VF) from normal 
sinus rhythm (NSR) through 
ventricular tachycardia (VT) 

(b) 2D cartoon and 
representative time series of 
snapshots of repetitive plane 
waves of NSR, a rotating 
spiral and its core for VT, and 
spatiotemporal irregularity of 
VF (c) analogous snapshots 
isolated slab of human left 
ventricular free wall 

(d) snapshots of color-coded 
potential, the V=—40 mV 
isosurface and filaments 
during VF in the anisotropic 
geometry of a rabbit heart 
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VT 
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is calculated from the signal attenuation and the 
intensity of magnetic gradient applied during 
a diffusion weighted spin-echo experiment. The tensor 
is symmetric, and so the 3x3 matrix can be 
represented by the 3 real eigenvalues and the 3 orthog¬ 
onal eigenvectors. The tensor for each voxel can be 
imagined as an ellipsoid whose axes are orientated 
along the eigenvectors, extended ellipsoids correspond 
to linearly extended anisotropic diffusion, flat ellip¬ 
soids to planar, orthotropic diffusion and spherical 
ellipsoids to isotropic diffusion. The largest (primary) 
eigenvector has been validated provides a measure of 
ventricular local average myocyte orientation within 
the ventricular wall of several species, and secondary 


and tertiary eigenvectors provide an index of any sheet 
orientation (Scollan et al. 2000). Figure 5 presents fiber 
and sheet angles reconstructed from a DTMRI data set 
and illustrates the transmural change in hber angle 
through the ventricles, and the more irregular sheet 
structure. The fractional anisotropy provides 
a measure of the tissue architecture, and can aid 
segmentation of the data set. These data sets provide 
the vector components of the diffusion tensor D for 
virtual cardiac tissues. Cardiac DTMRI data sets take 
^10 h to acquire, and so have been obtained for post¬ 
mortem hearts, and a spatial resolution ^100 pm can 
be achieved on a 9.4 T research instrument, fine 
enough for computing propagation. 
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Modeling the Heart, Fig. 5 Visualization of fiber and sheet 
orientation, and a measure of anisotropy, for a 200 pm cubic 
voxel digital data set reconstruction of a rabbit heart obtained by 


DTMRI (a) long axis, (b-d) short axis slices. All angles are 
defined relative to a line through the centroids of the left 
ventricle 






Clinical cardiac magnetic resonance imaging 
provides two-dimensional slices in any selected 
plane, or three-dimensional data sets of cardiac 
geometry, at a spatial resolution of 2-3 mm (Locki 
et al. 2009), at selected times in the cardiac cycle. 
Patient specific ventricular geometry reconstructed 
from clinical imaging does not include any informa¬ 
tion about anisotropy or orthotropy: For patient 
specihc modeling this could be introduced by rule 
based methods, as the smooth, tranmural change 
of ^120° in fiber helix angle is found in all normal 
mammalian hearts. 

Computational Systems Biology of the Heart 

Myocyte excitation and action potentials are cell level 
properties that emerge from the interactions between 
membrane and intracellular ion transport processes, 
and the computed effects of drugs on cell behavior 
could be tested by experiments on cells. Synchroniza¬ 
tion and propagation are tissue level effects, and 
models allow the quantitative effects of different 
mechanisms to be dissected separately in space and 


time. The effects of changed channel properties (due to 
mutations or drugs) rate, curvature, anisotropy, and 
spiral wave meander patterns can be computed in 
isolation for homogenous virtual tissues. These quan¬ 
titative spatiotemporal predictions of virtual behaviors 
cannot be tested experimentally, as all real cardiac 
tissue has an architecture and is heterogeneous. 
Figure 6 illustrates the visualization, using voltage 
sensitive dyes, of surface and subsurface electrical 
activity of an isolated perfused heart, where contrac¬ 
tion has been blocked by an excitation-contraction 
decoupler (Efimov et al. 2004). Such optical 
mapping experiments can provide a partial quantitative 
validation of predicted three-dimensional spatiotem¬ 
poral activity during normal sinus rhythm, or solitary 
or periodic propagating action potentials produced by 
stimulation. They can only provide a semiquantitative, 
statistical validation of the spatiotemporal irregulari¬ 
ties of computed activity during fibrillation (Biktashev 
and Holden 2001). 

The myocardium is not just composed of myocytes 
there are also fibroblasts, connective tissue and 
intruding blood vessels, and perhaps scar tissue 
resulting from earlier damage. All these can contribute 
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Modeling the Heart, Fig. 6 Activation time and action 
potential duration obtained from optical mapping of surface 
spatiotemporal activity in response to a brief suprathreshold 
stimulation in isolated, perfused rat heart, contraction blocked 
by and voltage imaged using Di-4-ANEPPS (c) computed 
activation time and (d) action potential duration 


to non-excitable granularities, with length scales 
from ^100 |J,m to cm, which can interfere with 
propagation, either by producing a site for wave break¬ 
ing or for pinning the free ends of reentrant waves. 
Further functional heterogeneities emerge if excitation 
is coupled to contraction in electromechanical models 
(Hunter et al. 1998) with mechano-electric feedback 
(Kohl and Ravens 2003). 


with the excitation described by membrane ion transport 
and intracellular sequestration and binding. Physiologi¬ 
cal and pathological patterns can be explained in terms 
of nonlinear wave properties - the dependence of veloc¬ 
ity on rate by nonlinear dispersion, and breakdown from 
spatiotemporal patterned activity into irregularity by 
interactions between waves and by changes in wave 
stability. The anatomy of the heart - its size, shape, 
and organization into chambers and conducting path¬ 
ways - provide the physical structure within which 
propagation occurs, and can be incorporated as a data 
set obtained from DTMRI. A biophysically and anatom¬ 
ically detailed computational model of mammalian 
heart electrophysiology can describe the electrical activ¬ 
ity of the heart, how it is altered by drugs, or mutations 
in ion charmels. Such quantitative predictions can be 
partially tested by optical recordings of activity in iso¬ 
lated, perfused hearts. The virtual human heart can be 
used to link noninvasive clinical recordings with models 
of the underlying cellular electrophysiology. Normal 
sinus rhythm, and arrhythmias, are cm length scale 
and 1-1,000 s time scale phenomena that emerge, at 
a tissue level, from the spatially heterogeneous cellular 
electrophysiology and tissue architecture, and the 
effects of changes in the ms-100 ms kinetics of ion 
transport processes on the initiation and persistence of 
arrhythmias can be computed. 


Cross-References 

► Analysis of Macroscopic Currents 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 

► Fluorescence Techniques for Studying Ion Channel 
Gating; VCF, FRET, and LRET 

► Gap Junction Proteins (Connexins, Pannexins, and 
Innexins) 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Voltage-Gated Sodium and Calcium Channels 


Summary 

The normal sinus rhythm of the heart, reentrant arrhyth¬ 
mias and hhrillation can all he computed in terms of the 
propagation of nonlinear waves in an excitable medium. 
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Definition 

Molecular dynamics simulation is a computational 
method to simulate the motion of a collection of 
atoms numerically. From the resulting trajectory of 
all atoms, thermodynamic and dynamic properties 
can be calculated. This is a very powerful and 
universal methods for models at the level of atoms 
that is been widely used to study lipids and membrane 
proteins. 

Introduction 

Computer simulations of lipids have become a powerful 
method to study in atomistic detail a broad range of 
problems involving lipids and membrane proteins. The 
first simulations 30 years ago studied highly idealized 
model systems over picosecond timescales and nano¬ 
meter length scales and required dedicated access to 
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Molecular Dynamics 
Simulations of Lipids, 

Fig. 1 Left: Membrane model 
of pure POPC. The lipid 
chains are shown in yellow, 
the terminal methyl groups as 
gray spheres, the 
phosphodiester groups in 
orange, and the choline groups 
in red. Water is shown in blue. 
The white rectangle indicates 
the simulation cell, while 
atoms outside this cell are 
periodic images of the atoms 
inside the cell. Right: Model of 
an entire vesicle in solution 
(Courtesy of Dr. Jelger 
Risselada) 



very rare supercomputers. As computers have become 
faster and simulation software both more sophisticated 
and easier to use, simulations can now be fruitfully 
incorporated in a rapidly growing number of research 
projects in areas of biochemistry, biophysics, and cell 
biology involving lipids. In this entry, a brief overview 
is given of the history of membrane simulations, the 
molecular dynamics method and some of its choices, 
and some examples of applications. 

History 

Biomolecular simulation goes back to the 1970s with 
the development of the first empirical force fields to 
describe the conformations and vibrations of simple 
molecules (Leach 2001). The field rapidly grew, 
including protein simulations, the first surfactants, 
polynucleotides, and, in 1988, the first bilayer 
membrane: a model system of decanol/decanoate/ 
sodium/water that exhibits much of the general behav¬ 
ior of more familiar biological membranes (Egberts 
and Berendsen 1988). From these early studies, it 
became clear that molecular dynamics (MD), subject 
to certain limitations, can give detailed insights into 
the motions of lipids and other biomolecules 
(van Gunsteren et al. 2006). 

In the 1990s, generally the focus was on bilayer struc¬ 
ture, although the first simulations studying interactions 
of peptides and membrane proteins with lipids as well as 
small molecules with membrane also appeared (Tieleman 
et al. 1997). Toward the end of the 1990s, membrane 


protein simulations became common, driven in part by 
major progress in crystallography of membrane proteins 
and in part by more powerful computers and better soft¬ 
ware (Forrest and Sansom 2000). Over the past 10 years, 
a truly impressive breadth of systems has been studied by 
molecular dynamics simulations, ranging from ever more 
detailed comparison to experimental stractural studies of 
pure lipids to simulations of very complex membrane 
protein systems. Although in 1994 (Pastor 1994) and in 
1997 (Tieleman et al. 1997) it was still possible to write 
comprehensive review articles on membrane simulations, 
this is no longer the case. Some examples of recent 
reviews include overviews of simulations of membrane 
proteins (Lindahl and Sansom 2008), large-scale lipid 
simulations (Marrink et al. 2009), and the effects of 
cholesterol on membrane structure (Berkowitz 2009). 
These represent only a fraction of the available reviews, 
however. Figure 1 shows an example of a simulation of 
a small patch of a l-palmitoyl-2-oleoyl-sn-glycero- 
3-phosphatidylcholine (POPC) bilayer, together with 
a simulation of an entire vesicle. 

Methods 

Potential Function 

In molecular dynamics simulations, all atoms in the 
system are treated classically, using empirical functions 
that approximate the exact quantum mechanical inter¬ 
actions but are easily calculated in a computer for thou¬ 
sands to millions of atoms. Interactions between atoms 
are divided in nonbonded interactions between any pair 
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of atoms except those closely connected by bonds and 
bonded interactions between atoms connected by chem¬ 
ical bonds. For the nonbonded electrostatic and van der 
Waals interactions, a partial charge and parameters for 
repulsion and attraction are assigned to each atom. The 
bonded interactions consist of bond, angle, and dihedral 
terms. Bonds and angles are usually described as har¬ 
monic oscillators and dihedral angles are usually 

described by some form of cosine expansion. A typical 
potential function is of the form: 

bonds angles 

+ y(1 +cos(«<^- <^o)) 

dihedrals 

impropers 

+ X! 

non—bonded pairs {ij) 

non—bonded pairs [iJ) *-1 

Here r,y is the distance between atoms i and 7 ; qj is 
the partial charge on atom ;; £ and cr are Lennard-Jones 
parameters related to the well depth and the size of the 
atom, respectively; k^, kg, and are force constants 

for bonds, angles, dihedral angles, and improper dihedral 
angles (a term to enforce certain geometries, such as 
planarity of peptide bonds); n is the dihedral multiplicity 
(the number of minima in the dihedral potential); and d^, 
00 , cpo, and v|/o are equilibrium values for the bond lengths, 
angles, dihedral angles, and improper dihedral angles. 

The precise form of this potential function is 
a choice, and many variations exist. However, the 
form given here is reasonably general and shows 
the most important assumptions that are made: only 
pair-additive interactions are taken into account, atoms 
are represented as point charges (electronic polarizabil¬ 
ity is neglected), and simple analytical forms are used 
for computational efficiency. The precise consequences 
of these assumptions are beyond the scope of the current 
paper, but extensive discussions are available in the 
literature (Leach 2001; Ponder and Case 2003). 

The potential function is very flexible. Although in 
most molecular dynamics simulations every atom is 
explicitly represented, a substantial fraction of lipid 


simulations uses “united atoms,” pseudoatoms 
that incorporate hydrogen atoms attached to aliphatic 
carbons in the carbon atom, so that a CH2 or CH3 group 
is essentially one atom. The same potential function is 
also used in some much coarser models. The MARTINI 
model, for instance, groups on average four 
nonhydrogen atoms together to form one interaction 
site (Marrink et al. 2007). It uses modified parameters, 
but the same overall approach as atomistic simulations. 

Simulations and Boundary Conditions 

Using this potential function, the equations of motion for 
all atoms in the system can be solved by calculating the 
forces on all atoms and integrating in time. In principle, it 
is possible to calculate the complete dynamics of any 
system that can be described in terms of a simple inter¬ 
action potential. The main result of such a calculation is 
a trajectory of all atoms in time: the coordinates and 
velocities of all atoms at any of the integration steps. 
This result can be animated in a movie or used to extract 
average thermodynamic and kinetic properties using sta¬ 
tistical mechanics. If the simulation is sufficiently exten¬ 
sive to sample most important states of the system in the 
simulation, almost all properties of the system can be 
calculated and compared to a wide variety of experi¬ 
ments (see below). 

A simulation is determined by the specific form of 
the potential function plus its corresponding parameters 
(including the exact version number of a set of param¬ 
eters), the simulation algorithms used, and by macro¬ 
scopic boundary conditions. The simulation algorithms 
involve choices regarding the type of integrator, the 
time step used, methods to increase the efficiency of 
simulations by making certain approximations regard¬ 
ing the relative importance of different interactions, and 
algorithms to control the macroscopic boundary condi¬ 
tions such as temperature and pressure. All of these 
should be specified in detail in the “Methods” section 
of simulation studies. There is a very large body of 
literature on technical aspects of membrane simulations, 
for example (Anezo et al. 2003). 

In addition to technical choices, the macroscopic 
boundary conditions are important in comparing simula¬ 
tions to experiment. These boundary conditions include 
the number of molecules, which is fixed in molecular 
dynamics simulations but can be variable in Monte 
Carlo simulations, which are based on a different method 
to sample conformations of the system of interest; the 
temperature; and the pressure. In bulk solution, this 
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pressure is normally 1 bar unless a comparison to exper¬ 
iments under other pressure is of interest, but membranes 
are defined by both a bulk pressure and a surface tension 
or lateral pressure. In monolayer simulations, the surface 
tension is a key thermodynamic property that can be 
varied experimentally and also in simulations (Baoukina 
et al. 2008). 

The scope of MD simulations has rapidly increased 
over the past 15 years, due to both increasing computer 
power and more efficient software. As of 2011, many 
recent studies consider model bilayers including 
bilayers with membrane proteins on a scale of ca. 
10-nm X 10-nm membrane areas with of the order of 
100,000 atoms on timescales of hundreds of nanosec¬ 
onds, while the state of the art is at least one or two 
orders of magnitude more extensive. For comparison, 
in 1995, typical studies might include 128 lipids 
(20,000 atoms) on a timescale well below 1 ns. 

Applications 

Molecular dynamics simulations have contributed to 
our understanding of biological membranes in several 
different areas. In the following, a number of examples 
that demonstrate the wide applicability of modern 
simulations is given. 

Membrane Structure and Dynamics 

Due to the intrinsic flexibility of lipid chains, it is difficult 
to obtain experimental high-resolution structural infor¬ 
mation on the structure of lipid bilayer membranes in 
their physiologically relevant, fluid state. Molecular sim¬ 
ulations have provided a great deal of insight into lipid 
membrane structure and dynamics and particularly into 
the effect of chemical composition on membrane struc¬ 
tural properties. For example, MD simulation is an excel¬ 
lent tool to study the structure and dynamics of 
polyunsaturated lipids, via comparison with experimen¬ 
tal NMR data (Eldho et al. 2003). Another example is the 
numerous simulations studies on the ordering effect of 
cholesterol in mixed cholesterol/phospholipid mem¬ 
branes (Berkowitz 2009). Simulation results are also 
used to develop structural models for the interpretation 
of electron density profiles and X-ray diffraction data, for 
example, Klauda et al. (2006). The mechanism of lipid 
diffusion in bilayers has been the subject of some con¬ 
troversy in the literature, both experimental and theoret¬ 
ical. Molecular dynamics simulations were used to 


interpret the differences between different experimental 
results and provided a clear view of the ballistic, 
subdiffusive, and diffusive regimes (Flenner et al. 2009). 

Membrane Defects 

The formation of transient water pores in lipid mem¬ 
branes can take place upon application of electric fields 
(electroporation) or tension, as well as upon adsorption of 
various surface-active molecules like peptides, surfac¬ 
tants, polymers, and oxidative reagents (Marrink et al. 
2009; Gurtovenko et al. 2010). Water defects facilitate 
the transport of ions and other polar and charged species 
across the membrane but also the translocation of lipids 
from one leaflet to the other - a process known as lipid 
flip-flop. In the presence of transient spontaneous water 
pores, flip-flop prevents membranes from maintaining an 
asymmetric distribution of lipids between inner and outer 
leaflet. Asymmetry in lipid composition therefore 
requires active transport via specialized membrane pro¬ 
teins. Because the defects involved in these processes are 
transient, they are difficult to study experimentally except 
by functional means such as leakage of a fluorophore or 
electrical current. Simulations have made major contri¬ 
butions in this area that also includes important medical 
problems such as the mechanism of antimicrobial pep¬ 
tides and methods of targeted drug delivery. 

Interaction of Lipids with Small Molecules 

Membranes are the boundary separating cells from each 
other and from the environment and regulate transport of 
various substances in and out of cells. Transport across 
membranes includes a variety of mechanisms, both 
active and passive. Passive transport occurs simply via 
diffusion, without the need for coupling to chemical 
reactions, and its rate depends largely on free energy 
changes upon partitioning of the solute into the mem¬ 
brane. Simulation studies of passive transport made very 
significant progress over the past decade, especially 
thanks to much increased computer power, allowing 
widespread use of free energy calculations (MacCallum 
and Tieleman 2008). Permeation of small molecules 
(water, oxygen, ammonia, alkanes, alcohols, anesthetics) 
has been studied in great detail using free energy calcu¬ 
lation techniques and in particular the umbrella sampling 
methodology. This method allows calculating the free 
energy change in a membrane system upon translocation 
into the membrane interior, providing insight into the 
thermodynamic driving forces and the kinetics of these 
processes. Studies on the permeation of drugs and larger 



Molecular Dynamics Simulations of Lipids 


1583 


M 


molecules have also appeared. An example is the recent 
simulation of translocation of fullerene clusters across 
lipid membranes (Wong-Ekkabut et al. 2008). 

Lipid-Protein Interaction and Membrane Proteins 

A large fraction of molecular dynamics simulations of 
lipid membranes includes the membrane simply as 
a solvent for membrane proteins (Lindahl and Sansom 
2008). Modeling and simulations of membrane- 
spanning proteins are particularly important and 
widespread because of the scarcity of structural infor¬ 
mation from experiments. Significant progress has been 
achieved over the past decade in the field of membrane 
protein structure determination, both by crystallography 
and by NMR techniques; nevertheless, the fraction of 
membrane proteins in the Protein Data Bank (PDB) 
remains small. Other studies have focused more specif¬ 
ically on lipid-protein interactions, and particularly on 
hydrophobic matching, specific lipid-protein interac¬ 
tions, and on the induction of curvature in membranes. 
Hydrophobic mismatch indicates a difference between 
the hydrophobic length of a protein or peptide and the 
hydrophobic thickness of the surrounding membrane. 
Hydrophobic matching is important for membrane pro¬ 
tein partitioning, sorting, orientation, and functioning. 
The effects of mismatch on membrane properties and 
protein orientation have been explored using molecular 
dynamics simulations. Changes in membrane geomet¬ 
rical shape occur during endocytosis and vesiculation 
and are often aided by proteins that sense and create 
curvature. Such changes occur on length scales that only 
recently have become accessible to simulations 
(Reynwar et al. 2007). This no doubt will continue to 
be an interesting area of study (Marrink et al. 2009). 

Outlook 

Molecular dynamics and computer simulations in gen¬ 
eral have become a major and widely used method in 
studying lipids and membrane proteins. Its high intrinsic 
resolution in time and space makes it an ideal method to 
study the details of lipid and membrane protein structure 
and dynamics. Over the past decade, software has been 
greatly improved technically as well as in user-friendli¬ 
ness. Many interesting problems can already be simu¬ 
lated on recent desktop machines and small allocations 
on general-use clusters, while state-of-the-art studies 
will continue to require access to large-scale super 


computers as well as development in areas such as 
efficient use of graphics processing unit machines. As 
advanced as current models are, significant progress can 
be expected in the development of accurate parameters 
and potential functions at all levels from advanced 
polarizable force fields to classical atomistic force fields 
and improved coarse-grained methods. Membrane sim¬ 
ulations are no longer the realm of a few highly special¬ 
ized groups but are now an integral part of progress in 
lipid and membrane protein research. 

Summary 

Computer simulations of lipids are a powerful and 
established method to study in detail a broad range of 
problems involving lipids and membrane proteins at 
the level of atoms, using atomistic or united atom 
models, or at the level of molecular fragments, using 
coarse-grained models. In this entry, we briefly 
describe the history of lipid simulations, the basic 
molecular dynamics method to study lipid systems 
together with a number of key methodological issues, 
and give examples of a number of recent applications. 
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Definition 

In the most general sense, molecular recognition corre¬ 
sponds to the mechanism hy which two or more mole¬ 
cules come together to form a specific complex. These 
types of specific molecular interactions span biology 
and include processes as diverse as enzyme catalysis, 
antibody-antigen recognition, protein synthesis, and 
transcriptional regulation to name but a few. As 
a consequence of the universal importance of molecular 
recognition in biological function, understanding how 
molecules specifically recognize and interact with one 
another is of fundamental importance. 

Introduction 

Early studies on the kinetics and specificity of 
enzyme-catalyzed reactions led to the conclusion 
that substrates combine with the enzyme at 
a specific active site location on the enzyme surface. 
This understanding generated new questions as to 
how enzymes could carry out chemical transforma¬ 
tions efficiently with great regio- and stereo¬ 
specificity (Dixon and Webb 1979). Initially that 
question was satisfied by Emil Eischer’s lock-and- 
key hypothesis (Eischer 1894).This model of recog¬ 
nition satisfied the community for over 60 years until 
the hypothesis was modified by Dan Koshland’s 
induced fit hypothesis, which recognized the confor¬ 
mational plasticity of enzymes due to the observation 
that the active site conformation was different 
between the ligand-free and ligand-bound states 
(Koshland 1958). Again this model seemed 
satisfactory for another 50 years until the later part 
of the twentieth century when insight from the protein 
folding community and the recognition of the exis¬ 
tence of proteins as an ensemble of differentially 
populated structural states rather than a singular fold 
was applied to the field of molecular recognition. This 
led to the development of the conformational selec¬ 
tion hypothesis by Ruth Nussinov (Tsai et al. 1999b). 
As discussed below, now, almost 120 years after the 
initial proposal of the lock-and-key model, a blending 
of the three hypotheses allows us the best model to 
understand not only substrate recognition in enzymes, 
but also all molecular recognition events including 
protein-protein and protein-nucleic acid interactions. 


The Lock-and-Key Hypothesis 

Just as the field biochemistry grew out of the study of 
biological fermentation, much of the field of molecular 
recognition grew out of the study of enzyme selectivity 
and how enzymes are able to carry out catalysis both 
efficiently and with great stereo- and regio-selectivity 
(Voet and Voet 2004). In 1894, Emil Eisher, the 
famous organic chemist, discovered that glycolytic 
enzymes are able to distinguish between stereoisomers 
of sugars. Based upon that discovery he formulated the 
lock-and-key hypothesis (Eischer 1894). This model 
proposed that enzymes recognize their substrates as 
a lock receives a key. That is, only in the case of 
exact geometric complimentarity between the sub¬ 
strate (key) and enzyme (lock) is the substrate posi¬ 
tioned properly for the functional groups at the enzyme 
active site to carry out chemistry (Fig. la). In the case 
of a non-substrate, this complimentarity is inexact and 
like the wrong key in a lock, nothing happens. 

The Induced Fit Model Builds upon the 
Lock-and-Key Hypothesis 

This lock-and-key model served the biochemical com¬ 
munity well for over 50 years. However, while this 
model adequately explained how substrates that are too 
large to fit within the confines of the active site would 
fail to act as substrates, it did not explain how small 
substrates, for instance water, often acted as non¬ 
substrates in enzymatic reactions. The first revision to 
the lock-and-key model came in 1951 when, while 
attempting to understand the influence of pressure on 
enzymatic reactions, it was proposed by Keith Laidler 
that the combination of substrate with enzyme results 
in conformational changes in the enzyme (Laidler 
1951). However, it was studies on the enzyme hexoki- 
nase, a few years later, that brought Dan Koshland to 
the conclusion that Emil Fischer’s long established 
lock-and-key theory needed updating and he intro¬ 
duced a conceptual model that he called “induced-fit” 
(Koshland 1958). In this model, the interaction of an 
enzyme with its substrate resembles the fit of a hand in 
a glove, with a moderately flexible enzyme (the glove) 
fitting a moderately flexible substrate (the hand) 
(Fig. lb). Specifically, the model proposed that (a) as 
stipulated by the lock-and-key model, precise orienta¬ 
tion of catalytic groups is required for enzyme activity; 
(b) the substrate may cause a change in the three- 
dimensional relationship of the active site residues; 
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Crystallographic Evidence of the Encounter 
Complex 

One of the main arguments against the induced ht 
hypothesis is that the pathway requires the formation 
of a thermodynamically unfavorable encounter com¬ 
plex (Fig. lb). This is the complex in which the ligands 
are bound to the protein but the energetically favorable 
conformational change that is induced by ligand bind¬ 
ing has not yet occurred. However, recent crystallo¬ 
graphic studies of at least one enzyme (Sullivan and 
Holyoak 2008) provide the evidence for the existence 
of such a conformational state. While one could argue 
that the crystallographic lattice influences the energet¬ 
ics of this state, allowing for its stabilization and 
observation, these studies do support the feasibility of 
the induced fit pathway, as structural biology has now 
demonstrated the existence of all three states populated 
along an induced ht pathway. 


Molecular Recognition: Lock-and-Key, Induced Fit, and 
Conformational Selection, Fig. 1 Schematic diagram 
representing the (a) lock-and-key and (b) induced fit and con¬ 
formational selection models. In panel B, the induced fit pathway 
is defined by the equilibrium constants Ki and K 2 , while the 
conformational selection pathway is defined by equilibrium con¬ 
stants K 3 and K 4 . Along the induced fit pathway, ligand (L) binds 
to P forming the P-L encounter complex. This ligand-binding 
event subsequently induces a conformational change in P and the 
formation of the P*-L key-lock complex. Along the conforma¬ 
tional selection pathway, P is in a preexisting equilibrium with 
the active conformation P*. Binding of L to P* forms the final 
P*-L key-lock complex. In order to maintain the preexisting 
equilibrium between P and P* (represented by K 3 ), as ligand 
binds to P* a shift occurs in the population of P toward P* 


and (c) the changes will be induced by the binding of a 
substrate, while a non-substrate will be incapable of 
inducing these required changes (Koshland 1958). 
Although the initial response to his proposal was 
unenthusiastic (Koshland 2004), the model gained 
gradual acceptance, acceptance due in no small part to 
the growing field of macromolecular crystallography, 
whose protein structures revealed pictures of enzyme- 
substrate interactions. These high-resolution structures 
demonstrated in many instances that ligand-free and 
ligand-bound structures indeed had two different struc¬ 
tural conformations (Gerstein et al. 1994). As the 
amount of structural data grew, this new information 
demonstrated that the induced-ht mechanism of recog¬ 
nition prevailed throughout biology, spanning protein- 
protein and protein-nucleic acid interactions. 


Rise of Conformational Selection Model 

At the end of the hrst decade of the twenty-hrst cen¬ 
tury, just as Dan Koshland’s work built upon the earlier 
studies of Emil Fisher, studies investigating the true 
plasticity of protein structure suggested that 
a modihcation to the induced fit model might be appro¬ 
priate. This model, coined the conformational selec¬ 
tion model, was stimulated by folding funnel research 
demonstrating that the tenet of “one protein, one fold, 
one function” does not hold (Tsai et al. 1999a, b) 
(Fig. lb). This long-held tenet has been replaced by 
a new view which states that folded proteins exist as 
a cluster or ensemble of closely related structural states 
and each of these states have potentially different reac¬ 
tivities and/or binding affinities. Supporting the impor¬ 
tance of these ensembles in biological function are 
NMR studies demonstrating that conformations simi¬ 
lar to the bound state are already populated in the 
absence of ligand binding to the macromolecule 
(Beach et al. 2005). This suggests, in contrast to the 
induced ht hypothesis, it may not be required for the 
binding event to induce a conformational change in 
the protein, but via a mechanism similar to the MWC 
(Monod-Wyman-Changeux) model of cooperativity 
(Monod et al. 1965), binding of the ligand directly to 
the “lock” state can occur. Subsequent to binding, to 
maintain equilibrium, the population of molecules then 
shifts toward the active conformation facilitating fur¬ 
ther binding events (Fig. lb). As stated above, the 
conformational selection model is consistent with the 
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notion of protein structures existing as an ensemble of 
states and that the ligand-bound or lock state is present 
in the absence of ligand. However, just as Koshland 
found with Emil Fischer’s model, direct binding of the 
substrate to the lock state in a conformational selec¬ 
tion model does not explain why small substrates 
would be non-substrates for a reaction if the binding 
of substrate does not need to induce conformational 
changes to achieve the final key-lock complex. An 
additional aspect of molecular recognition not 
explained by the conformational selection model is 
binding to active sites of enzymes that, through con¬ 
formational change, are occluded in the final “key¬ 
lock” state. In these instances, the closing of the active 
site, while resulting in the active “lock” conformation 
and allowing for activity, does not sterically allow for 
binding directly to this conformation as predicted by 
the conformational selection model. Therefore, bind¬ 
ing in these situations must occur to a more open 
complex. Binding of ligand or substrate to this more 
open state would subsequently stabilize the closed 
(lock) conformation, resulting in the formation of the 
key-lock state more consistent with the induced ht 
hypothesis. In other systems that do not suffer from 
such a steric restriction, it is quite reasonable to pos¬ 
tulate direct binding to the key-lock state, as postu¬ 
lated by the conformational selection model, and 
depending on the system, little conformational change 
may be induced by ligand binding. It is, however, 
quite unlikely that ligand binding would have no 
impact on the formation of the final key-lock state as 
the free energy of ligand association is likely 
partitioned to some extent causing subtle and not so 
subtle conformational changes. 

The Extended Conformational Selection Model 

Recently, a model unifying the conformational selec¬ 
tion and induced ht models has been proposed 
(Csermely et al. 2010). This latest model, deemed the 
“extended conformational selection model,” includes 
elements of both conformational selection and induced 
ht to overcome the individual shortcomings of the two 
original models mentioned above. In this model, it is 
hypothesized that although the population of 
preexisting conformations may include a conforma¬ 
tional state that is very similar to the hnal key-lock 
state, some of the free energy available from the bind¬ 
ing event will be utilized to optimize protein-ligand 
interactions and the inducement of the hnal key-lock 


state in line with the induced ht hypothesis. 
The energetics of this partitioning of ligand-binding 
energy to conformational change was originally 
discussed by William Jencks is his Circe Effect paper 
(Jencks 1975). 

Summary 

The process of molecular recognition is an area of 
great importance for the understanding of biological 
regulation and function, as illustrated by the 120-year 
history of its study. Continuing studies in this held are 
sure to generate not only new insight into complex 
biological problems, but also provide the basis for the 
exploration of new avenues in modern drug design. 
Specihcally, the development of new molecules that 
function by skewing the population distribution of 
conformational states, resulting in either activation or 
inhibition of targets and thereby eliciting the desired 
biological response becomes an intriguing area of 
investigation. 

Cross-References 

► Computational Studies of Enzyme Motions 

► Macromolecular Crystallography: Overview 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► Protein Dynamics and Allostery 

► Protein Dynamics in Catalysis - Computational 
Studies 

► Protein NMR - Introduction 
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Synonyms 

Protein Structure 


Definition 

Molecular replacement (Rossmann 2001) is a method 
for solving the phase problem in crystallography. 
A molecule of known structure (the model structure), 
which must he similar to the structure to be determined 
(the target structure), is placed in the asymmetric unit 
so that the root-mean-square deviation between the 
model and the target is minimized. The phases calcu¬ 
lated from the model are then combined with the 
amplitudes measured from the target to generate an 
initial electron density map. In this sense, the model 
replaces the target for the calculation of the phases. 


hence the name of the method. If the root-mean-square 
deviation in atomic positions between the model and 
the target is sufficiently small, over a sufficiently high 
proportion of the atoms, then the electron density map 
given by molecular replacement will show novel fea¬ 
tures of the target structure, and the phase problem is 
considered solved. 

Utility 

Molecular replacement is used principally in protein 
crystallography, since the phase problem for asymmet¬ 
ric units with fewer than 200 non-hydrogen atoms can 
be solved using de novo methods. At its best it is a fast, 
inexpensive, and highly automated way of solving the 
phase problem. 

Molecular replacement requires a template, or 
model, structure. For molecular replacement to be 
successful, the model structure must be sufficiently 
similar to the target structure. Whether or not the 
model and target are sufficiently similar cannot be 
known before the target structure is solved; they are 
sufficiently similar if and only if molecular replace¬ 
ment succeeds. However, there are good indicators that 
molecular replacement is likely to be successful. 
Molecular replacement is likely to be successful 
when the sequence identity between the model and 
the target is better than ^30%. This generally corre¬ 
sponds with a root-mean-square deviation in atomic 
positions of less than ^1.5 A (Chothia and Lesk 1986). 
Molecular replacement can also be successful when 
the target and model have between 20% and 30% 
sequence identity, after applying model-improvement 
techniques. It is difficult to obtain a molecular replace¬ 
ment solution when the sequence identity is less than 
20%. The sequence identity is only a good guide to 
success if there is no conformational change, such as 
relative movement of domains, between the model and 
the target structures. 

A full molecular replacement search is unnecessary 
when the model and the target structures are able to 
crystallize in the same space group with the same unit 
cell and almost identical arrangements of atoms in the 
asymmetric unit. This may happen when the model and 
the target differ by only a point mutation or by the 
addition or substitution of a small molecule. The model 
is used to phase the target, but the original orientation 
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and position of the model can be used, with only 
a small degree of optimization by rigid body 
refinement. Differences between the model and target 
can be identified with simple Fourier techniques. 
Molecular replacement would be the method of choice 
for solving the phase problem if the point mutation or 
small molecule addition/substitution induced a change 
in crystal form. 

Some pathologies of the crystal can make molecular 
replacement difficult even when a good template is 
available. These include poor diffraction, crystal 
twinning, and high numbers of molecules in the asym¬ 
metric unit. The latter is an issue when the fraction of 
the total scattering given by the model is much lower 
than it would be were there only to be one copy of the 
target in the asymmetric unit. 

Molecular replacement is currently used to solve 
the phase problem for more than two thirds of protein 
structures (Evans and McCoy 2008). As the database 
of known structures increases, the proportion of 
protein structures having good templates available 
increases, enabling an even higher proportion of 
structures to be solved by molecular replacement. 

Implementation 

Conceptually, molecular replacement involves 
orienting and translating the model structure through¬ 
out the unit cell on a finely sampled grid of angles and 
positions; calculating the structure factors for each 
orientation and position; comparing the calculated 
and observed structure factors; finding the orientation 
and position with the best match between calculated 
and observed structure factor amplitudes; and 
calculating an initial electron map using the observed 
structure factor amplitudes and the phases taken from 
the model structure at the best match. 

In practice, the implementation of molecular replace¬ 
ment uses computational techniques to speed up the 
search calculations which would be prohibitively slow 
were the method to be implemented naively. Much of 
the literature on molecular replacement addresses these 
computational issues. 

Software for performing molecular replacement 
differs in the target function used to score the match 
between the calculated and observed structure factors, 
and in the way of searching the six-dimensional space. 


It is important to have a target function that identifies 
the correct orientation and position among all the false 
orientations and positions with high signal to noise, 
and a search algorithm that explores the sample space 
adequately, so that the correct orientation and position 
are among those scored. Different target functions and 
search strategies may be more or less optimal for 
different molecular replacement problems, such as 
different compositions of the asymmetric unit. 

A given structure may require many attempts at 
molecular replacement solution utilizing a battery of 
different models. Automation of model generation, run¬ 
ning of the molecular replacement job for each model, 
and collation of the results enhances the number of jobs 
that can be run in a given time. Automation may be 
critical to finding a solution in challenging cases. 

Multiple models can be generated from a single 
template by altering the number of atoms included in 
the model and the temperature factors of the atoms. 
Atoms can also be added by theoretical modeling. 
Where the template has a domain structure, conforma¬ 
tional differences between the model and target may 
prevent structure solution. Conformation differences 
may be accounted for either by splitting the search 
model into domains or producing a series of models 
with differences in the orientation of the domains, with 
the aim that one should be close to the target confor¬ 
mation. Homology models have traditionally not been 
successful as models for molecular replacement, but 
the newer de novo modeling methods produce models 
that can be close enough to the target to provide 
a molecular replacement solution. 

Cross-References 

► Macromolecular Crystallography: Overview 

► Structural Genomics 
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Synonyms 

Nerve growth factor (NGF); Tropomyosin-related 
kinase A (TrkA) 

Definition 

The nerve growth factor (NGF) is a member of 
a family of neurotrophin proteins, essential for the 
development and maintenance of nervous systems. 
Deficit in NGF activity is related to the onset of 
Alzheimer’s disease (AD). Such activity derives, in 
part, from its binding to the tropomyosin- related 
kinase A (TrkA) receptor. Understanding the molecu¬ 
lar basis of NGF/TrkA molecular recognition might 
therefore provide important insights for drug design 
and protein inhibitors (Chao 2003). In this respect, the 
determination of structural determinants of the NGF/ 
TrkA complex (Wehrman et al. 2007) has been a real 
breakthrough. Molecular simulation studies based on 
the available structural information point out that the 
N-terminus of NGF is a critical region for TrkA bind¬ 
ing. Intriguingly, the conformational properties of such 
region are affected by Zn^'^ and Cu^"^ ion binding, 
which are known in turn to inhibit the interactions of 
the neurotrophin with TrkA. 

Introduction 

Neurotrophins are growth factors essential for the 
wiring regulation of the nervous system during devel¬ 
opment (Bothwell 1995). They ensure its maintenance 
in the adult organism and modulate synaptic transmis¬ 
sion (Chao 2003). The growth factor family comprises 


nerve growth factor (NGF), the brain-derived 
neurotrophic factor (BDNF), neurotrophin-3 (NT-3), 
and neurotrophin-4 (NT-4). They share a sequence 
homology close to 50% and a structural homology 
close to 60% (Wiesmann and de Vos 2001). 

Discovered by Rita Levi Montalcini and Stanley 
Cohen, for which they received the Nobel Prize in 
1986, NGF (Levi-Montalcini 1987) is so far deemed 
as a potential target for anti-Alzheimer’s disease (AD) 
therapies. Indeed, the content of NGF in the brain 
decreases with age, particularly in memory-related 
areas such as the hippocampus (Larkfors et al. 1987). 
Deficits in NGF release and its cognate tropomyosin- 
related kinase A receptor (TrkA) (Klein et al. 1991) 
phosphorylation signaling contribute to age-related 
decline in long-term potentiation (Bliss and 
Collingridge 1993), the aspect of neuronal plasticity 
widely believed to facilitate learning and memory 
(Bliss and Collingridge 1993). In addition, Zn^"^, 
Cu^"^, and Pd^"^ ions have been found to inhibit the 
interactions of NGF with TrkA receptor (Ross et al. 
1997). 

Structure and Dynamics of the NGF/TrkA 
Complex 

NGF exerts some of its biological activity as 
a non-covalent dimer of monomers (Wiesmann and 
de Vos 2001) which interacts with its TrkA receptor 
(Klein et al. 1991). Elucidating the structural biology 
of NGF and TrkA is very useful for drug design appli¬ 
cations (Chao 2003). In these regards, the publication 
of the X-ray structure of the complex between NGF 
and TrkA (Wehrman et al. 2007) has been a real break¬ 
through. The complex features a structure that resem¬ 
bles a crab with two extended pincers (see Fig. la), 
with a C 2 -fold symmetry. The crab is composed of the 
NGF homodimer as the body flanked by the pincers 
comprising the TrkA dl-d5 domains (Fig. la). 

In spite of its importance, only very few computa¬ 
tional studies have been focused on the structure and 
dynamics of NGF and its receptors (Settanni et al. 
2003; Berrera et al. 2006; Stanzione et al. 201 1). 

Through Amber94-based MD simulations (Cornell 
et al. 1995) of human NGF-TrkA domain 
5 (NGF/TrkA-d5) complex, Settanni et al. (2003) 
found that highly persistent, water-mediated H-bond 
interactions at the protein-protein interface are 
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Molecular Simulations of NGF Protein, Fig. 1 (a) 3D rep¬ 
resentation of the extracellular domain of the NGF/TrkA dimer, 
drawn from X-ray coordinates of Wehrman et al. (2007) (pdb 
code 2IFG). (b) Close-up: MD Snapshot of the complex consid¬ 
ered in the work of Settarmi et al. (2003). NGF is shown in 


orange and TrkA domains are shown in green. Interface regions 
of NGF/TrkA complex are highlighted by two opposite arrows. 
NGF N-terminus residues playing a key role for the stabilization 
of the complex, as emerging from MD simulations by Settanni 
et al., are shown by blue sticks 


relevant for binding (Fig. lb). The authors show that 
persistent hydrogen bonds and electrostatics interac¬ 
tions are mostly located at the NGF N-terminus (from 
residues Asn 349(349') and Gin 350(350') of TrkA 
(TrkA’) to residues Arg 103(103') and His 84(84')) 
on the NGF homodimer. Both direct and water- 
mediated protein-protein contacts stabilize the com¬ 
plex. NGF N-terminus is likely to play a role for the 
specihcity of the NGF/TrkA-d5 recognition. Indeed, it 
is not present in other neurotrophin-receptor systems 
(Wiesmann and de Vos 2001). Therefore, a peptide 
consisting of only the N-terminal part of NGF is 
expected to have high affinity for the receptor. 

Berrera et al. (2006) addressed this issue by 
Amber94-based MD simulations on the TrkA-d5/ 
NGF complex with or without the N-terminus. The 
complete complex is stable during the dynamics. How¬ 
ever, the two moieties detach when the N-terminus is 
absent. NGF’s His-4, Glu-11, Trp-21 residues play 
a particular role for the binding (Settanni et al. 2003; 
Berrera et al. 2006) (see Fig. 1). 


More recently, Amber03-based MD simulations 
(Duan et al. 2003) pointed to the plasticity of the 
NGF(1-16) and NT-4(1-16) N-terminus peptides 
(Stanzione et al. 2011). Most of the conformations do 
not present a regular secondary structure. The confor¬ 
mational versatility may be an important structural 
feature that allows these proteins to interact with diver- 
sihed receptors. Consistent with this proposal, NGF 
(1-16) (and to a lesser extent NT-4(1-16)) is proposed 
to bind TrkA and TrkB via conformational selection 
(Boehr et al. 2009; Lange et al. 2008). 

Zn^"^ and Cu^"^ ion binding (Milardi and Rizzarelli 
2011) to NGF has been investigated by means of 
a combined classical and ab initio approach 
(Shamovsky et al. 1999). Multiple CHARMM-based 
simulated annealings (Brooks et al. 1983) were run 
using the X-ray structure of McDonald et al. (1991). 
The conformations found with this coordination 
environment (His-84, Asp-105, His-4 and His-8) 
revealed a shortening of the distance between Ser-1 
and Glu-135 regions (Fig. 2). This result was 
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Molecular Simulations of NGF Protein, Fig. 2 Structure of 
NGF upon Cu^'^and binding emerging from Shamovsky 
et al. The rearrangement of residues Glu-35, His-4, His-8, 
His-84, and Asp-105 is marked with dashed arrows 

experimentally observed through the chemical cross- 
linking of the two NGF monomers (Ross et al. 1997). 

Summary 

Molecular simulations studies of the NGF/TrkA 
complex point to the role of the NGF N-terminal 
domain for the affinity of the complex. This region is 
highly flexible and poorly conserved across a large 
number of neurotrophin sequences, indicating 
a strong specificity of the N-terminal domain toward 
specific Trk receptors. The binding of Zn and Cu^"^ to 
NGF, which inhibits NGF/TrkA interaction, does 
affect this region. 

Cross-References 

► Domain Structure Classifications 

► Dynamic Docking 

► Fluxes Through Channels and Pores 

► Molecular Dynamics Simulations of Lipids 

► Molecular Recognition: Lock-and-Key, Induced Fit, 
and Conformational Selection 


► Protein Dynamics and Allostery 

► Protein Folding 

► Protein Structural Models - Evaluating Quality 
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Synonyms 

Fundamentals of infrared spectroscopy; Fundamentals 
of Raman spectroscopy; Fundamentals of vibrational 
spectroscopy; Normal modes of molecules 

Definition 

The physical principles of infrared and Raman spec¬ 
troscopy are described, that is, molecular vibrations, 
vibrational transitions, effects of anharmonicity, and 
the interaction of vibrations with electromagnetic 
radiation. 

Basic Characteristics 
Vibrations 

The nuclei in a molecule vibrate around their equilib¬ 
rium positions. These often complicated, collective 
movements have much in common with the simple 


two-atomic oscillator, consisting of two nuclei with 
masses niy and m 2 which are connected by a bond 
with force constant /. Its vibrational frequency v is 
given by 

v = - 

2n V WL 

where/is the force constant (units: N/m or dyn/A, with 
100 N/m = 1 mdyn/A) between the two atoms, and 
the reduced mass (l/» 2 j. = l/wti - 1 - l/m 2 )- The frequency 
increases with increasing force constant, that is, with 
increasing bond strength. It is therefore affected by 
inter- or intramolecular factors that alter the bond 
strength, for example by interactions between an 
enzyme and a substrate which prepare the substrate 
for the catalytic reaction. This changes the substrate’s 
bond strengths and thus its vibrational frequencies. 

The frequency is also dependent on the masses of 
the vibrating atoms, the smaller they are, the faster is 
the oscillation. Because of this, vibrations involving 
hydrogen atoms have the highest frequencies. The 
mass effect is often used for the interpretation of spec¬ 
tra: vibration(s) to which particular atoms contribute 
are identified by isotopic labeling, which shifts the 
frequencies of these vibrations (► Isotope Edited 
Raman and Infrared Difference Spectroscopy Studies 
of Protein Structure). 

Vibrations distort the equilibrium structure of 
a molecule as illustrated in Fig. 1 for some examples. 
In stretching vibrations (abbreviated v) the bond length 
elongates and contracts periodically. In valence angle 
bending vibrations (5), the bond angle of a three- 
atomic fragment of the molecule changes, in out of 
plane bending vibrations (y), the angle between 
a bond and a molecular plane defined by two other 
bonds. In torsional vibrations, the dihedral angle of 
a four-atomic fragment of the molecule oscillates (t). 
It is easily imagined that shortening or elongation of 
a bond meets stronger resistance than a movement 
perpendicular to the bond. Therefore, the force con¬ 
stant of stretching vibrations is typically a factor of 10 
larger than that of bending vibrations, and stretching 
vibrations have higher frequencies (Colthup et al. 
1990). 

Bond lengths and angles are called internal coordi¬ 
nates of a molecule. Usually, several of them are 
coupled: They oscillate together with the same fre¬ 
quency and pass through their equilibrium position at 
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Molecular Vibrations and Their Interaction with Electro¬ 
magnetic Radiation, Fig. 1 Examples for vibrations of inter¬ 
nal coordinates. From left to right stretching, valence bond angle 
bending (scissoring), wagging (out of plane bending), and tor¬ 
sional vibration. The red symbols illustrate the movements at 
a particular moment, with @ indicating movement out of the 
paper plane 


the same time. A motion like this is called a normal 
mode of vibration. Approximatively, a normal mode 
vibrates independently from all other normal modes. 
Some normal modes are localized on a small part of 
the molecule, for example on a C=0 bond. They are 
called group vibrations (Colthup et al. 1990). Others 
involve many atoms of a molecule and their frequen¬ 
cies characterize the chemical structure and conforma¬ 
tion of the entire molecule like a fingerprint. 
A nonlinear molecule with N atoms has 3 N-6 normal 
modes (3 N-5 for a linear molecule). This gives 20 000 
normal modes for an average Escherichia coli protein 
and to nearly 10® normal modes for E. coli DNA. 

Vibrational Transitions and Anharmonicity 

Many vibrational properties are satisfactorily 
described by the harmonic oscillator approximation, 
which implies that the restoring force is proportional to 
the deviation from the equilibrium structure and the 
potential energy depends only on quadratic terms in 
the displacements. The harmonic oscillator has dis¬ 
crete energy levels E = {n + Yi) hv^j^,, as shown in 
Fig. 2a, with n = 0, 1, 2,... being the vibrational 
quantum number and Vvib the vibrational frequency. 
Increasing n increases the vibrational amplitude with¬ 
out changing the frequency. Electromagnetic radiation 
can induce transitions between these energy levels. 
According to the selection rule for the harmonic oscil¬ 
lator, the only allowed transitions are between adjacent 
vibrational levels (An = ±1), which are separated in 
energy by AE = hv^^j. Most transitions proceed from 
the ground state level to the first excited level because 
most oscillators are in the ground state at room tem¬ 
perature. These are called fundamental transitions 
(blue arrow in Fig. 2a). Transitions from excited vibra¬ 
tional states are referred to as hot transitions (red arrow 


a be 



Molecular Vibrations and Their Interaction with Electro¬ 
magnetic Radiation, Fig. 2 Vibrational transitions, (a) fun¬ 
damental (blue) and hot transition {red) of an harmonic oscillator 
with vibrational quantum numbers indicated, (b) overtone tran¬ 
sition of an anharmonic oscillator, (c) combination transition of 
two anharmonic oscillators 

in Fig. 2a) because excited levels are more populated at 
high temperatures (Boltzmann distribution), which 
facilitates their observation. 

All real oscillators deviate to some degree from the 
ideal harmonic behavior and are anharmonic because 
the potential energy depends on cubic and higher 
orders in the displacements. Anharmonicity affects 
the vibrational energy levels, softens the selection 
rule for the harmonic oscillator and may couple normal 
modes (► Infrared Spectroscopy of Protein Dynamics; 
Ultrafast Kinetics and ► Nonlinear IR Spectroscopy of 
Proteins). In consequence: 

1. The vibrational energy levels are no longer equally 
spaced, instead the spacing decreases with increas¬ 
ing energy (Fig. 2b). Accordingly, hot transitions 
have lower energy than the corresponding funda¬ 
mental transitions. 

2. Transitions over several levels (overtones. 
An = ±2, 3,...) as well as simultaneous transitions 
of several normal modes (combination transitions) 
occur, but usually have a low transition probability 
(Colthup et al. 1990; Fig. 2b and c). 

3. Transfer of excitation energy may occur from one 
excited normal mode to another. 

Absorption of Electromagnetic Radiation: Infrared 
Spectroscopy 

Vibrational spectroscopy observes transitions between 
vibrational levels. The main techniques are infrared 
and Raman spectroscopy which differ regarding the 
fate of the incoming photons. In infrared spectroscopy, 
photons are absorbed and this induces vibrational 
transitions. For fundamental transitions, the energy 
difference AE = hv^j^, has to match the energy of 
the absorbed photon (Bohr frequency rule). 
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This implies that light can be absorbed when 
the frequencies of light and vibration coincide 
(i^photon = i^vib)- This Condition is met in the infrared 
spectral region which extends from 0.78 pm to about 
1,000 pm. A further requirement (selection rule) needs 
to be fulfilled for the absorption of photons: The 
molecular dipole moment has to change during an 
oscillation of the normal mode, which implies that 
normal vibrations involving polar bonds are often 
strong absorbers. 

Vibrational spectra are plotted against the quantity 
wavenumber v, which is the inverse of wavelength 1: 
V = Wavenumber is proportional to the transition 
energy and has the unit cm“*. Wavenumber should be 
written in one word (i.e., not wave number), used in the 
singular form (i.e., not wavenumbers) and its unit dis¬ 
tinguished from the quantity itself (i.e., it is wrong to say 
1,650 wavenumbers). It is common to use frequency 
and wavenumber as synonyms because they are propor¬ 
tional. However, they are not identical. Adding to the 
confusion, spectra are plotted in different directions 
regarding the wavenumber axis, depending partly on 
different traditions within subsections of vibrational 
spectroscopy. The International Union of Pure and 
Applied Chemistry (lUPAC) recommends to plot vibra¬ 
tional spectra from high to low wavenumbers (Lamola 
and Wrighton 1984). This gives a consistent appearance 
of the spectra regardless of whether they are plotted 
against wavenumber or wavelength. 

The infrared spectral range can be subdivided into 
the ► near-infrared region (12,800 - 4,000 cm“* or 
0.78 - 2.5 pm) where overtone and combination 
bands are observed, the mid-infrared region (4,000 - 
200 cm~* or 2.5 - 50 pm) which is dominated by 
fundamental transitions, and the >■ far-infrared or 
terahertz region (200 - 1 cm~* or 50 pm - 1 cm, 6 - 
0.03 THz, 1 THz = 33 cm“^) where collective motions 
and hydrogen bonding networks can be studied. Group 
vibrations absorb above ^1,500 cm“* and the finger¬ 
print region occurs below ^1,300 cm”'. 

Inelastic Scattering of Electromagnetic Radiation: 
Raman Spectroscopy 

In Raman spectroscopy, photons are inelastically 
scattered and gain (anti-Stokes scattering) or loose 
energy (Stokes scattering) that corresponds to the 
energy difference between vibrational states 
(► Raman Spectroscopic Instrumentation, Experimental 
Considerations). This energy difference between incident 


and scattered photons is plotted in Raman spectra. Stokes 
scattering is more probable, but the large majority of 
photons are scattered elastically (i.e., without energy 
gain or loss). Raman scattering occurs in only 1 of 
^10“' scattering processes and for only 1 of ^10® inci¬ 
dent photons. The selection rule for Raman scattering 
requires that the polarizability of the molecule changes 
during the vibration. The different selection rules for 
Raman and infrared spectroscopy imply that both 
methods give complementary information: Strong infra¬ 
red absorbing vibrations are often weak Raman scatterers 
and vice versa. 

Information That Can Be Derived from the 
Vibrational Spectrum 

The approximate position of a band in a vibrational 
spectrum is determined by the vibrating masses and 
the strengths of the vibrating bonds, the exact position 
by electron density shifts due to the intra- and 
intermolecular environment and by coupling between 
internal coordinates, which often depends on confor¬ 
mation. The strength of the signal depends on the 
amount of material and via the selection rules on the 
symmetry of the molecule. Structural heterogeneity 
affects the width of vibrational bands: Each structure 
produces a slightly different spectrum because vibra¬ 
tional spectroscopy generates a snapshot of the sample 
conformer population unless ultrafast dynamics are 
important. Thus, flexible structures will give broader 
bands than rigid structures. Therefore, vibrational 
spectra contain the information listed below about the 
observed molecules and often there is a quantitative 
correlation between spectral and molecular properties 
(Palafox 1998; Deng and Callender 2001; Barth 2007): 

• Amount/concentration 

• Chemical structure 

• Conformation 

• Environment (e.g., hydrogen bonding and electric 
held) 

• Bond properties (e.g., length, strength) 

• Structural heterogeneity 

Cross-References 

► Far-Infrared Spectroscopy of Proteins 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Infrared Spectroscopy of Protein Reactions 
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► Infrared Spectroscopy of Protein Structure 

► Isotope Edited Raman and Infrared Difference 
Spectroscopy Studies of Protein Structure 

► Near-Infrared Spectroscopy in Biological 
Molecules and Tissues 

► Nonlinear IR Spectroscopy of Proteins 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Raman Spectroscopic Instrumentation, 
Experimental Considerations 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 

► Terahertz Spectroscopy: Solvation of Biomolecules 
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Molecularly Imprinted Polymers 

Sergey A. Piletsky, Iva Chianella and 
Michael J. Whitcombe 

Cranheld Health, Cranfield University, Cranfield, 
Bedfordshire, UK 

Synonyms 

Imprinted polymers; Molecular imprinting; Template- 
mediated polymerization 

Definition 

Molecular imprinting is the process of template- 
induced formation of specihc recognition sites 


(binding or catalytic) in a material where the template 
directs the positioning and orientation of the material’s 
structural components by a self-assembly mechanism. 
This involves the formation either of a complex 
(non-covalent imprinting) or reversible covalent 
bonds (covalent imprinting) between a given target 
analyte (template) and polymerizable functional 
monomers dissolved in an appropriate solvent. The 
template-monomer interactions are then hxed by rad¬ 
ical polymerization into the growing polymer network. 
Subsequent removal of the template leaves binding 
sites within the polymer that possess both complemen¬ 
tary shape and orientation of functional groups to allow 
selective recognition of the target species. Eigure 1 
depicts a schematic representation of molecularly 
imprinted polymer formation showing some of the 
possible mechanisms of interaction between the 
template and polymerizable functional monomers. 
A diverse range of species, including metal ions, phar¬ 
maceuticals and their metabolites, drugs of abuse, 
hormones, pollutants, phytochemicals, biomarkers, 
peptides, proteins, viruses, bacteria, yeast cells, and 
crystals have been used as templates in molecular 
imprinting and the list continues to grow. 

Molecularly imprinted polymers (MIPs) can be 
prepared as acrylic or methacrylic polymers, polyure¬ 
thanes, conjugated polymers such as polyaniline or 
polypyrrole, or as inorganic materials such as Si 02 or 
Ti02 (Alexander et al. 2006; Dickert et al. 2006; 
Malitesta et al. 1999). Acrylates and methacrylates 
remain the most popular monomers used for MIP prep¬ 
aration and usually produce polymers with superior 
binding and recognition abilities. This is mainly due 
to the availability of a huge range of potential func¬ 
tional monomers (more than 4,000 polymerizable 
compounds are available commercially and more 
may be prepared synthetically for specific purposes); 
despite this, the majority of non-covalent MIPs have 
been prepared using just a handful of monomers, the 
most important being methacrylic acid, acrylamide, 
and vinylpyridine. In addition, the versatility of vinyl 
polymerization and its initiation methods has allowed 
the formation of (meth)acrylate-based MIPs using bulk 
polymerization, as thin films and grafts, membranes, 
micro- and nanoparticles (Poma et al. 2010). Meth 
(acrylate)-based materials also lend themselves readily 
to optimization and design strategies, including com¬ 
binatorial, bioinformatics, and computational methods 
(Lanza and Sellergren 2004; Chianella et al. 2002). 
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Molecularly Imprinted Polymers, Fig. 1 Schematic repre¬ 
sentation of molecularly imprinted polymer formation showing 
some of the possible mechanisms of interaction between the 
template (blue) and the polymerizable functional monomers 
and the rebinding of the target species {red) within the imprinted 
cavity. Interactions used in molecular imprinting can involve 


one or more of the following mechanisms: (a) reversible cova¬ 
lent bonds; {h) covalently attached polymerizable binding 
groups that are activated for non-covalent interaction by tem¬ 
plate cleavage; (c) electrostatic interactions; {d) hydrophobic or 
van der Waals interactions and {e) metal ion-ligand interactions 
(Adapted with permission from Alexander et al. (2006)) 


Molecularly Imprinted Polymers, Table 1 Comparison of biomolecules and MIPs 


Biomolecules 

MIPS 

Poor stability to variations in temperature, pH etc. 

Stable at low/high pHs, pressure and 
temperature (<140°C) 

High price of enzymes and receptors 

Inexpensive and easy to prepare 

Poor performance in nonaqueous media 

Can work in both aqueous and organic 
solvents 

Individual biomolecules have specific operational requirements (pH, ionic strength, 
temperature, substrate) 

Polymers for different targets could operate 
in the same specific environment 

For some important analytes, natural receptors and enzymes do not exist and 
antibodies cannot be prepared 

Polymers could be prepared for practically 
any compound 

Poor compatibility with micromachining technology and miniaturization 

Polymers are fully compatible with 
micromachining technology 


Basic Characteristics 

Molecular imprinting is one of the few generic tech¬ 
nologies available for the design of synthetic receptors 
and has therefore become the focus of interest for 
scientists and engineers looking for replacements for 


unstable biological molecules such as antibodies and 
enzymes. The advantages of MIPs over biomolecules 
such as proteins are listed in Table 1. 

The main advantages of MIPs over their biological 
counterparts include superior stability to elevated 
temperatures and pressures and improved inertness to 
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strong acids, bases, and organic solvents; low cost; 
long shelf-life; reproducibility; and ease of prepara¬ 
tion. Consequently, MIPs can be utilized in many 
applications where specihc molecular recognition is 
a necessary property. MIPs have therefore been used 
as recognition elements in sensors for chemical and 
biological entities (Fig. 2), as substitutes for antibodies 
in immunoassays, for catalysis, drug delivery, as drugs 
and in affinity separations (e.g., as the stationary phase 



Molecularly Imprinted Polymers, Fig. 2 Polyurethane sur¬ 
face imprinted with yeast cells to be used in a gravimetric sensor 
(Dickert et al. 2006) 


in high performance liquid chromatography (HPLC), 
as sorbents for preconcentration and purihcation of 
analytes by solid-phase extraction (SPE), and as 
selective matrices for capillary electrophoresis), 
(Tse Sum Bui and Haupt 2010). 

Synthesis of Molecularly Imprinted Particles 

The traditional method for producing MIPs is as porous 
bulk monoliths (by bulk polymerization in the presence 
of a porogenic solvent), which are ground and sieved to 
obtain irregularly shaped particles of appropriate size 
(typically ^5-50 pm). Although straightforward, this 
procedure is time consuming, wasteful, and unsuitable 
for industrial production. In addition, it consumes con¬ 
siderable amounts of the template species during syn¬ 
thesis and the resulting polymer often has 
a heterogeneous distribution of imprinted sites. In 
order to overcome these limitations, alternative strate¬ 
gies aimed at the production of MIPs as micro- and 
nanoparticles have been investigated in recent years. 
These include precipitation polymerization (Fig. 3a), 
mini- and microemulsion polymerization (Fig. 3b), 
core-shell approaches (Fig. 3c). Similarly, controlled 
polymerization methods such as living 
free-radical polymerization using iniferters, atom trans¬ 
fer radical polymerization (ATRP), and reversible 
addition-fragmentation chain transfer polymerization 
(RAFT) are increasingly being used in MIP synthesis. 



Molecularly Imprinted Polymers, Fig. 3 (a) SEM image of polymerization (Reproduced with permission from Curcio 
MIP nanoparticles imprinted for (R, 5)-propranolol prepared by et al. (2009)); (c) SEM image of silica-MIP particles imprinted 
precipitation polymerization (Reproduced with permission from for TNT prepared by the core-shell approach (Adapted with 
Yoshimatsu et al. (2007)); (b) TEM image of MIP nanoparticles permission from Gao et al. (2007). Copyright (2007) American 
imprinted for glucopyranoside produced by miniemulsion Chemical Society) 
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Areas for Commercialization of MIPs 

Despite the advantages of MIPs over biological 
receptors (Table 1), commercialization of the technol¬ 
ogy remains challenging. Opportunities include 
separations technology (SPE) and sensors. MIP-based 
SPE cartridges for selective extraction and cleanup 
of trace analytes in complex matrixes are currently 
sold by Sigma-Aldrich (http://www.sigmaaldrich. 
com/analytical-chromatography/sample-preparation/ 
spe/supelmip.html) and Polyintell (http://www. 
polyintell.com/products/affinimip-spe-cartridges/). 
These are available for several target analytes such as 
beta-agonists, beta-blockers, pesticides, and some 
drugs of abuse (e.g., amphetamine). 

Regarding MIP-based sensors, the most promising 
areas of application are in medical diagnostics and envi¬ 
ronmental analysis. One important niche area is the rapid 
detection of chemical and biological warfare agents for 
civil and military use. Optical sensors for the detection of 
fungal toxins have been developed by Toximet Ltd 
(http://www.toximet.com/index.htm), which detects 
fluorescent toxins absorbed onto MIP cartridges. 
MIP-coated wipes which change color in response 
to explosives are produced by Raptor Detection Tech¬ 
nologies (http://www.raptordetectiontech.com/?q=about- 
us). Several other companies are developing MIP-based 
sensors, including Sphere Medical (http://www. 
spheremedical.com/) and Semorex (http://www.semorex. 
com/index.html). This exciting science will undoubtedly 
lead to many new applications and products as the com¬ 
mercial potential of MIPs in realized. 
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► Pyranopterins 


Cross-References 

► Amperometric Biosensors 

► Molecular Recognition: Lock-and-Key, Induced Fit, 
and Conformational Selection 

► Potentiometric Biosensors 
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MRI and PET Imaging: Clinical 
Applications 

William Andrew Hallett*, Giulio Gambarota*, 
Rexford Newbould* and Paul M. Matthews*’^ 
'GlaxoSmithKline Clinical Imaging Centre, 
Hammersmith Hospital, London, UK 
^Division of Brain Sciences, Department of Medicine, 
Imperial College, London, UK 

Synonyms 

Magnetic Resonance Imaging (MRI); MRI 

Definitions 

Magnetic resonance imaging (MRI) is a noninvasive 
medical imaging technique used to characterize soft 
tissue structure, physiology, or function based on stim¬ 
ulated emission of radiofrequency signals after posi¬ 
tioning a subject in a high magnetic held. MRI is one of 
the most widely applied of all medical imaging tech¬ 
niques. Clinical applications are recognized in almost 
every medical specialty. 

Positron emission tomography (PET) is a nuclear 
imaging technique able to measure the three- 
dimensional distribution and concentration over time 
of molecules (radiotracers or radioligands) into which 
a positron-emitting isotope has been incorporated. 
Clinical PET radiotracers or radioligands are most 
typically administered intravenously and use isotopes 
that have relatively short half-lives (minutes to hours) 
of radioactive decay. PET is used in clinical medicine 
for diagnosis and treatment monitoring, particularly in 
oncology, but with potential for wider application. 

Introduction 

MRI relies on the weak interaction between a nuclear 
magnetic dipole (the proton in the case of all com¬ 
monly used medical applications) and an applied mag¬ 
netic held. The energy of the interaction is expressed as 
a resonance frequency and the amplitude of the 
radiofrequency signal is related to the concentration 
of the nuclear dipole. A magnetic held gradient that 


varies regularly over a subject can encode spatial infor¬ 
mation. Because the interaction is weak, useful signal- 
to-noise ratio is achieved within practical imaging 
times only with the use of high held magnets 
(1-3 T for most applications). In usual practice, instru¬ 
mentation relies on detecting signals from protons of 
water or fat in body tissues. Tissue contrast arises from 
tissue-specihc differences in water (or fat) content and 
differences in the proton nuclear spin-spin (T2) or 
spin-lattice (Tl) relaxation times. Investigations are 
believed to be without risk of long-term adverse con¬ 
sequences even after repeated studies. 

In clinical PET, a positron-emitting radionuclide is 
incorporated into a pharmaceutical-grade compound 
formulation. The isotope emissions with radioactive 
decay are related directly to concentration. This allows 
the spatial and temporal mapping of the compound or 
its products. An advantage of PET is that it is possible 
to label a molecule that is similar to (or - in less 
common applications - even chemically identical to) 
a molecule naturally found in the body and monitor it 
with such sensitivity that only microdoses need to be 
administered, so that physiological processes are not 
perturbed. 

Magnetic Resonance Imaging 
History 

MRI is based on principles of nuclear magnetic reso¬ 
nance described by Eelix Bloch of Stanford and 
Edward Purcell of Harvard and recognized with the 
joint award of the Nobel Prize in Physics to them in 
1952. The hrst description of MRI was in Paul 
Lauterbur’s (1973) report of a general approach for 
image generation using nuclear magnetic resonance 
(Budinger and Lauterbur 1984). Potential for clinical 
applications developed in several centers through the 
mid-1970s. Based on preclinical investigations, this 
broadened rapidly to include applications to the heart 
and brain. Eollowing initial investigations of hepatic 
pathologies at Aberdeen University, reports from 
London’s Hammersmith Hospital in 1981 (Doyle 
et al. 1981) illustrated the practical applications to 
neurological disease including the demonstration that 
inflammatory lesions of multiple sclerosis could be 
identified, which arguably marked the first clinical 
application for which MRI provided diagnostic infor¬ 
mation that could not be obtained by any other means. 
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Clinical applications subsequently grew rapidly with 
availability of commercial instruments from multiple 
manufacturers; by 2007, 27.5 million MRI examina¬ 
tions were being performed annually on over 11,000 
installed systems in the United States alone. 

Major Areas of Clinical Application 
MRI in Oncology 

The quest for a technique that would allow for nonin- 
vasive discrimination between cancer and healthy tis¬ 
sue was one of the major driving forces for the 
development of MRI. Early work of Damadian showed 
that water protons of malignant tissue had different T1 
and T2 relaxation times than healthy tissue (Damadian 
1971). Currently, in addition to signal contrast based 
on differences in T1 and T2 relaxation times, an array 
of techniques, ranging from diffusion-weighted imag¬ 
ing to exogenous contrast agent-enhanced MRI, are 
employed to discriminate cancer from healthy tissue. 
MRI is important as a diagnostic tool in oncology and 
is routinely used in clinical setting for detection and 
therapy monitoring of brain, liver, breast, pelvic, 
head and neck, and prostate tumors. 

Tumor detection with gadolinium- 
diethylenetriaminepentaacetic acid (Gd-DTPA) 
dynamic contrast-enhanced MRI (DCE-MRI) relies 
on Gd-DTPA leakage from hyperpermeable tumor 
vessels and subsequent accumulation in the tumor 
interstitium. As Gd-DTPA shortens the T1 relaxation 
time of water protons, the signal intensity of tumors is 
highly enhanced on T1-weighted images. Contrast- 
enhanced MRI not only improves tumor detection but 
also provides a means to characterize the tumor angio¬ 
genesis. Emerging hyperpolarization techniques for 
*^C enable a 5-6 order of magnitude enhancement of 
the MR signal (Golman et al. 2006). Real-time moni¬ 
toring of the conversion of [l-^^C]pyruvate into lactate 
promises the potential both for detection of certain 
cancers (e.g., prostate) and investigation of the meta¬ 
bolic processes within cancer tissue. 

MRI of the Central and Peripheral Nervous System 

Constrained by a rigid skull, large movements do not 
confound even more prolonged MRI brain acquisi¬ 
tions. This consideration, as well as the relative inac¬ 
cessibility of the brain for more direct clinical 
examination, makes MRI unparalleled in its ability to 


map brain soft tissue contrast due to its sensitivity to 
local chemical environments and flexibility in gener¬ 
ating contrast mechanisms. As a consequence, MRI of 
the central nervous system (CNS) has wide clinical 
application; the brain is the most common focus of 
clinical MRI studies in general medical settings. 

Structural MR of the CNS achieves tissue contrast 
by a combined weighting of relaxation parameters 
(T1 and T2) as well as the spin density to accentuate 
discrimination of different tissues, for example, white 
and gray matter. This can be achieved by modifying 
the timing of the image acquisition or by preparatory 
radiofrequency pulses (such as pulses that invert or 
destroy the effective magnetization) before stimulation 
for the final, observed signal. For example, signal from 
cerebrospinal fluid (CSF) can be removed from the 
image by inverting all the magnetization, and then 
waiting a specific time period for the CSF spins to 
approach their null point to create a Fluid-Attenu- 
ated-Inversion-Recovery (FLAIR) image. 

Physiological and microstructural measures are also 
possible with MRI. Bolus injection of an MR contrast 
agent such as Gd-DTPA can give measures of cerebral 
blood flow (CBF) and volume (CBV). If blood-brain 
barrier integrity is impaired, Gd-DTPA will accumu¬ 
late and increase signal on T1-weighted MRI. Strong 
motion dependence can be introduced to allow contrast 
based on the microscopic diffusion of water spins 
(DWI). The diffusion of water spins in neurons is 
restricted by cell membranes and by the myelin sheaths 
surrounding axons. DWI is very sensitive to cell swell¬ 
ing, as in ischemia. The preferential diffusion along 
white matter tracts can be tracked with diffusion tensor 
imaging (DTI), which allows the tracks of larger clus¬ 
ters of nerve fibers to be mapped. Magnetization trans¬ 
fer (MT) maps the MRl-invisible spins associated with 
larger molecules, such as proteins, based on informa¬ 
tion related to the chemical exchange of spins between 
the fluid and the macromolecular water pool. 

MR angiography (MRA) highlights blood in vessels 
at the expense of the surrounding tissue to provide 
a detailed image of vascular anatomy. MRA can rely 
on the fresh inflow of unsaturated blood spins, using an 
approach known as time-of-flight, imaging that does 
not rely on injection of a contrast agent or by mapping 
the very short T1 of blood after Gd-DTPA has been 
injected. MRA allows for noninvasive, non-ionizing 
evaluation of vascular malformations and the anatomy 
or the integrity of major vessels. 



1602 


MRI and PET Imaging: Clinical Applications 


Functional MRI (fMRI) enables an indirect measure 
of changes in brain neuronal activity based on associ¬ 
ated increases in blood volume, flow, and relative 
oxygenation. This “neurovascular coupling” is 
a consequence of presynaptic neurotransmitter release 
and thus reflects local neuronal signaling (Logothetis 
2003). Growth in research applications in cognitive 
neuroscience and related areas was rapid after its first 
demonstration. Routine clinical applications thus far 
have been limited to the localization of functionally 
eloquent brain to guide neurosurgical excisions of 
mass lesions or for language or memory lateralization 
in planning of temporal lobe surgery in the treatment of 
refractory epilepsies (Matthews et al. 2006). 

While most clinical use has been for study of the 
CNS, MRI also has applications to the peripheral ner¬ 
vous system (PNS). MRI can facilitate the noninvasive 
assessment of peripheral nerve pathologies, such as 
compressive and inflammatory neuropathies, and mon¬ 
itoring of peripheral nerve degeneration and regenera¬ 
tion processes. Longer nerve T2 relaxation times are 
a hallmark of nerve lesions and other PNS abnormal¬ 
ities. With recent improvements in both MR hardware 
and software, advanced imaging techniques, such as 
DTI also have been applied to PNS investigations 
(Yamashita et al. 2009). 

Cardiac MRI 

Clinical cardiac MRI (CMR) applications are growing 
on the basis of a now strong evidence base for their 
clinical utility and with the greater availability of MRI 
systems designed to meet the special demands of car¬ 
diac studies. CMR allows diagnostic mapping of the 
structure of the heart and the integrity of its major 
vessels. It also provides highly precise, quantitative 
measures of cardiac function (Bogaert et al. 2005). 

Structural CMR typically images the heart at 
a single timepoint in its cardiac cycle during a breath- 
hold by means of prospective gating of signal acquisi¬ 
tion to the electrocardiogram (ECG). Images are 
acquired at end-systole or end-diastole. A fast spin 
echo technique can image the chambers of the heart, 
the myocardium, and the larger vessels. When applied 
with a technique in which the blood signal is greatly 
suppressed (dark blood imaging), fine structures such 
as the heart valves can be visualized. State-of-the-art 
CMR can visualize vessels as small as the coronary 
arteries. Structural CMR makes use of contrast mech¬ 
anisms such as Tl- or T2- weighting and fat 


suppression (important because of the often substantial 
pericardial fat deposition) in order to characterize 
infarcts and functional impairments or for diagnosis 
of mass lesions. 

Functional CMR relies on a large variety of tech¬ 
niques. Many utilize time-resolved movies that are 
derived from multiple acquisitions using retrospec¬ 
tive gating, which acquires data throughout the car¬ 
diac cycle, and afterward partitions the data into 
a specified number of phases across one cycle. 
From these images, morphology can be defined. 
The maximal and minimal left ventricular volumes 
allow calculation of the ejection fraction. Sophisti¬ 
cated local chamber wall deformation analyses are 
possible with superimposed magnetic pulse “grid 
tags.” The injection of an MR contrast agent can 
further elucidate myocardial perfusion from the first 
pass kinetics of transit of an injected contrast bolus 
through the vasculature, as well as highlight areas of 
nonviable tissue from any delayed uptake of the 
contrast agent. 

Liver MRI 

Although a less major application area, applications to 
liver pathology have been important since the 
pioneering applications at the inception of clinical 
MRI. Masses such as cysts, hemangiomas, and tumors 
are readily visualized and discriminated based on the 
different MRI signal characteristics. Recent interest is 
extending to quantitative studies of hepatic fat content 
using magnetic resonance spectroscopy (MRS), which 
can measure relative fat and water concentrations. 
Hepatic water T2 changes provide an index of iron 
content. 

Lung MRI 

MRI was long viewed as unsuitable for clinical lung 
imaging because of the low water content and con¬ 
founds arising from field gradients at lung air-water 
interfaces. While CT remains the gold standard for 
structural diagnosis of lung disease, clinical lung 
MRI has been driven largely by interest in better char¬ 
acterization of pathological fluid accumulation, mass 
lesions, or perfusion defects. Modem MRI scanners 
can acquire structural images with fast sequences 
such as “single-shot fast spin echo,” as well as dynamic 
images with “rapid gradient echo imaging” enabling 
imaging during a short breath-hold or during normal 
respiration. 
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Contrast-enhanced techniques can be used to study 
the blood perfusion of the lungs, as well as to perform 
high-resolution angiograms of the lungs, such as to 
rule out pulmonary embolism or visualize the vessels 
and pleura. 

Certain stable gasses such as ^He and *^®Xe may 
have their MR signal greatly increased through spin 
hyperpolarization. MR imaging of the lungs during 
inhalation of such hyperpolarized gasses allows exam¬ 
ination of lung ventilation, perfusion, and microstruc¬ 
ture (Fain et al. 2007). This has led to studies of 
diseases such as asthma, COPD, and cystic fibrosis. 
Further development of hyperpolarized gas MRI could 
transform the role of MRI in the stratification and 
monitoring of respiratory disease (Kauczor 2009). 

Musculoskeletal MRI 

One of the earliest published MR images acquired in 
humans was the image of a wrist (Hinshaw et al. 1977). 
With excellent soft tissue contrast and potentially sub¬ 
millimeter spatial resolution, MRI provides detailed 
anatomical delineation of joints and is hnding major 
applications in sports medicine because of its advan¬ 
tages over other noninvasive imaging methods for 
assessment of cartilage integrity, as well as muscle 
and ligament tears. MRI is also of great value for 
clinical examinations of the spine for evaluation of 
suspected herniation, infection, or tumor infiltration. 
The potential to discriminate fibro-fatty replacement of 
muscle is useful for monitoring the progression of 
muscular dystrophies. 

PET 

History 

The use of a cyclotron to produce positron-emitting 
isotopes for medical use was first suggested in 1946, 
little more than a decade after their first development 
(lones 2002). Coincidence detection was used to 
image brain tumors in the early 1950s and the subse¬ 
quent development of PET as a research tool was rapid. 
From the mid-1990s, the use of PET became wide¬ 
spread clinically, largely based on the applications of 
a single radiotracer, the glucose analogue E] 
Eluorodeoxyglucose (EDG). Although EDG-PET is 
by far the most commonly used technique, the range 
of clinical questions that PET can address is extensive 
and new applications continue to emerge. 


F]Fluorodeoxyglucose (FDG) and its Clinical 
Applications 

EDG-PET has been used most widely in oncology for 
staging of disease, although a role for monitoring 
responses to therapy is recognized. FDG reflects glu¬ 
cose transport and phosphorylation, which are greatly 
increased in many cancers, providing a sensitive 
marker for primary or metastatic tumors (Czemin 
2004). In oncology, FDG-PET conveniently lends 
itself to a single time point acquisition (typically one 
hour post injection) and semiquantitative analysis, 
whereby tissue uptake is expressed relative to the 
injected activity per gram of tissue (SUV or standard¬ 
ized uptake value). However, increased FDG uptake is 
not specific to any single disease process and careful 
interpretation is required. FDG-PET has been applied 
in cardiology as an index of potentially reversible 
injury after myocardial infarction (Schelbert 2004), in 
neurology for the diagnosis and monitoring of reduced 
brain glucose metabolism in neurodegenerative disor¬ 
ders and as a marker of inflammation. 

Future Opportunities for Clinical PET 

PET radiotracers other than EDG have clinical appli¬ 
cations (Table 1). Of these, ['** E]Eluoro-L-thymidine 
(FLT), a biomarker for tumor proliferation, is probably 
the most widely used radiotracer after FDG, and is 
available commercially. [** FJFluoride bone imaging 
is used for detection of skeletal pathology. A number 
of molecules such as FJFlorbetapir are being devel¬ 
oped as PET markers for brain amyloid plaques to aid 
the early diagnosis of Alzheimer’s disease. Several 
major suppliers are working to provide commercial 
sources for an even broader range of clinical PET 
agents. 

A number of factors limit more rapid or widespread 
adoption of PET techniques in the clinic. Eirst is the 
need to demonstrate diagnostic value and cost effec¬ 
tiveness, given the considerable cost of a PET exami¬ 
nation. Access to PET radiopharmaceuticals also 
is restricted because of the short half-life of PET iso¬ 
topes and the cost and complexity of setting up 
a facility equipped with a cyclotron to produce 
radiopharmaceuticals. 

Changes in the technology could facilitate diffusion 
of applications across a wider user base. For example, 
both ^^Rb and ®*Ga generators are also now commer¬ 
cially available for PET applications. Use of these 
would be similar in concept, although costlier and 
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MRI and PET Imaging: Clinical Applications, Table 1 Some 
radiopharmaceuticals and their clinical applications (Adapted 
from Kitson et al. 2009, but see also http://www.ncbi.nlm.nih. 
gov/books/NBK5330/) 


PET radiotracer 

Clinical application 

N]Ammonia 

Cardiology: myocardial perfusion 

[18 F] 

Fluorodeoxyglucose 

FDG) 

Oncology, cardiology, neurology, 
psychiatry: glucose metabolism 

[** C]Methionine 

Oncology: protein synthesis 

['* C]Flumazenil 

Neurology: epilepsy - benzodiazepine 
receptor antagonist 

[18 p] 

Fluoromizonidazole 

Oncology: hypoxia as a response to 
radiotherapy 

[** C]Acetate 

Cardiology: oxidative metabolism in the 
heart 

['* C]Flumazenil 

Neurology, psychiatry: benzodiazepine 
receptors in the brain 

[“ C16-OH-BTA-1 
([" C]-PIB) 

Neurology, psychiatry: amyloid plaque 

[*** F]Flutemetamol 

Neurology, psychiatry: amyloid plaque 

[‘^ F]BAY94-9172 

Neurology, psychiatry: amyloid plaque 
and neurofibrillary tangles 

F]DOPA 

Neuropsychiatry: presynaptic 
dopaminergic function 

[*** F] Fluoride 

Oncology: bone scintigraphy 

1*^* Ga]DOTA 

Oncology: neuroendocrine tumor 
imaging 

[*^Rb]Rubidium 

Cardiology: myocardial perfusion 

[18 FjFluorotyrosine 

Oncology: amino acid uptake, protein 
synthesis 

[** C]Thymidine 

Oncology: DNA synthesis 

[18 p] 

Fluorothymidine 

(FLT) 

Oncology: tumor proliferation 

['’^Cu]ATSMor['**F] 

MISO 

Oncology: hypoxia 


less convenient, to the more common ^^Mo generator 
used in conventional nuclear medicine to generate 
Tc labeled radiotracers. They promise the potential 
for wider availability of a range of PET radiotracers 
based on commercial distribution of diagnostic mole¬ 
cules that then can be labeled with the positron emitter 
at the site of use (even with a small radiopharmacy). 

Recent Advances in Clinical PET Technology 

The most significant technological advance in PET 
over the last decade was the almost universal adoption 
of combined PET and CT scanners. With these, CT 
and PET scans are acquired sequentially. This 


development was driven both by the realization that 
the data needed to perform attenuation correction of 
PET data could be obtained by appropriate scaling of 
a conventional CT image rather than the rather low 
quality PET transmission scans that had been used 
before, and that the anatomical information provided 
by CT could aid localization of radiotracer 
information. 

Scanner manufacturers have more recently devel¬ 
oped a first generation of clinical PET-MR scanners 
that enable true simultaneous acquisition of data from 
both modalities. It is too early for clinical applications 
to have emerged. An exciting promise is the potential 
for accurate correction of artifacts in the PET image 
data that arise from patient motion, and in particular 
from respiration, a particular confound of PET-CT, 
should be possible through simultaneous monitoring 
of the MR signal. 

PET detection technology itself has continued to 
evolve. Until recently most commercial PET scanners 
used only a single time window to generate coinci¬ 
dence events. However, with improvements in detec¬ 
tor materials and the associated electronics, 
discrimination of single event time differences can 
be used to apportion positron emission events to 
zones within the scanner field of view to improve the 
image quality by the rejection of events that are not 
needed for image reconstruction. Advances in recon¬ 
struction theory, coupled with increasing computa¬ 
tional power, also have enabled better use of count- 
rate limited data to increase image resolution and 
decrease image noise by more sophisticated modeling 
of the data acquisition process. These advances will 
make clinical PET examinations both faster and more 
effective as diagnostic tools. 

Summary 

MRI and PET have demonstrated how even sophisti¬ 
cated biophysical measures can be translated into 
powerful clinical tools. The sensitivity and accuracy 
of diagnoses that they enable have contributed to more 
rapid and precise diagnostic evaluations. The methods 
have reduced the burden of invasive assessments sub¬ 
stantially for many areas of medicine. Both techniques 
also have added new quantitative and functional 
dimensions to the traditional focus of clinical imaging 
sciences on morphology. Advances in the range of 
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investigations and in their accessibility across the 

medical community continue to grow. 

Cross-References 

► Magnetic Resonance Imaging (MRI) Methodology 

► NMR 

► Positron Emission Tomography Methodology 
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mRNA Degradation in Prokaryotes 
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Synonyms 

RNA 

Definitions 

mRNA is a ribonucleic acid polymer used as a template 
for protein synthesis. 

mRNA degradation is the breakdown of the polymer 
as a consequence of hydrolysis or phosphorolysis of 
phosphodiester bonds. 

Basic Characteristics 

mRNA constitutes only a small fraction of total cellu¬ 
lar RNA, but it experiences the most intense metabolic 
activity. mRNA is rapidly turned over in comparison to 
other RNA species, such as tRNA or rRNA, and this 
instability is an important aspect of the regulation of 
gene expression. As mRNA serves as a template for 
protein synthesis, it is crucial to control the levels to 
match cellular requirements and to rapidly eliminate 
messengers which are faulty or no longer useful. Strik¬ 
ing the correct balance between synthesis and turnover 
requires controlled degradation of mRNA by ribonu- 
clease enzymes (RNases). 

Bacteria possess many RNases, which may over¬ 
lap or complement one another functionally 
(Table 1). Broadly, there are two types of enzymes 
responsible for RNA degradation: endoribo- 
nucleases, which cleave RNA internally, and 
exoribonucleases, which cleave from the 3' or 5' 
termini. These enzymes often exhibit processive 
behavior, meaning that they make several cleavages 
of the same substrate in rapid succession. 
Processivity depends on many RNA features, such 
as the presence of secondary structures and 
sequences favoring the cleavage; it is also influenced 
by ongoing translation and the presence of 
frani-acting molecules. Additionally, RNA can be 
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mRNA Degradation in Prokaryotes, Table 1 Main nucleases participating in mRNA decay in prokaryotes 


Name 

Substrate 

Characteristic 

Endonucleases 

RNase E 

ssRNA, A/U-rich sequence, preferably 
5' monophosphate 

mRNA inactivation and degradation, involved in sRNA-mediated 
mRNA decay 

RNase G 

ssRNA, A/U-rich sequence, preferably 
5' monophosphate 

RNase E paralogue, involved in the decay of transcripts encoding 
enzymes from central metabolism and mRNAs of heat shock and 
anaerobic growth proteins 

RNase III 

dsRNA 

mRNA degradation, involved in sRNA-mediated mRNA decay 

RNase P 

ssRNA, tRNA precursors 

Degradation with low efficiency of polycistronic transcripts; main 
role is processing of tRNA precursors 

RNase Y 

ssRNA 

mRNA inactivation 

RNase 

ssRNA, preferably 5' monophosphate 

mRNA inactivation and degradation 

Exonucleases 

PNPase 

ssRNA 

3' ^ 5' exoribonuclease, able to process through dsRNA structures 
when aided by RNA helicase, mRNA degradation, involved in 
degradosome assembly, addition of heteropolymeric tails to RNA 

RNase 11 

ssRNA 

3' —> 5' exoribonuclease, mRNA degradation, mRNA protection 
by poly(A) tails removal 

RNase R 

ssRNA 

3' —> 5' exoribonuclease, degradation of mRNA, also with strong 
secondary structures (able to process through dsRNA structures) 

Exosome 

ssRNA 

3' —> 5' exoribonuclease, mRNA degradation, addition of 
heteropolymeric tails to RNA 

RNase J1*,J2* 

ssRNA 

5' —> 3' exoribonuclease, mRNA degradation 


^Asterisks mark nucleases which possess both endo- and exonucleolytic activity 


marked for degradation through polyadenylation by 
poly(A) polymerase (PAP), which in Escherichia 
coli can add tails of 15-40 nucleotides. The targets 
of many ribonucleases have been identified by 
assessing the mRNA levels in specific RNase knock¬ 
out strains through microarray and deep sequencing 
analyses. 

Bacterial mRNAs often have a stem-loop structure 
on the 3' end that confers protection from 3' ^ 5' 
exoribonucleases. An internal cleavage, which pro¬ 
duces a free 3' end, can initiate mRNA decay. Access 
to internal cleavage sites may be blocked by translating 
ribosomes, but the RNA 5' end is vulnerable to 
endoribonuclease attack. Some RNases which are 
responsible for initiating mRNA degradation prefer 
a monophosphate on the 5' RNA end instead of the 
triphosphate present in primary RNA transcripts, and 
removing this cap may be the signal to initiate degra¬ 
dation in many cases (Condon 2009). 

Gram Negative Bacteria 

RNA degradation has been studied most extensively in 
the gram negative bacterium, Escherichia coli. The 
mRNAs in this organism each have a characteristic 


lifetime that can span from seconds up to 30 minutes. 
Their degradation starts usually with internal cleavage 
by RNase E, and less frequently by RNase III, which 
cleaves duplex regions. Mutagenesis studies showed 
that emerging RNA fragments are further degraded to 
nucleotides by 3' ^ 5' exonucleases, such as polynu¬ 
cleotide phosphorylase (PNPase), RNase II, and 
RNase R (Fig. 1). In the case of bacterial polycistronic 
transcripts, primary endonucleolytic cleavage may 
occur even before translation is completed on the 3' 
end of the mRNA. 

RNase E, the main endoribonuclease in E. coli is 
a large protein, and its C-terminal domain recruits 
other enzymes, including the ATP-dependent RNA 
helicase B (RhlB), PNPase mentioned above, and the 
glycolytic enzyme enolase. This multicomponent 
assembly, known as the RNA degradosome, functions 
in the turnover of the majority of E. coli mRNAs as 
well as processing of precursors for several structured 
RNA species (Fig. 2). In vivo evidence for the RNA 
degradosome assembly includes the observed 
colocalisation of its components by fluorescence 
microscopy, immunogold labelling, and freeze- 
fracture techniques (Carpousis 2007). 
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E. coli 


B. subtilis 






Archaea 



3'-5' exonuciease 
(Exosome) 


0 


3'-5' exonuciease 
(RNase R?) 

5'-3' exonuciease 
(RNase J?) 


(B 


Endonuclease 


5'-3' exonuciease RNA helicase 


3’-5' exonuciease flKyjj exosome 


Inorganic phosphate 


Poly(A) polymerase 

RNase J1/J2:5'-3' exonuciease, 
endonuclease 


mRNA Degradation in Prokaryotes, Fig. 1 Schematic comparison of mRNA degradation pathways in E. coli, B. subtilis, and 
Archaea. Question marks indicate stages of mRNA degradation for which the details are not currently clear 


For many substrates, RNase E internal cleavage is 
strongly dependent on the nature of the chemical group 
on the 5' end of the substrate. The presence of 
a monophosphate on the 5' end induces the cleavage, 
whereas triphosphate seems to protect the mRNA to 
a certain degree from RNase E attack. The crystal 
structure of RNase E catalytic domain provided insight 
in this mechanism and identified a phosphate binding 
pocket, which is responsible for allosteric activation of 
this enzyme. As nascent bacterial messages begin with 
triphosphate, it was proposed that removal of pyro¬ 
phosphate from the triphosphorylated transcript hy 
a pyrophosphohydrolase (such as RppH) triggers its 
degradation hy RNase E. Initial cleavage hy RNase 
E results in formation of a new 5' monophosphorylated 


RNA fragment, which in turn can trigger subsequent 
cleavage, and in this way ensures rapid and total decay. 
For some substrates, RNase E can bypass the 5' end 
requirement, and this is likely to involve recognition of 
structure in the RNA substrate outside of the cleavage 
site. RNase E prefers to cleave in A/U-rich single- 
stranded RNA regions that lack strong secondary 
structures. Such A/U-rich sequences are present in 
many E. coli mRNA ribosome binding sites; thus, the 
fate of a transcript may be governed by the rate of 
ribosome recruitment. Homologues of RNase E are 
found in many bacteria, as well as in plants, and 
degradosome-like complexes based on this enzyme 
seem to occur throughout the Enterobacteria (Condon 
2009; Carpousis 2007). 
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N-terminal 
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PNPase 

1021-1061 



segment B 
633-662 685-712 
(coiled-coil) 


segment C 
839-850 

C-terminal scaffold domain (CTD) 


segment D 
1021-1061 


intrinsically unstructured (except segments A-D) 


mRNA Degradation in Prokaryotes, Fig. 2 Diagram of 
RNA degradosome from E. colt. RNase E serves as a scaffold 
for other degradosome components, and their binding sites are 
indicated in the C-terminal domain. There are also two RNA 
binding sites (RBD, AR2) marked in this region. Segments A-D 


show parts of the RNase E C-terminal tail that have increased 
propensity for secondary structure formation. Segment 
A contains membrane attachment motif, segment B is a part of 
RhlB binding site and segments C and D are enolase and PNPase 
binding sites, respectively (Carpousis 2007) 


In E. coli and many other bacteria the double- 
stranded RNA endoribonuclease RNase III cleaves 
several mRNAs with duplex structure and favored 
nucleotide context. Furthermore, RNase III and 
RNase E were found to be responsible for mRNA 
degradation induced by base-pairing with small 
noncoding RNAs. It seems likely that RNase III is 
responsible for cleavage of duplex formed between 
cA-encoded sRNA and its target, which share long 
stretches of complementary sequence. Trawy-encoded 
sRNAs, which do not base-pair perfectly with their 
targets, are probably substrates for RNase E (Belasco 
2010 ). 

Inactivated by endoribonuclease cleavage, the 
mRNA fragments are further degraded by 
exoribonucleases. In E. coli three enzymes with 
3' —> 5' exoribonucleolytic activity important for tran¬ 
script degradation have been identified; RNase II, 
PNPase, and RNase R. 

RNase II is an abundant exonuclease without 
sequence specihcity that uses a hydrolytic cleavage 
mechanism yielding nucleoside monophosphate prod¬ 
ucts. The crystal structure of this enzyme has helped to 
understand its activity, which is strongly inhibited by 
double-stranded structures and is most efficient against 
poly(A) tails (Arraiano et al. 2010). RNase II not only 


participates in mRNA degradation, but may in fact 
protect some transcripts from degradation by removing 
their poly (A) tails. 

In E. coli the major exoribonuclease is PNPase, 
which uses a phosphorolytic mechanism to sequen¬ 
tially release nucleoside diphosphate products. PNPase 
is also inhibited by stem-loops on the substrate 3' end 
and requires a 3' single-stranded region to bind and 
efficiently cleave the RNA. However, this enzyme is 
able to degrade even double-stranded RNA molecules 
with the help of the RNA helicase RhlB. Another way 
for this enzyme to attain substrate access is through 
polyadenylation of the 3' end, which provides a recruit¬ 
ment site for PNPase. Under conditions of excess 
nucleoside diphosphate, limited phosphate and a little 
magnesium, PNPase itself is able to add tails to the 
transcripts. Insight into RNA recognition and catalytic 
mechanism of PNPase has been provided from crystal 
structures in apo- and RNA bound states. 

Structured RNAs are substrates for the exonuclease 
RNase R. The activity of this enzyme requires a 3' 
single-stranded overhang. RNase R was shown in 
in vitro cleavage assays and Northern blot analysis of 
E. coli wild type and mutants to be able to degrade 
RNAs containing repetitive extragenic palindromic 
sequences (REP) which form very stable stem-loop 
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structures in bacterial RNA. RNase R is widespread in 
bacteria, and in Pseudomonas syringae is part of 
degradosome-like complex, interacting with RNase 
E and RNA helicase. 

The hnal step of RNA degradation in many gram 
negative bacteria is decay of accumulated short single- 
stranded RNA fragments which cannot be processed 
by any of the endo- or exoribonucleases. These are 
degraded by oligoribonuclease, which cleaves the frag¬ 
ments with 3' ^ 5' polarity into mononucleosides 
(Condon 2009). 

Gram Positive Bacteria 

mRNA degradation in gram positive bacteria differs in 
many ways from the processes found in E. coli, includ¬ 
ing the use of a distinct group of ribonucleases (Fig. 1) 
(Condon 2009). 

A representative organism from this bacterial group 
is Bacillus subtilis. The first RNases described in this 
organism, which were considered as the main mRNA- 
degrading enzymes, were RNase J1 and RNase J2, 
functional homologues of E. coli RNase E, but belong¬ 
ing to a different structural family that bear no com¬ 
mon ancestry with the E. coli enzyme. Both enzymes, 
J1 and J2, possess endonucleolytic activity, but apart 
from that they are also 5' ^ 3' exoribonucleases. 
Moreover, it was shown that their activity is boosted 
by the presence of a monophosphate on the 5' end of 
the substrate, whereas 5' triphosphorylated RNAs were 
cleaved slower. The crystal structure of Thermus 
thermophilus RNase J has illuminated the catalytic 
mechanism and revealed the presence of a 5' binding 
pocket responsible for monophosphate preference. 
Secondary structures in B. subtilis mRNA occur on 
the 5' end of the transcript, in contrast to the 3' end as 
found in E. coli, and likely reflects the need for protec¬ 
tion of the mRNA from 5' exonucleases. RNase 
J homologues are widely distributed amongst the bac¬ 
teria and archaea. 

Initial cleavage of structured mRNAs in B. subtilis 
is performed by RNase Y. In vitro cleavage assays 
showed that this endoribonuclease is also sensitive to 
the status of the mRNA 5' end. Moreover, its activity is 
affected by the conformation of the transcript. Homo¬ 
logues of RNase Y have been identified in about 40% 
of sequenced genomes of bacteria. 

PNPase is also present in B. subtilis and may be the 
main exoribonuclease in this organism responsible for 
mRNA degradation. Analysis ofB. subtilis strains with 


mutated PNPase revealed effects on decay of many 
transcript breakdown products, but not their half- 
lives. This suggests that mRNA degradation in 
B. subtilis starts with endoribonucleolytic cleavage, 
inactivating the messenger. The decay in B. subtilis is 
probably completed by two enzymes which are func¬ 
tional analogues of oligoribonuclease from E. coli: 
NrnA and NmB, identified by complementation of an 
E. coli om (oligoribonuclease) mutant. Furthermore, it 
is possible that a degradosome-like assembly is formed 
between RNases Jl, J2, Y, and PNPase, as well as 
enolase and 6-phosphofructokinase inB. subtilis. 

Cyanobacteria 

Very little is known about RNA degradation in the 
Cyanobacteria, a group of bacteria that have capacity 
for photosynthesis. An RNase E/G-like enzyme was 
identified in Synechocystis sp. and was shown by 
in vitro RNA cleavage assays as well as analysis of 
mutants to be involved in the light response by cleav¬ 
age in an AU-box upstream of the ribosome binding 
sequence for specific transcripts. The Synechocystis 
genome encodes proteins which are homologous to 
PNPase, RNases R/II, and PAP, and transcript half- 
life in this bacteria is similar to E. coli. Also PNPase 
was discovered to be responsible for addition of 
heterogeneous tails found on RNAs. However, no 
degradosome-like assembly has been identified in 
Cyanobacteria (Arraiano et al. 2010). 

Archaea 

Many archaea were predicted to have an “exosome,” 
a multienzyme protein complex responsible for RNA 
degradation and processing (Fig. 3). The existence of 
such a complex in Archaea was first demonstrated in 
Sulfolobus solfataricus. The exosome complex is 
responsible for exonucleolytic 3' ^ 5' degradation of 
single-stranded RNA and addition of heteropolymeric 
tails to RNA. Crystal structures of the exosome core 
have revealed a ring-shaped structure with a central 
pore which resembles bacterial PNPase (Lorentzen 
et al. 2005). 

Not all members of Archaea have the exosome, and 
species lacking it also usually lack polyadenylation. 
However, a homologue of RNase R has been found 
among the members of this group, and it was proposed 
that this enzyme might substitute functionally for the 
missing exosome. Exosome assemblies are also found 
in the Eukaryota (Arraiano et al. 2010; Belasco 2010). 
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Toxin-Antitoxin Pairs 

It has been suggested that prokaryotic RNases do not 
distinguish between stable and unstable RNA, and that 
they degrade all RNA with permissive, unprotected 



mRNA Degradation in Prokaryotes, Fig. 3 Structure of 
exosome from Sulfolohus solfataricus. The view is along the 
axis of the channel that is the likely route for single-stranded 
RNA (3L7Z PDB file) 


structures. It is possible that some RNAs are degraded 
by different RNases depending on conditions or 
growth stage. Indeed, encoded in the prokaryotic 
genomes are toxins that are ribonucleases and which 
are responsible for global RNA decay under stress 
conditions. They are present in the cell continually, 
but are sequestered by an antitoxin that neutralizes 
their toxicity. However, under certain conditions, the 
antitoxin is degraded, and the released toxin may influ¬ 
ence many cellular processes. An example of such 
a toxin is RelE, which is found in gram negative and 
gram positive bacteria, as well as in Archaea. Northern 
analysis of E. coli RNA have shown that RelE cleaves 
mRNA within the ribosome at the coding region during 
amino acid starvation. Another well characterized 
toxin is MazE in E. coli. Toxin-antitoxin pairs are 
widely distributed throughout all prokaryotic lineages, 
reflecting an important role in selective fitness for 
response to environmental stress (Gerdes et al. 2005; 
Blower et al. 201 1). 

Perspectives 

Aspects of mRNA degradation continue to be discov¬ 
ered and new mechanisms revealed that regulate and 
coordinate this process (The main techniques used to 
investigate these processes are summarised in Table 2). 


mRNA Degradation in Prokaryotes, Table 2 Main biophysical techniques used in identification and characterisation of 
endo- and exonucleases participating in mRNA degradation 


Method 

Purpose 

Examples 

X-ray crystallography, 
NMR 

Determination of protein structure 

RNase E, PNPase, Enolase, exosome, RNase J, 
helicase, Poly(A) polymerase 

BIAcore, SPR 

Analysis of interactions between proteins or 
substrates 

Interaction of degradosome components with 
C-terminus of RNase E 

Microarrays and deep 
sequencing 

Analysis of gene expression changes 

Identification of genes influenced by activity of RNase 
E, RNase III 

Northern blot analysis 

Analysis of specific RNA levels in vivo 

Different RNases influence on specific transcripts 

Genomic mutants 

Analysis of effects of specific protein in vivo 

Identification of different RNases targets by 
determination of changes in mRNA levels in wild type 
and mutant 

Co-immunoprecipitation 

Identification of protein partners and 
associated RNA; complements deep 
sequencing to identify targets of RNA binding 
proteins 

Identification of degradosome components 

Protein engineering and 
purification 

Biochemical in vitro studies 

Purification and in vitro activity analysis of all RNases 

Microscopy methods: 
electron microscopy, 
fluorescence microscopy 

Protein cellular localization, identification of 
protein-protein interactions 

Confirmation of degradosome components 
colocalization in vivo, membrane localization of 
degradosome 

In vitro cleavage assays 

Analysis of protein activity 

Confirmation and identification of mRNA substrates 
of different RNases 
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Studies are most advanced in E. coli; however, 
despite knowledge about main pathways of RNA 
decay in this bacteria, novel pathways continue to 
be uncovered. There is growing awareness of the 
importance of noncoding RNAs for posttranscrip- 
tional regulation of gene expression involving 
mRNA degradation in prokaryotes. Deep sequencing 
methods have allowed the identification of many 
small noncoding RNAs (sRNAs), a great number of 
which still have unidentified function and mecha¬ 
nism of action. Furthermore, the activity of ribonu- 
cleases is in many cases modulated by other proteins, 
thus searches of protein partners, by bioinformatics 
studies as well as high-throughput in vivo methods, 
may allow identification of new networks and regu¬ 
latory pathways. Advances in studies of diverse 
microorganisms are providing further insight into 
mRNA degradation pathways and their evolutionary 
history. New protein expression and purification 
techniques provide a way of gaining biochemical 
and structural knowledge about proteins that were 
not previously amenable to investigation. Together 
with developments in X-ray techniques which 
enable structure determinations of challenging mac- 
romolecular complexes, detailed insights are likely 
to continue into the mechanisms and function of 
mRNA degradation, as well as many other related 
regulatory processes both in prokaryotes and 
eukaryotes. 

Cross-References 

► RNA Polymerases and Transcription 

► RNA-Binding Proteins - Catalytic Domains 
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Synonyms 

Directional mRNA transport; RNA 


Definition 

mRNA localization describes the asymmetric accumu¬ 
lation of mRNAs at subcellular sites and their local 
translation. 

Basic Characteristics 
Introduction 

It is often suggested that cytoplasmic proteins are 
generally translated throughout the cytoplasm. To 
date, however, numerous examples from fungi, inver¬ 
tebrates, and vertebrates are available where mRNAs 
are translated at subcellular sites (Fig. 1) (St Johnston 
2005). A genome-wide assessment of mRNAs during 






1612 


mRNA Localization 


mRNA Localization, 

Fig. 1 Examples of mRNA 
localization. Image of 
Xenopus oocytes: courtesy of 
Maike ClauBen and Tomas 
Pieler. The neuron image has 
been reproduced with 
permission from Tubing et al. 
( 2010 ) 



Bud-tip localization Localization of bicoid mRNA 

of ASH/mRNA at the anterior pole of 

in S. cerevisiae D. melanogaster embryos 



DEADSouth mRNA 
localization to the vegetal 
pole of X, laevis oocytes 


Localization of 
CaMKIla mRNA in 
mature hippocampal neurons 


the embryogenesis of the fruit fly Drosophila 
melanogaster showed that about 70% of all expressed 
transcripts exhibit distinct intracellular localization 
(Lecuyer et al. 2007). Although these numbers are 
likely to be lower for other organisms, this example 
provides strong evidence for the prevalence of mRNA 
localization. 

Local mRNA translation can be achieved by dis¬ 
tinct mechanisms. Transcripts can be actively 
transported to subcellular sites as part of large messen¬ 
ger ribonucleoprotein particles (mRNPs). Alterna¬ 
tively, mRNAs are floated through a cell by 
cytoplasmic streaming and localized by passive bind¬ 
ing to distinct subcellular sites. A third possibility is 
the selective degradation of transcripts in parts of the 
cytoplasm and asymmetric translation of the remaining 
mRNA. Since to date most examples involve active 
transport, this section will concentrate on motor- 
dependent mRNA localization. 

Visualizing mRNA Transport 

In 1983, in situ hybridization with radioactively 
labeled antisense DNA provided the first example for 
an asymmetrically distributed mRNA (Jeffery et al. 
1983). Since then, in situ hybridization has been 


dramatically improved and is still widely used (Weil 
et al. 2010). It has been adapted to fluorescence 
methods (Fig. 2), which allows for multiplexing dif¬ 
ferent RNA species, studying colocalization with pro¬ 
teins, and image quantification. However, a main 
disadvantage of in situ hybridization is the need for 
fixation of the specimen. Thus, it can only offer a static 
view of mRNA localization. 

One approach to directly observe mRNA transport 
is microinjecting fluorescently labeled RNAs into liv¬ 
ing cells and to trace their distribution by fluorescence 
microscopy (Fig. 2) (Weil et al. 2010). Although used 
successfully, microinjection also has disadvantages: it 
is limited to large specimen like Drosophila embryos 
and Xenopus oocytes, and bears some risks of artifacts 
by damaging cell walls during injection or overloading 
the cell with the synthetic RNA. 

A drastically different approach is the so-called 
MS2 system (Weil et al. 2010). Here the mRNA of 
interest is modified by inserting MS2-stem loops into 
its 3' untranslated region (Fig. 2). In parallel, the 
bacteriophage MS2-coat protein is expressed in 
fusion with a fluorescent protein (FP). This fusion 
protein binds to the MS2-stem loops in the modified 
mRNA. When both constructs are co-expressed in 
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mRNA Localization, 

Fig. 2 Commonly used 
techniques to visualize mRNA 
localization 


In situ hybridization: 

Cells are fixed and 
hybridized with labelled 
anti-sense probe 


RNA micro-injection: 

Cells are injected 
with labelled, in vitro- 
synthezised RNAs 


Observation 


antisense ^ 
probe 


Fixed samples (static): 
visualization of endogenous, 
localized mRNAs 



MS2 system: 

Cells are modified to 
simultaneously express 
(I) an mRNA fused to 
MS2 stem loops and 
(ii) a fusion of MS2-coat 
protein with a fluorescent 
protein {MS2-FP) 



Live imaging (dynamic): 
direct observation of the 
localization of MS2-FP 
decorated mRNAs 

Only tracing of modified 
endogenous mRNAs possible 


a cell, the modified mRNA is decorated with MS2- 
FP and can be traced by fluorescence microscopy. 
One great advantage of this system is that it can be 
used in tissues that are not well suited for microin¬ 
jection or in situ hybridization. A major drawback is 
that expression of both constructs has to be facili¬ 
tated in the same cell, which can be technically 
challenging. 

Examples of mRNA Localization 

Since mRNA localization is a widespread phenome¬ 
non, the examples provided below can only give 
a rather superficial overview. 

One of the best-studied localization events is the 
directional mRNA transport during mitosis of the yeast 
Saccharomyes cerevisiae (Fig. 1) (Zarnack and 
Feldbriigge 2010). Here, the ASHl mRNA is 
transported into the daughter cell, where it is locally 
translated. The protein product Ashlp ensures 
a distinct cell fate of the daughter cell. mRNA transport 


has also been reported in other fungal species like the 
pathogens Ustilago maedes or Candida albicans. 

Early Drosophila development also heavily relies 
on active mRNA transport and local translation 
(Fig. 1). During oogenesis, maternally produced 
mRNAs are transported into the maturing oocyte and 
anchored at particular sites of the egg (St Johnston 
2005). After fertilization, these mRNAs are locally 
translated. The resulting protein gradients provide 
positional information for the establishment of the 
embryonic body axes. Later, zygotic transcripts are 
also localized to provide asymmetric information on, 
for instance, apical-basal polarity. 

The oocyte of the frog Xenopus laevis is a favored 
model to study directional mRNA transport in verte¬ 
brates. During maturation, a set of mRNAs is localized 
to the vegetal hemisphere of these huge eggs (Fig. 1). 
One of the best-studied examples is Fgl mRNA. Its 
localization requires association with a RBPIVera 
protein-containing complex. 
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Fig. 3 Cartoon showing the 
major steps of mRNA 
localization 


Transcription 


Nuclear 

priming 


Nuclear 

export 


Cytoplasmic Membrane 

transport association & 

) translation 



Directional mRNA transport also occurs in somatic 
tissues of adult organisms. In neurons, mRNAs are 
localized to pre-and postsynaptic areas of axons and 
dendrites, respectively (Fig. 1). Several of these tran¬ 
scripts encode proteins required for neurogenesis or 
synaptic activity, such as CaMKIl-alpha, MAP2, and 
heta-Actin. For instance, in primary neurons heta- 
Actin mRNAs is transported to growth cones (Martin 
and Ephrussi 2009). Its stimulus-induced translation is 
required for growth-cone navigation. Consistently, 
interference with neuronal mRNA transport in mice 
impairs neuronal plasticity and long-term potentiation. 

The Inventory of Localizing mRNPs 

Affinity purification followed by mass spectrometry 
was used to obtain an inventory of localizing mRNPs. 
Besides the frequent copurihcation of molecular 
motors, a number of RNA-binding proteins, transla¬ 
tional repressors, RNA-helicases, and components 
of the exon-junction complex have been found. 
It has even been suggested that ribosomal subunits 
are incorporated into localizing mRNPs. 

Mechanisms of mRNA Localization 

mRNAs deemed for localization contain cA-acting 
regulatory elements, termed zip-code elements. 
They are bound by dedicated RNA-binding proteins 
and incorporated into large mRNPs (Fig. 3). Recent 
work from yeast suggests that localizing mRNAs are 
already bound co-transcriptionally by transport 


factors. A different study in Drosophila found that 
splicing of the Oskar mRNA at the first intron is 
required for its cytoplasmic localization (Martin and 
Ephrussi 2009). Although it remains to be shown 
weather other instances include similar nuclear prim¬ 
ing events, these examples indicate an importance of 
the nuclear history for the cytoplasmic fate of local¬ 
izing mRNAs. 

In the cytoplasm, these mRNPs are joined by 
motor proteins that mediate the motile activity for 
mRNA localization (Pig. 3). All three classes of 
motor, myosin, kinesin, and dynein are found in 
localizing mRNPs. Their specific roles during trans¬ 
port can be probed by the use of inhibitors and 
cytoskeleton-depolymerizing drugs, or by muta¬ 
tional analyses. Another approach is to compare the 
motilities and processivities of different motors 
measured in vitro with particle movements in vivo. 
As long as only a single motor is involved in mRNA 
transport, its identity can be anticipated from these 
parameters. However, often combinations of motors 
are found. 

Once these mRNPs reach their cytoplasmic desti¬ 
nation, they associate with membrane-proximal sites 
and translation is activated (Pig. 3). In particular, the 
attachment of localizing mRNPs at membrane sites is 
only poorly understood. Two studies in yeast and in 
chicken fibroblasts showed that translational activation 
after anchoring requires kinase activity (Martin and 
Ephrussi 2009; Paquin and Chartrand 2008). Local 
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phosphorylation of RNA-binding proteins reduces 
their affinity for target mRNAs and release transcripts 
for translation. 

Future Directions 

In only few examples like yeast, a small number of 
factors is sufficient to assemble into complete mRNA- 
transport complexes. Most mRNPs are more complex 
and involve many more factors. A main goal in future 
will therefore be to understand how these factors inter¬ 
act to form distinct mRNPs and what the compositions 
and divergent functions these different subtypes of 
mRNPs are. In vivo data suggest high mRNA specific¬ 
ity for most transport events. An interesting question 
will therefore be to understand how specific such 
mRNA complexes really are and how specificity is 
achieved on a molecular level. 

In future, advancements in imaging techniques will 
make it easier to trace individual molecules (see also 
Section on ► Single-molecule methods) and to study 
their behavior in a cell. These approaches will have to 
be used hand in hand with biochemical and biophysical 
studies to understand the roles of individual factors in 
the process of mRNA localization. Thus, a better 
understanding of mRNA localization, including its 
nuclear priming, the regulation of complex assembly, 
and anchoring after transport, will ultimately depend 
on our ability to combine such diverging but 
complementing experimental approaches. 

Cross-References 

► Fluorescence Labeling of Nucleic Acids 

► Single-Molecule Methods 

► Single-Molecule RNA FISH 
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Synonyms 

Polysaccharides 


Definition 

The use of physical and chemical methods to study 
mucins and mucin systems at different levels of orga¬ 
nization: molecular shape, aggregation, polymeriza¬ 
tion, gellation, and viscosity, all of which might 
underlie physiological functions. 
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Basic Characteristics 

The peptide cores of these ► glycoproteins undergo 
posttranslational glycosylation and commonly contain 
60-80% of their dry weight as carbohydrate. The gly- 
can chains are attached through serine or threonine 
residues in the variable number tandem repeat 
(VNTR) domains, a structural “fingerprint” of the 
mucins (Linden et al. 2008). General architecture is 
conserved in mucins: they are flexible linear polymers 
fitting a freely jointed chain model. Mature mucins 
divide into two groups: secreted and apical membrane 
associated (transmembrane, membrane-bound 
mucins) (Carlstedt et al. 1985). Complex networks 
are formed by multiple short- and long-range interac¬ 
tions, including cross-links through disulphide bridges 
between Cysteine-rich or Cysteine-knot sequences and 
von Willebrand C and D domains located at N- and 
C-terminal peptide domains and flanking the glycan 
carrying VNTR domains. Multiangle laser light scat¬ 
tering and ► circular dichroism indicate molecular 
weights of 20-100 x 10®, and 100-300 nm radius of 
gyration in dilute solutions; lengths up to 10 pm 
emphasize the need for biophysical methods in mucin 
study (Kesimer and Sheehan 2008; Kesimer et al. 
2010). As polyanions, mucin charge is largely dictated 
by their glycans, with sialic acids and glycosulphate 
esters contributing a dominant negative charge. 

Granule Structure and Secretion 

Storage granules of secreted mucins are spheroidal and 
not cylindrical as those of von Willebrand Factor, with 
which mucins share structural similarities. Because of 
the observed speed of unfolding and hydration on 
secretion (from milliseconds in slugs, to a few seconds 
in mammals), it is argued that mucin polymers are non- 
randomly folded in the granule. Polarized microscopy 
images of partially hydrated mucus granules revealed 
thread-like (nematic) topography interpreted as 
a reflection of supramolecular organization of the poly¬ 
mers. The molecular weights of newly secreted 
MUC5B granules, assessed by light scattering, indi¬ 
cate they contain 25-50 dimers, organized around 
10-15 nodes, as seen with >■ electron microscopy. 
These nodes are clusters of interacting NH 2 and 
COOH-domains (they bind colloidal gold, which is 
excluded from the glycosylated sequences). As nega¬ 
tive charges are borne by the glycan chains of the 
mucin molecules, on secretion, the exchange of Ca^"^ 


(the dominant intracellular ions) for two Na"^ ions 
occurs as the latter penetrate the granule. This 
increases osmolarity, causing swelling and ingress of 
water molecules that swell the gel and give access 
to the clustered terminal domains of mucins. Mucins 
then solvate and adopt linear configurations by a mech¬ 
anism that is not yet understood. 

pH Dependence 

The ionic composition and pH of the mucosal 
environment - the latter can vary between pH 1-8 - 
influence mucin conformation and aggregation from 
globular to extended, in a mucosa-specific manner. 
Through pH-sensitive domains of mucins, cervical 
mucus changes mesh size (500 ± 250 nm, ► atomic 
force microscopy (AFM) measurements) and appear¬ 
ance (fiber diameters 10-500 nm), to globular aggre¬ 
gates (600 ± 200 nm) in the periovulatory relatively 
alkaline conditions, permitting sperm penetration. 
Globules reorganize into a network at pH 6. 
A further example is the gastric mucus that displays 
a pH-dependent sol-gel transition from a viscous poly¬ 
mer solution to a soft gel as pH is lowered below pH 4, 
and vice versa as the pH is raised. This gel is visco¬ 
elastic, with the elastic component of the frequency- 
dependent viscoelastic shear modulus, G’((n) higher 
than the viscous component, G”{(o) over a wide range 
of oscillatory shear frequencies (co) (Boukari et al. 
2009; Vigil et al. 2009). 

Variations to the common architectural pattern 
might underlie distinct functions or pathological 
unfolding of the polymer. Additional and complemen¬ 
tary to electron microscopy, ► AFM resolves hydrated 
polymers to nanometer resolution: in equilibrium con¬ 
figurations mature, ocular mucins form flexible poly¬ 
mers with persistence length shorter than DNA of the 
similar chain lengths (Round et al. 2002) (see also 

► Polysaccharides: Biophysical Properties, 

► Glycoconjugates in Cell Function and Therapeutics, 
and ► Worm-Like Chain (WLC) Model). These 
mucins are decorated by short glycans, and regions 
tens of nanometers long are relatively naked, unlike 
the largest salivary mucins whose glycosylation pre¬ 
vents even small molecules from reaching the peptide 
core. Classic confocal microscopy, two photon tech¬ 
niques, and ► fluorescence correlation spectroscopy 
reveal the movement of (nano)particles and microbiota 
through mucin gels. Mobility reflects the local gel 
conformation, brought to light by HIV virions variable 
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confined movements in cervical gel, or shed light on 
the mechanism of translocation: increased pH caused 
by Helicobacter pylori metabolism literally liquefies 
the gel around itself (Celli et al. 2009). 

Adhesion and Surface Activity 

Adhesion to surfaces is possible through N and 
C termini and other poorly glycosylated domains 
(visualized by EM, colloidal gold) and also through 
oligosaccharides (shown by AFM), resulting in com¬ 
plex patterns of adhesion being dominated by their 
mucin component. Quartz crystal mass balance with 
dissipation (QCM-D) methods indicate that bovine 
submaxillary mucin adsorbs on gold as a flexible 
layer with extended loops and tails, while on hydro- 
phobic surfaces it forms a flat, compact layer. Early 
stages of ocular mucin adherence to hydrophobic or 
hydrophilic substrates, observed by AEM, fit random 
and cooperative sequential adsorption. Optical wave¬ 
guide lightmode spectroscopy (OWLS) revealed an 
anisotropic adsorption of this mucin, with signihcant 
adsorption-desorption hysteresis. Surface saturation 
concentrations are in the range of 2-5 mg/m^ for all 
mucins tested: the adsorbed mass is typically higher on 
hydrophobic than on hydrophilic substrates. Mucin 
layers repulse each other due to steric and electrostatic 
forces, attributes that likely contribute to 
biolubrication. This operates in boundary, mixed, or 
hydrodynamic regimes, depending on shear, viscosity, 
and layer thickness, with the overall effect being 
a substantial reduction of friction, and probably also 
of wear. This function of mucins is essential to the 
physiology of oral and ocular surfaces, and at bony 
interfaces in joints. In mucositis, where lubrication is 
deficient, the shear viscosity of whole saliva superna¬ 
tant is severely altered as is its birefringence in exten- 
sional flow. These results point to radical changes in 
mucin composition, concentration, or glycosylation. 

Aggregation at mucosal surfaces depends on 
mucin species and tissue-specihc glycosylation: self¬ 
aggregation is the rule; especially in the case of 
MUC5B, this results in macromolecules that require 
reduction for solubilization in chaotropic solvents 
(Hattrup and Gendler 2008). A range of other secreted 
molecules, thought to have functions in mucosal pro¬ 
tection, participate in macromolecular complexes with 
mucins. Important examples are the trefoil factor fam¬ 
ily peptides (TEE 1-3) and gastrokines, which are 
thought to have cross-linking properties that stabilize 


mucus gels. Companion molecules may shield mucin 
epitopes from external ligands, as shown by AFM and 
► atomic force spectroscopy mapping of sialic acids 
and their linkage - receptors of bacteria and viruses - 
in pure mucins and preocular mucin gels. At the sur¬ 
face of the preocular gel, one linkage is fragmented 
while the other is present in numerous large clusters. 
The fragmented linkage is dominant in purified mucins 
and, therefore, the discrepancy between gels and pure 
mucins suggests shielding by non-mucin components. 

Receptor-Mediated Interactions 

Pro-adhesive epitopes abundance and patterns may 
affect the affinity of bonds and facilitate or forbid 
cooperative binding, receptor cross-linking, and signal 
transduction. Through their highly-glycosylated 
domains (and vicinal to such domains in a network), 
mucins have structural motifs that might underlie 
entropic effects and microaffinity constant gradients 
in their interactions with galectins and selectins 
(see ► Lectins), as well as carbohydrate-binding pro¬ 
teins of microbiota. Large (summation of free energies 
of binding) affinity enhancements occur if the lectin 
itself is multimeric, with more modest enhancements if 
two lectins bind to the same glycosylated molecule. 
Isothermal titration microcalorimetry is a technique to 
directly measure the binding affinity, enthalpy 
changes, and number of partners in a forming bond 
(see also ► Macromolecular Crowding: Effects on 
Association Equilibria and Kinetics) (Dam and Brewer 
2010). It has revealed the effect of the density of 
glycosylation and glycan composition on the ► ther¬ 
modynamics of binding of the lectin soy bean aggluti¬ 
nin to pig gastric mucin analogues, where relative 
enhancements of affinity reached 10^. Binding of 
a lectin to a multivalent carbohydrate might involve 
jumps of the lectin from one sugar receptor to another 
before dissociation. Repeated bonds with receptor gly- 
cans on the same molecule and multivalent bonds with 
receptors on different molecules can, for membrane- 
associated mucins, lead to signal transduction. This 
would occur without mucins reorganizing on the cell 
surface, as occurs after loss of intercellular contact. 

Negative binding cooperativity, where binding to 
one receptor decreases affinity for another receptor on 
the same molecule, has been observed for the binding 
of galectins to asialofetuin and opens the possibility of 
decreasing microaffinity constants of microbial or 
immune cell lectins to mucins and mucosal surfaces. 
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Cross-References 

► Atomic Force Microscopy 

► Atomic Force Spectroscopy 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 

► Dynamic Light Scattering 

► Electron Microscopy 

► Fluorescence Correlation Spectroscopy 

► Glycoconjugates in Cell Function and Therapeutics 

► Glycoproteins 

► Lectins 

► Macromolecular Crowding: Effects on Association 
Equilibria and Kinetics 

► Mass Spectrometry of A-Linked Carbohydrates and 
Glycoproteins 

► Thermodynamics of Biomolecular Interactions 

► Worm-like Chain (WLC) Model 
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Multiangle Light Scattering from 
Separated Samples (MALS with SEC 
or FFF) 

Philip J. Wyatt 
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Synonyms 

Asymmetric flow Eield Elow Eractionation (A4F); 
Dynamic light scattering; Eield flow fractionation 
(EEE); Gel permeation chromatography (GPC); 
Quasi-elastic light scattering (QELS) Size exclusion 
chromatography (SEC) 

Introduction 

During the past few decades, analytical light-scattering 
techniques have grown dramatically in popularity and 
importance for virtually all fields where the measure¬ 
ment of molar mass and size is sought. This has been 
particularly evident for proteins and other hiopoly- 
mers. Playing a key role in this transition from tech¬ 
niques such as analytical ultracentrifugation to provide 
absolute mass and size were laser based instruments 
and tremendous advances in computers and their asso¬ 
ciated software. The relation between light-scattering 
measurements and the determination of molar mass 
and size from such measurements is reviewed and 
then extended to the determination of their associated 
distributions by fractionation methods followed 
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by light-scattering measurements. Two of the most 
popular fractionation techniques, size exclusion chro¬ 
matography (SEC) and field-flow fractionation (FFF), 
and its most popular variation, asymmetric flow FFF 
(A4F), are discussed together with the role they play 
when combined with light-scattering measurements. 
Several important exemplars are presented including 
the application of MALS plus separation to protein 
conjugates and the measurement of protein oligomer 
size using quasi-elastic light scattering (QELS) [often 
referred to as dynamic light scattering (DLS)]. 

A Brief Summary of Some Light-Scattering 
Theory Elements 

The interpretation of scattered light measurements 
from proteins, biopolymers, and other small particles 
in solution has long been understood and is described 
in a variety of classical papers and books (Zimm 
1948a, b; Wyatt 1993; Kerker 1969; Bohren and 
Huffman 1983; Huglin 1972; van de Hulst 1957). In 
general, it is assumed that these molecules are small 
compared with the wavelength of the incident light and 
may be well described using the so-called Rayleigh- 
Gans approximation (Bohren and Huffman 1983). 
Most commercial light-scattering photometers use ver¬ 
tically polarized incident light produced by a laser 
source. Thus, the electric field of the emitted plane 
polarized radiation is perpendicular to the plane of 
measurement as shown schematically in Fig. 1 . 

There are two general principles by which means 
the light scattered from a solution of molecules is 
interpreted. First, the intensity of the scattered light is 
directly proportional to the product of the scattering 
molecules’ weight average molar mass. Secondly, the 
variation of the scattered intensity with angle is 
a measure of the particles’ size. These principles are 
spelt out more specifically in terms of the so-called 
excess Rayleigh ratios to be defined presently. 

The light scattered by molecules in a solution is 
generally expressed in terms of the excess Rayleigh 
ratios. The Rayleigh ratio is best understood by reference 
to Fig. 2, showing the basic elements and structure of an 
individual detector. The sample is shown lying within 
and illuminated by a fine light beam of intensity /q. An 
encapsulated detector 1 (usually a transimpedance 



Multiangle Light Scattering from Separated Samples 
(MALS with SEC or FFF), Fig. 1 Schematic MALS measmement 
in a plane 



Multiangle Light Scattering from Separated Samples 
(MALS with SEC or FFF), Fig. 2 Basic elements and structure 
of an individual detector 


photodiode) lies in the plane of measurement (p = 0 
centered at the scattering angle 6 with respect to the 
incident light direction /q. The detector’s field of view 
has been restricted to view only the small volume Av 
indicated and collects light scattered from it into the 
solid angle dQ. This solid angle is defined by the colli¬ 
mator 2 that defines the acceptance angle 5 of the detec¬ 
tor, its light sensitive area A, and the distance d from this 
area to the scattering volume. The Rayleigh ratio R(0, (^) 
is defined as the ratio of the scattered light collected by 
the detector in the direction (0, (p), per unit solid angle, 
per unit volume of the scattering molecules, and per unit 
incident intensity thereon. As discussed above, measure¬ 
ments of macromolecules in solution are usually made in 
the single plane corresponding to (j) — 0, so we may 
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replace R{0, (j)) simply by R{9) [Note that very small 
contributions of scattered light are collected outside of 
the scattering that he within the small solid angle col¬ 
lected by the detector.]. For the case of molecules in 
solution, /?(6) refers to the excess Rayleigh ratio, i.e., the 
scattering of the solute molecules in excess of the scat¬ 
tering by the pure solvent: 

m = ^solution (^) ^solvent (0) ( 1 ) 


found that the calculations could be extended to 
a larger range of molar masses and concentrations if 
the reciprocal excess Rayleigh ratios were chosen. 
Forming the reciprocal of Eq. 2 yields: 


K*c 

m 


1 


MP{e)[l - 2A2McP{e)] 

1 


MP{6) 


2A2C-... 


(4) 


Zimm (1948a) developed a graphical method to 
extract the key molecular properties from measure¬ 
ments of unfractionated samples based upon two 
assumptions: (1) the molecules interact with other 
molecules in solution at a single point and (2) the 
concentration of the interacting molecules is “vanish¬ 
ingly small.” Under these conditions, the scattering of 
monochromatic light of vacuum wavelength io from 
solute molecules at vanishing low concentration c is 
given to order by the equation: 

^ = MF(0) [1 - 2A2McP{e)] (2) 

where K* = An^n^^{dn/tic)^ /and M is the 
weight averaged molar mass. P{d) is the form factor 
or “scattering function.” At very small angles, P{Q) 
relates the angular variation in scattered intensity to the 
root mean square radius Vg of the particle where: 

P{e) = 1 - 2i^{rf^/V. + ■■■ (3) 


h=\j ) > where ^ and 

= (47r//l)^sin^(6/2). A molecule is assumed to be 
comprised of n segments of mass m,, 1=1,2,..., n. 

Historically, light-scattering measurements were 
made over a range of angles 9j and concentrations c^. 
Once extrapolated to zero angle and zero concentra¬ 
tion, the graphical results yielded values of the weight 
average molar mass M, the second virial coefficient A 2 , 
and the mean square radius In recent years, this 
method has been replaced by means of a global ht to 
the data, eliminating the need to extrapolate while 
making use of all data collected to produce more accu¬ 
rate results. Nevertheless, because the graphical 
extrapolation methods were used exclusively during 
the early years of applying Zimm’s methods, Zimm 


Substituting Eq. 3 into Eq. 4 for small molecules, 

K*c 1 , ^ 

R{9) ~ MP{9) ^ 

Equation 5 forms the basis for the Zimm plot tech¬ 
nique (Zimm 1948a; Wyatt 1993) described in many 
texts. Suffice to remark that the technique involves 
plotting a linear combination of the two independent 
variables c, the concentration, and sin^|, against 
K*c/R{9). 

Figure 3 presents a simple 3-angle Zimm plot for 
a bovine serum albumin (BSA) sample. Note that the 
root mean square radius (derived from the slope at 

0 = 0 of the extrapolated c = 0 ht), 

an uncertainty comparable to its value suggesting that 
the measurement by MALS cannot resolve such small 
sizes. Indeed, at visible wavelengths conventionally 
used in MALS instrumentation, the minimum Cg 
values measureable are of the order of 10 nm. Most 
proteins, such as BSA, have radii of the order of a few 
nm. The second virial coefficient is derived from the 
slope of the line ht to the extrapolated set of 0 = 0 
values at the K*c/R{9) axis. Note also that the mass 
derived (from the intersection of the plot at the 
K*c/R{9) axis), about 77,000 D, is signihcantly larger 
than the known BSA mass of about 67,000 D. The 
difference between the measured and actual value 
conhrms the presence of aggregates in the sample. 
Indeed, once separated by the SEC or FFF processes 
to be described presently, the presence of 17% dimer 
in this sample is conhrmed. The Zimm plot procedure 
measures the weight average molar mass and, 
thereby, integrates the aggregate molar masses when 
it produces its hnal result. 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 

Fig. 3 Zimm plot of a protein 
with three angle MALS 
detector 


Zimm Plot (Kc/R) 



Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 4 DLS measurement of 
BSA dimer and monomer 
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A brief comment should be made concerning the 
sizing of such small molecules as proteins. Although 
they cannot be sized by MALS, they can be measured 
by dynamic light-scattering (DLS) methods (often 
called quasi-elastic light scattering or QELS). Such 
measurements yield a slightly different size character¬ 
ization, the hydrodynamic radius, Thus, following 
fractionation by FFF or SEC means, the sample flows 
through a MALS unit, one of whose detectors is 


replaced by a detector and corresponding processor 
commonly used for DLS measurements. The DLS 
measurements are done “on the fly” while the fraction¬ 
ated sample moves by. The DLS measurement 
measures the diffusion coefficients of the sample, 
slice by slice, from which the hydrodynamic radii are 
calculated from the Einstein-Stokes equation. Figure 4 
shows such measurement results from a sample of 
BSA. Note that DLS measurements were made over 
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very short time intervals as the peaks passed through 
the detector. Note that only the monomer and dimer 
components were sized owing to the insufficient 
amount of trimer present. Such on-line DLS measure¬ 
ments require considerably greater sample concentra¬ 
tions. Accordingly, for certain biomolecules for which 
very limited sample is available, such measurements of 
I'l, cannot be made. 

Although such Zimm methods are useful to deter¬ 
mine average values of M, A 2 , and far greater 

attention has been focused on the determination of the 
distributions of such quantities within a particular sam¬ 
ple. Thus, the samples have to be separated or frac¬ 
tionated into their constituents before light-scattering 
measurements are made. Not only would such prior 
separations permit detection of contaminants, but the 
presence of aggregates, such as discussed for the BSA 
sample just shown, may be readily detected as well. 
Since, furthermore, such separated fractions would be 
at extremely low concentrations, the interpretation of 
the measured values is simplified. 

Consider now a separation method that fractionates 
a sample into its constituent components. As men¬ 
tioned earlier, we shall focus on two specific separation 
mechanisms in this section: size exclusion chromatog¬ 
raphy (SEC) and field-flow fractionation (FFF) with 
particular emphasis on asymmetric flow FFF (A4F). 
There are certainly other types of separation mecha¬ 
nisms, such as reversed-phase chromatography, hydro- 
dynamic chromatography, and ion chromatography, 
but for biological molecules and particles, use of the 
aforementioned two methods has been growing most 
rapidly. Basically, these separation methods, SEC 
using columns packed with a variety of materials and 
A4F using selective diffusion properties balanced by 
opposing flows, are based on processes that separate by 
hydrodynamic size. Details of these processes will be 
described presently in sections “Size Exclusion Chro¬ 
matography (SEC) Separation” and “FFF Separation.” 

Quite similar to the Zimm analysis, the measurement 
of a small fraction (or “slice”) of the sample undergoing 
fractionation requires both a determination of the 
fraction’s concentration as well as measurement of its 
absolute scattered light properties as a function of scat¬ 
tering angle. Each such fraction is assumed to be mono- 
disperse. Naturally, this “purity” depends critically 
upon the degree of separation achieved which, 
for certain sample types and/or processes, may not be 
possible. For example, in the case of SEC, each 


separation column set used has some corresponding 
exclusion limit whereby molecules greater than 
a particular size cannot be resolved; they all elute over 
the same period of time. Other classes of molecules/ 
particles, whose compositions are different yet whose 
effective sizes are the same, may co-elute. These are 
very special cases that require further preparation for 
separation and will not be discussed further. 

From light-scattering and concentration measure¬ 
ments of each separated fraction, the corresponding 
molar mass and mean square radius may be derived. 
Once these properties have been measured for each frac¬ 
tion of the sample introduced, characteristic features of 
the sample may be extracted and a variety of deductions 
made. Rather than examine and process a two- 
dimensional Zimm plot, the extraction of molecular 
properties involves a simple plot of the reciprocal excess 
Rayleigh ratio, 1 /R{0), as a function of sin^(6/2). 

The importance of making measurements over 
a broad range of scattering angles cannot be over 
emphasized. For very small molecules, of course, 
where their mean size is very small compared with 
the wavelength of the incident radiation, one would 
not expect any variation with angle of scattered inten¬ 
sity; the particles scatter isotropically, i.e., for verti¬ 
cally polarized incident light, the scattered intensity at 
each angle is the same. Suitably weighted by their 
individual statistical fluctuations, the ensemble of 
such identical values yields a more precise average 
value than obtainable from measurement at just one 
angle. Indeed, a few commercial instruments, used to 
measure small biological particles only, detect 
scattered intensities at a single angle only, such as 90°. 

Further details of the on-line analyses will be 
discussed in conjunction with the data presented in 
the sections that follow. 

Experimental Preliminaries 

Figure 5 illustrates the means by which fractionated 
samples are produced and measured. The solvent, 
often phosphate buffered saline for proteins and simi¬ 
lar biological particles, is degassed and then pumped 
continuously by a commercial pump or similar constit¬ 
uent of a liquid chromatograph. The solvent then 
passes through a filter to remove dust particles and 
other debris that may have contaminated it during 
preparation. A sample aliquot is injected into the 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 5 Liquid 
chromatography schematic 
setup with MALS 



Columns 
(or other 
separator) 


solvent stream prior to passing through a separation 
device such as a set of SEC columns or other fraction¬ 
ation means. Once separated, the samples may enter 
a UV detector by which means its corresponding con¬ 
centration may be determined if its corresponding 
extinction coefficients are known. For species without 
UV absorption, there would be no UV signal. 

The separated sample then passes through a light¬ 
scattering photometer where scattered light intensities 
are measured at a plurality of angles, i.e., measure¬ 
ments of multiangle light scattering (MALS) are made 
(cf. Fig. 1). Each detector has its own corresponding 
collimated detector (cf. Fig. 2). 

Upon leaving the photometer, the sample then 
passes through a differential refractive index (dRI) 
detector where the concentration of each eluting frac¬ 
tion is determined by measurement of the incremental 
refractive index difference. An. Previously, the differ¬ 
ential refractive index increment dn/dc for the sample 
with respect to the solvent has been measured. Thus, 
the incremental concentration difference Ac between 
the fraction being measured and the solvent is calcu¬ 
lated from Ac = An/{dn/dc). Interestingly, for the 
majority of proteins, dn/dc « 0.185 to within about 
±5%. Thus, even if the extinction coefficient of 
a protein sample is not known, a value of its molar 
mass may be determined to within about 5% with this 
simple estimate of dn/dc. If dn/dc has been measured 
previously, then for pure proteinaceous samples, the 


UV detector is unnecessary. Alternatively, if the 
extinction coefficient of the protein being measured is 
known, then the dRI detector is unnecessary. For many 
types of biological materials that are not proteins, the 
dRI represents the method of choice for determining 
the concentration of such samples. 

Size Exclusion Chromatography (SEC) 
Separation 

The revision (Striegel et al. 2009) under the leadership 
of Andre Striegel of the classic text Modern Size- 
Exclusion Liquid Chromatography (by Yau, Kirkland, 
and Ely) marked a further confirmation that the sepa¬ 
ration technique would continue its significance when¬ 
ever sample separation was required prior to 
measurement. Although the original text was written 
almost entirely with the objective of measuring 
organic polymers, once the power of MALS measure¬ 
ments became evident, a great change occurred as 
molecular and protein chemists moved to SEC from 
earlier reversed-phase chromatography. The revised 
text confirms this. The basis for separation of molecu¬ 
lar species by SEC may be understood by reference to 
Figs. 6 and 7. The columns are packed with porous 
particles of selected sizes and porosity. As shown in 
Fig. 6, the larger the molecule, the less able it is to 
penetrate the porous particles. Thus, larger molecules 
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Multiangle Light Scattering from Separated Samples 
(MALS with SEC or FFF), Fig. 6 Passage of molecules through 
a column packed with porous particles 

pass through the columns more rapidly as they are 
increasingly excluded from the particle material. 
Once they have reached a great enough size, they are 
excluded completely. This size is referred to as the 
exclusion limit which depends on the type of columns 
or column set selected. The separation of molecular 
species depends upon their size. Figure 7 shows the 
progression of a sample comprised of two components 
of different size as it passes through a column packed 
with powder particles. The injected sample at a is 
highly localized; at b, the larger molecules in the 
sample begin to move ahead of the smaller compan¬ 
ions. By c, the larger molecules have reached the end 
of the column and begin eluting. By d, the smaller 
particles have begun to leave the column. Note that 
the initial injection band has now broadened and sep¬ 
arated. The Striegel text should be consulted for further 
details concerning the mechanisms by which SEC 


(often referred to as GPC, gel permeation chromatog¬ 
raphy) separates molecules, the bases for selecting 
columns, and means for interpreting the consequent 
separations of a so-fractionated sample. Often found 
nomenclature such as “plate count,” “plate height,” 
and “pore size” is explained in detail in that text 
together with a variety of measurements and their 
interpretations. 

At the time the original text was published, the use 
of multiangle light-scattering detectors to derive molar 
mass distributions was not possible as such photome¬ 
ters had not been introduced until about 1986. Never¬ 
theless, it seemed obvious that a natural extension of 
the Zimm plot technique to separated fractions would 
require only a single extrapolation to zero angle since, 
following fractionation, the concentrations at each 
slice would be vanishingly small and an extrapolation 
to zero concentration unneeded. However, because the 
beam diameter produced by typical laser sources of the 
period was very narrow, the measurement of scattered 
light at an angle very close to 0° became a practical 
objective. If the angle measured was small enough, 
then no extrapolation would be required and the mea¬ 
sured value could be used directly with negligible error 
for a range of molecular masses and sizes. 

Beckman Instruments introduced their low-angle 
device (Kaye 1973) in 1972 with a lowest angle attain¬ 
able of about 3.5°. Such low-angle scattering devices 
(later extended by Chromatix Corporation to about 
2.5°) were called low-angle laser-light-scattering 
(FALLS) photometers. The word “laser” was added 
to assuage the user who, from familiarity with previous 
classical measurements, was aware that conventional 
light sources produced beams far too large to be able to 
make measurements below about 20°. The combina¬ 
tion of SEC with low-angle light scattering had been 
achieved by Ouano and Kaye (1974) a few years before 
the original text and was referenced therein. 

The sensitivity of light-scattering measurements 
presents a problem for SEC, however, as many types 
of SEC columns have a tendency to shed, especially 
after changing pump rates, when the column set is new 
requiring many hours to equilibrate, responding to the 
pulse introduced with a sample as well as following 
power failures resulting in the need to restart the 
systems. Selection of suitable columns may well 
require additional efforts to locate columns that are 
free of such defects. Eigure 8 shows the excess Ray¬ 
leigh ratios as 12 angles following a SEC separation. 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 7 Separation of two 
different samples by SEC 
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FFF Separation 

Equivalent to columns in many respects are field-flow 
fractionation devices (Giddings 1993). The associated 
methods are among the most versatile families of 
separation techniques known. Though not commer¬ 
cially available until about 1991, they have become 
extremely important devices especially when used in 
combination with MALS detection. There was consid¬ 
erable early resistance to the use of MALS detectors 
with FFF separation because of the belief that the 
separation processes by FFF were so well accounted 
for by the theory that the eluting particle/molecule size 
could be determined solely by the time of appearance 
of a particular species at the outlet of the FFF channel 
following initiation of the separation process. Though 
shown not to be true, certain classes of particles were 
well described qualitatively by such an application of 
the theory. Most importantly, for both SEC and FFF 
separations, the insertion of a monodisperse sample of 
molecules or particles produces a broad band due to 


dilution and mixing effects spanning a range of elution 
times. Thus, even though each member of the sample 
may be identical, the actual times of elution will vary 
over a broadened sample peak. A theory that predicts 
the exact time of elution corresponding to each distinct 
species cannot describe well the different arrival times 
of identical particles. The phenomenon of such “band 
broadening” is discussed in the next section. 

A good introduction both to theory as well as the 
many types of FFF separation may be found in the 
Field Flow Fractionation Handbook (Schimpf et al. 
2000) containing a broad selection of topics by many 
of the early leaders in the held. A few comments and 
illustrations are appropriate as the FFF techniques are 
not generally as well known as SEC, and some rarely 
mentioned caveats will be discussed below. 

Figure 9 shows the schematic structure of the most 
commonly used FFF device, the asymmetric flow field- 
flow fractionator (A4F). The A4F channel is comprised 
of the following elements together with means to hold 
them together: 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 8 Excess Rayleigh 
scattering data shown for 12 
scattering angles 





1. A bottom assembly structure 1 holding a liquid- 
permeable frit 2 surrounded by a sealing 0-ring 3. 

2. A permeable membrane 4 that lies on the frit 2. 

3. A spacer 5 of thickness from about 75 to 800 pm 
into which has been cut a cavity 6. 

4. A top assembly structure 7 generally holding 
a transparent plate 8 of material such as Lexan® or 
glass. The plate need not be transparent for some 
implementations. 

The resulting sandwich is held together with holts 
12 or other means. The generally coffin-shaped or 
tapered cavity 6 in the spacer 5 will serve as the 
channel in which separation will occur. The top assem¬ 
bly structure 7 usually contains three holes, called 
ports, that pass through the top plate 8 and are centered 
above the charmel permitting the attachment of fittings 
thereto. These ports are: 

1. A mobile phase inlet port 9 located near the begin¬ 
ning of the channel and through which is pumped 
the carrier liquid, the so-called mobile phase 

2. A sample port 10 , downstream of the inlet port, into 
which an aliquot of the sample to be separated is 
introduced to the channel and focused thereunder 

3. An exit port 11 through which the fractionated 
aliquot leaves the channel near the end of the cavity 
A single pump, as used in the A4F-preferred 

embodiment, provides the mobile phase at the inlet 
port 9. The mobile phase is the source of two distinct 
flows: (1) flow through the frit-supported membrane 
producing a cross flow transverse to the injected sam¬ 
ple and (2) longitudinal flow parallel to the membrane 
and leaving the channel with the fractionated sample 



Multiangle Light Scattering from Separated Samples 
(MALS with SEC or FFF), Fig. 9 Main components of an 
A4F separator 

through the exit port 11 . Because of the small diameter 
of the outlet tubing as well as back pressure caused by 
detectors downstream of the channel, the impedance to 
the sample-containing channel flow is generally much 
greater than the impedance produced by the 
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frit-supported membrane and the cross flow 
therethrough. This cross flow is controlled by 
a needle valve or similar computer interfaced means 
housed in a remote control unit. The total flow that 
passes through the membrane 4 and, there from, 
through the supporting frit 2 is controlled and regulated 
by means of a remote valve attached underneath that 
controls the outflow through fitting 13 . 

Review of the channel structure shown in Fig. 9 will 
emphasize another major distinction between FFF and 
SEC. The relatively small surface area of the FFF 
device with which the sample may interact (the mem¬ 
brane of Fig. 9), compared with the many internal 
regions of a packed SEC column, can reduce signifi¬ 
cantly the possibility that some fractions of the sample 
are actually removed. An important element of any 
separation strategy (SEC or EEE) is the need to confirm 
after separation that all of the sample has been recov¬ 
ered. The total mass of the injected sample must agree 
with the total mass of the fractionated sample 
recovered. 

The A4E fractionator is considered a variation of 
the earlier developed symmetrical flow-field fraction¬ 
ator, EIFFE, often referred to at the time as the 
“universal” separator. It had a very steep learning 
curve and, compared to A4E, was extremely difficult 
to operate. In this earlier device, a cross flow is pro¬ 
vided to the channel hy a separate pump. Thus, each 
flow is produced by a separate pump providing, 
thereby, cross flow symmetry. Eor A4E, on the other 
hand, an effective cross flow is established by 
restricting the channel out flow relative to the input 
flow. The difference between the two flows becomes 
the effective cross flow. Because the A4E fractionator 
produces the two basic flow fields of the traditional 
ElEEE device with a single pump, many of the results 
characteristic of symmetrical cross flow EEE have been 
assumed operative for A4E as well. One of these, for 
example, is the so-called fractionating power F that is 
proportional to the product of cross flow times the 
square root of the ratio of cross flow to channel flow,/^, 
i.e., F ocfx X Y^. Since the source of the cross flow 
for the symmetrical fractionator is independent of the 
channel flow, both may be varied and would be con¬ 
stant over the channel length. Eor the A4E device, on 
the other hand, the channel flow always varies along 
the length of the channel and reaches a minimum just 
before the sample leaves the channel. The tapered 
coffin-like structure of the spacer 5 is intended to 


maintain the channel flow by compensating for contri¬ 
bution to the cross flow. However, a given profile 
compensates for only one particular pair of inflow 
and outflow rates. 

Because of the ability to program EEE devices, in 
many respects, each can serve the function of almost 
limitless sets of columns. Whereas SEC column sets 
elute the largest molecular species first with the 
smallest eluting last, the traditional operation of A4E 
and CFIEEF channels produce elution of the smallest 
species first. By varying the cross flow to channel flow 
ratio during the elution, larger species may be further 
separated from their small companions. Of course, 
such separations may well dilute the species present, 
and this may result in decreased signals as well as 
increased band broadening (see next section). Once 
a separation has been considered complete, the cross 
flow is generally reduced to zero so that any remaining 
fractions may be flushed out by the channel flow. 

Eigure 10 is a schematic of the flow and normal 
separation within an A4E channel. The smaller mole¬ 
cules diffuse deeper into the parabolic channel flow 
and elute earlier. 

Eigure 11 compares SEC and A4E separations for 
a vaccine comprised of a very broad protein conjugate 
complex. The range of molar masses present is huge 
with the A4E separation shown spanning a range from 
about 5 X 10"* to almost 2 x 10^. Although the set of 
columns used for the SEC separation of the same 
sample showed components as low as 5 x 10^, the 
so-called exclusion limit of the SEC columns could 
not resolve any components of mass above 8 x 10^. 
Thus, the ability of EFF separations to resolve sample 
fractions from both particles and molecules within the 
same sample Is another extremely important result. 

A bibliography of every known paper written 
concerned with every form of field-flow fractionation 
and their applications was initiated and developed by 
Dr. Mark Schure of Rohm and Haas. In 2008, 
he turned the project over to Wyatt Technology 
Corporation where it continues to be maintained and 
expanded at http://www.wyatt.com/component/option, 
comjumi/Itemid,! 16/flleid,7/. 

Band Broadening 

The concept of band broadening concerns the longitu¬ 
dinal spreading of a small pure sample introduced as 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 10 Schematic cross 
section of a A4F channel 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 11 Protein- 
polysaccharide conjugate 
vaccines 



a narrow bolus into a liquid chromatography system. 
Most of this spreading occurs within the column or 
A4F channel and arises primarily from dilution and, to 
a lesser extent, diffusion of the holus constituents. 
Quantitative measurement of such spreading is 
performed in order to obtain a measure of the effects 
of the separating structure upon a homogeneous sam¬ 
ple. Assuming that the separation of a more complex, 
heterogeneous sample is successful and the sample 
constituents are resolved thereby, the type of hand 
broadening of particular importance for the subsequent 
determination of the mass and size distributions 


present in the sample will be that caused by the various 
detector components themselves. 

An essential requirement for the successful sepa¬ 
ration and subsequent analysis of a separated sample 
is the ability of each detector required for this process 
to yield its expected measurements with minimal 
error. Thus, in order to calculate the molar mass 
distributions present, for example, in a separated pro¬ 
tein sample, the accurate measurement of the concen¬ 
tration corresponding to each separated element 
whose excess Rayleigh ratio has been measured is 
essential. 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 12 90° excess Rayleigh 
ratio and differential 
refractometer measurements 
of an eluted sample 
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Figure 12 shows a 90° excess Rayleigh ratio mea¬ 
surement of an eluted monodisperse sample 
superimposed upon the response of differential refrac¬ 
tometer concentration detector. The concentration pro¬ 
file has been normalized and displaced in elution 
volume; so the peaks occur at the same volume. Note 
that the full width of the peak at half maximum, nor¬ 
malized to the peak height, is broader for the down¬ 
stream dRI detector. For monodisperse samples, they 
should be equal since the dRI signal is proportional to 
the concentration c while the excess Rayleigh ratio is 
proportional to me and m is constant across the peak 
[This will not be the case when the sample measured is 
polydisperse since, having compensated suitably for 
the delay between the two detectors, the peak of the 
excess Rayleigh ratio generally will occur before the 
peak of the concentration signal]. Note that because of 
band-broadening effects, the concentration peak is 
broader than the light-scattering peak. 

Consider a separation that produces resolved 
peaks corresponding to separated fractions present 
in the sample. An example is shown in Fig. 13 with 
the excess Rayleigh ratio measured at the 90° 
scattered light detector as a function of elution 
volume for bovine serum albumin sample (BSA). 
Also shown is the signal produced by the down¬ 
stream differential refractive index detector (dRI) 


with this signal directly proportional to the sample 
concentration at that slice. Referring back to Fig. 5, 
the sample has been separated by the SEC columns 
which are followed by the light-scattering detector 
and the dRI detector. A UV detector is not present. 
The signals from these two detectors have been 
combined by aligning the peaks of each signal. 
Because of the downstream sample dilution and 
diffusion, the dRI signal shows a peak broader 
than the MALS peak. Combining the MALS and 
dRI signals to produce the associated molar mass 
as a function of elution volume yields the molar 
mass results shown graphically in Fig. 13, a clear 
distortion of the expected monodisperse nature of 
the separated protein species. 

The “grimace” appearance of the molar mass dis¬ 
tribution shown in Fig. 13 arises because the dilution/ 
diffusion effects have resulted in an apparent concen¬ 
tration near the peak that is too small, whereas in the 
wing regions, the concentration appears to be too large. 
For the case of the BSA protein shown, rather than the 
flat line, characteristic of a monodisperse sample, an 
artifactual distribution of masses is produced. Integrat¬ 
ing the signals over the complete peaks, however, will 
yield the correct weight average molar mass of the 
monodisperse protein (see section “Mass Recovery 
and Distributions”). 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 13 Excess Rayleigh 
ratio at 90° and calculated 
molar masses from scattering 
data and band-broadened 
concentration 
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For the BSA sample analyzed above, it had been 
assumed that the scattered intensity was isotropic, i.e., 

« land the scattered intensity at all angles is 

the same. Thus, only a single angle measurement was 
required to complete the analysis. For larger biological 
molecules and particles, when is no longer very 
small and scattering is not isotropic, a global fit to the 
collected excess Rayleigh ratios Rj{Q°) must be made 
for each slice j from which the molar mass and mean 
square radius may be extracted. For very small mole¬ 
cules, however, the measured data at all angles may be 
used to increase the precision of the extracted molar 
mass values. Since measurement of R{0j) at each angle 
9j should be the same except for experimental errors, 
the weighted average of all such data for such 
isotropically scattering molecules should yield 
a more precise value. 

Returning to Fig. 13, the BSA sample, separated by 
SEC columns, shows the presence of both dimer and 
trimer aggregates as well as unresolved higher aggre¬ 
gate states. It cannot be overemphasized that band¬ 
broadening artifacts such as shown in Fig. 13 are 
unacceptable artifacts of an appropriate separation 
process. They must be corrected. Despite a variety 
of methods (Huglin 1972) directed to correcting such 
artifacts, it was not until Trainoff (2008) developed 


appropriate analytical means in 2003 that the problem 
was considered solved. It is essential that the analyt¬ 
ical processing of MALS data collected from protein 
and related biological samples includes such correc¬ 
tion for band broadening. Figure 14 shows the same 
data of Fig. 13 corrected for band broadening which is 
in complete agreement with the expected monodis- 
perse nature of the BSA monomer, dimer, and trimer 
species. 

In the event that a protein sample is fractionated and 
only the UV detector of Fig. 5 is used to measure 
concentration, the band-broadening effects may not 
even need correction since the dilution of the sample 
within such a detector is generally very small due to the 
very small volume of the detector cell itself. For many 
such cases, the correction may be ignored. However, 
once a downstream dRI detector is employed (or other 
downstream devices such as differential viscometers), 
such correction is essential. The need to correct for 
band broadening should be immediately evident upon 
examination of the fractograms generated. 

Another most interesting result, once band broad¬ 
ening has been suitably corrected, is shown in Fig. 15. 
Shown there is an A4F separation of native keyhole 
limpet hemocyanin (KLH), an extremely large, hetero¬ 
geneous glycosylated protein consisting of subunits 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 14 Excess Rayleigh 
ratio at 90° and calculated 
molar masses from light¬ 
scattering data and 
concentration after correction 
for band broadening 


molar mass vs volume 


^^cattehn^ngl^^egrei^^^^^^moia^ais^ia^umbe^^ 



Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 

Fig. 15 Keyhole limpet 
hemocyanin separated by A4F 



with a molecular weight of 350,000 and 390,000 in 
aggregates with molecular weights of 4,500,000- 
13,000,000. KLH is extensively used as a hapten car¬ 
rier to produce antibodies against them. Of particular 
interest in this sample is the predominant fraction 
whose molar mass is about 8.3 million Da. Without 
the appropriate correction for band broadening, this 
important result would not be so clearly evident. 


Mass Recovery and Distributions 

Because of the possibility that small fractions of bio¬ 
logical samples may adhere to various column packing 
materials and/or A4F membranes due to certain sol- 
vent/mobile phase conditions, it is very important to 
account for any mass discrepancies that may exist. The 
easiest way to check for such discrepancies is to 
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compare the total eluted mass (over all species) with 
the mass of the injected sample. At each slice j of 
volume Avj, the concentration (band broadening 
corrected) is Cj [Although it is standard practice to 
express the liquid chromatography (LC) output chro¬ 
matogram in terms of the eluted volume per slice, Avj, 
it is often expressed in units of time. Given the LC flow 
rate, /, and the slice collection time. A/), the two are 
related simply by Avy =/Afy]. If the volume of each 
slice is the same, i.e., Av, then the total mass eluted. 
Me, is just the sum of the masses eluted at each slice: 

N N 

i j 

where N is the total number of slices collected. This easily 
determined quantity is then compared with the mass 
injected M/ = V/C/, where V/ is the sample volume 
injected at concentration Cj. Surprisingly, this simple 
calculation is often forgotten, sometimes with extraordi¬ 
nary consequences. As mentioned earlier, the greater 
surface areas of columns relative to the membrane surface 
of A4F devices can result in significantly changed results 
for the same sample fractioned by the two processes. 

Following fractionation and measurement, the mass 
distributions of the species separated may be obtained 
directly. This type of information is particularly useful 
for the quantitative study of aggregates often present in 
protein and other biological samples. The weight 
average mass of molecules in a particular peak or 
otherwise defined region of the chromatogram is 
given simply by: 


P = 

_ 

Eg 


^ E cm j Ycj 
Eg/EgM 
EgM 

Eg 


(9) 


Note that if /7j, is constant over the region measured, 
the polydispersity, P —\. Thus, the peaks corrected in 
Fig. 14 confirm the monodispersity of the monomer 
and its two aggregates shown. 

Consider now the elution of an arbitrary sample 
comprised of a range of molar masses. Of considerable 
interest is the so-called cumulative weight fraction C{m) 
that corresponds to the weight fraction of the sample 
whose molar mass includes all samples up to m. The 
sample referenced, of course, may correspond to all the 
slices collected or just to an explicit region within the 
total collected. From the measured (»?, , c,) pairs at each 
slice i, the cumulative weight fraction is simply: 


E Eim 

- (10) 

E Em 

m 

where the summation over slices is indicated by the 
indicia shown. Thus, the collection is over N slices 
with the mass at the first slice indicated by m\ and the 
last by mN. Note that the differential weight fraction at 
each slice is just: 


AC(m,) 


cm 

~N 


E Eimi 


( 11 ) 


E Em 


Eg 


(7) 


where the summations are performed over the slices in 
the region or peak selected. Similarly, the number 
average molar mass may be calculated for the same 
region/peak from: 


E njmj Eg 
x;«/ E Eilnii 


( 8 ) 


since wt/n,- = C/. 

The polydispersity of molecules in a selected region 
is just the ratio of Eqs. 7 and 8 summed over the 
collected slices in that region, namely. 


i.e., the weight fraction of a sample whose mass is m. 
Note that during a collection over a region divided into 
discrete slices, the contributions at a particular mass 
may come from several slices that may not even be 
contiguous. A traditional means to display such results 
is based on “binning” into size intervals and then 
plotting the results. The resultant appearance may 
appear quite irregular because of the statistical fluctu¬ 
ations associated with each slice measurement. 
To obtain a smoother and more realistic differential 
distribution, the cumulative distribution is first 
obtained with the resultant smoother function being 
differentiated to yield the differential distribution. 
Numerous other analytical techniques are employed 
to obtain such results. 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 16 Cumulative 
distribution of three proteins 



The cumulative number distribution, whereby the 
distribution reflects to relative number of molecules at 
a particular mass, is calculated similarly as: 

m 

Y, ninii 

CnH = - ( 12 ) 

Y "/"b- 

nil 

Referring to Fig. 14, the corresponding weight frac¬ 
tions for the monomer, dimer, and trimer fractions are 
easily shown to be 92.4%, 6.65%, and 0.95%, 
respectively. 

Figure 16 compares the cumulative weight fraction 
of three different proteins collected over the entire 
eluting samples. Differences in their relative aggregate 
components are obvious. 

Protein Conjugates (Complexes) 

An important class of biological molecules Is comprised 
of complex, or conjugated, structures such as the so- 
called protein conjugates. Such molecules share many 
of the characteristics of classical copolymers, i.e., mol¬ 
ecules comprised of two distinct and different struc¬ 
tures/materials. As a first example, many proteins 
function properly only when they are glycosylated, i.e., 
subject to an enzymatic process that attaches glycans to 
them. Such glycosylated molecules may be formed also 
from lipids and other organic molecules. Some proteins, 
for example, do not fold correctly unless they are 


glycosylated first. Though not a strict requirement for 
folding, unglycosylated proteins often degrade quickly. 

Another form of such biopolymers is a pegylated 
complex. Here, a polyethylene glycol (PEG) molecule 
is attached covalently to another molecule such as 
a drug or therapeutic protein. Such complexed mole¬ 
cules play many important roles in the treatment of 
disease by imparting significant pharmacological 
advantages over the unmodified form. Conjugation of 
a drug with PEG can impart extended life of the drug in 
the circulatory system, improved drug solubility, 
increased drug stability, and reduced dosage frequency. 

Making MALS measurements on fractionated 
complexed molecules allows one to determine the 
distribution of the conjugate with respect to targeted 
protein or other biomolecule. Controlling the range of 
such PEG adjuncts certainly affects the therapeutic 
properties of the conjugated complex, and controlling 
and monitoring the amount of PEG attached becomes 
an essential element for the production and quality 
control of the manufactured drug complex. Similarly, 
the basis for understanding how glycan attachment can 
affect protein function requires an understanding of the 
range of glycan mass attached and how such conjuga¬ 
tion affects size as well as function. 

Eractionation of a conjugated biopolymer by SEC 
or EEE means depends generally on the molecule’s 
effective hydrodynamic size. As long as the core pro¬ 
tein or biomolecule to which the conjugate will adhere 
is the same for the entire fractionated sample, the 
interpretation of the measurements is quite simple 
since the effective hydrodynamic size of the complex 
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depends only on the amount of material added. On the 
other hand, complex particles may themselves form 
aggregates whose different sizes should permit their 
detection and characterization following fractionation. 

Both a differential refractive index (dRI) detector and 
a UV detector are required to characterize fractionated 
hiopolymer complexes per the setup shown in Fig. 5. As 
an example of, say, a protein to which is attached a glycan, 
the differential refractive index increment of both the 
protein and the glycan, (dn/dc)^ and (dn/dc)^, respec¬ 
tively, must he known a priori. In addition, the 
corresponding extinction coefficients Sp and Sc must also 
he known. All of these quantities should be determined at 
the wavelength of the MALS instruments being used. 
Note that most proteins have a {dn/dc)p value close to 
0.185. Without having to measure such a value for each, 
this value will generally suffice to within an error of ±5%. 

For a slice j containing a fraction of the complex, 
the effective differential refractive index increment 
will be: 




^Pj) 


(13) 


where Wpj is the weight fraction of the protein compo¬ 
nent and the concentration c, = Cpj + Ccj is the sum of 
the constituent protein and conjugate concentrations. 
Similarly, the corresponding extinction coefficient for 
the protein and the attached conjugate is: 


(14) 


At a fractionated slice j, the Zimm formalism using 
a dRI detector to measure concentration becomes for 
each angle 0, : 



4ti^ 

m) 

N^4Rj{e^) [ 

1 


^pj + 


{dc)p^PJ + 


Mpi+Mcj 


l+^(r2)sin20,./2 


'2A2C 

(15) 


since the concentrations are vanishingly small after 
fractionation and, therefore, 2 A 2 C << ^ | ^ . If the 
conjugates are greater than a few nm in size, one may 


extract their mean square radius, (I'g), from the slope 
of ijRj^Qi) versus sin^(0//2). 

The concentration measured by the dRI is just: 


^ — ^pj T ^cj — ^dRlJ — 


SdRIj<^dRI 



(16) 


where its signal at slice / is SdRij and oCdRi is its calibra¬ 
tion constant. The concentration measured by the UV 
detector at slice j is: 


_IuVj<^UV _ luvj^uv 

eL {^pCpj + ScCcj)L 

where its signal at slice j is luyj, the protein concentra¬ 
tion is Cpj, the conjugate concentration is Ccj (only 
measureable by the UV detector if the conjugate is 
detectable by it with a corresponding extinction coef¬ 
ficient Ec), and L is the cell path length. 

The concentrations of the two components 
Cpj and Ccj are determined by combining Eqs. 16 and 
17. From Eq. 15, the molar mass of the species in the 
slice Mpj + Mcj is determined. The individual contrib¬ 
uting masses in the slice j are then extracted by noting 
that the number of protein molecules rip present in 
a slice is equal to the number of conjugate groups 
that will form the protein conjugate. Therefore, since 
Cp = ripMp and Cc = HcMc and Up = He, 

Mp/Mc = Cp/Cc. (18) 

The foregoing analysis is certainly one of the most 
important results of the combination of light scattering 
with fractionation. It cannot be overemphasized that 
any band-broadening effects present must be corrected 
if meaningful results are to be derived. 

Consider now a few examples. Eigure 17 shows the 
UV trace of an SEC-separated protein complex com¬ 
mon to both a mammal and an insect. The degree of 
glycosylation differs markedly, however, as evident 
from Eig. 18 that shows the dRI trace that is then 
combined with the data of Eig. 17 and the MALS 
measurement (not shown). The combination of these 
three results (UV, dRI, and MALS) is then used to 
calculate the corresponding molar mass of the common 
protein and the two different associated glycan distri¬ 
butions. The MALS results at each slice j of Mp, + Md 
were obtained from the intercept at 0 = 0 of the global 
fit over all angles i. The protein masses derived are 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 17 UV trace of SEC- 
separated protein complex 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 18 Molar mass 
distribution of protein 
complex 


Insect & Mammalian 




time (min) 


virtually identical, while the degree and amount of 
glycosylation present in mammal and insect differ 
significantly. 

Figure 19 shows similar results for a protein of 
mass 18.2 kD pegylated with a branched PEG of 
mass around 18 kD. The resultant conjugate has 
a mass around 34 kD, suggesting a degree of 
pegylation of about 50%. 

An example of a protein-detergent complex pro¬ 
duced during the isolation of a specific membrane 
protein is shown in Fig. 20. Such membrane proteins 
are only soluble in the presence of detergent micelles 


that form the resultant nearly homogeneous complex. 
A BSA monomer is shown also for comparison. 

Future Applications/Opportunities and 
Comments 

There are many other types of molecules and small 
biologically conjugated particles whose separation and 
subsequent classification may be performed with 
MALS combined with fractionation processes. These 
include emulsions, liposomes, dendrimers, DNA 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 19 Pegylated protein 
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Multiangle Light Scattering 
from Separated Samples 
(MALS with SEC or FFF), 
Fig. 20 Protein-detergent 
complex 



time or volume 


fragments, and protein complexes with Au and Ag 
particles, to name but a few. As they become more 
important, for example, as carriers for drug delivery, 
the need to separate and measure the complex con¬ 
stituents will become of even greater importance. FFF 
separations techniques remain in a state of significant 
development in which their separation mechanisms 
are continuously being refined and improved. Analyt¬ 
ical software for the processing of data collected from 
scattered light measurements of fractionated samples 


continues to be enhanced with added attention to 
precision and reducing the effects of experimental 
errors. 

Concomitant with the development of this chapter, 
an important new book by Podzimek (201 1) has been 
published that touches on many of the same topics, 
though with different emphasis and more focused on 
synthetic polymers. It includes a good discussion of 
topics rarely discussed in the biomolecular context 
such as viscometry and polymer branching. 
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Nevertheless, it is an excellent source of more detailed 
information about the two separation mechanisms 
(SEC and A4F) discussed here. It presents an excellent 
background of often used column calibration tech¬ 
niques and their shortcomings. 

Acknowledgement Many of the results of this section are 
based on samples prepared and analyzed by Dr. Michele Chen 
using the Wyatt Technology ASTRA® software. 
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Synonyms 

Fourier transform NMR spectroscopy 


Definition 

In multidimensional nuclear magnetic resonance 
(NMR) spectroscopy nuclear spins, polarized in a 
static magnetic field, are brought in a non-equilibrium 
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state during a preparation period and undergo one or 
more cycles of frequency labeling and mixing before 
they are detected. Signals in most multidimensional 
NMR spectra have coordinates that are the chemical 
shifts of nuclear spins which have been correlated 
during the different mixing periods, either by 
through-bond or by through-space magnetic interac¬ 
tions between them. Multidimensional NMR makes 
it possible to determine the chemical and spatial 
structures of molecules. 

Introduction: From 1D to 2D Fourier 
Transform NMR Spectroscopy 

About 50 years ago, transitions between nuclear spin 
states were monitored by slowly sweeping the 
magnetic field B through the respective resonance 
conditions of the spins, while keeping the frequency 
of the photons used for irradiation constant. This 
procedure was very time-consuming as the signals 
were brought into resonance and detected one by 
one. The introduction of pulse Fourier Transform 
(FT) NMR by Richard Ernst in the 1960s (Ernst and 
Anderson 1966) was a decisive moment in the history 
of NMR spectroscopy: The NMR sample is polarized 
in a strong magnetic field and exposed to 
a short (microseconds) radiofrequency (RE) pulse. 
The nuclear spin magnetization’s response to this 
RE pulse (the free-induction decay or FID) is subse¬ 
quently measured as an induced current in a receiver 
coil (Fig. 1). This FID has been aptly likened to the 
response of a bronze bell to the blow of a hammer 
(Derome 1987): The sound produced, while fading 
away during the following seconds, can be recorded 
by a microphone, and the resulting damped signals 
can be analyzed spectrally (Fourier transformed) to 
reveal the various frequencies present, thus charac¬ 
terizing the bell (the NMR sample). The hammer 
blow (RF pulse) marks the start of the acquisition 
period. FT-NMR allows the rapid repetition of the 
NMR experiment and therewith affords a significant 
increase in the signal-to-noise ratio by coaddition of 
the FIDs. 

ID pulse-acquire NMR spectroscopy is especially 
useful in the chemical characterization of molecules 
from their typical chemical shift patterns: The 
spectrum provides a unique molecular “fingerprint.” 
To obtain information on the relative topology or 


spatial arrangement of the nuclei, it was necessary to 
measure the FID upon irradiation with an additional 
RF pulse (double resonance). This would reveal which 
resonances are connected by some pairwise magnetic 
interaction. Two important interactions exist between 
nuclear spins, and these are either transmitted through 
the bonding electrons (scalar or J coupling) or 
directly through space (dipolar or D coupling). Double 
resonance could thereby reveal which nuclei are part of 
the same covalent structure, or are in close spatial 
proximity, respectively. Unfortunately, the repetition 
of ID FT experiments after or during irradiation of 
individual spin transitions is often an inefficient use 
of time. 

Pondering the possible effects of two RF pulses on 
the magnetization due to nuclear spins under their 
mutual spin-spin interaction, Jean Jeener realized that 
the NMR signals recorded during the acquisition 
period can also be recorded as a function of a second, 
independent time variable. At a summer school in 
1971, Jeener proposed that ID NMR signals could be 
recorded directly during an acquisition or detection 
period (t 2 ) following two 90° RF pulses and that the 
time between these pulses (ti) could be incremented 
systematically (Fig. 2). In this two-dimensional (2D) 
experiment, the precession frequencies in the indirect 
or evolution time-domain are also encoded in the 
FIDs and can be resolved by 2D Fourier transforma¬ 
tion. Importantly, the second RF pulse serves to 
transfer or mix some of the magnetization from one 
spin to another by their mutual coupling, indicated by 
a transfer matrix R in the bottom panel of Fig. 2. A two- 
dimensional FT then correlates the precession frequen¬ 
cies during the indirect evolution period with those 
during acquisition. Jeener’s insightful suggestion led 
to the conception of the first two-dimensional NMR 
experiment by the Ernst group (Aue et al. 1976a) and, 
eventually, to a whole toolbox of multidimensional 
NMR experiments. 

Because of the centrality of multidimensional ET 
NMR in contemporary spectroscopy, this procedure 
will now be illustrated graphically in some detail. In 
this example, a 2D NMR dataset is recorded by a 
basic two-pulse sequence consisting of the general 
structure described above: preparation-evolution- 
mixing-detection. In this example, nuclear spins of 
one kind are excited by the preparation step and their 
chemical shifts encoded during the (first) evolution 
period. The basic experiment is repeated several 
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FT CW 


Multidimensional NMR Spectroscopy, Fig. 1 Schematic 
representation of Fourier transform NMR spectroscopy by the 
example of the 60-MHz proton resonance of 7-ethoxy-4-methyl- 
coumarin. An excitation pulse, represented mathematically by 
the superoperator P in the bottom of the graph, transforms the 
equilibrium polarization - denoted here by the quantum mechan¬ 
ical density operator Oq - into observable magnetization, ct( 0). 
During free precession under the evolution superoperator E, 


which includes chemical shift and coupling terms, the density 
operator evolves further into o'(t). Sampling of the signal (free 
induction decay, FID) is represented by the detection operator D. 
Fourier transformation of the FID produces the spectrum marked 
“FT.” For comparison, a spectrum obtained in the equivalent 
amount of measurement time by slow-passage CW irradiation is 
also shown (Further details can be found in Ernst (1992). © The 
Nobel Foundation. Image courtesy of Richard R. Ernst) 


times, while incrementing the period tj with each 
new FID, as shown in Fig. 3, and the results are 
stored separately. 

The resulting data matrix is a two-dimensional 
time-domain signal s(ti, t 2 ), with the directly observed 
FIDs as rows that contain an additional frequency 
modulation in the indirect time domain. An example 
2D time-domain signal is shown in Fig. 4a. Fourier 
transformation of each of the FIDs leads to a series of 
ID NMR spectra, where each of the peaks shows 
a modulation pattern in the indirect time dimension 
(ti) that reflects the precession frequency before 
mixing. The resulting 2D dataset s(ti, Q 2 ) - sometimes 
also referred to as a collection of ti-interferograms - is 
shown in Fig. 4b. A second Fourier transformation, this 
time in the indirect time domain, results in the desired 
2D correlation spectrum s(Q 1 , Q 2 ) (Fig. 4c). Since the 
preparation and mixing pulses generate precession of 
different nuclear magnetic moments in this example. 


there are no diagonal peaks in the resulting spectrum, 
but only transferred magnetization. 

The first practical form of 2D NMR was Correlated 
SpectroscopY (► Fluorescence Correlation Spectros¬ 
copy), where the mixing pulse transfers magnetization 
among spins that experience scalar or ► J-coupling. 
2D COSY NMR identifies all existing pairwise mag¬ 
netic interactions between the nuclei in a molecule. In 
most cases, J-couplings between nuclei that are more 
than three bonds apart are negligibly small, and there¬ 
fore 2D proton-proton COSY spectra only connect 
signals for protons attached either to the same or adja¬ 
cent heavy atoms. As an example, consider the cova¬ 
lent chemical structure of the small alkaloid molecule 
quinine, shown in Fig. 5a. Several protons are indi¬ 
cated by a number in the structure and in the 
corresponding ID NMR spectrum in Fig. 5b. Magne¬ 
tization of a spin that was not transferred by the mixing 
period would lead to signals with the same frequency 
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Multidimensional NMR Spectroscopy, Fig. 2 Schematic 
representation of the 2D experiment proposed by Jeener. The 
first pulse excites coherences that precess during tj and are 
transferred by the second pulse to different nuclear spin transi¬ 
tions where the coherences precess with a new frequency. 


Measuring the response a(ti, ^ 2 ) and Fourier transformation 
with respect to both time variables produces the 2D spectrum 
that represents the transfer matrix R (Further details can be found 
in Ernst (1992). © The Nobel Foundation. Image courtesy of 
Richard R. Ernst) 


Multidimensional NMR 
Spectroscopy, 

Fig. 3 General 2D NMR 
experiment, consisting of 
a preparation, evolution, 
mixing, and detection period. 
The duration t, is varied 
systematically from 
experiment to experiment and 
yields a 2D time-domain 
signal s(ti, 12 ) (© The Nobel 
Foundation. Image courtesy of 
Richard R. Ernst) 


I PREPA- I EVOLUTION I MIXING I DETECTION 
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on both axes (i.e., a diagonal peak), whereas transfer 
between spins would lead to a correlation or cross peak 
in the 2D spectrum (provided the spins display unequal 
precession frequencies). For example, the absence of 
appreciable scalar coupling to other protons in the 
molecule for the methoxy protons (25) results in an 
intense peak at the diagonal, whereas cross peaks are 


too faint to observe. In contrast, signals of protons 
adjacent on the aromatic ring (15)/(16) and (19)/(20) 
experience significant scalar coupling and give rise 
to strong off-diagonal correlations. The chemical 
structure (panel a) can be directly mapped to the 
spectral pattern (panel b) and vice versa. 2D COSY 
NMR spectroscopy forms the basis for the de 
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Multidimensional NMR Spectroscopy, Fig. 4 (a) 2D time- 
domain signal s(ti, 12 ) consisting of four nuclear spin precession 
frequencies present during tj, coupled with four different pre¬ 
cession frequencies during t 2 . (b) 2D t|-interferograms s(t|, Q. 2 ) 


produced by FT along the t 2 acquisition dimension, (c) 2D 
Fourier transform spectrum s(Q|, Q 2 ) visualizing the 

four-frequency correlations present in the signal s(t|, 12 ) 
(Image produced with Mathematic a™ (Wolfram Research)) 


novo determination of chemical molecular structure. 
2D Total Correlation SpectroscopY (TOCSY) 
(Braunschweiler and Ernst 1983) forms a very useful 
complement. In 2D TOCSY, the magnetization is 
exchanged within a network of J-coupled spins, and 
correlations between nuclei that are not scalar 
coupled can still he observed if they share coupling 
to a common partner. This allows structural fragments 
to be assembled. 

In the case of ► nuclear overhauser effect (NOE) 
spectroscopy or NOESY (Macura and Ernst 1980), 
cross peaks connect resonances of protons that are 
less than about 0.5 nm apart. In Fig. 5d, correlations 
between the aromatic ring protons (15)/(16) and 
(19)/(20) are also observed because these spin pairs 
are located within a short, hxed distance. However, the 
schematic chemical structure in Fig. 5a does not 
represent the actual conhguration in solution well: 
For example, cross peaks from (22) are observed 


to (7), (12), and (25), which appear remote in Fig. 5a. 
The 3D crystal structure of quinine, shown in Fig. 5c, 
agrees very well with the short distances derived from 
NOESY spectra, indicating that the crystal and 
solution structures of quinine are very similar. The 
many pair correlations from 2D NOESY spectra can 
be input to calculation algorithms for the de novo 
determination of spatial molecular structure. The com¬ 
bined use of 2D COSY and NOESY spectroscopy 
has led to the first successful assignment of proton 
resonances for a small protein and to the determination 
of its 3D structure in solution by the group of Kurt 
Wiithrich (see ► Protein NMR - Introduction). 

Heteronuclear NMR 

In the late 1970s, a further breakthrough was forged for 
the sensitive detection of low-y nuclei-like and *^N 
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Multidimensional NMR 
Spectroscopy, Fig. 5 (a) 

Chemical structure of quinine 

(b) 2D *H-'H cosy spectrum 
of quinine in methanol. 

(c) Ball-and-stick model of the 
quinine molecule, as found in 
the crystalline form by X-ray 
diffraction (Data from 

B. Pniewska and A. Suszko- 
Purzycka Acta 
Ciystallographica Section C 
45: 638-642). (d) 2D 'H-‘H 
NOESY spectrum of quinine 
in methanol (Images of 
quinine (C) Ben Mills, 
Wikimedia Commons. NMR 
Spectra courtesy of Pieter van 
der Meulen, University of 
Groningen) 


25 

|3 20/22 



F2 (ppm) 
2D NOESY 


by utilization of the polarization of the attached 
protons (► INEPT). Combining proton excitation and 
detection with 2D data acquisition increased the 
achievable sensitivity by a factor (Yh/Yx)^^^, (equal to 
31.5 and 306 for X = and *^N, respectively), 
relative to direct detection ID spectroscopy 
(neglecting relaxation effects). 2D Heteronuclear 
Single or Multiple Quantum Coherence (HSQC or 
HMQC) spectroscopy became especially popular in 
biomolecular NMR after techniques had been 
developed to enrich proteins and other biomolecules 
in the spin-i4 isotopes ^^C and *^N. Equally important 
to the gain in sensitivity, heteronuclear 2D NMR 
greatly alleviated the overlap problem in ID *H 
NMR. Like in homonuclear J-correlation spectros¬ 
copies (such as ► COSY), 2D resonances in a HSQC 
spectrum have coordinates that are the frequencies of 
the J-coupling partners, in this case the chemical 
shifts of the proton and of the attached heteronucleus 


('^C or '^N). Only when both chemical shifts are the 
same as those of another spin pair, 2D resonances 
will still show overlap. 


2D + 2D = 3D 

Conceptually, 3D-NMR spectroscopy is a straightfor¬ 
ward extension of the 2D acquisition strategy and gener¬ 
ally combines two existing 2D-NMR experiments 
(Griesinger et al. 1989), as shown in Eig. 6a. Eor example, 
a 3D experiment may start with a first NOESY scheme, 
but the detection pulse is replaced by a further HSQC 
experiment. The 3D experiment thus takes the form prep- 
aration-evolution( tj)-mixing(Rj)-evolution( t 2 )-mixing 
(R 2 )-detection(ts). After 3D FT, cross signals are dis¬ 
persed in a cube. 3D spectroscopy resolves spectral over¬ 
lap, but at the expense of longer recording times. 
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Multidimensional NMR Spectroscopy, Fig. 6 (a) Extension 
of the 2D NMR concept of Fig. 2 to include a third frequency¬ 
encoding period. In this experiment, three nuclear spin 
transitions are correlated through two pair interactions, with 
their respective transfer matrices Ri and R 2 , respectively 


(© The Nobel Foundation. Image courtesy of Richard R. 
Ernst), (b) Cross-eyed stereo representation of the 3D-FT 
HNCO spectrum for the 125-residue protein photoactive yellow 
protein from H. halophila (Image produced with Mathematica^M 
(Wolfram Research)) 


Triple-Resonance NMR Spectroscopy 

The spectacular development of biomolecular solution 
state NMR spectroscopy in the 1990s was propelled by 
the group of Ad Bax, who extended the idea of 
heteronuclear 2D NMR to 3D ► triple-resonance 
NMR and applied it to the ► resonance assignment of 
proteins uniformly enriched in both and *^N 
(Kay et al. 1989; Ikura et al. 1990; Bax and Grzesiek 
1993). In an illustrative example, 3D HNCO spectros¬ 
copy, triples of ^H, *^N, and carbonyl nuclear 
spins, are made visible in a 3D frequency space, as 
shown in Fig. 6b. Each resonance now has coordinates 


that are the chemical shifts of the three nuclei in such 
a triple. A triple is detected in HNCO whenever these 
three nuclei are chemically linked and hence 
J-coupled, like in the peptide planes between amino 
acids in a protein: 'H bound to *^N and *^N bound to 
carbonyl '^C. Needless to say that overlap is rare in 
such 3D spectra (see Fig. 6b). 

A few other versions of triple-resonance NMR 
spectroscopy were proposed that together allowed for 
the complete sequential assignment of all backbone 
resonances of calmodulin, a 16.7-kD protein, in 
a very elegant, robust, and straightforward fashion. 
A host of 3D, and even 4D NMR pulse sequences. 
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has been developed in various laboratories over 
the years, correlating the chemical shifts of various 
J-coupled nuclear spins. Their names usually reflect 
the types of nuclei that are correlated (like in HNCO), 
allowing their information content and possible 
application to be guessed from the name alone. 
These experiments are further discussed in the section 
on triple-resonance NMR. Although homonuclear 3D 
NMR is not employed much, the impact of 3D triple 
resonance NMR spectroscopy has been enormous. 

Some Limitations and Extensions 

There are several reasons why adding new dimensions 
to existing low-dimensional NMR techniques cannot go 
on indehnitely. Most importantly, the pulse sequences 
become longer as the time between the first pulse and 
the start of the acquisition is used for yet another evo¬ 
lution period followed by yet another mixing period. 
Relaxation processes will weaken the signals, and many 
of the expected resonances will be lost in the noise of the 
multidimensional spectrum. Clever adjustments in the 
pulse sequences (e.g., selecting efficient coherence 
transfer pathways between slowly relaxing coherences, 
see ► cross-correlated relaxation and ► TROSY), mod¬ 
ifications of the sample (e.g., uniform or selective deu- 
teration of the protein to mitigate relaxation by dipolar 
relaxation with protons, for example, using the ► SAIL 
(stereo array isotope labeling) methodology), or invest¬ 
ments in the hardware (the use of a ► cryogenic probe or 
a stronger polarizing magnetic field) will all help. 

Another problem is associated with the time it takes 
to record multidimensional spectra. The classical 
recording strategy requires that each additional evolu¬ 
tion domain must be sampled in a certain minimal 
number of time steps, dictated by the required spectral 
resolution. Assume, for the sake of argument, that 
60 time steps are needed in each indirect evolution 
domain to obtain enough spectral resolution. Assume 
further that each evolution domain has to be sampled 
twice (to discriminate positive and negative frequen¬ 
cies with respect to the relevant carrier frequency) and 
finally that one pass through the pulse sequence takes 
Is, including the acquisition period and the recovery 
delay between scans. Then, it is straightforward to 
calculate the minimal time it takes to record a ID 
experiment (Is), a 2D experiment (2*60*Is = 2 min), 
a 3D experiment (2*2*60*60*Is = 4 h), a 4D 


experiment (2*2*2*60*60*60*Is = 20 days), and 
finally a 5D experiment (more than 6 years!). Not 
surprisingly, already in 4D spectroscopy, one tends to 
sacrifice resolution in one or more of the indirect 
domains to avoid conflicts with other users of the 
spectrometer. Several strategies have been proposed 
to solve this problem. The Filter Diagonalization 
Method (FDM) has been shown to produce well- 
resolved high-resolution 2D, 3D, and 4D spectra, 
even when very few increments are available in the 
indirect time domains. FDM data processing in the 
presence of noise remains challenging, especially in 
high-dimensional applications. Alternatively, one 
could use nonuniform or ► sparse sampling, where 
one spreads the time points more efficiently over the 
available evolution time without compromising the 
resolution. This generally means that algorithms other 
than Fourier transformation are more suitable to pro¬ 
cess the data, such as ► maximum entropy reconstruc¬ 
tion. In another innovative approach, a number of 
projections of the high-dimensional spectrum are 
recorded by simultaneously incrementing the evolu¬ 
tion times in two or more indirect domains. This gen¬ 
erates high-resolution datasets of reduced 
dimensionality in which the spectral frequencies are 
linear combinations of the offsets of the resonances in 
the orthogonal frequency domains. Optionally, the 
high-dimensional target spectrum, as well as any ID, 
2D, or 3D projection or cross section of it, can be 
obtained by the use of projection-reconstruction 
algorithms. 

Other Types of Multidimensional NMR 
Spectroscopy 

In most multidimensional NMR experiments, all 
domains are frequency domains, and the multi¬ 
dimensional coordinates of peaks in such spectra are 
usually the chemical shifts of the nuclei that have been 
correlated in that particular experiment via mutual mag¬ 
netic interactions. 2D EXchange SpectroscopY (EXSY) 
(Jeener et al. 1979) is special because mixing occurs via 
slow chemical reactions and exchange of nuclei between 
nonequivalent electronic surroundings. It allows the 
characterization of the kinetics of processes like intra¬ 
molecular rotations, bond shifts, valence isomerization, 
chemical exchange, intermolecular association, and 
chemical reactions, without disturbing the equilibrium. 
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A few exceptions are worth mentioning here, as 
they are of particular practical interest. J-resolved 
spectroscopy is one of the earliest 2D NMR experi¬ 
ments. In a 2D J-resolved spectrum (Aue et al. 1976b), 
chemical shifts and homonuclear J-couplings are 
visible in one dimension, and only the J-couplings are 
visible in the indirect domain. Clever manipulation of 
the dataset results in a spectrum that can be regarded 
as a regular ID spectrum in which all multiplets due 
to homonuclear couplings have been rotated 90° 
along the indirect frequency axis, thus avoiding lots 
of overlap and allowing a clearer view of the individual 
multiplets. It has regained much of its popularity due to 
its applicability in »• metabolomics. 

Diffusion-ordered spectroscopy (DOSY) (Morris 
and Johnson 1992) separates the resonances in the ID 
spectrum of a complex mixture in the indirect domain 
by the different rates of translational diffusion of 
the molecules in the mixture, thus avoiding the need 
for a physical separation like HPLC. 

There is no space here to review the many 
additional forms of (pseudo)multidimensional NMR 
experiments that have appeared in the literature, but 
this section would not be complete without briefly 
mentioning one well-known and important early 
example of 2D NMR spectroscopy: the inversion- 
recovery experiment to measure nuclear spin-lattice 
relaxation rates for the study of intramolecular 
dynamics. 

The subject of multidimensional NMR spectros¬ 
copy and its application to studies of molecular 
structure and dynamics has been covered in many 
excellent books. Here is a small subset (Ernst et al. 
1990; Ernst 1997; Hoch and Stem 1996; Keeler 2010; 
Levitt 2008; Wiithrich 1986) that may be used as 
starting points for the interested reader to leam more 
about the theory and details. 

Summary 

Multidimensional NMR spectroscopy correlates the 
nuclear precession frequencies (chemical shifts) of 
individual nuclei, which are revealed by Eourier trans¬ 
formation. Multidimensional NMR experiments make 
use of pairwise correlations between the atomic nuclei, 
such as electron-mediated (through-bond) and direct 
(through space) magnetic coupling, so that the reso¬ 
nances of atoms that are adjacent in chemical stmcture 


or are spatially proximate can be identified and 
assigned. Popular NMR techniques, known under 
acronyms such as COSY, TOCSY, NOESY, and triple 
resonance NMR spectroscopy, generate restraints 
from which chemical and molecular structures can 
be inferred. Multidimensional NMR spectroscopy is 
utilized for the de novo determination of molecular 
structure and dynamics in solution of a wide range of 
molecules, including proteins and nucleic acids. 

Cross-References 

► Cross-Correlated Relaxation 

► Cryogenic NMR Probes 

► Fluorescence Correlation Spectroscopy 

► INEPT 

► J Coupling 

► Maximum Entropy Reconstruction 

► Metabolomics 

► Nuclear Overhauser Effect 

► Protein NMR - Introduction 

► Protein NMR Resonance Assignment 

► Sparse Sampling in NMR 

► Stereo-Array Isotope Labeling (SAIL) Method 

► Total Correlation Spectroscopy (TOCSY) in NMR 
Protein Structure Determination 

► Triple Resonance NMR 

► TROSY 
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Synonyms 

Protein dynamics 


Introduction 

Interference effects between two different spin- 
relaxation mechanisms that are quantified by measure¬ 
ments of the so-called “cross-correlated” relaxation 
have generated a lot of interest in the past several 
years (Frueh 2002). These studies provide information 
on protein dynamics at atomic resolution, including 
detailed insight into local motional anisotropy on the 
fast, ps-ns timescale complementing data obtained 
from traditional “auto-correlated” Ri, R 2 and NOE 
measurements (Cavanagh et al. 2007). In addition, 
valuable structural information that is very difficult, 
often impossible, to obtain from conventional mea¬ 
surements can be extracted from cross-correlated 
relaxation effects. These include backbone dihedral 
angles in proteins and sugar puckers in nucleic acids 
which can be incorporated as additional constraints 
into standard structure calculation and refinement 
protocols increasing the precision and accuracy of 
biomolecular structures determined in solution 
(Schwalbe et al. 2001). Additionally, cross-correlated 
relaxation may be utilized to determine the chemical 
shift tensors for several nuclei in solution (Schwalbe 
et al. 2001). The most significant contribution, how¬ 
ever, of cross-correlated-relaxation-based experiments 
on the field of biomolecular NMR has been the devel¬ 
opment of the so-called transverse relaxation opti¬ 
mized relaxation spectroscopy (TROSY) and related 
techniques. These approaches rely on selective obser¬ 
vation of spectral components that have significantly 
increased lifetimes due to (1) the interference between 
the anisotropic chemical shifts of ^^N or aromatic 
nuclei and the corresponding dipolar interactions with 
attached protons or (2) intra-methyl *H-^H dipolar 
interactions in highly deuterated proteins that are 
selectively ^^C,*H-labeled at methyl positions of He, 
Leu, and Val residues. TROSY-based experiments 
have allowed NMR studies of very large proteins far 
beyond the size range of traditional NMR techniques 
(Pervushin 2000; Tugarinov and Kay 2005). 

The correlation time most relevant to NMR relaxa¬ 
tion in solution, including cross-correlated relaxation, 
is the molecular rotational correlation time, tc, with 
values in the tens of nanoseconds for biomolecules of 
interest for solution NMR. However, it has been dem¬ 
onstrated that motions occurring on a timescale slower 
than Tc can lead to effects similar to cross-correlated 
relaxation in multiple-quantum (MQ) coherences. 
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These motional processes representing chemical 
exchange on the ps-ms scale are only capable of mod¬ 
ulating the so-called “isotropic” spin-interactions (e.g., 
J-couplings or isotropic chemical shifts) whose aniso¬ 
tropic components have already been “motionally 
averaged” by dynamics occurring on a timescale of 
the order of Tc or faster (Bruschweiler and Ernst 
1992). Abnormally large differences in the relaxation 
rates of zero-quantum (ZQ) and double-quantum (DQ) 
coherences generated between two nuclei were shown 
to result from the slow, correlated modulation of iso¬ 
tropic chemical shifts (CSM - correlated shift modu¬ 
lation) (Kloiber and Konrat 2000; Frueh et al. 2001). 
That these effects did not result from relaxation in the 
conventional sense, due to processes faster than tc, 
was conclusively proven by the fact that they could 
be altered by the application of CPMG-type sequences, 
where n pulses were repeated on a millisecond 
timescale (Dittmer and Bodenhausen 2004; Orekhov 
et al. 2004). 

Multiple effects, including the formation or disrup¬ 
tion of hydrogen bonds, changes in local geometry by 
alteration of dihedral angles, or changes in local chem¬ 
ical environment by repositioning of neighboring aro¬ 
matic rings may lead to CSM effects. A complicating 
factor is that the modulation, i.e., changes in chemical 
shift, is not uniform for a given set of nuclei affected by 
the same motion. Indeed, these may be of either sign 
and, in some cases, oppose the effects of the other 
contributions to the measured relaxation rates (see 
(18) below). The presence of anisotropic motion may 
affect the chemical environment of some nuclei leav¬ 
ing others unaffected and blind to the particular 
motional mode. Thus, it is necessary to make measure¬ 
ments of multiple relaxation rates involving several 
nuclei of the protein backbone and side chains to 
characterize the motion more accurately. It has also 
been shown that the measurement of CSM effects in 
MQ relaxation can be more sensitive than conventional 
CPMG-based measurement of single-quantum relaxa¬ 
tion (R 2 ) in certain motional regimes (Wang and 
Palmer 2002). Further, cross-correlated relaxation 
measurements directly probe correlated dynamics 
over several bonds on slow timescales (Majumdar 
and Ghose 2004). This information can only be 
inferred from, but not directly measured, using 
CPMG-based R 2 - or Rip-type experiments (Cavanagh 
et al. 2007). A clear understanding of correlated 
motion on all timescales is crucial to properly interpret 


changes in dynamics terms of the energetics character¬ 
izing protein interactions and stability. Here, we 
review the theoretical framework for the design of 
experimental strategies to detect CSM effects in 
solution. 


Theoretical Principles 


Consider a system consisting of two weakly coupled 
spins (/, S) undergoing exchange between multiple con¬ 
formations, each with a different chemical environment. 
If the exchange is treated as a stochastic process, the 
time-dependent isotropic chemical shifts of the two spins 
may be represented by the stochastic variables cr/(r) and 
(jg (f). In the interaction frame of the static Hamiltonian 
(neglecting scalar couplings), the time-evolution of the 
density operator is given by (in units of h) 


dt 


H{t),p{t) 


( 1 ) 


where the stochastic Hamiltonian H{t) is 


H{t) = Boexp(///o0{y/f^/(04 + 
X exp(—i/Zo?) 


( 2 ) 


Hq contains the secular parts of the static Hamiltonian 
(the Zeeman interaction and scalar couplings where 
relevant) and Bq is the static magnetic field. Note that 

H{t) = H{t) since the stochastic chemical shift 
Hamiltonian is secular. Nevertheless, we continue to 
use the notation representing the interaction frame 
Hamiltonian for consistency with the descriptions of 
spin relaxation due to other interactions, e.g., dipolar 
couplings (Ernst et al. 1987; Cavanagh et al. 2007). 
Equation 2 may be solved using a perturbation expan¬ 
sion, which truncated to second order yields 


dp{t) 

dt 


+00 


H{t), \H{t-x),p{t)-P^^ 
(p(0 - Pe^ 


dt 


( 3 ) 
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is the density operator at thermal equilibrium and the 
overbar signifies an ensemble average. When the den¬ 
sity operator is expressed in a complete set of orthonor¬ 
mal operators {B,, fij} in a Cartesian basis, 
the relaxation superoperator, F, can be expressed 
in matrix form with elements (r„) given by 


A ^ *• R 

Multiple-Quantum NMR Spectroscopy: Detection of Slow 
Dynamics, Fig. 1 Schematic representation of an exchange 
process between two discrete sites 


r r-- I Bl^_^ Trace{B*[LAh.B;\\] 

" '■" 2 Trace{B*B,] 

+00 

/ ^2 _^ 

a,{t)as{t-T)dx + ^ ^ 

^ K,L=I,S 

-OO K^L 

r -V +00 

Trace{B*[h,[S,,B,]]} f ^^ 
Trace {B;B,\ J 

— OO 

(4) 


coherences. The magnitude of = r,j^,may be esti¬ 
mated from the difference in the relaxation rates of ZQ 
{Fzq) and DQ (Fqq) coherences as 


F 


CSM 


= F[,, = F% = 


Fdq — r. 


ZQ 


( 6 ) 


To evaluate (5) further in terms of experimental 
measurables, we start with the correlation function 
Cjs{t) given by 


The first term corresponds to the auto-correlation term 
(r"j) and represents the Rex term encountered in 
expressions of R 2 in systems with chemical exchange. 
The second term corresponds to a cross¬ 

correlation term that is the major focus of this review. 

is potentially nonzero in the specific case when 
Bs = 21aSh and B, = 2IcSd with a,h,c,d = x,y; 
a ^ C] b ^ d. Thus, a nonzero term leads to the 
partial conversion 2IaSh 2FSd (e.g., 2I^-Sx —> 2IySy 
or 21xSy —> 2IySx) during a fixed relaxation delay, and 
its magnitude may be measured by quantifying the 
buildup of 2IcSd from 2IcSh- 

Alternatively, in the spherical basis (or irreducible 
tensor basis) {Bf ,Bf}, the effects of correlated CSM 
contribute only to the diagonal terms of the Redfield 
matrix. Thus, 



Ki^L 


Trace{Bf*[F[S^^,Bf]\} 

Trace{Bf*Bf} 


4-00 



—00 


(5) 

The positive sign holds when Bf = /+S+//_S_ (i.e., 
double-quantum, DQ, coherences) and the negative 
sign holds when Bf = /+S_//_S+(i.e., zero-quantum, 
ZQ, coherences). Thus, the presence of CSM leads to 
a difference in the relaxation rates of ZQ and DQ 


4-00 

Cis{t)= j Oi{t)as{t-x) dx (7) 

—00 


For exchange between two states A (the “ground” or 
major state) and B (the “excited” or minor state), 
(Fig. 1), (7) reduces to a simple form given by: 


C/5(t) = 2Pa{ 1- PA)A(TiA(JsXex (8) 

where Act/ = ctj{A) — (Xs{B) and Acts = (Xs{B) 

represent the differences in isotropic chemical shifts 
for spins I and S, respectively, between the two states; 
kA^B and are the forward and reverse rate con¬ 
stants; Xex = ^/kA^B + ^/kA^B IS the corresponding 
exchange time and pA and Pb — ^ — Pa are the relative 
populations of the two states. Thus, combining (4)-(8), 
we obtain a simple expression for Fqsm in a two-spin 
system undergoing exchange between two discrete 
states 


Fcsm = 2yjy^A(JiAasBlpA{ \ - Pa)t: ex 
= 2AcojA(aspA{l - Pa)t: ex 


(9) 


with A(Ojs = yj gA(j{sBo. 

Note that the derivation of (4)-(9) is based on the 
general perturbation framework of Redfield theory 
and is thus valid for Xc << << x^, i.e., in the fast 

exchange regime, where x„ = 1/x/AcojAcos is an 
effective chemical shift timescale. In practice, this 
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puts Tct in the ps-ms regime (Majumdar and Ghose 
2004). Expressions for Fqsm may also be derived in the 
slow exchange regime when In particular, 

when the populations of the two states are greatly 
skewed so that 1, the minor state does not make 
a measurable contribution to the observed signal, the 
value of Fcsm may be estimated from the difference 
between the free-precession rates of the DQ and ZQ 
coherences in the two states and is given by (Wang and 
Palmer 2002). 


_ 2 pa{\ - PA)^m^(Os-Cex 

^ CSM . . r-s \ 0 ^ ^ 

(1 -f - 4Affl2Acu2^4^ 

( 10 ) 

Equation 10 reduces to (9) when Acu/T^ << 1 and 
Am^tex « 1, i-e., the fast exchange condition is sat- 
ished for both I and S nuclei. This is a more stringent 
requirement over << 1 / ^/AcojAcos, described 
above, and is not strictly necessary for fast exchange 
conditions to hold for MQ coherences provided that 
Affl/ and Am^ are both small. 

As is evident from (9) for fast exchange on the 
chemical shift timescale (i.e., ^ t^) the rate Fcsm 

depends linearly on the exchange time and on each of 
Affl/ or Affl 5 . This linear dependence on also occurs 
in the exchange contributions to single-quantum relax¬ 
ation, i.e., to Rex terms for fast exchange under free 
precession. However, the excellent sensitivity of MQ 
relaxation (compared to SQ relaxation) to exchange 
arises from the fact that DQ and ZQ coherences 
precess at rates given by the sum (cu/ + ffl^) and 
difference(ffl/ — ffl^), respectively, of their resonance 
frequencies in each state. This leads to exchange con¬ 
tributions to ZQjDQ auto-relaxation rates proportional 
to (Affl ± Affl5)2. Depending on the signs of Affl/ and 
Affl 5 , an enhanced sensitivity to chemical exchange is 
obtained for either the ZQ or DQ relaxation even for 
smaller individual values of Acoj or Acos. Therefore, as 
alluded to by Wang et al. (Wang and Palmer 2002), use 
of MQ measurements may be more sensitive to the 
effects of slow dynamics in the fast exchange regime. 
However, in the slow exchange regime, the magnitude 
of FCSM has a strong dependence on the relative sizes 
of Affl/ and Am^. When Affl/ and Affl^ are very different 
from each other, Fqsm can be very small, though Rgx is 
not. In such cases, probing slow dynamics using SQ 
experiments (e.g., R 2 ) may be more useful to charac¬ 
terize the exchange process. Another characteristic of 


MQ coherences is that, at least theoretically, 
depending on the magnitudes and signs of Aco/ and 
Acos, exchange for ZQ and DQ coherences may lie in 
different regimes, e.g., one in the slow/intermediate 
and the other in the fast exchange regime. 


Strategies to Measure/"cs/w 

Based on the discussion above, there are two strategies 
of obtaining Fcsm- either by detecting the buildup of 
2Ii.Sd from 2IaSh (Scheme I) or separately detecting the 
decays rates of ZQ and DQ coherences (Scheme II). 
Each strategy provides distinct advantages and have 
both been utilized to characterize CSM effects. 


Scheme I 

For an initial density operator p{0) = 2IaSh, partial 
conversion to 2 IcSd (where a,h,c,d = x,y; a ^ c; 
h ^ d) occurs due to CSM effects. In order to measure 
rCSM, two experiments have to be performed, one that 
monitors 2 IcSh{T) and another that monitors 2 IcSd{T) 
as a function of a fixed relaxation delay, T (Fig. 2). In 
the absence of any other relaxation interference 
effects, such as coherent chemical shift and scalar 
coupling evolution, the detected signal as a function 
of T is given by (Kloiber and Konrat 2000; Majumdar 
and Ghose 2004). 


S,{T) = (21A{T)) = cosh[FcsMT] (11a) 

82 ( 1 ) = { 2 IMT)) = smh[FcsMT] (1 lb) 


A is a constant (designed to be the same in both 
experiments), Fa is the auto-correlated relaxation 
rate. Fcsm can then be obtained directly from the 
relationship 



(mn)= 


rcsM = -tanh ' 


= — tanh 
T 


( {21MT)) \ 

\{21aSd{T))) 

{ Slim 

Wt)) 


(12b) 


Equation 12a consists of a single unknown parameter 
(Fcsm), making its inversion, represented by (12b), 
extremely stable in the presence of noise. 
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Multiple-Quantum NMR Spectroscopy: Detection of Slow 
Dynamics, Fig. 2 Pulse sequence for the measurement of 
CSM effects for MQ coherences involving successive *^0“ 
nuclei in the protein hackbone. This sequence is an example 
of Scheme I. High power njl and 7 t pulses are shown as narrow 
and wide dark lines, respectively, and 7 r /4 pulses by dashed 
lines. Shaped pulses are shown as sine-bells (thin\ nil, broad: 
7 i), with on-resonance pulses in black and off-resonance pulses 
with hatched sine-hells. All pulses were applied along the x- 
axis unless explicitly specified. High power pulses were 
applied with pulse-widths of 5.5 ps (’H), 13.5 ps f'^C), and 
38 ps ('^N). ^H WALTZ16 decoupling during the relaxation 
delay T was performed using a 1.0 kHz rf field. RF carriers 
were placed at 4.75 ppm ('H), 55 ppm (*'^C“), 175 ppm 
118 ppm (*^N), and 3 ppm (^H). Shaped pulses used were: a: 


500 ps for J(NC“) or J(NC') evolution; /5: 1.0 ms for 
selective C' refocusing; y*": 500 ps pulses, phase modu¬ 
lated at —120 and -tl20 ppm, for off-resonance inversion at C'. 
Delays t = 2.75 ms, Incol ~ 17.0 ms and = 26.0 ms. 

= ly.-yl; 4>2 = y\ ^3 = ly,y>-y,-y); <l>4= {4(y),4(-y)i; 
(/)g = y, (j)i, (^8 = —y and (/)r = |x,x,—x,—x,—x,—x,x,x). For 
selection of cross-correlation peaks, (^2 and 03 were simulta¬ 
neously incremented by 90 . The duration and strengths of the 
gradients were gl, g2 = (0.5 ms, 15 G/cm); g3 = (1.0 ms, 16 G/ 
cm); g4 = (0.1 ms, 15.9 G/cm). Quadrature detection along m, 
was achieved by inverting the phases 05 , 07 , and 08 , and the 
sign of the gradient g3 (k = ± 1) on alternate FIDs and 
processing using the Ranee—Kay technique (Modified from 
Majumdar and Ghose 2004 with permission) 


However, for certain side-chain relaxation measure¬ 
ments, scalar couplings active during the relaxation 
delay T lead to far more complicated functions with 
trigonometric pre-multipliers (Del Rio et al. 2006), mak¬ 
ing it far more unstable to inversion. In such cases, it is 
more convenient to perform two experiments to sepa¬ 
rately detect ZQ and DQ coherences, fit the two curves 
separately, and obtain the value of Fcsm from the differ¬ 
ence in their individual decay rates as described below. 

Scheme II 

The real part of the ZQ (Pzq) and DQ (Pqq) compo¬ 
nents of the density operator, also referred to as the 


x-component of ZQ^ or DQ^ terms (corresponding 
equations may be written for the imaginary parts, i.e., 
the DQy and ZQy terms) in an IS spin system, may be 
expressed in the following way (Ernst et al. 1987): 


PZQ ~ 2 + '2.IySy\ 

= i[/+5_+/_S+] 

(13a) 

PDQ ~ 2 ~ '2.lySy\ 

= ^M++/_5_] 

(13b) 


where p^Q contains the ±2 quantum terms. The evo¬ 
lution of P 2 Q and puQ over a relaxation delay T (in the 
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laboratory frame, neglecting chemical shift evolution 
and including only relevant terms of the density oper¬ 
ator) is given by (Del Rio et al. 2006) 

m 

Pzq{T) = ]Jcos( 7 t 4 e 7 ’)e~^^e^Pzg( 0 ) -h ... (14a) 

/-I 

m 

Pdq{T) ^X{cos{nf Poq{Q) + ... (14b) 

/=! 

rzQ (r dq) is (as above) the relaxation rate of the ZQ 
(DQ) term and the corresponding effective scalar cou¬ 
pling constant with the /* passive nucleus (a ) is 
represented by 4 q (4ie)' have assumed that the I 
and 5 are coupled with m passive nuclei, /^pand /[j^are 
given by 


fzQ/DQ=‘'Jii,X')T^J{S,X') (15) 

where the ” sign holds for the ZQ coherence and the 
“-I-” sign forDG coherence. The V{1,X‘) and '’J{S,X') 
represent scalar couplings {a,h = 1,2,3) with a passive 
nucleus (Fig. 3). When/^p/^iQ is dominated hy large 
one-bond passive couplings with the I nucleus (a = 1, 
h = 2), (16) can be approximated as 

Pzq{T) = cos {nJzQTY’e-^^'^PzQiQ) + ... (16a) 

Poq{T) = cos i7iJDQTre-^‘>o^PoQiO) + ... (16b) 

where. Jzq/dq = T 5' 1 1*1^0 ■ Thus, in order to 

measure JzQjoQ and ^zQjDQ, two sets of experiments 
are required, with each measuring the time-evolution 
of the ZQ or the DQ coherence (Fig. 4). The signal 
intensities in the two experiments as a function of the 
relaxation delay T are given by 



Multiple-Quantum NMR Spectroscopy: Detection of Slow 
Dynamics, Fig. 3 Schematic representation of a three-spin 
system showing active and passive couplings. The two nuclei 
participating in the MQ coherence are shown in red. The '/(/, S) 
coupling (red) does not affect the MQ evolution; however, the 
passive, 1-bond (*/(/, X*)) and 2-bond (^J(S, X*)) passive cou¬ 
plings do affect MQ evolution 


Measurement of Fcs/w 


In real systems, however, several effects lead to the 
buildup of llcSd from 2 IaSh or equivalently result in 
a difference in the relaxation rates of ZQ and DQ 
coherences. This rate, Fis, consists of multiple compo¬ 
nents that are modulated both by fast (ps-ns) motion 
(processes faster than the rotational correlation time 
Tc) and indeed slow motional processes in the ps-ms 
timescale that lead to a nonzero Fqsm- T is is given by 
(Frueh 2002): 


- 9^0 Wii - ' 

',y=i,2 


33 


4 - ^^33 





.3 -3 

'ihPsh>= 


4(0) 


+ g + <^s) -J (®/ - ®s)] 

+ 2y,fsBlXaIXasPAi} - PAj-^ex 


(18) 


Szq{T) = Azq cos {nJzQTf'e-^^^ (17a) 
Sdq{T) = Aoq cos {nJoQTTe-^^^aT ( 17 b) 

where Azq and Aoq are constants. F cs^can be deter¬ 
mined from Fzq and Fqq using ( 6 ). Equations 17a,b 
can be fitted most accurately if at least one of the nulls 
in (17a,b) are sampled. This implies that T^ax > 

_ ! _ > _ ! _ 

Vdq/zq ^ 2 ( i /(/, X ')) • 


The first term in (18) (X) results from 

Q' 

the cross-correlation between the chemical shift 
anisotropy (CSA) tensors (the symmetric, rank-2 part 
of the chemical shift tensor) of the two spins ( 1 , 8 ), with 

,/,y( 0 ) = |tc ^ 3 cos^^ 9 J-i j ^ goth tensors are assumed to 
be fully asymmetric with principal components 
(j'lf {i = 1, 2, 3} and ^ the projec¬ 

tion of the component of the CSA tensor of / on 
the y* component of the CSA tensor of S is given by the 
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M 


angle 6 '^. The second term (^0) results 


from the cross-correlation between the dipolar interac¬ 
tions of each spin I and S with the same proton at 
a distance from spin I and r^/^tfrom spin S. The 
spectral density function for dipole-dipole cross¬ 


^ 3cos^fl‘-l ^ 


where 




is 


correlation Jik{(o) = § 

the angle between the IW and SH^ internuclear vec¬ 
tors. The third term (Fxr) is the result of the cross¬ 
relaxation between I and S with the corresponding 
spectral density functions given by J{(o) = | 

The contribution of this term is usually very small and 
may be neglected under most circumstances. The 
effects of cross-correlation between heteronuclear 
dipolar interactions are also small enough to be 
ignored. The first three terms of (18) result from relax¬ 
ation in the true sense (Ernst et al. 1987; Cavanagh 
et al. 2007) and depend on the overall correlation time, 
Tc (overall isotropic diffusion assumed). The fourth 
term (T csm) arises due to two-site exchange character¬ 
ized by an exchange time (as discussed above). 

In deuterated proteins, the first, third, and the fourth 
terms in (18) are unaffected by the level of deuteration 
to first order. However, each T?® in the second term 


should be replaced by 


(1 -fk) +fk{yD/yHY 


pDD 
^ k 

where /jt is the fractional deuteration for the site 
(Majumdar and Ghose 2004). It can be shown that 
this scaling is accurate when Fi^T <C I (the linear 
regime). 

It should also be noted that each of the three major 
contributions to (18) (excluding the cross-relaxation 
term) may be of any sign and potentially interfere 
with each other. However, as pointed out above, in the 
presence of slow conformational exchange in the fast 
exchange regime, Fcsm has a linear dependence on 
(which is in the ps-ms range) and may greatly outweigh 
all other contributions to Fjs and, in certain circum¬ 
stances, lead to very large absolute values of Fis. 

The first and fourth terms in (18) depend on the 
strength of the static field (Bq). The contribution of 
the second and the third terms to F/s can be extracted 
from the intercept of a plot of Fis values obtained at 
different static field strengths against (Lundstrom 
and Akke 2004). However, separation of the effects of 
chemical exchange from those due to CSA/CSA cross¬ 
correlations, simply from the Bo-dependence of Fis, is 
not possible. Therefore, it is prudent to chose the I and 


S nuclei such that the CSAs of one, preferably both 
nuclei are small, minimizing the influence of the first 
term in (18). An example of such an optimal scenario is 
when the / and S nuclei correspond to successive 
positions on the protein backbone (Majumdar and 
Ghose 2004; Lundstrom et al. 2005). 


Applications 

Several elegant applications of the use of CSM effects 
have appeared in print and several more should be 
forthcoming as these approaches permeate through 
the biological NMR community. A few examples are 
provided below. 

Lundstrom et al. (2005) have used the measurement 
of CSM effects in MQ relaxation to demonstrate the 
transient and cooperative unfolding of a helical seg¬ 
ment in the C-terminal domain of calmodulin. Massi 
et al. (2005) have used these measurements to illustrate 
the dynamic disruption of the hydrogen-bonding net¬ 
work at the N-terminus of the a-helix in ubiquitin. 
A study by Mori et al. (2010) using several proteins 
found that Fcsm values were generally positive in 
/1-sheets and negative in a-helices, indicating corre¬ 
lated (Act/ and Acr 5 in (18) have the same sign) and 
anti-correlated (Acy and Acr 5 have opposite signs) 
CSM effects, respectively. This was attributed to dif¬ 
ferences in the motional modes dictated by the local 
geometries - local unwinding in helices and in-plane 
twisting or bending motions in sheets. 

Among the most elegant applications of the use of 
MQ spectroscopy to probe slow dynamics has come 
from the Kay and coworkers who have presented 
a series of multi-pulse dispersion experiments to mea¬ 
sure CSM effects quantitatively for both ZQ and DQ 
coherences (Orekhov et al. 2004). These experiments, 
in contrast to the free-precession experiments 
described above, allow the quantitative measurement 
of values, and, in principle, the magnitudes and the 
relative signs of Act/ and Acr^. They are more informa¬ 
tive than standard SQ experiments and more sensitive 
to the dynamic processes than SQ experiments in cer¬ 
tain motional regimes as mentioned above. One 
caveat, however, is that given the more complicated 
dependence on relevant variables in expressions of the 
dispersion curves compared to standard SQ relaxation 
dispersion, data of far higher precision are required to 
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extract accurately all motional parameters, somewhat 
limiting their range of applicability. However, certain 
MQ dispersion experiments applied to side-chain 
methyl groups are expected to find applicability in 
large, highly deuterated systems where SQ dispersion 
experiments are extremely insensitive (Korzhnev et al. 
2004a). Given the completeness of the original papers 
(Korzhnev et al. 2004a, b) and limited space in the 
present context, these multi-pulse approaches are not 
reviewed here. 

Summary 

Measurements of the influence correlated modulation 
of isotropic chemical shifts on the relaxation of multi¬ 
ple-quantum coherences in proteins has recently 
emerged as an efficient probe of slow, correlated 
motion on the ps-ms timescale. Here we attempted to 
establish a theoretical basis for the description of 
these effects and for the design of experiments to 
detect them. 
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Synonyms 

Alanine scanning; Binding hotspots; Chimeric scan¬ 
ning; Homologue scanning; Interaction interfaces; 
Mutagenic scanning to define binding hotspots; 
Mutation 

Definition 

Site-directed mutagenesis, originally developed by 
Hutchinson and colleagues in the 1970s, is an invalu¬ 
able tool for identifying residues that are important 
for protein structure and function (Hutchison et al. 
1978). For example, it is a very effective approach for 
investigating the role of individual residues in the 
active sites of enzymes, as pioneered by Winter, 
Fersht, and colleagues (Winter et al. 1982). Muta¬ 
genic scanning, in which large numbers of residues 
are mutated in a systematic manner (Fig. 1) and the 
mutants tested for binding to a target, has been used 
extensively to define protein-protein or protein-ligand 
interaction interfaces and to reveal mechanisms of 
protein folding. Note that it is important to test that 
a mutation has not affected the fold of the protein 
before assuming that a mutated residue is implicated 
in binding, and that most mutagenic scanning tech¬ 
niques do not provide measurements of the free 
energy contributions of sidechains to an interaction; 
measurements of free energy require ► double mutant 
cycle analysis. 

Basic Characteristics 

Probably the most common form of mutagenic 
scanning is alanine-scanning mutagenesis, in which 
residues are mutated to alanine (Fig. la). The use of 
alanine eliminates the side chain beyond the p-carbon 
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Mutagenic Scanning to 
Define Binding Hotspots, 
Fig. 1 Examples of 
mutagenic scanning. 

(a) Alanine-scanning 
mutagenesis in which residues 
are systematically mutated to 
alanine. Residues thought 
to be important for 
maintaining function (marked 
with an asterisk) are not 
mutated. The example is for 
the first zinc finger in friend of 
GAT A 1 (FOGl FI) binding 
to GATAl. (b) Homologue- 
scanning mutagenesis. In this 
example, residues in GATAl 
N-finger (GATA-NF), which 
binds to FOG 1, were 
systematically mutated to the 
equivalent residue in the 
GATAl C-finger(GATA-CF), 
which does not bind to FOGl 
(Fox et al. 1998). (c) Chimeric 
scanning in which whole 
elements of structure are 
swapped between like 
proteins. In this example, the 
helix-loop-helix domains 
(HI L H2) from Tall/SCL 
(red) were exchanged for 
equivalent domains in five 
other family members 
(Schlaeger et al. 2004) 


a 


* * * 


★ 


* * 


FOG FI KDVFPCKDCGIWYRSERNLQAHLLYYCASRQ + + 


A- + + 

-A- - 

--A- + + 

-A- + + 

-A- + + 

-A- + 

-A- - 

-A- + + 

-A- + 

-A- + + 

-A- + 

-A- + + 

-A- - 

-A- + + 

-A- + 

-A- + + 

-A- + 

-A- - 

-A-- + + 

-A- + + 


b 


GATA-NF 


GATA-CF 


EARECVNCGATATPLWRRDRTGHYLCNACGLY 

AG- 

-L- 

--TQ- 

-V- 

-T- 

-QT- 

-T- 

-NA- 

-DP- 

-V- 

AGTQCTNCQTTTTTLWRRNASGDPVCNACGLY 


Tall/SCL 
Swap of H1LH2 
Swap of HI 
Swap of H2 
Swap of HI H2 
Swap of L 



but does not alter the main-chain conformation, nor 
does it introduce functional groups such as charge or 
hydrogen-bonding potential. Alanine is also well tol¬ 
erated in buried and exposed positions and in all types 
of secondary structure. Thus, mutation to alanine is 
a relatively benign approach that should not signih- 
cantly affect the structure of the protein being 
mutated, but will hopefully perturb binding. The 
term “alanine scanning” was introduced by Cunning¬ 
ham and Wells in a study of growth hormone binding 
to the growth hormone receptor (Cunningham and 
Wells 1989). A comparison of the effects of alanine 
mutation with a structure of the growth hormone/ 
growth hormone receptor complex revealed that 
although numerous residues make contacts between 
the two proteins, a much smaller number (2/30 resi¬ 
dues) account for the majority (75%) of the binding 


free energy (Clackson and Wells 1995). The key 
residues were found in the center of the binding site, 
referred to as a “hot spot” of binding, and are 
a common phenomenon in protein-protein interac¬ 
tions (DeLano 2002). A potential disadvantage of 
alanine scanning is that mutation to alanine may not 
sufficiently perturb an interaction, in which case multiple 
(e.g., double or triple) mutations can be made (which 
also has the advantage of more rapidly scanning through 
the sequence) and/or alternative mutations can be 
generated. 

Homologue- or chimeric-scanning mutagenesis is 
an alternative approach that can be used if the protein 
under study has one (or more) homologues that do not 
bind a partner. In theory, the similar proteins should 
have similar structures, and therefore, the protein 
under study can tolerate mutation to different residues 
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found in equivalent positions in the homologous 
proteins. For example, human placental lactogen 
(hPL), porcine growth hormone (pGH), and human 
prolactin (hPRL) are homologues of human growth 
hormone that have reduced binding affinities for the 
human liver growth hormone receptor. Chimeras 
were generated in which different regions of the 
homologues were substituted for human growth hor¬ 
mone and used to map receptor binding and mono¬ 
clonal antibody binding sites (Clackson et al. 1998). 
The chimeric approach can be carried out by making 
large numbers of simultaneous mutations (Fig. lb), or 
by systematically mutating one residue at a time 
(Fig. Ic). It has the advantage that very radical muta¬ 
tions can be made, which are sometimes required to 
sufficiently perturb an interaction, but may cause 
important residues to be missed, because they are 
not different in the homologues. A variant of this 
approach in which all possible permutations of chi¬ 
meric mutations are generated has been termed “tra¬ 
jectory scanning” (Capra et al. 2010), which is 
feasible if only a few residues vary between 
homologues. 

A related mutagenic-scanning approach is cyste¬ 
ine scanning, in which mutation to cysteine is used to 
introduce a chemical probe. In one variation of this 
approach, the mutants of membrane proteins in 
intact E. coli were treated with membrane permeable 
and impermeable maleimide derivatives that react 
with free sulfhydryl groups (Tamura et al. 2003). 
The inside/outside localization of introduced cyste¬ 
ine residues is judged from competitive binding of 
these two compounds. Effective labeling by the imper¬ 
meable compounds defines the elements of structure 
that exist on the outside of the membrane, whereas 
effective labeling by the membrane permeable com¬ 
pound marks in regions inside the cell. As the com¬ 
pounds are poorly reactive with sulfydryl groups in 
hydrophobic regions, the membrane-spanning regions 
are not labeled. Electron paramagnetic resonance (EPR) 
measurements of paramagnetic spin-labeled cysteine 
mutants can provide information about mobility and 
accessibility (Altenbach et al. 1989). Site-directed 
spin-labeling EPR often uses a variant form of cysteine 
scanning to measure distances in molecules, providing 
low-resolution information on protein folds and confor¬ 
mational changes. Here, pairs of nonadjacent cysteine 
residues are introduced and labeled with paramagnetic 
spin labels and the distances measured using measured 


using EPR methods (Cooke and Brown 201 1). In many 
cases, cysteine scanning requires that existing free cys¬ 
teine residues are mutated prior to cysteine-scanning 
studies. 

Cross-References 

► Double-Mutant Cycle Analysis 

► EPR Spectroscopy: General Principles 

► Protein-Protein Interactions - Inhibition 
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Synonyms 

Cross-bridges; Molecular motors; Structural features 
of myosin classes 

Definition 

Myosin classification is based on phylogenetic analy¬ 
sis of their motor domain. 

Introduction 

Myosins belong to a superfamily of proteins that con¬ 
tain three functional domains (see ► Myosin: Funda¬ 
mental Properties and Structure and >• Myosin Work 
and Motility: Mechanism): (1) a motor domain, with 
a highly conserved core structure comprising the ATP 
and actin-hinding sites, (2) a neck domain (also called 
lever arm), consisting of an a-helical strand stabilized 
by calmodulin and/or light chains forming a sequenced 
IQ motif, and (3) a C-terminal domain, called the tail 
region, which is variable in amino-acid sequence and 
length. While the motor domain defines most of the 
mechanical characteristics of myosins, the tail domain 
contributes to position the motor region while 
interacting with actin, and allows myosin heavy chains 
to bind to their specific cargo. 

The classification of myosins is based on phyloge¬ 
netic analysis of their motor domains. Although myo¬ 
sin classification has been the focus of many studies, 
and there is a general agreement on most myosin 
classes, the detailed nomenclatures and subclassifica¬ 
tions may vary according to the methods used hy 
different groups. An analysis performed with 67 spe¬ 
cies based on 267 myosins grouped them in 24 classes 
(Foth et al. 2006), while an analyses performed with 20 


species using 150 myosins grouped them in 37 classes 
(Richards and Cavalier-Smith 2005). A comprehen¬ 
sive study presented a comparative genomic analysis 
of 2,269 myosins in 328 organisms (Odronitz and 
Kollmar 2007). Based on the positions of myosins in 
the phylogenetic tree of the motor domains in each 
group analyzed, the authors reconstructed the tree of 
eukaryotic life and identified 35 myosin classes 
(Odronitz and Kollmar 2007) (Fig. 1). 

Myosins from different classes are also divided into 
subclasses, making the general classification complex. 
Furthermore, although a large number of myosins have 
been identified, the roles of several of them are not well 
understood yet. In this entry, the 18 major classes of 
myosin with their respective subclasses that have been 
proposed by several authors and have had their func¬ 
tions investigated are described. All required genes for 
coding the 18 classes are spread in several different 
eukaryotic organisms; mammals contain the largest 
number of myosin genes (at least 28 myosin genes). 
Figure 2 shows a phylogenetic tree of myosin classes 
based on the molecule’s motor domains including the 
18 classes of myosin described in this entry, and also 
additional myosins found in chordate (class XIX), 
insects (class XX), kinetoplastids (class XXI), 
apicomplexans, and diatom algae (classes XXII, 
XXIII and XXIV) (Foth et al. 2006). 

The first myosin reported in the literature was myo¬ 
sin II, which has heavy chains that self-associate to 
form dimmers and filaments, commonly called “con¬ 
ventional” myosins. The description of myosin II was 
followed by the discovery of myosin I, the first “uncon¬ 
ventional” class of myosin described (Pollard and 
Kom 1973). The subsequent classes were named 
according to the order of discovery. Most of our 
knowledge on molecular motors structure and func¬ 
tioning is based on studies performed with myosin II, 
responsible for muscle contraction. There is also a 
substantial amount of information on myosin I and 
myosin V, but less Is known about other classes 
(Berg et al. 2001). 

Myosins perform their work while cyclically 
attaching/detaching from actin. The duty ratio of myo¬ 
sin corresponds to the fraction of time it remains 
strongly bound to actin during an attachment/detach¬ 
ment cycle. The duty ratios of different myosins vary 
according to their functions; myosins with a high-duty 
ratio spend most of their cycles strongly attached to 
actin, while myosins with a low-duty ratio motor 
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3 Myo20 
■ Myo17 


Myosin Family Classification, Fig. 1 Phylogenetic tree of myosin sequences of class-VI and their distribution. The bar 
myosins constructed from multiple sequence alignment of corresponds to estimated amino-acid substitutions per site (Dia- 
1984 myosin motor domains. The expanded view shows the gram from Odronitz and Kollmar (2007), with permission) 
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i; XVII 


Myosin Family Classification, Fig. 2 Phylogenetic tree of 
myosin classes based on the molecule’s motor domains. Myosin 
class XVII was omitted from the general tree, but it is shown in 


the right inset figure, which depicts the respective part of the tree 
where these myosins were included (Adapted from Foth, et al. 
(2006), with permission) 
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spends most of their cycles detached from actin. Myo¬ 
sins II, which work cooperatively in thick filaments 
within muscle cells, have a low-duty ratio; multiple 
myosin motors rapidly attach and detach from actin 
during a contraction. Myosins V, on the other hand, 
have a high-duty ratio, as two motor domains of the 
molecule work processively to carry cargos inside 
the cells, and do not detach from actin at the same 
time for several nanometers of motility. 

There are several scientific reviews that explain in 
details the structures and major roles of different myo¬ 
sins (Berg et al. 2001; Mooseker and Cheney 1995; 
Sellers 2000; De La Cruz and Ostap 2004). Some 
information gathered in this entry was taken from 
such reviews - the reader is strongly encouraged to 
look into these reviews for in-depth information of the 
different myosins. Details about the mechanism behind 


the mechanical work performed by myosins resulting 
from ATP hydrolysis and the interactions with actin 
will not be discussed, as they are the subject of other 
entries of the Encyclopedia of Biophysics (► ATP- 
Driven Mechanical Work Performed by Molecular 
Motors, ► Myosin: Fundamental Properties and Struc¬ 
ture, ► Myosin Work and Motility: Mechanism). 

Myosin I 

In 1973, Pollard and Korn (1973) reported the discov¬ 
ery of an “unconventional,” single-headed myosin, 
unable to associate into bipolar filaments. Genetic 
analysis showed variability among the motor domains 
of this myosin class, which led to further subclassifi¬ 
cations. Myosins I configure the largest single class of 
myosin that have been characterized in the literature. 
Myosins I are responsible for cell crawling and 
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division, phagocytosis, and membrane ruffling (Pol¬ 
lard et al. 1991). 

Although many subclasses of myosin I have been 
reported, the names differ according to the method of 
classification - there are reviews that cover the differ¬ 
ent myosins I in details (e.g., Mooseker and Cheney 
1995; De La Cruz and Ostap 2004; Gillespie et al. 
2001). Myosin I nomenclature may be confusing, 
with different systems in use and different genes 
given the same names, but they can be generally sep¬ 
arated in two large subclasses: the long-tailed myosins 
and the short-tailed myosins, including myola, myolb, 
and myolc (De La Cruz and Ostap 2004). 

Myosins I are characterized by a single-motor 
domain attached to a lever arm that contains two or 
three IQ motifs able to bind to calmodulin. Their tail 
domain can be found in three forms, believed to have 
the same protein ancestor, called Tail Homologues : 
THl, TH2, and TH3 (Pollard et al. 1991). Myosins 
I can be found in protozoa (i.e., Acanthamoeha and 
Dictyostelium), Drosophila Melanogaster, fungus 
(i.e., Aspergillus), and several vertebrates (chicken, 
rat, and human) (Mooseker and Cheney 1995). 

Myosin II 

Myosins II are the most studied among all myosins; 
virtually all kinetic models of myosins have been 
based on this “conventional” class. Myosins II are 
responsible for skeletal, cardiac, and smooth muscle 
contraction. Nonetheless they can be found in various 
eukaryotic organisms, and are commonly divided into 
several subclasses, based on the sequence analysis of 
their motor domain, including skeletal and cardiac 
sarcomeric myosins, vertebrate smooth muscle and 
non-muscle myosins, myosins from Acanthamoeha, 
Dictyostelium, and Physarum, and myosins from 
Saccharomycotina (Berg et al. 2001; Sellers 2000). 

All myosins from class II are doubled headed, 
containing two heavy chains weighing between 171 
and 244 kDa, and two pairs of light chains (essential 
light chains - ELC, and regulatory light chains - RCL). 
Phosphorylation of the RLC induced by Ca^"^ binding 
to calmodulin and activation of the enzyme myosin 
light chain kinase (MLCK) primarily regulates smooth 
muscle contraction, while it has a secondary role in 
skeletal muscle contraction (Sweeney and Stull 1990). 
ELC stabilizes the lever arm of myosins while 
the molecule performs mechanical work and the 
so-called power-stroke while attached to actin 


(see ► ATP-Driven Mechanical Work Performed by 
Molecular Motors and Myosin Work and Motility: 
Mechanism). During muscle contraction, ATP hydro¬ 
lysis in the myosin II motor domain causes conforma¬ 
tional changes on the converter domain, leading to 
a rotation in relation to the lever arm. During this 
“power-stroke,” myosin II causes sliding of actin. 
Myosins of class II are also important for chemotaxis, 
cytoskeleton assembly, and disassembly (Sellers 
2000 ). 

Myosin III 

Myosins III are single-headed molecules found in Dro¬ 
sophila Melanogaster, Octopus, and several verte¬ 
brate. Myosin III, sometimes called nina-C, diverges 
from the other classes by having a kinase-like domain 
in the N-terminal, which links the cytoskeleton to the 
signaling complex during photo-transduction (B abler 
2000 ). 

Myosin IV 

Myosins IV lack the coiled-coil domain, and thus it is 
mostly considered a single-headed molecule. How¬ 
ever, two myosins IV join for cargo transport. Myosins 
IV have the motor domain linked to a single IQ motif, 
followed by (TH4). This myosin class has been found 
only in Acanthamoeha (Berg et al. 2001). 

Myosin V 

Myosins V are cargo transport proteins. They are dou¬ 
ble-headed; each motor domain is followed by a lever 
arm attached to six calmodulins and light chains. The 
two lever arms converge into a rod containing 
a a-helical coiled-coil domain, which is linked to 
a globular tail specialized to attach to specific cargos. 
Structural changes induced by ATP hydrolysis allow 
the power-stroke produced by myosin V to be ampli¬ 
fied by the lever arm, resulting in a displacement of 
^36 nm/step (Trybus 2008). Myosins V are found in 
almost all eukaryotes, except plants. Two subclasses of 
myosin V (Myo2p and Myo4p) are found in inverte¬ 
brates, and three subclasses of myosin V (Myosin Va, 
Vb, and Vc) are found in vertebrates (Berg et al. 2001). 

Myosin VI 

Myosins VI are found in Drosophila and in several 
vertebrate (chickens, mice, pigs, and humans), and are 
responsible for cargo transport. Myosins VI are dou¬ 
ble-headed and have their motor domains linked to 
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a single IQ motif, followed by two separate lever arms 
that end in two linked a-helical coiled-coil structures. 
These helical structures are followed by two globular 
tails that will hx to the cargos to be transported (Sellers 
2000). Close to the converter site, between the head 
and the neck domains, there is a 53-amino-acid motif 
peculiar to this class that allows myosins VI to move 
on the opposite direction than other myosins, toward 
the negative pole of actin filaments (Wells et al. 1999). 

Myosin VII 

Myosins from class VII have two heads. The molecule 
motor domains are followed by five IQ motifs. The tail 
domain contains a short coiled-coil structure and two 
formerly termed talin homology domains (PERM), 
two TPI4, and one third sulphydryl domain (SH3). 
Two subclasses of myosin VII have been described: 
myosin VIIA and myosin VIIB, both identified in 
Drosophila, Caenorhabditis elegans, and several 
mammals (humans, pigs, and mice). Little is known 
about myosin VII functioning, but alterations in genes 
from subclass A have been related to deafness and 
blindness in mice and humans (Sellers 2000). 

Myosin VIII 

Myosins VIII have two motor domains followed by 
a 90-residue amino-terminal extension, four IQ motifs, 
a coiled-coil domain, and a globular tail. This class of 
myosins is found only in plants (Arahidopsis, Micella, 
and Chara) (Sellers 2000). 

Myosin IX 

Myosins IX have only one head, followed by six IQ 
motifs, a tail domain containing a motif, followed 
by a specific domain that shares identity with GPTase 
activating proteins (GAPs) from the Rho subfamily of 
G-proteins. This highly complex structure is believed 
to be involved in signaling pathways for cytoskeleton 
organization (Bahler 2000). At least four subclasses 
from myosin IX have been described: myr-5 and myr-7 
from rats, IXB from humans, and hum-7 from 
Caenorhabditis elegans (Bahler 2000). 

Myosin X 

Myosins X are double-headed and is found in frogs and 
bovines (Sole et al. 1994). The motor domain is 
followed by three IQ motifs and a tail with a coiled- 
coil structure and one Pleckstrin Homology (PH) 
domain - an amino-acid sequence associated with 


different intracellular signaling processes (Mayer 
et al. 1993) - that is found in various GAPs and kinases 
involved in signal transduction (Sellers 2000). 

Myosin XI 

Myosins XI have two motor domains, six IQ motifs, 
and tail coiled-coil elements separate by non-helical 
elements, similar to myosin V. Myosins XI are found 
in plants {Arabidopsis and Chara), and are responsible 
for fast cargo transport (Mooseker and Cheney 1995; 
Sellers 2000). 

Myosin XII 

Myosins XII have a motor domain followed by two IQ 
motifs, followed by a short coiled-coil domain. Myosin 
XII has been identified in Caenorhabditis elegans 
(Sellers 2000). 

Myosin XIII 

Myosins XIII have one motor domain followed by 
a number of IQ motifs that may vary from three to 
five in the subclass that contains MYOl-genes, and 
from five to seven in another subclass that contains 
MY03-genes. Myosin XIII has been identified in 
Acetabularia cliftonii (Sellers 2000). 

Myosin XIV 

Myosins XIV have one motor domain virtually lacking 
a neck and IQ motifs, followed by a short tail region. 
Myosins XIV are found mostly in parasites - 
Toxoplasma gondii and Plasmodium falciparum — and 
their role is unclear (Sellers 2000). 

Myosin XV 

Myosins XV are a single-headed myosin with the 
heaviest myosin heavy chain among all classes: 
a molecular weight of 395 kDa. The motor domain is 
followed by two IQ motifs and a tail that lacks a coiled- 
coil structure. Myosins XV are found in the pituitary 
gland and in the hair cells from rodents and humans, 
and thus are likely related to pituitary and hearing 
functioning (Sellers 2000). 

Myosin XVI 

Myosins from class XVI have one head, followed by 
one IQ motif and a tail with no coiled-coil structure. 
Myosins XVI are found in glial and neuronal cells from 
rats and humans, and participate in cell migration 
during brain development (Patel et al. 2001). 
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Myosin XVII 

Myosins XVII are uniquely associated with 
a metabolic enzyme (the chitin synthase from fungi) 
(Fujiwara et al. 1997). The structure of this myosin is 
simple: a sole motor domain followed by the chitin 
synthase domain, lacking IQ motifs, tail or coiled-coil 
structure. 

Myosin XVIII 

Myosins XVIII have two motor domains and one IQ 
motif followed by a long tail with a coiled-coil struc¬ 
ture (Furusawa et al. 2000). The N-terminal of myo¬ 
sin XVIII is linked to a PDZ domain - a specific 
polypeptide found in signaling proteins of different 
species - followed by a KE-rich region - another 
amino-acid sequence believed to participate in 
nuclear targeting and microtubule binding (Sheng 
and Sala 2001). Different from all other classes, 
myosin XVIII seems to lack the actin-binding site - 
the KE-rich region may be associated with microtu¬ 
bules instead of actin. Myosin XVIII is found in bone 
morrow stromal cells from mice, suggesting their 
involvement in structural organization and mainte¬ 
nance of cell architecture (Berg et al. 2001; 
Furusawa et al. 2000). 

Summary 

Myosins belong to a family of proteins containing 
a motor domain, which has ATP and actin-binding 
sites, a neck domain, and a tail region. The motor 
domain defines most of the mechanical characteristics 
of myosins, and serves as the basis for myosin classi¬ 
fication. There are different classifications of myosin, 
but there is a general agreement on at least 18 classes, 
presented in this entry. Some of the myosins are well 
characterized (myosins I, II, and V), while other myo¬ 
sins are still under investigation. 

Cross-References 

► ATP-Driven Mechanical Work Performed by 
Molecular Motors 

► Dynein Family Classification 

► Electron Microscopy of Motor Structure and 
Possible Mechanisms 

► Kinesin Superfamily Classification 


► Myosin: Fundamental Properties and Structure 

► Myosin Work and Motility: Mechanism 
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Synonyms 

Molecular motors; Structural basis of myosin motility; 
Structural features common to all myosin classes 

Definition 

Myosins are a superfamily of molecular motors that 
convert the chemical energy of ATP hydrolysis into 
directed motion along the filamentous protein actin. 

Introduction 

Within the myosin superfamily, there are numerous 
classes that are responsible for a wide range of cellular 
processes requiring generation of force and motion, 
including organelle trafficking, cellular motion, cyto¬ 
kinesis, and muscle contraction (reviewed in Foth et al. 
2006). The goal of this entry is to discuss the common 
myosin structural features that allow these motors to 
convert the chemical energy associated with ATP 
hydrolysis into force and motion. 

Myosin Nomenclature 

A detailed description of myosin nomenclature can be 
found in > myosin family classification; a brief over¬ 
view is presented here. The term “myosin” was origi¬ 
nally used to describe the protein responsible for force 
generation in the thick filaments of striated and smooth 
muscle cells. Eventually, non-muscle myosins with 
similar properties (including the ability to form thick 
filaments) were purified from other eukaryotic cells. In 
1973, a myosin purified from Acanthamoeha was 
shown to have properties that distinguished it from 
other known myosins, including a much lower molec¬ 
ular weight and the inability to form thick filaments 
(Pollard and Kom 1973). This was the first discovery 
of a novel myosin class. 


According to current nomenclature, the original 
muscle and non-muscle myosins that are capable 
of forming thick filaments are referred to as conven¬ 
tional myosins, and all other classes are unconventional 
myosins. Because the heavy chains of conventional 
myosins self-associate as dimers, they are also referred 
to as class II myosins. Because the Acanthamoeha myo¬ 
sin described above is monomeric, it is called a class I 
myosin. All other myosin classes are numbered 
according to the order in which they were discovered. 

Domain Structure of Myosins 

All myosin heavy chains exhibit a three-domain struc¬ 
ture (Fig. la). At the N-terminus is the motor domain, 
also known as the catalytic head, which contains the 
motor’s ATP- and actin-binding sites. Following the 
head is the neck domain, consisting of an a-helical 
strand stabilized by the binding of one or more cal¬ 
modulin or calmodulin-like light chains to a sequence 
termed the IQ motif (IQXXXRGXXXR) (Cheney and 
Mooseker 1992). The most C-terminal domain is the 
tail domain. The tail can serve a variety of roles, 
including allowing myosin heavy chains to self-asso- 
ciate, binding a myosin to its cargo, and anchoring the 
myosin to structures in the cell. 

Based on phylogenetic analysis of the motor 
domain, there are at least 35 distinct myosin classes 
found in eukaryotes, 13 of which are expressed in 
humans (Ordonitz and Kollmar 2007). Myosins within 
each class exhibit similar tail domain organization, 
suggesting that the heads and tails have evolved 
together. This likely indicates that motor domains 
have specific kinetic and biochemical features that, 
when coupled to particular tail domains, allow them 
to perform distinct functions (Cope et al. 1996). 

From Sliding Filaments to Lever Arms: Seeking 
Out the Mechanism of Myosin Motion 

In 1864, Wilhelm Kiihne extracted a viscous protein 
from muscle that he named “myosin” and considered 
responsible for the rigor state of muscle (Kiihne 1864). 
A history of the experiments that helped elucidate how 
this protein generates force and motion can be found in 
the articles (Szent-Gyorgyi 2004; Cooke 2004). Some 
of the key experiments that have led to our current 
understanding are described below. 

Early myosin studies focused on the role of the 
protein in striated muscle contraction. The basic con¬ 
tractile unit of striated muscle, the sarcomere, consists 
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1 . ATPase cycle 

2. Actin-binding cycle 

3. Mechanical cycle 


Myosin: Fundamental Properties and Structure, Fig. 1 (a) 

Three-domain structure of myosin. Myosins are comprised of an 
N-terminal motor domain (green), which contains the actin- and 
nucleotide-binding sites, a neck domain (purple), which consists 
of an Qt-helix stabilized by the binding of one or more light 
chains, and a C-terminal tail domain (brown), which is the 
most diverse region among the myosin classes, (b) A myosin 
must coordinate three cycles in order to convert the chemical 
energy of ATP hydrolysis into motion along actin 


of interdigitated arrays of thick and thin filaments, 
which are primarily comprised of myosin and actin, 
respectively (Fig. 2a). To describe the mechanism of 
sarcomere contraction, the sliding filament theory was 
put forth, which hypothesized that during contraction, 
the length of the filaments remains constant, but the 
overlapping arrays of filaments slide past each other 
(Huxley and Hanson 1954; Huxley and Niedergerke 
1954) (Fig. 2c). This theory is still widely accepted, 
though it has been refined, as described below. 

Electron microscope observations of muscle 
revealed linkages between the actin and myosin fila¬ 
ments, referred to as cross-bridges, which extend from 
the myosin filaments (Huxley 1957) (Fig. 2b). In 1969, 
Hugh Huxley proposed the swinging cross-bridge 
model to explain filament sliding within a sarcomere 
(Huxley 1969). According to this model, the cross¬ 
bridges interact cyclically with an actin filament, cre¬ 
ating a rowing motion that is coupled to ATP hydroly¬ 
sis. This requires that the myosin cross-bridge bind to 
an actin filament in one conformation and then 
undergo a structural swinging motion, referred to as 
the powerstroke, while bound to the filament. 

Spectroscopic and structural techniques such as 
EPR and fluorescence spectroscopy. X-ray and neutron 


scattering, and electric birefringence were utilized to 
search for signs of cross-bridge motion. Surprisingly, 
these studies revealed that the majority of the myosin 
protein does not move significantly during force pro¬ 
duction (reviewed in Cooke 1986). This led to the 
hypothesis that the bulk of the myosin was attached 
to actin in a rigid fashion, and that a distal part of the 
myosin was envisaged to swing as a rigid lever (Pig. 3). 
Thus, the swinging cross-bridge model became the 
lever arm model. 

Coupling Myosin Motion to ATP Hydrolysis 

Steady state kinetics experiments revealed two seem¬ 
ingly contradictory aspects of the actomyosin ATPase 
cycle: (1) actin accelerates myosin’s ATPase activity 
and (2) ATP dissociates myosin from actin (Eisenberg 
and Moos 1968, 1970; Eisenberg et al. 1968). Through 
their transient state experiments, Lymn and Taylor 
were able to reconcile these observations and proposed 
a model for how the ATPase cycle is linked to motion 
of the cross-bridge. Their key observations were that 
ATP hydrolysis occurs when the motor is not bound to 
actin, and that binding of actin accelerates release of 
the hydrolysis products (Lymn and Taylor 1971). 

Their model is presented in Fig. 4. Starting at the top 
right state, the motor is bound to actin in the absence of 
nucleotide (AM). Binding of ATP causes the motor to 
dissociate from actin (M-T). This is followed by 
hydrolysis of the ATP and a conformational change 
of the myosin that primes the motor for its powerstroke 
(referred to as the recovery stroke) (M-D-Pj). The 
motor then binds to actin (AM-D-Pi), which results in 
the dissociation of the hydrolysis products and the 
powerstroke. The motor thus returns to its original 
state (AM). 

This model is still believed to be correct, although it 
has been greatly expanded upon. In particular, it is now 
better understood how the motor couples its ATPase 
cycle with binding and release of actin and to the 
conformational changes associated with the recovery 
stroke and powerstroke. For a detailed description of 
the myosin chemical and mechanical cycle, see 
► Myosin Work and Motility: Mechanism. 

High-Resolution Structures Provide a Structural 
Basis for the Lever Arm Model 

The first high-resolution X-ray structure of myosin 
revealed a likely candidate for the lever arm. 
Payment et al. solved the structure of the SI 
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Properties and Structure, 
Fig. 2 Sliding filament theory 
of muscle contraction, (a) The 
structure of a single sarcomere 
consists of interdigitated 
arrays of thick (myosin- 
containing) and thin (actin- 
containing) filaments 
mage from Szent-Gyorgyi 
2004). (b) A highly magnified 
(600,000x) electron 
micrograph of the central 
region of a sarcomere from 
cross-striated muscle reveals 
the cross-bridges linking the 
thin and thick filaments 
(Image from Huxley 1957). 

(c) A sarcomere shortens when 
the myosin cross-bridges 
interact with the thin 
filaments, propelling the thin 
filaments toward the center of 
the sarcomere (Image from 
Szent-Gyorgyi 2004) 


a c 

thick filament 


thin filament 
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cross bridge 
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proteolytic fragment of myosin II from chicken mus¬ 
cle, which consists of the first ^850 residues of the 
myosin heavy chain along with the two associated 
light chains (Rayment et al. 1993). This structure, 
which contains no bound nucleotide but has a sulfate 
ion in the active site, revealed that the C-terminus of 
the SI fragment consists of an extended a-helix to 
which the light chains are bound (i.e., the neck 
domain described above). Rayment et al. suggested 
that the purpose of this domain is to “amplify the 
conformational changes associated with the active 
site” (Rayment et al. 1993). In other words, they 
hypothesized that the neck functions as the lever 
arm, amplifying small movements within the motor 
domain. 

This conformation of the motor is now known to be 
the post-rigor state of myosin, which is the state of the 
motor bound to ATP following dissociation from actin 
(M-ATP in Fig. 4). A similar conformation has since 
been observed for various myosins in the presence of 
ADP, ATP, MgADP-BeFx (a nucleotide analog that 
mimics ATP), and in the absence of nucleotide 
(Bauer et al. 2000; Fisher et al. 1995; Risal et al. 
2004; Coureux et al. 2004). An example of a post¬ 
rigor structure is shown in Fig. 5. 


A second myosin conformation was revealed from 
crystal structures of myosins bound to nucleotide ana¬ 
logs that mimic either the ATPase transition state or 
ADP Pi (Smith and Rayment 1996; Dominguez et al. 
1998; Houdusse et al. 2000; Kollmar et al. 2002). This 
conformation is referred to as the pre-powerstroke 
state, and corresponds to the state of the myosin just 
prior to actin binding (M-ADP-Pj in Fig. 4). In support 
of the lever arm model, the post-rigor and pre- 
powerstroke states exhibit subtle differences in the 
motor domain, but show very different lever arm posi¬ 
tions (Fig. 5). Taken together, these two conformations 
allowed for visualization of the recovery stroke of the 
myosin, and predicted a ^70° rotation of the lever arm 
resulting in a motion of ^10 nm at the end of the neck 
domain. 

Support for the Lever Arm Model 

Support for the lever arm model has come from 
a variety of experiments, discussed in (Spudich and 
Sivaramakrishnan 2010; Sun and Goldman 2011). One 
of the most significant pieces of evidence has come from 
single-molecule experiments (including optical trap and 
fluorescence experiments) capable of directly detecting 
the powerstroke of individual motors. These experiments 
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Myosin: Fundamental Properties and Structure, 
Fig. 3 Lever arm model of the myosin powerstroke. Myosin 
binds to actin and undergoes a conformational change in which 
the distal part of the motor swings as a rigid lever, while the bulk 
of the motor does not move (Image from Geeves and Holmes 
1999) 


have measured step sizes of various myosin motors, and 
these values are consistent with the conformational 
changes predicted by the pre-powerstroke and post¬ 
rigor crystal structures. In addition, numerous experi¬ 
ments have verihed an important prediction of the lever 
arm model: that the velocity and step size of myosin 
motion is proportional to the length of the neck domain. 
Thus, the lever arm model is commonly accepted as the 
mechanism that describes the conformational changes 
associated with myosin force generation. 

Structure of the Motor Domain 

Comparison of the various myosin crystal structures 
reveals a shared motor domain architecture, described 
below. 
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Myosin: Fundamental Properties and Structure, 
Fig. 4 The myosin chemomechanical cycle. The actin sites 
to which myosin is able to bind are represented as circles. The 
cycle is described in the text (Adapted from Lymn and 
Taylor 1971) 
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Myosin: Fundamental Properties and Structure, 
Fig. 5 Two conformations of the myosin lever arm for 
Agropecten uradians (bay scallop) myosin II. The motor domain 
is grey, the neck domain is red, and the two light chains are 
orange and yellow. Consistent with the lever arm model, the pre- 
powerstroke (pdb: IQVI) and post-rigor (pdb: 1SR6) states of 
myosin show small conformational changes in the motor domain 
coupled to a large change in the position of the lever arm. The 
motor domains of the two structures (excluding the SH3 domain, 
which is not shown) were aligned in PyMol 
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Myosin: Fundamental Properties and Structure, 
Fig. 6 Structure of the motor domain of chicken myosin V. 
(a-e) are structures of myosin V bound to MgADP.BeFx in the 
post-rigor state (pdb: 1W7J). (a) The motor domain is gray, the 
neck domain is red, and the myosin V essential light chain (ELC) 
is blue, (b) The four subdomains of the motor domain are the 
N-terminal subdomain (red), the upper 50-kDa subdomain 
(U50, orange), the lower 50-kDa subdomain (L50, green), and 
the converter subdomain (blue), (c) The four joints are switch II 
(blue), the strut (orange), the relay (red), and the SHI helix 


(green). Also shown are the P-loop (purple). Loop 1 (red), and 
switch I (leal), (d) The color scheme is the same as (a), except 
the central seven-stranded fi-sheet which is shown in purple, (e) 
Alignment of the seven-stranded P-sheet for myosin V bound to 
MgADP.BeFx in the post-rigor state (pdb: 1W7J; green) and 
myosin V without bound nucleotide in the rigor state (pdb: 
10E9; orange). The ADP.BeFj molecule is also shown, as are 
switch I and switch II. As the figure shows, a change in the twist 
of the P-sheet results in a change in the positions of switch I and 
switch II. A similar alignment is depicted in Elinas et al. 2012 


For many myosin classes, the first 50-80 residues of 
the motor domain contain a protruding six-stranded, 
antiparallel P-barrel with similarities to the Src 
Homology 3 (SH3) domain. In the early myosin II 
crystal structure, this appeared as an independently 
folding domain making little contact with the rest of 
the domain (Rayment et al. 1993). This is one of the 
least conserved regions within the myosin motor 
domain. Although the function of this domain is 
unknown, it has been shown that it is important for 
normal functioning of a myosin II, affecting both 
in vivo function and kinetic properties of the motor 
(Fujita-Becker et al. 2006). The domain is conforma- 
tionally sensitive to the nucleotide state of the motor 
(van Duffelen et al. 2005), and it has been suggested 


that it may be implicated in the structural integrity of 
the motor domain (Nikolaeva et al. 2002). In addition, 
this domain may enable communication between the 
essential light chain and actin (Lowey et al. 2007). 

The bulk of the motor domain consists of a-helices 
surrounding a core seven-stranded P-sheet (Fig. 6). 
The a-helices form a deep cleft in the motor extending 
from the nucleotide-binding site to the actin-binding 
interface. To generate force and motion, the motor 
domain must coordinate three cycles: (1) an ATPase 
cycle (consisting of ATP binding, ATP hydrolysis, and 
release of hydrolysis products), (2) an actin-binding 
cycle (consisting of binding and release of actin), and 
(3) a mechanical cycle (consisting of the powerstroke 
and recovery stroke) (Fig. lb). To achieve this 
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coordination, communication within the motor domain 
must occur between the nucleotide-binding site, the 
actin-binding interface, and the lever arm. 

The motor consists of four subdomains linked by 
highly conserved deformable connectors (also referred 
to as joints). The four subdomains, which move as 
single units, are the N-terminal subdomain, the upper 
and lower 50-kDa subdomains (U50 and L50), and the 
converter subdomain (Fig. 6b). The joints include 
switch II, the strut, the relay, and the SHI helix 
(Fig. 6c). The cleft in the motor domain separates the 
U50 and L50 domains, and is thus called the 50-kDa 
cleft. Movements of the four domains are coupled as 
a result of these connectors, allowing for direct com¬ 
munication between the various parts of the motor 
domain. A detailed description of the allosteric mech¬ 
anisms that allow for communication between the var¬ 
ious parts of the motor domain can be found in 
Sweeney and Houdusse (2010) and Yang et al. 
(2007). Key aspects are described below. 

1. The lever arm: Because the converter domain is 
connected directly to the neck, it is essentially part 
of the lever arm. The converter is the most mobile 
domain and is connected to the L50 and N-terminal 
domains by the relay and the SHI helix, respectively. 
Rotation of the converter and lever arm is responsible 
for amplifying smaller conformational changes within 
the rest of the motor domain, especially changes in the 
relative positions of the N-terminal and L50 domains. 
In addition, the position of switch II, which interacts 
with bound nucleotide, affects lever arm position by 
inducing conformational changes at the interface 
between the relay and the SHI helix. 

2. The nucleotide-binding site: The nucleotide¬ 
binding site is of the P-loop variety found in some 
other ATPases and GTP-binding proteins, and is 
located near the interface between the N-terminal 
and U50 domains. Three elements contribute to 
nucleotide binding: the P-loop, switch I, and 
switch II (Fig. 6c). Switch II also connects the 
U50 and L50 domains at the bottom of the 50-kDa 
cleft that separates these two domains. Switch I and 
switch 11 move in response to the nucleotide state of 
the motor, and are especially sensitive to the pres¬ 
ence or absence of y-phosphate in the active site. 

3. The actin-binding interface: The actin-binding 
interface is comprised of portions from both the 
U50 and L50 domains. Closure of the 50-kDa 
cleft, which is largely dictated by the conformations 


of the strut and switch II joints, results in an 
increased affinity of the motor for actin. In general, 
strong binding of myosin to actin results in weak 
binding of nucleotide, and vice versa. This is 
because closure of the 50-kDa cleft causes elements 
of the nucleotide-binding pocket to move to posi¬ 
tions that can no longer strongly coordinate bound 
nucleotide. 

A picture thus begins to arise of how the various 
parts of the motor can communicate with one another. 
For example, strong binding of myosin to actin is 
associated with closure of the 50-kDa cleft. This, in 
turn, results in reduced affinity for nucleotide as well as 
a change in the position of the lever arm due to 
a change in the conformation of switch II. Such 
communication results in tight coupling of the 
ATPase, actin-binding, and mechanical cycles, and 
the resultant cycle is referred to as the myosin 
chemomechanical cycle. The Lymn and Taylor 
model (Fig. 4) presents a simplified description of 
this cycle. For an expanded model, see ► Myosin 
Work and Motility; Mechanism. 

The Rigor Structure Reveals a Transducer 

This simple description of the motor domain as 
consisting of four rigid bodies connected by deform¬ 
able linkers is a powerful model for describing how 
myosin’s chemical and mechanical processes are 
coupled. However, the model was modified when crys¬ 
tal structures of nucleotide-free myosins (Coureux 
et al. 2004; Menetrey et al. 2005; Yang et al. 2007) 
revealed a rigor-like conformation of the motor (AM 
in Fig. 4). These structures showed that the (I-sheet at 
the core of the myosin undergoes various degrees of 
twisting in response to relative movements of the 
N-terminal and U50 domains. This, in turn, impacts 
elements of the nucleotide-binding site (see Fig. 6e). It 
has been suggested that this twisting controls the recip¬ 
rocal relation between actin- and nucleotide-binding 
(Llinas et al. 2012). The p-sheet and the stmctural 
elements that accommodate this twisting (including 
loop 1) are thus referred to as the transducer element. 

Summary 

Although the number of distinct myosin classes has 
grown rapidly since 1973, so has our understanding of 
the properties that unite these various classes. 
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A picture thus begins to arise of a universal myosin 
engine. This engine can be utilized to achieve disparate 
functions in different cellular environments, and it can 
even be kinetically tuned. Nevertheless, all myosin 
motors appear to share a set of well-defined structural 
features that allow for coupling of chemical and 
mechanical events, resulting in generation of force 
and motion. 
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Synonyms 

Chemomechanical cycle of myosins; Molecular 
motors; Structural and kinetic tuning of myosins 

Definition 

Myosins are a superfamily of molecular motors that 
convert the chemical energy of ATP hydrolysis into 
directed motion along the filamentous protein actin. 

Introduction 

Myosins function by converting the chemical energy 
associated with ATP hydrolysis into force and directed 
motion. The goal of this review is to discuss the 
coupled chemical and mechanical cycles that allow 
this energy transduction to occur and to examine how 


this kinetic cycle is “tuned” by the various myosin 
classes to achieve distinct functions. 

Myosin Structural Features 

Although there are at least 35 distinct myosin classes 
implicated in a variety of cellular processes (Ordonitz 
and Kollmar 2007), all myosins share certain structural 
features, which are discussed in detail in “► Myosin: 
Fundamental Properties and Structure.” A brief over¬ 
view is provided here. 

Myosins exhibit a three-domain structure (Fig. la). 
The most N-terminal domain is the motor domain, 
which contains the actin-binding interface and the 
nucleotide-binding site. This is followed by the neck 
domain, which consists of an a-helix that is stabilized 
by the binding of one or more calmodulin or 
calmodulin-like light chains. The most C-terminal 
domain is the tail domain, which can serve a variety 
of roles, such as allowing myosin heavy chains to self¬ 
associate, binding a myosin to its cargo, and anchoring 
the myosin to structures in the cell. 

According to the lever arm model, the rigid neck 
domain acts as a lever that amplifies small movements 
within the motor domain. The force-generating swing 
of the neck domain is referred to as the powerstroke. 

The motor domain consists primarily of tx-helices 
surrounding a core seven-stranded P-sheet (Fig. lb). 
The a-helices form a deep cleft (referred to as the 
50-kDa cleft) in the motor extending from the nucleo¬ 
tide-binding site to the actin-binding interface. The 
motor domain is comprised of four subdomains (the 
N-terminal, upper 50-kDa (U50), lower 50-kDa (L50), 
and converter subdomains) that are connected by four 
linkers (the strut, switch II, relay, and SHI helix) 
(Fig. Ic, d). Movements of the four domains are 
coupled as a result of these linkers, allowing for direct 
communication between the various parts of the motor 
domain. 

The Common Myosin Kinetic Cycle 

A myosin motor generates force and motion by cou¬ 
pling three cycles: (1) an ATPase cycle (consisting of 
ATP binding, ATP hydrolysis, and release of hydroly¬ 
sis products), (2) an actin-binding cycle (consisting of 
binding and release of actin), and (3) a mechanical 
cycle (consisting of motions of the neck domain/lever 
arm). The resultant kinetic cycle is referred to as the 
actomyosin chemomechanical cycle. During their 
kinetic cycles, all studied myosin classes appear to 
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Myosin Work and Motility: Mechanism, Fig. 1 Structural 
features of myosins, (a) Myosins are comprised of an N-terminal 
motor domain (green), which contains the actin- and nucleotide¬ 
binding sites; a neck domain (purple), which consists of an a- 
helix stabilized by the binding of one or more light chains; and 
a C-terminal tail domain (brown), (b-d) are structures of myosin 
V bound to MgADP.BeFx in the post-rigor state (pdb: 1W7J). (b) 
The motor domain is gray, the neck domain is red, and the 



myosin V essential light chain (ELC) is blue, (c) The four 
subdomains of the motor domain are the N-terminal subdomain 
(red), the upper 50-kDa subdomain (U50, orange), the lower 
50-kDa subdomain (L50, green), and the converter subdomain 
(blue), (d) The four joints are switch 11 (blue), the strut (orange), 
the relay (red), and the SHI helix (green). Also shown are the P- 
loop (purple), loop 1 (red), and switch I (teal) 


transition through the same states, though not neces¬ 
sarily at the same rates. Detailed discussions of our 
current understanding of the actomyosin 
chemomechanical cycle can be found in (Sweeney 
and Houdusse 2010; Llinas et al. 2012). An overview 
is presented below. 

In general, myosins rapidly hydrolyze ATP in the 
absence of actin. However, rapid release of the hydro¬ 
lysis products requires the motor’s interaction with 
actin. The predominant states that a motor passes 
through during the actomyosin chemomechanical 
cycle are detailed in Fig. 2a. States whose names 
begin with M represent the weak actin-binding states. 
For these states, the motor attaches and detaches from 
the actin on submicrosecond time scales, has a low 
actin-affinity, and does not bind stereospecifically to 
actin (Geeves and Holmes 1999). States whose names 


begin with AM represent strong-binding states of the 
motor, which involve stereospecific binding interac¬ 
tions and high actin-affinities (K^ 1 pM). These are 

also referred to as “force-bearing” states. 

States for Which Crystal Structures Have Been Solved 
Crystal structures have been solved for the first three 
states (labeled as rigor, post-rigor, and pre- 
powerstroke in Fig. 2a). Figure 3 shows representative 
structures for each of these states. Comparing them, 
variability is seen in the degree of twisting of the 
central seven-stranded P-sheet. This twisting is 
thought to facilitate communication between the 
actin-binding interface and the nucleotide-binding 
site, resulting in the observed reciprocal relation 
between actin- and nucleotide-binding (Llinas et al. 
2012). Because of this, the P-sheet and the structural 
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MT 

Post-rigor 


MDP, 
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A-MDP, 
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Moderate affinity Moderate affinity 
for actin for actin 



Myosin Work and Motility: Mechanism, Fig. 2 Actomyosin 
chemomechanical cycle. The myosin motor domain is shown in 
green, the neck domain is purple, and the actin is gray. The 
complete cycle is shown in (a), and is described in the text. The 
transition surrounded by blue brackets involves phosphate 


release, the powerstroke, and strong binding to actin. This tran¬ 
sition is shown in greater detail in (b), where the top and bottom 
models correspond to different proposed mechanisms. In the top 
model, phosphate release precedes the powerstroke, and in the 
bottom model, phosphate release follows the powerstroke 


elements that accommodate this twisting are referred 
to as the transducer element. Yang et al. noted that the 
lowest energy state of a p-sheet is a right-handed twist, 
and thus “twisted” and “untwisted” states of the 
P-sheet refer to lower and higher energy states, respec¬ 
tively (Yang et al. 2007). 

Starting from the left side of Fig. 2a, the motor begins 
in a state in which it is associated with actin and has no 
bound-nucleotide (AM), the so-called rigor state. The 
50-kDa cleft is completely closed (both at the actrn- 
binding interface and near the nucleotide-binding 
pocket) (Fig. 3, middle). In the absence of nucleotide, 
the P-sheet is highly twisted, positioning switch I away 
from the P-loop and close to switch II (Fig. 3, bottom). 


Upon binding ATP, the post-rigor state (M T) is 
formed where nucleotide-binding elements are 
repositioned to coordinate the nucleotide. In doing so, 
the cleft completely opens (Fig. 3, middle) causing the 
motor to dissociate from actin. In this state, the P-sheet 
partially untwists, positioning switch I near the nucle¬ 
otide and relatively far from switch II (Fig. 3, bottom). 
The lever arm position changes only slightly between 
the rigor and post-rigor states. 

Hydrolysis of ATP (M D-Pj) is associated with 
partial closure of the 50-kDa cleft, involving only the 
inner part of the cleft near the nucleotide-binding 
pocket (Fig. 3, middle). This results in interactions 
between switch II and the y-phosphate that trap the 
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Rigor Post-rigor 




Pre-powerstroke 



Myosin Work and Motility: Mechanism, Fig. 3 Representa¬ 
tive crystal structures of the rigor, post-rigor, and pre- 
powerstroke states of myosin. The rigor structure is myosin II 
from Doryteuthis pealeii (squid) in the absence of nucleotide, 
and the post-rigor and pre-powerstroke structures are of myosin 
II from Argopecten irradians (bay scallop) in the presence of 
sulfate and MgADP-vanadate, respectively. The motor domains 
(with the exception of the N-terminal SH3 domain) have been 
aligned using PyMol. For the upper images, the neck domain is 


shown in red and the two light chains are shown in orange 
and yellow. For the middle images, the light chains are not 
shown, and the converter domain is shown in blue, the SHI 
helix is shown in red, and the relay is shown in green. For the 
bottom images, the P-loop (purple), switch I (teal), and switch II 
(blue) are shown. In addition, the sulfate ion is shown for the 
post-rigor structure and ADP-vanadate is shown for the pre- 
powerstroke structure 


phosphate within the motor (Fig. 3, bottom). In addi¬ 
tion, this conformation of switch II repositions the 
relay and SHI helix such that the lever arm undergoes 
a conformational change that primes it for its next 
powerstroke, a motion referred to as the recovery 
stroke. The distortion of the P-sheet is fully untwisted 
in this state. 

Phosphate Release and Generation of Force 
Because crystal structures have not yet been solved 
for the remaining states, there is still debate as to 
how the chemomechanical cycle proceeds. It is 
likely that the motor binds weakly to actin 


(A*M D Pj) through an interaction between surface 
loops on the myosin (especially loop 2) and actin 
(Geeves and Conibear 1995; Furch et al. 1998). 
This is followed by three events: release of phos¬ 
phate from the motor, strong rebinding of the 
motor to the actin, and the force-generating 
powerstroke. As a result of these transitions, the 
motor arrives at a state where it is strongly bound 
to actin and ADP, and its lever-arm has changed 
position (AM D). While it is widely accepted that 
phosphate release and force generation are linked, 
the order of these three events has not been 
unequivocally determined. 
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Phosphate release from myosin precedes ADP, 
which means that the phosphate requires 
a “back door” from which it can exit the 
nucleotide-binding pocket since ADP is blocking its 
exit (Payment et al. 1993). In addition, in the pre- 
powerstroke state, switch I and switch II prevent the 
phosphate from dissociating. Thus, formation of an 
escape route for phosphate requires a structural 
rearrangement of switch I or switch II. This would 
either require a rearrangement of switch I that allows 
phosphate to dissociate while maintaining coordina¬ 
tion of the ADP, or a rearrangement of switch II that 
does not alter the lever arm position (see (Sweeney and 
Houdusse 2010) for a discussion of these two possible 
mechanisms). 

Two mechanisms that have been proposed for how 
phosphate release is coupled to the powerstroke are 
presented in Fig. 2b. In the top model, the motor 
undergoes a transition to a state with moderate actin- 
affinity in which the back door is open (A**M D Pi). 
Phosphate is then able to dissociate from the motor 
(A**M-D), resulting in a rearrangement of switch II 
that leads to lever arm movement as well as strong¬ 
binding to actin (AM D). Arguments supporting this 
model are presented in Sweeney and Houdusse 2010. 

According to the second model (Fig. 2b, bottom), 
the weakly bound motor transitions to a strong actin- 
binding state (AM D-Pj) and then undergoes its force¬ 
generating powerstroke (AM’ D Pj). This is followed 
by phosphate release (AM D). Arguments supporting 
this model are presented in Takagi et al. 2004. It is 
important to note, however, that Takagi et al. argue 
that force-generation must precede phosphate release. 
As they point out, if generation of force occurs when 
the motor goes from its weak to strong actin-binding 
state (A*M-D-Pi to AM-D-Pj), perhaps through 
a rotation of the myosin head, then it is possible for 
phosphate release to occur either before or after the 
powerstroke associated with switch II motion. 
According to this alternate model (not shown in 
Fig. 2b), force generation would occur in two steps. 

A Second Conformational Change is Associated 
with ADP Release 

Before releasing ADP, the motor must undergo an 
additional lever arm motion that is associated with 
a transition to a state that binds ADP weakly 
(AMDw). Evidence supporting this conformational 
change is reviewed in Nyitrai and Geeves 2004. 


Though it is uncertain whether this conformational 
change can contribute to work production, it is thought 
that the force sensitivity of this step may be important 
to the force-dependent function of some myosins (see 
the discussion below). 

At this point, the lever arm has returned to its post- 
powerstroke conformation. The (3-sheet returns to its 
twisted state, again separating switch I and the P-loop, 
resulting in loss of coordination of the ADP. The motor 
releases ADP and returns to its rigor state (AM). 

Thermal Ratchets and Powerstrokes 

In the description of the chemomechanical cycle 
above, deterministic language has been primarily 
used to describe how myosin generates motion. The 
lever arm is envisioned as an elastic element, and the 
powerstroke consists of a relaxation of a strained con¬ 
formation of the lever arm (similar to the contraction 
and relaxation of a stretched rubber band). In other 
words, the molecular motor is viewed as a “miniature 
version of a macroscopic device, employing springs, 
cogs, levers, and the like to effect motion” (Astumian 
and Derenyi 1998). 

However, there is an important difference between 
macroscopic and microscopic motors. Microscopic 
motors operate in an environment where viscosity 
dominates inertia and where random thermal fluctua¬ 
tions are significant. According to the powerstroke 
mechanism described above, thermal noise is viewed 
as something that must be overcome to create motion, 
and in the absence of this noise, the motor’s 
powerstroke could still occur. 

An alternate mechanism envisions the motor as 
a thermal ratchet (or Brownian ratchet). A thermal 
ratchet harnesses (rather than fights) thermal fluctua¬ 
tions to generate its motion. According to such 
a model, the ATPase activity serves the purpose of 
rectifying this random motion. Thus, in the absence 
of thermal noise, the motor would be unable to gener¬ 
ate motion. 

Andrew Huxley presented the first thermal ratchet 
model of myosin (Huxley 1957). More recently, 
a possible thermal ratchet mechanism was described 
in which the motion of the lever arm is envisioned as 
a means of capturing Brownian motion rather than 
a means of directly generating motion (Houdusse and 
Sweeney 2001). Although this review will not go into 
the details of these models, discussions of the relation¬ 
ship between powerstroke and thermal ratchet models 
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can be found in (Wang and Oster 2002; Howard 2009; 
Astumian 2010). It is also worth noting that the current 
model of myosin V motility, which is discussed in 
detail below, contains elements of both a powerstroke 
and thermal ratchet model. 

Differences Between the Myosin Classes 

Although all myosins undergo the same 
chemomechanical cycle, different myosins must be 
able to perform disparate cellular functions. Variability 
in the structure and kinetics of the various classes allows 
different myosins to use the common chemomechanical 
cycle to perform different tasks. Some of the ways in 
which different myosins vary and how this variability 
impacts their function are highlighted below. 

Structural Diversity Among the Myosin Classes 

Conserved Domains within the Tail 
Because the tail domain is the region of the motor that 
varies the most among the myosin classes, many of the 
differences in motor function can be attributed to dif¬ 
ferences in this region. Tails typically contain 
a number of conserved domains that allow for 
protein-protein and protein-lipid interactions, as well 
as additional functions such as kinase activity. Tail 
domain diversity, and its relation to myosin function, 
is reviewed in Krendel and Mooseker 2005. 

Dimerization Domains 

For a myosin to move cargo along an actin filament, it 
must undergo numerous steps along the actin without 
dissociating, a capability known as processivity. Because 
it is not expected that a single motor can processively 
move along actin (with the notable exception of myosin 
IX (Inoue et al. 2002; Post et al. 2002; Liao et al. 2010)), 
the tail domains of many myosins contain self-assembly 
domains (typically ► coiled coil forming domains) that 
bring together two myosin motors. Dimeric motors have 
been observed to move processively along an actin fila¬ 
ment in a hand-over-hand fashion (e.g., see Churchmann 
et al. 2005). Dimerization may also be regulated for some 
myosins. For example, myosin VI is normally mono¬ 
meric, and yet it forms a dimer when bound to its cargo 
(Llinas et al. 2012). Thus, the motor is activated to move 
processively only when it is needed. 

Lever Arm Length 

According to the lever arm model, the length of the 
neck domain of a myosin is proportional to its step size 


and to the rate at which it moves along actin, 
a prediction that has been verified by a variety of 
experiments (reviewed in Sun and Goldman 2011). 
Lever arm length plays an important role in the 
processive stepping of the two-headed processive 
motor myosin V. Because of the relatively long neck 
domain of this motor, which binds six light chains, the 
two motor domains are able to span the 36 nm pseudo¬ 
repeat of an actin hlament. Thus, the motor can trans¬ 
locate along a single face of actin without rotating 
around the filament (Sun and Goldman 2011). 

Direction of Motion 

Actin is a polarized filament, and most characterized 
myosins move along the filament toward the so-called 
(-I-) end of the filament. Myosin VI is the only motor 
known to move toward the (—) end, which allows it to 
achieve unique functions from other myosins. How 
reverse directionality is achieved is described in Llinas 
et al. 2012; some of the details are discussed here. Inter¬ 
estingly, much of the motor domain of myosin VI is 
similar to other (-I-) end directed myosins. Reverse direc¬ 
tionality primarily arises through an insert (insert-2) that 
is located between the converter and the lever arm. Insert- 
2 forms a novel calmodulin-binding motif, and the bound 
light chain interacts with the converter, stabilizing a new 
position of the lever arm. In addition, in its pre- 
powerstroke state, the converter adopts a unique fold 
that allows the lever arm to rotate by 180° during its 
powerstroke (instead of the ^70° rotation seen with myo¬ 
sin II). Thus, myosin VI still exhibits the same 
chemomechanical cycle as other myosins and conforms 
to the lever arm model, though it exhibits a unique swing¬ 
ing of its lever arm due to these structural modifications. 

Kinetic Tuning Among the Myosin Classes 

The rates of transition between the states can vary 
greatly for different myosins, thus affecting the life¬ 
times of the various states. In this way, myosins are 
“kinetically tuned” to achieve their disparate cellular 
functions. A detailed discussion of how kinetic tuning 
dictates myosin function can be found in Bloemink and 
Geeves 2011. Some of the important properties of 
a myosin’s chemomechanical cycle that are tuned in 
this way are described below: 

Average Cycle Time (L) 

Myosins vary in how fast they pass through their 
chemomechanical cycles, which affects the rate at 



Myosin Work and Motility: Mechanism 


1677 


which an individual motor can move (or move along) 
actin. If the average time for a motor to complete its 
cycle is then the average speed that a single motor 
can move along actin is d/tc, where d is the step size of 
the motor. 

Average Strongly Bound Time (tj) 

When motors work in ensembles, the rate at which they 
can move actin is determined not by t^., but by the 
average time that the motor remains strongly bound 
to the actin during a single chemomechanical cycle, 

It is thus not surprising that the fastest known myosin, 
Chara corallina myosin XI, exhibits an extremely 
small value of 4 (Ito et al. 2007). As another example, 
ts of smooth muscle myosin II is much larger than for 
skeletal myosin forms, resulting in a slower rate of 
contraction and increased force-generation for smooth 
muscle (Guilford et al. 1997). 

Duty Ratio 

A motor’s duty ratio is dehned as the fraction of time 
that the myosin spends strongly bound to actin during 
a single cycle (tjtc). In general, a low duty ratio motor 
spends most of its cycle occupying the low actin- 
affinity states, and a high duty ratio motor spends 
most of its cycle occupying the high actin-affinity 
states. For a population of motors, the duty ratio is 
also the steady-state fraction of actin-bound 
molecules. 

For skeletal muscle myosin II, which works in large 
ensembles within a sarcomere, the slowest step in its 
chemomechanical cycle is actin-activated phosphate 
release. Thus, the motor has a low-duty ratio, which 
minimizes the drag resulting from multiple motors 
remaining bound to a filament during contraction. For 
the two-headed myosin V, on the other hand, the 
slowest step is ADP release from actomyosin. Thus, 
the motor has a high duty ratio, enabling the motor’s 
processivity since it is unlikely that both of the motor 
domains will detach from actin at the same time. 

Force-Dependent Kinetic Tuning of Myosins 

Force applied to a molecular motor has the effect of 
specihcally perturbing mechanical transitions within 
the chemomechanical cycle. Thus, force affects the 
kinetic rates of those transitions involving conforma¬ 
tional changes, potentially altering 4 , t^, and the duty 
ratio. A striking example of force-dependent kinetics is 
seen with a mammalian myosin lb. A backward force 


on this motor slows the apparent rate of ADP release. 
This is thought to arise because the applied force 
affects the conformational change between the strong 
ADP-binding (AM-D) and weak ADP-binding states 
(AM-Dw)- As a result of this force-sensitivity, the duty 
ratio of this motor changes from <0.2 in the absence of 
force to >0.9 in the presence of 2 pN of backwards 
force (Laakso et al. 2008). This allows the motor to 
move its cargo in the absence of force. In the presence 
of force, however, the motor’s kinetic cycle is more 
appropriate for maintaining tension and anchoring its 
bound cargo to actin. 

Another interesting example of force-dependent 
kinetic tuning comes from the processive motion of 
myosin V. As discussed earlier, both the dimerization 
domain and the high duty ratio of myosin V are key to 
the motor’s processivity. However, the average num¬ 
ber of consecutive steps observed for a myosin 

V moving along actin is much greater than would be 
expected given its duty ratio (Sellers and Veigel 2006). 
This discrepancy has led to a model of kinetic “gating,” 
in which the two motor domains coordinate their 
chemomechanical cycles to ensure that the trailing 
head of a walking motor passes through its ATPase 
cycle before the lead head releases from the filament. 

A likely source for this gating is the intramolecular 
strain between the two motor domains, which results in 
a (-I-) end directed (forward) force on the trailing head 
and a (—) end directed (backward) force on the lead 
head (Fig. 4a). As evidence for this, two studies 
observed that a single-headed myosin V motor con¬ 
struct exhibits different kinetic cycles when it experi¬ 
ences a forward or backward force (Purcell et al. 2005; 
Veigel et al. 2005). Both studies observed that the 
average actin-bound lifetime of a myosin V motor 
increases in the presence of a backward force. One of 
the studies (Veigel et al. 2005) also detected a slight 
decrease in the actin-bound lifetime of a myosin 

V motor in the presence of a forward force. Both 
effects are expected to increase the processivity of 
a two-headed motor. 

In Fig. 4b, a model for the myosin V kinetic path¬ 
way is presented that includes a gating mechanism. 
This model is presented and discussed in Dunn and 
Spudich 2007. The motor starts in a state with both 
heads bound to actin and ADP. The trailing head is in 
a post-powerstroke conformation. The lead head, how¬ 
ever, is prevented from undergoing its powerstroke due 
to intramolecular forces, and is thus shown in 
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Myosin Work and Motility: 
Mechanism, Fig. 4 Gated 
mechanism of the processive 
stepping of dimeric myosin V. 
(a) The lead and trailing heads 
of a myosin V dimer are shown 
in green and orange, 
respectively. The green arrow 
represents (—) end directed 
intramolecular forces exerted 
on the lead head, and the 
orange arrow represents (+) 
end directed forces exerted on 
the trailing head, (b) Kinetic 
cycle describing a single 
~36 nm step of the motor. The 
cycle starts at the top with the 
green motor as the leading 
head and ends with the orange 
motor as the leading head. 

A curved lever arm indicates 
a strained state. The cycle is 
described in the text 



a strained, curved state. As a result of this backward 
force, the chemomechanical cycle of the lead head is 
slowed, preventing it from releasing from actin before 
the trailing head. 

The rear head then releases its ADP and binds ATP, 
causing it to dissociate from actin. This removes the 
intramolecular strain, allowing the lead head to undergo 
its powerstroke, which moves the motor’s center of mass 
forward by ^20 nm. The unbound motor then hydro¬ 
lyzes ATP and undergoes a diffusional search until it 
finds its next actin-binding site, moving the center of 
mass an additional ^15 nm. It then binds to actin, 
releases phosphate, and undergoes its powerstroke, 
returning to the initial strained state after having com¬ 
pleted a single ^35-nm step. 


Alternative models have been proposed for the 
myosin V stepping mechanism. In particular, the state 
of the gated lead head of the motor has not been 
unequivocally determined. For example, a model has 
been proposed in which the lead head is kinetically 
stalled in a pre-powerstroke conformation with bound 
ADP and phosphate (De La Cruz and Ostap 2004). 

Summary 

To generate motion and force, myosins must coordi¬ 
nate chemical and mechanical events. Although differ¬ 
ent myosin classes may perform vastly different 
cellular functions, all appear to pass through the same 
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states during their chemomechanical cycles. Variabil¬ 
ity among the different classes results from structural 
and kinetic differences that allow these motors to uti¬ 
lize the same kinetic cycle to perform distinct roles. 

Cross-References 

► ATP-Driven Mechanical Work Performed by 
Molecular Motors 

► Electron Microscopy of Motor Structure and 
Possible Mechanisms 

► Myosin Family Classification 
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NADH-Ubiquinone Oxidoreductase 
(Complex I) 

Ulrich Brandt and Volker Zickermann 
Molecular Bioenergetics Group, Medical School, 
Cluster of Excellence Frankfurt “Macromolecular 
Complexes,” Center for Membrane Proteomics, 
Goethe-University, Frankfurt am Main, Germany 

Synonyms 

NADH dehydrogenase; NDH-1; Respiratory complex 1 

Definition 

Proton-pumping NADH-ubiquinone oxidoreductase 
(complex I) is a very large membrane protein complex 
that is found in many bacteria and almost all eukary¬ 
otes (Brandt 2006). In the mitochondrial respiratory 
chain it couples the transfer of two electrons from 
NADH to ubiquinone to the vectorial translocation of 


four protons across the inner membrane of the organ¬ 
elle and thereby generates up to 40% of the driving 
force for ATP synthesis by ATP synthase. Complex I is 
a source of reactive oxygen species and is involved in 
the pathogenesis of encephalomyopathies and neuro- 
degenerative diseases. 

Basic Characteristics 

Structure and Functional Modules 

The minimal form of complex I represented by the 
enzyme from prokaryotes typically comprises 14 
central subunits that are conserved throughout species. 
In eukaryotic complex I up to 31 additional accessory 
subunits have been identified (Carroll et al. 2006). The 
central subunits can be assigned to three functional 
modules (Table 1, Fig. 1) that harbor the bioenergetic 
core functions of reduced ► nicotinamide adenine 
dinucleotide (NADH) oxidation (N-module), quinone 
reduction (Q-module), and proton pumping (P-module) 
(Brandt 2006). The N-module and the Q-module 
contain all redox prosthetic groups, one ► flavin mono¬ 
nucleotide (FMN) molecule, and eight canonical iron- 
sulfur clusters. In most eukaryotes the hydrophobic 
central subunits ND1-ND6 and ND4L of the P-module 
are encoded by mitochondrial DNA. 

A common L-shaped overall structure with a hydro¬ 
philic peripheral arm and a hydrophobic membrane 
arm has been determined for prokaryotic and eukary¬ 
otic complex I by electron microscopy. X-ray crystal¬ 
lographic analysis provided low resolution structural 
models for the complete enzyme complexes of the 
eubacterium Thermus therniophilus and the aerobic 
yeast Yarrowia lipolytica and for the major part of 
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NADH-Ubiquinone Oxidoreductase (Complex I), Table 1 Complex I comprising 14 central subunits arranged in three 
functional modules 


Subunit symbol 


E. coli 

T. thermophilus 

Y. lipolytica 

Bovine 

Human 

Redox centers'* 

Module 

NuoG 

Nqo3 

NUAM 

75-kDa 

NDUFSl 

Nib, 2 X [Fe4S4]'’ 

N 

NuoF 

Nqol 

NUBM 

51-kDa 

NDUFVl 

FMN, N3 

N“ 

NuoE 

Nqo2 

NUHM 

24-kDa 

NDUFV2 

Nla 

N 

NuoD'' 

Nqo4 

NUCM 

49-kDa 

NDUFS2 


Q“ 

NuoC‘‘ 

Nqo5 

NUGM 

30-kDa 

NDUFS3 


Q 

Nuol 

Nqo9 

NUIM 

TYKY 

NDUFS8 

2 X [Fe4S4] 

Q 

NuoB 

Nqo6 

NUKM 

PSST 

NDUFS7 

N2 

Q“ 

NuoH 

Nqo8 

NDl 

NDl 

NDl 


Pp 

NuoN 

Nqol 4 

ND2 

ND2 

ND2 


Pp 

NuoA 

Nqo7 

ND3 

ND3 

ND3 


Pp 

NuoK 

Nqoll 

ND4L 

ND4L 

ND4L 


Pp 

NuoJ 

NqolO 

ND6 

ND6 

ND6 


Pp 

NuoM 

Nqol 3 

ND4 

ND4 

ND4 


Pd 

NuoL 

Nqol 2 

ND5 

ND5 

ND5 


Pd 


“The nomenclature and assignment of several iron-sulfur clusters is discussed controversially 
'’Some bacterial complexes contain an additional iron-sulfur cluster (N7) in this subunit 
“NADH oxidation site 

‘'Subunits NuoC and NuoD are fused to a single polypeptide in E. coli 
“Ubiquinone binding and reduction 


the membrane arm of complex I from Escherichia coli 
(Efremov et al. 2010; Hunte et al. 2010). The X-ray 
structure of the peripheral arm of complex I from 
T. thermophilus was solved at up to 3.1 A resolution. 

The N-module resides in the distal and the 
Q-module in the proximal part of the peripheral arm. 
The primary electron acceptor FMN is connected via 
a linear chain of iron-sulfur centers with the ubiqui¬ 
none reduction site (Tocilescu et al. 2010). The proton 
translocation function is associated with the membrane 
arm that is divided into a proximal (Pp) and a distal 
(Pd) module. 

Coupling of Redox Chemistry and Proton 
Translocation 

The function of complex I is to convert the redox 
energy of the electron transfer from NADH to ubiqui¬ 
none into a chemiosmotic membrane potential. In oxi¬ 
dative phosphorylation this proton motive force drives 
ATP synthesis. The ubiquinone reductase activity 
is inhibited by a large number of chemically diverse 
compounds like Rotenone, Piericidine, and DQA 
(2-decyl-4-quinazolinyl amine). Note that some 
bacteria use menaquinone instead of ubiquinone as 
a substrate. 


The primary acceptor for electrons donated by 
NADH is FMN. Electron transfer to ubiquinone is 
mediated by a linear chain of seven iron-sulfur clus¬ 
ters. One additional cluster is situated near the FMN 
cofactor and may serve a special function in protection 
against ► reactive oxygen species (ROS) formation at 
the flavin site. ► EPR spectroscopy resolves the signa¬ 
tures of two binuclear and four tetranuclear clusters 
while two tetranuclear clusters are EPR silent. The 
assignment of several EPR signatures to individual 
clusters in the structure of complex I is a matter of 
debate. N2, the terminal cluster of the electron transfer 
pathway, is characterized by a pH-dependent midpoint 
► redox potential (E^v = —150 mV). The EPR visible 
clusters in the electron transfer pathway upstream of 
N2 are nearly isopotential with E^y values around 
—250 mV. Cluster N2 is the immediate electron 
donor for ubiquinone and several lines of evidence 
indicate that the two electron reduction of ubiquinone 
releases the energy that drives proton translocation by 
complex I. Remarkably, recent structural data located 
cluster N2 ^30 A above the membrane plane while the 
proton-pumping sites have been allocated to the prox¬ 
imal and distal membrane arm domain. In conclusion 
the spatial separation of redox chemistry and proton 
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NADH-Ubiquinone Oxidoreductase (Complex I), 
Fig. 1 Structural model for mitochondrial complex I from Y. 
lipolytica and coupling of redox chemistry and proton translo¬ 
cation. The peripheral arm contains the N-module (yellow) and 
the Q-module (orange). A linear chain of iron-sulfur clusters 
(red spheres) connects the NADH oxidation site with the ubi¬ 
quinone reduction site. Cluster N2 is the immediate electron 
donor for ubiquinone and is situated 30 A above the membrane 
plane. The membrane arm is subdivided (black dashed line) into 
a distal and proximal P-module (Pp and Pp,, cyan). The redox 
chemistry is the driving force for proton translocation but is 
spatially separated from the putative proton translocation sites. 
A conformational coupling mechanism involving long range 
energy transfer via the long helical transmission element 
shown in magenta is discussed. The approximate contour line 
of the electron density map tentatively assigned to a complex 
I monomer is indicated by a dashed blue line 

translocation implies that long range energy transfer 
plays a central role in the coupling mechanism of 
complex I (Zickermann et al. 2009). A long helical 
structure laterally associated with the membrane arm 
connects the Pp and Po domain in prokaryotic and 
eukaryotic complex I and has been suggested to medi¬ 
ate energy transmission in a conformational coupling 
mechanism (Fig. 1) (Efremov et al. 2010; Hunte et al. 
2010 ). 

Evolution and Variants of Complex I 

Complex I is a modular assembly of three preexisting 
structural and functional entities (Friedrich and 
Scheide 2000). Subunits of the N-module are homolo¬ 
gous to the NAD"^ reducing [NiFe] hydrogenase of 
Ralstonia eutropha. In the Q-module the two subunits 
that construct the ubiquinone binding site correspond 
to subunits of soluble [NiFe] hydrogenases. In the 
P-module four polypeptides are related to subunits of 


bacterial monovalent cation/proton antiporters 
encoded by the mrp operon (Mathiessen and Hagerhall 
2003). 

Enzyme complexes related to complex I but with 
different electron input modules are found in archaea, 
in a few bacterial species, e.g., Helicobacter pylori, 
and in cyanobacteria and chloroplasts. 

Complex I in Disease 

Mutations in the mitochondrial genome are known to 
cause a number of encephalomyopathies. Leber’s 
Flereditary Optic Neuropathy (LHON) is a prominent 
example that is caused by specific point mutations 
mainly in the genes for subunits NDl, ND4, and 
ND6. The typical clinical picture of nuclear mutations 
in complex I genes or assembly factors is Leigh syn¬ 
drome, a severe, early onset neuromuscular disorder 
(Janssen et al. 2006). 

Poisoning with the neurotoxin 1-methy 1-4-phenyl 
1,2,3,4 tetrahydropyridine (MPTP) was shown to repro¬ 
duce Parkinson’s disease in humans. The compound is 
converted into 1-methy 1-4-phenylpyridinium (MPP"^) 
by monoamine oxidase. MPP"^ is selectively taken up 
into dopaminergic neurons by the dopamine transporter 
and triggers apoptotic cell death by inhibiting complex 1. 

Cross-References 

► Electron Transfer Through Proteins 

► Electron Transport Chains in Bacteria 
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► Single-Molecule High-Resolution Imaging with 
Photobleaching (SHRImP) 


Nanodroplet Confinement 

Lori S. Goldner 

Department of Physics, University of Massachusetts, 
Amherst, MA, USA 

Synonyms 

Single-molecule methods 


Definition 

A method for isolating, conhning, and manipulating 
individual molecules for study. 


Basic Characteristics 

Aqueous droplets in oil, with diameters from 100 nm to 
100 pm and volumes from attoliters to nanoliters, are 
used as nanoreactors for small quantities of reagent, 
down to the level of single molecules. Above 
a femtoliter, droplet reactors can be easily formed 
and manipulated in microfluidic devices and have 
found wide use in industry. Below a femtoliter, drop¬ 
lets provide a convenient environment for single¬ 
molecule sensitive chemistry and biophysics. 

Single-molecule sensitive measurements of struc¬ 
tural transformations, reaction dynamics and binding 
all require that a molecule under study be measured for 
some amount of time. This requires a method for locat¬ 
ing or distinguishing individual molecules, and for 
conhning them in a detection region. Depending on 
the measurement scheme, various methods have been 
devised for distinguishing or localizing biomolecules, 
including surface binding, surface adsorption, or con- 
hnement in a porous material. Additionally, molecular 
conhnement in surface-tethered liposomes (Boukobza 
et al. 2001) offers a less perturbative and more repro¬ 
ducible environment for the conhned molecule than 
either direct molecular tethering or adsorption. 

Conhnement of single molecules to aqueous 
nanodroplets in oil is a simple and sometimes conve¬ 
nient alternative to other methods for conhning or 
localizing molecules (Goldner et al. 2010; Reiner 
et al. 2006; Tang et al. 2008). Droplets are easily 
identihed and tracked in an optical microscope, and 
can be manipulated optically or physically to remain 
in a detection volume. Droplets offer an advantage 
that they can be made to coalesce on contact, without 
loss of hydrophilic contents, thereby providing 
a convenient means for fast mixing. In the absence of 
surfactant, droplet coalescence is intrinsically fast, and 
the timing of mixing will be limited by diffusion in the 
droplets. Diffusional mixing can occur in less than 
1 ms for subfemtoliter droplets. 

More generally, nanodroplets offer convenient 
compartmentalization and localization useful in many 
chemical applications. Unlike nature’s own cellular 
compartments, droplets can be engineered to be stable 
under conditions not commensurate with living sys¬ 
tems. The use of emulsions makes massively parallel 
high-throughput measurement of bioreactions possi¬ 
ble; a single microliter of sample can be aliquoted 
into 10^-10® separate bioreactors. Conhnement to 
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a small reaction volume leads to higher reaction rates 
than in the bulk, both because reactant concentrations 
can be made arbitrarily high as droplets become arbi¬ 
trarily small, and because diffusion-limited reaction 
rates increase as volume decreases. 

The utility of single aqueous droplets in oil for the 
study of single molecule reaction kinetics has long 
been recognized. Almost 50 years ago Rotman (1961) 
demonstrated that the activity of single (3-D galactosi- 
dase molecules confined to aqueous droplets in sili¬ 
cone oil could be measured by monitoring the increase 
in fluorescence of a fluorogenic substrate over time. 
While Rotman did not detect individual turnovers, his 
technique represents perhaps the first time that single 
enzyme kinetics could be measured using fluores¬ 
cence. More recently, chymotrypsin kinetics have 
been studied using a similar technique (Lee and 
Brody 2005). In both these works, droplets were used 
to confine individual molecules and indirectly study 
their kinetics using fluorogenic substrates. Most 
recently, fluorescence and fluorescence resonance 
energy transfer has been detected directly from indi¬ 
vidual biomolecules confined to single nanodroplets 
(Reiner et al. 2006; Tang et al. 2008). 

Many applications have been found in recent years 
for droplet compartmentalization of biological 
reagents, typically in picoliter to nanoliter droplets. 
One example includes use of single DNA templates 
in applications of directed evolution, where droplet 
confinement is a convenient method of linking geno¬ 
type and phenotype. Reviews can be found in Refs. 
(Griffiths and Tawfik 2006; Kelly et al. 2007; Leamon 
et al. 2006; Song et al. 2006; Taly et al. 2007). Droplet 
confinement has also proven useful in the amplification 
of single DNA templates; a recent review of 
microfluidic DNA amplification can be found in 
Zhang and Ozdemir (2009). 

In recent years, the development of droplet fluidic 
devices is facilitating many more applications of 
droplet-confined chemistry. Microfluidics have the 
advantage of creating very monodisperse droplets, 
with rates of droplet formation up to 3,000/s now 
typical. Furthermore, by confining reactants to 
a droplet, dispersion that occurs by virtue of 
a parabolic flow field in continuous flow devices is 
eliminated. Many of the applications of droplet fluidic 
devices are discussed in Refs. (Griffiths and Tawfik 
2006; Kelly et al. 2007; Leamon et al. 2006; Song 
et al. 2006; Taly et al. 2007). 


Cross-References 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Single-Molecule Methods 

► Surface-Tethered Lipid Vesicles 
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Nanoparticles for Drug and 
Gene Delivery 
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London, London, UK 

Synonyms 

Nanomaterials; Nanomedicine 

Definition 

Particles with one or more dimensions of the order of 
100 nm or less designed for the targeted delivery of 
drug and genetic therapies. 

Nanoparticles for Therapeutic Applications 

Today, there is a strong focus of nanotechnology appli¬ 
cation on certain challenging diseases. Cancer nano¬ 
technology is a new field of interdisciplinary research 
cutting across biology, chemistry, engineering, and 
medicine aiming to lead major advances in cancer 
detection diagnosis and treatment (Ferrari 2005; Nie 
et al. 2007). The opportunity lies in the fact that for the 
first time, we are able to tackle cancer management 
needs and individualize therapies by developing per¬ 
sonalized treatments. Ideally, clinical scientists 
should detect disease markers and at the same time 
formulate the nanoparticles targeting such markers to 
deliver specifically the drug or gene (Wang and 
Thanou 2010). For the first time, scientists can design 


and prepare multifunctional nanoparticles able to (a) 
detect disease, (b) image disease, (c) treat, and (d) 
monitor treatment progression. This can be done by 
assembling functional molecules in one nanosized par¬ 
ticle. The application and efficiency of these 
nanoparticles in vivo will help enormously in the treat¬ 
ment of diseases. 

Nanoparticles have been widely studied for drug 
and gene delivery in tumors. Most nanoparticles are 
expected to accumulate in tumors due to the enhanced 
permeation and retention effect (ERP) identified by 
Maeda as a means to target anticancer agents 
(Matsumura et al. 1986). Tumors are characterized by 
highly permeable blood vessels due to rapid and 
defected angiogenesis. In addition, tumors are charac¬ 
terized by dysfunctional lymphatic drainage that helps 
the retention of nanoparticles long enough to allow 
local nanoparticle disintegration and release of the 
drug in the vicinity of tumor cells. The phenomenon 
has been used widely to explain the efficiency of nano¬ 
particle and macromolecular drug accumulation in 
tumors. 

The first nanoparticles used to deliver cancer che¬ 
motherapy were the liposomes. Liposomes were dis¬ 
covered 40 years ago by Bangham (Cohen and 
Bangham 1972). Liposomes are vesicles with an aque¬ 
ous interior surrounded by one or more concentric 
bilayers of phospholipids with a diameter ranging 
from 30 nm to several microns. Liposomes may vary 
in size, lipid composition, method of preparation, and 
particularly surface chemistry. Liposomes have been 
evolved through the years to a versatile carrier adapted 
each time to have a different functionality and serve 
a certain drug delivery purpose. 

Liposomes, sized at the nanoscale, consist of a lipid 
bilayer surrounding a water core hosting the drug. The 
first studies to report the efficiency of liposomes as 
nanoparticles focused on the improvement of pharma¬ 
cokinetics and biodistribution of the anthracycline 
drug doxorubicin. Doxil, the PEGylated liposomal 
doxorubicin, shows high efficiency due to improved 
pharmacokinetics as it has been shown to escape the 
reticuloendothelial system (RES), an important barrier 
in nanoparticle systemic circulation (Gabizon and 
Martin 1997). Polyethylene glycol is used with 
a number of nanoparticles as it improves colloidal 
stability and prevents uptake by the RES. PEG is 
usually added on the surface of nanoparticles to create 
the so-called steric stabilization effect where the PEG 
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molecules form a protective hydrophilic layer on the 
surface of nanoparticles that prevents interaction with 
each other (aggregation) and with blood components. 
As a result, grafting of PEG on the surface of 
nanoparticles reduces uptake by the macrophages 
of the mononuclear phagocyte system (MPS) and pro¬ 
longs the blood circulation times (Papahadjopoulos 
et al. 1991). 

Nanoparticles have been suggested for the treat¬ 
ment of diseases of the central nervous system (CNS) 
such as stroke Alzheimer’s and Parkinson’s disease. 
CNS diseases represent the largest and fastest growing 
area of unmet medical need. Several major types of 
nanoparticles have been widely used for drug delivery 
to the brain, such as liposomes, dendrimers, and poly¬ 
meric micelles (Yang 2010). 

Nanoparticles with functions such as imaging and 
drug delivery have been suggested for atherosclerosis 
and cardiovascular diseases (Lobatto et al. 201 1). 

The design of nanoparticles for drug and gene deliv¬ 
ery is not trivial as several factors have to be taken into 
consideration. Primarily, the chemistry of the core 
component that will carry the drug or gene and the 
biocompatibility layers need to be designed consider¬ 
ing the structural integrity and stability in biological 
fluids. Further similar to the process of product drug 
development, nanoparticle development needs to con¬ 
sider physicochemical issues related to the properties 
of the nanoparticles, biopharmaceutical issues related 
to the properties of the “bio-barriers,” and pharmaco¬ 
logical issues related to the site, time, and duration 
of nanoparticle’s action. These nanoparticles have to 
be considered different to small and large molecular 
drugs introducing novel parameters for their design. 
FDA states that ADME (Administration, Distribution, 
Metabolism, and Excretion) studies need to be 
redesigned in the case of nanoparticles to take under 
consideration their aggregation and surface character¬ 
istics (Zolnik and Sadrieh 2009). 


Nanoparticle Surface Constituents; 
the Stealth Layer 

PEG layer (corona) is added onto any nanoparticle that 
is aimed to be administered intravenously. Either this 
is liposome, dendrimer, or inorganic nanoparticles, 
such a layer provides long circulation as it inhibits 
the accumulation of opsonins and dysopsonins and 


their uptake by the macrophages. There are currently 
no rules about the organization and the type of PEG on 
the surface of nanoparticles. However, there are some 
trends indicating that the PEG molecular weight and 
density on the nanoparticles are key features that 
will control PEG conformation on the surface of the 
nanoparticle and the circulation half-life (ti/ 2 ). Such 
PEG layer can interfere with targeting ligands and 
inhibit them from interaction with the corresponding 
receptors. 

Most nanoparticles require colloidal stability 
in vitro (in aqueous buffers, i.e., for storage) and 
in vivo (biological stability). Their protective layer is 
formed by the incorporation of a hydrophilic polymer 
layer. In this case, PEG is the mostly used material. 
PEG molecules are commonly adopted for the stealth 
function, due to their enhanced hydrophilicity and 
flexibility (Bergstrom et al. 1994). The amphiphilic 
nature of the PEG-lipids renders them into micellar 
structures, and their insertion into liposomal surfaces 
is perceived as a method of relieving micellar strain. 

In liposomes, the incorporated PEG-lipid conju¬ 
gates need to be able to render steric stability of the 
nanoparticles and be able to prevent surface adsorption 
of blood proteins. When presented on a liposomal sur¬ 
face, individual PEG chains exhibit a Flory dimension, 
Rf, which represents the volume that each flexible PEG 
cloud occupies (Fig. 1). Longer polymer chains have 
larger Rf, although studies have shown that the PEG 
density on a liposome surface is more important than 
the size of the polymer (Dos Santos et al. 2007). The Rf 
of a PEG 2000 chain is approximately 5.6 nm and, in 
conjunction with grafting density, affects the resultant 
conformation of the PEG chains. Increasing the 
amount of PEG-lipids within a liposomal formulation 
increases the PEG grafting density, which ultimately 
reduces the distance, D, between each PEG molecule 
on the nanoparticle surface. When D > Rf, the PEG 
chains will self-assemble into a random-coil-like 
mushroom cloud. When D < Rf, the lateral pressure 
between the overcrowded PEG mushroom clouds will 
force the extension of the PEG chains into a brush 
conformation. The brush regime results from the 
increased lateral pressure between the PEG chains, 
which forces the extension of the polymers away 
from the surface, into more linear conformations. 

Inorganic nanoparticles require different methods 
of coating or introducing the biocompatibility and 
colloidal stability layers. Generally, this is performed 
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Nanoparticles for Drug and Gene Delivery, 
Fig. 1 Representation of different PEG conformations, formed 
through their incorporation onto surfaces at different densities. 
When the distance between the PEG-lipids, D, is greater than the 


Flory diameter of the PEG (D > Rf), the polymer will collapse 
into a mushroom configuration. When D < Rf, the PEG chains 
will be extended from the surface to form a brush-like 
conformation 


by the co-precipitation of the particles with various 
types of polymers or cross-linked polymers, 
a technique that can improve the particle 
monodispersity, an important parameter regarding 
their biological applications. Dextran, albumin, and 
PEG (Mw 5000) are all types of molecules that have 
been studies as coating materials for providing bio¬ 
compatibility to iron oxide nanoparticles whereas 
gold nanoparticles provide a more suitable surface for 
PEG grafting. Thiol-PEG or bifunctional PEG can be 
added on the surface to provide colloidally stable and 
biocompatible gold nanoparticles (Shenoy et al. 2006). 

Nanoparticle Surface Constituents; 
Targeting Ligands 

The addition of a targeting moiety onto the surface 
of nanoparticles (Eig. 2) aims to increase selective 
cellular binding and internalization through receptor- 
mediated endocytosis. Without the incorporation of 
targeting ligands, nanoparticles rely on nonspecihc 
interactions with cell membranes, which can be low 
when covered in a layer of PEG polymers. The targeting 
effect of the surface ligands is dependent not only on 
the nature of the ligand itself, but on a variety of factors 
that require cooperative optimization. 

One challenge of targeting diseased tissue 
(i.e., tumors) is that the defective cells are often very 
similar in characteristics to its surrounding healthy tis¬ 
sue. To differentiate such cells, the ligands can be 
designed to have specihcity for receptors that are 


overexpressed on cancerous cells, but are normally or 
minimally expressed on normal, healthy cells. These 
molecules should have high affinity to their cognate 
receptors, plus have innate abilities to induce receptor- 
mediated endocytosis. The targeting layer poses as the 
outmost exterior of the nanoparticle delivery system, 
where targeting ligands are generally presented on top 
of the stealth layer (Wang et al. 2008). Structures such as 
antibodies, antibody fragments, proteins, small mole¬ 
cules, aptamers, and peptides have all demonstrated 
abilities to induce nanoparticle targeting to cancer cells. 

Antibodies against the HER-2 receptor, the trans¬ 
ferrin receptor (TfR) and the prostate-specific antigen 
receptor, are both common examples of receptor 
targets, due to their overexpression of such receptors 
on cancer cells (Wagner et al. 1994). These antibodies 
generally exhibit strong interactions with 
corresponding receptors, with dissociation constants 
in the nanomolar range. Antibody fragments, 
consisting of only the Eab binding regions, have also 
been studied as targeting ligands (Tang et al. 2005). 
The advantages of such structures are that they are 
smaller and do not contain the Ec region of the anti¬ 
body which can induce immunogenicity and antigenic¬ 
ity when present on liposome surfaces. Like antibodies 
and antibody fragments, the use of whole proteins is 
also commonly considered as targeting ligands for 
their increased affinity for target receptors. Eor exam¬ 
ple, the natural ligand for TfR, transferrin, binds to its 
receptor with a dissociation constant of around 
= 40 nM. On overcoming systemic barriers to 
arrive in tumor vicinities, targeted liposomes have 
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Nanoparticles for Drug and Gene Delivery, 
Fig. 2 Representation of targeted and nontargeted liposo¬ 
mal nanoparticle systems. The targeted nanoparticles are 


functionalized with an outer layer of receptor-specific ligands, 
which can be of an antibody (full or fragment), small molecule, 
peptide, or aptamer 


greater opportunities for binding to cancer cells if the 
ligand has naturally high affinities for the 
corresponding receptor. 

Peptide sequences pose as even shorter and smaller 
versions of antibody and protein fragments. 
A prominent example of peptide ligands is the RGD 
peptide, identified through phage display to have high 
affinity to a 5 P 3 integrin receptors overexpressed on 
angiogenic vasculatures (Ruoslahti 1996). Other 
types of shorter ligands include small molecules folic 
acid specific for the folate receptors on ovarian cells. 

Ligands chosen for receptor targeting should have 
the function of inducing receptor-mediated endocyto- 
sis. Depending on the type of ligand-receptor interac¬ 
tion, the rate of cellular internalization would differ. 
This is an important factor as rates of internalization 
could affect the accumulation of nanoparticle in 
targeted sites. Some ligand species, such as folate, 
have been shown to have fast internalization as well 
as cell surface recycling rates in cancer cells (Paulos 
et al. 2004). The ligands also need to he conjugated 
onto nanoparticles in an optimal fashion, as to maintain 
their affinities for their corresponding receptors. For 
example, the measured dissociation constants for opti¬ 
mally conjugated HER-2 antibodies were similar to 
free antibodies, around 12-15 nM, depending on the 
surface PEG density. The targeting layer is presented 
as the ultimate exterior of the nanoparticle, the surface 
that interacts primarily with cell membranes. 
Irrespective of their nature, the ligand must have the 
right conformation, high affinities for corresponding 
receptors, and be able to exhibit high rates of cellular 
internalization. Furthermore, the loading and presen¬ 
tation of the ligand on nanoparticle surfaces must be 
cooperative with the stealth layer. 


It is important to incorporate PEG chains at densi¬ 
ties that allow optimal coverage, which means induc¬ 
ing brush conformations on nanoparticle surfaces, as 
discussed earlier. As the brush conformation allows the 
extension of the PEG chains away from the nanoparti¬ 
cle surface, presenting targeting ligands on such 
a platform should allow increased interactions with 
corresponding receptors. Although the presentation of 
ligands on a PEG-brush surface enhances their binding 
to receptors, their improved exposure can decrease the 
nanoparticle lifetimes in circulation. Easter clearance 
from circulation generally corresponds to a higher and 
faster accumulation into the liver and the spleen (RES). 
In this example, the total amount of PEG 2000 content 
needs to be included at optimal brush densities, at 
6.5 mol%. Presentation of the ligand in a PEG-brush 
fashion reveals the targeting molecules increasingly to 
plasma proteins; hence, their faster clearance from 
circulation could be a result of Ec-mediated RES 
uptake into liver and spleen-associated macrophages 
(Elbayoumi and Torchillin 2006). 

Concentration of surface ligands is another param¬ 
eter that affects the ligands’ targeting effect (Gantert 
et al. 2009). Higher ligand densities are envisioned as 
a method of increasing the probability of nanoparticle 
interactions with cell receptors (multivalency). However, 
the presence of increased non-PEG-like material on 
nanoparticle surfaces can be more detrimental than 
advantageous to delivery. A study using aptamers as 
a targeting ligand for prostate cancer-specific antigens 
(PSMA) demonstrated that higher densities of surface 
ligands resulted in greater accumulation of nanoparticles 
into the liver and the spleen (Gu et al. 2008). In that study 
Gu et al. used poly (D,L-lactide-coglycolyde) [PLGA] 
and PEG triblock copolymer-based nanoparticles. 
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The nanoparticles were composed by PLGA-b-PEG and 
PLGA-b-PEG-b-Apt (aptamer for PSMA) (Gu et al. 
2008). Localization of such nanoparticles in tumors was 
also at lower concentrations compared to nanoparticles 
functionalized with lower densities of aptamers. This is 
suggesting that higher coverage of the PEGylated 
nanoparticles’ surfaces with targeting ligands further 
shields the effect of the PEG layer, hence resulting in 
greater recognition by tumor and spleen-associated 
macrophages. 

Examples of targeted nanoparticle drug delivery 
systems have shown faster and higher concentrations 
accumulation in tumors, compared to nontargeted sys¬ 
tems. Generally, it is believed that the introduction of 
targeting ligands does not enhance nanoparticle accu¬ 
mulation into tumors, but shows higher efficacy by 
enhancing internalization into tumor cells. 

Summary 

Nanotechnology applications in medicine, termed 
nanomedicine, have introduced a number of 
nanoparticles of variable chemistry and architecture. 
Nanotechnology involves engineering multifunctional 
devices with dimensions at the nanoscale, similar 
dimensions as those of large biological vesicles or 
molecules in our body. These devices typically have 
features just tens to hundred nanometers across and 
they can carry one or two detection signals and/or 
therapeutic cargo(s). Being inspired by physiologically 
existing nanomachines, nanoparticles are designed to 
safely reach the target tissue and specifically release 
their therapeutic cargo at the site of the disease, 
thereby increasing the bioavailability of the drug at 
the site of action. Nanoparticles have the advantage 
of targeting certain diseased tissues (i.e., inflammation 
or tumors) by simply being accumulated preferentially 
in these tissues (passive targeting). The phenomenon is 
called the enhanced permeation and retention (EPR) 
effect, and has been proposed to explain accumulation 
of synthetic colloids in tumors. It is caused by leaky 
angiogenic vessels and poor lymphatic drainage and 
has been used to explain why macromolecules and 
nanoparticles are found at higher ratios in tumors com¬ 
pared to normal tissues. Nanoparticles have been 
suggested to treat atherosclerosis and diseases of the 
CNS as they can permeate membranes such as the 
blood-brain barrier. Several materials are suggested 


for the assembly of therapeutic nanoparticles. Poly¬ 
mers, linear and dendrimers, are associated with the 
drug in a covalent or noncovalent way and have been 
used with or without a targeting ligand (e.g., to direct 
the nanoparticle to a disease-related cell surface 
expressed receptor). Stealth liposomes are suggested 
to carry the drug in their aqueous core, and they are 
usually decorated by recognition molecules, being 
widely studied in the bench and applied in clinic as 
well. Inorganic nanoparticles such as gold and iron 
oxide are usually coupled chemically to the drug, 
PEG, and the targeting ligand. It appears that the 
PEG coating and ligand decoration are common con¬ 
stituents in most types of nanoparticles. There are 
several examples of successful drug delivery 
nanoparticles and many of them have rapidly moved 
to clinical trials. Nevertheless, there is still room for 
optimization in the area of the nanoparticle kinetics 
such as improving their plasma circulation and bio¬ 
availability and understanding the effect of targeting 
ligands on their efficiency to attach on diseased tissues. 
The need to develop novel and efficient ligands has 
never been greater, and the use of proper conjugation 
chemistry is mandatory. 

Cross-References 

► Carbon Nanotubes As Biomaterials 

► Chemical Diversity of Lipids 

► Dendrimers for Drug Delivery 

► Glycoproteins 

► Molecular Recognition: Lock-and-Key, Induced Fit, 
and Conformational Selection 
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Synonyms 

Bionanowire sensors; Nanofiber biosensors 


Definition 

Nanowire biosensors: A class of sensors or measuring 
devices, of which the major sensing components are 
made of wires or fibers in nanometer scale formed 
by biological molecules, such as DNA molecules, 
polypeptides, fibrin proteins, and filamentous bacterio¬ 
phages, etc. 


Basic Characteristics 


Nanowire biosensors are a class of nanosensors, of 
which the major sensing components are made of 
nanowires formed by biological molecules, referred 
to as bionanowires, such as DNA molecules, polypep¬ 
tides, fibrin proteins, and filamentous bacteriophages, 
etc. A bionanowire is a one-dimensional fibril-like 
nanostructure, with the diameter constrained to tens 
of nanometers or less (1 ^ 100 x 10 ® m) and 
unconstrained length. Bionanowires offer a few signif¬ 
icant advantages. First, biological molecules them¬ 
selves are important parts of biosensors, they are 
capable of recognizing the targets or generating sig¬ 
nals; second, the bionanowires can be easily modified 
by gene manipulation, thus providing versatile 
functions; third, by rational design, the self-assembly 
characters of bionanowires can display functional 
ligands on the nanowire’s surface in designed orders; 
fourth, they possess more reaction area per volume to 
improve detection sensitivity and response time due to 
their high surface to volume ratio. 

For these reasons, bionanowires are excellent 
candidates for sensing applications. Usually, 
bionanowires play a role of a mediator to integrate 
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Nanowire Biosensors, Fig. 1 Modular designed protein 
nanowire biosensor, (a) The construction of self-assembly unit 
in protein nanowires: gene fusion protocol, (b) The structure of 
fusion protein. There are three modules in fusion protein: self- 
assembly domain drives the fusion protein assembling into 
nanowire; the functional ligands generate hiosensing signals; 
the linker peptide links the self-assembly domain and functional 
ligand and provide space to ensure both fusion partners remain in 
native activities, (c) The example of self-assembly bifunctional 
protein nanowire in biosensing. Bifunctional protein nanowires 


were generated by seeding-induced self-assembling of yeast 
amyloid protein Sup35p that genetically fused with protein 
G and an enzyme (methyl-parathion hydrolase, MPH), respec¬ 
tively. The protein nanowires possessed a high ratio of enzyme 
molecules to protein G, allowing a dramatic increase of the 
enzymatic signal when protein G was bound to an antibody 
target. As a result, a 100-fold enhancement of the sensitivity 
was obtained when applied in the detection of the Yersinia pestis 
FI antigen (Reprinted with permission from Men et al. 2009. 
© 2009 American Chemical Society) 


recognition (or binding) element and transduction (or 
signal) element in nanowire biosensors. The special 
dimensions of bionanowires could be explored to ori¬ 
ent monofunctional or multifunctional molecules in 
high density along the longitudinal or vertical direction 
of the nanowires. This will help improve the detection 
sensitivity of a biosensor or develop multipurpose bio¬ 
sensors. In the process of sensing, functionalized 
bionanowires recognize or capture the targets, and 
then convert the recognition into chemical or physical 
signals, which are measurable. 

DNA (► Infrared Spectroscopy of Protein Dynam¬ 
ics: Ultrafast Kinetics) is an ideal nanowire template, 
its diameter is at nanoscale (2 nm), it can be targeted 
uniquely at both ends, and has controllable length 
through its production by DNA synthesizers or polymer¬ 
ases. DNA nanowire biosensors are normally based on 
a reaction that produces electronic or optical signals by 


ordered DNA-protein conjugates on the DNA nanowire. 
Organizing functional molecules sequentially on the 
DNA nanowrres could effectively enhance multien¬ 
zymes catalysis systems and signal transduction (Carsten 
and Itamar 2010). One example is a sensor based on 
a nucleic acid-tethered ferrocene relay unit and GOx 
modified with a nucleic acid tether hybridized with 
a DNA scaffold that is covalently linked to an electrode 
(Piperberg et al. 2009). The programmed functionalized 
DNA nanowire has found efficient electrical contact 
between the biocatalysis center and the electrode, thus 
allowing bioelectrocatalytic activation of the enzyme. 

Some peptides, such as the diphenylalanine peptide 
of Alzheimer’s P-amyloid and the asparagine and 
glutamine-rich polypeptide of yeast prion, have the intrin¬ 
sic ability to self-assemble into elongated solid nanofibrils 
or nanotube (an elongated nanowire with a definite inner 
hole). Because of their well-defined nanostructures 
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formed by self-assembly, peptide nanowires offer an 
attractive alternative for biosensor fabrication. In one 
example, a nonmediated amperometric biosensor was 
constructed based on immobilization of modified peptide 
nanotubes on electrode surfaces and the sensitivity for 
hydrogen peroxide and nicotinamide adenine dinucleo¬ 
tide (NADH) detection was improved (Yemini et al. 
2005a, b). In another example, a label-free pathogen 
ensors chips (► Peptide Micro-array) is developed by 
self-assembling peptide nanowires. The antibodies- 
coated peptide nanowire is as a bridge between a pair of 
electrodes. The binding of the virus to the peptide nano¬ 
tube was detected by measuring capacitance change 
between the electrodes (de la Rica et al. 2008). 

Protein nanowires are fibril-like nanostructure formed 
by protein self-assembly, such as amyloid protein 
(► Amyloid Protein Biomaterials), collagen, silk 
protein, filamentous bacteriophage, and bacterial flagella 
(► Bacterial Hagellar Motor: Overview), etc. The surface 
of protein nanowires can be functionalized through gene 
manipulation. In a protein nanowire-based biosensor, 
both biological signal molecules and binding molecules 
are fused genetically with a self-assembly domain, 
respectively (► Protein Design for Biosensors). They 
are then integrated into protein nanowire through self- 
assembly. The resultant bifunctional protein nanowires 
possess a high ratio of signal molecules to biological 
sensing molecules, allowing a dramatic increase of the 
response signal when sensing to a target. Figure 1 shows 
a general protocol for construction of protein nanowire 
biosensors, which has been successfully applied to build 
ultrasensitive biosensors for immuneassay and detection 
of methyl parathion, etc. (Men et al. 2009, 2010; Leng 
et al. 2010). 

Another kind of nanowire biosensors is biomolecule- 
inorganic hybrid nanowire biosensors. Hybrid nanowire 
biosensors are mainly made of biological sensing mol¬ 
ecules with modified inorganic nanowires, such as sili¬ 
con nanowires (► Carbon Nanotubes as Biomaterials), 
and semiconductor nanowires that have large surface 
areas, and show excellent field effect or electrical prop¬ 
erties (Patolsky et al. 2006; Chris Le et al. 2007). These 
biosensors significantly lower the detection limit. 

Cross-References 

► Amyloid Protein Biomaterials 

► Bacterial Flagellar Motor: Overview 


► Carbon Nanotubes as Biomaterials 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Protein and Peptide Arrays 

► Protein Design for Biosensors 
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Near UV Protein CD 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

Protein aromatic circular dichroism; Protein side chain 
circular dichroism 

Definition 

Protein circular dichroism (CD) spectroscopy is 
usually separated into (1) far UV or backbone CD 
with data collected from ^190 to 250 nm and 
(2) near UV or aromatic CD with data collected from 
250 to 300 nm. The aromatic chromophores of protein 
side chains (tryptophans, tyrosines, and phenyl 
alanines) are planar and so have no intrinsic CD. 
However, their location as part of an amino acid results 
in an induced CD signal in their transitions. This is 
further enhanced when they are located in the chiral 
environment of a peptide or protein. The signs and 
intensities of the CD signals induced into the achiral 
aromatic side chain transitions are dependent on their 
environment. It is a useful fingerprint of protein iden¬ 
tity and conformation, but seldom leads to direct 


structural analysis. Disulfide bonds are often counted 
as honorary aromatic groups since their CD may be 
apparent between 250 and 270 nm. Wavelengths of 
relevant residues are given in Table 1. 

When peptides with aromatic residues assemble (such 
as in a membrane), bands in the 230-240 nm region of 
the CD spectrum appear. They arise as a result of n- n* 
exciton coupling of aromatic transitions, where one of 
the components occur at about 235 nm and the shorter 
wavelength component is masked by the backbone tran¬ 
sitions (Woody 2007). 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Circular Dichroism Spectroscopy: Units 

► DNA-Ligand Circular Dichroism 

► Far UV Protein Circular Dichroism 

► Near UV Protein CD 

► Oriented Circular Dichroism Spectroscopy 

► Protein Circular Dichroism Analysis 
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Near UV Protein CD, 

Table 1 Spectroscopic 
parameters for Phe, Tyr, and 
Trp side chains. The 
assignment is using the 
nomenelature of Platt. The 
oscillator strengths for 
degenerate or nearly 
degenerate pairs (the B bands 
of Phe and Tyr) are given as 
half the observed value for the 
entire band (Woody 2007) 



Assignment^ 


Smax/(mor‘ cm“' dm^) 

Oscillator 

strength 

Apparent 
vibronie structure 

Phe 

Lb 

260 

200 

0.001 

-500 cm"‘ 


La 

210 

10,000 

0.09 

- 


Bb 

180 

30,000 

0.45 

- 


Ba 

180 

30,000 

0.45 

- 

Tyr 

Lb 

275 

1,400 

0.02 

—800 cm^' 


La 

230 

8,800 

0.13 

- 


Ba 

190 

24,000 

0.55 

- 


Bb 

190 

24,000 

0.55 

- 

Trp 

Lb 

280 

3,200 

0.03 

—850 cm”' 


La 

270 

4,500 

0.13 

- 


Bb 

220 

35,000 

0.6 

- 


Ba 

195 

20,000 

0.5 

- 
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Near-Edge X-Ray Absorption Fine 
Structure (NEXAFS) 

► X-Ray Absorption Spectroscopy of Metals in Biology 


Nearest-Neighbor Spin-Spin Correlations 

► Oriented-Sample NMR of Membrane Proteins; Sen¬ 
sitivity Enhancement and Spectroscopic Assignment 


Near-Infrared Spectroscopy in Biological 
Molecules and Tissues 

Yukihiro Ozaki', Akifumi Ikehata^ and Hideyuki 
Shinzawa^ 

'Department of Chemistry, School of Science and 
Technology, Kwansei Gakuin University Sanda, 
Sanda, Hyogo, Japan 

^National Food Research Institute, National 
Agriculture and Food Research Organization, 
Tsukuba, Japan 

^Advanced Institute of Science and Technology, 
Nagoya, Aichi, Japan 

Synonyms 

Chemometrics in NIR spectroscopy; NIR imaging; 
NIR spectroscopy 

Definition 

This entry describes principle, instrumentation, analysis 
method (chemometrics), and biological applications of 
near-infrared (NIR) spectroscopy. Applications discussed 
include the studies of protein structure, denaturation and 
hydration, quantitative in vivo analysis of blood glucose, 
mapping brain function, and imaging. 

Introduction 

Near-infrared (NIR) spectroscopy is spectroscopy in 
the region of 800-2,500 nm (12,500^,000 cm”'), 
being mainly concerned with absorption spectroscopy 


and diffuse reflectance spectroscopy (Siesler et al. 
2002; Ozaki et al. 2007; Burns and Ciurczak 2007; 
Workman and Weyer 2007). NIR spectroscopy is rel¬ 
atively new in biophysics, but its application to bio¬ 
logical materials stretched back more than 40 years. In 
the 1960s, Norris et al. (Ben-Gera and Norris 1968) 
tried to employ NIR spectroscopy to monitor water 
content in grain. They paid much attention on the 
potential of NIR spectroscopy as a nondestructive ana¬ 
lytical method. It was the idea of using a statistical 
method for extracting useful information from compli¬ 
cated NIR spectra that woke the “sleeping Giant” up. 
The first trial of biomedical application of NIR spec¬ 
troscopy was in vivo monitoring of the redox behavior 
of cytochrome c oxidase by Jobsis in 1977 (Jobsis 
1977). After these two pioneering studies, NIR 
spectroscopy has been spreading to various helds, 
including biological and biomedical sciences, food 
and agricultural engineering, chemical engineering 
and online monitoring, polymer science and engineer¬ 
ing, and environmental sciences (Siesler et al. 2002; 
Ozaki et al. 2007; Burns and Ciurczak 2007; Workman 
and Weyer 2007). 

NIR spectroscopy contains both electronic spec¬ 
troscopy and vibrational spectroscopy (Siesler et al. 
2002; Ozaki et al. 2007; Burns and Ciurczak 2007; 
Workman and Weyer 2007). As vibrational spectros¬ 
copy, it deals with overtones and combination modes. 
Since both the overtones and combination modes are 
forbidden transitions according to harmonic oscillator 
approximation, the intensities bands due to them are 
very weak. The intensities of bands due to electronic 
transitions in the NIR region are also weak. Thus, the 
NIR region is, in general, very superior in permeability 
compared to the visible (VIS) and mid-IR region, 
enabling nondestructive and noninvasive analyses. 

The NIR region may be divided into three regions, 
the 800-1,100 nm (12,500-9,900 cm”') region, the 
1,100-2,000 nm (9,900-5,000 cm”') region, and the 
2,000-2,500 nm (5,000^,000 cm”') region. The hrst 
region, named short wavelength NIR (SWNIR) region 
or near NIR region, is concerned mainly with elec¬ 
tronic transitions, higher-order overtones, and combi¬ 
nation transitions, and thus it shows particularly high 
permeability. This region provides so-called “a win¬ 
dow for biological bodies” and is used practically for 
medical applications and agricultural applications. The 
second region contains many bands arising from the 
first and second overtones and combination modes. 
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This region is very useful for general quantitative and 
qualitative analyses. The third region mostly deals 
with combination modes. The permeability of this 
region is relatively low. 

NIR spectroscopy has the following advantages for 
studies in biophysics: 

1. It is a nondestructive and noninvasive method, and 
thus very suitable for in vivo studies of biological 
materials and tissues. Neuroimaging is a good 
example of noninvasive analysis of NIR 
spectroscopy. 

2. It allows for noncontact analysis and analysis using 
an optical fiber. 

3. It is much easier to study and analyze an aqueous 
solution, than with IR spectroscopy. 

4. It can be applied to samples in various physical 
states, shapes, and thickness. 

An NIR spectrum consists of a number of bands 
arising from overtones and combination modes that 
overlap heavily with each other, and thus so-called 
multicollinearity is very strong in the NIR region 
(Siesler et al. 2002; Ozaki et al. 2007; Bums and 
Ciurczak 2007; Workman and Weyer 2007). There¬ 
fore, to extract useful information from the NIR spec¬ 
trum is not always easy. Besides conventional spectral 
analysis methods, chemometrics has most often been 
used to extract information from NIR spectra. 

Instruments 

The most important elements for NIR instruments are 
light sources, spectrometers, and detectors. 

Light Sources 

To date, tungsten halogen lamps are unsurpassed as 
economical sources of radiation for NIR spectroscopy. 
The lamps show broad irradiation from 350 to 2,600 nm 
with a maximum at around 900 nm by the cleaning 
reaction of tungsten. The tungsten halogen lamps are 
ideally stable and incoherent light source; however, the 
heat generation may cause damage to the sample. In that 
case, an IR cut filter (a cold filter) is effective to elimi¬ 
nate the heat. Recent cold light sources, LEDs, will 
significantly grow over the next decades. NIR LEDs 
have relatively broad emission properties (EWHMs are 
about 50 nm). A long-life continuum source can be 
attained with a combination of several LEDs having 
different emission properties. High-power and narrow- 
band laser diodes (LDs) are also useful if the specific 
wavelength is decided. Very broad continuum coherent 


sources, in other words, “white lasers,” based on 
supercontinuum generation from a nonlinear device, 
such as an optical fiber, are now commercially available. 
The emission band ranges from 500 to 2,400 nm 
depending on the pump laser. The supercontinuum 
laser is a pulse source but as small as a laptop PC. 

If one can use a tunable Ti: Sapphire laser, spectral 
scans between 700 and 1,000 nm can be performed 
without a monochromator. Although TiiSapphire laser 
has been known as a q-switched pulse laser, nowadays, 
continuous wave (CW) TiiSapphire laser is also avail¬ 
able. In case one needs coherent light in a longer wave¬ 
length region (up to 2,600 nm), an optical parametric 
oscillator (OPO) with an Nd:YAG laser may be avail¬ 
able. Digital Micromirror Device (DMD), which can 
be used instead of a monochromator, realizes a high¬ 
speed programmable spectral illumination. Since 
a DMD chip is a dispersive element having several 
hundred thousand microscopic mirrors on its surface, 
a DMD source can generate not only a single wave¬ 
length but also multiwavelengths (a spectral pattern) at 
once. Although a classic tungsten-halogen lamp is 
used as the original source, the ability of rapid wave¬ 
length change and special resolution will open a new 
avenue of NIR illumination. 

Spectrometers 

Each NIR spectrometer is not different from those used 
in the visible (VIS) or IR region. However, due to the 
wide NIR wavelength range, dispersion type and Fou¬ 
rier transform (FT) type spectrometers coexist in the 
NIR region (Siesler et al. 2002; Ozaki et al. 2007; 
Bums and Ciurczak 2007; Workman and Weyer 
2007). The dispersion spectrometers equipped with 
optical gratings cover a whole range of an NIR spec¬ 
trum with good stability of the light intensity. On the 
other hand, FT spectrometers are well suited for mid- 
and long-wavelength regions of the NIR region and 
excellent at wavenumber (wavelength) accuracy. 

Detectors and Others 

PbS, InAs, and InGaAs detectors are frequently used 
for NIR spectrometers (Siesler et al. 2002; Ozaki et al. 
2007; Bums and Ciurczak 2007; Workman and Weyer 
2007). For the SWNIR region. Si photodiodes (gener¬ 
ally used for VIS) are also available (Siesler et al. 
2002). Moreover, the recent development of array 
detectors is driving the expansion of NIR imaging 
instruments as will be discussed later. 
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detection incidence 



Near-Infrared Spectroscopy in Biological Molecules and 
Tissues, Fig. 1 Diffuse reflected light of interactance 
measurement 

The use of fiber optics is a unique characteristic of 
NIR spectroscopy. Contact fiber probes, called 
interactance probes, realize nondestructive measure¬ 
ments of opaque samples. As shown in Fig. 1, incident 
light has to diffuse into a sample with minimal surface 
reflectance, and a part of the light arrives at a detection 
probe. An interactance method is not applicable to 
a transparent sample because it is based on diffuse 
reflectance of particles. The total path length of an 
interactance method is about a few cm in the SW 
NIR region. Although the penetration depth of the 
light and the distance between the incidence and 
detection probes are roughly related, for more precise 
experiments, time-resolved measurements must be 
performed. 

Chemometrics 

Chemometrics is a chemical discipline that uses math¬ 
ematical and statistical methods to design or select 
optimal measurement procedures and experiments, 
and to provide maximum chemical information by 
analyzing chemical systems (Siesler et al. 2002; 
Ozaki et al. 2007; Bums and Ciurczak 2007; Workman 
and Weyer 2007), (Ntes et al. 2002; Jiang and Ozaki 
2002). It has been used extensively in NIR spectros¬ 
copy as a powerful tool to yield a mechanistic inter¬ 
pretation of complex physicochemical phenomena 
represented by a spectral data set. Since an NIR spec¬ 
trum is often obtained as a highly convoluted form of 
overtone or combination bands arising from multiple 
chemical components in the system, key information 
in the NIR spectum is distributed broadly throughout 
data. It is possible to condense the information into 


a very compact form by chemometrics. Chemometrics 
mostly used in NIR spectroscopy are those for (I) clas¬ 
sification, (2) multivariate calibration, and (3) self¬ 
modeling curve resolution (SMCR) (Siesler et al. 
2002; Ozaki et al. 2007; Burns and Ciurczak 2007; 
Workman and Weyer 2007), (Naes et al. 2002; Jiang 
and Ozaki 2002). 

In the practice of chemometrics for NIR spectra, 
a factor analytic technique called Principal Component 
Analysis (PCA) often plays an important role to eluci¬ 
date distinct and useful information for detailed anal¬ 
ysis of complex alternation of NIR spectra. PCA 
enables to condense the essence of the information 
present in the NIR spectral data into a very compact 
matrix representation referred to as scores and load¬ 
ings. The basic hypothesis of such factor analysis tech¬ 
niques is that the improved proxy of the original data 
matrix can be reconstructed from only a limited num¬ 
ber of significant factors. Thus, while the score and 
loading matrices contain only a small number of fac¬ 
tors, it effectively carries all the necessary information 
about spectral features and leading to sorting out the 
convoluted information content of highly complex 
chemical systems. 

The spectra can be represented as a matrix X of 
m by n dimension. 


Xii 

Xl2 ■ 

^In 

X2\ 

X22 ■ 

^2n 

Xml 

^ml 

^mn 


where m is the number of spectra traces in this data set 
and n is the number of data points per spectrum along 
the wavenumber (or any other spectral variable) axis. 
PCA may be viewed as mathematical decomposition 
of spectral data in terms of the orthogonal set of dom¬ 
inant factors, i.e., eigenvectors. PCA results in yielding 
two matrices called scores and loadings which com- 
plementarily represent the entire features broadly dis¬ 
tributed in NIR spectral data. 

In short, it becomes 

X = TP' + E (2) 

where T is an w? by r PCA score matrix and P is an n 
by r PCA loading matrix, respectively. The rank r 
corresponds to the number of principal components 
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representing the significant portion of the information 
contained within the data matrix X. The selection of r 
is somewhat arbitrary. It is usually set to be a number, 
as small as possible but sufficiently large enough such 
that there are no obvious spectral features found in the 
residual matrix E. The residual matrix E is the portion 
of the original data, which is not accounted for by 
the first r principal components used for the data rep¬ 
resentation. Importantly, the two matrices T and P 
complementally represent the entire features broadly 
distributed in X. Namely, T holds abstract information 
concerning the relationship among the samples and 
P contains summary of variable, e.g., wave number 
which provides chemically or physically meaningful 
interpretation to the pattern observed in T. The detailed 
analysis of such matrices brings useful insight into 
building a mechanistic model for understanding com¬ 
plex phenomena studied by NIR spectroscopy. 

Multivariate calibration is also a major 
chemometrics technique widely used in NIR spectro¬ 
scopic study (Siesler et al. 2002; Ozaki et al. 2007; 
Burns and Ciurczak 2007; Workman and Weyer 2007), 
(Naes et al. 2002; Jiang and Ozaki 2002). The main 
purpose of calibration in NIR spectroscopy is to esti¬ 
mate the actual quantity of specific chemical compo¬ 
nents in the object by using NIR light. Because of the 
well-known collinearity problem of NIR spectra with 
so many data points in the wavenumber direction being 
highly correlated, the latent relationship among spec¬ 
tral variables cannot be obtained from straightforward 
general inverse technique like Multiple Linear Regres¬ 
sion (MLR). Instead, a chemometrics technique called 
Partial Lest Squares (PLS) regression often provides 
a solid solution for the collinearity problem of NIR 
spectra. In PLS, one of the most important points lies in 
the fact that spectral data matrix X is decomposed into 
a linear combination of scores T and loadings P matri¬ 
ces. For example, the decomposition of X is carried out 
by repeatedly regressing the score along the given 
concentration vector Y containing the known concen¬ 
trations of a specific analyte for a series of spectra. PLS 
modeling essentially ends up with a regression vector 
b which is a linear combination of the absorbance 
values at selected wavenumbers. The regression vector 
or matrix carries the information mostly relevant to the 
determination of the concentration of the analyte and 
less sensitiveness to the overall effect of interference. 
Once one obtained the regression vector b, it can later 
be used to estimate the concentration of the analytes 


even in unknown samples as long as their NIR spectra 
are measured. 

Another important chemometric method, SMCR 
will be outlined in NIR Imaging section. It is important 
to point out that the application of chemometrics is not 
limited to NIR spectroscopy. It is generally applicable 
to various kinds of spectroscopy, e.g., IR and Raman. 
Similar techniques are also extensively used in Quan¬ 
titative Structure-Activity Relationship (QSAR) stud¬ 
ies which are often referred to as Chemoinformatics. 

Applications 

Biological Molecules 

NIR spectroscopy has been extensively used to inves¬ 
tigate structures and functions of a variety of biological 
molecules from water molecules, carbohydrates, to 
proteins (Siesler et al. 2002; Ozaki et al. 2007; Bums 
and Ciurczak 2007; Workman and Weyer 2007). One 
of unique points of NIR spectroscopy in the studies of 
biological molecules is that it is powerful to explore 
hydrogen bondings or inter- and intramolecular inter¬ 
actions. This characteristic occurs because of the fol¬ 
lowing two reasons. One is that the NIR region is 
dominated by bands ascribed to functional groups 
that contain a hydrogen atom (e.g., 0-H, C-H, N-H). 
Another is that a band shift due to the formation of 
a hydrogen bonding is much larger in the NIR region 
compared with the IR region. 

NIR spectroscopy has several advantages over IR 
spectroscopy in studying biological molecules. First, 
one can use a cell having thickness of 0.1-1 mm, and 
thus, the exact concentration of a biomolecule solution 
can be estimated. In the case of IR spectroscopy, one 
must use a very thin cell (on the order of micrometers) 
or an attenuated total reflection (ATR) prism for the IR 
measurement of the biomolecule solution. Therefore, 
there always remains a problem of adsorption of bio¬ 
logical molecules on the cell. Another advantage of 
NIR spectroscopy is that the effects of bands due to 
water vapor are much weaker in the NIR region. Yet 
another advantage is that one can probe hydration of 
biomolecules. 

In NIR studies of biological molecules, generalized 
two-dimensional (2D) correlation spectroscopy has 
often been used to unravel NIR spectra (Noda and 
Ozaki 2004). Wu et al. (2000) used 2D correlation 
NIR spectroscopy to explore temperature-induced 
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Spectroscopy in Biological 
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Fig. 2 FT-NIR spectra of 
HSA in aqueous solutions with 
concentration of 1.0, 2.0, 3.0, 
4.0, and 5.0 wt.% over 
a temperature range of 45-80 
°C (Reproduced from ref. Wu 
et al. 2000. With permission. 
Copyright (2000) American 
Chemical Society) 



variation in the secondary structure and hydration of 
human serum albumin (HSA). Figure 2 shows FT-NIR 
spectra of HSA in aqueous solutions with concentra¬ 
tions of 1.0, 2.0, 3.0, 4.0, and 5.0 wt.% over 
a temperature range of 45-80° C (Wu et al. 2000). 
Concentration-perturbed 2D correlation spectra were 
calculated for the spectra in the 7,500-5,500 cm^^ and 
4,900-4,200 cm * regions at different temperatures. 
To investigate temperature-induced changes in the 
secondary structure and hydration, power spectra 
and slice spectra (Noda and Ozaki 2004) were calcu¬ 
lated from the synchronous and asynchronous spectra, 
respectively. Figure 3a, b depicts them, respectively 
(Wu et al. 2000). In the power spectra, a band 
near 4,600 cm” due to the combination mode of 
amide B and amide II (amide B/II) of HSA shows an 
abrupt shift by 5 cm”* between 58°C and 60°C. Both 
the power and slice spectra in the 7,500-5,500 cm”* 
region provide an explicit evidence that the hydration 
(water-protein interaction) changes markedly near 
60°C. A comparison of temperature-dependent fre¬ 
quency shifts between the band near 4,600 cm”* due 
to amide B/II of HSA and that near 7,000 cm”* due to 
the combination mode of water reveals that the protein 
unfolding occurs almost in parallel with the change 
in the hydration. This study has demonstrated that 
the overtone and combination modes are very useful 
in monitoring subtle changes in protein dynamics 
(Wu et al. 2000). 

Another interesting example of NIR studies of 
biological molecules is an investigation of 


C-H.. .0 = C hydrogen bonding in a bacteria- 
synthesized biopolymer, poly (3-hydroxybutyrate) 
(PHB). PHB is one of the most extensively studied 
biodegradable thermoplastic polymers. 

NIR spectroscopy has certain advantages in study¬ 
ing polymers and biopolymers (Siesler et al. 2002; 
Ozaki et al. 2007; Burns and Ciurczak 2007; Workman 
and Weyer 2007). In general, sample preparation and 
handling are much easier for NIR spectroscopy than 
for IR spectroscopy. It can be applied to thick films, 
sheets, granulates, powders, fibers, and textiles. Fur¬ 
thermore, in conjunction with light fiber technique, it 
enables online monitoring of polymer reactions and 
productions. NIR spectroscopy is also powerful in 
studying hydrogen bondings, hydration, and inter- 
and intramolecular interactions in polymers and bio¬ 
polymers. By combining IR and NIR data, one can 
explore the anharmonicity of a band which is very 
sensitive to the formation and strength of a hydrogen 
bond. 

Hu et al. (2006) investigated the 
C-H.. .0 = C hydrogen bonding of PHB by using 
NIR spectroscopy. An NIR spectrum of a PHB film 
changes remarkably with the evolution of 
melt-crystallization process. Accordingly, NIR bands 
due to the amorphous and crystalline states can easily 
be assigned. Figure 4a shows temporal changes of NIR 
spectra in the range of 6,050-5,650 cm”* of the PHB 
film during the melt-crystallization process at 125°C. 
Figure 4b depicts the second derivatives of the spectra 
measured at 0 and 180 min. Particular attention was paid 
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Wavenumber/cm 


Near-Infrared Spectroscopy in Biological Molecules and 
Tissues, Fig. 3 (a) Power and (b) slice spectra of the synchro¬ 
nous and asynchronous 2D correlation spectra calculated from 

to an NIR band at 5,973 cm”* due to the first overtone of 
the C-H stretching mode and an IR band at 3,435 cm” 
due to the first overtone of the C = O stretching mode 
of the C-H.. .O = C hydrogen bonding. The 
anharmonicities of the C-H (5,973 cm”*) and C = 
O (3,435 cm”*) bands arising from the crystalline parts 
and those of the amorphous C-H (5,954 cm”*) and C = 
O (3,457 cm”*) bands were estimated by using both IR 
and NIR data. The anharmonicities of the crystalline 
C-H and C = O bands are significantly different from 
those of the amorphous C-H and C = O bands, supporting 
the idea that both C-H and C = O bonds in the crystalline 
state are involved in the C-H.. .0 = C hydrogen bonding. 

Cells and Tissues 

Recently, noninvasive NIR measurements have 
achieved as time-domain and frequency-domain spec¬ 
troscopy methods (Tromberg et al. 2000; Hawrysz and 
Sevick-Muraca 2000). The measurements of time- 
domain spectroscopy are performed by repetitive 



Wavenumber/cm 

the spectra shown in Fig. 2 (Reproduced from ref. Wu et al. 
2000. With permission. Copyright (2000) American Chemical 
Society) 

pulse and recording the phase delay at a fixed distance 
from the incidence site. For frequency domain spec¬ 
troscopy, intensity-modulated NIR lights in MHz-GHz 
frequency range are used. (The continuous wave tech¬ 
niques may give the average distribution of path length 
in turbid samples from the spatial properties of 
scattered light shown in Fig. 1, but have no means 
of detecting the real path lengths of each photon.) 
These methods are extensively applied to detection 
of cancer cells as imaging and depth profiling tech¬ 
niques (Tromberg et al. 2000; Hawrysz and Sevick- 
Muraca 2000). 

Metal nanoparticles are receiving particular atten¬ 
tion as new molecular probes. Gold nanorods, whose 
size is about 10 nm in diameter and over 50 nm in 
length, show corresponding transversal surface plasma 
oscillation peak at 520 nm and longitudinal one at 
around 800 nm. The longer the length of rods grows, 
the longer the peak wavelength of the longitudinal 
mode shifts (Kim et al. 2002). The strongly absorbed 
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Near-Infrared Spectroscopy in Biological Molecules and 
Tissues, Fig. 4 (a) Temporal changes of NIR spectra in 
the range of 6050-5650 cm“* of the PHB film during the 
melt-crystallization process at 125 °C. (b) Second derivatives 
of the spectra measured at 0 and 180 min (Reproduced from ref. 
Hu et al. 2006. With permission. Copyright (2006) American 
Chemical Society) 

NIR light is converted efficiently into heat on 
a picosecond time domain due to electron-phonon 
and phonon-phonon processes. This heat generation 
is used for effective photothermal cancer therapy with 
low-power NIR laser irradiation (Huang et al. 2006). 
The excitation of surface plasma oscillations and the 
generation of strongly enhanced electric held are also 
used for surface-enhanced Raman scattering (SERS) 
(Guo et al. 2007). 

Nondestructive and Noninvasive Analysis 

Since NIR spectroscopy is a nondestructive analytical 
technique with no or little pretreatment, applications of 
NIR spectroscopy to in vivo biomedical analysis and 


diagnosis have received keen interest (Siesler et al. 
2002; Ozaki et al. 2007; Burns and Ciurczak 2007; 
Workman and Weyer 2007), (Heise 2002; Du et al. 
2007; Ozaki et al. 2009). Particularly, noninvasive 
brain function analysis (Heise 2002; Du et al. 2007) 
and noninvasive blood glucose measurement (Ozaki 
et al. 2009) have been matters of extensive NIR 
studies. 

The in vivo blood glucose measurement with NIR 
spectroscopy is a very challenging project because it 
deals with very weak signals of glucose directly from 
human skin, and the physiological conditions of skin 
tissue such as body temperature change easily with 
time (Ozaki et al. 2009). One critical difficulty associ¬ 
ated with in vivo blood assay is an extremely low 
signal-to-noise ratio of glucose existing in human 
issues. When NIR spectra are measured in vivo, 
background noise easily hides the glucose signal. To 
overcome this problem, Maruo et al. (2003, 2006) 
developed a new NIR spectrometer system with a set 
of two optical hbers to obtain the dermis spectra selec¬ 
tively. One set of optical hbers is attached to the skin 
surface vertically. The skin surface is illuminated by 
the measuring light through the inlet optical hber, and 
the scattered light is collected by the detecting optical 
hber (Fig. 1). The hber distance controls the penetra¬ 
tion depth of the measuring light as conhrmed by 
computer simulations based on a Monte Carlo method. 
This property has been used to restrict the measure 
absorption to the dermis layer of skin. Figure 5 
displays 50 NIR spectra of the forearm human skin 
of one subject measured during one oral glucose intake 
experiment (Maruo et al. 2006). A strong band at 
1,450 nm is assigned mainly to the combination 
of the antisymmeric and symmetric 0-H stretching 
modes of water. The sharp absorbance peak at 
1,450 nm suggests that our system using the novel 
hber probe can reduce interference from absorption 
by the stratum corneum. 

An in vitro study using bovine serum by Maruo 
et al. (Maruo and Oka 1999) showed the important 
role of the glucose absorption peak at 1,580 nm for 
a glucose assay. The characteristics of the peak 
observed in the in vivo study described above are in 
accord with that of the in vitro study. This result means 
that the small absorbance change of glucose in vivo 
can be detected by the newly developed NIR instru¬ 
ment for blood glucose measurement (Maruo et al. 
2003). 
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Near-Infrared Spectroscopy in Biological Molecules and 
Tissues, Fig. 5 Fifty NIR spectra of the forearm human skin 
of one subject (Subject 6) measured during one oral glucose 
intake experiment (Reproduced from ref. Mamo et al. 2006. 
With permission. Copyright (2006) Society for Applied 
Spech'oscopy) 

Six subjects were subjected to oral glucose intake 
experiment (Maruo et al. 2006). About 50 paired data 
sets, i.e., the reference blood glucose contents and the 
NIR spectra, were obtained for each experiment. 
Maruo et al. (2006) obtained the correlation coeffi¬ 
cient, R; 0.934 and standard error of prediction 
(SEP); 23.7 mg/dl for the six subjects using individual 
calibration models. Figure 6 presents the result for 
Clarke error grid analysis obtained using the same 
data sets of the six subjects. The data plots included 
are: for the A zone, 71.3%; B zone, 21.3%; C zone, 0%; 
D zone, 7.4%; and E zone, 0%. Almost all prediction 
plots exist in zones A and B. However, it is noted that 
there are some plots in zone D. The plots in zones 
A and B are clinically acceptable, while those in 
zones C, D, and E are potentially dangerous, and, 
therefore, they may have the clinically significant 
errors. In this study, the plots in D zone have an 
overestimated value below 70 mg/dl of blood glucose 
contents. To improve these prediction errors below 
100 mg/dl, not only the reduction of measurement 
noise but also the preparation of analytical data sets 
with adequate blood glucose contents is important. 

Mapping brain function of humans with diffuse opti¬ 
cal tomography (DOT) based on NIR spectropscopy has 
recently received great interest because it has led to 


a revolution in brainscience and medicine, enabling the 
functional mapping of the human brain on a scale that 
was previously accessible only through invasive studies 
of animals (Dehghani et al. 2009; White et al. 2009). 
DOT is uniquely suited to noninvasive brain and neuro¬ 
imaging as it is a mobile system utilizing a small, flexible 
imaging cap. DOT can measure absolute changes 
in oxygenated (AHb02), deoxygenated (AHbR), and 
total hemoglobin (HbT), yielding more comprehensive 
images of the brain’s hemodynamics. DOT methods are 
advancement beyond previous optical imaging tech¬ 
niques performed in topographic method, often referred 
to as diffuse optical imaging (DOI). DOT encapsulates 
a variety of technological improvements to resolution 
and depth sectioning. Recently, White et al. (White et al. 
2009) revealed resting-state functional connectivity in 
the human brain with DOT. Figure 7 illustrates repeat¬ 
ability of functional connectivity of DOT over multiple 
imaging sessions (AHbR) (White et al. 2009). Correla¬ 
tion analysis from three sessions showed that the patterns 
were qualitatively similar over multiple days (Fig. 7). 
The results obtained by White et al. (2009) demonstrated 
the successful application of functional cormectivity 
methods to DOT of adult human subjects. 

NIR Imaging 

NIR imaging is a technique to measure a spectrum for 
every pixel dividing an object into many spatial parts 
(Sasic and Ozaki 2010). A unique feature of such 
spectroscopic imaging technique is that it can take 
full advantage of spatial and molecular structural 
information about chemical moieties. Not only spec¬ 
tral variable but also spatial information is available in 
spectroscopic imaging. A graphical representation of 
NIR imaging is shown in Fig. 8. Each pixel point of an 
NIR image consists of a full NIR spectrum, and each 
microimage is a collection of spectral intensity mea¬ 
sured at a fixed wavenumber. 

Since a set of NIR imaging spectra contains infor¬ 
mation about the spatial dimension (X- and Y-axis) 
and spectral dimension (wave number or wavelength), 
the interpretation of complex variations of NIR imag¬ 
ing spectra becomes a challenging task. In the actual 
application of NIR imaging, it is often accompanied 
with some chemometrics techniques such as PCA and 
SMCR so that spatial as well as molecular structure 
information can be effectively extracted from the data. 
For example, scores represent physical or chemical 
distribution of main contributing sources among 
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Spectroscopy in Biological 
Molecules and Tissues, 

Fig. 6 Correlation between 
reference and predicted blood 
glucose contents. The lettered 
zones represent clinically 
relevant regions of 
performance; (A) clinically 
accurate readings; (B) results 
that would lead to benign 
action or inaction by the user; 
(C) results that would lead to 
unnecessary corrections; (D) 
results that would lead to 
inaction when action is 
necessary; (E) results that 
would lead to treatment that is 
opposite to what clinical 
accuracy would call for 
(Reproduced from ref. Maruo 
et al. 2006. With permission. 
Copyright (2006) Society for 
Applied Spectroscopy) 



R = 0.934, SEP = 23.7 mg/dl 

A zone: 71.3% B zone: 21.3% C zone: 0% D zone 7.4% E zone 0% 


samples and loadings indicate corresponding chemical 
compositions. Thus, NIR imaging is useful to examine 
distribution of chemical components within a sample 
or to prohe morphological feature of materials at the 
molecular level. 

In PCA, a data matrix is decomposed in terms of the 
orthogonal set of dominant factors, i.e., eigenvectors. 
That is to say, PCA does not reveal the “true” under¬ 
lying physical sources of data but provides only their 
linear combination fulfilling the orthogonal constrains. 
Thus, scores and loadings can take both positive and 
negative values, such that they usually do not have any 
direct connection to physical meaning, even though 
these solutions are based on solid mathematical foun¬ 
dation. An alternative factor analytic technique, which 
can provide NIR imaging data a more hrm link with 
a physical model, is SMCR (Jiang and Ozaki 2002). In 
SMCR analysis, one tries to decompose the data matrix 
X into more physically interpretable forms. In a simple 
bilinear model, one has 


X = CS' + E (3) 


where the m by r matrix C contains normalized con¬ 
centration prohles of r individual chemical species, 
and the n by r matrix S contains pure component 
spectra of the same species. The basic principle of 
SMCR is to seek a bilinear model that gives the best 
fit, in the sense of least squares or weighted least 
squares. In SMCR analysis, C and S are usually 
obtained by alternating least-square (ALS) problems 
unlike PCA. The two matrices, C and S, complementally 
represent the entire features broadly distributed in X. 
Namely, concentration profile C holds quantitative infor¬ 
mation chemical components distributed over the sam¬ 
ples, and pure component spectra S contains actual pure 
spectra of the corresponding chemical components. 

Figure 9 represents an illustrative example of 
SMCR analysis based on NIR imaging of cellulose 
tablet (Shinzawa et al. 2009). A set of NIR spectra 
collected at different positions of a cellulose tablet is 
decomposed into concentration profiles C and pure 
component spectra S by SMCR. Concentration profiles 
offer quantitative information of each chemical com¬ 
ponent within the cellulose tablets. Namely, distribu¬ 
tion of crystalline and amorphous components of the 


N 
































1704 


Near-Infrared Spectroscopy in Biological Molecules and Tissues 
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Near-Infrared Spectroscopy in Biological Molecules and 
Tissues, Fig. 7 Repeatability of DOT over multiple imaging 
sessions (subject 1, AHbR). Seed boxes are shown in gray. 
Images from different sessions are not co-registered. (a-c) Cor¬ 
relation maps within the visual cortex from the left visual cortex 


Right motor cortex seed 

d 



0 1 


seed, (d-f) Correlation maps within the motor cortex from the 
right motor cortex seed. Note the similarity in the patterns in 
each session (with slight linear translations between days) 
(Reproduced from ref. White et al. 2009. With permission. 
Copyright (2009) Elsevier) 


Near-Infrared 
Spectroscopy in Biological 
Molecules and Tissues, 

Fig. 8 Graphical 
representation of NIR imaging 
data 


NIR spectra are measured 
at each pixel 



cellulose. Corresponding pure component spectra rep¬ 
resent pure NIR spectra of crystalline and amorphous 
components of cellulose. Thus, it becomes possible to 
derive key information concerning spatial distribution 


as well as molecular structure information of the 
cellulose tablet by SMCR. 

The molar absorptivity of water in NIR region 
becomes relatively small, and NIR light can typically 
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Near-Infrared 
Spectroscopy in Biological 
Molecules and Tissues, 
Fig. 9 An illustrative 
example of SMCR study 


Set of spectra collected 
at each pixel 




X = CS^ 
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penetrate much farther into a sample than IR radiation. 
It, therefore, can be very useful in probing the system 
containing some level of water content. The ability of 
NIR imaging offers useful information especially in 
the field of pharmaceuticals as well as food science 
since the distribution of specific chemical component 
within the system can often be of major interest in 
many product manufacturing processes. 

Cross-References 

► Infrared Spectroscopy of Cells, Tissues, and 
Biofluids 

► Infrared Spectroscopy of Protein Structure 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Protein Dynamics Studied with ID and 2D Infrared 
Spectroscopy 

► Raman Spectroscopic Instrumentation, 
Experimental Considerations 

► Raman Spectroscopy and Microscopy of Cells and 
Tissues 

► Surface-Enhanced Raman Spectroscopy for 
Bioanalytics 
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Definition 

We discuss the use of neutron scattering as a tool for 
revealing the internal structure of biological membranes. 

Basic Characteristics 

Clifford Shull and Bertram N. Brockhouse were 
awarded the 1994 Nobel Prize in Physics “for 
pioneering contributions to the development of neutron 
scattering techniques.” Today, neutron scattering offers 
a powerful suite of techniques for determining atomic 
and spin structures, and dynamics (i.e., local and collec¬ 
tive motions). These techniques are widely used in 
condensed matter physics, materials science, materials 
chemistry, polymer science, the biological sciences, and 
engineering. Because of their intrinsic properties, ther¬ 
mal and cold neutrons are ideally suited for studying 
a wide range of systems, including biological materials. 
For example, the interaction with matter of neutrons 
with energies less than 1 eV reveals atomic resolution 
detail, as their wavelengths (A, > 0.03 nm) are of the 
order of interatomic distances. Similarly, because their 
energies are comparable to chemical bond energies, and 
thermal motions, in crystals and liquids, neutrons are 
extensively used in dynamical studies. Neutron scatter¬ 
ing techniques are also complementary to those of 
X-rays and electrons, especially in studies of biological 
materials inherently rich in hydrogen and low atomic 
number elements. In fact, neutron scattering takes 
advantage of such materials, where contrast variation 
methods - the selective substitution of hydrogens for 
deuterons - can either amplify or nullify the scattering 
from selected parts of a biomolecule (Fitter et al. 2006). 

The technique of elastic neutron scattering has been 
successfully applied to four general areas of biophysics: 
high-resolution single crystal diffraction; a) low- 
resolution diffraction from samples with long-range 
order in one or two dimensions; b) reflectometry from 
thin films; c) and d) small-angle scattering from ran¬ 
domly oriented structures. The detailed analysis of pro¬ 
tein interactions mediated by hydrogen atoms is usually 
non trivial to determine by X-ray crystallographic stud¬ 
ies, as it is difficult to locate hydrogen atoms even when 
high-resolution X-ray data are available - i.e.. X-ray 
scattering increases with increasing atomic number. 
On the other hand, high-resolution neutron diffraction 
studies have accurately revealed the positions of 


hydrogen atoms, and distinguished nitrogen atoms 
from similar atomic number elements, such as carbon 
and oxygen (Schoenbom and Knott 1996). 

Neutron Diffraction 

With regard to structural biology, the various scattering 
techniques complement crystallographic studies that, 
in many cases, require hard-to-obtain, high-quality 
crystals of macromolecules. Due to the intrinsic disor¬ 
der present in biomimetic systems - disorder is consid¬ 
ered important for the proper function of biological 
systems - the vast majority of membrane samples do 
not form perfect, or even near perfect crystals, that are 
needed to solve structure to atomic resolution. The 
limited amount of attainable data from such samples is 
then best described by broad statistical distributions and 
a membrane’s overall characteristics. For example, in 
the case of model membrane systems (i.e., positionally 
correlated structures), the position and amplitude of 
Bragg reflections reveal the membrane’s lamellar peri¬ 
odicity and one-dimensional scattering density profile. 
Further, by changing the systems “contrast” through the 
exchange of H 2 O for D 2 O, it is possible to determine the 
effect of membrane additives, such as cholesterol, on 
the lipid bilayer and its associated water layer, and the 
extent that water penetrates into the bilayer. For 
example, bilayers made of short- and long-chain phos¬ 
pholipids thickened in the presence of cholesterol. This 
manifestation by cholesterol on bilayer thickness was 
interpreted by Kucerka et al. as a disorder-order transi¬ 
tion induced by cholesterol on the lipid’s acyl chains. 
This effect dominates over the other option of rectifying 
the hydrophobic mismatch (Kucerka et al. 2009). 

Another set of experiments that take advantage of 
the H/D substitution is selective labeling. The use 
of bulky labeling probes is ubiquitous in fluorescence 
light microscopy - and other techniques (e.g., electron 
spin resonance) - but they can affect a membrane’s 
structural and dynamical properties. For the most part, 
substituting H for D does not alter the system’s 
chemicophysical properties but does profoundly 
change the system’s contrast. Importantly, neutron 
data from a system with and without the label differ 
only in the region of the label. A Fourier difference 
profile can then directly localize the labeled portion 
of a molecule (Schoenbom and Knott 1996). Over 
the years, this technique of isotopic substitution 
has been extensively used to study the structure of 
model membranes, proteins, DNA/RNA, and viruses 
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Neutron Scattering of Membranes, Fig. 1 Schematic of cholesterol’s orientation in (a) commonly studied lipid bilayers and (b) 
PUFA lipid bilayers (Harroun et al. 2006) 


(Pabst et al. 2010). Using this technique, the location of 
cholesterol in lipids with polyunsaturated fatty acid 
(PUFA) chains was determined. 

Cholesterol is understood to orient parallel to the 
lipid hydrocarbon chains. In 2006 using deuterated 
cholesterol, Harroun et al. demonstrated this to be the 
case in a number of different lipid bilayers. In contrast, 
however, cholesterol was found to sequester in the 
center of PUFA lipid bilayers (Fig. 1). Follow-up 
studies demonstrated the modulated preference of cho¬ 
lesterol for different lipids, clearly highlighting the 
need for the great diversity of lipids found in biology 
(Kucerka et al. 2010). For example, in plasma mem¬ 
branes sphingolipids are primarily located in the outer 
monolayer, whereas unsaturated phospholipids are 
more abundant in the inner leaflet. It is thus possible 
that the presence of PUFA in the inner bilayer leaflet 
may serve to enhance the transfer of cholesterol to the 
outer layer, potentially modifying membrane function. 
This mechanism then invokes the necessity for rafts - 
(i.e., functionalized domains) that could facilitate the 
biosynthetic pathways of cholesterol and its transport 
to and from cells. 

In order to reconstruct the real-space structure of 
a membrane, both amplitude and phase are needed. 
However, in the majority of scattering experiments. 


only the amplitude of the structure factor can be 
obtained by experiment - the phase information is 
lost. In order to recover the phase component of the 
structure factor, various methods have been developed. 
The cholesterol example presented above used contrast 
variation to obtain the phase component of each indi¬ 
vidual Bragg reflection. In reflectometry and small- 
angle scattering experiments, real-space solutions 
rely on fitting the data to a realistic model. In doing 
so, one must keep in mind the possibility that the data 
can be fitted equally well by more than one model. In 
the case of neutron reflectometry studies, Majkrzak 
et al. (2000) have made good use of polarized neutrons 
and reference substrates made of Cu, Ni, and Mo to 
obtain both the amplitude (real part) and phase 
(imaginary) components of the complex reflection, 
allowing for the data (reciprocal space) to be directly 
inverted into a unique scattering density profile 
(real space). 

Small-Angle Neutron Scattering 

Small-angle neutron scattering (SANS) is probably the 
technique most commonly applied to biological 
materials as it can probe length scales ranging from 
nanometers to fractions of a micron when ultra SANS 
is used. In the case of unilamellar vesicles (ULVs) 
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Neutron Scattering of Membranes, Fig. 2 (a) Predicted 
scattering from polydisperse ULV containing multiple domains 
(N = 1-32). The number of domains increases on going from the 
bottom to the top curves. Curves are shifted on the vertical axis 


to facilitate viewing, (b) Fits to a 1:1:1 DOPC to DPPC to 
cholesterol ULV system corresponding to the superposition of 
signals from ULVs with single and multiple domains (Adapted 
from Pencer et al. 2005) 


(hollow spherical structures consisting of single lipid 
bilayer), SANS can evaluate both the ULV’s size, and 
the thickness of the bilayer that forms the ULV. This 
technique thus allows for the accurate structural deter¬ 
mination of ULVs that have been fabricated for deliver¬ 
ing dmgs and imaging diseased tissue, introducing 
genetic material to living cells, and enhancing the effi¬ 
cacy of various medical imaging techniques (Nejat 
2009). Although ULVs exist in solution, and their iso¬ 
tropic structural dimensions are typically characterized 
by low-resolution data, much insight into a system can 
be achieved through the previously discussed contrast 
variation techniques. For example, Pencer et al. (2005), 
using coarse-grained models of heterogeneous ULVs, 
demonstrated SANS’ potential to detect and distinguish 
between lateral segregation in membranes. This was 
done by exploiting the unique sensitivity of neutron 
scattering to differences between FI and D atoms 
(Fig. 2), as was discussed previously. Most recently, 
a new and innovative advance in scattering theory and 
experiment enables neutron data to be analyzed in 
such a manner as to provide not only structural informa¬ 
tion in the direction perpendicular to the plane of the 
bilayer, but also information regarding a membrane’s 
in-plane organization, e.g., lateral lipid segregation 
(Iglic 2010). 
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Nicotinamide Adenine Dinucleotide 
(NAD) 

Mathias Ziegler 

Department of Molecular Biology, University of 
Bergen, Bergen, Norway 

Synonym 

Diphosphopyridine nucleotide (DPN) 


Definition 

NAD is a dinucleotide composed of two mononucleo¬ 
tides (adenosine 5'-monophosphate, AMP, and nicotin¬ 
amide mononucleotide, NMN), which are joined through 


their 5' phosphate groups (Fig. 1). The molecule is pre¬ 
sent in all living cells and reversibly converted between 
its oxidized (NAD"^) and reduced (NADH) forms acting 
as a coenzyme in many metabolic hydride transfer 
(redox) reactions (Fig. 2). NAD"^ serves also important 
functions in signal transduction pathways. A phosphory- 
lated derivative of NAD, NADP, is also present in all 
organisms, exhibits similar physicochemical properties, 
but apparently does not have a prominent role 
in signaling. 

Physicochemical Characteristics of NAD 

In its naturally occurring, biologically active form, the 
nicotinamide moiety of NAD is attached to the anomeric 
carbon of the ADP-ribosyl moiety in the P-conformation 
(Fig. 1). Both NAD"^ and NADH exhibit an absorption 
maximum at 259 nm owing to the presence of the ade¬ 
nine base (e = ^16,200 M~*cm^* for NAD"^). Due to the 
reduced nicotinamide ring, NADH has an additional 
absorption maximum at 340 nm (s = ^6,200 IS/T'cm”*). 
This property is widely used to measure enzymatic 
reactions which require NAD as coenzyme. NADH is 
a strong reducing agent, the midpoint ► redox potential 
of the NAD-f/NADH redox pair being -0.32 V. 

Unlike NAD"^, NADH is fluorescent with an emission 
maximum at ^460 nm (excitation at 340 nm). The 
fluorescence lifetime (^0.4 ns in aqueous solution) 
increases signiflcantly, when the molecule binds to pro¬ 
teins. This property has been exploited to estimate con¬ 
centrations of free vs. protein-bound NADH or the redox 
state (NADH/NAD"^ ratio) in living cells by means of 
fluorescence microscopy, but can also be used to deter¬ 
mine binding constants for isolated proteins. 

Biological Functions of NAD 

NAD plays two distinct biological roles. Similar to 
ATP, it is a key molecule in both energy transduction 
and signaling pathways. In bioenergetics, NAD is 
a coenzyme which serves as reversible electron carrier 
in a multitude of redox reactions. In signaling processes, 
however, only NAD"^ is involved. NAD is regarded as an 
intracellular molecule. However, in higher organisms 
low amounts are present in extracellular fluids (e.g., 
blood plasma), thereby providing substrate for extracel¬ 
lular NAD'^-mediated signaling events. 
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Fig. 1 Chemical structure of NAD"^ 



Ribose-ADP Ribose-ADP 

H 

R-OH + NAD+ « — R = 0 + NADH + H+ 

Nicotinamide Adenine Dinucleotide (NAD), Fig. 2 Function 
of NAD as redox carrier. Top: In metabolic reactions, NAD"^ can 
take up two electrons (and a proton), which are added to the 
nicotinamide moiety. Bottom: The reversible oxidation of an 
alcohol to an aldehyde illustrates the typical redox function of 
NAD in metabolism 

Redox Functions of NAD 

Most enzymes that use NAD as a coenzyme are 
oxidoreductases. That is, they oxidize a substrate 
(extract electrons) and at the same time reduce NAD"^ 
to NADH (Fig. 2; note that the designations oxidation 
and reduction in the Figure refer to the changes in 
NAD, not the substrate). Generally, these reactions 
are reversible. In the major catabolic routes, electrons 
from nutrients (such as glucose and fatty acids) are 
transferred onto NAD"^ to yield NADH. Eventually, 
the electrons are donated to the respiratory chain (to 
drive ATP synthesis) and thereby NADH is reoxidized 
to NAD"^ via ► NADHiubiquinone oxidoreductase 
(complex I). The redox functions of NAD and the met¬ 
abolic enzymes catalyzing the corresponding reactions 
are comprehensively covered in all standard textbooks of 
biochemistry. 


Signaling Functions of NAD'^ 


Over many years, the biological role of NAD had been 
thought to be limited to metabolic redox reactions. It has 
now become clear that NAD"^ is also a key element of 
various regulatory pathways (Berger et al. 2004). Three 
distinct types of reactions are known in which NAD"^ is 
cleaved to yield nicotinamide, while the remaining ADP- 
ribosyl moiety is converted into signaling molecules 
or attached to proteins. First, NAD"^ glycohydrolases 
(the best characterized representative in humans being 
CD38) produce ADP-ribose and cyclic ADP-ribose 
(cADPR). These molecules activate calcium channels in 
the plasma membrane or the endoplasmic reticulum, 
respectively, thereby leading to an increase of the cyto¬ 
solic calcium concentration (Malavasi et al. 2008). Sec¬ 
ond, ADP-ribosyl transferases catalyze the modification 
of Ultra- and extracellular proteins causing changes in the 
biological activities of these proteins. Cell-surface ADP- 
ribosyl transferases are predominantly involved in immu¬ 
nological functions (Seman et al. 2004). The intracellular 
enzymes have a broad range of functions including regu¬ 
lation of metabolic enzymes and control of nuclear pro¬ 
cesses. Poly-ADP-ribose polymerases (PARPs) generate 
polymers of ADP-ribose and attach them to target pro¬ 
teins. This modification has a critical impact on important 
nuclear events and has been demonstrated only in higher 
eukaryotes. The major mammalian PARP enzyme, 
PARP-1, is catalytically activated by DNA strand breaks. 
Therefore, extensive DNA damage can cause depletion of 
cellular NAD and cell death owing to rapid degradation of 
NAD"^ by PARP-1 in response to genotoxic assaults 
(Krishnakumar and Kraus 2010). Third, a class of protein 
deacetylases, the sirtuins, require NAD"^ as a co-substrate. 
The acetyl group of the acetylated protein is transferred 
onto ADP-ribose, thereby yielding unmodified protein 
and O-acetyl ADP-ribose, another potential signaling 
derivative of NAD"^. This mechanism has been identified 
as a major link between metabolism (by sensing the NAD 
concentration and redox state) and cellular/organismal 
fate in higher eukaryotes. The deacetylation of nuclear 
proteins (such as histones and transcription factors) by 
sirtuins is highly selective and regulates, for example, key 
metabolic processes, the biological clock and life span 
(Imai and Guarente 2010). Generally, NAD'^-dependent 
protein deacetylation is ancient and also found in bacteria. 
In higher eukaryotes, it also controls the activity of key 
enzymes of metabolic pathways compartmentalized in 
mitochondria. 
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NIR Imaging 


Subcellular Compartmentation and 
Biosynthesis of NAD 

In eukaryotes, the majority of NAD resides in the 
mitochondria and the nucleo-cytoplasm (the nucleus 
and cytosol are thought to be freely exchangeable), as 
well as in chloroplasts in plants. This is in agreement 
with the known metabolic and signaling pathways that 
require NAD. 

Cellular NAD synthesis (Berger et al. 2004; Magni 
et al. 2008) relies on different precursors. Most cells 
produce their NAD from the vitamin B3 precursors 
nicotinamide and nicotinic acid, collectively termed 
“niacin.” In mammals, vitamin B3 deficiency causes 
the disease pellagra. In some bacteria, NAD can be 
generated from aspartate, whereas in higher organisms 
a pathway of tryptophan degradation generates an NAD 
precursor, quinolinic acid. The ribosides of nicotinamide 
and nicotinic acid can also be utilized. Once they 
have entered cells, the precursors are converted to the 
corresponding mononucleotides (e.g., NMN from nico¬ 
tinamide). The mononucleotide is then converted to 
the dinucleotide by adding the AMP moiety from ATP. 
The corresponding enzymes, NMN/NAMN adenylyl- 
transferases are essential as they are required in all routes 
of NAD synthesis, regardless of the extracellular precur¬ 
sor used. When NAD is synthesized via a nicotinic acid 
derivative, the nicotinic acid dinucleotide is eventually 
amidated by NAD synthetase. 

Cross-References 

► Electron Transfer Cofactors 

► NADH-Ubiquinone Oxidoreductase (Complex I) 

► Redox Potential 
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Nitric Oxide Synthase - Computational 
Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

NOS 


Definition 

A heme-containing enzyme that catalyzes the production 
of nitric oxide from L-arginine. 

Basic Characteristics 

Nitric oxide (NO) is a cellular signaling molecule that 
has an important role in many biological processes in 
a wide range of organisms. Production of NO by nitric 
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oxide synthases (NOSs) is achieved by catalysis of the 
two-step oxidation of L-arginine to L-citrulline. As in 
► cytochrome P450 enzymes, the sulfur of a cysteine 
residue coordinates to the heme iron. In addition, 
NOSs feature a conserved tryptophan that forms 
a hydrogen bond with the sulfur atom. It has been 
proposed that the tryptophan modulates the electronic 
properties of the heme and the heme-bound ligands. 
DFT QM/MM calculations have been used to investi¬ 
gate the role of the tryptophan in the product state, i.e., 
NO bound to the heme iron (Fernandez et al. 2005). 
Several studies on the reaction mechanism of NOS 
have also been performed, with small “cluster” models 
with quantum mechanical (QM) electronic structure 
calculations and with ► QM/MM methods on larger 
molecular models incorporating the whole enzyme 
environment (de Visser 2009). These studies focus on 
the initial reaction step, the oxidation of L-arginine to 
A-hydroxo-arginine. It was suggested that due to the 
unique and highly polar L-arginine substrate, the 
catalytic cycle for the first step in NOS diverges 
from cytochrome P450 enzymes after formation of 
Compound I, and Compound I is not the active oxidant, 
in contrast to cytochrome P450 enzymes. Subsequent 
QM/MM calculations considered three different 
possible reaction mechanisms for the first step in 
NOS. Only the pathway that featured a singly proton- 
ated ferric-peroxy complex, combined with formation 
of a cation-radical species shared by the tetrahydro- 
biopterin cofactor and the deprotonated arginine 
substrate, was deemed energetically reasonable and 
could account for experimental data on the reaction 
(Cho et al. 2009). The detailed catalytic mechanism of 
this enzyme is not yet entirely resolved, but it is clear 
that computational molecular simulation and modeling 
provides a crucial means of proposing and testing pro¬ 
posed reaction pathways, because unambiguous experi¬ 
mental observation of the species involved is nigh 
impossible. A study into the nature of the tetrahydro- 
biopterin cofactor bound to NOS is an example of how 
experimental (e.g., EPR) and computational studies can 
be combined to help elucidate further details (Stoll et al. 
2010 ). 
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Definition 

Processes by which drugs are discovered and designed 
with analytical techniques such as NMR. 

Introduction 

Drug discovery is an ongoing and challenging process 
that is fraught with failure, but the successes have had 
profound impacts on human health. Fundamentally, drug 
discovery is a multidisciplinary endeavor requiring, 
among others, bioinformatics and computational chem¬ 
istry, cell biology, medicinal chemistry, enzymology, 
high-throughput screening (HTS), molecular biology, 
protein chemistry, genomics and other “omics” technol¬ 
ogies, and structural biology. Drug discovery is also an 
iterative procedure comprising four stages: target iden¬ 
tification and validation, lead discovery, lead optimiza¬ 
tion, and clinical trials (Betz et al. 2006). In general, the 
drug discovery process starts by identifying a protein 
target associated with drug resistance, the progression 
or pathology of a disease, or the virulence of an organ¬ 
ism. Usually, the protein of interest is chosen because it 
is perturbed when the system is in a disease state, or the 
target is a critical or essential protein, for instance, 
required for microbial or tumor survivability. Part of 
the target identification process also requires understand¬ 
ing the protein’s biological or cellular function and 
potentially obtaining its three-dimensional (3D) struc¬ 
ture. The process of finding an initial set of antagonists 
or agonists is typically accomplished using HTS and 
a library composed of hundreds of thousands to millions 
of compounds. Commonly, three to five chemical clas¬ 
ses from the HTS results are selected as chemical leads 
for further optimization. The selection of chemical leads 
is based on activity, chemical novelty and patentability, 
drug-like characteristics, synthetic accessibility, and 
diversity. Different analogs of the chemical leads are 
synthesized to increase affinity, selectivity, and potency, 
while reducing toxicity issues. Effectively, the goal is to 
evolve the chemical leads into dmg candidates. Overall, 
the dmg discovery process is extremely time consuming 
and expensive, requiring approximately 12 years and 
costing >$800 million dollars. This is due, in part, to 
the complexity of the biological system, limited knowl¬ 
edge about the protein targets, and the inherent difficulty 
of converting an inhibitor to a drag, where only 1 in 


5,000 chemical leads becomes a drag (Light and 
Warburton 2011). Therefore, technological advance¬ 
ments and new methodologies that improve the success 
rate are desperately needed. 

Nuclear magnetic resonance (NMR) is a versatile 
analytical tool that is used in all phases of the 
drag discovery process prior to initiating clinical trials 
(Betz et al. 2006). NMR is routinely used to characterize 
the structures of both small molecules and large bio¬ 
molecules. More importantly, NMR is often used to 
address an initial and important question in the drag 
discovery process: does the ligand selectively and spe¬ 
cifically bind to the protein target in a biologically rele¬ 
vant manner? This is an essential step in evaluating the 
results obtained from HTS, in silico screens, or frag¬ 
ment-based screens, and for evolving chemical leads 
into drag candidates. NMR is uniquely suited for 
detecting protein-ligand interactions, for identifying 
the ligand-binding site, for calculating dissociation con¬ 
stants {Kq), for determining a 3D structure of the protein- 
ligand complex, and for monitoring in vivo activity. 
Furthermore, this information can be obtained using 
HTS-NMR, where hundreds to thousands of potential 
drags can be screened. In addition, cryoprobe technology, 
high-held magnets (>800 MHz), and enhancement 
in NMR pulse sequences and HTS technology have 
substantially increased throughput, signal-to-noise, and 
the detection of low levels of proteins or weak 
binding ligands. Thus, HTS-NMR has been applied to 
metabolomics, and extended to larger chemical libraries 
and larger protein targets (>100 kDa), while simulta¬ 
neously minimizing resources (experimental time, mate¬ 
rial) (Pellecchia et al. 2008). 

An inherent value of HTS-NMR, with respect to 
drag discovery, is its nearly universal application 
(Widmer and Jahnke 2004). HTS-NMR does not require 
any target-specihc setup because the screen is indepen¬ 
dent of the protein’s function. Thus, proteins recently 
identihed by genomics can be screened immediately, 
as long as an adequate amount of labeled or unlabeled 
material is available; and the protein falls within 
the allowable molecular weight (MW) range for the 
specihc NMR experiment. In general, the protein is 
overexpressed and isotopically labeled with *^N and/or 
for structure determination or HTS affinity screens. 
Escherichia coli and Bacillus suhtillis are the most com¬ 
monly used expression systems because their robustness 
permit high cell density and expression rates (Heller and 
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Kessler 2001). The 3D protein structure is critical to the 
drug discovery process because it enables the evaluation 
of the biological relevance of the potential drug candi¬ 
dates. This is accomplished by determining if the ligand¬ 
binding site corresponds to the protein’s active site or 
functional epitope. Similarly, ligand selectivity and spec¬ 
ificity is determined based on the ligand binding to other 
binding sites or proteins. Also, the structure is routinely 
used to annotate functionally uncharacterized proteins 
based on structural homology to proteins with an 
assigned function. The NMR assignments and 3D pro¬ 
tein stractures are typically determined using a standard 
set of 3D triple resonance and NOESY experiments 
(Kanelis et al. 2001). 

HTS-NMR is used to identify chemical leads, and 
validate HTS and in silico screening results. SAR by 
NMR (structure-activity relationship by NMR) was the 
first illustration of HTS-NMR. Since then, there have 
been numerous NMR experiments designed for the sole 
purpose of being used in an NMR ligand affinity screen 
(Pellecchia et al. 2002). Some popular one-dimensional 
NMR experiments (or ligand-focused screens), include 
saturation transfer difference (STD), WaterLOGSY 
(water ligand observation by gradient spectroscopy), 
NOE pumping, and diffusion edited NMR (Widmer 
and Jahnke 2004). The primary goal of these experi¬ 
ments is to use an observable change in the NMR 
spectrum of the chemical lead (peak intensity, peak 
width, chemical shift, etc.) to identify a productive bind¬ 
ing interaction. Advantages of these techniques include 
rapid data collection (<5 min.), minimal sample 
requirements (<10 pM), and no need for isotopically 
labeled protein. But in general, these techniques do not 
provide any information on the ligand-binding site. 
Alternatively, protein-focused screens, SAR by NMR, 
along with Multi-Step NMR, MS/NMR, NMR-SOLVE, 
and SHAPES, among others, are used to identify 
ligand-binding sites in addition to identify binders. 
Importantly, these methods are typically used as part 
of a fragment-based screen that requires determining the 
binding proximity of two or more ligands. The goal 
is to chemically link multiple fragments to achieve 
a corresponding enhancement in binding affinity. 
But, these screens do require isotopically labeled 
proteins and significantly longer data acquisition 
times. HTS-NMR can also be used to evaluate 
in vivo drug toxicity and efficacy by using NMR 
metabolomics protocols. NMR is used to compare 


metabolic profiles before and after drug treatment 
to determine the biological impact of the drug 
candidate. 

Ligand-Focused HTS-NMR 

The relatively low sensitivity of NMR is an important 
issue in its application to HTS. Correspondingly, HTS- 
NMR requires longer data acquisition times and sample 
requirements compared to standard high-throughput 
screens. Importantly, screening hundreds of thousands 
to millions of compounds, while routine for HTS, is 
completely impractical for NMR. Thus, designing 
NMR experiments to minimize both instrument time 
and sample requirements is critical to improve through¬ 
put; and the primary goal of ligand-focused ID NMR 
screens. Simply, a ligand-focused screen monitors 
changes in the ID 'H NMR spectram of the ligand 
resulting from the addition of the protein target. The 
ligand is typically in large excess (> 10-fold) of the 
protein. In general, the ID *H-NMR methods exploit 
large differences in physical properties between small 
molecules and large biomolecules; and correspondingly 
between the free and bound states of the ligand. Eor 
example, a schematic representation in Eig. 1 illustrates 
the effects of differing T 2 relaxation times on ligand 
binding (Eejzo et al. 1999). Specifically, small molecules 
undergo rapid Brownian motion and have correspond¬ 
ingly slow T 2 relaxation times. Conversely, proteins 
undergo slow Brownian motion and have fast T 2 relaxa¬ 
tion times. Therefore, the NMR spectrum of a small 
molecule consists of narrower peaks compared to 
a protein NMR spectmm. As a small molecule binds 
a protein, it will inherit the physical properties of the 
protein. As a result, the NMR peaks of the small mole¬ 
cule will broaden proportionally to the difference in 
line widths and fraction of bound ligand. Ligand-focused 
ID NMR screens have some distinct advantages. The 
NMR experiments are relatively fast (<5 min). Only 
small quantities, ^1-10 pM of unlabeled protein are 
required per NMR experiment. Additionally, the ligands 
can be screened as mixtures of ^4 to >20 different 
compounds with a concentration of ^ 100 pM per ligand. 
Assembling large mixtures can be challenging since it 
may be difficult to find compatible sets of compounds 
that maintain solubility and stability while avoiding 
cross-reactivity. 
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NMR in Drug Discovery - Introduction, Fig. 1 ID ‘H spec¬ 
tra of 1 mM free ligand {bottom) in the presence of 100 pM IMP 
dehydrogenase (top). The significant line broadening indicates 
a ligand-protein interaction. (Reprinted with permission from 
(Fejzo et al. 1999). ©1999 by Elsevier) 


The appearance of the ID NMR spectrum also 
reflects on how rapidly the ligand exchanges between 
the bound and free state. Additionally, the appearance of 
the ID NMR spectrum is influenced by the ligand’s 
affinity to the protein target (Pellecchia et al. 2002): 


_ ^off _ [-^]f [-P]f 
“ Kon ~ [PL] 


( 1 ) 


where is the dissociation constant, K„jf is the rate 
constant for dissociation, Kan is the rate constant for 
formation, [L\p is the concentration of free ligand, [P^p 
is the concentration of the free target protein, [PL] is 
the concentration of the protein-ligand complex. Tight 
binders usually have slow exchange rates and a Kp, of 
less than 10 nM. In this case, a separate signal will be 
observed for both the bound and free ligands, where 
the bound signal is usually broadened beyond detec¬ 
tions. Correspondingly, a binding event is indicated by 
a decrease in the intensity of the ligand NMR spec¬ 
trum. If the ligand binds weakly {Kp, > 10 pM), the 


exchange between the bound and free state is fast and 
the observed spectrum consists of a single signal, 
whose chemical shift and line width is the weighted 
average between the bound and free state. For inter¬ 
mediate exchange, the ligand peak broadening is typ¬ 
ically enhanced. Thus, in the case of fast exchange, 
binding affinities can be estimated from HTS-NMR 
using ID NMR spectra (Shortridge et al. 2008). 
The Kp, can be calculated by measuring the peak height 
and line width using the following equations: 


Kn = 


c[P]j 


B 


single 


-c[P]p]-[L] 


where c = — — 1 and fisinde =1 —- 

Dp ® Ip 


( 2 ) 


where [P]p and [L\p are the total protein and ligand 
concentrations, respectively, and Vp are the line 
width for the bound and free state, respectively, and Ip 
and Ip are the peak height for the bound and free state, 
respectively. In addition to fast exchange, the approach 
also assumes that there is only a single binding site. 
These assumptions are reasonable in the early stages 
of dmg discovery because initial chemical leads will 
generally have weak binding affinities. Obtaining bind¬ 
ing affinities from HTS-NMR have also been described 
for STD experiments, for the displacement of known 
low-affinity inhibitors, or for chemical shift changes 
using NMR with ^^F-containing compounds, 

among other NMR experiments. 


Target-Focused HTS-NMR 

One important advantage of HTS-NMR is the ability to 
detect weak binding ligands, which is a critical factor in 
fragment-based screens. Most target-focused approaches 
to HTS-NMR rely on chemical shift perturbations in the 
protein NMR spectrum, where the protein is either *^N or 
labeled. The ligand is unlabeled and undetected 
by the target-based NMR experiments. Therefore, the 
ligand is still typically in excess (>5-fold) to maximize 
the observation of weak binding ligands {Kp, > 1 mM). 
Two-dimensional (2D) *H-^^N-heteronuclear single 
quantum coherence ('H-*^N-HSQC) experiments are 
commonly used for HTS-NMR. Other common HTS- 
NMR experiments include the 2D *H-'^N-TROSY, 2D 
'H-*^C HSQC, and recent derivatives that enable very 
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NMR in Drug Discovery - Introduction, Fig. 2 (a) Overlay 
of the 2D 'H-*^N-HSQC spectra of B. suhtilis protein YndB with 
an increasing concentration of chalcone ranging from 0 pM 
{blue) to 160 pM {cyan), (b) NMR titration data for trans- 
chalcone (blue), flavanone (green), flavone (purple), and flavo- 
nol (orange). The normalized chemical shift perturbations (CSP) 
for the nine most perturbed residues are plotted versus the 


protein-ligand concentration ratios. The theoretical curve 
displayed for fra/ti-chalcone corresponds to a A'd of 1 pM and 
represents the upper limit for the Ko. The measure Ko values are 
<1 pM (frani-chalcone), 32 it 3 pM (flavanone), 62 ± 9 pM 
(flavone), and 86 ± 16 pM (flavonol). (Reprinted with 
permission from (Stark et al. 2010). © 2010 by John Wiley 
and Sons) 


rapid data collection (e.g., SOFAST). A peak in the 2D 
*H-^^N-HSQC spectrum is correlated to each amino acid 
residue (except proline) present in the protein sequence. 
Each peak is also associated with both a *H and 
chemical shift resulting from the bonded NH (amide) 
group. The relative location (chemical shifts) of each 
peak in the 2D ^H-'^N-HSQC spectrum is strongly 
dependent on the local environment for each amino 
acid residue in the protein structure. Thus, the addition 
of a ligand known to bind the protein results in local 
environmental changes for amino acids in direct contact 
with the bound ligand, and for residues that undergo an 
induced structural change. Correspondingly, chemical 
shift changes are observed for a select number of peaks 
that can be mapped onto the protein structure to identify 
the ligand-binding site. A unique spatial clustering of 
residues that incur a chemical shift perturbation (CSP) 
indicates a specific binding interaction and the location 
of the ligand-binding site. Conversely, a random distri¬ 
bution of residues with CSPs on the proteins surface 
indicates a nonspecific interaction. Alternatively, an 
observation that a majority of residues incur a CSP 
or experience a decrease in peak intensity probably 
infers a ligand-induced aggregation, precipitation, or 
denaturation, which typically eliminates the compound’s 
consideration as a chemical lead. Importantly, the obser¬ 
vation that two or more ligands (fragments) share prox¬ 
imal, but not identical binding sites, based on CSP 


mapping, presents an opportunity to chemically link the 
fragments. The linked fragments are expected to have an 
enhanced affinity and represent a starting point for fur¬ 
ther optimization. 

The addition of an increasing concentration of 
unlabeled ligand will cause a progressive change in 
the protein’s chemical shifts if the ligand-binding 
affinity is weak and in fast exchange (Roberts 2000). 
Thus, a binding affinity can be determined by follow¬ 
ing CSPs as a function of ligand concentration: 


N 


CSP ohs — 


{Ko + [L] + [/>])- ^iKo + [L] + [P]f-m[P]) 


2[P] 


(3) 


where CSP ohs is the 2D HSQC chemical shift 

perturbations, [P] is the protein concentrations, [L] is 
the ligand concentration, and Kq is the dissociation 
constant. Conversely, a tightly bound ligand in slow 
exchange will result in two sets of NMR peaks in the 
HSQC spectrum for the bound and free state. The ratio 
of intensities for the bound and free states can be used 
to determine the fraction bound and measure a Ko from 
a standard binding isotherm. Figure 2 illustrates the 
application of a 2D HSQC titration experiment 

to measure chemical shift perturbations and calculate 
a iCo (Stark et al. 2010). 
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HTS-NMR and Lead Optimization 

It has been estimated that the number of possible chem¬ 
ical compounds is on the order of 10“'^ (Bohacek et al. 
1996). Clearly, obtaining and experimentally screening 
such an immense compound library is impossible. 
Instead, a screening library for HTS-NMR usually con¬ 
sists of only a few hundred to several thousand com¬ 
pounds. Commonly, the HTS-NMR screening library is 
a set of chemical leads from HTS. Obviously, the quality 
of the chemical leads that emerge from HTS is dependent 
on the quality of the original HTS library (Lipinski 
2004). A major source of populating a screening library 
is from prior HTS lead optimization. Unfortunately, this 
often leads to higher MW and more lipophilic com¬ 
pounds due to exploiting hydrophobic interactions that 
increase potency. The result is a compound library with 
reduced pharmacokinetics or pharmacodynamics prop¬ 
erties. In essence, HTS-NMR provides an invaluable 
function by further validating the HTS chemical leads 
and experimentally eliminating compounds with poor 
drug-like properties. HTS-NMR can experimentally ver¬ 
ify acceptable physical behavior, such as good solubility, 
stability, purity, and a lack of aggregation or micelle-like 
behavior. HTS-NMR can also confirm that the chemical 
lead specifically binds the protein target in a biologically 
relevant manner. Additionally, selection of compounds 
can also abide by the Lipinski’s rule of 5, which is based 
on predicted molecular properties important for drug¬ 
like behavior. Of course, strictly following Lipinski’s 
rule of 5 may eliminate potentially interesting and 
novel leads. Instead, a combination of HTS-NMR and 
Lipinski’s rule of 5 provides the most flexibility. Addi¬ 
tionally, ligand efficiency (LE) provides a better 
approach to prioritize chemical leads instead of tradi¬ 
tional measures of relative activity (/C50, or K,). LE is 
deflned as: 


AG -RT\r\Ki 

LE = -=- 

N N 


(4) 


where AG is the Gibbs free energy, Kj is the dissocia¬ 
tion constant for inhibitor binding, and N is the number 
of non-hydrogen atoms. 

Based on LE, lower MW (<300 Da) chemical leads 
provide a better opportunity to successfully evolve 
a compound into a drug (Zartler and Shapiro 2005). 
Similarly, a library of low MW compounds provides 
a more efficient coverage of structural space because 


the number of low MW compounds is signiflcantly 
reduced. The efficiency of low MW compounds for 
drug discovery is the basis of fragment-based screen¬ 
ing. Correspondingly, HTS-NMR using a fragment- 
based library has become an integral part of the drug 
discovery process. A fragment-based screen is com¬ 
plementary to standard HTS because the two chemical 
libraries typically cover different regions of structural 
space. As a result, chemical leads from a fragment- 
based screen tend to be novel compounds and 
distinct from HTS leads. Also, chemical leads from 
a fragment-based screen tend to be more drug-like, 
where the lower MW provides more opportunities for 
optimization and maintaining high ligand efficiency. 
Again, the primary goal behind a fragment-based 
screen is to identify two or more fragments that bind 
in proximal sites on the target protein. Chemically 
linking these fragments in a manner that maintains 
the original binding orientations will result in a new 
lead molecule with enhanced affinity. The process is 
illustrated in Eig. 3. HTS-NMR is well-suited to 
screening a fragment-based library because the low 
MW drug-like fragments are expected to have weak 
binding affinities (Kd > pM - mM) that are readily 
detectable by NMR. Patentability may be a potential 
challenge with fragment-based libraries. Multiple 
pharmaceutical and biotechnology companies will 
use very similar screening libraries since chemical 
fragments of known drugs are a well-defined set of 
compounds. Thus, it will not be surprising to obtain 
similar chemical leads when fragment-based libraries 
are screened against an identical protein target. 


NMR Functional Annotation 

Eunctional annotation of unknown protein targets plays 
an important role in the drug discovery process because it 
allows for identification of novel targets that may be 
involved in human disease. Functional annotation is pri¬ 
marily obtained by sequence or structure homology. The 
majority of the ^19 million proteins identified to date 
(UniPortKB; http://www.uniprot.org) are functionally 
annotated through sequence or structure homology. 
The transfer of functional information is vastly more 
common than the direct acquisition of experimental 
data. But at least 40% of prokaryotic and eukaryotic 
genomes are explicitly annotated as “hypothetical” 
or “uncharacterized” proteins because of a lack of 
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NMR in Drug Discovery - 
Introduction, 

Fig. 3 Applications of SAR 
by NMR method for fragment 
design showing ligand binding 
for (a) matrix 

metalloproteinase 3 (b) Bcl- 
xL. In each case at the top, the 
identified fragment leads are 
shown with cyan carbons, 
whereas the linked compounds 
are denoted with green carbon 
atoms. (Reprinted with 
permission from (Hajduk and 
Greer 2007). © 2007 by 
Nature Publishing Group) 



homology to functionally annotated proteins. HTS-NMR 
can be used to assist in the functional classification 
of unannotated proteins when sequence and structure 
homology fails. 

A fundamental component to understanding a pro¬ 
tein’s function is derived from its interaction partners. 
The identity of ligands and the location of binding sites 
can be leveraged to infer a function in a manner similar to 
sequence and structure homology. This is based on the 
observation that amino acids in an active site or func¬ 
tional epitope are evolutionarily more stable than the rest 
of the protein. Effectively, these residues are required to 
maintain function, where proteins that share similar 
ligand-binding sites are predicted to be functional homo¬ 
logs. FAST-NMR (Functional Annotation Screening 
Technology using NMR) uses HTS-NMR to identify 
ligands and binding sites for functionally uncharacterized 


proteins. FAST-NMR then exploits a stmcture and 
sequence similarity to a ligand-binding site from an anno¬ 
tated protein to infer a function for an uncharacterized 
protein (Powers et al. 2008). A schematic representation 
of the FAST-NMR protocol is illustrated in Fig. 4. 
Unlike traditional HTS-NMR, the screening library con¬ 
tains only biologically active molecules that are active 
against a defined protein target. In essence, the library is 
used as chemical probes to identify structural homologs 
of the natural ligand and the location of ligand-binding 
sites. FAST-NMR uses a tiered approach to HTS-NMR 
that includes a ID *H NMR ligand-focused screen 
followed by a 2D *H-'^N-HSQC target-focused CSP 
screen. The ligand-focused screen is used to quickly 
identify binders, where only the hits are further screened 
in the target-focused CSP screen. The 2D *H-*^N-HSQC 
CSP screen is used to verify binders, identify the 
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NMR in Drug Discovery - Introduction, Fig. 4 Schematic diagram for FAST-NMR. (Reprinted with permission from (Powers 
et al. 2008). © 2008 by Elsevier) 


ligand-binding site, and generate a protein-ligand co¬ 
structure. This structure is then used as an input for 
CPASS (Comparison of Protein Active-Site Stmcture) 
to identify homologous ligand-binding sites from func¬ 
tionally annotated proteins. The CPASS database con¬ 
tains all the unique ligand-binding sites present in the 
RCSB PBD. 

NMR Metabolomics and In Vivo Drug Activity 

One means of analyzing the state of a biological 
system is achieved by monitoring the metabolome - 
all the metabolites present in a cell, tissue, organ, or 
organism. Correspondingly, metabolomics is the study 
of the changes in the concentration and identity of 
these metabolites that results from environmental or 
genetic stress from a disease state or drug treatment. In 
essence, the metabolome provides a chemical finger¬ 
print or signature that uniquely defines the state of the 
system. Metabolomics has an intrinsic advantage over 
genomics and proteomics analysis since observed 
changes in the metabolome are directly coupled with 


changes in protein activity and cell function. Metabo¬ 
lites are the end product of all cellular processes, and 
are a direct result of enzymatic and protein activity. 
NMR metabolomics can assist drug discovery by 
characterizing drug efficacy, selectivity, and toxicity 
(Powers 2009). Importantly, NMR metabolomics may 
identify and eliminate troublesome compounds prior to 
initiating a clinical trial. Thus, HTS-NMR can also be 
used to follow the impact of chemical leads on metab¬ 
olites from cellular extracts (bacteria, human cell lines, 
tumors, etc.) or biofluids (urine, serum, saliva, etc.). 
Common NMR experiments used for metabolomics 
includes: ID 'H NMR, 2D 'H-'H TOCSY, and 
2D HSQC. 

Metabolites are typically detected using ID *H-NMR, 
where peak heights relative to an internal standard 
are used to determine metabolite concentrations. Because 
of the low sensitivity of NMR, only the most abundant 
(^ >1 pM) metabolites are observed. Also, 

a metabolomic sample is a complex heterogeneous mix¬ 
ture containing numerous metabolites. The limited 
spectral resolution and severe peak overlap in a ID 
'H-NMR spectram makes unambiguous metabolite 
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NMR in Drug Discovery - Introduction, Fig. 5 (a) One¬ 
dimensional *H-NMR spectrum of an equimolar mixture of the 
26 small-molecule standards, (b) Two-dimensional 
HSQC NMR spectra of the same synthetic mixture (red) overlaid 


I i I 

lH(ppm) 


onto a spectrum of aqueous whole-plant extract from 
Arahidopsis thaliana (blue). (Reprinted with permission from 
(Lewis et al. 2007). © 2007 by American Chemical Society) 


assignment particularly challenging. Furthermore, since 
the metabolome has not been completely characterized, 
erroneous assignments may result. Alternatively, 2D 


HSQC spectra are commonly used to identify 
metabolites. The higher spectral resolution, the larger 
chemical shift dispersion, and the observation of 
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NMR in Drug Discovery - Introduction, Fig. 6 (a) 2D 

OPLS-DA scores plot demonstrating the clustering pattern for 
12 antibiotics with known biological targets and three com¬ 
pounds of unknown in vivo activity: untreated Mycobacterium 
smegmatis cells, chloramphenicol, ciprofloxacin, gentamicin, 
kanamycin, rifampicin, streptomycin, ethambutoi, ethionamide, 
isoniazid, ampicillin, D-cycloserine, vancomycin, amiodorone, 
chiorprothixene, and clofazimine treated M. smegmatis cells. 
The symbols correspond with the coloring scheme and labeled 
symbols indicated on the tree diagram in (b). The ellipses cor¬ 
respond to the 95% confidence limits from a normal distribution 
for each cluster. The untreated M. smegmatis cells (black 


square) was designated the control class, and the remainder of 
the cells were designated as treated. The OPLS-DA used one 
predictive component and six orthogonal components to yield 
a R^X of 0.715, R^Y of 0.803, and of 0.671. (b) 
Metabolomics tree diagram determined from the OPLS-DA 
scores plot. The coloring scheme and associated symbol for 
each compound in the tree diagram correlates with colored 
symbols in the OPLS-DA scores plot. The bootstrap numbers 
for each node are indicated on the tree diagram. (Reprinted with 
permission from (Halouska et al. 2012). © 2011 by American 
Chemical Society) 


chemical shifts for each C-H pair significantly improve 
the accuracy of metabolite assignment. The metabolomic 
sample does need to be supplemented with a *^C-labeled 
metabolite to avoid long acquisition times because of the 
low natural abundance of (1.1%). Correspondingly, 
the only metabolites observed in the 2D HSQC 

spectmm are derived from the ^^C-labeled metabolite. 
Also, because of variations in J-coupling constants, and 
Ti and T 2 relaxation times, direct quantitation of 
a standard HSQC spectrum to calculate metabolite con¬ 
centrations is unreliable. Nevertheless, modification to the 
HSQC NMR pulse sequence has produced quantitative 
HSQC experiments (e.g., HSQCq, QQ-HSQC, etc.). 
Metabolites can also be quantified by using the fast quan¬ 
tification by NMR method (FMQ), where a series of 
standard metabolites over a range of concentrations are 
screened (Fig. 5). A biological extract can then be 


compared against the standards to determine the metabo¬ 
lite concentrations (Lewis et al. 2007). 

Several databases have been developed that enable 
metabolite identification from experimental chemical 
shift data. The databases contain NMR and mass spec¬ 
troscopy data for thousands of metabolites from 
a broad range of organisms. Therefore, metabolite 
identification needs to be verified with KEGG and 
MetaCyc metabolic pathway databases to verify the 
metabolite is actually present in a specific organism 
(Powers 2009). The NMR metabolomic databases 
include: Human Metabalome Database (HMDB), 
Madison Metabolomics Database (MMCD), and Plat¬ 
form for Riken Metabolomics (PRIMe). 

A common application of NMR metabolomics data 
is to monitor global differences between metabolic 
samples instead of following changes in specific 
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metabolites. This approach involves the use of multi¬ 
variate statistical analysis and pattern recognition. 
Multivariate statistical techniques such as principal 
component analysis (PCA) and orthogonal partial 
least-squares discriminant analysis (OPLS-DA) are 
routinely employed to capture global perturbations in 
the metabolome (Stoyanova and Brown 2002; Bylesjo 
et al. 2006). PCA is an unsupervised method that 
reduces multivariate dataset to a single point and pro¬ 
jects the major variations in the dataset into a few axes 
called principal components. In this way, spectral 
variations are captured in a model that can he easily 
visualized. As a result, similar metabolomic data 
will cluster closely together and separately from 
other distinct metabolomes. Conversely, OPLS-DA is 
a supervised method used to determine the variations 
within the data set that is correlated to the classification 
label (treatment, control, disease state). If the varia¬ 
tions within the data set (noise) are not correlated with 
the classification label, then the variations are filtered 
out, resulting in a single latent vector. This is analo¬ 
gous to PCA, where PCA captures all the intrinsic 
variations within the data set. Applying PCA and 
OPLS-DA to NMR metabolomics data sets is 
extremely valuable for classifying drug activity and 
toxicity in a living system based on relative clustering 
patterns (Fig. 6). This also allows for determining the 
in vivo mechanism of action for a new chemical lead or 
drug (Halouska et al. 2012). 

Summary 

Nuclear magnetic resonance (NMR) is a versatile ana¬ 
lytical tool with a wide range of applications that may 
improve the success rate of drug discovery. NMR uses 
ligand-focused or target-focused screening techniques 
to discover chemical leads with a large range of 
affinities. Correspondingly, NMR is ideally suited 
for fragment-based screening, which complements 
standard high-throughput screens and expands the 
coverage of compound structural space. NMR can 
also be employed to identify the location of 
ligand-binding sites and determine a protein-ligand 
co-structure. This is essential information for linking 
fragments to enhance binding affinity and to evolve 
chemical leads into drug candidates. NMR ligand 
affinity screens can also be applied to assist in the 
functional annotation of uncharacterized proteins to 


identify new therapeutic targets. Finally, NMR 
metaholomics strategies can identify the in vivo 
mechanism of action, and determine the in vivo efficacy 
and toxicity for chemical leads and help alleviate the 
many failures encountered in clinical trials. 
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NMR Methods for Kinetic Analysis 
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University of Liverpool, Liverpool, UK 

Definition 

NMR kinetic analysis aims at deriving reaction rates 
from the characteristics of the NMR spectra. 

Basic Characteristics 

The unique ability of NMR to resolve and monitor 
signals of individual nuclei in different chemical 


environment makes it a powerful method for kinetic 
analysis. NMR spectra are highly sensitive to the envi¬ 
ronment, and no modification of the molecules is nor¬ 
mally required to detect the changes due to the 
interaction and kinetic effects. Its application, how¬ 
ever, is limited by the low sensitivity and relatively 
long time of data acquisition. These restrictions are 
particularly severe for monitoring nonequilibrium pro¬ 
cesses where reaction course is followed after system 
perturbation or mixing, making these experiments gen¬ 
erally suitable only for reaction times longer than sec¬ 
onds. Faster rates can be obtained by the NMR analysis 
of dynamic equilibrium where no net changes occur, 
although the reaction is proceeding at a detectable rate. 

Nonequilibrium Analysis 

The fastest way to monitor a reaction time course is 
through a sequential collection of ID NMR spectra. 
A single scan acquisition takes a fraction of a second to 
complete and, even on high-held spectrometers, suffi¬ 
cient signal-to-noise ratio requires at least tens of scans 
at sub-millimolar concentrations. This restricts the 
delay between each point of the time course to seconds 
or longer. Additionally, there is a dead time before the 
experiment caused by sample insertion, temperature 
equilibration, and magnetic held adjustment. In 
a normal NMR setup, the dead time is measured in 
minutes, although can be dramatically reduced with 
the use of how probes. Due to these considerations, 
even in an optimized setup at high concentrations, the 
measurable rate constants are less than 0.1 s“*. For 
enzymatic reactions, the rate of substrate conversion 
can be easily adjusted by reducing enzyme concentra¬ 
tion, making these reactions amenable to the 
nonequilibrium NMR analysis. 

With a slow enough reaction rates, NMR provides 
a wealth of information for characterizing reaction 
pathways and intermediates. Signals of small mole¬ 
cules are usually well resolved in a ID spectrum, and 
individual reaction products can be identihed and 
followed selectively by monitoring corresponding res¬ 
onances. If ID resolution is not sufficient, a time series 
of 2D COSY or TOCSY spectra can be collected to 
improve resonance separation and help with product 
identihcation. Each of the 2D spectra normally takes 
tens of minutes to acquire, reducing the reaction rates 
available to the analysis. For or *^N enriched mol¬ 
ecules, corresponding HSQC spectra are used to enhance 
selectivity and resolution. Routinely, a sequential series 
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of [*H,'^N]-HSQC spectra are used to determine hydro¬ 
gen exchange rates in proteins (Cavanagh et al. 2007). 

Reactions that can be stopped by changing reaction 
conditions make a special case where faster rates can 
be monitored by NMR. The reaction timescale is then 
determined by the characteristics of the quenched flow 
rather than NMR limitations. Once the reaction is 
stopped, the intermediate concentrations and compo¬ 
sitions of the products can be analyzed by ID or 2D 
NMR spectroscopy, providing detailed characteriza¬ 
tion at each time point. Such analysis is often used 
for study of hydrogen exchange in protein folding, 
where exchange is quenched by a pH jump. 

Chemical Exchange 

In a dynamic equilibrium, the rate of spin exchange 
between different environments has a strong effect on 
the NMR spectra. The exchange process is modeled 
using McConnell modifications of the Bloch equa¬ 
tions, described in details for two spins in (Rule and 
Hitchens 2006; Levitt 2001). Matrix-based multi-spin 
exchanged equations are presented in (Cavanagh et al. 
2007). A general case of multi-spin high-order reaction 
is analyzed in (Ernst et al. 1987). Briefly, in a two site 
exchange A »-► B with the rate constants ki and k_i for 
the forward and reverse reactions, the spectral changes 
caused by the exchange depend on the relationship 
between the exchange rate kex = ki -i- k_i and the 
frequency difference Aro = a)A-®B for the spin in the 
two states. The frequency difference can vary for dif¬ 
ferent spins of the same molecule, resulting in different 
exchange contributions into specific resonances. Five 
exchange regimes are normally identified that lead to 
specific effects on the NMR spectra (Rule and 
Hitchens 2006; Levitt 2001) illustrated in Fig. 1: 

• Very slow, kex two resonances are observed 

for a single spin with negligible effect on the line 
width 

• Slow, kex < two resonances are observed 
broadened by the exchange 

• Intermediate, kex Aco, single very broad reso¬ 
nance is observed with a complicated lineshape 

• Fast, kex > a single broadened resonance is 
observed 

• Very fast, kex ^ Aco, a single resonance is observed 
For two separately observed resonances in the slow 

exchange regime, their relative amplitudes are propor¬ 
tional to the populations of the two states, while for 
a single resonance of the fast exchange regime, the 



« Aco 
< Aco 

~ Aco 


> Aco 


» Aco 


-50 


50 co=27tv (rad/sec) 


NMR Methods for Kinetic Analysis, Fig. 1 Effect of chemical 
exchange on NMR spectra simulated for two states of equal 
population. The frequency difference Aco = 27iAv = 100 rad 
s^', exchange rate constants kex from top to bottom are A: 5, 
B: 20, C: 60, D: 100, E: 150, F: 500, and G: 1,000 s^f Three 
curves are shown for the intermediate exchange to illustrate 
resonance coalescence and maximum broadening. Note that the 
exchange regime is determined by the relationship between k^x 
and Ata (rad s“*) rather than Av (Hz) normally used for reso¬ 
nance frequency measurements 


N 


chemical shift is linearly dependent on the relative 
population. These can be used to evaluate equilibrium 
constant. 


Keq =ki/k_i =Pa/Pbc 

with Pa and pa corresponding to the respective 
populations on the two states. Note that in the fast 
exchange regime, chemical shifts of the individual 
states are required to determine the populations. The 
exchange broadening is maximal when the populations 
are equal. Ligand binding makes a special case of 
the chemical exchange because populations of the 
states can be easily controlled through the total con¬ 
centrations of the components. This allows evaluation 
of the binding constant and chemical shifts of the 
free and bound form for the fast exchange regime by 
analyzing concentration dependence of the NMR 
spectra. 
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Dynamic Equilibrium Methods 

Depending on the exchange regime, different methods are 
used to evaluate the exchange rate. In the very slow range, 
the exchange contribution into the linewidth is negligible. 
However, if the exchange rate is faster than spin-lattice 
relaxation rate Ri, net magnetization can be transferred 
between the exchanging sites. This transfer is detected 
using exchange spectroscopy in experiments identical to 
those used for the detection of Nuclear Overhauser Effect 
(NOE) (Ernst et al. 1987; Rule and Hitchens 2006). In 
a ID version, the signal of the spin in one of the states is 
selectively perturbed, and the change in the intensity of 
the resonance corresponding to the exchanged state is 
monitored. The exchange rate is determined from the 
dependence of intensity change on the transfer time. In 
a 2D version of the experiment, cross-peaks are detected 
between the resonances corresponding to the exchanging 
states and the exchange rate is determined from their 
intensities. Care has to be taken to avoid contributions 
from other magnetization transfer mechanisms. The Ri 
values normally range between 1 s“ * for proteins to <0.1 
s“ * for small molecules, placing the rate constants mea¬ 
surable by the exchange spectroscopy at fastest end of the 
rates accessible by nonequilibrium measurements. 

In the slow exchange regime, exchange contribu¬ 
tion into the linewidth becomes significant and can 
be used to evaluate the exchange rate. If two 
well-separated lines are observed, the increase in the 
linewidths of the two sites is (Rule and Hitchens 2006): 

7lAv'/^A=R2A+ki; 7tAv'/^B=R2B+k-l 

where R 2 A and R 2 B are the intrinsic relaxation rates in 
the absence of exchange. If the intrinsic relaxation 
rates are known or negligible relative to the exchange 
contribution, the exchange rate constants can be 
calculated directly from the linewidths. When lines 
start coalescing, no analytical approximation for the 
lineshapes can be derived and the exchange rate has to 
be evaluated through numerical data fitting. The max¬ 
imum line broadening and complete coalescence of the 
resonances occurs when kex is close to Aco (Fig. 1). As 
the exchange rate increases further, the linewidth 
decreases and the lineshape simplifies with the 
exchange broadening in the fast exchange regime 
equal to (Rule and Hitchens 2006): 

R 2 =PaR2A+PbR2B +PaPb /kex • 


The exchange rate can only be evaluated if the 
populations, chemical shift differences, and the relax¬ 
ation parameters of the individual states are known, 
which is usually possible only for ligand binding. 

Under fast to very fast conditions, the exchange con¬ 
tribution can be derived independently from other 
parameters using relaxation dispersion experiments that 
measure the dependence of the transverse relaxation rate 
on delay between refocusing pulses in CPMG-based 
experiment or on the intensity of the effective magnetic 
field strength in the spin-lock variant (Lipchock and 
Loria 2009; Rule and Hitchens 2006). The exchange 
contribution into transverse relaxation during CPMG 
sequence with the delay between refocusing pulses i 
depends on the multiple T*kex and disappears completely 
if ^ kex- The dispersion curve is obtained by mea¬ 
suring relaxation rates at different i values, and the 
exchange rate constant is derived by fitting the experi¬ 
mental curve into the model equation. Fast transverse 
relaxation in proteins limits the maximum i value to 
^10 ms and the sample heating prevents using values 
significantly shorter than 1 ms, limiting the accessible 
exchange rate constants to ^10-10“^ s“/ Similar rate 
constants can be obtained in the on-resonance spin-lock 
relaxation dispersion experiment where the relaxation 
rate is measured as a function of the spin-lock field 
applied on-resonance, with the fastest rate constant lim¬ 
ited by the intensity of the radio-frequency field. Faster 
rate constants up to ^10^ s“^ are available from the off- 
resonance spin-lock experiments, where the strength of 
the effective magnetic field is increased through the 
contribution of the main magnetic field. 

Overall, NMR offers methods for detailed kinetic 
analysis of processes on the time scale of days to 
fractions of milliseconds. Simultaneous monitoring of 
multiple sites and identification of intermediate states 
and interaction sites provide means to correlate struc¬ 
tural and kinetic information. 

Cross-References 

► Quenched-Flow Methods 
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NMR of Carbohydrates 

► Carbohydrate NMR Spectroscopy 


NMR of Lipids 

Anthony Watts 

Biomembrane Structure Unit, Department of 
Biochemistry, University of Oxford, Oxford, UK 

Synonyms 

Membrane dynamics; Membrane lipids; Membrane 
order; Membrane structure; NMR of membranes 

Definition 

NMR provides a way of probing the molecular 
arrangements that lipids can adopt, their molecular 
order and dynamics, usually in hydrated solution and 
either with or without other components, such as mem¬ 
brane proteins or ions, under a range of conditions of 
temperature, pressure, and pH. The information can 
usually be obtained with minimal perturbation chemi¬ 
cally, although the inherent low sensitivity of NMR 
can present practical limitations. 


deuterium (^H), fluorine (*^F), oxygen (^^O), or 
enriched with carbon (^^C), to enable detection with 
more or less perturbation of the lipid properties 
depending on the nuclear substitution. 

Chemical Analysis of Lipids by NMR 

High-resolution, solution-state (*H, *^C, or ^*P) NMR 
is a routine method for the chemical analysis of lipids, 
either after synthesis or of a naturally occurring, and 
usually purified, lipid. 

The solvent used for NMR study will depend 
on the lipid solubility, but chloroform/methanol and 
dichloromethane/methanol are universally used since 
in these, lipids are often monomeric and readily 
soluble. Carbon tetrachloride is suitable for nonpolar 
lipids. For lipids extracted from natural membranes, the 
solvent mix and temperatures used for extraction can 
change the efficiency of extraction. Around 0.4—0.7 mL 
of sample is required in a 5 mm standard solution-state 
NMR tube, at a concentration of 1-20 mM, but this can 
vary depending on the lipid and solvent. 

For natural abundance proton (*H) NMR, the 
solvents are deuterated to reduce the solvent proton 
signal and provide a lock-signal, but for NMR, the 
solvent natural abundance is not so much of a problem 
and can be reduced digitally, or the sample volume 
decreased thereby increasing the lipid concentration. 
Spectral dispersion is also much larger in NMR 
(^200 ppm) than in *H NMR (^10 ppm), so spectral 
overlap is significantly reduced. 

The usual assignment strategies used in small mole¬ 
cule NMR analysis also apply to lipids, using a routine 
standard for reference, such as tetramethylsilane 
(TMS). Because of the complex electronic distribution 
induced by chain unsaturation, the introduction of one 
or more double bonds gives rise to several extra reso¬ 
nances in the spectrum compared to the saturated chain 
homologues. An extensive range of proton and 
NMR assignments is available from the AOCS Uipid 
Library http://lipidlibrary.aocs.org/index.html. 


N 


Introduction 

Lipids may indigenously contain NMR visible nuclei, 
such as protons (^H), nitrogen (*"*N), carbon (*^C at its 
natural abundances of 1.1%), or phosphorous (^^P), but 
can also be labeled with other nuclei, such as 


Proton NMR of Lipids 

In proton NMR spectra of lipids in isotropic solvents 
(Fig. 1), from lower to high chemical shift (ppm with 
respect to TMS), it is the lipid chain terminal methyl 
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NMR of Lipids, 

Fig. 1 Proton NMR spectra 
of lipids. Methyl linolenate in 
deuterochloroform solution at 
300 MHz (tetramethylsilane as 
internal standard) shows 
features in the spectrum 
corresponding to the protons 
in the fatty acid labeled “a to 
h” (Christie 1982) 


CH 30 -C 0 -CH 2 -CH 2 -(CH 2 ) 4 -CH 2 -CH=CH-CH 2 -CH=CH-CH 2 -(CH 2 ) 3 -CH 3 



protons that are often the narrowest lines (^0.8- 
1.0 ppm), with an envelope (1.6-1.3 ppm) of reso¬ 
nances for those protons at carbons in the center of 
the hydrocarbon chain (^3 to ^8), followed by protons 
at methylene groups either side of the C=C group 
(2.0-2.2 ppm). Protons adjacent to the C=0 group 
are observed next (2.2-2.3 ppm), and methylene 
group protons between the C=C are at ^2.7- 
2.8 ppm. Characteristic resonances are observed for 
olefinic or other unsaturated groups (^5-6 ppm), and if 


the acid is esterified, (as here in Fig. 1), methoxy pro¬ 
tons are usually sharp and at ^3.5-3.7 ppm. Similar 
strategies can be used for phospholipids and sterols. 

Quantification of resonances to aid molecular anal¬ 
ysis is more straightforward in *FI NMR spectra than 
spectra, since the relaxation (and hence integrated 
signal intensity) of is complicated by the proximity 
of protons. 

Although much of the NMR for chemical analysis 
of lipids is now superseded by mass spectrometry, for 
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NMR of Lipids, 

Fig. 2 500 MHz proton ID 
Fourier-transform MAS 
(2.7 kHz) NMR spectrum of 
1 , 2 -dimyristoyl-i) 7 -glycero- 
3-phosphocholine 
(DMPC)-^H 20 (50 wt%) 
at 36°C. Assignments are as 
shown for the glycerol 
(1, 2-CH, 3), head group 
(a, b, g), terminal methyl 
(CH 3 ), and acyl chain 
((CH 2 )io) protons (Adapted 
from Forbes et al. 1988) 



more complex lipid bilayer and membrane NMR, 
especially solid state, magic-angle spinning NMR of 
hydrated lipid bilayers (Fig. 2), these assignments 
in solution are then vital in the interpretation of the 
membrane or bilayer spectra to give molecular details. 

Phosphorous-31 NMR of Phospholipids 

Phosphorous-31 has highly anisotropic, axially 
symmetric magnetic interactions (maximum anisot¬ 
ropy ^ Acj = cJn,CT 33 — <5-12 70 ppm; ^13 kHz at 

500 MHz; given that CTh ^ a 33 (^85 ppm) CJ 22 
(14 ppm) for the phosphoesters of phospholipids). This 
anisotropy is averaged to ^40 ppm (^6 kHz at 
500 MHz) by the fast motion of the phospholipid 
head group around the molecular long axis in a lipid 
bilayer or other polymorphic form, with an order 
parameter (Smoi) of ~0.1-0.2 and within a cone of 
angle ^10-20° (angle 0, where S^oi = iA(<3 cos^ 0 
- 1 >), and the resultant NMR spectra can be diagnostic 
of the lipid polymorphic form. 

Phosphorous-31 in phospholipids shows chemical 
shift sensitivity (1-3 ppm) to the ester moiety at the 
head group, and can therefore be used for chemical 
identification of phospholipid classes in isotropic sol¬ 
vent lipid extracts from natural membranes (Fig. 3) to 
give high-resolution-like spectra. The different phos¬ 
pholipid types in mixed bilayers, however, are not 
usually distinguished in a statistically recorded. 



NMR of Lipids, Fig. 3 Typical NMR profile of 
solvent-extracted phospholipids from bovine white matter. The 
assignments of the resonances are PC, phosphatidylcholine; 
PCplas, PC-plasmalogen; PCa, alkylacyl-PC; PS, phosphati- 
dylserine; SPH, sphingomyelin; PE, phosphatidylethanol- 
amine; PEplas, PE-plasmalogen; PI, phosphatidylinositol; 
P1P2, Pl-bisphosphate; PEx, PCx, unidentified PE and PC 
derivatives; Um, unidentified phospholipids (Adapted from 
Edzes et al. 1992) 

wide-line NMR powder pattern (1-3 ppm is not 
resolved over ^40 ppm wide powder spectra) (Fig. 4). 

Magic-angle spinning (MAS) solid-state NMR can 
be used to average out the chemical shifts spectral 
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NMR of Lipids, Fig. 4 Comparison of the L,, (on the left) and 
hexagonal Hn (on the right) phases of phospholipid water dis¬ 
persions. A schematic representation of their geometry is shown 
at the top. Due to the rapid diffusion around the cylinder axes in 
the Hn phase, both the ^'P NMR (middle) and the NMR 
(bottom) spectra are scaled by a factor of —1/2. The NMR 
spectra can yield both the absolute value of S and its sign. 
Because of their symmetry, the NMR spectra do not provide 
the information about the sign of S. For illustrative purposes, the 
simulated powder patterns are shown on arbitrary horizontal and 
vertical scales, with cOq indicating the Larmor frequency for each 
nucleus 


anisotropy (spinning rate, co,. > chemical shift 
anisotropy, ^6 kHz for phospholipids in bilayers) 
and therefore different lipids types in bilayers can be 
resolved, quantified and any specific interactions with 
ions or proteins quantified. 


Deuterium NMR of Labeled Lipids 

In deuterium NMR, specific proton-deuterium substi¬ 
tutions can be made at various positions in the lipid, or 
alternatively, uniform labeling, usually in the acyl 
chains, is used. Deuterons in the lipid head groups are 


particularly sensitive to surface interactions due to 
ions, pH, and protein interactions. 

Quadrupole splittings from the spin mi = 1 spectra 
are measured for assigned -CD, -CD 2 , or -CD 3 groups, 
or groups of these labeled positions. More usually 
a multiply, or per-deuterated chain or head group 
is used to form bilayers. Because of the low sensitivity 
of ^H as an NMR nucleus, and its wide spectral 
lines (up to 127 kHz; spin = 1; y = 6.54 ^1/7 Hz 
sensitivity of *H), most of the systems studied 
composed of the ^H-labeled lipid, or in mixture with 
other non-labeled lipids where the ^H-lipid was 
still a major component. The quadrupole splittings 
are sensitive to lipid order and lipid dynamics 
(see: ► Lipid Organization, Aggregation, and Self- 
assembly). 


Lipid Polymorphism by NMR 

Large, extended hilayers: Proton NMR spectra of 
lipids in large extended bilayers are usually featureless 
because of the strong proton dipolar broadening 
(^100 kHz) which is not averaged through molecular 
tumbling (x^ (extended bilayers) Z^^lOO kHz). 
However, magic-angle spinning (MAS) and 
decoupling can relieve this broadening to give narrow 
spectral lines (Fig. 2), although very fast (>^15 kHz) 
spinning can centrifuge dispersions in the NMR 
sample tube. 

The anisotropy of phosphorous-31 and deuterium 
nuclei has been found to be most useful in wide-line 
(non-spinning) NMR for characterizing and studying 
large extended phospholipid bilayers, which need to 
have a rotational correlation time (t^) ^the maximum 
anisotropy of the nuclear interactions; ~6 kHz for ^*P; 
^127 kHz for ^H. These static bilayers, on the NMR 
anisotropy averaging timescale, give rise to spherically 
averaged powder patterns (Fig. 4), and characterize 
fast long axis motion of the lipids in bilayers around 
the bilayer normal, and reduced amplitude of motion of 
the head groups and acyl chains within the bilayer. The 
amplitude of motion is characterized by order param¬ 
eters, and the rate of motion by dynamic measurements 
(see: ► Lipid Organization, Aggregation, and Self- 
assembly). 

Small hilayer vesicles: If phospholipid bUayers are 
rendered into small bilayer vesicles by, for example, son- 
ication or extrusion through a filter, the phosphorous-31 
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NMR of Lipids, Fig. 5 The size (radius, R) of a small vesicle 
can be estimated from the line width of the NMR spectrum, 
assuming vesicle tumbling is the major contribution to the 
spectral broadening, which is a good assumption for most 
liquid-crystalline bilayers. Simulated ^*P-NMR spectra of 
dioleoylphosphatidylcholine vesicles of different sizes (radius, 
R) at 30°C using a ^*P anisotropy, (cth — a^) = 3,550 Hz, solvent 
viscosity, p = 0.008P, orientationally independent Lorentzian 
full line width at half height = 60 Hz, Djiff = 6-2 x 10^^ cm^/s, 
and R as indicated (From Burnell et al. 1980) 


acyl chains that occupy a large bilayer volume, can 
form inverted lipid polymorphic phases termed hexag¬ 
onal, Hu phases. The hexagonal packing of lipid tubes 
is confirmed by the long spacing ratios (1/V3; 1/2; 
1/V7...) in x-ray scattering of Hu phase lipids. Phos- 
phorous-31 and deuterium NMR of deuterated lipids 
give rise to inverted powder patterns of half spectral 
anisotropy (^20-25 ppm) compared to the same or 
similar lipids in a bilayer (^40-50 ppm) (Fig. 4) due to 
the averaging of the spectral anisotropy by lateral diffu¬ 
sion down the tubes and around the tube circumference. 

Phospholipids can convert from a bilayer phase at 
lower temperatures to a hexagonal Hu phase at higher 
temperatures as the chain volume increases with 
increasing thermal motions. 

Phosphorous-31 NMR studies have been useful in 
describing polymorphic phase diagrams for phospho¬ 
lipids and helped in the concept of developing area/ 
volume models for understanding lipid polymorphism. 

Cubic, micellar, and rhombic phases: Phosphorous- 
31 NMR spectra of cubic, micellar, and rhombic lipid 
phases show an isotropic spectrum (Fig. 6) due to the fast 
lateral diffusion (Djiff) of phospholipids around the var¬ 
ious curved structures at a rate (l/Ddiff) fast enough to 
average the ^'P CSA (^3-6 kHz; ^20-40 ppm). This 
averaging occurs for cubic phases or bilayer vesicles 
(radius < ^30-50 nm), as well as for micelles, and the 
mechanisms are the same as for small bilayer vesicles. 


Lipid Domains and Phase Separation 
Detected by NMR 


N 


NMR spectmm obtained is for an isotropically averaged 
system. The NMR line width can give an indication of the 
size of the vesicle for ^50-500 nm as a result of the 
averaging of the ^*P chemical shift anisotropy (Fig. 5). 

The averaging arises from both lateral lipid diffu¬ 
sion around the vesicle and vesicle tumbling, due 
to Brownian rotational diffusion, described by 
Tc~* = 6/r^ (Dt + Ddiff) where Dt = kT/STtrtj, tj is the 
viscosity of the medium, and Djiff is the rate of lateral 
lipid diffusion in the bilayer (Dduf lO^^-lO”*^ s). 
However, other factors, such as the lipid thermotropic 
phase, lipid-protein interactions, and ionic interac¬ 
tions, can also reduce the ^*P anisotropy of lipids in 
a polymorphic phase. 

Hexagonal phases: Lipids, notably phospholipids, 
with head groups that occupy a small surface area, and 


Multicomponent lipid bilayers can give rise to NMR 
spectra that can be used to distinguish lipid domains or 
lateral phase separation. For this to be evident, lipid 
exchange between the various separated regions must 
be slow on the timescale of the NMR property being 
exploited. Most usually, phosphorous-31 and deute¬ 
rium NMR are used since the rate of averaging of the 
chemical shift or quadrupolar anisotropies, respec¬ 
tively, falls in the 10^-10“ s range. 

A significant number of phospholipids and lipids have 
been shown to induce lateral phase separation, including 
cholesterol, ceramides, and sphingomyelin, in mixtures 
with phospholipids, usually phosphatidylcholines or 
phosphatidylethanolamines. The size, differential chain 
order, flexibility, and life times of domains and their 
lipids can be described by NMR, which distinguishes 
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NMR of Lipids, 

Fig. 6 Isotropic NMR 
spectra are recorded for cubic, 
micellar, rhombic phases, and 
small vesicles comprised of 
phospholipids 




J_L 
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25 


ppm 


liquid-ordered and liquid-crystalline components as 
a result of the slow, two (or more) site exchange of 
the lipids on the NMR anisotropy averaging timescale 
(^4-6 kHz for ^*P; ^127 kHz for ^H) between the 
different phases (liquid-ordered, liquid-disordered, etc). 

Lipid Order Parameters by NMR, S^oi 

Often lipid bilayers are described as “fluid” or as having 
“flexible” chains - such definitions lack physical units or 
definition, and so order (in ° as an amplitude, or an order 
parameter, Smoi) and dynamics (as a correlation time, Tc, 
or rate of motion, ic”*) are required to describe fully 
lipid behavior (Fig. 7). 

Lipid order is determined by the amplitude of acyl 
chain motion within a polymorphic lipid phase, most 
usually bilayers. Order is defined by an order parameter 
Smol- 

S^oi = 1/2(3 <cos20 - 1>) 

where 0 is the angle of a specific moiety (often a reporter 
group) with a given reference such as the bilayer normal. 
Since 0° > 0 > 90°, the order parameter S is defined as 


0>ISI>l;S=lis defined for all-tmns, ordered chains 
and S = 0 for isotropic, disordered chains. 

NMR order parameters are measured from anisotropy 
averaging of an NMR property, such as quadmpolar 
interactions (AVq) in deuterium NMR or chemical shift 
(A) in phosphorous-31 NMR. For ^^F-labeled lipids, the 
NMR spectral anisotropy arises from residual chemical 
shift anisotropy (cn — a^); the proton-fluorine 
intrachain interactions (Ai) are modulated by motional 
amplitude to give ^^F NMR spectra from which Smoi can 
be determined. 

Since the value of Smoi is defined as a time-averaged 
(< >) property over the amplitude (0) of acyl chain 
motion, the rate of motion (or as a correlation time, 
Tc~*) of the reporter needs to be fast with respect 
to the NMR property (for ^H in lipid chains, AVq(n,ax) 
^127 kHz, and so Tc~' < ^10”"^ s) being measured. 

The way in which S^oi is measured experimentally 
depends on the nucleus being detected. For specifically 
^H-labeled lipids, the individual quadrupolar splitting 
(Avq in Hz) can be determined directly (Fig. 8) and Smoi 
calculated from: 

|Smol| — AVq^Qbs^/AVq^ max) 









NMR of Lipids 


1733 


N 



Segmental 

Fluctuations 


Molecular 

Diffusion 

^MD 


Membrane 

Deformation 


10-'2 


10-9 


10-6 


10-3 


NMR Quadrupolar / Chemical Shift Anisotropy / Magnetic Dipolar Lineshapes 
Infrared / Raman Spectroscopy Inelastic X-ray / Neutron Scattering 

NMR Spin-Lattice Relaxation NMR Spin-Spin / Rotating-Frame Relaxation 
Nuclear Spin Relaxation Dispersion (Spin-Lattice / Spin-Spin / Rotating-Frame) 
Atomistic MD Simulation / Coarse Grained MD Simulation Continuum Elastic Modelling 


NMR of Lipids, Fig. 7 NMR spectroscopy reveals phospho¬ 
lipid membrane dynamics and structure over a range of time- 
scales. Phospholipid mobility is characterized by segmental 
fluctuations, molecular diffusion, and viscoelastic membrane 
deformation. Orientational fluctuations correspond to geometry 
of interactions via Euler angles O and by correlation times of 
the motions, (a) Principal axis system of *^C-*H or C-^H bonds 
fluctuates due to motions of internal segmental frame (/) with 
respect to the membrane director axis (D). (b) Diffusive 


phospholipid motions are described by anisotropic reorientation 
of molecule-fixed frame (M) with respect to the membrane 
director axis (D). (c) Liquid-crystalline bilayer lends itself to 
propagation of thermally excited quasi-periodic fluctuations in 
membrane curvature expressed by motion of the local membrane 
normal {N) relative to membrane director axis (D). The appro¬ 
priate range of timescales of various complementary biophysical 
methods is indicated (From Leftin and Brown 2011) 


N 


where Avq(n,ax) = 3/2(e^qQ/h) ^127 kHz for hydro¬ 
carbon chains. 

Here, molecular averaging around the long molec¬ 
ular axis is assumed, which is a good approximation 
for acyl chains in phospholipid bilayers. However, if 
the axis about which averaging occurs does not coin¬ 
cide with the molecular axis, for example, in -C=C- 
kinks in chains, or in the head groups of phospholipids, 
then Smoi 7 ^ ScD> the order parameter of the -CD group. 
Now: 


^(obs) 


= s 


bilayer ’ *^00 


Sbilayer = 1/2(3 <COS^0 - 1 >) 
ScD = 1 /2(3 < cos^a — 1 >) 
Scoii = 1/2(3 < cos^4) - 1 >) 
8^01= 1/2(3 <cos2p-l>) 


ScD — Scoll ■ Smol Sintra 


Sintra = 1/2(3 <COS^Y “ 1 >) 


where (Scoii) represents the fluctuations of the bilayer 
normal, (Smoi) the reorientations of the molecular long 
axis around the bilayer normal, and (Sintra) the 
reorientations of the C-D bond vector with respect to 
the molecular axis (Fig. 8), giving: 


The deuterium NMR spectra for per-deuterated lipid 
chains comprise an overlap of all individual labeled 
positions and two approaches are used to extract order 
information. Firstly, individually assigned resonances 
can be identified either directly (e.g., for the lower part 
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NMR of Lipids, Fig. 8 Typical deuterium NMR spectrum (left) 
for a single -CD2 on an acyl chain of a phospholipid in hydrated 
bilayers. The quadrupole splitting (AVq in Hz) is measured as 
shown and can be used to give information about local order of 
the lipid chain, as an order parameter, Scd (Adapted from Leftin 
and Brown 201 1). When the axis of motional averaging does not 
coincide with the long molecular axis (D* ), an d is at an angle (0) 
with respect to the applied magnetic field ( Bo ), the observed CD 
vector is now tilted at an angle a with respect to the bilayer 


normal. If the rotation of lipids about different axes are indepen¬ 
dent, then the order parameter may be represented as the product 
of all individual contributions Scd = Scoii • Smoi • Sintra- The 
bilayer normal is now subject to fluctuations around the vector 

Tf within a cone (angle (p), whereas the molecular director d is 
at an angle b to the bilayer normal, and the observed CD vector is 
at an angle y with respect to the molecular director (Adapted 
from Vermeer et al. 2007) 


NMR of Lipids, 

Fig. 9 A typical deuterium 
NMR spectrum of a per- 
deuterated acyl chain 
phospholipid (here dsq, 

1 ,2-dimyristoylphosphatidyl 
choline) in bilayers (light red 
line) showing multiple 
overlapping quadrupole 
splittings that can be 
deconvoluted (black lines) 
into the individual spectral 
components for each C-atom 
(Adapted from Leftin and 
Brown 2011) 



DMPC 
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Lipid conformation in membranes 



NMR of Lipids, Fig. 10 Normalized order profiles of 
different bilayers. Variation of the molecular order 
parameter, Smoi, with the segment position. O, 1,2- 
dipalmitoyl-i;j-glycero-3-phosphocholine. A, l-palmitoyl-2- 
oleoyl-OT-glycero-3-phosphocholine. □, 1,2-dipalmitoyl-OT- 
glycero-3-phosphoserine. x, Acholeplasma laidlawii (Adapted 
from Seelig and Browning 1978) 

of the chain and terminal -CD3) or de-Paking routines 
that can be used to deconvolute individual values of AVq 
(Fig. 9). 

Secondly, the first moment (Mi) of a ^H-NMR 
spectral envelope from per-deuterated lipids can give 
an averaged order change for a bilayer perturbation, 
induced, for example, by other lipids, peptides, or pro¬ 
teins. M is defined as: 


Ml = l/ASIcOilil 


the head groups of lipids, is smaller than in similar 
saturated lipids or than expected, and to determine 
Smob some assumptions about the axis of motional 
averaging need to be made to estimate Scoii and Sintra. 


Lipid Dynamics by NMR 


Often lipid bilayers are described as “fluid” or have 
“flexible” chains - such definitions lack physical units 
or definition, and so order (in ° as an amplitude, or an 
order parameter, Smoi) and dynamics (as a correlation 
time, Tc, or rate of motion, ic *) are required to 
describe fully lipid behavior. 

The timescale sensitivity (ps-ms) and versatility of 
nuclei (^H, '^C, ^'P, ^H) that can be used in NMR of 
lipids allows a detailed description of lipid dynamics to 
be constructed (Fig. 7) using NMR relaxation param¬ 
eters (Ti; T 2 ) and chemical shift or quadrupolar anisot¬ 
ropy averaging. 

NMR spin-lattice relaxation rates measured for 
lipids in bilayers show a profile (Fig. 11) with very fast 
chain motion at the center of the bilayer, and a gradient 
of motional rates from the bilayer surface to the center, 
with complex dynamics in the glycerol region. The 
frequency selection of the NMR experiment permits 
a more detailed quantification of dynamic timescales 
for individual positions. 

Double bonds in lipid chains induce a complex 
dynamic behavior in which ally lie groups show increased 
and vinylic groups reduced motional rates. 


N 


where A = S Ij, the total area under the spectrum and I, 
is the intensity of the ^H-NMR spectra at frequency cd;. 
The origin of the frequency axis is usually defined as 
the center of symmetry of the spectrum, and the limits 
for the Mj integration enclose the observed spectrum. 

A characteristic deuterium NMR derived order 
parameter profile across a bilayer shows a similar 
order parameter (ISmoil 0.4) for acyl chain methylene 
groups in the upper half (glycerol - ^C8/10) of the 
bilayer and then a decreasing order parameter from the 
center of the acyl chains (^C8/10) to very large ampli¬ 
tude of motion for chains in the center of the bilayer 
(Fig. 10). Similar order profiles are also observed in 
natural membranes (Fig. 10). 

Since Sen = Scoii ■ S^oi ■ Sintra (see above), the 
measured quadrupole splitting for deuterated segments 
at the -C=C- bonds in unsaturated lipid chains, or in 


Lipid Hydration and Ionization State 

Lipid hydration increases lipid mobility as a result 
of the hydration-induced surface area, and polar 
head groups undergo conformational adjustments. 
Deuterium and phosphorus-31 NMR are suitable 
methods for sensing dynamics of lipids in response 
to hydration and ionization state of lipids or charged 
lipophiles. The ^H-NMR spectrum is sensitive to both 
order and rate of surface motion, through changes in 
the quadrupole splitting (Fig. 12) and relaxation rates 
(Fig. 13). 

Ionization states for lipid head groups and for 
charged lipophiles solvated in bilayers can also be 
measured from ^H-NMR spectra of head-group deu¬ 
terated lipids. The bilayer surface environment is very 
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NMR of Lipids, 

Fig. 11 High -resolution 
'^C NMR spin-lattice 
relaxation rate profiles for 
liquid-crystalline DPPC show 
a significant NMR detection 
frequency dependence of 
motional rates for lipids in 
bilayers. Spin-lattice 
relaxation rates are 
presented for various carbon 
positions (i) of DPPC at 50°C 
at NMR Larmor frequencies of 
(X) 15.04 MHz, (M) 

20.00 MHz, (♦) 25.15 MHz, 
(T) 45.29 MHz, (A) 

90.80 MHz, (•) 125.76 MHz, 
and (■) 150.84 MHz. (a) The 
glycerol backbone and choline 
head group are resolved in 
natural abundance NMR 
in the 50-80 ppm region and 
exhibit a pronounced 
dispersion of the relaxation 
rates, (b) Acyl chain segments 
are observed in the 0-40 ppm 
flngeiprint region of the high- 
resolution ''’C NMR spectrum. 
The molecular structure of 
DPPC and assignments 
corresponding to the carbon 
index (i) are shown 


\ /_Cy 



DPPC 



different from a homogeneous aqueous environment 
and consequently pKa and electrostatic behavior is 
significantly different for lipid-solvated lipophiles. 
For example, fatty acids and charged anesthetics have 
shifted pKa behavior when in bilayers compared with 
when in aqueous solutions. 

Lipid Partitioning by NMR 

NMR is a useful tool for determining the partition 
behavior of lipophiles into lipid bilayers and mem¬ 
branes. If the lipophile can be NMR labeled, for exam¬ 
ple, with deuterium, then NMR resonances from the 
lipophile in the aqueous phase are resolved from the 
lipophile dissolved in the bilayer. If two populations 
are observed, then exchange of the partitioning 


component is slow on the NMR timescale of the 
nucleus and property being observed. The advantage 
of this approach is that equilibrium measurements can 
be made without the need for filtration or centrifuga¬ 
tion used in conventional partition coefficient assays. 

Lipid-Protein Interactions by NMR 

NMR of lipids can be used to report on protein-lipid 
interactions in bilayers since proteins perturb the lipid 
order and/or dynamics. Proteins may induce lipids 
to phase-separate within the bilayer, in which case 
two or more NMR spectral components are observed 
(see: Lipid domains and phase separation detected by 
NMR), or proteins may induce an averaged perturba¬ 
tion on the lipid order and/or dynamics giving rise to 
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NMR of Lipids, Fig. 12 ^H-NMR quadrupole splitting Avq 
recorded as a function of the water/lipid ratio n„ of DOPC 
multibilayers at SO^C. The data points for the a (O), (5 (A), and 
Y (□) segments of the choline head group were curve-fitted using 
(1) {dotted lines). The inset shows the linear correlation between 
the a and (3 splittings, fitted by the straight line which gives 
a gradient of m ~ — 1 




NMR of Lipids, Fig. 13 ^H-NMR spin-lattice relaxation rates, 
l/Ti, recorded as a function of the water/lipid ratio n„ of DOPC 
multibilayers at 30°C, for the a (circle), (3 (triangle), and 
Y (square) segments of the choline head group. The isothermally 
recorded data points were curve-fitted (dotted lines) using (1). 
The dashed line for the y segment represents the hypothetical 
results calculated for a uniformly reduced temperature 
(T^ed = 0.18), to take into account the effect of the phase 
transition on the molecular dynamics in the bilayer 



NMR of Lipids, Fig. 14 The deuterium NMR spectra 
(upper) from deuterated lipid head group (phosphatidylcholine, 
-N'^(CD 3 ) 3 ) are responsive to the presence of an integral protein 
(from pure bilayers to 65:1, lipids: protein, molar basis) in the 
bilayers. The collapse of the quadrupole splitting (lower) 
with increased protein content indicates fast lipid-protein inter¬ 
actions on the deuterium NMR anisotropy averaging timescale 
(rate < ^10"* s) 


single component averaged spectra (Fig. 14). Both 
situations have been used to study lipid-protein inter¬ 
actions by wide-line and magic-angle spinning NMR 
of lipids. 

As an approach to study the specificity of peptide 
and protein interactions at the structural level, solid- 
state NMR recoupling methods (REDOR and rota¬ 
tional resonance) can be used to obtain information 
about the distances between nuclei in lipids and 


membrane-associated peptides at high resolution. The 
specificity of lipid-peptide interactions is therefore 
defined and can give biological insights into antimi¬ 
crobial or toxins interactions with membranes. 

NMR spectra of lipid-protein interaction can give 
information about the specificity of intermolecular inter¬ 
actions, the degree of perturbation of lipid order and 
dynamics by proteins, and structure for a membrane- 
embedded protein. 
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Summary 

NMR can be used as a tool to reveal molecular details 
about lipids in many different polymorphic and sample 
forms, and help gain an understanding of thermody¬ 
namic behavior, molecular interactions with other 
membrane components, ions, and pH effects on lipid 
properties. The intrinsic insensitivity of the method 
compared with some other biophysical methods is 
partly outweighed by the lack of chemical perturbation 
in observing either naturally NMR visible isotopes, or 
through non-perturbing labeling. 

Cross-References 

► '■’F NMR 

► Lipid Domains 

► Membrane Fluidity 

► Membrane Lipid Electrostatics 

► NMR 

► Protein NMR Resonance Assignment 

► Residual Dipolar Coupling 
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Interactions - Introduction 
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Synonyms 

Ligand-binding interactions; NMR 


Introduction 

Entire DNA sequences of hundreds of organisms 
(http;//www.ebi.ac.uk/genomes/) and nearly 80,000 
coordinate sets of macromolecules (http://www.rcsb. 
org/) are now available at the click of a mouse. The next 
step in trying to understand life is to determine how the 
molecules in a cell interact with each other, temporally 
and spatially, in coordinated networks. Work has begun 
to define this “interactome” and impressive and bewil¬ 
dering maps have been constracted using genetics, 
affinity purification, and bioinformatics protocols 
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NMR Studies of Macromolecular Interactions - Introduc¬ 
tion, Fig. 1 An illustration of the two main types of protein- 
protein interaction observed, (a) shows a complex formed by two 
structured domains, (b) shows a disordered region of a protein 
binding to a structured domain. In (b) there is an entropic cost in 
forming the complex which tends to reduce the affinity without 
reducing the specificity 


regions and these regions are often involved in protein 
interaction networks; indeed they often act as “hubs” 
for promiscuous interactions with several different 
partners (Tompa et al. 2009). 

Interactions between two proteins are often 
mediated by other factors. One example is the 
reduction of dimensionality of a search (Adam and 
Delbriick 1968), for example, binding of two proteins 
to a membrane (Fig. 2a) or DNA will increase the 
effective on-rate kon- Local concentrations can also 
be increased by mutual binding to a scaffold protein 
or by tethering two domains together (Fig. 2b) 
(Kuriyan and Eisenberg 2007). There has also been 
increased interest recently in the role of “encounter” 
complexes that form in a rapid binding step, followed 
by formation of the final complex (see below and 
Fig. 2c). 

NMR Methods for Detecting Interactions 


(Bader et al. 2008; http;//thebiogrid.org/). In addition to 
these, still error-prone, network maps, detailed kinetic 
and structural information about the complexes formed 
are needed. Solution state NMR is increasingly recog¬ 
nized as a unique and powerful tool in this area. A search 
of the PDB database reveals that over 1,000 of 9,300 
NMR structures are of protein-ligand complexes. 
NMR, however, gives not only structure but also valuable 
kinetic, dynamic, and thermodynamic information about 
complexes. It is especially useful for analyzing the 
numerous weak but specific interactions that occur in 
the cell. In this brief entry I will focus on protein-protein 
interaction studies. Other reviews on this topic 
(Zuiderweg 2002; Bonvin et al. 2005; O’Connell et al. 
2009) and related topics, such as protein-drug and pro¬ 
tein-nucleic acid (Bonvin et al. 2005) interactions are 
available. 

Protein-Protein Interactions 

At the structural level, protein-protein interactions can 
be considered in two main classes (Bader et al. 2008), 
one where two structured domains form a complex 
(Fig. la) and the other where a complex is formed 
from a domain and a disordered peptide segment of 
another protein (Fig. lb). A large number of proteins 
(^30%) in the various genomes contain disordered 


NMR description of the P + L ^ PL equilibrium: 
The NMR parameters describing a protein in free (P) 
and bound (PL) states will in general be different. Key 
parameters defining the two states include the 
chemical shifts ((5p and (5 pl) and the relaxation times 
(Ti, 2 p and Ti 2 pl). The NMR signals that are observed 
from free and bound states depend on kon and koff in 
Fig. 1. The exchange regime can be “fast,” where 
one averaged signal is observed for free and bound 
states, with NMR parameters that are a weighted 
mean of each state. The exchange regime is said to be 
“slow” when two non-averaged signals are observed 
for the free and bound states. “Intermediate” exchange 
is when the lines are broadened by the exchange 
process; koff and ((5p — (5 pl) are key parameters in 
determining the observed exchange regime, with fast 
exchange being observed when koff ^ ((5p — (5 pl). 
Intermediate exchange often causes detection 
problems because of broad lines (careful design of 
experiment can help; Reibarkh et al. 2006) but this 
regime has the advantage that the exchange parame¬ 
ters, koff etc., can be determined by line-shape analysis 
(see below). Since protein-protein exchange reactions 
usually have a relatively small range of diffusion- 
limited kon values, the observed exchange regime is 
correlated with affinity; if = koff/kon > 10“^ M, the 
exchange regime is often “fast” while if < 10~® M, 
the exchange rate is often “slow.” 


N 
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NMR Studies of 
Macromolecular 
Interactions - Introduction, 
Fig. 2 Illustration of protein- 
protein interactions mediated 
by (a) a membrane; 

(b) another protein and 
a tethered domain; (c) by the 
initial formation of an 
encounter complex 






NMR Studies of Macromolecular Interactions - Introduc¬ 
tion, Fig. 3 Illustration of the CSP experiment. HSQC 

spectra of the light gray protein give single peaks for each amide 


proton. On addition of the black protein, peaks, a-d, are 
perturbed more than the others. These are likely to occur at the 
protein-protein interface 


Chemical shift perturbation (CSP): There are 
powerful two-dimensional NMR experiments that 
can be used to monitor the shifts of specific groups 
in a protein. These include the HSQC (heteronuclear 
single quantum coherence) and TROSY (transverse 
relaxation optimized spectroscopy) experiments 
(reviewed in Cavanagh et al. 2007). In a '^N-labeled 
protein all amino acids, except proline, can give rise to 
single peaks in *H-'^NHSQC or TROSY spectra. 
These individual cross peaks can be assigned 

to specific backbone resonances in the protein with 
standard experiments on proteins labeled with 
and When a ligand is added to a sample, some 


resonances shift (or broaden) selectively as a result of 
complex formation. These perturbations can be used to 
locate the binding sites. A weighted shift plot is often 
used [e.g., V j(A(5N/5)^ -i- A(5 h }] that takes account of 
the different shift ranges of different nuclei (e.g., A(5 n 
the induced shift in the dimension is ^5 x the shift 
induced in the *H dimension, A^h)- Assigned reso¬ 
nances that experience significant shift perturbation 
can be mapped on a known structure (Fig. 3). 

CSP is used extensively in NMR studies of protein- 
ligand interactions. In the fast exchange regime, it has 
the advantage that, as well as the interaction surface, the 
affinity of the interaction can be obtained from a plot of 
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NMR Studies of Macromolecular Interactions - Introduc¬ 
tion, Fig. 4 Illustration of differential isotope labeling of two 
proteins. Isotope-filtered pulse sequences can then be used to 
detect intermolecular NOE selectively, e.g., between groups 
containing and ‘H-‘^C or and '^N 

and have relatively low natural abundance but can be readily 
incorporated in recombinant proteins expressed in bacteria or 
yeast. Deuterium (^H) labeling is also often used, especially for 
larger systems because it can reduce relaxation mechanisms and 
simplify the spectra 

induced shift against concentration of added ligand. 
A limitation is that the mechanism of perturbation is 
not generally well understood and induced conforma¬ 
tional changes cannot readily be distinguished from 
surface contacts. CSP is, however, relatively easy to 
apply and is very informative if assignments are avail¬ 
able. The method can be applied to very large systems; 
examples include the proteasome (Sprangers et al. 
2008) and GroEL (Fiaux et al. 2002) and even the 
formation of protein complexes in a cellular environ¬ 
ment (Inomata et al. 2009). CSP also provides useful 
input in docking procedures (see below). 

Nuclear Overhauser effect: A major goal in inter¬ 
action studies is to define the structure of the complex 
formed. Experimental restraints can be provided by 
paramagnetic probes and residual dipolar coupling 
(see below) but observation of pairwise dipolar 
interactions between assigned nuclei via the nuclear 
Overhauser effect (NOE) remains the most potent and 
precise way to define the structure of a complex. 
An influential early example of the application of 
NOEs in structure determination was a study of 
a calmodulin peptide complex (Ikura et al. 1992). 
The detection of intermolecular NOEs is greatly 
facilitated by differential isotope labeling (Fig. 4a). 
Elegant NMR isotope Altering methods have been 
developed to differentiate the spectra of labeled and 
unlabeled protein (Cavanagh et al. 2007) and to detect 
intermolecular NOEs. 



NMR Studies of Macromolecular Interactions - Introduc¬ 
tion, Fig. 5 Illustration of cross saturation. An unlabeled larger 
system is irradiated with radio frequency to saturate the 'H 
signals. This saturation is transferred to the smaller isotope 
labeled protein in the complex and residues in the interface 
region (shown as white) have reduced intensity; this reduction 
is observed in the “free” form of the ’ labeled protein which is 
present in excess 


There are now hundreds of examples of structures 
of protein-protein complexes solved by solution state 
NMR. NMR has been particularly effective in studies 
of the disorder/order type of interaction shown in 
Fig. lb. An example from our own work is the 
determination of the structure of a complex between 
a beta integrin tail and its intracellular talin ligand 
(Wegener et al. 2007). A notable example of the 
domain-domain class of interaction is the complex 
formed between the N-terminal domain of enzyme 
I and the histidine phosphocarrier protein (Clore and 
Iwahara 2009); this structure was defined using a wide 
range of NMR restraints including NOEs, residual 
dipolar coupling, and paramagnetic centers (see 
below). 

Cross saturation: Another important mapping 
method is “cross saturation” where a relatively large 
unlabeled protein is selectively irradiated with radio 
frequency so that all the *H resonances are “saturated” 
(their net magnetization is reduced to zero). Association 
of an isotope-labeled smaller protein with the irradiated 
protein can be detected because 'H groups in the inter¬ 
face region also become saturated (or “bleached”) by 
dipolar interactions with the saturated resonances of the 
larger protein; this results in intensity reductions in 
specific resonances of the smaller protein (Shimada 
et al. 2009; Fig. 5). Variations of this approach have 
also been applied; for example by altering the isotopic 
labeling patterns in the two proteins (Ferrage et al. 2010). 

Kinetics. NMR can also contribute significantly to 
an understanding of the kinetics of complex formation. 


N 
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NMR Studies of Macromolecular Interactions - Introduc¬ 
tion, Fig. 6 Illustration of residual dipolar coupling effects. 
Weak alignment of a protein is induced by the introduction of 
space restrictions within the sample; information about the size 
of the dipolar coupling, e.g., between '^N and *H in an N-H 
group, can then be extracted. This information can be related 
to the orientation of the N-H bond vector with respect to the 
direction of the applied field, Bq 

The lineshapes of peaks are sensitive to the exchange 
rates, especially kon, and simulation programs are 
available to fit the observed shapes to kinetic parame¬ 
ters. An example is a study of the binding of different 
peptides to an SH2 domain in intermediate exchange 
(Ladbury et al. 1996). Transfer of magnetization 
between free and bound states can also be observed in 
the slow exchange regime using saturation 
transfer difference spectra and transferred NOEs (Pons 
et al. 2008). Relaxation dispersion measurements have 
also become popular (Sugase et al. 2007). In this 
experiment, T 2 relaxation is measured by a train of 
180° pulses; measurement of T 2 as a function of pulse 
separation gives information about the ligand exchange 
rates on a timescale that complements other methods. 
The method can also be used to investigate ‘invisible 
states’ in exchanging systems (Hansen et al. 2008). 

Mapping with amide-proton exchange. Protein 
interfaces can be mapped using the protection of 
amide-proton exchange that arises when protein- 
protein interactions shield the interface area from 
solvent (Zuiderweg 2002). However, amide-proton 
exchange rates are generally slower in a ligand 
complex than in the free protein and interpretation in 
terms of a precise interface has to be done with caution. 

Residual dipolar coupling: Dipolar interactions 
between a pair of nuclei lead to relaxation and NOEs, 


but they also cause a splitting of NMR signals that 
depends on the intemuclear separation and relative ori¬ 
entation of the pair in the applied magnetic field. In 
solution these dipolar splittings are normally averaged 
out by rapid tumbling but they can be partially 
reintroduced by inducing fractional alignment of the 
molecules in the NMR sample. Weak alignment can be 
introduced in various ways, including compressed gels, 
bicelles, and filamentous phage (see e.g., Bax 2003; Liu 
and Prestegard 2010). The small residual dipolar cou¬ 
plings (RDCs) seen using these weak alignment media 
(^10 Hz) give useful information about how some 
groups, e.g., NH, CH, are aligned relative to the external 
magnetic field. This is very useful in obtaining relative 
domain orientation using docking programs (see below). 

Paramagnetic relaxation and encounter complexes. 
Paramagnetic probes are very useful tools for stmctural 
studies of macromolecules and their complexes 
(reviewed in Otting 2010). A free electron has 
a magnetic moment 658 x larger than a *H, so large 
induced relaxation and chemical shifts can be observed 
for resonances of groups in the vicinity of a probe. 
Nitroxide spin labels can be used to explore exposed 
surfaces (Deschamps et al. 2005), or they can be 
covalently attached to a single, surface-exposed cysteine 
introduced into the protein. Paramagnetic metal ions are 
also very useful, especially with new tags designed to add 
specific binding sites for lanthanide ions or Mn^"^. 
A notable recent use of paramagnetic probes has been 
in the study of encounter complexes (Fig. 2c). The Clore 
group studied interactions between the N-terminal 
domain of enzyme I and HPr. This 40 kDa complex is 
involved in the transfer of phosphoryl groups between 
histidines. EDTA-based Mn^"^ binding sites were intro¬ 
duced at three separate sites on HPr, and intra- 
and intermolecular relaxation effects measured (Clore 
and Iwahara 2009). The intramolecular relaxation data 
observed for HPr were consistent with the known struc¬ 
ture but the intermolecular features indicated the forma¬ 
tion of nonspecific encounter complexes. The Ubbink 
group observed large encounter areas in the electron 
transfer complex formed between yeast cytochrome 
c and cytochrome c peroxidase (Bashir et al. 2010). 
The significance of these observed encounter complexes 
is not yet entirely clear but they illustrate, along with the 
other factors shown in Fig. 2, how biological systems can 
fine-tune the kinetics and affinity of complex formation. 

Docking. In many cases, NMR alone does not pro¬ 
vide sufficient information to produce a well-defined 
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structure of a protein-protein complex; docking 
calculations can then be very powerful. The backbone 
of the protein structure is usually assumed to remain 
unchanged in the complex, but conformational 
flexibility of the protein side chains is allowed. 
Docking algorithms contain two main components: 
a search of all the possible geometries to create a 
large family of possible protein complex structures 
followed by an evaluation of these geometries based 
on energy terms. Several powerful docking programs 
are now available (e.g., HADDOCK; de Vries et al. 
2010, Bonvin this volume); these can incorporate data 
from CSP, mutagenesis, and other known properties 
such as symmetry and the proximity of catalytic site 
residues. Docking protocols are also available that 
incorporate RDC data (Berlin et al. 2010). 

Summary 

NMR studies of interactions between proteins and 
various ligands give not only structural information 
about the complexes formed but also kinetic, dynam¬ 
ics, and thermodynamic information. A number of 
powerful NMR methods that can be applied to study 
complexes are briefly described. 

Cross-References 

► Cross Saturation/Transferred Cross Saturation 
(TCS) 

► HADDOCK 

► Protein Dynamics and Allostery 

► SAR by NMR 
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Weontae Lee 

Department of Biochemistry, Yonsei University, 
Seoul, South Korea 

Definition 

Unlike conventional proteomics, structural proteomics 
deduces biological function of unknown gene products 
from the three-dimensional structure of a protein by 
structural biology, bioinformatics, molecular biology, 
and biochemistry. The success of structural proteomics 
relies on that protein functions are related to their 
three-dimensional structures, providing crucial infor¬ 
mation about its biological functions. In this regard, 
structural proteomics approach is one of the powerful 
research areas in the post-genomic era, promising 
putative biochemical and cellular functions of a num¬ 
ber of uncharacterized gene products. Recently, a num¬ 
ber of pioneering groups have successfully predicted 
their biological functions based on three-dimensional 
structures of hypothetical proteins. The tools of 
structural proteomics are a number of structure deter¬ 
mination techniques, including NMR spectroscopy. 
Especially, advances in hardware design, data acquisi¬ 
tion methods, sample preparation, and automation 
of data analysis in NMR spectroscopy enable high- 
throughput structure determination of proteins in phys¬ 
iological condition. These efforts ensure that NMR 
spectroscopy will become an important methodology 
in structural genomics. Therefore, NMR-based struc¬ 
tural proteomics will deposit a comprehensive struc¬ 
tural database to lead the biological functions of 
hypothetical proteins identihed by the human genome 
project. 


Cross-References 

► Macromolecular Crystallography: Overview 

► NMR 

► NMR-based Structural Proteomics 

► Protein NMR - Introduction 

► Protein Structure Comparison Methods 

► Structural Genomics 


Noncanonical Hydrogen Bonds 

► XH/t: Interactions 


Nonlinear IR Spectroscopy of Proteins 

► Ultrafast Two-Dimensional Infrared Spectroscopy 
of Proteins 


Nonlinear Raman Spectroscopy: 
Coherent Anti-Stokes Raman 
Scattering (CARS) 
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Synonyms 

Raman spectroscopy 

Definition 

Nonlinear Raman spectroscopy is a spectroscopy 
based on nonlinear effects and involving Raman-active 
molecular transitions. 
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Coherent anti-Stokes Raman scattering is generated 
in samples due to third-order nonlinear susceptibility 
and requires three laser beams to produce a signal 
beam. 


The energy diagrams of CARS are shown in Fig. 1. 
The CARS signal arises from the nonlinear 
polarization induced along a Cartesian coordinate 
(Boyd 1992) 


Introduction 

Coherent anti-Stokes Raman scattering (CARS) pro¬ 
vides analytical and spectroscopic information 
pertaining to Raman-active resonances in gases, liq¬ 
uids, and solids. Unlike second-order processes, CARS 
is a third-order nonlinear optical effect and, therefore, 
is applicable to isotropic, as well as anisotropic media. 
The signal due to CARS processes is considerably 
higher than spontaneous Raman scattering. Moreover, 
the CARS signal is spatially and temporally coherent 
and propagates like a laser beam. Collectively, these 
properties make CARS a superior tool for obtaining 
spectra of luminescent samples including fluorescent 
materials, impurities, combustion systems, and dis¬ 
charge systems. 

In spite of many advantages, CARS has its draw¬ 
backs. One of the most serious disadvantages of CARS 
as an analytical tool is radiation due to nonresonant 
part of the third-order susceptibility which limits 
detection sensitivity of the technique. Strong perturba¬ 
tion of CARS spectrum due to interference between 
neighboring resonances, background and electronic 
transitions is another drawback which complicates 
spectral analysis. 

Over the past years, the CARS technique has found 
two important applications, namely, combustion diag¬ 
nostics and CARS microscopy. 

The CARS Phenomenon and Theoretical 
Description of CARS Spectroscopy 

The phenomenon of coherent Raman scattering was 
first reported by scientists at the Ford Motor Company 
in 1965 (Maker and Terhune 1965). Since its introduc¬ 
tion in the early 1970s, CARS spectroscopy has been 
widely used as a spectroscopic tool (Levenson and 
Kano 1988). CARS is a four-wave mixing process in 
which a pump field £((Ui), a Stokes field £(co 2 ), and 
a probe field £(< 1 ) 3 ) interact with a sample and generate 
an anti-Stokes field £(< 7 ) 4 ) at the frequency of 

ffl4 = ffll — CO 2 + ®3- 


£p)(cB4) = £)^xp«(-®4;ffli;-ra2;c»3) 

Jkl ( 1 ) 

Ej{coi)El{(j}2)Ei{Q)i), 

(3) 

where Xijki is the third-order susceptibility, which in an 
isotropic material can be expressed in the form: 

~ zi 122^ij^kl + + 1^221 ^H^jk (2) 

A general expression for the third-order susceptibility 
can be written as: 

y(3) = _ ^ _ , y(3) 

Q-{cOp- CO,) - ir„ 


cOf — 2c0p — iE, 

where Q is the vibrational frequency; Fr and F, are the 
half width at half maximum of the Raman line and that 
of the two-photon transition, respectively; Ar and A, 
are constants representing the Raman scattering and 
the two-photon absorption cross sections. The first 
term in (3) is a vibrationally resonant contribution 
(Fig. la). The second term is a nonresonant contribu¬ 
tion that is independent of Raman shift (Fig. lb). The 
third term is an enhanced nonresonant contribution due 
to two-photon electronic resonance (Fig. Ic). In most 
experiments, the pump and probe fields are derived 
from the same laser beam, £((Ui) = £( 013 ). Due to 
momentum conservation, CARS signal generation 
needs to fulfill the phase-matching condition, 
|k 4 — (2ki — k 2 )|/ << 71, where / is the interaction 
length and ki, k 2 , and k 4 are wave vectors of the 
pump, Stokes, and CARS fields, respectively 
(|k| = conjc, where c is a speed of light and n is 
a refractive index) (Shen 1984; Levenson and Kano 
1988; Mukamel 1995). Due to dispersion, especially in 
the condense phase samples, the collinear beam propa¬ 
gation results in small interaction length and the incident 
beams must be properly aligned to satisfy phase¬ 
matching conditions. Such spatial alignment also helps 
to spatially separate CARS signal beam from powerful 
input laser beams, which otherwise might leak into 
detector. 


N 
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Nonlinear Raman 
Spectroscopy: Coherent 
Anti-Stokes Raman 
Scattering (CARS), 

Fig. 1 Energy diagram of 
CARS, (a) Resonant CARS 
characterized by 

— (ct>i — C02) — irR]. 
(b) Nonresonant CARS from 
an electronic contribution 
(Xm ), where the dotted lines 
indicate virtual states, (c) 
Electronic contribution 
(A,/[o], - 2coi - /r,]) 
enhanced by a two-photon 
resonance of the pump beam 
associated with an excited 
electronic state 


Phase matching condition 


C Electronically 
excited state 



®t 


Resonant 

contribution 


Nonresonant 

contribution 


Two-photon enhanced 
nonresonant contribution 


Nonlinear Raman 
Spectroscopy: Coherent 
Anti-Stokes Raman 
Scattering (CARS), 

Fig. 2 Multiplexed detection 
and BOXCARS beam 
geometry for combustion 
research 
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Application of CARS Spectroscopy to 
Combustion Studies 

The application of CARS spectroscopy to study com¬ 
bustion was first demonstrated in 1973 by Taran and 
coworkers (Regnier and Taran 1973). In the following 
years, three major breakthroughs made the technique 
applicable to real systems: (1) development of multiplex 
CARS with broadband dye lasers which allowed single 
pulse measurements; (2) introduction of the BOXCARS 
beam geometry for better spatial resolution; and (3) adap¬ 
tation of a technique for elimination of nonresonant 
background. The initial implementations of CARS 
were based on nanosecond (ns) laser systems and used 
a narrowband (^0.001 cm”') pump laser and a broad¬ 
band 150 cm”') Stokes laser to excite entire manifold 
of rovibrational transitions (see Fig. 2a). 


For temperature measurements, nitrogen is a mole¬ 
cule of choice due to its abundance in air-fed reacting 
flows. To excite N 2 , which has vibrational band origin 
at 2,330 cm”*, the narrowband second harmonic from 
a Nd:Yag laser (at 532 nm), and the broadband emis¬ 
sion from dye laser at around 607 nm can be used as 
a pump and Stokes beams. The pump beam is usually 
split in two in order to arrange beams in BOXCARS 
configuration (see Fig. 2b). Detailed review of the 
application of ns-CARS spectroscopy to combustion 
diagnostics is provided in Ref. Eckbreth (1996). The 
later development of ns-CARS has been focused on 
simultaneous detection of multiple species along with 
temperature and its applications to harsh environ¬ 
ments. In order to excite rovibrational transitions 
in pairs of molecules such as [N 2 , CO 2 ], [N 2 , CHJ, 
[O 2 , CO], [N 2 , 02 ], and [N 2 , H 2 ], the dual-pump CARS 
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was developed. Later this technique was extended 
to triple-pump CARS, dual-pump dual-broadband 
CARS, and dual-broadband rotational CARS. For 
detailed review, see Ref. Roy et al. (2010). 

With the introduction of commercially available 
compact picosecond (ps) and femtosecond (fs) lasers 
systems, ps- and fs-CARS measurements within time 
scales free from collisional effects became possible. 
The reduction of nonresonant background in ns-CARS 
is employed via various polarization schemes; how¬ 
ever, these methods are ineffective in high pressure, 
high temperature environments of real combustors due 
to window-induced birefringence. Contrarily, ultrafast 
pulses allow to use time domain to eliminate 
nonresonant background due to different time dynam¬ 
ics of resonant and nonresonant signals. In addition, 
extremely high peak power of fs lasers leads to high 
signal levels that allow a single shot detection of 
CARS signal at kHz rates compared to ^10 Hz rates 
achievable with ns-CARS. Detailed review of applica¬ 
tion of fs-CARS spectroscopy to combustion can be 
found in Refs. Gord et al. (2008) and Roy et al. (2010). 

CARS Microscopy 

The application of the CARS process to microscopy 
opens up a new method for chemical imaging. In 
CARS microscopy, the temporally and spatially 
overlapped pump and Stokes laser pulses are tightly 
focused into a sample to generate a signal in a small 
excitation volume (<1 pm^). A CARS image is 
acquired by raster scanning the sample or the laser 
beams. The vibrational contrast in CARS microscopy 
arises from the signal enhancement when cOp — (o^ is 
tuned to a Raman-active vibrational band. CARS 
microscopy offers the following advantages over tra¬ 
ditional techniques: (1) Chemical bond selectivity. 
Because the contrast in CARS microscopy arises 
from intrinsic molecular vibrations, it is possible to 
produce images with chemically selective contrasts 
when the laser frequency difference {cOp — cu^) is 
tuned to different Raman bands. (2) Large signal 
level. As a coherent process, the CARS fields from 
different molecules have a well-defined phase relation¬ 
ship. The coherent addition results in a quadratic signal 
increase with respect to the density of molecular oscil¬ 
lators, versus the linear increase of spontaneous 
Raman signal. For a bulk liquid, the CARS signal can 


be larger than the spontaneous Raman signal by 9 
orders of magnitude (Levenson and Kano 1988). 
(3) High imaging speed. The large signal level in 
CARS microscopy enables high-speed imaging which 
is important for live cell and tissue studies. In laser¬ 
scanning microscopy with high repetition rate (in MHz 
range) lasers, an image acquisition speed at one frame 
per second has been achieved (Cheng et al. 2002a) and 
has been recently increased to 20 frames per second 
(Evans et al. 2005). The high imaging speed not only 
avoids respiration-induced image distortion during 
in vivo imaging, but also allows real-time inspection of 
highly dynamic systems. (4) Suhmicron 3D resolution. 
The nonlinear optical excitation ensures that the signal is 
only generated at the center of the focus, offering CARS 
microscopy an inherent 3D spatial resolution. The lateral 
and axial resolutions with a 60X water immersion objec¬ 
tive (numerical aperture, NA = 1.2) were measured to 
be 0.23 and 0.75 pm,(Cheng et al. 2002a) which allows 
detection of subcellular stractures in a tissue environ¬ 
ment. (5) Multimodal imaging. A unique advantage of 
CARS is that other NLO imaging modalities, including 
two-photon excited fluorescence (TPEF) and second 
harmonic generation (SHG), can be naturally incorpo¬ 
rated into the same platform, enabling real-time visuali¬ 
zation of various components in a tissue. 

A modem high-speed, multifunctional CARS 
microscope (Eig. 3) for biological applications origi¬ 
nated from multistep technical advances (Cheng and 
Xie 2004; Evans and Xie 2008) and theoretical study 
(Cheng et al. 2002b) of CARS microscopy. 

Important features of the microscope are: 

Collinear beam geometiy. Noncollinear beam 
geometry is used in CARS spectroscopy to fulfill the 
phase-matching condition (Levenson and Kano 1988). 
When the interaction length becomes very small 
(^1 pm) under the tight focusing condition in micros¬ 
copy, the phase-matching condition is fulfilled with 
a collinear beam geometry for forward-detected 
CARS (E-CARS) (Zumbusch et al. 1999). The collin¬ 
ear beam geometry greatly simplifies the optical align¬ 
ment and is a key step in producing high-quality CARS 
images. 

Laser-scanning on a confocal microscope platform. 
The most significant advantage of CARS over sponta¬ 
neous Raman is large signal level of CARS allowing 
for high-speed vibrational imaging. This advantage has 
been realized by scanning pulsed lasers of high repeti¬ 
tion rates (in MHz), which resulted in an image 
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Spectrometer 


Nonlinear Raman Spectroscopy: Coherent Anti-Stokes 
Raman Scattering (CARS), Fig. 3 A high-speed CARS micro¬ 
scope. Pump and Stokes laser beams are collinearly combined, 
directed into a laser-scanning box, and focused into the samples. 
Emitted photons are collected by a condenser for forward- 
detected CARS (F-CARS) or by the focusing objective for epi- 
detected CARS (E-CARS). PMTl and PMT2 are detectors for 
F-CARS and E-CARS, respectively. The pump laser is also used 


for generating spontaneous Raman signals from an object visu¬ 
alized by CARS. A spectrometer with a pinhole attached to the 
side port of the microscope is used for confocal recording of 
spontaneous Raman spectra. Although not shown, sum- 
frequency generation (SFG) and two-photon excited fluores¬ 
cence (TPEE) imaging can be performed on the same platform. 
PH pinhole, DM dichroic mirror, L lense, PMT photomultiplier 
tube 


acquisition speed of a frame per second (Cheng et al. 
2002a). Video rate imaging can be realized by scan¬ 
ning the laser with a polygon mirror or a resonant 
scanner. 

Near IR laser excitation. Using the near IR lasers 
allows to avoid two-photon electronic enhancement of 
the nonresonant background, reduces the photodamage 
induced by multiphoton absorption, and diminishes 
tissue scattering leading to increased optical penetra¬ 
tion depth. 

Forward-detection versus epi- (i.e., backward-) 
detection. For objects that are comparable or larger 
than the wavelengths of incident lasers, the CARS 
signal goes forward as a result of constructive addition 
of electric fields. Importantly, because the CARS sig¬ 
nal Is highly directional in the forward direction, an air 
condenser is sufficient for collection of the F-CARS 
signal. For example, F-CARS is suitable for imaging 
lipid bodies or myelin sheath in cultured cells or sliced 
tissues. For objects with an axial length much smaller 
than the excitation wavelengths, the phase-matching 
condition is fulfilled in both forward and backward 


directions. This backward propagating CARS signal 
from small objects constitutes the first mechanism for 
epi-detected CARS (E-CARS). E-CARS also arises 
from the discontinuity of at the interface, and 
backscattering of forward CARS photons. E-CARS is 
important for live animal imaging where the epi- 
detected signal arises from backscattering of the for¬ 
ward propagating CARS photons by the tissue. 

Multimodal NLO imaging. An important advantage 
of CARS microscope is that other NLO modalities 
including two-photon excited fluorescence (TPEE) and 
sum-frequency generation (SEG) can be implemented on 
the same platform. Multimodality is important because 
different NLO imaging methods have their distinctive 
advantages: TPEE can be used to visualize proteins, ions 
with fluorescent labeling, or specific autofluorescent 
structures; SFG is selective to non-centrosymmetric 
molecular assemblies such as collagen fibrils or crystals; 
and CARS is naturally sensitive to lipid-enriched struc¬ 
tures such as adipocytes. The pulsed laser beams for 
CARS could be simultaneously used for TPEF and 
electronic SFG imaging. As a two-beam modality. 
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Nonlinear Raman 
Spectroscopy: Coherent 
Anti-Stokes Raman 
Scattering (CARS), 

Fig. 4 CARS images of (a) 
cytoplasmic lipid droplets in 
the small intestine cells and 
(b) axonal myelin sheath in 
spinal white matter. The 
CARS signal arises from the 
symmetric CH 2 stretch 
vibration 



Small intestine White matter 


CARS microscopy is mostly operated with picosecond 
(ps) pulses, either from two synchronized Tiisapphire 
lasers or from a synchronously pumped OPO system. 
Picosecond pulse excitation not only provides sufficient 
spectral resolution, but also increases the ratio of reso¬ 
nant signal to nonresonant background. Although tun¬ 
able ps laser systems operating in the NIR range are 
widely accepted for high-speed CARS imaging (for 
reviews see Cheng and Xie 2004; Cheng 2007; Evans 
and Xie 2008), the reduced efficiency of nonlinear opti¬ 
cal (NLO) process caused by longer pulse duration hin¬ 
ders wide application of ps lasers to TPEF and SHG 
imaging. To overcome above limitation of ps-CARS, 
high-speed CARS imaging of lipids with fs laser pulses 
has been demonstrated. Commercial CARS microscopes 
with either ps or fs laser sources are now available 
through Leica and Olympus, respectively. 

Coupling CARS imaging with spontaneous Raman 
spectroscopy. It has been realized that CARS and spon¬ 
taneous Raman have their distinctive advantages: CARS 
permits high-speed imaging by focusing the energy on 
a single Raman band while spontaneous Raman permits 
complete fingerprint analysis at a pixel of interest. Based 
on this understanding, a compound Raman microscope 
in which the same ps laser source is used for high-speed 
coherent Raman imaging and confocal Raman spectral 
analysis has been developed and applied for analysis 
of lipid-rich structures such as sebaceous gland in 
mice (Slipchenko et al. 2009) and lipid species in 
Caenorhahditis elegans (Le et al. 2010a). 

CARS microscopy has been demonstrated to be 
highly sensitive and selective to lipid-rich structures 
that are abundant of CH 2 groups. In particular, CARS 


microscopy permits chemically selective imaging of 
two important lipid-rich structures, cytoplasmic lipid 
droplets, and axonal myelin sheath (Eig. 4). With such 
capability, CARS microscopy has found broad appli¬ 
cations in the field of lipid biology in recent years (Le 
and Cheng 2009; Wang et al. 2009; Le et al. 2010b). 


Summary 

CARS spectroscopy is a mature analytical tool. The 
applications of ns-, ps-, and fs-CARS spectroscopy to 
gas-phase reacting flows enabled noninvasive measure¬ 
ments of temperature and concentrations of major species 
present in the reacting flows with accuracy on the order of 
few percent. The CARS microscopy is based on chemical 
selectivity of CARS and offers a high-speed imaging and 
inherent 3D resolution as well as ability to be coupled 
with other imaging modalities on the same platform. 
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Cross-References 

► Sum Frequency Generation Vibrational 
Spectroscopy 

► Two-Photon Excitation Fluorescence STED 
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N-Type Inactivation in 
Voltage-Gated Potassium 
Channels, Fig. 1 N-type 
inactivation of voltage-gated 
channels conferred by 
either the N-terminus of the 
pore-forming a-subunit (top 
panel) or the N-terminus of 
P-subunits (lower panel). 

In each case the first 20 amino 
on the N-terminus form a 
ball-peptide that binds within 
the pore once the channel has 
opened - blocking conduction 
and inactivating the channel 


N-type Inactivation (pore-forming subunit) 
Closed Open 



Closed 


N-type Inactivation (induced by p-subunit) 
Open 



H 


Inactivated 



Inactivation in Kv Potassium Channels 

Voltage-gated channels (Kv) close rapidly by 
a dynamic process known as inactivation, which is 
a key component of channel function. Inactivating 
channels modulate membrane excitability. N-type inacti¬ 
vation involving the N (amino)-terminus of the channel 
(hence the name “N-type inactivation”) was initially 
found in Shaker-related channels, the inactivation of 
which can be removed by intracellular enzymes or the 
N-terminal deletion of the a-subunit channel protein 
(Hoshi et al. 1990; Zagotta et al. 1990). It was later 
confirmed that rapid inactivation could also be conferred 
by a synthetic peptide having the same sequence as the 
proximal N-terminus of the a-subunit or the N-terminus 
of the p-subunit (Fig. 1). These experimental studies were 
unified with the development of the “ball-and-chain” 
model in which a positively charged inactivation particle 
(the ball) on a tether (the chain) binds and blocks 
the channel on the intracellular side of the membrane 
(Armstrong and Bezanilla 1977). 

The inactivation ball, also known as the inactivation 
gate, responsible for closing of the channel, is formed by 
the first 20 amino acids on the cytoplasmic N-terminus of 
the channel and the contiguous 40 or more residues 
constitute the chain. The inactivation ball possesses 
two essential chemical characteristics. The first 10 
residues are hydrophobic, and the remaining 10 are 
hydrophilic. The chain segment, which is rich in positive 
charge, pushes the ball toward its receptor site when the 
membrane is depolarized (Zhou et al. 2001). It has been 
shown that this N-type inactivation occurs through 


a sequential reaction of a dynamic nature in which the 
gate initially binds to the T1 domain surface by electro¬ 
static interactions (Gulbis et al. 2000) and then enters 
through the lateral portals to the inner pore and occludes 
ion movement (Zhou et al. 2001). influx from extra¬ 
cellular elevation of extracellular potassium expels the 
inactivation gate from its binding site, which results in 
accelerated recovery form inactivation. The T1 domain 
which directly precedes the first transmembrane domain 
(SI) of Kv a-subunit is a highly conserved cytoplasmic 
portion consisting of ^130 amino acids responsible for 
driving and regulating the tetramerization of the pore¬ 
forming a-subunit. In Shaker-type K"^ channels, rapid N- 
type inactivation is followed by a slower C-type inacti¬ 
vation which involves a constriction of the extracellular 
mouth of the channel pore (Yellen et al. 1994, see also 
► Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis). 

Kv4 channels inactivate rapidly and recover quickly 
from inactivation. The T1 domain of the Kv4 N-terminus 
features intersubunit Zn^"^ binding sites (Scannevin et al. 
2004). The structure of the T1 domain includes four zinc 
ions bound to the T1 tetramer, and each Zn located at 
the interface between two adjacent monomers is 
involved in intersubunit contacts. The Kv4 N-terminus 
is involved in a fast inactivation with features that man¬ 
ifest N-type inactivation. However, N-terminal deletions 
of Kv4 channels only partially eliminate or slow the fast 
phase of inactivation, rather than resulting in a complete 
loss of inactivation as observed in the Shaker channel, 
suggesting that Kv4 inactivation may differ from classi¬ 
cal Shaker N-type inactivation. 
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N-Type Inactivation in 
Voltage-Gated Potassium 
Channels, 

Fig. 2 The co-crystal 
structure of the 
Kv4.3 N-terminus/ 
KChlPlcomplex, revealing 
the clamping action formed by 
the KChIPs and how the 
KChIPs interact with the 
N-terminus of Kv4.3 subunit 
to suppress channel 
inactivation 



Cytosolic Kv channel-interacting proteins (KChIPs, 
216 ^ 256 amino acids) that belong to the neuronal 
calcium sensor (NCS) family of calcium binding EF-hand 
proteins co-assemble with Kv4 (Shal) subunits to form 
a native complex that encodes somatodendritic A-type 
current, Isa, in neurons and transient outward current, 
Ito, in cardiac myocytes (An et al. 2000). The specific 
binding of KChIPs to the Kv4 N-terminus enhances 
surface expression, facilitates subunit assembly and reg¬ 
ulates gating properties of Kv4 channels. The co-crystal 
structure of the Kv4.3 N-terminus/KChIPlcomplex 
reveals there is a clamping mode in which a single 
KChIPl molecule acting as a monomer laterally clamps 
two neighboring Kv4.3 N-termini in a 4;4 manner, with 
two contact interfaces being involved in the interaction 
(Pioletti et al. 2006; Wang et al. 2007). Within the first 
interface, the proximal N-terminal hydrophobic peptides 
of Kv4.3 reach out into a deep elongated groove and are 
sequestered by binding to the deep hydrophobic pocket 
on the surface of KChIPl, resulting in physical suppres¬ 
sion of Kv4 inactivation by KChIPl (Fig. 2). 


Cross-References 

► Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 
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Synonyms 

NMR 

Introduction 

The nuclear Overhauser effect (NOE) is the transfer of 
nuclear spin polarization from one nuclear spin popu¬ 
lation to another via dipolar cross-relaxation. The phe¬ 
nomenon has been known since the early days of NMR 
and is a generalization of Albert Overhauser’s proposal 
that nuclear spins could be polarized by irradiation of 
electron spins (Overhauser 1953), which was verified 
experimentally the same year (Carver and Slichter 
1953). The theoretical framework for also describing 
transfer of polarization to one nuclear spin population 
from another such population was laid down two years 
later (Solomon 1955). 


dt 


= -(Iks + W, + W2)P.a + WsPaP 
+ W,Pp, + W2Ppp + K 


( 1 ) 


where is a constant that ensures that the population 
Paid returns to its equilibrium value. At equilibrium, all 
derivatives with respect to time are zero and therefore 
the value of K can be calculated as 

K={Ws + W, + W2)Pl - WsP% - WjPl, - W2 PIp 

( 2 ) 


The differential equation can be modified to 
instead concern deviations from the equilibrium 
population by 


dAPa 

dt 


= -{Ws + Wj + W2)AP.a + WsAPaft 
+ WiPpa + W2APPP 
For the other states one similarly gets 
dAPap 


(3a) 


dt 


■ = -(Ws + W, + Wo)AP,p + WsAP, 
W,Ppp + WoAPp, 


dAP 




dt 


dAP 




dt 


= -{Ws + Wi + Wo)APp, + WsAPpp 
+ WjP OLO. + WqAP ap 

= -{Ws + W, + W 2 )APpp + WsAPp, 
+ W,P,p + W 2 AP,, 


(3b) 


(3c) 


(3d) 


Theory 

Spin-lattice relaxation for interacting spins can be 
treated theoretically by considering the rates of transi¬ 
tions between energy levels, Ik, (Bloembergen et al. 
1948). If the eigenstates for a two-spin system are 
labeled \mims), the energy level diagram of Fig. 1 
can be drawn. 

The different processes are denoted as flip transi¬ 
tions if Amj = ±1 or Am^ = ±1, flip-flip transitions if 
Ani! -f Ams = ± 2 , and flip-flop transitions if 
Ami + Ams = 0, A»i/ 7 ^ 0. The differential equation 
that governs the population of state |aa), P^^i, is 


Since the expectation values for the operators 
corresponding to longitudinal magnetization, I^if) 
and S,{t), are equal to h{t) = P^a + Pap - Pp^ - Ppp 
and S-(f) = Pxx — Pap + Ppa. — Ppp, the equations can 
be reformulated as 


= - (iTo + 21 k/ + lk2)A/,(z) 
dt 

- {W2 - lko)AS,(r) 

— ^ ^ = — (Vko + 2lk5 + lk 2 )A 5 z(z) 

dt 

- (Ik2 - lko)A/z(t) 


(4a) 


(4b) 
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Nuclear Overhauser Effect, Fig. 1 Energy level diagram for 
the two-spin system IS with all possible transition rates indicated 





Usually the rate constants are written as 


p, = Wo + 2Wi + W 2 

(5a) 

Ps = Wo + 2Ws + W 2 

(5b) 

(Tjs = W 2 — VFo 

(5c) 


where pi and pg are the longitudinal auto-relaxation 
rate constants and tt/j is referred to as the longitudinal 
cross-relaxation rate constant. The auto-relaxation 
rates, pj and p^, may or may not be equal and the 
cross-relaxation rate constant ajs is positive if 
W 2 > Wo, negative if W 2 > Wq, and vanishes if they 
are equal (cf.). With these definitions, the expressions 
governing the time evolution of AI^ and AS- become 

= -p,M-Xt) - (6a) 

- (J,sM^{t) (6b) 

These equations are called the Solomon 
equations (Solomon 1955) and can be used to 
describe longitudinal relaxation of dipole-dipole 
coupled spins. 

In the presence of cross-relaxation, the return of 
I, to thermal equilibrium after a perturbation is 
typically not monoexponential with rate constant 
Pi but will depend on the spin state of S as exem- 
plihed with the following experiments (Cavanagh 
et al. 2007). For the selective inversion recovery 
experiment, where a 180° pulse is applied to I but 
not S, followed by a relaxation delay, t, and a read 
pulse one gets 



- 


saturation 

NucI 

ear Overhauser Effect, Fig. 2 

Pulse sequences for (a) the 


transient NOE experiment and (b) the steady state NOE differ¬ 
ence experiment. Narrow (wide) filled rectangles denote 90° 
(180°) pulses while the open rectangle in (b) represents satura¬ 
tion by closely spaced pulses or continuous wave irradiation. The 
experiment is repeated without this element 


^ = 1 - exp[-(p/ - (7is)t\ 

X {1-f exp(-2(7/sf)} (7) 

whereas for the non-selective inversion recovery 

experiment, where 180° pulses are applied to I as 
well as to S one gets 

^ =1-2exp[-(p; -f fT/s)r] (8) 

Although relaxation is monoexponential in 

this case, this experiment cannot be used by 
itself to determine pj. In the transient NOE 
experiment (Fig. 2a) S is selectively inverted 
by a 180° pulse followed by delay and a read 
pulse, /z as a function of this delay is then 
governed by 
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^ = 1 - exp[-(p/ - a,s)t] 

^ Z 

X {1 - exp(-2(TKf)} (9) 


This experiment can be used to measure the 
cross-relaxation rate since the initial rate of change is 
equal to 2 ( 7 / 5 . The steady state NOE enhancement can 
be measured in the steady state NOE dijference 
experiment (Fig. 2b). In this experiment, the S spin is 
selectively irradiated for a period that should be long 
compared with pj and ajs so that Sz{t) = 0 , after which 
a read pulse is applied to I. The signal intensity will 
then be proportional to the steady state magnetization. 
If. The experiment is then repeated without saturation 
of the S spin so that the signal intensity is proportional 
to equilibrium magnetization, if The steady state NOE 
enhancement, pjg, is then calculated as 


Vis — 




( 10 ) 



Wo^x 


Nuclear Overhauser Effect, Fig. 3 Homonuclear NOE 
enhancement for a symmetric top as a function of coqTx , where 
Tx is defined as the correlation time for rotation about an axis 
orthogonal to the symmetry axis and Ty is the correlation time for 
rotation about the symmetry axis. For a spherical top 
Tx = ry = Tc. The calculations have been performed for 
;^ = 0.1 {dashed line), ^=1 {solid line) and :^ = 10 
{dashed-dotted line) at a static magnetic field of 11.7 T and an 
angle between the intemuclear vector and the symmetry axis of 
90“ 


From semiclassical relaxation theory (Redfield 
1957; Wangsness and Bloch 1953) it is possible to 
calculate relaxation rate constants as functions of the 
dipole-dipole coupling strength and spectral density 
functions sampled at appropriate frequencies as 

d^ 

P] = -^(3/(cu/) -F 6J(o}j CU 5 ) -l-,/((U/ — ®s)) 

( 11 a) 


/(cu) 


2 

5 


1 -F 


( 12 ) 


where is the correlation time for molecular tum¬ 
bling. For the homonuclear case the equations can be 
further simplified to 


N 


Pi = Ps = ^ (-^(0) + 3/(fflo) + 6/(2mo)) (13a) 


P 5 = — {3J{a>s) + 6J{a>! + cos) -F /(«/ - m^)) 


(11b) 


( 7/5 = — {6J{(Oi -F ms) - J{mi - ms)) 


Vis — 


yi Pi 


( 11 c) 

(lid) 


where d = PoUjiys < I'Js^ > /47t, Po is the permeabil¬ 
ity of free space, Ti is the Planck constant divided by 
271, 7 / and 75 are the magnetogyric ratios for spins 
I and S, rjs is the intemuclear distance, and J{m) is 
the spectral density sampled at frequency m. For 
a rigid spherical top the spectral density function is 
modeled as 


(7/5 = ^(6.7(2fflo)-/(0)) (13b) 

where coq = 7 /Bo and Bq is the static magnetic field. It 
can be shown that if the spins I and S are of the same 
species, enhancement is equal to 0.5 in the extreme 
narrowing limit, |(X)oTc| <?; 1 , whereas in the spin 
diffusion limit, Imoxf ^ 1 , enhancement is equal 
to —1. For a spherical top, enhancement disappears 
when mote =112 while the zero-crossing occurs at 
different values if molecular tumbling is anisotropic as 
shown in Fig. 3. Similar effects are found for the NOE 
buildup rate (Neuhaus and Williamson 2000). 

A noteworthy feature of the above expressions is 
that although ( 7/5 is proportional to the inverse sixth 
power of the distance between spins / and S, (Eq. lid) 
predicts that NOE enhancement is independent of 
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internuclear distance. This is however only true in the 
case of an isolated two-spin system where the sole 
relaxation mechanism is the dipole-dipole interaction 
between the two spins. In practice, the expression for 
rjjg must be modified to 


ys ^is 


Ji Pi + P*i 


(14) 


where pj' is longitudinal relaxation of I due to other 
mechanisms than the dipolar coupling to S. These may 
be contributions from chemical shift anisotropy, dipole- 
dipole interactions with external nuclear spins and with 
unpaired electrons from dissolved oxygen or paramag¬ 
netic ions. Since pj' is independent of the internuclear 
distance between I and S, it follows that the enhance¬ 
ment decreases as the internuclear distance increases. 
There is thus no straightforward answer to which is the 
maximum allowed distance between I and S for the 
observation of the NOE. However, the concept is seldom 
meaningful for internuclear distances of more than 5 A. 


to 2.8 • 10*^ and 3.4 • 10^ s^', respectively. The steady 
state heteronuclear NOE enhancement is thus sensitive 
to motions that are faster than and is commonly used 
to probe such motions. Typically, it is measured by 
recording a two-dimensional version of the experiment 
in Eig. 2b with saturation of the S spin performed by 
application of closely spaced 120 ° pulses for a period 
of several seconds. In order to measure the steady state 
NOE accurately, it is imperative that relaxation to 
thermal equilibrium and steady state, respectively, is 
complete. Recycle delays in excess of 10 s which 
should be compared to one or a few seconds for most 
protein NMR experiments should thus be employed. 
This experiment is combined with measurements of 
the longitudinal and transverse auto-relaxation rates 
and the data is usually interpreted according to the 
model-free formalism, in which the spectral density 
function is modeled with parameters describing molec¬ 
ular tumbling as well as correlation times and order 
parameters of internal motions on time scales that are 
faster than molecular tumbling (Clore et al. 1990; 
Lipari and Szabo 1982). 


The Heteronuclear NOE 

The above equations hold equally well if the nuclei I 
and S are of different species but the fact that the 
magnetogyric ratios yj and y^ are different must be 
taken into account. Specifically, this means that higher 
enhancement will be realized for the nucleus with the 
lower magnetogyric ratio. The NOE can thus be used 
to increase the sensitivity in experiments probing low 
magnetogyric ratio nuclei although coherent polariza¬ 
tion transfer methods such as INEPT usually are more 
efficient (► INEPT). It is also noteworthy that if the 
relative signs of yj and y^ differ, enhancement will be 
negative in the extreme narrowing limit. A subtlety in 
this case is that the precession frequencies also are of 
opposite sign, implying that |ffl/ ± 7 ^ |ct)/| ± |ffls|. 

This must be taken into account when evaluating the 
spectral density functions and indicates that NOE 
enhancement as a function of cooTc for y = —K is not 

simply the mirror image of the case when ^ = —K 

yj 

(Neuhaus and Williamson 2000). 

Eor a heteronuclear spin-pair in a macromolecule 
the frequencies co/ ± cos are typically significantly 
larger than the inverse of the correlation time for 
molecular tumbling. Eor example, for and *’N at 
a held of 11.7 T, Icon -f ffl/vl and \a)H — C0]v| correspond 


The Solomon Equations for a Multi-spin 
System 

It is straightforward to generalize the energy level 
diagram of Eig. 1 and the Solomon equations to more 
than two spins. It is noteworthy that the dipole-dipole 
interaction between two spins can never induce transi¬ 
tions for a third spin, that is, only pairwise Wq, Wi, and 
W 2 processes have to be considered. Eor a multi-spin 
system, the equation describing A/^ is modihed to 

( 15 ) 

X 

where the sum runs over all external spins, X and the 
dehnition of pj is modihed to also include couplings to 
these spins. In the initial time regime, NOE buildup is 
no different than for the two-spin case. However, at 
later time-points indirect mechanisms must be consid¬ 
ered (Cavanagh et al. 2007). Eor instance the mecha¬ 
nism dubbed spin dijfusion (► Spin Diffusion) is the 
result of cross-relaxation being relayed by a third spin, 
that is., S ^ X ^ I and may erroneously be interpreted 
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Nuclear Overhauser Effect, Fig. 4 Basic pulse sequences for 
NOESY and ROESY experiments. Narrow (wide) rectangles 
correspond to pulses with flip angles of 90° (180°). (a) In this 
NOESY sequence, which includes a Hahn echo, the basic eight 
steps of the phase cycle is ((ti = x,—x,—x,x, (\>2 = x,x,—x—x, 
(|)3 = x, 4)4 = y,y,y,y,-y,-y,-y,-y, (j^rec = x,-x. (b) For the 
ROESY experiment the basic phase cycle is 4)i = x,—x, 4)2 = x,x. 


4'rec = X,—X. For both experiments the phase cycle is extended by 
CYCLOPS on all pulses and the receiver. Frequency discrimina¬ 
tion in FI can be achieved by TPPI or States-TPPI. Bipolar 
gradients can be employed during tj to prevent radiation damping 
and for the NOESY experiment an additional gradient may be 
applied at the beginning of t„„- 


as an NOE between I and S. The effect is however 
useful for rapidly transferring saturation throughout 
a molecule and to a ligand as a way of detecting 
transient interactions in cross saturation experiments 
(Takahashi et al. 2000). Another indirect effect is 
called hack transfer and is the result of magnetization 
being relayed back to the original spin, that is, 
I ^ S ^ I. This process makes the NOE weaker than 
for an isolated IS two-spin system. 

The NOESY Experiment 

The nuclear Overhauser effect spectroscopy (NOESY) 
experiment is a two-dimensional experiment used to 
measure cross-relaxation rates (Macura et al. 1981). 
The basic pulse sequence is shown in Eig. 4a. After an 
excitation pulse, the spins are frequency labeled during 
ti and subsequently returned to the z-axis. Magnetiza¬ 
tion transfer, S ^ I, due to cross-relaxation occurs for 
period after which a read pulse followed by an 
optional Hahn echo is applied. 


Since buildup of cross-peak intensities is linear with 
proportionality constant 2(T{s cx in the initial time 
regime, cross-peak intensities are often used as 
a measure of intemuclear distance (► Interspin Distance 
Determination by EPR). This must however be done 
with caution. For instance, cross-peaks that only appear 
at long mixing times may be due to spin diffusion and 
should not be considered. Furthermore, the identical 
dependence on rf^ for all pairs of spins only holds 
under the assumption that the internal motions of all 
spins are equal. Usually, cross-peak intensities are 
converted into distances conservatively by dividing 
them into three classes corresponding to strong, medium 
and weak based on their intensities and assigning them 
distance ranges of 1.8-2.7, 1.8-3.3 A, and 1.8-5.0 A, 
respectively. By always using the shortest possible dis¬ 
tance as the lower bound, internal motions that reduce 
cross-peak intensities are taken into account. The rfg^ 
dependence on cross-peak intensity must also be con¬ 
sidered when calculating distances of single averaged 
signals. Averaging is in fact slightly involved and 
a detailed analysis shows that conformational changes 
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on a time scales slower than molecular tumbling, such 
as flips of aromatic rings, should be averaged as (I'jf) '' 
whereas motions occurring on faster time scales, such as 
rotations of methyl groups, should be averaged as 

’ (Neuhaus and Williamson 2000). 

Since the homonuclear NOE vanishes for 
oJoTc = 1.12 the NOESY experiment can in practice 
be realized for small or large molecules. Especially in 
the spin diffusion limit, NOESY is the method of 
choice for measurement of cross-relaxation rates 
since (1) two or more dimensions are necessary to 
avoid signal overlap, (2) it is not feasible to perform 
selective inversion of all nuclei for large molecules, 

(3) the time needed to complete the phase cycle is not 
a limiting factor because of low sensitivity, and 

(4) buildup of NOE enhancements is rapid in the spin 
diffusion limit. Whether the one-dimensional transient 
NOE experiment or NOESY is best suited for small 
molecules is often not so obvious and depends on how 
many resonances must be selectively inverted among 
other things (Neuhaus and Williamson 2000). 


The ROESY Experiment 

In the rotating frame Overhauser effect spectroscopy 
(ROESY) experiment (Bothner-By et al. 1984) 
(Eig. 4b) the spins are subject to continuous wave 
irradiation during . The amplitude of this so-called 
spin-lock held, oji, is usually on the order of 10 kHz. 
The spins will then experience an effective 
fleld cOeff = (coj -f ’ that makes the angle 
9 — arctan(g) with the static magnetic fleld. If the 
spin-lock fleld is applied on resonance, Q — 0, the 
dipolar contributions to auto- and cross-relaxation 
for a homonuclear IS two-spin system are given by 
(Neuhaus and Williamson 2000) 

^ 1 2 -^(2® 1 ) +2-^(0) + 9/(coo) -f 6,/ (2coo) | 

2 ^ (16a) 

^^{5,/(0) + 9/(coo) + 6,/(2coo)} 

O 

<4^^ = y |^•^(2®l) - ^/(O) -f 6/(fflo)| 

«y{4/(0) + 6./(mo)} (16b) 


where the superscript ROE is used to distinguish the 
rotating frame rate constants from their laboratory 
frame counterparts. The equations make it clear that 
ROESY experiment is an excellent choice for molecules 
of intermediate size such that mote = 1-12 since the 
cross-relaxation constant does not vanish. Because of 
fast auto-relaxation, it is less well suited for very large 
molecules. Other complications are due to side effects of 
spin-locking. Eor instance, if it is not applied on reso¬ 
nance, the magnetization will be locked in a tilted frame 
and the auto- and cross-relaxation rates will be averages 
of pf and Pi and 4^^ ^*1 weighted by the tilt 
angle the effective fleld makes with the static field. 
Another complication arises since the strong J coupling 
(► J Coupling) is active in the presence of a spin-lock 
fleld and may lead to magnetization transfer not due to 
cross-relaxation. However, continuous wave irradiation 
is not particularly effective for promoting transfer of 
magnetization by this mechanism except for spins reso¬ 
nating with very similar frequencies making this prob¬ 
lem less important. 

Summary 

The nuclear Overhauser effect is one of the most useful 
features of NMR spectroscopy and is used to transfer 
nuclear spin polarization from one population to 
another. The NOE is sensitive to molecular motions 
and is often used to characterize these. Since it is 
a through-space rather than a through-bond effect, it 
reports on proximity of nuclei and not on how 
many covalent bonds that separate them. One of the 
most important applications is therefore structure 
calculations of proteins and other molecules (► NMR- 
based Structural Proteomics). 

Cross-References 

► INEPT 

► J Coupling 

► NMR in Drug Discovery - Introduction 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► NMR-based Structural Proteomics 

► Nucleic Acid NMR - Introduction 

► Protein NMR - Introduction 

► Spin Diffusion 
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Synonyms 

DNA 

Definition 

Deoxyribonucleic acid, DNA, is composed of planar 
aromatic nucleic acid bases linked to a sugar and 


a phosphate as illustrated in Fig. 1 . Ribonucleic acid, 
RNA, differs from DNA by an extra oxygen on the 
sugar and also thymine is replaced by uracil which 
does not have the methyl substituent. Most of the 
DNA and RNA transitions we measure with UV spec¬ 
troscopy is due to the n-n* transitions of the bases. 
These transitions have no intrinsic CD signal since the 
bases are planar and hence achiral. However, the iso¬ 
lated bases acquire asymmetry in their electronic tran¬ 
sitions by coupling with the chiral ribose sugar units of 
the backbone. The magnitude of Asmax (the molar 
difference in absorbance of left and right circularly 
polarized light) for each base is of the order of 
2 moU* dm^ cm~^ at 270 nm; the purine bases have 
a negative signal whereas the pyrimidine ones have 
a positive CD in that region. When the bases are linked 
by phosphodiester bonds to form DNA or RNA, an 
additional (and usually larger) source of chirality 
results from the coupling between bases that are 
stacked in a helix (Berova et al in press; Norden et al 
2010 ). 


Basic Characteristics 


The shape of the CD spectrum measured for a DNA or 
RNA sample contains information not only about its 
base pair content but also about the geometry of the 
structure. In fact, the orientation of the bases relative to 
one another (i.e., the DNA polymorph) is the most 
signiheant factor determining the spectral shape. 
Since CD spectra are easy to measure, CD is often 
the simplest technique to use to probe DNA conforma¬ 
tional changes as a function of ionic strength, solvent, 
ligand concentration, etc. in solution. 

The simplest application of CD to DNA (or RNA) 
structure determination is to identify which polymorph 
is present in a sample. Since the DNA CD from 190 to 
300 nm is due to the skewed orientation of the bases, if 
the DNA is untwisted or the bases are tilted, a change 
would be expected in the CD spectrum. Some exam¬ 
ples are given in Fig. 2. Z-DNA is negative to a first 
approximation where the CD of B-DNA is positive. 
The CD of a typical RNA or A-form DNA has a sharp 
negative band at 210 nm. 

A CD spectrometer is an ideal tool for probing 
condensed DNA structures, although what is measured 
then is not strictly CD, but rather a form of light 
scattering. The results of intramolecular condensation 
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Nucleic Acid Circular 
Dichroism, Fig. 1 DNA 

nucleotide bases indicating 
backbone parameter labels 
used to describe the 
orientation of DNA bases with 
respect to one another 




Wavelength (nm) 

Nucleic Acid Circular Dichroism, Fig. 2 (a) CD spectrum of 
DNAs as a function of GC content (pH = 6.8, 5 mM NaCl). 
Clostridium perfringens (27% GC), calf thymus DNA 
(42% GC), Micrococcus luteus (72% GC), poly[d(G-C )]2 

are particles of size comparable with the wavelength of 
the light. The particles interact differently with left and 
right circularly polarized light and, at the wavelengths 
where the chromophores absorb light, give a large 
“CD” signal. The sign of the CD indicates the helical 
handedness of the condensed DNA particle, with 
a negative signal above 250 nm corresponding to 
a left-handed helix (Arscott et al 1995). CD is also 



Wavelength (nm) 

(100% GC). Also shown is Z-form poly[d(G-C )]2 induced 
by 50 pM [Co(NH 3 ) 6 ]^’^. (b) CD of a 372 nucleotide mRNA. 
^260 nm = 1 at 25°C for this sample 


useful for probing structural changes such as those 
from quadruplex to non-quadruplex structures. How¬ 
ever, it should be noted that the accepted spectral 
signatures for such transitions need to be validated by 
considering, for example, NMR data from samples at 
the same concentrations and conditions as the struc¬ 
tures exist as equilibria between different arrange¬ 
ments of the bases. 
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Wavelength (nm) 

Nucleic Acid Linear Dichroism, Fig. 1 Absorbance, LD and 
reduced LD, LD' = LD/A spectra of calf thymus DNA (100 pM) 
in a 1 cm path length cuvette (absorbance) and a 1 mm path 
length Couette flow cell (LD) 
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Nucleic Acid Linear Dichroism 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

Anisotropic spectroscopy; DNA 


Definition 

A linear dichroism (LD) spectrum shows the differ¬ 
ence in absorbance of light polarized parallel and per¬ 
pendicular to an orientation direction (Rodger and 


Norden 1997; Norden et al 2010). The idealized struc¬ 
ture of B-DNA has the base pairs oriented perpendic¬ 
ular to the DNA helix axis to a first approximation. 
This means the ti-ti* transitions take place in the plane 
perpendicular to the helix axis, and so these are 
expected to give a negative LD signal as illustrated in 
Fig. 1 . LD can be used to probe changes in orientation 
of the DNA bases. 

The LD spectrum of B-DNA shown in Fig. 1 
is similar to but not exactly the same shape as the 
negative of its absorbance spectrum, -A. The same 
information is summarized in the reduced LD spec¬ 
trum which is fairly constant across the 260 nm band, 
but not completely so. This tells us that, in solution, the 
bases are not rigidly perpendicular to the helix axis as 
implied by the idealized B-DNA structure. Careful 
analysis of solution LD studies has led to the conclu¬ 
sion that the bases of B-DNA in solution lie at an 
average angle of ~80° (or even less) from the helix 
axis (Chou and Johnson 1993) (though we typically 
assume 86° in a calculation (Matsuoka and Norden 
1982)). 

The magnitude of the LD signal of any sample 
is also affected by the degree of orientation as 
summarized by the orientation parameter, S. 

LD' =-=-^(Scos^a — l) (1) 

^iso 2 
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a 



Nucleic Acid Linear Dichroism, Fig. 2 (a) The LD of calf 
thymus B-DNA (500 |tM, 20 mM NaCl, 1 mM sodium 
cacodylate buffer pH = 7) upon addition of a tetracationic di¬ 
iron triple helicate [Fe(LL) 3 ]‘*'^ (LL = C 25 H 20 N 4 ) at DNAdigand 



Wavelength (nm) 

Nucleic Acid Linear Dichroism, Fig. 3 LD spectra of 
Micrococcus luteus DNA (200 pM in water) as a function of 
temperature normalized to 1 at 245 nm. Data were collected in 
a 0.5-mm path length Couette flow cell spinning at ~3,000 rpm 
at the Astrid synchrotron. Quartz cutoff for sample capillary is at 
182 nm in these spectra (Rittman et al. 2012) 

The magnitude of the orientation parameter, S, can 
be used to provide information about the length, flex¬ 
ibility, or bendability of DNA or about the introduc¬ 
tion of kinks into DNA. If S increases as the result of 


b 



Wavelength (nm) 

ratios shown on the figure (Hannon et al. 2001) (b) LD spectra of 
poly[d(G-C )]2 (50 pM, 5 mM NaCl, 1 mM cacodylate) as 
B-DNA and Z-DNA after the addition [Co(NH 3 )]^'^ (50 pM) 

some change imposed on the system (such as the 
addition of a DNA-binding ligand), then the LD 
increases because the DNA has become more ori¬ 
ented. This generally means it has lengthened, or 
stiffened (or both), or some bend or kink has been 
removed. Similarly, molecules that bind to DNA and 
bend it reduce S. Figure 2 illustrates both a bending 
effect and a stiffening effect (in this case converting 
to B-DNA to Z-DNA). 

Subtle changes in DNA LD spectral shapes can 
be used to probe changes in structure such as that 
induced by increases or decreases in single stranded 
regions as illustrated by the salt and temperature 
dependence of the DNA spectra of Fig. 3 (Rittman 
et al. 2012). 

Cross-References 

► DNA-Ligand Circular Dichroism 

► DNA-Ligand Flow Linear Dichroism 

► Linear Dichroism 

► Linear Dichroism Spectra - Measurement 

► Linear Dichroism Spectroscopy: Theory 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 
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Nucleic Acid NMR - Introduction 

Masato Katahira and Tsukasa Mashima 
Kyoto University, Institute of Advanced Energy, 
Kyoto, Japan 

Synonyms 

DNA 


Definition 

Nucleic acid NMR is the study of the DNA/RNA 
structure in solution with the NMR method. 


Introduction 

► NMR is a powerful tool to study the ► DNA/^ RNA 
structure in solution. It should be reminded that there 
are many examples in which the structure of a certain 
DNA/RNA in solution is quite different from that in 
crystal. This entry describes how the DNA/RNA 
structure can be determined by the NMR method. In 
addition to the established ways of the analysis, recent 
progresses and tips are also documented. 

Firstly, three different ways of the DNA/RNA prep¬ 
aration are described. Secondly, the method of the 


resonance assignment, that is the crucial step for the 
NMR analysis, is explained. Thirdly, the way to iden¬ 
tify the mode of the base pair, that is the key informa¬ 
tion to construct the structure, is described. Fourthly, 
the methods to derive various structural restraints are 
explained. Fifthly, the way to determine the structure 
by calculation by means of structural restraints is 
described. Finally, the recent examples of the DNA/ 
RNA structures determined by NMR are shown. The 
example of real-time monitoring of the cytosine to 
uracil conversion of DNA by a human anti-HIV 
enzyme is also shown. 

Sample Preparation 

DNA and RNA can be made with a DNA/RNA synthe¬ 
sizer. By using the *^C, ^^N-labeled phosphoramidite, 
either uniform or residue-specific '^N-labeling can 
be achieved. The yield of the overall synthesis becomes 
very low, when the length of DNA/RNA is long. 
Thus, this method is suitable for making relatively short 
DNA/RNA. 

DNA can also be made with enzymatic synthesis with 
a DNA polymerase, a Klenow fragment, using a proper 
template (Fig. la) (Zimmer and Crothers 1995). 
By locating a ribonucleotide at the end of the template, 
the synthesized target DNA can be separated from 
the template by alkaline hydrolysis. The PCR method 
can also be applied for the synthesis (Werner et al. 2001). 
By using *^N-labeled dNTPs as starting materials, 
either uniform or nucleotide-type-specific *^C, *^N - 
labeling can be achieved. DNA is purified with denatured 
polyacrylamide gel electrophoresis. 

RNA can also be made with enzymatic synthesis 
with an RNA polymerase of T7 phage using a proper 
template (Fig. lb) (Varani et al. 1996; Furtig et al. 
2003). Efficiency of the synthesis of short RNA is 
usually very low; thus this method is suitable for 
making relatively long RNA. 

It is recommended to put an NMR tube in a 15% H 2 O 2 
solution overnight to inactivate a trace of RNase before 
use and to extensively wash the tube with water 
afterward. 

Resonance Assignments 

A repeating unit of DNA/RNA, a nucleotide, and 
five bases are shown in Fig. 2a and b, respectively. 
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a^gatccSP 

, I I I I I 

^ „ CTAGGTCCCGAAAACCCG„ 


template DNA, S RNA 


Klenow fragment 


I3r 


^N-labeled dNTPs 


GATCCgSGGCTTTTGGGC^' 
I I I I I 

, CTAGGTCCCGAAAACCCG,- 


alkallne hydrolysis 


a^GATCCi 
, I I I I I 


“gggcttttgggC 


, CTAGGTCCCGAAAACCCG,- 


T7 promoter 


transcription start point 


The sequential phosphoras H3'/H4'/H5'/H5" connec¬ 
tivities comprising of intraresidue P(;)-H5'(;)/H5"(0/ 
H4'(0 and interresidue P(0-H3'(; — 1) cross peaks in 
Het-COR can further confirm the sugar proton assign¬ 
ments, with the assignments of phosphorus being 
obtained at the same time. The sequential phosphorus 
C37C47C5'-H3'/H4'/H5'/H5" connectivities observed 
in HCP can also confirm the assignments. For relatively 
short DNA/RNA, these two methods using the phospho¬ 
rus can give the assignments on the basis of totally 
through-bond connectivities without relying on through- 
space connectivities obtained from NOESY. For longer 
DNA/RNA, the methods using the phosphorus become 
less sensitive because they rely on relatively small 2-, 3-, 
or 4-bond coupling constants. Intraresidue correlation 
between HI' and H6/H8 can also be obtained purely 
relying on through-bond connectivities for relatively 
small DNA/RNA. 

H5 of pyrimidine is assigned from an H5-H6 


■< - 

-►iGCTTGATC,. 

correlation peak in TOCSY, and H2 of adenine is from 


template DNA 

;CGAACTAGk, 

: DMA 

an H2-H8 correlation peak in HCCH-TOCSY. HMBC 


'’’gcuugauc®' + 


^^C-, ■'^N-labeled rNTPs 


Nucleic Acid NMR - Introduction, Fig. 1 Sample prepara¬ 
tion. (a) Enzymatic synthesis of DNA with a DNA polymerase, 
a Klenow fragment. The 3'-end residue of the template, boxed, is 
a ribonucleotide, (b) Enzymatic synthesis of RNA with an RNA 
polymerase 

The most useful tool for the assignments of DNA/RNA 
is the sequential base proton (either H6 of pyrimidine 
or H8 of purine)-Hl' connectivities comprising of 
intraresidue H6(/)/H8(i)-Hl'(/) and interresidue 
H6(/)/H8(0-Hl'(/ - 1) cross peaks in NOESY (Varani 
et al. 1996; Cromsigt et al. 2001; Furtig et al. 2003). An 
example of the trace of the connectivities is shown in 
Fig. 3a (Mashima et al. 2009). On the basis of the HI' 
assignment, the other sugar protons, H2', H2", H3', 
H4', H5', and H5", can be assigned using DQF-COSY 
and TOCSY. The obtained assignments can be con¬ 
firmed by the sequential base proton-H2'/H2" and base 
proton-H3' connectivities. 

When a sugar takes on C3'-endo puckering as often 
found in RNA, Vhi'h 2' is almost 0 Hz, thus the Hl'- 
H2' correlation is hard to observe in either DQF-COSY 
or ► TOCSY. Even in this case, HCCH-COSY and 
HCCH-TOCSY can give the Hl'-H2' correlation and 
the H2' assignment as a result. These spectra are also 
powerful for the assignments of the other sugar 
protons. 


is also useful to assign H2 for non-labeled DNA/RNA 
through H8-C4 and H2-C4 correlation peaks. 

*^N ► HSQC is useful to discriminate exchangeable 
imino protons originating from guanine and thymine 
(uracil) bases. The imino nitrogen resonance of 
a guanine appears at 146-149 ppm, while that of a thy¬ 
mine (uracil) does at 156-162 ppm. The residue- 
specific assignment of the imino proton can be 
obtained from HCCNH-TOCSY on the basis of 
already assigned H6/H8 through the magnetization 
transfer of either H8 ^ C8 ^ C4 ^ C6 ^ N1 ^ HI 
(imino proton) for a guanine or H6 ^ C6 ^ N1 ^ C2 
^ N3 ^ H3 (imino proton) for a thymine (uracil). The 
imino proton of a guanine can also be assigned with 
HNN-COSY in which both H8 and HI (imino) give the 
correlation peak to the common N3. The imino proton 
of a thymine (uracil) can also be assigned with *^N 
HSQC in which both H5 and H3 (imino) give the 
correlation peak to the common N3. The imino proton 
of a guanine of non-labeled DNA/RNA can be 
assigned with JR-HMBC in which both H8 and HI 
give the correlation peak to the common C5. When 
a certain secondary structure can be assumed, imino 
protons can be assigned on the basis of imino-imino 
connectivities in NOESY between neighboring base 
pairs (Fig. 3b) (Nakano et al. 2011). 

Discrimination of exchangeable amino protons 
originating from cytosine, adenine, and guanine bases 
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Nucleic Acid NMR - 
Introduction, Fig. 2 A 

nucleotide and bases, (a) 
Structure of a nucleotide, 
a repeating unit of DNA/RNA. 
(b) Structures of five bases 
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Nucleic Acid NMR - 
Introduction, 

Fig. 3 Assignments. 

(a) Sequential base proton 
(either H6 of pyrimidine or 
H8 of purine)-Hl' 
connectivities in NOESY for 
r(GGAGGAGGAGGA), the 
intraresidue base-Hl' cross 
peak being labeled with the 
residue number (Mashima 
et al. 2009). (b) Imino-imino 
connectivities between 
neighboring base pairs for the 
stem region of an RNP 
aptamer against ATP (Nakano 
et al. 2011) 
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can also be obtained from HSQC. The amino 
nitrogen resonance of cytosine, adenine, and guanine 
appears at 94-98, 82-84, and 72-76 ppm, respectively. 
The residue-specific assignment of the amino protons 
of a cytosine is obtained from the strong intrabase 
NOESY cross peak with already assigned H5. Adenine 
amino protons are assigned from HCCNH-TOCSY on 
the basis of already assigned H2 and H8. Guanine 
amino protons are assigned from the strong intrabase 
NOESY cross peak with already assigned HI (imino). 

C2, C5, C6, and C8 are assigned from HSQC. 
C2 appears in a low-field region (152-156 ppm) com¬ 
pared to C6 and C8 (134-144 ppm), which is utilized to 
discriminate H2 from H6/H8 in an early stage of the 
assignment. Cl', C2', C3', C4', and C5' are assigned 
from '^C HSQC, HCCH-COSY, and HCCH-TOCSY. 
'^N HSQC gives H6-N1 (pyrimidine), H8-N7/N9 


(purine), and H2-N1/N3 (adenine) correlations, 
which is utilized to discriminate H2, H6, and H8 in 
an early stage of the assignments. 

Einally, site-specific incorporation of 1-2% or 
'^N-labeled nucleotide at each position of DNA/RNA 
with a synthesizer is also a powerful and cost-effective 
tool for the assignment (Phan and Patel 2002). 

Identification of Base Pairs 

Eor an A:T base pair (Pig. 4a), a strong AH2-T(U)H3 
(imino) NOESY cross peak is observed. For a G:C base 
pair (Fig. 4b), strong CH41/CH42 (amino)-GHl 
(imino) NOESY cross peaks are observed. These 
cross peaks indicate the formation of the either A:T 
or G:C base pair. 
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Nucleic Acid NMR - Introduction, Fig. 4 Base pairs, (a) A:T base pair, (b) G:C base pair, (c) GiAbase pair, (d) U:A:Ubase triple, 
and (e) G(:A):G(:A):G(:A):G heptad 
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A base pair can be identified more directly and 
solidly with ► HNN-COSY by observing a spin-spin 
coupling across a ► hydrogen bond involved in the 
formation of the base pair (Grzesiek et al. 2001). In 
the case of the A:T base pair, the T(U)H3 (imino)-AN 1 
correlation peak appears due to the existence of the 
► spin-spin coupling, ^Vnnj between T(U)N3 and 
ANl. In the case of the G:C base pair, the GHl 
(imino)-CN3 correlation peak appears due to the exis¬ 
tence of the coupling between GNl and CN3. 
Even if exchangeable T(U)H3 (imino) is not observed 
for an unstable A:T(U) base pair, the base pair forma¬ 
tion can still be deduced from the observation of the 
AH2 (non-exchangeable)-T(U)N3 correlation peak 
originating from the existence of the Jnn coupling 
between ANl and T(U)N3. 

Nonstandard base pairs can also be identified with 
HNN-COSY. For a G;A base pair (Fig. 4c), the GHl 
(imino)-AN 1 correlation peak appears due to the 
coupling between GNl and ANl. In the case of a 
U:A:U base triple (Fig. 4d), the UH3 (imino)-ANl 
correlation peak appears due to the UN3 (imino)- 
ANl ,/nn coupling for a Watson-Crick base pair, 
while the UH3 (imino)-AN7 correlation peak appears 
due to the UN3 (imino)-AN7 ^Vnn coupling for 
a Hoogsteen base pair. More complex architecture 
such as a heptad (Fig. 4e) (Matsugami et al. 2001) 
can also be identified with HNN-COSY. Observation 
of the G1H21/H22 (amino)-A3N7 correlation peaks 
originating from the G1N2-A3N7 coupling and 
the A3H61/H62 (amino)-GlN3 correlation peaks orig¬ 
inating from the A3N6-G1N3 Jnn coupling indicates 
the formation of the G1:A3 base pair. Presence of the 
G1H21/H22 (amino)-G4N7 correlation peaks originat¬ 
ing from the G1N2-G4N7 ^Vnn coupling indicates 
the formation of the G1;G4 base pair. In this way, the 
formation of the heptad was solidly proved (Sotoya 
et al. 2004). 

As an application of HNN-COSY, a method 
for the direct unambiguous discrimination between 
intra- and intermolecular hydrogen bonds in symmet¬ 
ric multimers has been developed (Sotoya et al. 2004). 
This method relies on the extent of the reduction 
of the HNN-COSY cross peak intensity between 
fully labeled, and an equimolar mixture of labeled 
and non-labeled samples. Irrespective of the level 
of multimerization, this method can clearly dis¬ 
criminate the intra- and intermolecular hydrogen 
bonds. 


Structural Restraints 

Distance restraints are the most powerful source for the 
structure determination of DNA/RNA (Varani et al. 
1996; Cromsigt et al. 2001; Furtig et al. 2003). Under 
the two-spin approximation, distance restraints are 
calculated from 

fij = r„f(NOE„f/NOEij)‘/'’ 

where / jj is the distance between proton i and j, is 
a reference distance, NOE^f is the intensity of a cross 
peak of NOESY recorded with a short mixing time 
(^50 ms) between a reference proton pair, and NOEy 
is that between protons i and j. As the reference proton 
pair, H5 and H6 of a cytosine (reference distance of 
2.47 A) or Hl'-H2" (2.20 A) is used. The calculated 
distance contains errors due to spectral noise, spin 
diffusion, and local dynamics. Therefore, a certain dis¬ 
tance is added to the calculated one to set an upper 
limit distance. For example, 0.8 A is added for the 
calculated distance of 0-2.7 A, 1.5 A is added for the 
2.7-3.2 A distance, 2.0 A is added for the 3.2^.0 A 
distance, and 2.5 A for 4.0-6.0 A. For a cross peak that 
can be observed only in NOESY with a longer mixing 
time (^200 ms), 0.5 A is further added to the upper 
limit distance. The lower limit distance is set to 1.8 A 
(Matsugami et al. 2001). For exchangeable protons, 
WATERGATE NOESY or JRNOESY is used to 
obtain the distance (Varani et al. 1996). 

The structure of each nucleotide unit can be 
described with the seven dihedral angles, a-^ for 
a backbone and x around the glycoside bond 
(Fig. 2a). 5 is related to sugar puckering. Because of 
the experimental difficulty to obtain reliable dihedral 
angle restraints, inclusion of these restraints into the 
structure determination is limited to relatively small 
DNA/RNA. Qualitative analysis of Vhs'p and Vh 5 "p 
based on Het-COR gives restraints for (3 for a certain 
case (Varani et al. 1996). Qualitative analysis of 
^•/^H 4 'H 5 ' and ^7 h4'h 5" based on DQF-COSY gives 
restraints for j (Varani et al. 1996). The sugar pucker 
can be determined from the analysis of Vhi'h 2 'j 
Vh 2 "h 3 '> and Vh 3 'h 4 ' derived from DQF-COSY and 
E. COSY (Varani et al. 1996). It is also useful to know 
that C3' and C4' resonances appear in the low-field 
region for C2'-endo in comparison with the case 
of Cy-endo. Thus, restrains for 5 related to the 
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sugar puckering are obtained. Qualitative analysis 
of UH 3 '(i)P(i+i) based on Het-COR gives restraints for 
£ (Varani et al. 1996; Cromsigt et al. 2001; Furtig et al. 
2003; da Silva 2007). 

When the mode of a base pair is identified as 
described above, distance restraints between donor 
and acceptor atoms of the hydrogen bonds are 
included. 

Planarity restraint that forces all atoms involved in 
a base pair to locate onto one plane could also be 
included. 

When a molecule is weakly aligned in solution, the 
dipolar-dipolar interaction does not average to zero 
anymore, hut the ► residual dipolar coupling (RDC) 
can be detected. The RDC between i and / spins, Dy, is 
given by 

Dy(0,.A) =^«{(3cos20- 1) 

+ 2>I2 Rsin^0cos2i/^} (1) 

where is the magnitude of the dipolar coupling 
tensor, R is the rhombicity, and 6 and ip are polar 
coordinates that describe the orientation of the ij vector 
in the principal axis system of the molecular alignment 
tensor (Bax et al. 2001). When five parameters. Da, R, 
and three Euler angles that define the orientations of 
principal axes of the molecular alignment tensor are 
determined, the experimentally obtained RDC value 
for the ij vector such as either a directly bonded N-H 
vector or a C-H vector restrains the orientation of the 
corresponding vector according to the Eq. 1. The 
restraint for the orientation of each vector is imposed 
against the common frame (the molecular alignment 
tensor); therefore these restraints can be regarded as 
“global restraints” in contrast to the distance restraints 
derived from NOE which are “local restraints.” The 
filamentous bacteriophage Pf 1 aligns in solution due to 
its magnetic anisotropy. Then, when the Pfl phage 
(10-20 mg/mL) is added to DNA/RNA solution, the 
weak alignment of DNA/RNA is achieved due to the 
steric interaction of DNA/RNA with the aligned Pfl 
phage. The negatively charged Pfl is useful to align 
DNA/RNA that is also negatively charged, because the 
unfavorable attachment of DNA/RNA to the alignment 
medium can be avoided by electrostatic repulsion. 
The RDC values for one-bond N-H and C-H can 
be derived as the difference in the coupling (scalar 
coupling plus RDC) between under the isotropic 


and aligned conditions. ID spectrum, HSQC without 
decoupling along the indirect dimension, IPAP 
(inphase/antiphase) HSQC and ► TROSY are used to 
obtain the RDC values (Bax et al. 2001; Lukavsky and 
Puglisi 2005). 

Cations play a critical role for the formation of 
certain DNA/RNA structures. For example, plays 
a critical role for the formation of the quadruplex. 
NH 4 ’^ can be used to probe monovalent ion binding 
to DNA/RNA. Observation of intermolecular NOE 
between the ammonium ion protons and those of 
DNA/RNA gives the information on how cations 
bind to and stabilize the certain DNA/RNA structure 
(Eeigon et al. 2001). 

Structure Determination 

Experimentally obtained structural restraints are 
transformed into pseudo-energy terms, which are com¬ 
bined with empirical energy terms to give the total 
energy. With this total energy, the restrained molecular 
dynamics is run. The structure determination is 
performed with simulated annealing protocol in 
which the system is heated to high temperature 
(2,000-3,000 K) and gradually cooled down to low 
temperature. Through this process, the molecule over¬ 
comes the energy barrier to escape from the local 
minimum and reaches the global minimum. The pro¬ 
grams such as XPLOR-NIH, ► CYANA, ► CNS, and 
AMBER are used for this calculation. Ten to twenty 
final structures are selected from 100 to 200 calcula¬ 
tions on the basis of the criterion of the smallest resid¬ 
ual total energy. The quality of the obtained structures 
is evaluated on the basis the root mean square devia¬ 
tion (r.m.s.d.) and violations of restraints. Reasonably 
good structures give the r.m.s.d. less than 0.5 A for all 
heavy atoms, no distance restraints violation greater 
than 0.5 A, and no dihedral angle restraints violation 
greater than 5° (Varani et al. 1996; Cromsigt et al. 
2001; Furtig et al. 2003). 

Because the proton density of DNA/RNA is lower 
than that of protein, there is less chance to obtain the 
distance restraints derived from NOE for DNA/RNA 
than for protein. Additionally, the shape of DNA/RNA 
tends to be rod-like rather than globular, and then the 
distance restraints are likely to be “local” rather than 
“global.” Therefore, the inclusion of the structural 
restraints derived from RDC could be more critical 
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Fig. 5 Examples of nucleic 
acid structures and 
monitoring of base 
conversion, (a) Structure of 
double-stranded 4'-thioDNA 
with the sequence of d 
(CGCGAATTCGCG) 
(Matsugami et al. 2008). 

(b) Dimeric structure of RNA 
aptamer, R12, against a prion 
protein with the sequence of r 
(GGAGGAGGAGGA) 
(Mashima et al. 2009). 

(c) Real-time monitoring of 
cytosine to uracil conversion 
in single-stranded DNA with 
the sequence of d 
(ATTCCCAATT) through 
deamination by anti-HIV 
enzyme, APOBEC3G 
(Furukawa et al. 2009) 



(Major groove) 


(Minor groove) 



1H (ppm) 


for the precise structure determination of DNA/RNA 
than for that of protein. In the case of a globular pro¬ 
tein, it can be assumed that the directions of NH and 
CH vectors are uniformly distributed to all orienta¬ 
tions. Under this assumption, and R can be deter¬ 
mined on the basis of the histogram of the observed 
RDC for many NH and CH pairs. Then just three Euler 


angles, that define the orientations of principal axes of 
the molecular alignment tensor, should be included 
into the structural calculation as newly added variables 
in order to incorporate structural restraints derived 
from RDC (Bax et al. 2001). In the case of DNA/ 
RNA that tends to exhibit a rod-like shape, however, 
the uniform orientation of NH and CH vectors cannot 
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be assumed and thus and R cannot be determined in 
the same way. Alternatively, the structure calculation 
is carried out without the RDC restraints in an early 
phase to obtain a tentative structure. Then, the opti¬ 
mum combination of Da and R that gives the best 
agreement between the experimentally obtained 
RDCs and the RDCs predicted for the tentative struc¬ 
ture is determined with a grid search. In a later phase, 
the RDC restraints are included into structure calcula¬ 
tion with the determined £)„ and R (Zweckstetter and 
Bax 2000; Bax et al. 2001; McCallum and Pardi 2003). 


Examples of DNA/RNA Structures and 
Real-Time Monitoring of Base Conversion 
by Anti-HIV Enzyme 

Figure 5 a shows the structure of double-stranded 
4'-thioDNA comprising 2'-deoxy-4'-thionucleotides 
with the sequence of d(CGCGAATTCGCG) 
(Matsugami et al. 2008). Most sugars take on the 
Ci'-endo conformation. The major groove is narrow 
and deep, while the minor groove is wide and shallow. 
Thus, 4'-thioDNA takes on A-form characteristic of 
RNA, both locally and globally, rather than conven¬ 
tional B-form. The unique biochemical properties of 
4'-thioDNA, including nuclease resistance that is 
a promising character as a functional oligonucleotide, 
are rationalized in the light of the elucidated structure. 

Figure 5b shows the dimeric structure of 12-mer 
RNA, r(GGAGGAGGAGGA) (R12), which functions 
as an aptamer against a prion protein (Mashima 
et al. 2009). R12 forms an intramolecular parallel 
quadruplex under physiological ionic conditions. 
The quadruplex contains G:G:G:G tetrad and G(:A): 
G:G(:A):G hexad planes. Two quadruplexes form 
a dimer through intermolecular hexad-hexad stacking. 
The structure has suggested how the RNA aptamer, 
R12, traps the prion protein with high affinity. The 
atomic coordinates of R12 would be useful for the 
development of R12 as a therapeutic agent against 
prion diseases and Alzheimer’s disease. 

Figure 5c shows time course of NMR spectra 
indicating deamination of two cytidine residues of sin¬ 
gle-stranded DNA, d(ATTCCCAATT), in a strict 3'^5' 
order by human ► anti-HIV enzyme, APOBEC3G, in an 
NMR tube (Furukawa et al. 2009). The enzymatic reac¬ 
tion is started by the addition of APOBEC3G and mon¬ 
itored in real time with NMR signals. This result 


indicates how APOBEC3G converts the cytidine base 
of the minus strand DNA of HIV-1 and destroys the 
genetic information in order to exert the anti-HIV 
activity. 


Summary 

The DNA/RNA structure in crystal does not always 
reflect that in solution. NMR is the only way to provide 
the DNA/RNA structure in solution at an atomic 
resolution. Therefore, this methodology is crucial to 
understand various biological events that need the 
knowledge on the DNA/RNA structure. The mode of 
a base pair, that is key information to construct the 
structure, can be solidly determined on the basis of the 
observation of a spin-spin coupling across a hydrogen 
bond with HNN-COSY. RDCs obtained for weakly 
aligned DNA/RNA in solution can provide valuable 
“global” structural information. In combination with 
these structural restraints, distance and dihedral angle 
restraints can provide a precise DNA/RNA structure in 
solution. 


Cross-References 


N 


► Cross-Correlated Relaxation 

► Cryogenic NMR Probes 
► CYANA 

► INEPT 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► J Coupling 

► Modeling of RNA Molecules 

► Molecular Dynamics Simulations of Lipids 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR in Drug Discovery - Introduction 

► NMR Methods for Kinetic Analysis 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► NMR-based Structural Proteomics 

► Nuclear Overhauser Effect 

► Protein NMR - Introduction 

► Total Correlation Spectroscopy (TOCSY) in NMR 
Protein Structure Determination 

► TROSY 




1772 


Nucleobase-Cation-Symport-2 Family and the Uracil: Proton Symporter, UraA 


References 

Bax A, Kontaxis G, et al. Dipolar couplings in macromolecular 
structure determination. Methods Enzymol. 2001;339:127-74. 

Cromsigt J, van Buuren B, et al. Resonance assignment and 
structure determination for RNA. Methods Enzymol. 
2001;338:371-99. 

da Silva MW. NMR methods for studying quadruplex nucleic 
acids. Methods. 2007;43:264-77. 

Feigon J, Butcher SE, et al. Solution nuclear magnetic resonance 
probing of cation binding sites on nucleic acids. Methods 
Enzymol. 2001;338:400-43. 

Furtig B, Richter C, et al. NMR spectroscopy of RNA. 
Chembiochem. 2003;4:936-62. 

Furukawa A, Nagata T, et al. Structure, interaction and real-time 
monitoring of the enzymatic reaction of wild-type 
APOBEC3G. EMBO J. 2009;28:440-51. 

Grzesiek S, Cordier F, et al. Scalar couplings across hydrogen 
bonds. Methods Enzymol. 2001;338:111-33. 

Lukavsky PJ, Puglisi JD. Structure determination of large 
biological RNAs. Methods Enzymol. 2005;394:399^16. 

Mashima T, Matsugami A, et al. Unique quadruplex structure 
and interaction of an RNA aptamer against bovine prion 
protein. Nucleic Acids Res. 2009;37:6249-58. 

Matsugami A, Ouhashi K, et al. An intramolecular quadruplex 
of (GGA)4 triplet repeat DNA with a G:G:G:G tetrad and a 
G(:A):G(:A):G(:A):G heptad, and its dimeric interaction. 
J Mol Biol. 2001;313:255-69. 

Matsugami A, Ohyama T, et al. Unexpected a-form formation of 
4'-thioDNA in solution, revealed by NMR, and the implica¬ 
tions as to the mechanism of nuclease resistance. Nucleic Acids 
Res. 2008;36:1805-12. 

McCallum SA, Pardi A. Rehned solution structure of the 
iron-responsive element RNA using residual dipolar couplings. 
J Mol Biol. 2003;326:1037-50. 

Nakano S, Mashima T, et al. Structural aspects for the recognition 
of ATP by ribonucleotide receptors. J Am Chem 
Soc. 2011;133:4567-79. 

Phan AT, Patel DJ. A site-specific low-enrichment '^N, '^C 
isotope-labelling approach to unambiguous NMR spectral 
assignments in nucleic acids. J Am Chem 
Soc. 2002;124:1160-1. 

Sotoya H, Matsugami A, et al. Method for direct discrimination 
of intra- and intermolecular hydrogen bonds, and character¬ 
ization of the G(:A):G(:A):G(:A):G heptad, with scalar 
couplings across hydrogen bonds. Nucleic Acids Res. 
2004;32:51113-18. 

Varani G, Aboul-ela F, et al. NMR investigation of RNA 
structure. Prog Nucl Magn Reson Spectrosc. 1996;29: 
51-127. 

Werner MH, Gupta V, et al. Uniform *^C/*^N-labeling of DNA 
by tandem repeat amplification. Methods Enzymol. 
2001;338:283-304. 

Zimmer DP, Crothers DM. NMR of enzymatically synthesized 
uniformly '^C, *^N-labeled DNA oligonucleotides. Proc Natl 
Acad Sci USA. 1995;92:3091-5. 

Zweckstetter M, Bax A. Prediction of sterically induced 
alignment in a dilute crystalline phase: aid to protein 
structure determination by NMR. J Am Chem 
Soc. 2000;122:3791-2. 


Nucleobase-Cation-Symport-2 Family 
and the Uracil: Proton Symporter, UraA 
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Synonyms 

NCS2 (nucleohase-cation-symporter-2) or NAT 
(nucleobases-ascorbate-transporter); UraA, uracil 
permease in Escherichia coli 

Definition: Based on Primary Sequence 
Similarity and Biological Function 

NCS2 or NAT family (TC # 2.A.40) (http;//www.tcdb. 
org/search/result.php?tc=2.A.40) contains over 100 
known members across all kingdoms of life 
(Saier 2000). In bacteria and fungi, NCS2 proteins 
mainly transport uracil, xanthine, and other derivatives 
of purine or pyrimidine, whereas in mammals, the NCS2 
members SVCTl and SVCT2 (sodium-dependent vita¬ 
min C transporter) transport vitamin C (L-ascorbate). 
Representative members of NCS2 family are 
(http://www.tcdb.org/search/result.php?tc=2.A.40.6): 
2.A.40.1.1: UraA - Uracil permease in Escherichia 
coli 

2.A.40.1.2: PyrP - High affinity uracil permease in 
Lactococcus lactis 

2.A.40.1.3: RutG- Putative pyrimidine permease in 
Escherichia coli 

2.A.40.2.1: YcpX- Purine permease in Clostridium 
perfringens 

2.A.40.3.1: PubX (XanP) - Xanthine permease in 
Bacillus suhtilis 

2.A.40.3.2: PucJ - Uric acid permease in Bacillus 
suhtilis 

2.A.40.4.1: UapA - High affinity uric acid-xanthine 
permease in Emericella nidulans 
2.A.40.4.2: YicE - The putative xanthine permease in 
Escherichia coli 

2.A.40.4.3: YgfO (XanQ) - Purine (xanthine) 
transporter in Escherichia coli 
2.A.40.5.1: UapC - General purine permease 
in Emericella nidulans 
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2.A.40.6.1: SVCTl - L-ascorbate: Na"^ symporter in 
Rattus norvegicus 

2.A.40.6.2: SVCT2 - L-ascorbate: Na"^ symporter 
in Homo sapiens 

2.A.40.6.3: LPEl - Leaf permease protein 1, high 
affinity uric acid-xanthine transporter in Zea mays 
UraA is the uracil transport protein in E. coli. 
Imported uracil is metaholized via the pyrimidine 
salvage pathways (Andersen et al. 1995). 5-Fluorouracil, 
an important anticancer drug, is also a substrate of UraA. 
UraA shares a sequence identity of 23.1%, 20.5%, and 
21.9%, with human SVCTl, SVCT2, and rat SNBTl, 
respectively. The transport reaction is: 

Uracil (out) -F H-" (out) ^ Uracil (in) -F H-" (in) 

Introduction 

The NAT Signature Motif 

Genetic, biochemical, and in silico studies of 
UapA and YgfO identified a NAT signature motif 
comprising 11 residues, [A/G/S][Q/E/P]-N-X-G-X- 
X-X-X-T- [R/K/G] (Gournas et al. 2008). 

Transport and Ligand Binding Assays 

Uracil binds to the recombinant UraA protein 
with a dissociation constant (Kj) of approximately 
0.41 ± 0.07 pM as measured by the scintillation 
proximity assay (SPA) (Fig. 1). In a cell-based uptake 
assay, UraA can transport ^^H-uracil with a K^, of 1.08 
± 0.2 pM (Fig. 2). 

X-ray Crystal Structure of UraA 

Crystals of recombinantly expressed and purified 
UraA were obtained in space group P 6422 in the pres¬ 
ence of 0.4% P-NG (n-nonyl-p-D-glucopyranoside) 
detergent and diffracted X-rays heyond 2.9 A. The 
structure of uracil-bound UraA was refined to 2.9 A 
resolution (Fig. 3). 

A Novel Transporter Fold 

UraA consists of 14 membrane helices, with both 
N- and C-termini located on the cytoplasmic side. 
The two C-terminal helices al3 and a 14 penetrate 
only halfway into the membrane and form a hairpin 
(Fig. 3). There is little apparent structural similarity 
between UraA and Mhpl, a representative member 
of the Nucleobase-Cation-Symport (NCSl) family 
(Weyand et al. 2008), despite the apparent similarity 



Nucleobase-Cation-Symport-2 Family and the Uracil: Pro¬ 
ton Symporter, UraA, Fig. 1 The binding affinity between 
uracil and WT UraA was measured by scintillation proximity 
assay (SPA). A competition experiment was performed in which 
the final concentration of ^H-uracil was kept constant at 0.17 pM 
(40 Ci mmoL'), and the concentration of non-labeled uracil was 
increased from 0 to 10 pM. Fitting of the data gave rise to 
a dissociation constant of approximately 0.41 ± 0.07 pM 


Cell-based uptake assay of ®H-uracll 



Concentration of uracil (pM) 

Nucleobase-Cation-Symport-2 Family and the Uracil: Pro¬ 
ton Symporter, UraA, Fig. 2 Uptake of^H-uracil by cells. E. 
coli ISuraA was transformed with WT-UraA in the pQLINK 
vector 


of substrates for NCS2 and NCSl family members. An 
exhaustive search of the PDB using DALI suggested 
that UraA has a novel fold compared to all known 
structures of secondary active transporters (Fig. 4). 
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periplasm 


Nucleobase-Cation-Symport-2 Family and the Uracil: Pro¬ 
ton Symporter, UraA, Fig. 3 The crystal structure of UraA 
was determined at 2.9 A. Two perpendicular views, one from 


periplasm and one from the lipid membrane, are shown. 
Corresponding TMs within the two inverted repeats (e.g., TMl 
and TMS) are colored identically. The bound uracil is shown 


Nucleobase-Cation- 
Symport-2 Family and the 
Uracil: Proton 
Symporter, UraA, 

Fig. 4 Topology diagram of 
UraA. The TMs are arranged 
into the core domain (in light 
purple hackground)and the 
gate domain (in light cyan 
background) 



The most prominent feature is placement of two short 
antiparallel (3-strands, on TM3 and TM10 respectively, 
at the center of the structure (Fig. 4). Inner membrane 
proteins are mostly a-helical. UraA is likely the only 
known transporter that contains (3-strands in the trans¬ 
membrane region. Each P-strand is preceded by an 
extended loop and followed by a short a-helix, a3 
and alO, respectively (Fig. 4). The 11-residue NAT 
motif, which was predicted to be a loop in the 
cytoplasm, constitutes alO in the structure of UraA. 
Helices a3 and otlO are aligned as if they were a single 
discontinuous transmembrane helix, with the substrate 
bound at the breakage point (Fig. 4). 

The 14 TMHs of UraA are spatially organized into 
two halves, which are named the core domain and the 
gate domain. The core domain comprises TMHs 1-4 
and 8-11, whereas the gate domain includes the other 6 
TMHs. Intriguingly, UraA also contains two internal, 
structural repeats: TMs 1-7 and TMs 8-14, which 
are related to each other by an approximately 180° 
rotation around an axis parallel to the membrane 


bilayer (Fig. 5). These two repeats can be 
superimposed with a root-mean-squared deviation 
(rmsd) of 2.9 A over 129 Ca atoms. 

Substrate Recognition 

Uracil is located in a concave pocket formed by 
the antiparallel P-strands and their connecting loops 
in TMH3 and TMHIO. The pyrimidine ring of uracil 
is approximately in parallel with the P-strands and 
surrounded by negative electrostatic potentials (Fig. 6). 
Recognition of the substrate is almost exclusively medi¬ 
ated by residues from the core domain, involving both 
polar and hydrophobic contacts from TMl, TM3, TMS, 
TMIO, and TMl2 (Fig. 7). Two negatively charged 
residues, Glu241 and Glu290, appear to anchor uracil 
by each making two hydrogen bonds to uracil. His245 is 
also located in the vicinity of uracil, likely contributing to 
its coordination through water-mediated hydrogen bonds 
or through interactions with Glu241. Furthermore, the 
two oxygen atoms of uracil are further hydrogen-bonded 
to the amide nitrogen atoms of Phe73 and Gly289. 
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Nucleobase-Cation- 
Symport-2 Family and the 
Uracil: Proton 
Symporter, UraA, 

Fig. 5 Superimposition of the 
two inverted repeats within 
UraA 



■ TM1-7 


1 


■ TM8-14 


Nucleobase-Cation- 
Symport-2 Family and the 
Uracil: Proton 
Symporter, UraA, 

Fig. 6 Uracil is located in 
a concave within the core 
domain and surrounded by 
negative electrostatic 
potential. The core domain of 
UraA is shown in both surface 
electrostatic potential (left) 
and rainbow cartoon (right) 



In addition to the polar contacts, uracil is surrounded by 
hydrophobic residues including Ala31 of TMHl, Phe73 
of TMH3, Tyr288 of TMHIO, and Tyr342 of TMH12. 
Phe73 insulates the substrate from the periplasm, 
whereas the phenyl ring of Tyr288 is roughly in parallel 
with the pyrimidine ring of uracil. Tyr342 is the only 
residue from the gate domain that contributes to uracil 
coordination (Fig. 7). 

Domain Organization 

The core and gate domains are approximately in 
parallel to each other with a planar interface (Fig. 8). 
Surprisingly, there are a large number of buried 
hydrogen bonds within the core domain, with the 
P-strands of TMH3 and TMHIO serving as the 
organizing center for these interactions. The side 
chain of Asn285 in TMHIO forms two hydrogen 
bonds with AsnSO in TMH2 and Ser273 in TMH9. 
The hydroxyl groups of Ser277 and Ser282 form an 
extra hydrogen bond between TMH9 and TMHIO. The 


hydroxyl group of Thr32 in TMHl makes two hydro¬ 
gen bonds with the side chains of Asp50 in TMH2 and 
Ser71 in TMH3. In addition, the imidazole nitrogens of 
His242 in TMH8 are hydrogen-bonded to Ser266 and 
Asn270 in TMH9. This network of interactions may 
help maintain the conformation of the extended loops 
in TMH3 and TMHIO and restrain the core domain 
TMHs from intra-domain movement. 

In contrast to the extensive polar interactions within 
the core domain, the inter-domain interface is 
populated mainly with hydrophobic residues from 
TMHs 1, 3, 8, 10 in the core domain and TMHs 5, 12 
in the gate domain (Fig. 8). The contacts are mostly 
located on the periplasmic side of uracil, suggesting an 
inward-facing conformation of the structure. Except 
the two uracil-binding Glu residues and His245, the 
substrate-facing surfaces of the core and gate domains 
are predominantly composed of hydrophobic residues, 
a feature that might be important for the transport 
mechanism of UraA. 
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Nucleobase-Cation- 
Symport-2 Family and the 
Uracil: Proton 
Symporter, UraA, 

Fig. 7 Uracil is coordinated 
by both polar (left) and van der 
Waals (right) contacts. 
Hydrogen bonds are shown in 
red dashes. Uracil is shown in 
yellow hall-and-sticks 
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Nucleobase-Cation-Symport-2 Family and the Uracii: Pro¬ 
ton Symporter, UraA, Fig. 8 Domain organization of UraA. 
Upper left panel. A periplasmic view is shown. The charged 
residues (Asp, Glu, Lys, and Arg) are shown in red, and polar 
residues (Ser, Thr, Asn, Gin, His, and Tyr) are shown in blue. 


Uracil is shown in white spheres. Upper right panel. The net¬ 
work of hydrogen bonds within the core domain. Lower panels: 
The substrate-facing surfaces of the core and the gate domains 
are shown. The corresponding residues that contact each other 
on the interface are labeled with the same colors 
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Nucleobase-Cation-Symport-2 Family and the Uracil: Pro¬ 
ton Symporter, UraA, Fig. 9 Sequence alignment of the E. 
coli UraA with NAT homologs from other species and organ¬ 
isms. Secondary structural elements of UraA are indicated above 
the sequence alignment. Invariant and highly conserved amino 


acids are shaded in red and orange, respectively. The residues 
that are hydrogen-bonded with uracil with their side chains and 
main chains are indicated by blue spheres and blue squares, 
respectively. The residues that bind to uracil with van der 
Waals contacts are indicated by orange squares 
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Nucleobase-Cation-Symport-2 Family and the Uracil: Proton Symporter, UraA 


conformational change of the gate domain 

^ 3 



gate gate 

domain domain 


substrate-free UraA 
outward-open 
{structure available) 


uracll-bound UraA 
Inward-facing 
(structure available) 


deprotonatlon 
uracil release 
2 



periplasm 


cytoplasm 


substrate-released UraA 
Inward-open 
(transient?) 


Nucleobase-Cation-Symport-2 Family and the Uracil: Pro- Glu245 are shown to emphasize their essential role in substrate 

ton Symporter, UraA, Fig. 10 A working model to illustrate recognition and proton translocation. For simplicity, only TM5 

the putative transport mechanism of UraA. Glu241, His245, and and TM12 are shown to represent the gate domain 


Putative Transport Mechanism 

UraA is a proton-coupled symporter, in which the trans¬ 
location of proton depends on the residues that are capa¬ 
ble of protonation and deprotonation. A remarkable 
revelation of the structure is the clustering of Glu241, 
His245, and Glu290 (Fig. 7), all of which are able to 
undergo cycles of protonation and deprotonation. In par¬ 
ticular, Glu241 and Glu290 are both directly committed 
to uracil recognition, whereas His245 may be involved in 
uracil binding through water-mediated hydrogen bond. 
This arrangement suggests that these residues may play 
a key role in both substrate recognition and proton trans¬ 
location. Only Glu241 is invariant among all NAT mem¬ 
bers. His245 is replaced by Asp, Thr, or Val in other NAT 
proteins, whereas Glu290 is substituted by Gin or Pro 
(Fig. 9). A closer examination, however, revealed func¬ 
tion-based conservation. Glu290 is conserved in the 
three known pyrimidine permeases RutG, PyrP, and 
SNBTl, suggesting a conserved mode of recognition 
for pyrimidine. In addition, His245 is conserved in 
RutG and PyrP, both of which are pyrimidine :proton 
symporters. By contrast, in the sodium symporters 
SNBTl, SCVTl, and SCVT2, His245 is replaced by 
a conserved Asp residue (Fig. 9). This analysis strongly 
suggests conserved mechanisms of transport, with 
Ftis245 likely playing a key role in proton symport and 
Asp playing a similar role in sodium symport. 

The structure of uracil-bound UraA represents an 
inward-open conformation. In order to take up uracil 
from the environment, UraA has to adopt at least one 
more conformation, that is, the one open to the 


periplasm. The distinct domain organization of UraA 
suggests a simple operation involving a rigid body 
rotation of the gate domain relative to the core domain 
centering the bound uracil in order to achieve the 
outward-open conformation (Fig. 10), although subtle 
conformational changes of certain residues such as 
Phe73 may be inevitable. 

It is noteworthy that the two Glu residues and His245 
are surrounded by hydrophobic residues (Fig. 9). There¬ 
fore, by analogy to LacY (Abramson et al. 2003), we 
speculate that these residues are likely to be neutral in the 
outward-open conformation of UraA, with Glu proton- 
ated. Based on this assumption, we propose a working 
model to explain the proton-coupling and symport mech¬ 
anism of uracil by UraA (Fig. 10). The default confor¬ 
mation of the substrate-free UraA may be outward-open 
and protonated. The absence of substrate binding makes 
closure of the gate domain onto the core domains 
energetically unfavorable, due to the two Glu residues. 
Upon binding of substrate, the gate domain undergoes 
a conformational change, achieving an inward-open con¬ 
formation as seen in the uracil-bound structure. A proton 
translocation is likely to occur among the Glu and His 
residues. Deprotonation may cause local conformational 
changes around Glu241, His245, and Glu290, and lead to 
the dissociation of uracil. An inward-open, deprotonated 
conformation is probably unstable, and thus only exists 
transiently. It quickly resumes the outward-open and 
protonated state. In the sodium symporters of the NAT 
family, the sodium ion, instead of proton, is required to 
neutralize the conserved Glu and Asp residues and for 
the binding of the substrate. 
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This model predicts that the core domain pro¬ 
vides the molecular basis for substrate selectivity 
and proton or sodium translocation, whereas con¬ 
formational changes of the gate domain allow 
transport of the substrate molecule. We recognize 
the speculative nature of this model, as many 
important questions remain unanswered. For exam¬ 
ple, what is the molar ratio between proton/sodium 
ion and the substrate molecule during each trans¬ 
port cycle? How do the conserved Glu and His 
residues trigger conformational changes during 
cycles of protonation and deprotonation? Many 
more biochemical experiments, as well as a high- 
resolution structure of substrate-free UraA, are 
required to address these questions. Nonetheless, 
the structural and biochemical characterizations of 
UraA reported here provide an important frame¬ 
work for the mechanistic understanding of the 
NAT family transporters. 
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Synonyms 

CA-imidazoline MDM2 antagonists 


Definition 


The tumor suppressor p53 plays a pivotal role in 
protecting us from cancer development (Vogelstein 
et al. 2000). It is a potent transcription factor which is 
activated following stress and regulates multiple genes 
implicated in cell cycle control, DNA repair, apoptosis, 
and senescence. Owing to its central role as a cellular 
gatekeeper, the p53 pathway is the most frequent target 
of genetic alterations in cancer. Approximately half of 
all human tumors express a dysfunctional p53 protein 
as a result of a mutation or deletion in its gene. 
In non-stressed cells, p53 levels are tightly regulated 
by another cellular protein, MDM2 (murine double 
minute 2). MDM2 controls p53 stability and activity 
through a feedback mechanism by which both proteins 
mutually regulate their cellular levels. p53 activates the 
transcription of mdm2 gene while the MDM2 protein 
binds to p53 and physically blocks its N-terminal 
transactivation domain. MDM2 is also a p53-specific 
E3 ligase which targets the tumor suppressor for 
ubiquitin-dependent degradation. This feedback loop is 
frequently dysregulated in cancer. Many tumors 
overproduce MDM2 protein and effectively impair p53 
function. Therefore, restoration of p53 activity by 
inhibiting the p53-MDM2 interaction represents an 
attractive new approach to cancer therapy. X-ray crys¬ 
tallography studies have shown that the transactivation 
domain of p53 forms an amphiphilic a-helix that projects 
Phe , Trp , and Leu residues into a relatively deep 
hydrophobic binding pocket on the MDM2 surface 
(Kussie et al. 1996). Nutlins are synthetic cis- 
imidazoline derivatives designed to bind tightly into the 
p53 pocket of MDM2 (Vassilev et al. 2004). They inhibit 
the formation of p53-MDM2 complexes in vitro with 
IC 50 in the 100-300 nM range. The crystal structure of 
MDM2-nutlin-2 complexes has revealed that nutlins 
project functional groups into the binding pocket that 
effectively mimic the interaction of the three p53 amino 
acids critical for the p53-MDM2 binding (Fig. 1). Only 
one of the two nutlin enantiomers can bind to MDM2 
with high affinity (e.g., nutlin-3a) while the other enan¬ 
tiomer (nutlin-3b) has 150-200-fold lower affinity and is 
thus practically inactive. Nutlins inhibit p53-MDM2 
binding in cancer cells, leading to stabilization of p53 
and activation of the p53 pathway, cell cycle arrest, and 
apoptosis. They have shown a remarkable selectivity for 
the p53-MDM2 interaction. Their effects have been 
observed only in cells expressing wild-type p53 but not 
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Nutlins, Fig. 1 X-ray structure of nutlm-2 bound to the p53 
pocket of MDM2 protein 


Cross-References 

► Ubiquitinylation 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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OCD 

► Oriented Circular Dichroism Spectroscopy 

ODC Mitochondrial Oxodicarboxylate 
Carrier 

► Mitochondrial Transport Protein Family 

OGC Mitochondrial Oxoglutarate Carrier 

► Mitochondrial Transport Protein Family 

OLID-FRET 

► Photochromic Forster Energy Transfer 

Oligomeric Proteins 

Richard C. Garratt, Napoleao Fonseca Valadares and 
Jose Fernando Ruggiero Bachega 
Institute of Physics of Sao Carlos, University of Sao 
Paulo, Sao Carlos-SP, Brazil 

Synonyms 

Protein oligomers; Protein-Protein interactions 


Definition 

Oligomeric proteins are those composed of more than 
one subunit (polypeptide chain). 

Introduction 

Oligomeric proteins, by definition, are composed of 
more than one subunit (polypeptide chain). As such, 
they possess a quaternary structure, generally consid¬ 
ered to be the highest level of organization within the 
protein structural hierarchy. This describes the way in 
which the various subunits are arranged one with 
respect to another. Oligomeric proteins may be com¬ 
posed either exclusively of several copies of identical 
polypeptide chains, in which case they are termed 
homo-oligomers, or alternatively by at least one copy 
of different polypeptide chains (hetero-oligomers). 

Oligomerization is the norm rather than the excep¬ 
tion and one estimate for proteins from Escherichia 
coli puts the percentage of monomeric proteins as low 
as 20% (Goodsell and Olsen 2000). Of the remaining 
80% of proteins which possess two or more subunits, 
homo-oligomers outnumber hetero-oligomers by approx¬ 
imately 4 to 1. However, historically this has not 
been reflected by depositions in the PDB where mono¬ 
meric proteins are overrepresented. This may be merely 
a distortion introduced by differences in the ease of 
crystallization in which case it is to be expected that it 
will be overcome with time (Jones and Thornton 1996). 
Caution should also be exercised in the interpretation of 
the PDB atomic coordinate files themselves since there is 
no direct correlation between the biologically active olig¬ 
omeric state and the contents of the asymmetric unit as 
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deposited in the databank (Berman et al. 2(X)0). The latter 
may contain less than the full oligomeric particle in which 
case the use of crystallographic symmetry will be neces¬ 
sary in order to generate the biologically active unit of 
interest. Alternatively, there may be more than one copy 
of the biologically relevant unit present and it is important 
to be able to distinguish between, for example, an asym¬ 
metric unit which contains two copies of a monomeric 
protein from one that contains a genuine dimer. The PISA 
server (Krissinel and Henrick 2007) is a generally reliable 
tool for predicting the correct oligomeric state of a protein 
from its atomic coordinates. 

More than one study has highlighted the prevalence 
of oligomers containing an even number of subunits 
(particularly dimers and tetramers) over those that 
contain an odd number (Klotz et al. 1975; Goodsell 
and Olsen 2000). This is a subtle point. It may be that 
since small odd numbers are also prime numbers these 
oligomers cannot be built up from others containing 
fewer subunits during evolution. Given that dimers and 
tetramers are particularly prevalent it may also be 
that isologous interfaces are favored over heterologous 
(see below). 

There Is no generally accepted classification or 
nomenclature for oligomers. However, proteins in gen¬ 
eral have been divided into two broad classes, globular 
and fibrous. This classification is well suited to the 
description of oligomers, as the vast majority is based 
on some form of symmetrical arrangement of subunits. 
If the symmetry is that of a closed point group then the 
oligomer will tend to be globular. If, on the other hand, 
it is based on helical (or screw axis) symmetry, then the 
result will be a filamentous structure which can extend, 
in principle, infinitely in both directions. Other types of 
filamentous oligomers are also possible, such as colla¬ 
gen which is based on three intertwining extended 
chains supercoiling around one another and septins 
which are based on nonpolar filaments possessing 
twofold and pseudo-twofold axes (see below) perpen¬ 
dicular to the filament axis. 

Types of Interfaces 

In globular homo-oligomers the subunits may interact 
with one another via two fundamentally different types 
of interface, isologous interfaces and heterologous 
interfaces (Monod et al. 1965). An isologous interface 
is the result of a twofold axis (a rotation of 180°) 
relating two identical subunits. Such a relationship 
implies that if a region A on subunit 1 interacts with 


region B on subunit 2, then necessarily region A on 
subunit 2 will interact with region B on subunit 1 
(Fig. 1). Only two possible arrangements based 
exclusively on twofold symmetry are possible. These 
correspond to homodimers which possess one twofold 
axis and homo-tetramers possessing three mutually per¬ 
pendicular twofold axes. 

Heterologous interfaces arise from higher-order sym¬ 
metry: threefold (120° rotation), fourfold (90° rotation), 
fivefold (72° rotation), etc. In this case, while region 
A on subunit 1 interacts with region B on subunit 2, 
the opposite is not so (Fig. 1). The result may be either 
a linear polymer or a closed circular structure based on 
cyclic point-group symmetry. If the subunits are approx¬ 
imately spherical then there is a tendency for the cavity at 
the center of the structure to increase in size with the 
number of subunits (Fig. Id). This may be used to form 
pores in membranes such as the protein secretory appa¬ 
ratus and the potassium channel which have six- and 
fourfold symmetry, respectively. 

Types of Point-Group Symmetry 

The vast majority of homo-oligomers can be described 
by one of three types of point-group symmetry: cyclic 
groups, dihedral groups, and higher-order symmetry 
groups (cubic or icosahedral). These can be described 
by use of either the International or Schoenflies point- 
group nomenclature (Hahn 1996). Cyclic groups, as 
described above, are characterized by a single rotation 
axis, which can be of any order, although very high 
orders tend to be rare. They form cyclic structures 
which present the same side of each subunit on the 
surfaces perpendicular to the rotation axis. They tend 
to form channels or pores and are suited to interacting 
with planar surfaces such as membranes as is the case 
of the fivefold related B subunits of heat-labile 
enterotoxin. 

Dihedral groups, besides having an «-fold principal 
rotation axis, also possess n twofold axes perpendicu¬ 
lar to it. This type of symmetry generates a greater 
variety of interfaces between subunits and these can be 
exploited for generating allosterism (see below). For 
example, it is far more common to encounter tetra- 
meric enzymes displaying 222 dihedral symmetry 
(also known as D2 in the Schoenflies nomenclature) 
than it is to belong to the cyclic point group 4 (or C4). 
The difference between the two is made apparent in 
Fig. 2. In the case of cyclic symmetry it is more 
difficult for diametrically opposite subunits to make 
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Fig. 1 Subunit interfaces. 

(a) Isologous interfaces are the 
result of twofold symmetry. 
Heterologous interfaces can 
lead to filaments (b) or to 
cyclic structures (c). The latter 
tend to form progressively 
larger central pores as the 
number of subunits 
increases (d) 



Isologous interface 


b 



Heterologous interface leading to 
a filamentous polymer 





Oligomeric Proteins, Fig. 2 Two alternative ways of being 
tetrameric. On the left 4 (or C4) cyclic symmetry and on the right 
222 (D2) dihedral symmetry 


direct contact with one another whereas it is common 
in the case of dihedral symmetry for each subunit to 
share interfaces with all others. Dihedral symmetry is 
often used to form hollow cylindrical cavities with 
entrances at both ends as observed in the proteasome 
and GroEL, both of which are based on 72 symmetry. 
In both cases layers of heptameric rings related by the 
perpendicular twofold axes are stacked on top of one 
another in a face-to-face fashion thus increasing the 


length of the internal cavity and even generating 
sub-cavities within (Voet and Voet 2010). 

Cubic symmetry is based on four threefold 
axes running in the direction of the body diagonals 
of a cube and icosahedral symmetry includes both 
threefold and fivefold axes. Both are ideal for the 
formation of approximately spherical shell-like struc¬ 
tures harboring an internal hollow cavity sheltered 
from the external milieu. 

Based on point-group symmetry Garratt and 
Orengo (2008) have produced a chart which, while 
not exhaustive, captures the diversity of arrangements 
adopted by globular oligomers (Fig. 3). Their classifi¬ 
cation is primarily based on the highest-order rotation 
axis of the oligomer, which increases on moving from 
left to right across the table. The background to each 
cell is used to distinguish between cyclic, dihedral, 
and higher-order point groups. One attractive feature 
of this table is that besides organizing structures in 
a rational manner, functional attributes of each of the 
selected proteins are also present. The octagon around 
each structure provides functional information which 
can be readily extracted by reference to the legend 
on the left. 
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Oligomeric Proteins, Fig. 3 A summary of oligomeric structures. The oligomeric table from the Protein Chart, reproduced with 
permission from Wiley-VCH 


Functional Attributes of Oligomers 

What are the advantages of oligomeric proteins from 
a functional point of view? Many have been identified 
and reviewed previously (Goodsell and Olsen 2000; 
Ali and Imperiali 2005). The Protein Chart uses 
a classification based on eight major functional classes. 

Size and Stability 

By accumulating a greater number of weakly stabiliz¬ 
ing interactions large proteins, and particularly 
oligomeric assemblies, gain stability. Furthermore, 
one of the major advantages of increased mass is the 
associated relative reduction in exposed surface area 
which diminishes the risk of degradation, including 
proteolytic attack. Large size also permits for building 
voluminous cavities such as those required by GroEL 
for engulfing unstructured polypeptide chains and 
accelerating their folding. On the other hand, for 


functions which require rapid diffusion and/or 
degradation such as extracellular hormones, small mono¬ 
meric proteins are ideal. 

Cooperativity and Allosterism 

By possessing more than one equivalent binding site, 
oligomeric enzymes have the potential to permit the 
communication between these, leading to a phenomenon 
known as cooperativity. Furthermore, the complexity of 
oligomers and their varied intersubunit interfaces allows 
for effector molecules to bind at sites distant from the 
active site and thereby induce conformational changes 
which affect activity. Flomotropic allosteric effectors 
influence the binding of the same molecule to a different 
subunit while heterotropic effectors affect the binding of 
a second ligand. The model proposed by Monod et al. 
(1965) suggests that the enzyme cycles between two 
extreme states (R and T, for relaxed and tense) both of 
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Oligomeric Proteins, Fig. 4 Aspects of oligomeric structures. 
(a) The conformational change from the R {red) to T {blue) state 
in glucosamine 6-phosphate deaminase involves alterations to 
the tertiary and quaternary structures including relative domain 
motions, (b) Aspartate transcarbamoylase is an example of 
a hetero-oligomer composed of six catalytic subunits {blue and 


green) and six regulatory subunits {cyan and red), (c) The trp 
repressor is a highly intertwined dimer and (d) cystatins are 
domain swapped, (e) Peanut lectin has one twofold axis {solid 
line), which applies to the tetramer, and two local twofold axes 
{dotted lines) relating the red subunit to the green and the blue 
subunit to the yellow, respectively 


which preserve the same symmetry but with different 
intersubunit interactions. The R and T forms of the 
cooperative enzyme glucosamine 6-phosphate deami¬ 
nase are shown in Fig. 4. In this case, as with most 
allosteric enzymes, there are alteration to both the ter¬ 
tiary and quaternary structure on cycling between the 
R and T-states. The principal advantage of cooperativity 
and allosterism is that they allow for fine control of 
enzyme activity as a function of the concentration of 
relevant intercellular components and thus the control 
of flux through metabolic pathways. 

Cavities, Channels, and Pores 

Cyclic point groups are particularly well suited to 
generate structures which have central pores or 
channels. Closed hollow cavities can be formed 
by higher-order point groups. Ferritin, for example, 
which is built from 24 subunits with cubic 432 
symmetry, is capable of sequestering over 4,000 
iron ions within its internal cavity, thus protecting 
the intracellular environment from redox damage. 
Icosahedral viral capsids, based on multiples of 60 
copies, form an effective way to protect the viral 
genetic material for transport and infection. 


Multiple Sites, Cross-Linking, and 
Membrane Association 

Immunoglobulins and many plant lectins are able to 
cross-link target molecules simply because they 
possess more than one binding site, a result of being 
oligomeric. For example, IgG type immunoglobulins 
possess two identical light chains and two identical 
heavy chains. One of each is necessary for the forma¬ 
tion of a single antigen-binding site and IgG molecules 
therefore possess two. If each site binds to a separate 
target molecule, such as membrane receptors located 
on different cells, this can lead to cross-linking and the 
formation of large aggregates. Alternatively, a series of 
binding sites present on different subunits of a cyclic 
oligomer could simultaneously bind to a series of 
receptors on the surface of the same cell leading to an 
increase in affinity. This is observed for cholera toxin 
and the heat-labile enterotoxin, which bind to specific 
glycolipids in the target cell membrane. 

Functional (Active) Site Formation 
Oligomerization provides the possibility of generating 
novel active site cavities at subunit interfaces and it has 
been commented that one sixth of oligomeric enzymes 
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present such features (Ali and Imperiali 2005). 
The HIV protease is dimeric with each monomer pro¬ 
viding one essential aspartic acid to the catalytic appa¬ 
ratus. In the case of allosteric enzymes, effector 
binding sites are often found to be similarly located 
at interfaces, as is the case in glucosamine 6-phosphate 
deaminase (Fig. 4). Sometimes effector binding can 
lead to a change in the oligomeric state itself, as is 
observed in phosphofructokinase 2 from E. coli. 

Rulers 

Goodsell and Olsen (2000) have pointed out that an 
ingenious application of oligomers is the ability to act 
as rulers by establishing a fixed distance between 
two active sites, something that would be impossible 
with independent monomers. A classic example is that 
observed in dimeric DNA-binding proteins (such as the 
regulators, repressor, and Cro, from the bacteriophage 
lambda), which recognize palindromic nucleotide 
sequences. This region of the double helix, therefore, 
possesses a local pseudo-twofold axis perpendicular to 
the main helical axis. This aligns with the twofold axis of 
the regulator protein on binding such that each monomer 
interacts with one half of the palindrome. For this to 
happen, the two halves of the palindrome must lie on 
the same side of the double helix and thus the separation 
between them corresponds to one turn of B-DNA 
(10 base pairs or 34 A). Furthermore, the separation 
between the two binding sites on the dimeric regulator 
must also correspond to this distance, as is observed. 
Thus the dimeric nature of the regulator serves to accu¬ 
rately establish an appropriate distance between equiva¬ 
lent binding sites. The trp repressor is a further example 
and appears in Fig. 4. 

The proteasome is responsible for cleaving 
substrate proteins into octapeptides for binding to the 
central groove of HLA type 1 molecules. Therefore, 
unlike most proteolytic enzymes, its specificity is not 
so much based on the recognition of a particular type of 
amino acid but rather on the size of its product. This is 
controlled by the spatial separation between active 
sites within the proteasome central cavity, which is 
a function of its oligomeric assembly (Fig. 3). It is an 
example of a proteolytic ruler. 

Multiple Functions 

It is common for the different types of subunit within 
a hetero-oligomer to play different roles. The enzyme 
aspartate transcarbamoylase possesses both catalytic 


and regulatory subunits (Fig. 4), the heat-labile 
enterotoxin from E. coli has both toxic- and receptor¬ 
binding subunits, and IgG molecules have both 
antigen-binding sites (formed at the interface between 
the heavy and light chains) and an Fc receptor-binding 
site (exclusive to the heavy chains). 

Economy of Genetic Material 

One of the principal advantages of oligomeric proteins is 
that large structures can be built from a limited amount of 
genetic material. Furthermore, it becomes easier to per¬ 
form quality control since subunits containing errors can 
be excluded from incorporation into the final oligomer. 
The extreme example of coding efficiency is seen in 
spherical viruses. Their reduced dimensions mean that 
there would be insufficient internal volume for housing 
enough genetic material to encode the capsid if this were 
made of a single polypeptide chain. A large number of 
copies of identical subunits (multiples of 60) provide the 
solution (Branden and Tooze 1999). 

The Nature of Intersubunit Interfaces and 
Structural Motifs 

Oligomeric proteins employ a wide range of interactions 
at their interfaces in order to guarantee their stability and 
many have dissociation constants within the nanomolar 
range. A classical intersubunit interface is formed by 
a central contiguous hydrophobic patch surrounded by 
hydrophilic residues and water molecules at the periph¬ 
ery. However, approximately two thirds of intersubunit 
interfaces show a large number of hydrophilic interac¬ 
tions and water-hlled cavities spread over the entire 
surface. It has been suggested that these may be remnants 
of an ancestral monomer which have survived as evolu¬ 
tion drove the protein in the direction of oligomerization. 
Many interfaces are relatively planar with approximately 
circular interacting surfaces which tend to bury 
>1,400 A^, at least in the case of obligate homodimers. 
They tend to show greater shape complementarity than 
transient complexes such as those of antibodies with 
their cognate antigens. Some dimers, such as the trp 
repressor (Fig. 4) are built from heavily intertwined 
monomers which are probably unstable in isolation. 
Others, such as the cystatin dimer (Fig. 4) are best 
described as domain swapped. 

Coiled coil motifs, such as leucine zippers, and 
helix-loop-helix motifs are both structural assemblies 
of a-helices used for dimerization (Fig. 5). Coiled coils, 
however, may also be used to form oligomers of 
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Oligomeric Proteins, Fig. 5 Oligomerization motifs. The 
leucine zipper (a) and the helix-loop-helix (b) are dimerization 
motifs. P-sheet strands are often used to stabilize oligomers, 
particularly those with cyclic symmetry such as the SmAP pro¬ 
tein involved in the formation of small nuclear rihonucleoprotein 
particles (c) and the staphylococcal a-hemolysin (d) 


higher order. The unsatished hydrogen-bonding potential 
of edge strands of P-sheets is also often used for stabiliz¬ 
ing oligomers. Transthyretin, the heat-labile enterotoxin, 
the archeal SmAP protein, and a-hemolysin (Fig. 5) 
are examples. However, this hydrogen-bonding potential 
is potentially dangerous (Richardson and Richardson 
2002). If not used to close a cyclic structure, the result 
will be polymerization leading to the formation of amy¬ 
loid. For this reason proteins have inbuilt negative design 
(such as edge strand distortion and coverage) in order 
to minimize the risk of forming these undesirable 
aggregates. 

Filamentous Oligomers 

Most filamentous oligomers are based on helical sym¬ 
metry. Actin filaments, microtubules, and the tobacco 
mosaic virus are examples. Figure 6 shows the RecA 
hlament which has 6i helical symmetry and wraps binds 
to both single- and double-stranded DNA. However, 
some nonpolar filaments do not present helical symme¬ 
try, rather they are the result of a series of twofold axes 
perpendicular to the hlament axis. Septins (Fig. 6) form 
hetero-oligomeric complexes which include both true 
and pseudo-symmetric twofolds. 






Oligomeric Proteins, Fig. 6 Filamentous oligomers, (a) 
A RecA polar filament which presents 6i helical symmetry and 
(b) an apolar septin hetero-filament with twofold {solid lines) 


and pseudo-twofold axes (dotted lines) indicated perpendicular 
to the filament axis 
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Oligomeric Proteins, Fig. 7 Dijferent oligomeric arrange¬ 
ments of hemoglobins, (a) Hemoglobin from the innkeeper 
worm has four identical subunits with 222 symmetry while 
human hemoglobin (b) has only one true twofold axis (solid 
line) and two pseudo-twofolds (dotted lines) relating the 
a-chains to the P-chains. Tetrameric clam hemoglobin is 


shown in (c) and a tetrameric unit from the giant hemoglobin 
from the earthworm (e) is shown in (d). In this case all four 
chains are different. Three such tetramers are related hy a local 
threefold axis (g). However, due to differences between the 
additional linker chains (green, purple, and orange) in fact this 
is really only a pseudo-threefold (f) 


Pseudosymmetry, Quasi-Symmetry, and 
Asymmetric Oligomers 

Pseudosymmetry may arise when an oligomer is 
composed of similar, but not identical, subunits. The 
familiar tetrameric hemoglobin molecule is composed 
of two a and two (3 chains which share approximately 
45% sequence identity. The tetramer has point group 
symmetry 2, but approximate 222 symmetry due to the 
similarity between the a and (3 subunits (Fig. 7). A dif¬ 
ferent variant on the theme is shown by the bacterial 
DNA polymerase III (Fig. 3) which is formally only 
twofold symmetric but has a pseudo sixfold axis due to 
the presence of three homologous domains within each 
subunit. 

Quasi-symmetry is observed in spherical viruses. 
These take advantage of inter-domain flexibility of 
the capsid proteins in order that these may occupy 
approximately identical local environments (Branden 
and Tooze 1999). 

Asymmetric oligomers are rare. However, occa¬ 
sionally oligomeric proteins will present a lower 


symmetry than is theoretically possible. For example, 
peanut lectin is composed of four identical polypeptide 
chains and would be expected to belong to one of 
two possible point groups in order to exploit its full 
potential in terms of symmetry (point groups 4 or 222). 
In fact the tetramer has only one twofold axis (point 
group 2). However, it also possesses two local twofold 
axes which apply separately to the two dimers which 
comprise the tetramer (Fig. 4). These do not lie 
perpendicular to the true twofold and the three axes 
therefore do not form a point group. The advantages of 
this arrangement in this particular case are unclear. 

Homologous Families of Oligomers 

The number of subunits and their arrangement are not 
always well conserved among members of a protein 
family. 7a-c shows three different arrangements of the 
four subunits of different tetrameric hemoglobins. 
Radically different interfaces are used in all three 
cases. A more dramatic example is the giant hemoglo¬ 
bin found in annelids (Fig. 7). The 144 globin chains 
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are based on 36 tetrameric units, which themselves are 
different from the three described above. Three tetra- 
mers are arranged around a local threefold axis and sit 
upon a trimer of homologous hut nonidentical linker 
chains reducing this axis to a pseudo-threefold. Twelve 
such mushroom-shaped units arranged into two six- 
membered rings present 622 symmetry. This 3.6 MDa 
complex is an extracellular protein whose size is impor¬ 
tant for avoiding loss from the hemolymph of the worm. 
Its arrangement allows for significant allosteric commu¬ 
nication between its subunits and represents a spectacu¬ 
lar example of many of the concepts described in this 
entry. 

Summary 

Oligomeric proteins are common and possess a series 
of advantages in terms of functional attributes. 
The vast majority are based on symmetric arrange¬ 
ments which may be described by either closed point 
groups, leading to globular oligomers with a finite 
number of subunits, or by helical symmetry involving 
a translational component which generates filaments. 
Oligomeric proteins have evolved under selective 
pressure which, in many cases, favors larger structures 
due to their intrinsic stability. Other advantages 
include cooperativity and allosterism, the formation 
of interfacial active sites, multiple functionality, and 
the formation of cavities, pores, and rulers as well as an 
overall economy in terms of genetic material. 

Cross-References 

► 3D Domain Swapping 

► Amyloid Formation 

► Macromolecular Crystallography: Overview 


References 

Ali MH, Imperiali B. Protein oligomerization: how and why. 

Bioorg Med Chem. 2005;13:5013-20. 

Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, 
Weissig FI, Shindyalov IN, Bourne PE. The protein data 
bank. Nucl Acid Res. 2000;28:235^2. http://www.rcsb. 
org/pdb. 

Branden C, Tooze J. Introduction to protein structure. 
New York: Garland; 1999. 


Garratt RC, Orengo CA. The protein chart. Weinheim: Wiley- 
VCH; 2008. 

Goodsell DS, Olsen AJ. Structural symmetry and protein func¬ 
tion. Annu Rev Biophys Biomol Struct. 2000;29:105-53. 

Flahn T, editor. International tables for crystallography, Vol. A, 
4th edn (rev). Dordrecht: Kluwer, 1996. 

Jones S, Thornton JM. Principles of protein-protein interactions. 
Proc Natl Acad Sci USA. 1996;93:13-20. 

Klotz IM, Darnell DW, Langerman NR. Quartemary structure of 
proteins. New York: Academic; 1975. 

Krissinel E, Flenrick K. Inference of macromolecular assemblies 
from crystalline state. J Mol Biol. 2007;372:774-97. 

Monod J, Wyman J, Changeux J-P. On the nature of allosteric 
transitions: a plausible model. J Mol Biol. 1965;12:88-118. 

Richardson JS, Richardson DC. Natural P-sheet proteins use 
negative design to avoid edge-to-edge aggregation. Proc 
Natl Acad Sci USA. 2002;99:2754-9. 

Voet D, Voet JG. Biochemistry. New York: Wiley; 2010. 


Oligosaccharide Analysis and 
Conformation 

► Techniques Applied to Glycan Structure and 
Conformation 


Oligosaccharide Stereochemistry 

► Anomeric Effect in Sugars 


Oligosaccharide-Protein Interactions 

► Lectins 


Oligosaccharides 

► Glycoproteins 


One-Dimensional Protein Structure 
Prediction 

► Protein Secondary Structure Prediction in 2012 








o 


1790 


ONIOM 


ONIOM 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 


Definition 

ONIOM stands for “Our own N-layered Integrated 
molecular Orbital and molecular Mechanics method.” 
This method can treat a molecular system in several 
layers, e.g., ah initio QM, semiempirical QM, and 
MM. The higher the level (typically related to higher 
computational expense and accuracy), the smaller the 
subsystem that is treated with it. The total energy of 
a system is then calculated by the total energy at 
the lowest level plus the energy of the subsystems 
at the higher levels and minus the equivalent energy 
of these subsystems at the lower levels. It is therefore 
referred to as a “subtractive approach.” 

For enzyme reaction modeling, the ONIOM 
approach is often used as a QM/MM method, which 
involves a straightforward “mechanical” embedding 
of the QM region in the MM environment. Initially, 
the method treated interactions between the QM and 
MM regions purely classically, with point charges 
representing the QM system for its interaction with 
the MM environment. This means that polarization of 
the QM region is not included. Further development 
has meant that polarization of the QM region by the 
MM atomic charges is now possible, through “electro¬ 
static embedding.” 

Cross-References 

► QM/MM Methods 

► Semiempirical Quantum Mechanical Methods 


Open-Probability 

► Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis 
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► Circular Dichroism Signals: Qualitative Description 
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Optical Analysis of Surfaces, Interfaces, 
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Ellipsometry 


Optical Fluorescence Microscopy 
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'Department of Nanophysics, Italian Institute of 
Technology (IIT), Genoa, Italy 
^Department of Physics, University of Genoa, 

Genoa, Italy 

^Institute of Biophysics, National Research 
Council, Genoa, Italy 

Synonyms 

Wide-held optical huorescence microscopy 

Definition 

It is a wide-held optical microscopy method utilizing 
huorescence as contrast mechanism. 

Basic Characteristics 

The wide-held huorescence microscope is a modihca- 
tion of the compound optical microscope, arranged to 
admit light of a selected wavelength to huorescently 
labeled specimens and to image them through the emit¬ 
ted radiation. Since huorescence emission has a lifetime 
in the nanosecond scale, continuous excitation is needed 
to collect an image having huorescence as contrast 
mechanism. In a classical wide-held optical huorescence 
microscope, the light source of a huorescent microscope 
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is a gas-vapor arc lamp, using either a discrete (Mercury) 
or a continuous (Xenon) light-emission spectrum. Con¬ 
tinuous- or pulsed-light-emitting lasers are now used in 
modem fluorescence microscopy beam-scanning tech¬ 
niques, as confocal laser-scanning microscopy and 
multi-photon excitation microscopy. In the most com¬ 
mon configuration, i.e., epifluorescence microscope, the 
excitation wavelength, selected by an excitation Alter, is 
reflected by a suitable dichroic mirror (dichromatic beam 
splitter), reaching the specimen through a focalizing 
objective. A careful regulation of excitation intensity is 
required to minimize the extinction of the fluorophore 
emission (photobleaching). The fluorescence radiation 
emitted by the fluorescent dyes is usually collected 
back by the very same objective used for excitation, 
transmitted through a dichroic mirror, spectrally selected 
from undesired wavelengths by an emission Alter (stop 
Alter), and finally focalized at the eyepiece image plane 
or sent to the sensitive target of a camera. In most 
epifluorescence microscopes, different combinations of 
excitation filters, dichroic mirrors, and emission filters 
are preassembled in an epifluorescence box. Resolution 
in a fluorescence optical microscope is limited by dif¬ 
fraction and can be estimated approximately 250 nm in 
the image plane (x,y) and 500 nm along the z-axis. 
Thanks to the very fast growth of the number and type 
of fluorescent molecules, fluorescence microscopy is 
currently the most popular microscopic technique for 
biomedical and biophysical applications. Huorescent 
dyes can be divided into two main families: morpholog¬ 
ical dyes, mainly addressed to evidence structural details 
of biological structures, and functional dyes, mainly 
addressed to reveal or measure physiological processes 
in tissues and cells in vivo. Modem fluorescence micro¬ 
scopes allow one to label cells with different types of 
fluorescent dyes simultaneously. In recent decades, some 
advanced techniques in optical fluorescence microscopy 
have been demonstrated. These techniques have been fur¬ 
ther improved with the advent of confocal laser-scanning 
microscopy and two-photon excitation microscopy. 

Among the advanced variations on the fluorescence 
microscopy methods the most utilized for probing molec¬ 
ular interactions are Fluorescence Resonance Energy 
Transfer (FRET) microscopy. Fluorescence Lifetime 
Imaging microscopy (FLIM), and Fluorescence Recov¬ 
ery After Photobleaching (FRAP). For several decades, 
detection of microscopic images has been committed to 
ordinary photographic cameras using emulsion-based 
films as storage medium. Thanks to the rapid 


improvement of photographic technology, in the last 20 
years electronic imaging has gradually replaced conven¬ 
tional photomicrography. Initially, cameras were video 
rate cameras. This kind of camera makes use of light 
detectors based on vacuum-tube technology and gener¬ 
ates an analog output signal that conforms to the industry 
standards RS-170, PAL, RGB, NTSC, etc. The electrical 
signal is usually recorded into analog signal storage 
devices as video recorders; however, to be sent to 
a computer for image analysis, it must be digitized by 
a frame grabber device. Actually, vacuum-tube-based 
detectors (photomultipliers) are used only in beam-scan¬ 
ning microscopy techniques. In recent years, video rate 
cameras have been replaced by cameras using a solid- 
state detector whose output is already a digital signal. 
Solid-state detectors are two-dimensional arrays of pho¬ 
todiodes mainly manufactured according to CCD 
(charge-coupled device) or CMOS (complementary 
metal-oxide semiconductor) technology. High-level per¬ 
formance is expected from scientific grade digital cam¬ 
eras. According to the application of interest, some main 
features have to be taken into account like the following; 
resolution, which mainly depends on the cross-sectional 
illuminated area of every photodiode (pixel); sensitivity 
mainly affected by the exposure time, the pixel area, and 
the quantum efficiency of the CCD; signal-to-noise ratio 
influenced by the readout noise, dependent on the CCD 
manufacturing, the electronics of the device, the readout 
speed, and thermal noise that can be reduced by cooling 
the sensitive chip of the camera; dynamic range being the 
ratio between the maximum light intensity observable 
without saturation of the CCD and the noise level. 

Cross-References 

► Confocal Laser Scanning Fluorescence Microscopy 

► Fluorescence and FRET in Membranes 

► Fluorescence Three-Dimensional Optical Imaging 

► Fluorescence; General Aspects 

► FRAP 

► Two-Photon Excitation Fluorescence Microscopy 
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Optical Spectroscopy: Future Fourier 
Transform Spectroscopy 
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Synonyms 

Infrared Spectroscopy; Femtosecond laser 


Definition 

Fourier transform spectroscopy with laser frequency 
combs is an emerging spectrometric technique without 
moving parts that allows to record broad spectral band¬ 
width spectra within very short acquisition times. 


Introduction 

Molecular spectroscopy proves an efficient tool for 
the determination of the structure of molecules, the 
quantitative analysis of complex mixtures, the investi¬ 
gation of dynamic systems, biomedical spectroscopy, 
micro-spectroscopy and hyperspectral imaging, and 
the study of many types of interfacial phenomena. To 
date, the overriding broad-spectral-bandwidth analytic 
spectroscopic instrument in the optical region of the 
electromagnetic spectrum with applications through¬ 
out the physical, chemical, and biological sciences has 
been the Michelson-based Fourier transform spectrom¬ 
eter (Griffiths and De Haseth 2007). Starting in the 
early seventies, Fourier transform spectroscopy has 
dramatically improved the quality of optical spectra 
and minimized the time required to obtain data. In 
addition, with constant improvements to computers, 
molecular spectroscopy has made further great strides. 
An optical spectrometer can be deployed in a large 
number of roles, such as minimally invasive medical 
diagnostics, environmental and workplace monitoring, 
industrial real-time process control, and even security 
applications. However, the design of scanning Michel- 
son-based Fourier transform interferometers has 
hardly evolved since and the instrument no longer 
meets some of the demanding capabilities of modem 
physics and sensing. In particular, the use of incoherent 
light sources limits the sensitivity, the resolution is set 
by the excursion of a mechanical moving mirror, and 
the moving-mirror displacement speed determines the 
recording time, which ranges between seconds and 
hours depending on the desired resolution. In recent 
years, a new approach to Fourier transform spectros¬ 
copy has been proposed and implemented. The 
resulting interferometers without moving parts take 
advantage of laser frequency combs and do not involve 
anymore the scanning Michelson interferometer. They 
have recently demonstrated an intriguing potential for 
dramatic advances in molecular spectroscopy. 

Laser Frequency Combs 

An optical frequency comb (Hansch 2006; Udem 
et al. 2002; Cundiff and Ye 2003; Hansch and 
Picque 2012) produces an optical spectrum, which 
consists of several millions of perfectly evenly spaced 
spectral lines. Most of the time, the periodic pulse train 
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Optical Spectroscopy: Future Fourier Transform Spectros¬ 
copy, Fig. 1 Time- and frequency-domain representation of the 
output of a laser frequency comb. Upper trace In the time 
domain, the output of a laser frequency comb consists of 
a periodic train of pulses. As the carrier wave moves with the 
phase velocity while the envelope moves with a different group 
velocity, the carrier wave shifts by Acp after each round trip with 
respect to the pulse envelope. Lower trace In the frequency 
domain, the spectrum (blue lines) of a frequency comb is 


broad, as the spectral span is inversely proportional to the pulse 
duration. The spectrum is centered at the earner frequency of the 
ultrashort pulses and consists of several hundred thousand per¬ 
fectly evenly spaced spectral lines. The comb lines in the radio¬ 
frequency region (red lines) show that the continuous shift of the 
carrier wave results in a frequency offset fo = Atp/x^, which 
prevents the comb from being comprised of exact harmonics of 
the pulse repetition frequency f. 


generated by femtosecond mode-locked lasers is used 
for producing very broadband frequency combs. In 
the time domain (Fig. 1), the output of such lasers is 
a sequence of pulses that are essentially copies of the 
same pulse separated by the laser cavity round-trip 
time. In the frequency domain (Fig. 1), the separation 
between two modes or comb lines is just equal to the 
repetition frequency f^. This remains true even if the 
pulses are not identical replicas but if a reproducible 
slip of the phase of the electromagnetic carrier wave 
relative to the pulse envelope from pulse to pulse is 
allowed for. Such phase slips occur in a laser because 
of dispersion in the cavity. The entire comb will then 
be shifted relative to the integer harmonics of the 
repetition frequency fr by a carrier-envelope offset 
frequency fo, that equals the net phase slip modulo 2 % 
per pulse interval. The frequency of a comb line with 
integer mode number n is then given by 4 = n f^ -F fo. 
The frequency of any comb line can be calculated from 
the two radio frequencies f^ and fo together with the 
integer mode number n. Such a comb acts like a ruler in 
frequency space that can, for instance, be used to 


measure a large separation between two different 
optical frequencies in terms of the pulse repetition 
frequency f^. 

The coherent nature of short pulses emitted by mode- 
locked lasers has hence led to a remarkable union of the 
two previously distinct fields of ultrafast optics and pre¬ 
cision spectroscopy. Frequency combs can conveniently 
link optical and microwave frequencies, and they pro¬ 
vide the long missing clockwork for optical atomic 
clocks. So far no fundamental limit of the accuracy of 
frequency combs has been discovered. By extending the 
limits of time and frequency metrology, they enable new 
tests of fundamental physics laws. Precise comparisons 
of optical resonance frequencies of atoms, ions, and 
molecules with the microwave frequency of a cesium 
atomic clock are establishing sensitive limits for possible 
slow variations of fundamental constants. Extensions of 
frequency comb techniques to new spectral regions from 
THz frequencies to the extreme ultraviolet are now under 
exploration and might open new spectral territories for 
precision spectroscopy. Emerging novel techniques for 
frequency comb generation include four wave mixing of 


































o 


1794 


Optical Spectroscopy: Future Fourier Transform Spectroscopy 



Optical Spectroscopy: Future Fourier Transform Spectros¬ 
copy, Fig. 2 Two nearly identical frequency combs, 1 and 2, 
are used: They have the same spectral and temporal properties, 
except that their line spacings (i.e., pulse repetition frequency) 
slightly differ. One of these combs, 1, is transmitted through the 


cell and heterodyned against the second comb, yielding a down- 
converted radio-frequency comb containing information on the 
absorption and dispersion experienced by each line of the comb 
1. Other implementations allow the two combs to interrogate the 
sample 


whispering gallery modes in microscopic toroidal optical 
resonators (Kippenberg et al. 2011). It has also been 
quickly realized that important applications in many 
areas of science and technology could benefit from 
these exquisite laser sources. In particular, frequency 
comb techniques are providing a key to attosecond sci¬ 
ence by permitting control of the phase of the electric 
field of ultrashort laser pulses. The calibration of astro¬ 
nomical spectrographs with laser frequency combs will 
permit sensitive searches for earth-like planets, and pre¬ 
cise interferometric distance measurements will allow 
new space missions with highly controlled formations 
of space vehicles. Here, we discuss the growing impor¬ 
tance of laser frequency combs in broadband molecular 
spectroscopy. Several approaches to molecular spectros¬ 
copy with frequency combs are being explored (Thorpe 
et al. 2006; Adler et al. 2010; Foltynowicz et al. 2011; 
Diddams et al. 2007; Mandon et al. 2009); this entry 
focuses on the technique of dual-comb spectroscopy. 

Dual-Comb Fourier Transform Spectroscopy 

The broad spectral bandwidth and the high-resolution 
structure of the frequency comb make it an attractive 
tool for broadband direct absorption molecular spectros¬ 
copy and fingerprinting. Dramatic advances in molecular 
science are foreseen for new spectroscopic methods 
based on frequency combs. Recent experiments of 
multi-heterodyne frequency comb Fourier transform 
spectroscopy (also called dual-comb spectroscopy) 


have demonstrated that the precisely spaced spectral 
lines of a laser frequency comb can be harnessed for 
the rapid and sensitive acquisition of highly multiplexed 
spectra of molecules and have therefore the potential to 
vastly enhance the range and capabilities of molecular 
laser spectroscopy. 

In an implementation of dual-comb spectroscopy 
(Fig. 2), an absorbing sample is interrogated by a fre¬ 
quency comb laser source. The information encoded by 
this interrogating comb then needs to be retrieved by 
a spectrometer. This is achieved by heterodyning the 
interrogating comb with a second comb, which serves 
as a reference: The two lasers beat against each other and 
the resulting interference signal is monitored as 
a function of time. It provides simultaneous and accurate 
access to a broad spectral bandwidth within a short mea¬ 
surement time and can physically be equally understood 
in terms of time-domain interferences, multi-heterodyne 
detection, optical free induction decay, linear optical 
sampling, or cross-correlation between two electric 
fields. In practice, the light transmitted by the sample is 
superimposed on a second frequency comb with slightly 
different repetition frequency. A single fast photodetec¬ 
tor then produces an output signal with a comb of radio 
frequencies due to interference between pairs of optical 
comb lines. In the frequency domain (Fig. 3a), the optical 
spectram is thus effectively mapped into the radio¬ 
frequency regime, where it becomes accessible to fast 
digital signal processing. In the time domain (Fig. 3b), 
the pulse train of the interrogating comb excites the 
absorbing sample at regular time intervals. A second 
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Optical Spectroscopy: 
Future Fourier Transform 
Spectroscopy, 

Fig. 3 Physical principle of 
frequency comb Fourier 
transform spectroscopy. The 
repetition frequency of the two 
lasers is respectively f,i and fr2 
and they differ by Af,« f,. 
The technique requires to keep 
constant the differences 
f,, - fr 2 and foi - fo 2 during 
the measurement or to monitor 
their variations to synchronize 
the data acquisition or to make 
a posteriori corrections, (a) In 
the frequency domain, the 
reference comb 2 with fr2 
line spacing acts as a highly 
multiplexed heterodyne 
receiver to generate a 
radio-frequency comb. The 
radio-frequency comb can be 
straightforwardly digitally 
processed, whereas in the 
optical domain, there are no 
means to directly count 
frequencies. As the comb 
parameters fri,fr 2 .foi.fo 2 ntay 
be also directly counted, 
measuring the radio-frequency 
comb makes it possible to 
a posteriori calibrate the 
optical frequency scale, (b) In 
the time domain, the reference 
comb FC2 pulse train slowly 
walks through the 
interrogating pulses from FCl 
to generate a measurement I(t) 
of the interrogating electric 
field. Each point of l(t) results 
from the interference between 
a pair of pulses. Only a few 
pairs of such pulses are 
represented for clarity 
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pulse train of different repetition frequency interferomet- 
rically samples the transient response or “free induction 
decay” of the medium, akin to an optical sampling oscil¬ 
loscope. The detector time-domain signal therefore 
carries, in addition to the signal from each laser at 
the pulse repetition frequency (which can easily be fil¬ 
tered out), the cross-correlation signal resulting from 


interference between pulse pairs from the two lasers. 
Here the phase correlations between successive laser 
pulses are crucial for reproducible sampling, even if the 
free induction decay happens on a time scale that is short 
compared to the time interval between two laser pulses. 

The first low-resolution proof-of-principle experi¬ 
ment has been performed in 2004 with unstabilized 
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mode-locked lasers (Keilmann et al. 2004), and a few 
groups have been contributing since in the THz (Yasui 
et al. 2005) or infrared region (Schliesser et al. 2005; 
Bernhardt et al. 2010a, 2010b; Coddington et al. 2008, 
2010; Ideguchi 2012; Giaccari et al. 2008; Baumann 
et al. 2011; Jacquet et al. 2009). 

These first implementations have demonstrated 
a very exciting potential of dual-comb spectroscopy 
without moving parts for ultrarapid and ultrasensitive 
recording of complex molecular spectra. Compared to 
conventional Michelson-based Fourier transform spec¬ 
troscopy, recording times could be shortened from 
seconds to microseconds, with intriguing prospects 
for spectroscopy of short-lived transient species or 
for hyperspectral imaging. The resolution improves 
proportionally to the measurement time. Therefore, 
longer recordings allow high-resolution spectroscopy 
of molecules with extreme precision, since the abso¬ 
lute frequency of each laser comb line can be known 
with the accuracy of an atomic clock. Selected exper¬ 
imental examples illustrate these features in the next 
paragraph. 

Experimental Illustrations of Dual-Comb 
Fourier Transform Spectroscopy 

An experiment (Jacquet et al. 2009) carried out in the 
telecommunication region may first be used to high¬ 
light an important benefit of dual-comb spectroscopy. 
Two 1,550-nm Er-doped fiber lasers emit ^90 fs 
pulses at a repetition frequency of the order of 
100 MHz and 20 mW average output power. The 
difference between the repetition rates of the two 
combs is set to a value ranging between 10 Hz and 
20 kHz. For Doppler-limited resolution, the combs can 
be let free running, while higher resolutions could only 
be reached with an active stabilization. The available 
spectral domain is limited by the Er-doped fiber oscil¬ 
lators to ^500 cm“*. A single comb interrogates the 
cell, which is filled with acetylene. The two comb 
beams are recombined with a 50-50 beam-mixer. 
They beat on a fast InGaAs photodiode and the electric 
signal is amplified and digitized. 

The upper part of Eig. 4 shows an experimental 
interferogram. Due to the slight mismatch between 
the constant repetition frequencies of the pulses of 
the two combs, the interferogram repeats itself at 
a period which is the inverse of the difference in the 


repetition frequencies of the two lasers. Strong bursts 
occur when pulses from the two lasers overlap. On one 
side of these bursts, the modulation of the interfero¬ 
gram, zoomed on in the second row of Eig. 4, is due to 
the molecular signatures. When a single comb interro¬ 
gates the sample, the resulting interferometer can 
indeed be viewed as the equivalent of a dispersive 
Fourier transform Michelson interferometer, in which 
the sample is placed in one arm of the spectrometer. 

The Fourier transform of a small part of the inter¬ 
ferogram time sequence reveals the spectrum. 
A spectrum of acetylene in the region of the Vi -F V 3 
overtone band spanning 470 cm”' is measured within 
a single recording of 42 ps, which brings a unapodized 
resolution of 0.18 cm” . For comparison, recording 
such a spectrum with a conventional Fourier transform 
spectrometer requires more than 10 s. The method 
indeed demonstrates, when compared to Michelson- 
based FT spectroscopy, a million-fold improvement in 
recording times at identical signal-to-noise ratio. 

Furthermore, due to the periodic structure of the 
interferogram, one can analyze spectral information 
at rates above 200 Hz for spectral resolutions at best 
equal to the comb line spacing. Techniques to increase 
the refresh rate by changing the difference in repetition 
frequencies of the two lasers have been proposed, 
resulting in an intriguing potential for time-resolved 
spectroscopy of single events. 

The sensitivity of molecular fingerprinting with 
dual-comb spectroscopy is dramatically improved 
when the absorbing sample is placed in a high-finesse 
optical cavity, because the effective path length is 
increased. When the equidistant lines from a laser 
frequency comb are simultaneously injected over 
a large spectral range into the cavity holding the gas 
sample, multiple trace gases may be identified within 
a few tens of microseconds. The cavity finesse deter¬ 
mines the enhancement factor for the intracavity 
absorption signal. Using femtosecond Yb-doped fem¬ 
tosecond fiber lasers that emit around 1.0 pm, weak 
overtone molecular transitions could be probed with 
high sensitivity. 

As an experimental demonstration, the rovibrational 
spectrum of ammonia, a molecule of astrophysical and 
environmental interests, is recorded in the region of the 
3vi band. The cavity finesse F > 1,200 enhances the 
effective absorption length to 880 m. In Fig. 5, the cavity 
transmission spans about 20 nm and the spectrum 
consisting of 1,500 spectral elements, with 4.5 GHz 
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Optical Spectroscopy: Future Fourier Transform Spectros¬ 
copy, Fig. 4 The upper part of the figure displays a typical 
interferogram. It reproduces periodically, with a period equal to 
1/Afr. Depending on the desired optical resolution, the Fourier 
transform (FT) of only a portion of this sequence is calculated. 
For instance, the FT of an interferogram of 42 ps duration is 


enough to resolve the Doppler-broadened profiles of the molec¬ 
ular sample, as shown on the lower part of the figure. The entire 
emission domain of the Erbium-fiber lasers is represented. The 
last row of the figure shows a zoom on the vl -t v3 band of 
acetylene, at 3 GFlz unapodized resolution, resulting from 
a 42 ps measurement without averaging 



Optical Spectroscopy: Future Fourier Transform Spectros¬ 
copy, Fig. 5 Cavity-enhanced spectrum of the crowded region 
of the 3vi overtone band of ammonia. These weak transitions are 


observed at high resolution for the first time to our knowledge 
and are of interest for the modeling of radiative transfer in the 
atmospheres of the giant Jovian planets 


resolution and a signal-to-noise ratio of 380, is measured 
within 18 ps. The weak overtone transitions of the 3vi 
band are rotationally resolved for the first time, to our 
knowledge. The minimum-detectable-absorption coeffi¬ 
cient a^in and the noise-equivalent absorption coeffi¬ 
cient at 1 s time averaging are 3 10“* cm“* and 
1 10~*° cm“' respectively. This proof-of- 

principle experiment (Bernhardt et al. 2010a) already 
demonstrates, with a 100-fold shorter measurement 


time, a amin coefficient, which is 20-fold better than the 
one reported in (Thorpe et al. 2006). The spectral band¬ 
width is presently limited by the Ytterbium amplifier. 
However, special mirror design managing dispersion 
has demonstrated to overlap the cavity modes and the 
comb components across 100 nm simultaneously and 
the multiplex spectrometer principle allows for the mea¬ 
surement of multi-octave spanning spectra. Therefore, 
a bandwidth of 100 nm should easily be achievable by 
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Optical Spectroscopy: 
Future Fourier Transform 
Spectroscopy, 

Fig. 6 Portion of a spectrum 
of acetylene in the region of 
the Pe(27) line of the vl + v3 
band with two different 
resolutions. In the upper trace, 
the resolution of 100 MHz is 
suitable for the Doppler- 
broadened profiles. In the 
lower trace, the comb lines are 
resolved (2.3 kHz linewidth, 
100 MHz spacing) and their 
intensities are shaped by the 
acetylene profiles 




the spectral broadening of the combs with highly 
nonlinear optical hbers. 

Frequency combs proved a revolutionary tool for 
frequency metrology. In dual-comb spectroscopy, 
increasing the measurement time and including in the 
Fourier transform calculation a sequence which com¬ 
prises more than two bursts (upper part of Fig. 4) 
enables to resolve the individual comb lines (Fig. 6). 
The resulting spectra (Jacquet et al. 2009) are sampled 
by combs of 100-MFIz line spacing that may cover 
domains of tens of nm with comb lines exhibiting kHz 
optical linewidth. A dense grid of accurate frequency 
markers therefore allows precise self-calibration of the 
wavenumber scale. However, the molecular spectra are 
sampled by the comb line spacing, with a step equal to 
the repetition frequency. As fr most often lies between 
50 MFlz and 200 MHz, most of molecular prohles at 
room temperature in the gas phase are appropriately 
sampled by the line spacing of the comb. In case better 
resolutions would be needed, interleaving techniques 
may sample the spectra down to the ultimate optical 
resolution imposed by the width of the comb lines. 

The demanding conditions on the stability of the 
pulse repetition frequency and carrier-envelope pulse- 
to-pulse phase slip presently impede the realization of 
the full capabilities of dual-comb spectroscopy for 
real-time applications. To record a distortion-free 
real-time interferogram, the timing jitter between 
subsequent pulses must indeed be kept lower than 
10 as. As a consequence, the interferograms may 
have to be averaged (Coddington et al. 2008; Baumann 
et al. 2011) over several seconds with combs stabilized 


against state-of-the-art cavity-stabilized continuous- 
wave lasers with a hertz-level line-width or a posteriori 
corrections (Giaccari et al. 2008) requiring multiple data 
acquisition channels and additional computational time 
may be performed, thus preventing fast acquisition rates. 
A new concept of real-time dual-comb spectroscopy 
that enables to record high quality spectra with free- 
running femtosecond lasers has been recently introduced 
(Ideguchi 2012). It does not require phase-lock electron¬ 
ics or a posteriori computer processing. The quality of 
the results far exceeds that of frequency comb systems 
stabilized against radio-frequency references and this 
new scheme, called adaptive dual-comb spectroscopy, 
might represent an important step toward the dissemina¬ 
tion of dual-comb spectroscopy to communities unfamil¬ 
iar with the sophisticated tools of frequency metrology. 

Conclusion and Outlook 

Dual-comb spectroscopy holds much promise for 
establishing the basis of a revolutionary spectroscopic 
tool: As with any Fourier transform spectrometer, the 
spectral span is only limited by the source and single 
detector spectral bandwidth. Overall consistency of the 
simultaneously measured spectral elements that may 
prove cmcial when, e.g., investigating unstable molecu¬ 
lar species is granted by the multiplex advantage. Much 
improved signal-to-noise ratios and reduced measure¬ 
ment times are provided by the high spectral radiance 
of frequency comb sources. The absence of moving parts 
overcomes the speed and resolution limitations of the 
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Michelson-based approach. The comb structure of the 
spectrum dramatically improves the detection sensitivity 
via optimal use of cavity-enhanced techniques. Self¬ 
calibration of the frequency scale of the spectra is achieved 
when the comb lines are resolved. Furthermore, compact 
high-resolution instruments may be designed and one may 
even envision a chip-size dual-comb spectrometer based 
on microresonators (Wang 2011) for real-time spectros¬ 
copy in the liquid phase. Dual-comb Fourier transform 
spectroscopy may be combined with, e.g., hyperspectral 
imaging, microscopy, or temporal resolution, while keep¬ 
ing their already demonstrated capabilities. Since laser 
frequency combs involve intense ultrashort laser pulses, 
nonlinear interactions can additionally be harnessed and 
the combination of coherent control and high-sensitivity 
broadband frequency comb spectroscopy might be 
envisioned. Frequency comb Fourier transform spectros¬ 
copy might therefore open up new insights in the under¬ 
standing of the structure of matter as well as new horizons 
in advanced nonintrusive instruments, for instance, in 
chemistry or biomedicine. However, dual-comb spectros¬ 
copy is still in its infancy and requires considerable efforts 
to express its full potential. Several technically challeng¬ 
ing issues, like the development of suitable comb sources 
in the “molecular fingerprint” mid-infrared and ultraviolet 
regions, have to be overcome. Signihcant insights and 
improvements are also needed before the technique can 
spread throughout scientific communities unfamiliar with 
advanced ultrashort laser and frequency metrology 
instruments. 

Summary 

The advent of laser frequency combs a decade ago has 
already revolutionized optical frequency metrology and 
precision spectroscopy. Extensions of laser combs from 
the THz region to the extreme ultraviolet and soft x-ray 
frequencies are now under exploration. Such laser 
combs have become enabling tools for a growing tree 
of applications, from optical atomic clocks to attosecond 
science. Recently, the millions of precisely controlled 
laser comb lines that can be produced with a train of 
ultrashort laser pulses have been harnessed for highly 
multiplexed molecular spectroscopy. Fourier spectros¬ 
copy, multi-heterodyne spectroscopy, and asynchro¬ 
nous optical sampling spectroscopy with frequency 
combs are emerging as powerful new spectroscopic 
tools. The first proof-of-principle experiments have 


demonstrated a very exciting potential of dual-comb 
spectroscopy for ultrarapid and ultrasensitive recording 
of complex molecular spectra. Compared to conven¬ 
tional Fourier transform spectroscopy, recording times 
could be shortened from seconds to microseconds, with 
intriguing prospects for spectroscopy of short-lived tran¬ 
sient species or for hyperspectral imaging. Longer 
recording times allow high-resolution spectroscopy of 
molecules with extreme precision, since the absolute 
frequency of each laser comb line can be known with 
the accuracy of an atomic clock. 

Cross-References 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 

References 

Adler F, et al. Cavity-enhanced direct frequency comb spectros¬ 
copy: technology and applications. Annu Rev Anal Chem. 
2010;3:175-205. 

Baumann E, et al. Spectroscopy of the methane V 3 band with an 
accurate mid-infrared coherent dual-comb spectrometer. 
Phys Rev A. 2011;84:062513. 

Bernhardt B, Ozawa A, et al. Cavity-enhanced dual-comb spec¬ 
troscopy. Nat Photonics. 2010a;4:55-7. 

Bernhardt B, Sorokin E, et al. Mid-infrared dual-comb spectros¬ 
copy with 2.4 pm Cr^'^:ZnSe femtosecond lasers. Appl Phys 
B-Lasers Opt. 2010b;100:3-8. 

Coddington I, Swann WC, Newbury NR. Coherent multiheterodyne 
spectroscopy using stabilized optical frequency combs. Phys 
Rev Lett. 2008;100:013902. 

Coddington I, Swann WC, Newbury NR. Coherent dual-comb 
spectroscopy at high signal-to-noise ratio. Phys Rev A. 
2010;82:043817. 

Cundiff ST, Ye J. Colloquium: femtosecond optical frequency 
combs. Rev Mod Phys. 2003;75(1):325^2. 

Diddams SA, Flollberg L, Mbele V. Molecular fingerprinting 
with the resolved modes of a femtosecond laser frequency 
comb. Nature. 2007;445(627-630):445. 

Foltynowicz A, Ban T, Maslowski P, Adler F, Ye J. Quantum- 
noise-limited optical frequency comb spectroscopy. Phys 
Rev Lett. 2011;107:233002. 

Giaccari P, Deschenes J-D, Saucier P, Genest J, Tremblay P. 
Active Fourier-transform spectroscopy combining the direct 
RF beating of two fiber-based mode-locked lasers with 
a novel referencing method. Opt Express. 2008;16:4347-65. 
Griffiths PR, De Flaseth JA. Fourier transform infrared spectros¬ 
copy. Hoboken: Wiley; 2007. 

Hansch TW. Nobel lecture: passion for precision. Rev Mod 
Phys. 2006;78:1297-309. 



o 


1800 


Optical Trap 


Hansch TW, Picque N. Frequency combs. In: Trager F, editor. 
Handbook of lasers and optics. 2nd ed. Springer Berlin: 
Heidelberg; (2012). 

Ideguchi T et al. Adaptive real-time dual comb spectroscopy. 
arXiv:1201.4177 (2012). 

Jacquet, P., et al. Frequency comb Fourier transform spectros¬ 
copy with kHz optical resolution. In: Fourier transform 
spectroscopy. Washington, DC: Optical Society of America; 
2009. paper FMB2, 3 p. 

Keilmann F, Gohle C, Holzwarth R. Time-domain mid-infrared 
frequency-comb spectrometer. Opt Lett. 2004;29:1542^. 
Kippenberg TJ, Holzwarth R, Diddams SA. Microresonator-based 
optical frequency combs. Science. 2011;332(6029):555-9. 
Mandon J, Guelachvili G, Picque N. Fourier transform spectros¬ 
copy with a laser frequency comb. Nat Photonics. 2009; 
3:99-102. 

Schliesser A, et al. Frequency-comb infrared spectrometer 
for rapid, remote chemical sensing. Opt Express. 2005; 
13(22):9029-38. 

Thorpe Ml, et al. Broadband cavity ringdown spectroscopy 
for sensitive and rapid molecular detection. Science. 
2006;311:1595-9. 

Udem T, Holzwarth R, Hansch TW. Optical frequency metrol¬ 
ogy. Nature. 2002;416:233-7. 

Wang CY et al. Mid-infrared frequency combs based on 
microresonators. arXiv:1109.2716 (2011). 

Yasui T, Saneyoshi E, Araki T. Asynchronous optical sampling 
terahertz time-domain spectroscopy for ultrahigh spectral 
resolution and rapid data acquisition. Appl Phys Lett. 
2005;87:06110L 


Optical Trap 

► Angular Optical Trapping 

► Bacterial Flagellar Motor: Biophysical Studies 

► Dual-Beam Optical Tweezers 

► Optical Tweezers 

► Single-Molecule Methods 


Optical Tweezers 

Ximiao Hou and Wei Cheng 

Department of Pharmaceutical Sciences, University of 
Michigan, Ann Arbor, MI, USA 

Synonyms 

Bacterial flagellar motor: biophysical studies; Laser 
trap; Optical trap; Single-molecule methods 


Definition 

Optical tweezers are focused laser beams that can trap 
and manipulate microscopic objects contact-free in 
three dimensions (3D). 

Introduction 

Photons carry momentum and can therefore exert 
forces on matter. Although the typical force generated 
by photons is very small, the range of force (10~*^ N, 
pN) is significant for microscopic objects and macro¬ 
molecules at the nanometer scale. Optical tweezers 
exploit this fundamental property to trap micron¬ 
sized objects in a potential well formed by light and 
allow manipulation of these objects in 3D. More 
importantly, optical tweezers can be used as quantita¬ 
tive tools to directly measure force and displacement 
once they are coupled with sensitive measurements of 
light. Since the first demonstration of stable optical 
trapping of micron-sized dielectric objects by Ashkin 
and coworkers (Ashkin et al. 1986), optical tweezers 
have found increasing use in biophysical investiga¬ 
tions. The continued development of optical tweezers 
over the years has made them a highly sensitive and 
quantitative tool in biophysics to manipulate and mea¬ 
sure macromolecular interactions, one molecule at 
a time. 

Principles of Optical Trapping 

An optical trap is formed by focusing a laser beam 
to a diffraction-limited spot with a high Numerical 
Aperture (NA) objective. As photons carry momen¬ 
tum, a dielectric object situated in the light path will 
experience a force that pushes the object in the direc¬ 
tion of light propagation due to scattering of photons 
by the object. At the same time, the electric field of the 
light wave induces transient dipoles in the dielectric 
object. The interactions between these transient 
dipoles and the electric field give rise to a force that 
aligns in the direction of the field gradient. When the 
laser beam is focused through an objective, the direc¬ 
tion of the field gradient points toward the focus of the 
laser beam where the laser beam has the highest inten¬ 
sity. This gradient force is proportional to both the 
electric field gradient and the polarizability of the 
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dielectric object, and counteracts the scattering force 
so that the particle can be stably trapped in 3D very 
close to the laser focus. 

To understand the interaction between light and 
a dielectric particle quantitatively, microscopic objects 
have been classified into three regimes based on their size; 

Particle Diameter >> Laser Wavelength X 

In this regime, ray optics can well describe the origin of 
trapping forces. As shown in Fig. 1, a laser beam of 
Gaussian intensity profile is focused through a positive 
lens, and a bead is located downstream of the focused 
beam slightly off the optical axis. Consider the bright ray 
on the right that propagates through the head, it will 
change its direction upon exiting the bead due to refrac¬ 
tion. Because the refractive index of the bead is higher 
than that of the surrounding media, the exiting ray will 
point away from the optical axis. This net change in 
photon momentum suggests that the bead exerts a force 
on the laser beam in the direction of the momentum 
change. According to Newton’s third Law, the bead 
will experience a reacting force of the same magnitude 
but in opposite direction, exerted by the photons (indi¬ 
cated by the red thick arrow). The same analysis applies 
to the dim ray on the left. As a result, there is a net force 
acting on the bead that points toward the focus of 
the laser beam, which gives rise to the trapping force. 
Conversely, if the bead has a lower refractive index than 
the surrounding media, it will be pushed away from the 
laser focus instead of being stably trapped. 


Laser in 


Dim ray 


Force 



Change of Photons 


Laser out 

Optical Tweezers, Fig. 1 Basic principle for optical tweezers 
to trap a particle in the ray optics regime (particle diameter d > > 
laser wavelength k). Incident laser beam is assumed to have 
a Gaussian intensity profile, as indicated by the gradient color 
shaded along the positive lens. Light rays of higher intensity are 
noted using thicker blue lines 


Rayleigh Particle: Diameter << \ 

In this regime, ray optics no longer applies. The inter¬ 
action of the particle with photons can be approxi¬ 
mated by the interaction of a transient dipole with 
the electromagnetic wave of the light. The scattering 
force, as described above, is proportional to the scat¬ 
tering cross section of the particle and can be calcu¬ 
lated as follows (Harada and Asakura 1996): 


zn2 87i(ka)"^a^ /m^ — 1 


p — 

^ sral — 


( 1 ) 


The gradient force can be calculated from the time 
average of the Lorentz force exerted on the transient 
dipole by the electromagnetic field as follows: 




( 2 ) 


where z is the unit vector in the beam-propagation 
direction, n 2 is the refraction index of the surrounding 
medium, m is the effective index ni/n 2 , where ni is the 
refraction index of the particle, a is the particle diam¬ 
eter, k is the wave number InjI, where L is the laser 
wavelength, and I is the beam intensity. 


where P is the dipole moment, E is the instantaneous 
electric field, and c is the speed of light in vacuum. 
As shown in (2), the direction of the gradient force 
aligns with the direction of the beam intensity 
gradient. 

The majority of microscopic objects used in optical 
trapping have a size range that is comparable to the 
wavelength of the trapping light. The exact theoretical 
treatment is much more complex, and simple analyti¬ 
cal solutions for scattering and gradient forces are not 
yet available. 
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Optical Tweezers, Fig. 2 A schematic of an optical tweezers 
setup. A laser beam is expanded by a telescope (Tl). It is then 
deflected by a mirror (Ml) and sent through a second telescope 
(T2) to further expand the beam. After this final beam expansion 
and deflection by a dichroic mirror (M2), the objective 01 
focuses the beam to a diffraction-limited spot to form an optical 
trap at the center of a microfluidic chamber (MC). A second 


Optical Tweezers Setup 

Figure 2 is a schematic diagram of a single-beam gradi¬ 
ent optical tweezers setup. It contains the following 
essential components: a laser to provide the light source 
for forming the optical trap, a telescope to expand the 
laser beam to overfill the back aperture of an objective 
(01) to make the trap strong, and the front objective (01) 
to focus the laser beam to form the optical trap. For 
quantitative detection and measurement of optical twee¬ 
zers, a back objective (02) is typically used to collect 
light exiting from the optical trap and a photodetector 
(PD) is used for quantitative measurement of the light. 
Building on this basic concept of optical trapping, 
more complex optical tweezers can be designed and 
constmcted. For example, a dual-trap optical tweezers 
can be constructed by splitting the single beam using 
a polarizing beam splitter to produce two orthogonally 
polarized beams. Alternatively, multiple optical traps 
can be created by either time-sharing of a single laser 
beam through the use of acousto-optic deflectors (AOD) 
(Dame et al. 2006) or splitting of a single beam by 
diffraction optical elements (DOE) to form arrays of 
optical tweezers, named “holographic optical tweezers 
(HOT)” (Dufresne and Grier 1998). In addition, the 
wavelength of the laser should be carefully selected 


objective (02) is used to collect the light exiting from the optical 
trap. The exiting beam is deflected by a third miiTor M3 
and projected onto a photodetector (PD) through a relay lens 
(RLl). A light-emitting diode (LED) is installed to provide the 
bright field illumination (blue lines) for the MC. The image of 
the MC is projected onto a CCD camera (CM) through a tube 
lens (TL2) 


because photochemical effects have been demonstrated 
for certain wavelengths, which is especially important 
for biological applications. 


Displacement Detection 

Lateral motion of a trapped object at the sample 
plane will produce a laser intensity shift at the back 
focal plane of the objective 02, which arises from 
the interference between the forward scattered light 
and the unscattered light that differ by 90° in their 
phases. When the movement of the object is small 
compared to the radius of the beam focus, this 
spatial shift in laser intensity is proportional to the 
displacement of the trapped object away from its 
equilibrium position and, therefore, can be used 
to measure the lateral displacement of the object. 
In practice, a position-sensitive photodetector is 
placed at a location that is conjugate to the back 
focal plane of the objective 02. By monitoring the 
laser intensity shift on the photodetector, lateral 
position of the trapped particle can be determined 
with nanometer precision (Gittes and Schmidt 
1998a). 
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Optical Tweezers, Fig. 3 Optical tweezers behave as a 
Hookean spring 


Force Measurement 

A particle in an optical trap experiences a restoring 
force when there is a displacement. The value of the 
force is proportional to the displacement, i.e., an opti¬ 
cal trap behaves as a Hookean spring to the trapped 
particle with a spring constant k (Fig. 3): 

F=-kx (3) 

where x is the vectorial displacement of the particle. 
When k is known, the force can be calculated accord¬ 
ingly. The spring constant of the trap (also called 
stiffness) is dependent on a number of parameters that 
include laser power, numerical aperture of the objec¬ 
tive, particle size, and its refractive index (Simmons 
et al. 1996). Stiffness values of 0.005-1 pN/nm have 
been reported throughout the literature. 


Measurement of Trap Stiffness 

Numerous methods have been developed in the liter¬ 
ature to measure the stiffness of an optical trap. 
Among these, two methods have been frequently 
used. 

1. Calibration using viscous drag of a fluid. Trap 
stiffness can be derived by applying an external 
viscous drag force on the bead and measuring its dis¬ 
placement. This is typically achieved through applying 



Bead displacement (nm) 

Optical Tweezers, Fig. 4 The Stokes force experienced by 
a bead in an optical trap was determined using Stokes’ Law. 
The displacement of the bead in the optical trap was measured 
using video microscopy. The slope of the curve defines the 
stiffness of the optical trap 


a laminar flow to the solution, where the particle is 
being trapped. For a solution of viscosity p that flows 
at a rate of v, the viscous drag experienced by a trapped 
spherical particle of radius r can be determined by 
Stokes’ Law: 


Fstokes = 67tprv (4) 

By adjusting the flow rate, a series of forces Fstokes 
can be applied and the resulting displacement x of the 
bead be measured. Trap stiffness can be derived simply 
from the slope of the Fstokes-T curve (Fig. 4). 

When the viscous drag force exceeds the maximum 
force that the trap can exert on the bead, the bead will 
no longer stay in the trap but move together with the 
flow. The force at which this occurs is termed trap 
“escape force”, which can be measured by increasing 
the flow speed until the bead is carried away by the 
flow. Typically, the trap escape force increases linearly 
with laser power (Simmons et al. 1996). 

2. Calibration by Brownian motion. The stiffness of 
an optical trap can also be obtained by measuring the 
Brownian motion of a trapped bead. A trapped bead 
undergoes damped Brownian motions, and its move¬ 
ment in solution is governed by three forces: the ran¬ 
dom thermal force F(t), the trap restoring force kx, and 
the viscous drag force ydx/dt, where y is the drag 
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Frequency (Hz) 

Optical Tweezers, Fig. 5 Power spectrum of a trapped bead. 
The comer frequency can be estimated by fitting the power 
spectrum with (6) 


coefficient and dx/dt is the instantaneous velocity. The 
Langevin equation for such a system can be written as 
follows: 


7 


df 

dt 


-\-kx = F(t) 


(5) 


The power spectrum of the bead position Sx(/) can 
be derived by applying a Fourier transformation to 
both sides of (5) (Gittes and Schmidt 1998b) and is 
defined as follows: 


Sx(/) 


kflT 

+P) 


( 6 ) 


where Icb is the Boltzmann constant, T Is the absolute 
temperature, and fc defines the corner frequency of the 
trap as/.. = kl2nj. 

Figure 5 shows a typical power spectrum of 
a trapped bead, which can be divided into two regimes: 
when/<</c, the power spectrum is almost constant; 
when f»fc, the power spectrum instead decreases 
with a rate of 1//^. Once the value of /c is found, the 
stiffness can be determined with k — Injfc, where j can 
be calculated as y = 67ir|r for a bead with known radius 
r in a medium with viscosity T) . A more complex equa¬ 
tion for the bead Brownian motion power spectrum has 
been derived to correct for nonideal conditions, for 
example, finite sampling and friction in an incompress¬ 
ible fluid (Berg-Sorensen and Flyvbjerg 2004). 


Applications of Optical Tweezers 

Since the invention of optical tweezers by Ashkin and 
coworkers, this contact-free manipulation technique 
has found broad applications in physics, chemistry, 
and biology. Optical tweezers can be used to manipu¬ 
late and position small objects such as viruses, bacte¬ 
ria, eukaryotic cells, as well as cellular organelles. 
With optical tweezers, single cells or micron-sized 
particles can be positioned, transported, and even 
sorted (MacDonald et al. 2003). 

The remarkable features of optical tweezers to 
directly apply and measure mechanical force in the 
piconewton range have been employed to study the 
mechanical properties of cell membranes. For exam¬ 
ple, a red blood cell was stretched through two attached 
beads by optical tweezers, and large deformations 
were captured under various applied forces. The mem¬ 
brane shear modulus can thus be measured using this 
technique. 

Coupled with sensitive measurement of light, opti¬ 
cal tweezers have become an important tool to study 
the biochemical and biophysical properties of motor 
proteins and biopolymers at the single-molecule 
level. To manipulate these single molecules using 
optical tweezers, micron-sized beads are usually used 
as mechanical handles: molecules are attached to 
a bead through complementary biochemistry, and 
then manipulated and measured through the bead. 
Three major types of single-molecule experiments 
have been developed. Type one is a surface-based 
experiment. As shown in Fig. 6a, one end of a single¬ 
molecule tether is attached to a bead that is held in an 
optical trap, while the other end of the molecule 
attaches to a substrate surface. This scheme has been 
used to follow cytoskeleton motor proteins and nucleic 
acids motor proteins. The disadvantage of this experi¬ 
mental geometry is stage drift that is independent of 
the laser trap, which has to be subtracted to reach high 
spatial resolution (Carter et al. 2009). Type two is 
a pipette-based experiment. As shown in Fig. 6b, one 
end of a single-molecule tether is attached to a bead 
held by an optical trap while the other end is attached 
to a bead held on top of a micropipette. This type of 
experiment can be carried out deep in solution, and 
thus potential concerns from being close to a surface 
are eliminated. Still, noise arises from the drift of the 
micropipette that is independent of the optical trap. 
Type three is a dual-beam optical tweezers. As shown 
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Laser beam h Laser beam q Laser beam 



Optical Tweezers, Fig. 6 Schematics of single-molecule optical tweezers experiments 


in Fig. 6c, two traps are formed from a single laser beam 
split by polarization. Because the entire molecule is 
optically levitated in solution, stage drift becomes 
a minimal concern (Abbondanzieri et al. 2005). In addi¬ 
tion, as these two traps are formed using the same beam 
source, noise of common origins, for example, beam¬ 
pointing instability can be effectively canceled out 
by conducting measurements along a difference coordi¬ 
nate (Moffitt et al. 2006). These developments have 
pushed the spatial resolution of optical tweezers to the 
subnanometer regime. 

Different single-molecule systems have been stud¬ 
ied using the experimental schemes described above. 
These systems range from molecular motors to single 
biopolymers. 

Molecular motors. Optical tweezers have been 
widely used to track the movement of motor proteins, 
including cytoskeleton motor protein families and 
nucleic acid translocases. One of the best studied cyto¬ 
skeleton motor proteins is kinesin. Kinesins move on 
microtubules and are important for cargo transport 
inside the cell. One experimental scheme is to attach 
the motor protein to a micron-sized bead that is trapped 
by optical tweezers with high precision (Fig. 6a). 
Block and co workers measured the step size of Kinesin 
to be 8 nm (Svoboda et al. 1993), which offered one of 
the first examples that biological motors move with 
regular steps. This result agrees well with the 8 nm 
spacing of each tubulin dimer. A unique feature of 
optical tweezers is the ability to measure the opposing 
force at which the motor will stop its movement. The 
stall force of a single kinesin was measured to be 7 pN 
(Visscher et al. 1999). Myosin, on the other hand, has 
an average step size of 11 nm and stall forces of 3-5 pN 
(Finer et al. 1994). 

The broad families of nucleic acid translocases have 
been studied using optical tweezers (Bustamante et al. 


2011). They include DNA and RNA polymerases, 
helicases, DNA packaging motors, ribosomes, and 
many other nucleic acids translocases. Although these 
proteins carry out different biological functions inside 
the cell, the common property they share is the ability 
to move on DNA or RNA molecules. High-resolution 
optical tweezers enable real-time observations of single 
motor-protein activity. The important questions that can 
be addressed include the velocity of movement, trans¬ 
location step size, processivity, conformational dynam¬ 
ics, and physical mechanisms that underlie the basis of 
movement. For example, Bustamante and coworkers 
carried out systematic studies on the DNA packaging 
motor of bacteriophage 4>29 (Bustamante et al. 201 1). 
The velocity, stall force, step size of the motor, 
and potential mechanisms of force generation by the 
motor, and its interaction with DNA have all been quan- 
tihed using optical tweezers. Block and coworkers used 
dual-beam optical tweezers with subnanometer spatial 
resolution to measure the step size of RNA polymerase 
(RNAP), using the experimental design schematically 
shown in Fig. 6c (Abbondanzieri et al. 2005). RNAP 
transcription changes the extension of the DNA in 
between the two beads, and the step size of RNAP was 
measured to be one base pair at a time. Many other 
nucleic acids motors such as DNA polymerases, DNA 
or RNA helicases have also been studied by optical 
tweezers. With the development of optical tweezers 
down to the subnanometer resolution regime, more 
exciting discoveries on motor movement are expected. 

Elasticity of nucleic acids. Elasticity is an important 
property of biopolymers. The elasticity of double- 
stranded DNA (dsDNA) directly affects the physiologi¬ 
cal states of DNA such as wrapping around histones, 
loop formation, and DNA packaging. As a quantitative 
tool for single-molecule manipulation, optical tweezers 
have been used to measure the force-extension curves of 
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double- as well as single-stranded DNA (ssDNA), from 
which important parameters such as persistence length 
can be extracted. One of the pioneering works in this area 
was carried out hy Bustamante and coworkers (Smith 
et al. 1996) using the design shown in Fig. 6 b. Single 
DNA molecule was stretched by moving the pipette 
away from the laser focus. The end-to-end extension of 
the DNA and the force on the molecule were measured. 
The persistence length of dsDNA was determined to be 
193 bp in 150 mM NaCl solution. Interestingly, when the 
force is increased to 65 pN, the molecule undergoes 
a sudden cooperative transition in which the extension 
of the DNA is lengthened to 1.7 times that of a standard 
B-DNA form. The exact nature of this transition is 
a matter of ongoing research. Binding of DNA by pro¬ 
teins or intercalating ligands changes the interphosphate 
distance or the persistence length of the molecule and, 
thus, changes the force-extension curve of dsDNA. Opti¬ 
cal tweezers have become a powerful tool to study DNA- 
binding proteins and intercalating ligands, shedding light 
on the mechanism of protein or ligand action. 

Single-molecule folding and unfolding. Optical 
tweezers are a facile tool to study folding and unfolding 
of proteins or nucleic acids. The reaction coordinate of 
a folding/unfolding transition is well defined when 
a single protein molecule or nucleic acid is held on its 
two extremities and force is applied to induce the struc¬ 
tural transition. Extensive studies of this kind have been 
performed on RNA structures (Tinoco et al. 2006) and 
model proteins (Cecconi et al. 2005). The force at which 
these molecules unfold, the pathway of folding and 
unfolding, and the distance to the transition state can 
all be mapped or measured using optical tweezers. In 
addition, a characteristic hopping phenomenon unique 
to single-molecule observations was revealed when the 
molecule was held at a tension close to the mechanical 
unfolding force. 

Summary 

The invention of optical tweezers together with the 
development of this technique over the years have 
brought about many interesting and exciting discover¬ 
ies in biophysics that were not possible to achieve 
before. Optical tweezers are a unique tool that allows 
biophysicist to study force, displacement generated by 
single biomolecules, and probe the inner working of 
a molecule at unprecedented scale and resolution. 


Compared to ► magnetic tweezers, optical tweezers 
can provide higher spatial resolution (0.1-2 nm) and 
higher force (200 pN). In addition, clever use of 
nonspherical particles or nanofabricated quartz cylin¬ 
ders allows one to apply ► torque with optical twee¬ 
zers (Deufel et al. 2007). Compared to the ► atomic 
force microscope (AFM), which can supply higher 
forces (10"* pN), optical tweezers offer higher spatial 
resolution with lower stiffness. The unique capability 
of optical tweezers to manipulate microscopic objects, 
apply and measure mechanical force, and quantify 
displacement at subnanometer resolution will continue 
to offer much excitement in biophysics. Notably, 
optical tweezers also expand their ability through 
a combination with ► single-molecule fluorescence 
technique. Using this hybrid technique, one can not 
only manipulate and measure molecules as described 
above, but can also image the molecules simulta¬ 
neously through a fluorescence signal from dye labels. 
Combined with microfluidic techniques, researchers 
can control and trigger biochemical reactions while 
manipulating single molecules. The development of 
these hybrid techniques along with laser trapping will 
continue to make optical tweezers a powerful and 
precise tool in biophysical research. 

Cross-References 

► Angular Optical Trapping 

► Atomic Force Spectroscopy 

► Bacterial Flagellar Motor: Biophysical Studies 

► Dual-Beam Optical Tweezers 

► Force-Fluorescence Spectroscopy 

► Laser Processing of Biomaterials and Cells 

► Magnetic Tweezers 

► Microfluidics for Single Molecule Detection 
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► Fourier Transform Infrared Photoacoustic Spectros¬ 
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ORC1 Human Ornithine Carrier Isoform 1 

► Mitochondrial Transport Protein Family: Structure 


ORC1, ORC2 Mitochondrial Ornithine 
Carriers 

► Mitochondrial Transport Protein Family 


ORC2 Human Ornithine Carrier Isoform 2 

► Mitochondrial Transport Protein Family: Structure 
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Definition 

The Order of Reaction refers to the power dependence of 
the rate on the concentration of each reactant. Thus, for 
a first-order reaction, the rate is dependent on the concen¬ 
tration of a single species. A second-order reaction refers 
to one whose rate is dependent on the square of the 
concentration of a single reactant (e.g., in a homo¬ 
dimerization reaction, A -i- A ^ A 2 ) or the combined 
first-order dependence on the concentrations of two dif¬ 
ferent reactants (A -i- B ^ C). The order of reaction is an 
experimentally determined parameter and can take on 
a fractional value. This is distinct from the molecularity 
(or stoichiometry) of the reaction which is the theoretical 
integer value of the number of molecules involved in the 
reaction. For simple one-step reactions, the order and 
molecularity should have the same value. A classic exam¬ 
ple of fractional order is the simple enzyme scheme 
under steady-state conditions; E-i-S«ES—>E-i-P. 
Here, the rate of production of product P is first order with 
respect to E and first order with respect to S at low 
concentrations of S. At high concentrations of S the 
reaction becomes limited by the second step which 
shows first-order dependence on E but zero order with 
respect to S. At intermediate concentrations of S the 
reaction will show between zero- and first-order depen¬ 
dence on [S]. 
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The words “order of reaction” may also be used to 
refer to the mechanism of a reaction. Thus for an 
enzyme operating with two substrates, A and B, 
a binary complex may form with either substrate 
(to form EA and EB respectively) which will react 
with the other substrate to form the ternary complex 
(E.A.B) in a so-called random-order reaction. Alterna¬ 
tively, one substrate may bind first (to form EA) 
and the second substrate will only bind to this binary 
complex to form E.A.B in a so-called ordered sequen¬ 
tial reaction. In this context the “order of reaction” 
could be summarized as A binds before B. 


Cross-References 

► Kinetics: Overview 

► Steady-State Enzyme Kinetics 


Ordering 

► Infrared Spectroscopy of Membrane Lipids 


Organophosphorus Hydrolase 

► Phosphotriesterase - Computational Studies 


Orientation of Protein Segments 
Determined by Polarized Infrared 
Spectroscopy 

► Polarized Infrared Spectroscopy 


Oriented Circular Dichroism 
Spectroscopy 

Alison Rodger 
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Coventry, UK 

Synonyms 

OCD 


Definition 

Oriented circular dichroism (OCD) spectroscopy, as 
the name implies, is the circular dichroism spectros¬ 
copy of oriented samples. It can be a very useful 
technique; however, it is prone to artifacts resulting 
from linear dichroism contributions to the observed 
spectrum (Norden et al. 2010). Thus care must be 
taken in collecting and analyzing the data. A sample 
for OCD should be symmetric about a unique orienta¬ 
tion axis. Data must then be collected with the light 
propagating along that unique axis and spectra 
measured at different rotations about the unique axis 
compared. If the rotated spectra are the same, then 
these data can be believed. If the sample is perfectly 
oriented (as it would be in a perfectly oriented crystal 
but probably in no other method), the observed spec¬ 
trum is then equivalent to three times (no rotational 
averaging) the CD spectrum of all the transitions in an 
isotropic sample except those polarized along the 
unique axis (which are invisible). 

To understand OCD, consider the CD spectrum of 
an a-helical peptide inserted, perpendicular to the sur¬ 
face, into a lipid bilayer that has been dried onto 
a quartz plate. Each helix is oriented as illustrated in 
Fig. la. The membrane normal is the unique axis. 
Since the 208-nm component of the lowest energy 
7i-7t transition is polarized along the helix axis, if the 
sample orientation is perfect, the 210-nm region of the 
spectrum will have no OCD intensity as the light is 
propagating along the direction of its polarization. So 
an a-helix inserted into a membrane will have 
a negative 222-nm CD band and a positive ^192-nm 
band. To illustrate this, the spectrum of the a-helical 
peptide melittin (a bee venom peptide whose spelling 
should be mellitin from the Latin word mellitus mean¬ 
ing “honey”) is shown in Fig. lb. In our experiment, 
the negative 210-nm band is attenuated but not absent 
indicating that the orientation is not perfect. We also 
note the usual positive band of solution phase a-helices 
that occurs at ^192 nm has shifted to longer 
wavelength due to less cancellation from the 
attenuated 210-nm negative band. 

If instead of being inserted into the membrane, a helix 
lies parallel to the surface of the membrane, then the 
transitions polarized along the helix axis (208 nm in 
this case) will have full OCD intensity and those perpen¬ 
dicular to the helix axis (e.g., 222 nm and 192 nm) 
will have half the maximum possible OCD intensity 
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Oriented Circular Dichroism Spectroscopy, Fig. 1 

(a) Transition polarizations of an a-helix. (b) Oriented circular 
dichroism spectrum of melittin in DMPC (dimyristoylphospha- 
tidylcholine) at 1-100 peptide to lipid molar ratio. A sample of 
melittin and DMPC co-dissolved in chloroform was added to 


a quartz window in a drop-wise fashion, dried in a flow of 
nitrogen, and left overnight at room temperature to slightly 
hydrate the sample. The final spectrum is the average of the 
four similar spectra obtained at 0°, 90°, 180°, and 270° 
(Damianoglou et al. 2010) 


(assuming they are randomly oriented on the surface). 
Thus, the 208-nm band will be comparatively enhanced 
for surface helices rather than diminished as is the case 
for inserted peptides. In using OCD it is important, as 
with linear dichroism, to recall that orientation is 
unlikely to be perfect. 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Circular Dichroism Spectroscopy: Units 

► DNA-Ligand Circular Dichroism 

► DNA-Ligand Flow Linear Dichroism 

► Electro-optics and Macromolecular Hydrodynamics 

► Far UV Protein Circular Dichroism 

► Linear Dichroism 

► Linear Dichroism Spectra - Measurement 

► Linear Dichroism Spectroscopy: Theory 

► Near UV Protein CD 

► Nucleic Acid Linear Dichroism 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 

► Protein Circular Dichroism Analysis 
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Oriented-Sample NMR of Membrane Proteins: Sensitivity 
Enhancement and Spectroscopic Assignment, Fig. 1 (a) 

REP-CP pulse sequence and its comparison with the conven¬ 
tional CP and CP-MOIST magnetization-enhancement tech¬ 
niques. (b) Magnetically aligned Pfl phage (parallel 


orientation with respect to the main magnetic field), (c) Pfl 
coat protein reconstituted in magnetically aligned bicelles 
{perpendicular alignment). A 25% gain is obtained using REP- 
CP for the Pfl phage, whereas a 45% gain is obtained for the 
more dynamic membrane-bound form of Pfl coat protein 


Definition 

Sensitivity enhancement increases the signal-to-noise 
ratio for the nuclear magnetic resonance (NMR) 
signals of the low-gamma spin-hearing nuclei. 

Spectroscopic assignment is the mapping of the 
locations of the spin-hearing nuclei within a molecule 
onto the corresponding resonances within its NMR 
spectrum. 

Oriented-sample NMR is the area of solid-state 
NMR where the spin-bearing molecules are aligned 
uniaxially with respect to an ordering axis. 

Membrane proteins are imbedded into biological 
membranes, and perform a variety of functions includ¬ 
ing: transport, energy transduction, signaling, pathway 
activation, and energy conversion in the living cells. 

Basic Characteristics 

Solid-state NMR of macroscopically aligned samples 
(or oriented-sample, OS NMR) has demonstrated its 


capability of providing remarkable details about 
the conformations of membrane proteins in their 
native-like, fully hydrated lipid environment. Notable 
examples include gramicidin, M2 domain of the 
acetylcholine receptor, M2 domain of the influenza 
A virus, fd and Pf phage coat proteins, phospholamban, 
Vpu (from HIV-1), and a mercury transporter protein 
MerF. When the protein molecules are aligned 
uniaxially, the proton resonances cannot be directly 
resolved since the high-gamma proton (or /) spins are 
strongly coupled to each other via the dipolar spin-spin 
interactions, thus considerably broadening their reso¬ 
nances. Therefore, only dilute low-gamma (or S) spins 
can be detected, which is inherently connected with the 
problem of low sensitivity. Usually, the magnetization 
enhancement for the dilute spins is achieved by trans¬ 
ferring the magnetization from the protons to the nitro¬ 
gen or carbon spins using the cross-polarization (CP) 
method (Ernst et al. 1987; Stejskal and Memory 1994). 
Further improvements of the method include, but 
are not limited to, ramped CP (Metz et al. 1994), 
CP-MOIST (Levitt et al. 1986), frequency-modulated 
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Oriented-Sample NMR of Membrane Proteins: Sensitivity Enhancement and distinguish the cross-peaks from the main peaks. Red boxes show the (r,; -i- 1) connec- 
Spectroscopic Assignment, Fig. 2 Spectroscopic assignment of the magnetically tivities between the resonances, (b) spin exchange experiment allows one to 

aligned Pf 1 phage. Two different pulse sequences can be used to obtain complementary cross-validate the assignments obtained in part A, especially for the crowded and/or low- 
assignment information, (a) Spin-exchanged variant of PISEMA (blue contours) over- intensity spectral regions. Spectroscopic assignment for residues R44-A36 is shown 
laid with the conventional PISEMA experiment (magenta contours) allows one to here. Others can be established in a similar manner 
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CP (Kim et al. 2004), and CP-COMPOZER (Fukuchi 
et al. 2009). An additional roadblock to atomic-resolution 
stmcture determination is obtaining sequence-specific 
assignments for the resolved NMR resonances. While 
the “shot-gun” approach (Marassi and Opella 2002) can 
be used to assign OS NMR spectra and obtain struc¬ 
tural constraints, it relies on the laborious preparation 
of multiple selectively isotopically labeled samples, 
and is generally restricted to residues in a-helical 
conformations. Proton-driven spin diffusion can 
be used to establish nearest-neighbor correlations 
between the resonances, but this technique generally 
requires long mixing times (up to several seconds), 
which may considerably increase the overall time of 
the experiment. 

In this entry, we describe two recent methods, 
which address the above two issues of critical impor¬ 
tance for OS NMR of membrane proteins - sensitivity 
enhancement and spectroscopic assignment. 

Sensitivity Enhancement 

Only a part of the overall magnetization can be trans¬ 
ferred from the abundant proton bath to the dilute spins 
in a single cross-polarization contact. This is due to 
a combination of the thermodynamic bounds (finite 
ratio of the number densities for the high- and low- 
gamma spins in proteins) and the quantum mechanical 
bounds on spin dynamics. Furthermore, Hartmann- 
Hahn mismatches due to inhomogeneity of the 
radiofrequency (rf) fields, profein and bilayer 
dynamics, rf heating, and Tip relaxation can further 
reduce the amount of the transferred magnetization 
from the high-gamma to the low-gamma spin reservoir 
in fully hydrated biological samples. Instead of trans¬ 
ferring the proton magnetization over a long (several 
milliseconds) contact, short (several hundred of micro¬ 
seconds) repetitive CP (REP-CP) contacts during the 
preparation period can be employed (Tang and 
Nevzorov 2011). Thermal re-equilibration intervals 
(Z-filters) lasting for a fraction of a second {<Tiz) are 
inserted between each CP contact (cf. Fig. la) to 
reestablish the proton magnetization. The REP-CP 
scheme is found to bring the two spin reservoirs 
close to the spin temperature corresponding to the 
intact proton bath, thereby increasing the final amounf 
of the transferred magnetization in lossy biological 
samples. Figure 1 shows a comparison of REP-CP 
with the conventional CP and CP-MOIST schemes 
for the case of a parallel orientation (Pfl phage. 


part B) and perpendicular orientation (Pfl coat pro¬ 
tein reconstituted in magnetically aligned bicelles, 
part C). 

Spectroscopic Assignment 

Assignment of the OS NMR peaks is achieved by 
establishing spin-spin correlations between the amide 
*^N nuclei corresponding to the adjacent, i.e. (/, i + 1), 
amino-acid residues within the backbone of 
a membrane protein. The establishment of the spin- 
spin correlations is obtained by transferring the Zee- 
man order of the dilute spins into the dipolar order of 
the proton bath and then back to the Zeeman order 
using the inverse cross-polarization under grossly 
mismatched Hartmann-Hahn (MMHH) conditions 
(Fevitt et al. 1986; Knox et al. 2010). The cross¬ 
peaks are usually established within several millisec¬ 
onds (e.g., 5 ms). During the MMHH exchange, the 
proton magnetization is kept along the z-axis, which 
does not interfere with the dilute-spin exchange in the 
doubly tilted rotating frame. It has been found that for 
membrane proteins reconstituted in magnetically 
aligned bicelles (perpendicular orientation of the align¬ 
ment axis with respect to the main magnetic held, Bq), 
the optimal mismatch for the rf B i held amplitudes of 
the '^N and 'H channels is approximately 5 kHz, 
whereas when the alignment axis is parallel to the Bq 
held, the (/, ; -i- 1) correlations are established at the 
10 kHz Hartmann-Hahn mismatch. While long-range 
cross-peaks, e.g. (f, i + 2), may arise, they are relatively 
uncommon. Figure 2 shows two-dimensional correla¬ 
tion spectra of Pfl phage for the cases of the PISEMA 
(Wu et al. 1994) experiment, (part A) and N-N dilute- 
spin exchange (part B). Spectroscopic assignment of 
the backbone amide resonances (Knox et al. 2010) can 
be established as indicated. Note that the above REP- 
CP sequence (Fig. la) can be used instead of the CP 
sequence to enhance the signal-to-noise ratios for both 
the main peaks and cross-peaks. 

Cross-References 

► INEPT 

► Membrane Proteins: Structure and Organization 

► Multidimensional NMR Spectroscopy 

► NMR 

► Protein NMR - Introduction 

► Protein NMR Resonance Assignment 

► Spin Diffusion 
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Synonyms 

Free radical damage; Oxidative damage; Redox 
imbalance 

Definition 

Oxidative stress (OS) is a general term used to describe 
an imbalance in the relative levels of prooxidants and 
antioxidants in which the levels of prooxidants are in 
excess. 

General Overview 

Under conditions of normal physiologic homeostasis, the 
production of prooxidants is kept in balance by equal or 
excess amounts of antioxidants. A state of oxidative 
stress occurs when this balance is perturbed in favor of 
excess prooxidants. An antioxidant is any molecule that 
delays, prevents, or removes oxidative damage to a target 
molecule. They are usually present in low concentrations 
compared to the amount of oxidizable substrates. 
Prooxidants are substances that generate oxidants or 
inhibit antioxidant systems. Oxidants are molecules 
that gain electrons in redox chemical reactions and 
promote the oxidation of target molecules. Biological 
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oxidants relevant to oxidative stress are ► reactive oxy¬ 
gen species (ROS), reactive nitrogen species (RNS), and 
transition metals. Since the production of cellular oxi¬ 
dants is a natural part of aerobic metabolism, organisms 
have evolved mechanisms to deal with these byproducts 
through specialized proteins, enzyme systems, and die¬ 
tary antioxidants. Oxidative stress not only refers to 
conditions of excess oxidant production, but it may also 
indicate diminished levels of antioxidants or antioxidant 
defense systems. Oxidative stress ensues when the 
steady-state levels of oxidants exceed the cellular capac¬ 
ity to either remove these toxic substances or rapidly 
repair the damage caused by their production (Halliwell 
and Gutteridge 2007). 

Prooxidants 

Formation of Reactive Oxygen Species (ROS): Molec¬ 
ular oxygen (O 2 ) is necessary for the survival of many 
life forms yet it can also be a toxic molecule that is 
directly oxidizing. Oxygen-mediated damage comes 
from the ability of O 2 to accept electrons with the 
subsequent formation of ROS. During normal respira¬ 
tion, O 2 acts as the terminal electron acceptor in the 
► mitochondrial electron transport chain. This four- 
electron reduction of O 2 by the enzyme ► cytochrome 
c oxidase generates two molecules of water. It has been 
estimated, however, that 1 - 2 % of all oxygen consumed 
by the mitochondria is partially reduced to ROS. This 
occurs under certain pathological (and normal) condi¬ 
tions at various sites along the electron transport chain. 
The reduction of O 2 by one to three electrons results in 
the formation of several reactive species including super¬ 
oxide ( 02 ~), hydrogen peroxide (H 2 O 2 ), and hydroxyl 
radical (‘OH), respectively (Eq. 1). 

02^02^ H 2 O 2 ^ -OH ^ H 2 O (1) 

These ROS have varying degrees of reactivity. Super¬ 
oxide can either act as a reductant or an oxidant, both of 
which may contribute to oxidative stress. Superoxide 
production is not only an unintentional by-product of 
mitochondrial respiration, it is also synthesized deliber¬ 
ately. Regulated 02 ~ production is accomplished by 
specialized membrane-bound enzymes such as those in 
the NAPDH oxidase family (Brown and Griendling 
2009). These enzymes are designed to produce 02 ~ by 
transferring electrons from NADPH to O 2 . 


Hydrogen peroxide, the two-electron reduction 
product of O 2 , can be generated enzymatically or 
via the dismutation of superoxide. Hydrogen peroxide 
is not particularly oxidizing but in the presence of 
reduced transitions metals, such as copper (Cu"^*) and 
iron (Fe"^^), extremely oxidizing hydroxyl radical or 
hypervalent metal complexes can be formed via the 
Fenton reaction (Eq. 2). 

Fe^+ + H 2 O 2 ^ Fe^+ -f •OH -f OH^ or Fe^+ = O 

( 2 ) 

ROS can also be formed through a process called 
redox cycling which can occur during xenobiotic 
metabolism. This is when a substrate becomes enzy¬ 
matically reduced to a free radical. This unstable free 
radical then transfers an electron to O 2 forming 02 ~ 
and generating the initial oxidized enzyme substrate. 
Because the enzyme substrate is continually reduced 
and oxidized it is not consumed during the cycle. The 
net result is a futile cycle that generates 02 ~ at the 
expense of oxidized coenzymes. The enzymes involved 
in redox cycling are, in general, flavoenzymes that use 
NAD(P)H as the electron source. 

Hypochlorous acid (HOCl, produced by the enzyme 
myeloperoxidase), singlet oxygen (* 02 ), and ozone 
(O 3 ) are other potential sources of ROS-induced oxi¬ 
dative stress. 

Formation of Reactive Nitrogen Species (RNS): 
Under biological conditions the most common RNS 
involved in oxidative stress are nitric oxide (‘NO), nitro¬ 
gen dioxide (•NO 2 ), nitrous anhydride (N 2 O 3 ), nitroxyl 
(HNO), peroxynitrite (ONOO“), peroxynitrous acid 
(ONOOH), and in some cases, nitrite (N 02 ~). Each of 
these compounds has distinctive properties in terms of 
reactivity, half-life, solubility, and biologic activity. The 
central link that relates these RNS to one another is their 
common precursor which is the short-lived free radical 
signaling molecule •NO. Nitric oxide is generated bio¬ 
logically by the enzyme ► nitric oxide synthase (NOS) 
which uses O 2 and arginine as substrates. Under normal 
conditions •NO serves a variety of physiologic functions 
in the cardiovascular, neurological, and immune systems 
(Thomas et al. 2008). Nitric oxide is not a good oxidant. 
However, the intermediates formed from the reactions of 
•NO with ROS generate strongly oxidizing RNS. For 
example, the reaction of •NO with 02 ~ is extremely 
rapid and forms peroxynitrite (ONOO“). This non¬ 
radical has a pKa of 6.8 and becomes easily protonated 
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at physiologic pH to form peroxynitrous acid (ONOOH). 
Peroxynitrous acid has two routes of decomposition. It 
can isomerize to form the unreactive nitrate (NOs”) or it 
can homolyze into two oxidizing radicals: •NO 2 and 
•OH (Eq. 3). 

N03^ + H+^ ONOOH ^•N02 +-OH (3) 

Furthermore, ONOO” decomposition is catalyzed 
by carbon dioxide (CO 2 ) which is present in high 
concentrations under biological conditions. The reac¬ 
tion of CO 2 with ONOO~ leads to NOs”, •NO 2 , and 
the formation of another potent one-electron oxidizing 
radical, carbonate anion (•COs”) (Eq. 4). 

NOs” -f CO 2 ^ ONOO“ -f CO 2 ^ "NOz -f * 003 “ 

(4) 

Since •NO and Oz” are often generated simulta¬ 
neously at sites of local inflammation, both ONOO” 
and ONOOH will be formed at these locations as well. 
This is important in terms of oxidative stress because 
the rapid reaction of •NO with Oz” to form ONOO”/ 
ONOOH results in the net conversion of two relatively 
mild oxidants (•NO and Oz”) into three strongly oxi¬ 
dizing ones (•OH, •NOz, and •COz”) (Eqs. 3 and 4) 
(Forman et al. 2003; Ignarro 2010; Lancaster 1996). 

Nitrite is a relatively stable end-product of •NO 
metabolism, yet under certain circumstances it can be 
oxidized by one electron to form the reactive free 
radical •NOz. This can occur by several mechanisms, 
such as when NOz” is acted upon by ► peroxidases or 
when it is oxidized by metals. Although NOz” is not 
the normal substrate for these enzymes, under specific 
circumstances they can be effective generators of 
•NOz in the presence of NOz” nnd HzOz. Free and 
heme-bound iron, which may be elevated in certain 
disease states, can catalyze •NOz formation from 
NOz” and HzOz (Thomas et al. 2002). 

N 2 O 3 is not a free radical, but it is a reactive nitro¬ 
gen species formed either as an intermediate during the 
reaction of •NO with O2 (autooxidation) or in the 
reaction of •NO with ONOO”. It can also be produced 
by the dimerization and dehydration of nitrous acid 
(HNOz) which is the protonated form of NOz”. This 
latter reaction can occur in specialized cellular organ¬ 
elles or organ compartments where there is a reduction 
in pH. N 2 O 3 is known as a nitrosating species because 
of its ability to transfer a nitrosonium equivalent (NO"^) 


to a nucleophile, most commonly a thiol or an amine. 
Nitrosated proteins can result in either a gain or loss of 
function which can exacerbate oxidative stress. 

Transition Metals: Due to the unique reactivity of 
metals and by virtue of their ability to participate in 
oxidation reduction reactions, metal ions are found in 
many biological systems. This reactivity makes them 
important components of the active site of proteins, yet 
it also indicates they can take part in potentially dele¬ 
terious chemical reactions. For this reason metals are 
usually tightly sequestered in storage proteins or cova¬ 
lently attached to metal-binding proteins in various 
forms. Protein-bound iron is present mostly in the 
form of ► iron-sulfur clusters or as a component of 
heme. The two most important metal ions associated 
with oxidative stress are iron and copper. Only a small 
fraction of total cellular iron is free or loosely bound. 
This labile iron pool, as it is called, is redox active and 
can freely participate in oxidant-generating reactions 
such as the Fenton reaction (Eq. 2). In these cases, iron 
would be considered a prooxidant since it is acting as 
a reductant and inducing the subsequent formation of 
oxidants. This is why certain pathological conditions 
are caused or exacerbated by increases in free iron and 
copper. In fact one of the consequences of excess free 
radical production, such as •NO and O 2 ”, is the sub¬ 
sequent release of metal ions which further perpetuates 
oxidative damage. 

Other sources of oxidative stress include radiation, 
altered metabolism, ► drug metabolism, mitochon¬ 
dria, smoke inhalation, viruses, bacteria, inflammation, 
and exercise. 

Antioxidants 

Antioxidants are substances that delay, prevent, or 
remove oxidative damage to a target molecule. In 
many cases they protect substances from oxidation by 
virtue of their ability to be more easily oxidized. Anti¬ 
oxidants come in various forms from small molecules 
like ascorbate (vitamin C) to large proteins such as 
► catalase and ► superoxide dismutase. Antioxidants 
can either be endogenously synthesized or arise from 
dietary sources (Halliwell and Gutteridge 2007). 

Examples of dietary antioxidants are ascorbate, 
tocopherols, carotenoids, and perhaps flavonoids. These 
molecules function through their ability to donate (or 
accept) electrons during their reactions with free 
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radicals. The result of these reactions is often the forma¬ 
tion of an antioxidant radical. Antioxidant radicals are 
generally stable and can easily he regenerated back to 
their active antioxidant form through various nontoxic 
mechanisms. For example, the lipid-soluble antioxidant 
tocopherol (vitamin E) is important for maintaining cell 
membrane integrity by protecting against lipid peroxida¬ 
tion chain reactions. When a membrane lipid radical is 
formed, tocopherol can donate an electron to this species, 
forming a less reactive lipid peroxide. The oxidized 
tocopherol is then reduced back to its active form at the 
membrane surface by ascorbate. The resultant ascorbate 
radical is stable and it is eliminated by its interaction with 
reducing enzymes to regenerate the active form. Since 
tocopherol is recycled and not consumed by this mech¬ 
anism, its concentration in cell membranes can be many 
times less than the amount of potentially oxidizable sub¬ 
strates (Ignarro 2010). 

Major antioxidant proteins include catalase, super¬ 
oxide dismutase, peroxiredoxins (Hall et al. 2010), 
thioredoxin, and the ► glutathione systems. Although 
metals have been discussed as being involved in the 
generation of free radicals, both zinc and copper are 
intimately involved in protecting the body against oxidant 
stress as important components of superoxide dismutase. 
Superoxide dismutase is present in almost all cells in 
relatively high amounts. It functions by rapidly catalyzing 
the breakdown of 02~ into H 2 O 2 and O 2 . Catalase con¬ 
verts H 2 O 2 into H 2 O and O 2 . Peroxiredoxins can also 
break down H 2 O 2 as well as organic hydroperoxides 
(ROOH). The ► thioredoxin system has a crucial role in 
regulating many cell functions such as viability and pro¬ 
liferation by acting as an electron donor for a number of 
enzymes including ► ribonucleotide reductase, methio¬ 
nine sulfoxide reductase, and peroxiredoxin. The gluta¬ 
thione system comprises a group of enzymes that provide 
the means for removal of peroxides along with many 
xenobiotic compounds. High cellular glutathione concen¬ 
trations make it extremely important in maintaining the 
redox balance of a cell. 

Although many antioxidants are capable of directly 
scavenging various free radicals, some species are 
inaccessible. For instance, •OH radical is so reactive 
that it will immediately and indiscriminately oxidize 
any substrate at its site of formation. Antioxidants are 
not present in high enough concentrations or do not 
have fast enough reaction kinetics to be protective 
against this species. Furthermore, antioxidants can 
become prooxidants under certain circumstances. 


For example, ascorbate can reduce ferric iron by one 
electron to its ferrous form. Reduced metals are much 
better Fenton reagents so this ascorbate-mediated iron 
reduction will facilitate iron-catalyzed production of 
•OH when H 2 O 2 is present. 

Types of Oxidative Damage 

Most molecules within a cell are prone to some degree 
of oxidation, including proteins, lipids, and nucleic 
acids. Each one of these classes of molecules can 
undergo numerous types of oxidative modifications. 

Nucleic Acids: DNA and RNA are susceptible to 
oxidative damage in the form of strand breaks or 
chemical modifications to bases and sugars. Although 
H 2 O 2 is not particularly reactive toward DNA, •OH is. 
Iron and copper ions readily bind to DNA and may 
be responsible for H 202 -mediated damage under 
conditions of stress. The interaction of H 2 O 2 with 
DNA-associated metal ions can form •OH which will 
react immediately at its site of production to produce 
numerous oxidized products such as 8 -hydroxy- 2 '- 
deoxyguanosine. Peroxynitrite, •NO 2 , and •COa” are 
also capable of inducing oxidized DNA modification. 
Nitric oxide is not directly DNA-damaging, but the 
formation of N 2 O 3 through its reaction with O 2 or 
02 ~ can nitrosate primary amines, leading to base 
deamination. For example, if the base 5-methylcytosine 
is deaminated to an alcohol, it assumes the identical 
structure of thymine. 

This C to T transition results in a DNA point muta¬ 
tion (Burney et al. 1999). RNA can just as easily be 
damaged by oxidation or nitrosation. However, its rate 
of cellular turnover is usually rapid, so it is unclear how 
this might contribute to oxidative stress. 

Lipids: Fipids are the major structural component of 
cellular membranes which surround cells and internal 
organelles. Many membrane lipids are polyunsaturated 
fatty acids (PUFAs) which mean they contain two 
or more carbon-carbon double bonds. It is these 
PUFAs that are most susceptible to oxidative damage 
(lipid peroxidation). Again, •OH seems to be the main 
instigator of lipid peroxidation and is most likely 
derived from reactions of metals, although there is 
some evidence that HO 2 (protonated superoxide) may 
be involved. Bis-allylic hydrogens present at the site 
of carbon-carbon double bonds are most susceptible 
to hydrogen atom abstraction by •OH, leading to 
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a carbon-centered lipid radical. This lipid radical can 
go on to react with O2 and further oxidize adjacent 
lipid molecules. Lipid peroxidation is a chain reaction 
whereby one initiating oxidative event is propagated 
into multiple lipid modifications. The net result is the 
accumulation of lipid hydroperoxides (LOOH). These 
hydroperoxides are relatively stable but they themselves 
can react with metals to generate •OH. Since •NO and 
•NO2 are lipid-soluble uncharged radicals, they can 
diffuse into membranes and combine with lipid radicals. 
These chain-terminating reactions lead to a variety of 
stable and unstable products. Conversely, •NO2 by itself 
is a strong oxidant capable of initiating lipid peroxida¬ 
tion (Halliwell and Gutteridge 2007 ; Ignarro 2010 ). 

Proteins'. Each of the 20 or more amino acids mak¬ 
ing up proteins is susceptible to oxidative damage. 
This damage is manifested as numerous and diverse 
adducts, along with peptide bond breakage and protein 
fragmentation. •NO2 can oxidize proteins or it can 
nitrate certain amino acid residues, namely, tyrosine, 
phenylalanine, and tryptophan. H2O2 is not directly 
oxidizing to proteins. Since proteins are often associ¬ 
ated with metal ions, H2O2 can be damaging due to the 
generation of •OH. Thiol groups are particularly sus¬ 
ceptible to oxidation which leads to the formation of 
thiyl radicals or disulfides. N2O3 can nitrosate thiol 
residues leading to the formation of nitrosothiol. The 
oxidation or nitrosation of key thiol residues on certain 
proteins can enhance or inhibit their catalytic function. 
Destruction of iron-sulfur clusters from 02~ or •NO 
exposure can liberate iron. This free iron can be 
reduced by 02~, making it more reactive toward H2O2. 

Consequences of Oxidative Stress 

Oxidative damage to protein, lipid, or nucleic acid 
molecules will have various short- and/or long-term 
consequences on cell function and viability. This will 
depend on the magnitude and duration of exposure to 
an oxidizing environment as well as the ability of a cell 
to repair the resultant damage. 

Damage to Macromolecules: Oxidation, nitration, 
nitrosation, hydroxylation, thiolation, and acetylation 
are all potential protein modifications that can result 
from oxidative stress. The end result of protein oxida¬ 
tions is varied. It depends on the extent of the damage 
(i.e., how many molecules of a specific protein are 
damaged) as well as the location of the damage on 


the protein. Oxidation of amino acid residues not key 
to a protein’s catalytic function will be less consequen¬ 
tial. Many proteins are turned over rapidly and there¬ 
fore acute bouts of oxidative stress may have little 
overall effect on the inhibition of their function. 
Long-term exposure to oxidative stress, however, can 
lead to steady-state levels of protein modifications that 
may elicit an immune response and contribute to auto¬ 
immunity. Not only is direct oxidative damage to 
a protein important, but so are the downstream signal¬ 
ing events brought on by these modifications. For 
example, oxidation of a critical cysteine residue on 
a signaling molecule usually results in a protein disul¬ 
fide formation. Disulfides can alter the conformation of 
a protein by either activating it as a transcription factor 
or by initiating events that result in the regulation of 
transcription. Therefore, oxidative stress can lead to 
either a gain or loss of protein function via thiol mod¬ 
ifications. This may induce downstream changes in 
cell-signaling events. 

Oxidized lipids play a role in oxidative stress. Lipid 
peroxides, cholesterol and LDL oxidation products, 
nitrated lipids, and isoprostanes are all examples of 
types of lipid damage that can result from oxidative 
stress. Hydroperoxides (LOOH) are the primary 
products of free radical-mediated lipid peroxidation. 
The oxidation of membrane lipids can alter the chem¬ 
ical characteristics and physical organization of the 
membrane leading to leakage, rupture, and changes in 
function. Some of the products of lipid peroxidation 
are chemically reactive and readily react with proteins, 
sugars, and DNA. Oxidized plasma lipids are differen¬ 
tially metabolized and can have cell-signaling effects. 
Some are known to be cytotoxic while others can 
increase the antioxidant capacity of a cell. Nitrated 
derivatives of unsaturated fatty acids are formed 
under both normal and pathological conditions and 
have the ability to act as signaling molecules through 
enzyme activation or modulation of inflammation. 

DNA damage is the hallmark of genetic mutations. 
Oxidative stress in the form of damage to the bases or 
sugars of DNA can lead to mutations. Types of DNA 
lesions include oxidation, nitration, deamination, halo- 
genation, and cross-linking. The major consequences 
of oxidative stress-mediated DNA damage are genetic 
mutations, altered gene products, and gene silencing. 

Disease States Associated with Oxidative Stress: 
Oxidative stress is linked to the etiology of numerous 
acute or chronic pathological conditions. Because DNA 
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damage is a consequence of oxidative stress, chronic 
exposure to prooxidants can lead to permanent genetic 
mutations that lead to cancer. Oxidative stress has been 
associated with every stage of cancer: initiation, pro¬ 
motion, and progression. Once cancer has formed, 
increased levels of oxidants are known to enhance 
cancer cell proliferation and contribute to tumor 
aggressiveness. In the cardiovascular system, athero¬ 
sclerosis, cardiac hypertrophy, and cardiomyopathy 
have all been associated with oxidative stress. Neuronal 
conditions associated with oxidative stress include 
stroke, Alzheimer’s, Parkinson’s, muscular dystrophy, 
amyotrophic lateral sclerosis, and others. Premature 
and natural aging, psoriasis, and UV skin damage 
have an oxidative stress component. In the gastrointes¬ 
tinal tract, pancreatitis, hepatitis, gastritis, Crohn’s dis¬ 
ease, ulcerative colitis, and liver cancer have all been 
suggested to involve reactive oxygen species. 

Summary 

Oxidative stress arises as a result of an excess produc¬ 
tion of prooxidants relative to antioxidants. Reasons 
for this include increases in the rate of oxidant produc¬ 
tion or decreases in the amounts of antioxidants. 
Another contributing factor is a diminished ability of 
cellular repair processes to remove or reverse damage 
which leads to an accumulation of lesions. Proteins, 
DNA, and lipids are the major cellular components that 
are susceptible to oxidative damage. The damage can 
manifest as structural damage, altered enzyme func¬ 
tion, or modifications in cell-signaling pathways. Oxi¬ 
dative stress is a causative factor for the development 
and progression of many diseases. 

Cross-References 

► Catalases and Catalase-Peroxidases 

► Cytochrome c Oxidase (Complex IV) 

► Drug Metabolism - Computational Studies of 
Mechanisms 

► Glutathione 

► Hemes 


► Iron-Sulfur Clusters 

► Mitochondrial Electron Transport 

► Nitric Oxide Synthase - Computational Studies 

► Peroxidases 

► Reactive Oxygen Species 

► Superoxide Dismutases 
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P2X Receptors for ATP: Molecular 
Properties and Functional Roles 
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Synonyms 

Purinergic receptors 

Definition 

P2X receptors for ATP comprise a family of 
ligand-gated cation channels that are distinct from the 
cys-loop and glutamate channel families. They are 
expressed in one form or another by almost all cell 
types and play physiological roles ranging from pain 
sensation to blood clotting. This entry gives an over¬ 
view of purinergic transmission, the basic properties of 
P2X receptors, and finally summarizes the main fea¬ 
tures and roles of molecularly defined receptor 
subtypes. 

Introduction to Purinergic Transmission 

Adenosine triphosphate (ATP) is widely thought of as 
an important intracellular energy carrier, but in addi¬ 
tion it also acts as an extracellular signaling molecule. 
This concept of purinergic signaling was initially 
championed by Prof Burnstock in the 1970s 
(for a historical review see (Burnstock 2006)). There 


are several ways in which ATP levels in the extracel¬ 
lular space can increase. Lor example, ATP is co¬ 
stored and co-released with classical neurotransmitters 
like noradrenaline, and it is released following shear 
stress in blood vessels as well as from damaged cells. 
The extracellular ATP acts at cell surface purinergic 
P2 receptors to mediate physiological responses rang¬ 
ing from muscle contraction to sensory function 
(Khakh and North 2006). The P2 receptors can be 
divided into G-protein-coupled P2Y receptors and 
ligand-gated P2X receptor ion channels (Burnstock 
2006). 

The P2X Receptor Family 

The P2X receptors were initially identified at the 
molecular levels by expression cloning in 1994, and 
subsequently a total of seven mammalian P2X receptor 
subunits (P2X1-7) have been described (Roberts et al. 
2006; Stojilkovic et al. 2005). P2X receptor genes have 
been isolated from a range of other organisms, includ¬ 
ing algae, indicating a long evolutionary history. How¬ 
ever, interestingly no homologues have been found in 
Caenorhahditis elegans or Drosophila (Lountain and 
Burnstock 2009). The crystal structure of a zebrafish 
P2X4 (zP2X4) receptor in an ATP-free closed state 
(Kawate et al. 2009) shows the receptor forms as 
a trimer from individual subunits that have two trans¬ 
membrane domains, intracellular amino and carboxy 
termini, and a large extracellular ligand-binding loop 
(Pig. 1). This membrane topology and trimeric assem¬ 
bly is similar to that of the acid-sensing ion channels 
but the receptors share no homology at the amino acid 
level. Thus, the stoichiometry and topology of these 


G.C.K. Roberts (ed.). Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
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P2X Receptors for ATP: Molecular Properties and Func¬ 
tional Roles, Fig. 1 P2X receptor structure shows the trimeric 
nature of the protein and sites associated with ATP and antagonist 
binding. Homology models of the P2X1 receptor based on the 
zP2X4 receptor structure (Kawate et al. 2009) are shown. Left 
pane! shows the three P2X receptor subunits, two are shown as 
surface representations (white and gray) and the third subunit 


(blue) is shown in cartoon format. The subunits each have two 
transmembrane (TM) spanning helices with the second (TM2) 
lining the pore region of the channel. Right panel shows a surface 
rendering of the P2X receptor mapping residues that when mutated 
had an effect on ATP potency (red) or the sensitivity to the 
antagonist PPADS (magenta). The proposed ATP-binding site 
(red) forms at the interface between two adjacent subunits 


receptors are distinct from the pentameric nicotinic/cys 
loop, and tetrameric glutamate families of ionotropic 
receptors. 

Molecular Properties/Features of 
P2X Receptors 

ATP binding to P2X receptors induces a conforma¬ 
tional change in the protein that leads to the opening/ 
gating of the ionic pore. Mutagenesis-based studies 
and the zP2X4 receptor structure have provided signif¬ 
icant insight into the molecular basis of the receptor’s 
properties (Evans 2010). P2X receptors do not have 
commonly found consensus sequences for ATP recog¬ 
nition like the Walker motif, and so mutagenesis-based 
approaches have been used to identify amino acids 
involved in ATP action at the receptor. These studies 
identified conserved amino acids that are likely to be 
involved in agonist binding. Mapping of these residues 
onto the zP2X4 receptor structure shows that they form 
as a cluster at the interface between two adjacent P2X 
receptor subunits (Fig. 1). This would give rise to three 


ATP-binding sites within the receptor. This is consis¬ 
tent with modeling and analysis of the Hill slope of the 
initial part of the ATP concentration response curves 
indicating that three molecules of ATP bind to activate 
the receptor. It has been suggested that conserved 
positively charged lysine residues (K68, K70 from 
one subunit and K309 from the adjacent subunit, 
P2X1 receptor numbering) coordinate the binding of 
the negatively charged phosphate tail of ATP 
and a conserved aromatic phenylalanine residue 
(within the conserved N290F291R292 motif, P2X1 
receptor numbering) may coordinate the binding 
of the adenine ring. Mutations around the proposed 
ATP-binding site have also been shown to 
regulate the binding of the P2 receptor antagonists 
suramin and PPADS (Fig. 1) (Evans 2010). The 
crystal structure provides a snapshot of the receptor 
and it remains to be determined the extent of confor¬ 
mational changes in the extracellular domain that are 
associated with agonist binding and gating of the 
channel. 

The pore region of the P2X receptors is formed by 
the second transmembrane domains (TM2) from each 
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of the three constituent subunits. The crystal structure 
of the zP2X4 receptor in the agonist-free closed con¬ 
formation shows the narrowest region of the channel 
forms where the TM2 domains cross each other about 
half way through the channel (Kawate et al. 2009). 
Mutagenesis studies have shown that this region of 
overlap plays a key role in the regulation of ionic 
permeation and is likely to form the channel gate 
(Surprenant and North 2009). 

P2X receptors are cation-selective, and under nor¬ 
mal physiological conditions have a reversal potential 
of ^0 mV and their activation leads to membrane 
depolarization. The channels also show appreciable 
calcium permeability. For example, calcium influx 
accounts for ^ 10% of the current flow through smooth 
muscle P2X1 receptors on arteries, and is sufficient to 
mediate contraction of arterial smooth muscle, and in 
neurons presynaptic P2X receptors can regulate trans¬ 
mitter release. P2X receptors can therefore have direct 
signaling roles independent of the activation of 
voltage-dependent calcium channels. Interestingly, 
the P2X2,4,7 also show a second form of permeability 
following prolonged activation (tens of seconds to 
minutes) leading to an increase in permeability to 
larger cations. However, there is still controversy 
over whether this results directly from dilation of the 
P2X receptor channel pore or results in the recruitment 
of additional channels, for example, pannexins. 

One of the key distinguishing features between 
different P2X receptors is the time-course of current 
responses to applied agonist. P2X2 and P2X7 recep¬ 
tors, for example, show robust sustained currents in 
response to 5 s applications of ATP. In contrast, P2X1 
and P2X3 receptor currents decay monoexponentially 
with time constants of < I s. This gives rise to a range 
of temporal signaling through P2X receptors. Molecu¬ 
lar studies have identified that both the TM-spanning 
domains as well as the intracellular amino and carboxy 
termini make substantial contributions to determining 
the time-course of responses (Surprenant and North 
2009; Roberts et al. 2006). In order to crystallize the 
zP2X4 receptor the amino and carboxy termini were 
truncated so no structural information is available on 
these regions; however, it seems likely that they make 
significant interactions with the TM domains to control 
the gating and time-course of P2X receptor responses. 

A conserved feature throughout the whole family of 
P2X receptors is a protein kinase C consensus 
sequence (TXK/R) in the amino terminus of the 


receptor (Roberts et al. 2006; Surprenant and North 
2009). The threonine residue can be phosphorylated, and 
mutation leads to a speeding in the time-course of the 
response and reduction in current amplitude suggesting 
that the receptor is normally basally phosphorylated. 
Stimulation of Gq-G-protein-coupled receptors and 
phorbol ester stimulation can also potentiate P2X recep¬ 
tor-mediated responses; however, additional mechanisms 
other than a direct effect on the conserved PKC site are 
thought to underlie the modulatory effects. 

An intracellular carboxy terminal trafficking motif 
YXXXK is conserved throughout the P2X receptor 
family and contributes to the control of surface expres¬ 
sion of the receptors (Chaumont et al. 2004; Murrell- 
Lagnado and Qureshi 2008). It is also emerging that 
the P2X receptors form complexes with other receptors 
and proteins to mediate efficient signaling and regula¬ 
tion. For example, P2X receptor subunits show differ¬ 
ential association and regulation by lipid rafts 
(Allsopp et al. 2010), and co-immunoprecipitation 
followed by mass spectroscopy studies are identifying 
the nature of P2X receptor signaling complexes and 
their contribution to receptor regulation (Kim et al. 
2001; Chaumont et al. 2008). In addition, it is 
clear that P2X receptors may interact with other 
ligand-gated ion channels to regulate their activity. 
For example, P2X2 receptors have been shown to be 
co-expressed with nicotinic acetylcholine receptors in 
neurons and activation of the P2X receptor leads to 
inhibition/occlusion of co-activated nicotinic receptors 
(Khakh et al. 2000). 

Functional Assembly and Defined Functional 
Roles of P2X Receptors 

The seven P2X receptor subunits can assemble to form 
a range of homo- and heterotrimeric channels. Studies 
where recombinant P2X receptor subunits have been 
co-expressed have shown that many different combi¬ 
nations of P2X receptor subunits are possible (Torres 
et al. 1999) and several of these have been shown to 
have distinct functional properties derived from their 
constituent subunits (North and Surprenant 2000). 
Three main characteristics have been used to 
distinguish between the properties of the receptors 

(1) the degree of desensitization of the current in 
response to continued/sustained ATP application, 

(2) the sensitivity to the ATP analogue a,P-methylene 
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ATP, and (3) the sensitivity to antagonists. 
A combination of pharmacological characterization, 
immunohistochemistry, molecular, and genetic/ 
knockout studies have identified functional correlates 
of several dehned P2X receptors (for reviews of work 
in this section see (Roberts et al. 2006; Surprenant and 
North 2009; Jarvis and Khakh 2009)). 

Homomeric Receptors 

P2X1 receptors. ATP-evoked currents at P2X1 recep¬ 
tors show rapid desensitization (<1 s time to 50% 
decay of current) during the continued presence of 
agonist. An ATP-free washout period of ^5 min is 
required for full recovery from this desensitization. 
The receptors are also activated by a,P -methylene 
ATP with similar sensitivity to ATP and are sensitive 
to the antagonists suramin and PPADs. The suramin 
derivative NF449 is a selective and effective P2X1 
receptor antagonist. The P2X1 receptor was initially 
expression cloned from vas deferens smooth muscle 
and the properties of the native P2X receptor in 
this tissue and other smooth muscles, for example, 
from arteries and the bladder, correspond to those 
of homomeric P2X1 receptors. P2X1 receptor knock¬ 
out mice have shown that this receptor is essential 
for ATP-evoked currents in vas deferens, artery, and 
bladder smooth muscle. The knockout mice 
showed male infertility due to reduced contraction of 
the vas deferens associated with ejaculation. The 
absence of P2X1 receptors also had a cardiovascular 
phenotype with reduced arterial constriction in 
response to sympathetic nerve stimulation and reduced 
autoregulatory responses in the kidney. In addition, the 
P2X1 receptor is expressed on platelets and can con¬ 
tribute to their activation and thrombus formation. 

P2X2 receptors. These receptors show little desen¬ 
sitization in response to ATP application and are essen¬ 
tially insensitive to a,P-methylene ATP. P2X2 
receptor currents are sensitive to the antagonists 
suramin and PPADS. The receptors are expressed 
throughout the nervous system. Knockout mice studies 
have shown that these receptors are involved in neuro¬ 
nal control of gut motility, regulatory responses to 
hypoxia, presynaptic regulation of transmitter release 
in the hippocampus, and taste sensation. 

P2X3 receptors. Like P2X1 receptors, these show 
rapid desensitization to agonist application and are 


sensitive to a,P -methylene ATP. They can be distin¬ 
guished using the selective antagonist A-317491. They 
are found predominantly in sensory neurons, and 
knockout studies have shown they play a role in pain 
sensation, visceral mechanotransduction, and gut 
peristalsis. 

P2X4 receptors. These receptors show an interme¬ 
diate level of desensitization during agonist applica¬ 
tion and are insensitive to a,P -methylene ATP and the 
antagonists suramin and PPADS. They are however 
uniquely sensitive among P2X receptor subunits in that 
ATP-evoked responses can be potentiated by the 
antiparasitic agent ivermectin, and this can be used as 
a hallmark for their presence in native tissues. The two 
main physiological roles that have been identified for 
these receptors are in the regulation of blood pressure 
and neuropathic pain. P2X4 receptors are expressed on 
endothelial cells lining blood vessels, and mice lacking 
the receptor have more constricted arteries and 
increased blood pressure. At the sites of neuronal 
injury there is recruitment of microglial cells and 
upregulation within these cells of P2X4 receptor 
expression. Downregulation or blockade of P2X4 
receptors with the antagonist TNP-ATP reduced the 
pain associated with injury models. 

P2X5 receptors. ATP-evoked currents at recombi¬ 
nant P2X5 receptors have small peak current ampli¬ 
tudes compared to the other P2X receptor family 
members suggesting that homomeric channels do not 
make significant functional contributions. In addition, 
the majority of humans have a splice variant of the 
P2X5 receptor that is nonfunctional. 

P2X6 receptors. These receptors are found located 
predominantly in the nervous system. However, 
homomeric P2X6 receptors do not form efficiently in 
recombinant systems due to incomplete glycosylation 
of the receptors and interestingly they cannot form 
functional homotrimeric receptors but can form func¬ 
tional heteromers with other subunits. 

P2X7 receptors. P2X7 receptors show some unique 
properties in the P2X receptor family. Firstly, they are 
considerably less sensitive to ATP, requiring millimo- 
lar levels to significantly activate the receptor (in con¬ 
trast to micromolar levels for the other subtypes) and 
also are particularly sensitive to the divalent cation 
concentration (these have an inhibitory effect on ATP 
potency). Secondly, P2X7 receptors show use depen¬ 
dence with significant changes in the currents follow¬ 
ing prolonged or repeated applications of ATP. 
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In many cell types, this leads to an increase in perme¬ 
ability, for example, to dyes like YO-PRO, and can be 
associated with membrane blebbing and cell death. 
The P2X7 receptor has a considerable longer 
C-terminal tail than the other receptor subtypes that 
contributes to the large pore-forming ability of the 
receptor. Functional studies have demonstrated that 
the P2X7 receptor is involved in immune cell functions 
including regulation of the release of pre-inflammatory 
cytokines and pain. 

Heteromeric receptors. Recombinant studies have 
indicated that a wide range of combinations of P2X 
receptor subunits are possible. To date, native functional 
correlates of two of these have been described. How¬ 
ever, it is likely that other subunit combinations are 
functional in native systems as there are still phenotypes 
in cells that have been described that do not correlate 
with those of recombinant receptors. The possibility 
exists, but that has not been tested, that a P2X receptor 
could form from three distinct P2X receptor subunits. 

P2X1I5 receptors. These heteromeric receptors 
have a biphasic time-course with an initial rapidly 
desensitizing component and then a secondary 
sustained current of ^10% of the peak current ampli¬ 
tude. This does not just result from the addition of the 
desensitizing P2X1 current and sustained small P2X5 
receptor current, as the desensitizing current has 
a much faster time to recovery and the sustained cur¬ 
rent is sensitive to a,p -methylene ATP (P2X5 
homomers are not). This P2X1/5 heteromeric receptor 
is likely to mediate P2X currents in glial cells isolated 
from the cortex (Lalo et al. 2008). 

P2X2/3 receptors. This was the hrst described 
heteromeric combination of P2X receptors and shows 
a composite phenotype with the non-desensitizing 
time-course of the P2X2 receptor and the a,P- 
methylene ATP sensitivity of the P2X3 receptor. Dou¬ 
ble knockout mice that do not express either P2X2 or 
P2X3 receptors suggest that the P2X2/3 receptor plays 
an important role in sensory transduction, responsive¬ 
ness to hypoxia, and underlies the P2X receptor pheno¬ 
type in a subpopulation of myenteric neurons in the gut. 

Summary 

P2X receptors for ATP comprise a distinct family of 
ligand-gated cation channels. Seven mammalian P2X 
receptor genes (P2X1-7) have been isolated. These 


encode subunits with two transmembrane domains, intra¬ 
cellular amino and carboxy termini, and a large extracel¬ 
lular ligand-binding loop. The homo- or heterotrimeric 
assembly of these subunits gives rise to P2X receptors 
with a range of functional properties. P2X receptors are 
expressed, and play physiological roles, in a wide variety 
of cell types including neurons, glia, smooth muscle, 
platelets, and immune cells. 
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(FI), Italy 

Definition 

Paramagnetic metalloproteins are proteins containing 
one or more paramagnetic metal ions, which are charac¬ 
terized by the presence of at least one unpaired electron. 
As in every molecule, unpaired electrons sizably perturb 
the NMR parameters relative to the protein nuclei, 
changing their relaxation, chemical shift, and motional 
averaging so that paramagnetic enhancements in relaxa¬ 
tion rates, hyperfine shifts, and residual dipolar couplings 
arise. These effects can limit the quality of the NMR 
spectra with respect to those of diamagnetic analogs of 
the protein, but they also represent a source of precious 
information for the structural and dynamic investigation 
of the system. The large magnetic moment of unpaired 
electrons actually interacts dipolarly with the nuclear 
magnetic moments, thus increasing the longitudinal 
and transverse relaxation rates of protein nuclei even 
far from the metal ion in terms of chemical bonds. The 
increase in the transverse relaxation rate is responsible 
for the increase in nuclear signal line width, which 
lowers the quality of the spectrum, sometimes at the 
level of broadening of the signals beyond detection. 
The relaxation paramagnetic enhancements, both trans¬ 
verse and longitudinal, are due to this dipole-dipole 
interaction, when the nucleus under investigation is sev¬ 
eral bonds away from the metal ion. Under these circum¬ 
stances, these enhancements can be used to obtain 
information on the nuclear distance, r, from the para¬ 
magnetic metals, as they depend on In addition, the 
dipolar interaction between the average magnetic 
moment induced by the external magnetic field and the 
nuclear magnetic moments contributes to nuclear relax¬ 
ation, with a term called Curie relaxation, which 
increases concurrently with the reorientation time of 


the protein. Curie relaxation also depends on ;-“®. Fur¬ 
thermore, contact contributions to nuclear relaxation 
may arise due to the unpaired electron spin density 
delocalization and/or spin polarization on metal ligands, 
which also determine changes in chemical shifts, called 
contact shifts. When observed such shifts are used to 
identify the metal ligands. A paramagnetic susceptibility 
tensor is defined proportionally to the average induced 
magnetic moment; the anisotropy of this tensor deter¬ 
mines chemical shift changes, called pseudocontact 
shifts, proportional to the size of the anisotropy values, 
to the distance of the nuclei from the metal ion (as r~ ) 
and to their position with respect to the paramagnetic 
susceptibility tensor. These are precious structural 
restraints as the shifts can be accurately measured. In 
the presence of magnetic susceptibility anisotropy, the 
energy of the system becomes orientation dependent, so 
that residual dipolar couplings arise due to the 
nonisotropical average of the dipolar interactions 
between protein nuclei. Such paramagnetism-based 
residual dipolar couplings depend on the square of the 
applied magnetic field, on the anisotropy values of the 
magnetic susceptibility tensor and on the orientation of 
the nucleus-nucleus vectors in the frame of the tensor. 
The structural information is of the same type as that 
obtained using external orientation devices. In order to 
obtain the above-mentioned restraints (paramagnetic 
enhancements to the relaxation rates, pseudocontact 
shifts, residual dipolar couplings), sometimes it 
may prove useful to introduce a paramagnetic ion at 
the site of a naturally occurring diamagnetic one, or a 
paramagnetic tag can be added to a protein not binding 
any metal ion. 

Cross-References 

► Cytochromes 

► Hemes 

► Hyperfine Shifts 

► Iron-Sulfur Clusters 

► Residual Dipolar Coupling 

► Superoxide Dismutases 


Paraoxon Hydrolase 
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Synonyms 

Ion channels 

Introduction 

Ion channels exist in the membrane of most cells and 
their activity is regulated hy a variety of factors such as 
► membrane potential, extracellular, and intracellular 
ligands. It is possible to investigate many of 
these processes by examining macroscopic (whole-cell) 
currents. However, measuring the activity of single chan¬ 
nels enables the open probability {Po) and single-channel 
(unitary) current (;) to be measured independently, and 
much more detailed kinetic information about processes 
such as activation and drag-induced block can be 
obtained by examining open and closed durations. The 
goal of single-channel recording, therefore, is to measure 
parameters such as unitary current amplitude and 
distributions of open and closed times under various 
conditions to give mechanistic information that may be 
impossible to obtain from macroscopic currents. 

Current only flows through an ion channel when it 
is open and the magnitude and direction of the current, 
i, is given by: 

i = yiy-EREv) (1) 

where V is membrane potential, y the conductance, and 
Erev the reversal potential of the channel. The con¬ 
ductance of a channel is very dependent on the record¬ 
ing conditions and generally increases (up to a point) 
with increasing concentrations of the permeant ions 
(Hille 2001). Single-channel records characteristically 
alternate between closed (non-conducting) and open 
(conducting) current levels, a process termed gating, 
which represent different conformational states of the 
channel. The transition between these states is abrupt 


and once open (or closed) the channel remains in that 
state for a random duration described by a ► Markov 
process. There are two important principles to note 
about this behavior; (1) the dwell time in any state 
will be a random variable distributed exponentially 
where the time constant (x) gives the mean dwell 
time in that state; (2) the mean dwell time in any 
state is given by l/(sum of rates for leaving that 
state). Thus, for a simple case of a channel that can 
be closed, open, or blocked, according to the scheme 
shown in Fig. la, the mean dwell times in each state, 
tSTATE, are given by: 


II 

(2a) 

to = l/{oi + kh[B]) 

(2b) 

II 

1 

(2c) 


where the denominators are the transition rates which 
have units of s“*. In this scheme, the only conducting 
state is O, hence when no current is detected it is not 
possible to distinguish whether this is because the 
channel is in state C or state B. The relation between 
state occupancy and channel current is illustrated 
in Fig. lb. Notice that openings tend to occur in 
bursts because in this case the transitions between 
states O and B, delineated by the dashed box, are 
more rapid than those between O and C. 

Extracting kinetic information. Deciphering mech¬ 
anisms from single-channel records can be a complex 
procedure, though some information, such as the 
blocking rate constant {ki,) in Fig. 1, can be relatively 
simple to extract. A straightforward rearrangement of 
Eq. 2b gives: 

1/U = h[B] + a (3) 

Equation 3 predicts a linear relation between the 
blocker concentration [5] and !/?„, the slope of which 
gives the blocking rate constant kf, and the y intercept 
a, as shown in Fig. Ic. Usually, some preliminary data 
is needed to formulate a plausible model; if, for exam¬ 
ple, the mean open time remained unaltered in the 
presence of the blocker that would imply the open 
state was not blocked and the scheme in Fig. la 
would not describe the data. Several other parameters, 
such as the distribution of burst lengths, are also useful 
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Patch-Clamp Recording of 
Single Channel Activity: 
Acquisition and Analysis, 
Fig. 1 (a). A simple three 
state kinetic scheme 
illustrating a chamrel that can 
exist in the closed (C), open 
(O), or blocked (B) state. The 
rates (above and below the 
arrows) represent the 
frequency of transitions. 

(b) . Relation between the 
occupancy of a particular state 
(upper trace) and the 
corresponding current that 
would be observed (i denotes 
the single channel current). 
The bursting behavior results 
from rapid transitions between 
the open and blocked state 
(within the dashed box). 

(c) . Plotting the reciprocal of 
the mean open time against 
blocker concentration, 

[B], enables the blocking rate 
constant (k/,) to be calculated 
from the slope of the line 
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burst 


burst 
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burst 
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in developing a kinetic scheme and for details of 
these the reader is referred to Colquhoun and Hawkes 
(1994, 1995). 

Acquisition of Single Channel Currents 

Patch-clamp configurations. To measure the current 
flowing through individual ion channels it is necessary 
to isolate a section of membrane small enough to 
possess only a few of the channels under investigation. 
This is done using the patch-clamp technique devel¬ 
oped by Hamill et al. (1981) where a polished glass 
micropipette makes a seal with the cell membrane 
allowing recording of current through the portion of 
membrane underneath the tip of the pipette. Various 


configurations of the patch-clamp technique, illus¬ 
trated in Fig. 2, enable recording of currents in differ¬ 
ent situations. The cell-attached configuration may be 
used when it is important that the cytoplasm remains 
intact, for example, when investigating a receptor- 
induced modulation of channel activity via a cell¬ 
signaling cascade. A disadvantage of this technique is 
that the membrane potential of the cell is not known. 
The inside-out technique enables the conditions at the 
intracellular side of a channel to be altered, allowing 
procedures such as increasing internal [Ca^"^] to open 
Ca^'^-activated channels or application of catalytic pro¬ 
tein kinases to investigate the effect of phosphorylation 
on channel activity. Outside-out is the configuration of 
choice to use when investigating ► ligand-gated ion 
channels. 
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Patch-Clamp Recording of 
Single Channel Activity: 
Acquisition and Analysis, 
Fig. 2 Configurations of the 
patch-clamp technique. The 
initial step is forming a high 
resistance seal between the 
cell membrane and the glass 
rim of the pipette tip. This 
resistance, which should 
eventually be > 1 GQ, is 
monitored (using Ohm’s law) 
by measuring the current 
amplitude induced by 
a voltage pulse 


Cell-attached patch 



Increasing signal to noise ratio. Because the current 
flowing through an individual ion channel is so small 
(1-10 pA) it is necessary to have as high a signal to 
noise ratio as possible. While much of this aspect is 
addressed by the electronics of patch-clamp amplifiers 
(see Ogden and Stanfield 1994; Sigworth 1995), using 
patch pipettes made from thick instead of thin walled 
glass will reduce noise levels considerably, as will 
coating the tip of the pipette with Sylgard or wax and 
positioning the pipette tip just below the surface of the 
solution if convenient. 

Filtering. One way of decreasing noise is to use low 
pass filtering, but this also reduces the information that 
can be extracted from the signal because it reduces the 
frequency response. A common method of measuring 
open and closed durations is to use regression to esti¬ 
mate times when the current crosses a threshold set at 
50% of the unitary current amplitude. The effect of 
both noise and filtering on this process can be demon¬ 
strated by comparing the results of threshold crossing 
in a trace before and after filtering. In Fig. 3a, the 
section of idealized channel activity comprises four 
openings with two short closings and one long closing 


separating them. Addition of typical recording noise to 
this idealized record results in two threshold crossings 
that are not attributable to channel closing. After fil¬ 
tering (low pass at 3 kHz cutoff frequency) the noise is 
clearly reduced and the two false events are no longer 
detected; however, the shortest closing is now 
missed because of the reduced frequency response. 
Additionally, for short events, filtering leads to 
an underestimate of their true durations. This is 
illustrated in Fig. 3b, where after filtering, the shortest 
event fails to reach threshold and the next 
shortest event, while reaching the threshold, has its 
duration underestimated. These, and other practical 
aspects of acquiring single-channel currents, are 
discussed in more detail by Colquhoun and Sigworth 
(1995). 

Digitizing the data. An analog to digital converter is 
used to change the signal into numerical values at 
discrete time intervals, referred to as the digitization 
rate, for storage onto the hard disk of a computer 
for analysis. It is important that the digitization 
rate is sufficiently high and a general rule is to 
digitize at five times the filter cutoff frequency to 
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Patch-Clamp Recording of 
Single Channel Activity: 
Acquisition and Analysis, 
Fig. 3 Filtering and digitizing 
the current signal. 

(a). Idealized channel activity 
(top trace) is shown with 
typical recording noise helow. 
A 50% threshold detects two 
false events. Filtering the trace 
results in reduced noise and 
the false events are no longer 
detected but a true event is 
now missed, (b). Filtering the 
record also reduces the 
apparent duration of short 
events when using the 50% 
threshold method, (c). The top 
trace (black) shows a section 
of recording digitized at 
10 kHz, the bottom trace 
shows the same section 
digitized at 2 kHz (black dots 
represent each datum point). 
The red trace in each case is 
that obtained at 2 kHz. Note 
that the threshold crossing 
events indicated by * are 
missed when digitizing at 
2 kHz 


a 


Idealized channel activity 


Low pass digital filter 
(Gaussian) 3 kHz 

missed 



5pA 


1 ms 


50% 






> 

0.2 ms 




5 pA 


10 ms 


avoid errors (see Fig. 3c). So if the signal has been 
filtered at 2 kHz the digitization rate should be 10 kHz 
(a data point every 100 ps). 

Duration of recordings. Deciding on the recording 
duration under a given condition will depend on the 
activity of the channel under investigation. If data is 
required to analyze open and closed time distributions 
the aim is to record a large number of events, several 
thousand would be ideal if it is practical. Clearly if the 
channel has a low Po, opening only occasionally, then 


a longer time will be necessary to accumulate a 
sufficient number of events to analyze than for 
a channel that opens more frequently. 

Analysis of Single Channel Currents 

Amplitude measurement and fitting. Single channel 
current amplitudes can be obtained by measuring indi¬ 
vidual openings directly; however, a more pragmatic 
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Patch-Clamp Recording of 
Single Channel Activity: 
Acquisition and Analysis, 
Fig. 4 Amplitude histogram 
of a Ca^'^-activated 
channel, (a). Section of 
recording showing the 
opening and closing of one 
channel. The dashed line 
indicates zero current and 
C represents the closed level 
and O the open level, (b). An 
amplitude histogram of the 
trace in part a. For display 
purposes this is plotted with 
the current on the y-axis on the 
same scale as the trace. 

(c). Several sections were used 
to generate the final histogram. 
The bin width was 0.05 pA and 
the red lines show the fits of 
equation (4) to each peak 


a b 

10 -1 



0 40 80 120 

points / 0.05 pA 



current (pA) 


method is to generate amplitude histograms of the data. 
One way of doing this is take appropriate sections of 
the recording and plot the distribution of all the data 
points as a histogram as shown in Fig. 4. If the section 
of recording analyzed contains openings to one 
level only, the histogram will comprise two peaks, 
one for the closed level and one for the open level. 
Each peak can be fitted with a Gaussian distribution of 
the form: 


/w 


a 


exp 


2rT2 j 


(4) 


where a is the area, fi the mean and the variance of 
the digitized current data (x). The difference between 
the means of both peaks gives the single-channel 
current. Before constructing an amplitude histogram, 
it is important to ensure that any drift in the baseline 
(closed level) be eliminated. It is possible to use ampli¬ 
tude histograms to analyze fast block of ion channels 
by fitting the open peak with a beta distribution 
(Yellen 1984; Davies et al. 1996). 


Dwell Time Measurement and Fitting 

Apart from allowing direct measurement of unitary 
current, single-channel recording enables durations of 
individual events to be measured and their distribu¬ 
tions to be fitted. This is an important aspect as it 
allows detailed mechanistic information to be 
obtained. 

Obtaining event durations. The initial step is to 
measure the duration of each opening and closing 
event, either by estimating the time of crossing a 50% 
threshold, see Figs. 3 and 5a, or by using a more 
sophisticated but time-consuming approach of time- 
course fitting. Detailed discussions of both methods 
are given by Colquhoun (1994) and Colquhoun and 
Sigworth (1995). As the events are detected their dura¬ 
tions are written to file to examine their distributions 
later. Once these durations have been measured it is 
usual to impose a limited time resolution to the data so 
that detected events shorter than a specified time (e.g., 
100 ps) are ignored. Thus, if two openings are sepa¬ 
rated by a brief closing, whose duration is shorter than 
the minimum resolution, the two openings will be 
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Patch-Clamp Recording of Single Channel Activity: Acqui¬ 
sition and Analysis, Fig. 5 Distribution of open and closed 
times, (a). A threshold at 50% of the full amplitude is set and the 
duration of each opening and closing is obtained from the times 
when the current trace crosses this threshold, (b). Sorting the open 
(or closed) times into a histogram illustrates the exponential nature 
of the distributions. The histogram, plotted in a conventional way, 
shows a simulation of 1,000 events fitted by a mixture of two 


exponential components with time constants of 3 and 30 ms. (c). 
Open and closed time distributions obtained from a Ca^'^-activated 
channel in 10 pM Ca^"^ at 0 mV after imposing a minimum 
resolution of 110 ps. These distributions are plotted using the log - 
square root method described in the text. The open times required 
two components and the closed times four components to fit the 
data. In parts b and c, the red lines show the fits and the dashed 
lines represent the individual components 


treated as one. Imposing a minimum resolution ensures 
a constant resolution over the data and is necessary if 
correction of missed events is to be applied. When 
analyzing event durations, it is important to consider 
whether the patch contains more than one channel. If it 
does, it is still possible to obtain information about 
open times by limiting the analysis to sections where 
openings to a single level only occur. However, when 
there are two or more channels in a patch it is more 
difficult to analyze closed times. Measuring brief 


closures can be of some value as they are likely to 
result from the activity of an individual channel, but 
a long closing could be terminated either because the 
same channel opens again or because a different chan¬ 
nel opens, leading to an underestimate of long closed 
durations. 

Fitting the distributions. The data file of event dura¬ 
tions can be used to form histograms of open and 
closed times. These histograms reveal that the dura¬ 
tions are exponentially distributed as shown by the 
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simulated plot of Fig. 5b. It is common, particularly for 
closed times, that the distribution will be a sum of 
several exponential components as given by the 
equation below: 

n 

/(o = (5) 

>=i 

where a, and x, are the relative area and time constant, 
respectively, of component j, and n is the total number 
of components. The mean of this distribution is given 
by Theoretically, the number of components 

required to fit the open or closed time distributions 
should be the same as the number of open or closed 
states, though in reality, especially for closed states, 
the number of components giving a good ht to the data 
will usually be less than this. Maximum-likelihood is 
the method of choice for fitting these distributions and 
can be applied either directly to the data set of individ¬ 
ual event durations or to binned data. 

Generally, the distributions comprise wide ranges 
of durations which are difficult to convey in a conven¬ 
tional histogram. To overcome this, a popular method, 
developed by Sigworth and Sine (1987), is to use log 
transformation of time such that bin widths are con¬ 
stant on the log scale; this simplifies fitting binned data 
containing several exponential components. Figure 5c 
shows examples of open and closed time distributions 
of a Ca^’^-activated channel binned at 25 bins per 
log unit according to this method (a minimum resolu¬ 
tion of 110 ps was imposed on the data) together with 
the fitted distributions with two and four components 
for the open and closed times, respectively. 

Correcting for missed events. Examinations of sin¬ 
gle channel currents reveal that openings separated by 
brief closings occur much more frequently than clos¬ 
ings separated by brief openings. Thus, especially after 
setting a minimum resolution, many of these brief 
closings will be missed leading to the open times 
being artificially long. When measuring mean open 
times, as might be done when examining a channel 
blocking mechanism (see above), for example, it is 
necessary to correct the mean open time for these 
undetected closings. This can be done by htting both 
open and closed time distributions and then multiply¬ 
ing the measured mean open time by the proportion of 
detected closed times (integral of the htted closed time 
distribution between the minimum resolution and 
inhnity) to give the corrected mean open time. 


Summary 

The patch-clamp technique enables the current flowing 
through a very small area of membrane to be recorded 
in isolation. This makes it possible for the current 
flowing through individual ion channels to be recorded 
and parameters such as unitary current amplitude, open 
probability, and open and closed time distributions to 
be measured. Such data enable more detailed mecha¬ 
nistic information to be obtained than is possible with 
whole-cell (macroscopic) recording. 

Cross-References 

► Analysis of Macroscopic Currents 

► Automated Patch Clamp: Advantages and 
Limitations 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 

► Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET 

► Kinetics: Overview 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Voltage-Gated Sodium and Calcium Channels 
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Synonyms 

Gelling agent; Pectin biophysics 


Definition 

“Pectic polysaccharides” are a family of extremely 
complex polysaccharides, isolated from plant cell 
walls under various extraction conditions (O’Neill 
and York 2003). The term pectin is generally used to 
describe simpler structures extracted, under fairly 
harsh conditions, from the waste plant tissue (pulp) 
from the citrus fruit, cider, or sugar beet industries. 

Introduction 

Pectin extracts consist primarily of a partially esteri- 
fied (usually methyl-esterihed) homopolymer of 
galacturonic acid (smooth regions) plus complex 
branched (mainly rhamnogalacturonan 1) regions 
(hairy regions) containing high levels of the neutral 
sugars such as rhamnose, galactose, and arabinose 
(Fig. 1). Pectin structures are normally shown as 
smooth regions interspersed with hairy regions. How¬ 
ever, it should be realized that this is a simplified 
structure based on the average chemical composition: 
recent studies suggest that the extracts are actually 
heterogeneous mixtures containing smooth regions 
and more complex aggregates composed of both 
smooth and hairy regions (Round et al. 2010). The 
extracts from the citrus fruit and cider industries are 
used commercially as gelling agents and stabilizers for 
acid-based drinks. 

Gelation 

There have been extensive studies on the mechanism 
of gelation of pectin (Morris 1998). The smooth 
regions are responsible for gelation. The mechanism 
of gelation depends on the level and distribution of the 
(methyl) esters along the galacturonic acid backbone. 
High methoxyl (HM) pectins (>60% esterihed) form 
transparent, thermo-irreversible gels when hot solu¬ 
tions are cooled, usually in the presence of sugar and 
acid. This reaction is familiar as the basis for the 
“setting” of jams and fruit jellies. The presence of 
sugar (usually sucrose) and the acidic conditions (usu¬ 
ally citric acid) contrive to make the polysaccharides 
increasingly insoluble on cooling. It is considered that 
cooling causes the polysaccharide chains to form 
threefold helices that associate to form a fibrous 
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Pectin Biophysics, Fig. 1 Major structural components of 
pectin extracts. The hairy (rhamnogaiacturonan 1) regions are 
branched and the major types of side chains are linear or 
branched galactans, arabinans, or arabinogalactans 


network as the basis of the gel structure. Low methoxyl 
(LM) pectins (typically 30-35% esterihed) gel in the 
presence of certain divalent cations (usually calcium) 
and/or under acidic conditions. Most investigations 
have been made on calcium-induced gels. The gels 
are transparent, thermo-irreversible, and are formed 
at room temperature when calcium ions are diffused 
into or released within pectin solutions. Gelation is 
considered to depend on the presence of several blocks 
of acid residues along the pectin chain. When the chain 
length in a block exceeds a certain length (15-20 
residues) cooperative binding of calcium occurs, 
forming junction zones between chains that stick the 
chains together. The junction zones have been called 
“egg-box” junctions (Fig. 2a) because the calcium ions 
(eggs) are contained within the buckled ribbon struc¬ 
tures of the pectin chains (egg-box). 

LM pectins are prepared by de-esterification of FIM 
pectins either enzymatically or under alkaline condi¬ 
tions. The type and duration of the treatment results in 
different ester distributions along the pectin chain 
which, in turn, determines the optimum calcium 
concentration at which gelation occurs: a factor that 
can compromise commercial applications where the 
hardness (calcium content) of the water will affect 
gelation. Alkaline treatment in the presence of ammo¬ 
nia introduces amide groups onto the pectin chain and 
this leads to thermoreversibility and broader calcium 
sensitivity on gelation. In the food industry, LM and 
amidated pectins can be used to produce a range of low 
sugar jams and jellies. Under acidic conditions, when 
the charge on the acid regions is neutralized, the chains 
can also be induced to associate and form gels. In this 


a 




Pectin Biophysics, Fig. 2 LM pectin gels, (a) Schematic 
picture of an “egg-box” junction zone, (b) Atomic force micro¬ 
scopic image of part of a LM pectin gel showing the fibrous 
network 

case, association may not be entirely dependent on the 
association of the acid blocks. Gels formed under 
acidic conditions can be thermoreversible and the 
properties of gels can be manipulated if calcium-set 
gels are prepared under acidic conditions. Although 
there have been extensive studies of the formation 
and nature of the junction zones, less is known about 
the long-range structure of the gels. Electron and 
atomic force microscopy (Morris 2007) images of gel 
precursors, gels, and gel fragments suggest that the 
junction zones act as sticky patches gluing the pectin 
chains together into thicker fibers forming a branched 
fibrous network (Fig. 2b). 


Pectin in Plant Cell Walls 

The gel structure and gelation of isolated pectin is 
considered to be a good model for aspects of the self- 
assembly and structure of the pectin-based networks in 
plant cell walls. Plant cell walls are complex composite 
structures (O’Neill and York 2003) based on two 
interpenetrating networks. The load-bearing structure 
is at present considered to consist of planes of cellulose 
fibers interconnected by molecular tethers called 
xyloglucans. Dynamic spectroscopic studies on 
stretched cell walls suggest that the pectin-based net¬ 
work is independent and not interconnected to the 
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cellulose-xyloglucan network (Morris et al. 2003). The 
pectin-based network is considered to hold water and 
contribute to the swelling and de-swelling of the plant 
cell wall. For most fruits and vegetables the removal of 
calcium ions leads to cell separation and the break¬ 
down of the tissue structure. Isolation of pectin from 
the intercellular (middle lamellae) region of the cell 
walls yields a HM pectin, which contains sufficient 
suitable length acid blocks, and gels in the presence 
of calcium. Enzymatic de-esterification of pectin in the 
presence of calcium leads to gelation, and this has been 
suggested as a possible early stage in the self-assembly 
of plant cell wall structure (O’Brien et al. 2009). The 
residual acid residues on the pectin chains, that are not 
complexed with calcium, are considered responsible 
for the swelling of the cell wall, in response to changed 
ionic conditions. In the much more highly concen¬ 
trated environment of the plant cell wall, it is possible 
that other divalent cations, monovalent cations, or 
even complexation with basic proteins may contribute 
to the association of pectin chains (Morris et al. 2003). 

Molecular Structure and Function 

Labeling of cell wall structures with antibodies to 
methyl and non-methyl-esterified smooth regions 
(Verhertbruggen et al. 2009) and to hairy regions has 
shown the selective distribution of different types of 
pectin structure within the plant cell wall. Because of 
the importance of the ester distribution on smooth 
regions of the pectin chains in controlling the func¬ 
tional properties of pectin extracts, and within the plant 
cell during growth and development, there is consid¬ 
erable interest in developing chemical and physical 
methods for characterizing this heterogeneity of ester 
substitution within populations of pectin molecules 
(Winnig et al. 2009). These types of approaches largely 
involve chemical or physical characterization of com¬ 
plex extracts, their fractionation or fragmentation, and 
the simulation or modeling of the data to predict the 
composition. Direct visualization of individual pectin 
molecules has revealed unusual features such as lim¬ 
ited branching of the polygalacturonic acid backbone 
and, in pectin extracts from sugar beet pulp, the pres¬ 
ence of a high proportion of pectin-protein complexes 
(Morris et al. 2003; Morris 2007). The latter observa¬ 
tion, and direct studies of the interfacial properties of 
sugar beet pectin extracts, has demonstrated the 


importance of the complexes in explaining the unique 
behavior of sugar beet pectin extracts as emulsifying 
agents. Proteins are known to associate and form elas¬ 
tic interfacial networks that stabilize emulsions, and 
the presence of the pectin chains forms a steric barrier 
around the oil droplets, which inhibits droplet-droplet 
collisions and flocculation (Gromer et al. 2010). 

Bioactivity 

Consumption of fruit and vegetables is considered to 
be an important aspect of a healthy diet. These foods 
contain many compounds that are considered to pro¬ 
mote health and prevent the incidence and progression 
of chronic disease. An important aspect is the role for 
pectin as a source of dietary hber. Dietary fiber is 
fermented by colonic microflora contributing to gut 
health. Pectin is also one of a number of dietary poly¬ 
saccharides that induce immunomodulatory effects 
following oral consumption (Ramberg et al. 2010). 
There is growing evidence from cellular, animal, and 
some clinical studies that modihed forms of pectin 
may have anticancer properties (Glinsky and Raz 
2009). It has been proposed that modification leads to 
the release of fragments that can bind to the important 
regulatory protein galectin 3, which controls a series of 
stages in the initiation, development, and spread 
of cancers. There is direct evidence that linear galactan 
chains, present in the hairy regions of the pectin 
extract, will bind specifically to galectin 3 (Gunning 
et al. 2009) in a fashion that should inhibit its role as 
a regulator, suggesting that these fragments may be 
responsible for the anticancer bioactivity. 

Cross-References 

► Polysaccharides: Biophysical Properties 
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Synonyms 

Ribosomal peptide synthesis 

Definition 

The ribosome acts as a catalyst to form peptide bonds 
in protein synthesis. 

Basic Characteristics 

The translation of genetic code into proteins through 
the specific catalysis of peptide bond formation by the 
ribosome is one of the most important achievements of 
evolution. The ribosome is a very large molecular 
structure which contains RNA molecules as well as 
proteins. Different from normal protein enzymes. 
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however, the catalytic part is based on RNA. With the 
emergence of structural data on the large subunit of the 
(bacterial) ribosome responsible for peptide bond syn¬ 
thesis, computational studies into the mechanism 
could be conducted. Trobro and Aqvist hrst used 
molecular dynamics and free energy perturbation sim¬ 
ulations with ► empirical valence bond methods to 
examine possible catalytic mechanisms (Trobro and 
Aqvist 2005). They concluded that the most favorable 
mechanism does not involve general acid-base cataly¬ 
sis by ribosomal groups but rather proton shuttling 
between reactants. As in protein enzymes, pre¬ 
organization of the active site appeared to be crucial 
for catalysis, even though catalysis is apparently lim¬ 
ited to a reduction in activation entropy (and not 
enthalpy). The same authors later used similar methods 
to propose a way in which ribosomal release factors 
can induce the peptidyl-tRNA cleavage in the termi¬ 
nation of protein synthesis (Trobro and Aqvist 2007, 
2009). Other studies, including QM/MM simulations, 
have verified the importance of the electrostatic envi¬ 
ronment provided by the ribosome for catalysis 
(e.g., Kastner and Sherwood (2010)). New structural 
data made possible further detailed computational sim¬ 
ulations of the interactions of release factors with the 
ribosome, which revealed how specificity of release 
factors is achieved (Sund et al. 2010). Reading stop 
codons was shown to involve interactions and switches 
that are distinctly different from normal tRNA inter¬ 
actions during peptide synthesis. Overall, computer 
simulations have been crucial in turning structural 
information on the ribosome into a detailed mechanis¬ 
tic picture, as also has been the case with ribozymes 
(enzymes composed of RNA) (Banas et al. 2009). 

Cross-References 

► Empirical Valence Bond Methods 

► Molecular Dynamics Simulations of Lipids 

► QM/MM Methods 
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Phase Contrast Electron Microscopy 

Kuniaki Nagayama 

National Institute for Physiological Sciences, 5-1 
Higashiyama, Myodaiji-cho, Okazaki, Aichi, Japan 

Synonyms 

Defocus Phase Contrast (DPC); Electron microscopy; 
Electron cryo-microscopy; Hilbert Phase Contrast 
(HPC); Phase plate; Zernike Phase Plate (ZPC) 

Definitions 

Phase object: An object that is transparent like glass or 
water but capable of changing the phase of transmit¬ 
ting electron waves. When the phase change is much 
smaller than 90°(7t/2), the object is called a weak phase 
object. 

Phase contrast: A contrast representing phase infor¬ 
mation carried in imaged objects using interference 
between an unscattered incident wave representing a 
zeroth order diffraction and scattered incident waves 
representing higher-order diffractions. 

Phase plate: An optical device to visualize 
transparent objects by converting a phase retardation 
in the incidence wave induced by objects to an 
amplitude. 

Zernike phase plate: A phase plate to generate a 
Zernike phase contrast. 

Hilbert phase plate: A phase plate to generate 
a Hilbert phase contrast. 


Cut-on frequency: Lowest bound of spatial fre¬ 
quency until which Zernike phase contrast can be 
recovered, which is determined by the size of the 
central hole in a Zernike phase plate. 

Defocus phase contrast: A phase contrast using an 
interference between an unscattered wave and 
defocus-modulated scattered waves. 

Introduction 

Phase plate electron microscopy is an emerging bio¬ 
physics technique for imaging ultrastructures in 
unstained biological samples with a contrast compara¬ 
ble to that of stained samples (Danev and Nagayama 
2001; Nagayama 2005; Danev et al. 2009). Advan¬ 
tages of the phase plate electron microscopic imaging 
technique are that it can visualize biological systems 
with a high contrast without invoking heavy metal 
stains and can be naturally combined with electron 
cryo-microscopy, which is advantageous in preserving 
intact structures in biosystems in the physiological 
condition (Eukuda et al. 2009). All the lengthy treat¬ 
ments imposed to the traditional sample preparation, 
chemical fixation, dehydration, resin embedding, and 
staining, can be omitted. A tiny phase plate, which is 
made of a thin film made of amorphous carbon with 
a width of about 100 pm and thickness of 20-40 nm, is 
installed into EM at the portion immediately below the 
objective lens, where usually an aperture is settled. 
Practically, two kinds of phase plate of thin film type 
are proposed; Zernike phase plate (Danev et al. 2001) 
and Hilbert phase plate (Danev et al. 2002). The former 
is used for generating a Zernike phase contrast (ZPC) 
and the latter for generating a Hilbert phase contrast 
(HPC). Both methods bring about an improved con¬ 
trast for unstained biological specimens compared 
with that obtained with a conventional defocus phase 
contrast (DPC method). 

Phase Plates 

A phase plate is a kind of spatial filter that functions to 
modify the optical information transferred through 
a microscopic tool by modulating the phase of an 
incident wave. Lenses are one of the most typical 
spatial filter and function to converge or diverge the 
incident wave in a manner depending on the lens shape 
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Phase Contrast Electron 
Microscopy, 

Fig. 1 Schematics of phase 
contrast microscopy, (a) 

A spatial filter schematic for 
defocus phase contrast (DPC). 
(b) A schematic for Zemike 
phase contrast (ZPC). (c) 

A schematic for Hilbert phase 
contrast (HPC). The bottom 
traces represents 
corresponding phase plates 



with no change in the wave amplitude. Phase plates 
work similarly as like lenses by positionally altering 
the wave phase but in a much simpler manner. Its 
function is illustrated in Fig. 1 together with 
a schematical view of the image formation process. 
Two types of phase plates, Zemike phase plate 
and Hilbert phase plate, both made of thin-film 
amorphous carbon, are shown. The first idea of phase 
plate electron microscopy was given by Boersch 
more than 60 years ago (Boersch 1947) and the thin- 
film fype phase plafe is one of the three proposed by 
him and unique by its simple design to be 
manufactured. 

Nevertheless, phase plates of thin-film fype had 
never been put to practical use before the report 
published in 2001 (Danev and Nagayama 2001). The 
major hurdle of making workable phase plates has 
been a hardship inherent in electron microscopy, the 
charging of phase plates. 

This difficult issue was partially solved in 2005 by 
adopting a treatment of conductive coating to 
wrap charges in the hnish stage of the phase plate 
fabrication (Nagayama 2005). The complete settle¬ 
ment, however, has to be waited until 2012 (Nagayama 
2012 ). 

Various types of phase plates including the Einzel 
lens using electrostatic potential generated with a spe¬ 
cific design of stacked electrodes, named as Boersch 
phase plate, have been proposed and are under devel¬ 
opment (summarized in a review (Danev and 
Nagayama 2010a)). 


Biological Applications of Phase Plate EM 

The phase plate prepared with a combination of several 
techniques to neutralize electron-irradiation-induced 
charges enables us to obtain qualihed images for var¬ 
ious kinds of ice-embedded biological specimens. 
Examples of applications of functional phase plates 
to biological systems are shown. 

Zemike phase contrast cryo-EM: High-resolution 
applications to smaller biological systems such 
as protein molecules, protein molecular 
machines, and viruses have been obtained by using 
anti-charging phase plates. In Eig. 2, examples of 
ZPC images for a protein molecule and two viruses 
are shown. 

The first example is a single particle analysis (SPA) 
for an EM-standard protein, GroEL (Danev and 
Nagayama 2008). The advantage of ZPC cryo-EM is 
illustrated as a comparison of the DPC and ZPC cryo- 
EM image of GroEL. With the conventional approach, 
the most challenging aspect of SPA is the first step: 
selecting the images that correspond to protein mole¬ 
cules. This is due to the fact that individual protein 
molecules must be identified for noisy raw images, as 
shown in Fig. 2al. The handling of ZPC images with 
higher contrast, as shown in Fig. 2aII, becomes less 
prone to human bias. Once a sufficient number of 
particles have been sampled and particle selection is 
completed, analysis becomes more computer-based 
and straightforward. A 13 A resolution 3D map 
(Fig. 2aIII) was reconstructed from approximately 
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Phase Contrast Electron Microscopy, Fig. 2 Examples of 
biological applications of thin-film phase contrast microscopy, 
(a) A protein, GroEL (/ DPC image (300 kV), 11 ZPC image 
(300 kV), HI 3D model obtained from single particle analysis 
with ZPC images) (Danev and Nagayama 2008). (b) A virus 
capsid, herpes simplex virus type I (/ DPC image (200 kV), 11 
ZPC image (200 kV), HI 3D model) (Rochat et al. 2011). 
(c) Projection images of T4 phage at near zero tilt (/ DPC 
projection image (200 kV), II ZPC projection image (200 kV)). 


Tomographic images sliced from a 3D reconstructed T4 phage 
(III DPC slice image, IV ZPC slice image) (Danev and 
Nagayama 2010b). (d) Comparison of TEM images of 
cyanobacterial cells (Kaneko et al. 2005). I A 300 kV DPC- 
TEM image of an ice-embedded unstained whole cell (~15 pm 
defocus). II A 300 kV HPC-TEM image of the same ice- 
embedded unstained whole cell as shown in (a), (near-focus). 
Ill A 100 kV DPC-TEM image of a resin-embedded, sectioned, 
and stained cell 
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1,500 raw particles that were picked directly from raw 
ZPC images shown in the photograph of Fig. 2aII. 

The second example is a SPA for a virus capsid, 
herpes simplex virus type 1 B-capsid (HSV-IB) 
(Rochat et al. 2011). The ZPC cryo-EM image 
(Fig. 2hII) was targeted in a near defocus condition to 
obtain maximum contrast enhancement. The contrast 
of the resulting images is substantially higher than that 
of the DPC image (Fig. 2bl). The 2,308 single particle 
images were used for reconstructing the hnal asym¬ 
metric map (Fig. 2bIII). What is most remarkable in 
the asymmetric 3D model is the exact positioning of 
the genome packaging apparatus, which has been 
a long-standing research target in virology. It has 
been finally fixed inside the capsid as shown in 
Fig. 2bIII. 

Zernike phase contrast cryo-tomography. Phase 
plate application to Zernike phase contrast cryo- 
tomography has been technically more challenging 
than that to SPA for several reasons followed: (1) a 
tedious procedure of realignment of the phase plate in 
association with the sample tilting, (2) a need to write a 
software to cope semiautomatically with the realign¬ 
ment, and (3) occasional phase plate alternation to a 
fresh one to manage the charging issue which is 
unavoidable during the lengthy data acquisition in 
tomography. Overcoming these difficulties, the first 
cryo-tomographic data recorded with ZPC-TEM was 
published for the case of a virus, phage T4 (Danev and 
Nagayama 2010b). 

An example of ZPC cryo-ET for the case of T4 
phage is shown in Eig. 2c. Eigures 2cl and II show 
T4 phage images sampled from DPC and ZPC tilt 
series of ice-embedded T4 phage (close to zero-tilt). 
It is easy to see the overall contrast improvement 
produced by the phase plate. In the ZPC image, 
disturbing fringes are appearing around strong-contrast 
edge; the heavy dark fringe just inside the viral capsid, 
the thin light fringe outside the capsid, and the 
light fringe close to the edge of the carbon him in the 
upper-right comer. All these features arise from the 
hnite size of the phase plate central hole, which corre¬ 
sponds to a hnite cut-on frequency. Comparison of 
tomograms of bacteriophage T4, acquired under iden¬ 
tical experimental conditions, except for defocus 
and the presence of a phase plate, are shown in 
Pigs. 2cIII and IV. The signihcant advantage with 
ZPC in contrast and overall details over the DPC 
image has to be noted. 


Hilbert phase contrast cryo-EM: An example of 
HPC image is shown for a kind of cyanobacteria, 
which is one of the well-studied bacteria having 
a cylindrical geometry of about 3 pm (length) x 
1 pm (diameter). Pigure 2d shows a comparison of 
images obtained for an unstained ice-embedded 
whole cell and a stained sectioned cell (Kaneko et al. 
2005). A characteristic feature of a HPC cryo-EM 
image, together with the high contrast, is demonstrated 
in Pig. 2dII for an ice-embedded whole cell. Counter¬ 
examples obtained with DPC or a scattering contrast 
using staining is shown in Pigs. 2dl or III. An obscure 
structureless image is observed in the DPC cryo-EM 
image, which was taken under the same experimental 
conditions as for the HPC image in Fig. 2dl, except for 
the phase plate and the defocus setting. Comparing the 
pair of images, the ice-embedded whole cell (Fig. 2dII) 
and the resin-embedded sectioned cell (Fig. 2dIII), 
a large difference in the image appearance can be 
observed, which may be attributable to the difference 
in specimen treatments. In the sectioned cell the ragged 
cell wall implies some shrinkage of the cell. Many 
aggregates and associated voids are also recognized, 
which are inevitably induced by chemical treatment, 
such as dehydration and selective staining of cellular 
organelles. On the other hand, the images of the ice- 
embedded cell are smoothly round and recognizably 
space-filled everywhere. The preserved roundness of 
cyanobacterial cells allows us to estimate the specimen 
thickness to be about 1 pm. 

Summary 

The first proposal of the idea of phase contrast electron 
microscopy using phase plates goes back to 1947 but 
its practical utilization belongs to a recent topic. The 
first successful realization using thin carbon film for 
the phase plate idea was reported in 2001 and various 
proposals followed to improve the performance and 
overcome the weakness intrinsic to the technology, the 
phase plate charging. Among them the thin-film type 
of phase plates, which is yet a device unique to bring 
about informative biological results, is focused in this 
entry. Three phase contrast methods, defocus, Zernike, 
and Hilbert phase contrast, are compared in their 
design and in the image characteristics, particularly 
the image contrast revealed in the micrograms of bio¬ 
logical specimens such as proteins, viruses, and 
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bacteria. The largely enhanced phase contrast realized 
by using Zemike phase plates has been proven to be 
beneficial in 3D-EM such as the single particle analysis 
and cryo-tomography. 

Cross-References 

► 3D Electron Microscopy Based on Cryo-Electron 
Tomography 

► Electron Crystallography 

► Electron Microscopy 

► Protein Structure Comparison Methods 
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► X-Ray Diffraction and Crystallography of 
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Synonyms 

Lateral phase separation; Lipid phase equilibria; Lipid 
self-organization; Lipids; Polymorphic and mesomor¬ 
phic behavior of lipids 

Definition 

Lipid polymorphism is the ability of lipids to form 
more than one solid structure (phase). Mesomorphic 
state (phase) is a state of matter intermediate between 
liquid and crystal. Lipid phase transitions are intercon¬ 
versions between various polymorphic and mesomor¬ 
phic lipid phases. 

Introduction 

Lipids constitute a varied and important group of 
biomolecules. Most lipids are amphiphilic and can 
behave as lyotropic liquid crystals. In the presence of 
water, they self-assemble in a large variety of phases 
with different structure and morphology. The lipid 
polymorphic and mesomorphic behavior, i.e., their 
ability to form various ordered, crystalline, gel, or 
liquid-crystalline phases as a function of water content. 
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Phase Transitions and 
Phase Behavior of Lipids, 
Fig.1 Structures of 
lipid phases. I. Lamellar 
phases: (a) subgel, L^; {h) gel, 
untilted chains, Lp; (c) gel, 
tilted chains, Lp<; (if) 
rippled gel, Pp'; (e) 
fully interdigitated gel, Lp‘“; 
(/) partially interdigitated gel; 

(g) mixed interdigitated gel; 

(h) liquid crystalline, L^. II. 
Lipid micellar aggregates: (;) 
spherical micelles, Mp (J) 
cylindrical micelles (tubules); 
(A) disks; (/) inverted micelles, 
Mii; (m) liposome. III. 
Nonlamellar mesomorphic 
(liquid crystalline) phases of 
various topology; (;i) 
hexagonal phase, Hi; (o) 
inverted hexagonal phase, Hn; 
(p) inverted micellar cubic 
phase, Qn^; (<?) bilayer cubic 
(Qii) phase, Im3m; (r) bilayer 
cubic phase, Pn3m; (s) bilayer 
cubic phase, Ia3d 




Im3m Pn3m Ia3d 


temperature and composition, as well as the mutual 
transformations between these phases, is the subject of 
this entry. 

Lipid Phase Nomenclature 

Lipid self-assembly in a variety of different phases is 
a function of their chemical structure as well as of 
external variables such as water content, temperature, 
pressure, and aqueous phase compositions. These 
phases are made of aggregates of different architecture 
(Fig. 1), with the aggregation process being driven by 


the hydrophobic effect (see ► Lipid Organization, 
Aggregation, and Self-assembly). 

Lipid polymorphic and mesomorphic phases are 
characterized by their (1) symmetry in one, two, or 
three dimensions; (2) hydrocarbon chain arrangement 
in the ordered gel and crystalline phases; and (3) type 
(normal or inverted) for the curved mesomorphic 
phases. For more than four decades, the nomenclature 
introduced by Luzzati has been used to designate 
lipid phases (Luzzati 1968). In this nomenclature, 
lattice periodicity is characterized by upper-case 
Latin letters: L for one-dimensional lamellar lattice, 
H for two-dimensional hexagonal lattice, P for 
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two-dimensional oblique or rectangular lattice, and T, 
R, and Q for the three-dimensional rectangular, 
rhombohedral, and cubic lattices, with space groups 
specihed according to the International 
Tables (Kasper and Lonsdale 1985). A Greek or 
Latin subscript is used as a descriptor of the 
chain conformation: a for disordered (liquid crystal¬ 
line), P for ordered (gel), P' for ordered tilted, 
and C for crystalline (subgel). Roman numerals 
are used to designate the aggregate type: I for 
oil-in-water (normal) and II for water-in-oil 
(inverted) type. 

Phase Transition Types 

Temperature and water content are primary variables 
in lipid-water systems, responsible for their thermo¬ 
tropic and lyotropic phase behaviors. Best known is the 
phase behavior of diluted lipid-water systems with 
water contents sufficient not only for full hydration of 
the lipid molecules (so-called excess water limit) but 
also for the transitions into mesomorphic phases 
with complicated spatial geometry and large internal 
aqueous volumes such as the inverted bicontinuous 
cubic phases (Fig. Iq-s), which require water contents 
well above the excess water limit for their 
development. 

A generalized phase sequence of thermotropic 
phase transitions for membrane lipids (phospholipids 
and glycolipids) may be written as (Tenchov 1991): 

Lc <-> L^ <-> Lj 2 <-> Q®n Hn <-> Q^n Mq. (1) 

On heating, a lamellar crystalline (subgel) Lc phase 
transforms into lamellar gel Lp phase; the latter phase 
undergoes a melting transition into the lamellar liquid- 
crystalline Lq, phase. Upon further increase of temper¬ 
ature, a series of mesomorphic phase transitions follow 
the sequence bilayer cubic Qn® - inverted hexagonal 
Hn - inverted micellar cubic Qn^ - inverted micellar 
Mil. Some lipids can form two or more modifications 
of a given phase, for example, gel phases of different 
structures (interdigitated, noninterdigitated, tilted, etc., 
see Fig. IT), mesomorphic cubic phases of different 
topology (ImSm, Pn3m, etc., see Fig. l.IIIq-s). Inter¬ 
mediate lipid phases have been reported as well, for 
example, the liquid-ordered phase has attracted atten¬ 
tion in the recent years because of its relevance to the 


functional lipid rafts in membranes (Simons and 
Ikonen 1997). 

With increase of the water content at constant tem¬ 
perature, the mesomorphic lipid phases arrange in the 
following sequence (Seddon 1990): 

inverted phases (Mu, Qn"^, Hn 

normal phases(Q®i,Hj, Qi”^) 

^ micellar solution monomers. (2) 

Typically, double-chain lipids only form L„ and 
inverted phases, while single-chain lipids can also 
form normal phases and micellar solutions. The 
phase sequence (2) is rationalized by the effect of 
water content on the effective shape of the lipid mol¬ 
ecules. Low hydration levels lead to tighter packing 
and smaller surface molecular areas of the lipid polar 
head groups, resulting in negative interfacial curvature 
and a tendency to formation of inverted phases. With 
increase of the water content, the surface molecular 
area increases; consequently, the concave interfaces of 
the inverted phases sequentially transform into the flat 
interface of the L^, phase and into the convex interfaces 
of the normal mesomorphic phases. Further dilution 
results in dissipation of the periodic lipid structures and 
formation of micellar solutions (and eventually mono¬ 
mer solutions at very high dilutions which bring the 
lipid concentration below the critical micellar concen¬ 
tration, CMC). 

Gel-Liquid-Crystalline Phase Transition 

From a biological viewpoint, of greatest interest are 
the transitions involving the physiologically 
important lamellar liquid-crystalline phase, namely, 
the gel-liquid-crystalline (melting) transition, and the 
lamellar-nonlamellar mesomorphic transitions. 

The lamellar gel-lamellar liquid-crystalline 
(Lp-L^;) phase transition, also referred to as (chain-) 
melting, order-disorder, solid-fluid, or main transi¬ 
tion, is the major energetic event in the lipid bilayers, 
taking place with a large enthalpy change. It is associ¬ 
ated with rotameric disordering of the hydrocarbon 
chains, increased head group hydration and increased 
intermolecular entropy (Nagle 1980). The energy 
required to expand the hydrocarbon chain region 
against attractive van der Waals interactions (volume 
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expansion) and to increase bilayer area (increased 
hydrophobic exposure at the polar-apolar interface) 
contributes to the large transition enthalpy change. 

The melting gel-liquid-crystalline transitions in 
fully hydrated lipids are accompanied by large 
increases in lipid surface area (^25%) and specific 
volume (^4%). In calorimetric measurements, they 
manifest as sharp, narrow heat capacity peaks 
with enthalpy of ^20-40 kJ/mol (Marsh 1990) 
(see ► Differential Scanning Calorimetry (DSC), Pres¬ 
sure Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers). Also, 
large volume fluctuations give rise to a strong increase 
of the isothermal bilayer compressibility at the melting 
transition temperature (Schrader et al. 2002). As a 
result of a dramatic increase of the bending elasticity, 
large bilayer undulations (anomalous swelling) have 
been observed at the melting transition (Monger et al. 
1994; Chu et al. 2005; Pabst et al. 2004). 

The temperature of the chain-melting transition is 
determined largely by the hydrocarbon chains - the 
longer and more saturated they are, the higher the 
transition temperature (Fig. 2a). For lipids with unsat¬ 
urated chains, the position and type of the double bond 
substantially modulate the melting temperature 
(Fig. 2b). Additionally, the melting temperature is 
affected also by chain branching and by the chemical 
linkage between the chains and the polar head group. 
Anhydrous lipids with identical hydrocarbon chains 
exhibit melting phase transitions at nearly identical 
temperatures. In aqueous dispersions, however, the 
head group interactions and the lipid-water interac¬ 
tions largely modify the lipid phase behavior. 
A summary of the phase transition temperatures of 
the major membrane lipid classes with different chain 
lengths is given in Table 1. 

Most of the membrane lipids have two different 
hydrocarbon chains, usually one saturated and one 
unsaturated; most common are the glyceropho- 
spholipids with saturated sn-1 chain typically 16-18 
carbon atoms long, and unsaturated sn-2 chain typi¬ 
cally 18-20 carbon atoms long. The gel-liquid- 
crystalline (Lp ^ Lot) transition temperatures of 
mixed-chain phosphatidylcholines are summarized in 
Table 2. It is evident from these data that altering the 
lipid chain length and unsaturation modulates the lipid 
phase state in very broad limits, thus providing the 
basis of a mechanism for membrane adaptation to 
large fluctuations in the environmental temperatures. 



Phase Transitions and Phase Behavior of Lipids, 
Fig. 2 Dependence of the phase transitions temperature on 
lipid chemical structure: (a) hydrocarbon chain length depen¬ 
dence of the Lp-Lo, {black squares) and L^i-Hn {open circles) 
phase transition temperatures in saturated diacyl phosphatidyl- 
ethanolamines (Seddon et al. 1983; Koynova and Caffrey 1994) 
and (b) dependence of the Lp-L^ phase transition temperature on 
the double-bond position for dioctadecenoyl phosphatidylcho¬ 
line bilayers (Koynova and Caffrey 1998) 

Formation of Nonlamellar Phases in 
Membrane Lipids 

Dispersions of double-chain nonlamellar membrane 
lipids most frequently display a lamellar-inverted- 
hexagonal, Lc-Hii, phase transition. In some instances, 
they can also form inverted phases of cubic symmetry. 
Important role in the lamellar-nonlamellar transforma¬ 
tions plays the membrane elastic energy (Siegel 2005; 
Gruner 1994). 

The Lo(-Hii transition may be considered as a result 
of competition between the spontaneous tendency of 
the lipid layers to bend and the resulting hydrocarbon 
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Phase Transitions and Phase Behavior of Lipids, Table 1 Gel-liquid-crystalline and lamellar-nonlamellar phase transition 
temperatures (°C) of fully hydrated lipids as a function of the lipid polar head group and hydrocarbon chain length 

Head group 
Chains sn-l/sn-2 

PC 1 

PE 

PG 

PS 

PA PI 

CL 

Glc 

Gal 

Mai 

N-Sph PC 

N-Sph Gal 

Gel-liquid-crystalline transition?, (Lp 











10:0/10:0 

-5.7 

2.0 






-6.9 




11:0/11:0 

-13.9 

16.9 





1.9 





12:0/12:0 

-1.9 

31.3 

-2 

14.2 

32 

25.4 

19.5 

26.8 




13:0/13:0 

13.5 

42.1 

10.7 




32.9 





14:0/14:0 

23.4 

50.4 

24 

35.4 

54 20 

40 

40.5 

48.7 

40.9 

25 


15:0/15:0 

33.7 

58.4 

33 


58 


50.7 



42 


16:0/16:0 

41.7 

64.4 

41.5 

51.4 

65 

58 

57.2 

61.6 

56.6 

44.5 

82 

17:0/17:0 

48.6 

70.5 





63.4 



46.5 


18:0/18:0 

54.5 

74.2 

54 

63.7 

75.4 

70 

68.4 

73.5 

66.7 

44.5 

84 

19:0/19:0 

60.2 

79.2 





73.7“ 



47.5 


20:0/20:0 

65.3 

83.4 





76.8“ 

80.0“ 



83 

21:0/21:0 

70.7 











22:0/22:0 

73.6 

90.0“ 










23:0/23:0 

77.9 











24:0/24:0 

80.3 











16:lc9/16:lc9 

-4.0 

-33.5 










18:lc9/18:lc9 

-18.0 

-7.3 

-18.3 

-11.0 

-11.0 







18:2c9,12/18:2c9,12 

-55.1 











18:3c9,12,15/18:3c9,12,15 

-63.0 











Lamellar-nonlamellar transition (L^ ■ 


unless otherwise indicated) 







14:0/14:0 







105'“ 

80.6” 




15:0/15:0 







82'“ 





16:0/16:0 


118.5 





79^ 

119“ 

80.7”; 

82.3'* 




17:0/17:0 


107.6 





76.6 





18:0/18:0 


101 





74.5; 

73.9"' 

76.0; 

85.0“ 




19:0/19:0 


97.8 





73.7“ 





20:0/20:0 


94.2 





76.8“; 

78.9"' 

80.0“; 

89.0“ 




22:0/22:0 


90.0“ 










16:0/18: lc9 


70.8 










16:lc9/16:lc9 


43.4 










18:lc9/18:lc9 


8.5 










18:1 t9/18:l t9 


62.2 










18:lc9/20:4c5,8,ll,14 


32.0 











PC, diacylphosphatidylcholines, PE diacylphosphatidylethanolamines, PG diacylphosphatidylglycerols, PS diacylphosphati- 
dylserines, PA diacylphosphatidic acids, PI diacylphosphatidylinositols, CL cardiolipins, Glc diacylglucosylglycerols, Gal diacylga- 
lactosylglycerols, Mai dialkylmaltosylglycerols, N-Sph PC sphingomyelins, N-Sph Gal galactocerebrosides (for the sphingolipids, 
chain length refers to the single fatty acid chain) 

"Lp ^ Hn 

"L, ^ Qn 

“^Qii ^ Hn 


• Hn transition 


chain packing strain - thus, membranes exist in a state 
of frustrated curvature stress (Gruner 1985). Respec¬ 
tively, the Lo(-Hn transition is believed to be driven by 


the relaxation of the curvature of the lipid monolayers 
toward their spontaneous curvature. Oppositely to the 
Lp-L„ transition, the Lo,-Hn transition temperature 
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Phase Transitions and Phase Behavior of Lipids, 
Table 2 Decrease of the gel-liquid-crystalline (Lp ^ L„) tran¬ 
sition temperatures of fully hydrated acyl-chain phosphatidyl¬ 
cholines with increasing sn-2 chain unsaturation. The first cis 
double bond causes the biggest transition temperature drop to 
occur, while further increases of chain unsaturation have much 
smaller effects 


Chains sn-ljsn-l 

Temperature (“C) 

16:0/16:0 

41.7 

16:0/16: lc9 

30.0 

16:0/18:0 

49.0 

16:0/18: lc9 

-2.5 

16:0/18:2c9,12 

-19.6 

16:0/20:0 

51.3 

16:0/20:4c5,8,11,14 

-22.5 

16:0/22:0 

52.8 

16:0/22: lcl3 

11.5 

16:0/22:6c4,7,10,13,16,19 

-3.0 

18:0/18:0 

54.5 

18:0/18:lc9 

6.9 

18:0/18:2c9,12 

-14.4 

18:0/18:3c9,12,15 

-12.3 

18:0/20:0 

60.4 

18:0/20: Icll 

13.2 

18:0/20:2cll,14 

-5.4 

18:0/20:3c8,ll,14 

-9.3 

18:0/20:4c5,8,11,14 

-12.9 

18:0/20:5c5,8,11,14,17 

-10.4 

18:0/22:0 

61.9 

18:0/22:lcl3 

19.6 

18:0/22:4c7,10,13,16 

-8.5 

18:0/22:5c4,7,10,13,16 

-6.4 

18:0/22:6c4,7,10,13,16,19 

-3.8 

18:0/24:0 

62.7 

18:0/24:lcl5 

31.8 

20:0/18:0 

57.5 

20:0/18:lc9 

11.5 

20:0/20:0 

65.3 

20:0/20: Icll 

20.5 

20:0/20:2cll,14 

5.4 

20:0/20:3cll,14,17 

1.8 

20:0/20:4c5,8,11,14 

-7.5 

20:0/22:0 

69.6 

20:0/22: lcl3 

29.2 

20:0/24:0 

70.6 

20:0/24: lcl5 

36.6 

22:0/18:0 

58.6 

22:0/18: lc9 

15.1 

22:0/20:0 

67.7 

22:0/20: Icll 

22.9 

22:0/22:0 

73.6 


(continued) 


Phase Transitions and Phase Behavior of Lipids, Table 2 

(continued) 


Chains sn-l/sn-2 

Temperature (°C) 

22:0/22: lcl3 

32.8 

22:0/24:0 

77.1 

22:0/24: lcl5 

41.7 

24:0/18:0 

58.9 

24:0/18:lc9 

20.7 

24:0/20:0 

68.4 

24:0/20:lcll 

24.5 


decreases with the hydrocarbon chain length increase 
(Fig. 2a). At sufficiently long chains, the Lo, phase is 
completely eliminated, and direct Lp-Hn transitions 
take place on heating. Such direct transitions have 
been observed for diacyl PEs of 22-carbon chains and 
monoglycosyldiacylglycerols of 19-20-carbon chains 
(Table 1). With long-chain glycolipids, a direct Lc-Hn 
transition is even observed, where both the Lp and 
phases are eliminated from the phase sequence. Inter¬ 
estingly, intermediate phases missing on heating may 
intervene in the cooling phase sequence. 

Among the seven cubic phases so far identified in 
lipids, of significant interest are the inverted 
bicontinuous or bilayer cubic phases with space groups 
(Im3m), (Pn3m), and (Ia3d) (Lindblom 

and Rilfors 1992; Luzzati et al. 1997) (Fig. l.IIIq, r, s). 
Whenever present, the bilayer cubic phases are located 
in a temperature range between the Lo, and the Hn 
phases. However, direct Lo,-Qii® transitions are rare 
in membrane lipid dispersions and mainly observed for 
short-chain PEs and monoglycosyldiacylglycerols. In 
many cases, a Qn® phase can be induced by means of 
temperature cycling through the La-Hn transition or 
by cooling of the Hn phase (Shyamsunder et al. 1988; 
Tenchov et al. 1998). A transformation from lamellar 
into bilayer cubic phase may be considered as cooper¬ 
ative act of multiple fusion events, whereby a set of 
initially separate, parallel bilayers fuse into a single 
bilayer of specific topology. The lamellar-cubic tran¬ 
sitions have very small, if any, latent heats. Although 
energetically inexpensive, these transitions are typi¬ 
cally rather slow. The slow formation, hysteretic 
behavior, and extended metastability ranges of the 
cubic phases create significant difficulties in their 
study and applications. 

Inverted micellar cubic phases have been observed 
mainly in mixtures of double-chain polar lipids 
with fatty acids or diacylglycerols, and also in some 
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Phase Transitions and Phase Behavior of Lipids, 
Fig. 3 Transition pathways in fully hydrated lipids. Schemes 
(a) and (b) are representative for PEs of intermediate chain 
length and for protonated DPPG; scheme (c), for short-chain 
PEs and PGs; and scheme (d), for the PCs. Solid horizontal lines 
indicate stable phases and dashed lines indicate metastable 
phases. The wavy lines represent isothermal relaxation into the 


Lc phase. LRl and SGII are metastable ordered gel phases with 
orthorhombic hydrocarbon chain packing. Schemes of the 
hydrocarbon chain packing: (e) hexagonal packing, (f) ortho¬ 
rhombic packing with four nearest neighbors, and (g) ortho¬ 
rhombic packing with two nearest neighbors (Reproduced from 
Tenchov et al. (2001). With permission of the Biophysical 
Society) 


single-component dispersions of glycolipids (Seddon 
et al. 2000). The most frequently observed inverted 
micellar cubic phase in lipids is of space group 
(Fd3m). For medium-chain lipids (>16 C atoms), it 
typically forms via an Hu-Qu'^ transition. 

Polymorphic Transitions Between Solid 
Lipid Phases 

At temperatures below the main transition, a basic 
equilibrium structure is the subgel (crystalline) L;, 
phase. Its formation usually requires prolonged low- 
temperature incubation. In addition to the L;, phase, 
a large number of intermediate stable, metastable, and 
transient lamellar gel structures are adopted by 


different lipid classes - with perpendicular or tilted 
chains with respect to the bilayer plane, with fully 
interdigitated, partially interdigitated, or noninter- 
digitated chains, rippled bilayers with various ripple 
periods, etc. (Fig. 1). A number of polymorphic phase 
transitions between these structures have been 
reported. Examples of such polymorphic transitions 
are the subtransition (Lc-LpO and the pretransition 
(Lp'-PpO in phosphatidylcholines. A polymorphic 
transition including rapid, reversible transformation 
of the usual gel phase into a metastable, more ordered 
gel phase with orthorhombic hydrocarbon chain pack¬ 
ing (so-called Y-transition) was reported to represent 
a common pathway of the bilayer transformation into 
subgel (crystalline) phase (Fig. 3) (Tenchov et al. 
2001 ). 
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Phase Transitions and Phase Behavior of Lipids, Table 3 Examples of heating, cooling, and isothermal phase sequences in 
lipid dispersions (Tenchov 1991) 

Lipid 

Scan direction 

Phase sequence 


DPPC 

Heating 

Lci 

13“C J,p, 35”C,Pp, 41,5"C 

La 


Cooling 

L. 

-^L5!c; ^ Lp. ^ Le2 



Isothermal equilibration 





<8°C 

Lc2 

^ Lci 


DLPE 

1 St heating 

Lci- 

43°C T 



Cooling 

Lo,- 

-.^Lp 



Isothermal equilibration 2°C, 9 days 

Lp- 

—> Lci 



Isothermal equilibration 26°C, 15 h 

Lp- 

-> Le2 



Isothermal equilibration 32°C, 15 h 

Lo,- 

Lci 


DMPE 

1 St heating 

Lc- 

56° C T 
-> 



Cooling 

Lo,- 

A2!C^I,p 



Isothermal equilibration 2°C, 9 days 

Lp- 

-Le 


DOPE 

Heating 

Lp. 




Cycling (n > 100) 

(L. 

"Hli)n ^ Qll 



Deep cooling (<—20°C) 

Qit 

-^Lp 


DOPE-Me 

Heating (>l°C/h) 

Lp- 

<0°C T 67°C 



Heating (<l°C/h) 

La- 

62“C 72~79“Ch,i 



Isothermal equilibration 55°C, 20 h 

La- 

disordered state ^ Qu 



Deep cooling (<—20°C) 

Qit 

Lp 


14-Gal 

1 St heating 

Lci 

^Le2-®!C^H„ 



Cooling 

Hu- 

60°C T 47°C T o 38°C 

- > ^ -*->3- 

-►Lc2 


2nd heating 

Lc2 

-^^H„ 


18-Gal 

1 St heating 

Lci 




Cooling 

Hii- 

T.p 



2nd heating 

Lp- 

exotherm, r-^ASoC t 78° C 

->■ ^c2 -s 

■ Hr, 


Isothermal equilibration 60-70°C, 1 h 

Lp- 

-> Lc2 



DPPC dipalmitoyl PC, DLPE dilauroyl PE, DMPE dimyristoyl PE, DOPE dioleoyl PE, DOPE-Me dioleoyl-W-methylethanolamine, 
14-Gal ditetradecylgalactosylglycerol, 18-Gal dioctadecylgalactosylglycerol 


Reversibility of the Phase Transitions: 
Transition Hysteresis and Formation of 
Metastable Phases 

Due to long relaxation times, especially in the transi¬ 
tion vicinity, lipid phase transitions often display pro¬ 
nounced hysteresis and end up with formation of 
metastable phases, which replace the equilibrium 
phases in cooling scans. The metastable phases can 
be long-lived and display no spontaneous conversion 
to the ground state on reasonable time scales. 

A large number of rate-limiting factors have been 
suggested as physical reasons leading to hysteretic 
behavior and formation of metastable phases: long 
hydration/dehydration times, slow reformation of 
hydrogen-bond networks, restricted molecular motion 
in the low-temperature solid-solid transformations. 


relative stability of the interfaces between solid and 
fluid domains, large spatial rearrangements in lamel- 
lar-nonlamellar transitions, low rate of appearance of 
critical-size nuclei of the nascent phase, and arrestment 
in local free energy minima. A comparison between 
heating and cooling phase sequences observed in aque¬ 
ous dispersions of lipids shows the frequent occurrence 
of additional, metastable phases, which only form in 
the cooling direction (Table 3). 

Phase Transitions in Lipid Mixtures: Phase 
Diagrams 

Composition is another important variable that 
strongly modulates the lipid phase behavior. The 
molecular interactions in a lipid mixture are reflected 
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in its phase behavior and are best presented by means 
of a temperature-composition phase diagram (see 
► Thermodynamics of Lipid Interactions). Various 
types of lipid phase diagrams reported in the literature 
are shown in Fig. 4. The lens-like diagram in Fig. 4a is 
characteristic for lipid mixtures which are completely 
miscible in both gel and liquid-crystalline phases. In 


order to display such complete miscibility, the two 
components must be structurally very similar. This 
kind of diagram is typical for lipid species with the 
same head group, differing by not more than two 
methylene groups in their hydrocarbon chains, such 
as the DMPC/DPPC binary (Fig. 4a). A usual compli¬ 
cation of the lens-type diagram is the frequently 


a 



d 




g 





h 




DOPC DSPC 


Phase Transitions and Phase Behavior of Lipids, Fig. 4 (continued) 
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occurring solid-state miscibility gap, where the mix¬ 
ture separates into two solid phases of different com¬ 
position. In mixtures of lipids with sufficiently 
different structures, the miscibility gap may overlap 
with the region of the solid-liquid-crystalline phase 
coexistence, giving rise to eutectic (Fig. 4b, c) or 
peritectic phase diagrams (Fig. 4d), in which single 
three-phase points exist. Horizontal solidus lines, 
reporting for such kind of behavior, have been 
observed for numerous lipid mixtures. Miscibility 
gaps may also occur in the liquid-crystalline phase of 
certain lipid mixtures. A phase diagram with a liquid- 
liquid immiscibility region is the monotectic phase 
diagram shown in Fig. 4e, with a monotectic triple 
point of coexistence of one solid and two liquid phases. 

Deviations from ideal mixing may occur not only 
with tendency for clustering of the like molecules, 
eventually leading to phase separation, but also when 
contacts between unlike molecules are preferred, i.e., 
when the nearest-neighbor pairs tend to be made up 
of unlike molecules (“chessboard” arrangement) 
(see ► Thermodynamics of Lipid Interactions). Such 
mixtures often display phase diagrams with upper 
isoconcentration (azeotropic) point - such as the 
DPPC/palmitic acid diagram shown in Fig. 4g. Except 
for the phosphatidylcholine/fatty acid mixtures, phase 
diagrams with upper isoconcentration point are typical 
for mixtures containing charged lipid. An example of 
a phase diagram with lower isoconcentration point is 
shown in Fig. 4f. 

The DPPC/cholesterol phase diagram in Fig. 4h 
contains a critical point. It is related to the existence 


of a peculiar, liquid-ordered (lo) phase in the mixtures, 
which is believed to be the prototype of the lipid rafts. 

Ternary phase diagrams are another important tool 
for characterization of the phase properties of complex 
lipid mixtures. These kinds of diagrams have proven 
especially useful recently, in the analysis of domains in 
model systems (see ► Lipid Domains). Ternary mix¬ 
tures of one phospholipid having a relatively high 
melting temperature and another phospholipid having 
a relatively low melting temperature together with 
cholesterol are viewed as useful models for the outer 
leaflet of animal-cell plasma membranes. An example 
of a ternary phase diagram is shown in Fig. 4i. It 
illustrates the rich phase behavior displayed by ternary 
lipid mixtures represented in this particular case by 
four regions of two-phase coexistence and one region 
of three-phase coexistence. 

Role of the Aqueous Phase Composition 

The Hofmeister Effect. The interactions of the lipid 
polar groups with water have an important contribution 
to the energy balance of a given phase. Many of the 
lipid phase transitions take place with large changes in 
the lipid surface area and consequently in the amount 
of bound water. The lipid hydration is determined by 
the chemical structure of the polar groups, but in fully 
hydrated systems, with water in excess, the extent of 
the polar group hydration depends also on the state 
of the bulk water. On the other side, various 
low-molecular solutes are known for their ability 


< - 

Phase Transitions and Phase Behavior of Lipids, 
Fig. 4 Types of lipid phase diagrams: (a) close to ideal phase 
diagram of the DMPC/DPPC mixture (Reproduced from Mabrey 
and Sturtevant (1976) with author’s permission); (b) eutectic 
phase diagram of the l-stearoyl-2-caprylphosphatidylcholine/ 
DMPC mixture (Reproduced from Lin and Huang (1988). With 
permission from Elsevier); (c) phase diagram of the DPPC/ 
C12C24PC mixture exhibiting eutectic and peritectic points 
(Reproduced from (Gardam and Silvius 1989) with permission 
from Elsevier); (d) peritectic phase diagram of the DMPE/DSPE 
mixture (Reproduced from (Dorfler 2000) with permission of 
Springer Science and Business Media); (e) monotectic phase 
diagram of the DEPC/DPPE mixture (Reproduced with permis¬ 
sion from (Wu and McConnell 1975); copyright (1975) American 
Chemical Society); (f) phase diagram with a lower 
isoconcentration (azeotropic) point) of the DEPC/DMPC mixture 


(Reproduced from (VanDijck et al. 1977) with permission from 
Elsevier); (g) phase diagram of the DPPC/palmitic acid mixture, 
combining upper isoconcentration (azeotropic) point with eutectic 
and peritectic points (Reproduced from (Inoue et al. 2001) with 
permission from Elsevier); (h) phase diagram with a critical point 
of the DPPC/cholesterol mixture (Reproduced from (Ipsen et al. 
1990) with permission of the Biophysical Society); (i) phase 
diagram of the ternary mixture distearoylphosphatidylcholine 
(DSPC)/dioleoylphosphatidylcholine (DOPC)/cholesterol at 
23° C, showing four regions of two-phase coexistence: liquid 
crystalline and gel (L^ + Lp), liquid ordered and gel (Lq + Lp), 
liquid crystalline and liquid ordered (Lo, + Lo), and liquid ordered 
and crystals of cholesterol monohydrate; there is also one region 
of three-phase coexistence, liquid ciystalline, gel, and liquid 
ordered (Lo, + Lp + Lo) (Reproduced from (Feigenson 2006) 
with permission) 
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Phase Transitions and Phase Behavior of Lipids, 
Tabie 4 Effect of solutes on the phase transition temperatures 
of hydrated PE (Koynova et al. 1997) 


Lipid 

Solute 

Slope (dTtr/dc) [K/M] 

Lp-Lc 

Lc-Hn 

Salts 

DMPE 

NaCl 

1.5 (1-4 M) 

-8.0 (3-6 M) 

DPPE 

NaCl 

2.0 

-7.4 


CaCL 

3.4 


POPE 

Na 2 S 04 


-16.0 


NaCl 


-7.3 


NaBr 


-3.5 


Nat 


6.0 


GuHCl 


8.0 


NaSCN 


27.0 

DSPE 

NaCl 

1.3 

-4.3 

DEPE 

Na 2 S 04 

1.7 

-9.9 


NaOAc 

1.5 

-9.1 


NaCl 

1.3 

-5.6 


NaSCN 

-3.3 

20.0 


GuHCl 

-4.6 

5.0 


GuHSCN 

-6.5 

50.0 

DTPE 

NaCl 

1.2 

-3.8 

DHPE 

NaCl 

1.3 

-2.5 

Monosaccharides 

DEPE 

Glucose 


-0.2 


Galactose 


-0.6 


Fructose 


-2.5 

Sugar alcohols 

POPE 

Sorbitol 


-9.0 

DEPE 

Sorbitol 


-5.4 


Myo-inositol 


-5.5 

Other polyols 

POPE 

Glycerol 


-3.3 

DSPE 

Glycerol 

0.9 

-1.0 

DEPE 

Glycerol 


-2.8 


PEG 


0.2 

Disaccharides 

POPE 

Sucrose 


-16.5 


Raffinose 


-30.0 

DEPE 

Sucrose 


-12.6 


Trehalose 


-10.8 


Lactose 


-10.0 


Maltose 


-9.0 

DMPE 

Sucrose 

0.3 


DSPE 

Sucrose 

1.4 

-8.4 


Trehalose 

1.1 

-8.7 

Amino acids 

DHPE 

Proline 

0.9 

-1.8 


{continued) 


Phase Transitions and Phase Behavior of Lipids, Tabie 4 

(continued) 


Lipid 

Solute 

Slope (dTtr/dc) [K/M] 


Lp-Lj, L^-Hn 

Others 

POPE 

Urea 

1.3 


DPPE dipalmitoyl PE, POPE palmitoyl-oleoyl PE, DSPE 
distearoyl PE, DEPE dielaidoyl PE, DTPE ditetradecyl PE, 
DHPE dihexadecyl PE 


to Strongly modulate the bulk water structure - “water- 
structure makers” (kosmotropes) and “water-structure 
breakers” (chaotropes). It thus turns possible that, even 
without direct interaction with the lipid polar heads, 
solutes can largely modulate the properties of the 
lipid-water interface and hence the lipid phase behav¬ 
ior. Changes in the aqueous phase composition are able 
to shift substantially the temperature regions of stabil¬ 
ity of the different lipid phases as well as to induce or 
suppress the formation of certain phases. Indirect sol¬ 
ute effects of such kind on the interfacial properties, 
generally termed the Hofmeister effect, have been 
found in a large number of lipid-water phases 
(Koynova et al. 1997). A great number of studies on 
the nature of the Hofmeister effect indicate that it 
results from an interplay of electrostatics, dispersion 
forces, thermal motion, fluctuations, hydration, polar¬ 
izability, ion size effects, and the impact of interfacial 
water (Collins and Washabaugh 1985). 

According to their effect on the lipid phase transi¬ 
tions, the Hofmeister solutes fall into two categories: 
(1) chaotropic solutes favoring formation of the lamel¬ 
lar liquid-crystalline phase Lq, at the expense of the 
neighboring Hu and Lp phases and (2) kosmotropic 
solutes favoring formation of Hu and Lp phase at the 
expense of the Lq, phase. Their effects are correctly 
described by an equation of Clapeyron-Clausius type 
between phase transition temperature and solute con¬ 
centration. The sign and magnitude of the transition 
shifts induced by the different solutes depend on the 
solute ability to unevenly distribute between 
interlamellar and free water. Kosmotropic solutes 
tend to minimize the area of the lipid-water contact. 
They suppress the Lot phase, as it has the largest surface 
area in contact with water. At high enough concentra¬ 
tion of kosmotropic solutes, the latter phase may 
completely disappear from the phase diagram. This is 
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precisely what is seen with sucrose, trehalose, proline, 
and some salts and is consistent with the opposite 
effect caused by chaotropic solutes (Table 4) (Tenchov 
and Koynova 1996). Addition of chaotropic solutes 
can also induce the appearance of missing liquid- 
crystalline phases from the general lipid phase 
sequence (1). 

Effect of pH. Changes in pH modulate the lipid 
phase behavior as a consequence of protonation/ 
deprotonation of the lipid head groups, resulting in 
a change of the surface charge of the membrane 
(Trauble et al. 1976). They also modify the 
surface polarity and hydration. Typically, protonation 
decreases lipid hydration and increases the main 
transition temperature (Cevc and Marsh 1987). The 
effects of pH titration on the chain-melting transition 
temperature Tn, of dimyristoyl phospholipids is illus¬ 
trated in Fig. 5a, showing that single protonation 
increases the melting transition temperature by about 
5-15°C. 

The shifts of the lamellar-hexagonal transition 
upon titration are greater and in the opposite direction 
relative to changes of T^. Thus, for didodecyl PE, the 
lamellar-hexagonal transition decreases by 41°C upon 
phosphate protonation (pK^l.9) and by 50°C upon 
amine protonation (pK^9.3), while for T^, these shifts 
are 6°C and 15°C, respectively, in the opposite direc¬ 
tion (Fig. 5b) (Seddon et al. 1983). 

Biological Relevance of Lipid Phase Behavior 

Biological roles of lipids are varied - in energy storage 
and fat digestion, as enzyme cofactors, electron car¬ 
riers, light-absorbing pigments, intracellular messen¬ 
gers, hormones, as constituents of the pulmonary 
surfactant and the skin stratum comeum (see 
► Skin Lipids), etc. The issue of their phase behavior 
in water is particularly relevant to their major struc¬ 
tural function - as building blocks of the biological 
membranes (see ► Functional Roles of Lipids in 
Membranes). 

The major discovery in the field of biological mem¬ 
branes is undoubtedly the finding that the 
biomembrane is a liquid-crystalline lipid bilayer with 
embedded proteins (Singer and Nicolson 1972; 
Shimshick and McConnel 1973; Trauble and 
Sackmann 1972). This so-called fluid-mosaic model 
has been a central paradigm in membrane biophysics 




Phase Transitions and Phase Behavior of Lipids, 
Fig. 5 pH. TIP. Dependence of the phase transitions tempera¬ 
ture on pH: (a) chain-melting transition temperature of 
dimyristoyl glycerophospholipid bilayers (superscripts give the 
lipid charge; abbreviations as in Table 1; MPA, methylpho- 
sphatidic acid) (Cevc and Marsh 1987); (b) gel-fluid (circles) 
and lamellar-hexagonal (squares) phase transition temperatures 
of didodecylphosphatidylethanolamine; the dashed line indi¬ 
cates the appearance of additional lines in the region of the 
lamellar-hexagonal transition (Reproduced with permission 
from (Seddon et al. 1983); copyright (1983) American Chemical 
Society) 

for more than three decades, and it has been very 
successful in rationalizing a large body of experimen¬ 
tal observations. The model includes two basic refer¬ 
ences to the lipid phase state - liquid crystalline and 
hilayer - both of which are of vital importance for the 
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proper functioning of membranes. The liquid- 
crystalline bilayers can arrange in stacks with 
interbilayer aqueous spaces, thus building up the 
lamellar liquid-crystalline phase Lo(. However, in addi¬ 
tion to the La phase, the membrane lipids are also able 
to form a large variety of other phases (subgel, gel, 
cubic, hexagonal, micellar). All these phases are inter¬ 
related and transform into each other via different 
kinds of phase transitions. On the temperature scale, 
the existence range of the phase is typically limited 
from above by lamellar-nonlamellar transitions into 
cubic, hexagonal, and micellar phases, and limited 
from below by fluid-solid transitions into gel and 
subgel phases. A remarkable from biological view¬ 
point property of the lipid dispersions is that the tran¬ 
sition temperatures limiting the stability ranges of the 
different phases can be altered by tens of degrees by 
varying the composition of the lipid-water system. 
The possibility to modulate the lipid phase behavior 
in very broad limits by varying the lipid composition 
appears to represent the basis of important regulatory 
mechanisms involved in the biomembrane responses 
to external stimuli such as changes in the environmen¬ 
tal conditions, as well as for the regulation of various 
membrane-associated processes. 

Summary 

Lipids constitute a varied group of biological mole¬ 
cules of diverse biological roles. Their major function 
is as building blocks of the biological membranes. 
According to the fluid-mosaic model, biomembranes 
are liquid-crystalline lipid bilayers with embedded 
proteins. This model includes two references to the 
lipid phase state - liquid crystalline and bilayer - 
both of which are of vital importance for the proper 
membrane operation. Except for the lamellar liquid- 
crystalline “membrane” phase, the amphiphilic mem¬ 
brane lipids are able to form also a large variety of 
other phases, which transform into each other via dif¬ 
ferent kinds of phase transitions. The impressive vari¬ 
ety of lipids in the biomembranes includes a large 
fraction of species that, in isolation, prefer to adopt 
curved, hexagonal, cubic, or micellar phases, rather 
than the lamellar phase. The physiological importance 
of lipid diversity and mesomorphism stems from the 
possibility to finely tune and optimize the collective 
properties of the biomembranes. 


Cross-References 

► Atomic Force Microscopy of Lipid Membranes 

► Critical Fluctuations in Lipid Mixtures 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 

► Functional Roles of Lipids in Membranes 

► Hierarchically Structured Lipid Systems 

► Infrared Spectroscopy of Membrane Lipids 

► Lipid Domains 

► Lipid Organization, Aggregation, and Self- 
assembly 

► Micropipette Manipulation of Lipid Bilayer 
Membranes 

► Skin Lipids 

► TheiTnodynamics and Thermodynamic Nonideality 

► Thermodynamics of Lipid Interactions 
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Definition 

Enzymes that catalyze the hydrolysis of phosphate 
esters from a variety of phosphate-containing substrates. 

Basic Characteristics 

Phosphatases are a large group of diverse enzymes. 
One important role is their involvement in the 
most common form of reversible posttranslational 
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modification of proteins, phosphorylation. Whereas 
protein kinases catalyze transfer of a phosphate group 
from ATP to specific amino acids, phosphatases 
can undo this phosphorylation. The presence or 
absence of a phosphate group can he used to activate 
or deactivate enzymes or enable or disable formation 
of protein-protein interactions. Therefore, phospha¬ 
tases are crucial for many signal transduction 
pathways. 

Some examples of computer simulation of reactions 
in a variety of phosphatases are outlined below. Parks 
et al. (2009) used a QM/MM minimum free energy 
path method to investigate the mechanism of the 
Cdc25B phosphatase, a dual-specificity phosphatase 
that catalyzes the dephosphorylation of tyrosine 
in the Cdk2/CycA protein complex and is 
a potential anticancer target. Their calculations, at the 
B3LYP/6-31-HG(d)/CHARMM22 QM/MM level, on 
the first step of the reaction indicated that, in the case 
of the small model substrate p-nitrophenyl phosphate, 
the monoanionic form of the substrate is preferred (in 
contrast to reactions catalyzed by other protein tyro¬ 
sine phosphatases). Re et al. (2011) studied the revers¬ 
ible phosphorylation of phosphoserine phosphatase by 
calculating two-dimensional potential energy surfaces 
at the B3LYP/6-3lH-G(d)/CHARMM27 QM/MM 
level. They concluded that the transition state in this 
process could be described as “geometrically associa¬ 
tive yet electronically dissociative” in character. 

Calculations with ► QM/MM methods (employing 
an extended version of the semiempirical AMI method 
in the QM region, to account for the cf-orbitals of 
phosphorus) have also been used to obtain free energy 
profiles of alkaline hydrolysis of methyl p-nitrophenyl 
phosphate in nucleotide pyrophosphatase/phosphodi¬ 
esterase (which is closely related to the bacterial alka¬ 
line phosphatase that contains two zinc ions and 
a magnesium ion) and water. The results indicated 
that the mechanism was dissociative in the enzyme 
but associative in water. The bond breaking between 
the phosphorus and the leaving group was found to 
be the rate-limiting step in the enzymatic reaction 
(Lopez-Canut et al. 2010). More recently, hydrolysis 
of methyl p-nitrophenyl phosphate in both alkaline 
phosphatase and nucleotide pyrophosphatase/ 
phosphodiesterase was also studied using QM/MM 
simulations that used a version of the semiempirical 
QM method SCC-DFTB specifically parameterized 
for phosphate hydrolysis (Hou and Cui 2012). 


Their findings indicate that the nature of the transition 
state for the same substrate is similar in the AP family 
enzymes and in solution, in disagreement with 
a previous QM/MM study (Lopez-Canut et al. 2010). 
They attribute this disagreement to a structural distor¬ 
tion of the zinc ion binding site in the previous study 
and further stress the importance of a carefully 
benchmarked QM/MM model when investigating 
phosphoryl transfer reactions. 

Apart from dephosphorylation of protein residues, 
some phosphatases can also catalyze the hydrolysis of 
lipids. One example is the secondary activity found for 
soluble epoxide hydrolase, an enzyme which is also 
important for drug metabolism. The mechanism of 
both steps in the reaction, phosphoenzyme intermedi¬ 
ate formation and subsequent hydrolysis, have been 
modeled using Car-Parrinello QM/MM molecular 
dynamics simulations (De Vivo et al. 2007). This 
work suggested metaphosphate-like transition states 
for both phosphoryl transfers and the potential impor¬ 
tance of a water bridge between the active site Mg 
ion and the ligand in the shuttling of protons. 

Cross-References 

► Car-Parrinello Molecular Dynamics 

► QM/MM Methods 

► Semiempirical Quantum Mechanical Methods 
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Phospholipid Bilayer Nanodiscs: 
Application to Single-Molecule 
Measurements 
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Synonyms 

Single-molecule methods 

Definition 

Phospholipid hilayer Nanodiscs, also called 
nanolipoprotein particles (NLPs) or reconstituted 
high-density lipoprotein particles (rHDLs), are a class 
of model membranes that consist of a segment of 
phospholipid bilayer constrained in a roughly discoidal 
shape by a discrete number of membrane scaffold 
proteins. 

Membrane scaffold proteins (MSPs) are proteins 
derived from apolipoproteins (apoA-I, apoE4, or 
apolipophorin) that form an amphipathic helical belt 
around the discoidal membrane segment of 
a Nanodisc. 

Introduction 

Both integral and peripheral membrane proteins are 
essential to a number of basic biochemical processes, 
including metabolism, transport, and signal transduc¬ 
tion. These biological roles often make membrane 



Phospholipid Bilayer Nanodiscs: Application to Single- 
Molecule Measurements, Fig. 2 (a) Nanodiscs can be used 
to study protein conformations and dynamics, as well as substrate 
binding by diffusion-based approaches such as single-molecule 
FRET and FCS. (b) Nanodisc-incorporated proteins can be 
immobilized for surface-based experiments such as TIRFM 

proteins attractive pharmacological targets, and a large 
fraction of drugs on the market target G protein-coupled 
receptors (GPCRs) and other membrane proteins. 
Biophysical investigations of membrane protein 
structure, function, and dynamics are facilitated by the 
maintenance of target proteins in well-defined, physio¬ 
logically relevant model membranes. 

A variety of model membranes have been devel¬ 
oped, including liposomes, detergent micelles, and 
bicelles, and have been used in insightful and ground¬ 
breaking studies of membrane proteins (Seddon et al. 
2004). However, Nanodiscs present important advan¬ 
tages over each of these systems in specific applications. 
Nanodiscs are smaller and more monodisperse than 
liposomes and essentially act as self-contained soluble 
particles; unlike detergent micelles, Nanodiscs present 
a true phospholipid bilayer environment; compared to 
bicelles. Nanodiscs are stable over a much broader 
concentration range, including very low concentrations. 

Several classes of integral membrane proteins 
have been incorporated into Nanodiscs (Fig. 1). 
The resulting membrane protein/Nanodise complexes 
are generally stable, well-defined, and monodisperse, 
making them amenable to biophysical experiments 
that can be challenging in the presence of traditional 
model membranes. In particular. Nanodiscs have 
enabled the successful characterization of integral 
membrane proteins by heteronuclear NMR in solution, 
small-angle X-ray scattering (SAXS), surface plasmon 
resonance, stopped-flow kinetics, and single-molecule 
fluorescence. 

Preparation of Nanodiscs 

The development of Nanodiscs was inspired by the 
large body of work describing the structure and assem¬ 
bly of lipoproteins, and especially our understanding 
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of nascent high-density lipoprotein particles. In the 
biological context of reverse cholesterol transport, 
these discoidal particles self-assemble from apolipo- 
proteins (such as apoA-I) and serum phospholipids at 
relatively low lipid:protein ratios and are then 
converted to mature, spheroidal HDL particles by the 
addition of additional lipids and cholesteryl esters. 
Sligar and coworkers (Bayburt et al. 2002; Ritchie 
et al. 2009; Grinkova et al. 2010) generated a library 
of MSP constructs of different lengths by removing 
the N-terminal regulatory domain of apoA-I 
and inserting or deleting amphipathic helical segments 
of different lengths. Nanodiscs are assembled by 
mixing MSP and detergent-solubilized lipid in appro¬ 
priate molar ratios (governed by MSP length and 
optimal lipid surface area in a bilayer), followed by 
detergent removal using dialysis and/or hydrophobic 
adsorbents. 

The choice of MSP determines the size of the 
resulting Nanodisc and, hence, the size of the protein 
or protein complex that can be incorporated (Fig. la). 
For example, the commonly used construct MSPIDI 
(apoA-I[Al-54], with an N-terminal His-tag) forms 
Nanodiscs 9.7 nm in diameter, large enough to incor¬ 
porate a single 7-transmembrane (7TM) protein such 
as bacteriorhodopsin. MSP1E3, which contains three 
addition 22-m amphipathic helices, forms Nanodiscs 
large enough (12.9 nm in diameter) to incorporate 
a trimer of 7TM proteins. Different constructs derived 
from apoA-I can form Nanodiscs from <10 to ^17 nm 
in diameter (Grinkova et al. 2010), while scaffold pro¬ 
teins based on apoE4 and apolipophorin can form 
particles ^20 nm in diameter (Baker et al. 2009). 
Nanodisc dimensions have been characterized exten¬ 
sively by hydrodynamic techniques, including size- 
exclusion chromatography, native polyacrylamide gel 
electrophoresis, and fluorescence correlation spectros¬ 
copy (ECS). SAXS has also been used to determine 
Nanodisc diameter and height (rather than a single 
radius of gyration) by fitting scattering data to analyt¬ 
ical expressions for nonspherical particles. 

Applications to Single-Molecule Biophysics 

Nanodiscs are well suited to single-molecule experi¬ 
ments for three reasons: (1) their narrow, monodis- 
perse particle size distribution, (2) their stability at 
concentrations in the pM-nM range, and (3) the range 
of functionalized lipids that can be incorporated into 
Nanodiscs, allowing for diverse surface-attachment 




nm 


Phospholipid Bilayer Nanodiscs: Application to Single- 
Molecule Measurements, Fig. 1 Model of a Nanodisc with 
an intrinsic membrane protein incorporated, showing the 
approximate relative dimensions of a 7-transmembrane domain 
protein (cyan) in the context of the Nanodisc membrane (light 
grey surface). The two helical scaffold proteins that constrain 
and support the membrane are shown in darker shades of grey. 
Figure is based on the crystal structure of bovine rhodopsin 
(PDB ID 1F88) (Palczewski et al. 2000) 

strategies that do not depend on covalent modification 
of the protein of interest. 

Diffusion-Based Measurements 

In diffusion-based single-molecule fluorescence mea¬ 
surements, fluorescently labeled molecules are 
observed as they pass through an optically defined 
focal volume (Fig. 2a); signals such as fluorescence 
intensity or fluorescence resonance energy transfer 
(FRET) then provide information about biomolecular 
interactions, conformations, and conformational 
changes of interest. These approaches require that the 
solution of molecules of interest is very dilute (^50 
pM) so that only a single molecule is in the focal 
volume at any given time. Nanodiscs are an ideal 
platform for single-molecule diffusion-based 
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experiments on membrane proteins because they pre¬ 
sent a planar lipid bilayer environment that is stable at 
these very low concentrations. 

In FCS, the temporal autocorrelation of spontane¬ 
ous fluctuations in fluorescence intensity results in 
a decay curve that can be fit with a variety of models 
to report on diffusion, chemical or photophysical 
kinetics, and conformational dynamics of biological 
molecules. Quantitative analysis of the autocorrelation 
curve in terms of measuring free and bound concen¬ 
trations of a fluorescently labeled species based on 
a difference in diffusive properties is well established, 
and FCS is therefore widely used to study equilibrium 
binding interactions between biomolecules. Nanodiscs 
make an excellent model system for the study of 
membrane-binding affinities of proteins and peptides 
by FCS for two primary reasons: (1) the number of 
lipids comprising the bilayer can be calculated exactly, 
which allows for precise determination of the stoichi¬ 
ometry of binding; and (2) their monodispersity and 
consistent size leads to a well-defined diffusion-related 
decay time (Nath et al. 2010). 

Immobilization of Nanodiscs 

Nanodiscs are also versatile tools to capture and immo¬ 
bilize membrane proteins in a bilayer for surface-based 
assays. Nanodiscs can be attached to surfaces either 
using affinity tags on the MSP or by including a small 
mole fraction of functionalized lipids in the assembly 
(Fig. 2b). Incorporated proteins can thereby be 
immobilized without covalent modification and with 
minimal perturbation to the membrane environment. 
A wide variety of headgroup-functionalized lipids are 
commercially available, including biotinylated, thiol¬ 
bearing, and thiol-reactive versions, as well as biolog¬ 
ically important lipids such as gangliosides and 
phosphoinositides. Biotinylated lipids and an avidin- 
functionalized glass surface have been used to immo¬ 
bilize Nanodisc-incorporated membrane proteins for 
characterization of substrate binding by single¬ 
molecule total internal reflection microscopy 
(TIRFM) (Nath et al. 2008). Biotin-avidin chemistry 
is orthogonal to the functional groups commonly used 
for covalent modification of proteins (i.e., cysteines 
and primary amines) and therefore minimizes the 
probability that surface attachment interferes with 
the functional properties of the protein of interest. 
The ability of Nanodiscs to maintain a target protein 
monomerically and stably both decreases the potential 


for heterogeneity due to oligomerization and enables 
longer observations of single molecules than would 
otherwise be possible. 

Finally, Nanodiscs have been used to characterize 
incorporated membrane proteins by atomic force 
microscopy (Tark et al. 2010), and in low-resolution 
structure determination of membrane protein com¬ 
plexes by cryo-electron microscopy (Katayama et al. 
2010). In these applications as well. Nanodiscs provide 
a degree of control over the oligomerization stoichi¬ 
ometry that serves to distinguish complexes of interest 
from ill-dehned aggregates. 

Cross-References 

► Atomic Force Microscopy 

► Cryo-Electron Microscopy 

► Fluorescence Correlation Spectroscopy 

► Membrane Proteins; Structure and Organization 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Single-Molecule Methods 

► Surface-Tethered Lipid Vesicles 

► Total Internal Reflection Eluorescence Microscopy 
for Single-Molecule Studies 
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► Phase Transitions and Phase Behavior of Lipids 


Phosphotriesterase - Computational 
Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

Aryldialkylphosphatase; Organophosphorus hydro¬ 
lase; Paraoxon hydrolase 

Definition 

A zinc-dependent bacterial enzyme that catalyzes the 
hydrolysis of organophosphates. 

Basic Characteristics 

Several organophosphate compounds, such as the nerve 
agents sarin and tabun and the insecticide paraoxon, are 
highly neurotoxic substances due to their irreversible 
inhibition of acetylcholinesterase. Bacteria such as 
Pseudomonas diminuta can make phosphotriesterase, 


an enzyme that has the potential to degrade these dan¬ 
gerous organophosphates. A natural occurring substrate 
does not seem to exist, thus the enzyme may have 
evolved purely for utilizing manufactured organophos¬ 
phorus compounds (which are used as insecticides). 

A range of computational studies, using molecular 
dynamics simulations and quantum mechanical (QM) 
calculations, have been performed to help understand 
the structure and reactivity of phosphotriesterase. For 
example, QM studies up to the MP2/6-31G(d) ab initio 
QM electronic structure level (Zhan et al. 1999) indi¬ 
cated that the oxygen found to bridge the zinc ions in 
X-ray crystallography belongs to a hydroxide ion 
(instead of a water molecule, as had been initially 
assumed). Further examples are a molecular dynamics 
study (Koca et al. 2001) that found that the phosphoryl 
oxygen of the substrate becomes strongly coordinated 
with one of the zinc ions if the substrate is paraoxon or 
sarin; and a quantum mechanical study of a small 
“cluster” model of the active site (Chen et al. 2007) 
that concluded that an associative mechanism for 
paraoxon hydrolysis was most likely, with the hydrox¬ 
ide that bridges the zinc ions sufficiently nucleophilic 
to be used in the hydrolysis reaction (as found in 
a number of other di-zinc enzymes, such as Class B 
► jS-lactamases). 

Wong and Gao first investigated the reaction mech¬ 
anism of the hydrolysis of paraoxon in 
phosphotriesterase using ► QM/MM methods (Wong 
and Gao 2007). They used semiempirical AMI/ 
CHARMM22 QM/MM molecular dynamics simula¬ 
tions and energy corrections at the B3LYP/6-31-i-G(d) 
density functional theory level to obtain a free energy 
prohle for the reaction and propose a detailed reaction 
mechanism. The simulations indicated that during the 
reaction, the distance between two zinc ions fluctuates 
significantly. They further proposed that the resting 
state of the enzyme with a hydroxide ion bound 
between the zinc ions was restored by a water 
molecule entering the active site and proton abstraction 
by an active site histidine (or proton release to solvent at 
high pFl). QM/MM methods were also used by Zhang 
et al. (2009), who used PM3/CHARMM22 molecular 
dynamics simulations to investigate substrate binding, 
the mechanism of reaction, and product departure in the 
phosphotriesterase-catalyzed hydrolysis of paraoxon. 
The phosphorus parameters, known to have limitations 
in the standard PM3 semiempirical QM method, were 
specifically optimized for the reaction. Selected 
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structural configurations were optimized at the higher 
B3LYP/6-31G(d)/CHARMM22 QM/MM level for 
comparison to confirm the semiempirical results. Their 
study suggested that the rate-limiting step in the overall 
process is the distortion of paraoxon bound to the 
enzyme needed to approach the zinc ion bridging 
hydroxide. The subsequent attack of the hydroxide ion 
on the phosphorous center to replace the diethyl phos¬ 
phate was found to have a low barrier. This indicates that 
phosphotriesterase activity on paraoxon and related 
organophosphorus compounds might be improved by 
mutations that stabilize the possibly distorted enzyme- 
bound conformation of the substrates. 

Cross-References 

► Beta-Lactamases: Computational Studies 

► Molecular Dynamics Simulations of Lipids 

► QM/MM Methods 

► Quantum Chemical Cluster Approach to Modeling 
Enzyme-Catalyzed Reactions 

► Semiempirical Quantum Mechanical Methods 

► Zinc-Dependent Metalloenzymes - Computational 
Studies 
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Photoactivated Localization 
Microscopy (PALM) 

Julie S. Biteen 

Department of Chemistry 2533 Chemistry, University 
of Michigan, Ann Arbor, MI, USA 

Synonyms 

Fluorescence photoactivation localization microscopy 
(FPALM) 

Definition 

Photoactivated localization microscopy (PALM) is 
a method of sequentially localizing sparse subsets of 
photoactivatable fluorescent proteins in order to obtain 
a sub-diffraction-limited fluorescent image (Betzig 
et al. 2006). 

Basic Characteristics 

The ability to detect the emission from a single fluo¬ 
rescent molecule (Moemer and Kador 1989) enables 
the measurement of the optical properties of molecules 
one at a time, without ensemble averaging. An impor¬ 
tant consequence of single-molecule fluorescence 
(SMF) detection was early on recognized to be the 
ability to measure the coordinates of spatially isolated 
fluorescent emitters with a certainty far better than 
the standard diffraction limit of light. Such sub- 
diffraction-limited localization opened the door to 
SMF imaging and tracking, a means to attain nanome¬ 
ter-scale information with a standard optical 
microscope. 

The position of a single emitter is found by consid¬ 
ering its detected emission pattern (Fig. la). In many 
cases, the center of mass of the emission coincides with 
the position of the emitter itself. To improve accuracy, 
precise knowledge of the point-spread function (PSF) 
of the microscope is used: the emission pattern is fit to 
the instrument PSF and the peak location indicates the 
emitter position. PSF fitting has the further benefit of 
providing fit statistics that can lead to an estimate of 
goodness of fit; the localization accuracy here can be as 
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Photoactivated Localization Microscopy (PALM), 
Fig. 1 (a) Single-molecule localization. A single, isolated 
fluorophore (red dot) is observed as a diffraction-limited point- 
spread function (PSF) (green circle), and the emitter position is 
extracted with high accuracy by finding the center of the PSF. 
(b) The high-density challenge. Two fluorophores spaced closer 
than the standard diffraction limit of light (red dots) give rise to 
overlapping PSFs (green circles)', the emitter positions cannot be 


determined from the emission pattern, (c) PALM Imaging. 
Using photoactivation or photoswitching of fluorescent proteins, 
only a sparse subset of emitters is in the “on” state at any one 
time. The emission from these isolated single molecules (red 
dots) again gives rise to nonoverlapping PSFs (green circles). 
A different sparse subset is localized in each imaging frame, 
and the localizations are summed to give rise to the final 
supeiTesolution image 


good as 1-2 nm for a bright emitter (► Fluorescence 
Imaging with One Nanometer Accuracy) (Thompson 
et al. 2002). This process of localizing single fluores¬ 
cent molecules is applied to determine the position and 
to characterize the motion of single molecules of inter¬ 
est (► single-particle tracking). 

Unfortunately, single-molecule localization is good 
only in the regime of sparse fluorescent labels; emitters 
that are closer together than the diffraction limit of 
light will have overlapping PSFs and localization 
becomes impossible. This sparse-emitter requirement 
is illustrated in Fig. lb: multiple fluorophores that are 
closely spaced relative to the standard diffraction limit 
of light give rise to overlapping emission patterns. The 
“trick” of fitting the PSFs to localize the emitters can 
no longer be achieved. 

In 2006, several techniques were introduced to 
overcome the sparse emitter requirement in the context 
of biological imaging. These techniques were coined 
photoactivated localization microscopy (PALM) 
(Betzig et al. 2006), ► Stochastic Optical Reconstruc¬ 
tion Microscopy (STORM) (Rust et al. 2006), and 
fluorescence photoactivation localization microscopy 
(FPALM) (Ftess et al. 2006). In general, dense collec¬ 
tions of fluorescent emitters that cannot be separated in 
space must be separated instead by some second vari¬ 
able. In PALM, STORM, and FPALM, this second 
variable is time, and a dense collection of 


photoswitchable or photoactivatable emitters are 
imaged. At any given time (in any given imaging 
frame), the majority of the molecules are in the “off’ 
(non-emissive/non-detectable) state, and a second 
excitation source is used to turn a sparse subset of the 
molecules “on,” as described below. 

In PALM, proteins of interest are imaged via 
fusions to photoactivatable fluorescent proteins or 
photoswitchable fluorescent proteins (FPs) (see 
► Protein Fluorescent Dye Labeling). Originally, 
these protein labels included photoactivatable green 
fluorescent protein (PA-GFP), Dronpa, Kaede, Kikume 
Green-Red (KikGR), and Eos Fluorescent Protein 
(EosFP); this list has since been expanded to include 
enhanced yellow fluorescent protein (EYEP), mDendra, 
and others. The EPs are excited via laser excitation with 
a visible laser (green, yellow or red depending on the 
EP), and all have two states: an “off’ state and an “on” 
state. In the case of the photoactivatable fluorophores 
(e.g., PA-GEP), the “off’ state is a dark (non-emissive) 
state and the “on” state is a bright (emissive) state, for 
the photoswitchable fluorophores (e.g., mDendra), the 
“off’ state is emissive, but absorbs at a wavelength 
shorter than the excitation laser wavelength, while the 
“on” state is emissive under the excitation conditions. 
A second laser (generally UV or near-UV) is used to 
switch subsets of FP-labeled proteins from the “off’ 
to the “on” state. 


























p 


1862 


Photoadduct 


Figure Ic illustrates the PALM methodology. First, 
a sparse subset of activated FPs is imaged in a 
widefield fluorescence microscope with a high numer¬ 
ical aperture (NA) objective until the fluorophores 
photobleach; the positions of these emitters, as deter¬ 
mined by center-of-mass determination or PSF fitting, 
are recorded. The cycle of activation and imaging is 
then repeated such that a unique subset of fluorophores 
is localized in each imaging cycle. The positions of 
the emitters are summed to produce a final, high- 
resolution image. The resolution of this image, defined 
as the smallest separation distance between two mole¬ 
cules that are still resolvable, can be as good as a few 
nanometers. 

PALM has been applied to many different biologi¬ 
cal systems, and has proven its versatility by being 
combined with ► live-cell single-molecule imaging, 

► single-particle tracking, multicolor imaging, and 
three-dimensional imaging. Current areas of active 
research include increasing capabilities for live-cell 
imaging, accelerating the speed of PALM imaging 
(though the initial PALM imaging experiments took 
hours, more recent ones have been accomplished on 
the scale of a few minutes), and increasing the power 
and speed of the post-processing (peak-fitting and 
image-rendering) steps. 

Cross-References 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Labeling of Nucleic Acids 

► Live-Cell Single-Molecule Imaging 

► Protein Fluorescent Dye Labeling 

► Single-Molecule High-Resolution Imaging with 
Photobleaching (SHRImP) 

► Single-Particle Tracking 

► Stochastic Optical Reconstruction Microscopy 
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Photochromic Forster Energy Transfer 
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Synonyms 

OLID-FRET; pcFRET; Photochromic ERET 

Definition 

Photochromic Forster Resonance Energy Transfer 
(pcERET) is a spectroscopic technique for measuring 
molecular interactions between two molecules, one 
labeled with a fluorophore and one with a photochro¬ 
mic chromophore (or fluorophore). 

Introduction 

Eorster Resonance Energy Transfer (ERET) is 
a spectroscopic technique, easily implementable in 
fluorescence microscopy, which is ideally suited to 
study biomolecular interactions in solution, live cells. 
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and tissues (Gadella 2008). FRET is based on the 
nonradiative transfer of energy from an excited molec¬ 
ular fluorophore (donor, D) to a chromophore (acceptor, 
A) via long-range dipole-dipole coupling (Parson 2007). 
In most applications also the acceptor is fluorescent, 
although it need not be so for FRET to occur. According 
to Forster’s theory (Valeur 2001; Parson 2007), the 
efficiency E of FRET is inversely proportional to the 
sixth power of the donor acceptor separation (r): 


E = 


Ro 




( 1 ) 


where Rq is a parameter, named Forster’s radius, which 
accounts for all the other contributions to the energy 
transfer process and represents also the D-A distance at 
which E = 0.5. Among these contributions stands the 
so-called overlap integral, i.e., the spectral integral 
of the product of donor’s emission and acceptor’s 
absorption weighted by the fourth power of wave¬ 
length (Parson 2007): 
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In common practice good D/A couples character¬ 
ized by large overlap integral in the visible range of 
spectrum may lead to Rq up to 5-7 nm, thus providing 
a tool for resolving molecular interactions (i.e., prox¬ 
imities) up to 10-12 nm, i.e., by far exceeding the 
optical diffraction limit (Lakowicz 2006). The detec¬ 
tion of FRET (i.e., of E > 0) in a solution or in some 
pixels of a microscopy image is a clear indication of 
binding between donor and acceptor-labeled mole¬ 
cules (DA complex) (Gadella 2008). Quantitative 
determination of actual E value, combined with knowl¬ 
edge of Rq for the D/A couple, would provide relevant 
structural information on DA, i.e., the distance 
between the binding partners (Eq. 1). Note, however, 
that the experimentally calculated FRET efficiency 
from intensity measurements is always a product of 
E and the molar fraction of global A molecules 
engaged in the DA complex (/X) or, symmetrically, 
the fraction of global D molecules engaged in the DA 
complex (/b) (Wlodarczyk et al. 2008). The latter 
condition discloses an interesting possibility: The 
knowledge of actual E for the pure DA complex 
could be used to obtain the molar fractions of bound 


partners and, from the knowledge of overall D or 
A concentrations, the dissociation constant of the com¬ 
plex that may represent a remarkable information from 
a biochemical viewpoint (Cardarelli et al. 2009). 

Over the years, several methods have been pro¬ 
posed to determine quantitatively FRET efficiency 
(lares-Erijman and lovin 2003). Sensitized emission 
is a generally applicable technique, but it is particularly 
cumbersome for microscopy studies of living speci¬ 
mens since it requires pre-calibration by donor and 
acceptor alone. Alternatively, if the D fluorescence 
can be individually observed, E is easily recovered by 
determining the extent of emission quenching due to A. 
This can be done quite straightforwardly from the deter¬ 
mination of donor fluorescence lifetime (Fluorescence 
Lifetime Imaging, FLIM, in microscopy) in the absence 
(to) or presence (tqa) of the binding partner labeled 
with A (Eq. 3) (Suhling et al. 2005; Digman et al. 2008; 
Gadella 2008); yet, detection of fluorescence lifetime 
requires a specialized instrument setup. 



A simpler strategy is based on the photobleaching 
of the acceptor (pbFRET): The donor signal (in each 
pixel if fluorescence microscopy is into play) is mea¬ 
sured prior (Fda) and after (Eq) photodestruction of the 
acceptor; E is then computed from the signal ratio 
according to Eq. 3 (Karpova and McNally 2006). 
pbFRET is particularly tailored for fast and accurate 
quantitative detection of FRET but the use of irrevers¬ 
ible photobleaching of the acceptor precludes 
reexamination of the sample, or of a given region of 
it, at later time. This in turn excludes monitoring bind¬ 
ing processes that vary with time, as most often occurs 
in biochemical processes. Photochromic FRET 
(pcFRET) overcomes this limitation by taking 
advantage of a reversible optical switching of the 
acceptor, i.e., by using a photochromic acceptor. 
A photochromic compound has the ability to switch 
reversibly between two different structural forms char¬ 
acterized by different optical properties (absorption 
and, in some cases, fluorescence) in response to illu¬ 
mination to appropriate wavelengths. In pcFRET, the 
absorption state of A is optically controlled in order to 
alternate high and low FRET with a properly selected 
D molecule (Fig. 1). Cycling FRET affords a fast and 
accurate way to recover/oE or/AE at different times. 
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photochromic autofluorescent protein acceptor 
(Bizzarri et al. 2010; Subach et al. 2010). In both 
cases, fluorescence of the sample is observed through 
a confocal microscope equipped with excitation 
sources (lamp, laser) that allow for both the on/off 
photoswitching of the photochromic acceptor and the 
excitation of the donor. Depending on the optical prop¬ 
erties of D and A, the source accounting for the on/off 
switching of A can coincide with that of D excitation. 
The sample is repeatedly scanned while recording the 
fluorescence of D and, in some cases, also that of A. 
One implementation of pcFRET involves monitoring 
the intensity changes in two wavelength intervals 
where D and A are both emitting although with differ¬ 
ent intensity contribution (Bizzarri et al. 2010). 
Eventually, changes in D intensity are correlated 
with those relevant to A optical state to recover 
FRET efficiency in each pixel (Eig. 2); this approach 
shares similarities with lock-in detection and it has 
been named Optical Lock-In Detection of ERET 
(OLID-FRET) by some authors (Yan et al. 2011). By 
pcERET, the interaction between epidermal growth 
factor receptor and growth factor receptor-binding 
protein 2 was demonstrated in live cells (Subach 
et al. 2010). 


Photochromic Forster Energy Transfer, Fig. 1 Scheme of 
pcFRET between a green fluorescent protein (GFP) donor 
and a photochromic yellow fluorescent protein (YFP) acceptor, 
following Bizzarri et al. (2010). When the acceptor is “on,” 
its absorption spectrum overlaps with the emission of 
donor GFP and FRET takes place. Photoconversion of 
the acceptor to the “off’ state is associated with a blueshift 
of its absorption spectrum and FRET with donor GFP is 
interrupted 


Implementation 

pcERET was proposed in 2002 by the Jovin’s group 
(Giordano et al. 2002), and later applied by different 
groups to solution experiments. Yet, discussion will be 
here restricted to implementation of pcFRET in fluo¬ 
rescence microscopy, as solution experiments can be 
considered a “single-pixel” case of imaging. In fluo¬ 
rescence microscopy studies, pcERET has been 
implemented according to two strategies: (a) use of 
a synthetic photochromic acceptor (Mao et al. 2008; 
Petchprayoon et al. 2011) and (b) use of a 


Data Analysis 

The following treatment has been developed by the 
author and takes into account all the features of 
pcERET in a formally general manner. The key player 
is a photochromic molecule A that exists in two states 
with different absorption characteristics (Ai and A 2 ), 
and a nonphotochromic fluorescent molecule D. The 
emission of D overlaps effectively with the absorption 
of Al, and negligibly with that of A 2 . Thus, ERET from 
D to Al {direct FRET) takes place when they are bound 
together in a complex DAi (in the most general case, 
D and A are labels of two binding compounds), 
whereas no FRET occurs for the complex DA 2 . 
While Al may display some fluorescence, it will be 
assumed that A 2 is nonfluorescent if excited at the 
excitation wavelength of D: These conditions allow 
for possible emission through ERET but avoid the 
rather unusual case of cross talk emission from 
the nonfretting acceptor state. Given the possible fluo¬ 
rescence of Al, for the sake’s of generality it will be 
also assumed that some ERET {reverse FRET) 
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Photochromic Forster Energy Transfer, Fig. 2 (a, b) 

Pseudocolor confocal images of the EGFP-L 20 -EYQI fusion 
protein expressed in CHO cells (Bizzarri et al. 2010). (a) Cell 
fluorescence collected in the F*** (485-500 nm) emission 
range at acceptor bleaching scan (BS) 0, 5, and 25; (b) cell 
fluorescence collected in the F'^^* (509-525 nm) emission 
range at acceptor bleaching scan (BS) 0, 5, and 25. Note that 
the color scales are different. EYQl undergoes three photo¬ 
chromic cycles that affect FRET between the two protein. 
Acceptor bleaching (conversion to off state) and donor 

between Aj, assumed as donor, and D, acting as accep¬ 
tor, may take place. 

Next, a mixture of interacting D and A (hereafter 
D and A denote the fluorophores as well as the fusion 
constructs between the fluorophores and some 
interacting molecules) is to be considered. The forma¬ 
tion of the DA complex is totally independent of A’s 
optical state, i.e., the binding constant K is the same for 
Al and A 2 (Fig. 3). Also, A can be considered optically 
switchable in the DA complexes with the same effi¬ 
ciency as the free acceptor. 

Following the treatment and the nomenclature of 
Wlodarczyk et al. (2008), the fluorescence signal 
obtained in each image pixel at kg upon excitation 
at kx is: 


excitation are carried out by illuminating at 476 nm 
(15 kW/cm^) for 30 s. Acceptor reactivation (conversion to 
“on” state) is obtained by excitation at 403 nm (1 kW/cm^) for 
20-30 s. (c) Plot of (blue) and (red) whole-cell 
emissions for one photochromic FRET cycle. Note that F*** 
is mainly due to donor and shows a time-dependent saturable 
increase that parallels the decrease of F*^' (where acceptor 
emission is prevalent), (d) F^** versus F*^' plot for the photo¬ 
chromic FRET cycle displayed in (c). The black line represent 
the linear fit, which yield E = 32 ± 4% 

F(4,le) = r?elx • {eD3>D([D] + [DAj] + [DA 2 ]) 

+ eA®A([Ai] -F [DAi]) 

— (sd'FdEda + eD‘FDEAD)[DAi] 

+ (sd'FaEda + eA‘FDEAD)[DAi]} 

where rj^. is the detection efficiency at Xf, 

Ix is the illumination intensity at 
Ed is the molar absorbance of D at kx, regardless of its 
complexation state; 

£a is the molar absorbance of Ag at kx, regardless of its 
complexation state; 

(bo is the fraction of photons emitted at kg by D, 
regardless of its complexation state; 
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K 

A2 + D — DA2 


hv 


hv 


O K 

A]^ + D — DA]^ 


Photochromic Forster Energy Transfer, Fig. 3 Reaction 
scheme for a photochromic A molecule capable to bind an 
optically complementary D molecule. The Ai"A 2 and 
DAi"DA 2 transformations are optically driven. The optical 
state of A does not affect the binding to D, DA, and DA 2 have 
the same dissociation constant K. Only Ai is fluorescent, and 
FRET takes place only in the DAi complex 


(Da is the fraction of photons emitted at by Ai, 
regardless of its complexation state; 

Eoa is the FRET efficiency assuming D as donor and 
Ai as acceptor {direct FRET)\ 

Bad is the FRET efficiency assuming A; as donor and 
D as acceptor (reverse FRET)', 

[D], [Ai], [DAi], and [DAi] are the concentrations of 
the molecular species in the spatial location related 
to the observation pixel. 

Note that the first two terms under brackets in the right 
side of Eq. 4 correspond to the overall emissions from D 
and A (alone or bound together) in absence of FRET; the 
third term describes the loss of donor emissions owing to 
FRET (direct and reverse)', the fourth term accounts for 
the actual FRET emissions (direct and reverse). 

Next, the following definitions are introduced: 
Ca = ([Ai] + [A 2 ]), Cda = ([DAi] + [DAJ), 
Cd = ([D] + Cda), Ca° = Ca + Cda- As stated 
previously, /a = Cda/Ca°- Let ^ be the molar fraction 
of A molecules in the Ai state (^ = [Ai]/Ca = [DAi]/ 
Cda), i-S-, the parameter that directly measures the 
“photochromic” balance between the two states Ai 
and A 2 . Finally, two different optical setups 1 and 2, 


characterized by the excitation and emission wave¬ 
lengths ()^xi, )^ei) and ()^x 2 , )^e 2 ), respectively, are intro¬ 
duced. Taking into account these definitions, Eq. 4 
becomes for setup 1 and 2: 


F(') =F(ixi,^i) 


FP) = F(1x2,^2) 


^ellxl • {e| 5 ‘’ 4 ‘'CD + ^ c° 



— £a tiAD 



(5a) 


'?.21.2{«d’®d’c„ + c-C“ 



— £a tiAD 



(5b) 


where the two superscripts (1) and (2) refer to the two 
spectral setups. 

The system 5a-b shows that F^* *^ and F*^* have the 
same linear dependence from the “photochromic” 
balance parameter This allows for concomitant 
de-parametrization of both equations to give: 

( 6 ) 


where: 


^,1x1 +/a ■ (4’Eda ^ 4‘^Ead 


’?e2lx2 ef<l)P4/A(<I>f-<I>; 


.( 2 ) 


Era -£ 


(2)l 


( 7 ) 


q = ^ellxl • 








+/a(<‘^ - 4*0 ■ (^d^Eda - 4‘>Ead)' 

• (4 '^Eda - £i'’EAD)_ 


( 8 ) 


Equations 6, 7, 8 show that F*^^' is always linearly 
related to F*^*, regardless of the fraction of emissive 
A molecules (Fig. 2): Such linear relationship is at 
basis of pcFRET data analysis. Indeed, linear fitting 


of F^^^ versus F^^' data collected over many switching 
cycles for a given pixel or pixel groups affords a slope 
and intercept that are directly related to photophysical 
features of the A/B couple including the FRET 






















Photochromic Forster Energy Transfer 


1867 


P 


efficiency (Fig. 2). Note that the complex form of 
Eqs. 7 and 8 can be strongly simplified in two cases 
relevant for real experiments: 

(a) Donor and acceptor are fully distinguishable 
from an optical point of view, i.e., D and Ai are 
singularly absorbing and emitting in optical setup 1 
and 2, respectively. This means: = 0, 

£ 0 ^ 2 ) = 0, = 0, = 0, Bad = 0 and 

Eq. 7 becomes: 


^ D . ^ 1) 

After incorporation of Eq. 12 in Eq. 11 and some 
rearrangements, Eq. 13 ensues: 

( 13 ) 


,»7e2lx2 





■/aEda 


( 9 ) 


Note that the quantity in parentheses corresponds 
to the ratio of fluorescent intensities of pure D and 
Ai (at the same concentration) collected in setup 1 
and 2, respectively. Thus, the product /aEqa is 
straightforwardly recovered from m after a simple 
calibration based on pure donor and acceptor 
forms. 

(b) Donor and acceptor are not distinguishable from 
an optical point of view, but setup 1 and 2 share the 
same excitation wavelength, i.e., = '^x 2 i 

Ixi = 1x2, = £d^\ and Ea^*’ = e/^^. 

In such a case, Eq. 7 becomes: 


e,( 1)(9 +/AE(a>9) - 1) 

■ £,(l)p) +/AE(a)(2) _ 1) 


( 10 ) 


where Fd*** and are the fluorescence emissions of 
D in the two spectral setups in absence of A; 
(Fd*!) = j _ j 2). Also, in Eq. 10 the 

relative extinction coefficients £,. = Ea^'V^d^**, the rel¬ 
ative brightness 4>,.9i = Oa^V^d*!^ (j = 1,2), and the 
apparent FRET efficiency E = (EoA-firEAo) are 
introduced. 

Equation 10 becomes even simpler if the second 
emission range is selected to have =1 


Next step is to consider two different FRET systems 
a and P entailing the same couple of fluorophores but 
associated with different apparent efficiencies E*“^ and 
E^*^* and molar fractions and /da*°'^- (“ and p 

could represent two protein complexes where the two 
fluorophores are separated by a different distance or 
assume a different mutual geometry with respect to 
each other.) If in both cases excitation is at the same 
wavelength and the second spectral range is chosen to 
have = 1, then the slope of the E^^' versus F*^^ plot 
for each system will be given by an expression equiv¬ 
alent to Eq. 13, and it is easy to show that: 




— m'wFRET 
miP) - yyilwFRET 


(14) 


Equation 14 has a remarkable form, as it suggests 
a fully experimental method to determine the 
E^“*'/da^'*^ product for a generic ERET system a 
involving a photochromic acceptor. Indeed, after 
selecting a wavelength range for which = 1 it is 
sufficient to adopt a reference ERET system P whose 
E^^'/da^^' is known and which entails the same couple 
of fluorophores found in a. The best candidate for the p 
FRET reference system is a covalently bound D-A 
couple for which it can be assumed/ da* = 1- 


Summary 


m = 


p(i) 

F(2) 


, /d»e(»!'* - ■) 


( 11 ) 


The direct proportionality between m and E leads to 
some relevant results. At first, it is worth noting that in 
the absence of ERET, i.e., for a noninteracting mixture 
of D and A or A alone, Eq. 11 becomes: 


Photochromic ERET (pcERET) combines the super¬ 
resolution capability of ERET technique with the 
photomodulation of the optical characteristics of the 
acceptor chromophore, in view of obtaining a fast and 
reliable assessment of molecular binding in solution 
spectroscopy and fluorescence microscopy. This entry 
summarizes the main photophysical features of 
pcERET and describes a model to analyze obtainable 
data. 
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Cross-References 

► Fluorescence and FRET in Membranes 

► Fluorescence: General Aspects 

► Live-Cell Single-Molecule Imaging 

► Optical Fluorescence Microscopy 
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Synonyms 

Light-induced electron transport 

Definition 

Photosynthetic electron transport describes the process 
of light-induced electron transport for generating 
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chemical energy and reducing equivalents in 
phototrophic organisms. 

Introduction 

Photosynthesis is the ultimate source of all of our food 
and most energy sources on Earth, and photosynthetic 
organisms use solar energy to drive the synthesis of 
biomass and biofuels. Photosynthetic electron trans¬ 
port is the first stage of photosynthesis that produces 
chemically stored energy and uses solar photons to 
drive electron transport against a thermodynamic gra¬ 
dient. The electron transport pathway and the coupled 
chemiosmotic gradient generates high-energy 
chemicals, (adenosine triphosphate, ATP) and reduc¬ 
ing equivalents (reduced ► nicotinamide adenine 
dicnucleotide phosphate, NADPH) (Blankenship 
2002). Both oxygen-evolving phototrophs, including 
cyanobacteria, algae, and plants, and anoxygenic 
(non oxygen-evolving) phototrophic bacteria are 
known. All anoxygenic organisms contain a single 
photosystem. While the details of the organization of 
photosynthetic complexes and reactions are different 
in these two broad groups of organisms, the basic 
mechanism of light-driven electron transfer across 
a membrane mediated by a pigment-containing protein 
complex called a reaction center (RC) is universal. 

Photosynthetic Electron Transport in 
Chloroplasts and Oxygenic Phototrophs 

The overall reaction for photosynthetic electron trans¬ 
port in chloroplasts of higher plants, photosynthetic 
algae, and oxygenic photosynthetic bacteria known as 
cyanobacteria is given in Eq. 1 . The precise stoichiom¬ 
etry (n) of ADP, Pi (inorganic phosphate), and ATP 
depends on the reaction conditions. 

2H20-f 2NADP+-t-nADP-|-nPiO 2 
-1- 2 NADPH + 2n+ +n ATP 

The production of NADPH and ATP through pho¬ 
tosynthetic electron transport is required for carbon 
fixation and numerous cellular functions. Further, the 
release of O 2 into the atmosphere caused the transition 
from the anaerobic world in the early Earth to the 
aerobic world of today (Blankenship 2010). 


Solar energy is either directly absorbed by chloro¬ 
phylls in the reaction center (RC), where the charge 
separation and electron transfer process take place, or 
first absorbed by chlorophyll or other accessory pigments 
such as carotenoids in the light-harvesting antenna com¬ 
plexes and then transferred to the RC. Electron transport 
is initiated in ► Photosystem II (PSII, or H20:plastoqui- 
none oxidoreductase) and > Photosystem I (PSI, or plas- 
tocyanmiferredoxin oxidoreductase). PSI and PSII, each 
containing several proteins and small molecules func¬ 
tioning as electron carriers for ► proton-coupled electron 
transfer, have been characterized in detail in oxygenic 
phototrophs (Blankenship 2002). 

In the noncyclic electron transport pathway in oxy¬ 
genic organisms, photosynthetic electron transport is 
initiated from oxidation of H 2 O in the oxygen- 
evolving complex (OEC) of PSII to NADP"^ in PSI 
(the so called Z-scheme). As illustrated in Fig. la, the 
noncyclic electron transport pathway can be divided 
into the electron donor side and the electron acceptor 
side of PSI and PSII. Three integral membrane protein 
complexes, the RC of PSII (including a special 
pair of chlorophyll a (Chi a) that absorbs photons at 
680 nm (P680)), the cytochrome h^f complex (or 
plastoquinokplastocyanin oxidoreductase), and the 
RC of PSI (with a special pair of Chi a and its C13^- 
epimer absorbs photons at 700 nm (P700)) operate in 
order for noncyclic photosynthetic electron transport to 
take place. Although most of the photosynthetic com¬ 
plexes are associated with the thylakoid membrane, 
several electron carriers are water-soluble proteins, 
including the cupredoxin plastocyanin (PC, a water- 
soluble copper-containing protein), ferredoxin (Ed, 
a small iron-sulfur protein), and ferredoxiniNADP"^ 
oxidoreductase (FNR), which is known to be the last 
electron transporter in photosynthetic electron trans¬ 
port, drives electron from the reduced ferredoxin to 
NADP"^ to generate NADPH. 

The cytochrome h^f complex transfers electrons 
from plastoquinol (PQH 2 , a reduced form of plastoqui- 
none (PQ) and a mobile electron carrier) in PSII to PC 
in PSI and functions as a proton pump for synthesizing 
ATP. The structure and function of the cytochrome h^f 
complex are similar to that of the cytochrome hcj 
complex (also known as complex III) in chemiosmotic 
electron transport in mitochondria and some 
anoxygenic phototrophs, although the cytochrome Cj 
and / subunits have very different structures and evo¬ 
lutionary histories (Baniulis et al. 2008). 
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Photosynthetic Electron 
Transport, Fig. 1 Schematic 
representation of 
photosynthetic electron 
transport in oxygenic 
phototrophs. Pane! a is an 
energetic diagram of the 
photosynthetic electron 
transport pathways. The cyclic 
electron flow around 
Photosystem I is shown in red 
line. Panel h shows the 
structures of four major 
transmembrane protein 
complexes, Photosystem II, 
cytochrome h^f, 

Photosystem I, and ATPase, 
and the diffusible electron 
carrier, plastohydroquinone 
{PQH 2 ), in the thylakoid 
membrane of higher plants. 
Three water-soluble proteins, 
plastocyanin, feiTedoxin, and 
ferredoxin: NADP’^ 
oxidoreductase are also shown 
(Nelson and Ben-Shem 2004) 
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Alternatively, cyclic photosynthetic electron trans¬ 
port pathway also operates during phototrophic growth 
(Fig. la). Different from the noncyclic electron trans¬ 
port, electrons carried by the reduced ferredoxin or/and 
NADPH during cyclic electron transport are trans¬ 
ferred to PQ or to the cytochrome hg/ complex, 
which forms a supercomplex with PSI and FNR during 
cyclic electron transport (Iwai et al. 2010). Thus, dur¬ 
ing cyclic electron flow, NADPH is not synthesized or/ 
and not used for assimilating CO 2 and carbon metab¬ 
olisms, and O 2 is not produced during cyclic electron 
transport because the PSII is bypassed. Nevertheless, 
cyclic electron transport is essential to fulfill the 
demands of ATP, which is generated through electrons 
transferred from PQH 2 to the cytochrome hg/complex. 
While the detailed mechanism(s) of cyclic electron 
transport is(are) not completely understood, it has 
been suggested that cyclic electron transport is essen¬ 
tial for synthesizing ATP in chloroplasts of bundle- 


sheath cells, which are found in certain C4 plants and 
have much less PSII than PSI, protecting the photo¬ 
synthetic apparatus from photodamage (Shikanai 
2007), and balancing ATP and NADPH in chloroplasts 
through regulating the noncyclic electron transport as 
cyclic and noncyclic electron transports share several 
electron carriers (Fig. la). 

Figure lb depicts the structural data of four trans- 
thylakoid membrane protein complexes (PS II (Guskov 
et al. 2009), cytochrome complex (Kurisu et al. 

2003) , PS I (Ben-Shem et al. 2003), and ATPase) and 
of mobile protein electron carriers (plastocyanin (Xue 
et al. 1998), ferredoxin, and FNR (Kurisu et al. 2001)) 
in the photosynthetic electron transport chain during 
oxygenic photosynthesis (Nelson and Ben-Shem 

2004) . The structures have been adjusted to the relative 
size of plant photosystem I, and the principal electron 
and proton pathways are illustrated. The cyclic electron 
transport pathway is indicated by a dashed line. 
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Photosynthetic Electron 
Transport, Fig. 2 Pathway 
of photosynthetic electron 
transport in anoxygenic 
phototrophs, including purple 
photosynthetic bacteria {left. 
Panel a), green sulfur bacteria 
(right. Panel a), filamentous 
anoxygenic phototrophs 
(Panel h), and heliobacteria 
(Panel c) 
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Photosynthetic Electron Transport 
Employed by Anoxygenic Phototrophic 
Prokaryotes 

The photosynthetic electron transport in photosyn¬ 
thetic prokaryotes is suggested to be the predecessor 
of photosynthetic electron transport in chloroplast of 
higher plants and plays an essential role in the evolu¬ 
tionary perspective of photosynthesis (Blankenship 
2010). It is evident that the oxygenic photosynthetic 
bacteria (cyanobacteria) carry out similar noncyclic 
and cyclic electron transport pathways as in 
chloroplasts, although in many cases cyanobacteria 
(and some algae) use cytochrome Cg instead of PC for 


transferring electrons from cytochrome h^f to the RC 
ofPSI (Fig. la). 

Electron transport pathways have been identified in 
various types of anoxygenic photosynthetic prokary¬ 
otes, including purple photosynthetic bacteria, green 
sulfur bacteria (GSBs), filamentous anoxygenic 
phototrophic bacteria (FAPs, or green non-sulfur/glid- 
ing bacteria), and heliobacteria (Blankenship 2002). 
Recently, another group of anoxygenic phototrophic 
prokaryotes have been discovered, the Chloroacido- 
bacteria (Bryant et al. 2007). Studies on purple bacteria 
and GSBs have been reported since eight decades ago 
(van Niel 1931). Depending on the electron sources, 
purple bacteria can be classified as purple sulfur 
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bacteria (PSBs) and purple non-sulfur bacteria. As indi¬ 
cated by the given name, purple non-sulfur bacteria 
prefer to acquire electrons from non-sulfide electron 
sources, such as organic carbon sources or H 2 . PSBs, 
along with GSBs, acquire electrons from sulfide 
oxidation, instead of water splitting. These organisms 
use several different enzyme systems, including sulfide- 
quinone oxidoreductase (SQR) to oxidize sulfide and 
trigger the electron transport (Frigaard and Dahl 2009). 

A special pair of bacteriochlorophyll a (BChl a) in 
the RC of purple bacteria and GSBs absorbs the photon 
at 870 nm (P870) in heterodimeric type II RC of purple 
bacteria or 840 nm (P840) in a homodimeric type I RC 
of GSBs. Purple bacteria have been shown to operate 
cyclic electron transport for producing ATP, and GSBs 
have been suggested to carry out noncyclic electron 
transport to reduce NADP"^ (Fig. 2a, right) and cyclic 
electron transport to generate ATP. The noncyclic 
pathway in GSBs has been generally accepted, 
whereas the cyclic electron transport has not yet been 
experimentally verified. Further, different from the 
cytochrome h^f complex employed by oxygenic 
phototrophs, the cytochrome hcj complex is utilized 
by purple bacteria and GSBs for photosynthetic elec¬ 
tron transport. Also, a water-soluble mobile protein 
► cytochrome C 2 (or cytochrome C 554 ), rather than 
plastocyanin, is used to transport electrons from the 
cytochrome he / complex to the type I RC (or the type II 
RC) in GSBs (or purple bacteria). Moreover, different 
quinones (menaquinones or ubiquinones) are used to 
transport electrons to the cytochrome hci complex in 
GSBs and purple bacteria (Fig. 2a). 

Like purple bacteria, FAPs also employ cyclic elec¬ 
tron transport in the type II RC (P870), whereas an 
alternative complex III (ACIII), containing seven dif¬ 
ferent protein subunits (Gao et al. 2010; Yanyushin 
et al. 2005), and auracyanln (a soluble blue-copper 
protein) (Bond et al. 2001; Lee et al. 2009) are utilized 
in place of the cytochrome ficy/fig/complex and cyto¬ 
chrome C 2 /plastocyanin, respectively, for photosyn¬ 
thetic electron transport (Fig. 2b). Finally, the 
electron transport pathway has not been completely 
established in the anaerobic anoxygenic heliobacteria 
that are obligatory photoheterotrophs and have the type 
I homodimeric RCs with a special pair of bacteriochlo¬ 
rophyll g absorbing at 798 nm (P798). P798 functions 
as the primary electron donor, and a unique isomer of 
Chi a, 8 -OH-Chl a, which absorbs the photon at 
670 nm (P670) and has only been found in 


heliobacteria, serves as the primary electron acceptor 
(Heinnickel and Golbeck 2007). A cyclic electron 
transport chain has been proposed with the electron 
contributed from the oxidation of NADH by the 
NADH-quinone oxidoreductase (i.e., the ► complex I 
in chemiosmotic electron transport) (Kramer et al. 
1997) and the cytochrome hci complex in purple 
bacteria and GSBs is replaced by a cytochrome he 
complex (a hybrid of cytochrome hei and fi^/complex) 
in heliobacteria (Ducluzeau et al. 2008) (Fig. 2c). 

Summary 

Photosynthetic electron transport converts free and 
abundant solar energy into reducing power and chem¬ 
ical energy for producing biomass and biofuels 
through transferring electrons sequentially from H 2 O 
through Photosystem II and then Photosystem I to 
NADP"^ in chloroplasts of higher plants and photosyn¬ 
thetic algae or in cyanobacteria. While oxygenic 
phototrophs operate both noncyclic and cyclic electron 
transport pathways, anoxygenic photosynthetic pro¬ 
karyotes carry out alternative electron transport 
pathways to generate ATP and/or NAD(P)H. 

Cross-References 

► Chlorophylls and Light-Harvesting Complexes 

► Cytochromes 

► Photosystem I 

► Photosystem II 

► Ubiquinol-Cytochrome c Oxidoreductase 
(Complex III) 
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Photosystem 2 

► Photosystem II 


Photosystem I 

Ingo Grotjohann and Petra Fromme 
Department of Chemistry and Biochemistry, 

Arizona State University, Tempe, AZ, USA 

Synonyms 

PS I; PSl 

Definition 

Photosystem I is a large membrane protein complex 
involved in the process of oxygenic photosynthesis 
catalyzing the light-induced transmembrane electron 
transfer from plastocyanin or cytochrome c6 to ferre¬ 
doxin or flavodoxin. 

Introduction 

Oxygenic photosynthesis, catalyzed by plants, algae, 
and cyanobacteria, is the major process on earth that 
converts light energy into chemical energy. It provides 
the major energy source for all higher life on earth and 
uses water as an electron source, leading to the evolu¬ 
tion of oxygen, which 2.5 billion years ago changed 
the atmosphere of our planet Earth. The reactions of 
photosynthesis are catalyzed by two large membrane 
protein complexes. Photosystem I (PSI) and > Photo¬ 
system II (PSII), which act in series and are coupled 
by the ► cytochrome bgf complex. PSI catalyzes the 
light-induced transmembrane charge separation from 
plastocyanin or ► cytochrome Cg, located at the inside 
of the photosynthetic membrane (lumen of the thyla- 
koids), to the soluble electron carrier protein ferre¬ 
doxin, located at the outside (stroma or cytosolic 
side) of the thylakoids. The electrochemical gradient 
formed across the photosynthetic membrane drives 
synthesis of ATP by the ATP synthase. The electrons 
are hnally used for reduction of hydrogen, stored in 
form of NADPH, and ATP which are used for CO 2 
fixation in the Calvin Cycle. 

PSI is a large protein complex. In cyanobacteria, it 
is a trimer with a molecular weight of 1 million Dalton. 
One monomer of PSI consists of 12 proteins and 127 
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cofactors. The structure of cyanobacterial PSI (Fig. 1) 
was determined at 2.5 A resolution (Jordan et al. 2001). 

The Cofactors of PSI 

One functional unit of Photosystem I (PSI) contains 96 
chlorophylls, the green pigments that give plants their 
green color. The chlorophylls play vital roles in light 
absorption, energy transfer, and catalysis of the primary 
charge separation and electron transfer reactions. The 
second most prominent cofactors are 22 carotenoids, 
orange molecules which serve as additional antenna 
pigments and protect PSI from photodamage under 
high light conditions. PSI also contains three [4Fe4S] 
► iron-sulfur clusters and two phylloquinones, which 
are involved in electron transfer, as well as four lipids 
and one Ca^"^ ion, which play major stractural roles. 

The Proteins of PSI 

The most important proteins in PSI are the two 
large protein subunits PsaA and PsaB. They form the 
functional and structural core of PSI and harbor all 
membrane intrinsic cofactors of the electron transport 
chain as well as 79 of the 90 antenna chlorophylls. The 
core of PSI consists of ten transmembrane helices of 
PsaA and PsaB that surround the electron transfer 
chain like a fence. This core is represented by the 
5 C-terminal transmembrane helices of PsaA/B. This 
part of PSI shows some structural but no sequence 
homologies to the core subunits of PSII and the purple 
bacterial reaction center, which suggests that the pho¬ 
toreaction centers may all have evolved from 
a common ancestor. This “heart” of PSI is flanked on 
both sides by six transmembrane helices that corre¬ 
spond to the N-terminal antenna domain of PsaA/B 
(Fig. Ic). This antenna domain of PSI shows homolo¬ 
gies to the major antenna proteins of PSII, fueling the 
suggestion that the large subunits of PSI, PsaA, and 
PsaB may have evolved by gene fusion of an ancient 
reaction center protein with an antenna protein. 
Seven small membrane intrinsic subunits surround the 
PsaA/B core of PSI. These small subunits, which con¬ 
tain only one to three transmembrane helices each, 
stabilize the antenna system and mediate trimerization 
of PSI. While the PsaA/B core of PSI is highly con¬ 
served between plants, algae, and cyanobacteria, the 


small membrane-extrinsic subunits show variations 
between different photosynthetic kingdoms. In 
cyanobacteria, trimerization is mediated by subunits 
PsaL, I and M, while one of the additional plant-specific 
subunits, PsaH, binds to PsaL and maintains PSI in a 
monomeric state in higher plants and green algae, 
where the PSI monomer is flanked on one site by four 
peripheral ► light-harvesting complexes I (LHCI), 
Lhcal-4. The structure of PSI from pea has been solved 
as a supercomplex (Fig. 2) with four LHCI proteins 
bound to a monomeric PSI (Amunts et al. 2010). 

The soluble electron carrier proteins, plastocyanin 
and cytochrome Cg, dock to PSI at its lumenal site. 
Cytochrome Cg is very likely the evolutionary older 
electron donor to PSI and can be found in most 
cyanobacteria and algae, some of which also contain 
plastocyanin, while in plants plastocyanin serves as the 
sole electron donor to PSI. The docking site for cyto¬ 
chrome Cg/plastocyanin is located at a relatively shal¬ 
low, 20 A deep lumenal indentation of PSI, which 
guides the electron donor proteins to the docking site 
in close proximity to P700, which in turn catalyzes the 
primary charge separation. This docking site only 
involves PsaA and PsaB and is mainly hydrophobic 
in cyanobacteria. However, an extension of a lumenal 
helix of the subunit PsaF provides an additional inter¬ 
action site in plants and algae that allows for a more 
stable complex formation by electrostatic interactions. 

While the lumenal side of PSI is relatively “flat,” 
PSI extends the membrane by 90 A at the stromal side, 
a structural feature which is often referred to as the 
stromal hump. Three membrane extrinsic subunits, 
PsaC, PsaD and PsaE, form the stromal hump which 
provides the docking site for the stromal electron car¬ 
rier ferredoxin or flavodoxin. The latter replaces ferre- 
doxin under iron deficiency. PsaC, which is located in 
the center of the hump, is the most important of the 
three extrinsic subunits, as it carries the terminal FeS 
clusters, named Fa and Fb, which transfer the electron 
from PSI to ferredoxin/flavodoxin. PsaE and PsaD 
stabilize the stromal hump and mediate docking of 
ferredoxin/flavodoxin. The stromal hump with the 
putative docking site is shown in Pig. 3. 

The Electron Transfer Chain 

The electron transport chain of PSI is well conserved 
between plants and cyanobacteria and consists of six 
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Photosystem I, Fig. 1 Photosystem I (PSI) from Thermosyne- 
chococcus elongatus. (a) View of a PS I trimer as seen from the 
cytosolic side. The protein subunits of one PSI monomer are 
marked. Nine of twelve subunits of a monomer are membrane 
intrinsic, with the largest subunits PsaA and PsaB among them. 
The three membrane-extrinsic subunits PsaC, PsaD, and PsaE 
form a stromal hump, and black circles mark their main bodies 
in the image. Note that PsaD forms a clamp across PsaC, which 

chlorophylls, two phylloquinone molecules, and three 
[4Fe4S] clusters. The cofactor arrangement is shown in 
Fig. 4. The organic cofactors are arranged as two 


has not been marked, (b) The core antenna network of a PSI 
trimer. Chlorophylls are shown in green, except those in the 
electron transfer chain, which are depicted in blue. Carotene 
molecules are orange, (c) The transmembrane helices of the 
nine membrane-intrinsic subunits of one PSI monomer as seen 
from the cytosol. The transmembrane helices of PsaA and PsaB 
have been numbered from N-terminus to C-terminus for 
orientation 

branches, which show a twofold axis running through 
the first FeS cluster Fx. They are named A- and 
B-branch, which refers to the preferential coordination 




p 


1876 


Photosystem I 


Photosystem I, Fig. 2 PSI 

from Pisum sativum as seen 
from the stroma. The core 
subunits (PsaA, PsaB, PsaC, 
PsaD, PsaE, PsaF, Psal, PsaJ, 
PsaK, and PsaL) closely 
resemble their cyanobacterial 
counterparts, shown in Fig. la. 
The plant-specific subunits, 
PsaG and PsaFI, play roles in 
binding LFICI and LFICII, 
respectively. The subunit PsaR 
has been putatively assigned 
and awaits further 
characterization 
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Lhca4 
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of the cofactors by the two large subunits, PsaA and 
PsaB. Electron transfer starts with the primary charge 
separation. Excitation energy transfer from the antenna 
chlorophylls leads to excitation of P700 to the excited 
state P700*, which catalyzes the primary charge sepa¬ 
ration. P700 is very likely represented by the first pair 
of chlorophylls, which consist of one chlorophyll a 
molecule in the B-branch and the Cl3 epimer of chlo¬ 
rophyll a in the A-branch. However, there is also 
spectroscopic evidence that P700* may be delocalized 
over more than just two chlorophylls and that charge 
separation may be also able to start from one of the 
accessory chlorophylls (Muller et al. 2010). The elec¬ 
tron is transferred either along the A- or B-branch over 
a chain of electron carriers that include the second and 
third symmetry related pairs of chlorophylls, the 
phylloquinones, and the FeS cluster Ex, until it finally 
reaches the terminal FeS clusters Fa and Fb, which are 
coordinated by PsaC. From Fb, the electron is finally 
transferred to ferredoxin. 

The nature and distribution of electrons along the 
two branches is an open and vividly discussed topic of 
current research. Results of site-directed mutagenesis 
in combination with optical and ► EPR spectroscopy 
show that both electron transfer branches are active 
and that the ratio of A- versus B-branch electron trans¬ 
fer varies between organisms (Dashdorj et al. 2005). 
There is also evidence that the electron transfer step 


from the phylloquinones to Fx are rate limiting for 
electron transfer in PSI, with the B-branch electron 
transfer step from the phylloquinone to Fx being ten 
times faster than the electron transfer along the 
A-branch (Li et al. 2006). The reason for this differ¬ 
ence in electron transfer rates is under investigation. 
One explanation is the presence of two different lipids 
in close proximity to the electron transport chain. In the 
faster B-branch, this lipid is a neutral galactolipid, 
while at the A-branch a negatively charged phospho¬ 
lipid is located in proximity to the phylloquione, which 
could increase the activation energy barrier for the 
electron transfer between the phylloquinone and Fx 
at the A branch. 

The first FeS cluster Fx is a rare example of a 4Fe4S 
cluster coordinated by two different proteins. Two 
cysteine residues of PsaA and PsaB coordinate the 
four Fe atoms in the [4Fe4S] cluster Fx. Fx is located 
at the twofold axis that relates PsaA and PsaB. In 
addition to its role in electron transfer, Fx also plays 
a very important role in the assembly of PSI. From Fx, 
the electron is transferred subsequently to the FeS 
clusters Fa and Fb, located in the extrinsic subunit 
PsaC. PsaC shows strong structural homology to solu¬ 
ble ferredoxins that contain two 4Fe4S clusters and it 
has therefore been proposed that it may share the same 
evolutionary roots with these soluble electron carriers, 
where a previously soluble protein became part of the 
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Photosystem I, Fig. 3 The cytosolic subunits PsaC, PsaD, and 
PsaE from T. elongatus. PsaC encloses two iron-sulfur clusters, 
Fa and Fg. PsaD forms a clamp across PsaC. The iron-sulfur 
cluster Fx, which is situated between the large membrane- 
intrinsic subunits PsaA and PsaB, is shown for orientation 


larger multi-protein complex of PSI. Further evidence 
for this hypothesis is provided by the comparison of 
PSI with primitive anaerobic homodimeric photoreac¬ 
tion centers found in heliobacteria and green sulfur 
bacteria. The photoreaction centers in these organisms 
consist of a homodimeric RC core, where an extrinsic 
protein containing two 4Fe4S clusters is only weakly 
attached (Romberger and Golbeck 2010). 




Photosystem I, Fig. 4 The electron transfer chain (ETC) of 
PSI. After charge separation at Pvooi art electron moves along one 
of the branches of chlorophylls and phylloquinones and further 
along the iron-sulfur clusters, until the iron-sulfur cluster Eg 
reduces the soluble electron acceptor, ferredoxin 


The Antenna System 

The core antenna system of PSI in cyanobacteria con¬ 
sists of 90 chlorophyll a molecules and 22 carotenoids 
per monomer. The antenna systems of the three mono¬ 
mers in the PSI trimer (see Fig. lb) are tightly 
interconnected so that excitation energy transfer also 
involves transfer of excitation energy from one mono¬ 
mer to the next. The chlorophylls form a clustered 
network, where nearly all chlorophylls are located at 
distances <15 A to the next neighbor, which ensures 
high efficiency of excitation energy transfer. The 
carotenoids are distributed all over the antenna system. 
They serve two functions. They serve as additional 
antenna pigments and are very important for photo¬ 
protection, as they quench dangerous triplet states of 
chlorophylls and dissipate the excess energy as heat, 
thereby preventing the formation of ► reactive oxygen 
species. 

The quantum efficiency of the excitation energy 
transfer from the antenna to P700 is >99.9% at room 
temperature. This core antenna system has been highly 


conserved over 1.5 billion years of evolution 
as more than 90% of the core antenna chlorophylls 
have the same orientation and location in plant and 
cyanobacteria (Jordan et al. 2001; Amunts et al. 2010). 

Most chlorophylls of the core antenna system are 
coordinated by the large subunits PsaA and PsaB and it 
has been proposed that these subunits have evolved by 
a gene fusion between an antenna protein with the 
protein that coordinated the electron transfer chain. 
The fusion of the two proteins allowed for a develop¬ 
ment of a very efficient antenna system in PSI, which 
contains a joint reaction center and antenna system. 
The whole core antenna system is, structurally and 
functionally, tightly coupled to the reaction center 
core. The chlorophylls of the core antenna can be 
divided nearly half and half into (1) antenna chloro¬ 
phylls that are coordinated by the N-terminal 6 trans¬ 
membrane helices of PsaA/PsaB and the peripheral 
subunits, and (2) chlorophylls that are located close 
to the RC core which are coordinated by the C-terminal 
5 transmembrane helices of PsaA and PsaB. These 
include the two so-called “connecting” antenna 
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chlorophylls, which are coordinated by helix 11 and 
are located inside the “fence” of the 10 C-terminal 
transmembrane helices that surround the reaction cen¬ 
ter core of PSI. 

The major function of the small subunits is to sta¬ 
bilize the core antenna system and provide a connec¬ 
tion to interact with peripheral antenna systems. The 
subunits that coordinate core antenna chlorophylls are 
Psal, PsaJ, PsaK, PsaL, and PsaX. It should also be 
noted that one of the phospholipids in PSI coordinates 
an antenna chlorophyll. 

In plants, PS I is a monomer and forms a complex 
with four tightly bound peripheral antenna proteins: 
the light harvesting complexes Lhcal, Lhca2, Lhca3, 
and Lhca4. The structure of the PSI-LHCI complex of 
pea has been determined at 3.4 A and reveals the 
architecture of the complex (Amunts et al. 2010) (see 
Fig. 2). The Lhca proteins flank the PSI core at the side 
of the complex where subunits PsaF and PsaJ are 
located. The PSI-LHCI complex contains a total of 
173 chlorophylls, out of which 109 chlorophylls are 
assigned to the core of PSI, 55 chlorophylls are coor¬ 
dinated by the LHCI proteins, while 9 chlorophylls are 
located at the interface between the LHCI antenna 
proteins and the core antenna system of PSI. 
Plants undergo state transitions, during which 
the light-harvesting complex II (LHCII) proteins 
can move from PSII to PSI. There is evidence from 
electron microscopic studies that the LHCII 
trimer may dock to PSI at the opposite side to the 
LHCI “belt” to the plant-specific subunit PsaH, 
which is located in close vicinity to PsaL (Jensen 
et al. 2007). 

Most cyanobacteria contain genes for two dif¬ 
ferent peripheral antenna systems. Under nutrient 
replete conditions, phycobilisomes, which are large 
membrane-extrinsic antenna complexes, serve as 
peripheral antenna systems for both PSI and PSII. 
They can move very fast between the photosystems 
in response to changing light conditions. In addition, 
many cyanobacteria also contain a membrane-intrinsic 
antenna system consisting of rings of the IsiA protein 
that surround the PSI trimer (Bibby et al. 2001; 
Boekema et al. 2001). These IsiA rings dramatically 
increase the size of the antenna system in PSI under 
conditions of iron deficiency, which is the dominant 
environmental limitation in most aquatic environ¬ 
ments. The system contains up to two rings of IsiA 
proteins surrounding the PSI core, with the inner ring 


consisting of 18 and the outer ring of 25 IsiA subunits 
(Chauhan et al. 201 1). These 43 extra antenna proteins 
lead to a size of the PSI-IsiA double-ring complex 
of 3.2 million Dalton, which includes 855 chlorophyll 
molecules. The chlorophyll network in this giant 
antenna system is functionally even more tightly 
coupled to the core of PSI than the chlorophyll network 
in the plant PSI-LHCI complex (Chauhan et al. 201 1). 

Summary 

PSI is a large membrane protein complex, present in 
plants, algae, and cyanobacteria. In cyanobacteria, 
it forms a trimer, where each monomer consists of 
12 proteins and 127 cofactors. PSI catalyzes the light- 
induced electron transfer across the photosynthetic 
membrane from plastocyanin or cytochrome c6 to 
ferredoxin or flavodoxin by a chain of electron carriers, 
consisting of six chlorophylls, two phylloquinones, 
and three 4Fe4S clusters. PSI contains a tightly 
coupled core antenna system of 90 chlorophylls and 
22 carotenoids, which is highly conserved between 
plants and cyanobacteria and transfers the excitation 
energy to the core of PSI. In higher plants, a monomer 
of PSI forms a complex with four light-harvesting 
proteins, while in cyanobacteria either membrane- 
extrinsic phycobilisomes or (under iron deficiency) 
rings of the IsiA protein further increase the absorption 
cross section of PSI. 

Cross-References 

► Chlorophylls and Light-Harvesting Complexes 

► Cytochrome h^f Complex 

► Cytochromes 

► Iron-Sulfur Clusters 

► Photosystem II 

► Reactive Oxygen Species 
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Synonyms 

Photosystem 2; PS 2; PS II; Water:plastoquinone 
oxidoreductase 

Definition 

Photosystem II (PSII) is a 750 kDa, multi-subunit 
(>20), transmembrane protein complex which func¬ 
tions as a water:plastoquinone oxidoreductase. 


Basic Characteristics 

A PSII monomer binds 59 cofactors but uses just 14 for 
primary light-driven electron transport (Fig. la) 
(Guskov et al. 2009). PSII is found in all oxygenic 
photosynthetic species. The range of oxygenic photo¬ 
synthetic species spans from the prokaryotic 
cyanobacteria to eukaryotic algae and higher plants. 
There are some differences between species in the 
protein subunits that make up PSII, but the redox- 
active cofactors involved in primary electron transport 
and their relationships are almost identical. It appears 
that the water oxidation mechanism evolved once, 
2.5-3.0 billion years ago, and has not changed substan¬ 
tially since then, indicating the difficulty of evolving 
such a tightly controlled and energetically demanding 
redox system. 

The net effect of PSII turnover is to use the energy 
from four visible photons (imax = 680 nm) to oxidize 
two water molecules into molecular oxygen, protons, 
and electrons and then to reduce two plastoquinones 
(PQ) to two plastoquinols. It is a light-driven water: 
plastoquinone oxidoreductase. The plastoquinones are 
reduced by the four electrons derived from the oxida¬ 
tion of the two water molecules (Blankenship 2002). 
The overall process is summarized in Eq. 1, where 
stroma and lumen denote the opposite sides of the 
photosynthetic thylakoid membrane: 

2PQ + 2 H 2 O -F + Ahv ^ 2 PQH 2 + O 2 

+ 4H+,u™en 

The cofactors and the path of this electron transfer 
are indicated in Fig. 1 . 

Electrons migrate from water in the lumen of 
a photosynthetic membrane to the quinone pool in 
that same membrane. The OEC (oxygen-evolving 
complex) is made up of the Mn 4 Ca cluster plus ligands 
from the D1 and CP43 proteins. The Mn 4 Ca cluster is 
the water oxidation catalyst. Yz is a redox-active tyro¬ 
sine (Dl:Tyrl61) that mediates electron transfer from 
the OEC to P680. P680 is the reaction center pigment 
that must be excited for charge separation. P680 refers 
to four chlorophylls: Chloi, ChlD 2 , Pon Pd 2 - The sub¬ 
scripts indicate to which protein subunit of PSII they 
are ligated. Pheooi is a pheophytin, Qa is a bound 
quinone electron acceptor, and Qb is the dissociable 
quinone that can be doubly reduced and protonated 
forming a quinol. These quinols transport reducing 
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Photosystem II, Fig. 1 

(a) The overall structure of 
a PSII monomer at 2.9 A 
resolution (PDB ID:3BZ1 
(Guskov et al. 2009) with 
some cofactor tails removed 
for clarity) - protein subunits 
are transparent grey, D1 and 
D2 are in darker grey. The 
approximate plane of the 
membrane is indicated with 
stromal and lumenal sides 
marked. The redox-active 
cofactors are highlighted in 
color, corresponding to the 
schematic in panel (b). 

(b) A schematic of the redox- 
active cofactors and the 
distances in A between them. 
The primary electron-transfer 
pathway is indicated with the 
heavy line; secondary 
pathways are represented 
with a thin line 
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equivalents to the ► cytochrome bgf complex and are 
eventually used hy the organism to produce carbohy¬ 
drates from fixed CO 2 . While the electron-transfer 
reactions are essentially sequential, the reactions take 
place on very different time scales. The initial charge 
separation occurs in picoseconds, stabilization of the 
separated charges by electron transfer to Qa is finished 
in hundreds of picoseconds (Rappaport and Diner 
2008; Renger and Holzwarth 2005), electron transfer 
from Qa to Qb is on the order of tens of milliseconds 
(Petrouleas and Crofts 2005), and electron abstraction 
from the OEC varies from tens to hundreds of micro¬ 
seconds to around 1 ms (McEvoy and Brudvig 2006). 
The following is a description of that process from 


light-driven charge separation to quinone reduction, 
and water oxidation as outlined in Eig. lb. 

A dimer of PSII monomers (a monomer is shown in 
Fig. la) is thought to be the functional unit in vivo 
(McEvoy and Brudvig 2006). The D1 and D2 subunits 
form the PSII reaction center and bind the electron- 
transfer cofactors. The D1/D2 pair in the reaction cen¬ 
ter possesses a pseudo C2-axis symmetry around the 
nonheme iron, but the asymmetry of the environment 
around the cofactors means electron transfer is almost 
always through the “A branch” of cofactors as indicated 
in Fig. lb. Electron transfer through the “B branch” 
P680 ^ Pheoo 2 ^ Qb is probably prevented by the 
energetics of formation of the primary radical pair 
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which are expected to be much less favorable in the 
B branch than the A branch (Rappaport and Diner 
2008). Initial radical pair formation in the A branch 
begins with the arrival of an exciton at P680 from a 
light-harvesting antenna complex adjacent to PSII (not 
shown). The excitation energy in P680* is transiently 
delocalized over the four chlorophylls (Chloi, Chlo 2 , 
Pdi, Pd2)> and P680* rapidly converts to the charge 
separated state. 

The ► redox potential (midpoint potential) of the 
excited species P680* is a highly reducing, —0.58 V, 
and it is poised to reduce Pheooi which has a midpoint 
potential of —0.499 V. The initial charge separation 
occurs in 0.6-3 ps after the arrival of the exciton at 
P680. The actual separation is between Pheoni and 
Chloi forming (Pheooi“/Chloi''^). The cation radical 
on Chloi quickly becomes distributed between Poi and 
Pd 2 , and this can be referred to as the P680 cation 
radical, i.e., (Pheooi~/P680‘'^). The cation radical is 
predominantly (~80%) localized on Pqi and the 
remaining 20% resides on P 02 (Rappaport and Diner 
2008). The P680‘'^ species has the highest reduction 
potential known in biology at -1-1.25 V. This potential 
is necessary to drive water oxidation in the OEC and is 
achieved through the use of Chi a in P680 and “tuning” 
from the protein environment (Rappaport and Diner 
2008). 

Following the initial charge separation event, the 
electron migrates further to Qa, at distance of 27 A 
from Pol, forming (Qa~/P 680‘'^) in 300 ps. The mid¬ 
point potential of Qa/Qa~ is —30 mV. This distance 
and energetic gradient rapidly stabilizes the initial 
charge separation (Rappaport and Diner 2008). 

Electrons from Qa” are subsequently transferred to 
Qb (Qb/Qb” midpoint potential -1-50 mV) with a half 
time of 150-200 ps (Petrouleas and Crofts 2005). Then 
protonation follows in 760 ps producing a semiquinone 
in the Qb site. A second charge separation at P680 leads 
to a second electron transfer to Qb”. This takes longer 
than the first at 620 ps, then a second protonation event 
(300 ps) results in a quinol that diffuses out of the Qb 
site, thus the term “two electron gate.” Another qui- 
none is now able to bind from the quinone pool. This 
process is potentially via a third quinone binding site 
Qc (Guskov et al. 2009). The kinetics of the Qa” to Qb 
transfer obviously can be influenced by the rate of 
quinone exchange at the Qb site (Petrouleas and Crofts 
2005), and this exchange is the overall rate limiting 
step for O 2 evolution by PSII. 


A nonheme iron (NHI) is bound by four histidines 
between the two quinone binding sites; its two open 
ligand binding sites are usually occupied by bicarbon¬ 
ate. Electron transfer between Qa and Qb is influenced 
by the NHI oxidation state. The NHI is normally Fe^% 
with a midpoint potential at pH 7 of 400 mV, which 
excludes it from participation in the quinone 
redox reaction directly. However, the NHI can be 
oxidized to Fe*^^ by reducing O 2 to O 2 ’”, for example. 
When this Fe**^ arises, it is reduced to Fe^^ in around 
7 ps by Qa” at pH 6.5. Anion ligation to the 
NHI instead of bicarbonate also influences electron 
transfer between Qa” and Qb. The binding of small 
carboxylic acids, NO or CN” to the NHI slows electron 
transfer. These rate changes may well be due to 
structural rearrangements, and this may represent 
some form of regulation in vivo (Petrouleas and 
Crofts 2005). 

One the same time scale that the electron is being 
transferred to Qa/Qb> POflO’"^ oxidizes the redox-active 
tyrosine Yz. Electron transfer at Yz is coupled to 
a deprotonation reaction along a hydrogen bond with 
Dl:Hisl90. This produces a neutral radical (P680/Yz’) 
on the order of 20-200 ns (Rappaport and Diner 2008). 
The (Yz/Yz") couple has a midpoint potential of 
around 1.2 V that allows it to subsequently oxidize 
the oxygen-evolving complex (OEC) (Rappaport and 
Diner 2008). The rate of OEC oxidation is dependent 
on the oxidation state of the OEC and can take ps to ms 
(McEvoy and Brudvig 2006). 

The OEC consists of a bioinorganic metal cluster 
Mn 4 Ca and its protein ligands. The OEC has flve 
separate oxidation states S„ where n = 0-4. The 
speed of Yz reduction depends on the S-state of the 
OEC ranging from 50 ps in Sq to 1.3 ms in S 3 (McEvoy 
and Brudvig 2006). On reaching S4, the OEC oxidizes 
two water molecules, producing O 2 (in addition to 4H'^ 
and 4e”) and returns to the Sq state. Thus, water is the 
terminal electron donor in oxygenic photosynthesis 
(McEvoy and Brudvig 2006). The midpoint potentials 
for the (S 2 /S 1 ), (S 3 /S 2 ), and (Si/So) states are estimated 
to be 1.12, 1.14, and 1.08 V, respectively. The (S 4 /S 0 ) 
couple is around 1.1 V. ►Proton-coupled electron 
transfer is very important in this process allowing for 
“redox leveling” which prevents the redox potential of 
the cluster raising so high that it cannot be advanced to 
the next state by Yz. The exact mechanism of water 
oxidation is unknown; however, it is theorized to involve 
either a nucleophilic attack of a Ca-bound water on 
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a terminal Mn-bound oxo species or an analogous reac¬ 
tion involving a Mn-bound oxyl radical. 

When the primary electron transport system is 
perturbed or delayed and electron donation to PhSO*"^ 
is impaired, it is necessary to protect the system from 
a long-lived species with such a high and potentially 
damaging redox potential. A persistent oxidant of 
-1-1.25 V is capable of directly oxidizing (and damag¬ 
ing) almost everything in its proximity, and ► reactive 
oxygen species such as singlet oxygen via triplet 
chlorophyll states are also damaging. Carotenes are 
capable of quenching singlet oxygen directly. In addi¬ 
tion, a secondary electron-transfer pathway from 
/l-caroteneD2 quickly reduces the otherwise persistent 
PhSO*"^. /?-caroteneD2 is then reduced in turn by electron 
transfer from Chlzoi and Cyt the hole on the cyto¬ 
chrome can be hlled by donation from the Qb site (Frank 
and Brudvig 2004). Another secondary transfer involves 
a tyrosine residue analogous to Yz on the D2 protein: Yq. 
The Yo residue can donate an electron to the OEC on 
a time scale of tens of seconds and is responsible for the 
dark stability of the OEC Si state (Ealler et al. 2005). 

Current research is focused on improving the struc¬ 
tural understanding of the system. Crystallographic 
work is advancing rapidly with 1.9 A the highest 
resolution achieved to date (Umena et al. 2011). 
However, the high-intensity X-rays necessary for crys¬ 
tallography reduce the Mn 4 Ca cluster altering its struc¬ 
ture. Such reduction poses a serious challenge as it 
limits the resolution that can be achieved at the cluster. 
Quantum mechanical/molecular mechanics simulation 
is a promising, complementary approach that produces 
spectroscopically verifiable, chemically rationalized 
structures (Sproviero et al. 2009). 

Cross-References 

► Chlorophylls and Light-Harvesting Complexes 

► Cytochrome h(f Complex 

► Photosynthetic Electron Transport 

► Photosystem I 

► Proton-Coupled Electron Transfer 

► Reactive Oxygen Species 

► Redox Potential 
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Phytochrome Structure 
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Definition 

Phytochromes are red-light photoreceptors that per¬ 
ceive light signals via reversible photoconversion 
between a red-light-absorbing (Pr) state and a far-red- 
absorbing (Pfr) state. They play important roles in 
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Phytochrome Structure, Fig. 1 The PAS, GAP, and PHY domains share a common PAS core topology 


mediating light responses in plants, bacteria, and fungi. 
Phytochromes utilize linear tetrapyrroles (also known 
as bilins) as chromophores to capture photons in the 
red region (wavelength range 600-800 nm) of the solar 
spectrum. 

Basic Characteristics 
Introduction 

Phytochromes are signaling proteins with a modular 
domain architecture, in which three N-terminal 
domains (denoted PAS, Per-ARNT-Sim; GAP, 
cGMP phosphodiesterase/adenyl cylase/FhlA; and 
PPIY, phytochrome) are responsible for light percep¬ 
tion and the C-terminal effector domains convert light- 
induced structural or chemical signals into biological 
signals (Rockwell et al. 2006). In plant phytochromes, 
the C-terminal domains include two tandem PAS 
domains followed by a histidine-kinase-related 


domain. In a typical bacteriophytochrome (BphP), 
the C-terminal output domain is a histidine-kinase 
(HK) domain that undergoes light-dependent auto¬ 
phosphorylation of a histidine residue. The phosphate 
group is then relayed to an aspartate residue of the 
cognate response regulator in a two-component signal¬ 
ing pathway. The photosensory core module (PCM) 
consisting of the PAS, GAP, and PHY domains may 
also be covalently coupled to other quite different 
effector modules such as GGDEP and EAL domains 
that are involved in synthesis or degradation of second 
messengers such as cyclic di-GMP. 

Each phytochrome incorporates its own preferred 
chromophore among biliverdin (BV), phycocyanobilin 
(PCB), or phycoerythrobilin (POB). The bilin chromo¬ 
phore is invariably embedded in the GAP domain and 
is covalently attached to a cysteine residue via 
a thioether bond. BphPs typically adopt BV with 
a cysteine anchor at the extended N-terminus of the 
PAS domain, while cyanobacterial and plant 
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Phytochrome Structure, 
Fig. 2 Ribbon diagrams of 
PaBphP-PCM as a parallel 
dimer (a) and Cphl-PCM as 
an antiparallel dimer (b) 



phytochromes prefer PCB or POB for which the 
anchor cysteine is located within the GAP domain. 

Three-Dimensional Structures 

Structures of the photosensory domains have been 
extensively investigated by X-ray crystallography in 
BphPs and cyanobacterial phytochromes. Plowever, no 
crystal structure is yet available for a full-length phy¬ 
tochrome of any kind. The first crystal structure deter¬ 
mined for any member of the phytochrome 
superfamily was the chromophore-binding domain 
from Deinococcus radiodurans BphP (DrBphP) 
consisting of only the PAS and GAP domains (Wagner 
et al. 2005). This tmncated DrBphP stmcture revealed 
an unusual 4 1 -knot (also called “figure-of-eight” knot) 
between the PAS and GAP domains, in which the 
N-terminus of the PAS domain passes through 
a large, twisted loop of the GAP domain. 

The first structural information for an intact 
photosensory core module consisting of the PAS, 
GAP, and PHY domains came from crystal structures 
of cyanobacterial Cphl from Synechocystis sp. 6803 
(Essen et al. 2008) and Pseudomonas aeruginosa BphP 
(PaBphP) (Yang et al. 2008). Both structures revealed 
that the PAS, GAP, and PHY domains share a common 
core topology, in which the central antiparallel 
P-strands are arranged in the order of 2-1-5-4-3 that 
defines the classical PAS fold (Pig. 1). In other words, 
the modular PCM structure can be described as 
“beans-on-a-stalk,” where three tandem PAS domains 


His260 



Phytochrome Structure, Fig. 3 Chromophore-binding 
pocket in DrBphP-CBD 


(the “beans”) are attached to a central helical spine 
(the “stalk”) at the dimer interface. Although the 
PAS, GAP, and PHY domains are linearly arranged 
in three-dimensional space, they are fully integrated to 
achieve photoconversion by converging near the chro¬ 
mophore-binding site via the hgure-of-eight knot and 
a long extending arm (or tongue) from the PHY 
domain (Pig. 2). 

In the absence of the C-terminal HK domain, BphPs 
are found to form either parallel or antiparallel dimers 
in the crystal lattices (Pig. 2). The 3D reconstruction of 
images from cryo-electron microscopy showed that the 
full-length DrBphP forms a head-to-head parallel 
dimer, in which two monomers wrap around each 
other in a left-handed manner (Li et al. 2010). 
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Phytochrome Structure, Fig. 4 Conformational differences in the DrBphP-CBD (Pr; light gray) and PaBphP-PCM (Pfr; green! 
cyan) structures 


Chromophore Conformations in the Pr and Pfr 
States 

Crystal structures determined in the dark-adapted Pr or 
Pfr states confirmed that the bilin chromophore adopts 
the ZZZssa configuration in the Pr state and ZZEssa in 
the Pfr state, respectively. This is consistent with 
a long-standing notion that the primary photo-event 
in phytochromes involves ZIE isomerization around 
the C15=C16 double bond in the chromophore. 

The bilin chromophore is similarly positioned in the 
GAP domain in all known BphP structures (Wagner 
et al. 2005; Yang et al. 2008; Essen et al. 2008). The 
largely planar chromophore is horizontally sandwiched 
between a highly conserved histidine and the signature 
PxSDlP motif. As illustrated in Fig. 3, the side chain of 
His270 in DrBphP indirectly interacts with the pyrrole 
nitrogens of rings A, B, and C via a pyrrole water; while 
the main chain carbonyl group of the highly conserved 
Asp207 coordinates with the same three pyrrole nitro¬ 
gens at the opposite side of the chromophore (Fig. 3). 

However, the chromophores in the Pr and Pfr states 
differ in their orientations relative to the protein 
matrix. These differences can be summarized as 
“flip-and-rotate” motions as the chromophore 
undergoes Pr/Pfr photoconversion (Yang et al. 2009). 
As ring D flips, its interactions with its surrounding 


residues are rearranged. As the chromophore rotates, 
the propionate groups of rings B and C switch their 
hydrogen bonding partners between a pair of distinct 
arginine and serine residues (Fig. 4a). In addition, 
a pair of conserved aromatic residues around ring 
D also undergoes concerted rotamer changes to accom¬ 
modate the flipping of ring D (Fig. 4b). 

Mechanisms of Photoconversion and Long-Range 
Signaling 

Reversible Pr/Pfr photoconversion exhibits distinct 
reaction intermediates with Lumi-R/Meta-R(s) on the 
Pr/Pfr pathway and Lumi-F/Meta-F on the Pfr^Pr 
pathway (Fig. 5). These transient intermediates have 
been studied by ultrafast time-resolved spectroscopic 
techniques (Dasgupta et al. 2009) or by cryo-trapping 
methods (Rohmer et al. 2010), and recently by temper¬ 
ature-scan cryo-crystallography (Yang et al. 2011). 
Although reaction intermediates in various phyto¬ 
chromes may differ in structural details, a general sig¬ 
naling principle is emerging. In phytochromes 
containing the PAS, GAF, and PHY domains, the pri¬ 
mary photoreaction originates in ring D. Upon absorb¬ 
ing a photon, ring D flips following 15Za/15£a 
isomerization, resulting in a strained chromophore in 
a protein cavity that lags in prompt adjustment. 
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Phytochrome Structure, Fig. 5 Reaction scheme for revers¬ 
ible Pr/Pfr photoconversion in BphPs 

Subsequent relaxation of the strained chromophore 
drives further conformational changes across the linear 
tetrapyrrole ring system from D to C, then to B and A. 
These events eventually lead to rearrangement in the 
chromophore-binding pocket thereby converting a light 
signal into structural signal(s) in the protein moiety. 

How the structural signal originated in the 
photosensory domains is transmitted to the C-terminal 
effector domain is not entirely clear. A plausible mech¬ 
anism that may be common to all sensor HKs including 
BphPs is that long “signaling” helices connecting adja¬ 
cent modular domains at the dimer interface are 
responsible for channeling structural signals over 
long distance via mechanisms similar to a torque 
(Moglich et al. 2010) and/or tweezers. 

Cross-References 

► 3D Electron Microscopy Based on Cryo-Electron 
Tomography 

► CNS (Crystallography and NMR System) 

► Ultrafast Raman Spectroscopy 
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Synonyms 

Induced fit binding; Protein-Protein Interactions 


Definition 

Many proteins are promiscuous protein binders in that 
they can bind multiple partners. Such proteins often 
have important biological roles and form nodes in 
protein interaction networks. There are numerous 
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Plasticity at Protein- 
Protein Interaction 
Interfaces, Fig. 1 Plasticity 
in the BH3-hinding interface 
ofBcl-X[_. (a) Structure of 
isolated Bcl-xL (hlue\ PDB 
accession code: 11x1). 

(b) Bc11-Xl (blue mesh) 
overlaid with the structure of 
a Bc11-Xl (orange ribbon)! 
BAD (cyan sticks) complex 
(PDB accession code: 2BZW). 

(c) Bc11-Xl (blue mesh) 
overlaid with the structure of 
a Bc11-Xl (orange ribbon)! 
PUMA (cyan sticks) complex 
(PDB accession code: 2YJ1). 

(d) Bc11-Xl (blue mesh) 
overlaid with the structure of 
a Bc11-Xl (orange ribbon)! 
ABT-737 (cyan sticks) 
complex (PDB accession 
code: 2YXJ) 



examples where promiscuous proteins bind their tar¬ 
gets using the same interaction interface. For example, 
two growth hormone receptor molecules bind a single 
growth hormone, protein growth hormone, using 
largely the same residues, but with each receptor 
targeting a different site on growth hormone (de Vos 
et al. 1992); pro-survival Bcl-2-like proteins bind 
BH3-only family proteins with varying degrees of 
selectivity, but although the same binding surfaces 
are used (a hydrophobic pocket on the Bcl-2-like 
proteins and the helical BH3 domains) (reviewed in 
Lessene et al. 2008), structural studies show small 
variations in binding (Fig. 1); and, the hinge region 
on the Fc fragment of human immunoglobulin G uses 
a common site to interact with at least four protein 
partners (DeLano et al. 2000). Although most well- 
defined protein-protein interfaces are relatively large 
(but can vary considerably in size depending on the 
affinity of binding and the nature of the interaction). 


mutational studies show that a small subset of the 
residues involved contributes most of the free energy 
of binding (DeLano 2002). These residues form bind¬ 
ing “hot spots”, which comprise usually less than half 
the contact area and are found at the center of the 
contact interface. Promiscuous binders can use the 
same hot spots for their different targets, and structural 
studies reveal that these promiscuous contact surfaces 
are adaptable, or structurally plastic, allowing a single 
protein to recruit a range of structurally diverse part¬ 
ners (DeLano et al. 2000). 

Basic Characteristics 

Structural plasticity in binding surfaces is always evi¬ 
dent to some extent, in that the surfaces of proteins 
usually show some differences in the bound and 
unbound states. However, the phenomenon reaches 
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extremes when considering intrinsically disordered 
proteins or protein domains. In the absence of 
a binding partner, such proteins are disordered, but 
often become folded on forming a complex with 
a binding partner (reviewed in Wright and Dyson 
2009). For example, an intrinsically disordered domain 
near the C-terminus of p53 takes up helical, (3-strand, 
or extended irregular structures when binding to dif¬ 
ferent partners (Oldheld et al. 2008). 

Although the term “hot spot” originally described 
a pseudo-energetic term (i.e., which residues appear to 
have the most effect upon binding when mutated, usu¬ 
ally to alanine), a picture is emerging of binding hot 
spots being general hot spots of binding activity. The 
selection of binding partners from naive peptide librar¬ 
ies (e.g., using phage display) frequently identify pep¬ 
tides that bind to established hot spots, as demonstrated 
by structural and biophysical studies or more indirectly 
by competing with the natural partner for binding 
(reviewed in Wells and McClendon 2007). Further, 
small molecule inhibitors of several protein-protein 
interactions tend to also target binding hot spots 
(reviewed in Wells and McClendon 2007). In these 
cases, the small molecules appear to bind more deeply 
into the contact surface of the target protein than does 
the natural binding partner, but may bind to cavities that 
are not seen in structures of either the free protein or the 
protein-protein complex, further emphasizing flexibil¬ 
ity or plasticity of protein binding surfaces (Fig. 1). 

Although small binding pockets do not seem to be 
available to larger proteins, computer simulations of 
molecular dynamics of a few unbound proteins in the 
absence of their small molecule binders indicate that 
transient pockets open rapidly (Brown and Hajduk 
2006; Eyrisch and Helms 2007). These data suggest 
that binders might select the appropriate binding states 
from an ensemble of conformations, rather than induc¬ 
ing different conformations upon or during binding, but 
it is difficult to experimentally distinguish these possible 
modes of binding. As demonstrated in small molecule- 
protein complexes, most of this plasticity comes from 
changes in conformation of side-chains and/or loops 
(Wells and McClendon 2007). Similarly, loop regions 
that appear to be flexible (access multiple conforma¬ 
tions) are frequently implicated in protein-protein inter¬ 
actions or interactions with other biomolecules. 

Structural plasticity appears to be a general property of 
proteins; the ability to make structural rearrangements 
makes them tolerant to mutation. It is likely that structural 


plasticity at protein-protein interfaces also allows 

adaptation to mutations as proteins coevolve (Atwell 

et al. 1997). 

Cross-References 

► Coupled Folding and Binding 

► Linear Motifs in Protein-Protein Interactions 

► Protein Dynamics and Allostery 

► Structurally Disordered Proteins 

References 

Atwell S, Ultsch M, De Vos AM, Wells JA. Structural plasticity 
in a remodeled protein-protein interface. Science. 1997;278: 
1125-8. 

Brown SP, Hajduk PJ. Effects of conformational dynamics on 
predicted protein druggability. Chem Med Chem. 2006; 1: 
70-2. 

de Vos AM, Ultsch M, Kossiakoff AA. Human growth hormone 
and extracellular domain of its receptor: crystal structure of 
the complex. Science. 1992;255:306-12. 

DeLano WL. Unraveling hot spots in binding interfaces: pro¬ 
gress and challenges. Curr Opin Struct Biol. 2002;12:14-20. 

DeLano WL, Ultsch MH, de Vos AM, Wells JA. Convergent 
solutions to binding at a protein-protein interface. Science. 
2000;287:1279-83. 

Eyrisch S, Helms V. Transient pockets on protein surfaces 
involved in protein-protein interaction. J Med Chem. 
2007;50:3457-64. 

Lessene G, Czabotar PE, Colman PM. BCL-2 family antagonists 
for cancer therapy. Nat Rev Drug Discov. 2008;7:989-1000. 

Oldfield CJ, Meng J, Yang JY, Yang MQ, Uversky VN, Dunker 
AK. Flexible nets: disorder and induced fit in the associations 
of p53 and 14-3-3 with their partners. BMC Genomics. 
2008;9(Suppl 1):S1. 

Wells JA, McClendon CL. Reaching for high-hanging fruit in drug 
discovery at protein-protein interfaces. Nature. 2007;450: 
1001-9. 

Wright PE, Dyson HJ. Linking folding and binding. Curr Opin 
Struct Biol. 2009;19:31-8. 


Polarized Infrared Spectroscopy 

Derek Marsh 

Max-Planck-Institut ftir biophysikalische Chemie, 
Gottingen, Germany 

Synonyms 

Fourier transform infrared spectroscopy; FTIR spec¬ 
troscopy; Infrared spectroscopy of oriented samples; 
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Orientation of protein segments determined by polar¬ 
ized infrared spectroscopy 


Definition 

Infrared spectroscopy of aligned samples, usually on 
a flat substrate or a stretched fiber, with the infrared 
beam at nonzero angle of incidence and polarized 
either parallel or perpendicular to the plane of 
incidence. Used to determine angular orientations of 
hiomolecules, particularly in membranes and fibrous 
proteins, mostly from the dichroism of the protein 
amide hands or lipid chain methylene hands. 


Basic Characteristics 


Dichroism of Aligned Samples 



The dichroic ratio R is defined from the absorbances, 
AII and Aj^, with radiation polarized parallel and per¬ 
pendicular, respectively to the plane of incidence: 

/?=A||/Ax (1) 

The value of R depends on the orientation of the 
infrared (IR) transition moment, relative to the plane of 
the substrate that aligns the sample, and on the ratios of 
intensities (e.g., E^/E^) of the electric field compo¬ 
nents of the radiation in the sample. The latter depend 
on the experimental setup, the most common of which 
is for attenuated total internal reflection (ATR) spec¬ 
troscopy with a Ge or ZnS internal reflection element 
(Table 1). The aligned samples have axial symmetry 
about the director, which is usually (e.g., for mem¬ 
branes) the normal to the plane of the aligning 
substrate. 


Polarized Infrared Spectroscopy, Table 1 Ratios of electric- 
field component intensities for 45°-cut Ge or ZnS ATR crystals 
(Marsh 1999a) 


Sample 

thickness 

(pm) 

El/El. 


El/El 



Ge 

ZnS 

Ge 

ZnS 

Ge 

ZnS 

Thick 

))0.4“ 

))0.9“ 

0.853 

0.450 

1.147 

1.550 

Thin 

((0.4“ 

((0.9“ 

0.877 

0.562 

0.828 

1.060 


“Penetration depth is directly proportional to wavelength. Values 
for 1,650 cm“' band. 


For transmission spectroscopy with non-zero angle 
of incidence, i: E^/E^ = I — sin^i/n^ and 
E^/E^ = sin^i/« 2 , where n 2 ^ 1.4-1.5. 

(/.-helix: For the amide I band of a-helices, the 
V||(A,0) and Vx(Ei,±27r//7) modes with resultant tran¬ 
sition moment directed parallel and perpendicular, 
respectively, to the helix axis are practically degener¬ 
ate and the component bands (which are comparable in 
intensity) superimpose (Marsh 1997; Marsh et al. 
2000). Consequently, R is determined by the orienta¬ 
tion 0 (relative to the helix axis) of the transition 
moment for an individual amide. The order parameter 
for the helix is then given by the following equation 
(Marsh 1997): 


(F2(cos7)) 


El/E]+El/E]-R 

/Mcos0)[£2/£f-^/£^-R] 


( 2 ) 


where y is the angle that the helix axis makes with the 
director, and P 2 (cosy) = *A(3cos^y — 1) (Marsh 2009; 
Kota et al. 2008). Angular brackets () indicate an 
average over all helix orientations. The orientations 
of the transition moment, relative to the a-helix axis, 
are: 0i = 38° and 0 a = 29° for the amide I 
(ca. 1,655 cm“*) and amide A (ca. 3,290 cm”*) 
bands, respectively (Marsh et al. 2000; see also 
Marsh and Pali 2001). 

Lipid chain: For the CH 2 -stretching modes of lipid 
chains (2,920 cm” antisymmetric stretch and 
2,850 cm”* symmetric stretch), the transition moments 
lie in the plane perpendicular to the chain axis 
(0ch = 90°). Therefore, the order parameter of the 
lipid chain axis is given by Eq. (2) with 
P2(cos0ch) = -0.5 (Marsh 2009; Kota et al. 2008). 

P-sheet: Eor the amide I and amide II bands of 
antiparallel p-sheets, the different modes are 
nondegenerate. That with resultant transition moment 
perpendicular to the p-strand axis, Vj^(7i,0), dominates 
for the amide I band (ca. 1,635 cm”*), and that with 
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resultant transition moment parallel to the P-strand 
axis, V||(0,7t), dominates for the amide II hand 
(ca. 1,540 cm”*). The P-sheets are distributed with 
axial symmetry and inclined at an angle a to the direc¬ 
tor, but the P-strands do not have axial symmetry 
because they are tilted at an angle P within the sheets 
(Marsh 1997). Measurement of the dichroic ratios of 
both amide I and amide II bands is therefore needed to 
determine the tilt, a, of the p-sheet. The dichroic ratio 
of the major amide I band is given by (Marsh 1997): 

El 2 (cos"«)(sin"/l) £2 

^ El 1 — (cos2a)(sin^/l) El 

and that of the major amide II band is given by 
(Marsh 1997): 

E^ 2 (cos^a)(cos^/l) E^ 

Rn = -^+ .\ (4) 

El 1 — (cos^a) (cos^p) El 


Quantifying Multicomponent Bands from 
Aligned Samples 

Quantifying band intensities from aligned samples 
(e.g., to determine secondary structural content of 
proteins) requires combination of measurements 
with parallel and perpendicular polarized radiation 
(Marsh 1999b). Generally it is not possible to quantify 
band intensities from aligned samples correctly when 
one uses unpolarized radiation. Use of a single polar¬ 
ization, or unpolarized radiation, is insufficient. For 
instance, if the transition moment is aligned parallel 
to one polarization, it would not contribute to the 
spectrum in the other polarization, and with the incor¬ 
rect weighting for unpolarized radiation. To reflect the 
total intensity, absorbances for parallel and perpendic¬ 
ularly polarized radiation are combined linearly with 
a weighting factor, G: 

A = A|| -h GAi (7) 


Combination of Ri and therefore allows deter¬ 
mination of < cos^a > and < cos^P > separately. The 
experimental protocol for aligned membranes is first to 
determine the strand tilt, p, within the sheet from 
measurements of Ri and Rn with samples dried from 
H 2 O. Then this value of p is used, together with Ri 
from samples in D 2 O, to determine the tilt of the sheet 
in hydrated membranes (Ramakrishnan et al. 2005; 
Anbazhagan et al. 2008a, b). The orientation of 
P-sheets is sometimes alternatively characterized by 
the net tilt, y, of the P-strands from the director and the 
azimuthal orientation, 9, of the sheets about the strand 
axis. The net strand tilt y is given by (Marsh 1997): 


where G depends on the experimental geometry. 
For the conventional situation in spectroscopy of mem¬ 
branes (director perpendicular to the aligning sub¬ 
strate) (Marsh 1999b): 

G = 2El/E]-EllE] ( 8 ) 

For a multicomponent band, the relative intensity of 
a particular component j is therefore given by: 

A|| ,■ + (lEljEl - EI/EDA^j 
fj = " X (9) 

j:(Ay + {2El/El-El/El)A^j) 


cos 7 = cos a cos/I (5) 

and the azimuthal sheet orientation, 9, about the strand 
axis is given by (Marsh 1997): 

cos r? = tan /I/ tan y ( 6 ) 

The origin for the azimuthal 9 angle is the normal to 
the P-sheet. Equations (4) and (5) show that y is 
obtained directly from Rn, but not readily for hydrated 
samples. Extension of this analysis to P-sandwiches is 
given in Ausili et al. (201 1). 

These methods are also applicable to IR dichroism 
measurements on isotope-edited a-helices and P- 
sheets. Methods of analysis are given in Marsh (2004). 


where the summation is over all components i in the 
spectrum. Generalization of Eq. (9) for nonequal 
extinction coefficients is given in Marsh (1999b). 


Dichroic Ratios of Multicomponent Spectra 

A common error is to calculate the dichroic ratio of a 
multicomponent IR band by simply adding absorbances 
with either parallel or perpendicular polarization. When 
the correctly weighted linear combinations of absor¬ 
bances are taken for the two polarizations, the dichroic 
ratio for the whole population is (Marsh 1999b): 


1 

Efj/iRj + G) 


-G 


( 10 ) 
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where Rj is the dichroic ratio of component j and 
J2.fj = 1 ■ For the particular case of a band consisting 
of just two components, the dichroic ratio is given by 
(Marsh 1999b): 

_ R2 + G _ 

l-A{Ri-R2)/{Ri+G) ^ ^ 

where/i +/2 = 1. Equation (11) is useful for correcting 
dichroic ratios where one part of the sample is 
unaligned, that is, R 2 = Riso^ appropriate to an isotro¬ 
pic distribution or for two titrating species where fi 
is given by the law of mass action, and for, e.g., 
P-sandwiches where /i « 0.5. Generalizations of 
Eqs. (10) and (11) for nonequal extinction coefficients 
are given in Marsh (1999b). 

Eurther details of all methods can be found in Marsh 
(1999a). 

Cross-References 

► Eourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► Infrared Spectroscopy of Protein Structure 

► Oriented Circular Dichroism Spectroscopy 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 

► Vibrational Spectroscopy with Neutrons 
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Polarized Light, Linear Dichroism, and 
Circular Dichroism 

Alison Rodger 

Department of Chemistry, University of Warwick, 
Coventry, UK 

Synonyms 

Anisotropic spectroscopy; Spectroscopy of chiral 
molecules; Spectroscopy of oriented systems 

Definition 

Spectroscopic techniques that use polarized light can 
provide more information about the samples being 
studied than those that use unpolarized light. Two 
techniques of particular utility for biomolecular sam¬ 
ples are linear and circular dichroism. Linear dichro¬ 
ism (LD) is the difference in absorption, A, of light 
linearly polarized parallel (//) and perpendicular (_L) to 
an orientation axis (Norden et al. 2010; Rodger and 
Norden 1997; Michl and Thulstrup 1986): 

LD=A//-A^ (1) 

LD is used with systems that are either intrinsically 
oriented or are oriented during the experiment. 
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a Linearly polarized 



b Right circularly polarized 




Polarized Light, Linear Dichroism, and Circular Dichroism, indicates the relative orientations of E, B, and k with the 
Fig. 1 (a) Linearly and (b) circularly polarized electromagnetic so-called right-hand rule (Norden et al. 2010 printed with 
radiation. Arrows denote direction of E. The hand on the right permission) 


Circular dichroism (CD) is similarly the difference 
in absorption of left and right circularly polarized 
light (Cantor and Schimmel 1980; Craig and 
Thirunamachandran 1984; Michl and Thulstrup 1986; 
Rodger and Norden 1997; Berova et al. 2000; Barron 
2004; Norden et al. 2010): 

CD = A| - A, (2) 

CD is particularly useful for studying solutions of 
chiral molecules, by which we mean ones that cannot 
be superposed on their mirror images. The word 
“dichroism” is derived from the Greek hi-xpropa 
meaning “two colors” (Soanes and Stevenson 2005) 
since for some samples two colors are seen when they 
are viewed from two different directions. 

In a linearly polarized light beam, all photons have 
their electric field, E, oscillating in the same plane, 
whereas in a circularly polarized light beam, the elec¬ 
tric field vector retains constant magnitude in time but 
traces out a helix about the propagation direction. 

Basic Characteristics 

A light beam may be characterized at each point in 
space or time by its electric field vector, E, its magnetic 
field vector, B, which is perpendicular to E, and its 
direction of propagation k. k, E, and B form a right- 
handed system as illustrated in Fig. 1. Linearly polar¬ 
ized light (often referred to as plane polarized light) is 
characterized by the electric field vector, E, of the 
photons oscillating in a plane. The magnitude of E 
varies sinusoidally in space as illustrated in Fig. 1 . 


Circularly polarized light is a form of polarization 
in which the electric field vector describes a 
cork-screw motion, completing one turn of a helix 
after having traveled a distance equal to the wave¬ 
length of light. Following the optics convention, we 
take the end of E for right circularly polarized light to 
form a right-handed helix in space at any instant of 
time as illustrated in Fig. 1. If we view right circularly 
polarized light sitting at a fixed point in space looking 
toward the light source, down the direction of propa¬ 
gation k, then the electric field vector traces out 
a clockwise circle. 

The two forms of light polarizations have different 
shapes: one being planar, the other looking asymmet¬ 
ric, like a screw. This textural difference illustrates the 
basis for quite different applications of the two forms 
of dichroism in the study of molecular structure: linear 
dichroism using linearly polarized light probes macro- 
scopically oriented molecules, whereas circular 
dichroism using circularly polarized light probes “heli¬ 
cal” molecules. Thus, while macroscopic orientation 
is a prerequisite for nonzero LD, net chirality of the 
molecules in the sample is a prerequisite for nonzero 
CD. In a randomly oriented solution phase sample, this 
means the molecules that have CD signals must not be 
superposable on their mirror images, that is, they must 
be chiral. If one is working with anisotropic samples of 
achiral molecules, such as crystals or fibers, one may 
obtain nonzero differences in absorbance of left and 
right circularly polarized light. However, the sign and 
magnitude of this signal will depend on the orientation 
of the sample with respect to k, so we usually restrict 
our attention to CD spectra of solutions of chiral 
molecules. 
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► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Circular Dichroism Spectroscopy: Units 

► DNA-Ligand Circular Dichroism 

► DNA-Ligand Flow Linear Dichroism 

► Far UV Protein Circular Dichroism 

► Linear Dichroism 

► Linear Dichroism Spectra - Measurement 

► Linear Dichroism Spectroscopy: Theory 

► Near UV Protein CD 

► Nucleic Acid Linear Dichroism 
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Polymeric Heart Valves 
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^Department of Mechanical Engineering, 
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Introduction 

Cardiac valve replacement (CVR) represents the 
second most common cardiac operation after coronary 
artery bypass grafting (Roberts and Ko 2005). A CVR 
is usually needed when a severely damaged heart valve 
fails to maintain its normal physiologic function, 
which is providing unidirectional smooth blood flow 
through the heart chambers and the main vessels by 
regular opening and closing throughout each cardiac 
cycle. 

The heart valves are unique elements of the cardio¬ 
vascular system and play a crucial role in regulating 
the blood flow by continuous cyclic opening and clos¬ 
ing under the extremely demanding mechanical envi¬ 
ronment. Depending on their anatomy, the heart valves 
could be subject to different dynamic workload and 
mechanical conditions. The aortic and mitral valves 
perform in a highly stressed region on the left side of 
the heart, regulating the high-pressurized oxygenated 
blood flow to the arterial system. In the right side of the 
heart, the tricuspid and pulmonary valves are assigned 
to a less stressful but still demanding state. The heart 
valves are predisposed to disease because of function¬ 
ing under such extreme cyclic loading conditions 
where hemodynamics plays a critical role in valve 
pathology. The vital function of the heart valves can 
be affected by congenital or acquired degenerative 
valvular diseases (e.g., calciflcation), resulting 
in obstruction of the blood flow (stenosis) and/or 
backward leakage flow of the blood (regurgitation) 
(Otto and Bonow 2012; Roger et al. 201 1). 

CVR was first introduced at early 1960s, and 
became a routine treatment for patients with severely 
damaged heart valves. Currently, the CVR procedure 
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Polymeric Heart Valves, 
Fig. 1 (a) A bileaflet 
mechanical heart valve (SJ 
Medical); (b) A stented 
bioprosthetic valve 
(HANCOCK II ULTRATM) 



can be performed via an open heart surgery using sur¬ 
gical prostheses or less invasive transcatheter valve 
implantation. The surgical valve prostheses are avail¬ 
able in forms of either mechanical or bioprosthetic 
valves. Mechanical heart valves (MHVs) are usually 
made from pyrolytic carbon, medical grade metal 
alloys and polymeric materials, and consist of one or 
more moving components (occluders) encompassed 
within a frame (housing). They evolved from early 
design of caged ball valves to titling disc and more 
hemodynamically efficient bileaflet valves (Fig. la). 
Bioprosthetic heart valves (BHVs) are formed by glu- 
taraldehyde-treated animal-derived tissues (mainly 
porcine valve and bovine pericardium) reinforced 
with a rigid plastic frame (Fig. lb). The BHVs can 
also be available in the form of stentless prosthesis 
with improved hemodynamics. The transcatheter 
implantable prostheses are animal-derived tissue valves 
reinforced with metal mesh frames (Nitinol or stainless 
steel), which can be crimped into a catheter and percu- 
taneously delivered to the anatomical position. 

Despite the huge benefits of replacement valvular 
prostheses and their significant improvement over the 
past five decades, the currently available prostheses are 
not ideal, still unable to meet the needs for more 
durable tissue valves, and less thrombogenic mechan¬ 
ical valves (Lapeyre et al. 2010). The introduction of 
transcatheter valve implantation technology in the 
early 2000s was a major breakthrough which improved 
the outcome of CVR for high-risk patients. Nonethe¬ 
less, the major problems related to the thrombogenicity 
and limited durability still remain unresolved. 

The synthetic leaflet heart valves, including poly¬ 
meric heart valves (PHVs), represent an attractive 
alternative to the existing replacement valve 


prostheses, owing to their favorable physical proper¬ 
ties, reproducible and affordable manufacturing, and 
flow dynamics similar to human heart valves. Flexible 
PHVs are believed to be able to combine the superior 
durability of MHVs and the enhanced hemodynamics 
of BHVs, while overcoming their major drawbacks. 
After the development of biomaterials with syntheti¬ 
cally enhanced properties and advanced manufactur¬ 
ing techniques, PHVs have started to draw more 
attention. These materials have the advantage of 
being easily engineered and being able to reproduce 
the geometry and hemodynamics of native valves. 
However, they have not yet become a valid heart 
valve replacement option, since they suffer premature 
failure, mainly due to their suboptimal design and lack 
of a durable, biostable, and hemocompatible polymers. 
The current clinical application of PHVs is limited to 
use for artificial hearts and ventricular assist devices 
(VADs). 

Biopolymers for the Heart Valve Application 

The choice of material is a crucial element in develop¬ 
ing PHVs, since it is the major factor determining the 
durability and biocompatibility of the valve. Several 
synthetic materials such as silicon and polyolefin rub¬ 
ber (Kiraly et al. 1982), polytetrafluoroethylene 
(PTFE) (Nistal et al. 1990), poly(styrene-block-isobu- 
tylene-block-styrene) (SIBS) (Gallocher et al. 2006), 
EPDM rubber (Cacciola et al. 2000), polyvinyl alcohol 
(PVA) cryogel (Jiang et al. 2004), and segmented 
polyurethane elastomers (Bemacca et al. 2002; 
Wheatley et al. 2000) have been tested as heart valve 
leaflets (Table 1). Polyvinyl alcohol (PVA) has been 
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Polymeric Heart Valves, Table 1 Summary of materials that have been investigated for use in polymeric heart valves (Taken from 
Ghanbari et al. (2009)) 

Material 

Advantages 

Drawbacks 

Outcomes 

Silicone 

Good flexibility 

Short durability 

Suffered from structural failure and impaired 


Adequate biocompatibility 

Leaflet tearing 
Thrombus formation 

hemodynamics 

PTFE 

Good hemodynamics 

Low resistance to 
thromboembolism 
and calcification 

PTEE was found to be unsuitable for developing 
heart valves because of its major complications 



Lee edge inversion 
and stiffening of the 
leaflet 


PUs 

Good viscoelasticity 

Susceptible to 
hydrolysis 

Biodegradation and calcification have been the 
main problems in several in vitro and in vivo 

Polyester 

urethane 

Good resistance to hydrolysis 

Low resistance to 
oxidation 

studies 

PEU 

Good resistance to hydrolysis and 

Prone to calcification 


PCU 

oxidation 



PVA 

Good mechanical properties 

Not suitable for dip¬ 
casting 

Satisfactory mechanical properties, but PVA 
needs to be assessed further for other 
characteristics 

SIBS 

Enhanced resistance to hydrolysis and 
oxidation 

Causes platelet 
activation and 
thrombogenicity 

Having resistance to oxidation and acid 
hydrolysis, biostability and minimized thrombus 

PDMS- 
PHMO PU 

Good biostability and mechanical 
properties; good resistance to oxidation 
and calcification 

Difficult to 
manufacture 

Showed better long-term stability than PDMS- 
based PU but needs more studies to show the real 
advantage of this structural modification 

POSS-PCU 

nanocomposite 

Excellent biocompatibility; resistance 
to oxidation, hydrolysis, calcification, 
and thrombogenicity 

Not detected yet 

Based on potentially improved biocompatibility 
and biostability, the development of POSS-PCU 
heart valve is currently under investigation. 

investigated 

as potential material for artificial heart segments. 

and soft elastomeric segments. The hard 


valves, demonstrating good mechanical properties, 
but unknown biostability and calcification resistance 
properties. Polyolefin (polystyrene-polyisobutylene- 
poly-styrene), also known as “QuatromerTM,” was 
developed in the late 1990s and shown to have ade¬ 
quate resistance to in vitro oxidation and hydrolysis 
and thrombogenicity. 

Polyurethane (PU) elastomers have been the 
most commonly used polymer for cardiovascular 
applications such as heart valves, mainly owing to 
their superior bio- and hemocompatibility, and physi¬ 
cal strength. Nevertheless, the long-term application of 
PU is limited by its low biostability, mainly caused by 
its susceptibility to degradation and calcification. The 
PU degradation results from oxidation, acid hydroly¬ 
sis, or enzymatic pathways, which eventually leads to 
the loss of mechanical strength and leaflet tearing. PU 
chemical microstructure consists of hard crystalline 


segments are formed from the reaction of a 
diisocyanate with a short chain diol or diamine 
(“chain extenders”), typically 1,4 butane diol or ethyl¬ 
ene diamine. The soft segments are formed from the 
reaction of the diisocyanate with higher molecular 
weight polyols typically in the range of 1,000- 
2,000 Da, such as polyether, polyester, or polycarbon¬ 
ate. The functional characteristic (e.g., mechanical 
properties) of PU is determined by the ratio of these 
two phases, and can be changed by varying the 
percentage of its soft segment. 

The three main types of PUs, polyester urethane, 
polyether urethane (PEU), and polycarbonate urethane 
(PCU), were developed with different soft segments in 
an effort to improve the properties of PU polymers for 
biomedical application. The first generation of PUs, 
polyester urethanes, was susceptible to rapid hydroly¬ 
sis of the polyester soft segment, and therefore not 
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suitable for long-term implants. To address this issue, 
PEUs were developed benefiting from excellent 
hydrolytic stability; yet, they demonstrate oxidative 
degeneration under in vivo conditions. The PCU was 
the third generation of PU with significantly higher 
resistance against biodegradation. 

The biostability of PUs can also be improved by 
incorporating special molecules which are resistant to 
biodegradation to its chemical structure. For instance, 
polydimethylsiloxane (PDMS), has been linked to the 
backbone of the PU in the presence of polyhexa- 
methylene oxide (PHMO), providing good thermal 
and oxidative stability. Moreover, the calcification 
resistance of the PUs has also been improved by cova¬ 
lent binding of anti-calcification agents such as 
bisphosphonate (2-hydroxyethane bisphosphonic acid 
[HEBP]) and/or heparin into the structure of the 
polymer. 

Mechanical Properties 

The native heart valve leaflets are made of complex 
tissue structures with nonlinear mechanical behavior. 
The heart valve leaflets can achieve very large strains 
with low stress in the radial direction and simulta¬ 
neously withstand significant pressure from the blood 
to prevent retrograde flow. For an artificial heart valve, 
it is therefore essential to mimic the behavior of the 
native valve and be able to withstand large deforma¬ 
tion demanded during the cardiac cycle (David 2010). 

The mechanical properties, particularly the stiff¬ 
ness, of the leaflet material, have a huge impact on 
the hemodynamic function of the PHVs. The leaflets 
made from a low modulus material are relatively 
flexible (elastic) and can achieve faster opening and 
closing. However, such material Is more susceptible to 
undergo high localized strain accumulation in high- 
stress regions, resulting in poor durability of the leaf¬ 
lets and eventual failure of the valve. In contrast, 
a higher modulus material may improve durability of 
the valve, but their stiffer structure leads to 
a suboptimal hemodynamics. A stiffer leaflet is likely 
to be more difficult to open and may be less likely to 
close tightly, hence producing a valve with greater 
regurgitation compared to a valve made of more flex¬ 
ible leaflets. Therefore, a material with a balanced 
stiffness is required to provide an optimum biome¬ 
chanical and hemodynamic function for the PHVs. 


The practical limit on material modulus for a viable 
heart valve leaflet material is reported to be close to 
32 MPa, and less than 63 MPa (Bemacca et al. 2002). 

Emergence of Nanotechnology 

Emerging technologies such as nanotechnology and 
the recent advance of biopolymer synthesis and surface 
modification methods have provided potential solu¬ 
tions to the problems of material fatigue and sustain¬ 
able functional characteristics. A novel family of 
nanocomposite polymers, based on the nanocage poly¬ 
hedral oligomeric silsesquioxane (POSS) integrated as 
pendant chain functional groups to a backbone of poly 
(carbonate-urea)urethane (PCU), has been recently 
introduced for biomedical applications (Fig. 2). 
POSS-PCU is a type II, nanocomposite polymer 
consisting of a hard crystalline segment and soft elas¬ 
tomeric segments which can be chemically engineered 
to have a range of physiochemical properties. 

The POSS-PCU nanocomposite polymer is 
implanted in human as lower limb bypass graft, lacri¬ 
mal duct conduit, and recently as a tracheal replace¬ 
ment (Jungebluth et al. 2011). POSS-PCU exhibits 
a range of beneficial properties making it a very desir¬ 
able material for cardiovascular applications, in par¬ 
ticular, developing heart valve leaflets (Fig. 3). 
Through a series of in vitro and in vivo studies, the 
POSS -PCU has shown excellent cytocompatibility 
(Kannan et al. 2006a), hemocompatibility, and biolog¬ 
ical stability (with significant reduction of inflamma¬ 
tory response) (Kannan et al. 2007), as well as 
resistance to thrombogenicity (Kannan et al. 2006b), 
calcification (Ghanbari et al. 2010), biodegradation 
(physiological, oxidative, and hydrolytic) (Kannan 
et al. 2006c), and also improved mechanical and vis¬ 
coelastic properties compared to PU-based materials 
(Kidane et al. 2009). 

Surface Modification and 
Biofunctionalization 

The biocompatibility of the materials is determined by 
the interactions of their surfaces with the biological 
system. The physiochemical characteristics of the bio¬ 
material surface can influence the biological reactions 
at the interface. Chemical properties such as the 
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Polymeric Heart Valves, Fig. 2 The POSS nanocage, with 
approximate size of 1.5 nm consists of a symmetrical inner 
inorganic framework of silicon and oxygen atoms, externally 
covered by organic functional groups, which are covalently 


bonded to PCU. The incorporation of POSS into PCU has 
resulted in an advanced nanocomposite polymer with unique 
characteristics for heart valve leaflet applications. I-Bu=R, 
Isobutyl tertiary groups 


Polymeric Heart Valves, 

Fig. 3 Cardiovascular 
application of POSS-PCU 
nanocomposite polymer, (a) 
a small diameter bypass graft 
made of POSS-PCU (internal 
diameter 5 mm), (b) the 
implantation of this bypass 
graft in a sheep; (c) the same 
bypass graft after 24 months of 
implantation with fully 
endothelialized surface, (d) 

An aortic valve fabricated 
from POSS-PCU 
nanocomposite polymer with 
a Nylon stent 



Carotid bypass surgery 
in an ovine model 


Endothelialised, patent graft 
after 24 months 


surface energy, hydrophobicity, and hydrophilicity, 
together with the morphology and topography of the 
surface, have a crucial effect on the biocompatibility of 
a material. 


Biofunctionalization is a surface modification pro¬ 
cess, which can be applied in an attempt to improve 
selective surface properties (such as biocompatibility) 
of the polymeric materials by compromising its 
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Polymeric Heart Valves, Table 2 Representative surface and structural modifications for improving biomaterial properties (taken 
from Ghanbari et al. 2009) 

Modification 

Advantages 

Comments 

HEBP-bounded 

PU 

Improves resistance to calcification 

Anti-calcification agents like HEBP have been shown to be 
effective in reducing the calcification rate of PU 

Cholesterol- 
modified PU 

Increases endothelial cell adhesion 

Improves self-endothelialization by enhancing cell affinity and 
decreases thrombosis formation and enzymatic degradation 

Fiber- reinforced 
SIBS 

Improves hemocompatibility 

Enhances the mechanical properties, durability, and 
hemocompatibility of the polymer 

RGD 

incorporation 

Promotes endothelial cell adhesion 

Improves cell adhesion and endothelialization of polymers, which 
can increase hemocompatibility 

PHI 

Modifies the hydrophobicity of surfaces 
to enhance biocompatibility 

Frequently used to convert hydrophobic polymers to hydrophilic 
polymers and might improve biocompatibility 

N anotopographic 
surface 

Improves cell adhesion, selective 
proliferation, and differentiation 

Might improve the adhesion and proliferation of specific types of 
cells due to the topography of the surface at the nanoscale 

Incoiporation of 
nanomaterials 

Improves the mechanical properties, 
biostability, and biocompatibility of 
polymers 

Incorporation of nanostructured materials into the backbone of 
polymers improves biocompatibility, resistance to calcification, and 
durability 


HEBP 2-hydroxyethane bisphosphonic acid, Pill plasma immersion ion implantation, PU polyurethane, RGD arginine-glycine- 
aspartic acid, SIBS poly(styrene-block-isobutylene-block-styrene) 



Polymeric Heart Valves, Fig. 4 Schematic illustration of 
surface modification of synthetic materials by biofunctional 
peptides. The surface of a synthetic biomaterial can be 
biofunctionalized by providing cell-specific adhesion motifs 
(green arrows) on the surface to facilitate the attachment of 
endothelial progenitor cells (EPCs) (shown as white cells) in 

interaction with the biological systems without altering 
their bulk properties (Table 2). For example, the cell 
attachment and proliferation on a synthetic surface can 


the circulatory blood and therefore promote in situ endothelia- 
lization. EPCs can proliferate and differentiate to mature endo¬ 
thelial cells and have an important role in neovascularization and 
angiogenesis. Potentially, such surface modification and 
biofunctionalization techniques can be used to endothelialize 
a heart valve biomaterial and enhance its hemocompatibility 

be improved by utilizing different techniques such as 
plasma immersion ion implantation (PHI), cholesterol, 
and peptide modihcation. These modifications 
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Polymeric Heart Valves, Fig. 5 Schematic illustration of 
surface modification of synthetic materials by nanotopographic 
features, (a) The surface nanotopographical features can result 
in a more cell friendly surface compared to standard surfaces, 
and promote cell attachment, growth, and differentiation. Sur¬ 
face topography can also cause selective cell proliferation by 
increasing the affinity of the surface to a specific cell type. The 
bone marrow-derived stem cells (BMSCs) are multipotential 
cell source that includes EPCs. EPCs preferentially interact 
with the surface that has a specific nanotopography (left), 
thereby selectively growing on this surface. By contrast. 


several different cell types will attach to and grow on 
a standard polymer surface, leading to a mixed cell population 
(right). This method can potentially be used in heart valve 
biomaterial to increase their affinity to EPCs to promote 
endothelialization. (b) Atomic force microscopy (AFM) 
images of surface topography of PCU polymer and POSS- 
PCU nanocomposite polymer. The modification of PCU 
with POSS nanoparticles results in the formation of 
nanotopographic features (i.e., shape, surface area, and charge 
of a compound’s external surface) on its surface and can pro¬ 
mote attachment and proliferation of EPCs 


enhance the affinity of the surface to specific cell types, 
such as endothelial cells, resulting in the formation of 
an endothelialized layer upon the synthetic surface 
(Fig. 4). The endothelialization can protect the valve 
from being treated as a foreign body by the immune 
system and can also increase the blood compatibility of 
the surface (De Mel et al. 2010). 

The morphology and topography of the biomaterial 
is also known to affect several cell behaviors, includ¬ 
ing cell adhesion, proliferation, differentiation, and 


apoptosis (Fig. 5). The cells can sense and respond to 
the topographical features of a biomaterial surface by 
so-called “contact guidance,” which is a directed loco- 
motory response of cells to an anisotropy of the envi¬ 
ronment. For instance, the fibroblasts are known to tend 
to align along ridges or in parallel to the alignment of 
collagen fibers in a stretched gel. Therefore, modifying 
a surface by creating topographic features at micro- and 
nanometer scale can create a cell-responsive surface on 
the heart valve biomaterials (Ghanbari et al. 2009). 
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Summary 

Despite more than half a century of extensive scientific 
and clinical investigations, the clinical application of 
synthetic leaflet heart valves is challenged by their 
susceptibility to thrombogenicity, material degrada¬ 
tion, and limited durability due to oxidation, hydrolysis 
and calcification, leading to leaflet stiffening and tear¬ 
ing. However, significant advances in novel technolo¬ 
gies, such as nanotechnology, together with the 
remarkable progress in polymer synthesis methods, 
has resulted in the development of several biostable 
polymers with improved physical and biological prop¬ 
erties. These enhanced materials, with the advantages 
of being easily engineered, being able to reproduce the 
geometry and hemodynamic characteristics of natural 
valve, have evoked new hopes for the development of 
advanced polymeric heart valves for CVR. 
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Synonyms 

Carbohydrates; Carbohydrate polymers; Glycans 

Definition 

Polysaccharide biomaterials are polymers and oligo¬ 
mers formed from carbohydrate monomers that are 
used as materials in biomedical applications. Glycans 
are polysaccharides of monosaccharide repeat units 
covalently linked by glycosidic bonds. Some authors 
use the term “polysaccharide” for other polymers that 
have carbohydrate pendant groups on non-polysaccha- 
ride backbones or carbohydrate-derived monomers 
that are not connected by glycosidic bonds. These 
might be more broadly classified as “glycopolymers.” 

Introduction 

Polysaccharides include the most abundant polymers 
in nature (cellulose and chitin). They are fundamental 
molecules of life with a diverse and unique set of 
biochemical and biophysical functions. Their natural 


abundance and many biological functions have made 
them attractive as biomaterials. Their use as biomate¬ 
rials has been facilitated by successful development of 
techniques to chemically modify them, process them 
(e.g., into gels, films, fibers, micro- and nanoparticles, 
and composites), and establishment of their biocom¬ 
patibility with respect to a wide range of applications 
in medicine and biology. 

Among the major classes of biological polymers 
(proteins, nucleic acids, and polysaccharides), the 
structure and function of polysaccharides is uniquely 
divergent. The wide range of physicochemical and 
biological properties arises from variations in the gly¬ 
cosidic bonds, molecular weight, and chemical sub¬ 
stituents. While nucleic acids and proteins are 
synthesized sequentially, according to a biological 
blueprint, many biological polysaccharides do not 
have a prescribed sequence. Heparin and heparan sul¬ 
fate, two glycosaminoglycans, are examples of poly¬ 
saccharides with divergent sequences produced from 
multiple modifications that occur during and after 
polymerization. These stochastic modifications 
include epimerization, A-deacetylation, A-sulfonation, 
and (9-sulfation, and they are controlled largely by the 
substrate specificity of the responsible enzymes 
(Roden et al. 1992; Lindahl 1999). Hence, differential 
expression and activity of these enzymes can result in 
polysaccharides with different properties depending 
upon the source species, age, and tissue. Another 
example, poly(D-glucopyranose)s, called glucans are 
all formed from single monomer (o-glucopyranose), 
but have different glycosidic bonds and degrees of 
branching. This bonding and branching gives a wide 
range of properties such as water solubility (ranging 
from soluble amylose to insoluble cellulose) and struc¬ 
ture (Kajiwara and Miyamoto 1998). Sequence and 
structure diversity imparts some polysaccharides with 
broad repertoire of biological functions. This diversity 
presents challenges for characterization, and vast 
opportunities for incorporating the wide range of 
polysaccharide functions into biomaterials (Boddohi 
and Kipper 2010). 

Polysaccharide Chemistry and Nomenclature 

The array of names used for polysaccharides can be 
dizzying. The reader is referred to the entry on “► Car¬ 
bohydrate Nomenclature” and to the lUPAC-IUBMB 
recommendations (Mcnaught 1996) for an overview of 
the naming conventions used. Here, we briefly 
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Polysaccharide Biomaterials, Fig. 1 Three polysaccharides used in biomaterials. {Left) ► Cellulose, a neutral polysaccharide; 
(Center) hyaluronan, a negatively charged polysaccharide; (Right) chitosan, a positively charged polysaccharide 


introduce polysaccharide nomenclature so that we can 
discuss polysaccharides used in biomaterials. 

Natural polysaccharides have a trivial name that is 
usually related to their natural source. Trivial names 
often end in “-an,” “-in,” “-ose,” and “-ate” (used for 
polysaccharides containing hexuronic acids). Exam¬ 
ples include hyaluronan, chitin, ► cellulose, and algi¬ 
nate. The “-ate” suffix is also sometimes used in 
reference to a salt (e.g., “sodium hyaluronate”). The 
suffix “-ic acid” is also sometimes used synonymously 
for polysaccharides containing hexuronic acids, (e.g., 
alginic acid and hyaluronic acid are synonyms for 
alginate and hyaluronan, respectively). The lUPAC- 
lUBMB recommendations suggest retaining these tra¬ 
ditionally used names, but using only the “-an” suffix 
for newly named polysaccharides (Mcnaught 1996). 

Systematic names for homopolysaccharides are 
formed by replacing the “-ose” or “-ic acid” ending of 
the pyranose, furanose, or uronic acid monomer 
trivial name with “-an” (e.g., glucan, xylan, and 
galacturonan). The glycosidic bond and anomeric 
form (a or P) can also be included in systematic 
names (e.g., (1 ^ 4)-P-D-glucan) (Mcnaught 1996). 
In specifying the glycosidic bond, the ^ points away 
from the number indicating the anomeric carbon and 
toward the number indicating the carbon to which it is 
bonded. It is sometimes replaced with either a dash or 
a comma. Saccharide monomers can be abbreviated 
when specifying a sequence. The most common abbre¬ 
viations are Fru (fructose). Gal (galactose), Glc 
(glucose), Ido (Idose), Man (mannose), and Xyl 
(xylose). Pyranose and furanose forms are distinguished 
by italicized p and / (e.g., Glcp and Fm/ for 
glucopyranose and fmctofuranose, respectively), 
but these are often omitted (Mcnaught 1996). Amine, 
A-acetyl, carboxylic acid, and a few other moieties are 
common enough to warrant their own abbreviations. 
Abbreviations for common monosaccharide deriva¬ 
tives include GlcN (2-amino-2-deoxy-glucopyranose), 
GlcNAc (2-deoxy-2-A-acetyl-glucopyranose), GlcA 
(glucuronic acid), IdoA (iduronic acid), ManA 


(mannuronic acid), and GulA (guluronic acid). Natu¬ 
rally derived and synthetically modified polysaccha¬ 
rides can have a myriad of other functional groups. 
However, many such substituted monomers do not 
have systematic abbreviations, so some authors may 
make up their own conventions. For example, 
anGal is a common abbreviation used for 3,6- 
anhdyro-galactopyranose. Sulfate substituents are also 
commonly indicated, as in GlcA2S for 2-G-sulfo- 
glucuronic acid. Any of these abbreviations can also 
contain the d- or l- prefix to indicate the isomer. If 
omitted, it may usually be assumed that the d- isomer 
is intended. (Fucose and iduronic acid are notable 
exceptions that usually appear as the l- isomers in 
natural polysaccharides.) (Mcnaught 1996). Examples 
using these conventions to specify polysaccharide 
sequences are provided below as we introduce poly¬ 
saccharides used in biomaterials. 

Polysaccharides used in biomaterials may be 
broadly classified according to their chemical 
functionality as neutral polymers, polyanions, and 
polycations. Figure 1 shows an example of each of 
these types of biomaterials. 

Glucans are neutral polysaccharides formed from the 
D-glucopyranose monomer (o-Glc/r). Glucans used in 
biomaterials include dextran ([6)-a-D-Glc-(l^]n, with 
1^3 branches), pullulan ([6)-a-D-Glc-(l ^ 4)-a-D- 
Glc-(1 ^ 4)-a-D-Glc-(l^]n), ► cellulose ([4)-P-d-G1c 
( l^]n) (Fig. 1 left), and the oligomers maltodextrin 
([4)-a-D-Glc-(l—!■]„) and cyclodextrins (cyclic oligosac¬ 
charides with the same stmcture as maltodextrin). 
Agarose is a neutral polysaccharide based on 
a disaccharide ([4)-a-L-anGal-(l ^ 3)-P-D-Gal-(l^]n). 
Galactomannans are polysaccharides with a backbone of 
[4)-P-D-Man-(l—>]n with some mannose residues modi¬ 
fied with a a-D-Gal linked via 1^6 glycosidic bonds. 
Galactomarmans are found as the polysaccharide com¬ 
ponents of gums such as guar gum and carob gum. 
Fructofuranose-based polysaccharides (levans and inu- 
lins) are less studied, but might also have biomedical 
applications (Combie 2006). 
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Polyanionic polysaccharides include those with 
hexuronic acids and sulfate substituents. Alginate is 
a block copolymer of [4)-P-D-ManA-(l^]n and [4)-a- 
l-Gu 1A-(1—Sulfated polysaccharides behave as 
strong polyanions and include ihtfucans, carrageenans, 
and most ► glycosaminoglycans. Fucans are a diverse 
group of algal sulfated polysaccharides based on l- 
fucose (Chaubet et al. 2000). Carrageenans also come 
from algae. They are structurally similar to agarose with 
a substituted disaccharide repeat unit based on [4)-a-D- 
Gal-(1 ^ 3)-P-D-Gal-(l^](Rabaneletal. 2006). Some 
carrageenans contain o-anGal residues, and many 
contain sulfate substituents. Pectins are plant-derived 
copolymers of [4)-a-D-GalA-(l^] and [4)-6-0- 
methyl-a-D-GalA-(l^]. Hyaluronan is a non-sulfated 
► glycosaminoglycan. The structure of hyaluronan is 
[4 )-P-d-G1cA-(1 ^ 3)-P-d-G1cNAc-( 1^] (Fig. 1 cen¬ 
ter). The sulfated ► glycosaminoglycans include chon- 
droitin sulfate, keratan sulfate, dermatan sulfate, 
heparin, and heparan sulfate. These are found in 
many animal tissues, usually as the polysaccharide 
substituents of ► proteoglycans. They can have very 
complex structures based on a disaccharide repeat 
unit of a hexuronic acid (or hexose) and a hexosamine 
and contain probably the most sequence diversity 
among natural polysaccharides. For example, chon- 
droitin sulfate is based on the [4)-p-D-GlcA-(l ^ 3)- 
P-D-GalNAc-(l^] disaccharide with sulfates on the 4- 
or 6-carbon of GalNAc or the 2-carbon of GlcA. Some 
GlcA are epimerized to L-IdoA in chondroitin sulfate. 
Fleparin is based on [4 )-P-d-G1cA-( 1 ^ 4)-a-D- 
GlcNAc-(l^]. Fleparin also contains some tx-L-IdoA 
residues, and is highly sulfated (at the 2-carbon of 
IdoA and GlcA, and the 6- and 3-carbons of GlcNAc); 
many of the GlcNAc residues are also A-sulfonated 
(GlcNS). Heparan sulfate has a similar structure but 
contains fewer sulfates. This variety leads to as many 
as 17 different disaccharide monomers for heparin and 
heparan sulfate (Lindahl and Kjellen 1999). This 
structural diversity coupled with the wide range of 
their biological activity has led some to conjecture 
that ► glycosaminoglycans could be very informa¬ 
tion-rich polymers (Biilow and Hobert 2006). 

Polycationic polysaccharides include chitin 
and chitosan. Chitin is produced by arthropods 
and mollusks, and is supplied commercially as a 
by-product of the shellfish industry. Chitin is 
a copolymer of P-(l ^ 4)-linked d-G1cNAc and 
d-G1cN, with a high ratio of GlcNAc to GlcN. Chitosan 


(Fig. 1 right) is formed by A-deacetylation of chitin. It 
is typically characterized by the degree of acetylation 
(DA), dehned as the percentage of GlcNAc residues. 
Some authors refer alternatively to the degree of 
deacetylation (DDA = 100% — DA). Chitosan is 
formally defined as having a nitrogen content greater 
than 7% and DA < 40% (Muzzarelli and Muzzarelli 

2005) . Unlike most anionic polysaccharides, which 
readily dissolve in water, chitosan is only soluble in 
weak organic acid solutions below the pKa of 
the amine (^6.0-6.5). Its dissolution is facilitated by 
the formation of an ammonium salt of the acid and the 
amine. Chitosan can also be dissolved in hydrochloric 
acid, but is not very soluble in other mineral acids. 
This enables chitosan-based materials to be stable at 
physiological conditions (pH > pKa). 

In Vitro Synthesis and Synthetic Modifications of 
Polysaccharide Biomaterials 

While most polysaccharides used in biomedical applica¬ 
tions are from natural sources, significant effort has been 
invested to develop in vitro synthesis techniques. The 
polyvalent nature of monosaccharides necessitates using 
protected monomers to control glycosidic bond forma¬ 
tion. The control of stereochemistry is also challenging. 
The most successful attempts at in vitro polysaccharide 
synthesis yielding high polymers are those that employ 
natural enzymatic mechanisms. Work in this area using 
glycosyl transferases and engineered glycosidases has 
been reviewed by Faijes and Planas (2007). Polysaccha¬ 
rides synthesized in vitro have not yet attained 
widespread use in the field of biomaterials. 

Natural polysaccharides are commonly synthetically 
modified to introduce additional chemical functionali¬ 
ties. Chitosan and the glucans dextrose and ► cellulose 
in particular have been modified extensively. In glucans, 
the pendant hydroxyl groups have been derivatized to 
sulfates, phosphates, and a wide variety of ethers, esters, 
thioethers, and thioesters. Such derivatization has also 
been used to introduce many other functional groups, 
including other polymers (e.g., poly(ethylene glycol)), 
and complex bioactive prodmgs which release 
a therapeutic moiety upon ester cleavage (Heinze et al. 

2006) . Chitosan is phosphorylated at the hydroxyl 
groups to improve water solubility. More often, chitosan 
is derivatized via the pendant amine. This can be 
converted to a quaternary ammonium group by strong 
methylating agents, which improves water solubility and 
antimicrobial activity. Alkylation or acylation of the 
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amine group improves solubility in organic solvents 
(Prabaharan 2008). Acrylic and vinyl polymers, 
polysaccharides, and other biomolecules have also 
been grafted to the pendant amine of chitosan to form 
a variety of functional chitosan derivatives as biomate¬ 
rials (Prabaharan 2008). 

Biological Properties and Functions of 
Polysaccharides 

Most membrane proteins and secreted proteins are 
glycosylated, and disruptions of glycosylation often 
have deleterious effects on protein folding and func¬ 
tion. Hence, it is not surprising that carbohydrate 
chemistries in general have a tendency to stabilize 
protein structures. The general mechanisms whereby 
carbohydrates can stabilize proteins are not completely 
understood, but may be due in part to the formation of 
hydrogen bonds near the protein surface and preven¬ 
tion of nonspecific protein aggregation. Materials 
based on polysaccharides also tend to be generally 
compatible with proteins and mammalian cells. Most 
polysaccharides are biodegradable, via enzymatic 
hydrolysis of the glycosidic bonds, yielding biocom¬ 
patible monosaccharide and oligosaccharide degrada¬ 
tion products. This biodegradability can be altered by 
synthetic modifications that alter enzyme-substrate 
specificity, or by covalently cross-linking polysaccha¬ 
ride-based materials. 

Polysaccharides perform a variety of important bio¬ 
physical functions ranging from providing structural 
rigidity to regulating the biomechanics of mammalian 
tissues (Boddohi and Kipper 2010; Kipper and 
Almodovar 2011). The ► glycosaminoglycans in par¬ 
ticular also have many important biochemical func¬ 
tions. They potentiate enzyme activity and regulate 
growth factor signaling in wound healing, cell differ¬ 
entiation, and morphogenesis. The ability of sulfated 
polysaccharides to inhibit blood coagulation by inter¬ 
action with coagulation enzymes is an important 
function of the endothelial lumen of blood vessels. 
This activity allows heparin to be used as an anticoag¬ 
ulant, and is the reason that surface coatings based on 
sulfated polysaccharides have been used for blood¬ 
contacting biomaterials. ► Glycosaminoglycans also 
bind and stabilize many growth factors and bind 
some growth factor receptors. This enables them to 
regulate signaling pathways involving growth factors. 
Hence, they have been proposed as materials for 
growth factor delivery (Boddohi and Kipper 2010). 


Chitosan and chitin have antimicrobial and antifungal 
activity. Chitosan and chitin may also promote healthy 
wound healing by influencing cytokine profiles during 
different stages of the wound healing process 
(Muzzarelli 2009; Boddohi and Kipper 2010). 
Chitosan has also been shown to facilitate transport 
across epithelia by opening tight junctions. Chitin, 
glucans, and glycuronans have been shown to be 
potent immunomodulators, demonstrating both anti¬ 
inflammatory and pro-inflammatory properties in dif¬ 
ferent studies (Wagner and Kraus 2000; Catalli and 
Kulka 2010). These contradictory results warrant fur¬ 
ther investigation into the multiple possible modes of 
action of these polymers in immunity and 
inflammation. 

Processing of Polysaccharides to Make 
Biomaterials 

The aqueous solubility of most polysaccharides per¬ 
mits easy processing, and many polysaccharides can 
be readily processed into gels, films, fibers, and other 
structures. The neutral polysaccharide agarose forms 
a thermoreversible gel at low temperatures (below 
about 30°C), which readily dissolves in water at higher 
temperatures (Barbotoin and Nava Saucedo 1998). 
Negatively charged polysaccharides, such as alginate, 
pectins, and K-carrageenan can be ionically 
cross-linked with divalent or trivalent cations 
(usually Ca^'^)to form gels (Barbotoin and 
Nava Saucedo 1998). Chitosan can also form gels 
by pH-induced gelation or by cros-slinking either 
covalently (e.g., by genipin or glyceraldehyde) or 
ionically (e.g., by tripolyphosphate or alginate). Cells 
and proteins immobilized in gels or microgel particles 
have found applications ranging from microbial 
bioreactors, to tissue engineering. These gels permit 
transport of small molecules, but might be used to 
protect their payload from enzymatic attack and 
immune surveillance, or they might be used to control 
the micromechanical environment and thereby 
influence cell phenotype. 

Polysaccharide films can be formed by a number of 
methods. Solvent evaporation can be used to form both 
surface coatings and free-standing films. Nanoscale 
thin films of polysaccharides can also be formed 
by the layer-by-layer deposition of polyelectrolyte 
multilayers (Boddohi and Kipper 2010). These 
polyelectrolyte multilayers permit very precise control 
over surface chemistry and micro-/nanomechanical 
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Polysaccharide Biomaterials, Fig. 2 Scanning electron 
micrographs of electrospun chitosan fibers used as a tissue scaf¬ 
fold for growing adult mesenchymal stem cells (MSCs). Scale 


bars on both images are 20 pm. Image on the right is a higher 
magnification of the area in the box on the left 


properties of biomaterials surfaces. Hence, this 
technique has been used to introduce the biological 
function of polysaccharides to a variety of surfaces, 
including glasses, polymers, metals, and hydrogels. 

In the held of tissue engineering, polysaccharides 
have been used to form both porous scaffolds, and 
woven and nonwoven fabrics. Porous sponge-like scaf¬ 
folds are formed by freezing and lyophilizing 
a polysaccharide solution. The formation of ice crystals 
during freezing generates a phase-separated structure. 
Lyophilization to remove the ice crystals leaves behind 
a porous polysaccharide sponge. Pore size, and other 
parameters, such as pore orientation and connectivity, 
can be modulated by controlling the thermal gradients 
during the freezing process. Another strategy for 
forming highly porous scaffolds that has gained recent 
attention is the electrospinning of nanohber and micro- 
hher mats (Fig. 2). Successful electrospinning requires 
appropriate solution and solvent properties, including 
solution conductivity, viscosity, surface tension, poly¬ 
mer entanglements, and solvent volatility. Chitosan, 
cellulose, and hyaluronan are polysaccharides that 
have been successfully electrospun. Fibers can also be 
formed using other polysaccharides blended with syn¬ 
thetic polymers that are more readily electrospun. 
Electrospun polysaccharide fibers have also 
been formed using blends of other biological polymers 
(e.g., silk hbroin) and composites with nanoparticles 
(e.g., carbon nanotubes). The composition of these 
blends and composites can be used to tune biochemical 
and biomechanical properties of the resultant hbers. 


Summary 

Polysaccharides have enjoyed widespread use in bioma¬ 
terials due to their relative biocompatibility, ease of 
synthetic modihcation, and many valuable biochemical 
functions. Furthermore, many polysaccharides can be 
readily processed into useful structures, including 
films, gels, composites, porous scaffolds, microparticles, 
and nanoparticles. Many polysaccharides are excellent 
materials for stabilizing proteins and are therefore ideal 
candidates as carriers for therapeutic proteins. The abil¬ 
ities to modulate immune responses, target specific cell 
surface receptors, and facilitate transport across tight 
junctions have led to a number of applications in the 
oral and mucosal delivery of biopharmaceuticals and 
vaccines. Wound dressings and tissue engineering scaf¬ 
folds based on chitosan and chitin gels, hbers, and hlms 
have been very successful at promoting healing and 
tissue growth in organs including skin, nerve, bone, 
cartilage, and many others (Muzzarelli 2009). Polysac¬ 
charides also appear as coatings for many biomedical 
devices. These coatings introduce anticoagulant, antimi¬ 
crobial, and antifungal activity. Depending upon their 
chemistry and micromechanical properties, these coat¬ 
ings might be used to either promote or inhibit cell 
adhesion, or they might be used to stabilize and deliver 
therapeutic small molecules or proteins. 

New applications of polysaccharides in biomate¬ 
rials will be developed as our ability to tune their 
chemistry and consequent biochemical function 
improves. More importantly, advances will also be 
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dependent on our understanding of their sequence and 
function diversity, and how that sequence diversity can 
be exploited to introduce multiple biological functions 
into biomaterials. Natural polysaccharides from organ¬ 
isms from bacterial, fungal, plant, and animal sources 
all have valuable properties. Controlling and 
exploiting these properties also requires creative 
development of techniques to tune the structure and 
assembly of polysaccharides, in particular at the 
microscale and nanoscale, where cellular interactions 
are governed by feature size and shape. 
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Synonyms 

Carbohydrates 


Introduction 

Polysaccharides in common with other large carbohy¬ 
drate polymers - including mucin glycoproteins 
► mucin biophysics and nucleic acids - present 
a different challenge to proteins when it comes to the 
evaluation of their biophysical properties. This is not 
only because of their generally larger size (reaching to 
molar masses > 50 x 10® g/mol) but because of their 
far greater non-ideality (through molecular co-exclu¬ 
sion and charge effects - as represented by virial coef¬ 
ficients) and greater diversity of shapes and flexibilities. 
The primary structure of polysaccharides is not coded 
by a genetic template so they are also generally highly 
polydisperse (as represented by their molar mass distri¬ 
bution). Their primary structure is also very different 
compared with proteins, with usually no more than three 
different types of monosaccharide residue present, 
although the glycosidic linkages are much more diverse 
compared with the peptide bond, for example, 1^1, 
1 ^2,1^3,1^4,1^6, 2^4 links (where the numbers 
refer to positions of carbon atoms) with the residues in 
either a or P forms and either l or d enantiomers. 


Types of Primary Structure 

Those polysaccharides which are polymers of the same 
building block or monosaccharide residue are known 
as homopolysaccharides, those which are made up of 
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Polysaccharides: Biophysical Properties, Fig. 1 Polysac¬ 
charide primary structure types (Adapted from Shipton 2004) 


more than one are referred to as heteropolysaccharides 
(Shipton 2004). Ligure 1 shows the variety of primary 
structure types - or sequence of monosaccharide resi¬ 
dues - that can be found. Obtaining such sequence 
information - and the links between the residues - 
involves a combination of techniques such as gas- 
liquid chromatography (GLC) and nuclear magnetic 
resonance (NMR) (Penman and Sanderson 1972; 
Salomonsen et al. 2009) together with possibly 
Raman spectroscopy and infrared spectroscopy 
(Salomonsen et al. 2008) reinforced by mass spectros¬ 
copy measurements (Deery et al. 2001; Hsu et al. 
2007). It is this diversity in the sequence and linkage 
which leads to the great diversity in conformational 




p 


1908 


Polysaccharides: Biophysical Properties 


and functional properties of polysaccharides (Shipton 
2004; Tombs and Harding 1998). 

Molar Mass and Molar Mass Distribution 

In terms of functionality, besides the chemical compo¬ 
sition and primary structure, the molar mass M and 
molar mass distribution /(M) versus M are regarded 
as two of the most important biophysical properties of 
a polysaccharide. An o/;gosaccharide is generally 
regarded as having 2-10 residues linked by 
glycosidic bonds, whereas >10 residues a structure is 
regarded as a po/jsaccharide, i.e., with a molar mass 
> 2,000 g/mol. Polysaccharides usually have a quasi- 
continuous type of distribution of molar masses which 
can be described either directly by an fiM) versus M 
plot - where/(M) is defined as the population (weight 
fraction) of species with a molecular weight between 
M and dM - or by the ratio of two average molar 
masses or “molecular weights” (either MJM^, the 
ratio of the weight to number average, or MJM^, the 
ratio of the z-average to the weight average molar 
mass, with > Mw > M„). These ratios can be 
related to the width of the distribution using relations 
originally given by Herdan (1949). Distributions are 
often of the log-normal type and for these M^/M^ = 
MJM^ (Fujita 1962). 

Polysaccharides also show a wide diversity of molar 
masses ranging, for example, from ^ 2,000- 
20,000 for heparins to >50 x 10® for amylopectin. 
They often show a broad molecular weight distribution 
(► Glycoconjugates in Cell Function and Therapeu¬ 
tics), although for commercial applications - especially 
for pharmaceutical use - narrower distributions can be 
produced by preparative column chromatography. 

A popular method of characterizing weight average 
molecular weight and molecular weight distribution is 
SEC-MALs (size-exclusion chromatography coupled 
online to multi angle light scattering) and the related 
SEC-FFF (.... field flow fractionation) (► Multiangle 
Light Scattering from Separated Samples (MATS with 
SEC or FFE)). The column (or EEE membrane) facil¬ 
itates separation and the light scattering photometer 
measures molecular weight absolutely without the 
need for comparison with standards of known molec¬ 
ular weight. These combined MALs methods work 
well for many polysaccharides provided they are 
within the separation range and there are no associative 


interactions with the column (or membrane): this 
can be problematic particularly for some cationic 
(e.g., chitosans, amino celluloses) and some neutral 
(e.g., glucomannans) polysaccharides. An alternative - 
without the need for columns or membranes - is 
analytical ultracentrifugation (AUC). Sedimentation 
equilibrium AUC is an absolute method where the 
final steady state solute distribution (after the forces of 
sedimentation and diffusion have come to equilibrium) 
can provide an estimate of and using routines 
such as MSTAR and MEIT specially designed for the 
analysis of heterogeneous systems (► Glycoconjugates 
in Cell Function and Therapeutics, and >■ Sedimenta¬ 
tion Equilibrium Analytical Ultracentrifugation). 
Although MJM^ will define the width of the distribu¬ 
tion, a more effective way of characterizing polydisper- 
sity is using sedimentation velocity AUC - at much 
higher speeds compared to sedimentation equilibrium. 
Modem analysis methods such as the SEDEIT program 
provide a distribution of sedimentation coefficients 
(Eig. 2a) which by themselves give a good measure of 
polydispersity. These can be transformed into 
a molecular weight distribution, if the approximate 
conformation is known. 

Conformation 1: Overall Shape, Flexibility, 
and Hydration 

Polysaccharides have a wide range of shapes and flex¬ 
ibilities ranging from compact highly branched struc¬ 
tures (glycogen, amylopectin) to rigid rod shapes 
(schizophyllan, scleroglucan, xanthan) and very flexi¬ 
ble random coils (dextrans, pullulans) (Tombs and 
Harding 1998). A common measure of flexibility is 
the persistence length Lp, and these can range from 
^2 nm for pullulans to ^200 nm for schizophyllan 
(► Glycoconjugates in Cell Eunction and Therapeu¬ 
tics). These can be estimated using combinations of 
viscosity, ultracentrifuge, and light scattering mea¬ 
surements using global methods such as HYDEIT 
(Ortega and Garcia de la Torre 2007) and these “solu¬ 
tion” biophysics approaches reinforce imaging 
methods such as electron microscopy (Stokke and 
Elgsaeter 1991) and atomic force microscopy 
(Inngjerdingen et al. 2008). 

A more general representation of the conformation 
type is the use of the power law or “Mark-Houwink- 
Kuhn-Sakurada” (MHKS) coefficients from the 
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M (g/mol) 

Polysaccharides: Biophysical Properties, Fig. 2 Size 
distributions of polysaccharides from sedimentation velocity in 
the analytical ultracentrifuge (a) sedimentation coefficient 
distribution of starch showing the separated amylose (12.5 S) 
and amylopectin (79 S) components (Adapted from Tester et al. 
2006); (b) molecular weight distribution of an alginate (Adapted 
from Harding et al. 201 lb) 

characteristic way hydrodynamic parameters vary with 
molecular weight (Harding et al. 1991): 

[p] = s = kM'" Rg = KgM^D = KdM‘‘ (1) 

where [ 77 ], s, Rg, and D are the intrinsic viscosity 
(► Intrinsic Viscosity), sedimentation coefficient 
(► Sedimentation Velocity Analytical Ultracentrifu¬ 
gation), radius of gyration (► Multiangle Light Scat¬ 
tering from Separated Samples (MALS with SEC or 
FEE), ► Complement Factor H and Its Ligands: A 
Multidisciplinary Approach to Interactions and 
Modeling, ► Neutron Scattering of Membranes), and 
translational diffusion coefficient (► Dynamic Light 
Scattering), respectively, k^, k^. Kg, and Kq are the 
corresponding pre-exponent factors and a, h, c, d the 
MHKS coefficients, with characteristic values for 


different conformations (► Glycoconjugates in Cell 
Function and Therapeutics). Further, if k^ and h are 
known or can be approximated it is possible to 
transform a sedimentation coefficient distribution g(j’) 
versus s into a molecular weight distribution /(M) 
versus M (► Glycoconjugates in Cell Function and 
Therapeutics) (Fig. 2b). 

Polysaccharides generally have a much larger 
capacity than proteins to interact with water or aqueous 
solvent, and intrinsic viscosity measurement, for 
example, provides an overall idea of the extent of 
hydration or interaction of a polysaccharide with 
water, although interpretation can be complicated by 
the contribution to the intrinsic viscosity from confor¬ 
mation (Harding 1997). NMR has also been used to 
probe polysaccharide hydration (Corzana et al. 2004) 
and polysaccharide hydrogels have been studied by 
neutron scattering and fluorescence recovery after 
photobleaching (FRAP) (Cavalieri et al. 2006), 
supplementing more traditional rheological 
approaches. 

Conformation 2: Ordered Structures 

In some cases regions of chains (on the same chain, 
called intrachain attraction - or an adjacent 
chain, called interchain attraction) can attract each 
other via hydrophobic, hydrogen bond or ionic inter¬ 
action, or aided by ligands such as calcium ions to form 
an ordered or regular structure. Classical examples are 
the formation of intrachain helices in amylose, 
polymannuronic, and polyguluronic acids, the 
interchain calcium ion linked “egg-box” structures 
from pectins (or alginates) linking buckled ribbon 
polyanionic chains together leading to gel formation 
and the formation of interchain double helices 
in carrageenan and xanthan, and triple helices in 
schizophyllan, leading to considerable chain stiffness 
and high persistence lengths (^200 nm) (Shipton 
2004; Tombs and Harding 1998). 

Interactions 1: Self-association 

Some polysaccharides can form double or triple heli¬ 
ces - these are examples of a self-association stabilized 
by, for example, hydrogen bond formation. Calcium 
ion induced gelation through the egg-box mechanism 
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(pectins, alginates) and extended ribbons aggregating 
to form microcrystalline structures again stabilized by 
hydrogen bonds (cellulose and chitin) are also exam¬ 
ples of very strong self-associative phenomena 
(Shipton 2004; Tomhs and Harding 1998). 

Some polysaccharides show evidence for weak and 
reversible self-association interactions with molar 
dissociation constants A'j > 10 pM (► Sedimentation 
Equilibrium Analytical Ultracentrifugation). Good 
examples are arabinoxylans which appear, on the 
basis of sedimentation velocity experiments (concen¬ 
tration dependence of the sedimentation coefficient), 
to self-associate very weakly with a dissociation con¬ 
stant in excess of 100 pM (Patel et al. 2007). These 
arabinoxylans also show “bioactivity” or immunosti- 
mulatory behavior and intriguingly such weak interac¬ 
tion strengths are comparable with the weak 
interactions between cell surface proteins and glyco¬ 
proteins involved in molecular recognition interactions 
(van der Merwe and Davis 2003). 

Interactions 2: Complex Formation 

Polysaccharides can also interact with other polymeric 
materials - including other polysaccharides - to give 
rise to complex formation. These can also range from 
weak reversible interactions (Fig. 3a) to very strong 
and irreversible complexes. An example of the former 
includes weak interactions between galactomannans 
and wheat protein gliadin digests - an interaction of 
potential use for the alleviation of wheat protein intol¬ 
erance problems in coeliac disease (Seifert et al. 1995), 
as probed by hydrodynamic methods - and an example 
of the latter includes the large irreversible micron size 
complexes formed in mucoadhesive interactions 
between polycationic chitosans and polyanionic 
mucins, where hydrodynamics, electron microscopy, 
and atomic force microscopy have provided powerful 
insights (► Mucin Biophysics) (Deacon et al. 2000) 
(Fig. 3b). Another powerful technique that can be 
used to probe the strength and kinetics of complex 
formation is Surface Plasmon Resonance (SPR) 
Haseley et al. 2002) and has, for example, been used 
to study the interactions of heparan sulfate and heparin 
with lipoproteins (Lookene et al. 1997). 

Other important interactions include the synergistic 
interactions between mixtures of konjac glucomannan, 
xanthan, and alginate (Harding et al. 2011a) as probed 



Polysaccharides: Biophysical Properties, Fig. 3 Polysac¬ 
charides interactions range from very weak - (a) as in weak 
self-association of arabinoxylans (Patel et al. 2007) - as found 
by analytical ultracentrifugation and (b) to very strong irrevers¬ 
ible complexes as in mucoadhesive chitosan interactions with 
mucus glycoproteins, as revealed by atomic force microscopy 
(Deacon et al. 2000) showing particle of size consistent with 
analytical ultracentrifuge measurements 


by analytical ultracentrifugation, NMR, and rheologi¬ 
cal methods. These interactions, which are ionic 
strength dependent, appear to underpin the use of 
appropriate combinations of these materials in sati¬ 
ety/dietary products. 

Many polysaccharides are able to form gels: often 
the gelation process is reinforced by the presence of 
ligands, these can be nonionic such as sucrose in the 
gelation of high-methoxy pectin, or ionic in the case of 
the calcium ion induced gelation of low methoxy pec¬ 
tins (Shipton 2004) and the gels can be reversible or 
irreversible (Ross-Murphy 1998). The ligands can also 
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be biomolecular, as in the use of mixed polysaccharide 
gels (e.g., locust bean gum-xanthan) and polysaccha¬ 
ride-protein gels (Harding et al. 1995). 

Low molecular weight and low-polydispersity 
heparins/heparan sulfates - as small as 
decasaccharides - have been extensively studied with 
regard to their interactions with proteins and the role 
these interactions play in multicomponent signaling 
systems, including their role in growth factor complex- 
ation and interaction with agonists such as NKl. 
A whole range of biophysical techniques have been 
used to study these interactions from mass spectros¬ 
copy and analytical ultracentrifugation, solution x-ray 
scattering, surface plasmon resonance, and x-ray 
crystallography (Harmer et al. 2004; Youles et al. 
2008; Lietha et al. 2001; Gherardi et al. 2006). These 
techniques provide stoichiometry, interaction strength/ 
kinetic data, and conformations/structural data, and 
these studies - centered around a polysaccharide sys¬ 
tem - provide an outstanding example of how powerful 
a multidisciplinary approach can be used in our bid to 
understand the biophysical basis behind human 
disease. 

Polysaccharides require a somewhat different bio¬ 
physical approach compared with proteins, largely 
because of their physical heterogeneity - their struc¬ 
tures are not coded directly by genes - and because of 
their far wider diversity of conformations and proper¬ 
ties. Advances in both hydrodynamicals and scatter¬ 
ing, together with advances in imaging techniques are 
providing great insights in terms of their polydisper- 
sities, their conformation and flexibilities, their 
volume and hydration, and their interactions, proper¬ 
ties which underpin their function in nature and in the 
application/potential application of these substances in 
biotechnology. 

Cross-References 

► Complement Factor H and Its Ligands: 

A Multidisciplinary Approach to Interactions and 
Modeling 

► Dynamic Light Scattering 

► Glycoconjugates in Cell Function and Therapeutics 

► Intrinsic Viscosity 

► Mucin Biophysics 

► Multiangle Light Scattering from Separated 
Samples (MALS with SEC or FFF) 


► Neutron Scattering of Membranes 

► Sedimentation Equilibrium Analytical 
Ultracentrifugation 

► Sedimentation Velocity Analytical 
Ultracentrifugation 
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Positron Emission Tomography 
Methodology 

Terry Jones 

The PET Research Advisory Company, Wilmslow, 
Cheshire, UK 

Introduction 

Positron emission tomography (PET) procedures and 
applications are outlined and the concept of the 
imaging science of PET is defined. Within this science, 
the role of PET methodology is identified, its compo¬ 
nents are explained, and the contribution made to the 
development of the use of PET is illustrated. 

What is Positron Emission Tomography? 

Positron emission tomography (PET) involves the 
production, usually with a cyclotron, of a positron- 
emitting radionuclide, which is used to radiolabel a 
“tracer” molecule. The tracer may be a specific imag¬ 
ing agent of a given biological pathway or molecular 
interaction, or may be a pharmaceutical. Upon admin¬ 
istering the radiolabeled tracer molecule, its distribu¬ 
tion within the body can be recorded in three 
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Positron Emission 
Tomography Methodology, 
Fig. 1 The spectrum of 
medical imaging illustrating 
how pet’s high sensitivity 
and specificity makes it 
unrivaled for imaging specific 
molecular interactions and 
pathways (Reproduced with 
permission from the European 
Journal of Nuclear Medicine 
and Molecular Imaging) 



dimensions and over time using a positron emission 
tomograph. Within the spectrum of the different 
modalities used for human imaging, as defined by the 
amount of material that is required for imaging the 
human body, the sensitivity of PET is unrivaled. Its 
specificity for molecular imaging in humans is also 
unrivaled, since PET allows the ability to image 
a specific molecule because it has been selectively 
labeled (Eig. 1: Jones 1996a). 

Data within PET scans can be analyzed to generate 
quantitative uptake of the tracer in tissues and its rates 
of exchange between tissue compartments. These trans¬ 
late into quantitative parameters of tissue function, such 
as metabolic rates, concentrations of molecular binding 
sites, and drug uptake. The uptake scans and distribu¬ 
tions of rate constants can be presented as cross- 
sectional or longitudinal sections through the body. To 
aid in the clinical interpretation of tissue function, PET 
scans can be superimposed on anatomic images such as 
those generated by computed tomography (CT) or mag¬ 
netic resonance imaging (MRl). 

In experimental medicine, PET procedures are used 
as investigative tools to research regional tissue 
disease, and to evaluate novel drug mechanisms of 
action and treatment efficacy (Saleem and Price 
2007; Jones and Price 2012). In health care, PET pro¬ 
cedures are used for diagnosis and treatment 


monitoring (Valk et al. 2003; Facey et al. 2007). Oper¬ 
ationally, PET is a multidisciplinary specialty, requir¬ 
ing the expertise of specialized chemists, physicists, 
biologists and clinicians in its methodologies. The 
composite activity and the interdependence of these 
multidisciplines have been critical to the research and 
development of PET methodology in the “imaging 
science of PET” (Fig. 2; Jones 1996b). By asking 
appropriate clinical research and health care questions, 
this collaboration has helped to stimulate methodolog¬ 
ical and technical advances (Jones and Price 2012) that 
have improved the overall accuracy of the resulting 
data. Here, the explanation of PET methodology is 
confined to the physical recording, processing, analy¬ 
sis, and presentation of PET scans. 

Recording of PET Scans 

Coincidence Detection 

Examples of positron-emitting radionuclides used for 
PET are Oxygen-15 (2.1-min half-life), Nitrogen-13 
(10-min half-life), Carbon-11 (20.1-min half-life), 
Fluorine-18 (108.8-min half-life), Rubidium-82 (76-s 
half-life), and Gallium 68 (68-min half-life). Longer- 
lived radionuclides such as Iodine-124 (4.1-day 
half-life) or Zirconium-89 (78.4-h half-life) may also 
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R&D Components of PET Imaging 


Positron Emission Tomography Methodology, Fig. 2 The research and developments that contribute to the accuracy of the 
derived PET image which serves to illustrate the Imaging Science of PET 


be used. Following the administration of a tracer 
molecule labeled with one of these positron emitters, 
the measurement of its uptake within body tissues 
exploits the physics of positron decay. When 
a positively charged positron is emitted following the 
radionuclide’s radioactive decay, it is “captured” by an 
electron. As a result, both particles annihilate. Their joint 
mass energies are transferred into two gamma ray pho¬ 
tons (each of 511 keV in energy) which are emitted at 
approximately 180° to each other. By placing radiation 
detectors on either side of the body, a simultaneous coin¬ 
cident recording provides the discrete information that 
a positron was captured between the two detectors. Sur¬ 
rounding the body with a mosaic of small detectors that 
can record coincidences (Fig. 3), means that many lines 
of response can he recorded. This provides the basic data 
which can be used to reconstmct the 3-dimensional (3D) 
tomographic distribution of the positron-emitting radio¬ 
nuclide within the body. 

Since the PET scanner is electronically collimated 
by using coincidence detection, it has a high solid angle 
of detection and hence high intrinsic sensitivity. This is 


unlike the case of single photon detection, where lead 
or tungsten multi-holed collimators are needed to 
achieve spatial resolution and as a result screen out 
a substantial amount of potential signal. Scintillating 
crystals are used to detect the incident gamma ray 
photons by absorbing and transferring their energy to 
light emissions, which in turn are detected hy posi¬ 
tional sensitive photon detection systems. Because of 
the high energy of the 511 keV photons, their detection 
requires the scintillating crystals to be dense and deep. 
The nature of the light emitted from this scintillation 
process has a critical bearing on the usefulness of 
a given scintillating crystal. The amount of light emit¬ 
ted per photon interaction in the crystal needs to he as 
high as possible for accurate positional location and to 
enable the crystal element to be as small as possible to 
maximize spatial resolution. The energy spectrum of 
the emitted scintillating light needs to be as narrow as 
possible to minimize the recording of scattered photons 
with lower energy. The rise time of the emitted scintil¬ 
lating light, upon which the coincident event is based, 
determines the coincident time resolution. This needs 
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Positron Emission 
Tomography Methodology, 
Fig. 3 The coincidence 
detection principle is 
illustrated as is the block- 
based structure of the arrays of 
detector elements that 
surround the subject being 
imaged. This depicts the first 
generation of block-based 
detectors which was 
composed of two rings of 
blocks. In modem PET 
scanners at least four rings are 
used 



to be as short as possible to minimize the recording of 
random coincidences and to enable individual photon 
“time of flight (TOF)” information which improves 
image reconstruction. The decay time of light from 
the scintillation event needs to be as fast as possible 
in order to clear the event from the crystal and thus 
minimize detector pileup, dead time, and hence losses 
of effective sensitivity. 

A practical and cost-effective means for constructing 
the mosaic of coincident detectors for positron emission 
tomographs has been key to the development of PET 
scanners. The most successful approach has been the 
construction of a block of 169 detector crystal elements 
contained within an 80 mm x 80 mm exposed surface 
and 20-mm deep whose light output is viewed by four 
photomultipliers. By sharing the light between the four 
photon sensors, it is possible to position the event on the 
crystal block to an individual crystal element (Casey 
and Nutt 1986). The current scintillating crystal material 
of choice for PET is lutetium oxy orthosilicate (LSO) 
(Melcher and Schweitzer 1992). This is the principal 
material that is used today in high performance PET 
cameras for the whole body, brain, breast, and for small 
animal PET imaging. 

Correcting for Unwanted Coincidences 

There are two sources of unwanted coincidence data in 
PET which need to be corrected for: random and 


scattered coincidences. Random coincidence occurs 
when a coincidence is registered within the imager’s 
time coincidence window, but did not arise from 
a common nuclear disintegration. This usually occurs 
when there is a high photon flux within, or projecting 
into, the scanner’s fields of view. Over a given time 
period, the random coincidence background can be 
separately monitored by delaying one of the coinci¬ 
dence arms by a longer period compared with the true 
coincidence time window of typically 5-10 ns. 
Coincidence events registered after the delay are 
taken to be the constant background of random coinci¬ 
dences. This is then registered and simply subtracted 
from the true plus random coincidences recorded in the 
coincidence time window to give the net true coinci¬ 
dence rate. Scattered coincidences occur when one of 
the photons arising from a common nuclear disintegra¬ 
tion is scattered within the body. When the scattered 
photon emerges from the body and is detected, the line 
of coincidence registered is not through the original 
point of the positron’s electron capture. This leads 
to a background of scattered events superimposed 
on the true coincidence profiles. Correction may be 
achieved by recorded coincidences that occur 
outside the body’s profile, and therefore must be 
false. These are used to model the scatter profile 
through the body which is then subtracted from the 
total number of registered coincidences. Although 
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corrections for both random and scattered coinci¬ 
dences are robust, their subtraction does statistically 
deteriorate the true coincidence signal (Strother et al. 
1990). For this reason, every effort is made to reduce 
their occurrence to a minimum. This is achieved by 
operating with the narrowest of coincidence time win¬ 
dows to screen out random coincidences, and the 
highest possible energy threshold for triggering 
a coincidence event to screen out scattered events 
which have reduced energy. 

Quantitative Imaging 

During their trajectory from the disintegrating 
radionuclide to the PET detectors, some of the emitted 
photons will be scattered or absorbed in the body. 
A correction needs to be made for this “tissue 
attenuation effect” to ensure image quality, and that 
quantitative values of tissue tracer concentrations can 
be measured from the resulting tomogram. For both of 
the pair of emitted photons to emerge unattenuated, 
together they have to traverse the sum of the total 
body’s thickness. Thus, the probability tissue length 
for losing a coincidence event is the total thickness of 
the body between the two coincident detector 
elements. To capitalize on this, a ring, of a diameter 
greater than a patient’s body, containing a positron- 
emitting radionuclide is inserted into the tomograph. 
Coincidence data sets are then recorded with and 
without the patient being located in its held of view. 
The ratio of the coincidences recorded between no 
patient present and when the patient is positioned in 
the tomograph provides attenuation factors for each 
line of response. This produces a transmission scan, 
which results from the total body thicknesses 
and hence provides correction factors for each of the 
lines of response within the subsequently recorded 
emission data. As a result of this universal correction, 
emission scans can be corrected for attenuation as if 
the body were transparent to the emitted photons and 
the picture element response within the reconstructed 
tomogram can be calibrated against a laboratory stan¬ 
dard, e.g., a well counter. This involves imaging 
a phantom filled with a solution of radioactivity, an 
aliquot of which is withdrawn and its concentration 
measured in a well counter. This cross-calibration 
means that tissue concentrations can be measured in 
the same units as blood samples, or from the absolute 


amount of activity administered to the subject 
(Woodard et al. 1975). 

Using quantitative data derived from tissue and 
blood samples measured in the same units, it is possi¬ 
ble to model the blood and PET time course data to 
derive rate constants of exchange between the blood 
and regional tissues. Where high temporal resolution 
of the blood concentrations is required, blood 
withdrawn from an arterial catheter is continuously 
withdrawn over an online detector, which in turn is 
calibrated in the same units as that of the scanner’s 
picture elements response. One important qualification 
when making quantitative measurements of tissue 
tracer concentrations is that the physical size of the 
tissue of interest needs to be at least half the spatial 
resolution of the PET scanner - namely half the Full 
Width at Half the Maximum Height (FWHM) of the 
line spread function (Hoffman et al. 1979). Where this 
condition is not met, e.g., for some of the smaller 
structures in the brain, a correction for this partial 
volume effect is made from an independent measure¬ 
ment of the structure’s size using magnetic resonance 
imaging (MRI). 

PET Image Reconstructions 

Initially the lines of coincidence data recorded were 
converted into trans-axial tomographic images by 
filtering and back projecting the projection data. In 
the interests of reducing the noise in reconstructed 
images, iterative reconstructions and in particular 
those based on maximum-likelihood expectation max¬ 
imization (ML-EM) (and its accelerated version of 
ordered subset expectation maximization (OSEM)) 
have been adopted (Shepp and Vardi 1982). 
Reconstructing the data recorded from all possible 
coincidence lines of response has required extensive 
development to create what is known as 3D PET 
(Bendriem and Townsend 1998). A major restriction 
to realizing high sensitivity and spatial resolution in 
PET cameras has been that when a photon enters 
a mosaic of detector elements, it may enter through 
one element, deposit its energy at an unknown depth 
within that element, or it may exit to deposit energy in 
an adjacent element. This problem is exacerbated as 
the ring diameter of the tomograph is reduced and its 
axial length extended in the interest of increasing 
sensitivity. This parallax problem has been addressed 
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Tomography Methodology, 
Fig. 4 A kinetic illustration 
of the steps that a tracer/ligand 
traverses to reach a specific 
tissue-binding site. From 
measurements of the plasma 
and tissue time courses, it is 
possible to model the data to 
derive the rate constants of 
exchange shown from which 
quantitative measures of the 
density of the receptor to 
which the ligand binds can be 
determined 



by developing the means to provide information on the 
depth of interaction in the detector that registers an 
event. A more recent and pragmatic approach is to 
measure, for each possible pair of coincident detectors, 
the spatially dependant point spread functions which 
characterize this parallax effect. This library of a priori 
information can then he used in the reconstruction 
process. The result is known as High Definition PET 
(HD-PET), where all possible coincidences may be 
recorded in a tomograph without loss of spatial reso¬ 
lution. This achieves maximum detection efficiency, 
less statistical uncertainty (noise) within the 
reconstructed image, and increased spatial resolution 
(Sureau et al. 2008). Spatial resolutions as low as 2 mm 
have been reported for a body PET scanner and 1.3 mm 
EWHM measurements are reported for a dedicated 
brain scanner using this correction. LSO-based PET 
cameras have now been refined so that their electronic 
readouts are sufficiently fast and stable to capitalize on 
the differences in the arrival times of the two emerging 
gamma rays at the two coincident detectors. The differ¬ 
ence, registered using TOE recording, provides addi¬ 
tional information which translates into statistical gain 
within the reconstructed images. Incorporating TOE 
into the reconstructions has recently demonstrated 
promising results (Kadrmas et al. 2008). On 
a practical level, advances have been made on 
correcting for patient motion, especially due to respira¬ 
tion where gated PET scans have significantly 
improved lesion detection within the lung. 


PET Image Presentation 

PET images were presented initially as trans-axial 
tomograms, but as body surveys were undertaken to 
detect cancer spread, the data were much more 
comprehensively viewed as whole body longitudinal 
images (Dahlbom et al. 1992). Combining PET with 
X-Ray CT has made a significant impact in interpreta¬ 
tion by superimposing anatomical information upon 
functional PET data (Townsend 2008). Eew whole 
body PET scanners are currently sold without 
being combined with X-Ray CT. Effort is currently 
focusing on combining PET and MRI. This is proving 
challenging as photomultipliers are affected hy the 
magnetic field. To this end, avalanche photodiodes 
are being used for block detector readouts, and more 
recently silicon-photo multipliers (SiPM) look partic¬ 
ularly promising for this application as they match 
standard photomultiplier performance and have less 
dead time. 

Quantitative PET Data Analysis 

The simplest form of quantitative PET data analysis is 
the use of the Standardized Uptake Value (Woodard 
et al. 1975). Eor ^**E-labeled Huorodeoxyglucose 
(18E-FDG) which is commonly used for detecting and 
staging cancer as well as following treatment response, 
consensus guidelines for its quantification have been 
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derived within the European Organization for the 
Research and Treatment of Cancer (Young et al. 1999). 
Using blood or plasma concentration time course in 
conjunction with kinetically recorded PET scans, ana¬ 
lyses have been developed based on compartmental 
models (Carson 2003). They have been implemented 
especially for quantifying brain neuroreceptors, for 
which rate constants of exchange translate into values 
of receptor density (Fig. 4). Where no clear single kinetic 
compartmental model exists, -e.g., in diseased 
tissue such as a tumor - a data-led approach has been 
adopted to define the kinetic exchange and principal 
components that are measurable in the data. Quantitative 
data are derived either on a region of interest (ROI) basis, 
or on a voxel element basis which provides parametric 
images. A significant advance was made when a method 
was introduced to identify regions in PET scans 
that revealed regional statistical differences between 
groups of studies or in correlations within external 
factors or stimuli. The technique, known as Statistical 
Parametric Imaging (SPM) (Friston et al. 1995), was 
initially developed to analyze PET brain scan data and 
is currently used extensively for functional MRI 
analysis. 

Conclusion 

As described, PET is a multidisciplinary modality. In 
addition to PET methodology, important contributions 
are needed in cyclotron technology; radiolabeling of 
tracer molecules; and in the discovery, development, 
and validation of imaging agents. With respect to the 
development and application of PET over the last 
25 years, by far the largest volume of PET studies has 
been in the use of *®FDG for cancer detection, staging, 
and treatment response. Much of this use of PET has 
been due in a large part to the major advances made in 
PET methodology, such as the high quality of quanti¬ 
tative whole body images of '“EDG uptake in cancer 
patients, superimposed on anatomical images. To fully 
capitalize on these advances, the field awaits the 
introduction of other methodological developments 
such as low cost, low radiation emitting, and simple- 
to-operate cyclotrons, together with simplihed low 
cost radiochemistry procedures operating under Good 
Manufacturing Practice (GMP). In addition, more 
specihc imaging agents need validating to extend the 
use of PET across internal medicine. 


Cross-References 

► Assignment of *'^F Resonances in Protein Solution 
State NMR Studies 

► Carbohydrate NMR Spectroscopy 

► MRI and PET Imaging: Clinical Applications 

► Glycoproteins 

► Protein-Protein Interactions: Kinetics 

► Live-Cell Single-Molecule Imaging 

► NMR Methods for Kinetic Analysis 

► Segmented Flow Microfluidics 
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Introduction 

Proteins are synthesized hy the ribosomes, which are 
located along the endoplasmic reticulum (ER). The 
process, by which proteins are synthesized, primarily 
based on an mRNA template, is referred to as transla¬ 
tion. The mRNA contains a trinucleotide genetic code 
which dictates the insertion of the specific amino acid 
and this code is translated by the ribosomes. The ribo¬ 
somes are formed by small and large units which 


surround the mRNA. The S unit is specialized in 
attaching an amino acid to the carboxy-terminal of 
the growing polypeptide and catalyzes the formation 
of the peptide bond. During peptide-bond formation, 
the alpha-amino group of the A-site aminoacyl-tRNA 
attacks the carbonyl carbon of the P-site peptidyl- 
tRNA to produce a new peptidyl-tRNA that is one 
amino acid longer at the A site and a deacylated 
tRNA at the P site. The second enzymatic activity 
associated with the peptidyl-transferase center is the 
hydrolytic cleavage of the ester bond in peptidyl-tRNA 
during termination of protein synthesis. A myriad of 
proteins exist which differ both in their primary 
sequence and posttranslational modifications. Ribo¬ 
somes solely present the activity to form the peptide 
bond, and therefore, proteins are directed elsewhere to 
suffer the posttranslational modifications. The activity 
of many proteins is dependent upon their posttransla¬ 
tional modifications and, therefore, posttranslational 
modifications are required for the activity and coordi¬ 
nation of vital processes. Alterations or loss in the 
regulation of posttranslational modihcations may lead 
to a myriad of diseases including cancer. This new 
concept that posttranslational modihcations are vital 
for maintaining physiological functions has recently 
earned attention, which previously was given solely 
to gene and protein expression levels. Moreover, new 
diagnostic venues are dedicated toward quantifying 
posttranslational modihcations (Kamath et al. 2011; 
Teeming et al. 2011; see also “► Mass Spectrometry 
of N-linked Carbohydrates and Glycoproteins”). 
Herein, the major posttranslational modihcations are 
discussed, as well as how they regulate vital cellular 
processes. 

Proteolysis 

Many proteins are synthesized as zymogens, an inac¬ 
tive enzyme precursor. Zymogens may contain inhib¬ 
itory sequences which maintain these proteins in an 
inactive form and require hydrolysis to be activated. 
Matrix metalloproteinases (MMPs) are synthesized in 
an inactive form, pro-matrix metalloproteinases, and 
are proteolytically activated by the sequential proteol¬ 
ysis of the pro-peptide blocking the active site cleft. 
The activation of metalloproteinases is the key level of 
regulation of these proteinases. The importance 
of proteolytic activation and inactivation of vital 
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physiological functions is evident in the proteolytic 
cascades of blood coagulation, the complement system 
activation, apoptosis where a signal is amplified by the 
quick on-off proteolytic switching and angiogenesis 
(Theopold et al. 2004; Jiravanichpaisal et al. 2006; 
Agarwal et al. 2010). Moreover, matrix 
metalloproteinase 1 cleaves many monocyte 
chemoattractant proteins converting these chemokines 
from agonists to antagonists (McQuibban et al. 2002). 

Proteins may also contain an N-terminal signal 
sequence which may direct the protein, for example, 
to the cell membrane or maintain the protein inactive, 
and the signal sequence is then proteolytically 
removed. Adrenomedullin, which can stimulate 
vasodilation, contains a sequence peptide that causes 
vasoconstriction; this domain is released by MMP-2 
cleavage (Martinez et al. 2004). The cleavage can 
release functional domains or may release ligands, 
such as cleavage of heparin affinity regulatory peptide 
by MMPs releasing VEGF (Dean et al. 2007). 

Glycosylation 

Glycosylation is the major posttranslational modifica¬ 
tion and greatly increases protein diversity. Glycosyl¬ 
ation plays innumerate roles in cellular and tissue 
events, from structural to its implication in different 
signaling pathways and recognition (see “► Carbohy¬ 
drate Antibiotics”; “► Carbohydrate Enzymology”; 
“► Glycoconjugates in Cell Function and Therapeu¬ 
tics”) (Hounsell). Two major types of glycosylation 
are N- and 0-linked glycosylation (see ► Glycopro¬ 
teins. In A-glycosylation, an A-acetylglucosamine res¬ 
idue is attached by an amide bond to a consensus 
sequence NX(S/T) at the asparagine residue. O- 
glycosylation may occur at any serine or threonine 
residue with no single common core structure or con¬ 
sensus protein sequence. The polymerization of the 
glycans uses several different glycosyl transferases 
with no template for glycosylation, which implies 
that the control of the process is related to the speci¬ 
ficity of the enzyme and its cellular location. 

N-glycosylation starts in the ER with the stepwise 
synthesis of a Glc3Man9GlcNAc2 structure, which 
is transferred to the NX(S/T) consensus motif 
of newly synthesized proteins. The first step is the 
transfer from A-acetylglucosamine (GlcNAc) 
phosphate from UDP-GlcNAc to membrane-bound 


dolichol monophosphate (Dol-P), forming 
GlcNAc-pyrophosphate dolichol (GlcNAc-PP-Dol) 
(Kelleher and Gilmore 2006). This step is followed 
by the addition of a GlcNAc and five mannose (Man) 
residues which are subsequently attached to this 
dolichol-linked monosaccharide in a stepwise manner. 
The lipid-linked heptasaccharide Man5GlcNAc2 is 
translocated via a flippase across the ER 
membrane and is elongated at the luminal side by the 
attachment of four further mannose residues and, sub¬ 
sequently, three glucose residues. The completed 
Glc3Man9GlcNAc2 oligosaccharide is then trans¬ 
ferred to selected asparagine residues of nascent pro¬ 
teins by the oligosaccharyl transferase complex 
(Sanyal and Menon 2009). Transfer of the glycan is 
followed by glycan-based quality control of the pro¬ 
tein, after which all three glucose residues and a single 
mannose residue are removed before export to the 
Golgi apparatus. 

Mucin-type of O-glycosylation is initiated by the 
addition of a GalNAc-residue to a serine or a threonine 
of a protein. Further elongation and termination by 
specific glycosyl transferases is highly regulated. 
Clustering of 0-glycans in Ser/Thr(/Pro)-rich domains 
allows glycan determinants such as sialyl Lewis X to 
be presented as multivalent ligands, essential for 
functional recognition (Hanisch 2001). 

Nuclear and cytoplasmic proteins show 0-linked 
attachment of A-acetyl-glucosamine (0-GlcNAc) on 
serine and threonine residues, which seem to play a key 
role in signal transduction pathways. Numerous pro¬ 
teins such as metabolic enzymes, transcription factors, 
phosphatases, kinases, cytoskeletal proteins, kinases, 
among others, display this type of glycosylation. 

Gamma-Carboxylation 

Additional carboxylation of glutamic acid by vitamin 
K-dependent gamma-carboxylase is a common 
posttranslational modification of many proteins, 
including some of the blood clotting factors. Vitamin 
K-antagonists, such as warfarin, are often included in 
the therapy of malignant disease, decreasing the blood 
coagulation potential (Cade et al. 1970; Kulik et al. 
2006). The gamma-carboxyglutamic acid-rich domain 
of these vitamin K-dependent proteins allows for 
calcium-induced conformational changes in the 
protein surface contributing to the calcium-mediated 
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ligand formation between the gamma- 
carboxyglutamic acid and anionic phospholipids in 
the membrane. 

Vitamin C-Dependent Modifications 

Proteins may suffer proline and lysine hydroxylations 
and carboxy-terminal amidation. Prolyl hydroxylase 
and lysyl hydroxylase are the enzymes, which catalyze 
the hydroxylation, and the donor of the amide for 
C-terminal amidation is a glycine residue. Vitamin 
C is a cofactor to these reactions, and therefore, they 
are vitamin C-dependent modifications. 

Collagen is the most abundant protein in mamma¬ 
lian tissues. All members of the collagen family are 
composed of three chains forming triple helix domains 
containing repeating GXY (X is often a proline (P) 
residue and Y a hydroxyproline (O) residue). A proline 
residue positioned at Y in the repeating domain is 
hydroxylated to hydroxyproline, and thus a posttrans- 
lational reaction is required to increase the stability of 
the collagen triple helix and for cross-linking pro¬ 
peptides in collagen (Ananthanarayanan 1983; Mid¬ 
dleton and Bulleid 1993; Pinkas et al. 2011). The 
presence of proline and hydroxyproline in the GXY 
domains are required for the sharp twisting of the triple 
helix. The importance of this posttranslational modifi¬ 
cation is evident in patients with vitamin C deficiency, 
like scurvy, which present defective collagen 
fihrillogenesis (Munday et al. 2005). The lack of lysine 
hydroxylation leads to impaired wound healing, defec¬ 
tive bone formation, and fragile capillaries, resulting in 
bleeding (Harwood et al. 1974). 

Sulfation 

Sulfation of tyrosine is a posttranslational modification 
thought to occur almost exclusively on secreted and 
transmembrane spanning proteins and controls 
extracellular protein interactions. Tyrosine sulfation 
is involved in several biological processes, promoting 
the binding of chemokine receptors (HIV-1 CC- 
chemokine receptor 5), inflammatory response mole¬ 
cules (P-selectin glycoprotein ligand-1), hemostasis 
proteins (Factor VIII), pathogen invasion (Duffy anti¬ 
gen/receptor of chemokines), as well as the activation 
of bioactive peptides. Also, specific C-terminal 


tyrosine sulfation of secreted fibronectin confers its 
binding affinity toward fibrin (Allen et al. 2007). 

Acylation 

In most cases, the initiator methionine is hydrolyzed 
and an acetyl group is added to the new N-terminal 
amino acid. Acetyl coenzyme A (acetyl-CoA) is the 
acetyl donor for these reactions. Some proteins 
have the 14 carbon myristoyl group added to their 
N-termini. The donor for this modification is 
myristoyl-CoA. This latter modification allows associ¬ 
ation of the modified protein with membranes. 
The catalytic subunit of cyclic AMP-dependent protein 
kinase (PKA) is myristoylated. Histones can also 
suffer acetylation on lysine residues in the N-terminal 
of the protein, which brings in a negative charge, 
inducing conformational changes in the protein 
facilitating gene transcription. 

Phosphorylation 

Phosphorylation is a vital tool in animal cells regulat¬ 
ing the biological activity of a myriad of proteins 
enabling them to switch from a transient state 
(see ‘Phosphorylation’). A phosphate may be rapidly 
added and later removed efficiently shifting a protein 
from an active to an inactive state. Glycogen synthase 
and glycogen phosphorylase in hepatocytes control the 
levels of glucose delivery to the blood in response to 
glucagon (Stingl and Roden 2004). The enzymes that 
phosphorylate proteins are termed kinases and those 
that remove phosphates are termed phosphatases. In 
animal cells, serine, threonine, and tyrosine are the 
amino acids subject to phosphorylation by enzymes 
termed serine/threonine kinases. Cellular signal 
transduction pathways represent coordinated 
phosphorylation cascades of a myriad of kinases in 
different subcellular compartments. 

Ubiquitination 

Ubiquitination (or ubiquitylation) is an ATP-dependent 
proteolytic system and is an enzymatic, protein 
posttranslational modification process in which the 
carboxylic acid of the terminal glycine from 



p 


1922 


Post-Translational Modifications 


the di-glycine motif in the activated ubiquitin forms an 
amide bond to the epsilon amine of the lysine in the 
modified protein see also ► Ubiquitinylation, (Walters 
and Chen). Ubiquitin directs protein recycling and has 
been found in almost all tissues (ubiquitously) of 
eukaryotic organisms. Ubiquitin binds to proteins cre¬ 
ating an ubiquitin tag which directs proteins to the 
proteasome for degrading. This system has also been 
shown to participate in modulating the activity of some 
proteins, as well as aiding in the regulation of several 
signaling pathways, such as the beta-catenin regulation 
in the wnt signaling pathway (Chairoungdua et al. 2010; 
Taelman et al. 2010). 

Glypiation 

Phosphatidyl inositol (PI) can be added to proteins 
posttranslation, in the endoplasmic reticulum, follow¬ 
ing cleavage of a hydrophobic C-terminal peptide. 
The PI moiety is covalently attached through 
a carbohydrate-containing linker (glucosamine and 
mannose glycosidically bound to the inositol residue) 
to a phosphoryl ethanolamine residue that is linked to 
the C-terminal amino acid of the protein by its amino 
group. Glypiated proteins are then processed through 
the secretory pathway to the extracellular space. 

The addition of the PI moiety allows these proteins, 
termed glycosyl-phosphatidylinositol (GPI)-anchored 
proteins, to attach to the cell membrane where they can 
play an important role in cell signaling. These proteins 
can be cleaved from the GPI anchor via phospholipase 
C resulting in the release of diacylglycerol, which is 
a second messenger in cell signaling pathways 
(Nazih-Sanderson et al. 1997). Furthermore, several 
GPTanchored proteins have enzymatic activity at the 
cell membrane, for example, acetylcholinesterase, 
which is attached to the postsynaptic membrane 
of cholinergic synapses and degrades the neurotrans¬ 
mitter acetylcholine terminating synaptic transmission 
(Coussen et al. 2001; Sharom and Lehto 2002; 
Zimmerman and Soreq 2006). 

Summary 

Posttranslational modifications regulate protein activity 
playing a vital role in cell biology, coordination of vital 
processes, and maintaining physiological functions. 


A myriad of physiological processes are governed by 
posttranslational modifications: cleavage of zymogens 
modulating proteolytic activity, glycosylation and phos¬ 
phorylation regulating the biological activity of pro¬ 
teins, gamma-carboxylation regulating blood clotting 
cascades, and vitamin C-dependent modifications 
enabling correct fibrillogenesis of collagen; thus, it is 
a testament to the importance of posttranslational mod¬ 
ifications. Alterations or loss in the regulation of post¬ 
translational modifications can trigger diseases, such as 
cancer, diabetes, and heart disease. Therefore, post¬ 
translational modifications have recently earned much 
attention in the search for novel biomarkers 
and development of new therapies (Krueger and 
Srivastava 2006). 

Cross-References 

► Glycoproteins 

► Glycosylphosphatidylinositol 

► Protein Phosphorylation 
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Synonyms 

Acetylation; ADP-ribosylation; Chromatin interac¬ 
tions; Citrullination; Methylation; Protein Phosphory¬ 
lation; Proline isomerization; Sumoylation; 
Ubiquitination 

Definition 

Histones, the proteins around which nucleosomes are 
built (Fig. 1), can interact with a range of proteins to 
regulate gene transcription and to maintain genome 
stability in processes such as replication, recombina¬ 
tion, cell cycle progression, and DNA repair. All 
histone proteins are capable of taking on a wide variety 
of reversible posttranslational modihcations, and these 
modifications regulate the interactions with protein 
partners and thereby, dictate the transcriptional status 
of chromatin, as proposed in the “histone code 
hypothesis” (Kouzarides 2007). 

Basic Characteristics 

Currently, covalent modifications to over 60 residues 
of histone proteins have been identified, including 
methylation, acetylation, ubiquitination, sumoylation. 
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Post-Translational Modification of Histone Proteins, 
Fig. 1 Ribbon representation of a crystal structure of the nucle- 
osome core particle (protein data bank code lAOI; Luger 
et al. 1997). Human a-satellite DNA (146 bp) is wrapped around 
a protein core composed of two copies of histones H2A (red), 
H2B (green), H3 (cyan), and H4 (blue). Unstructured N- and 
C-terminal ends (tails) of histone proteins protrude from the 
nucleosomes as indicated by red arrows 


phosphorylation, citrullination (also known as 
deimination), ADP-ribosylation, and proline isomeri¬ 
zation. Many residues can be subject to more than one 
type of modification; for example, residue K9 in 
histone H3 can be either methylated or acetylated 
(Fig. 2). The complexity is further increased by the 
different possible sub-states that are possible in the 
case of methylation. Thus, lysine residues are found 
as mono-, di-, or trimethylated (Kme^ wherein 
n = 1-3) and arginines can be mono- or dimethylated 
(Rme; wherein i = 1-2). Additionally the Rme 2 
modihcation can be symmetrical or asymmetrical, 
depending on which nitrogen(s) on the guanidino 
group is (are) methylated. Flistone modifications are 
generally dynamic, in that they can be “written” and 
“erased” by highly regulated enzymes. Most of the 
modihcations are concentrated on the unstructured 
N- and C-terminal tails (Kouzarides 2007; Tail 2007) 
that form approximately 30% of the histone mass and 
protrude out from the nucleosomes where they are 
accessible to other chromatin-associated proteins 
(Luger et al. 1997). Although fewer modifications are 
located in the histone cores, those that have been 


described, such as acetylation of K56 on H3 and K91 
on H4, have been proposed to play a significant role in 
the regulation of histone:DNA and histone:histone 
interactions (Shahbazian and Grunstein 2007). 

Acetyl marks are written by histone 
acetyltransferases (HATs), which catalyze the transfer 
of an acetyl group from acetyl-coenzyme A to the 
£-amine of a target lysine (Shahbazian and 
Grunstein 2007). This process can be counteracted by 
histone deacetylases (HDACs). Generally HATs and 
HDACs have relaxed substrate specificity and can 
modify several alternative lysine residues. HATs and 
HDACs are essential in the development of numerous 
organisms, and aberrant histone acetylation has been 
described in human diseases such as leukemia, epithe¬ 
lial cancers, and fragile X syndrome (Timmermann 
et al. 2001). Lysine acetylation is thought to relax the 
higher-order chromatin structure by interfering with 
histone:DNA interactions, whereas the removal of ace¬ 
tyl groups by HDACs leads to chromatin condensation. 
In line with these activities, HATs and HDACs are 
commonly found as a part of multi-protein transcrip¬ 
tional activator and repressor complexes, respectively, 
and the other members of these complexes presumably 
regulate the activity of the enzyme component. 
Histone acetylation also most likely promotes 
transcription by recruiting gene activating factors that 
specifically recognize the introduced acetyl groups 
(Shahbazian and Grunstein 2007). Histone acetylation 
levels can be artificially altered by HDAC inhibitors 
(HDACI), and several HDACIs, such as trichostatin 
A (TSA) and sodium phenylbutyrate, are currently 
under investigation as potential drugs in cancer 
treatment (Timmermann et al. 2001). 

Lysine and arginine methylation can be either a 
gene-activating or repressing mark depending on the 
residue-affected and the methylation status. Methyla¬ 
tion is carried out by methyltransferases, which modify 
lysines and arginines in a highly specific manner using 
S-adenosylmethionine as a donor substrate 
(Shilatifard 2006). Deregulation of the histone methyl¬ 
ation system has been connected to oncogenesis and 
neurological disorders, such as X-linked mental retar¬ 
dation (Shi 2007). For most of the lysine residues within 
histone tails that can be methylated, a number of alter¬ 
native methylases have been identified. For example, 
H3 can be methylated at K9 by different enzymes 
depending on whether it is located in transcriptional 
active or repressed chromatin (Shilatifard 2006). 
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Post-Translational Modification of Histone Proteins, 
Fig. 2 Posttranslational modifications commonly found on the 
core histone tails (H2A, H2B, H3, and H4) are indicated. Lysine 


can be mono-, di-, or trimethylated, while arginine is found as 
mono- or symmetrically/asymmetrically dimethylated. The start 
and end residues of the tails are numbered beside the sequences 


Methylated H3K4 and H3K36 are commonly found 
along actively transcribed genes. In yeast, some H3K4 
and H3K36 methyltransferases have been proposed to 
be co-recruited to euchromatin with the transcriptional 
machinery, and this arrangement is likely to be 
conserved in higher eukaryotes (Shilatifard 2006). 
H3K4me2 and H3K4me3 have been studied in detail, 
and trimethylated H3K4 can be recognized by a number 
of transcriptional activators, such as the chromatin 
remodeling factor CHDl (chromodomain ATPase/ 
helicase DNA binding protein 1; Shahbazian and 
Grunstein 2007; Shilatifard 2006). Several enzymes 
have been identified that can remove these methyl 
marks, many of which belong to the Jumonji C family 
(Shi 2007). For each methylated lysine, several 
demethylases have been identified, and they are often 
specific for a particular lysine methylation status. 

Arginine methylation, a relatively poorly character¬ 
ized histone modification, is carried out by protein 
arginine methyltransferases (PRMTs; Bedford and 
Clarke 2009). For example, FI3K4 trimethylation can 
be prevented by prior methylation of the neighboring 
H3R2 by PRMT6, suggesting that H3R2 methylation 
is a transcriptionally repressive histone mark (Bedford 
and Clarke 2009). Arginine demethylation seems to 
occur via a citrullination reaction catalyzed by 
peptidylarginine deiminases, and leads to arginine 
citrullination rather than restoration of the unmodified 
residue. Recently, the Jumonji family protein JMJD6 
was suggested to be an arginine demethylase that can 
regenerate the native arginine residue (Bedford and 
Clarke 2009). Further work is required to establish 
the biological signihcance of these different arginine 
demethylation pathways. 

Serine and threonine phosphorylation by kinases, 
and dephosphorylation by phosphatases, have mainly 


been studied in the context of mammalian cell-cycle 
regulation, DNA repair, and cell signaling 
(Kouzarides 2007). Among phosphorylation marks, 
the best characterized is chromatin hyperpho¬ 
sphorylation, which is associated with the nuclear 
DNA damage response that is triggered by DNA 
double strand breaks. Thus, the H2A variant H2AX in 
mammals becomes rapidly phosphorylated at serine 139 
by the kinase ATM (ataxia telangiectasia mutated) in 
nucleosomes surrounding the lesion (Ikura et al. 2007). 
Together with histone ubiquitination and acetylation, 
H2AX hyperphosphorylation leads to accumulation of 
DNA damage response proteins at the site of lesion. 

Ubiquitination, sumoylation, and ADP-ribosylation 
are all relatively bulky modifications when compared 
to acetylation, methylation, and phosphorylation, and 
have to date been studied in less detail. In addition to 
histone ubiquitination during DNA repair, several 
studies suggest that histone H2B monoubiquitination 
and deubiquitination act to regulate the histone meth¬ 
ylation that is associated with transcription initiation 
and elongation. Histone ubiquitination is carried out by 
E2 ubiquitin-conjugating enzymes in the presence of 
E3 ligases (Shilatifard 2006). Sumoylation and 
ADP-ribosylation are poorly characterized modifica¬ 
tions in histones. ADP-ribosylation is seemingly 
involved in DNA repair, and work carried out 
primarily in yeast indicates that sumoylation is gener¬ 
ally found in repressive chromatin (Kouzarides 2007). 
Proline cis-trans isomerization by proline isomerases 
is not a covalent modification, but conformational 
changes that result in the histone polypeptide backbone 
as a consequence of proline isomerization (e.g., of P38 
in H3) have been suggested to regulate other 
modifications taking place in the nearby residues 
(Kouzarides 2007). 
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Additional types of histone modifications, such as 
serine and threonine O-GlcNAcylation (0-linked P-N- 
acetylglucosamine), tyrosine hydroxylation (Yoh), and 
lysine crotonylation (Kcr) have been discovered very 
recently (Tan et al. 2011), and future studies will be 
required to elucidate the roles, if any, played by these 
modihcations in the histone code. 
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Potassium Channel Selectivity and 
Gating at the Selectivity Filter: Structural 
Basis 

Simon Berneche 

Biozentrum, University of Basel, Basel, Switzerland 

Synonyms 

C-type inactivation; Open-probability; Relative 
conductance 

Definition 

The selectivity of an ion channel refers to its 
propensity to favor the permeation of a specific ion 
fype. The process called gating refers to the intercon¬ 
version between conductive and nonconductive 
conformations of a channel. 

Introduction 

In excitable cells, the main gate of voltage-dependent 
channels opens following cell depolarization. For 
a given period of time their selectivity filter is the only 
barrier between the intra- and extracellular solutions, 
which hold respectively a high concentration of and 
Na"^ ions. The role of the selectivity filter is to catalyze 
and control the diffusion of down its electrochemical 
gradient while preventing the diffusion of Na"^ in opposite 
direction, thus allowing for the repolarization of the cell. 

How can channels distinguish between two 
kinds of ion that are so similar? and Na"^ both 
hold a charge of -f 1 and their radii differ by only a 
fraction of an Angstrom (r^a = 0.9 A; r^ = 1.33 A). 
The challenge is made even greater since the channel 
needs to select for the larger of the two ions, so 
a simple sieve mechanism would not work. 
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Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis, Fig. 1 Canonical struc¬ 
ture of the selectivity filter of the KcsA channel, (a) 
Detailed view of two opposite subunits with ions occupying 
sites SO, S2, and S4. The dashed lines correspond to impor¬ 
tant intra-subunit interactions, (b) The backbone atoms 


forming the permeation pore are shown for all four 
subunits. The octahedral coordination scheme is 
highlighted in site S3. In parentheses is given the selectiv¬ 
ity (AAG in kcal/mol) of each binding site calculated by 
molecular mechanics free energy perturbation (Noskov 
et al. 2004) 


While many hypotheses existed beforehand, the 
elucidation of the high-resolution X-ray structure of 
the prokaryotic KcsA channel allowed for 
a detailed understanding of the fine mechanisms under¬ 
lying selectivity (Doyle et al. 1998; Zhou et al. 2001). 
The structure of the KcsA selectivity filter reveals four 
K"^ binding sites that are each defined by eight oxygen 
atoms forming a quasi-cubic box (see Fig. 1). This 
unique structure, with all carbonyl oxygens pointing 
toward the pore, is made possible by the presence of 
two conserved glycine residues in the core of the 
selectivity filter (the GYG signature sequence). 
Because glycines have no sidechain, they can reside 
in the left handed a-helical region of the 
Ramachandran plot (generally forbidden to L-amino 
acid) and allow for the alignment of the different 
carbonyl groups (Valiyaveetil et al. 2006). Two K"^ 
ions are usually found in the filter, occupying sites 
S1/S3 or S2/S4, with a water molecule in between 
(note that a third ion can also transiently bind to SO 
when ions are in the S2/S4 state). The binding sites 
look like perfectly designed to hold a K"^ ion. Though, 
we will see that the structure is only one of many 
elements that need to come together to give rise to 
selectivity. 

High selectivity implies binding the substrate with 
high affinity to recognize its chemical properties, 
which should considerably slow down permeation. 
Paradoxically, despite such restraint, K"^ channels 
achieve high K"^ permeation rates approaching 
the limit of diffusion. This special combination of 


properties - high selectivity and permeation rate - is 
made possible by the principle of single file diffusion: 
in this arrangement, each single ion binds strongly to 
the channel protein but is, at the same time, 
destabilized by the repulsion of neighboring ions 
(Morais-Cabral et al. 2001; Berneche and Roux 
2001). When an ion enters the selectivity filter of 
a K"^ channel, its interactions with water molecules 
are gradually replaced by interactions with oxygen 
atoms from carbonyl or hydroxyl groups of the protein. 
The permeating K"^ ions can thus smoothly transit from 
water bulk to the selectivity filter without facing 
important free energy barriers (Berneche and Roux 
2001 ). 

The mobility of the glycine amide planes in the core 
of the selectivity filter provides the required flexibility 
to minimize the free energy barriers faced by the per¬ 
meating K"^ ions when moving from one binding site to 
the next - these barriers were shown to be on the order 
of 2 kcal/mol (Berneche and Roux 2001). However, 
the flexibility provided by the glycine residues also 
means that the selectivity filter is a malleable structure 
that can potentially adopt different conformations. 
This plasticity allows the selectivity filter to react to 
its environment, changing its conformation from 
a canonical conducting state to a nonconducting one. 
By doing so, the selectivity filter plays an important 
role in controlling the conductance of the channel. The 
conformational changes are controlled by a network of 
molecular interactions around the pore but also by the 
permeating ions. These properties, structurally 




p 


1928 


Potassium Channel Selectivity and Gating at the Selectivity Filter: Structural Basis 


characterized in the KcsA channel, are believed to be 
relevant to C-type inactivation in eukaryotic channels, 
a mechanism that notably plays a crucial role in the 
synchronicity of signal propagation in excitable cells 
(Yellen 1998). 

Permeation, selectivity, and gating constitute the 
core functions of the selectivity filter of channels 
and result from a fine equilibrium of forces. In the 
following pages, we will illustrate how the structure 
of the selectivity filter gives rise to these interdependent 
functions. While permeation and selectivity can 
now be described with robust models - although 
there are still ongoing debates (see Andersen 2011 and 
references therein) - the gating mechanisms in the 
selectivity filter of channels are not yet well 
understood. 

What is Selectivity? 

From a functional point of view the selectivity of a 
channel represents the probability of observing the 
complete transmembrane permeation of a ion ver¬ 
sus that of a Na"^ ion. However, quite often, the selec¬ 
tivity of channels is discussed in terms of 
thermodynamics binding affinity to the selectivity fil¬ 
ter. This simplification is appropriate because the rel¬ 
ative affinity of the ions for the different binding sites 
determines, for a large part, their relative permeation 
rate. It nevertheless remains an incomplete view that 
should be supplemented by an analysis of the kinetic 
factors underlying the actual ion diffusion (Thompson 
et al. 2009). 

Which of or Na"^ is actually more likely to bind 
to the channel’s narrow pore? The answer to this ques¬ 
tion does not simply depend on the interaction between 
the ions and the protein, it also depends on the inter¬ 
action with the water bulk. The ion free energy of 
solvation, i.e., the free energy of bringing an ion from 
vacuum into a water bulk, is about —80 and —100 kcal/ 
mol for and Na"^, respectively. These values indi¬ 
cate that the smaller Na"^ ion interacts more strongly 
with water than a ion does. It is actually the same in 
a channel: Na"^ does interact more strongly with the 
selectivity filter of a channel than itself. While 
this may sound shocking, it has little importance for 
selectivity, which arises from a detailed account of the 
free energy of the system. When a ion enters the 
selectivity filter, the interactions with the solvent are 


lost and close interactions with the protein are formed. 
The free energy change associated with this process is: 

AG/f = Gk.channel — Gk.water (1) 

The same can be written for a Na"^ ion: 

AGwa = Gno-.CMANNEL — Gno-WATER (2) 

Let us consider a Na"^ ion bound to the channel. The 
free energy for moving this ion back to the water bulk and 
replacing it in the selectivity filter by a ion would be: 

AAG = AGic — AGf^a 

= [Gk-.channel — Gk.water] 

— [GNa-.CHANNEL — Gno^WATEr] (3) 

= [Gk:CHANNEL — Gno-.cMANNEl] 

— [Gk-.WATER — GNa-.WATER] 

A given binding site will be selective for if AAG 
is negative. The last line of (3) says that the channel is 
selective for if the difference in binding free energy 
of the ions in the channel is less than their difference in 
free energy of solvation. 

Selectivity Arise from Protein Structure 
and Chemistry 

The most popular model to explain selectivity has 
historically been the snug-fit mechanism (Bezanilla 
and Armstrong 1972). According to this model, the 
binding sites of channels would be designed in 
such a way that only ions would be appropriately 
coordinated, while smaller ions, like Na"^, would 
loosely fit and not be as well coordinated as in bulk 
water. Initially, the high-resolution X-ray crystallogra¬ 
phy structure of the KcsA channel, with binding 
sites perfectly matching the geometry of the ion, 
seemed to confirm this model (Zhou et al. 2001). But 
things have afterward been shown to be more complex. 

The application of the snug-fit model to channels 
would require that their selectivity filter be held in 
a rigid conformation with a sub-Angstrom precision, 
preventing any adaptation to the smaller Na"^ ion (Yu 
et al. 2010). However, molecular dynamics simula¬ 
tions show that the fluctuations of the selectivity filter 
are actually larger than the size difference between the 
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Potassium Channel Selectivity and Gating at the 
Selectivity Fiiter: Structural Basis, Fig. 2 A 2D illustration 
of the snug-fit mechanism, (a) The ion perfectly fit in the 
binding site defined by carbonyl groups that are held in place by 
the overall structure of the protein, achieving a perfect coordi¬ 
nation for (b) The smaller Na"^ ion does not fit as well in the 


binding site, leading to a lack of coordination and a free energy 
increase, (c) In the case of the channel, the protein is not rigid 
enough to impose a fixed geometry of the binding sites. The 
protein adapts to the smaller Na"^ ion and provides appropriate 
coordination, at the expense of unfavorable electrostatic repul¬ 
sion between the carbonyl groups 


two ions. Radial distribution functions extracted from 
the simulations further illustrates that both and Na"^ 
are well coordinated in the binding sites of the KcsA 
channel, and thus selectivity must have another origin 
(Noskov et al. 2004). 

An alternative explanation is that selectivity arises 
from the number and intrinsic physical properties of 
the ligands coordinating the ions in the channel. This 
model suggests that, while Na"^ ions are appropriately 
coordinated in the selectivity of channels, this is 
achieved at a free energy cost arising from the repul¬ 
sion between the carbonyl groups forming the binding 
sites (Noskov et al. 2004). The channel structure is 
malleable enough to accommodate the smaller Na"^ 
ion, but the fact that Na"^ is smaller than implies 
that the carbonyl dipoles are closer to each other and 
thus their repulsion is higher (see Fig. 2c). These 
unfavorable interactions between the ligands are not 
compensated for by their favorable interaction with 
Na"^. Molecular mechanics calculations show that the 
carbonyl dipole of 3 Debye allows for a more favorable 
balance of the electrostatic forces (ion-ligands and 
ligands-ligands) for than any other ions, smaller 
or bigger (Noskov et al. 2004). 

While, at the first sight, the structure of the KcsA 
channel suggests that the four binding sites are struc¬ 
turally equivalent, they are not equally selective 
because they do not provide identical microscopic 
environments. The S2 site, in the middle of the selec¬ 
tivity filter, appears to be the most selective with 
a AAG of about 6 kcal/mol. Ions in S2 are dehydrated 
and coordinated by eight carbonyl oxygen atoms. In 


other sites, ions are coordinated by both carbonyl and 
water molecule oxygen atoms, in different ratio. In 
sites SI and S3, an average of six carbonyls and one 
or two water molecules coordinate a bound ion, 
bringing down the selectivity to a AAG of 2-3 kcal/ 
mol. Site S4 is even less selective with only about five 
oxygen atoms from the protein coordinating the ion. 
Thus, the reduction in the total number of coordinating 
ligands together with the increased participation of 
water molecules means that the binding site displays 
properties closer to those of bulk water and is thus 
less selective. The geometry and mechanical properties 
of the binding sites play an important role in defining 
the ratio of protein ligands and water molecules. The 
quasi-cubic coordination scheme that is well 
maintained in binding site S2 imposes the presence of 
eight carbonyl ligands. Such a large number of ligands 
is not necessary to achieve the proper coordination of 
ions (on average, about six ligands are found in the 
first coordination shell of ions in bulk water), but it 
does increase selectivity by rising the repulsive forces 
between the ligands when in presence of a smaller ion 
like Na"^ (Noskov et al. 2004; Bostick and Brooks 
2007; Varma and Rempe 2008). The favorable Na"^- 
ligand interactions increase linearly with the number of 
coordinating ligands N, but the unfavorable ligand- 
ligand interactions increase much faster as: f (N — 1). 
Note that Na"^ ions find stable positions in between 
binding sites, in the plane of the carbonyl rows. 
While these positions are favorable to Na"^, they do not 
seem to be particularly unfavorable to ions, as was 
illustrated by free energy calculation describing the 
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Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis, Fig. 3 Alternative 
nonconductive conformations of the KcsA selectivity filter, (a) 
An asymmetric structure that involves the rotation of the Val76/ 
Gly77 amide plane in only one of the four subunits. According to 
molecular dynamics simulations, this structure is stable on short 
time scales and might be linked to fast gating events, (b) 


A pinched conformation observed by X-ray crystallography in 
which binding site S2 is no longer accessible to ions or water 
molecules. This conformation has little affinity for ions, which 
transiently bind to binding sites SI or S4. (c) The structure 
presented in (b) is stabilized by a closed network of H-bonds 
involving the Val76/Gly77 amide planes and water molecules 
(top view) 


permeation of ions through the selectivity filter of 
KcsA (Berneche and Roux 2001). Diffusion through 
the S2 binding site, which presents the highest AAG, is 
likely to be a key step defining the selectivity of 
the channel. However, the fact that Na"^ ions occupy 
alternative equilibrium positions has direct impact 
on the multi-ion interactions underlying permeation 
and, potentially, on the selectivity of the channel. 
These questions are just starting to be addressed by 
the scientific community (Thompson et al. 2009). 

In summary, ion selectivity in the flexible binding 
sites of channels emerges from a subtle balance 
of ion-ligand and ligand-ligand interactions. The specific 
structure of the selectivity filter concentrate an excess of 
ligands (carbonyls) which have the proper chemical 
properties to, on one side, coordinate ions 
efficiently and, on the other side, create less favorable 
electrostatic interactions in presence of smaller or bigger 
ions (although these remain well coordinated). 

The Malleability Provided by the Glycine 
Residues Sustains Gating 

Electrophysiological single channel recordings show 
that gating in the selectivity filter of channels takes 
place on a wide range of time scales. Channels are 
sometimes seen flickering rapidly between conductive 
and nonconductive states on a time scale of millisec¬ 
onds (Chakrapani et al. 2011). They can also enter in 
stable nonconducting states that are thought to be asso¬ 
ciated with C-type inactivation, which consists in the 


time-dependent closure of the selectivity filter follow¬ 
ing the activation of the channel (Yellen 1998). 
Although the structural details underlying these gating 
mechanisms remain largely unknown, important 
progress have been made by elucidating some of 
the conformational changes of the selectivity filter 
that can block or greatly reduce the ionic current. 

The first structural evidence that the selectivity filter 
of channels could adopt alternative conformations 
came from molecular dynamics simulations of the 
KcsA channel. One possible conformational change 
involves the rotation of the Val76/Gly77 amide plane 
in typically only one of the four subunits, resulting in 
a carbonyl group pointing away from the pore 
(Fig. 3a). While quite small, this change of conforma¬ 
tion results in a lack of coordination for an ion moving 
from site S3 to S2 and increases considerably the free 
energy barrier to permeation, possibly reducing the 
ionic current by a factor 1/1,000 (Berneche and Roux 
2005). Such asymmetric conformation seems to be 
stable for only a short period of time and is believed 
to be involved in fast gating events (Chakrapani et al. 
2011). It remains however possible that further 
unknown conformational changes could stabilize the 
carbonyl in its outward conformation, making the latter 
relevant to slower gating events as well. 

Another nonconducting conformation of the KcsA 
channel was observed by crystallization in low 
concentration of K"^ and high concentration of Na"^ 
(Zhou et al. 2001). This structure shows a selectivity 
filter that is pinched in its middle: the four Val76/Gly77 
amide planes are reoriented by about 30° to become 
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tangent to the pore with the Gly77 CA atoms occluding 
the permeation pathway. This conformation is stabilized 
by water molecules inserted between the subunits, bind¬ 
ing the Val76 carbonyl of one subunit to the Gly77 
amine group of the next (Fig. 3b, c). This structure, 
which was also observed in high concentration 
under conditions maintaining the main gate of the chan¬ 
nel open, is likely to correspond to the inactivated state 
of the KcsA channel (Cuello et al. 2010a). X-ray elec¬ 
tron density measurements (using TG as a surrogate for 
K"^) and free energy calculations suggest that this struc¬ 
ture has low ion-binding affinity (Zhou and MacKirmon 
2003; Boiteux and Bemeche 2011). This implies that 
the selectivity filter of the KcsA channel loses both of its 
ions through the inactivation process. Whether this 
mechanism is representative of C-type inactivation in 
eukaryotic channels remains unclear. 

Gating in the Selectivity Filter is Controlled 
by Various Mechanisms 

The channel gating mechanisms are all 

interdependent. Following activation of the main 
gate, the open-probability of the selectivity filter 
remains stable or decreases, notably through C-type 
inactivation. As further discussed below, the rate and 
extent of this decrease depends on a variety of factors, 
including the specific channel structure, the permeat¬ 
ing ion kind and concentration, and the eventual pres¬ 
ence of blockers, like the N-type inactivation peptide. 
For the selectivity filter to retrieve its highest open- 
probability, the main gate of the channel needs to be 
brought back to its closed state for an extensive period 
of time before being activated again. The structural 
basis of this coupling mechanism was recently eluci¬ 
dated by X-ray crystallography of the KcsA channel 
(Cuello et al. 2010b). When the helices forming the 
intracellular gate undergo their outward movement that 
actually opens the pore, steric contacts between these 
helices (involving residue Phel03 in KcsA) and the 
bottom of the selectivity filter (residues Thr74/Thr75) 
perturb the structure of the selectivity filter and favor 
its inactivated state. Closing the main gate reverses the 
process and promotes the conductive state of the filter. 

While the sequence of the selectivity filter of K"^ 
channels is extremely well conserved, the amino acids 
that surround it vary from channel to channel and 
define the structural stability of the pore. These 


amino acids form hydrogen bonds and salt-bridges 
within and between subunits and influence the struc¬ 
ture of water molecules filling cavities around the 
selectivity filter. These water molecules, like the ion 
in the pore, are key components of the protein struc¬ 
ture. The resulting network of molecular interactions 
supports a complex architecture with finely balanced 
forces (see Fig. la). Breaking some of these interac¬ 
tions is likely to favor inactivation, though it is not 
always the case. For example, in the KcsA channel, 
removing a H-bond between residues Glu71 and 
Asp80 stabilizes the conducting state of the channel 
(Cordero-Morales et al. 2007). Also, while sometimes 
a malleable filter with less structural restraints might 
inactivate faster, it might also recover from inactivation 
more rapidly. Many of the key interactions that are 
thought to stabilize the selectivity filter are actually 
absent in the hERG channel, maybe explaining the 
unique combination of extremely fast inactivation and 
recovery from inactivation mechanisms underlying the 
channel’s inward rectification property (Smith et al. 
1996; Yellen 1998). 

Permeating ions have a direct impact on gating 
events in the selectivity filter. The presence of K"^ or 
Rb"^ ions generally favors the conductive state, while 
smaller ions like Na"^ and Li"^ promote inactivation. 
The KcsA low-[K'^] structure discussed above pro¬ 
vides a structural illustration of this phenomenon. 
Despite being available in high concentration, Na"^ 
ions do not allow the channel to maintain or retrieve 
its conductive state. From a free energy accounting 
perspective, it means that the system, as a whole, is 
considerably more stable when Na"^ ions are kept in the 
bulk and the channel maintained in its water-filled 
inactivated conformation. In presence of K"^ ions, the 
free energy difference between the ion-free inactivated 
state and the K'^-bound conductive state is smaller and 
the system undergoes transitions between the two. This 
is what one could call super-selectivity, i.e., that the 
channel can adopt a conductive conformation only in 
presence of a given type of ion. X-ray crystallography 
(Zhou and MacKinnon 2003) and NMR (Imai et al. 
2010) experiments have clearly illustrated the structural 
response of the KcsA selectivity filter to titration of 
various ion kinds. These NMR experiments also illus¬ 
trate how temperature influences the structural property 
of the channel and potentially gating. 

The transition from the conductive to the 
nonconductive state might also depend on the 
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dynamics and fine equilibrium of ions in the selectivity 
filter. It was proposed, on the basis of electrophysiol¬ 
ogy measurements and kinetic modeling, that occupa¬ 
tion of given binding sites by ions would prevent 
transitions to the inactivated state according to a foot- 
in-the-door mechanism (see Yellen 1998). Free energy 
calculations on the KcsA channel have later predicted 
that the presence of an ion in S2, which happens to be 
the most selective binding site, would stabilize the 
conductive state by preventing the rotation of 
the Val76 carbonyl groups illustrated in Fig. 3a 
(Bemeche and Roux 2005). Any factor that can favor 
the occupancy of site S2, like the variation of the trans¬ 
membrane potential or the interaction with ions in other 
binding sites, could indirectly regulate gating events in 
the selectivity filter. For example, the signature influence 
of extracellular concentration on C-type inactivation 
can potentially be explained by the fact that an increase 
of concentration would increase the ion-hinding proba¬ 
bility to site SO and favor the occupancy of site S2, thus 
reducing inactivation. While the relevance of these 
observations to C-type inactivation remains to be 
shown, it is interesting to note that in a series of X-ray 
stractures illustrating the development of inactivation in 
the KcsA channel, the occupancy state of S2 decreases 
throughout the process (Cuello et al. 2010a). 

Blocker molecules constitute another potential gat¬ 
ing determinant (Ader et al. 2008). Blockers with little 
binding affinity can actually prevent inactivation, nota¬ 
bly by binding to the extracellular mouth of the selec¬ 
tivity filter (e.g., like ions, tetraethyl ammonium 
(TEA) was shown to hind in SO with the same putative 
influence on inactivation). Blockers with higher affin¬ 
ity are likely to promote C-type inactivation. One 
notable example of such mechanism is the N-type 
inactivation peptide found at the N-terminus of many 
Kv channels which are known to block the ionic cur¬ 
rent by binding to the pore and, by doing so, force the 
selectivity filter into its C-type inactivated state (see 
Yellen 1998). This is a physiologically important 
mechanism since recovery form C-type inactivation 
is generally slow and thus determines the probability 
of a channel to conduct on a subsequent activation 
event. The evidences presented above on the KcsA 
channel suggest that by blocking the access of ions to 
the selectivity filter (actually decreasing the apparent 
concentration), the N-type peptide or other blockers 
with similar affinity would promote C-type inactiva¬ 
tion. Whether the dynamics and fine equilibrium of the 


ions within the selectivity filter also play a role in this 
mechanism remains an open question. 

Summary 

The permeation, selectivity, and gating properties of 
K"^ channels are intrinsically linked. While it is often 
tempting to discuss K"^ channels as if they were all alike, 
this is a drastic oversimplification. The more we learn 
about their fine stmcture and function, the more we 
understand the important subtleties that differentiate 
them. It is unclear how or if our findings mostly based 
on studies of the prokaryotic KcsA channel can be trans¬ 
ferred to eukaryotic K"^ channels. The mechanisms that 
we have described here can most probably apply to all 
K"^ channels but to different extent depending on their 
specific structure (sequence) and environment. The 
mechanisms underlying selectivity are universal, 
anchored in basic physical-chemistry principles. How¬ 
ever, all channels do not display the same flexibility and 
stability, involving that the properties of their ion-bind¬ 
ing sites can differ greatly. Adding or removing 
a hydrogen bond at a key position, or allowing a few 
water molecules to reach in, can alter the selectivity and 
gating properties of the selectivity filter. Clearly, the 
spectmm of the possible structural variability of K"^ 
channels remains to be explored. 
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Definition 

Potassium-selective channels are membrane proteins 
with a delicate pore structure efficiently fluxing potas¬ 
sium ion flow close to the diffusion limit while 
maintaining high selectivity for potassium. 

Introduction 

Potassium-selective ion channels are efficient and 
highly selective for shuttling potassium ions across 
membrane. Potassium channels have a sophisticated 
pore structure which efficiently fluxes potassium ions 
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close to the diffusion limit (10®-10 Vs) while 
maintaining high selectivity allowing the passage of 
only one sodium ion over thousands of events. Potas¬ 
sium channels form the largest and the most diverse 
ion channel family (Jan and Jan 1997). In animals, 
potassium channels regulate neurotransmitter release 
from nerve terminals, heart rate, neuronal excitability, 
hormone secretion, muscle contraction, electrolyte 
transport in the kidney tubules, and cell volume regu¬ 
lation. In addition to their well-characterized roles in 
animals, potassium channels regulate turgor pressure, 
osmolarity, and nutrition uptake in plants (Latorre 
et al. 2003). Molecular identihcation of the first potas¬ 
sium channel was achieved by positional cloning based 
on the phenotype of the Drosophila mutant Shaker. 
Since then, more than 200 genes encoding various 
potassium channels have been cloned. For many, but 
not all potassium channels, posttranscriptional alterna¬ 
tive splicing of their transcripts further increases the 
diversity of potassium channels in vivo. The channel 
itself is composed of four a-subunits, the main pore¬ 
forming protein, and several accessory proteins called 
P-subunits. The association of a-subunits with 
P-subunits may influence the channel gating and traf¬ 
ficking properties. The a-subunit of different types 
of potassium channels contains different numbers of 
hydrophobic segments spanning the membrane; 
according to the number of transmembrane segments, 
potassium channels can be classified into three groups: 
(1) those with six transmembrane segments (S0-S6) 
and one pore region (P) in an a-subunit (6TM/1P), 
including most voltage-gated channels; (2) those with 
two transmembrane segments (Ml and M2) and one 
pore region in an a-subunit (2TM/1P), including most 
of the inwardly rectifying potassium channels; 
(3) those with four transmembrane segments 
(M1-M4) with two pore regions in an a-subunit 
(4TM/2P), known as two-pore-domain potassium 
channels (Yu et al. 2005). The selectivity filter is 
formed by four P-loops in the S5-S6 linker from each 
a-subunit, including a tripeptide motif GYG known as 
the signature sequence of potassium channel. These 
three amino acids are highly conserved among all the 
potassium channels, except in some cases such as the 
KvlO and Kir6 channels, in which the tyrosine (Y) 
residue is substituted by a phenylalanine (F). Muta¬ 
tions of this signature sequence typically eliminate 
potassium selectivity. A long pore accommodating 
multiple ions has been proposed by early biophysical 


studies of potassium channels. Recent crystal struc¬ 
tures of potassium channels provide solid evidence to 
support this hypothesis (Gouaux and Mackinnon 
2005). The 2TM/1P potassium channel family has the 
simplest design of the transmembrane domain 
consisting of primarily the pore-forming structure; 
many other forms of ion channels likely evolved from 
this prototype by gene duplication and/or the acquisition 
of other functional domains such as the voltage-sensing 
domain. This entry introduces all four potassium chan¬ 
nel families. Readers are referred to other entries of this 
encyclopedia for discussions on the structural basis 
of potassium channel selectivity, potassium channel 
gating, inactivation and deactivation, and RCK domain 
of Kca channel. 

Voltage-Gated Potassium Channels 

Voltage-gated potassium channels (Kv) form one the 
most diverse potassium channel families, including 
more than 40 voltage-gated potassium channel genes 
cloned from the human genome (Fig. 1). Table 1 lists 
all Kv channels in the human genome. So far, 12 volt- 
age-gated potassium channel families (Kvl-Kvl2) 
have been identified; however, Kv5, Kv6, Kv8, and 
Kv9 do not form functional channels by themselves, 
and may coassemble with other channel subunits to 
silence certain Kv potassium channels (Gutman et al. 
2005). All of the voltage-gated potassium channels 
have similar 6TM/1P structural design (Fig. la). The 
Kv channel protein has the first four transmembrane 
(TM) segments forming the voltage-sensing domain 
that controls the gating of the pore formed by the pore 
loop and the flanking 5th and 6th TM domain (Fig. la). 
Kv channels are either homotetramer or heterotetramer, 
and this tetramerization is mediated by the T1 domain 
at the N-terminus (Sansom 2000). Kv channels undergo 
conformational changes in response to changes in trans¬ 
membrane potential, and subsequently open the chan¬ 
nel gate and cause potassium flux. Channels in this 
group show large variations in gating kinetics and 
they can be classified into two general groups: delayed 
rectifier potassium channels (DR-type) and transient 
potassium channels (A-type). DR-type potassium chan¬ 
nels open with a lag behind membrane depolarization 
and close only after repolarization of the membrane 
potential. These channels do not interfere with the 
speedy rising phase of the action potential but act in 
a delayed fashion to repolarize cells and prepare them 
for subsequent stimulation. On the other hand, A-type 



Potassium Channels: Their Physiological and Molecular Diversity 


1935 


P 




N 


e 



Kv1.3 

Kv1.6 

Kv1.7 

Kv1.4 

Kvl.1 

Kv1.2 

Kv1.5 

Kvl.8 

Kv3.2 

Kv3.4 

Kv3.1 

Kv3.3 

Kv4.2 

Kv4.3 

Kv4.1 

Kv9.1 

Kv9.2 

Kv9.3 

Kv8.2 

Kv2.1 

Kv2.2 

Kv6.2 

Kv6.4 

Kv6.3 

Kv8.1 

Kv5.1 

Kv6.1 

Kv7.2 

Kv7.4 

Kv7.5 

Kv7.3 

Kv7.1 

Kv11.1 

Kvi 1.3 

Kv11.2 

Kv12.1 

Kvi 2.3 

Kvi 2.2 

KvlO.1 

Kvi 0.2 

KCal.1 

KCaS.I 

KCa4.1 

KCa4.2 

KCa2.2 

KCa2.3 

KCa2.1 

KCa3.1 


Shaker 

Shaw 

Shal 


^ Shab 


erg 

elk 

_ eag 

— BK 

— IK 

SK 


Kv family 


Kca family 




Potassium Channels: Their Physiological and Molecular 
Diversity, Fig. 1 (a) Computer-simulated DR-type potassium 
currents in response to a family of voltage pulses from a holding 
potential of —70 mV to command voltages of —80 to -tSO mV in 
10 mV increment, (b) Computer-simulated A-type potassium 


currents in response to the same voltage protocol as in 
(a). Membrane topology of a representative voltage-gated potas¬ 
sium channel subunit (c) and a BK-type calcium-activated 
potassium channel subunit (d). (e) Phylogenetic tree of 
1P/6TM potassium channel family 
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Potassium Channels: Their Physiological and Molecular 
Diversity, Table 1 List of human voltage-gated potassium 
channels 


Channel name 

Gene locus 

Trivial name 

Kvl.l 

KCNAl 

Shaker-related family 

KvL2 

KCNA2 


KvL3 

KCNA3 


KvL4 

KCNA4 


KvL5 

KCNA5 


KvL6 

KCNA6 


KvL7 

KCNA7 


KvL8 

KCNA8 


Kv2.1 

KCNBl 

Shab-related family 

Kv2.2 

KCNB2 


Kv3.1 

KCNCl 

Shaw-related family 

Kv3.2 

KCNC2 


Kv3.3 

KCNC3 


Kv3.4 

KCNC4 


Kv4.1 

KCNDl 

Shal-related family 

Kv4.2 

KCND2 


Kv4.3 

KCND3 


Kv5.1 

KCNFl 


Kv6.1 

KCNGl 


Kv6.2 

KCNG2 


Kv6.3 

KCNG3 


Kv6.4 

KCNG4 


Kv7.1 

KCNQl 

KVLQTl 

Kv7.2 

KCNQ2 

KQT2 

Kv7.3 

KCNQ3 


Kv7.4 

KCNQ4 


Kv7.5 

KCNQ5 


Kv8.1 

KCNVl 


Kv8.2 

KCNV2 


Kv9.1 

KCNSl 


Kv9.2 

KCNS2 


Kv9.3 

KCNS3 


KvlO.l 

KCNHl 

eag family 

KvlO.2 

KCNH5 


Kvll.l 

KCNH2 

erg family 

Kvll.2 

KCNH6 


KvlL3 

KCNH7 


Kvl2.1 

KCNH8 

elk family 

Kvl2.2 

KCNH3 


Kvl2.3 

KCNH4 



potassium channels inactivate rapidly after opening. 
The major role of A-type potassium channels is to set 
the firing rate of the action potentials (Hille 2001). 

For voltage-gated potassium channels, the protein 
conformation changes from nonconducting state to 


Potassium Channels: Their Physioiogical and Molecular 
Diversity, Table 2 List of human calcium-activated potassium 
channels 


Channel name 

Gene locus 

Trivial name 

KCal.l 

KCNMAl 

BK; Slot 

KCa2.1 

KCNNl 

SK-1 

KCa2.2 

KCNN2 

SK-2 

KCa2.3 

KCNN3 

SK-3 

KCa3.1 

KCNN4 

IK-1 

KCa4.1 

KCNTl 

Slack 

KCa4.2 

KCNT2 

Slick 

KCaS.l 

KCNUl 

Slo3 


a conducting state upon membrane depolarization. 
Crystal structure studies have revealed that the channel 
gate locates at the bundle crossing of the S6 segments 
at the cytoplasmic side and is lined primarily by hydro- 
phobic residues. The hydrophobic gate primarily con¬ 
trols the water flow through it and indirectly affects the 
ion movement. Membrane depolarization reorients the 
position of the S4 segment, the voltage sensor, and 
broadens the diameter at the bundle crossing. The 
highly conserved S4 segment contains several posi¬ 
tively charged residues (lysine or arginine) spaced 
every three residues along the S4. This S4 segment is 
surrounded by other segments (SI, S2, S3, S5, and S6) 
and its charged residues are well protected from the 
hydrophobic lipid bilayer environment. The S4 moves 
upon membrane potential change to control the 
channel gate on the pore domain (Long et al. 2007; 
Tombola et al. 2005). 

Some voltage-gated potassium channels gradually 
reduce their ion flow after opening, while the mem¬ 
brane potential Is still depolarized. This phenomenon 
is called “inactivation.” A “ball-and-chain” model has 
been proposed to explain this phenomenon: the con¬ 
formational change of channel opening produces 
a receptor site for the inactivation ball, which can 
obstruct the conductive pathway. Site-directed muta¬ 
tion revealed that the 20 amino acids at N-terminus of 
Shaker potassium channels form the inactivation ball, 
and several positively charged residues contribute to 
the voltage-dependent inactivation phenomenon of 
Shaker potassium channels. The binding of the inacti¬ 
vation ball to the pore is accelerated at depolarized 
potential. Deletion of the N-terminus eliminates the 
inactivation; cytoplasmic application of peptides com¬ 
posed of the N-terminal amino acid sequence restores 
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Potassium Channels: Their Physiological and Molecular 
Diversity, Fig. 2 (a) Computer-simulation of strong inwardly 
rectifying potassium currents in response to a family of voltage 
pulses from a holding potential of —70 mV to command voltages 
of —80 to -t60 mV in 10 mV increment in symmetrical potassium 
solutions (high [K"^] on both sides), (b) Current-voltage 


relationship of steady-state inwardly rectifying potassium 
currents. The dash line represents an ideal Ohmic conductance, 
(c) Membrane topology of a representative inwardly rectifying 
potassium channel subunit, (d) Phylogenetic tree of 1P/2TM 
potassium channel family 


the inactivation process. The binding site of the inac¬ 
tivation ball is located at the channel pore accessible 
from the cytoplasmic side of the membrane, and some 
channel blockers such as tetraethylammonium (TEA) 
bind to this site (Yellen 2002). 


Calcium-Activated Potassium Channel 

Calcium-activated potassium channels (Kca) consti¬ 
tute a distinct potassium channel family within the 
Kv potassium channels. These Kca channels require 
intracellular calcium for activation and can be further 
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categorized according to their single channel conduc¬ 
tance: small conductance (2-25 pS) Kca channels 
(SK), intermediate conductance (25-100 pS) Kca 
channels, or large conductance (100-300 pS) Kca 
channels (BK) (Kaczorowski and Garcia 1999; Wei 
et al. 2005). Figure 1 illustrates the fact that these 
channels form two well-dehned hut only distantly 
related groups within the voltage-gated potassium 
channel family. One group includes the SK (Kca 2.1, 
2.2, and 2.3) and IK (Kca 3.1) (Table 2). These chan¬ 
nels are voltage-insensitive and are activated by 
submicromolar concentrations of intracellular cal¬ 
cium. In contrast, the second group BK (Kcal-l) is 
activated by both voltage and intracellular calcium, 
and its voltage dependence is also related to intracel¬ 
lular calcium (Flille 2001). BK Kca channels are also 
unique because there is an additional transmembrane 
domain (SO) preceding the SI transmembrane domain 
so that the BK Kca channel N-terminal domain is 
exposed to the extracellular side rather than the intra¬ 
cellular side (Fig. 1). Both SK and IK Kca channels 
play pivotal roles in many physiological processes 
involving calcium-dependent signaling in both 
excitable and nonexcitable cells. Intracellular calcium 
does not bind directly to the channel protein; the 
calcium sensitivity of these channels is regulated via 
calmodulin, which is constitutively bound to the 
C-terminal domain. Binding of calcium to this calmod¬ 
ulin causes conformational changes that open the chan¬ 
nel. On the other hand, calcium ions bind directly to the 
C-terminal region of the BK Kca channel protein. 
Those calcium-binding sites are rich in negatively 
charged amino acids and are known as the “calcium 
bowl.” The BK Kca channels are involved in diverse 
physiological function such as controlling the action 
potential bursting duration, spike-frequency adapta¬ 
tion, and some of them can act as pacemakers in 
spontaneously active cells (Hille 2001). 

Inwardly Rectifying Potassium Channels 

Potassium channels that preferentially pass inward 
rather than outward potassium ions flow are called 
inwardly rectifying potassium channels (Kir). They 
show steep voltage dependence. However, unlike Kv 
channels, with intrinsic voltage sensors, the voltage 
dependence of Kir channel arises from binding of 
magnesium or polyamines to the pore and occlusion 
of the conductive pathway. Although at resting mem¬ 
brane potential Kir channels are blocked by 


Potassium Channels: Their Physiological and Molecular 
Diversity, Table 3 List of human inwardly rectifying potas¬ 
sium channels 


Channel name 

Gene locus 

Trivial name 

Kirl.l 

KCNJl 

ROMK 

Kiri.2 

KCNJIO 


Kiri.3 

KCNJl 5 


Kiri.4 

KCNJl 3 


Kir2.1 

KCNJ2 

IRKl 

Kir2.2 

KCNJl 2 

IRK2 

Kir2.3 

KCNJ4 

IRK3 

Kir2.4 

KCNJl 4 

IRK4 

Kir2.6 

KCNJl 8 


Kir3.1 

KCNJ3 

GIRKl 

Kir3.2 

KCNJ6 

GIRK2 

Kir3.3 

KCNJ9 

GIRK3 

Kir3.4 

KCNJ5 

GIRK4 

Kir4.1 

KCNJIO 


Kir4.2 

KCNJl 5 


Kir5.1 

KCNJl 6 


Kir6.1 

KCNJ8 

KATP 

Kir6.2 

KCNJll 


KirV.l 

KCNJl 3 



intracellular magnesium and/or polyamines, those 
pore blockers are expelled from the pore when the 
electrochemical driving force favors inward move¬ 
ment of potassium ions (Hibino et al. 2010). Different 
inwardly rectifying potassium channels exhibit differ¬ 
ent degrees of inward rectification. Kir2 families show 
strong inward rectification, while Kiri, Kir3, and Kir6 
families show weak inward rectification. Kir channels 
have a 1P/2TM topology and the two transmembrane 
segments. Ml and M2, are equivalent to the S5 and S6 
transmembrane segments of a Kv channel (Fig. 2) 
(Kubo et al. 2005). Kir channels also form homo- or 
hetero- tetramers with other Kir proteins from the same 
family group (Table 3). In order to be targeted to the 
plasma membrane correctly and function normally, 
Kir6 family need to coassemble with additional mem¬ 
brane proteins called sulfonylurea receptors thereby 
shielding the ER-retention signals on their C-terminus 
(Bichet et al. 2003; Seino 1999). Kir channels can 
maintain the membrane potential near the potassium 
equilibrium potential (Ek) during resting condition but 
have little effect during action potential firing since 
they are mostly shut at depolarized potentials. In 
nonexcitable tissues such as kidney or intestine, Kir 
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Potassium Channels: Their Physiological and Molecular 
Diversity, Fig. 3 (a) Computer-simulation of typical 2-pore 
domain potassium currents in response to a family of voltage 
pulses from a holding potential of —70 mV to command voltages 
of —80 to -t60 mV in 10 mV increment in physiological solutions 


(high [K"^]! and low[K'^]o). (b) Current-voltage relationship of 
steady-state 2-pore domain potassium currents. The dash line 
represents an ideal Ohmic conductance, (c) Membrane topology 
of a representative 2-pore domain potassium channel subunit, 
(d) Phylogenetic tree of 2P/4TM potassium channel family 


facilitates ion transport across the cell membrane. The 
gating of Kir channels are also quite diverse: Kiri 
family is controlled by pH, Kir3 are controlled by 
intracellular G proteins, and Kir6 family is controlled 
by intracellular ATP (Hibino et al. 2010; Hille 2001). 


Two-Pore Domain Potassium Channels 

As mentioned previously, the selective filter region 
around the potassium channel signature sequence 
GYG in the pore domain is highly conserved in all 
potassium channels. Instead of traditional molecular 
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Potassium Channels: Their Physiological and Molecular 
Diversity, Table 4 List of human 2-pore domain potassium 
channels 


Channel name 

Gene locus 

Trivial name 

K2pLl 

KCNKl 

TWIK-1 

K2p2.1 

KCNK2 

TREK-1 

K2p3.1 

KCNK3 

TASK-1 

K2p4.1 

KCNK4 

TRAAK 

K2p5.1 

KCNK5 

TASK-2 

K2p6.1 

KCNK6 

TWIK-2 

K2p7.1 

KCNK7 

TWIK-3 

K2p9.1 

KCNK9 

TASK-3 

K2pl0.1 

KCNK 10 

TREK-2 

K2pl3.1 

KCNK13 

THIK-1 

K2pl5.1 

KCNK 15 

TASK-5 

K2pl6.1 

KCNK16 

TALK-1 

K2pl7.1 

KCNK 17 

TALK-2 

K2pl8.1 

KCNKl 8 

TRESK 


cloning, searching genome databases have also 
revealed novel genes with potassium channel signature 
sequences. A third family of potassium channels with 
a unique structure two-pore domain potassium chan¬ 
nels (K 2 p) is identified in this fashion (Fig. 3). Unlike 
Kv, Kca and Kir channels, which have only one pore 
region per subunit, a striking feature of K 2 P channels is 
their structural design: each subunit has two pore 
domains and four TM domains. This distinct 2P/4TM 
topology can be found in more than 70 predicted 
homologs in genome databases and 15 human genes 
in the family are designated as KCNK genes encoding 
the K 2 P channels (Fig. 3 and Table 4); they most 
readily reveal ion channel function upon expression. 
Presumably, these K 2 P channels are dimers of two K 2 P 
subunits. Biophysical characterizations reveal that K 2 P 
channels contribute to a background leak conductance, 
that is, a voltage-independent Ohmic conductance 
showing Goldman-Hodgkin-Katz rectification 
(Hille 2001). Those potassium leak currents stabilize 
membrane potential below the threshold for action 
potential bring and expedite repolarization in excitable 
cells. As expected for regulators of excitability, K 2 P 
channels are under tight control by a plethora of chem¬ 
ical and physical stimuli, including oxygen tension, 
pH, lipids, mechanical stretch, neurotransmitters, and 
G protein-coupled receptors; the channels are also the 
molecular targets for certain volatile and local 
anesthetics. 


Summary 

Potassium channels play pivotal roles in both excitable 
and nonexcitable cells such as setting the resting mem¬ 
brane potential, shaping action potentials, and regulat¬ 
ing cell volume. With over 80 genes encoding 
potassium charmels identibed in human, its molecular 
and biophysical properties, channel assembly and traf- 
bcking, and modulation by secondary messengers, 
transmitters, and hormones have been elucidated to 
a large extent. 

Cross-References 

► Calcium Activation of Channels: RCK 
Domains 

► Ion Channel Regulation by G-Protein-Coupled 
Receptors: Recent Advances with Optical 
Biosensors 

► Potassium Channel Selectivity and Gating at the 
Selectivity Filter: Structural Basis 

► Potassium Channels: Their Physiological and 
Molecular Diversity 

► Protein Structure 
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Synonyms 

Bioanalytical sensors; Biodetectors; Biodevices; 
Electrochemical sensors; ISE (ion-selective electrode) 

Definition 

A potentiometric biosensor can be defined as a device 
incorporating a biological sensing element connected 
to an electrochemical potential transducer. Potentio¬ 
metric biosensors usually rely on a biochemical 
reaction leading to a simpler chemical specie and 
its subsequent electrochemical detection (NH 4 OH, 
CO 2 , pH, H 2 O 2 , ...). The analytical signal generated 
by a potentiometric biosensor is an electrical potential. 


Introduction 

A biosensor is an analytical device comprising a bio¬ 
logical element capable to recognize an analyte, 
coupled to a transducer which generates a signal pro¬ 
portional to the concentration of the analyte. The gen¬ 
eral architecture of a potentiometric biosensor is given 
in Eig. 1 . In response to a target analyte, the biological 
recognition element generates a biochemical signal, 
which is converted by the transducer in a detectable 
potential that can be correlated with the analyte 
concentration. 

Consequently, the key points required for the devel¬ 
opment of biosensors are (Eggins 2002): 

- Recognition elements that impart the selectivity 
enabling the biosensor to respond specifically 
to a particular analyte, thus avoiding interferences 
from other species. Biorecognition elements 
include enzymes, antibodies, and microorganisms. 

- Physicochemical transducers that are able to con¬ 
vert the biological response into electrical potential, 
current, impedance, optical, piezoelectric, or calo¬ 
rimetric signals. Amongst these methods, electrical 
potential is especially interesting because potentio¬ 
metric sensors are easy to use and applicable 
for the detection of many different analytes and 
can be miniaturized using silicon or thick-film 
technologies. 

- Methods of immobilization of the recognition 

element that must be connected to the transducer. 
Various methods can be used including physical 
adsorption onto a surface, entrapment of the 
element in a polymer or gel matrix, formation 
of covalent chemical bonds between the 

element and the transducer, possibly via 

a crosslinker, or encapsulation of the element in 
a membrane. 

- Definition of measurable performance factors of the 
biosensors that include selectivity, probably the 
most important factor, sensitivity range, response 
time, lifetime, accuracy, and reversibility. How¬ 
ever, the importance of each of these performance 
parameters will strongly depend on the final 
application. 

Since 1962, when Clark and Lyon proposed the first 
glucose biosensor (Clark and Lyons 1962), biosensors 
have been developed and used for various applications, 
including public health, environmental monitoring, or 
process control (Eggins 2002). 
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Potentiometric Biosensors, 
Fig. 1 Schematic 
presentation of 
a potentiometric biosensor 
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Public health, and particularly biomedical analysis, 
is the main area of application of biosensors because 
dosing blood components, ions, or metabolites is 
strongly needed for diagnosis. Currently, bioanalytical 
sensors have already been developed for the detection 
of sodium, potassium, calcium ions, pH, oxygen, glu¬ 
cose, urea, lactate, creatinine, etc. 

There is also a need for environmental monitoring, 
particularly for pollution applications, farming, or gar¬ 
dening. Analyses in water concern acidity, salinity, 
phosphate, nitrate, nitrite, organic molecules, or 
heavy metals. Pesticides, herbicides, fertilizers, indus¬ 
trial, and domestic wastes also need biosensing. 
Concerning the control of industrial processes, sensors 
are used for the control of fermentation, either off-line 
in a laboratory or online in real time. Industries 
involved in pharmaceuticals and food processing 
need biosensors for monitoring processes such 
as microbial and cell cultures. Currently, 
biosensors exist which allow to monitor a range of 
compounds such as sugars, yeasts, malts, alcohols, or 
phenols. 


E, to the activities of the species responsible for the 
response (Eq. 1): 


£ = £o+^ln(^ 

ZiF \ < 


sample^ 


( 1 ) 


where Eg is the standard electrode potential of the 
sensor electrode, is the activity of the ion in 

the sample, a[‘^^ is the activity of the ion in the refer¬ 
ence of the sensor electrode, R Is the universal gas 
constant, T is the absolute temperature, F is the 
Faraday constant, and z, is the number of electrons 
implied in the elementary redox reaction. 

The main types of potentiometric sensors are 
membrane-based Ion-Selective Electrodes (ISE), 
Screen-Printed Electrodes, Ion-Selective Field Effect 
Transistors (ISFET), Solid-State devices, and Chemi¬ 
cally modified electrodes (using, e.g., metal oxides or 
electrodeposited polymers as sensitive layers). 


Enzyme Biosensors 


Principles of Potentiometric Sensors 

Potentiometry has proven to be an efficient and attrac¬ 
tive analytical technique for many practical applica¬ 
tions. In potentiometric sensors, the potential 
difference between a reference electrode and the indi¬ 
cator electrode Is measured, with a very high imped¬ 
ance voltmeter, under conditions of quasi null current 
flow (Yunus et al. 201 1). The reference electrode pro¬ 
vides a constant potential against which the potential 
of the indicator electrode varies depending on the 
activity of an ion in the sample to be tested. The 
principles of potentiometric measurements are found 
in the Nernst equation, which relates the potential 
difference between reference and indicator electrodes. 


Enzymatic biosensors constitute an important area of 
biomedical research. Enzymes interact specifically 
with some substrates, and can thus be used for the 
detection of these substrates as, for example, glucose 
by glucose oxidase, creatinine by creatinine 
amidohydrolase, or urea by urease. Among these 
analytes, urea and creatinine have received consider¬ 
able attention leading to the development of many 
potentiometric biosensors for use in clinical, food, 
and environmental areas. 

Urea Biosensors 

Urea is the main end product of protein metabolism. 
It has thus a great significance in medical diagnosis 
by being the main indicator of kidney dysfunction. 
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The physiological level of urea in serum ranges from 
3.0 to 7.5 mmol.L“\ High urea levels can be caused by 
severe renal dysfunction and are eventually fatal if not 
treated by dialysis and eventually kidney transplanta¬ 
tion. In dairy milk, urea levels can be used to track and 
restrain adulteration used for commercial benefits, or 
assess the nutritional program of cows and particularly 
lactating dairy cows. Finally, urea can cause environ¬ 
mental stress because it acts as a nitrogenous fertilizer. 
It decomposes to ammonia, which is very toxic, 
thereby polluting the streams and rivers into which it 
ends up. 

Urea biorecognition can be based on the highly 
effective and selective catalytic hydrolysis by the ure¬ 
ase enzyme. Products of the hydrolysis, NH 4 '^ and 
OH“, clearly indicate that biosensors based on pH 
detection or ammonium detection are applicable for 
urea quantification (Eq. 2). 

HjN - CO - NHz -f 3 H 2 O 2NH+ 

-1- HCOg^ -P OH- (2) 

A large number of potentiometric urea biosensors are 
reported in the literature because urease is a cheap and 
stable biocatalyst that is widely used as model enzyme 
for the development of new enzyme immobilization 
protocols, new urea-sensitive materials and new sens¬ 
ing schemes. 

pH-sensitive urea biosensors. Urea sensing through 
pH mediation is possible due to the enzymatic hydro¬ 
lysis and the formation of alkaline products (Eq. 2). 
The localization of the hydrolysis in the close proxim¬ 
ity of the surface of the pH-sensor ensures a fast and 
prompt potential decrease of its potential. Common 
calibration graphs of such pH-mediated potentiometric 
biosensors describe a sigmoidal shape as a function of 
pH. Potentiometric urea biosensors can be composed 
of various pH-sensitive layers, the main condition 
being the dependence of the sensitive material to pH 
changes. The most commonly used materials are oxide 
metals (Tymecki et al. 2005), electrodeposited poly¬ 
mer films (Rajesh et al. 2005), and charged polyelec¬ 
trolytes (Lakard et al. 2011) adsorbed on a substrate. 
For the two latter, the polymers contain amino or 
carboxylate functions that can easily be protonated or 
deprotonated. The enzyme is often covalently attached 
to the outer layer of the polymer films through the 
carbodilmlde coupling reaction: a carboxyl group on 
the support is reacted with a carbodiimide, then the 


product is coupled with an amine group on the bioma¬ 
terial to form an amide bond between the support and 
the enzyme (Magnin et al. 2008). This covalent immo¬ 
bilization technique greatly improves the sensor’s life¬ 
time but sometimes at the cost of sensitivity. It is also 
worth mentioning that a variety of such biosensors are 
mass-produced in miniature format by means of sev¬ 
eral thick and thin film technologies including ISFET 
technology (Sant et al. 2003) and screen-printing tech¬ 
nology (Tymecki et al. 2005). In addition, these bio¬ 
sensors are almost insensitive to alkaline cations existing 
in real samples of importance from a biomedical point of 
view. The main drawback of these biosensors is their 
sensitivity to the buffer capacity of the samples. In addi¬ 
tion, it is important to note that the enzyme activity and 
stmcture have an optimal pH working range. Conse¬ 
quently, a too large increase of buffer capacity will 
lower pH changes, while detection in a unbuffered solu¬ 
tion will intrinsically affect the enzyme activity itself. 

Ammonium-sensitive urea biosensors. Ammonium 
ion-selective electrodes are also amongst the most 
widely used potentiometric urea biosensors due to 
their fast and reproducible response and long lifetime. 
Contrary to pH-based urea sensors, they are not sensi¬ 
tive to the buffer capacity of the analyzed samples since 
they are not sensitive to protolytic species, and so can be 
used for accurate analysis in all kind of biomedical 
samples including serum and blood having high buffer 
capacity. On the contrary, the selectivity of ammonium- 
mediated biosensors can be poor since they also respond 
to endogenous ammonia and to other alkaline cations. 
This is an important drawback since ammonia, sodium, 
and potassium are present at high level in urines. Inter¬ 
ferences are usually circumvented by using buffers with 
highly controlled compositions. The presence of ammo¬ 
nium and/or interferents can significantly increase the 
baseline signal and decrease the sensitivity of the bio¬ 
sensors. Such biosensors have been successfully 
applied for the accurate determination of serum urea at 
physiological and pathological levels. Field effect tran¬ 
sistors (FET) combining an ammonium-selective elec¬ 
trode and a reference electrode are frequently composed 
of urease grafted through a carbodiimide reaction to a 
membrane matrix, such as carboxylated PVC (Senillou 
et al. 1999; Walcerz et al. 1998). 

Creatinine Biosensors 

Creatinine is found together with creatine in muscle 
tissue and in blood. It is an important bioanalyte of 
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clinical significance used to evaluate renal, muscular, 
and thyroid dysfunctions. In contrast to urea, the con¬ 
centration of creatinine is not influenced by the protein 
intake, and therefore its level is a more reliable indi¬ 
cator of renal function. Creatine and phosphocreatine 
are also excreted from the body. The amount of creat¬ 
inine in the urine is proportional to the amount of 
creatine and creatine phosphate present in the body 
and also in the muscular mass. Normal physiological 
levels of serum creatinine range from 100 pmol.L”* in 
normal cases up to 1,000 pmol.L”^ during severe 
disorder or disease. 

The potentiometric sensing of creatinine is mainly 
based on detection of ammonia, the by-product of the 
enzymatic conversion of creatinine (Eq. 3) (Koncki 
2007). 

creatinine + HjO iminohydrolase 

-^ (3) 

methylhydantoin + NH 35 

As previously described for urea biosensors, potentio¬ 
metric creatinine biosensors detection can be based on 
pH-sensitive or-ammonium sensitive sensors. The 
advantages and drawbacks of resulting sensors are 
consequently very similar: problems due to buffer 
capacity for pH sensors and problems due to lack of 
selectivity for ammonium sensors. An additional draw¬ 
back is the presence of creatinine in significantly lower 
levels than urea in humans (approximately one order of 
magnitude) rendering the detection more difficult and 
leading to stronger interferences in real samples. 
Potentiometric creatinine biosensors have more often 
been developed using pH-sensitive membrane elec¬ 
trodes (Radomska et al. 2004), pH-ISFETs (Soldatkin 
et al. 2002), or ammonium ion-selective electrodes 
(Rasmussen et al. 2007). 

Immunosensors 

When potentiometric sensors are based on antibodies 
used as molecular biorecognition elements through the 
formation of a stable complex, the device is called 
immunosensor. In essence, immunosensing can 
encompass a broader spectrum of target analytes in 
comparison to enzyme-based biosensing. 
Immunosensing is also selective because of the high 
specificity of immunoreactions. Consequently, 
immunosensors constitute an attractive method for 


clinical diagnosis, environmental monitoring, and 
food analysis. The strategy of potentiometric 
immunosensors is based on the change in the potenti¬ 
ometric response before and after antigen-antibody 
reaction. 

Label-Based Biosensors 

Electrochemical immunosensors generally rely on the 
immobilization of antibodies onto an electrode sur¬ 
face, which react with free antigens in competition 
with labeled antigens. After the antibody-antigen 
interaction, the recognition is revealed by the enzyme 
activity measurement involving the addition of 
a specific substrate. The sensing mode of 
immunosensors can be amperometric, potentiometric, 
or impedimetric. Most of the time immunosensors are 
amperometric since: (1) the most commonly used 
enzyme is horseradish peroxidase that produces com¬ 
pounds that are detected amperometric ally, 
(2) amperometry leads to low detection limits and 
high sensitivity. However, some potentiometric 
immunosensors have also been developed. An exam¬ 
ple of immunosensor consisting of polypyrrole-coated 
screen-printed gold electrodes combined with horse¬ 
radish peroxidase is reported in the literature, which 
converts o-phenylenediamine into 2,3- 
diaminophenazine (Purvis et al. 2003). This 
immunosensor was used to detect hepatitis B surface 
antigen, troponin I, digoxin, and tumor necrosis factor. 
Another type of potentiometric immunosensor is based 
on nanoparticle labeling and ion-selective electrodes 
(ISE). For example, a protein biosensor has been 
obtained using CdSe nanoparticles on a secondary 
antibody in a sandwich immunoassay. CdSe quantum 
dot labels were dissolved by oxidation with H 2 O 2 and 
released cations that were detected with cadmium ion- 
selective electrodes (Thurer et al. 2007). A similar 
sandwich immunoassay was developed using second¬ 
ary IgG conjugated with silver and gold nanoparticle 
labels (Chumbimuni-Torres et al. 2006). These works 
demonstrate that the combination of nanoparticle 
labels and ISE leads to improve detection limits and 
is attractive for bioassays. 

Label-Free Biosensors 

When the immune interaction between antibody and 
antigen is directly monitored, the device is named 
label-free immunosensor. Such immunosensors 
exhibit fast responses that rely on simple operation. 
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A first interesting label-free method has been devel¬ 
oped for ISFETs. In this example, protein molecules 
are bound onto the ISFET sensor, which detects, using 
a dynamic measurement, the ion-step response of the 
release or uptake of protons by proteins. The efficiency 
of this method was proved using an ISFET covered 
with a monolayer of latex beads coated with human 
serum albumin (Besselink et al. 2003). Other 
immunosensors were developed in which the receptor 
parts are based on respective antibodies adsorbed on 
colloidal gold nanoparticles. The biomodified 
nanoparticles were located into a matrix modifying 
the electrodes of the sensor (Tang et al. 2005). Another 
type of potentiometric labelless immunosensor uses 
Carbon NanoTubes (CNT). CNTs are synthesized by 
Chemical Vapor Deposition to form the channels of the 
CNT-FETs (Tans et al. 1998), which are highly sensi¬ 
tive labelless biosensors, thanks to a geometry charac¬ 
terized by its high surface-to-volume ratio. When 
a charged biomolecule adsorbs on the channel of the 
CNT-FET, a reduction in source-drain current is 
observed. Biomolecules have been detected using 
CNT-FETs. For example, an immunosensor was 
developed using CNT-FETs for a cancer marker, 
PSA (Fi et al. 2005). PSA-mAb was immobilized 
with a linker and then PSA was detected selectively. 

Microbial Sensors 

A microbial biosensor is a device which combines 
microorganism(s) and a physical transducer, capable 
of generating a signal proportional to the concentration 
of some analyte. Many microbial biosensors have been 
developed recently for applications in environment, 
food, or medicine. Electrochemical and optical micro¬ 
bial biosensors are the most widely studied. Optical 
microbial sensors use fluorescent, bioluminescent, and 
colorimetric sensing techniques. Electrochemical 
microbial sensors use amperometry, impedancemetry, 
or potentiometry. Potentiometric microbial biosensors 
are generally sensitive to pH changes or CO 2 release. 
They can consist of modified electrodes or field-effect 
transistors. Recently, a potentiometric biosensor based 
on a pH electrode modified with Pseudomonas 
aeruginosa was developed for the selective and rapid 
detection of the cephalosporin group of antibiotics. The 
microbial enzymatic hydrolysis of cephalosporin pro¬ 
duced a pH change near the pH electrode that generated 


a potentiometric response (Kumar et al. 2008). Another 
potentiometric biosensor was developed for the identi¬ 
fication of p-lactam residues in milk. This analytical 
device consisted of Bacillus stearothermophilus and 
a COi-sensitive electrode, which measured its release 
by the microorganisms. The presence of P-lactams 
inhibited the microbial growth and, consequently, the 
CO 2 production rate (Ferrini et al. 2008). In addition, 
a potentiometric whole-cell biosensor based on a pH 
field-effect transistor has been successfully applied to 
the detection of Escherichia coli K12 bacterial activity 
using a three-dimensional agarose as the immobiliza¬ 
tion matrix (Bettaieb et al. 2007). 

Summary 

Potentiometric biosensors are devices incorporating 
a biological sensing element connected to a physico¬ 
chemical transducer, providing an electrical potential as 
analytical signal. The main characteristics of such bio¬ 
sensors include a recognition element, a transducer, a 
method of immobilization of the sensing element, and 
performance parameters such as selectivity, sensitivity 
range, response time, lifetime, accuracy, and reversibil¬ 
ity. These biosensors can be useful for applications in 
numerous areas ranging from public health and environ¬ 
mental monitoring to the control of industrial processes. 
In this entry, enzymatic biosensors, immunosensors 
(based on antibody-antigen reactions), and microbial 
sensors have been successfully described. The most 
widely developed enzymatic sensors are urea biosensors 
(often used as a model case) and creatinine biosensors; 
these biosensors use pH or ammonium-sensitive mate¬ 
rials. Concerning immunosensors, two main types of 
biosensors are currently developed; label-based biosen¬ 
sors using either enzymes or nanoparticles for labeling, 
and labelless biosensors in which the immune interac¬ 
tion between antibody and antigen can be directly mon¬ 
itored. Finally, microbial biosensors are developed. The 
latter use microorganism(s) to generate an analytical 
signal. These biosensors are generally sensitive to pH 
changes or CO 2 release. 

Cross-References 

► Amperometric Biosensors 

► Carbon Nanotubes as Biomaterials 
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► Peroxidases 

► Protein Fluorescent Dye Labeling 

► Redox Potential 
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Hydrostatic pressure is introduced as a parameter for 
studying structural, thermodynamic, and kinetic 
properties of lipid membranes. 

Introduction 

Notably, the greatest portion of the biosphere on Earth 
is in the realm of environmental extremes 
(Daniel et al. 2006). Although hydrostatic pressure 
significantly influences the structural and dynamical 
properties and thus functional characteristics of cells, 
this has not prevented life from invading high pressure 
habitats. About 70% of the surface of the Earth is 
covered by oceans, and the average pressure on the 
ocean floor is about 380 bar (0.1 MPa = 1 bar, 
1 GPa = 10 kbar). Cold- and pressure-adapted species 
are even found on the deepest ocean floor 
(at ^ 11,000 m in the Mariana Trench) and in deep-sea 
sediments where pressures of about 1 kbar and more 
may be reached (Lauro and Bartlett 2007). 

Interest in pressure as a thermodynamic and kinetic 
variable has been growing also in physicochemical 
studies of biological materials in recent years 
(Gross and Jaenicke 1994; Heremans and Smeller 
1998; Akasaka 2006; Winter et al. 2007; Mishra and 
Winter 2008) due to the following reasons: 
(1) A change in temperature of a system produces 
a simultaneous change in thermal energy and volume. 
Hence, to separate thermal and volume (density) 
effects, also pressure-dependent experiments are 
required. (2) The use of pressure allows one to tune 
intermolecular interactions without major perturba¬ 
tions which are often produced by changes in temper¬ 
ature or by addition of cosolutes. (3) Pressure affects 
chemical equilibria and reaction rates. Le Chatelier’s 
principle predicts that the application of pressure shifts 
thermodynamic equilibria toward the direction that 
occupies a smaller volume, and accelerates processes 
for which the transition state has a smaller volume than 
the ground state. Knowledge of AV° and AV* values, 
the reaction and activation volume changes, respec¬ 
tively, discloses additional information about the 
nature of the reaction and its mechanism (e.g., on 
hydrational changes or the kind of interactions 
involved). 

The covalent structure of low molecular mass 
biomolecules (peptides, lipids, and saccharides), as 
well as the primary structure of biomacromolecules 


(proteins, nucleic acids, and polysaccharides), is 
generally not perturbed by pressures up to ^20 kbar. 
Pressure acts predominantly on the conformation and 
supramolecular structure of biomolecular systems 
such as lipid mesophases. By using a variety of 
biophysical techniques, the pressure-dependent struc¬ 
ture and phase behavior of biomolecular systems has 
been studied in recent years. Applying the 
pressure-jump relaxation technique in combination 
with time-resolved synchrotron X-ray diffraction and 
spectroscopic techniques, the kinetics of biomolecular 
reactions has been studied. Among the most pressure 
sensitive systems are lipid membranes. In the follow¬ 
ing sections, selected results will be discussed. Exper¬ 
imental aspects can be found elsewhere (Akasaka 
2006; Jonas 1991; Winter et al. 2007, and refs, therein). 

Pressure Effects on Lipid Membranes 

The basic structural element of cellular membranes 
consists of a lamellar ► phospholipid bilayer matrix. 
Saturated phospholipids often exhibit two thermo¬ 
tropic lamellar ► phase transitions, a gel-to-gel 
(Lp'-Pp') pretransition and a gel-to-liquid-crystalline 
(Pp'-Lc() main (chain melting) transition at a higher 
temperature, T^. In the physiologically more relevant 
fluid-like Lo, phase, the acyl-chains of the lipid bilayers 
are conformationally disordered (“melted”), whereas 
in the gel (solid-ordered) phases, the chains are 
extended and ordered. In addition, a variety of pres¬ 
sure-induced lipid phase transformations have been 
observed (Winter et al. 2007). A common slope of 
^22°C kbar * has been observed for the gel-fluid 
phase boundary of saturated phosphatidylcholines as 
shown in Fig. 1. Assuming the validity of the 
Clapeyron relation describing first-order phase transi¬ 
tions, dTjn/dp = T^AVm/^m, the positive slope can be 
explained by an exothermic enthalpy change, AH^, 
and a partial molar volume decrease, AV^, for the 
fluid-to-gel transition. Only the slopes of the 
di-cA-unsaturated lipids have been found to be mark¬ 
edly smaller. The two cA-double bonds of DOPC and 
DOPE lead to very low transition temperatures and 
slopes, as they impose kinks in the linear conforma¬ 
tions of the lipid acyl-chains, thus creating significant 
free volume fluctuations in the bilayer so that the 
ordering effect of high pressure is reduced 
(Winter 2001). Hence, in order to remain in 
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Pressure Effects on Lipid Membranes, Fig. 1 T, p-phase 
diagram for the chain-melting transition of different phospho¬ 
lipid bilayer systems (the lengths and degree of unsaturation of 
the acyl-chains are denoted on the right). The fluid-like Lo,-phase 
is observed in the low-pressure, high-temperature region of the 
phase diagram; the ordered gel phase regions appear at low 
temperatures and high pressures, respectively 


a physiologically relevant, fluid-like state at high pres¬ 
sures, more of such c/i-unsaturated lipids are incorpo¬ 
rated into cellular membranes of deep sea organisms, 
one example of the homeoviscous adaption of organ¬ 
isms (Lauro and Bartlett 2007). Applying high pres¬ 
sure can lead to the formation of additional ordered 
phases, which are not observed under ambient pressure 
conditions (e.g., a partially interdigitated high-pressure 
gel phase). 

Cholesterol, which constitutes up to ^50% of the 
lipid content of animal cell membranes, has a marked 
effect of the temperature and pressure-dependent 
phase behavior of lipid bilayers as well. Measurements 
of the acyl-chain orientational order of the lipid bilayer 
system by measuring NMR spectra or 

steady-state ► fluorescence anisotropy has demon¬ 
strated the ability of cholesterol to efficiently regulate 



Pressure Effects on Lipid Membranes, Fig. 2 p,r-phase 
diagram of the ternary lipid mixture DOPC/DPPC/Chol 
(1:2:1). The potentially physiological relevant Ij + k two-phase 
coexistence region is marked in gray and depicted schematically 
in the drawing at the top 


the structure, motional freedom, and hydrophobicity of 
lipid membranes. At concentrations above ^30 mol% 
cholesterol, the order parameter values of the lipid 
chains are found to be almost independent of pressure, 
thus rendering lipid membranes rather resistant to ter¬ 
restrial pressures. 

T, p-phase diagrams of binary lipid mixtures and 
ternary systems (model raft mixtures) have been deter¬ 
mined as well (Winter et al. 2007). The latter systems 
are supposed to mimic distinct liquid-ordered lipid 
regions, called “rafts,” which seem to be also present 
in natural membranes and are thought to be important 
for a variety of cellular functions. As expected, lipid 
domain formation is also markedly influenced by 
pressure. Figure 2 exhibits the liquid-disordered/ 
liquid-ordered (Id/lo) phase coexistence region of 
a canonical model raft mixture (Jeworrek et al. 2008). 
Upon pressurization at ambient temperatures, an 
essentially ordered state is reached at pressures of 
about 1 kbar. Interestingly, in this pressure range, 
ceasing of membrane protein function in natural mem¬ 
brane environments has been observed for a variety of 
systems, which might be related to the membrane 
matrix reaching a physiologically unacceptable overall 
ordered state at these pressures. 
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Pressure Effects on Lipid Membranes, Fig. 3 (a) T, y^-phase 
diagram of DPPC bilayers in excess water. Besides the Gel 1 
(PpO. Gel 2 (Lp'), and Gel 3 phase, an additional crystalline gel 
phase (Lc) can be induced in the low-temperature regime after 


prolonged cooling which is not shown here, (b) Phase diagram of 
DPPC-gramicidin D (GD) (5 mol%) in excess water as obtained 
from diffraction and spectroscopic data 


Membrane proteins can constitute about 30% of the 
entire protein content of a cell and so, reside to 
a varying extent in the lipid environment where they 
act as anchors, enzymes, or transporters. Membrane 
lipids and proteins influence each other directly as 
a result of their biochemical nature and in reaction to 
environmental changes. Pressure studies have been 
carried out on a few channels and membrane proteins, 
only. Figure 3 shows the effect of the incorporation of 
the model channel peptide gramicidin D (GD) on the 
phase behavior of DPPC phospholipid hilayers 
(Winter et al. 2007). Gramicidin is polymorphic, 
being able to adopt a range of structures with different 
topologies. Depending on the gramicidin concentra¬ 
tion, significant changes of the lipid bilayer structure 
and phase behavior were observed. These include 
disappearance of certain gel phases formed by the 
pure DPPC system, and the formation of broad 
two-phase coexistence regions at higher gramicidin 
concentrations (Fig. 3b). Hence, not only the lipid 
bilayer structure and T, /j-dependent phase behavior 
drastically depends on the polypeptide concentration, 
but also the peptide conformation (and hence function) 
can be significantly influenced by the lipid environ¬ 
ment. No pressure-induced unfolding of the polypep¬ 
tide is observed up to 10 kbar. For large integral and 
peripheral proteins, however, pressure-induced 
changes in the physical state of the membrane may 
lead to a weakening of protein-lipid interactions as 
well as to protein dissociation. For example, the 
influence of hydrostatic pressure on the activity of 


Na"^, K'^-ATPase was studied using a kinetic assay. 
The data show that the activity, k, of Na"^, K'^-ATPase 
is inhibited reversibly by pressures below 2 kbar. The 
plot of Ink versus p reveals an apparent activation 
volume of the pressure-induced inhibition reaction 
which amounts to AF^ = 47 mL mol“\ At higher 
pressures, exceeding 2 kbar, the enzyme is inactivated 
irreversibly due to protein unfolding and interface 
separation (Chong et al. 1985; Powalska et al. 2007). 

For a series of lipid molecules, also non-lamellar 
lyotropic phases are observed as thermodynamically 
stable or long-lived metastable phases. Fundamental 
cell processes, such as endo- and exocytosis, mem¬ 
brane budding, and fusion, involve a rearrangement 
of biological membranes where such non-lamellar 
lipid structures are probably involved. Pressure-jump 
relaxation studies in combination with Synchrotron 
► X-ray diffraction have been used to reveal details 
of such processes (Conn et al. 2006, 2008; 
Seddon et al. 2003; Winter et al. 2007; Winter and 
Jeworrek 2009). 


Concluding Remarks 

Pressure work on biomolecular systems has been 
shown to yield a wealth of enlightening new informa¬ 
tion on their structure, dynamics, and phase behavior, 
and on the transition kinetics between their various 
conformational and functional states. But it is also 
clear that the application of the pressure variable in 
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biomolecular science is still in an early stage, in 
particular with respect to biochemical reactions and 
processes. These pressure studies are also important 
for the development of new biotechnological and 
pharmaceutical applications, such as high-pressure 
food processing, the development of new antiviral 
vaccines and drugs, pressure separation of 
protein complexes (inclusion bodies), and supercritical 
biocatalysis (Winter et al. 2007; Mishra and 
Winter 2008). 
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Introduction 

A major part of our biosphere is in the realm of envi¬ 
ronmental extremes, and for several decades now, the 
limits of the existence of life have been pushed to 
unexpected extremes of pressure, temperature, pH, 
and salinity (Lauro and Bartlett 2007; Daniel et al. 
2006). For example, deep-sea sediments and hydrother¬ 
mal vents are densely crowded with barophilic- 
thermophilic (i.e., pressure- and heat-adapted) species. 
Psychrophilic-barophilic (cold- and pressure-adapted) 
species, which live at ^2°C, are found on the deepest 
ocean floor (^11,000 m) in the Mariana Trench and in 
deep-sea sediments. Close to hydrothermal vents, gener¬ 
ally several thousand meters under the sea surface, even 
organisms far more complex than bacteria can be found at 
conditions of high hydrostatic pressure (HHP) (^300 bar) 
and high temperature (up to ^140°C). The extent of deep 
microbial ecosystems, which by some estimates may rival 
the total surface biomass, has been hardly explored, and 
questions regarding life in Earth’s deep interior, how life 
adapts to deep environments, and what novel biochemical 
pathways sustain life at high pressures and extreme tem¬ 
peratures remain largely unanswered. 

The covalent structure of low-molecular-mass bio¬ 
molecules as well as the primary structure of 
biomacromolecules (proteins, nucleic acids, and poly¬ 
saccharides) is generally not perturbed by pressures up 
to ^20 kbar (1 kbar = 100 MPa). Pressure acts pre¬ 
dominantly on the conformation and supramolecular 
structure of biomolecular systems. By using a variety 
of biophysical techniques, the pressure-dependent 
structure and dynamics of biomolecular systems such 
as proteins has been studied in recent years. Applying 
the pressure-jump relaxation technique in combination 
with time-resolved synchrotron X-ray diffraction and 
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Pressure Effects on Proteins, Fig. 1 Schematic p, T-stability diagram of a monomeric protein {left). As an example (right), thep, 
T-diagram of SNase as obtained by ► SAXS, FT-IR spectroscopic, and DSC measurements is shown (Panick et al. 1998) 


spectroscopic techniques, the kinetics of protein fold¬ 
ing reactions has also been explored. Here, a few 
selected results will be discussed. Finally, recent 
advances in using pressure for studying misfolding 
and aggregation of proteins will be elucidated. Exper¬ 
imental aspects can be found elsewhere (Akasaka 
2006; Arnold et al. 2003; Jonas 1991; Winter et al. 
2007, and refs, therein). 

Pressure Effects on Protein Structure and 
Stability 

Since the discovery of pressure-induced protein 
unfolding and denaturation by Nobel laureate P.W. 
Bridgman in 1914, it has been shown in many studies 
that hydrostatic pressure may lead to disruption of the 
intermolecular interactions maintaining the native pro¬ 
tein folded state, which is accompanied by a decrease 
of the volume of the protein-water system, and simul¬ 
taneous unfolding (Gross and Jaenicke 1994; 
Heremans and Smeller 1998; Silva et al. 2001; Winter 
et al. 2007). The thermodynamic stability of a protein 
is a multidimensional function of temperature, pres¬ 
sure, and solution conditions, yielding an energy land¬ 
scape as a multidimensional surface. When the 
solution conditions are kept constant, the stability of 
the protein is a simultaneous function of temperature 
and pressure, only. The Gibbs free energy difference 
between the denatured (unfolded) and native state. 


AuG(r, p), can be calculated to a certain approximation 
(Akasaka 2006; Winter et al. 2007). The transition line, 
where the protein unfolds upon a temperature or pres¬ 
sure change, is given by A^G — 0. The physically 
relevant solution of the curve in the p, T-plane has an 
elliptic-like shape (Fig. 1). While the heat-induced 
denaturation, which is mainly driven by a gain in 
configurational entropy, is mostly a highly cooperative 
process leading to a largely unfolded conformation, 
cold- and pressure-induced denaturation is known to 
be a milder form of denaturation, leading to a partial 
unfolding of the protein, only. According to the stabil¬ 
ity diagram shown in Fig. 1, pressure can destabilize 
proteins at ambient temperatures, which is facilitated 
by a combination of factors as follows; (1) High pres¬ 
sures lead to a weakening of hydrophobic interactions 
(Hummer et al. 1998). (2) The presence of cavities 
within the folded protein (or in the interface of oligo¬ 
mers) favors the unfolding (or dissociation) of these 
structures. (3) Also the dissociation of electrostatic 
interactions leads to a marked reduction in the volume 
caused by electrostrictive effects of water molecules 
around the exposed unpaired charged groups. The sta¬ 
bility diagrams and the folding reactions of oligomers 
and protein complexes are more complex. Oligomeric 
proteins and multiprotein assemblies often dissociate 
into individual subunits already at pressures of 
0.5-2 kbar (Winter et al. 2007). 

Since the cytoplasm of the biological cell comprises 
a complex aqueous solution of a variety of different 
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salts and osmolytes, the effect of different salts and 
cosolvents on the high-pressure stability of proteins 
has been studied as well (Winter et al. 2007). For 
example, upon addition of sucrose, an increase in the 
denaturation pressure is observed. The volume 
change upon unfolding, AV°, is almost independent 
of the presence of the cosolvent. With regard to tem¬ 
perature-induced denaturation, a different behavior of 
Ay° as a function of osmolyte concentration is 
observed, which is largely due to the strong tempera¬ 
ture dependence of AV° (Winter et al. 2007, and refs, 
therein). 

Also with respect to the kinetics of the folding 
reaction, pressure studies are of particular use, as 
they allow to evaluate the volume profile during the 
folding process (Pappenberger et al. 2000; Winter et al. 
2007) and to characterize the nature of the barrier to 
folding or unfolding and the corresponding transition 
state. Moreover, pressure studies present an important 
advantage due to the generally observed positive acti¬ 
vation volume for folding, the result of which is to slow 
down the folding reaction substantially, in turn 
allowing for relatively straightforward measurements 
of structural order parameters characteristic for folding 
intermediate states (Panick et al. 1998). For the folding 
reaction of SNase, the activation volume for folding 
has been found to be large and positive 
(AVf = 57 mL mol“*) and that for unfolding is small 
and negative (AV^ = ~23 mL mol“*). The volume of 
the protein solvent system in the transition state is thus 
significantly larger than that in the unfolded state and 
somewhat smaller than that in the folded state, so that 
the transition state lies closer to the folded than to the 
unfolded state in terms of system volumes. The positive 
activation volume of folding indicates that the transition 
state is accompanied by dehydration and chain collapse. 

Owing to recent methodological advances, remark¬ 
able progress has recently been made in high-pressure 
biomacromolecular crystallography. The status and 
prospects of protein crystallography at high pressures 
is discussed by Fourme et al. (2009). 

Using Pressure for Studying Protein 
Aggregation Phenomena 

If some failure occurs in achieving the native confor¬ 
mation of proteins, this will render the protein inactive, 
or even worse, and it can produce a misfolded 



Pressure Effects on Proteins, Fig. 2 Characteristic features 
of the high-pressure treatment of 0.5 wt.% insulin in aqueous 
solution (0.1 M NaCl, T = 60°C) at different stages of the 
aggregation process. The decay of the normalized thioflavin 
T (ThT, a fluorescence marker of fibril formation) intensity upon 
a pressure increase to 1 kbar after progressively longer incubation 
times is shown. At 35 and 60 min, the oligomeric states prevail, 
whereas at 2 h, the mature fibrillar state of insulin is reached, 
which exhibits pressure resistance (Mishra and Winter 2008) 

molecule that can interfere or block components of 
the cellular machinery to the point of causing cell 
malfunction. A wide range of human diseases are asso¬ 
ciated with aberrations in the folding process, such as 
Alzheimer’s disease, Parkinsons’s disease, prion-pro- 
tein-related encephalopathies, and type II diabetes 
mellitus. In an attempt to probe the stability and ener¬ 
getics of protein aggregates and amyloid fibrils, 
pressure perturbation has been in focus, recently 
(Silva et al. 2004; Meersman et al. 2006; Mishra and 
Winter 2008). 

The susceptibility of protein aggregates to pressure 
largely depends on the degree of the structural order of 
the aggregate: Fresh, amorphous aggregates (e.g., from 
insulin. Fig. 2) are more sensitive to pressure and prone 
to refolding to the native state than mature amyloid 
fibrils. In the latter case, the effectiveness of pressure- 
induced dissociation depends on the particular mode of 
polypeptide backbone and side chain packing that 
allows reducing remaining void volumes. The pres¬ 
sure-sensitivity of fresh aggregates and the insensitiv¬ 
ity of most mature fibrils to high pressure allow one not 
only to differentiate between various stages of amy¬ 
loid-formation, but also to obtain reliable thermody¬ 
namic data on the transformation process. 
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Pressure Effects on 
Proteins, Fig. 3 Schematic 
(free) energy landscape for 
protein folding and 
aggregation/amyloid 
formation. While the protein 
attains the native 
conformation at its global 
energy minimum under 
normal physiological 
conditions, the amyloidogenic 
stacking of many protein 
molecules may lead to lower 
energies beyond this level in 
the aggregation funnel. At low 
temperature and high pressure, 
dissociation of less densely 
packed protein aggregates and 
fibrils is possible (Mishra and 
Winter 2008) 



intramolecular contacts 
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amyloid fibrils 


intermolecular contacts 


Some amyloid fibrils (in particular protofibrils) 
exhibit a larger partial specific volume than that of 
the composing proteins, because the assembly process, 
which involves various conformational transitions of 
native structures to nonnative (f-sheets, could create 
new water-excluded cavities and induce hydrophobic 
pockets. In these cases, HHP has the potential to dis¬ 
sociate also amyloid fibrils. For example, it has been 
shown that moderate pressures (1-3 kbar) can dissoci¬ 
ate amyloid fibrils of transthyretin, P 2 -microglobulin, 
and a-synuclein. One of the most thrilling prospects 
for application of high pressure in protein aggregation 
research is the idea of destroying prion infectivity 
through pressure treatment (Silva et al. 2004; 
Meersman et al. 2006; Mishra and Winter 2008, and 
refs, therein). 

Figure 3 shows a schematic (free) energy landscape 
for protein folding and aggregation phenomena. At the 
high-energy, high-entropy surface, a multitude of 
unfolded conformations are present. “Furmeling” on a 
rugged energy surface or “landscape” occurs toward the 
natively folded state via rapid intramolecular contact 
formation. By a - generally slow - nucleation process of 
partially unfolded structures (induced by mutations. 


particular cosolvents, membranes, etc.), formation of 
aggregates and amyloid fibrils via intermolecular con¬ 
tacts may occur, and the system enters the aggregation 
funnel. Within the aggregation funnel, loosely packed 
oligomeric or amorphous aggregates as well as - gen¬ 
erally several - amyloid fibrillar states (strains) with 
different packing properties and with deep energy min¬ 
ima may form, hence exhibiting conformational poly¬ 
morphism. Pressure-dependent studies have helped 
disentangle different aggregation pathways and have 
revealed the polymorphic nature of amyloid fibrils. 
Dissociation of protein aggregates and, under favorable 
conditions, also amyloid fibrils, is possible upon appli¬ 
cation of HHP as well. 

Concluding Remarks 

Despite significant progress in our understanding of 
protein folding and stability, important questions 
concerning the structural basis of folding cooperativity 
and the structural, thermodynamic, and dynamic ori¬ 
gins of barriers between conformational (sub)states 
remain unanswered. This has hindered our 
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understanding of the structural basis of important bio¬ 
chemical processes such as catalysis, energy transduc¬ 
tion, and signaling processes. Application of 
hydrostatic pressure constitutes a unique and gentle 
thermodynamic perturbation to proteins, as high- 
pressure selectively stabilizes conformations with 
smaller specific volumes, and subtle conformational 
fluctuations that go unnoticed at ambient pressure can 
be enhanced with high pressure to levels more easily 
detectable by spectroscopic means (Kapoor et al. 
2012). Moreover, pressure studies are also important 
for the development of novel technological applica¬ 
tions, such as in the areas of high-pressure food 
processing and pressure separation of protein com¬ 
plexes and aggregates (Crisman and Randolph 2009; 
Silva et al. 2004). 
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Prostaglandin Synthase 

► Cyclooxygenase - Computational Studies 


Protein Absorbance Spectroscopy 

► UV Absorbance Spectroscopy of Biological 
Macromolecules 
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Synonyms 

Peptide micro-array 

Definition 

The term “Peptide array” refers to a high-throughput 
screening technique used to rapidly and simulta¬ 
neously assess a large number of interactions between 
peptides or proteins and their binding partners. The 
method is commonly used for identifying binding 
sites within proteins. 

An array is a systematic arrangement of a large 
number of compounds, in rows and columns, on 
a solid support. Each compound located on the array 
serves as an individual reaction unit. Commonly 
used arrays range from arrays of small organic com¬ 
pounds to DNA, proteins, and peptide arrays. For 
protein arrays it is most common to use full 
length proteins, isolated protein domains, or peptides. 
The proteins/peptides are being attached to 
a solid support, and then incubated with the potential 
binding partner (e.g., protein or ligand). Excess 
and nonbinding ligands are removed by washing, 
and finally the binding complexes are detected by 
labeled probe methods such as fluorescence or 
chemiluminescence. 


Basic Characteristics 

The concept of parallel synthesis of multiple compo¬ 
nents on a solid support was introduced in the early 
1980s by both Ronald Frank and Mario Geysen (Frank 
et al. 1983; Geysen et al. 1984). Initially, the array 
technology was used mainly in genomics. RNA and 
DNA arrays are currently standard research tools for 
gene analysis (reviewed in Davison et al. 2006; Bier 
et al. 2008). Unlike nucleotides or small molecules, 
proteins are difficult to produce, more biochemically 
diverse, and their functionalities are extremely depen¬ 
dent on correct folding. However, the study of cellular 
pathways absolutely requires working with proteins 
since proteins are the actual mediators of most cellular 
processes. During the last decade, scientists have tried 
to overcome the technical challenges associated with 
these molecules and have generated several types of 
protein arrays (Wolf-Yadlin et al. 2009) (Fig. 1): 
(1) Protein arrays - proteins immobilized on 
a membrane support and screened for binding 
a partner such as another protein, domain, peptide, or 
antibody molecule. (2) Peptide arrays - peptides 
immobilized on a membrane support, then screened 
for binding another protein, domain, peptide, or anti¬ 
body molecule. (3) Antibody arrays - antibodies 
immobilized on a membrane support and incubated 
with candidate binding proteins. (4) Lysate micro¬ 
arrays - the entire lysate of a cell is immobilized, and 
its interactions detected using antibodies. This method 
is also known as the “reverse phase protein array” 
(Paweletz et al. 2001). Antibody arrays and lysate 
arrays are mostly used to detect proteins in biological 
samples, whereas peptide and protein arrays are gen¬ 
erally used for mapping protein-protein interaction 
sites and to identify binding motifs. This entry will 
focus on using the arrays for mapping binding sites or 
active regions that mediate interactions of proteins. 

Scientists have tried to overcome the relative com¬ 
plexity of the protein world by adopting the linear 
thinking of the DNA/RNA world and the simplicity 
of chemical synthesis from the small molecule world. 
This led to the evolvement of peptide arrays to explore 
protein-protein interactions (Katz et al. 2011). 
Although peptides are linear by nature they can serve 
as good models for binding studies of protein domains, 
because they often undergo an unfolded to folded 
transition upon binding to gain their native structure. 
Protein arrays are intended to reveal whether an 
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Incubation with a potential 
interactor (protein, peptide, 
antigen, serum antibody) 




Primary and 
Secondary antibodies 



Y 


Screening results 



Protein and Peptide Arrays, Fig. 1 A general scheme of 
peptide array screening, (a) Four common types of protein/ 
peptide arrays are shown. The optional capture agents are pro¬ 
teins, peptides, antibodies, or cell lysate. All agents are cova¬ 
lently immobilized to a solid support, (b) The basic screening 
assay includes incubation of the binding protein with a potential 
interactor (protein, peptide, antigen, or serum antibody), 
followed by a few washing steps to discard the excess of 
unbound interactors, (c) Detection of the binding interactors is 
performed mostly by labeled probe methods. It is possible to use 
either a primary antibody against the partner protein itself com¬ 
bined with a secondary labeled antibody, or a specific antibody 


against a fused tag such as His-, Biotin-, GST, etc. For antibody 
micro-arrays sandwich detection technique can be used. The 
immobilized agent antibody binds an antigen interactor that 
can be recognized by another labeled secondary antibody. To 
avoid unspecific binding and cross reactions, direct labeling of 
the interactor can be used. This can be achieved by using 
a luorescent dye, radioactive isotopes, or by adding 
a detectable enzyme such as peroxidase or phosphatase, (d) 
The screening results appear as dark spots that represent the 
binding of the interactor to a specific capture agent. The different 
intensities of the dark spots are in correlation to the diverse 
binding affinities 
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interaction between proteins exists, whereas peptide 
arrays enable researchers to “zoom in” and narrow 
down a potentially huge interaction interface to 
a specific binding site. 

Peptide arrays have several advantages in protein- 
protein interactions research: (1) It is usually easier to 
chemically synthesize peptides than to make recombi¬ 
nant proteins. (2) Peptide synthesis allows the incor¬ 
poration of non-natural and/or modified amino acids 
during synthesis, enabling, for example, studies of 
post-translational modifications. (3) Probes or linker 
groups such as biotin or fluorescent tags can be easily 
introduced into peptides at any required position and 
be used for protein immobilization or detection. 
(4) Peptides resulting from the screening can act as 
inhibitors or activators of the target proteins, and thus 
serve for diagnostic or therapeutic purposes. (5) The 
resulting binding peptides can be used in modeling 
protein-protein interactions (Katz et al. 2008). 

Currently, there are two common ways to prepare 
peptide arrays: (1) Synthesizing functionalized pep¬ 
tides in advance and then covalently attaching them 
to the support. Peptide immobilization can be 
performed using several methods (Falsey et al. 2001; 
Wegner et al. 2002; Takahashi et al. 2003; Kohn 2009) 
and (2) synthesizing the peptides directly on the mem¬ 
brane support, usually by SPOT synthesis. 

The first method is expensive, time consuming, and 
requires peptide purification in advance. Yet, the puri¬ 
fication of the peptide reduces binding artifacts that 
may result from unsuccessful and/or low yielding syn¬ 
theses. This method is more suitable for arrays with 
a relatively small number of peptides or when several 
copies of the same array are required. The second 
method is much more robust and also very economical 
in terms of the cost of reagents and solvents. It is more 
suited for rapid examination of many sequences, which 
is currently the most common use for peptide arrays. 

The SPOT synthesis technique is frequently used 
for sequentially synthesizing peptides directly on the 
support using Fmoc solid phase synthesis (Fields and 
Noble 1990). The basic principle of the SPOT synthe¬ 
sis is using the circle that forms when a droplet is 
dispensed on a planar surface as a reaction vessel. 
The circle of the droplet creates limited boundaries 
that can be addressed individually by manual or auto¬ 
mated delivery of the corresponding reagents. Multiple 
separate spots can be arranged to form an organized 
array. The reduction in solvent volume is enormous. 


because the spots formed by the droplets are small by 
nature. Most of the synthetic steps are similar for all the 
peptides in the array and thus can be performed simul¬ 
taneously by rinsing the whole solid support. The most 
obvious drawback of this process is that peptides do 
not undergo purification. For more details on SPOT 
synthesis technique, see reviews in refs. (Frank 2002; 
Hilpert et al. 2007; Volkmer 2009). 

Currently, most of the synthesis can be carried out 
automatically using peptide synthesizers and spotting 
robots. This leads to very small spot sizes and thus 
a smaller dimension for the entire array, along with 
a higher concentration of peptide per spot. These min¬ 
imized arrays are named micro-arrays, and are cur¬ 
rently widely used due to the improved results they 
give over larger arrays. 

Peptide Arrays Design 

Currently, it is very easy and relatively cheap to order 
the peptide arrays from a commercial company, and 
the important task for the user is the design of the 
required peptide array. 

There are two main approaches for designing pep¬ 
tide arrays (Katz et al. 2011): rational design and 
combinatorial design. Combinatorial design is based 
on the synthesis of a large number of peptides, usually 
derived from one specific motif (which is suspected to 
be the interactor), with small systematic modifications 
in the parent sequence. These arrays are screened for 
binding the target protein in order to find the optimal 
interaction sequence (Liu et al. 2003; Cardo-Vila et al. 
2010). In rational design, the array includes peptides 
derived from sequences in partner proteins that are 
known to bind the protein of interest, or are derived 
from other sequences of proteins that are homologous 
or part of the same biological pathway. This approach 
is applied both for identifying the precise binding sites 
in the partner proteins, and for identifying peptides that 
can serve as ligand-mimics for therapeutic applica¬ 
tions. In array design, the following rules should be 
followed: (1) The optimal length of the peptides should 
not exceed 15 residues. This is because the purity and 
identity of each individual peptide is not tested. The 
typical purity of an average 6-15 residue peptide is 
>70%, so it is generally reliable to use such arrays. In 
contrast, the purity of longer peptides cannot be 
guaranteed. (2) The designed peptides should be partly 
overlapping. When binding of the protein to partly 
overlapping peptides is observed, it strongly indicates 
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a specific binding region rather than one random bind¬ 
ing sequence. Moreover, it can help minimize the 
binding site to the shortest binding sequence common 
to all the observed overlapping peptides (Fig. 1). (3) It 
is important to take into account the charge and the 
hydrophobicity of the peptides. Hydrophobic peptides 
tend to make nonspecific interactions, and this is one of 
the main reasons for false positive results in peptide 
array screening. Cys, Met, and Trp residues can also 
cause problems with synthesis because they are sus¬ 
ceptible to oxidation and side reactions. If possible, it 
is advisable to choose sequences that contain 
a minimal number of these residues. (4) When 
performing rational design of peptides derived from 
the sequence of a known protein, it is highly 
recommended checking whether a three-dimensional 
structure is available. If so, the secondary structure of 
the protein should be considered during the design. For 
example, entire helices or beta strands should be used, 
and where possible the 2-3 flanking residues from each 
side included, to maintain the native secondary struc¬ 
ture of the peptide. (5) The N-terminus of the peptides 
should be acetylated to mimic the uncharged physio¬ 
logical state of the peptide in the context of the whole 
protein. Acetylated peptides are also more stable 
toward degradation. (6) Always include in the array 
a few control peptides of known binding affinity to the 
protein of interest to give a comparative basis for the 
binding affinity of unknown peptides. 

The Binding Assay 

Screening peptide arrays is technically quite simple, 
resulting in a large data set in a relatively short time. 
The basic protocol for peptide array screening includes 
several steps (Fig. 1): Incubation of the binding protein 
with the peptide array, followed by a few washing 
steps to remove the excess of unbound protein and 
finally detection of the protein-bound peptides (Katz 
et al. 201 1). Small modifications are usually performed 
in each step to optimize the assay for each particular 
biological system. In the case of micro-array slides, 
usually all assay steps are performed directly on the 
peptide array, whereas in the case of cellulose sheets 
(macro-arrays) it is very common to add another step 
after the incubation - electrostatic transfer of the bind¬ 
ing protein to a PVDF membrane. The PVDF mem¬ 
brane is replaced several times within one experiment 
to improve the signal to noise ratio. This process 
allows the reuse of the same array several times. 


More detailed information about peptide array transfer 
can be found at Rudiger et al. (1997) and Rotem et al. 
(2008). The incubation of the binding partner with the 
peptide array is the most important step for the success 
of the experiment. The main parameters that can be 
modified when optimizing the assay are: (1) choosing 
an optimal blocking agent (e.g., BSA, skimmed milk, 
sucrose etc.) and concentration of that blocker to pre¬ 
vent nonspecific binding (Volkmer 2009); (2) changing 
the ionic strength of the incubation solution depending 
on the nature of interaction. For example, if the inter¬ 
action has an electrostatic nature, lower ionic strength 
buffers will result in tighter binding, enabling the 
detection of weaker interactions; (3) adding reducing 
agents such as DTT or PME. The reducing conditions 
are essential in cases when either the formation of 
disulfide bonds between the protein and the peptides 
on the array may lead to false results, or when 
a reducing environment is essential for stability of the 
protein. 

The last step of the assay is detection of the peptide- 
protein binding. Detection is performed mostly by 
labeled probe methods such as fluorescence, chemilu¬ 
minescence, electrochemiluminescence, and radioac¬ 
tivity detection. It is possible to use either a primary 
antibody against the partner protein itself combined 
with a secondary labeled antibody, or a specific anti¬ 
body against a fused tag such as Hisg-, Biotin-, GST, 
etc. (Fig. 1). Antibodies against the tags are usually 
HRP conjugated. This is preferable for avoiding the 
need for a second incubation and more washing steps. 
The clear disadvantage of the antibody-based detection 
systems is that multiple steps are involved, and the 
possibility of nonspecific binding and/or cross¬ 
reactions of the peptide with the antibodies. One of 
the ways to overcome these problems is using 
a directly labeled partner protein. Direct labeling of 
the protein can be achieved by using a fluorescent 
dye, radioactive isotopes, or by addition of detectable 
enzyme such as peroxidase or phosphatase. The main 
drawback of the direct labeling method is that the 
detection relies on the assumption that the protein is 
uniformly labeled. This is difficult to ensure, due to the 
complexity and lack of specificity of labeling process. 

The Next Step - What to Do with the Screening 
Results 

The spot patterns obtained from peptide array screen¬ 
ing can be documented and quantitatively evaluated 
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using standard modem image analysis readers such as 
the ones commonly used in 2D analysis of gel electro¬ 
phoresis and blotting. It is recommended to evaluate 
the intensity of the observed peptides in each array 
only relative to the other peptides in the same array, 
and not to compare it with the results of other arrays 
screening or even with other copies of the same array. 
The method is only semi-quantitative, and the results 
must be validated by synthesizing the binding peptides 
and testing the interaction in solution using quantita¬ 
tive methods such as fluorescence anisotropy or 
SPR (Katz et al. 2008; Rotem et al. 2008). Such exper¬ 
iments allow the binding affinity to be determined. It is 
especially recommended to synthesize and purify the 
binding peptides that had high signal intensity on the 
array, and then measure their binding affinity. In sum¬ 
mary, peptide array screening is a good tool for rapid 
screening of many sequences, but it must be accompa¬ 
nied with other quantitative biophysical methods to 
validate binding and measure the precise binding 
affinities. 

Cross-References 

► Mass Spectrometry: Mapping Large Stable Protein 
Complexes 

► Protein Complexes 

► Protein Design for Biosensors 

► Protein Structure Comparison Methods 

► Protein Structure Prediction 
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Protein Circular Dichroism Analysis 

Lee Whitmore 

Birkbeck College, University of London, London, UK 

Synonyms 

Deconvolution; Secondary structure analyses 

Definition 

The far-ultraviolet circular dichroism spectra of proteins 
contain information about the electrostatic and magnetic 
environment of the chromophores and may be analyzed 
to give a close approximation of the content of second¬ 
ary structure types. Different protein secondary structure 
types produce characteristic spectral shapes and, due to 
the additive nature of the spectra, the shapes contribute 
to the total spectrum in the proportions that their sec¬ 
ondary structures are found within the sample. Typi¬ 
cally, radiation of wavelengths 240-190 nm and 
shorter are used for this type of analysis. More electronic 
transitions are included by collecting data at shorter 
wavelengths, so more information is available. 

Basic Characteristics 

Analysis Methods 

Many methods have been employed to analyze CD 
spectra with varying degrees of success. Although 
some early methods were fairly basic and are no longer 
used, there are presently a number of methods that 
produce results of similar accuracy. There is no one 
method that gives the best analysis results for all pro¬ 
teins in all situations. The methodologies used fall 
broadly into four types: 

1. Obtaining a “signature” spectrum for each kind of 
secondary structure to be analyzed and using 
a least-squares approach to determine the closeness 
of the experimental spectrum to combinations of 
fractions of the signatures 

2. Performing a variation of principal component 
analysis upon the reference data and using combi¬ 
nations of the resulting eigenvalues instead of sig¬ 
nature spectra to determine the closeness 


3. Training a neural network, support vector machine, 
or other supervised learning technique to identify 
the spectral shapes and magnitudes in order to inter¬ 
pret the experimental data 

4. Using a first-principles approach to pre-calculate an 
electronic transitions parameter set using quantum 
mechanical methods, then applying them via 
a matrix method to recreate the experimental data 

Reference Data 

Aside from first-principles methods, all methodologies 
are highly dependent upon having accurate and appro¬ 
priate reference data. The accuracy of the data requires 
high quality in both the recorded CD spectrum of the 
reference protein and in the determination of its atomic 
structure from, almost universally. X-ray crystallogra¬ 
phy. The appropriateness of the reference data refers to 
how alike it is to the experimental spectrum, in that it 
will never be possible to analyze for high helical con¬ 
tent if none of the reference spectra have high helical 
content. Similarly, it has been shown that reference 
data from soluble proteins are less effective in analyz¬ 
ing the spectra of membrane proteins. As modem anal¬ 
ysis software now does a very good job, the most 
signihcant variable in obtaining a “correct” result is 
the selection of quality reference data in terms of both 
accuracy and appropriateness. 

To provide a “general case” analysis tool, it is 
common to constmct a reference data set consisting 
of a selection of different distributions of protein sec¬ 
ondary structure types with which to either train 
a neural network or input as comparative data. The 
coverage of varying secondary structure types is 
more important than the absolute number of spectra 
within the reference set. 

Software and Suites 

Several stand-alone programs and packages have been 
produced to facilitate protein circular dichroism anal¬ 
ysis (for a comprehensive list, see Whitmore and Wal¬ 
lace 2009). Among the most well-known software are 
Selcon 3, Contin-LL, CDSSTR, VARSLC, K2d, 
K2d2, SOMCD, CDNN, and DichroCalc, the first 
three of which comprise the downloadable suite 
CDPro (Sreerama and Woody 2000) and the first five 
of which are available through the online interface 
DichroWeb (Whitmore and Wallace 2008). K2d2 and 
SOMCD are different modifications of K2d and 
together with CDNN are all neural network methods. 
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DichroCalc (Bulheller and Hirst) is currently the only 

method that uses a first-principles methodology. 

Cross-References 

► Circular Dichroism and Chirality 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► DNA-Ligand Circular Dichroism 

► Far UV Protein Circular Dichroism 

► Near UV Protein CD 

► Oriented Circular Dichroism Spectroscopy 
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Synonyms 

PCDDB 


Definition 

The Protein Circular Dichroism Data Bank (PCDDB) 
is a repository for Circular Dichroism (CD) and 
Synchrotron Radiation Circular Dichroism (SRCD) 
spectroscopic data together with associated metadata. 


Basic Characteristics 
A Background to the PCDDB 

Many experimental techniques that are used in today’s 
research generate an enormous quantity of results in 
the form of data. After their initial use in analyses, 
these data then have to be stored, often for long 
periods, but in such a way that they are always readily 
retrievable for any further analyses that might be 
performed on them. Individual research laboratories 
worldwide retain stored data from their experiments, 
often for many years. One problem with having data 
stored for a long period is the potential obsolescence of 
the storage medium itself during that time. Each new 
revolutionary form of storing data, from magnetic 
tapes, to floppy disks, to cassettes, to compact disks, 
supersedes the previous form. Such changes very 
swiftly consign to the rubbish the devices that enabled 
the stored data to be retrieved and vast amounts of data 
have actually been lost in this way. 

One way to circumvent this problem is to create 
central online repositories for data. A number of 
science funding bodies worldwide (e.g., the US 
Department of Energy, US National Institutes of 
Health, US National Science Foundation, the Research 
Councils of the UK, Commission of the European 
Communities, and the Australian Prime Minister’s 
Science Engineering and Innovation Council 
(PMSEIC) Working Group for Data Sharing) have 
deemed it a requirement that data collected in projects 
that they have supported should be made freely acces¬ 
sible to all. The Protein Circular Dichroism Data Bank 
is an example of a central online repository for data. It 
has been created to store CD and SRCD spectroscopic 
data and associated metadata (such as the experimental 
conditions, the machine parameters used, etc.) and 
make these publicly accessible. 

The Format of the PCDDB 

The PCDDB has been created as a repository of spec¬ 
troscopic data much in the same manner as the long- 
established Protein Data Bank (PDB) that was created 
to house the three-dimensional structure coordinates of 
proteins and their associated crystallographic, NMR, 
or cryo-EM data. 

The PCDDB is available as a searchable Data Bank 
of CD and SRCD spectra. It contains all the raw and 
processed spectral data together with details of 
the spectral parameters used to calibrate the 
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spectrophotometer or synchrotron beamline used to 
obtain the data. This is to ensure that good practice 
has been maintained when collecting the spectral data 
prior to their deposition into the PCDDB. In addition, 
there are associated tools and protocols for spectral 
matching, analysis, and back-calculations available as 
part of the overall resource. All spectral data included in 
the PCDDB have undergone checking of the quality of 
those data. This checking is via validation software 
available online and is used to verify the quality of the 
data deposited, in order to maintain a high standard of 
the data incorporated into the resource. This form of 
checking procedure mirrors the aims of that used by the 
PDB where packages such as PROCHECK, 
MolProbity, and WHAT_CHECK are routinely used. 
All entries receive a set of validation “flags” associated 
with particular data parameters, each of which must be 
within acceptable limits to gain full entry status. Entries 
in which a parameter falls short of these limits are 
retained within the PCDDB, in a fashion similar to 
that of the PDB. They can also be accessed, however, 
not as part of the main searchable entry set. Such entries 
may often contain an added comment from the depos¬ 
itor explaining the failure in the validation procedure. 

All entries in the PCDDB have links to other Data 
Banks with related material for that entry. These include 
links to the: Research Collaboratory for Structural Bio¬ 
informatics (RCSB) PDB (RSCBpdb); the PDB Japan 
(PDBj); the PDB Europe (PDBe); and the sequence 
Data Bank (UniProt) together with a relevant “medline” 
reference for the entry. In addition, should the protein be 
an enzyme, there is a further link to its particular 
Enzyme Commission (E.C.) entry. Where it is relevant, 
there is also a CATH (Class, Architecture, Topology, 
Homology) classification entry for the protein. 

At present the focus of the PCDDB is to house data 
from proteins, but again, in parallel with how the PDB 
developed, future plans are to incorporate CD and 
SRCD spectra from other macromolecules, such as 
DNA, RNA, and polysaccharides. 

The PCDDB Release 

The first release of the Protein Circular Dichroism Data 
Bank was made publicly available in December 2009. 
The first entries were of the 71 proteins that comprise 
the SP175 reference database (Lees et al. 2006). In the 
current release, users can download and access these 
data files. In the full beta version released late 2010 
(Whitmore et al. 2011), users can deposit their own 


spectra, which will be validated before being made 
openly available online. 

Accessibility 

The PCDDB can be accessed through the website 
http://pcddb.cryst.bbk.ac.uk. 
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1996). In the far ultraviolet, between 180 and 230 nm, 
bands in the circular dichroism spectra of proteins arise 
predominantly from the electronic excitations of the 
peptide chromophore, in particular the n — 7t and the 
71 — 7t transitions. An isolated peptide chromophore 
would not exhibit a circular dichroism spectrum. The 
circular dichroism of a protein arises from the asym¬ 
metric arrangement of the polypeptide chain and the 
coupling (or mixing) of electronic transitions in the 
chiral macromolecular environment. 

The n — 7t transition is an excitation of an electron 
from a lone pair orbital on the amide oxygen atom into 
the antibonding n molecular orbital on the amide 
group. The 7t — 7i transition is an electronic transition 
from the nonbonding n, molecular orbital into the 7i 
orbital. The n — tt transition occurs at about 222 nm 
and the 7t — 7t* transition at about 190 nm. For helical 
proteins, one observes a negative band in the circular 
dichroism spectrum around 208 nm. This band arises 
from coupling of the n — n transitions, which have 
relatively large electric transition dipole moments. 

Coupling and mixing of transitions influence not 
only the location of bands in the circular dichroism 
spectra, but also their intensity. The area under a band 
is directly proportional to a quantity known as the 
rotational strength, which is analogous to the oscillator 
strength in “conventional” electronic absorption 
spectroscopy. The rotational strength, R, is 
proportional to the product of the electric and magnetic 
transition dipole moments, p, and m, respectively. This 
relationship, in standard quantum chemical notation, is 
embodied in the Rosenfeld equation: 


Definition 

Circular dichroism is one of several phenomena 
collectively known as optical activity. Circular 
dichroism is the differential absorption of left-handed 
and right-handed circularly polarized light by a chiral 
substance. A chiral molecule is one whose mirror 
images cannot be superposed. 

Basic Characteristics 

Proteins are chiral (asymmetric) molecules and, thus, 
exhibit circular dichroism. The chirality arises, in part, 
from the helical elements of the structure. About 
^35% of amino acid residues in globular proteins 
occur in ot-helical conformations. Such conformations 
have distinctive backbone dihedral angles and are 
characterized by hydrogen bonding between residues 
located at a position ; and i + 4 along the polypeptide 
chain. About ^25% of residues adopt a P-strand 
conformation, which is a much more extended struc¬ 
ture, again with characteristic backbone dihedral angles 
and patterns of hydrogen bonding. P-strands actually 
are also helical in nature; however, the helical radius is 
much narrower. Hence, the majority of residues in pro¬ 
teins are in ordered, local (so-called secondary) struc¬ 
tures with a chirality at a macromolecular level, in 
addition to the local chirality associated with the asym¬ 
metric arrangement of substituents around individual a- 
carbon atoms. It was recognized many years ago 
(Moffltt 1956) that a helical arrangement of oscillators 
(which can serve as an approximate model of electronic 
excitations) leads to a CD signal. 

The underlying principles and theory of circular 
dichroism are well established (Schellman 1975; 
Rodger and Norden 1997). In the ultraviolet, one can 
detect transitions corresponding to electronic 
excitations. In the infrared, transitions corresponding 
to vibrations can be measured. The latter gives rise to 
vibrational circular dichroism (VCD) in the infrared. 
VCD spectra are rich in information, but the 
instrumentation has only been developed and 
commercialized relatively recently. Most day-to-day 
measurements of the circular dichroism of proteins are 
made in the ultraviolet and it is the theory of this form 
of circular dichroism that is described here. Many 
fuller and more technical accounts are given elsewhere 
(Bulheller et al. 2007; Berova et al. 2000; Fasman 


Roa = Im((i/7o|7?|i/7^) • 

In this equation, the subscripts 0 and A refer to the 
ground and electronic excited states; Im indicates the 
imaginary part of the quantity in parentheses, and the 
wavefunctions of the different states are denoted by ijj. 
In an achiral environment, one or other of the transition 
dipole moments would be zero, due to symmetry consid¬ 
erations. For example, in the simplest amide, formamide, 
the 71 — 71* transition is electric dipole allowed, but has 
magnetic transition dipole moment of zero; the converse 
is the case for the n — 7i transition. However, in a chiral 
molecule, electronic transitions with nonzero electric and 
magnetic transition dipole moments can occur. 

The interactions of electronic excitations can be 
modeled using exciton theory (Davydov 1962) and. 



p 


1964 


Protein Circular Dichroism: Theoretical Aspects 


in the context of protein circular dichroism, this is 
often referred to as the matrix method (Tinoco 1962; 
Besley and Hirst 1999; Berova et al. 2000; Fasman 
1996). The interactions can be calculated within 
a dipole-dipole approximation or more sophisticated 
representations of the transition charge densities can 
be used. The geometry of the protein chain controls 
the separation and relative orientation between peptide 
chromophores, which is, in turn, reflected in the sign and 
magnitude of the individual interactions. This approach 
leads to the constraction of protein-level wave functions 
as linear combinations of individual chromophore exci¬ 
tations. From these protein-level wave functions, one 
can compute the electric and magnetic transition dipole 
moments, and, thus, the rotational strengths associated 
with each electronic transition. 

Groups other than the peptide chromophore may 
also influence the circular dichroism spectrum of 
a protein. Aromatic moieties in the side chains of 
tryptophan, phenylalanine, histidine, and tyrosine 
have electronic transitions in the far and near 
ultraviolet. In the former region, the regular arrange¬ 
ments of peptide chromophores (due to the secondary 
structure) tend to dominate the signal, although 
sometimes there may be side-chain contributions, on 
the order of ^10%, to the intensity of bands. In the near 
ultraviolet, there are no peptide excitations and the 
bands in this region reflect interactions among 
aromatic groups, that may be packed together; bands 
in the near ultraviolet are about 100 times less intense 
than those in the far ultraviolet. 

The theory of protein circular dichroism, as outlined 
above, is rooted in a consideration of electronic 
transitions that are localized on individual 
chromophoric groups. In effect, these excitations are 
modeled using quantum mechanics, but interactions 
among the excitations are treated classically. Recently, 
the advent of synchrotron radiation circular dichroism 
experiments (Wallace and Janes 2009) has revived 
interest in higher energy transitions, occurring below 
180 nm, in the deep ultraviolet. Bands in this region, 
like their counterparts in the far ultraviolet, are 
dependent on the secondary structure of the protein, 
and provide valuable additional information. They 
arise from charge-transfer transitions, in which an 
electron is excited from one peptide chromophore 
into a 71* orbital located on an adjacent peptide group. 
To describe this theoretically requires the formalism 
discussed above to be extended, such that dipeptide 


entities are considered at a fully quantum mechanical 
level (Bulheller et al. 2008). 

Most researchers measuring the circular dichroism 
of proteins do so with the pragmatic aim of assessing 
the structural integrity of the protein under study and 
perhaps estimating its secondary structure content. 
Occasionally, however, it may be of interest to 
compute the circular dichroism of a given (perhaps 
hypothetical or model) protein stmcture or an 
ensemble of protein structures using the first 
principles’ approach summarized above. It is now 
possible to access an implementation of these 
theoretical methods through a web-interface - http:// 
comp.chem.nottingham.ac.uk/dichrocalc - where one 
simply uploads a protein structure in the standard 
protein data bank format and the results of the 
requested calculation are sent back via e-mail. 
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Definition 

Defining Protein-Protein Interactions 

Protein-protein interactions can be classed according 
to different physical properties. Although different 
classification systems could be used, the following 
definitions appear to be the most widely adopted in 
the literature (Jones 201 1). 

Homooligomers versus heterooligomers: 
Homooligomers comprise complexes formed by iden¬ 
tical protein chains. Isologous homoligomers involve 
contacts through the same surface on each protomer or 
subunit, whereas heterologous homoligomers make 
involve different interfaces on the different protomers. 
Heterooligomers are complexes formed between non¬ 
identical protein chains. 

Non-ohligate versus obligate complexes (Jones and 
Thornton 1996): The protomers that form non-obligate 
complexes can exist as independent structures. 


Obligate complexes comprise protomers that are not 
found as stable independent structures, and are usually 
functionally as well as structurally obligate. These 
obligate complexes are sometimes referred to as 
obligomers (Ofran and Rost 2003). 

Transient versus permanent complexes (Nooren and 
Thornton 2003): These properties relate to the lifetime of 
the complex. A permanent complex is usually very stable 
and the subunits exist only in complex. A transient com¬ 
plex has a short lifetime. Transient complexes are further 
broken down into weak transient interactions that undergo 
a dynamic equilibrium in solution, and strong transient 
interactions that require a molecular trigger such as post- 
translational modification to promote or reduce binding. 

It should be noted that non-obligate/obligate and 
transient/permanent complexes exist in a continuum 
in vivo, and the lifetime of a complex may vary 
according to different conditions. 
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Definition 

Although proteins have different percentages of differ¬ 
ent amino acids, each residue has an amide bond 
linking it to the next residue in the chain. A number 
of concentration determination methods have been 
developed that involve derivatizing the amides with 
a dye and spectroscopically determining the concen¬ 
tration of the derivatives (Noble and Bailey 2009). The 
three methods mentioned below all rely on a standard 
of known concentration to enable a calibration curve to 
be plotted. The calibration is not necessarily linear and 
ideally is the same protein of known purity since the 
dye binding efficiency varies from protein to protein. 
Other reagents in the sample such as buffer salts and 
detergents may also interfere with either the chemistry 
or the spectroscopy of the dyes. 

Basic Characteristics 

The methods for three commonly used assays are 
outlined below. 

Biuret Method 

The biuret method is simple and reasonably specific as 
it depends on the reaction of copper(II) with four 
N atoms in the peptide bonds of proteins. Its concen¬ 
tration range is from 0.5 to 2.5 mg protein per assay. 
Compounds containing peptide bonds give 
a characteristic purple color when treated in alkaline 
solution with copper sulfate. This is termed the 
“biuret” reaction because it is also given by the sub¬ 
stance biuret NH 2 CONHCONH 2 . For a wide variety of 
proteins, 1.0 mg of protein in 2 cm^ of solution results 
in an absorbance at 540 nm of 0.100. Proteins such as 
hemoproteins, however, give spurious results due to 
their intrinsic absorption at 540 nm. The protein con¬ 
tent of cell fractions such as nuclei and microsomes 
can be estimated by this method after solubilization by 
detergents such as deoxycholate or sodium dodecyl 
sulfate. 

The biuret reagent may be made by placing 
CUSO 4 . 5 H 2 O (1.5 g) and sodium potassium tartrate. 4 H 20 
( 6.0 g) into a dry 1 dm^ volumetric flask and adding 
about 500 cm^ of water. With constant swirling, 
NaOH solution (300 cm^, 10% w/v) is added and 


the solution made to volume (1 dm^). The reagent 
prepared in this manner is deep blue in color. It may 
be stored indefinitely if KI (1 g) is also added and 
the reagent is kept in a plastic container. The protein 
solution {x cm^, where jr < 1.5 cm^) is then mixed 
with water [(1.5 — x) cm^] to make it into a total 
volume of 1.5 cm^; 1.5 cm^ of biuret reagent is 
added. The purple color is developed by incubating 
for 15 min at 37°C. The tubes must then be cooled 
rapidly to room temperature and the absorbance at 
540 nm determined. The color of the solutions is 
stable for hours. 

Folin-Ciocalteau or Lowry Method 

The Lowry method is an enhancement of the biuret 
method that is one to two orders of magnitude more 
sensitive, being able to measure protein concentrations 
in the range of 5-150 pg. The main disadvantage of the 
Lowry method is that many substances interfere with it: 
ammonium sulfate, thiol reagents, sucrose, EDTA, 
Tris, and Triton X-100. 

The final color in the Lowry method is a result of two 
reactions. The first is a small contribution from 
the biuret reaction of protein with copper ion in basic 
solution. The second results from peptide-bound copper 
ions, facilitating the reduction of the phosphomolybdic- 
tungstic acid (the Folin reagent) which gives rise to 
a number of reduced species with a characteristic blue 
color. The amino acid residues which are involved in the 
reaction are tryptophan and tyrosine as well as cysteine, 
cystine, and histidine. The amount of color produced 
thus varies with amino acid content. 

Two solutions are required for the Lowry method. 
For the alkaline copper solution, mix Na 2 C 03 (50 cm^, 
2% w/v) in NaOH (0.1 M) with CUSO 4 . 5 H 2 O (1 cm^ 
0.5% w/v) and sodium potassium tartrate (1 cm^, 
1% w/v). This solution is only stable for 1 day. The 
Folin reagent (phosphomolybdic-tungstic acid) may be 
made by diluting the concentrated Folin reagent 
obtained from a commercial source (e.g.. Sigma) 
to be 1 M H"^. Then add x cm^ of sample (where 
X < 0.6 cm^) (volume determined to contain 
5-100 pg of protein) to (0.6 — x) cm^ of water and 
3 cm^ of the alkaline copper solution. After mixing and 
standing for 10 min at room temperature, 3.0 cm^ of 
Folin reagent is added and after 30 min, the absorbance 
at 600 nm is determined. 
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Coomassie Blue Dye Binding Assay 

The dye Coomassie Brilliant Blue G250 binds to pro¬ 
teins and changes color. This is the basis for an assay 
that is appropriate for 5-100 pg (so similar to Lowry) 
which is more commonly known as the Bradford assay 
(Bradford 1976). The protonated form of Coomassie 
Blue is a pale orange-red color whereas the 
unprotonated form is blue. When proteins bind 
Coomassie Blue in acid solution, their positive charges 
suppress the protonation and a blue color results. The 
binding of the dye to a protein thus causes a shift in 
the absorption maximum of the dye from ^465 to 
^595 nm. It is the increase in absorbance at 595 nm 
that is usually monitored. The assay is nonlinear and 
requires a standard curve. 

The reagent may be prepared as follows. Coomassie 
Brilliant Blue G-250 (100 mg) is dissolved in 95% 
ethanol (50 cm^). To this solution phosphoric acid 
(100 cm^, 85% w/v) is added and the solution diluted 
to 1 cm^. Dyes are notoriously impure as supplied by 
chemical companies, so it is usually advisable to 
buy the reagent in kit form. To perform the assay, x 
cm^ of the sample containing 5-100 pg of protein is 
placed in a clean, dry test tube. (0.5 -x) cm^ water and 
5.0 cm^ of diluted dye reagent are added and the 
solution mixed well. After a period of of 5-60 min, 
A 595 is determined. 

The Bradford assay is relatively robust. The only 
compounds found to give excess interfering color in 
the assay are relatively large amounts of detergents 
such as sodium dodecyl sulfate, Triton X-100, and 
commercial glassware detergents. 
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Introduction 

Protein dynamics is intimately connected to function. 
Many biochemical reactions are mediated by protein 
conformational transitions to high-energy states that 
are often populated at low levels undetectable by tra¬ 
ditional methods in structural biology. Nuclear mag¬ 
netic resonance (► NMR) relaxation dispersion 
methods (► Relaxation Dispersion) provide a unique 
means of characterizing this type of protein motion 
with rate constants ranging from approximately 
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1 -10^ s“' and populations down to approximately 1%. 
Relaxation dispersion experiments yield a wealth of 
information about the exchanging system, including 
the kinetic rate constants, the relative populations of 
the exchanging states, and the chemical shift differ¬ 
ences between these, which depend on the molecular 
structure. Temperature-dependent experiments enable 
mapping of the energy landscape in terms of the free 
energy barrier and the free energy difference between 
the exchanging states, broken down into enthalpy and 
entropy. Applications have addressed protein folding¬ 
unfolding (► Protein Folding), enzyme catalysis 
(► Protein Dynamics in Catalysis - Computational 
Studies), and ligand binding (► Protein-Ligand 
Dynamics) (Korzhnev and Kay 2008; Baldwin and 
Kay 2009). 

This chapter outlines the fundamental principles of 
relaxation dispersion experiments and their application 
to protein dynamics. The focus is on heteronuclear 
experiments performed on isotope-labeled proteins 
(► Protein NMR - Introduction), where the 
dynamics is typically probed by or '^C spin relax¬ 
ation, but experiments have also been developed 
for 'H. 


Theory 

The basic theory was first developed in the 
1950s-1960s (McConnell 1958; Allerhand and Thiele 
1966) and has since seen continuous refinement 
(Palmer et al. 2001; Palmer 2004; Palmer and Massi 
2006), enabling experimental characterization of an 
increasingly broad range of problems, including 
exchange between multiple states. 

These NMR methods measure the radio frequency 
(RF) field dependence of the excess transverse relaxa¬ 
tion rate {Rex) caused by the exchange of nuclei 
between alternative conformations/states with differ¬ 
ent resonance frequencies (Fig. 1). The stochastic 
exchange process dephases the coherent magnetiza¬ 
tion, which can be refocused by applying 
a refocusing RF field, either in the form of a train of 
180° pulses or a continuous-wave “spin-lock” 
(Figs. 2-3). 

Evolution of a spin system undergoing exchange 
can be generally described using the stochastic 
Liouville equation, which reduces to the 
Bloch-McConnell equation in the case of a single 


uncoupled spin (Abergel and Palmer 2003). 
Below we outline the general framework describing 
exchange contributions to the transverse relaxation 
rate, based on the Bloch-McConnell equation, 
and illustrate the central concepts for the simplest 
case of 2-state exchange; analytical expressions 
describing more complex exchange processes 
have been described (Palmer 2004; Palmer and 
Massi 2006). 

Exchange between different states (which 
could represent alternative conformations within 
a molecule or sites in different molecules) can be 
described by a set of coupled reactions (van Kampen 
1992): 

kx2 kij 

Fi ^ Yi; Vi^Ys;Y,- ^ Yj -...; Yn^i ^ Y^ 

A'zi k2\ kji k22jx-\ 

( 1 ) 

where the N different states T, undergo mutual 
exchange with rate constants kjj (from T, to Yj). Reac¬ 
tions involving oligomerization or binding of ligands 
(► Protein-Ligand Dynamics) (where, e.g., T, is the 
free state and T, is the ligand-bound state) can be 
accommodated within this general scheme provided 
that the relevant rate constants are recognized as pseudo 
first-order reaction rate constants (e.g., kjj = kij [L], 
where [L] is the ligand concentration). 

The equilibrium populations are determined by the 
detailed balance condition 


KP = 0 (2) 

where P is the column vector of equilibrium 


populations p,, and the matrix of kinetic rate constants 
Kis 

K = 

1 

1 

M 

kii 

-Xz ... 

k^l 

^Nl 

(3) 



kiN 

—Y^n_ 


with diagonal elements 





S,- = 



(4) 
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Protein Conformational Dynamics by Relaxation Disper¬ 
sion, Fig. 1 Conformational exchange and its effect on the 
NMR spectrum, (a) Schematic illustration of exchange between 
two conformational states of a protein (yellow shape) with one 
H-N amide group shown in hall and slick representation. The 
resonance frequency of the nucleus depends on the molec¬ 
ular structure and changes from £2] to Q. 2 . In addition, the H-N 
bond vector changes orientation in the molecular frame when the 
protein exchanges between states 1 and 2. (b) Energy landscape 


view of conformational exchange between two conformations 
with relative populations pi = 0.9 and p 2 = 0-1 (magenta) or 
Pi ~ P 2 ~ 0.5 (blue). The forward and backward rate constants 
are ki 2 and £ 21 , which yield the exchange rate k^,^ = ki 2 + k 2 i. 
(c-e) Effects of exchange on the NMR spectrum in the absence 
of RF fields, (c) No exchange, k^x = 0. (d) Slow exchange 
regime, k^x ~ 100 s^'. (e) Fast exchange regime, 
k^x = 10,000 s^*. In c-e, £2] = —2n 100 s^* and Q 2 = 27t 
100 s“*; populations color coded as in panel b 


Exchange processes lead to increased transverse 
relaxation rates of nuclear spin magnetizations if the 
different states have different magnetic environments 


that give rise to different resonance frequencies. The 
magnetization of a single uncoupled spin evolves in 
time according to the Bloch-McConnell equation 
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Protein Conformational Dynamics by Relaxation Disper¬ 
sion, Fig. 2 Conceptual illustration of exchange contributions 
to the transverse relaxation rate in the presence of refocusing RF 
pulses. The vector diagram at the top indicates the evolution of 
a magnetization vector processing with offset frequency Cl, 
resulting in a phase i/r = Clt. Time points are taken at r = 0, 
immediately before and after the first 180° pulse {t — t_ and t+, 
respectively), and at the first echo (t = T,.p). (a-d) Phase evolu¬ 
tion diagrams, i/r(f) plotted versus t. (a) In the absence of 
exchange, the phase evolution (blue line, slope = Q) of the 
magnetization vector is refocused perfectly by the CPMG 





pulse train. The linewidth of the resulting NMR signal (green) 
is governed by intrinsic relaxation, R® (b) In the presence of 
exchange, the magnetization vector stochastically changes offset 
frequency between Q, (blue line) and Q .2 (t'sd line), resulting 
in dephasing of the coherent magnetization arising from the 
ensemble of spins. The resulting signal is broadened by the 
exchange contribution, (c) Increasing the number of 

refocusing pulses in a given relaxation delay decreases the R„ 
contribution to yield a narrower linewidth. Panel d shows each of 
the hypothetical exchange jumps that are included in panels 
b and c 


dt 


(L -F r)M(f) 


(5) 


where M(t) is the column vector of magnetizations 
residing in states 7,: M,(t) = [Mj-,(t), My/l), M^_,(t)], 
which represents deviations from the Boltzmann equi¬ 
librium magnetization. L describes the evolution in the 
absence of exchange 


Li 


L = 



0 


(6) 


L,= 


-R 


0 

V 


Q, 


0 


-Q, 

pO 

^ 2 ,/ 

0 


0 

0 


(7) 


where Rn and T?®,/ are the intrinsic relaxation rate 
constants of M^ i and respectively, in the absence 

of exchange, and Qi is the offset of the resonance 
frequency from the RF carrier frequency. F describes 
the effect of exchange on M,(t), by expanding K to 
accommodate the three-dimensional spin 
magnetization. 


L 


N 


0 


r = K 0 1 m 


( 8 ) 
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Protein Conformational Dynamics by Relaxation Disper¬ 
sion, Fig. 3 Pulse sequence elements for selected '^N relaxa¬ 
tion dispersion experiments. Wide and narrow bars represent 90° 
and 180° RF pulses, respectively. All RF pulses have phase y, 
unless otherwise specified, (a) Relaxation-compensated CPMG 
experiment. The delay is set to l/(4/jv,//) and n is any even 
integer, (b) R^p experiment using chemical shift evolution to 


align the magnetization with the effective field and return it to 
the z-axis. The period x is adjusted to l/cO]. Magnetization is 
returned to the z-axis and 180° pulses are applied to proton at 
time points T/4 and 3 T/4. (c) R\p experiment using an adiabatic 
pulse to align the magnetization with the effective field and 
another with the reverse profile to return it to the z-axis. 180° 
pulses are applied to 'FI at time points T/4 and 3T/4 


which is the direct product between K and the 3x3 
identity matrix (1^)- Provided that the differences 
between the intrinsic relaxation rate constants for the 
N states are small compared to the kinetic rate con¬ 
stants, that is, I Ri j—Rxj \<^ kjj and I R^j—Rij it 

suffices to consider the population-weighted relaxation 
rate constants 

N 

Ra = (9) 

/-I 

where Rp indicates R^ or /? 2 - In many cases of interest, 
the population of the ground state dominates such that 
the population-weighted relaxation rate constants can 
be replaced by those of the ground state. In the fast 
exchange limit (defined below), measurement of /?i or 
7?2 directly yields the population-weighted average. 


Timescales of Motion 

The exchange rate of the molecular motion relative to 
the difference in resonance frequency between the 
exchanging states defines the exchange timescale, 
which can be divided into separate regimes. In the 
case of 2-state exchange, the timescales are defined 
as: fast 3> Am), intermediate {kp^ « Am), and slow 
{kpx ^ Affl), with kpx = ki 2 + ^21 and Acu = IQ 1 -Q 2 I. As 
illustrated in Fig. 1, the exchange timescale and relati¬ 
ve populations of the exchanging states drastically 
affect the appearance of the NMR spectrum. 
In the absence of refocusing RF fields, 
the solution to Eq. 5 gives the relaxation decay 
constants of the resonances (Cavanagh et al. 2007). 
Assuming for simplicity that = R^i = Ri, 
a general solution for the dominant resonance 
(Pi > Pi) is: 
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Ri.i =R2 


+ 


0 , 1 1^2 


2 78 I 




(7^. - Aco^)^ - 16/21/22Aaj^7 


-, 1/2 


1/2 


( 10 ) 


Assuming further that pi > P2, Eq. 10 reduces to 
(Palmer 2004): 


extends the results of Eq. 12 that is independent of 
Bq in the slow-exchange limit, and of Eq. 11 that 
depends quadratically on Bq in the fast exchange limit. 

Relaxation Dispersion Experiments 

The exchange contribution to the transverse relaxation 
rate can be modulated by the application of an RF field. 
The effect of the RF field is taken into account by 
incorporating it into Eq. 5: 


^2^1 “'^“+^2^“/224 + Aft22 

In the slow exchange limit, ^ A®, the spectrum 
comprises separate resonances that have the relaxation 
decay constants: 


^ 2,1 — ^ 2,1 + ^12 “ ^ 2.1 + Plkex (12a) 

R 22 = ^2,2 + ^21 = R 2.2 + Pl^ex (12b) 

Note that if the populations are skewed (e.g.. 
Pi ^ P 2 ), then only the major component might be 
visible, whereas two resonances are visible if the 
populations are similar. In the fast exchange limit, 
kgjc ^ Acu, a single, averaged resonance is observed 
with relaxation decay constant: 

Rl = Pl^2,l + ^2^2,2 + PlPl^OJ^/kex (13) 

In analyzing relaxation dispersion data, it is critical 
to have accurate information on the timescale, which 
cannot readily be determined from the appearance of 
the spectrum if the populations are skewed (Fig. Id-e). 
Instead, the exchange timescale can be determined 
from the dependence of the exchange broadening, 
= R 2 - R 2 , on the static magnetic field strength, 
Bq. This is so because Am scales linearly with Bq. The 
relative change in R^^ is related to the relative change 
in Bo (Palmer et al. 2001): 

6R,, _d{\nRex) m _ SBo 

- — —7-1 T - — ^ - ( -1-'^) 

Rex (f(lnAm) Bq Bq 

In the cases of particular interest, / 2 i > 0.7, a defines 
the exchange timescale as follows: 0 < a < 1 for slow 
exchange; a « 1 for intermediate exchange; and 1 < a 
< 2 for fast exchange. This dependence of a on Am 


^^=(L + r + B)M(t) (15) 

with 


B = liv 


0 

0 


—ffli sin (p 


0 tui sin (/) 

0 —(DiCOSCp 

coi cos (p 0 


(16) 


where Ijy is the A-dimensional identity matrix, cui is 
the RF-field amplitude, and (p is the phase of the RF 
field, defined such that <p = 0 corresponds to an RF 
field oriented along the x-axis. 

In the case of rotating-frame relaxation, the RF field 
is applied throughout the relaxation delay, t, and Eq. 15 
can be solved directly (following transformation to the 
tilted rotating frame; see below). By contrast, in labo¬ 
ratory-frame spin-echo experiments, the RF field is 
applied as a train of even-numbered 180° refocusing 
pulses, spaced at intervals of Xep, where t — InXep and n 
is an integer. In this case, Eq. 15 needs to be integrated 
in a piece-wise manner, taking into account the finite 
length of the 180° pulses. However, the treatment can 
be simplified by recognizing that the effect of each 
180° pulse essentially is to invert the phase of the 
transverse magnetization in between periods of free 
precession (see below). 

Due to the distinction between laboratory- and 
rotating-frame experiments, below we outline their 
salient features under separate headings. 

Laboratory-Frame Relaxation 

Laboratory-frame relaxation is measured using a Carr- 
Purcell-Meiboom-Gill (CPMG) train of refocusing 
pulses (Fig. 3a, ► CPMG). Considering that a 180° 
pulse is typically much shorter than the timescale of 
evolution under L -i- F, and that oti Q, the effect on 
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M(t) of a single 180° pulse with 0 = 0 can be described 
in compact notation: 


V± = 


^cp 

72 




ll/2 


M(r+) = UM(f_) (17) 




where t+ and t_ denote time points immediately 
following and preceding the 180° pulse, respectively, 
and 


U = 1a( 0 


1 

0 

0 


0 0 

-1 0 

0 1 


( 18 ) 


Taking the solution of Eq. 5 to be M(f) = exp[(L + 
r)f]M(0) = A(f)M(0), the effect of a single spin-echo 
is given by 

M(t,p) = A(t,p/2)UA(t,^/2)M(0) 

= A(t,^/2)UA(t,^/2)U-'UM(0) (19) 

Consequently, at the end of an even number of 
CPMG echoes, the magnetization is described by: 


M(2«t,^) = [A(t,^/2)UA(t,^)U-1A(t,^/2)]"M(0) 

( 20 ) 


Equation 20 shows that the sense of chemical shift 
evolution is inverted during one half of the total CPMG 
period, leading to perfect refocusing of the magnetiza¬ 
tion in the absence of exchange (Pig. 2a). Closed-form 
expressions for the transverse relaxation rate constants 
as a function of the spacing of the refocusing 
pulses have been derived for several limiting cases. 
Equation 21 defines a general expression for 2-state 
exchange, which is valid in all exchange regimes, as 
long as the evolution is dominated by a single expo¬ 
nential (Davis et al. 1994): 

= ^2 + 2 

(kex —^cosh“' [D+ cosh(r^_,_) — D_ cos(r 7 _)] | 


where 


D± 


±l + (0 + 2Am2)/(02 + ^2)'/2' 


c = -2Acokex{pi -Pl) 

Measuring R 2 as a function of x^.p enables extraction 
of kex. Pi, and Aoj. Combined analysis of relaxation 
data from multiple static magnetic fields aids in sepa¬ 
rating the influence of the different parameters and 
increases the reliability of curve htting. In the fast 
exchange limit, a simpler expression is adequate 
(Allerhand and Thiele 1966): 




1 - 


^ex'^cp 


tanh 


kex^cp 


( 22 ) 


In this case, the determinable exchange parameters 
are limited to k^^^ and the product pxP 2 A(j?. Typically, 
the decay is well approximated by a single exponential 
in the fast and intermediate exchange regimes, whereas 
this may not be the case in the slow exchange regime, 
where instead the following equation describes the 
relaxation of the major conformation (px > P 2 ) when 
1^2 1 “-^221 ^12 ^/'^cp (Gutowsky et al. 1965; 

Tollinger et al. 2001): 


7?2=4i 


, sin(AfflT^^)\ 

V Amxcp ) 


(23) 


To make these equations valid for cases where the 
exchange-free relaxation is multiexponential due 
to cross-correlated relaxation mechanisms (► Cross- 
Correlated Relaxation), experiments are often 
recorded in a constant time manner, where the number 
of the refocusing pulses is varied during a hxed relax¬ 
ation period. Another potential problem concerns sca¬ 
lar coupled spins, which evolve during free-precession 
relaxation periods between in-phase and anti-phase 
spin operators that have different relaxation rates, 
resulting in an effective relaxation rate that is modu¬ 
lated by the spacing of the refocusing pulses, unless 
these are applied at time points x^p = mlJis, where m is 
an integer and Jig is the scalar coupling constant 
(► J Coupling). Relaxation-compensated experiments 
solve this problem by dividing the relaxation period 
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Protein Conformational Dynamics by Relaxation Disper¬ 
sion, Fig. 4 Definitions of teiTns in rotating-frame relaxation 
experiments, (a) A nucleus experiencing two different magnetic 
environments, 1 and 2, will resonate at two different frequencies 
separated by Aca. Its offsets from the carrier frequency of an 
applied RF field, co^ /r, are £2, and Q .2 with a population-weighted 


into two parts that explicitly average the relaxation of 
in-phase and anti-phase magnetizations regardless 
of the number of refocusing pulses (Palmer et al. 
2001; Cavanagh et al. 2007). Alternatively, 
experiments can be designed to measure the relaxation 
of in-phase magnetization by decoupling the other 
spin, while carefully avoiding coherence transfer 
between spins due to matched RF fields (Hansen 
et al. 2008). 

Rotating-Frame Relaxation 

In the presence of a continuous RF field in the trans¬ 
verse plane, a spin will experience an effective field 
given by (see Fig. 4): 

+ (24) 

where Q is the offset from the RF carrier. Magnetiza¬ 
tions aligned with the effective field will be locked in 
a tilted rotating frame making an angle, 6, with the 
static magnetic field that depends on Q and coi 
according to: 


tan 0 = — (25) 

The relaxation rate of a magnetization oriented 
along the effective field is denoted by /?ip and is a 


offset of Cl. (b) During application of an RF field, the magneti¬ 
zation is locked in a frame rotating with the carrier frequency of 
the applied field and is tilted with respect to the static magnetic 
field by an angle 9. The spin will experience an effective field, 
co^ff, which is the vector sum of the offset from the RF carrier and 
the strength of the RF field 


weighted sum of longitudinal and transverse relaxation 
rates: 

Rip = Ricos^f) + + ReA-sin^0 (26) 

Rip is measured by first aligning the magnetization 
vector with the effective magnetic field, either by 
a combination of pulses and chemical shift evolution 
(Fig. 3b) or by an adiabatic pulse (Fig. 3c), then apply¬ 
ing an RF field for a relaxation period T, and subse¬ 
quently returning the magnetization vector to the 
z-axis (Cavanagh et al. 2007). 

Analytical expressions that approximate the numer¬ 
ical solutions to the Bloch-McConnell equations have 
been derived for several exchange scenarios (Abergel 
and Palmer 2003; Palmer 2004; Palmer and Massi 
2006). For 2-state case in the fast exchange limit. Rip 
is described by: 

Rip = RiCos^D + RSsin^f) + sin^0^^^^^-^ 

(27) 

where is taken to be the population-weighted 
value, ajgff = Wi + (/tiQi + P 2 ^ 2 ) ■ As is the case of 
CPMG relaxation dispersion, only and p\p 2 l^ijr can 
be determined in this regime. An expression applicable 
to 2-state exchange on arbitrary timescales is given by 
(Palmer and Massi 2006): 
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PxP2^m'^ke. 




lff.2l^lff + - sin^0/21/22Affl2 (^1 + ^ 




(28) 


where = oti^ + Q,^. If /2i 3> P 2 , then a simple 

relationship is obtained (Palmer 2004): 

Rip = RiCos^O + R^sin^O + sin^0^^^^^^^-(29) 

4 + ®?#,2 

This equation is congruent with Eq. 27, except that 
Rex now depends on the effective field of the less 
populated state, (Oeff, 2 , rather than on (Oejf. Thus, R^p 
relaxation is clearly different in the fast- and interme¬ 
diate/slow exchange regimes. Under fast exchange 
conditions, Rex is maximal when the RF carrier fre¬ 
quency is placed at the average resonance position 
(/ 2 iQi -I- P2^2)- By contrast, outside of the fast 
exchange regime, Rex reaches its largest value when 
the carrier is placed on resonance with the minor state 
(Q 2 ), see Fig. 5a. In the intermediate/slow regime, both 
the populations and absolute sign of Am can be deter¬ 
mined from the offset dependence of /?„■ Similar 
results are obtained also for more complex scenarios 
(Palmer and Massi 2006), as exemplified for linear 
3-state exchange in Fig. 5b. 

Since Rex depends on (Oeff that in turn depends on Q 
as well as (Oi, it is possible to record/?ip dispersions by 
varying Q or 021 , or both. In particular, it is possible to 
achieve very large effective fields by moving the spin- 
lock field far off resonance, albeit at the expense of 
reduced sensitivity to exchange through the depen¬ 
dence on sin^0. Rip experiments can thus probe signif¬ 
icantly faster exchange processes than those accessible 
by CPMG; this conclusion holds even if the RF field is 
applied on resonance, since approximately twice as 
large effective fields can be achieved in this case 
(Fig. 6). For intermediate to slow exchange, another 
advantage compared to CPMG is that all exchange 
parameters, including the sign of Am, can be retrieved 
without the need to record additional experiments. Rip 
dispersion experiments targeting intermediate 
exchange are typically recorded as a function of Q 
using constant cui, as exemplified in Fig. 5. 


Extensions and Practical Considerations 

It is helpful to determine separately in order to 
define the dispersion plateau and thereby improve the 
accuracy of exchange parameters extracted by curve 
fitting, especially in the case of fast exchange (Palmer 
et al. 2001; Cavanagh et al. 2007). As gauged from 
Eqs. 21-22, R 2 {^hcp) approaches /?2 when Ihcp > Kx- 
This requirement is difficult to meet if kex > 2,000 s”* 
due to NMR hardware limitations. However, provided 
that R\ is measured separately, accurate exchange 
parameters can be extracted even if I/t^.^ > QAkex', 
similar considerations apply to Rip experiments. R\ 
can be estimated from the transverse cross-correlated 
relaxation rate, which does not include contributions 
from exchange (Palmer et al. 2001; Cavanagh et al. 
2007). Alternatively, Z?® can be estimated from a linear 
combination of four relaxation rates determined for 
two-spin Cartesian operators, e.g., N^H^., N^Hj., 

(Hansen et al. 2007). In relaxation-compensated 
CPMG experiments, the relevant exchange-free relax¬ 
ation rate is an average of those for in-phase and anti¬ 
phase operators, which in the macromolecular limit 
can be calculated based on Z?® for the in-phase coher¬ 
ence and the longitudinal relaxation rate of the coupled 
nucleus (Palmer et al. 2001; Cavanagh et al. 2007). 

Extensions of the basic experimental design 
described above include methods to measure Rex con¬ 
tributions to the relaxation rates of multiple-quantum 
(MQ) coherences. Here, Am is replaced by the sum 
(Am/ -I- Amj) or difference (Am/ — Am5) for double¬ 
quantum (2Q) or zero-quantum (ZQ) coherence, 
respectively. Methods have been developed to measure 
the difference in exchange contributions to the 
relaxation rates of ZQ and 2Q coherences, AZ?^^ 
(Cavanagh et al. 2007). In the fast exchange regime, 
Azz^S is derived from Eq. 13 

= ^P\P2^(Ol^(Os/kex (30) 
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Protein Conformational 
Dynamics by Relaxation 
Dispersion, Fig. 5 Offset 
dependence of outside of 
the fast exchange regime. 
is plotted for (a) 2-state and 
(b) 3-state exchange. The lines 
represent exact solutions to the 
Bloch-McConnell equation. 
Parameters used in the 
calculations were: 


(a) Pi = 

0.95, 

ki2 + ^21 

= 250s-‘. 

II 

3 

<1 

2,400 s“'; 

(b)pi = 

0.95,P2 = 0.03, 

ki2 + ^21 

= 500 s“‘. 

^13 + ^31 

= 1,000 s“‘. 

^23 + ^32 

= 700 s“‘. 

Aoj2i ~ 

2,000 s“'. 

Am3i = 

-4,000 s^‘; 


ya aiiu uy ivi — i.J , 

/J2=ll s~* andcoi = 1,000s“* 



Offset (s '') 



Offset (s '') 


A more general expression, valid across a broader 
range of time scales as long as pi ^ p 2 , is obtained 
from Eq. 11 (Palmer 2004; Cavanagh et al. 2007): 

^l^MQ ^ _ 4pjp2Aco,Acoski _ 

P^l4x + ^ 

(31) 

Equations 30-31 reveal that MQ experiments yield 
the relative sign of Aojj and Acog, which is valuable for 
further interpretation of the chemical shifts in terms of 
the minor state structure. Relaxation dispersion exper¬ 
iments tailored for MQ coherences demand special 
consideration, because both x and y components 
of the magnetization need to be refocused since the 
ZQ and 2Q coherences involve linear combinations of 


and operators (Orekhov et al. 2004; 

Verde et al. 2009). 

Resonance frequencies depend not only on the 
chemical shifts, but also on interactions with neighbor¬ 
ing spins. Anisotropic interactions, such as dipolar 
couplings, average to zero in isotropic solutions but 
can be reintroduced by dissolving the molecule of 
interest into an anisotropic medium that generates 
weak alignment (Tjandra and Bax 1997; Cavanagh 
et al. 2007) (► Residual Dipolar Coupling). The 
resulting residual anisotropic interactions depend on 
the orientation of the interaction tensor in the molecu¬ 
lar frame of the protein, which contains valuable struc¬ 
tural information (cf. Fig. la). Residual dipolar 
couplings (RDCs) have been used extensively in struc¬ 
tural and dynamical studies of proteins. Other 
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Protein Conformational 
Dynamics by Relaxation 
Dispersion, 

Fig. 6 

Rcx{<^eff^ oo), plotted as 
a function for CPMG 

(filled circles, solid lines) and 
Rip (dashed lines) 

(a) = 8,000 s“', 

(b) kg^ = 2,000 s“* and (c) 
kcx ~ 250 s“*. In all cases 

Pi = 0.95, Act)2i = 2,000 s^', 
Ri = 1.5 s“*, andS” = 11 s^'. 
For CPMG (Opff is defined as 
Jt/x,.p. In the case of Rip, the 
RF carrier is placed at the 
population-weighted offset, Q, 
in (a) and (b) and at 12] in 

(c) , and ci/pffris altered by 
varying coi 
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anisotropic interactions can also be measured; for 
example, residual anisotropic chemical shifts with 
well-defined orientation and magnitude of the chemi¬ 
cal shift tensor can he interpreted to provide structural 
constraints. If anisotropic interactions result in differ¬ 
ent resonance frequencies of the exchanging states, 
they will contribute to because Am = AQ ± 
TtAD, where AQ is the difference in chemical shift 
and AD is the difference in coupling (expressed in 
Hz) (Igumenova et al. 2007; Vallurupalli et al. 2007). 
Thus, residual anisotropic interactions can give rise to 
significant Rg^, even if AQ = 0. Moreover, the resulting 
AD terms are of greatest interest, because they directly 
report on motions of the affected nuclei (rather than 
surrounding protein moieties) and provide additional 
information on molecular structure. Relaxation disper¬ 
sion experiments based on spin-state selective pulse 
sequences detect either of the two components of the 
doublet arising from nucleus S being in the up or down 
spin state (Cavanagh et al. 2007). Thus, spin-state 
selective experiments measure Am = AQ -i- rtAD 
or Am = AQ — rtAD, which enables separation of 
AQ and AD. 

Spin-state selective implementations of CPMG or 
Rip experiments that select the slowly relaxing 
(► TROSY) multiplet component also offer the benefit 
of higher sensitivity and greater sensitivity toward 
exchange for medium- and high-molecular-weight 
proteins (Igumenova and Palmer 2006; Cavanagh 
et al. 2007). 

Summary 

High-resolution relaxation dispersion experiments 
provide a wealth of information on protein dynamics 
on slower time scales that often are of prime biological 
interest. Laboratory- and rotating-frame experiments 
are in many ways complementary and offer particular 
advantages under certain experimental scenarios, but 
can also be used in conjunction. The maximum infor¬ 
mation content of these experiments comprises the 
exchange-free decay rates (Rj, and R\^, the kinetic 
rate constants {kj j), and the relative populations of the 
various states ip,), as well as the chemical shift differ¬ 
ences (AtUjj) between pairs of exchanging states. 
Together, these parameters provide a detailed view of 
protein dynamics. 


Cross-References 

► CPMG 

► Cross-Correlated Relaxation 

► Protein NMR - Introduction 

► J Coupling 

► NMR 

► Protein Dynamics in Catalysis - Computational 
Studies 

► Protein Folding 

► Protein-Ligand Dynamics 

► Relaxation Dispersion 

► Residual Dipolar Coupling 

► TROSY 
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Synonyms 

Mutation; Protein engineering; Protein immobilization 


Definition 

Protein engineering aims at the modification of the 
gene codifying for a target protein with the purpose 
of generating mutants with the desired properties. It 
allows to study the structure/function relationship of 
proteins and to generate molecules (enzymes, small 


peptides, biomaterials) for biotechnological, including 
sensing, applications. 

Introduction 

Recombinant DNA technology has revolutionized the 
way we can design proteins for biosensing. It allows 
complete control over the amino acid sequence of 
a protein by experimentally introducing mutations at 
DNA level. This enables to modify or introduce prop¬ 
erties such as binding, catalytic turnover, substrate 
specificity, and stability. It is also possible to fuse 
different domains or whole proteins allowing to com¬ 
bine functions such as catalysis and binding with also 
tagging for immobilization on surfaces. In some cases, 
proteins can be expressed from synthetic or modified 
genes, bringing protein engineering in the sphere of the 
new discipline of synthetic biology. 

Three branches of protein engineering can be iden¬ 
tified: rational design that uses site-directed mutagen¬ 
esis to modify existing proteins to achieve a particular 
aim; de novo design that involves the synthesis of new 
proteins from first principle by using established 
knowledge on protein folding and structure prediction; 
directed evolution that uses random mutagenesis on 
known gene sequences to generate new proteins or 
enzymes to achieve a new target function in a fortu¬ 
itous manner identified by screening or selection of 
a wide range of sequences. The methodologies under¬ 
lying these approaches and their use in biosensors have 
been reviewed in detail (Gilardi and Fantuzzi 2001; 
Gilardi 2004). 

Protein Engineering by Rational Design 

Rational protein engineering is based on site-directed 
mutagenesis and it relies on the existing information 
on the 3D structure of the target protein and the impli¬ 
cation of specific residues in its function. When a 3D 
structure of the studied protein is not available, rational 
protein engineering can still be applied by generating 
a 3D model of the target. The model can be validated 
by traditional biochemical studies on the native protein 
or by the lengthy process of establishing the role of 
different residues by alanine scanning mutagenesis. 
The final output is the generation of a structural and 
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Protein Design for Biosensors, Fig. 1 (a) Homology model 
of human flavin-containing monooxygenase 3 (hFM03) with its 
cysteine residues shown in orange and the FAD cofactor in 
yellow. The solvent-exposed cysteine 466 used for covalent 
linkage to electrode surfaces is indicated by an arrow. 

functional model that allows the identihcation of func¬ 
tionally important residues to be mutated by site- 
directed mutagenesis to achieve the desired property. 

Protein modeling is a discipline in its own right and 
it relies on the vast amount of structural and functional 
information stored in databases available. Once the 3D 
X-ray or NMR structure or a reliable 3D model for the 
target protein has been obtained, molecular graphics is 
used to locate the key residues to be targeted by muta¬ 
genesis, as well as to perform calculations to interpret 
or extrapolate principles for design of the mutants. For 
example, the Connolly algorithm allows the calcula¬ 
tion of the solvent-accessible surfaces of proteins and 
their specihc residues; this bears important implication 
for covalent linkage of the protein to solid surfaces 
(see, for example, Fig. 1, Sadeghi et al. 2010). 

Another useful operation that can be performed to 
guide the design of mutants is the calculation of a map 
of electrostatic potentials on a protein surface. Electro¬ 
static interactions play a key role in molecular interac¬ 
tions such as protein/protein, protein/ligand, enzyme/ 
substrate as well as protein/surface interactions in elec¬ 
trochemical or other immobilized systems. Therefore, 
the characterization of such interactions is often an 
important element in the rational protein design pro¬ 
cess. On the basis of the knowledge of the structure/ 
function relationship of the protein under investiga¬ 
tion, the rational design cycle may begin according to 
the scheme of Fig. 2. 


(b) Covalent linkage of cysteine 466 of hFM03 to a 
self-assembled monolayer of dithiobismaleimidoethane on 
gold, allowing the electrochemically driven enzymatic conver¬ 
sion of the drug tamoxifen (TAM) into its A-oxide product 
(TAM-NO) 

The prerequisite for site-directed mutagenesis is the 
availability of the cloned gene of the target protein and 
the expression at high levels, typically hundreds of mg 
per liter of culture, of the mutants in a host cell that does 
not produce the wild type protein. Site-directed muta¬ 
genesis can be used to replace, delete, or insert one or 
more amino acids. In all cases, a DNA polymerase is 
used to insert an oligonucleotide primer containing one 
or more mismatches in reference to the wild type DNA 
template. Several methods have been developed over 
the last 15 years to achieve these objectives; the pre¬ 
ferred approach available today is the method based on 
the polymerase chain reaction (PCR). This provides 
a quick and efficient strategy to generate mutants by 
amplification of a double stranded DNA (dsDNA) tem¬ 
plate using synthetic oligonucleotide primers that con¬ 
tain the desired mutation(s). The PCR methods use 
a heat-stable DNA polymerase, for example, the Taq 
polymerase from Thermus aquaticus or the Vent from 
Thermococcus litoralis or the Pfu from Pyrococcus 
furious. The choice of the best heat-stable DNA poly¬ 
merase depends on the needs of the specific experiment, 
as different enzymes have different optimal tempera¬ 
ture, level of fidelity, and proofreading activity. In all 
cases, the enzyme carries out the synthesis of 
a complementary DNA strand of DNA from the 5’- to 
the 3’-end using a ssDNA template, starting from 
a dsDNA region where a synthetic oligonucleotide 
primer is annealed. At least one of the primers must 
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and site for mutation 



Denature the plasmid and 
anneal with mutagenic primers 


Mutagenic 
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nicked circular strands 
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Mutated plasmid 
repaired by E.coli XLI-Blue 



Protein Design for Biosensors, Fig. 2 (a) Flowchart 
showing the experimental steps involved in the rational design 
of proteins, (b) Site-directed mutagenesis performed directly 

contain the sequence for the desired deletion(s), inser- 
tion(s), or replacement(s). Two primers are necessary, 
each complementary to opposite strands of DNA, initi¬ 
ating the synthesis of the region flanked by them. The 
elements necessary in the mix for a PCR reaction are an 
excess of the deoxynucleotides (dNTPs) that provide 
both the energy and the nucleosides required for the 
synthesis of the DNA, an excess of the oligonucleotide 
primers, and a very small amounts of the DNA tem¬ 
plate. The reaction is carried out by first heating the 
sample to separate the DNA strands (denaturation), 
followed by cooling to allow pairing of the primers on 
the DNA template (annealing) and setting to the tem¬ 
perature for optimal polymerase activity (extension). 


on the expression plasmid (gray) using complementary muta¬ 
genic primers (red) and the high-fidelity polymerase Pfu 
Turbo 

The heating/cooling process is repeated in cycles by 
using an automated thermocycler able to achieve the 
required temperature very rapidly. The strong advan¬ 
tage of the PCR method is that the new DNA strand 
containing the mutagenic primers will accumulate 
exponentially at each cycle, while the wild type DNA 
template will remain in trace amounts as initially intro¬ 
duced by the experimentalist. For this reason, it is 
crucial to use only minimal amounts of template 
DNA. The final result is the substitution, insertion, or 
deletion of specific amino acids in the protein sequence 
conferring desired functional properties. 

Protein engineering has also contributed to achieve 
the controlled immobilization of protein on surfaces. 













p 


1982 


Protein Design for Biosensors 



Screws 


Pogo-pin I 

Working eiectrode 

(WE) ^ 

With engineered redox 
enzyme 


WE hoider 
Outiet nanoport 


iniet nanoport 



Screen printed 
Counter eiectrode (CE) 
Reference electrode (RE) 


CE-RE holder 


Protein Design for Biosensors, Fig. 3 (a) Electrode array 
(green) on PET layer consisting of eight wells. Side view of 
a well is shown at the top. The gold working electrode 
(yellow) is used to covalently link the engineered protein 
(here a human cytochrome P450, red) via a self-assembled 
monolayer. Carbon ink is used as a counter electrode (black)', 
Ag/AgCl ink (gray) is used as a reference electrode and silver 

This is a very important achievement for the rational 
design of sensing surfaces. To this purpose, cysteine 
residues are very interesting targets owing to their low 
occurrence in natural proteins and to the specific reac¬ 
tivity of the thiol of the side chain, in particular toward 
fluorescence and redox labels as well as gold surfaces. 
Array-based electrodes as well as microfluidic amper- 
ometric devices have been constructed exploiting 
the ability of site-directed cysteines to covalently link 
self-assembled spacers bound to gold electrodes 
(Mak et al. 2010; Fantuzzi et al. 2010, 201 1) (Fig. 3). 

Protein Fusions 

Fusion of genes codifying for different proteins has 
also been used in biosensor development; two exam¬ 
ples are fusions with the green fluorescent protein 
to achieve optical signal and the fusion of electron 
transfer proteins with the catalytic domain of redox 
enzymes to achieve communication between the cata¬ 
lytic site and the electrode surface in amperometric 


ink is used for the connecting tracks, (b) Exploded view of 
a microfluidic cell made of micro-machined polymethylme- 
tacrylate containing an evaporated gold on silicon working 
electrode (WE, orange) with the immobilized engineered 
protein, and screen printed carbon and silver/silver chloride 
inks on polytethylene terephthalate counter (CE)/reference 
(RE) electrodes 

devices (molecular lego, Sadeghi et al. 2000). This 
approach is very close to the area of research that has 
been named recently as synthetic biology. 

Tagging and Chemical Modification of 
Proteins 

Following the construction and expression of mutants 
by molecular biology of the synthesis of new proteins 
from first principle, some level of chemical modifica¬ 
tion of the protein may be required. This is the case, for 
example, for the site-specific labeling of engineered 
unique surface cysteines with fluorophores or electro¬ 
active probes. Both approaches add new functionalities 
to the protein enabling it to respond optically or elec- 
trochemically upon binding or catalysis. Other amino 
acids used for chemical modification are those carrying 
amino- or carboxyl-groups in their side chains, or the 
C- and N-terminus of the protein itself. As the amino- 
and carboxyl-groups are much more common in the 
sequences of proteins than the thiol-groups of cysteine, 
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Protein Design for Biosensors, Fig. 4 (a) Flowchart showing 
the experimental steps involved in the de novo synthesis of 
proteins, (b) Repeats of alpha-helical structures (cylinders) 
used to generate synthetic four helix bundles ahle to incorporate 


heme {red) (c) De novo designed helix bundle with multi-redox 
centers, immobilized on an electrode surface for the develop¬ 
ment of bio-electronic devices 


they are less suitable for site-specific labeling of sens¬ 
ing proteins. 

Suitable plasmids are also commercially available 
to allow to clone the gene of the target protein next to 
strings of typically 4-6 histidines (His tags), or biotin- 
acceptor peptides designed to be either at the N- or C- 
terminus of the protein, or to binding proteins such as 
the maltose-binding protein. This has been used for 
a wide range of applications in purification by affinity 
chromatography or for immobilization on solid 
surfaces. 

Other examples of chemical modification of pro¬ 
teins may involve the incorporation of nonnatural 
amino acids with synthetic side chains that carry with 
them desired optical or electrochemical properties as 
described above. These nonnatural amino acids are 
inserted during the in vitro or chemical synthesis of 
the protein. 

Protein Engineering by De Novo Synthesis 

Proteins can also be created from first principle by 
solid state synthesis (Borgia and Fields 2000). This 
approach follows the scheme described in Fig. 4. 


Four helix bundles have been a popular target for 
the de novo design of model cytochrome and a range of 
binding proteins because of the relative simplicity of 
their intra-chain hydrogen bonding pattern and the 
well-understood hydrophobic effect that drives their 
folding. On the other hand, so far progress in creating 
6 -structures has been slower due to the complex long- 
range interchain patterns of hydrogen bonding that 
characterize this type of folding (DeGrado et al. 
1999). A variety of programs are now available in 
guiding the rational de novo design of proteins, 
allowing to predict the folding of a particular sequence 
(Ritcher et al. 2011). An alternative approach to the de 
novo rational design of proteins is the combinatorial 
synthesis. Here, libraries of peptides and proteins are 
generated either by chemical solid phase synthesis or 
by combinatorial synthesis of genes that are then 
expressed in E.coli. Libraries of synthetic peptides 
folding into helical bundles can be obtained with 
a binary pattern of polar and nonpolar amino acids, in 
which the character of the residue is specified but the 
exact side chain is varied randomly. This can be 
achieved by constructing a library of synthetic genes 
containing patterns of degenerate NAN codons (in 
which N denotes any base) where polar amino acids 
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Protein Design for Biosensors, Fig. 5 (a) Flowchait showing the experimental steps involved in the directed evolution of 
proteins, (b) Details of the steps necessary to obtain a library of random mutants 


are required (lysine, histidine, glutamic acid, gluta¬ 
mine, aspartic acid, asparagine), and NTN where 
apolar amino acids are specified (methionine, leucine, 
isoleucine, valine, phenylalanine). 

Protein Engineering by Directed Evolution 

Directed evolution is based on a number of cycles of 
random mutagenesis aiming at achieving new func¬ 
tions in existing proteins. It is applicable also when 
little information is known on the structure/function 
relationship of the target protein/enzyme. The 
resulting library of random mutants must be efficiently 
screened to confirm that the desired function has been 
successfully obtained (Fig. 5). This approach has been 
successfully applied to the creation of enzymes with 
high chemical and thermal stability, solubility in 
organic solvents, activity toward new substrates spec¬ 
ificity and enantio- or regio-selectivity in catalysis. All 
these are very attractive properties for the biosensors of 
the future (Valetti and Gilardi 2004). Essentially this 


approach mimics in vitro the process of natural molec¬ 
ular evolution that is able to generate a potentially 
infinite plethora of new proteins with new functions. 
Therefore, it becomes obvious that there is a great 
potential that directed evolution offers to the 
bioanalytical area, where new binding proteins, for 
example, antibodies, or enzymes active on specific 
target analytes can be generated. 

Directed evolution is based on the generation of 
genetic diversity by DNA recombination, also called 
DNA shuffling. This process mirrors the Darwinian 
evolution where random mutagenesis, recombination, 
and selection allow the addition of beneficial mutations 
and the removal of deleterious ones. Once a trend in the 
structure/function properties of the target enzyme has 
been rationalized, the directed evolution approach can 
be set to work in tandem with a rational design. 

Error-prone PCR is by far the most used approach to 
produce random mutants. It was developed by Leung 
in 1989 (Leung et al. 1989) who exploited the ability of 
thermostable DNA polymerases lacking proofreading 
activity, for example, Taq polymerase, to introduce 
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Protein Design for 
Biosensors, Fig. 6 Example 
of random positioning of 
mutations following error- 
prone PCR on cytochrome 
P450 BM3 from Bacillus 
megaterium, on the overall 
fold (a) and in detail (b) These 
mutations were found 
(Tsotsou et al. 2012) to confer 
to this fatty acid hydrosylase 
the ability to perform the 
reactions described in Fig. 7. 



Asp251 



random mistakes during the extension of the new DNA 
strand. Taq has the highest known error rate, in the 
range of 0.1 x 10“'^ to 2 x 10“"^ per nucleotide per pass 
of the polymerase. The rate of misincorporation 
depends on the specific reaction conditions; therefore, 
the standard PCR protocol needs to be modified to 
control the error rate. The process to perform directed 
evolution is described in Valetti and Gilardi (2004). 

Error-prone PCR is able to produce large libraries 
of random mutants in a sequential fashion where 
deleterious mutations are accumulated in conjunction 
with beneficial ones. This can seriously limit the 
evolutionary process because of the high probability 
that the effects of the few beneficial mutations are 
lost due to the many deleterious ones. From 
a biological point of view, sequential random muta¬ 
genesis is equivalent to an asexual evolutionary 
process. 

The breakthrough in directed evolution was made 
by Stemmer in 1994 with the introduction of DNA 
shuffling that allows the in vitro formation of recom¬ 
binant genes from a set of parental ones. Block changes 
are introduced with the fragmentation by DNase 1 of 
a pool of homologous genes that can be derived from 
a naturally occurring protein family or from a library of 
mutants produced by error-prone PCR. The fragments 
are purified by agarose gel electrophoresis and 
reassembled into a full length gene by repeated cycles 
of PCR without added primers. During this step, the 
fragments prime each other on the homologous 
regions, resulting in recombination when fragments 
derived from one parental gene prime on another one, 
causing a template switch. The recombinant genes 
obtained with this procedure can then be used for 


other rounds of mutation and recombination. Deleteri¬ 
ous mutations are removed by backcrossing with 
parental or wild type DNA. 

A key requirement to proceed in the iterative cycles 
leading to the best recombinant protein is the availabil¬ 
ity of a screen or selection procedure that is sensitive to 
the properties target in the in vitro evolution process. 
This step is not trivial, as the vast majority of the 
mutations are deleterious and only very few will lead 
to the improved target property. The number of possi¬ 
ble variants (V) of a protein that can be created by 
introducing M substitutions simultaneously over 
N amino acids is given by the formula: 

V= 19'^[N!/(N-M)!M!] 

This means that even for a small protein, when only 
10 positions are targeted for three simultaneous 
changes of amino acids, there are 823,080 possible 
variants. The ability to identify one possible active 
variant among this large library is close to the limit 
of most screen or selection methods. 

Traditionally the term selection is reserved for the 
methods of searching a library based on an enzymatic 
function that confers a growth or survival advantage to 
the host organism. An example of selection is given by 
Stemmer in the first application of DNA shuffling on fi- 
lactamase. Generally the use of a host organism as a 
means for biological selection is very efficient and it 
allows searches of libraries containing more than 10° 
variants. There is however a high risk that, under the 
selective pressure, the host organism itself develops 
alternative metabolic mechanism unrelated to the tar¬ 
get activity. 
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Protein Design for Biosensors, Fig. 7 Range of reactions 
performed by mutants A2 (Asp251Gly, Gln307His), M3 
(Asp251Gly, His266Ala, Glu267Arg, Thr269Asn, Ser270Glu, 
Gln307His), P2 (Asp208Arg, Ile209Tyr, Asp251Gly, 
Gln307His), and K4 (Val317Cys) of the fatty acid hydrosylase 


cytochrome P450 BM3 from Bacillus megaterium. Random 
mutants were found to recognize different drugs as well as 
polyaromatic hydrocarbons offering new applications both in 
biosensing and bio-catalysis (Tsotsou et al. 2012). 


On the other hand, a screening procedure involves 
the construction of an arrayed protein library and the 
application of a rapid and high throughput assay suffi¬ 
ciently sensitive and specific to enable the identifica¬ 
tion of positive variants. The development of the 
screening assay is usually the critical step in the 
directed evolution of a particular enzyme. Quantitative 
screening assays based on 96 (or more) well plates 
suitable for automation using robotic arms, liquid han¬ 
dling systems, and plate readers with automated data 
acquisition are favored. 

Once the presence of the desired mutation has been 
confirmed by DNA sequencing, the protein can be 
expressed and purified. The yields should reach levels 
of at least 100 mg/1 of bacterial culture to allow rea¬ 
sonable applications in the biosensor area. In the first 
analysis, the functional performance of the mutant 
should be checked with a number of complementary 


physical-chemical methods. The detailed functional 
characterization of the mutants aims at establishing 
what effect, direct or indirect, the mutation has on the 
function of the protein. Although X-ray crystallogra¬ 
phy and NMR spectroscopy are the methods of choice 
for a detailed analysis of the effect of the mutation on 
the structure and its consequences on function, the 
application of these techniques is not always quick 
and straightforward. Information on stability and fold¬ 
ing can be obtained from circular dichroism, fluores¬ 
cence spectroscopy, and calorimetry. Depending on 
the presence of cofactors or prosthetic groups, UV- 
vis and EPR spectroscopy can also provide useful 
information on the active site of enzymes. Stopped 
flow spectrofluorimetry can also provide a detailed 
description of the enzyme kinetics. As a whole these 
investigations should confirm that the desired function 
has been achieved without significant perturbation of 
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the other properties. This task is not trivial because the 
effect of a mutation on the overall 3D scaffold of the 
protein is not always obvious. 

Figure 6 shows an example of random mutants 
obtained for a cytochrome P450, the catalysis of 
which is expanded from a fatty acid monooxygenase 
to a drug and/or polyaromatic hydrocarbon oxidase 
(Fig. 7, Tsotsou et al. 2012). 

Summary 

Proteins and enzymes are undoubtedly an attractive 
target for the development of biosensors and novel 
biocatalysts. Their dimensions are intrinsically in the 
nanoscale, they are formed by precise scaffolds folded 
to perform specihc and often coupled functions such as 
binding, stereospecific catalysis, pumping, and self¬ 
assembling. All these functions can be detected by an 
array of different biophysical methods that have made 
much progress allowing to reach the level of single 
molecule detection. This entry has shown how the field 
of protein engineering has refined many different strat¬ 
egies to specifically tune or even create de novo the 
biological element to perform a wide range of desired 
functions imposed by the experimentalist. 

Cross-References 

► Amperometric Biosensors 

► Alignment of Protein Sequences 

► Atomic Force Microscopy 

► Flomology Modeling of Protein Structures 

► Molecular Motors 

► Protein Structure Prediction and Structural 
Annotation of Proteomes 
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Synonyms 
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Dynamics 

Introduction 

Allosteric regulation is a physicochemical mechanism 
that proteins employ for efficient use of metabolites and 
signaling pathways in all organisms (Fersht 1999). Allo¬ 
stery is generally considered to be the phenomenon of 
the binding of a ligand at one site altering activity (e.g., 
binding, reactivity) at a nonoverlapping site. The text¬ 
book example of protein allostery is O 2 binding to 
hemoglogin, during which oxygen binding becomes 
progressively tighter as the four subunits are saturated. 
Two classical theories of allostery were proposed in the 
1960s (Fersht 1999). Both have proven to be successful 
at describing specific cases of allostery, and both 
define allostery as arising from the interactions between 
discrete structural subunits. In the Monod-Wyman- 
Changeux (MWC) model, the allosteric change in 
activity results from concerted conformational 
change of subunits. In the Koshland-Nemethy-Filmer 
(KNF) model, allostery results from sequential confor¬ 
mational change of subunits. Although the models can 
quantitatively reproduce binding curves and kinetics, 
they do not provide an explanation of the underlying 
mechanism of allosteric communication. A full under¬ 
standing of allostery will require not just structural 
principles, but also the role played by molecular 
dynamics. 

Since the 1960s, evidence has gradually accumulated 
to reveal the dynamic nature of proteins. This has accel¬ 
erated in the last 10-15 years due primarily to advances 
in computer power for ► molecular dynamics simula¬ 
tions, and in ► NMR methods for characterizing protein 
dynamics (► NMR Studies of Macromolecular Interac¬ 
tions - Introduction). This entry focuses primarily on 
experimental support that has led to recent physical 


insights into the dynamics that underlie allostery. Previ¬ 
ous reviews on this topic provide additional detail (Kern 
and Zuiderweg 2003; Swain and Gierasch 2006; Cui and 
Karplus 2008; Tsai et al. 2008). Computational 
approaches (see, for example, ► Computational Studies 
of Fnzyme Motions) to studying dynamics in allostery 
(Cui and Karplus 2008) are invaluable and will likely 
represent the best approach for pinpointing the crucial 
aspects of allosteric transitions. Most of the examples 
cited here are from monomeric allosteric proteins, as 
these have afforded high quality NMR data. Farly pro¬ 
gress is nevertheless being made on classical allosteric 
and multidomain proteins. 

Models of Dynamics-Based Allostery 

Two major, generalized models of allostery have 
emerged, one which employs conformational changes 
and one that does not. It should be noted that these two 
models, which are now described, are not mutually 
exclusive. 

Shifting the dynamic equilibrium of conformations. 
A key concept that advanced understanding of allo¬ 
stery is that proteins in solution exist in multiple 
conformations in dynamic equilibrium. This is often 
referred to as “pre-existing populations” of conforma¬ 
tions, which implies that this equilibrium exists prior to 
binding ligand and that ligand binding will select and 
stabilize a single conformation out of the conforma¬ 
tional ensemble. Most generally, however, multiple 
conformations can exist in either fl-ee or ligand- 
hound states. While at face value this seems no differ¬ 
ent than simply stating that proteins are dynamically 
fluctuating, the concept of pre-existing populations is 
frequently reduced such that the focus is on only a few, 
or two, significantly populated conformations (Fig. 1). 
This is because it is difficult to experimentally detect 
more than a single, discrete, alternate conformation; 
furthermore, the simplicity of two states fits well with 
the notion of a “switch.” Hence, the predominant 
working concept of a protein with a pre-existing set 
of populations is one which has a major and minor 
form. These concepts were first reviewed by Nussinov 
and coworkers (Kumar et al. 2000). 

Although discrete multiple states in dynamic equi¬ 
librium had been detected early on in various ► NMR 
studies, the first detection of this in the context of 
allostery came through the use of *^N transverse 
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Protein Dynamics and Allostery, Fig. 1 Population shift- 
based allostery. The conformational energy landscape {black 
lines) is shown for the apo {left) and singly modified {right) 
states of an allosteric protein with two binding sites. Spontane¬ 
ous conformational switching between macrostates A {gray 
oval) and B {gray rectangle) occur on the microsecond-to- 


millisecond timescale, according to the size of the activation 
barriers. The allosteric protein is depicted here as monomeric, 
but could equally well be shown as oligomeric. Upon binding the 
first ligand {green circle), the B conformation is stabilized (rel¬ 
ative to the A conformation), which is binding competent for 
a “square” ligand 


► relaxation methods {R 2 = I/T 2 ) on the monomeric 
receiver domain of the signaling protein NtrC 
(Volkman et al. 2001). In this study, Kern and col¬ 
leagues demonstrated that for differentially active 
forms of NtrC, the “inactive” (i.e., unphosphorylated) 
state of NtrC is actually a spontaneously 
interconverting mixture of inactive and active confor¬ 
mations, with the inactive conformation more highly 
populated. Upon phosphorylation at Asp-54, NtrC 
switches almost completely to the active conformation, 
redistributing the populations of the two states. Because 
the active conformation includes a structural 
rearrangement at the p4-a4-p5 interface that is distal 
from Asp-54 and binds a downstream effector, NtrC is 
allosteric. The key finding from this work was that 
phosphorylation does not merely switch NtrC from 
“off’ to “on,” but rather readjusts the ps-ms dynamic 
equilibrium between the two conformational 
macrostates. A crucial detail is that this was observed 
directly in a single sample using NMR relaxation. It 
should be mentioned that the original MWC model also 
allows for the population of multiple states and indeed 
predicts that a redistribution of populations (between 
T and R) should occur upon ligand/effector binding. 
Thus, the NtrC study placed the MWC model of allo¬ 
stery in a modern light and identified the timescale of 
switching between the two states as ps-ms. It showed 
for the first time how an allosteric unit can be inherently 
dynamic such that it switches “rapidly” between two 


conformations. It was also highly significant to observe 
allostery in a monomeric protein, which suggested that 
solution NMR might play an important role in revealing 
the inner workings of allostery. 

Conformational entropy (dynamics) as an active 
driver of allostery. A completely different conception 
for the role of dynamics in allostery was introduced by 
Cooper and Dryden (Cooper and Dryden 1984). This 
theoretical study represents a true departure from 
either MWC or KNF models, in that it dispenses with 
the idea of an overall conformational change. Essen¬ 
tially ignored for over a decade, this model has gained 
traction during the last 10 years. In the Cooper-Dryden 
(CD) model of allostery (Fig. 2), binding of the first 
ligand alters activity at a nonoverlapping (distal) site 
just as in classical allostery. However, binding of the 
second ligand is not modulated by structural changes, 
but rather, by a difference in the free energy via 
entropy. Because entropy and enthalpy both contribute 
to free energy, any source of entropy can modulate 
binding free energy. It has been shown that side- 
chain dynamics on the ps-ns timescale can be 
a significant source of conformational entropy (Lee 
et al. 2000), and this strategy is used by the protein 
calmodulin (Frederick et al. 2007). Thus, if the binding 
of both ligands individually changes the conforma¬ 
tional entropy of the protein (or biomolecular recep¬ 
tor), hut not necessarily the protein structure, then the 
binding of the first ligand can alter the binding affinity 
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Protein Dynamics and Allostery, Fig. 2 Cooper-Dryden 
(CD) type allostery, with no conformational change. Protein 
states with enhanced internal dynamics and conformational 
entropy are shown in orange outlined with squiggles. In contrast 
to Fig. 1, the average conformations of the allosteric sites do not 
undergo changes, (a) This is one possible scheme for positive 
allostery. An alternate scheme would show enhanced conforma¬ 
tional entropy/dynamics in the doubly liganded state, either in 
place of or in addition to the dynamic apo-state. (b) This is one 
possible scheme for negative allostery. As with panel A, other 
schemes are possible. Large, negative values of AGbind corre¬ 
spond to tight binding, whereas small, negative values corre¬ 
spond to weaker binding. In these models, the dynamics affect 
AGbind through changes in conformational entropy terms (AAconf. 
bind); Af/bind remains constant upon binding each ligand 

of the second, regardless of the location of the two 
binding sites. Allostery can therefore he achieved 
through the thermodynamics of dynamic modulation, 
in the absence of conformational change (Tsai et al. 
2008). This type of dynamically driven allostery 
has been confirmed recently through NMR relaxation 
studies (see below). The timescale of dynamics 


that are responsible for the allosteric effects in these 
systems are fast, on the picosecond to low-nanosecond 
timescale. 

Broadened definition of allostery. From these ideas, 
and supported by work described below, the definition 
of allostery has broadened in recent years. Allostery 
once applied only to symmetric, oligomeric enzymes, 
and the mechanism was through changes in 
quarternary structure. Today, use of allostery can 
apply to both monomeric proteins and non-enzymes, 
and it has even been suggested that all dynamic pro¬ 
teins are allosteric (Clarkson and Lee 2004; 
Gunasekaran et al. 2004). It is most generally and 
fundamentally tied to the phenomenon of long-range 
communication - which can in principle exist in any 
protein - since this is the true essence that must exist in 
any allosteric molecule. 

Examples of Dynamics in Allostery 

With the basic concepts for generalized allostery 
discussed above, some specific examples are now 
described. These are ► NMR focused, since NMR 
can characterize dynamic regions of biomolecules 
with site-specific resolution. 

Allostery via shifting the dynamic equilibrium of 
conformations. The first type of allostery discussed 
was that of the conformational population shift 
(Fig. 1), which is essentially the MWC model with 
the additional stipulation that the two states (e.g., 
T and R) are in rapid (i.e., dynamic) equilibrium. 
This was first demonstrated with NtrC (as described 
above) as inferred from the standard *^N relaxation 
approach using Rj, R 2 , and the heteronuclear ► NOE 
(Volkman et al. 2001). The picture has recently been 
updated by a “CPMG ► relaxation dispersion” NMR 
approach, which reinforces the earlier model but 
identifies the exact timescales of interconversion 
between inactive and active conformational states, 
as well as the equilibrium populations for different 
forms of NtrC, including the phosphorylated form 
(Gardino et al. 2009). From this work, the average 
time spent in the active and inactive conformations of 
the unphosphorylated, wild-type protein is ~100 ps 
and -500 ps, respectively, due to conformational 
switching (Gardino et al. 2009). A second example 
of dynamic population shifting is found in the Runt 
domain from the Runxl core binding (transcription) 
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factor (Yan et al. 2004). Runxl heterodimerizes with 
core binding factor (3 (CBFp), which allosterically 
increases Runxl’s affinity for DNA. The Runxl 
Runt domain samples two conformations in the 
absence of CBpp, one low affinity and the other 
high affinity, and the interconversion gives rise to an 
exchange-based contribution to transverse spin relax¬ 
ation, Rex- Binding of CBpp shifts the population 
toward the high affinity state to the extent that Rgx 
largely disappears. A third example can be found 
in the ATP-binding cassette (ABC) transporter pro¬ 
tein MJ1267 from Methanococcus jannaschii, 
in which Binding of ADP-Mg causes a redistribution 
of Rex at proximal and distal regions of this protein 
(Wang et al. 2004). 

Relaxation dispersion experiments are ideal for 
monitoring population shift allostery. The KIX domain 
of the CREB-binding protein, CBP, has two separate 
binding sites that bind the activation domain from the 
mixed lineage leukemia (MLL) protein and the activa¬ 
tion domain of c-Myb, and these sites are allosterically 
coupled. NMR CPMG relaxation dispersion experi¬ 
ments were carried out on the binary KIX-MLL com¬ 
plex (Bruschweiler et al. 2009). These data indicated 
that all residues undergo dynamic switching between 
conformational states at the same rate. The overall 
populations (93:7) and differences in chemical shifts 
between the two states for each residue were deter¬ 
mined. These “dynamic” chemical shift differences 
correlate well with the known “static” chemical shift 
differences (in KIX) between KIX-MLL and the ter¬ 
nary KIX-MLL-cMyb complex (r = 0.79). A strong 
correlation of this type is the most rigorous evidence 
for two-state exchange. Interestingly, when residues 
experiencing exchange were mapped on the structure 
of KIX, these sites formed a contiguous pathway 
linking the MLL and c-Myb binding sites. 

Cooper-Dryden (CD) allostery. At present, there 
are a few examples of CD allostery. The catabolite 
activator protein (CAP) is a dimeric prokaryotic tran¬ 
scription factor that consists of a DNA-binding 
domain, and a regulatory domain that binds cAMP. 
The two cAMP binding sites are distinct from one 
another and binding is negatively cooperative. Nega¬ 
tive cooperativity offers the advantage of studying 
both singly and fully bound species and this feature 
was exploited for NMR characterization of a dimeric 
construct containing the cAMP binding domain (CBD) 
upon successive binding of cAMP (Popovych et al. 


2006). The order parameters, (a measure of bond 
vector rigidity on the ps-ns timescale, see ► Kinetics: 
Relaxation Methods), of NH bond vectors were deter¬ 
mined for CBD 2 , CBD 2 -CAMP 1 , and CBD 2 -CAMP 2 . 
Binding of the first cAMP had little effect, but binding 
of the second caused widespread rigidification (and 
loss of conformational entropy). This pattern of con¬ 
formational entropy change upon successive cAMP 
binding is consistent with the overall entropy changes 
as observed from isothermal titration ► calorimetry. 
Key to this study was the determination that binding 
of the first cAMP is accompanied by no conforma¬ 
tional change near the second cAMP binding site. 
Thus, the thermodynamic basis for binding 
anticooperativity derives from the fast fluctuation 
magnitudes and how they change as a function of 
ligand binding. 

A second example of CD allostery was found in the 
context of a protein-protein interaction. The third PDZ 
domain (PDZ3) from postsynaptic density-95 (PSD- 
95) binds a peptide from the C-terminus of the CRIPT 
protein with a Kj of 20 pM. In PDZ3, a helix distal to 
the binding site was removed from the domain and the 
binding thermodynamics and methyl side-chain 
dynamics (order parameters, from relaxation) 
were characterized by ITC and NMR (Petit et al. 
2009). Removal of the helix results in a 20-fold 
increase in Kj, and interestingly, this was reflected 
primarily in the entropic component of the free energy 
change. NMR chemical shifts indicated no significant 
change in the PDZ core structure upon removal of the 
helix, which renders conformational change an 
unlikely source of allosterically weakened peptide 
binding. Instead, the ps-ns side-chain dynamics in the 
truncated PDZ3 were found to be significantly 
enhanced relative to the full-length protein. Thus, in 
this case the source of allosteric energy derives from 
a dramatic change in side-chain conformational 
entropy that is distributed throughout the PDZ domain. 
A third example of CD allostery was reported in the 
tryptophan RNA-binding attenuation protein (TRAP) 
(McElroy et al. 2002). 

The Ensemble View 

A variety of terminologies for describing protein struc¬ 
ture and dynamics is found in the literature. One term 
used by many is “the ensemble view.” This term is 
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broadly accepted as a generally improved way to think 
about protein structure, namely, as a dynamic ensem¬ 
ble of interconverting protein conformations. The idea 
of a dynamic, fluctuating protein in the context of CD 
allostery easily lends itself to be envisioned as 
a structural ensemble. Consider, for example, the 
order parameter for a bond vector, S^. The order 
parameter is frequently compared to angular move¬ 
ment within a cone, which can be diffusive and favor¬ 
ing no particular orientation within that cone. Thus, 
there is a high multiplicity of states with all, or at least 
many, orientations within the cone viewed as contrib¬ 
uting roughly equally to the ensemble. As it turns out, 
advocates of the population shift (or related conforma¬ 
tional selection) view of proteins have also adopted use 
of “the ensemble view.” In this context (Fig. 1), the 
meaning of ensemble, at least in practice, can be some¬ 
what different. Namely, it refers to the major wells that 
are in principle distinguishable experimentally 
(Fig. 1). Some of these discrete states may be seen 
under one set of solution conditions, and others under 
different conditions. These “macrostates” that exist in 
the energy landscape form the ensemble, but typically 
their populations are highly skewed with some barely 
populated at all. The idea is that the lowly populated 
states have some nonzero population, and they may 
become the major population given the right set of 
conditions. Importantly, one of those lowly populated 
states may represent the active conformation. This 
macrostate ensemble view is characterized by 
a relatively low multiplicity of states (Fig. 1), which 
reflects experimental limitations. In the CD view, 
interconversion between members of the ensemble 
occurs on the ps-ns timescale, and in the population 
shift/conformational selection view, interconversion 
seems to occur on the ps-ms timescale. Both ensemble 
views are reasonable, but the difference in timescales 
is large. As a result, “the ensemble view” is a versatile 
term, but one which can be imprecise if not properly 
defined. 

One noteworthy framework for understanding allo¬ 
stery has been developed by Hilser, Freire, and col¬ 
leagues (Liu et al. 2006). This theoretical construct is 
truly an ensemble view, and it embraces the major 
ideas of both population shift and CD models of allo¬ 
stery. Like population shift, it recognizes (and utilizes) 
discrete conformations and ascribes different weights 
to those states, and the overall “average” conformation 
can change upon a perturbation. The specific 


enumeration of states allows for tracking the redistri¬ 
bution of states upon perturbation (e.g., ligand bind¬ 
ing). Like the CD model, it samples a very large 
number of states (hundreds of thousands), many of 
which are nearly indistinguishable in terms of energy 
and conformation, and hence provides a basis for sim¬ 
ulating conformational entropy. However, this 
achievement comes at the cost of loss of information 
about time. It is a purely thermodynamic/statistical 
model in which no one state is linked to another in 
time (i.e., it is not a trajectory), and in fact there is no 
timescale range associated with the range of confor¬ 
mations, which spans the native-like states all the way 
to the fully unfolded state. This “ensemble model” has 
been used to dissect the thermodynamic basis for long- 
range, allosteric-like coupling in dihydrofolate reduc¬ 
tase (Pan et al. 2000; Liu et al. 2006) and other 
systems. 

Mechanisms of Allosteric Transmission: How 
Fast is the Speed of Propagation? 

The “timeless” nature of the ensemble model discussed 
above raises the question of what actually happens, 
temporally, during allosteric transitions. Historically, 
this has been difficult to measure, although some mea¬ 
surements have been made. The allosteric quaternary 
change in hemoglobin has most recently been mea¬ 
sured using wide-angle X-ray scattering to reveal two 
stages in the conformational change, the first occurring 
over a 2 ps period, followed by a second transition 
requiring 20 ps (Cammarata et al. 2010). Given previ¬ 
ous estimates of sub-microsecond transitions in this 
system (reviewed in Kern and Zuiderweg 2003), 
there may be a complex hierarchy of dynamics under¬ 
lying the overall structural change. For an allosteric 
system, one would really like to know how quickly the 
allosteric change happens in response to effector bind¬ 
ing. For a concerted model of allosteric conformational 
change, one might envision the allosteric response 
having two kinetic phases. In the first, the initial effects 
of ligand binding are sensed by the protein, with allo¬ 
steric forces being transduced over distances. In the 
second, the conformational change transpires, aided by 
the allosteric forces. This would likely occur over 
a longer time period since the protein must solve the 
problem of breaking and repacking stabilizing interac¬ 
tions. Alternatively, conformational change might 



Protein Dynamics and Allostery 


1993 


proceed in a more step-wise, mechanical fashion, 
much like dominoes falling. In the case of CD-type 
allostery, with no conformational change, allosteric 
transmission might occur quite fast since it would 
consist of the “spread” of forces (phase 1), followed 
by a re-equilibration of dynamic motions that are fast, 
given that they would be entropically relevant. Thus, it 
is feasible that CD-type allostery might be advanta¬ 
geous when a very fast response is needed. 

Related to the timing of allosteric events, there is 
debate as to whether allosteric signaling processes 
should have “pathways.” While pathways appeal to 
our macroscopic intuitions (and our sense of time), 
molecular processes are governed by energetics. The 
ensemble view of Hilser’s does not require pathways, 
for example (Liu et al. 2006), and perturbations to 
proteins do not always lead to observable contiguous 
responses (Clarkson et al. 2006). NMR evidence sug¬ 
gestive of pathway behavior has been reported for both 
the cases of population shift/conformational change 
(Bruschweiler et al. 2009) and CD-type allostery 
(Puentes et al. 2004) and the pathway mindset can be 
found throughout the literature. Overall, at present, 
there remains significant questions about the generality 
of pathways and their relevance to either population 
shift or CD-type allostery. 

Cross-References 

► Protein Conformational Dynamics by Relaxation 
Dispersion 

► Protein NMR - Introduction 

► Protein-Ligand Dynamics 

► Thermodynamics of Biomolecular Interactions 
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Synonyms 

Electron paramagnetic resonance spectroscopy; MTSL 
and MTSSF; Protein dynamics; Protein structure 

Definitions 

Electron paramagnetic resonance (EPR) spectroscopy 
is the study of absorption of microwave radiation by 
a paramagnetic species in the presence of an external 
magnetic field. 

Spin labels are paramagnetic molecular tags used in 
conjunction with EPR to study the structure and 
dynamics of biomacromolecules. 

Site-directed spin labeling (SDSL) is the process of 
appending an EPR-active label to a specific site in 
a protein or other biomacromolecule. 

Introduction 

In addition to knowing the structure of a protein, 
details regarding dynamics and conformational 
changes are often required for a full understanding of 
protein function. Site-directed spin labeling (SDSL) 
with ► electron paramagnetic resonance (EPR) is a 
sensitive and powerful technique, well suited for gath¬ 
ering such information. An advantage of SDSL EPR in 
studies of biomacromolecules is that the EPR signal 
originates from unpaired electrons, which are relatively 
uncommon to biological systems except paramagnetic 
metalloproteins or systems where transient radicals are 
generated. With no real size limitations, SDSL EPR can 
be used to study systems ranging from small, soluble 
peptides to ► membrane proteins to large protein com¬ 
plexes, circumventing the major limitation of standard 
solution ► NMR techniques. Other advantages include 


small (sub-nanomole) sample size requirement and 
exquisite sensitivity of the EPR line shape to local 
environment and dynamics of the labeled site. 

Site-Directed Spin Labeling 

SDSL was pioneered in 1965 when spin labels were 
first synthesized by H. M. McConnell and colleagues 
(Stone et al. 1965). Shortly afterward, protected 
nitroxide radicals were shown to be persistent in the 
presence of both water and oxygen (Hoffmann and 
Henderson 1961). Most often, SDSL is carried out by 
engineering a cysteine residue into the protein of inter¬ 
est via manipulation of the DNA through a technique 
called site-directed mutagenesis (Braman 2002), and 
subsequently expressing the recombinant protein and 
modifying the engineered cysteine residue with a spin 
label (Hubbell et al. 1998; Fanucci and Cafiso 2006). 
Site-directed mutagenesis was developed by Michael 
Smith in 1978 (Hutchison et al. 1978) as a technique 
which is used to manipulate DNA bases at a specific 
site in the sequence using synthetic oligonucleotide 
primers; Smith was later awarded a Nobel Prize for 
this contribution. An alternative to SDSL in this classic 
sense is to synthesize peptides or small proteins with an 
amino acid derivative that incorporates a spin label 
such as TO AC (Hanson et al. 1996). It was not until 
1989 that Wayne L. Hubbell and associates used site- 
directed mutagenesis to produce variants of bacterio- 
rhodopsin, which they then reacted with two different 
spin labels to produce a series of proteins with para¬ 
magnetic tags, suitable for studies by EPR (Altenbach 
et al. 1989). In this landmark study, Hubbell et al. were 
able to determine the environment of specific residues 
in the protein and draw conclusions about the degree of 
order and relative lifetime, bridging the way for future 
studies on protein topology, structure, and dynamics 
using site-directed spin labeling and EPR. 

A spin label is an EPR-active molecular probe 
containing both an unpaired electron, usually in the 
form of a nitroxide radical, and a reactive moiety for 
binding a protein or other biomacromolecule. The 
nitroxide radical is typically involved in a five- or 
six-member heterocyclic ring, such as a pyrrolidine 
or piperidine, with bulky methyl groups as protection 
against destabilizing collisions. Protein thiol groups, 
most often found in cysteine residues, react specifi¬ 
cally with the functional groups such as 
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Protein Dynamics and Conformational Change from 
Site-Directed Spin Labeling and Electron Paramagnetic 
Resonance Spectroscopy, Fig. 1 Structure of common spin 
labels before and after reaction with cysteine residue; (a) MTSL: 


(l-oxyl-2,2,5,5-tetramethyl-A3-pyrroline-3-methyl)methanethio- 
sulfonate; (b) lAP: 3-(2-iodoacetamido)-PROXYL; (c) MSL: 
4-maleimido-TEMPO; and (d) lASL: 4-(2-iodoacetamido)- 
TEMPO 


methanethiosulfonate, maleimide, and iodoacetamide 
to create a covalent bond at neutral pH, thus these serve 
as common reactive moieties. The unpaired electron 
and the reactive moiety are usually separated by 
a flexible linker of four or five bonds. Figure 1 illus¬ 
trates the structures of four common nitroxide spin 
labels and the structures of the reporter-site cysteine 
residues after modification with each respective spin 
label. The choice of labeling sites is an important con¬ 
sideration that requires careful planning. Of primary 
concern is the preservation of structure and function of 
the biomacromolecule. For this reason, it is imperative 
to choose a site that easily tolerates an amino acid 
substitution. Most spin labels have a small molar vol¬ 
ume similar to that of a phenylalanine, and for this 
reason spin-label substitution is generally minimally 
structurally perturbing. In addition, spin labels have 
only weak hydrogen-bonding capacity, which is very 
important in order to study the dynamics of the label and 
to determine steric constraints imposed by the label’s 
environment. If, on the other hand, the motion of the 
label was limited by hydrogen bonding, little could be 
concluded. For more information on site-directed spin 
labeling, readers are referred to several excellent review 
articles on the topic (Hubbell et al. 1998; Columbus and 
Hubbell 2002; Fanucci and Cafiso 2006). 

Electron Paramagnetic Resonance 

EPR spectroscopy is the study of absorption of micro- 
wave radiation by a paramagnetic species in the pres¬ 
ence of an external magnetic field. This entry will offer 
only a very brief overview. For a detailed discussion on 
the background and theory of EPR, readers are referred 


to that entry in this series, or to comprehensive texts on 
the topic (Weil and Bolton 2007; Brustolon and 
Giamello 2009). The unpaired electron, found most 
often in a free radical or paramagnetic species, experi¬ 
ences a net dipole. This dipole is due to a magnetic 
moment that arises from spin angular momentum of 
quantum number m = ± i^. In the simplest case of the 
free electron in solution, the electron’s magnetic 
moment is degenerate, meaning that the energies of 
the two spin states, ms = ± Yi, are equal. When 
a magnetic field is applied, the electron can align itself 
either parallel or antiparallel to the field. This phenom¬ 
enon is referred to as the Zeeman effect, and is given in 
the Zeeman equation, 

AE = hv = (1) 

where g is the spectroscopic g-factor, is the Bohr 
magneton, and B is the strength of the applied magnetic 
field. Resonance occurs when the applied energy is 
equal to the difference in energy levels Ea and Ep. 
When EPR is performed in conjunction with SDSL, 
the free electron (ms = ± Vi) on the nitroxide spin label 
couples with the nuclear spin from nitrogen (mj =1) 
via the hyperfine interaction. As such, both levels 
and Ep split into three hyperfine energy levels, provid¬ 
ing the system three allowed energy transitions (Weil 
and Bolton 2007). 

Nitroxide Spectral Line Shapes and 
Correlation Time 

EPR nitroxide spectral line shapes are highly sensitive 
to local secondary structural elements, as well as local 
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Magnetic Field (Gauss) 

Protein Dynamics and Conformational Change from 
Site-Directed Spin Labeling and Electron Paramagnetic 
Resonance Spectroscopy, Fig. 2 Dependence of EPR 
nitroxide spectral line shape on motion 


dynamics and conformational changes in proteins, and 
thus change dramatically with changes in correlation 
time. For sites undergoing fast isotropic motion with 
a small rotational correlation time, the features of the 
line shape will be very sharp and narrow. As motion 
becomes more restricted or anisotropic and correlation 
time increases, the averaging of the anisotropy in the A 
and g tensors will be incomplete and the spectrum will 
be broadened, as shown in Fig. 2, where the extent of 
the broadening correlates with the extent of restriction 
in the motion. 


Correlation time is defined by the convolution of 
local mobility, and consists of Tr, Tr, and T;. The 
rotational correlation time (tr) is the rate at which 
the protein is tumbling in solution, and can be modu¬ 
lated by the local tumbling volume of the protein, as 
well as experimental factors such as temperature, vis¬ 
cosity, and the presence of solutes. Evaluating changes 
in Tr is useful for studying the binding of biomolecules 
such as in the case of protein complexes, membrane¬ 
binding proteins, or RNA/protein complexes; how¬ 
ever, nitroxide spectral line shapes that are dominated 
by Tr do not contain site-specific information about 
protein dynamics. The majority of dynamics informa¬ 
tion derivable from the CW EPR spectrum comes from 
spectral line shapes dominated by Ti and Tr. The inter¬ 
nal correlation time (t,) is driven by torsional oscilla¬ 
tions about the spin label, and can be determined to 
a certain extent by repeating experiments with 
nitroxide spin labels of varying characteristics. Einally, 
Tr is driven by local dynamics and backbone fluctua¬ 
tion, and can be modulated by changes in secondary 
and tertiary structure, as well as conformational 
changes within the protein (McFIaourab et al. 1996; 
Columbus and Hubbell 2002). 

Extracting Dynamics Information from the 
CW EPR Spectrum 

The nitroxide label, once incorporated into the macro¬ 
molecule, can be characterized according to two fun¬ 
damental aspects, namely the solvent accessibility and 
the motion of the spin-labeled side chain, the latter of 
which is dominated by tertiary interactions and back¬ 
bone dynamics. In an early work, Earahbakhsh and 
colleagues showed that side-chain dynamics of the 
incorporated spin label yield specific information 
about conformational changes on the millisecond time- 
scale (Farahbakhsh et al. 1993). Years later, 
McHarourab and colleagues carried out a thorough 
investigation on side-chain dynamics in T4 lysozyme 
using 30 single cysteine substituted variants 
(McHaourab et al. 1996). These works laid the ground¬ 
work for SDSL EPR as a method to study a protein in 
terms of its tertiary fold, equilibrium dynamics, and 
time-dependent conformational changes. 

EPR spectral line shape analysis can provide semi- 
empirical measures of structural and dynamic aspects 
of a system. Such a characterization generally involves 
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Protein Dynamics and Conformational Change from Site-Directed Spin Labeling and Electron Paramagnetic Resonance 
Spectroscopy, Fig. 3 Graphical illustration of parameters for quantifying the breadth of a CW EPR line shape 


measuring the ratio between the intensity of the center 
field transition to the low field transition (H( 0 )/H(+i)) 
(Belle et al. 2008), the peak-to-peak hrst derivative 
distance AHpp, which is a term driven by the degree 
of averaging of the anisotropic g tensor, and the spec¬ 
tral second moment < //^ > (2), a term driven by the 
degree of averaging of the anisotropic hyperfine tensor 
(Columbus and Hubbell 2002). AHpp can be normal¬ 
ized to give the scaled mobility (Ms) as shown in (3), 
where is the most mobile AHpp reported in the 
literature and dj is the immobile. The < > is pro¬ 

portional to the sum of distance squared between every 
point and the center of mass of the spectrum times the 
intensity at that point. These parameters are graphi¬ 
cally illustrated in Fig. 3. The second moment and 
spectral line width are functions of rate and order and 
may thus be used as semiempirical measures of mobil¬ 
ity. As nitroxide motion is reduced, the corresponding 
second moment and spectral line width increase (Eaton 
et al. 2010). 


(3) 

1=1 

McHarourab and colleagues used T4 lysozyme, 
a protein of well-known structure, labeled with 
MTSL at 30 different individual positions, to create 
a plot of the reciprocal second moment vs. the recip¬ 
rocal central line width for spectra representing helix 
surface, loop, tertiary interaction sites^ and buried 
sites. Points on this graph are clustered together by 
mobility, where surface-exposed loop sites are found 
on the high end of mobility, while buried sites are 
motionally restricted. The low mobility of the spin- 


label side chain at buried sites in the protein is under¬ 
stood in terms of steric interactions of the nitroxide 
within the core of the protein. The high level of 
isotropic mobility at solvent-exposed sites can be 
explained in terms of backbone dynamics (McHaourab 
et al. 1996). 

Basic Principles of Pulsed EPR 

The basic concepts of pulsed EPR are very similar to 
those of pulsed nuclear magnetic resonance (NMR), 
with only minor perturbations. Pulsed EPR was first 
described by W. B. Mims of Bell Laboratories in the 
1960s, when a modulation of the echo amplitude in 
a Ce^’^-doped CaW 04 crystal was observed for the 
first time. It was not until the 1980s that pulsed EPR 
started to undergo rapid development, and in 1982 
that X-band pulsed EPR spectrometers became 
commercially available. Pulsed EPR has since gained 
popularity as a valuable tool in a wide variety of 
applications in the fields of chemistry and physics, 
including the study of protein conformations and 
dynamics, and one such technique is called double 
electron-electron resonance (DEER) also referred to 
as pulsed electron double resonance (PELDOR). 
DEER experiments generally utilize two different 
microwave frequencies in order to extract information 
regarding the distance between two paramagnets, often 
two spin labels, by measuring the strength of the dipo¬ 
lar coupling. Data is collected in the form of 
a modulating echo intensity, which is then converted, 
generally via a process called Tikhonov regularization 
(Tikhonov 1943), into a distance profile. For a more in- 
depth discussion regarding the theory and principles of 
pulsed EPR, the reader is referred to a number of 
excellent sources (Berliner et al. 2000; Schweiger 
and Jeschke 2001). 

















p 


1998 


Protein Dynamics and Conformational Change 




t (|ts) Distance (A) 

Protein Dynamics and Conformational Change from Site-Directed Spin Labeling and Electron Paramagnetic Resonance 
Spectroscopy, Fig. 4 Examples of (a) dipolar-modulated echo curves with (b) corresponding distance profiles 


Monitoring Dynamic Shifts in the 
Conformational Ensemble 

The distance profile obtained provides information 
about the most probable distance(s), average distance 
(s), and the full-width at half-maximum (FWHM) 
intensity, where the most probable distance is deter¬ 
mined by the greatest P(r) intensity. The FWHM is the 
breadth at half of the maximum intensity and is indic¬ 
ative of relative flexibility of both the spin label and the 
protein. Two examples of dipolar-modulated echo 
curves and resulting distance profiles are illustrated in 
Fig. 4. The echo curve in black corresponds to a dis¬ 
tance profile with a greater FWHM, most probable 
distance, and average distance than does the echo 
curve in gray. This would tend to indicate that the 
distance between the two intra-protein spin labels in 
the “gray” sample are, on average, closer together and 
held in a more rigid dynamic position than the spin 
labels in the “black” sample. A more quantitatively 
descriptive method of analysis, called the Gaussian 
reconstruction process, can be performed in order to 
define the conformational ensemble of the system. For 
a more detailed description of this process, the reader is 
referred to recent publications (Blackburn et al. 2009; 
Kear et al. 2009). The conformational ensemble is the 
sum of each of the individual and distinct structural 
populations defined by the most probable distance 
between the paramagnets, the FWHM, and the relative 
percent of the total population for each individual 
Gaussian-shaped population. 

SDSL and DEER have been utilized to characterize 
flap dynamics and conformational change of the 
enzyme ► HIV-1 protease. Steve Kent and coworkers 
generated synthetic active and inactive HIV-1 protease 
constructs with MTSL labels at positions 55 and 55' on 


the flaps, where the distance between the labels is 
estimated to range between 25 and 42 A depending 
on the position of the flaps. Kent reported on interflap 
distances in the presence of three peptidomimetic 
inhibitors, each one representing a different stage of 
the enzyme-catalyzed peptide bond hydrolysis reac¬ 
tion. The results demonstrated that the flaps adopted 
different structural properties throughout the course of 
the catalytic reaction (Torbeev et al. 2008). Our group 
then compared distance distribution profiles of inactive 
HlV-1 protease in the apo form with those in the 
presence of nine separate EDA-approved protease 
inhibitors and a substrate mimic (Blackburn et al. 
2009). With work demonstrated that with sufficiently 
high signal to noise ratio (SNR) in the dipolar- 
modulated echo curve, information about the relative 
population distributions and ligand-induced conforma¬ 
tional shifts of the major flap conformations can be 
extracted to provide information regarding the energy 
landscape and conformational sampling of a system. 

Summary 

Understanding protein dynamics is a key step to under¬ 
standing protein function. Site-directed spin labeling 
with EPR offers an extremely sensitive methodology 
to uncovering those dynamics. Because the EPR signal 
comes solely from the rare unpaired electron, EPR is 
unsurpassed in sensitivity and site specificity. 
A relatively simple method with no size limitations, 
SDSL EPR can be used to study many types of sys¬ 
tems, including membrane proteins and large macro- 
molecular complexes. Changes in structure and 
backbone dynamics in the millisecond range can be 
easily monitored in real time, (Steinhoff et al. 1994) 
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making EPR an outstanding choice for uncovering the 
dynamic qualities of a protein. 


Cross-References 

► Chemistry of Spin Labeling 

► DEER of Metalloproteins 

► EPR Spectroscopy: General Principles 

► HIV Protease - Computational Studies 

► Interspin Distance Determination by EPR 

► Membrane Proteins: Structure and Organization 

► Protein NMR - Introduction 

► Pulsed EPR: Principles and Examples of 
Applications to Hemeproteins 

► Simulation of Spin-Label EPR Spectra 
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Definition 

The influence of protein dynamics on enzyme catalysis 
is a hotly debated topic. 


Basic Characteristics 

The origin of the remarkable rate enhancement of 
reactions achieved by enzymes is still actively debated. 
Although the predominant explanation is based on the 
specific stabilization of transition states of rate- 
limiting reaction steps by the (pre-organized) enzyme 
environment, other proposals have been put forward. 
One of these is the involvement of protein dynamics 
(reviewed in McGeagh et al. 201 1). When considering 
proposals to explain enzyme catalysis, it is important 
first to define them rigorously in physical terms, and 
subsequently design and perform experiments and/or 
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simulations that can test them. Computational molec¬ 
ular simulation is particularly useful in this respect, as 
it offers the possibility to strictly separate various 
effects in ways that are extremely difficult or impossi¬ 
ble to analyze clearly experimentally. 

Conformational fluctuations arising from protein 
dynamics are important for many aspects of the func¬ 
tioning of proteins and enzymes. For example, confor¬ 
mational changes are often crucial for enzyme 
turnover (e.g., allowing substrate binding and product 
release). In certain cases, protein dynamics may 
help the formation of rare enzyme-substrate conforma¬ 
tions that react with high efficiency (Lodola et al. 2007 ; 
Ma and Nussinov 2010); such minor conformations 
may form only a very small proportion of the total 
population of enzyme-substrate complexes, but may 
be primarily responsible for reaction. The possible 
direct involvement of protein dynamics in the rate 
enhancement of chemical steps, however (e.g., as so- 
called promoting vibrations), is distinct and particu¬ 
larly controversial. Computer simulations, typically 
using QM/MM or empirical valence bond methods, 
provide evidence that dynamical effects do generally 
not contribute significantly to rate enhancement of 
chemical steps (Olsson et al. 2006; Kamerlin and 
Warshel 2010; McGeagh et al. 201 1). Yet, “promoting 
vibrations” are still sometimes considered as a possible 
explanation for the temperature dependence of large 
kinetic-isotope effects related to tunneling in enzy¬ 
matic H-transfer reactions (e.g., Basran et al. 1999). 
However, kinetic modeling studies indicate that the 
existence of two (or more) alternative reactive 
enzyme-substrate conformations can account for the 
observed temperature dependence of kinetic isotope 
effects (Glowacki et al. 2012). QM/MM studies 
have indicated that such alternative enzyme-substrate 
conformations can indeed exist, for example in 
methylamine dehydrogenase (MADH) (Ranaghan 
et al. 2007). 

Cross-References 

► Empirical Valence Bond Methods 

► Methylamine Dehydrogenase (MADH): 

Studies with Variational Transition State Theory 

(VTST) 

► QM/MM Methods 

► Quantum Effects/Tunneling 
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Synonyms 

Protein structure 


Definition 

Studying protein dynamics with IR spectroscopy means 
that protein molecular motion is followed over time by 
probing molecular vibrations with infrared light. 

Introduction 

The purpose of this entry is to briefly describe the major 
approaches of using one-dimensional infrared (ID IR) 
and two-dimensional infrared (2D IR) spectroscopy for 
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studying protein equilibrium and non-equilibrium 
dynamics. Dynamics are an essential component in the 
functioning and assembly of many proteins, but protein 
motion can be extremely difficult to measure with high 
structural accuracy. NMR spectroscopy may be the most 
general technique for measuring site-specific dynamics, 
because the dynamics of individual atoms can be mon¬ 
itored on timescales down to milliseconds or shorter. 
However, the time-resolution of NMR spectroscopy is 
insufficient to resolve the initial steps in protein folding. 
Moreover, it is difficult to study membrane associated or 
aggregated protein dynamics on any timescale with high 
resolution NMR. In situations like these, researchers 
often turn to fluorescence, circular dichroism, FRET, 
EPR, or other spectroscopies that have various combi¬ 
nations of structure and time-resolution. 

As compared to these techniques, infrared spectros¬ 
copy is unique in that it can provide either bond-specific 
or secondary structure information on timescales ranging 
from picoseconds to days. Additionally, it can be applied 
to a wide range of samples and environments. As a result 
of these benefits, infrared spectroscopy has been used for 
decades to monitor protein structural dynamics. How¬ 
ever, traditional infrared spectroscopy can often only be 
interpreted qualitatively, because the lineshapes and 
vibrational couplings that determine the spectra cannot 
be rigorously determined. But with the invention of 2D 
IR spectroscopy, these quantities can now be indepen¬ 
dently measured by 2D lineshape analysis and cross peak 
patterns. There have been advances in methods for cal¬ 
culating spectra from protein stractures and molecular 
dynamics simulations as well (Kwac and Cho 2003; Lin 
et al. 2009). Together, 2D IR spectroscopy and predictive 
simulations are ushering in a new era of quantitative 
protein infrared spectroscopy. 

The methods for studying dynamics using 2D IR 
spectroscopy can be roughly broken into three classes 
according to the experimental techniques by which the 
spectra are collected. The three classes of experiments 
are summarized in Table 1. Each class is described 
below and examples are provided to illustrate the use 
of each technique. The examples were purposely cho¬ 
sen to demonstrate how isotope labeling has been used 
to spectroscopically isolate individual amino acids in 
peptides. Infrared spectroscopy can be used to monitor 
the secondary structure of proteins through the amide 
I band, which has characteristic peak patterns for 
Qc-helices, P-sheets, and other secondary structures. 
These peak patterns are inherently complicated 


Protein Dynamics Studied with ID and 2D Infrared Spec¬ 
troscopy, Table 1 Methods of 2D IR spectroscopy for protein 
dynamics studies separated into three classes. Time-resolution 
and structural changes monitored are dependent on experimental 
method 


Class of 
method 

Time-resolution 

Possible structural 
changes 

Equilibrium 

fluctuations 

• 0.1-5 ps 

• Equilibrium solvent 
dynamics 



• Intrinsic structural 
heterogeneity 

• Chemical exchange 

Triggered 

methods 

• pH jump: 0.1-10 ns“ 

• T-jump: 0.1-10 ns“ 

• Fast folding/unfolding 


• Photoswitch: 2 ps 


Rapid-scan 

• Fast mixing: 2 ms 

• Protein aggregation 


“Depending on laser source 


because they are usually not well-resolved and struc¬ 
tural disorder spreads their intensities over eigenstates 
that span a wide range of frequencies (Hamm and 
Zanni 2011). Empirical rules and computer simula¬ 
tions can help in their interpretation, but by using 
a isotope label, for example, a single 

amide I mode is shifted about 70 cm“* so that it 
appears in a spectral window between the = '“O 
amide I and II bands (Torres et al. 2001). Using this 
label, one can then measure local structural fluctua¬ 
tions through lineshape analysis (Kim et al. 2009; 
Woys et al. 2010) or probe its local structure using 
cross peaks (Strasfeld et al. 2009). Carbon-13 labeling 
alone also works (Decatur 2006), but the frequency 
shift is not as large so it is not as certain that it can be 
interpreted as an independent chromophore that is 
isolated from the couplings of the unlabeled residues. 
For this entry, the focus is on isotopically labeling the 
amide I band, but another approach is to incorporate 
small infrared probes into proteins such as nitriles 
(Getahun et al. 2003; Kim et al. 2008; Krummel and 
Zanni 2008; Hill et al. 2009). Regardless, the more 
specific the structural analysis, the clearer the interpre¬ 
tation and the broader the appeal to the biophysics and 
structural biology communities. 

Equilibrium Fluctuations 

The first experimental class measures the natural fre¬ 
quency fluctuations of the protein using 2D lineshape 
analysis. These experiments are most commonly 











p 


2002 


Protein Dynamics Studied with ID and 2D Infrared Spectroscopy 


Protein Dynamics Studied 
with 1D and 2D Infrared 
Spectroscopy, 

Fig. 1 Ovispirin in lipid 
bilayer and 2D IR spectra of 
13c _ '*0 labeled ovispirin. 
(a) Schematic drawing of 
ovispirin with a-helical fold 
bound to a lipid bilayer. 
Ovispirin is shown in purple, 
lipid hydrocarbon chains in 
gray, lipid headgroups in 
orange, and water molecules 
in cyan, (b) Region around 
rounded peaks that correspond 
to isotope labeled He-10 (from 
Ile-10(‘'’C = ‘**0) Ovispirin 
spectrum), (c) Region around 
elongated peaks that 
correspond to isotope labeled 
Lys-15 (from Lys- 
15('^C = ‘^O) Ovispirin 
spectrum) (From Woys et al. 
2010 with permission) 



performed on equilibrated proteins, although they 
could be applied to kinetically evolving systems as 
well. 2D IR lineshapes provide information on the 
frequency fluctuations of the vibrational mode. The 
frequency fluctuations are sensitive to the structural 
heterogeneity of the surrounding environment, mostly 
due to electrostatic forces (Mukherjee et al. 2006; Lin 
et al. 2009). Thus, heterogeneity of hydrogen bonding, 
solvation, ions, and peptide stracture all contribute to 
the lineshape. When structural inhomogeneity is very 
small or electrostatics are weak, then one measures a 
round lineshape. When inhomogeneity is large or elec¬ 
trostatics are strong, then one gets a peak that is elon¬ 
gated along the diagonal. The anti-diagonal width of 
a 2D IR peak provides the homogeneous lifetime, and 
the diagonal width gives the total linewidth. Thus, the 
peak elongation is a measure of the inhomogeneity of 
the system (Hamm and Zanni 2011). 

As an example of how 2D lineshapes give information 
on structure and dynamics, results from a study on the 
antibiotic peptide ovispirin measured when it is bound to 
a lipid bilayer are presented (Woys et al. 2010). Ovispirin 


is amphipathic which allows it to form an a-helix when 
bound to a bilayer since in this conformation its hydro- 
phobic residues all point into the bilayer interior and its 
hydrophilic residues point outward into the headgroup 
and water regions (see Fig. la). The 2D IR spectrum of 
a hydrophobic residue such as He-10 exhibits peaks that 
are quite round, as shown in Fig. lb. (In 2D IR spectra, 
peaks always appear in pairs. For amide I modes, the two 
peaks partially overlap, which must be taken into account 
when quantitatively interpreted.) Thus, Ile-10 is largely 
homogeneous. In contrast, Lys-15 is hydrophilic and 
exhibits peaks that are extremely elongated along the 
diagonal, as shown in Fig. Ic. Thus, it resides in a very 
heterogeneous environment. If one plots the diagonal 
elongation for all 15 of the residues that were measured, 
a cyclical pattern is observed with a periodicity of 
3.6 amino acids, which confirms that the peptide is 
indeed largely a-helical. A more thorough analysis and 
comparison to spectra calculated from molecular 
dynamics simulations led to the conclusion that the 
peptide lies just under the headgroup region in the 
bilayer and is kinked at residues 13-14. 
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What dynamical timescale does 2D lineshape anal¬ 
ysis provide information about? The timescale is set by 
the vibrational lifetime of the system. For the amide I 
mode, that is about 2 ps. For non-natural chromophores 
like nitriles, it is about 5 ps. During this time, only very 
small structural changes occur. The only truly dynamic 
species on a 1-5 ps timescale are water molecules and 
ions (a hydrogen bond in water has a lifetime of about 
1.5 ps) (Mukherjee et al. 2006). Thus, infrared spectros¬ 
copy provides a nearly instantaneous snapshot of the 
structural distribution. The time-resolution of infrared 
spectroscopy is advantageous for simulations, because 
one does not need to run long molecular dynamics 
calculations. Instead, one can generate an ensemble of 
molecules and then average their spectra together, 
whereas the transitions between conformations must 
be captured in order to properly simulate other tech¬ 
niques with slower time-resolution. 

Triggered Methods 

The intrinsic picosecond time-resolution of 2D IR 
spectroscopy can be obtained for non-equilibrium or 
kinetically evolving systems as well (Bredenbeck et al. 
2005; Chung et al. 2007). In these experiments, one 
quickly triggers a structural change and collects an 
infrared spectrum or one data point for a 2D IR spec¬ 
trum at some delay time later. The experiment is auto¬ 
mated so that it can be repeated many thousands of 
times to signal average and/or collect the full 2D IR 
spectrum. Once collected, the spectrum provides 
a snapshot of the ensemble at that particular delay 
time, and a series of spectra with varying delay times 
monitors the structural evolution. Many methods have 
been developed to trigger the structural change, includ¬ 
ing pH jumps, temperature jumps (T-jumps) (Kubelka 
2009), bond breakage, and photoswitches. Since the 
trigger method for proteins usually takes tens of pico¬ 
seconds to tens of nanoseconds, the time-resolution is 
usually set by the trigger rather than the intrinsic time- 
resolution of the spectroscopy. 

Two examples are provided to exemplify this sec¬ 
ond class of experiments. Both utilize transient IR 
spectroscopy and isotope labeling to study protein 
dynamics. The first study, which is summarized in 
Fig. 2, uses a T-jump to study the folding and unfolding 
kinetics of a single residue in the 36 residue villin 
headpiece subdomain (HP36) (Brewer et al. 2007). 


HP36 is a single domain protein consisting of a three- 
helix bundle (see Fig. 2a). Due to its small size and fast 
folding rate, it has become a model system for studying 
protein structure, stability, and kinetics. A difference 
infrared spectrum of HP36 with a label at 

residue Ala-57 is shown in Fig. 2b, which is produced 
by subtracting the spectrum of the labeled peptide from 
the spectrum of the unlabeled peptide. A well-resolved 
labeled peak appears at 1,572 cm” *, which is typical of 
13c = labels in the a-helices of soluble peptides. In 
transmembrane peptides, frequencies of 1,594 cm”* 
are more common, due to the displacement of water. 
Using this label as a structural maker, relaxation kinet¬ 
ics were measured following a T-jump of 4°C initiated 
with a 10 ns laser pulse from 10”* to 10”^ s (Fig. 2c). 
Biphasic relaxation kinetics were observed with 
a microsecond component that was assigned to the 
global folding/unfolding of the protein and a faster 
component due to relaxation in either the folded or 
unfolded free energy basin or both. The similarity 
between the kinetics of the labeled and unlabeled res¬ 
idues, led the authors to conclude that HP36 has one 
dominant activation barrier along the folding pathway. 

An example in which different residues exhibit 
different kinetics is the folding of an a-helical peptide 
probed at six different residues with *^C = **0 labeling 
(Ihalainen et al. 2008). In this case, the peptide folding 
is initiated with a femtosecond laser pulse that acts on 
a photoswitchable cross linker that tethers the two ends 
to each other (see Fig. 3a). The linker also adds an 
additional structural constraint similar to a disulphide 
bond. The kinetics are monitored from about 10 ps to 
40 ms (see Fig. 3b). When folding is carried out at low 
temperatures (6°C), the folding times from site to site 
differ by a factor of eight, whereas at higher tempera¬ 
tures (45°C) the range is much smaller. Without the 
cross linker, folding is faster even at low temperatures. 
From analysis of the data and comparison to molecular 
dynamics simulations, the authors conclude that the 
data is consistent with a folding mechanism in which 
the peptide is structurally constrained by the linker so 
that the rate-limiting step is escape from heteroge¬ 
neous traps. Such a mechanism is in contrast to the 
folding of an unconstrained a-helix, which is thought 
to fold through a nucleation mechanism in which the 
formation of one turn nucleates the rest of the helix. 

The two examples above describe systems that can be 
averaged many times because the system is replenish- 
able (e.g., the photo-cross linker can be switched back) 
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Protein Dynamics Studied with 1D and 2D Infrared Spec¬ 
troscopy, Fig. 2 Isotope labeled HP36 with difference IR 
spectrum and T-jump relaxation kinetics, (a) Structure of the 
villin headpiece subdomain (HP36) highlighting the alanine 
residue containing the amide = '*0 isotopic label, (b) The 
difference IR spectrum produced by subtracting the absorbance 
spectrum of Ala-57(*^C = **0) HP36 from the absorbance 
spectrum of wild-type HP36. The negatively valued feature 
reports the = ’*0 amide I mode frequency of Ala-57 


(1,572 cm^*). The positively valued feature gives the 
12c = *®0 amide I mode frequency of Ala-57 (1,647 cm^'). 
(c) Temperature-jump relaxation kinetics for Ala-57(*^C = **0) 
HP36. Kinetics monitored at 1,573 cm^* (short dashed) and 
1,644 cm^* (long dashed) correspond to the labeled residue 
and unlabeled residues, respectively. Solid curves show 
biexponential fits to the kinetic traces (Courtesy of Brian Dyer, 
Emory University with permission) 


and the trigger is very reproducible. These criteria do not 
apply for all systems. Once triggered, not all systems can 
be converted back to their original state, such as when 
photocleavage is used as a trigger. In this case, large 
quantities of sample are needed. Moreover, triggering 
reproducibly can be difficult, such as in nucleation 
events like protein aggregation. Reproducibility is 
necessary in this class of methods so that spectra are 
properly averaged. Of course the effect of synthetic 
incorporation of triggers must also be considered. 
But, with this method of data collection, it is possible 
to retain the intrinsic femtosecond time-resolution of ID 
and 2D IR spectroscopies over microseconds or longer 
timescales. 


Rapid-Scan 

Another way of studying protein kinetics is to collect 
2D IR spectra on-the-fly (Strasfeld et al. 2008; Shim 
et al. 2009). That is, to measure all of the necessary 
data from a single initiation event. This third class of 
data collection can be applied to irreversible systems, 
such as those initiated by chemical denaturation. For 
many years, rapid-scan IR spectroscopy has been used 
in this manner to study protein kinetics, but it has only 
been in the past few years that the method of collecting 
2D IR data has been implemented in this fashion. Since 
the time-resolution is set by how long one needs to 
average, the quicker a spectrum is collected, the better. 
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with 1D and 2D Infrared 
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Fig. 3 Photoswitchable 
peptide and site-specific 
folding signal, (a) Schematic 
drawing of a photoswitchable 
peptide in its unfolded and 
folded state. A cross linker’s 
cis conformation destabilizes 
the helix, while the trans 
conformation stabilizes it. 

(b) One (of six) site-specific 
folding signals measured after 
initiation with a femtosecond 
laser pulse. The signal {open 
circles) is for a '^C = 
labeled version of the peptide 
at the ninth residue from the 
N-terminus (L9). A single¬ 
exponential fit of the data is 
shown as a solid line (Courtesy 
of Peter Hamm, University of 
Zurich with permission) 




A few years ago, a mid-infrared pulse shaper was 
implemented for the hrst time into a 2D IR experimen¬ 
tal setup that allowed the delays in the 2D IR pulse 
train to be incremented from one laser shot to the next 
with no lost time (Shim et al. 2007; Shim and Zanni 
2009). As a result, an entire 2D data set was collected 
in < 1 s, whereas it previously took many minutes to 
collect a spectrum. When used in a rapid-scan mode, 
the protein conformation change is initiated and then 
2D IR spectra are collected at 1 s intervals over the 
course of the kinetics. At the end of the scan, a running 
average is performed to acquire the necessary signal- 
to-noise. Commonly, a 1-5 min window is used for the 
running average on 1 mM protein samples with a single 
isotope label. This class of data collection is unique 
from the others because the time-resolution of the 
experiment is dictated by the speed of data collection. 
The replenishable techniques described above can be 
averaged for many hours since the system is reversible 
on a shot-to-shot basis. This means that the absolute 
data collection time does not ultimately limit the time- 
resolution. In the rapid-scan mode, the signal-to-noise, 
method of data collection and repetition rate of the 
laser are intricately linked. Mid-IR laser sources that 
work at many tens of kilohertz are starting to be devel¬ 
oped, so the time-resolution of this approach is bound 
to improve. 


As an example of this third class of methodology, 
rapid-scan 2D IR spectra collected for the aggregation 
of the human islet amyloid polypeptide (hlAPP or 
amylin) that may be the cause of type 2 diabetes 
(Shim et al. 2009) is shown. hlAPP aggregates into 
parallel p-sheet fibers, the mechanism of which is 
currently attracting much attention because it may be 
an intermediate in the aggregation pathway that is the 
cytotoxic species. It can take between minutes to many 
hours for aggregation to occur, depending on the con¬ 
centration and conditions. It is not easily amenable to 
triggerable infrared methods, because once the insolu¬ 
ble fibrils are formed, they are difficult to dissociate 
back into the monomer polypeptides and because even 
single point mutations are known to alter the structure 
of the fibers and the kinetics of its fiber formation. 
Thus, rapid-scan 2D IR spectroscopy is an ideal way 
to study their aggregation kinetics. 

Shown in Fig. 4a is the structure of the polypeptide 
in its fibril form, along with the locations of the six 
residues whose kinetics have been measured. The 
fibers consist of polypeptides folded in half and 
stacked into two columns so that there are four parallel 
P-strands. The isotope labels are arranged along the 
inner and outer P-sheets. Because it is a structurally 
degenerate system, isotope labeling of one residue will 
create two columns of labeled amino acids when the 
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Protein Dynamics Studied with 1D and 2D Infrared Spec¬ 
troscopy, Fig. 4 Human islet amyloid polypeptide (hlAPP) 
structure, 2D IR spectra, and aggregation kinetics, (a) hlAPP 
in looped conformation as observed in amyloid fibrils. Kinetics 
have been measured for each of the highlighted residues by use 
of labeling, (b-d) Difference 2D IR spectra of Ala- 

25(*^C = '*0) hlAPP at t = 31, 66, and 205 min, calculated by 


subtracting the t = 5 min spectrum, (e) Summary of the kinetics 
for all six labeled versions of hlAPP. These curves depict the 
growth of cross peaks in the 2D IR spectrum between the isotope 
labeled residue and the (I-sheet feature of unlabeled residues. 
Thus, these curves give the rate of inclusion of the isotope 
labeled residue into amyloid fibrils, which varies residue-to- 
residue (From Shim et al. 2009 with permission) 


system reaches its final fibril structure. The kinetics are 
initiated by solvating a powder of denatured peptides, 
at which time 2D IR spectra are continuously recorded 
every 0.4 s (a four frame phase cycling scheme is used) 
(Shim and Zanni 2009) over the course of several hours 
that the fibers take to fold. Shown in Fig. 4b-d are three 
representative 2D IR spectra collected during the 
course of the aggregation kinetics. At 5 min, the spec¬ 
trum is very broad and centered at 1,645 cm“*, indi¬ 
cating that the peptides are largely random coils. After 
some time, diagonal peaks appear at 1,617 cm“\ 
which indicates that P-sheets are forming. Also 
appearing are diagonal peaks for the isotope label at 
1,574 cm“* and 1,585 cm”*, indicating that Ala-25 is 
being incorporated into a P-sheet structure. The scaled 
kinetics of the 6 measured residues are shown overlaid 
in Fig. 4e from which it is clear that the P-sheet does 
not form at the same time for all the residues. Of the 
residues that were measured, Val-17 forms first and 
Val-32 forms last. A comparison of the half-rise times 
(tso) for the six residues shows that the residues nearest 
the loop form P-sheets sooner than residues at the ends. 
Thus, an aggregation mechanism was proposed where 


aggregation begins somewhere near or at the loop 
region, and then the p-sheet propagates outward 
toward the N- and C-termini. These results provide 
some of the most structurally detailed information on 
hlAPP aggregation. The results are especially interest¬ 
ing, because nucleation begins near the loop, whereas 
one might think that the most likely place for nucle¬ 
ation to occur would be at residues 23-27 (FGAIL) 
since it is the most amyloidogenic region in the poly¬ 
peptide (Tenidis et al. 2000). 

Summary 

In this entry, methods of using infrared spectroscopy to 
study protein dynamics were classified into three cat¬ 
egories with the focus largely on advances and appli¬ 
cations of 2D IR spectroscopy. Infrared spectroscopy 
has many advantages over other techniques when it 
comes to time scales and sample environments. It has 
enjoyed many fruitful years of service to protein stud¬ 
ies, and with the new advent of 2D IR spectroscopy, 
13c = ***0 isotope labeling and simulations, it will 
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continue to be useful for addressing dynamical ques¬ 
tions with ever more precise structural accuracy. 

Cross-References 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Kinetics: Relaxation Methods 

► NMR Methods for Kinetic Analysis 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 

► Spin-Labeling EPR of Proteins: Dynamics and 
Water Accessibility of Spin-Label Side Chains 

► Transient-State Kinetic Methods 
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Definition 

The dynamics of proteins is the evolution of atomic 
and molecular structure as a function of time, centered 
on those motions that affect or bring about function. 

Introduction 

The subject of this review is the application of “time- 
resolved” approaches to characterize the dynamical 
nature of proteins, more specifically enzymatic 
catalysis (Callender and Dyer 2006). Because of the 
importance of dynamics to how proteins (and indeed 
other biological macromolecules and complex sys¬ 
tems) function, there is a substantial amount of work 
ongoing in many different communities to understand 
this problem. Some of this work is described in this 
chapter and in other chapters of this volume. 

Just what is meant by dynamics? In our context, 
“dynamics” is the evolution of atomic and mole¬ 
cular structure as a function of time, centered on 
those motions that affect or bring about function. 
While atoms in proteins undergo considerable motions 
of various types, an organizing paradigm describing the 
dynamics is required so that meaningful studies can be 
performed and the observed kinetics can be related to 
simulation/computational studies and, finally, function. 

Figure 1 is a schematic of the “energy landscape” 
picture of the reaction pathway that takes place in 
enzymatic catalysis, the catalyzed conversion of 
a substrate molecule to product. Although binding of 
substrate starts the catalytic event (and many other 
processes involving proteins), this physical event is 
very poorly understood. There is certainly substantial 
evidence that a protein experiences a range of confor¬ 
mations in solution, and some of the ensembles are 
more competent to bind the substrate than others. 
The ensemble of conformations are interconverting 
on various timescales, many faster than the catalytic 
event and others slower. There have been a number of 
studies on the nature of ligand binding to proteins 
showing, in virtually all cases, the formation of an 
“encounter complex” which relaxes to conformation(s) 
closer to those that would be observed in crystallo¬ 
graphic structures of protein-ligand complexes 
(Hammes and Schimmel 1970; Callender and Dyer 
2006). In principle, the system may reach the transition 
state of the reaction more easily from some of these 


“relaxed” conformations than others. In this view, the 
problem of protein dynamics is a characterization of the 
various protein substates structures, how they intercon¬ 
vert, and in what way are the interconversion thermo¬ 
dynamics and kinetics related to function. The picture 
emphasizes the energy landscape with regard to the 
entire protein-ligand complex. The reaction coordinate 
of the reacting groups is a specific coordinate subspace 
embedded within the protein-ligand energy landscape. 
Hence, it is possible, conceptually, that many substates 
of the protein-substrate complex are relatively well 
populated and still have just a small subset of these to 
be “competent” to produce product. 

In addition, unlike the solution chemistry of 
the specific reaction under study, the transition state 
coordinate of the enzyme-substrate system may involve 
protein residues directly. In a recent study of lactate 
dehydrogenase, a line of residues extending from one 
side of the protein through the active site was found to 
form a “compression” mode that transiently brought the 
two reacting substrates closer together, thereby likely 
enhancing the catalytic rate. This was found to be a tme 
vibrational mode, and has been called a “promoting 
vibration” (Quaytman and Schwartz 2007). 

It is clear from Fig. 1 why the study of the dynam¬ 
ical nature of proteins is difficult and why so little is 
understood. It is necessary to characterize atomic/ 
molecular motions on all timescales, from femtosec¬ 
onds to minutes despite the fact that most enzymatic 
catalysis occurs on the millisecond timescale (give or 
take an order of magnitude). There is, at least at pre¬ 
sent, no way of simplifying the problem to focus on 
a specific timescale or even simplifying the Michaelis 
complex to a system of “two-state” or “three-state” 
conformations. Moreover, the energy landscape is 
very complex and the number of substates huge so 
that, while particular studies may reveal specific 
motions, some way must be found that relates the 
observed motion to function. Most of the atomic and 
molecular motion will be just “background.” 

Summarized here are “time-resolved” studies 
(reviewed in, Callender and Dyer 2006). The approach 
is essentially the familiar “pump-probe” type. In 
common chemical studies, an example of such 
a study would be “stopped-flow” spectroscopy 
pioneered about 50 years ago. Two reactants are 
mixed, and the reaction is monitored as a function of 
time by some suitable spectroscopic probe (optical 
absorption, emission, etc.). The experimental 
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Protein Dynamics: Time-Resolved Spectroscopic Studies, 
Fig. 1 An energy landscape picture of enzymatic catalysis. 
Binding of substrate involves specific protein conformations of 
the un-ligated protein which are competent to bind. The archi¬ 
tecture of the enzyme-substrate Michaelis complex is viewed as 
an ensemble of interconverting states, some of which are far 
from being productive (in the sense of ease of going to product) 
and some closer to the transition state. Organized portions of the 
protein form a sub-ps compression mode, a “promoting 


vibration,” that carries the system over the transition state. The 
picture emphasizes the energy landscape with regards to 
the entire protein-ligand complex. The reaction coordinate of 
the reacting groups is a specific coordinate subspace embedded 
within the protein-ligand energy landscape. Shown in color 
{black, to blue, to green, to red), the approach to the transition 
state of the system is indicated. The putative pathways between 
conformations are highly simplified 


challenge is that this particular approach has 
a resolution of at best 1 ms. Since enzymatic catalysis 
is, very typically, over in that time, the time resolution 
must be extended considerably. 

Discussed here, briefly, is the state of the art in “fast 
mixers” (capable at present of a time resolution of 
around 5 ps or so), laser-induced temperature 
jump relaxation spectroscopy (capable of a resolution 
of around 10-50 ps), and other approaches for 
“pumping’Vinitiating a chemical reaction or structural 
rearrangement. When coupled to probes capable 
of following the reaction (like optical or IR absorption, 
fluorescence emission, and so forth), methods exqui¬ 
sitely sensitive to evolution of structure, equally 
sensitive on all timescales are available. 

Fast Mixers 

Continuous flow fast mixing methods are perhaps the 
most general approach to the study of reaction kinetics, 
although the time resolution of these techniques is 


limited. These techniques can be applied to any reaction 
by mixing separate protein and ligand streams. Since 
mixing methods rely on diffusion of the enzyme and 
substrate together, the time resolution is fundamentally 
limited by the length scale of the mixing process. Two 
different strategies have been pursued to minimize the 
mixing length scale and thus to maximize the time- 
resolution of these techniques. The first approach relies 
on turbulent flow to achieve mixing. In one implemen¬ 
tation, two sample streams are forced to flow around 
a sphere at a high flow rate, causing rapid turbulent 
mixing. The mixed stream is then passed through an 
observation cell, to allow spectroscopic detection of the 
reaction progress. The main disadvantage of this 
approach is the relatively large sample consumption 
because of the high flow rates needed to produce turbu¬ 
lent flow. The fastest time resolution reported for 
a turbulent flow mixer is ^50 ps, and the mixer con¬ 
sumes nearly 1 ml of sample per second. 

An alternative approach relies on hydrodynamic 
focusing of a sample stream by a surrounding (sheath) 
stream at low flow rates. The flow is laminar, and 
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Fig. 2 Microfluidics flow cell with UVA^is, fluorescence, and 
Raman detection capabilities 

the sample and sheath streams do not mix. Instead, the 
reaction is initiated by diffusion of the enzyme sub¬ 
strate into the central sample stream from the sheath 
flow. Figure 2 shows a calibrating run. The reaction 
progress is followed by obtaining the full UV-vis 
absorbance, fluorescence or Raman spectrum using 
an imaging spectrograph and CCD detector. The two- 
dimensional CCD detector records the full spectrum as 
a function of distance (time) downstream of the hydro- 
dynamic focusing region. Thus the full spectral and 
temporal response is recorded in a few seconds of 
integration time, requiring only a few microliters of 
dilute (100 pM) protein solution. The time resolution is 
ca. 10-50 ps. 


This can be combined with laser “flash” initiation, 
using laser photolysis to initiate the reaction (Toepke 
et al. 2006). Once the sample stream is equilibrated, 
say a few millimeters downstream from the focusing 
region, the sample is flowed through a sharply focused, 
continuous wave laser beam. The laser beam initiates 
the reaction, and measurements are performed down¬ 
stream from the “flash” point. Microfluidics flow-flash 
improves the time resolution of the flow experiment to 
a few microseconds, since it is not limited by the 
diffusion of reactants together, but rather by the spatial 
resolution of the excitation and detection schemes. 

Another approach to the study of fast enzyme 
dynamics, particularly for reactions involving proton 
transfer, is the laser-induced pH jump. The laser- 
induced pH jump and its application to proton transfer 
dynamics in proteins have been demonstrated by several 
groups (cf., Callender and Dyer 2006; Causgrove and 
Dyer 2006). Still another approach is to induce 
a reaction by pulsed laser light when experimental 
circumstances permit (reviewed in this volume; Zscherp 
and Barth 2001). 

Temperature Jump Relaxation Spectroscopy 

The laser-induced T-jump technique is the most 
general and simplest of the fast initiation approaches 
(Cantor and Schimmel 1980). The use of fast T-jump 
methods to displace chemical equilibrium and measure 
relaxation rates was pioneered by Eigen and DeMayer, 
using rapid capacitance discharge to produce Joule 
heating in conducting solutions (Eigen and De Maeyer 
1963). Faster T-jumps are possible using pulsed laser 
excitation, usually pumping the near-infrared solvent 
absorbance. Vibrational relaxation takes place on the 
picosecond timescale both in water and in proteins, 
therefore complete thermalization of solvent and 
solute can occur within 10-20 ps. By judicious choice 
of temperature and solution conditions, one can cause 
the equilibrium of an enzyme reaction to be poised 
at any point. A change in temperature then perturbs 
the equilibrium in a desired direction, and the 
relaxation kinetics of the system is then measured 
as it approaches the new equilibrium. The use of 
laser-induced T-jump relaxation techniques was 
developed to study fast events in protein folding and 
enzyme reactions (Callender et al. 1998; Dyer et al. 
1998). The ultimate time resolution of this approach is 
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Fig. 3 Laser-induced T-jump relaxation spectrometer using 
IR absorption as a probe. Time response: 20 ns-10 ms, and 
by varying the laser diodes, the total frequency range 
spanned by this is system is 900-3,200 cm“*. The approach 
to the impulsive heating of a protein solution is direct 
pumping of the near infrared absorbance of water (near 
1.54 pm for H 2 O or 1.91 pm for D 2 O). Specifically, the 
pump pulse is generated by Raman shifting the fundamental 
of a Q-switched Nd:YAG laser (one Stokes shift in methane 
or hydrogen, respectively) to yield a 10 ns heating pulse. 

about 10-50 ps. Different spectrometer systems must be 
employed to cover various timescales (e.g., 10 ps-10 ms, 
10 ms-10 ms, 1 ms-minutes). In this way, a range of 
timescale of over 13 orders of magnitude is covered 
(Callender and Dyer 2006). Figure 3 is a schematic of 
a spectrometer system for nanosecond-microsecond 
studies. 

Time-Resolved Spectroscopic Probes of 
Structure 

Structural characterization of protein dynamics 
requires spectroscopic tools with sufficient time 
resolution and structural sensitivity. Fluorescence, 
UVA^is, and IR absorption can follow kinetics down 
to the picosecond timescale, regardless of the reaction 
triggering approach, since the characteristic times of 
these spectroscopies is 10-100 fs. Structural speciheity 
of the optical spectroscopies arises from the probing of 


The wavelength of the heating pulse lies near the peak of 
either the weak H 2 O (1.54 pm) or D 2 O (1.91 pm) IR absorp¬ 
tion band (a fn 5-10 cm“*). These wavelengths were chosen 
because they are only moderately absorbed (about 10% typ¬ 
ically in our cells). This insures uniform heating of the 
sample within the interaction volume. Also, this frequency 
is ideal for our purpose because proteins do not absorb 
at 1.54 pm (about 6,500 cm^*) nor at 1.91 pm (about 
5,200 cm“*). The typical energy densities we employ are 
10-70 MJ per pulse focused to a 200-2,000 pm diameter 
spot size, which produces a 10-30°C temperature jump 

specific chromophores. For example, emission from 
the indole ring of a single Trp residue within sizable 
proteins is easily measured (Lakowicz 1999). The 
structural sensitivity of vibrational spectroscopy is 
very well established. Often, molecular groups yield 
vibrational spectra at specific frequencies and intensi¬ 
ties characteristic of the specific group (Herzberg 
1945, 1950). The accuracy of correlations between 
vibrational frequencies and key bond properties can 
be very high. For example, empirical correlations of 
bond vibrational frequencies have been used in some 
cases to determine bond lengths to within ±0.005 A, 
bond orders to within ±0.04 valence units, and 
hydrogen bonding enthalpies to within ±0.5 kcal/mol 
(e.g., Deng et al. 1998). Protein dynamics can involve 
changes in protein backbone conformation and 
H-bonding, in the orientation and packing of side 
chains, in the solvation of these structures and so 
forth, all of which will be reflected in changes in 
the vibrational spectrum. Two problems have tended 
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Fig. 4 The substrate binding site (shown in red) is fairly buried 
in LDH (10 A from the surface) and the surface loop that covers 
the binding pocket is “closed” (left structure). The “mobile 
surface loop” of LDH is partially open in the binding competent 
conformations (right structure), and multiple open conforma¬ 
tions, most likely yielding multiple binding pathways, appear to 
exist. In general, open versus closed conformations are due to 


subtle protein and water rearrangements. The channel to the 
active site in the closed (left) and open (right) forms. The 
coloring scheme is consistent to both pictures: white - Argl09; 
dark blue - loop residues except Argl09 (98:105 and 107:110); 
red - Asnl38, Argl69, Hisl93, Thr248; light blue - rest of the 
protein. Argl09 is displaced 5.64 A from its position in the 
active site out to the surface of the protein 


to limit the usefulness of IR spectroscopy in the 
characterization of proteins. One, water, the natural 
environment of proteins, absorbs strongly throughout 
the infrared and has intense absorbencies near 3,300, 
1,600, and <800 cm”'. This problem is usually 
mitigated by the use of D 2 O to access spectral “win¬ 
dows” which are intractable in H 2 O. T-jump relaxation 
spectrometers can pump either H 2 O or D 2 O. Second, 
the nearly 10"* vibrational modes of a typical small 
protein and the limited frequency dispersion compared 
to linewidth combined to produce extreme spectral 
congestion. Over the past decade, however, revolution¬ 
ary developments have occurred in several areas of 
vibrational technique including difference methods 
(e.g., Zscherp and Barth 2001; Kotting and Gerwert 
2005) and isotope editing (e.g., Callender and Deng 
1994; Brewer et al. 2007). These developments greatly 
enhance our ability to assign the vibrational observ¬ 
ables to specific structures. 


Kinetic Modeling 

Kinetic studies, of any type, require modeling the 
results with a kinetic pathway(s). For example, studies 


of enzymes might be modeled by (where S and P 
represent “substrate” and “product”): 

E + S< - >£«5 <-» £«£<- >E + P 


and so forth. Finding the correct pathway is performed 
by measurements as a function of the concentration of 
free enzyme and free substrate; this distinguishes 
bimolecular from unimolecular reactions, establishes 
the number of unimolecular interconversions, and 
yields a quantitative extraction of the microscopic 
rate constants. In addition, the use of different 
probes of structure (IR signature frequencies, 
NADH and Trp emission studies) can be useful, 
even necessary, to disentangle parallel from 
sequential kinetic models. The use of software that 
model kinetics of a reaction given kinetic parameters 
has been very helpful in accurate refinements 
of a specific kinetic model (e.g.. Cantor and 
Schimmel 1980). 

Relaxation studies are particularly appropriate as 
a complement to “mixing” studies. This is because, in 
a study of kinetics using relaxation approaches, fast 
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Protein Dynamics: Time- 
Resolved Spectroscopic 
Studies, Fig. 5 Transient IR 
isotope-edited absorbance of 
LDH/NADH/oxamate ternary 
complex (’^C 2 minus *^€ 2 ) 
after laser T-jump to 40° C at 
1,624 cm^* (upper trace) and 
1,607 cm“* (lower trace). 

A single exponential fit 
overlays the 1,624 cm“ ’ 
transient, whereas 
a biexponential fit is shown for 
the 1,607 cm“' transient. The 
temperature-dependent static 
isotope-edited IR spectra are 
shown in the inset 
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interconversions relax before slower relaxations. So, 
for example, in a system like: 


the fast step is not obscured by the rate-limiting slow 
step (as would be the case if the reaction had been 
initiated from left to right in a conventional mixing 
experiment); both steps are plainly observed. How¬ 
ever, the sequence of steps is sometimes difficult to 
disentangle within a single relaxation study. In general, 
the correct kinetic sequence is often resolved when 
several different probes of structure are available and 
taking into account the specific physics known about 
a particular protein. 


Putting It All Together: The Dynamics of 
Lactate Dehydrogenase (LDH) 

Ligand binding: an example. From systematic studies 
of the binding kinetics of ligands/substrates to the 
LDH/NADH complex (Qiu et al. 2007), it was inferred 
that the protein complex exists as an ensemble 
of conformations, some of which can bind substrate 
well (labeled “BC” below for binding competent) 
and some which are unable to bind ligand effectively 
(non-competent, “NC”)- It was found that the 
LDH®*-/NADH ensemble of species combines with 
oxamate to form a loose “encounter complex” (labeled 
ENC below) to yield a partial kinetic scheme 
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describing the binding event (not known is if there are 
many diverse competent or non-competent conforma¬ 
tions, if they freely interconvert, and timescales): 

LDH'^'^/NADH , -" LDH®‘^/NADH 

k„n 

+ oxamate ^ A LDH™^/NADH/oxamate- > 

2000 s^‘ 

From studies where the binding is modulated by tem¬ 
perature and a series of osmolytes and from parallel 
molecular dynamics calculations, a plausible (approx¬ 
imate) structure for the binding competent species of 
LDH was derived. Both the binding-enabled and - 
disabled structures are sketched in Fig. 4. It was spec¬ 
ulated that the strategy for binding, effectively 
a transient exposure of the binding pocket to solvent, 
may be necessary to get a ligand efficiently to a binding 
pocket that is located fairly deep within the protein’s 
interior. Such a process is a fairly old idea; some sort of 
transient dynamical motion(s) was postulated, from the 
days of the first crystallographic pictures, to be 
required for the binding of oxygen to myoglobin and 
hemoglobin since the binding pocket in these proteins 
is deeply buried indeed. 

The Michaelis Complex of LDH consists of an 
ensemble of interconverting structures. A series of 
studies were carried out characterizing the number of 
substates within the LDH/NADH/oxamate complex 
(oxamate is a substrate mimic of pyruvate) and the 
kinetics of their interconversions using T-jump relax¬ 
ation studies coupled with measurements of key sub¬ 
strate IR bands and fluorescence emission studies 
arising from the protein at 30 ns resolution (Deng 
et al. 2008; Zhadin et al. 2008). The studies suggest 
that the evolution of the reaction pathway between 
LDH/NADH and substrate (mimic; oxamate) collapses 
via a branched reaction pathway to form major and 
minor populated bound species. The substates 
and interconversion rates, at room temperature, are as 
follows: 

400 s“^ 

LDH“‘"“/NADH/oxamate , LDH™‘^/NADH 

200 s"' 

3000 s“^ 

/oxamate ^ _ LDH^^J°WADH/oxamate 

2000 s"' 

The Michaelis complex ensemble then consists of at 
least three conformations: (1) the “loosely bound” 
encounter complex (about 5% of the total; its 


structure remains to be elucidated); (2) a well-bound 
LDH^‘"°'^/NADH/oxamate complex (about 30% of the 
total, labeled as “Minor” in the scheme above; charac¬ 
terized by the oxamate C = O stretch at 1,624 cm“* in 
Fig. 5); and (3) a major population (about 70% of the 
total; characterized by the oxamate C = O stretch 
at 1,608 cm“* in Fig. 5). These species are 
interconverting on the sub-millisecond timescale. It 
can be deduced, from the strong correlation of the 
C = O stretch of substrate and catalytic activity, that 
the “minor” substate lies in an energy basin within the 
Michaelis complex ensemble far from the reaction 
pathway while the “major” species is highly reactive. 

Summary 

Just as organized molecular structure is a signature 
feature of proteins and this organized structure highly 
correlated with and responsible for function, so is 
organized internal molecular and atomic motion 
responsible for function. However, the physics of the 
dynamical nature of proteins is only understood in 
rather general terms, certainly not sufficiently well to 
design functional proteins (or conversely interrupt 
some key functional purpose with a small molecule). 
The kinetic properties of atomic motion within pro¬ 
teins are difficult to characterize and relate to function. 
However, new “time-resolved” spectroscopic 
approaches, sketched in this short review, are begin¬ 
ning to bear fruit. They are capable of reporting 
dynamic features on the vast timescale needed to 
understand protein dynamics (femtoseconds to 
minutes) and relate the observed dynamics directly to 
function. 
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Synonyms 

Protein dynamics; Protein folding 

Definition 

Proteins (biopolymers consisting of amino acids) play 
a key role in many biological processes and structures. 

Protein folding (“folding”) is the process by which 
proteins self-assemble into a well-dehned folded or 
“native” structure. This native structure is typically 
the structure relevant for function. 

Introduction 

Understanding how proteins fold is complicated by the 
subtleties of the many, relatively weak interatomic 
interactions which somehow conspire to stabilize 
a protein fold and direct folding via certain pathways. 
However, studying protein folding in all-atom detail 
on experimentally relevant timescales (milliseconds to 
seconds) is a major challenge for Molecular Dynamics 
(MD) simulation. Thus, due to the challenges of study¬ 
ing folding in atomic detail, how proteins fold remains 
an unsolved problem. 

In protein folding, chemical detail (involving 
the nature of the interatomic interactions) plays an 
important role. Unlike related problems, such as 
RNA secondary structure formation, the forces that 
drive protein folding are subtle, resulting from 
a complex combination of relatively weak interactions. 
Often, even small perturbations (e.g., a single point 
mutant) can signihcantly alter folding, even in some 
cases driving the protein to be stable in a different fold. 
Moreover, the final goal for understanding folding is 
how proteins fold in biological contexts. Thus, even if 
one could develop successful simplified models for 


folding in vitro, one would still need to ask how are 
these interactions modified in a cellular environment? 
The ability to understand folding at an atomic level 
opens the door to naturally carrying over this under¬ 
standing to novel environments, such as those found in 
the cell, since modeling physical interactions in atomic 
detail makes these models transferable, unlike 
simpler models. 

Thus, simulating protein folding kinetics remains 
extremely challenging. The most straightforward 
approach, molecular dynamics simulations using 
standard atomistic force fields, quickly mns into 
a significant sampling challenge for all but the most 
elementary of systems. While small proteins fold on the 
multiple microsecond to second timescale, detailed atom¬ 
istic simulations are currently limited to the nanosecond 
to microsecond regime. To overcome this barrier, one 
must typically choose between simplified models and 
alternate sampling methods, which both introduce new 
approximations. It must be emphasized here that the 
relevant question is not whether a given method is “cor¬ 
rect” in some absolute sense (as all models have limita¬ 
tions), but whether the model is predictive. 

How can one tell if simulations of folding are 
successful? Naturally, the most direct test is the 
quantitative prediction of experiment. Currently, 
the most accessible quantitative experimental observ¬ 
ables of two-state proteins are the folding rate, the 
unfolding rate, and the thermodynamic stability. 
Thus, it is important to validate any simulation method 
through quantitative comparison to experiment with 
proper statistics. As rates and free energies are the 
natural quantitative experimental measurements, 
relative or absolute prediction of these quantities is 
necessary for a direct connection to experiment and 
a true assessment of theoretical methodologies. 
Finally, it should be stressed that a quantitative predic¬ 
tion of rates is not sufficient to guarantee the validity of 
a model. The ability of fairly different models to 
quantitatively predict folding rates strongly suggests 
that additional experimental observables are needed to 
further validate simulation. 

Progress to Date 

An understanding of complex biophysical phenomena 
will likely require some degree of computer 
simulation, due to intrinsic experimental limitations. 
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Thus, it is naturally the combination of powerful 
simulations with quantitative experimental validation 
that will elucidate the nature of how proteins fold. 
In many ways, there has been great progress. The 
ability to quantitatively predict rates, free energies, 
and structure (e.g., see Bowman et al. 2009) from 
MD simulations based on physical force fields reflects 
significant progress made over the last 10 years. 

Approaches to Simulating Folding with MD 
Simulation 

There have been numerous other methods to simulate 
relatively long-timescale dynamics. While all methods 
clearly cannot be exhaustively described, some of the 
methods commonly used are summarized here. 

Tightly coupled molecular dynamics (TCMD). To 
simulate dynamics, typically one simulates molecular 
dynamics (MD) by numerically integrating Newton’s 
equations for the atoms in the system, with femtosec¬ 
ond timesteps to include the fast timescales of atomic 
motion. Thus, to reach the millisecond timescale, 
many (10*^) steps must be taken. While modem molec¬ 
ular dynamics codes are extremely well optimized and 
perform about 10° steps per CPU day, this clearly falls 
short of what is needed. Using multiple CPUs in 
a tightly coupled fashion to speed a single trajectory 
is appealing, but is an inefficient use of CPU power 
(i.e., one does not get a lOOx speed increase with 100 
CPUs) and thus has not been widely used to get beyond 
the nanosecond timescale, with the notable exceptions 
of Klaus Schulten and coworkers’ recent 10 ps 
simulations (Freddolino and Schulten 2009) and 
ground breaking work using specialized hardware 
from D.E. Shaw et al. (2010). 

Replica Exchange Molecular Dynamics (REMD) 
has become a powerful technique to explore the free 
energy landscapes of proteins (Sugita and Okamoto 
1999), with speed increases of lOx over traditional 
MD. Moreover, REMD efficiently parallelizes with 
only slightly coupled networking required. 
However, REMD achieves its speed increase by using 
a nonphysical form of kinetics (in temperature replica 
space). While this yields a Boltzmann weighted ensem¬ 
ble (after sufficient convergence), the trajectories can¬ 
not themselves be used to predict any direct kinetic 
properties, although aspects related to the kinetics 
(such as possibly kinetically relevant intermediates) 
can be potentially inferred from the resulting free 
energy landscapes (Garcia and Onuchic 2003). 


High temperature unfolding. While folding times 
are clearly long from a simulation point view, 
unfolding (especially under high denaturation 
conditions) can be very fast - on the nanosecond 
timescale. Under extreme denaturing conditions 
(e.g., ^400 K temperature), one would expect that 
the folded state would become only marginally 
metastable, with a low barrier to unfolding. 
Daggett and Levitt (1993) first took advantage of this 
scenario, and Daggett’s group has subsequently 
pioneered this method to examine a variety of proteins 
and compare their results to experiment. 

Coarse grained and minimalist models have played 
a large role in recent simulations of protein folding 
(Onuchic et al. 1997; Shea and Brooks 2000). These 
methods trade some degree of chemical complexity 
for computational tractability. However, in their sim¬ 
plicity, they also allow one to directly address a range 
of hypotheses relating to general properties of folding. 
However, in their generality, by construction they may 
lack the ability to access more detailed questions of 
folding (depending on the nature of the question of 
interest). 

Energy landscape theory has played a significant 
role in the development of how one conceptualizes 
protein folding (Bryngelson et al. 1995; Chan and 
Dill 1997). Driven by both analytic theory and 
minimalist models (e.g., lattice and off-lattice Go 
models), energy landscape approaches by Dill, 
Onuchic, Wolynes, and others have suggested 
a funnel picture for how proteins fold. 

Path sampling. Given an initial trajectory between 
the unfolded and folded regions, this method generates 
an ensemble of different pathways that join the 
unfolded and folded regions (Chandler 1997). For 
example, Bolhuis and coworkers determined the for¬ 
mation order of hydrogen bonds and the hydrophobic 
core in a P-hairpin (Bolhuis 2003). It is also possible to 
calculate folding rates from these ensembles. 

Markov State Models. Recently, discrete-state 
master equation or Markov state models have had 
success at modeling long-time statistical dynamics 
(Singhal et al. 2004; Swope et al. 2004; Chodera 
et al. 2007; Noe and Fischer 2008; Noe et al. 2009). 
In these models, metastable conformational states are 
identified such that the intrastate dynamics are much 
faster than interstate dynamics (so states visited over 
time form a discrete Markov chain). Transition rates 
between the states are estimated from Molecular 
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Dynamics simulations. If the model is shown to 
self-consistently recapitulate the statistical dynamics 
of the trajectories it was constructed from, and if it 
obeys the Markov property, it can be used to simulate 
the statistical evolution of a non-interacting ensemble 
of molecules over much longer times than the lengths 
of the individual trajectories from which it is 
constructed. Spectroscopic signals can be computed 
directly from linear combinations of the “spectro¬ 
scopic signatures” of each state, and direct comparison 
with experimental spectra can be computed. In partic¬ 
ular, the combination of GPUs and distributed 
computing is particularly powerful, enabling simula¬ 
tion of folding events on millisecond timescales and 
beyond. 

Case Study: MD Simulation of the Folding of 
NTL9 on the Millisecond Timescale Using 
Markov State Models 

To date, the slowest-folding proteins folded ah initio 
by all-atom molecular dynamics simulations have had 
folding times in the range of nanoseconds to microsec¬ 
onds. Recently, Voelz et al. studied the millisecond 
folding dynamics of the 39-residue protein NTL9 
using an Markov State Model (MSM) approach 
(Voelz et al. 2010). 

Simulation Details 

Tens of thousands of trajectories were simulated via 
the Folding@home distributed computing platform 
(Shirts and Pande 2000) from native, extended, and 
random-coil configurations using an accelerated 
version of GROMACS written for GPU processors 
(Friedrichs et al. 2009), for an aggregate time of 
1.52 ms. GPUs played a key role, allowing for dramat¬ 
ically longer trajectories than previously possible. 

Comparison to Experiment 

The native state is very stable in this forcefield, 
exhibiting decreasing stability with increasing temper¬ 
ature (the native structure is taken from the N-terminal 
domain of the 2.8 A-resolution crystal structure of 
ribosomal protein L9). RMSD-C^ distributions after 
10 ps show well-defined native and collapsed unfolded 
basins near 3 A and 5 A, respectively. 

Simulations reach a well-folded state. Of the 
^3,000 trajectories started from unfolded (extended 


and coil) states, two reach an RMSD-Cq, < 3.5 A and 
eight reach an RMSD-Cq, < 4 A. Near-native configu¬ 
rations show heterogeneity in hydrophobic packing, 
most notably in alternative side chain arrangements 
in the beta-sheet structure (Fig. 1). Most common of 
these is a nonnative hydrophobic core in which PHE5 
is exposed to solvent, while the natively exposed cross¬ 
strand packing of residues 14, 118 and 137 is instead 
used to form a stable core. A Markov Sate Model 
(which takes into account complex, non-exponential 
kinetics) built on the data set also recapitulates the 
experimental observation of folding on the 1 ms 
timescale. 

Insight into Folding Mechanisms 

MSM approaches, by identifying a set of kinetically 
metastable states and efficiently sampling transitions 
between these states, can model long-timescale 
kinetics from much shorter trajectories. Key to 
accurately identifying metastable states is the 
clustering of trajectory conformations into microstates 
fine-grained enough as to best be lumped into groups 
of maximally metastable macrostates. 100,000 
microstate clusters were calculated using an approxi¬ 
mate Ucenters algorithm, each with an average radius 
of ^3 A RMSD-backbone. The timescales predicted 
by these models (obtained by diagonalizing 
the macrostate rate matrix) show a clear spectral 
gap separating the slowest relaxation timescale from 
the rest at all lag times, indicative of single¬ 
exponential kinetics. The implied timescale of the 
model levels off beyond a lag time of ^10 ns to an 
implied timescale of ^1 ms, close to the experimental 
folding time. 

A key strength of MSMs is their ability to gain 
insight at coarser scales, by “lumping” the 100,000 
microstate model into a simpler model with fewer 
states. To gain a mesoscopic view of the folding free 
energy landscape, a 2,000-macrostate MSM was built 
from the simulation data. In this mesoscopic view, 
multiple folding pathways along metastable states are 
found, which are diffuse collections of on- and 
off-pathway states (see Fig. 1 for a lower resolution 
visualization). This indicates a vast heterogeneity of 
folding substates. At the same time, highly populated 
“native” and “unfolded” macrostates that dominate the 
observed relaxation rates can be identified. 

With an MSM, it is natural to examine the nature of 
protein folding in terms of studying the graph of 
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Protein Folding Studied 
with Molecular Dynamics 
Simulation, Fig. 1 A 2,000- 
state Markov State Model 
(MSM) built using a lag time 
of 12 ns, using Transition path 
theory (Noe et al. 2009) for 
visualizing transition paths. 
The top ten folding fluxes are 
shown, calculated by a greedy 
backtracking algorithm. These 
pathways account for only 
about 25% of the total flux, 
and transit only 15 of the 2,000 
macrostates. The size shown 
for each state is proportional to 
its free energy, and the arrow 
size is proportional to the 
interstate flux 



metastable states and how they are connected to each 
other, thus laying out how the protein folds. Recent 
MD simulations suggest that folding of proteins occurs 
via a hub-like mechanism (Rao and Caflisch 2004; 
Bowman and Pande 2010), where the native state is 
highly kinetically connected. This means that traversal 
between the huge number of unfolded states most often 
occurs via the folded state. 

Summary 

Due the limitations of both simulation and experiment, 
the ultimate understanding of protein folding 
will likely come from a coupled approach of detailed 
simulations extensively validated and tested by 
experiment. Moreover, with the ability to quantita¬ 
tively predict kinetics for folding in solution, new 
challenges emerge: understanding folding in biologi¬ 
cally relevant contexts, such aspects found in cellular 
biology, which would be an important next step in the 
understanding of protein folding and its connection to 
biology and biomedical questions. In addition, 
this work would also have an impact on the basic 
understanding of several protein-misfolding related 
diseases, such as Alzheimer’s and Huntington’s. 
Indeed, the ability to use detailed models to simulate 
long timescales would have very broad implications 
in forming connections between simulation and exper¬ 
iment to an extent not seen today. To summarize, it is 


expected that, in new frontiers like these, the study 
of folding will have the greatest biological and 
biomedical impact. 
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Synonyms 

Biomolecular modeling 


Definition 

Molecular dynamics simulations are iterative calcula¬ 
tions performed using Newton’s equations of motion to 
predict the movement of atoms or particles over time. 

Introduction 

In vivo protein folding is fast and efficient; to study this 
process experimentally at atomic detail presents 
a difficult task (Bartlett and Radford 2009). In contrast, 
molecular dynamics (MD) simulations can access spa¬ 
tial and temporal resolution beyond the capabilities 
of experimental methods but is hindered in its ability 
to acquire data over biological timescales 
(Daggett 2006). There were notable studies paving 
the way for atomistic MD studies of protein folding 
and unfolding. For example, the first folding simula¬ 
tion of a globular protein was reported in 1975 using 
a coarse-grained model of bovine pancreatic trypsin 
inhibitor (BPTI) (Levitt and Warshel 1975). Further 
advancement in computer hardware led to the first 
united-atom simulation of the same protein in 1977 
(McCammon et al. 1977). 

Protein folding typically occurs over the microsec¬ 
ond to second timescale, which consistently poses 
considerable challenges for the modeling of protein 
systems with computational resources or patience 
being the limiting factors. To counteract this limita¬ 
tion, some have resorted to improving the accuracy of 
coarse-grained methods to reduce the computational 
effort required to measure motion on a number of 
different timescales (Tozzini 2005), while others, 
aided by technological advances, have followed 
a trend to use MD simulations of increasing duration 
as well as advanced sampling methodologies. Notable 
milestones of long simulation times are the villin head- 
piece (Duan and Kollman 1998) that was simulated for 
1 ps in 1998 and, more recently, a 1 ms simulation of 
BPTI (Shaw et al. 2010). Although the ability to gen¬ 
erate longer simulations within a feasible time is 
predicted to continue increasing into the future 
(Vendruscolo and Dobson 201 1), the timescales acces¬ 
sible at present limits most research to the study of fast 
folders, where the folding time is typically less than 
1 ms (Pande et al. 2008; Daggett 2006). 

With such limits on accessible timescales, a choice 
has to be made between the accuracy achievable and 
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the degree of sampling attainable. It is important to find 
the appropriate simulation method that provides 
a balance between these factors while remaining suit¬ 
able to provide the desired information for a given type 
and size of system. As mentioned above, the simula¬ 
tion times accessible can be increased through reduc¬ 
tion in the detail of the protein models from all-atom 
models, to united-atom or coarse-grained models 
(Tozzini 2005). Solvation, although important in bio¬ 
logical processes and a contribution that one would 
always wish to model explicitly, is another area where 
reduced treatments can enable extended simulation 
times to be accessed. Simulation in a vacuum or using 
implicit solvation methods can significantly reduce the 
amount of computational time required; efficiency is 
increased through both the reduction in number of 
atoms and a decrease in friction within the simulated 
system (Ponder and Case 2003; Pande et al. 2008). 

Another limitation of MD methods is that only 
a single molecule is simulated; experimental measure¬ 
ments are an ensemble average generated by a bulk 
sample of protein molecules. In the past, the method of 
using a single simulation has been criticized for inad¬ 
equate sampling and not reflecting the dynamics of the 
experimental ensemble. While this is true, it has been 
shown that only a finite number of simulations are 
needed to capture the ensemble average both of the 
native state and of the more heterogeneous denatured 
state (Daggett 2006). It was demonstrated through 100 
independent simulations of chymotrypsin inhibitor 2 
(CI2) at 498 K that five to ten simulations are sufficient 
to capture the overall dynamic behavior of the ensem¬ 
ble (Day and Daggett 2005). Moreover, the ensemble 
averages obtained from a small number of individual 
simulations of the same system have been shown to be 
quite uniform, despite the underlying heterogeneity of 
the simulated ensemble. 

In the following sections, an overview is given of 
a number of methods useful in the study of protein 
folding. The general theory and concepts of MD sim¬ 
ulations are covered in the majority of introductory 
texts to MD and will not be repeated here (Schlick 
2010; Leach 2001). 

Protein Folding Methods 

The free energy landscape for the protein folding pro¬ 
cess is commonly envisaged as a funnel with a rugged 


surface and a global minimum that pertains to the 
native state (Wolynes 2008). Although conventional 
MD simulations are adequate for capturing native state 
dynamics, they are unable to surmount the energy 
barrier to escape from the native basin and as such 
are limited in their ability to characterize the energy 
landscape far away from this global minimum. The 
progression over such barriers under normal tempera¬ 
ture and pressures is a rare event; hence, the MD 
methods used in the study of protein folding attempt 
to circumvent this limitation in a number of ways that 
reduce the size of the barriers or increase the probabil¬ 
ity of barrier crossing. 

The methods discussed below were all developed to 
overcome the energetic barriers that typically prohibit 
the extensive sampling of the folding energy landscape 
in a conventional MD simulation. Elevated tempera¬ 
ture unfolding simulations increase the probability of 
barrier crossing by increasing the thermal fluctuations. 
Forced unfolding simulations simulate applied forces 
to pull mechanical protein structures apart. Biased MD 
simulations use a definition of a given conformational 
state, typically the denatured or transition state, to bias 
the force field toward that target state. The definition 
for the nonnative states is often provided through a set 
of experimental restraints. Umbrella sampling incor¬ 
porates a set of simulations run in parallel, each biased 
to a particular region of the energy landscape. Replica 
exchange MD simulations, rather than using a biasing 
potential, exchange trajectories between different sim¬ 
ulation temperatures in an attempt to promote barrier 
crossing. Finally, transition path sampling attempts to 
find pathways that link the denatured and native states 
with identified transition points on the energy 
landscape. 

Elevated Temperature Unfolding 
Simulations 

Elevated temperature simulations typically use high 
temperatures of up to 498 K to induce the rapid 
unfolding of a structure (Daggett and Levitt 1993). 
As with conventional MD simulations, a native state 
structure is used as the starting point and increased 
pressures are used to maintain the correct water density 
for a given temperature. This method rests on the 
principle of microscopic reversibility and that the path¬ 
way and states visited as a protein unfolds are the same 
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Protein Folding: Molecular 
Dynamics Simulations, 

Fig. 1 Representative 
structures from the 348 K and 
398 K simulations of CI2 
reproduced from Day and 
Daggett (2007). This figure 
demonstrates the reproduction 
of the folding pathway in both 
folding and unfolding 
directions. Structures are 
colored from blue to red 
according to their degree of 
native structure. Structures 
that are in the same column are 
judged similar by Ca RMSD 
and position along the reaction 
coordinate 



Refolding 


as those visited during the folding pathway. Moreover, 
it assumes that using elevated temperatures to increase 
the speed at which the protein unfolds does so without 
altering the folding pathway. Hence, two main criti¬ 
cisms of this method are that the principle of micro¬ 
scopic reversibility is invalid and that the energy 
landscape is altered as the temperature is increased. 
However, detailed assessment of the method has dem¬ 
onstrated confidence in its ability to generate the cor¬ 
rect folding pathway. There are some differences 
between trajectories generated at different tempera¬ 
tures; however, the overall unfolding pathway has 
been shown to be independent of the temperature 
(Day et al. 2002). The adherence to the principle of 
microscopic reversibility was demonstrated through 
simulations of CI2 at the melting temperature (T^) of 
348 K (Day and Daggett 2007). At the Tn,, a protein 
should be 50% unfolded and 50% folded giving a AG 
of zero between the two states. Under these conditions, 
the trajectory was observed to unfold and refold along 
the same pathway as highlighted in Fig. 1. Further¬ 
more, the trajectory refolded to a less compact version 
of the native state, a structure that concurred with 
experimental data acquired at elevated temperature 
(Day and Daggett 2007). 

For over a decade, this method has been used with 
great success and has repeatedly shown agreement 
with experimental data (Fersht and Daggett 2002). 
Many comparisons have been done of predicted 


transition states identified from the elevated tempera¬ 
ture trajectories against experimental (F-value analy¬ 
sis. Notable case studies are discussed in detail in 
Daggett 2006. 

Forced Unfolding 

An MD method that is compatible with ► atomic force 
microscopy (AFM) measurements, called mechanical 
or forced unfolding, simulates the application of 
a pulling force on one or both termini of a protein 
(Forman and Clarke 2007). Simulation of a protein 
forcibly unfolded by this method generates 
a characteristic “saw-tooth” or step profile under con¬ 
ditions of constant velocity or constant force, respec¬ 
tively, that can be compared directly with experimental 
data. This approach was initially used to study the 
unfolding of fibrous proteins, such as titin-127 found 
in muscle tissue; the simulation being akin to the 
mechanical force experienced in vivo. More recent 
application of this method to globular proteins, such 
as ubiquitin and bamase, demonstrated that non¬ 
mechanical proteins react differently under the appli¬ 
cation of force unless simulated as a concatenated 
chain of domains (Daggett 2006; Forman and 
Clarke 2007; Galera-Prat et al. 2010). Moreover, 
the results when using this method can be dependent 
upon the magnitude and acceleration of the force 
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applied, as well as the direction of pulling and the 
position at which the force is applied (Forman and 
Clarke 2007). 

Biased MD Simulations 

There are many types of biased simulations that allow 
the energy barriers on the folding landscape to be 
surmounted without the use of high temperatures or 
excessive force, such as those applied in mechanical 
unfolding simulations. The biasing is achieved by 
selecting a reaction coordinate that varies along the 
folding pathway from the folded to the unfolded state, 
such as increasing radius of gyration or loss of native 
contacts, or by dehning the denatured or transition 
state of interest with a set of experimental restraints. 
A perturbation, or energy bias, is then applied in the 
unfolding direction of the reaction coordinate, making 
movement back toward the native state energetically 
unfavorable. The outcome, however, can be very 
dependent upon the reaction coordinate chosen 
(Caflisch and Paci 2005). 

One biased MD (BMD) approach incorporates 
experimental data pertaining to detected transition 
states, such as (h-values, to generate possible transition 
state conformations. A reasonable assumption is that 
the more experimental data incorporated into the sim¬ 
ulation, the nearer the MD-generated ensemble will be 
to the true transition state ensemble (TSE). Such 
a method was pioneered using forced unfolding simu¬ 
lations and then using experimental O-values to 
restrain the simulation to the proximity of the TSE 
(Caflisch and Paci 2005). The simulations can be com¬ 
pared with the experimental dataset by evaluating 
a theoretical O-value from the fraction of native con¬ 
tacts between the time-dependent conformation, N,, at 
time t and the reference native state structure, Nre/- 
N, 

fl) =- (Li and Daggett 1994). The perturbation 

N,-ef 

applied to the potential energy function favors the 
higher energy transition state conformational ensem¬ 
ble and drives the calculation away from the native 
state to where the deviation between the calculated and 
experimental restraints, the <l)-values, is at a minimum. 
Further, the perturbation only has an effect when the 
system moves away from the target transition state 
conformation and reacts by selecting for those sponta¬ 
neous fluctuations that act in the correct direction to 


take the conformation toward the transition state. 
Hence, to surmount the barrier to the higher energy 
transition state, starting from the native state, the per¬ 
turbation will act to transform some of the kinetic 
energy that occurs from surmounting the energy sur¬ 
face into potential energy instead of resulting in an 
effective temperature increase. In this way, the BMD 
gradually propagates the atomic forces in the direction 
away from the native state toward the conformation 
desired without using excessive force. Most notably, 
this method has been successfully used in conjunction 
with elevated temperature simulations and well vali¬ 
dated against experimental data for the simulation of 
immunoglobulin and immunoglobulin-like domains 
(Caflisch and Paci 2005). 

Replica Exchange Molecular Dynamics 

Replica-exchange MD (REMD) was introduced in 
1999 and successfully characterized a greater degree 
of the potential energy landscape for Met-enkephalin 
than was obtained using conventional methods 
(Schlick 2009). Using numerous copies of the same 
system, called replicas, simulations are run in parallel 
over a range of temperatures as a generalized ensemble 
of non-interacting replicas. The range and distribution 
of temperatures chosen is important and needs to follow 
an exponential distribution in order to keep the 
exchange rate constant. A single trajectory does not 
remain at a constant temperature throughout the simu¬ 
lation; trajectories at neighboring temperatures are 
exchanged periodically allowing trajectories of similar 
potential energy to sample different temperatures. 
These exchanges increase the probability of 
surmounting energy barriers and enable greater sam¬ 
pling of the conformational space. However, the 
exchange between temperatures is not random and is 
dependent on satisfaction of an acceptance criterion 
that ensures a Boltzmann distribution is adhered to 
with overlap between the simulations. The decision to 
exchange between states n and m, with respective 
potential energies E„ and E„„ is done using Metropolis 
criterion based on the probability w,y = exp(—A), where 
A = [P„—(3„,](E„,—E„), and P = l/kT. The merit of this 
method is the ability to sample the energy landscape 
effectively because it removes the limitation of kinetic 
traps and the energetic barriers that exist at lower tem¬ 
peratures. However, this approach is unsuitable for the 
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determination of folding pathways and kinetics due to 
the discontinuous nature of moving between states. 
Further, the method is unable to reproduce experimen¬ 
tal thermodynamic results, providing little conhdence 
in the underlying sampling. An extension to the REMD 
method is the multiplexed approach where, instead of 
simulating a single replica at each temperature, multi¬ 
ple replicas are used (Caflisch and Paci 2005). 

Umbrella Sampling 

Umbrella sampling is an advanced biased MD 
approach that involves compartmentalizing the free 
energy landscape into “windows” defined by 
a chosen reaction coordinate and a set of starting struc¬ 
tures that pertain to each “window” (Schlick 2009; 
Caflisch and Paci 2005). One way of generating 
starting structures to correspond to the spread of 
energy landscape “windows” is to use elevated tem¬ 
perature simulations to unfold the protein. From these 
trajectories, conformations corresponding to points 
across a chosen reaction coordinate, e.g., by using the 
radius of gyration and native contacts as demonstrated 
on segment B1 of streptococcal protein G1 
(Sheinerman and Brooks 1998), can be selected. 
Once generated, the starting structures for each 
“window” are re-equilibrated to 298 K and MD simu¬ 
lations performed in parallel. Further, a bias is applied 
to each “window” to restrain the starting conformation 
to that region on the chosen reaction coordinate, 
allowing the probability of the system in the 
corresponding conformation to be determined from 
the MD simulations for each “window.” The energy 
landscape can then be essentially sutured back together 
from a potential of mean force of these probabilities 
using the weighted histogram analysis method 
(WHAM) (Kumar et al. 1992). The success with 
which the energy landscape is coherently assembled 
is determined by the size of the individual windows 
and the biasing strength. Windows that are distinctly 
separated with no overlap or insufficient populations at 
the joining edges will result in a disjointed energy 
landscape. However, fundamentally this approach 
yields pathways when the pathways are already 
known, i.e., one must supply a reasonable end state 
and, possibly, intermediate states obtained by other 
methods, such as MD, in which case, the path is 
already known, or by assuming a reaction coordinate. 


Transition Path Sampling 

The transition path sampling (TPS) approach is a more 
advanced sampling method that incorporates MC 
moves within an MD trajectory (Schlick 2009; Dellago 
et al. 2006). The method identifies transition states on 
an energy landscape and samples pathways from these 
points that link the native and denatured states. Tran¬ 
sition states are thought of as “bottlenecks” in the 
folding mechanism and correspond with high-energy 
saddle points on the energy landscape. The 
method requires an initial trajectory that provides 
a pathway between the native and denatured states; 
this can be a real or pseudotrajectory obtained using 
any method. TPS then uses Monte Carlo moves 
to identify saddle points; the move being accepted 
based on the relative probabilities of the old and 
new conformations. MD simulations can then be used 
to propagate the motion of the protein from these 
identified transition states to determine pathways 
across the energy landscape linking up the previously 
defined native and denatured states. Multiple sets of 
pathways, the transition path ensemble, are typically 
determined due to the heterogeneity of the energy 
landscape and collectively analyzed to infer a folding 
mechanism. 

Summary 

Computational power is still not sufficient to perform 
explicitly solvated folding simulations of large pro¬ 
teins that fold on the millisecond to second timescale. 
Consequently, the methods detailed here are still as 
necessary now in the investigation of protein folding 
as when they were first introduced. Until computa¬ 
tional hardware has advanced sufficiently that con¬ 
ventional MD simulations can capture the folding 
process, there will always be a choice between 
sacrificing speed or accuracy. Until experimental 
methods are capable of capturing atomic resolution 
structures on a temporal scale, molecular dynamics 
simulations will remain the most powerful technique 
with which to investigate the conformations and 
dynamics of transiently populated states that, 
although can be detected by experiment, are yet to 
be structurally characterized. This chapter has only 
given an overview of some of the most commonly 
used approaches involving MD simulations, yet it has 
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hopefully conveyed the versatility and the adaptabil¬ 
ity of molecular dynamics simulations. The held is 
still developing, and up until now, most research 
efforts have focused on individual or small subsets 
of proteins. However, there is much utility in the 
collation and data mining of the vast amounts of 
simulated data to further our comprehension of pro¬ 
tein folding. One such effort is Dynameomics (www. 
dynameomics.org), an initiative of ours that aims to 
elucidate protein dynamics and unfolding on a large 
scale using a set of proteins that reflect 95% of the 
known autonomous globular protein folds (van der 
Kamp et al. 2010; Schaeffer et al. 2011). Further 
meta-analyses should provide greater insight into the 
dynamic and folding behaviors of proteins. 
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Protein Lysine Methyltransferase - 
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Synonyms 

Histone-Lysine Wmethyltransferase 

Definition 

An enzyme that catalyzes the transfer of a methyl 
group from the S-adenosyl methionine cofactor to the 
terminal nitrogen of lysine residues. 

Basic Characteristics 

Lysine methylation of histones, the proteins around 
which DNA is wrapped in eukaryotic cells, is an 
important mechanism to regulate the structure of chro¬ 
matin and, thereby, the expression of genes (for which 
partial unwinding of chromatin is necessary). Enzymes 
that catalyze this methylation are protein lysine 


methyltransferases, which use the co-factor 5-adenosyl 
methionine (SAM) as the source of the methyl group. 
The specificity of these enzymes for single, double, 
or triple methylation of lysines is an active area 
of research. 

The first computer simulation study into the reac¬ 
tion mechanism and specificity of a lysine 
methyltransferase was performed on SET7/9, which 
specifically catalyzes the transfer of a single methyl 
group to an unmodified histone lysine residue (Hu and 
Zhang 2006). This study used ah initio QM/MM free 
energy perturbation simulations (using B3LYP/6-31G 
(d)/AMBER ff94 for geometry optimization). The free 
energy barrier calculated at the MP2/6-31-i-G(d) QM/ 
MM level was in excellent agreement with the exper¬ 
imental reaction rate. The calculations further indi¬ 
cated that the specificity for mono-methylation over 
di-methylation in SET7/9 is due to the disruption of 
reactive conformations for the di-methylation reaction. 
A subsequent QM/MM study (using slightly different 
methods) from the same group indicated that the cata¬ 
lytic effect of SET7/9 is due to a pre-organized elec¬ 
trostatic environment in the enzyme that facilitates 
methyl-transfer (Wang et al. 2007). 

Rubisco large subunit methyltransferase is a lysine 
methyltransferase that can at least transfer two methyl 
groups to lysine. It has been studied using molecular 
dynamics simulation and *■ QM/MM methods (Zhang 
and Bruice 2007a). ► Molecular dynamics simulations 
indicated that the formation of a water chain may be 
required to allow the escape of the proton to solvent in 
the initial lysine deprotonation step. Subsequently, this 
was also found for the SET7/9, SETS, and viral SET 
lysine methyltransferases (Zhang and Bruice 2007b, c, 
2008). Two-dimensional potential energy surfaces for 
methyl transfers in Rubisco large subunit 
methyltransferase (as well as for SET7/9 and SETS) 
were calculated at the semiempirical SCC-DETB/ 
CHARMM22 QM/MM level. After optimization 
of the transition states from these surfaces, 
MP2/6-31-FG(d,p) calculations were performed to esti¬ 
mate free energy barriers of the reactions. The barriers 
for the first and second methylation of a lysine sub¬ 
strate were in reasonably good agreement with exper¬ 
imental reaction rates, with the second methylation 
being slightly more favorable. A third methylation 
had a higher barrier. Their suggestion that the lack of 
support by the enzyme for the third methylation is due 
to the absence of water chain formation, however, not 
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consistent with the fact that methyl transfer and not 
proton abstraction is rate limiting. Hu et al. further 
studied the SET7/9 and Rubisco large subunit lysine 
methyltransferases with the same QM/MM molecular 
dynamics methods previously employed (Wang et al. 
2007) to investigate the difference in specihcity 
between these enzymes (Hu et al. 2008). The results, 
which conhrm the expected specificities, indicate that 
specificity is primarily controlled by the methyl- 
transfer reaction step. It was found that binding of the 
methylated lysine to SET7/9 opens up a channel that 
allows water molecules from solvent to access the 
active site. This in turn disrupts the active site arrange¬ 
ment, leading to a higher activation barrier for the 
second methylation. In the more spacious active site 
of the Rubisco large subunit methyltransferase, how¬ 
ever, the methylated lysine can be accommodated 
without affecting methyl transfer catalysis. 

Cross-References 

► Protein Folding: Molecular Dynamics Simulations 

► QM/MM Methods 

► Semiempirical Quantum Mechanical Methods 
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Introduction 

Among the many spectroscopic methods which can be 
used to study the structure and dynamics of proteins, 
NMR, while originating in the somewhat recondite 
phenomenon of nuclear spin angular momentum, has 
proved to be arguably the most powerful. The power of 
the method in the study of proteins results largely from 
the fact that nuclei of the same element in different 
chemical and magnetic environments give rise to dis¬ 
tinct spectral lines (the chemical shift), so that - even 
in as complex a molecule as a protein - information 
can be obtained at the level of individual atoms. Fur¬ 
thermore, NMR is sensitive to both the structure and 
dynamics of molecules as well as their interactions. 
This has led to an enormous growth in the application 
of NMR to the study of biological macromolecules 
since the tentative beginnings some 50 years ago. In 
the words of Emsiey and Feeney (1995): “NMR started 
as the plaything of the physicists, became the favourite 
toy of the chemists and finally went on to seduce the 
biochemists.” Summaries of current methods and 
applications of protein NMR are given by Cavanagh 
et al. (2007), Grzesiek and Sass (2009) and Lian and 
Roberts (201 1). A number of other entries give further 
details of topics touched on here (see Cross- 
References); in the interests of space, references are 
only given to reviews and to chapters in Lian and 
Roberts (201 1) as a way into the literature. 

Development of Protein NMR: Dependence on 
Technical Advances 

The first NMR spectrum of a protein was a 40 MHz 
spectrum of ribonuclease reported by Saunders, 
Wishnia, and Kirkwood in 1957, only 12 years after 
the first observation of the NMR phenomenon. This 
first spectrum was almost featureless, and it required 
the development of higher-field spectrometers over the 
next few years to allow the observation of signals from 
individual groups in proteins and the extraction of 
biologically interesting information. In the early 
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Protein NMR - 
Introduction, Fig. 1 Protein 
NMR spectra then and now. 

(a) 100 MHz 'H spectrum 
showing the histidine 
resonances of pancreatic 
ribonuclease (12.5 kDa). The 

* resonances of the four 
histidine residues are shown, 
together with the H®' 
resonance of histidine 105. The 
spectrum was obtained using 
a 6.5 mM solution of 
ribonuclease, averaging 
continuous-wave scans over 
a total time of 6 h. Reproduced 
with permission from Roberts 
GCK et al. (1969). 
Biochemistry 8:2053-2056. 

(b) 800 MHz 'H-‘'^N 
heteronuclear single quantum 
coherence spectrum of 
NADPH cytochrome P450 
reductase (67 kDa). The 
spectrum was obtained with 

a 0.5 mM solution of [^H, '^N]- 
labeled protein. Huang W-C, 
Roberts, GCK, unpublished; 
see also Ellis et al. (2009) 



years of protein NMR, the 1960s, the pioneers were 
particularly the groups of Mildred Cohn (Philadel¬ 
phia), Oleg Jardetzky (Harvard, then Merck Research 
Laboratories), and Robert Shulman (Bell Laborato¬ 
ries). From this and other work in these early years, it 
was clear that NMR did indeed have considerable 
potential to provide structural and dynamic informa¬ 
tion on proteins and their interactions. A full realiza¬ 
tion of this potential has taken some time (Fig. 1) and 
has been dependent on a very striking range of techni¬ 
cal advances in three essential areas: 

Sensitivity. Since NMR is intrinsically a very insen¬ 
sitive technique in terms of the amount of material 
required, improvements in this area have been essen¬ 
tial to the wide applicability of the method to proteins. 
The two key developments have been the introduction 
of pulse Fourier transform NMR by Ernst (1966; Nobel 
Prize, 1991) and the continuing development of 
higher-held spectrometers, currently up to 22.3 T 
(950 MHz for *H). More recently, the introduction of 
cryogenically cooled sample probes and of a range of 
fast data acquisition methods have led to further 


improvements. Together with the advances in molec¬ 
ular biology which have greatly simplihed the produc¬ 
tion of quantities of pure protein, these developments 
have transformed our ability to obtain high-quality 
information from NMR spectra of proteins at fairly 
reasonable concentrations. 

Resolution. To make use in practice of the ability of 
NMR to provide detailed residue-level structural and 
dynamic information requires one to resolve each of 
the many hundred ’H, *^C, and *^N resonances in the 
spectrum of even a small protein. Again the develop¬ 
ment of higher-held spectrometers has been very 
important, but the key development in improving res¬ 
olution has been the introduction of multidimensional 
NMR, in which the resonances are displayed on 
several frequency axes. This originally involved two 
or sometimes three ^H frequency axes, but the devel¬ 
opment of triple-resonance NMR in the 1980s by the 
groups of Bax, Markley, and others, in conjunction 
with the use of progressively more sophisticated 
methods of labeling with the stable isotopes ^H, 
and *^N (Takeda and Kainosho 201 1), led to a range of 
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experiments (Cavanagh et al. 2007) which dramati¬ 
cally improved the resolution and extended the size 
range of proteins which can be studied. 

Assignment. The ability to resolve resonances of 
individual amino acid residues in a protein is of only 
limited use unless they can be assigned to the specific 
residue from which they arise. The combination of 
multidimensional NMR and isotope labeling has pro¬ 
vided widely applicable approaches to resonance 
assignment (Cavanagh et al. 2007; Lian and Barsukov 
2011). For small proteins, assignments can be carried 
out using unlabeled proteins, making through-bond 
(scalar coupling) connections between protons within 
an amino acid, followed by sequential through-space 
(NOE-based) connections between protons in neigh¬ 
boring amino acids. However, there is a limit to the 
size of protein to which this approach can be applied, 
determined essentially by the attainable resolution in 
spectra of unlabeled proteins. A second approach, 
based on triple-resonance experiments, is applicable 
to larger proteins and is now almost universally used; 
this involves labeling recombinant proteins with *^C, 
*^N, and perhaps and using appropriate pulse 
sequences to make through-bond connections between 
the protons and nitrogen and carbon atoms along the 
entire polypeptide chain (Cavanagh et al. 2007; Lian 
and Barsukov 2011). 

Protein Structure 

The most widely recognized achievement of NMR in 
structural biology is its ability to determine the three- 
dimensional structures of proteins in solution. The first 
such structure was reported from Wiithrich’s labora¬ 
tory in 1985 (Wiithrich, Nobel Prize, 2002). Since that 
time, more than 8,900 protein structures solved by 
NMR have been deposited in the Protein Data Bank. 
This remarkable achievement has depended, first, on 
the recognition that a high-quality structure can be 
determined from a large number of rather imprecise 
restraints and, secondly, on the development of 
steadily improving methods for deriving these 
restraints from NMR data and for using them to calcu¬ 
late structures (Vuister et al. 2011; Guntert 2011). The 
first structures depended on short-range restraints on 
inter-proton distances (from NOEs) and on torsion 
angles (from three-bond scalar couplings). Since then 
additional sources of restraints have been introduced, 
including information from chemical shifts and the 
precise identification of hydrogen bonds from scalar 


couplings across the hydrogen-bond. Perhaps the most 
important new kinds of data are the longer-range 
restraints from paramagnetic effects and the orienta¬ 
tional restraints from residual dipolar couplings in 
partially oriented solutions; these have proved 
extremely valuable in improving the quality of struc¬ 
tures, notably for multi-domain proteins. 

The problems of resolution and assignment referred 
to above have so far generally limited full structure 
determination by NMR to proteins of less than about 
50-60 kDa. However, resonances can be resolved and 
assigned in much larger proteins, and valuable infor¬ 
mation can be obtained by a “probe” approach - for 
example, using methyl-TROSY experiments with 
['^C, 'H]-methyl groups in a [^^C, ^H] background 
(Kay 201 1) - in proteins as large as 1 MDa. Membrane 
proteins present a greater challenge to NMR, as they do 
to X-ray crystallography, but both solution-state and 
solid-state NMR methods have recently had significant 
success in this difficult area (McDermott 2009; Hiller 
and Wagner 2009; Renault et al. 2010). In addition to 
the three-dimensional structure, NMR can determine 
the charge state (and pKa value) of individual side 
chains; this was one of the earliest applications of 
NMR to proteins and remains very valuable in view 
of the widespread mechanistic importance of titratable 
side chains. 

The representation of protein structures derived 
from NMR data takes the form of ensembles of struc¬ 
tures which are consistent with the available data. One 
assumes that all the NMR-derived restraints should be 
satisfied simultaneously - that is, that the protein 
of interest exists in a single energy minimum, with 
fluctuations about this minimum represented by the 
distributions seen in the ensemble of structures. (Just 
occasionally one finds, for example, a side-chain atom 
with two mutually inconsistent NOEs, indicating that 
the side-chain adopts two different positions.) 

One of the most exciting developments in protein 
NMR over recent years has been the demonstration 
that it is possible, using relaxation dispersion and para¬ 
magnetic relaxation methods, to detect and character¬ 
ize other, higher energy (low population), states. This 
has been applied, for example, to the characterization 
of intermediates in protein folding and of the initial 
encounter complexes in protein-protein and protein- 
DNA complex formation - in both cases providing 
important mechanistic information (Clore and Iwahara 
2009; Korzhnev et al. 2010; Bashir et al. 2011). 
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Protein NMR - Introduction, Fig. 2 Protein structures deter¬ 
mined by NMR. Ensembles of structures of metallo-P-lactamase 
Bell from Bacillus cereus; (a) Enzyme alone, (b) Enzyme in 
complex with the inhibitor R-thiomandelic acid. The two zinc 
atoms are shown as spheres {cyan in a, orange in b), and the 
inhibitor is shown as a yellow stick model in b. The ensembles 

The structures seen in the NMR-derived structural 
ensemble, representing the most populated state of the 
protein, generally agree closely, but there are often 
regions such as loops or the N- and C-termini where 
there is greater divergence between members of the 
ensemble (Fig. 2). This illustrates one of the great 
strengths of NMR studies of protein structure in solu¬ 
tion, that it can characterize regions of the protein 
which are relatively flexible or disordered. Indeed, 
NMR is essentially the only method available which 
can provide atom-level information on unfolded pro¬ 
teins or on the class of intrinsically disordered proteins 
which is now increasingly recognized as biologically 
important (Jensen et al. 2011). 

Protein Dynamics 

An equally important attribute of NMR in the study of 
proteins is its ability to study protein dynamics in 
atomic detail and over an extraordinary range of time- 
scales. The detection of low-populated states men¬ 
tioned above is one important example of this. Not 
only can the existence of dynamic processes be dem¬ 
onstrated, but the residues involved can be identified 
and limits can be placed on the nature of the process. 
Different NMR methods are sensitive to dynamics on 
different timescales (Mittermaier and Kay 2009; 
Kleckner and Foster 2010): rapid local picosecond- 
nanosecond fluctuations affect spin relaxation rates, 
millisecond-second dynamics affect relaxation disper¬ 
sion profiles and lineshapes and can be also studied by 
zz-exchange experiments, while still slower dynamics 



show the existence of a well-defined structure but with a flexible 
N-terminus and a degree of flexibility in the surface loops, 
particularly around the active site (at the top of the structures). 
Karsisiotis AI, Damblon C, Roberts GCK, unpublished work; 
see also Karsisiotis (2008) 


can be studied in real time or by hydrogen exchange. 
Currently, only residual dipolar coupling experiments are 
sensitive to motions faster than the overall tumbling of 
the protein but slower than ^10 ps. The detailed charac¬ 
terization of protein dynamics by NMR has transformed 
our picture of protein structure. An early example of this 
was the observation, in the 1970s, that the symmetry- 
related ortho or meta protons of phenylalanine or tyrosine 
residues in a number of proteins were magnetically 
equivalent. This could be explained by 180° “flips” of 
the rings, the relatively rare frequencies of which 
(^10^ s“*) could be measured by NMR; simulations 
indicated that these required concerted displacements of 
numerous atoms in the structure. Since that time, detailed 
characterizations of protein dynamics on the picosecond- 
nanosecond timescale and the effects thereon of, e.g., 
ligand binding have become fairly routine, and more 
recently, these studies have been successfully extended 
into the (arguably more physiologically relevant) micro¬ 
second-millisecond timescale (Mittermaier and Kay 
2009; Kleckner and Foster 2010). 

Protein dynamics is not only important in filling out 
our picture of protein structure; it clearly has 
a considerable functional significance in both catalysis 
and allosteric regulation, and NMR has played a key 
role in demonstrating this. Because NMR can provide 
kinetic and structural information on systems at equi¬ 
librium, it has proved possible to study individual steps 
of enzyme catalysis. Not only can the rates of such 
steps be measured, but recent years have seen a number 
of elegant examples of NMR experiments linking 
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Protein NMR - 
Introduction, Fig. 3 Ligand 
binding by NMR. The binding 
of the inhibitor R- 
thiomandelic acid (RTM) to 
the metallo-P-lactamase Bell 
from Bacillus cereus is used as 
an example. {Top) Residues 
whose chemical shifts are 
affected by the binding of the 
inhibitor (chemical shifts 
changes: red > orange > 
yellow) locate the binding site. 
{Bottom) Intermolecular 
NOEs, indicated by red dotted 
lines, position the ligand 
precisely. The two zinc atoms 
are shown as spheres {cyan in 
Top, orange in Bottom). 
Karsisiotis AI, Damblon C, 
Roberts GCK, unpublished 
work; see also Karsisiotis 
( 2008 ) 



specific protein motions to enzyme catalysis (e.g., 
Boehr et al. 2006; Eisenmesser et al. 2007; Loria 
et al. 2008) showing the way toward a full structure- 
dynamics-function picture of enzymes. 

Protein Interactions 

The function of the vast majority of proteins in the cell 
depends upon their noncovalent interactions with other 
molecules, ranging from enzyme-substrate interac¬ 
tions to the array of protein-protein and protein- 
nucleic acid interactions involved in, for example, the 
regulation of transcription. NMR has proved very 
valuable in the study of the interaction of proteins 
with both small and large molecules, yielding results 
ranging from the simple detection of binding to a full 
three-dimensional structure of the complex (Roberts 
201 1; Driscoll 2011). Valuable information on protein- 


protein interactions can be obtained by studying the 
complex between one of the proteins and a relevant 
peptide from the binding partner (the first example of 
this was from Bax’s laboratory in 1992; Ikura et al. 
1992), but for complexes which are not too large a full 
structure determination is possible by making use of 
differential isotope labeling (Driscoll 2011). 

The detection of binding from effects on the reso¬ 
nances of small-molecule ligands has proved valuable 
in drug discovery, while effects on assigned reso¬ 
nances of the protein (“chemical shift mapping”) 
locates the binding site (Fig. 3) and has been success¬ 
fully used in exploring structure-activity relationships 
in drug design programs. Intermolecular NOEs, iden¬ 
tified using isotopically labeled proteins, can position 
the ligand precisely (Fig. 3), often as part of a full 
structure determination of the complex. 
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Complexes of a wide range of affinities and lifetimes 
can be studied and in many cases the lifetime of the 
complex can be determined by NMR. For high-affinity 
complexes, where the exchange between the free and 
complexed states is slow, the limitation on the size of 
complex which can be studied is essentially the same as 
that for protein structure determination, currently 
^50 kDa. However, when exchange between the free 
and complexed states is fast on the NMR timescale (in 
particular, fast compared to the difference in chemical 
shift or relaxation rate between them), valuable infor¬ 
mation about the position, conformation, and ionization 
state of a small-molecule ligand bound even to a very 
large protein can be obtained from effects on the aver¬ 
aged resonances of the ligand, easy to observe if the 
ligand is present in excess. Two of the most valuable 
methods in this context are paramagnetic relaxation 
effects arising from a paramagnetic ion or spin label 
bound to or attached to the protein (an approach first 
developed by Mildred Cohn in the 1960s) and the trans¬ 
ferred NOE, where NOE effects arising in the complex 
are observed on the averaged ligand resonances. As 
noted above, the observation of averaged resonances in 
NMR spectra means that a kinetic model — often simply 
an assumption that the complex exists in a single con¬ 
formation — is required to interpret the structural infor¬ 
mation obtained. 

Summary: Benefits and Limitations 

The clear benefit of NMR is that it allows one to 
determine the structure of proteins in the solution state 
under largely physiological conditions - indeed 
a number of recent papers have shown that it is possible 
to obtain NMR spectra from isotopically labeled pro¬ 
teins introduced into cells. There are of course limita¬ 
tions to the applicability of NMR; (1) There is an upper 
limit to the size of protein whose structure can be 
determined (this is currently ^50 kDa but continues to 
increase), although valuable information short of a full 
structure determination can be obtained on much larger 
proteins. (2) Relatively high concentrations of protein 
are required (though these continue to decrease), and 
the protein must be stable for a few days at room 
temperature. Both solution-state and solid-state NMR 
techniques are beginning to make important contribu¬ 
tions to the study of membrane proteins, though this 
remains a technically challenging area. 


NMR can also provide extensive information on 
protein kinetics and dynamics. This includes protein 
motions over a range of timescales from sub¬ 
nanosecond to seconds and longer, structural and 
dynamic information on disordered regions, and iden¬ 
tification and characterization of states of the protein 
present at low populations. The ability to simulta¬ 
neously obtain structural and dynamic information is 
a particular and important strength of NMR. 

A potential limitation to the interpretation of NMR 
data is the fact that they are time and ensemble aver¬ 
ages. They must therefore be interpreted in terms of 
a model, and the conclusions will be correct only 
insofar as the chosen model is correct. Thus, one nor¬ 
mally assumes that the protein or its complex exists in 
a single state, but this need not of course be the case. 
One should use a model which is as simple as possible 
but no simpler - for example, if a two-state model of 
complex formation is used to analyze a system which 
actually has three or more states, the kinetic and struc¬ 
tural information obtained will not relate to a real state. 
Fortunately, since it is now usually possible to obtain 
NMR data for a very high proportion of the resonances 
of the protein, and since powerful NMR experiments 
are available to measure the rates of interconversion 
between species, the applicability of the chosen model 
can be tested reasonably well. 

Cross-References 

► Assignment of *®F Resonances in Protein Solution 
State NMR Studies 

► CNS (Crystallography and NMR System) 

► CPMG 
► CYANA 

► Electron Paramagnetic Resonance Spectroscopy 

► Fluorescence Correlation Spectroscopy 

► Fragment Screen 

► In-Cell NMR 

► Multidimensional NMR Spectroscopy 

► NMR in Drug Discovery - Introduction 

► NMR Methods for Kinetic Analysis 

► NMR Studies of Macromolecular Interactions - 
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► Nuclear Overhauser Effect 

► Protein Dynamics and Allostery 

► Protein NMR Resonance Assignment 
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► Relaxation Dispersion 

► SAR by NMR 

► Stereo-Array Isotope Labeling (SAIL) Method 

► Structure Determination by NMR: Overview 

► Total Correlation Spectroscopy (TOCSY) in NMR 
Protein Structure Determination 

► Triple Resonance NMR 
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► ZZ-Exchange 
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Overview of Protein Resonance Assignment 

Until the introduction of sequential assignment proce¬ 
dure developed by Kurt Wiithrich and his coworkers in 
1980s (Wiithrich 1986), most of the protein assignment 
work was accomplished with reference to the crystal 
structure. Therefore, the establishment of the sequential 
assignment procedure was a mile stone for the protein 
NMR. Backbone amide proton (H^) and a proton (H®) 
signals were sequentially assigned based on the dis¬ 
tance information between and and were 

aligned on the amino acid sequence of the particular 
protein. This facilitates NMR to be independent from 
X-ray crystallography and the structure of proteins in 
solution could be determined by NMR using the assign¬ 
ment of proton signals and proton-proton distance 
information. However, due to limited resolution in *H 
2D-NMR spectra, the molecular weight of the target 
protein is restricted to be less than 8 kDa. 

In order to overcome the molecular weight limita¬ 
tion, stable isotope-aided NMR method was presented 
by Ad Bax and his coworkers. They developed a suite 
of ► triple resonance 3D NMR methods using *^C/*^N 
uniformly labeled proteins (Bax and Grzesiek 1993). 
Using much larger hetero nuclear ► J couplings 
compared to ^Jhh> efficient magnetization transfer 
between coupled nuclei could be achieved. A number 
of sophisticated pulse sequences were developed to cor¬ 
relate the chemical shifts of the pair of the backbone 
and *^N to those of H“, *^C', side chain carbons and 
protons in its own and preceding residues, thus facilitat¬ 
ing sequential assignment. This strategy was widely used 
and the proteins of ca 8-25 kDa could be assigned and the 
structure of proteins could be determined only by NMR 
(Bax and Grzesiek 1993; Cavanagh et al. 2007). 

For proteins larger than 25 kDa, perdeuteration of 
proteins (Kay 2001) and ► TROSY detection 
(Wiithrich and Wider 2003) are useful for the assign¬ 
ment purpose and the assignment more than 50 kDa is 
possible (Kay 2001). 

In the following section, the resonance assignment 
procedure for proteins of different molecular size is 
discussed. 

Sequential Resonance Assignment Strategy 

Here, the sequential assignment strategy based on the 
analysis of ^H 2D-NMR spectra of proteins will be 



Protein NMR Resonance Assignment, Fig. 1 Sequential res¬ 
onance assignment using COSY and NOESY spectra. 
is connected to H“i_i by dipole-dipole interaction. The sequen¬ 
tial walk starting from Hi“ and ending at is shown in 

solid line 

briefly outlined (Wiithrich 1986). This method is com¬ 
prised of two stages. The first involves the identifica¬ 
tion of the spin systems of amino acid residues. This is 
achieved by the analysis of the DQF-COSY or 
► TOCSY experiments which identify NMR reso¬ 
nances connected by ► spin-spin couplings (^Jhh is 
usually less than 8 Hz). Each amino acid has a specific 
spin-spin coupling pattern and is assigned to 
a particular amino acid types. The second is to link 
each amino acid residue using NOESY information. 
This is based on the evidence that the distance between 
the back-bone H^; and H“i_i is within 5 A and thus, 
dipole-dipole interaction between these protons can be 
detected in the NOESY spectrum. The assignment 
process using NOESY and DQF-COSY is schemati¬ 
cally shown in Fig. 1, where the COSY spectrum is 
shown in lower right half and the corresponding 
NOESY spectrum in the upper left half. Here, starting 
from the DQE-COSY diagonal peak (Hi“), the DQE- 
COSY cross peak (H;^, H;”) and the NOESY cross 
peak (H^i, H^i-i) are found through the diagonal 
peak H^^i. Thus, the walk arrives at the diagonal peak 
of H°'i_i. This process is repeated to link each preced¬ 
ing amino acid residues. Amino acid types have been 
already identified by DQE-COSY or TOCSY so that 
the sequential connectivity is aligned along the amino 
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acid sequence of the target protein. The spiral pattern 
linked by solid lines in Fig. 1 is remarkable when the 
sequence forms a P-strand. This spiral pattern is called 
a P-snail and is useful to identify the p-strand in the 
protein, while strong NOE correlation between amide 
protons (H^i, indicates the presence of the 

Qt-helix. Thus, NOESY pattern is characteristic to the 
► secondary structure of the target proteins and is 
often used to identify the secondary structure in the 
proteins (Wiithrich 1986). 

However, due to overlap of signals in the *H 2D- 
NMR spectra, the sequential resonance assignment 
strategy generally fails for proteins more than 8 kDa, 
but this strategy is not only of historical importance but 
also useful for the assignment of small proteins that are 
not isotopically labeled with nuclei. 


Assignment Strategy Using Triple Resonance 
3D-NMR 

uniformly labeled proteins are useful to reduce the 
resonance overlap of amide protons where each signal 
can be resolved according to the chemical shift of 
directly bonded nuclei in the ► HMQC/ 

HSQC spectra. The HSQC spectrum is useful 

as it provides the chemical shift correlation of the 
amide proton and directly connected amide nitrogen. 
Thus, each amino acid residue gives a single peak in 
the HSQC spectrum. The 'H-'^N HSQC spec¬ 

trum is a starting point for the sequential assignment of 
amino acid residues using the ► triple resonance 
3D-NMR (Bax and Grzesiek 1993). In the triple reso¬ 
nance 3D-NMR experiment, proteins are uniformly 
labeled with *^C and so that, correlation peaks in 
the *H-*^N HSQC spectmm can be resolved according to 
the chemical shift of ^^C in the third dimension, thus, the 
peaks in the triple resonance 3D-NMR spectmm gives 
a chemical shift correlation among *H, *^N, and '^C. 

The triple resonance 3D-NMR was proposed as 
a more general assignment procedure for the main 
chain amide protons and backbones (Bax and Grzesiek 
1993). Since the nitrogen and carbon atoms (C“, C' and 
side chain carbons) are uniformly labeled with and 
*^C, respectively, protons, carbons, and nitrogens are 
connected with each other via large ► spin-spin 
couplings (Eig. 2). Large spin-spin couplings enable 
efficient magnetization transfer between coupled 
spins. The basic scheme of sequential assignment by 


140Hz I 140Hz 
H-■'^CP-H 


-15N 

--95Hz| 

H 

Protein NMR Resonance Assignment, Fig. 2 Spin-spin cou¬ 
pling constants observed in the backbone of the '^N and 
uniformly labeled proteins 

the triple resonance 3D-NMR is straightforward. Eirst, 
the magnetization of the amide protons is transferred to 
the amide nitrogens (*^N) through 'Jh'^n ("^95 Hz), 
and subsequently transferred to in its own and the 
preceding residues through (H Hz) and 

(^^7 Hz), respectively. This pulse sequence, 
named HNCA (Cavanagh et al. 2007), is designed to 
encode and chemical shifts. Einally, the 
magnetization is transferred backward to the original 
amide proton and is detected. Thus, the cross peaks in 
HNCA are correlated with the chemical shifts of ^Hj, 
*^Ni, and in its own residue and in its 

preceding residue. When we take a strip at the specific 
*H and chemical shift plane, we observe two peaks 
corresponding to intra-residual ('H,, '^Nj, '^Ci“) and 
inter-residual (*Hi, '^Nj, '^C^i-i) cross peaks. Another 
pulse sequence named HN(CO)CA (Cavanagh et al. 
2007) was proposed to discriminate these two cross 
peaks, where in HN(CO)CA, the amide proton magne¬ 
tization is transferred to *^COi_i, thus 

restricting the magnetization transfer pathway only to 
the preceding residue and the single cross peak 
corresponding to the chemical shift of i_i can be 
observed. Main-chain backbone assignment procedure 
using HNCA and HN(CO)CA is schematically shown 
in Fig. 3, where the HN(CO)CA and HNCA strips 
dissected from each (*H, *^C) plane for ith residue is 
aligned according to the connectivity of a carbons 
in the sequence. The cross peaks corresponding to 
*^C“ i_i is assigned by HN(CO)CA experiment, thus 
discriminating the cross peaks of ^^C“ i_i (filled circle 
in Fig. 3) from ^^Ci“ (open circle in Fig. 3) in HNCA 
The cross peaks of C“ carbons are connected on the 
strip as shown in dotted lines (Fig. 3). In principle, the 
sequential assignment of the main chain protons, nitro¬ 
gens, and carbons can be possible when this process is 
repeated along the sequence. However, the chemical 
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Protein NMR Resonance 
Assignment, Fig. 3 Scheme 
of the sequential assignment 
by HNCA and HN(CO)CA 


Scheme of sequenciai assignment by HNCA and HN(CO)CA 

i-1 i i+1 

HNCA HNCA HNCA 

HN(CO)CA HN(CO)CA HN(CO)CA 



HNCA 


cp cp 



i-1 O i O 


HN (CO)CA 
Cp 


■N —C— N*- 
H ^ II H*^ 
1-1 O 


Cp 

Ca —C 
II 

i O 


shift dispersion is small in 0“ chemical shifts so that it 
is generally difficult to align residues on the sequence 
solely based on the chemical shifts of As the 
chemical shift of is more disperse, neighboring 
residues can be uniquely connected using the chemical 
shifts of both C“ and C^. It should be noted that Ala 
(ca 30 ppm) and Ser C^/Thr (ca 60-70 ppm) have 
unique chemical shifts, facilitating the unique assign¬ 
ment of Ala or Ser/Thr types. Starting from these 
residues, the amino acid residues can be connected on 
the primary sequence. HNCACB (Cavanagh et al. 
2007) and CBCA(CO)NH (Cavanagh et al. 2007) 
were developed to connect the amide protons to 
and in its own and the preceding residues. Unam¬ 
biguous backbone assignments can be possible using 
these measurements for proteins of 8-25 kDa. The side 
chain carbon and proton resonances can be assigned 
using CC(CO)NH TOCSY and HCC(CO)NH TOCSY 
(Cavanagh et al. 2007), respectively, where the side 
chain carbon and proton resonances in the preceding 
residue are correlated with and . Thus, the 
assignment of the backbone and side chain protons, 
carbons, and nitrogens are established. 

^H/*^C/^^N triple labeling is useful for the assign¬ 
ment of proteins of 25-100 kDa. The preparation of 
perdeuterated proteins and the assignment method 
were established by Lewis Kay and his coworkers 
(Kay 2001). Despite high molecular weight proteins, 
perdeuteration is quite efficient to reduce the relaxation 
rates, and the assignment of the larger proteins was 


successfully made by the triple resonance 3D-NMR spec¬ 
troscopy incorporating the ► TROSY pulse sequence 
(Wiithrich and Wider 2003). Resonance assignment is 
crucial for the study of protein-protein and protein-ligand 
interactions as well as the structure determination. Based 
on the chemical shift changes induced by binding of the 
effector proteins or ligands, the detailed molecular inter¬ 
action can be elucidated. Thus, the assignment of protein 
signals is indispensable for both functional and structural 
studies of the target proteins. 

From the chemical shifts of *^N, 

*H“, and in each residue, the secondary structure 
of the target proteins are predicted using the 
► Chemical Shift Index or the TALOS -i- program 
(Shen et al. 2009), which predicts torsion angles 
((jti, v|/i) using information derived from the backbone 
chemical shifts and amino acid types. 

The assignment process in the triple resonance 
3D-NMR is straightforward but tedious, so that several 
programs have been proposed to accomplish the sequen¬ 
tial assignment efficiently. Olivia can correlate several 
types of the triple resonance 3D-NMR spectra to find the 
most probable alignment to the sequence on the spectral 
basis (http;//fermi.pharm.hokudai.ac.jp/olivia/). 

Large coupling constants ensure efficient magneti¬ 
zation transfer. It should be noted that ^JcaN is around 
7 Hz compared to 11 Hz of ^JcaN> thus permitting 
magnetization transfer from to in its own 
and in its preceding residues. This enables sequential 
assignment of the backbone resonances. 
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HN(CO)CA and HNCA strips dissected from 
(*H, *^C) plane at the same chemical shift are 
aligned for (i—l)th, ith and (i + l)th residues. In the 
HNCA strip, two peaks corresponding to Ci“ and C“i_i 
are observed, where in the HN(CO)CA strip, single 
cross peak corresponding to C“i_i is observed. The 
sequential walk in the HNCA and HN(CO)CA is 
shown in dotted lines. 


Use of Amino Acid Selectively Labeled 
Proteins for Assignment Purpose 

Identihcation of amino acid types is essential for reso¬ 
nance assignment of larger proteins but because of the 
degeneracy of the chemical shifts, discrimination of 
amino acid types based on the chemical shift is gener¬ 
ally ambiguous. In order to remove this ambiguity, 
preparation of amino acid selectively labeled proteins 
is useful (McIntosh and Dahlquist 1990). 

Amino acid selectively labeled proteins are 
prepared using non-labeled amino acid medium 
supplemented by ^^N specifically labeled amino acid 
in bacteria, where only specific type of amino acid 
can be observed in *H-*^N HSQC spectrum, thus facil¬ 
itating to reduce spectral overlap and give the 
type-specific assignment of amino acid residues. 

Amino acid selective *^CO labeling under the 
uniform *^N background using M9 medium 
supplemented with *^NH 4 C 1 as a sole nitrogen sources 
can be prepared for the purpose of the sequence- 
specific resonance assignment. In combination with 
2D HNCO spectrum, a pair of residues connected by 
'^CO'^NH can be determined, as was originally 
proposed by Kainosho (Kainosho and Tsuji 1982). 
Thus, the sequence-specific resonance assignment can 
be accomplished and the sequential assignment based 
on the triple resonance 3D-NMR method can be 
confirmed. This strategy was successfully applied to 
obtain whole sequential assignment by using wheat 
germ cell-free system, which allows for production of 
all 20 amino acid selectively labeled proteins (Morita 
et al. 2004). 

Cross-References 

► Assignment of Resonances in Protein Solution 
State NMR Studies 


► NMR Studies of Macromolecular Interactions - 
Introduction 

► Protein NMR - Introduction 
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Definition 

Protein phosphorylation is a reversible posttranslational 
modification of proteins or other molecules, involving 
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the covalent addition of a phosphate (PO 4 ) group onto the 
molecule. The most common donor is the y-phosphoryl 
(P 03 ^~) group of adenosine-5'-triphosphate (ATP), and 
the acceptors, on proteins, are typically serine, threonine, 
or tyrosine residues (canonical phosphorylation); less 
often, other amino acids can he phosphorylated 
(noncanonical phosphorylation) (Berg et al. 2008). 

Basic Characteristics 
What Is Phosphorylation? 

Phosphorylation of proteins is an ancient mechanism; 
it was first identihed in eukaryotic organisms 
(Fischer and Krebs 1955), but is now known to be 
present in all three domains of life (Eukarya, Bacteria, 
and Archaea). The reaction is catalyzed by enzymes 
known as protein kinases (also known as 
phosphotransferases), and it is one of the most 
ubiquitous posttranslational modihcations known; it 
is estimated that up to 30% of all human proteins can 
be phosphorylated. The removal of the phosphoryl 
group - dephosphorylation - is catalyzed by enzymes 
collectively termed phosphatases. A range of ever- 
evolving approaches from mass spectrometry through 
labeling intact cells to computational methods are used 
to identify phosphorylation sites and the functional 
effects of the modification (Hardie 1999). 

Protein phosphorylation may, broadly, be divided 
into two groups: canonical and noncanonical phospho¬ 
rylation. Canonical phosphorylation is where the 
phosphorylated residue is either serine, threonine, or 
tyrosine, and this accounts for the majority of 
phosphorylation events in the cell. Of these, phospho- 
serine is most common and phospho-tyrosine is the least 
common. However, these are not the only residues that 
may be phosphorylated. The two-component signaling 
system, which is found in all domains of life, involves 
a histidine kinase, and a subsequent transfer of 
the phosphoryl group to an aspartate residue (Mascher 
et al. 2006). Lysine and arginine residues may also be 
phosphorylated. 

Protein Kinases 

Protein kinases are one of the largest protein families 
known (Hunter 1991) - the human genome encodes 
518 protein kinases (^2% of the genome). The protein 
kinases are classified into two broad groups based on the 
residue being phosphorylated: serine/threonine kinases 


(where the phosphorylated residue is either a serine or 
a threonine) and tyrosine kinases (where the phosphory¬ 
lated residue is a tyrosine). This grouping can be blurred 
as kinase specihcity can vary greatly, and there 
are known examples of dual-specificity protein kinases 
(a kinase that is able to act as both a serine/threonine and 
tyrosine kinase). Serine/threonine kinases are further 
sub-divided into eight major groups: AGC, CAMK, 
CKl, CMGC, STE, RGC, Atypical, and TKL (tyrosine 
kinase-like). The two major groups of tyrosine kinases 
are receptor tyrosine kinases and non-receptor tyrosine 
kinases. Proteins may contain more than one kinase 
domain, and each domain may be capable of phosphor- 
ylating multiple (i.e., different) substrates. To complicate 
matters further, a single residue in a protein may 
be phosphorylated by different protein kinases. This can 
result in very complicated signaling networks, but 
provides the cell with multiple levels of regulation that 
can be very hnely controlled. 

Despite the many families and subfamilies, and the 
fact that protein kinases can differ in their size and 
specificity, the catalytic core of the protein kinase 
domain is highly conserved both at the sequence and 
structure levels. The PROSITE database (http:// 
expasy.org/prosite) has created two signature patterns 
for the protein kinase family, both located in catalytic 
domain. The first is the N-terminal glycine rich motif 
with a conserved lysine that has been shown to be 
involved in ATP binding. The second is located in 
the central part of the catalytic domain, and has two 
forms: one for serine/threonine kinases and another 
one for tyrosine kinases. Both of these signatures con¬ 
tain a conserved aspartic acid that forms part of the 
active site. If both signatures are present in a sequence 
then the probability of it being a protein kinase is close 
to 100%. The structure of the catalytic domain of the 
protein kinases is also highly conserved (Taylor et al. 
1993); it consists of two lobes separated by a deep 
cleft. The smaller lobe is mainly comprised of 
the N-terminal sequence and is dominated by an 
anti-parallel P-sheet, whereas the larger lobe is mostly 
a-helical. The catalytic domain also contains two 
highly conserved loop regions: the glycine-rich loop, 
in the smaller lobe, has an important role in anchoring 
and positioning the phosphate moiety, while the 
catalytic loop, in the larger lobe, helps position the 
substrate protein and direct the catalytic event. There 
is a large amount of conformational flexibility within 
the catalytic domain with two extreme conformations: 
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“open” and “closed.” The apoenzyme is typically in 
the open conformation, and binding of ATP or the 
substrate causes the conformation to change to 
the closed form. ATP binds in the cleft formed between 
the two lobes, and in the closed conformation is 
almost completely buried. It is the smaller lobe that 
predominately forms the binding surface for ATP, and 
it forms a tight interaction with the atoms that are not 
involved in phosphotransfer. The larger lobe predom¬ 
inately forms the binding surface for the substrate 
protein. 

Although some protein kinases are capable of 
phosphorylating many different substrates, a protein 
kinase needs to have a sufficient level of specificity to 
perform a regulatory function in the cell. The two main 
factors that affect the specificity of a protein kinase 
are “peptide specificity” and “substrate recruitment” 
(Kobe et al. 2005). Peptide specificity is the molecular 
recognition of the peptide sequence (linear motif) 
surrounding the phosphorylated residue of a substrate, 
and substrate recruitment is any process that brings 
a kinase and substrate together. One of the most effec¬ 
tive forms of recmitment is autophosphorylation. 
Within the cell, the relative contributions between 
peptide specificity and recmitment vary with the 
overall level of phosphorylation determined by 
these factors and the by dephosphorylation (by phos¬ 
phatases). In general, tyrosine kinases have a lower 
specificity than serine/threonine kinases. 

The Role of Phosphorylation in the Cell 

Phosphorylation and dephosphorylation regulate 
proteins as a molecular switch. The addition of 
a phosphoryl-group can result in altered activity of 
the protein in three main ways (Johnson and Lewis 
2001). Firstly, the phosphoryl-group has two negative 
charges. This alters the electrostatics of the proteins 
surface, and can significantly alter the proteins binding 
and catalytic activities. Secondly, the phosphoryl-group 
has three hydrogen bond acceptors, so the phos¬ 
phorylated protein has the potential to create new 
bonds. Thirdly, the addition of the phosphoryl-group 
alters the size and shape of the protein surface, creating 
a new binding site, a mechanism very common in 
signaling (Pawson and Scott 1997). The process of 
phosphorylation/de-phosphorylation can take place in 
less than a second or hours; therefore, the cell is able to 
adjust the kinetics of phosphorylation to gain a very 
finely tuned control of regulation. One important form 


of regulation of many protein kinases is auto¬ 
phosphorylation, where a kinase is able to phosphory- 
late itself. An example of protein kinase is receptor 
tyrosine kinases, cell surface receptors that allow the 
cell to respond to external stimuli. Series of protein 
kinases can form kinase cascades, where each kinase 
phosphorylates and activates a downstream kinase; as 
each kinase can phosphorylate many substrate kinases, 
this leads to significant amplification. 

Phosphorylation and Disease 

As phosphorylation is used to control many aspects 
of cell growth, movement, and death, the breakdown of 
proper kinase activity is often associated with disease, 
particularly cancer. One of the best known examples of 
this is the proto-oncogene c-Src, a non-receptor protein 
tyrosine kinase, whose elevated expression and kinase 
activity have been linked to cancer (Lahiry et al. 2010). 
Src is a three-domain protein (SH2, SH3, and a tyrosine- 
kinase domains) itself controlled by phosphorylation; 
there are two major phosphorylation sites, with the first, 
located at the C terminus of the protein - a tyrosine that, 
when phosphorylated, deactivates the protein by asso¬ 
ciating with the SH2 domain. Src can be activated 
through the dephosphorylation of this residue. The sec¬ 
ond phosphorylated residue is also a tyrosine modified 
by autophosphorylation. In the cell, Src is in the 
deactivated state for most of the time, and mutations in 
Src itself (causing it to become mostly active) or the 
kinases that control its regulation (either through the 
failure to properly phosphorylate the C terminal site or 
failure to dephosphorylate it) can have very serious 
consequences; for example, the substrates Fak and Cas 
are involved in integrin signaling, and She and Stat3 in 
growth regulation. For these reasons, Src, as well as 
other protein kinases, has become important dmg tar¬ 
gets, and kinase inhibitors are important therapeutics, 
for example, for the treatment of cancer and inflamma¬ 
tion (Li and Stafford 2009). 

Cross-References 

► Linear Motifs in Protein-Protein Interactions 

► Phosphatases - Computational Studies 

► Post-Translational Modifications 

► Post-Translational Modification of Flistone Proteins 

► Regulation of Protein Surfaces Through 
Posttranslational Modification 
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Protein Secondary Structure Prediction 
in 2012 
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Synonyms 

Alpha-helix prediction; Beta-strand prediction; 
One-dimensional protein structure prediction; Protein 
structure prediction 

Definition 

Protein secondary structure prediction aims at the 
prediction of secondary structure on the residue level 


from sequence information alone. Predicted are 
commonly alpha-helices and beta-strands, i.e., the most 
prevalent regular secondary structure segments. On the 
opposite side of regular secondary stracture are irregular 
or disordered regions often also referred to as loops, 
random coils, or disorder. 

Introduction 

More than a decade ago, science leaped when putting 
up the almost entire blueprint for human life. Now that 
the parts are known, can this blueprint be used as 
a manual to understand how the machine works? 
“Like with every proper manual, usually we do not 
find the information we need and in the rare cases 
that we do, we do not understand the answer” jokes 
Anna Tramontano (La Sapienza, Rome). Moreover, 
every year since, new surprising findings gave 
glimpses at how incomplete the knowledge actually 
is. Despite immense advances in molecular biology 
over the last 10 years, substantial experimental infor¬ 
mation for most human proteins remains missing. 
Structural biology has also leaped over the last 
10 years: Over 80% of the experimental high- 
resolution structures known today have been deter¬ 
mined after 2000. Parallel to this increase, the number 
of proteins of known sequence has exploded. In 
December 2011, about 72 K experimental protein 
structures were in the Protein Data Bank (PDB: 
www.pdb.org; Berman et al. 2000) as opposed to 
about 19 M protein sequences in UniProt (2012). 

Secondary structure is arguably the simplest, mean¬ 
ingful aspect of protein stmcture. It is derived from the 
amino acid sequence (misleadingly dubbed “primary 
structure” in the past) and serves as something like the 
Lego bricks to form the full three-dimensional (3D) or 
tertiary structure. In terms of information content, 
secondary structure is essentially one dimensional (ID) 
as it can be mapped onto a string of letters that assign 
a secondary stmcture state to each residue. Linus Pauling 
pioneered the idea that protein stractures consist of 
regular substmctures, namely secondary stracture, before 
any 3D stracture had been experimentally observed. 
He proposed many types and shapes for such substruc¬ 
tures all stabilized by hydrogen bond formation. 

One of these substructures, alpha-helices (Fig. la), 
was observed a decade later when the first 
high-resolution protein structures were solved: 
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Fig. 1 Regularsecondary structure. The protein shown is 
rad60, which mimics Small Ubiquitin-like Modifier (SUMO) 
proteins. Its 3D structure has been experimentally determined 
at an extremely high level of detail (i.e., to the very high resolu¬ 
tion of <1 A, PDB identifier 3goe, Prudden et al. 2009). Alpha- 
helices are shown in orange (a) and beta-sheets are indicated as 


green arrows (b). Alpha-helices are stabilized by hydrogen 
bonding usually between residues i and i + 4 (a, hydrogen 
bonds marked by blue dashes), while beta-sheets are formed 
between beta-strands that connect residues more distant in 
terms of their sequence separation (b, hydrogen bonds marked 
by blue dashes). Beta-sheets typically involve hydrogen bonding 
of every other residue 


myoglobin and hemoglobin (Kendrew et al. 1960; 
Perutz et al. 1960). As both proteins seemed to differ 
substantially in terms of their sequence, it appeared for 
some time that helices were the only regular secondary 
structure realized in proteins. This turned out to be 
a premature expectation; the second major regular 
substructures are beta-strands that aggregate into 
beta-sheets (Fig. lb). 

Protein Structure Classification Based on 
Secondary Structure 

Michael Levitt proposed to classify proteins according 
to these main constituents into alpha, beta, and alpha -i- 
beta (Levitt and Greer 1977). He thereby introduced 
the first step that is still at the heart of two of today’s 
major protein structure classifications, namely of 
CATH (www.cathdb.info; Orengo et al. 1997) and 
SCOP (scop.mrc-lmb.cam.ac.uk/scop; Murzin et al. 
1995). A more recent addition to structure classifica¬ 
tion is COPS (topsearch.services.came.sbg.ac.at; 
Suhrer et al. 2009) that importantly deviates from this 
concept by not using derivatives of the original Levitt- 
classes for any major classification step. However, 
COPS still uses the concept of secondary structure. 

There are different types of helices and strands, and 
there are several methods that automatically assign 


those types from 3D coordinates as deposited in the 
PDB. Most often used is DSSP (Kabsch and Sander 
1983). DSSP identifies hydrogen bonds through 
a simple electrostatic energy, and then assembles 
regular patterns of hydrogen bonds into its eight 
classes. About 32% of residues for which experimental 
structures are available can be classified as alpha-helix 
and 18% as beta-strand. All other residues are in 
regions that appear less regular. Often they are all 
referred to as loops or even more misleadingly as to 
random coils. 

Application of Secondary Structure 
Prediction 

Secondary structure helps in many contexts, for 
example, when aligning proteins in the twilight and 
midnight zones of sequence comparisons (Rost 1999) 
in which evolutionary sequence relations are hardly 
recognizable. It can be used to verify comparative 
models, to predict protein structure, and to 
support guesses about protein function and evolution. 
Furthermore, it serves as maybe the most important 
input feature used by higher-level prediction methods 
that address questions beyond structure such as 
predictions of protein binding, functional classes, 
and subcellular localization. Secondary structure 
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predictions are also one of the most important input 
features used by methods that predict the effect of 
non-synonymous SNPs, i.e., single amino acid 
changes, or more generally of methods that explore 
dynamical features of proteins. Thus, starting 
with good predictions of secondary structure always 
is a good idea. 

Prediction Continuousfy Improved to 
over 80% Q3 

Performance Measure. The simplest way to measure 
the performance of secondary structure prediction is 
the three-state per-residue accuracy (Q3) that describes 
the fraction of all residues correctly predicted in either 
of the three states: helix, strand, and others (frequently 
referred to as “loop” or “coil”). Some proteins are 
easier to predict than others, and the prediction 
accuracy is compiled over a distribution (Fig. 2). 

Before the first protein structure was solved, A.G. 
Szent-Gydrgyi predicted secondary structure based on 
the intrinsic biophysical features of proline residues, 
which are known to break helices. Today it is known 
that while often true, this simple rule on its own makes 
a very poor prediction method. In fact, the following 
statement is overwhelmingly valid: Simple rules that 
may suggest a pseudo-understanding of protein 
structure formation do not suffice to predict structure 
from sequence. 

First-Generation Methods. A major step forward in 
secondary structure prediction condensed around the 
deep mind of Barry Robson. The first steps compiled 
simple statistics that measured the preference of 
a certain amino acid for particular types of secondary 
structure. Jean Gamier and David Osguthorpe 
complemented this and shaped the first “winner” 
among these methods, called GOR (Gamier et al. 
1978). This hrst type of method reached levels of Q3 
around 55% (Rost et al. 1993). 

Second-Generation Methods. The next big step 
forward expanded the simple single residue statistics, 
i.e., what are the odds of a proline residue to be in 
a helix, to the level of segments of several consecutive 
residues, i.e., what are the odds of a proline residue 
flanked by other residues on either side to be in a helix. 
GOR3 (Gamier et al. 1996) was one of the most 
successful representatives of these methods that 
reached Q3 levels around 62% (Rost et al. 1993). 
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Fig. 2 Distribution of Q3 scores. Q3 gives the percentage of 
residues correctly predicted in one of the three states: helix, 
strand, and other. Here, the Q3 score is calculated individually 
for each protein chain in a set of 2546 nonredundant protein 
chains. Plotted is the distribution of Q3 scores for this dataset. 
The distribution averages to a Q3 score around 82% with a range 
from 33% to 100%. This distribution originates from ReProfSec 
predictions; it is likely similar to that of today’s top-prediction 
method(s). Which side of the distribution is your protein likely to 
be on? The best way to answer that question is to explore the 
reliability index (Fig. 3): If many residues in your protein are 
predicted at RIs above average, your protein is more likely to sit 
on the right-hand side 

Secondary structure formation is determined by 
local interactions within an N-residue fragment, and 
global interactions involving partners from outside 
a fragment of N residues. Second-generation methods 
predicted the secondary stracture of the residue at 
the center of a fragment of N consecutive residues, 
with N ranging from 5 to 50 addressing only local 
interactions. Maybe only 62% of the interactions 
are determined locally, and prediction accuracy is, 
therefore, capped at 62%. 

One way to address this hypothesis would be to use 
larger fragments. The empirical hnding was that 
N = 11 was better than N = 9 was better than N = 8 
and so forth. However, there is a limit: N = 50 is not 
better than N = 11. This observation could mean that 
global interactions do not matter much. On the other 
hand, this finding can also be explained differently. It 
is probable that global interactions are important, and 
that longer fragments are dominated in terms of their 
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information content by noise, i.e., increasing 
N decreases the signal-to-noise ratio. How to access 
more global information without changing this ratio? 

Third-Generation Methods. This challenge was 
addressed by the third-generation methods that use evo¬ 
lutionary information to embed global information and to 
increase the information density (signal-to-noise ratio). 
To achieve this, users have to first build a multiple 
sequence alignment using proteins related to the query 
sequence for which they seek the prediction. When intro¬ 
duced, these methods used relatively simple pairwise 
alignments against databases that were small by today’s 
numbers. Nevertheless, performance was immediately 
boosted to levels above 72%. The first method that 
surpassed a sustained level of Q3 > 72% was PlTDsec 
(Rost 1996). Several other improvements specihcally 
addressed issues such as improved prediction for 
beta-strand and predictions of helices and strands 
that resemble observed regular secondary structure 
more than fragments thereof. PROFsec (Rost 1996) 
improved by extending the sequence profiles, 
combining predictions of solvent accessibility and sec¬ 
ondary structure. Better database search methods and 
larger databases brought the performance to levels of 
78% (Jones 1999; Przybylski and Rost 2002). Due to 
the ongoing search for improvements and the increase of 
available data today’s methods, e.g., ReftofSec, are 
approaching 82% Q3 (Hoenigschmid & Rost, 
unpublished data). 

Prediction Based on Known Structures. All the 
above numbers hold for proteins that have no significant 
similarity to proteins for which experimental structures 
are already available. However, the structural coverage, 

i.e., the percentage of proteins that can be modeled 
based on the known experimental structures is 
increasing steadily (Kiefer et al. 2009). Most proteins 
known today have, therefore, some local region for 
which some aspects of structure can be modeled. 
Are these models better than proper secondary structure 
prediction methods when Q3 is measured? 

The answer depends on the quality of the model, i.e., 
ultimately on the similarity between the protein under 
investigation and the protein with a similar sequence for 
which an experimental stracture is available. For high 
similarity, reading secondary structure predictions off the 
comparative models is better than using expert prediction 
methods. However, for low levels of similarity, the 
experts are better (Marti-Renom et al. 2002; Eyrich 
et al. 2003). Most models that can be built 


through comparative modeling based on low similarity, 
i.e., secondary structure prediction experts, are still 
usually the best bet. However, the increased information 
stored in data banks have given rise to new methods 
that rely more heavily on available stractural data 
(Bau et al. 2006; Zhang et al. 2011). 

Hints for Users 

1. Find the Right Method. Good and mediocre second¬ 
ary structure prediction methods differ substantially 
in their usefulness. Newer methods are sometimes 
better than older ones, but the newest methods are 
not necessarily the best. More readily available 
methods are also not necessarily better. Thus, the 
first advice is to find a few of the good methods. 

2. Compare methods hut rely more on reliability 
indices than on consensus. Assume you used the 
“top methods” for your protein. Which one of these 
should you use to continue? Many users tend to 
believe more in residues that are predicted the 
same way by different methods, i.e., that would 
exploit some consensus between the methods 
(Zhang et al. 2011). There are many good reasons 
for expecting that such consensus or averages help. 
However, in a field in which you have tools that are 
as diligently crafted as the top secondary structure 
prediction methods, what is usually advisable may 
become a mistake. The best methods provide esti¬ 
mates for the reliability of each prediction (Fig. 3) 
and if the developers have done this well, such 
estimates are much more relevant than any type of 
simple consensus (Eyrich et al. 2003). You do not 
need to run many methods, although some publica¬ 
tions demonstrate that consensus methods can fare 
better (Zhang et al. 2011). However, you will have 
to find a very good method and for the last 5 years, 
the advance in the field has not been established by 
objective assessment. 

3. There may or there may not he today’s best method. 
The previous two rules suggest that this entry would 
be most helpful if it suggested a list of top methods. 
There are many reasons why this entry does not 
provide such a list. The simplest is that the objective 
comparison methods that automatically assessed 
the state of the art in the field until a few years ago 
are no longer alive (Eyrich et al. 2003; Rychlewski 
and Eischer 2005). Conclusions about the state of 
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Percentage of residues 
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Fig. 3 Reliability index. Cumulative percentage of residues 
and Q3 scores are plotted at different reliability index (RI) 
values. The reliability index reflects the strength of a particular 
prediction for one particular residue. This index enables users to 
zoom into the most reliable prediction as the curve shown reveals 
that performance is proportional to the RI. For instance, roughly 
half of all the residues are predicted at RI > 8. For these most 
reliably predicted set, prediction accuracy (Q3) is almost 95%, 
i.e., about 13% points higher than the performance for all resi¬ 
dues. This plot was generated using ReProfSec for the same data 
set used for Fig. 2 

the art from the literature alone might be very mis¬ 
leading. Furthermore, any noncontinuous assess¬ 
ment method will at best provide a correct view up 
to a certain time. This encyclopedia will hopefully 
help you long after that time. The reality is that 
although secondary structure prediction methods 
remain at the heart of all important advanced 
sequence analysis and protein annotation, there are 
many other challenges to address. Simply put; suc¬ 
cessful groups in computational biology will shy 
away from publishing even significant 
improvements. 

4. Better alignments, better predictions. The major 
source of improvement of prediction methods over 
the last two decades originated from growing data¬ 
bases and from adequately integrating this 
increased information. The single most important 
way to improve predictions is by improving the 
information contained in the multiple sequence 
alignment used to get the prediction. For the 


particular case of secondary structure prediction 
for water-soluble proteins this largely boils down 
to: the more diverse family members you include in 
the alignment, the better. (Even at the price of 
including some remote nonmembers by mistake!) 

Typically, Hidden Markov Models (HMMs) are 
better at identifying distant relations than is 
PSI-BLAST and typically prohle-profile alignment 
methods are best. For instance, one of the recent 
methods (ReProfSec, http://www.predictprotein.org) 
increased secondary structure prediction by almost an 
entire percentage point simply by switching 
from PSI-BLAST to HHBlits (http://tooIkit.lmb.uni- 
muenchen.de/hhblits; Hildebrand et al. 2009). 
However, the simple rule “more diversity better 
prediction” does not hold for the prediction of 
transmembrane regions (strands and helices). Here, 
the opposite remains valid unless you know very well 
what you do. 

5. Predictions have 20% mistakes: find them! Levels 
of about 80% accuracy imply that 20% of the resi¬ 
dues are predicted incorrectly. Roughly a fourth of 
these are “bad” mistakes of the type “helix 
predicted where strand is observed” and vice versa 
(Rost 2005). Most mistakes tend to be on the protein 
surface. In contrast, few mistakes tend to character¬ 
ize active and binding sites. All of these numbers 
comprise averages; predictions are substantially 
worse for some proteins and much better for others 
(see Lig. 2). Thus, the error rate ranges from almost 
entirely correct in some proteins to levels close to 
random predictions for others. The reliability indi¬ 
ces can provide some clue whether your protein is 
more likely to be an average performer or to belong 
to either of those two extremes: the more residues 
are predicted at unusually high levels of reliability, 
the higher the accuracy for the protein. Taking 
a closer look at residues with low reliability often 
reveals an important story. 

After several decades of research in the held, it 
is still not possible to identify what types of proteins 
fall more often into which extreme of these classes. 
There are trivial correlations of the type: Since most 
methods predict strand less accurately than helix, 
proteins with more helix content tend to be 
predicted more accurately. However, no correlation 
claimed between the success in secondary structure 
prediction and functional traits has withstood the 
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winds of time. Extreme examples are orthologous 
enzymes, i.e., enzymes that perform an identical or 
similar function in different organisms. These may 
differ strongly in terms of prediction accuracy, 
although no such trends can reliably be established 
for the difference in performance between different 
organisms, in general. 

6. Secondary structure predictions help to predict pro¬ 
tein disorder. The study of the beauty of protein 
structures reveals again and again how the intricate 
details of structures determine function. However, 
many proteins, in particular in eukaryotes, have 
long regions that are often referred to as intrinsi¬ 
cally unstructured or disordered, i.e., regions that do 
not adopt regular secondary structures (Dunker and 
Obradovic 2001; Dunker et al. 2008). Estimates 
about how much disorder can be observed in 
human differ widely according to the choice of 
parameters and methods. One simplification is that 
15-30% of all human have at least one region with 
at least 50 disordered residues. These numbers 
increase almost twofold when reducing the criterion 
to a minimum of 30 consecutive residues 
(Schlessinger et al. 2011). Disorder occurs in 
regions that need flexibility to bind to many differ¬ 
ent substrates or to impose access to a large space to 
sense intruders into this region (like light sensors in 
airports) or to buffer intrusion (like filling material 
in packages). Some disorder is strongly enriched 
in non-regular secondary structure, and others in 
contact-deprived helices. Furthermore, other 
disorder has high propensity for secondary structure 
switching. Consequently, secondary structure 
prediction methods help in the identification of 
protein disorder. 

7. Important new information for understanding 
function and evolution. The pursuit of understand¬ 
ing protein function and evolution typically begins 
with the study of sequence data. Often, analyses 
also end there, wasting the wealth of detail available 
from protein structures that usually is needed to 
discriminate between alternative hypotheses. 
Since models based on experimental 3D structures 
are commonly available for fewer than 30-40% of 
all residues, important details are often missing. 
Fortunately in many cases, secondary structure 
already captures some of these details. Therefore, 
predictions of secondary structure often help more 


in the identification of evolutionary and functional 
similarities than comparisons of secondary struc¬ 
ture derived from experimental 3D structures 
(Przybylski and Rost 2004). 

Examples for Methods 

A collection of secondary structure prediction methods 
is chosen with respect to their quality and availability. 
As with all methods in computational biology, for 
many reasons the most readily available prediction 
methods are often not the best ones. In fact, some of 
the easiest-to-reach methods perform substantially 
worse than a method any advanced student can develop 
in a short summer. 

ReProfSeclPHD/PROFsec: PredictProtein. ReProf- 
Sec, the latest improvement over the PHD-PROF series, 
increases performance substantially by replacing 
PSTBLAST with HHblits. ReProfSec is available as 
web service, as package, and as web server through the 
first Internet server in the field, namely, PredictProtein 
(www.predictprotein.org) (Rost et al. 2004). 

PSIpred. Developed by the group of David Jones, 
PSIpred (Jones 1999) is clearly one of the top performers 
(Eyrich et al. 2003). The important step introduced by 
PSIpred was the move from the simple BLAST-like 
pairwise alignments to using further reaching 
PSTBLAST-based alignments as input to neural 
networks. The group has continuously updated the 
service. It is available as a standalone version and 
as a web server together with suite of other 
prediction methods (bioinf.cs.ucl.ac.uk/web_servers) 
(Mcguffin et al. 2000). 

SABLE. SABLE initially improved over PSIpred by 
combining PSI-BLAST profiles and neural networks, 
and predicted solvent accessibility to improve the 
prediction of secondary structure (Adamczak et al. 
2005). A web server and a standalone version are 
available. 

Porter. Distill. PORTER defines one important point 
in the ongoing struggle for advances (Pollastri and 
Mclysaght 2005). Improvements come through using 
more advanced machine learning devices than 
those exploited in ReProfSec and PSIpred, in addressing 
particular shortcomings such as finding an 
optimal path between comparative modeling and de 
novo prediction, and in the particular way in 
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which output from different structure prediction 
methods is combined to predict secondary structure. 
The web server Distill (distill.ucd.ie/distill) combines 
various structure prediction methods from the group 
(Bau et al. 2006). 

Summary 

This entry introduces the prediction of protein second¬ 
ary structure, namely, of alpha-helix, beta-strand, and 
others. Since secondary structure is the simplest, yet 
meaningful aspect of protein structure, it is utilized in 
a large variety of application: for example, in protein 
structure prediction, when aligning proteins with little 
sequence similarity and in the prediction of protein 
binding and subcellular localization. Put more gener¬ 
ally, predicting secondary structure is always a good 
idea to start any attempt at analyzing a protein 
sequence. 

The history of secondary structure predictions is 
described, i.e., important aspects and the ongoing 
improvement of the prediction methods. Furthermore, 
useful hints for users are provided and a number of 
today’s secondary structure prediction methods. 

Cross-References 

► Alignment of Protein Sequences 

► Homology Modeling of Protein Structures 

► Membrane Proteins: Structure and Organization 

► Neural Networks 

► Protein Structure 

► Structurally Disordered Proteins 
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Ertan Eryilmaz, Dongsheng Liu and David Cowburn 
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Synonyms 

Expressed protein ligation; Intein; Protein trans¬ 
splicing 

Definition 

Segmental labeling is a powerful labeling technique 
applicable to large proteins for study by NMR or 
neutron scattering. Region(s) of the protein is/are 
labeled with different stable isotopes so that the 
labeled region(s) can be studied in the context of full- 
length protein. 

Size limitation is a major drawback of macromo- 
lecular NMR: As the size increases, the tumbling rates 
become slower yielding faster transverse relaxation 
rates resulting in poor signal-to-noise (s/n) ratio, and 
more signals appear resulting in greater spectral com¬ 
plexity. In addition to reducing complexity of acquired 
spectra, segmental labeling can be applied to (1) assign 
large proteins, (2) determine structures, (3) form 


labeled proteins from producible fragments when com¬ 
plete product is impractical to express, (4) observe 
domain orientations and ligand perturbations using 
residual dipolar couplings, (5) introduce nonnatural/ 
modified amino acids, and (6) introduce NMR silent 
tags that can be used to enhance solubility and/or assist 
purification. 

Expressed protein ligation (EPL) is a modified 
native chemical ligation (NCL) where unprotected 
protein C-terminal a-thioester of a segment is 
chemoselectively trans-thioesterified with an N- 
terminal Cys of a second segment. Spontaneously, 
the formed thioester undergoes an S ^ A-acyl transfer 
and forms a native peptide bond. Protein trans-splicing 
is a posttranslational modification that relies on two 
intein portions, where affinity and catalytic activity of 
two portions of an intein are crucial in ligation of two 
extein sequences. Individual intein halves have no 
splicing activity, and a functional protein-splicing ele¬ 
ment is formed when these portions associate 
noncovalently. 

Inteins mediating protein splicing can be minimized 
by deletions of amino-acid residues to obtain shorter 
functional fragments, for example, shorter C-terminal 
fragment that can be chemically synthesized. This 
approach has the potential to widen biotechnological 
applications. In addition to minimization, inteins can 
be optimized by mutations. Often, mutations causing 
enhanced splicing activity are reflected as subtle 
changes in structure determined by X-ray crystallog¬ 
raphy but alter dynamics and affect efficiency of splic¬ 
ing. These observations make inteins subjects of 
many studies investigating protein folding, and protein 
structure-dynamics-function relationships. 

Cross-References 

► Core Splicing Machinery: Structural and 
Biophysical Studies 

► CPMG 

► Cross-Correlated Relaxation 

► Macromolecular Crystallography: Overview 

► Multidimensional NMR Spectroscopy 

► NMR 

► Post-Translational Modifications 

► Protein Folding 

► Protein NMR Resonance Assignment 

► Protein Splicing - Computational Studies 
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► Relaxation Dispersion 

► Residual Dipolar Coupling 

► Spin Labeling 

► Structure Determination by NMR: Overview 

► Total Correlation Spectroscopy (TOCSY) in NMR 
Protein Structure Determination 

► Triple Resonance NMR 

► TROSY 


Protein Side Chain Circular Dichroism 

► Near UV Protein CD 
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Jon I. Mujika and Xabier Lopez 
Kimika Fakultatea, Euskal Herriko Unibertsitatea and 
Donostia International Physics Center, Donostia-San 
Sebastian, Euskadi, Spain 

Synonyms 

Autoproteolytic enzyme: protein that catalyzes the 
hydrolysis of its own protein chain; Extein; External 
protein; Intein; Posttranslational process: process after 
translated process in gene expression; Protein trans¬ 
splicing; Self-consistent charge density function tight 
binding (SCC-DETB) 

Definitions 

Molecular dynamics (MD) simulation is 
a computational method to analyze the time-dependent 
behavior of a molecular system by solving Newton’s 
equations of motion numerically. 

QM/MM methods combine a quantum mechanical 
(QM) method to characterize the active site of 
a molecular system with a molecular mechanical 
(MM) method to treat the remaining part of the system. 

A “cluster model” is a truncated model of a large 
molecular system where only the most relevant atoms 
of the system under study (e.g., catalytic residues. 


substrate, and cofactors) are included in the 
calculation. 

Protein splicing is a posttranslational process that 
activates some biologically inactive proteins (Paulus 
1998). In this autocatalytic process, the central part of 
the protein, termed the intein, is excised from the rest 
of the protein, and the other two flanking domains, the 
C- and N-exteins join each other to form an active 
protein. Some in vitro experiments determined that 
the protein splicing mechanism is an intramolecular 
process that is catalyzed by amino acids located in the 
intein and the first residue of the C-extein, and that the 
process does not require coenzymes or external 
sources of metabolic energy. Therefore, protein splic¬ 
ing of the intein is autoproteolytic, that is, the “sub¬ 
strate” in which the catalytic reaction takes place is the 
protein itself. The overall mechanism for protein splic¬ 
ing is believed to be composed of the following four 
steps (see Fig. 1) (Xu and Perler 1996): a N-S or N-0 
acyl rearrangement, transesterification, cyclization of 
the last amino acid of the intein to form an aminosuc- 
cinimide, and a second acyl rearrangement. 

In spite of this knowledge of the overall chemical 
steps involved in protein splicing, the details at 
atomistic level are not fully understood. For instance, 
the N- and C-terminal cleavages (Steps 1 and 3 in 
Fig. 1) involve a nucleophilic attack of a residue side 
chain (a serine, cystein, or threonine in the first step 
and an asparagine in the third) onto the scissile peptide 
bond. However, the reaction is very slow per se and 
a previous activation is necessary, although how this 
activation takes place is still under debate. In the com¬ 
putational studies reviewed in this work, two activa¬ 
tion modes of the reaction have been considered: 
electrophilic and nucleophilic activation. 

The group of Hard studied the autoproteolytic intra¬ 
molecular reaction undergone by large multidomain 
proteins (similar to mucins) in their SEA domains 
(sea urchin sperm protein, enterokinase, and agrin) 
(Johansson et al. 2008, 2009). The peptide bond 
cleavage is begun by a N-0 acyl rearrangement, and 
therefore, their investigation is relevant to other self- 
catalyzed protein rearrangements, including protein 
splicing. Eirst, experimental techniques were 
employed to analyze their autoproteolytic activity, 
kinetic and structural features, and then molecular 
dynamics simulations were carried out to investigate 
the structure and dynamical behavior of the system 
(Johansson et al. 2008). The simulation suggested 
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Fig. 1 General mechanism of 
protein splicing 



C-extein 



C-extein 


Step 2: Transesterification 


HoN 



aminosuccinimide o 


Step 4: S-N or 0-N 
acyl shift 


HX 


N-extein ' 



C-extein 


O 
































p 


2050 


Protein Splicing - Computational Studies 


when the trans scissile peptide bond is planar, the 
protein secondary structure observed experimentally 
is disrupted. When this planarity is lost and the peptide 
bond was found to adopt a strained nonplanar confor¬ 
mation, the structure of the protein was found to be 
consistent with experiments. It was suggested that the 
strain of the peptide bond implies that the reactant is 
energetically destabilized, decreasing the activation 
free energy barrier of the reaction. In later work 
(Johansson et al. 2009), quantum chemical calcula¬ 
tions with density functional theory (DFT) and polar¬ 
izable continuum model (PCM) for solvent effects 
showed that strain of the peptide bond could decrease 
the activation energy of the reaction, but not suffi¬ 
ciently to initiate the N-O acyl shift. Based on exper¬ 
imental kinetic data, the authors argued that the 
reaction of N-0 acyl shift depends on an initial pro¬ 
tonation event, which was found to be rate limiting at 
neutral pH. Further calculations indicated that strain of 
the scissile peptide bond could facilitate the proton¬ 
ation of the peptide bond N atom. Once the peptide 
bond is protonated at the N atom, the nucleophilic 
attack of the serine side chain is energetically less 
demanding. However, it should be pointed out that 
this study lacks a plausible explanation for another 
critical point - how the serine hydroxyl side chain is 
ionized. The authors only considered the proton trans¬ 
fer to the peptide bond’s O atom, showing a high- 
energy barrier, which is consistent with their results 
where the protonation at the O atom is hindered by the 
peptide bond twisting. 

Du et al. investigated the cleavage at the N-terminal 
splice junction following the so-called pKa shift mech¬ 
anism (Du et al. 2009). In this proposed reaction mech¬ 
anism, a histidine located in the B-block, which is the 
most repeated residue in all intein sequences, acts as 
general base accepting a proton from the serine side 
chain. Therefore, in this mechanism, the reaction is 
activated by increasing the nucleophilicity of the ser¬ 
ine. The pKa of this residue was determined by NMR in 
the precursor and in the cleaved product. Interestingly, 
the pKa of this histidine drops from 7.3 ± 0.6 in the 
precursor to less than 3.5 in the product. Based on these 
results, the authors characterized a reaction pathway 
using a quantum mechanics/molecular mechanics 
(QM/MM) method. The mechanism they modeled 
involved the B-block histidine acting first as general 
base to deprotonate the serine side chain, which facil¬ 
itates the nucleophilic attack of the serine on the 


peptide bond. Since the distance between the histidine 
and the serine side chain is too large in the crystal 
structure (5.7 A), they proposed that the proton transfer 
can occur through a bridging water molecule. Then, in 
a second step, the histidine acts as general acid and 
protonates the scissile peptide bond N, causing the 
breakdown of the peptide bond. Only the relative ener¬ 
gies of the minima were presented in the work, not the 
energy barriers of each step. The tetrahedral interme¬ 
diate (reached after the nucleophilic attack of the ser¬ 
ine side chain) was calculated to be 31.2 kcal/mol 
above the reactant. In the same work (Du et al. 2009), 
the authors suggested that the pKa shift mechanism is 
also followed in the C-terminal cleavage. In that reac¬ 
tion, a histidine located in block F is proposed to be 
responsible for deprotonating the asparagine side 
chain. The authors did not characterize the reaction 
pathway, but the pK^ of the F-block histidine was 
determined by NMR as close to 8, both in the spliced 
precursor and in the product. This reaction mechanism 
was investigated by our group using DFT calculations 
on a cluster model (Mujika et al. 2009) and QM/MM 
molecular dynamics simulations of the GyrA intein 
(Mujika et al. 2012). The calculations on the cluster 
model show that deprotonation of the asparagine side 
chain is an effective mechanism to activate this residue 
for its cyclization. We calculated the free energy pro¬ 
file in solution for the mechanism in which the F-block 
histidine acts as the general base to deprotonate the 
asparagine side chain. These calculations confirm the 
importance of the penultimate histidine, in agreement 
with several experimental studies that demonstrate its 
relevance in the C-terminal peptide bond cleavage. 
The calculations indicate that this histidine is impor¬ 
tant first in stabilizing the negative charge borne by the 
carbonyl O atom in the tetrahedral intermediate, and 
then acting as a general acid to protonate the scissile 
peptide bond nitrogen atom. The study on the GyrA 
intein (Mujika et al. 2012) analyzed the protonation 
states of the F-block and penultimate histidines. The 
molecular dynamics simulations, employing the 
CHARMM27 force field, demonstrated that the proton¬ 
ation states of these residues have a direct influence on 
the orientation of the asparagine side chain. In the same 
work, we used the SCC-DFTB semiempirical QM 
method to model the active part of the intein, combined 
with the CHARMM27 force field to treat the remaining 
part of the system. With this QM/MM scheme, the free 
energy profile of the activation and subsequent 
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cyclization of the asparagine was calculated, including 
the effect of the entire intein. Moreover, the QM/MM 
molecular dynamics simulations highlighted the impor¬ 
tance of the Argl81-Vall82-Aspl83 segment in 
catalyzing the reaction. All of these results confirm 
that nucleophilic activation of the asparagine by the 
F-block histidine, without strain of the peptide bond, 
is a plausible mechanism for the cyclization of this 
residue observed at the C-terminal of the intein. 


Conclusions 

The computational work reviewed here has focused 
mainly on activation of peptide bond cleavage of the 
N- and C-terminals of an intein, for which two main 
activation modes have been considered: electrophilic 
activation of the scissile peptide bond by its proton¬ 
ation at the N atom and ionization of the nucleophilic 
side chain (serine or asparagine) by a histidine. There 
are still open issues around protein splicing, such as 
whether strain is caused in the peptide reactant and how 
the nucleophile is activated, that would require further 
studies. Recent results support the mechanism in which 
histidine acts as a base to deprotonate the asparagine 
side chain, and peptide strain is not involved. 

Cross-References 

► Computational Enzymology 

► Core Splicing Machinery: Structural and 
Biophysical Studies 

► Post-Translational Modifications 

► Protein Folding: Molecular Dynamics Simulations 

► QM/MM Methods 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Splicing 
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Introduction 

Different functions come to mind when thinking about 
proteins: food, enzymatic reactions, immune response, 
movement, cellular structure, regulation of gene 
expression, cell communication, and organization. In 
all but the first case, these functions all are based upon 
a fundamental property of these molecules, their abil¬ 
ity to assume a shape selected by evolution to allow 
them to perform their roles. 

Physics can help us understand the forces that drive 
the folding process, experimental techniques such as 
X-ray crystallography or nuclear magnetic resonance 
can provide us with examples of their spatial organi¬ 
zation and enable us to recognize regularities, and 
comparison of evolutionary-related proteins offers 
the possibility to investigate their functional and struc¬ 
tural constraints. Computational tools can build upon 
all this information to develop methods to infer their 
structure from their amino acid sequence. Their effec¬ 
tiveness strongly depends upon the amount and type of 
information available for the target protein under 
study, and this necessarily implies that the accuracy 
of the results is case-dependent. 
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It is obviously important to appreciate the relation¬ 
ship between how much is known about a protein and 
how well its structure can be inferred since the accu¬ 
racy of the hnal model dictates how appropriate it is as 
the basis for specihc applications. 

Estimating the Quality of a Protein Structural 
Model 

One aspect of the problem involved in the quality 
estimation of a protein model is to understand how 
well a structure prediction method performs on aver¬ 
age. This will be a distribution, and the result in each 
case will depend, as mentioned before, on the type and 
amount of available information about the target. For 
example, comparative modeling techniques are quite 
reliable in general, and it is known that the accuracy of 
modeling of the backbone of the core of a protein will 
directly depend upon the sequence similarity between 
the target protein and the protein(s) of known structure 
that is used as a template for building the model 
(Chothia and Lesk 1986), but regions outside the core 
as well as side chains might diverge to different extent 
in different cases and, in several cases, the exploitation 
of the availability of the structure of multiple homolo¬ 
gous proteins of known sequence and/or structure can 
modify the relationship. 

Even more complex is the quality estimation of 
a model obtained by other techniques, such as fold 
recognition and fragment assembly methods. It follows 
that methods to assess how accurate a given model is 
are needed. 

There can be two flavors to these latter methods. 
Scientists are obviously interested in some parameter 
related to the expected deviation of a model from the as 
yet unknown native structure of the protein, but there 
are instances where the relative ranking of different 
models is of extreme usefulness, for example, to 
select among different potential solutions provided 
by various methods. 

The Critical Assessment of Techniques for 
Protein Structure Prediction 

It is well accepted by the community that prediction 
methods have to be tested blindly (i.e., on proteins the 


structure of which is unknown at the time of predic¬ 
tion) and that, in order to effectively estimate their 
different accuracies, they should be compared on the 
same set of target proteins. This has been the role of 
the Critical Assessment of Techniques for Protein 
Structure Prediction (CASP) experiments that ran, 
at two yearly intervals from 1994 onward (Moult 
et al. 2011). 

The structure of the experiment consists of four 
main steps: 

1. Participants register for the experiment in one or 
both of two categories: as human teams, where 
a combination of computational methods and inves¬ 
tigator expertise may be used; and as servers, where 
methods are only computational and fully auto¬ 
mated, so that a target sequence is sent directly to 
a machine. 

It should be mentioned here that it is extremely rare 
that a model is genuinely manually produced, more 
often the predictor verihes the accuracy of an auto¬ 
matic model or selects one of the solutions proposed 
by an automatic method. In reality the real differ¬ 
ence between these two categories lies in the span of 
time between the availability of a target sequence 
and the expiration date before which the model has 
to be submitted. 

2. Information about “soon to be solved” structures is 
collected from the experimental community and 
passed on to the modeling community. The high 
throughput of structural genomics together with 
a well-integrated pipeline for target identification 
facilitated by the PDB now allow CASP to collect in 
excess of 100 diverse and interesting targets. 

3. Models are collected in accordance with predefined 
deadlines. Groups are limited to a maximum of five 
models per target, and instructed about the aspects 
of modeling where they should concentrate their 
efforts. 

4. The models are compared with experiment, using 
numerical evaluation techniques and expert assess¬ 
ment, and a meeting is held to discuss the signifi¬ 
cance of the results. 

Targets are divided into two primary categories: 
template-based modeling (TBM), where a relationship 
to one or more experimentally determined structures 
could be identified, providing at least one modeling 
template and often more; and template-free modeling 
where there are either no usefully related structures. 



Protein Structural Models - Evaluating Quality 


2053 


P 


or the relationship is so distant that it cannot be 
detected (FM). 

CASP has provided a wealth of data and informa¬ 
tion impossible to summarize in a few sentences. The 
results of the assessment, the description of the 
methods, and an analysis of the progress are reported 
in special issues of the journal PROTEINS (Moult 
et al. 2011) while all data are available at the website 
of the prediction center (www.predictioncenter.org). 
Inspection of these will convince the reader that there 
has been substantial progress in the held and, espe¬ 
cially, that the coverage by the different methods has 
increased substantially providing reasonable models 
for most target proteins. 

A few open issues are highlighted by the experi¬ 
ment, namely, that in TBM it still seems to be impos¬ 
sible to consistently improve a model with respect to 
the template used and, perhaps consequently, that 
regions that change substantially during evolution are 
still inaccurately modeled. It is also clear that, while 
FM approaches often provide a model sufficiently 
close to the native structure, it is still difficult to iden¬ 
tify this model among the various solutions provided 
by the methods. 

In addition to evaluating the overall accuracy of 
model of three-dimensional structure, CASP also 
examines other key aspects of structure modeling: 
intramolecular contact identification (Monastyrskyy 
et al. 2011a); prediction of the presence of structural 
disorder (Monastyrskyy et al. 2011b); the identifica¬ 
tion of ligand binding sites (Schmidt et al. 201 1); and 
the ability to determine the accuracy of a model 
(Kryshtafovych et al. 2011), critical to determine 
whether it is suitable for a specific purpose. 

Estimating the Quality of a Structural Model 

Interestingly, the first methods for quality assessment 
of a protein structural model were developed for 
solving different problems. 

For example, 3D-1D profiles were originally 
developed to detect distantly related proteins and 
subsequently used for quality prediction (Futhy et al. 
1992). They are based on the idea of assigning an 
environmental class to each residue and use the prob¬ 
ability of finding a specific residue in a given environ¬ 
ment to assign a “probability of correctness” to 


a model. At the same time, methods to assess the 
probability for two residues to be at a specific distance 
from each other, originally used in fold recognition 
technique, gave rise to somewhat effective tools for 
detecting errors in protein models (Sippl 1993). 
A similar idea, but at the atomic rather than residue 
level, has also been explored. Tools to assess the 
quality of an experimental structure have also been 
adapted to solve the quality assessment issue, with 
the idea that stereochemical incorrect regions of the 
model can help identifying problematic regions in the 
model, although a few rounds of energy optimization 
of a model is usually enough to render these tools 
rather insensitive. 

When the importance of the problem was more 
widely recognized, several methods explicitly developed 
to this purpose were implemented. They can be roughly 
classified into ranking method and single-model quality 
prediction tools. 

The first class of methods aims at predicting which 
models are closer to the best one, or to the native 
structure, among a list of candidates. They work very 
well (the correlation coefficient being higher than 0.9 
for most methods) (Kryshtafovych et al. 201 1). These 
usually employ consensus approaches to generate 
quality estimates, generally by clustering the models 
according to their structural similarity and selecting 
the best model as, for example, the centroid of the 
most populated cluster. 

The parameters used by single-model method are 
various and can include preferences for amino acid or 
atom pair contacts, consistency of the local amino acid 
sequence with their secondary structure and solvent 
accessibility in the model, etc. Often automatic learn¬ 
ing approaches are used to optimize the weight of the 
various parameters. 

Methods based on the analysis of individual models 
which aim to provide an estimate of their accuracy still 
lag far behind and this is probably the area that requires 
more effort from the community. The approaches used 
here are more diverse and are based on different 
combinations of parameters. 

As in other areas of structural bioinformatics, 
a number of methods that combine the output from 
several others have been developed as well. 

Table 1 summarizes the information used, alone or 
in combination, by the methods participating in the last 
CASP assessment, as deduced from reading the 
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Table 1 Both clustering and single-model methods are based 
on different types of information. A survey of the most widely 
used ones is given in the table 

Clustering 

methods Information 

Similarity of secondary structure of equivalent 
residues in the model and in the ensemble 
Similarity of solvent accessibility of equivalent 
residues in the model and in the ensemble 
Structural similarity of the model to other models 
in the ensemble 
Statistical potentials 

Information from evolutionary-related proteins 
Agreement of the model’s contact map with the 
contact maps of all models in the ensemble 
Beta-sheet topology 

Single-model 

methods 

Residue-residue pair potentials 

Atom-hased pair potentials 

Local backbone geometry 

Volume of the cavities surrounding a residue 

Agreement with residue-level distance 

constraints extracted from related protein 

structures 

Physics-hased potential 
Template-derived residue-residue distance 
potential 


abstracts submitted to the conference available at the 
prediction center website: www.predictioncenter.org/ 
casp9. In the case of clustering methods, the ensemble 
is usually the list of models provided by structure 
prediction servers, sometimes weighted by their 
predicted single-model quality values. 

Progress? 

Interestingly, the comparison of the last CASP results 
in this area with the results from previous experiments 
is encouraging and shows that the field is making 
progress. Consistent improvement is noticeable for 
both the best and moderately-well-performing groups, 
with a more pronounced improvement for the latter 
groups (Kryshtafovych et al. 2011). 

The comparison of the performance of the best 
groups that have also participated in the previous 
experiment shows that none of them performed 


worse, with many improving their results. It 
is important to mention that, as discussed and proven 
in (Kryshtafovych et al. 2011), the observed 
progress cannot be attributed to a decreased 
target difficulty, but it is likely to reflect methodolog¬ 
ical improvements over the last few CASP 
experiments. 

Summary 

The prediction of the structure of proteins is, on one 
hand, a useful exercise to gauge our understanding of 
the “rules of the game” of the folding process, but, 
perhaps more importantly, it is expected to be helpful 
for discovering relevant biological properties of the 
macromolecule. 

As has been mentioned on several occasions, the 
type of information that can be reliably derived from 
a protein model strictly depends upon its quality and, in 
many cases, on the quality of specific region of the 
model. It follows that strategies for correctly estimat¬ 
ing the model reliability need to be developed. This 
would also add value to the several model databases 
that are available and that undoubtedly will be devel¬ 
oped in the future. 

The community is lively and continuously growing 
and is effectively working both in the direction of 
improving the ranking of models as well as in estimat¬ 
ing the reliability of a single model and of its local 
regions. 

There is no doubt that the field will witness an ever 
growing interest in this aspect of computational biology 
and it is also rather obvious that every step in this 
direction will get computational biology closer to its 
customers, that is, to the molecular and cell biologists 
and perhaps in the near future to the biomedical 
community as well. 

Cross-References 

► Alignment of Protein Sequences 

► Comparative Protein Structure Modeling 

► Protein Structure Prediction and Structural 
Annotation of Proteomes 

► Proteins: Relationship Among Divergence of 
Sequence, Structure, and Function 

► Structural Genomics 
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Institute for Medical Research, London, UK 

Synonyms 

Protein structure; Protein structure alignment 

Definition 

Computational tools designed to detect regions of sig¬ 
nificant similarity between protein structures for 
a variety of purposes including classification, evolu¬ 
tionary inference, analysis of substructures, and 
assessing predicted structural models. 

Introduction 

The database of known protein structures is a rich 
source of insights into biochemical reaction mecha¬ 
nisms, possible evolutionary relationships, and 
physical constraints on proteins. Protein structure 
comparison methods are essential for realizing this 
potential. The development of these methods has 
helped us to refine our understanding of the important 
features of protein structure while their application has 
been instrumental in creating a global picture of pro¬ 
tein fold space, and suggesting unifying evolutionary 
hypotheses about protein families essential to the cre¬ 
ation of ► domain structure family classifications. 
Structure comparisons are also fundamental parts of 
methods for ► protein structure prediction and predic¬ 
tion of function from structure. 

Structure comparison methods have three main 
components: the score function, the alignment method, 
and the background distribution used for assessing 
significance of comparisons. Additional steps are 
added to construct more sophisticated methods for 
flexible alignment and fast database searching. 
Interpreting structure comparisons requires some con¬ 
sideration of the biological context of structural evo¬ 
lution and underlying regularities of unrelated 
structures. 
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Scoring Structural Similarity 

The most fundamental component of a structure com¬ 
parison is the score assigned to the similarity between 
the structures. For atomic comparisons (the most 
widely applicable situation) it is given that a set of 
equivalent positions (equivalences) is already known. 
In the case of assessing a predicted model this is 
straightforward since the alignment between the 
model and the target structure is given a priori. When 
aligning two structures the score function is used 
to assess the quality of a possible alignment. Although 
the same score function can be used for both 
purposes since the problem in both cases is essentially 
the same; in practice the two situations have slightly 
different requirements and different measures are often 
used. 

It is generally a requirement that the number of 
objects compared in the structures is the same. Since 
the number and character of atoms in the side chains of 
different amino acids is different this requires some 
degree of abstraction if sequence differences are toler¬ 
ated in atomic level comparisons. In general, this is 
addressed by comparing only alpha carbons. Again 
for more detailed comparisons other atoms can also 
be included and for high-accuracy modeling an all¬ 
atom score including side chain atom positions can 
be used. 

Once the exact comparison to be scored is specified, 
we need to put a number on the extent that the two sets 
of atoms are in the same place relative to the rest of the 
structure. This is the first point at which things get 
interesting as there are many ways to do this. Perhaps, 
the simplest case conceptually is the distance root- 
mean-square deviation (dRMSD) - in this, we compare 
distances between all pairs of atoms and find the root- 
mean-square deviation between equivalent 
distances. This is the basis of the scoring scheme 
used by the widely used DALI method for structure 
alignment. 

Although simple this scoring scheme is not the most 
commonly used: decomposition into sets of pairwise 
distances loses information about the spatial relation¬ 
ships between atoms which can lead to distortions. 
Instead most methods rely on rotating and translating 
one structure to fit as well as possible over the other - 
this is known as structural superposition. This leads to 
the more familiar RMSD measure, which is the natural 
metric for assessment of the goodness of fit - a least- 


squares deviation measure. To calculate the RMSD the 
structures are first superposed, following which the 
RMS of the distances between equivalent atoms is 
found. Several superposition algorithms exist for 
deriving the optimal rotation (translation is accounted 
for by centering both sets of coordinates at 0), these are 
principally applications of methods for finding eigen¬ 
values of matrices and the details are solely of techni¬ 
cal interest. For further information the reader is 
referred to Flower (1999) and Theobald (2005). 

The RMSD has a long history in structure compar¬ 
ison and it is still useful as perhaps the most widely 
understood measure of similarity. However significant 
difficulties with it have led to its being largely aban¬ 
doned in large-scale assessments of prediction accu¬ 
racy, most notably the biennial CASP experiment in 
protein structure prediction. There are two principal 
difficulties: first the RMSD increases with the radius of 
gyration of the protein (and therefore with chain 
length, since proteins are generally compact), meaning 
that an absolute value of RMSD is uninterpretable 
unless accompanied by a length. Second, the 
RMSD measure as stated here would take into account 
all the regions of the protein equally - a large differ¬ 
ence in just a small region (i.e., an incorrect loop 
conformation) would result in a very large RMSD 
despite substantial similarity elsewhere. This can lead 
to similar structures with a few large deviations receiv¬ 
ing worse scores than structures which are globally less 
similar. 

Alternative measures now widely used are the 
global distance test (GDT) score, which is the standard 
measure of prediction accuracy used in CASP, and 
variants of the Levitt-Gerstein (LG) score, in particular 
the TM-score. Essentially these invert the question and 
ask how much of the protein is within a given distance 
cutoff. In the case of the GDT score four cutoffs are 
used (with a separate optimal superposition derived for 
each case) and the mean of the coverage at each of the 
cutoffs (1, 2, 4, and 8 A are the standard) is taken to 
produce a score in the interval [0, 1]. Since residues 
which are close at a lower threshold are nearly always 
close at each of the other thresholds this has the effect 
of substantially upweighting the score for well- 
predicted residues. This does not, however, correct 
the issue of length dependence. 

The TM-score of Zhang and Skolnick (2004) is 
conceptually similar but uses only a single threshold, 
corrects for length dependence and simultaneously 



Protein Structure Comparison Methods 


2057 


P 


weights coverage and distance. The score is calculated 
as follows: 

y_^_ 

Y 1 + {dij/dof 

Where dij is the distance at each residue and d^, is 
a threshold distance, which varies according to the 
length of the shorter of the proteins compared. The 
form of this will be discussed in the section on back¬ 
ground score normalization. For any given residue 
position the score will be zero at a large distance, 1/2 
at the threshold distance, and 1 at distance 0. Thus the 
overall score will be 1 for a perfect model and 0 for 
a dreadful one. 

Many alternative approaches to scoring exist, the 
majority of which involve the use of a different repre¬ 
sentation - for example, vector scoring at the SSE level 
is one commonly used way to speedup database 
searching. If local quality is to be assessed several 
fragment-based measures have been developed and 
for assessing side chain conformations a variety of 
graph-theoretic approaches are commonly used with 
particular application in comparing structures to data¬ 
bases of functional sites. 

Structural Alignment 

In the relatively simple case discussed above the align¬ 
ment of two proteins is given in advance, as is the case 
when assessing a structure prediction. However, in the 
more general situation, we do not know the alignment 
and we must find both the best superposition of the two 
structures and the best set of equivalent residues 
simultaneously. 

This is a difficult problem since without assigned 
equivalences the score functions above cannot be 
applied but it is the equivalences which are of interest 
and so cannot be known in advance. The problem must 
be solved using a heuristic of some kind. Many 
approaches are possible: stochastic search is one 
widely used method - Monte Carlo, simulated 
annealing and genetic algorithm-based alignment 
methods have all been published. The advantage of 
such approaches for the developer is that it is generally 
simple to include arbitrary features into the objective 
function and these can be used as an excellent test bed 
for new ideas. However, from a user point of view 


these methods can often suffer from a lack of speed 
and reliability, since stochastic searches do not always 
find a good result and usually find different solutions 
on different runs. The preferred method is therefore to 
use an iterative approach which will reliably and 
repeatably find a very good solution. While these 
methods will in general only find locally optimal solu¬ 
tions they offer significant advantages in speed and 
reliability. 

Iterative methods start by finding an initial align¬ 
ment which is used to derive an approximately correct 
rotation then refining this to find the best alignment and 
rotation. The initial alignment can be found using 
sequence, aligning secondary structures, multifeature 
local scores, fragment similarities, or other means. 
From this alignment a rotation matrix is found and 
the quality of the superposition is assessed using the 
chosen score. New alignments can be tested with 
the score function using this rotation until a better 
alignment is found, following which the superposition 
can be recalculated and the process repeated. Typically 
the process converges after only a few iterations, 
providing a repeatable and generally accurate result. 
It has been found that the final alignment result is very 
sensitive to the quality of the initial rotation matrix. 
TM-align circumvents this problem by sampling three 
initial alignments while SAP uses a two-level dynamic 
programming method with randomization to increase 
sampling of possible alignments. Initial fragment- 
based alignments are another widely used method for 
avoiding these problems, an approach adopted by 
Fr-TM-Align and MAMMOTH. 

Assessing Significance 

Determining the degree to which the score for the 
similarity between two structures could be a chance 
occurrence is a difficult issue which has not yet been 
fully resolved. This is a very important issue for data¬ 
base searching purposes and in the absence of a full 
understanding of the problem the solution is found by 
simulation. 

The problem is essentially to determine how likely 
two sets of points are to have a particular structure 
comparison score at random. In the case of two random 
points picked from a sphere the RMSD, for example, 
will approximate the diameter of the sphere, hence 
growing with the radius of gyration of the points. 
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However the RMSD for real proteins will be consider¬ 
ably smaller for two reasons: firstly the positions of 
sequential amino acids are not independent of each 
other since they are at most about 3.8 A distant. This 
can be modeled using a self-avoiding random walk but 
this background model still results in very large 
RMSDs compared to real proteins. This is because 
there are substantial regularities within proteins 
which bias the distribution, particularly the existence 
of characteristic secondary structures, turns, and sec¬ 
ondary structure packings. Consequently, the usual 
approach is to use data from real proteins to generate 
a background model. 

To do this, the distribution of scores given certain 
parameters (e.g., query length) is modeled empirically 
using contiguous fragments from real structures. Alter¬ 
native schemes might involve analyzing scores for 
alignments between real proteins of different fold or 
generation of decoys, perhaps based on the query 
structure. One popular method for decoy generation 
uses mirror-image proteins as decoys (Maiorov and 
Crippen 1994), large numbers of such decoys can be 
made by recombining loops, chain reversal, and cyclic 
permutation (Taylor and Aszodi 2005). Naturally this 
approach will work better with some schemes than 
with others - dRMSD measures, for example, could 
not distinguish left- from right-handed structures. Sig¬ 
nificance tests for some other measures (notably GDT) 
do not exist since they are not used to search databases, 
while other methods (e.g., DALI, MAMMOTH) pro¬ 
duce significance tests as part of their default score, 
generally by estimating parameters for an extreme 
value distribution using low-scoring database matches. 

In the case of TM-Align, the reported TM-score 
incorporates a length correction, which allows 
a more-or-less direct interpretation of the significance 
of the resulting alignment. This is based on an approx¬ 
imation of the dependence of distance between equiv¬ 
alent residues derived using gapless comparison of 
fragments from the protein data bank. The dependence 
of expected distance on chain length is a function of the 
dependence of the expected radius of gyration for the 
protein on length and the relationship between radius 
of gyration and expected separation. The overall shape 
of the relationship approximates the cube root of the 
length. In general, the TM-score for very different 
protein structures is distributed from 0.2 to 0.3, with 
a score over 0.5 being highly significant and usually 
indicative of substantial global similarity. 


Flexible Alignment 

The assumption used above is that protein structures 
can be meaningfully treated as rigid bodies and aligned 
within a single rotational frame. Although this is 
broadly true for comparisons of domains there are 
several known cases in which this assumption breaks 
down, most importantly where domain motions have 
occurred within proteins, local repacking or high 
intrinsic flexibility are other examples of situations 
where modifications are needed. 

The simplest solution is to introduce “hinges” into 
the alignment, points at which we leave one rotational 
frame and enter another - in other words, where 
a different rotation matrix is derived. Clearly, the num¬ 
ber of such hinges must be kept as small as possible 
since if each atom is in its own rotational frame, we 
could always achieve a perfect score. Several 
implementations of this idea exist; the original and 
most well-known method is FATCAT, which is now 
used to produce alignments for the RCSB PDB website 
in order to identify structural neighbors. This method 
first finds similarities between fragments of the two 
proteins and generates an initial alignment. Following 
this the structural compatibility between aligned 
regions is assessed and if it exceeds a given threshold 
the two regions are joined. Otherwise a hinge is intro¬ 
duced between the two. The resulting overall align¬ 
ment is then refined to produce the final alignment. 

Generally flexible alignment is superior to rigid- 
body methods for many useful applications; however, 
it is naturally a slower method. This is not generally of 
much significance since speed is usually most impor¬ 
tant when large numbers of comparisons are required, 
for which none of the above approaches is likely to be 
fast enough. For such applications some kind of 
prefilter is generally used to allow interesting compar¬ 
isons to be identified quickly (a topological filter has 
been implemented in the case of FATCAT). We dis¬ 
cuss these methods briefly in the next section. 

Fast Comparisons for Database Searching 

One question which structural biologists typically ask 
when publishing a structure is whether it represents 
a novel fold. For practical purposes this can essentially 
be defined as the lack of a significant structural simi¬ 
larity to any known structure, although as noted above 
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Protein Structure Comparison Methods, Table 1 Selected methods for structural alignment and structure comparison 

Name 

Key references 

Comments/Key features 

DALI 

Holm and Sander (1996) 

Alignment of distance matrices; reports significance; used to create FSSP database 

FATCAT 

Ye and Godzik (2003) 

Flexible alignment; reports significance; used by RCSB PDB to find structural neighbors 

FrTM-Align 

Pandit and Skolnick (2008) 

Slower but more accurate version of TM-align; initial fragment-based alignment 

MAMMOTH 

Ortiz et al. (2002) 

Fragment-based alignment; reports significance; used in GASP assessments; 
multiple structure version available 

SAP 

Taylor (1999) 

Local-global alignment by double-dynamic programming; Related SSAP method 
used to generate initial CATH database 

SSM (PDBe 
Fold) 

Krissinel and Henrick 
(2004) 

Fast database search and alignment by secondary structure vector matching; reports 
significance; used by EBI MSD for structural neighbor searches 

THESEUS 

Theobald and Wuttke 
(2008) 

Superposition of multiple structures only (does not align); significantly improves 
results using maximum likelihood based method 

TM-Align 

Zhang and Skolnick (2005) 

Alignment based on TM-score 

VAST 

Gibrat et al. (2002) 

Fast structural alignment by vector search. Used for structural neighbor searches in 
NCBI MMDB 


the question of length dependence means that this 
depends on whether the whole structure is used to 
search with or first split into domains. In order to assess 
this, we must compare the new structure to the data¬ 
base of known structures. 

As we have seen the usual approach to structure 
comparison requires an iterative or heuristic approach, 
which slows down the already relatively slow process 
of aligning two structures. Although the number of 
structures is small compared with the number of 
sequences it is necessary to speed things up as much 
as we can for the purpose of interactive searching. 

To achieve this we must sacrifice some detail. The 
typical approach, implemented, for example, by tools 
such as VAST or the SSM tool used for structure com¬ 
parisons at the EBI website (recently renamed to PDBe 
Fold), is to represent the structure as an ordered set of 
vectors defining the secondary structural elements or 
some other conveniently straight regions. Fast 
searching can then be achieved by matching vectors to 
one another and using good matches to seed initial 
alignments. Graph theory is another means of rapidly 
comparing secondary structure elements between pro¬ 
teins and is implemented in the CATHFDRAF method 
used to compare structures for classification in CATH. 
Full alignments can then be generated in the usual way. 

Interpreting Structural Comparisons 

Despite a great deal of progress there are several 
remaining difficulties for structure comparison. The 
relationship between significant structural similarities 


and evolutionary relationships is extremely compli¬ 
cated and without a deeper understanding of physical 
constraints on protein structure, in addition, to those 
imposed by evolution we cannot reasonably say which 
similarities are likely to be convergent and which 
divergent. Other information is frequently used to 
argue in favor of homology, such as conservation of 
functional features and unusual elements of structure, 
but a more detailed knowledge of structural evolution 
is required if we are to make progress on this issue. In 
general, the problem of structural alignment is degen¬ 
erate - although an evolutionarily correct alignment 
might exist there is no general best structural align¬ 
ment under many useful score functions and the two 
will not always coincide. Assessment of homology 
based on structural alignment requires careful consid¬ 
eration of the proposed evolutionary scenario. 

Problems with score functions and degeneracies in 
comparison can have a large impact on the global 
picture of fold space. Recently, the debate on the 
existence of discrete structural fold groups has 
resurfaced and it appears that opinion presently favors 
a continuum of structures rather than a set of discrete 
groups. Generally, this is somewhat of a semantic issue 
and it is difficult to assess its significance in a wider 
sense, however, at least some of the results on which 
this is based can be understood as artifacts of the 
results of applying structural alignment on a large 
scale. The difficulty with structure alignment is that 
the distances which it produces are not true distances in 
a mathematical sense since they frequently fail to obey 
the triangle inequality (Pascual-Garcia et al. 2009). 
This produces irregularities in the results of clustering 
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since two very different objects can seem to be linked 
by a third which has properties of both. This creates the 
appearance of a continuum but it is purely an artifact of 
the same type as can he produced by multidomain 
proteins in sequence clustering. In practice, this sort 
of problem is essentially insurmountable since proteins 
are subject to insertions, deletions, and recombinations 
as they evolve (Sadowski and Taylor 2010); hence 
a different approach is needed if the question of conti¬ 
nuity is to be meaningfully settled. 

Summary 

The variety of methods for protein structure compari¬ 
son is increasing rapidly as newer, more specialized 
approaches are developed. Despite this the most widely 
used methods generally share several common features, 
which have been described in this entry. It has unfortu¬ 
nately been necessary to omit descriptions of a number 
of more innovative approaches of greater mathematical 
sophistication since these are at present not in general 
usage; however, it is hoped that these newer methods 
will he instramental in redefining our picture of protein 
structure space in the future and circumventing the 
many challenges which remain for structure compari¬ 
son. A list of the methods used as examples appears as 
Table 1. These methods are selected as those which are 
most widely used in publicly available services and 
represent significant advances. Preference is also 
given to methods which are freely available, especially 
those for which the source code is also provided. 

Cross-References 

► Comparative Protein Structure Modeling 
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► Protein Structure: Potential Folds 

► Protein Structure Prediction and Structural 
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Synonyms 

Fold recognition; Protein structure; Protein structure 
modeling; Protein structure prediction; Sequence 
alignments; Structural genomics; Template-based 
modeling; Template-free modeling 

Definition 

Protein structure prediction methods aim to predict the 
structures of proteins from their amino acid sequences, 
utilizing various computational algorithms. Structural 
genome annotation is the process of attaching biolog¬ 
ical information to every protein encoded within a 
genome via the production of three-dimensional pro¬ 
tein models. 

Introduction 

Proteins are essential molecules involved in both 
structural and functional roles of all living cells. 
Numerous diseases, notably Alzheimer’s, Parkinson’s, 
heart disease, and cancers, involve mutations in spe¬ 
cific proteins, which affect their function. Thus, deter¬ 
mining protein structure is essential to understanding 
functionality and is potentially helpful in developing 
treatments for the diseases or disorders. The ultimate 
goal is to determine the function of a protein from 
sequence computationally, but often sequence infor¬ 
mation alone is insufficient. During the course of evo¬ 
lution, protein structure has been more conserved than 
amino acid sequence, therefore the analysis of protein 
structures often leads to a greater understanding of 
protein function than can be obtained from just study¬ 
ing their sequences (McGuffin 2008a). 


Experimental methods which include ► X-ray crys¬ 
tallography and ► nuclear magnetic resonance (NMR) 
are commonly used for protein structural determination, 
but they have several limitations. The cloning, expres¬ 
sion, and purification of a protein and in the case of the 
X-ray crystallography, the subsequent production of 
diffraction quality crystals for protein structure determi¬ 
nation, are time consuming and costly. Conversely, 
computational methods for protein structure prediction 
are easily automated, fast, and cheap. Predicted struc¬ 
tures allow the inference of function, lead to a better 
understanding of protein evolution, and guide experi¬ 
mental work in dmg discovery, biopharmaceuticals, 
industrial enzymes, ligand-protein interactions, and 
cancer biology, to name a few applications. 

In this post-genomic era, the gap between protein 
sequence and structure is widening. At the time of 
writing, there are <67,000 protein structures in the 
Protein Data Bank (PDB) and ^140 million sequences 
in the GenBank database (Fig. 1). The rate at which 3D 
structures are being solved is evidently unable to com¬ 
pete with the speed of genome sequencing. However, 
the use of bioinformatics tools may be used to help 
close the gap between sequence and structure, help in 
proteome annotation, and speed up the elucidation of 
protein structures by the production of high quality 
homology models for molecular replacement. 

Sequence Alignment in Protein Structure 
Prediction 

► Sequence alignment algorithms are subdivided into 
global and local sequence alignment methods. Global 
sequence alignment algorithms seek to align two 
sequences over the whole length of the protein to 
produce a score, which determines the evolutionarily 
relatedness of the two proteins. Local sequence align¬ 
ment algorithms align evolutionarily related segments 
of proteins, which include binding sites, domains, and 
sequence repeats that are important to the protein, but 
may exist in other proteins, which are distantly 
evolutionarily related and have unrelated functions 
(Fig. 2) (Altschul et al. 1990, 1997; Lipman and 
Pearson 1985). 

The first global sequence alignment algorithm was 
developed by Needleman and Wunsch in 1970 
(Needleman and Wunsch 1970), which was the first 
application of ► dynamic programming for sequence 
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Protein Structure 
Prediction and Structural 
Annotation of Proteomes, 
Fig. 1 The number of 
sequences deposited in 
sequence databases including 
GenBank, EMBL, Swiss-Prot, 
and KEGG GENES is 
dwarfing the number of 
protein structures in the PDB 
by a factor of over 2000 (Data 
taken from the PDB and 
sequence databases), (a) The 
number of sequences in 
sequence databases and the 
number of protein structures in 
the PDB are plotted against the 
years of entry, (b) As in (a) but 
enlarged to focus on the PDB 
entries and highlight the rate 
of increase in comparison to 
the sequence databases. 

(c) The number of sequences 
in sequence databases and the 
number of protein structures in 
the PDB is plotted on a log 
scale against the years of entry 


a I •■PDB ♦GenBank EMBL Swiss-Prot KEGG GENES 



Year 


b I •PDB ♦GenBank » EMBL Swiss-Prot KEGG GENES 



C !• PDB GenBank ■» EMBL Swiss-Prot *- KEGG GENES 
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sequenc« mom WDFQOQS 


Protein Structure Prediction and Structural Annotation of 
Proteomes, Fig. 2 Multiple sequence alignment of the PKDl 
domain 1 (PDB 1B4R), showing highly conserved sequence 


motif WDFGDGS. The eight species were aligned using 
ClustalW2, and the HHpred color scheme was utilized to illus¬ 
trate amino acids with similar biochemical properties 


comparison. This was followed in 1981 by Smith and 
Waterman’s (Smith and Waterman 1981) development 
of an algorithm for local sequence alignment 
possessing a high degree of similarity, which included 
the addition of a weighting for gap penalties and the 
use of a matrix to identify sequence pairs. 

Lipman and Pearson developed PASTA (Lipman 
and Pearson 1985) in 1985 with an update to the 
algorithm in 1988. PASTA is a local sequence align¬ 
ment method, which is still widely used. PASTA was 
one of the first sequence alignment algorithms that 
could be run on a standard desktop PC of the era, 
through the introduction of the concept of “ktups.” 
Ktups are segments of an amino acid sequence, with 
ktup = 1 being one amino acid long, ktup = 2 is two 
amino acids long and so forth. This concept is used to 
increase the speed of the algorithm, as the number of 
searches is reduced. The higher the ktup value, the 
faster the speed. PASTA utilizes ktup = 2 as 
the default for amino acid sequence alignment and 
ktup = 6 for DNA sequence alignment (Lipman and 
Pearson 1985). 

BLAST - Basic Local Alignment Search Tool - 
(Altschul et al. 1990) is a rapid sequence alignment 
algorithm for homology searching of sequence librar¬ 
ies. BLAST, like PASTA, also utilizes the ktups 
method but refers to ktups as “words,” with the default 
“word” size set to three for amino acid sequences and 
11 for DNA sequences. Gapped BLAST and PSI- 
BLAST (Position Specific Iterafive-BLAST), intro¬ 
duced in 1997 (Altschul et al. 1997), further improved 
the sensitivity and speed of the BLAST algorithm. 
Gapped BLAST generates a single gapped alignment, 
which increases the speed for pairwise sequence 
alignment, whereas the original BLAST program 
often finds several alignments involving a single data¬ 
base sequence, which when considered together were 


statistically significant. Using the Gapped BLAST 
alignment algorithm, it then becomes necessary to 
find only one rather than all of the ungapped 
alignments significantly increasing the speed of the 
algorithm. PSI-BLAST is more sensitive for the detec¬ 
tion of weak, but biologically relevant sequence simi¬ 
larities. PSI-BLAST uses a sequence-profile alignment 
method, with a position-specific scoring matrix gener¬ 
ated from significant alignments in round i which are 
then used in round / + 1 to generate a matrix for the 
next round. This iterative searching and profile 
consfruction process significantly increases the 
sensitivity of searching the sequence and structure 
databases. PSI-BLAST is an integral part of most 
successful tertiary structure prediction pipelines. 

Critical Assessment of Techniques for 
Protein Structure Prediction (CASP) 

The continual development of more advanced protein 
structure prediction tools is driven by the Critical 
Assessment of Techniques for Protein Structure 
Prediction (CASP) competition. CASP is a biennial 
competition, with the aim of advancing the methods of 
predicting protein structures from sequence, by the pro¬ 
vision of objective testing of the methods via blind 
prediction. CASP is currently divided into six 
prediction categories: (1) tertiary structure prediction - 
template-based and free modeling, (2) disorder predic¬ 
tion, (3) domain prediction, (4) contact prediction, 
(5) quality assessment, and (6) binding site prediction 
(Moult et al. 2009). 

There have been major improvements seen in the 
structure prediction category in each successive CASP. 
The previous three CASP experiments showed that 
fully automated structure prediction servers can 
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Protein Structure Prediction and Structural Annotation of Proteomes, Table 1 Established techniques for the modeling of 
protein folds fall into three major categories, which is dependent on the level of information that is known about the protein sequence 
(McGuffin 2008b) 

Method category 

Requirements 

Relative computational 
difficulty 

Relative speed 

Theoretical 
sequence coverage 

Homology/comparative 
modeling 

Homologous (>30% 
sequence ID) to a template 
structure from the PDB 

Easy 

Fast 

Minimum 

Fold recognition/threading 

A template fold of known 
structure from the PDB 

Medium 

Medium 

Medium 

Ab intio/new fold/free 
modeling 

The target sequence and/or 
a fragment library 

Hard 

Slow 

Maximum 


produce models close in quality to those produced by 
the very best expert human modelers (Kryshtafovych 
et al. 2009). Server performance is extremely impor¬ 
tant, since they are the only choice for high throughput 
modeling. The number of servers involved in CASP 
has increased from 53 in CASP5 to 79 in CASP9, 
showing an increase in interest for protein structure 
prediction and increased competition in the field. 

Tertiary Structure Prediction 

Protein tertiary structure prediction methods are 
divided into template-based and template-free model¬ 
ing methods. If a structural template is available within 
the PDB, template-based modeling methods such as 
► homology modeling and fold recognition can be 
utilized, but if a structural template is unavailable 
free modeling will have to be utilized (Table 1). 

Template-Based Modeling 

The success of template-based modeling (TBM) 
methods is based on three key facts: (1) similar 
sequences fold into similar structures, (2) many 
unrelated sequences also fold into similar structures, 
and (3) there are only a relatively small number of 
unique folds when compared with the number of pro¬ 
teins found in nature; most of the fold space has been 
structurally annotated and few new folds are being 
solved (McGuffin 2008b). 

Traditionally, template-based modeling is divided 
into two subcategories: homology modeling and fold 
recognition. Homology modeling, also known as 
comparative modeling, is dependent on finding 
a sequence alignment between the target and the 
template structure with a sequence ID > 30%. Fold 
recognition methods go beyond simple sequence 
searching, when the sequence identity between the 
target and template sequence is within the twilight 


zone (20-35% sequence ID). However, it is becoming 
increasingly difficult to differentiate between homol¬ 
ogy modeling and fold recognition algorithms, as most 
successful methods now utilize profile-profile-based 
sequence searching algorithms to identify very distant 
relationships between targets and templates (McGuffin 
2008b). 

HHsearch is a popular, rapid, and accurate profile- 
profile-based fold recognition method (Soding 2005), 
which utilizes the PSTBLAST position-specific scor¬ 
ing matrices and PSIPRED (Jones 1999) secondary 
structure predictions, to build profile-Hidden Markov 
Models (HMMs) of the target sequences. The profile- 
HMMs are then compared to a fold library of profile- 
HMMs that have been built for proteins with known 
structure. Once target-template alignments have been 
constructed, then 3D models of the target structure can 
be built from the coordinates of the template structures 
(Fig. 3) (Table 2). 

A user-friendly template-based modeling predic¬ 
tion pipeline - IntFOLD-TS - is now available which 
combines profile-profile alignment outputs from sev¬ 
eral different methods to produce up to 40 alternative 
fold recognition models, which are subsequently 
ranked utilizing the ModFOLDclust2 (McGuffin and 
Roche 2010) model quality assessment method. The 
IntFOLD server (Roche et al. 2011) also integrates the 
ModFOLD 3.0 method for model quality assessment, 
the DISOclust 2.0 method for protein intrinsic disorder 
prediction, the DomFOLD 2.0 method for domain 
boundary prediction, and the FunFOLD 1.0 method 
for the prediction of ligand-binding site residues. 

Template-Free Modeling 

Free modeling has also been referred to as ah initio 
modeling, modeling from first principles, or de novo 
modeling. Template-free modeling is the prediction of 
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Protein Structure Prediction and Structural Annotation of Proteomes, Fig. 3 Template-based modeling pipeline, such as 
that used by IntFOLD-TS 





Protein Structure Prediction and Structural Annotation of 
Proteomes, Table 2 Some of the top publicly available tem¬ 
plate-based modeling servers in CASP9 


Server 

URL 

BioSerf 

http://bioinf.cs.ucLac.uk/bio serf/publicjob 

HFIpred 

http://toolkit.lmb.uni-muenchen.de/hhpred 

IntFOLD 

http://www.reading.ac.uk/bioinf/IntEOLD/ 

I-TASSER 

http://zhanglab.ccmb.med.umich.edu/I-TASSER/ 

LOMETS 

http://zhanglab.ccmb.med.umich.edu/LOMETS/ 

PCONS 

http://pcons.net/ 

Phyre2 

http://www.sbg.bio.ic.ac.uk/phyre2/html/page. 

cgi?id=index 

pro-sp3- 

TASSER 

http://cssb.biology.gatech.edu/skolnick/ 

webservice/pro-sp3-TASSER/index.html 


a protein’s tertiary structure from sequence without the 
use of a protein structure as a template. The use of free 
modeling is necessary when a template cannot be 
found to predict the structure of the protein. Free 


modeling usually carries out conformational searches 
under the assistance of a designed energy function, 
which generates several structural decoys based on 
possible conformations that the hnal model is selected 
from. Energy functions for free modeling are generally 
classified into physics-based energy functions and 
knowledge-based energy functions, which depend on 
the use of statistics from structurally elucidated pro¬ 
teins (Table 3) (Lee et al. 2009). 

Physics-Based Methods 

Physics-based methods are defined as methods that 
utilize interactions between atoms based on quantum 
mechanics and electrostatic interactions. Physics- 
based methods also utilize a small number of critical 
parameters, which include electron charge and 
Planck’s constant, with atoms additionally described 
by their atom type, where only the number of atoms is 
relevant. The use of quantum mechanics has not yet 
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Protein Structure Prediction and Structural Annotation of 
Proteomes, Table 3 Some of the top publicly available free 
modeling web servers in CASP9 


Server 

URL 

chunk-TASSER 

http://cssb.biology.gatech.edu/skolnick/ 

webservice/chunk-TASSER/index.html 

MULTICOM- 

NOVEL 

http://casp.met.missouri.edu/nncon. 

html 

QUARK 

http://zhanglab.ccmb.med.umich.edu/ 

QUARK 

RAPTORX-FM 

http://raptorx.uchicago.edu 

Roberta 

http://robetta.bakerlab.org/ 


been used to predict even small structures, due to the 
computational resources needed for such calculations. 
Without the use of quantum mechanics, the most prac¬ 
tical starting point for free modeling is to utilize 
a compromised force field, with a large number of 
selected atom types. Within each atom type the 
physicochemical properties are calculated from infor¬ 
mation on crystal packing or quantum mechanical 
theory. Examples of all-atom physics-based force 
fields include AMBER, CHARMM, and OPES, 
which also contain terms in relation to bond length, 
angles, torsion angles, van der Waals, and electrostatic 
interactions (Lee et al. 2009). 

Knowledge-Based Methods 

Knowledge-based methods are generally more suc¬ 
cessful and utilize empirical energy terms, derived 
from structurally elucidated proteins deposited in the 
PDB. These energy terms are further divided into two 
sub-classifications. The first energy term encompasses 
genetic and sequence-independent terms, which 
include hydrogen bonding and the local backbone 
stiffness of a polypeptide chain. The second energy 
term encompasses amino acid or protein sequence- 
dependent information, which include: pairwise 
residue contact potentials, distance dependent atomic 
contact potentials and secondary structure propensity. 
Despite most knowledge-based methods utilizing 
secondary structure propensities, local structures may 
be rather difficult to reproduce when modeling. One 
way in which this problem can be counteracted is the 
utilization of secondary structure fragments, acquired 
from sequence or profile alignments, for the initial 
model construction. This is also advantageous as 
the entropy of the conformational search is reduced 
(Lee et al. 2009). The fragment assembly method for 


knowledge-based free modeling was utilized in 
ERAGEOLD (Jones 2001), ROSETTA and the 
QUARK servers (Table 3). 

Model Quality Prediction 

Once a selection of models has been produced for a 
target sequence, the quality of each model must then be 
assessed. Being provided with details about the poten¬ 
tial errors in 3D models arguably makes them more 
useful in the context of guiding experimental work. 
Model quality assessment programs (MQAPs) are used 
for the prediction of 3D model quality of proteins 
(McGuffin and Roche 2010). MQAPs can be classified 
into two categories: single model-based methods, 
which are able to assess the quality of individual 
models, and the clustering-based methods, which com¬ 
pare multiple models against each other. According to 
recent GASP experiments, the clustering-based MQAP 
methods, such as ModPOLDclust2 (McGuffin and 
Roche 2010), are currently the most accurate methods 
if multiple alternative models can be obtained. How¬ 
ever, the single model methods, which produce abso¬ 
lute scores for individual models, are potentially more 
useful if few models are available. 

Structural Annotation of Genomes 

The structural annotation of genomes is extremely 
important for the functional determination of 
the encoded protein sequences. Traditionally, 
► functional annotation of protein sequences has 
been carried out using simple sequence alignment 
methods. With the rapid growth in the number of 
genome projects, the need for accurate annotation has 
also increased. However, sequence-based structural 
annotation is inadequate for protein sequences, which 
have a low pairwise sequence identity (<30%) to 
proteins with known structures. Thus, structural 
annotation methods, which attempt to carry out fold 
recognition on a genomic scale, can help to increase 
the level of annotation beyond the twilight zone of 
sequence identity. 

Structural Annotation Databases 

Several databases have been developed providing 
structural annotations of genomes, including Gene3D 
(Yeats et al. 2008), SUPERFAMILY (de Lima Morais 
et al. 2011), and the Genomic Threading Database 
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(McGuffin et al. 2004). These databases have been 
constructed via the use of sequence-based searching 
methods and fold recognition in order to structurally 
annotate entire proteomes. Gene3D provides an up-to- 
date comprehensive database for structural and func¬ 
tional annotations of the majority of available protein 
sequences, including UniProt, RefSeq, and IntegrS. 
Structural annotations of genomes, are generated via 
a detailed search of the GATH structural database 
profile-HMM library. Functional annotation is also 
carried out by the Gene3D database utilizing GO 
assignments, FunCat, KEGG, active site data, disor¬ 
dered predictions utilizing DISOPRED2, and data 
from microarray experiments (Yeats et al. 2008). 

Methods for Structural Annotation 

Both intensive fold recognition methods such as 
IntFOLD-TS and rapid methods such as HHsearch 
(Soding 2005) can be utilized for stmctural annotation 
of entire proteomes. McGuffin et al. stmcturally anno¬ 
tated the entire human proteome utilizing the 
mGenTHREADER method (McGuffin and Jones 2003) 
in just over 24 h by harnessing 515 CPUs. This study 
provided the proof of concept that intensive fold recog¬ 
nition can be carried out for rapid proteome annotation 
via the use of grid technology (McGuffin et al. 2006). 

Summary 

Computational methods for prediction of protein 
structure are essential in the post-genomic era as exper¬ 
imental-based methods are unable to keep pace with 
the speed of genome sequencing. The production of 3D 
protein models, along with the structural annotation of 
entire proteomes, allows for both the interpretation of 
the proteins general function and the prediction of the 
binding site residues. These predictions may be 
exploited subsequently in in silico studies for the 
design of novel proteins for both medical and industrial 
applications, along with the development of drugs that 
will act as agonists or inhibitors for these proteins in 
order to modify their activity in disease pathways. 

Cross-References 

► Alignment of Protein Sequences 

► Comparative Protein Structure Modeling 


► Domain Structure Classifications 

► Homology Modeling of Protein Structures 

► Macromolecular Crystallography: Overview 

► Protein NMR - Introduction 

► Protein Secondary Structure Prediction in 2012 

► Protein Structural Models - Evaluating Quality 

► Protein Structure Comparison Methods 

► Protein Structure Prediction 

► Proteins: Relationship Among Divergence of 
Sequence, Structure, and Eunction 

► Statistical Analysis and Prediction of Protein- 
Protein Interactions and Binding Sites 
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Protein Structure: Potential Folds 

Michael I. Sadowski 

Division of Mathematical Biology, MRC National 
Institute for Medical Research, London, UK 

Synonyms 

Dark matter folds; New protein folds; Possible topol¬ 
ogies of proteins 

Definition 

Protein folds that do not violate any known principles 
of protein structure but which have not been observed 
to exist in nature. 

Basic Characteristics 

The outcome of the ► structural genomics projects of 
the late 1990s and early 2000s has been that the obser¬ 
vation of natural proteins with significantly different 
structures to any previously known is now very rare. 
Contrastingly using computational sampling methods 
it is possible to generate sets of apparently reasonable 
protein structures which are not significantly similar to 
any that are already known. 

Why are new folds such a rare occurrence? The 
reason is apparently that evolution has reused many 


designs on a grand scale: a set of roughly ten basic 
designs account for about 40% of known domain struc¬ 
tures according to the major '>• domain structure family 
classifications. The reasons for this are still being 
investigated but several factors have been identified 
as likely to be causative. 

Two evolutionary reasons can be cited: mechanis¬ 
tically it seems that new structures and functions arise 
largely via processes of gene duplication, leading to 
large-scale copying of similar structures and many 
observations of internal duplications and similarities 
within structures (reviewed in Taylor and Sadowski 
2010). For example, there is a strong prevalence of 
alpha/beta units in protein structures and other structural 
motifs known as supersecondary structures are very 
highly represented (e.g., beta-beta-beta-alpha units, 
split beta-alpha-beta units, Greek keys, meanders); anal¬ 
ysis of novel folds suggests that they are less dominated 
by these features (Femandez-Fuentes et al. 2010) 

From the point of view of natural selection there is 
apparently very little pressure to explore structure 
space. Most of the superfold structures have shown 
a surprising ability to perform a diversity of functions, 
showing that in general novel functions do not require 
novel structures. It seems therefore that evolution is 
mechanistically biased toward reuse of known struc¬ 
tures and that these are frequently able to provide new 
functions, limiting the sampling of possible structures 
by evolution. 

Physical constraints on the exploration of stable 
structures have also been suggested as a reason for 
limited sampling of possible structures. Theoretical 
work on idealized lattice proteins has shown that 
a highly skewed distribution of sequences per structure 
will be found because some structures are more 
designahle - they will have more sequences which 
stably fold into them than others as a consequence of 
having a more nearly unique contact map (Li et al. 
1998). This suggests that many plausible structures 
may not have solutions in sequence space, or have 
only a limited set. This would make it more difficult 
to hnd sequences for these structures (both for evolu¬ 
tion and for humans), limiting their usefulness. More 
importantly for biology a limited sequence space 
means less robustness to evolutionary change and 
translational errors. Less designahle proteins would 
therefore be unable to perform as wide a range of 
functions and could not be expressed at high levels in 
the cell without producing many mutated copies. 
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Protein Structure: Potential Folds, Fig. 1 An illustration of 
protein topological representation. For certain types of protein 
(such as the three-layer alpha/beta/alpha structure depicted here) 
a topological representation can capture essential features of the 
layout and connectivity of the secondary structures in such a way 
as to allow enumeration of possible structures. Deriving 


topological descriptions for all known structures allows 
unknown but plausible topologies to be generated. The left 
panel shows a topological representation of the E. call chemo- 
taxis protein CHY with a schematic but more detailed picture of 
a three-dimensional representation in the right panel (generated 
with Pymol) 


potentially leading to accumulation of toxic aggre¬ 
gates. Thus less designable folds, even once found, 
could be displaced by those which are more designable 
unless they are uniquely able to perform a useful 
function. 

These arguments, in addition to experimental 
successes in designing proteins of novel fold, suggest 
that there may be many proteins with novel structures 
that have not been used by nature which could be 
accessible to ► protein design. What constitutes 
a truly novel fold depends on the fold definition 
which is not always agreed on by different authorities, 
but this can to some degree be circumvented by using 
a topological description. In this formalism the protein 
chain is abstracted to a series of elements of secondary 
structure and the connections between them, with 
a coarse-grained representation of the spatial relation¬ 
ships sometimes retained (Fig. 1). This provides a sim¬ 
ple way to enumerate possible structures. On this level 
there are known features which are rarely observed, in 
particular crossing loops and left-handed connections 
between elements of secondary structure, allowing cer¬ 
tain folds to be eliminated as physically unlikely. These 
rules reduce the space significantly; however, for small 
proteins, there remains roughly a tenfold excess of 
unseen folds (Taylor et al. 2009). Since the number of 
possible topologies increases combinatorially with the 
number of elements this increases rapidly once the 
number of elements increases to eight or more. 

Even without such enumeration we can generate 
plausible fold candidates by the mechanism of circular 


permutation, in which the termini are joined and 
each loop in turn can be cut to generate a new fold, 
a process which can be engineered and also occurs in 
real protein evolution. Since this operation does not 
convert a “good” topology into a bad one except in 
a few cases where only two members of the cyclic 
permutant group have no crossing loops these are 
all likely to be available and have related sequence 
spaces, suggesting that they offer a large variety 
of potential new folds since in general only 
a small number of circular permutants of even highly 
populated fold groups are found in nature (Fig. 2). 
Similarly the less natural operation of chain reversal 
will not in general violate any of the rules of protein 
topology and this provides another large set of poten¬ 
tial folds. 

Systematic attempts to generate possible folds and 
determine which are plausible have only recently 
begun. Enumerating possible folds combinatorially 
and comparing these to a large database of known 
topologies shows that while the number of plausible 
folds with a small number of elements is very close to 
the number observed the enormous number of possi¬ 
bilities for larger numbers of elements vastly out¬ 
weighs the number of observed folds. Comparison of 
decoy models based on these to the known protein 
databank using ► structure comparison methods 
confirms this (Taylor et al. 2009). An alternative 
method for generating folds by molecular dynamics 
sampling also found possible structures that are highly 
dissimilar to those already known; in this case the 
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Protein Structure: Potential Folds, Fig. 2 Circular 
permutants of the 3chy topology. Ten topologies are shown, 
beginning and ending with 3chy. Topologies with known struc¬ 
tures are depicted in red, unknown in gray. For this arrangement 
five of nine permutants have not so far been observed 


unknown structures were generally found to be 
protein-like but with higher mean contact-order, 
suggesting that selection for speed of folding might 
be part of the explanation for biased sampling of struc¬ 
ture space (Cossio et al. 2010). Attempting to make 
sequences which fold to these new structures is an 
interesting problem for ► protein design methods and 
should help to close the gap between known and 
theoretically possible structures. 


Cross-References 

► Domain Structure Classifications 

► Protein Design for Biosensors 

► Protein Structure Comparison Methods 

► Structural Genomics 
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Synonyms 

Accuracy of protein synthesis; Translation; Transla¬ 
tional Fidelity 

Definition 

Translation is the process by which the sequence of 
bases in an mRNA is decoded into the sequence of 
amino acids in a peptide or protein. 

Ribosomes are large ribonucleoprotein particles that 
synthesize proteins. 

tRNAs (transfer RNAs) are small RNAs (75-85 nucle¬ 
otides) that function as adapters between the codon of 
the mRNA and the amino acid specified by the codon. 
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Protein Synthesis: Translational Fidelity, Fig. 1 X-ray 
crystal structures of the 70S ribosome from Thermus 
thermophilus (a) and the SOS ribosome from Saccharomyces 
cerevisiae (Ben-Shem et al. 2010) (b) Proteins and rRNA in 
the small ribosomal subunit are colored dark and light blue. 



respectively, and dark and pale yellow, respectively, in the 
large ribosomal subunit. Expansion segments of the eukaryotic 
ribosome are in red (Figure courtesy of M. Yusupov and L. 
Jenner, IGBMC, Strasbourg) 


EF-Tu (Elongation Factor Tu) is a GTP-binding pro¬ 
tein that delivers aminoacylated tRNA to the rihosome. 


Basic Characteristics of Translation 

Translation of the nucleotide sequence of messenger 
RNA (mRNA) into protein takes place on the rihosome 
and requires aminoacylated tRNA (aa-tRNA) and 
a translation elongation factor Tu (EF-Tu). Different 
combinations of base triplets (codons) in mRNA code 
for each of the 20 standard amino acids that may be 
incorporated into the polypeptide chain. Each codon is 
recognized by a complementary anticodon on the 
tRNA to which the amino acid is attached. Attachment 
of the amino acid to the tRNA (aminoacylation) is 
catalyzed by aa-tRNA synthetases which are specific 
for their respective amino acid and tRNA (or group of 
isoaccepting tRNAs). Ribosomes are large ribonucleo- 
protein complexes consisting of three or four RNAs 
and 50-80 proteins, depending on organism and organ¬ 
elle. Ribosomes from bacteria and eukaryotes have 
similar architecture, although eukaryotic ribosomes 
are larger and have higher molecular mass because 
they have larger rRNAs and more proteins. Mitochon¬ 
drial ribosomes resemble bacterial ribosomes, but con¬ 
tain smaller rRNAs and many more proteins. 
Ribosomes consist of two subunits of different size, 
the small subunit (denoted as 30S in bacteria and 40S 
in eukaryotes) and the large subunit (508 in bacteria 


and 60S in eukaryotes) (Fig. 1). Ribosomes have three 
binding sites for tRNA: the A site, where aa-tRNA 
binds and mRNA is decoded; the P site where 
peptidyl-tRNA resides during elongation of peptide 
chain; and the E site which serves as an exit site for 
tRNA leaving the ribosome during ► tRNA transloca¬ 
tion. Recognition of the mRNA codon by the antico¬ 
don of aa-tRNA takes place in the decoding site on the 
small ribosomal subunit. The peptidyl transferase cen¬ 
ter where peptide bonds are made is localized on the 
large ribosomal subunit. There are a number of con¬ 
nections between the subunits, called intersubunit brid¬ 
ges. The bridges keep the subunits together and 
stabilize the tRNAs in their binding positions. 

Speed and fidelity of translation determine cell growth 
and the quality of newly synthesized proteins. The rate 
of protein elongation is 1-25 s“^ per codon at 37° C. 
Error frequencies in vivo range between 10“^ and 10“^ 
depending on the organism, concentrations, and nature of 
tRNAs that perform misreading, as well as the mRNA 
context (Drummond and Wilke 2009). The overall error 
rate of protein synthesis reflects the accumulated mis¬ 
takes from all of the steps involved in translation, of 
which tRNA aminoacylation by aa-tRNA synthetases 
and the decoding on the ribosome are the most error- 
prone. Incorporation of erroneous amino acids into 
a protein may affect the fitness of the organism by reduc¬ 
ing the amount of active proteins, producing proteins that 
are toxic for the cell, or increasing misfolding. To prevent 
the attachment of incorrect amino acids, aa-tRNA 
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Protein Synthesis: Translational Fidelity, Fig. 2 Kinetic 
mechanism of EF-Tu-dependent aa-tRNA binding to the 
A site. Kinetically resolved steps are indicated by rate constants 
ki-kv (forward reactions) and k_i-k _3 (backward reactions). 


The rate of codon reading (k^ead, k-readi likely a readily revers¬ 
ible step) could not be determined by rapid kinetics. Rate con¬ 
stants of the two chemical steps that are rate-limited by the 
preceding step are designated koxp and kpep 


synthetases and ribosomes rigorously control the 
aminoacylation and the decoding reactions, respectively, 
through numerous quality control mechanisms. 

Key Steps in Quality Control of Decoding 

Aa-tRNA is delivered to the decoding site of the ribo¬ 
some by an elongation factor which is called EF-Tu in 
bacteria and eEFlA in eukaryotes. EF-Tu (eEFlA) is 
a GTP-binding protein which in the GTP-bound form 
binds any aa-tRNA to form a tight ternary complex. 
EF-Tu-dependent A-site binding of aa-tRNA com¬ 
prises a number of steps, as revealed by biochemical 
analysis and '*■ kinetics (► stopped-flow, quench-flow, 
single-molecule methods; Rodnina 201 1) (Fig. 2). The 
first step is the codon-independent formation of an 
unstable initial binding complex. During initial com¬ 
plex formation, ternary complexes bind to the ribosome 
in a stochastic fashion and dissociate until one with the 
anticodon matching the A-site codon binds and 


recognizes the codon (“codon recognition”). This has 
two important consequences. One is the stabilization of 
the complex by the formation of base pairs between 
codon and anticodon as well as by interactions with the 
ribosome. The other is stimulation of the GTPase activ¬ 
ity of EF-Tu by more than five orders of magnitude 
(“GTPase activation”). GTP hydrolysis and Pi release 
lead to an extensive stractural change of EF-Tu from 
the GTP- to the GDP-bound form by which it loses the 
affinity for aa-tRNA. The aminoacyl end of aa-tRNA 
set free from EF-Tu-GDP moves into the peptidyl trans¬ 
ferase center (“accommodation”) and takes part in pep¬ 
tide bond formation. EF-Tu-GDP dissociates from the 
ribosome and is reactivated by elongation factor Ts 
(EF-Ts; eEFlB in eukaryotes) which acts as a guanine 
nucleotide exchange factor (GEF) and catalyzes the 
exchange of GDP for GTP. The functional cycle of 
eukaryotic eEFlA is probably similar. Because of the 
high cellular concentrations of EF-Tu or eEFlA, prac¬ 
tically all aa-tRNA is bound in the ternary complex and 
immediately available for entering the ribosome. 
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Protein Synthesis: 
Translational Fidelity, 
Fig. 3 Effect of codon¬ 


proofreading (dissociation ky, 
accommodation ks). Cognate 
codons are UUC and UUU 
(red bars). Near-cognate 
codons contain a single 
mismatch in the first (CUC, 
CUU, GUC, yellow), second 
(UCC, green), or third (UUA, 
UUG, blue) codon position 
(Gromadski et al. 2006) 
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Faithful translation of the mRNA by the ribosome 
depends on three selection stages (Fig. 2): the prefer¬ 
ential rejection of incorrect ternary complexes prior to 
GTP hydrolysis at the initial selection stage; the pref¬ 
erential rejection of near-cognate aa-tRNAs at the 
proofreading stage after GTP hydrolysis; and the pref¬ 
erential hydrolysis of near-cognate peptidyl-tRNAs in 
the P site by termination factors. During initial selec¬ 
tion most of the non-cognate ternary complexes (i.e., 
those whose anticodon does not fit the codon in more 
than one position) and a large majority of near-cognate 
ones (those with a single mismatch) are rejected; the 
error frequency for the near-cognate aa-tRNA during 
initial selection is about 10~ -10~ . The near-cognate 
aa-tRNAs that passed initial selection are rejected at 
the proofreading phase, which alone accounts for error 
frequency of 10“'-10“^ (Rodnina 2011). Finally, the 
hydrolysis of erroneous peptidyl-tRNA reduces the 
observed error frequency of synthesis of a full-length 
protein about tenfold (Ortiz-Meoz et al. 201 1). 

Kinetic Discrimination 

Kinetic studies have identihed four elemental reactions 
that have different rates for cognate and near-cognate 
aa-tRNAs (Fig. 2). The ribosome controls the differ¬ 
ences in the stabilities of the codon-anticodon com¬ 
plexes (k_ 2 , ky) and specifically accelerates the rates of 


GTPase activation (k 3 ) and accommodation (ks) of 
correct substrates, implicating codon-anticodon 
duplex stability and ► induced ht as sources of selec¬ 
tivity. Kinetic partitioning between GTPase activation 
and ternary complex dissociation strongly favors the 
acceptance of cognate and the rejection of near¬ 
cognate ternary complexes. Likewise, cognate 
aa-tRNA is preferentially accommodated during 
proofreading, while near-cognate tRNA is mostly 
rejected. Single mismatches at any position of the 
codon-anticodon complex result in slower forward 
reactions and a uniformly 1000-fold faster dissociation 
of ternary complexes from the ribosome (Fig. 3). Fligh- 
fidelity tRNA selection is achieved by a conformational 
switch of the decoding site between accepting and 
rejecting modes, independent of the thermodynamic 
stability of the respective codon-anticodon complexes 
or the position of mismatch (Rodnina 2011). 

Structural Basis of tRNA Selection 

► X-ray crystal structures revealed how the ribosome 
monitors the codon-anticodon duplex (Ramakrishnan 
201 1) (Fig. 4). Bases A1492, A1493, and G530 of 16S 
rRNA change their positions and form A-minor inter¬ 
actions with the minor grove of the first two base pairs 
of the codon-anticodon complex in a fashion which is 
specific for Watson-Crick base pair geometry, but 
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Protein Synthesis: Translational Fidelity, Fig. 4 Structural 
basis of aa-tRNA selection, (a) Interactions of the ribosome with 
the codon-anticodon duplex. Top panel, in the first codon posi¬ 
tion, A1493 hinds in the minor groove of the A36-U1 base pair. 
Middle panel, in the second position, G530 and A1492 monitor 
the A35-U2 base pair. Bottom panel, the third (wobhle) G34-U3 
base pair (Reproduced from Ogle et al. 2001, with permission), 
(b) Local rearrangements at the decoding site. Top left panel, the 
30S subunit in the absence of ligands. Top right, 30S suhunit 
with paromomycin (PAR) bound to the decoding center. Bottom 
left, 30S subunit in the complex with an mRNA-anticodon stem 
loop (ASL) complex. Bottom right, 30S subunit with 


paromomycin and the codon-anticodon fragment. (Reproduced 
from Ogle et al. 2001, with permission), (c) Global 
rearrangements of the 30S subunit in response to codon recog¬ 
nition. The 30S suhunit in complex with A-site tRNA anticodon 
stem loop. Red arrows indicate the movement of domains during 
the transition to the closed 30S conformation. H, rRNA helices, 
S4, S5, S12 are ribosomal proteins (Reproduced from Ogle et al. 
2003, with permission), (d) Close-up of selected 30S elements 
around the decoding center in the presence of antibiotics 
paromomycin (green) and streptomycin (pink) (Reproduced 
from Ogle et al. 2003, with permission) 


sequence independent. The local structural changes at 
the decoding site constitute a part of the induced fit 
rearrangement of the ribosome. Mismatches distort the 
Watson-Crick geometry of the duplex, thereby 
impairing the interactions with decoding-site residues. 


Local changes within the decoding site lead to a global 
change of the 30S subunit from an open to a closed 
conformation, which involves a movement of the 
shoulder and platform domains relative to the rest of 
the 30S subunit (Fig. 4). Near-cognate aa-tRNA 
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Protein Synthesis: Translational Fidelity, Fig. 5 Distortion 
of aa-tRNA during decoding, (a) Conformational changes dur¬ 
ing codon reading, (b) Schematic representation of structural 
rearrangement induced by codon-anticodon pairing. The match 
is sensed by decoding-center nucleotides 530, 1492, and 1493 (1). 
Codon recognition triggers domain closure of the 30S subunit (2), 


bringing the shoulder domain into contact with EF-Tu and shifting 
the P loop (residues 230-237) of domain 2 (3). This changes the 
conformation of the acceptor end of the tRNA (4), disrupting its 
contacts with switch I, which becomes disordered (5), opening the 
hydrophobic gate to allow His84 to catalyze GTP hydrolysis 
(Reproduced from Schmeing et al. 2009, with permission) 


induces some of these structural changes as well, hut 
the resulting global interaction pattern is different: the 
closed conformation of the 30S subunit does not form, 
the shoulder of the subunit does not move, and the head 
moves in a different direction (Ramakrishnan 2011). 

The local and global conformational changes at the 
decoding center are affected by antibiotics that induce 
misreading (Fig. 4). The antibiotic paromomycin 
induces the conformational flip at the decoding center 
even in the absence of correct codon-anticodon inter¬ 
action. Formation of the complex stabilizes the binding 
of near-cognate aa-tRNA and increases the rate of GTP 
hydrolysis in the near-cognate ternary complex almost 
to the level of the cognate complex. Restricting the 
flexibility of the 30 S subunit by the antibiotic strepto¬ 
mycin affects the GTPase activation step primarily by 
decreasing the rate of the reaction with the cognate, 
and increasing the rate with the near-cognate, ternary 
complex, resulting in an almost complete loss of selec¬ 
tivity (Ramakrishnan 2011; Rodnina 2011). 

Following codon recognition and prior to the 
release from EF-Tu, the aa-tRNA transiently assumes 
a conformation that is more open or distorted com¬ 
pared to the unbound tRNA. The distortion in the 
tRNA involves an untwisting accompanied by 
a widening of the groove in the anticodon-stem as 
well as a movement of the D loop of tRNA 
(Ramakrishnan 2011) (Fig. 5). The 3' end of tRNA is 
pulled away by almost 6 A compared to the structure 


of the ternary complex in the absence of the ribosome. 
The distortion at the tRNA is necessary (although not 
sufficient) for the activation of GTP hydrolysis on the 
ribosome. For rapid GTP hydrolysis to take place, 
a histidine residue of EF-Tu (His84 in EF-Tu from 
Escherichia coli) has to change its position to stabilize 
the hydrolytic water molecule for the attack on the 
gamma-phosphate of GTP. The tRNA distortion 
affects the relative orientation between tRNA and 
EF-Tu, which leads to a rearrangement in EF-Tu and 
the stabilization of the catalytically active orientation 
of Flis84 by A2662 of the sarcin-ricin loop of 23S 
rRNA on the large ribosomal subunit, thereby stabiliz¬ 
ing the GTPase transition state. Helices h8 and hl4 of 
the small ribosomal subunit are potential negative reg¬ 
ulators of the GTPase. 

During accommodation, the 3' end of aa-tRNA has 
to move a distance of almost 70 A within the ribosome 
from its binding site on EF-Tu into the peptidyl trans¬ 
ferase center. Computer simulation of accommodation 
suggested a stepwise movement of the tRNA through 
a corridor of conserved rRNA bases which engage in 
various interactions with the tRNA during this move¬ 
ment (Sanbonmatsu et al. 2005). The rigidity of the 
tRNA and the correct alignment of the tRNA with 
respect to ribosomal proteins S13, S19, L16, L25, 
L27, L31 (Jenner et al. 2010) are important for the 
preferential accommodation of the cognate, and the 
rejection of the near-cognate, aa-tRNA. 
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Cross-References 

► Aminoacyl-tRNA Synthetases 
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► Molecular Recognition: Lock-and-Key, Induced Fit, 
and Conformational Selection 

► Stopped-Flow Techniques 

► Translocation on the Ribosome 


References 

Ben-Shem A, Jenner L, Yusupova G, Yusupov M. Crystal struc¬ 
ture of the eukaryotic ribosome. Science. 2010;330:1203-9. 

Drummond DA, Wilke CO. The evolutionary consequences of 
erroneous protein synthesis. Nat Rev Genet. 2009; 10:715-24. 

Gromadski KB, Daviter T, Rodnina MV. A uniform response to 
mismatches in codon-anticodon complexes ensures ribo- 
somal fidelity. Mol Cell. 2006;21;369-77. 

Jenner L, Demeshkina N, Yusupova G, Yusupov M. Structural 
rearrangements of the ribosome at the tRNA proofreading 
step. Nat Struct Mol Biol. 2010;17:1072-8. 

Ogle JM, Brodersen DE, Clemons WM, Jr., Tarry MJ, Carter 
AP, Ramakrishnan V. Recognition of cognate transfer RNA 
by the 30S ribosomal subunit. Science 2001;292:897-902. 

Ogle JM, Carter AP, Ramakrishnan V. Insights into the decoding 
mechanism from recent ribosome structures. Trends 
Biochem Sci 2003;28:259-66. 

Ortiz-Meoz RF, He SL, Zaher HS, Green R. Sense and nonsense 
recognition by the ribosome. In: Rodnina MV, Green R, 
Wintermeyer W, editors. The ribosome. Structure, function 
and dynamics. Wien, New York: Springer; 2011. 

Ramakrishnan V. Structural studies on decoding, termination and 
translocation in the bacterial ribosome. In: Rodnina MV, 
Green R, Wintermeyer W, editors. The ribosome. Structure, 
function and dynamics. Wien, New York: Springer; 2011. 

Rodnina MV. Mechanisms of decoding and peptide bond for¬ 
mation. In: Rodnina MV, Green R, Wintermeyer W, editors. 
The ribosome. Structure, function and dynamics. Wien, New 
York: Springer; 2011. 

Sanbonmatsu KY, Joseph S, Tung CS. Simulating movement of 
tRNA into the ribosome during decoding. Proc Natl Acad Sci 
USA. 2005;102:15854-9. 

Schmeing TM, Voorhees RM, Kelley AC, Gao YG, Murphy F, 
Weir JR, Ramakrishnan V. The crystal structure of the ribo¬ 
some bound to EF-Tu and aminoacyl-tRNA. Science. 
2009;326:688-94. 


Protein Trans-splicing 

► Protein Segmental Labeling 

► Protein Splicing - Computational Studies 


Protein-Assisted Electron Transfer 

► Electron Transfer Through Proteins 


Protein-Carbohydrate Interactions 

► Complement Factor H and Its Ligands: A Multidis¬ 
ciplinary Approach to Interactions and Modeling 

► X-Ray Diffraction and Crystallography of Oligosac¬ 
charides and Polysaccharides 


Protein-Ligand Dynamics 

Jeffrey W. Peng 

Department of Chemistry and Biochemistry, 
University of Notre Dame, Notre Dame, IN, USA 

Synonyms 

Ligand motions relevant for protein binding 

Definition 

Protein-ligand dynamics studies, in the context of drug 
design, are an effort to learn how the internal motions 
of a flexible ligand affect its activity toward a macro- 
molecular target, typically a protein. 

Introduction 

Structure-activity-relationships (SAR) are a guiding 
force of drug design. These relationships correlate the 
structural variations within a ligand series with their 
biological activity toward a macromolecular target, 
typically a protein. Ideally, the correlations suggest 
structural modifications that lead to steady improve¬ 
ments in activity. 

The SAR concept is eminently reasonable when 
activity can relate to a single structure, as with rigid 
ligands. Yet, many bioactive ligands pertinent to drug 
design are flexible molecules. Examples are peptidic 
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ligands used to interfere with protein-protein interac¬ 
tions, and ligands derived from fragment-linking strat¬ 
egies (Hajduk and Greer 2007). These ligands sample 
multiple conformations and often reorganize upon pro¬ 
tein binding (Perola and Charifson 2004). These 
dynamics can blur the structure-activity link by com¬ 
plicating the prediction of bioactive conformations and 
the rank-ordering of binding affinities. It is therefore 
benehcial to complement traditional SAR with exper¬ 
imental studies aimed at ligand motions relevant for 
biological activity. Such studies would then furnish 
flexibility-activity relationships (FAR) (Detlefsen et 
al. 1999; Namanja et al. 2010). 

Toward this end, a powerful approach is the 
measurement of Nuclear Magnetic Resonance 
(NMR) spin relaxation rates. These rates measure the 
rapidity by which bulk magnetization relaxes back 
to equilibrium after a perturbation. Familiar 
examples include the longitudinal or “spin-lattice” 
relaxation rate Ri = 1/Ti and the transverse or “spin- 
spin” relaxation rate R 2 = I/T 2 . For ligand dynamics 
studies, *H, '®F, and relaxation measurements are 
common. 

measurements are particularly appealing 
(Detlefsen et al. 1999; LaPlante et al. 2000) because 
of the prevalence of carbons in drug-like ligands and 
because the relaxation rates are interpretable in 
terms of the dynamics of the corresponding sites. 
The latter benefit is a consequence of the relaxation 
being dominated by local mechanisms attached to the 
site. These mechanisms include: (1) the 
heteronuclear dipole-dipole (DD) interactions 
between the and the directly bonded proton(s); 
(2) the chemical shielding anisotropy (CSA) of the 
X nucleus; (3) chemical exchange processes that mod¬ 
ulate the chemical shift. 

Current methods for measuring NMR relaxa¬ 
tion rate constants enable both a comprehensive and 
site-specific description of molecular dynamics, and 
thus open the way for serious investigations of ligand 
dynamics. Below, we discuss relaxation experi¬ 
ments to study ligand motions relevant for protein 
binding. We focus on those motional timescales for 
which spin relaxation is most sensitive; the ps (10~^^ s) 
-ns (10“^ s) timescale, and the ps (10“® s)-ms (10“^ s) 
timescale. Motions on the bridging timescales are 
better explored by other NMR parameters, such as 
residual dipolar couplings. 


Ligand Motions on ps-ns Timescale 

At equilibrium, the*^C-*H bond vector orientations of 
a ligand fluctuate relative to the external static field, Bq. 
These fluctuations stem from overall molecular 
tumbling and internal conformational dynamics (flexi¬ 
bility), and they produce fluctuating magnetic fields 
that impinge on the spins and cause relaxation. 
Accordingly, the relaxation rates for a -nucleus 
depend on the orientational dynamics of corresponding 
'^C-*H bond vector. Measuring relaxation rates for 
a free and receptor-bound ligand thus provide a means to 
correlate ligand dynamics with the consequences of 
forming the protein-ligand complex. 

Measurements of the free-state relaxation rates are 
straightforward - one measures the isolated ligands. 
To access the complementary bound-state relaxation 
rates, one then adds the protein. The ligand then 
undergoes, in the simplest case, a two-state exchange 
process between the free [L] and bound [PL] states 

kov[P] 


koFF 

The forward and backward rate constants, koN [P] 
and koFFj are related to the equilibrium fractions of 
free and bound species, Pf and Pb, by the detailed 
balance condition PFkoN[P] = koFFPn- Perturbations 
to Pf and Pb relax with net exchange rate constant 
kex = koN[P] + koFF- The bound ligand fraction can 
be expressed in terms of the total protein concentration 
Px, total ligand concentration L^-, and the equilibrium 
dissociation constant Xq as 

Pb = + (Pt/Lt) + (Kd/Lt)} 

- ^ )/{l + (Pt/Lt) + (Kd/Lt)}' - 4(Pt/Lt) 

( 2 ) 

Smaller correspond to tighter binding, and 
Xd = 10 pM serves as a rough boundary between 
tight and weakly binding ligands. 

A ligand subject to the exchange of Eq. 1 will yield 
exchange-averaged rates, Ravg- Generally, Ravg can 
depend on (1) the intrinsic relaxation rates of the free 
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and bound states, Rp and Rp; (2) the chemical shift 
differences between the two states, Am = YBo(5p—§ b); 
(3) Pp and Pb; (4) and kgx- The functional dependence 
and subsequent extraction of bound-state information 
can be messy, even for the humble two-state exchange 
process. Fortunately, bound-state relaxation rates (and 
thus information concerning bound-state flexibility) 
are readily extracted for two practical scenarios: tight 
binders and weak binders. We discuss each in turn. 

We first consider the tight-binding scenario, in 
which A'd < 10 pM and Py > Lx>>A'd. Under these 
conditions, the bound ligand fraction Pb is ^ 1 , so that 
Ravg reflects only the bound state. Then bound and free 
measurements are cleanly separated, and their compar¬ 
ison is trivial. Tight-binding ligands are essentially part 
of the protein, hence, they share the latter’s concentra¬ 
tion and basic relaxation properties. Thus, to selectively 
detect the ligand NMR signals, the ligand and protein 
should have different isotope labeling schemes, such as 
labeling of the ligand and no labeling of the protein. 
Standard *H-detected heteronuclear relaxation pulse 
schemes would then automatically edit for the ligand. 
For ligands bound to larger protein targets (Mr > 
20 , 000 ), protein perdeuteration is advisable to reduce 
intermolecular *H-*H dipolar cross-relaxation that 
would decrease experimental sensitivity. 

The other simplifying scenario is weak binding, in 
which Ky) > 10 pM with a molar excess of ligand. 
Lx > Px. These conditions are common in fragment- 
based NMR screening, and NMR studies of protein- 
protein interactions that use small peptides as proxies 
of one of the protein interaction sites. Critically, weak 
binding promotes fast binding exchange on the chemical 
shift timescale (kgx > Am). In this special case, the 
observed relaxation rates Ri,avg and R 2 ,avg have the 
simple forms 

Ri,avg = PpRi.p + PbRi.b 

Ri.AVG = PpR2.P + PbR 2,B + Rex 

Ri f, R 2 _f and R 13 , R 2 ,b reflect the intrinsic rates of 
the free and bound states caused by ps-ns bond vector 
dynamics. Rgx is an additional contribution to R 2 ,avg 
from ps-ms chemical exchange (vide infra). For free- 
precession and fast exchange (kgx > Am), Rgx is 


While fast exchange may be mathematically con¬ 
venient, it may not in fact prevail. One can then 
“nudge” kgx toward faster exchange, by increasing 
the amount of protein, raising the temperature, or 
both. Equation 3 shows that the fast-exchange Ravg 
depends on the bound and free states. Ri,avg is the 
simplest case - a simple population-weighted average 
of the intrinsic rate constants. To extract R^b. one 
plugs in Rif (measured separately) and Pb (e.g., deter¬ 
mined from ligand titration experiments). Extracting 
R 23 is more complex since R 2 ,avg can have an addi¬ 
tional Rgx term. 

With an excess of ligand over protein (Lx > Px), the 
possibility of natural abundance relaxation mea¬ 
surements arises. With higher Bq fields and cryogenic 
probes, quantitative natural abundance relaxation 
data are feasible for ligand concentrations ^ 1 mM. 
Benefits of natural abundance are: (1) The likelihood 
of two adjacent nuclei is quite low (^ 0 . 01 %), thus 
reducing artifacts from homonuclear scalar 

couplings and dipolar cross-relaxation. (2) The mea¬ 
surements remain viable in pharmaceutical research 
settings in which isotope-enriched ligands are often 
scarce. Of course, while natural abundance Is feasible, 
it is not compulsory. If *^C-enriched ligands are forth¬ 
coming and have labeling schemes that mitigate arti¬ 
facts due to dipolar and scalar coupling, they 

are preferable. 

The Lx > Px condition promotes the selective 
observation of ligand signals. But even with this 
excess, protein perdeuteration is desirable for enhanc¬ 
ing the sensitivity and accuracy of the relaxation 
experiments, particularly in the case of natural abun¬ 
dance measurements. Deuteration also mitigates 
proton spin-flips that interchange the doublet 
members - a process deleterious to the accuracy of 
several relaxation pulse-sequences that exploit their 
differential relaxation properties. 

A caveat of working with a ligand excess is that the 
fast-exchange Ri^avg and R 2 ,avg are not equally sen¬ 
sitive to the bound state. Eigure 1 depicts this inequity 
via simulations of aromatic Ri and R 2 versus 
rotational correlation time ic for an isotropically tum¬ 
bling ligand. The free and bound ligand correspond to 
short Tc (150 ps/rad) and long (20 ns/rad), respec¬ 
tively. A ligand excess means Pf > Pb; thus, for the 
bound state to be “heard” in the population-weighted 
averages of Eq. 3, the intrinsic bound-state rate, we 
require Rb > Rp- Eigure 1 shows this may not be the 
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Protein-Ligand Dynamics, 
Fig.1 Simulation of aromatic 
'^C Ri and R 2 versus the 
overall rotational correlation 
time To for a rigid isotropic 
tumbler. The assumed 
relaxation mechanisms are the 
dipole-dipole 

interaction (rcn = 0.109 nm) 
and the anisotropic '^C 
chemical shielding. The red 
dashed-dotted line 
corresponds to the free ligand 
Tc (150 ps/rad) and the blue 
dashed line corresponds to Xc 
of the ligand bound to a large 
protein (20 ns/rad) 


free bound 


free bound 




case for longitudinal rates like Rj, which maximize for 
Tc 1 /cocj which is small compared to the Tc typical of 
a receptor-bound ligand. By contrast, the standard sin¬ 
gle quantum transverse relaxation rates, R 2 , increase 
monotonically with Tc, and thus, molecular mass. This 
means that even if Pb < Pp, R 2 ,b can still contribute 
significantly to the observed R 2 ,avg- 

Figures 2a, b provide experimental evidence of 
these effects for the aromatic CH bonds of a small 
ligand, 2-phenoxy-benzoic acid. The clear bars are 
the isolated ligand, while the filled bars are the ligand 
in the presence of its binding receptor, p38 MAP kinase 
(Mr = 43,000). The R 2 ,avg values clearly change from 
R 2 ,f, whereas the Ri,avg values are nearly indistin¬ 
guishable from Ri b- Thus, R 2 ,avg would seem to be 
the more sensitive probe of the bound state. 

Yet, quantitative extraction of R 23 from R 2 ,avg can 
remain complicated because of the possibility of an 
additional Rex term. What would be convenient is 
a relaxation parameter that is amplified in the bound 
state, like R 2 , but retains the simple averaging depen¬ 
dence as Ri.avg (lacks Rex)- Cross-correlated relaxa¬ 
tion rate constants are a good approximation of what 
we seek: these rate constants are differences in the 
relaxation rates of individual members of a multiple! 
(Goldman 1984). An example is cross-correlation 
between the CSA and heteronuclear DD relax¬ 

ation mechanisms, which causes the Ri and R 2 stimu¬ 
lated by the CSA and DD mechanisms to differ for the 


two doublet members. By contrast, it does not 
affect Rex- Thus, if we measure rj,^, the difference of 
these doublet-specific R 2 rates, the Rgx contribution 
drops out. The result is transverse relaxation rate con¬ 
stant, 77xy, avg, that is amplified in the bound state but 
has same simple population-weighted average 

^.i->>,AVG = + Pb?7a-3>,B- (5) 

The first ligand cross-correlated measurements used 
and *^N-enriched ligands (Blommers et al. 1999; 
Carlomagno et al. 1999). Natural abundance mea¬ 
surements are also possible (Peng 2003). In particular. 
Fig. 2c shows an example of natural abundance 
aromatic DD/^^C CSA cross-correlation 

measurements to compare free versus bound flexibility 
for 2-phenoxy-benzoic acid. The bars plot rj^y for indi¬ 
vidual CH bond vectors of the ligand aromatic rings. 
The clear bars are the free ligand (ryx^ p); the shaded 
bars are PB'ilxy,B for the ligand bound to p38 MAP 
kinase. Critically, taller and shorter bars indicate lesser 
and greater internal mobility. Thus, the bound ligand 
retains some internal flexibility about the intuitive 
torsion axes. 

The above examples demonstrate that *^C NMR 
relaxation can profile differences in site-specific ps-ns 
flexibility of bound ligand. These profiles could be 
used to compare changes in ps-ns flexibility for 
a series of ligands targeting a common receptor. Such 
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Protein-Ligand Dynamics, 
Fig. 2 Natural abundance 
relaxation parameters for the 
aromatic CH bonds of 
2-phenoxy-benzoic acid 
(upper left) in the absence 
(clear bars) and presence 
(filled bars) of its binding 
receptor, p38-MAP kinase. 

(a) Ri; (b) R2; (c) cross- 
correlated rixy using 
continuous-wave spin¬ 
locking. In (c), the solid bars 
are PnCIxy. while the clear bars 
are r|xy,F- The dashed lines 
denote axes of internal 
torsions. Conditions were 
1 mM ligand, 50 pM U-‘^N/ 
70% perdeuterated p38, 

18.8 T, T = 295 K 


4 

un n 
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CH Bond CH Bond 


comparisons would allow correlation of ps-ns flexibil¬ 
ity with binding affinity. 

Ligand Motions on the |xs-ms Timescale 

The interaction of a ligand with its target receptor can 
also entail chemical exchange dynamics on the ps-ms 
timescale. For example, flexible ligands often undergo 
conformational reorganization upon binding. Thus, 
a ligand described by the two-state process of Eq. 1 
might undergo a conformational exchange process in 
which the ligand toggles between free versus bound 
conformational ensembles (Perola and Charifson 
2004). Two techniques particularly useful for investi¬ 
gating exchange dynamics are: (1) exchange spectros¬ 
copy (EXSY) (Jeener et al. 1979) and (2) transverse or 
rotating-frame relaxation dispersion studies (Deverell 
et al. 1970; Palmer and Massi 2006). 

EXSY is effective when the exchange is slow 
(kex < Aoj), and the exchange-coupled states 
give well-resolved peaks. 2-D EXSY produces 
cross-peaks between these “parent” peaks that lay 
bare the exchange dynamics. Flowever, EXSY can 
fail if the fractional populations of the exchanging 
nuclei are strongly skewed. Eor example, in many 
binding studies, Lj > P^; the bound ligand resonances 
may then be too small or broad to detect. 


A powerful alternative is relaxation dispersion. Dis¬ 
persion experiments detect the chemical shift modula¬ 
tion resulting from a nucleus exchanging between states 
with different intrinsic shifts. This chemical shift mod¬ 
ulation adds to the transverse relaxation already caused 
by the aforementioned DD and CSA mechanisms, and it 
is the origin of the Rgx term in R 2 ,avg (Eqs. 3 and 4). 
The appealing strength of the dispersion experiment is 
that they are performed on the visible (major) species 
resonances; yet, they still report on the minor states that 
may escape direct observation. In effect, dispersion 
measurements of the major state interrogate the “invis¬ 
ible” minor state (Korzhnev and Kay 2008). Critical to 
the accuracy of these measurements are the use of com¬ 
pensated Carr-Purcell-Meiboom-Gill (CPMG) or Rip 
schemes (Loria et al. 2008). 

Relaxation dispersion experiments measure effec¬ 
tive transverse relaxation rates R 2 ,eff versus a spin-lock 
field with strength, Vgff (Hz). The exchange contribu¬ 
tion to R 2 ,eff has a dependence on Vgff. For each 
resonance, one measures and fits this frequency depen¬ 
dence (dispersion profile) to model functions derived 
from a kinetic model (e.g., two-state binding). In the 
best cases, the fit yields the exchange rate constants kgx, 
fractional populations, and chemical shift differences 
A® between the major (visible) and minor (invisible) 
states (Palmer and Massi 2006; Korzhnev and Kay 
2008; Loria et al. 2008). This analyses also defines 
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Protein-Ligand Dynamics, Fig. 3 Natural abundance 
relaxation dispersion profiles for three phospho-serine ligands 
of Pin-1: (•)substrate FFpSPR, filled circles and solid curves; 
(O) trans-locked inhibitor, open circles, and dashed lines; (A), 


cis-locked inhibitor, open triangles, and dashed-dotted lines. 
Measurement conditions were 18.8 T, 295 K, 1 mM ligand, 
and 10 pM U-’'’N-70% perdeuterated Pinl 


Rex, which can then be used to extract the bound state 
R 2 ,b in Eq. 3. 

By itself, the dispersion only announces the exis¬ 
tence of an exchange process that modulates the 
nucleus’ chemical shift. For the two-state model, this 
modulation involves a nucleus toggling between two 
chemical shifts separated by Acu. If the only concern is 
determining kgx, then knowing the origins of the Aco is 
not so critical - it suffices that it exists. But if we are 
interested in exchange related to conformational 
dynamics (conformational exchange), then we must 
tie the chemical shift differences, Aco, to conformation. 

In this context, dispersion measurements are 
appealing, since the aliphatic chemical shifts are 
strongly dependent on local torsion angles (Spera and 
Bax 1991). This dependence has fueled our interest in 
relaxation dispersion to study ligand conforma¬ 
tional reorganization caused by protein interaction 
(Zintsmaster et al. 2008; Namanja et al. 2010). Figure 3 


illustrates such measurements for three ligands of 
Pinl, a mitotic regulator (Mr ^ 18,500) that catalyzes 
the cis-trans isomerization of pThr/Ser-Pro (pT/S-P) 
motifs of other cell cycle proteins. The three ligands 
included a 5-residue phospho-serine (pS) substrate 
FFpSPR, and two inhibitor analogs with the central 
pS-P core locked in either the cis or trans configuration 
(Wang and Etzkom 2006). The locking prevents cis- 
trans isomerization. The isolated ligands gave no dis¬ 
persion (flat profiles). Addition of Pinl caused the 
responses of Fig. 3. The substrate (filled circles) gave 
strong dispersion for the targeted pS-P linkage and the 
C-terminal R5-Ca. Fits of those dispersions to the gen¬ 
eral two-state exchange functions gave kex, Pb, and A® 
values consistent with substrate conformational 
dynamics due to protein binding or cis-trans isomeri¬ 
zation. By contrast, the inhibitors gave little or no 
dispersion, suggesting less conformational reorganiza¬ 
tion upon binding for these conformationally 
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constrained inhibitors. Thus, to the extent that local 
conformation dominates the aliphatic shifts, the 
comparative study shows that (1) the dispersion exper¬ 
iments can map ps-ms ligand motions important for 
activity; (2) the experiments are sensitive to ligand 
modihcations that would perturb such motions. 

The above dispersion studies were performed for 
weak-binding interactions >10 pM) with a molar 
excess of ligand. This enabled the dispersion measure¬ 
ments to be carried out at natural abundance *^C. As 
stated, natural abundance avoids complications from 
dipolar and scalar couplings and strengthens 
the method’s appeal to pharmaceutical research set¬ 
tings in which ligand isotope enrichment can be 
impractical. The very same dispersion methods are 
applicable for tight-binding ligands. However, since 
the ligand concentration would be that of the protein 
(typically < 1 mM), enrichment would be needed 
for sensitive detection. 


Summary 

We have described NMR relaxation experiments 
that provide site-specihc descriptions of ligand dynam¬ 
ics relevant to protein interaction. The experiments pro¬ 
vide atomic-resolution data to guide systematic efforts 
aimed at correlating ligand flexibility with activity. 

Ligand flexibility is usually regarded as detrimental 
to complex formation. This is due to the entropic cost 
expected from “freezing out” the ligand motions upon 
binding. Accordingly, one typically tries to remove 
flexibility by cyclization and other methods of rigidi- 
flcation. But knowing how the ligand should be rigid- 
ifled can be challenging if the “offending” motions are 
not mapped. The NMR relaxation experiments can 
guide the rigidiflcation by exposing the motions rele¬ 
vant for complex formation. This information can be 
invaluable when a high-resolution 3-D structure of the 
protein-ligand complex is pending. 

We note that a blanket policy of rigid ligands may 
not always be optimal for several reasons: (1) A totally 
rigid ligand might freeze out protein motions, leading 
to an unfavorable entropic component to the binding 
free energy; thus, some residual ligand flexibility, some¬ 
how commensurate with that of the protein, 
may actually promote tighter binding. (2) Ligands with 
some residual flexibility may better co-adapt to protein 
active sites that change according to drug pressure 


(Freire 2002). (3) Finally, tight binding is just one aspect 
of drug design. A dmg must also be bio-available; often 
this means tinkering with the ligand scaffold in ways 
that may compromise the tightest possible binding 
(Navia and Chaturvedi 1996). In fact, a predictive under¬ 
standing of what physicochemical properties can 
enhance oral bioavailability is still missing. While heu¬ 
ristic rules have provided guidance (Lipinski et al. 1997), 
they often implicitly assume a single ligand structure. It 
is conceivable that some residual flexibility may 
better enable a ligand to meet the often conflicting 
requirements of binding potency and transport properties 
important for oral bioavailability. As stated by 
Hirschmann (Hirschmann 1991): 

“Discovering an active compound is relatively easy, 
discovering an important new drug remains unbeliev¬ 
ably difficult.” 

To help mitigate this difficulty, we should expand 
our abilities to manipulate structure and flexibility, as 
needed, to optimize molecular behavior. The ligand 
dynamics studies described here, together with protein 
dynamics investigations, can provide the experimental 
data to help reach this goal. 

Cross-References 

► NMR in Drug Discovery - Introduction 

► Protein Dynamics and Allostery 

► Protein NMR - Introduction 

► Residual Dipolar Coupling 

► Relaxation Dispersion 

► Statistical Analysis and Prediction of Protein- 
Protein Interactions and Binding Sites 

► Thermodynamics of Biomolecular Interactions 

► ZZ-Exchange 

References 

Blommers MJJ, Stark W, et al. Transferred cross-correlated 
relaxation complements transferred NOE: structure of an 
IL-4R-derived peptide bound to STAT-6. J Am Chem 
Soc. 1999;121:1949-53. 

Carlomagno T, Fell! IC, et al. Transferred cross-correlated relax¬ 
ation: application to the determination of sugar pucker in an 
aminoacylated tRNA-mimetic weakly bound to EF-Tu. J Am 
Chem Soc. 1999;121:1945-8. 

Detlefsen DJ, Hill SE, et al. Molecular flexibility profiling using 
NMR specti'oscopy. Curr Med Chem. 1999;6:353-83. 
Deverell C, Morgan RE, et al. Studies of chemical exchange by 
nuclear magnetic relaxation in the rotating frame. Mol Phys. 
1970;18(4):553-9. 



Protein-Protein interactions - Interpreting Data from Large-Scale Screens 


2083 


P 


Freire E. Designing drugs against heterogenous targets. Nat 
Biotechnol. 2002;20:15-6. 

Goldman M. Interference effects in the relaxation of a pair of 
unlike Spin-1/2 Nuclei. J Magn Reson. 1984;60:437-52. 

Hajduk PJ, Greer J. A decade of fragment-based drug design: 
strategic advances and lessons learned. Nat Rev Drug 
Discov. 2007;6(3):211-9. 

Hirschmann R. Medicinal chemistry in the golden age of biol¬ 
ogy: lessons from steroid and peptide research. Angew Chem 
IntEdEngl. 1991;30:1278-301. 

Jeener J, Meier BH, et al. Investigation of exchange processes by 
two-dimensional NMR spectroscopy. J Chem Phys. 1979;71: 
4546-53. 

Korzhnev DM, Kay LE. Probing invisible, low-populated states 
of protein molecules by relaxation dispersion NMR spectros¬ 
copy: an application to protein folding. Acc Chem Res. 
2008;41(3):442-51. 

LaPlante SR, Aubry N, et al. Transferred '^C T, Relaxation at 
Natural Isotopic Abundance: A Practical Method for Deter¬ 
mining Site-Specific Changes in Ligand Flexibility upon 
Binding to a Macromolecule. 1 Am Chem Soc. 2000; 122: 
12530-5. 

Lipinski CA, Lombardo F, et al. Experimental and computa¬ 
tional approaches to estimate solubility and premeabiity in 
drug discovery and development settings. Adv Drug Deliv. 
1997;23:3-25. 

Loria IP, Berlow RB, et al. Characterization of enzyme motions 
by solution NMR relaxation dispersion. Acc Chem Res. 
2008;41(2):214-2L 

Namanja AT, Wang XJ, et al. Toward flexibility-activity rela¬ 
tionships by NMR spectroscopy: dynamics of Pinl ligands. 
1 Am Chem Soc. 2010;132(16):5607-9. 

Navia MA, Chaturvedi PR. Design principles for orally bioavail- 
able drugs. Drug Discov Today. 1996;1:179-89. 

Palmer 3rd AG, Massi F. Characterization of the dynamics of 
biomacromolecules using rotating-frame spin relaxation 
NMR spectroscopy. Chem Rev. 2006; 106(5): 1700-19. 

Peng JW. New probes of ligand flexibility in drug design: trans¬ 
ferred *^C CSA-dipolar cross-correlated relaxation at natural 
abundance. I Am Chem Soc. 2003; 125(36):! 1116-30. 

Perola E, Charifson PS. Conformational analysis of drug-like 
molecules bound to proteins: an extensive study of ligand 
reorganization upon binding. I Med Chem. 2004;47(10): 
2499-510. 

Spera S, Bax A. Empirical correlation between protein backbone 
conformation and Ca and CP '^C nuclear magnetic reso¬ 
nance chemical shifts. I Am Chem Soc. 1991;113:5491-2. 

Wang XI, Etzkom FA. Peptidyl-prolyl isomerase inhibitors. 
Biopolymers. 2006;84(2): 125^6. 

Zintsmaster IS, Wilson BD, et al. Dynamics of ligand binding 
from *^C NMR relaxation dispersion at natural abundance. 
1 Am Chem Soc. 2008;130:14060-L 


Protein-Lipid Interaction 

► Membrane Protein Structure 

► Transmembrane Helix Orientation and Dynamics 


Protein-Protein Interactions 

► Complement Factor H and Its Ligands: A Multidis¬ 
ciplinary Approach to Interactions and Modeling 

► Heteromeric Versus Homomeric Association of Pro¬ 
tein Complexes 

► Linear Motifs in Protein-Protein Interactions 

► Mass Spectrometry: Mapping Large Stable Protein 
Complexes 

► Oligomeric Proteins 

► Plasticity at Protein-Protein Interaction Interfaces 

► Statistical Analysis and Prediction of Protein- 
Protein Interactions and Binding Sites 

► Thermodynamics of Biomolecular Interactions 


Protein-Protein Interactions - 
Interpreting Data from Large-Scale 
Screens 

Marc R. Wilkins 

Systems Biology Initiative, School of Biotechnology 
and Biomolecular Sciences, University of New South 
Wales, Sydney, NSW, Australia 

Synonyms 

Interactome analysis; Protein fragment complementa¬ 
tion analysis; Two-hybrid screens 

Definition 

Large-scale screens of protein-protein interactions have 
been undertaken with two-hybrid techniques or variants 
thereof. Protein complexes have also been purified on a 
proteome-wide scale and their constituent proteins iden¬ 
tified by mass spectrometry. The analysis of data from 
these screens should be interpreted differently to that 
from focused studies of one interaction or protein com¬ 
plex. False positive interactions may be present and 
technical limitations in methods of measuring interac¬ 
tions may also lead to false negative interactions. The 
combination of data from multiple large-scale screens, 
particularly if orthogonal approaches have been used, is 
useful to produce high-quality datasets and build accu¬ 
rate interaction networks for any cell type. 
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Interpreting Data from Large-Scale Screens 
of Protein-Protein Interactions 

Approaches for Large-Scale Interaction Screens 

Protein-protein interactions can be screened, on 
a proteome-wide scale, using two-hybrid techniques 
(Fields and Song 1989) or related variants. These 
approaches test if any two proteins of interest can 
interact. Variants include the split uhiquitin system 
and protein fragment complementation assays involv¬ 
ing enzymes such as dihydrofolate reductase. In all 
cases, each protein of interest is fused with the first or 
second complementary fragment of a reporter protein. 
If the proteins of interest interact, the fragments of the 
reporter protein are brought into close proximity. This 
leads to the generation of a measurable signal; this may 
involve the expression of a gene, the reconstitution of 
enzymatic activity, or the capacity to fluoresce. In 
large-scale screens, libraries are generated such that 
all mRNAs are fused to a gene sequence coding the 
first fragment of a reporter. A second library is also 
created whereby all mRNAs are fused to the second 
fragment of a reporter. Typically, this is done such that 
each library is a set of plasmids which can then he 
mixed, co-transformed, and co-expressed in a host cell. 
Transformants that show reporter activity can then 
have their plasmids sequenced to identify the two 
interacting proteins. If done in sufficient numbers, 
and with a generally applicable technique, this can 
permit the interaction of all pairs of proteins in 
a proteome to be tested. 

In addition to two-hybrid approaches, protein-pro¬ 
tein interactions can also be screened by the analysis of 
protein complexes. In these approaches, protein com¬ 
plexes are purified from the cell by use of affinity 
techniques or by strategies that can purify many pro¬ 
tein complexes in parallel. The constituent proteins of 
each complex are then identified by digesting to pep¬ 
tides and analysis by tandem mass spectrometry. With 
affinity-based purification, large-scale screens have 
been undertaken by the systematic engineering of one 
type of affinity tag onto each protein in a proteome. 
This has been done by the generation of thousands of 
different strains, each with one protein engineered to 
carry the affinity tag. Use of an identical tag on each 
protein permits a single purification approach to he 
used in all purifications, although a separate purifica¬ 
tion and analysis is required for each strain. 
This approach has been done extensively in yeast 


(e.g., Gavin et al. 2006; Krogan et al. 2006) whereby 
~500 complexes have been defined. 

Large-scale screens have been done for proteins 
from many species, including those from viruses, bac¬ 
teria, yeast, and man. However, the interactions of 
yeast proteins have been most intensively studied. 
This includes five large-scale two-hybrid screens and 
four large-scale screens and analyses of protein com¬ 
plexes (e.g., Berlin et al. 2007; Gavin et al. 2006). 
Results from yeast analyses are therefore most infor¬ 
mative to help understand how data from large-scale 
screens should be interpreted. 

Replicates and Controls in Large-Scale Screens 

The quality of data from large-scale screens needs to be 
evaluated differently to data from studies on one pair of 
proteins. Large-scale screens are designed to 
“oversample” interactions; they are designed to measure 
the interactions of proteins multiple times. This capacity 
to provide replicates within one screen is a notable 
strength. For example, the affinity purification of a com¬ 
plex may be done a number of times in a screen, with 
different proteins in the complex being engineered with 
the fusion tag in each case. Library-based protein frag¬ 
ment complementation analyses, done on a large scale, 
will also have multiple replicates per pair of proteins. 
However, the set of controls that one might specifically 
design and use for a focused study of protein-protein 
interaction will not be available. Equally, the fact that 
any interaction appears multiple times in a screen does 
not mean that the interaction is a true positive, but could 
be a false positive which is seen reproducibly. 

False Positives and False Negatives 

False positives in large-scale screens can arise from 
multiple sources, including the following: (1) Two- 
hybrid assays involving complementation of frag¬ 
ments of a transcription factor can artificially activate 
the expression of a reporter gene, particularly when 
transcription-associated proteins are being tested. 
(2) Two-hybrid or protein fragment complementation 
assays typically involve the overexpression of pro¬ 
teins; the elevated protein concentrations can lead to 
promiscuous interactions. (3) Large-scale screens can 
remove spatial and temporal constraints on protein 
expression; this forces proteins which may not usually 
be present at the same time or place in the cell to be co¬ 
expressed. (4) Proteins that contain structural elements 
common to many proteins, namely, protein interaction 
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domains, may mediate interactions with proteins car¬ 
rying complementary domains even though these are 
not the natural interaction partners of these proteins. 
(5) In assays involving affinity or other purihcation of 
complexes, followed by mass spectrometry, purifica¬ 
tions can be contaminated with other proteins. These 
are often proteins of high abundance, proteins such as 
chaperones whose task it is to interact with many other 
proteins in the cell, or proteins whose physicochemical 
characteristics (e.g., high hydrophobicity) lead to 
widespread interactions upon cell lysis. These com¬ 
mon contaminants, however, are easily recognized by 
tallying the percent of purifications in which they 
appear (e.g., Gavin et al. 2006). 

Large-scale screens, while appearing comprehen¬ 
sive, will also contain false negatives. The precise 
number of interactions missed will be impossible to 
estimate until such time that a consensus set of inter¬ 
actions for any species or cell type, or interactome, is 
defined. However, it appears that most techniques for 
measuring pairwise interactions have a sensitivity of 
20-35% (Braun et al. 2009). Some false negatives will 
occur for technical reasons. For example, the creation of 
some fusion proteins can disrupt their capacity to inter¬ 
act, a library of proteins in a large-scale screen may be 
unrepresentative or incomplete, or the screen may be 
incompatible with certain types of proteins, for exam¬ 
ple, membrane proteins. Yeast two-hybrid techniques 
also require proteins of interest to be readily transported 
into the nucleus and thus trigger the reporter gene; this 
can be difficult for larger or highly hydrophobic pro¬ 
teins. Transient interactions, as opposed to those that are 
stable, may not be easily detected. Other false negatives 
can have a biological basis. For example, certain pro¬ 
teins will not interact in a pairwise fashion without the 
presence of other proteins. This is a limitation when the 
complementation screen being used tests the interac¬ 
tions of protein pairs. In other cases, a protein may 
require processing or posttranslational modification 
before it can interact. In the analysis of protein com¬ 
plexes, there will be many interactions which do not 
occur in the cell except under specific triggers or condi¬ 
tions. As most large-scale screens have been undertaken 
in only one biological condition to date, this will limit 
the total number and type of interactions detected. 

Multiple Lines of Evidence 

It is challenging to accurately assess the quality of large- 
scale screens of interactions and thus fully interpret their 


results. The emerging paradigm to generate high-quality 
data, and thus permit careful evaluation of large-scale 
screens, is to combine multiple lines of evidence for 
every interaction. In its simplest form, interactions 
supported by evidence from multiple sources are of 
high confidence. Those that show only one line 
of evidence are treated as low confidence until other 
evidence becomes available. Evidence that can be used 
to support that from large-scale screens includes results 
from other large-scale screens (with similar or orthogo¬ 
nal techniques, both in vivo and in vitro), biological 
evidence (e.g., gene co-expression, co-localization, 
functional relationships, and evidence of the same inter¬ 
action seen in related species), or interactions from 
small-scale screens in the literature and literature- 
curated interactions in databases (e.g., MIPS, Pagel 
et al. 2005). On this basis, high-quality datasets of 
interactions have been generated for S. cerevisiae pro¬ 
teins (e.g., Yu et al. 2008) and are being generated for 
proteins in human cells (e.g., Braun et al. 2009). When 
visualized, high-quality data generates networks with 
clear features including protein dimers, large protein 
complexes, and hub proteins that participate in binding 
with many other proteins (Fig. 1). 

Interactions Form Networks, But of a Specific Kind 

Finally, it is worth commenting on what protein inter¬ 
action networks, built from large-scale screens or oth¬ 
erwise, actually represent. They represent a composite 
set of interactions of proteins, summarized from hun¬ 
dreds to thousands of experiments. They summarize all 
protein complexes which occur inside a cell, even 
though they may not all happen at once. However, 
these networks do not currently model signal transduc¬ 
tion pathways; this is because transient kinase- 
substrate interactions are not measured effectively by 
fragment complementation assays or in the analysis of 
purified protein complexes. Further, the protein-pro¬ 
tein interaction networks do not provide deep insights 
into gene regulatory networks; this is because they do 
not capture signal transduction events or measure the 
binding of proteins to nucleic acids. Finally, they do 
not provide details of metabolic pathways; these 
involve interactions between enzymes and small mol¬ 
ecules and an understanding of how different enzymes 
act on the products of other enzymes to alter the struc¬ 
tures of such molecules. Nevertheless, protein-protein 
interaction screens and networks do provide 
a fundamental and important view of the protein 
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Protein-Protein Interactions - Interpreting Data from 
Large-Scale Screens, Fig. 1 The yeast interactome visualized 
with GEOMI (Ho et al. 2008), with data from the filtered yeast 
interactome dataset (FYI, Bertin et al. 2007). Proteins are 
represented by spheres and interactions between proteins are 
shown as lines. Stable protein complexes are seen as dense 
areas of the networks with multiple interconnections. Colors 
have been used to show the subcellular localization of the pro¬ 
teins, according to a large-scale study which GFP-tagged the 
majority of proteins in the yeast proteome (Huh et al. 2003). 
Colors: blue = nucleus, green = mitochondrion, 

gold = cytoplasm, and red — unknown localization 

complexes and machines that are the architectural 
underpinnings of the cell. 

Cross-References 

► Mass Spectrometry in Proteomics 

► Post-Translational Modifications 
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Introduction 

Many vital cellular processes such as proteolysis, DNA 
repair, DNA replication, and protein folding are 
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carried out by large protein complexes. These assem¬ 
blies are established by protein-protein interactions, 
which are driven by noncovalent forces such as hydro¬ 
gen bonding, dispersion forces, and salt bridges. The 
precise stoichiometry and topology of protein com¬ 
plexes results in the formation of biologically active 
entities. It is therefore important to determine and 
characterize the structure of these assemblies in order 
to gain a fundamental understanding of their biological 
function and thus their role in cell biology. Numerous 
techniques are available to study protein-protein inter¬ 
actions, including ultracentrifugation, circular dichro- 
ism (CD), surface plasmon resonance (SPR), affinity 
chromatography, gel electrophoresis, nuclear mag¬ 
netic resonance (NMR), and X-ray crystallography. 

Mass spectrometry (MS), a technique that measures 
the mass/charge of gas phase ions, can be used to 
elucidate elements of biomolecular structure and inter¬ 
actions. Molecular mass is a fundamental property of 
a molecule, and it is commonly used to identify pep¬ 
tides and proteins as well as protein complexes. More¬ 
over, dissociation of protein complexes into their 
building blocks can provide the identity of each indi¬ 
vidual subunit and allow the creation of a protein- 
protein interaction map revealing their organization 
within the oligomer. The works of a number of 
researchers have successfully demonstrated the ability 
of MS to measure noncovalently bound protein assem¬ 
blies, like the ribosome, a 2 MDa protein-RNA com¬ 
plex (McKay et al. 2006), tetradecameric GroEL 
(Robinson and Sobott 2006), and the 20S proteasome, 
a 690 kDa 28-mer (Loo et al. 2005). The success of MS 
in probing noncovalent protein associations is 
accredited to electrospray ionization (ESI), a gentle 
ionization technique first applied to biomolecule by 
Fenn (1989), who has awarded the Nobel Prize in 
Chemistry in 2002, for his work. The development of 
ESI in combination with instrumental improvements 
(better sensitivity, resolving power, and mass accuracy) 
has led to a revolution in biological mass spectrometry. 

Electrospray Ionization 

ESI, which allows the aerosolization of a liquid solu¬ 
tion, is capable of ionizing and transferring large, polar 
biomolecules into the vapor phase. The solution phase 
origin of ESI samples allows the study of proteins and 
protein complexes, as well as the investigation of their 


interactions, from a native-like environment. Under 
carefully controlled conditions, many protein com¬ 
plexes survive desolvation and pass into the mass 
spectrometer intact. Mass measurement of the full 
protein assembly provides important information on 
stoichiometry, whilst MS/MS and partial denaturation 
techniques allow interrogation of subunit structure 
(Robinson and Sobott 2006). 

Nano-Electrospray Ionization 

The development of nano-electrospray ionization 
(nano-ESI), which is a miniaturized variant of ESI, has 
improved the sensitivity of the technique, while at the 
same time allowing the use of purely aqueous solutions 
which can be difficult to spray using conventional ESI. 
Sample consumption is dramatically decreased since 
flow rates of a few nL/min are used, while in conven¬ 
tional ESI sample is infused at flow rates between 3 and 
5 pL/min. As an additional benefit, nano-ESI tolerates 
high buffer concentrations, which often suppress signal 
intensity in conventional ESI. Molar concentrations of 
ammonium acetate are compatible with nano-ESI (Rob¬ 
inson and Sobott 2006). 

Noncovalent Interactions Probed by MS 

Protein-protein interactions in solution are driven by 
noncovalent interactions, which include hydrogen 
bonds, dispersion forces, and salt bridges. However, 
during the transition from solution to gas phase, in the 
MS, electrostatic forces are strengthened while hydro- 
phobic effects are weakened in the absence of solvent. 
Hence, the dominant forces maintaining noncovalent 
complexes in the gas phase may differ from those in 
solution. It is believed that hydrophobically driven 
complexes are particularly difficult to observe by MS, 
although recent work has shown that such complexes 
can be detected (Daniel et al. 2002; Liu et al. 2009). 

MS-Based Approaches for the Evaluation of 
Noncovalent Interactions 

Chemical Cross-Linking 

The use of chemical cross-linking as a tool to 
probe noncovalent protein interactions goes back to 



p 


2088 


Protein-Protein Interactions Studied by Mass Spectrometry 


the mid-1950s. Zahn and Meienhofer confirmed the 
solution structure of insulin, which was believed at 
the time to be a dimer based on hydrogen/deuterium 
exchange experiments. The chemical cross-linking 
technique, as applied to biomolecules, is based on the 
addition of a cross-linking reagent to a protein or 
a protein complex in order to form a covalent bond 
between two amino acids that are close in space. Cross- 
linking can occur between amino acids that are either 
presented in an individual biomolecular chain or in 
neighboring chains. This results in the preservation of 
the nascent intramolecular or intermolecular interac¬ 
tions, respectively. Only the side chains of eight amino 
acids (arginine (Arg), lysine (Lys), aspartic acid 
(ASP), glutamic acid (Glu), cysteine (Cys), histidine 
(His), methionine (Met), tryptophan (Trp), and tyro¬ 
sine (Tyr)) can participate in a covalent bond formed 
by a cross-linker. Many cross-linking agents, showing 
different degrees of group selectivity have been 
reported (Kaltashov and Eyles 2005). 

The cross-linked complex can be directly analyzed 
by MS or digested prior analysis by MS. Different 
strategies for the detection of cross-linked proteins, 
or peptides by MS, have been developed. Muller 
et al. (2001) was the first to describe an isotope labeling 
strategy. In this method, the addition of stable isotope 
labeled cross-linked agents in proteins can discrimi¬ 
nate cross-linked form non-cross-linked peptides. 
Putatively, cross-linked peptides can be confirmed by 
reductive cleavage of the linker, and detection of the 
associated mass shift. Finally, Back et al. (2002) 
described the use of MS-labile cross-linkers, based on 
the production of a diagnostic marker ion formed dur¬ 
ing a dissociation process. 

Covalent Chemical Labeling Approaches 

Covalent chemical labeling approaches are based on 
the formation of a covalent bond between solvent 
exposed amino acid side chains and a label reagent. 
These chemical modifications alter the mass of the 
protein and specifically shift the mass of the solvent 
accessible amino acid side chains, allowing protein- 
protein binding surfaces to be identified. In a typical 
MS experiment, the modified protein assemblies are 
digested and the resulting proteolytic peptides are typ¬ 
ically separated by liquid chromatography (LC) and 
then are analyzed by MS. Using a bottom-up method, 
the position of the resulted peptide fragments can be 
determined and therefore the solvent exposed regions 


as well as the nonexposed regions, where protein- 
protein interactions are taking place, can be mapped. 

The label reagent can either be nonspecific or amino 
acid-specific. In the latter case, the label forms cova¬ 
lent bond only with the side chain of a specific amino 
acid. For example, organic acid anhydrides and vicinal 
dicarbonyl compounds particularly modify lysine and 
arginine residues, respectively. The interaction 
between a full-length glycosylated HIV gbl20 and 
a soluble segment of human CD4 has been investigated 
by covalent labeling approach and the use of amino 
acid-specific labels in combination with LC and tan¬ 
dem MS (Kaltashov and Eyles 2005). 

Hydroxyl Radical Footprinting 

Hydroxyl radicals, which represent an irreversible, 
nonspecific label, react with solvent exposed amino 
acid side chains resulting in oxidative side chain mod¬ 
ifications. The rate of amino acid side chains modifi¬ 
cations depends on their solvent accessibility and 
their relative reactivity to hydroxyl radical attack, 
which conforms to the following reactivity order: 
Cys > Met > Trp > Tyr > Phe > Cys > His > 
Leu ^ lie > Arg ^ Lys ^ Val > Ser ^ Thr ^ Pro > 
Gin ^ Glu > Asp ^ Asn > Ala > Gly. Hydroxyl 
radicals can be generated either: (1) chemically, by 
the reaction of hydrogen peroxide (H 2 O 2 ) with 
a transition metal ion complex such as iron-EDTA, 

(2) electrochemically, within an ESI source with 
the use of oxygen as the nebulizer gas, and 

(3) radiolytically, by radiolysis of water using radio¬ 
active elements such as *^^Cs, or by a high-intensity 
radiation source such as synchrotron X-rays (Kiselar 
and Chance 2010). 

Hydrogen/Deuterium Exchange 
Hydrogen/deuterium exchange (HDX) entails the 
exchange of protein’s backbone amide protons with 
deuterium atoms. The rate of hydrogen exchange 
depends on solvent accessibility as well as the presence 
of hydrogen bonding. Hence, amide hydrogens that 
are either involved in secondary structures (helices 
or P-sheets) or are buried or are hydrogen bonded 
have much lower exchange rates compared to back¬ 
bone amide hydrogens exposed to solvent-accessible 
environment. Following the introduction of ESI, HDX 
MS has become an ideal experimental tool for studying 
protein and protein complexes structure. In a typical 
HDX ESI-MS experiment, the proteolytic fragments of 
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a biomolecular entity that are usually chromatograph- 
ically separated are examined before and after hydro¬ 
gen exchange. An interesting example presenting the 
potentials of the use of HDX MS in probing protein- 
protein interactions is the revelation of the supramo- 
lecular structure of the HIV-1 capsid protein assembly. 
This methodology, whilst relatively straightforward 
experimentally, results in reversible labeling and so 
deuterium back-exchange must be closely monitored 
and corrected (Kaltashov and Eyles 2005). 

Noncovalent Protein-Protein Interactions 
Studied by Mass Spectrometry 

Composition, Stoichiometry, and Topology 

The transfer of large intact protein complexes into the 
gas phase allows the accurate determination of their 
mass but often provides incomplete information about 
their composition, stoichiometry, and topology. The 
dissociation of these complexes into their building 
blocks can provide the identity of each individual 
subunit and allow the creation of a protein-protein 
interaction map revealing their organization within 
the oligomer. Dissociation into subcomplexes can be 
achieved in solution, prior to ionization, by increasing 
the temperature (Benesch et al. 2003) or through 
a change in ionic strength with increasing concentra¬ 
tions of ammonium acetate or the stepwise addition of 
organic solvents such as dimethylsulfoxide, methanol 
or acetonitrile (Levy et al. 2008). Although these 
methods are effective for obtaining partial or complete 
disassembly of protein complexes, complementary gas 
phase dissociation through tandem MS approaches is 
also commonly used. Several activation techniques 
have been applied for this purpose with the most 
widely used being collision-induced dissociation 
(CID). In this approach, the protein complex ion of 
interest is accelerated against neutral gas atoms or 
molecules and its kinetic energy is partially converted 
into internal energy by the multiple collisions experi¬ 
enced. If sufficient internal energy is accumulated, 
interactions between subunits will be disrupted and 
dissociation occurs (Benesch et al. 2007). Several stud¬ 
ies on homo-oligomers have revealed a common dis¬ 
sociation pathway by CID, consisting of ejection of 
unfolded monomers carrying away a high number of 
charges. To illustrate this, the dissociation process of 
raHSP16.9, a small heat shock protein from wheat 


composed of 12 subunits, is depicted in Fig. 1 . A single 
charge state (32 h-) of the raHSP16.9 12-mer was iso¬ 
lated by the quadrupole of a Q-TOF MS, and the 
accelerating voltage gradually increased. A monomer 
with an average charge of 14.6-1- is ejected from the 
assembly leaving a stripped 11-mer with a low average 
charge of 17.4-I-. Further increase in voltage results in 
the removal of a second monomer from the 11-mer 
with a similar asymmetry in charge partitioning rela¬ 
tive to their mass, resulting in a monomer and a 10-mer 
with average charge states of 7.0-1- and 10.4-I-, respec¬ 
tively. Although this distribution of charges is asym¬ 
metric relative to the mass of the dissociation products, 
it is proportional to their surface areas, which is con¬ 
sistent with the unfolding of the leaving monomer in an 
intermediate state during the process, uncovering 
a higher surface area that will accept more charges 
prior to its separation (Benesch et al. 2006). Lower 
charge states of a protein complex are usually more 
stable and less easily dissociated than higher charge 
states. Alternative dissociation pathways for lower 
charge states have been observed with subunits being 
disrupted without unfolding or with the loss of peptide 
fragments without intact subunit dissociation (Pagel 
et al. 2010). 

As an alternative to CID, protein complex ions can 
be accelerated against a surface, in an activation tech¬ 
nique known as surface-induced dissociation (SID). 
This collision provides a more efficient kinetic to 
internal energy conversion in a single step, resulting 
in a more effective dissociation with lower energy 
provided to the system. Contrary to CID, charge 
partitioning in SID is usually symmetric, with the 
number of charges in a precursor homo-oligomer ion 
being equally distributed by the dissociation products. 
For these reasons, this technique has the potential to be 
an alternative or complement to CID in structural stud¬ 
ies of noncovalent protein complexes (Wysocki et al. 
2008). 

Hemoglobin 

Hemoglobins from higher organisms have been exten¬ 
sively studied by many biophysical techniques. These 
complexes are formed by a pair of a and a pair of () 
subunits, each subunit containing a heme group. 
Important insights into the folding and assembly of 
these complexes have been achieved by an online 
dialysis system connected to an ESI mass spectrome¬ 
ter. The hemoglobin tetramer was initially denatured in 
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Protein-Protein Interactions Studied by Mass Spectrome¬ 
try, Fig. 1 Dissociation of TaHSP16.9 (a) CID of the quadru- 
pole isolated TaHSPlO.O 12-mer leads to the formation of highly 
charged monomers and lowly charged 11-mers. Higher collision 
energies result in the removal of a second monomer from the 


11-mer and the formation of 10-mer. (b) Relative abundance of 
the 12-mers, 11-mers and 10-mers at increasing accelerating 
voltages, (c) Dissociation scheme showing the average charge 
state of each species. Reproduced from Benesh et al. 2006 by 
permission. © American Chemical Society, 2006 


solution by an increase in the pH to 10.0 and addition 
of acetonitrile. The dissociated highly charged a and (3 
monomers, present in both holo (with heme) and apo 
(without heme) forms were detected. A beta dimer, 
formed by a holo and an apo subunit, was also 
detected, appearing to be in equilibrium with their 
separated components. The solution was gradually 
dialyzed into a purely aqueous environment of pH 8.0 
and changes in the oligomeric species present were 


monitored with time. The unfolded apo a subunit was 
hrstly converted into the holo form by heme binding 
and then folded to its native conformation before 
interacting with holo or apo (3, forming two 
heterodimers that differed in the presence or absence 
of heme in the P subunit. The aP dimers with a heme 
group in both subunits interacted later, forming the 
hnal tetrameric assembly (Boys and Konermann 
2007). As illustrated by this study, all the interactions 
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between the hemoglobin components can be 
maintained in the gas phase by ESI-MS, making it 
a useful tool to study association and dissociation 
processes in solution. 

Proteasome 20S 

The proteasome is a protein assembly whose main 
function is the degradation of damaged, misfolded, or 
short-lived proteins. The 20S core particle is composed 
of 28 subunits arranged into four stacked rings, each 
containing seven subunits. The outer rings are com¬ 
posed of noncatalytic a subunits while the inner rings 
are composed of P subunits which are responsible for 
the catalytic activity. This ocvPyPva? stoichiometry was 
confirmed by ESTMS with the analysis of the full 20S 
proteasome from Methanosarcina thermophila (mea¬ 
sured mass 690.5 kDa, theoretical mass 689.3 kDa). 
CID applied either to a single ay ring or to the full 20S 
proteasome resulted in the dissociation of a highly 
charged a subunit leaving a ae or a ayPyPyae stripped 
complex, carrying much less charges than its precursor 
(Loo et al. 2005). This study on the 20S proteasome is 
a good example of how ESTMS can be applied in the 
study of large protein machineries with important roles 
inside the cell. 

Eukaryotic Translation Factor 3 

The eukaryotic initiation factor 3 (eIE3) is 
a multimeric protein complex with an approximate 
mass of 800 kDa. It is the largest and most complex 
of the eukaryotic initiation factors and is involved in 
ribosome biogenesis and protein synthesis. The mass 
spectrum of the intact complex together with comple¬ 
mentary proteomics analysis has revealed that it is 
composed of 13 subunits present at unit stoichiometry. 
Gradual increase of the ionic strength of the solution 
dissociated the complex into distinct modules reveal¬ 
ing specific interactions between the different subunits. 
Additional information from CID, immunoprecipita- 
tion, and comparison with the yeast eIE3 allowed the 
construction of a full interaction map and a possible 
tridimensional model. The interaction of the eIE3 com¬ 
plex with the hepatitis C virus (HCV) internal ribo¬ 
some entry site (IRES), a 332-nucleotide RNA 
molecule, was also analyzed. A full eIF3:IRES com¬ 
plex was detected and CID was applied, revealing 
a different dissociation pattern of the eIF3 complex 
upon IRES binding. The dissociation of some 
subcomplexes was not affected by IRES binding 


while in some others it was clearly diminished 
suggesting a direct interaction with IRES. A structural 
model of the elF3:IRES interaction was then 
constructed based on this different dissociation pattern 
(Zhou et al. 2008). The potential of ESTMS to obtain 
structural information from high complexity 
multiprotein systems is well demonstrated by this 
study, establishing this technique as an important 
structural biology tool. 

Viral Capsid Assemblies 

In an innovative study published in 1996, viral ions 
were generated in the gas phase by ESI and recovered 
by a collector within the vacuum. Analysis of the 
collected particles by transmission electron micros¬ 
copy revealed the retention of the spherical and rod¬ 
like structures of rice yellow mottle virus (RYMV) and 
tobacco mosaic virus (TMV), respectively. Also sur¬ 
prising was the confirmation of the TMV viability by 
inoculation and infection of tobacco plants after recov¬ 
ery (Siuzdak et al. 1996). This study suggested has that 
ESTMS can be used for the study of very large supra- 
molecular complexes in the Mega-Dalton scale, 
including viral capsids. The hepatitis B virus (HBV) 
capsid is an example of such type of complex that has 
been studied by this technique. The HBV capsid pro¬ 
tein assembles to form icosahedral capsids with trian¬ 
gulation numbers of T = 3 or T = 4, composed of 180 
or 240 subunits, respectively. Although HBV capsids 
have been extensively studied by other biophysical 
techniques, valuable information about their physical 
properties and assembly has been obtained by ESTMS. 
Mass spectra of in vitro assembled HBV capsids 
showed two well-resolved sets of peaks with m/z 
values around 22,000 and 25,000, allowing the deter¬ 
mination of each capsid mass with very high accuracy. 
Upon application of CID, a dimer of the capsid protein 
is ejected, which is consistent with the fact that these 
proteins are dimers in solution and within the particles. 
The same experiment made with a cysteine-free 
mutant of the capsid protein showed the ejection of 
monomer instead of dimer, with a lower energy 
applied, revealing the role of a covalent disulfide 
bond in the stability of both the dimer and full assem¬ 
bly. For capsids expressed and assembled in 
Escherichia coli that unspecifically package small 
RNAs from the host, the stability was even higher 
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with no dissociation detected under CID (Uetrecht 
et al. 2008). 

Other Macromolecular Assemblies 

Several other noncovalent protein assemblies have 
been studied by ESI-MS. One of the most extensively 
studied complexes and frequently used model is 
GroEL, a chaperonin assembly with a mass of 
800 kDa, composed of 14 subunits organized in two 
heptameric rings (Robinson and Sobott 2006). ESI-MS 
have also been successfully applied to the study of 
membrane complexes by applying nanoESI to micellar 
solutions containing the complex and stripping it from 
the micelle lipid molecules by an increase in collisional 
energies, retaining the noncovalent interactions within 
the protein assembly (Barrera et al. 2009). Other mac¬ 
romolecular machineries such as the ribosome, com¬ 
posed of RNA and protein subunits, have also been 
successfully studied in the gas phase by ESI-MS, show¬ 
ing the potential of this technique to characterize 
noncovalent complexes not composed exclusively of 
proteins (Robinson and Sobott 2006). 

Summary 

Mass spectrometry (MS) has emerged as a powerful 
analytical method for the study of noncovalent protein- 
protein interactions. This entry highlights MS-based 
approaches for the analysis of these interactions, 
giving a number of examples where protein 
noncovalent complexes have been successfully studied 
by MS. 
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Introduction 

Protein-protein interactions (PPI) are central to the 
signal transduction events that drive cellular prolifera¬ 
tion, intercellular communication, cell death, and 
many other cellular processes (Rual et al. 2005; 
Stumpf et al. 2008). Not surprisingly, modulating 
these protein interaction networks can frequently 
cause or mitigate disease states; developing agonists 
or inhibitors for protein-protein interactions is there¬ 
fore a critical focus for drug discovery. This encyclo¬ 
pedia entry will outline some of the challenges with 
discovering drug leads for PPI targets, highlight 


important biophysical methods used to evaluate pro¬ 
tein/small molecule complexes, and summarize two 
examples of PPI-inhibitor discovery. 

Historically, protein-protein interactions have been 
considered “undruggable” targets for several reasons, 
including: (1) there are few naturally occurring small 
molecule protein-protein interaction inhibitors; 
(2) protein interaction surfaces are frequently large 
and flat; (3) proteins interact with many partners, and 
classes of proteins are similar, making selectivity and 
specificity challenging; (4) compounds that bind to 
these large interfaces tend to be large themselves, 
making drug-like properties such as permeability and 
bioavailability more difficult to obtain. 

To emphasize the first two issues, we contrast PPI 
with G-coupled protein receptors (GPCR) and 
enzymes, which form the largest classes of drug tar¬ 
gets. GPCRs and enzymes generally have natural small 
molecule ligands or substrates, which can provide 
a starting point to develop small molecule inhibitors. 
In contrast, protein-protein interactions involve large 
binding surfaces (typically 700-1500 A^ on each side 
(Lo Conte et al. 1999)), often comprised of multiple, 
discontiguous regions of protein structure. It is there¬ 
fore no small matter to use the natural partner as 
a starting point for design. Second, unlike GPCRs, 
the potential small molecule interaction sites on pro¬ 
tein-protein interactions are flat. One way to compare 
binding surfaces is by pocket volume; a recent study by 
Fuller et al. shows that PPI tend to have two to eight 
pockets with an average volume of 55 A^, while the 
substrate pockets in enzymes average 260 A^, with 
many drugs occupying a pocket volume > 600 A^ 
(Fuller et al. 2009). This contrast suggests that small 
molecules could have trouble finding a suitable bind¬ 
ing pocket on the PPI surface. 

Although these challenges might appear insur¬ 
mountable, careful inspection of the structures of PPI 
can help us select the most tractable targets. First, in 
spite of large structural protein interaction surfaces, 
there are usually smallerinterfaces, termed 
“hotspots,” that are responsible for most of the binding 
affinity (Wells and McClendon 2007). Some of these 
protein hotspots are found on a single region of protein 
structure, such that a small peptide can substitute for 
the whole protein; to date, these types of interfaces 
appear to be more tractable targets for small molecules 
(Table 1). Second, crystal structures of small molecule 
inhibitors bound to PPI targets show that protein 
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Protein-Protein Interactions - Inhibition, Table 1 Examples of PPI inhibitors 

Protein-protein interaction 

ASA of 
PPI (A^)^ 

Small molecule 

Molecular 
weight (Da) 

Kd (nM) 

Ligand 

efficiency 

(AC/HAC)'’ 

Reference 

Small, contiguous epitopes 

ZipA/FtsZ 

541 

Compound-1 

424 

12,000 

0.22 

Rual et al. (2005) J. Med. 
Chem. 48, 1498 

XIAP (BIR3)/caspase-9 

472 

Compound-2 

442 

5 

0.35 

Cost et al. (2004) J. Med. 
Chem. 47,4417 

XIAP (BIR3)/caspase-9 

472 

Compound-20 

486 

26 

0.29 

Sun et al. (2008) J. Med. 
Chem. 51, 7169 

Bcl-XL/Bak 

825 

ABT737 

813 

1 

0.22 

Bruncko et al. (2007) 

J. Med. Chem. 50, 641 

MDM2/p53 

686 

Null in-2 

686 

140 

0.23 

Vassilev et al. 

(2004) Science 303, 844 

MDM2/p53 

686 

TDP222669 

582 

80 

0.31 

Crasberger et al. (2005) 

J. Med. Chem. 48, 909 

Large, discontiguous epitopes 

FKBP12/TGFR 

869 

9o 

625 

19 

0.23 

Dubowchik et al. 

(2001) Org. Lett. 3, 3987 

E2/E1 

946 

Compound-23 

608 

6 

0.25 

Wang et al. (2004) J. Biol. 
Chem. 279, 6976 

IL-2/IL-2R 

898 

SP4206 

662 

60 

0.22 

Raimundo et al. (2004) 

J. Med. Chem. 47, 3111 


solvent accessible surface area buried at the protein interface 
^LE free energy of binding divided by heavy atom count 


interaction surfaces are frequently able to adapt to the 
small molecule. Low energy conformational 
rearrangements in loops and side chains allow the hot 
spot to bury the small molecule deeper into the protein 
surface than the structure of the PPI would suggest. 
Thus, not only are the sizes of hotspots closer to the 
size of a low-molecular weight drug, but hotspots are 
also able to adapt to bind compounds with structures 
very divergent from the native protein partners. 

Advances in two technical areas have also facili¬ 
tated the hunt for PPI inhibitors. First, the availability 
of biophysical methods has allowed new screening 
approaches based on binding (rather than PPI inhibi¬ 
tion) and better characterization of the mechanism of 
binding for candidate inhibitors. Approaches such as 
x-ray crystallography, ITC, SPR, or NMR allow us to 
rapidly determine the binding dynamics, stoichiome¬ 
try, and binding site of small molecule-protein inter¬ 
actions. Second, new chemical approaches, including 
fragment-based lead discovery (FBLD) and 
peptidomimetics, seek to increase the diversity of com¬ 
pounds in small molecule libraries, thereby increasing 
the chances of finding the rare PPI inhibitor by com¬ 
pound screening or design. 


Biophysical Methods to Characterize 
Protein/Molecule Interactions 

The goal of a PPI-inhibitor screen is to find compounds 
that bind to a discrete site on one of the protein part¬ 
ners, leading to disruption of the PPI. Screens to dis¬ 
cover PPI inhibitors therefore often monitor the loss of 
the PPI using proximity-based assays such as fluores¬ 
cence polarization (FP), time resolved fluorescence 
resonance energy transfer (TR-FRET), or Alphascreen 
(Perkin Elmer). While these assays are suitable to 
high-throughput screening, they are also subject to 
artifacts due either to promiscuous inhibition by aggre¬ 
gation or to inhibition of the signal rather than the 
interaction (Arkin and Wells 2004; Shoichet 2006; 
Baell and Holloway 2010). It is difficult to distinguish 
these artifacts using the standard tests for reversibility 
and mechanism of inhibition that are used for enzyme 
inhibitors (Copeland 2003). Researchers have there¬ 
fore turned to biophysical methods to confirm binding 
of the small molecule to the target, and to determine 
stoichiometry, the dissociation constant (Kx^), and site 
of binding. (Mayr and Bojanic 2009). Eour informa¬ 
tive, commonly used methods are described here. 
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Isothermal titration calorimetry (ITC) is a method 
in which ligand is injected into a protein sample and 
the heat of binding is measured. ITC provides infor¬ 
mation about the stoichiometry of binding, A'd, and the 
enthalpic and entropic contributions to binding energy. 
No binding site information is provided by ITC. The 
low throughput and large sample requirement for ITC 
limits its use for primary screening. However, with the 
recent availability of automated sampling, throughput 
is now 20-30 samples per day, allowing ITC to be used 
for validating HTS hits and for optimizing compounds 
based on thermodynamic data (Freire 2009). 

Surface plasmon resonance (SPR) detects the bind¬ 
ing of ligands to immobilized protein targets. Although 
SPR has traditionally been used to characterize pro¬ 
tein-protein interactions, recent advances in sensitiv¬ 
ity allow small molecule binding to be detected 
directly. Increases in throughput also allow SPR to be 
used to screen small libraries and to confirm binding of 
HTS hits. SPR monitors binding stoichiometry, 
and binding kinetics (Myszka and Rich 2000), all crit¬ 
ical parameters in validating and optimizing com¬ 
pounds. For example, binding kinetics elucidate 
whether the compounds bind reversibly and with 
rapid kinetics. Slow off rates might be advantageous 
for tight-binding drug leads; however, slow kinetics or 
irreversibility is an indication of an undesirable mode 
of interaction for low affinity binders. SPR consumes 
small amounts of protein, but does not provide infor¬ 
mation about binding site. 

Nuclear Magnetic Resonance (NMR) can be used 
for both screening and hit validation (Pellecchia et al. 
2008). Ligand-detected methods can confirm binding 
of small molecules; however, they do not provide 
stoichiometry and they can be subject to false positives 
from aggregating compounds. Protein-detected 
methods are more informative, but require soluble, 
small, and well-behaved proteins that can be expressed 
at high levels when isotopically labeled. After 
assigning the resonances in the multidimensional 
NMR spectrum to specihc amino acid residues, the 
binding site and A'd can be determined. However, 
chemical shift changes arising due to conformational 
changes instead of direct binding can be misleading. 

Definitive information about binding site frequently 
comes from a crystal structure of the ligand in complex 
with the protein target. Because high concentrations 
are typically used in crystallography, a soluble small 
molecule with low affinity can be often co-crystallized 


or soaked into a protein crystal. A crystal structure can 
also reveal subtle artifacts. For example, the crystal 
structure of Pinl with a small molecule inhibitor 
revealed that two molecules were stacked on top of 
each other at the ligand-binding site through 
a 7t-stacking interaction (Potter et al. 2010). Crystal¬ 
lography can be used for screening, but it is a low-to- 
moderate throughput method with a requirement for 
large amounts of material. An orthogonal method is 
required to measure and optimize potency. 

Examples of PPI Inhibitors 

Despite the difficult nature of inhibiting PPI, there are 
some examples of well-validated small molecule 
inhibitors that could lead to drugs (for a recent review 
see Berg 2008). The PPI in these examples include 
several interaction surfaces that are small, continuous 
peptide segments, as well as some with large, 
discontiguous surfaces (Table 1). Due to the adaptive 
nature of protein interaction surfaces, small molecules 
do not usually replicate the protein binding partner 
atom-for-atom (Fry 2008). Rather, the small molecule 
inhibitors tend to adopt a distinct, rigid conformation 
that may help in orienting atoms to interact with the 
protein target (Wells and McClendon 2007; Fry 2008). 
The examples shown in Table 1 reveal two additional 
features of PPI inhibitors. First, these chemicals tend to 
have molecular weights on the high end for drug-like 
molecules. Second, inhibitors tend to have similar 
ligand efficiencies (defined as binding energy divided 
by heavy atom count); these efficiencies are low com¬ 
pared to drugs (GPCR antagonists have efficiencies 
^0.6), reflecting the properties of PPI described 
above. Herein, we summarize the properties of inhib¬ 
itors of XIAP/Caspase-9 and IL-2/IL2-R; these two 
systems represent ends of the spectrum in terms of 
the complexity of the interface and the ligand effi¬ 
ciency of the best inhibitors. 

XIAP: Small, Contiguous Epitope 

In Table 1, Mdm2/p53, Bcl-2/Bcl-xl, ZipA/FtsZ, 
and XIAP/Caspase-9 are PPI comprised of a small, 
contiguous portion of protein sequence from one 
protein that binds into a concave groove on its 
partner. The X-linked inhibitor of apoptosis/second 
mitochondria-derived activator of caspase (XIAP/ 
Smac) represents a particularly small peptide epitope 
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Protein-Protein Interactions - Inhibition, Fig. 1 Crystal 
structures of XIAP with SMAC peptide and with small molecule 
inhibitor. Left: Crystal structure of the BIR3 domain of XIAP 
with the tetrapeptide binding region from SMAC. Right. Struc¬ 
ture of the domain with a small molecule SMAC mimetic. The 
BIR domain is shown as a surface in gray, and the ligand 
(peptide or small molecule) is shown as sticks in yellow. The 
surface residues that interact with the peptide are colored green, 
while the surface residues that interact with the small molecule 
are colored orange 

and yielded a peptidomimetic inhibitor with 
a particularly high ligand efficiency for binding to 
XIAP (Oost et al. 2004; Park et al. 2005; LaCasse 
et al. 2008; Cohen et al. 2009; Wang 201 1). 

XIAP is a member of a family of proteins that bind 
to caspases and inhibit the initiation of apoptosis, or 
programmed cell death. It is overexpressed in many 
human cancers and may contribute to resistance to 
chemotherapy. XIAP binds through its BIR3 domain 
to caspase-9, sequestering the pro-apoptotic caspase-9 
in a catalytically inactive conformation. Smac binds to 
XIAP, inhibiting the interaction between XIAP and 
caspase-9 and freeing caspase-9 to initiate apoptosis. 
In principle, compounds that mimic Smac would sim¬ 
ilarly act to inhibit the XIAP/caspase-9 interaction, 
thereby supporting apoptotic cell death. Smac’s bind¬ 
ing epitope is a continuous extended region of the 
N-terminus (Fig. 1), and the first four residues 
(AVPI) confer most of the binding energy. The AVPI 
peptide served as the starting point for a structure- 
based drug design program at Abbott Laboratories 
(Oost et al. 2004; Park et al. 2005). The peptide inter¬ 
actions were recapitulated with capped tripeptides that 
contained unnatural amino acids and bound to the 
BIR3 domain of XIAP with low nanomolar affinity 
(Fig. 1) and a high ligand efficiency of 0.35 
(Table 1). These molecules induced apoptosis in can¬ 
cer cell lines, and small molecules with this mecha¬ 
nism of action are currently in clinical trials for cancer. 
Peptidomimetic design of inhibitors was successful for 
XIAP/Smac, and appears to be especially well suited to 
small, concentrated epitopes. 



Protein-Protein Interactions - Inhibition, Fig. 2 Crystal 
Structure of IL-2 with IL-2R and with small molecule inhibitor. 
Left. Crystal structure of IL-2 with its receptor IL-2Ra. Right: 
Structure of the domain with a small molecule inhibitor, SP4206. 
IL-2 is shown as a surface in gray, and the ligand (receptor or 
small molecule) is shown in yellow. The surface residues that 
interact with the reeeptor are colored green, while the surface 
residues that interact with the small moleeule are colored 
orange. 


IL-2: Large, Discontiguous Epitope 

Shown in Table 1, HPV-18 E1/E2, FKBP12/TGFR, 
and IL-2/lL-2Ra are examples of PPI with interaction 
surfaces involving multiple, discontiguous regions of 
protein sequence. For the IL-2/IL-2Ra interaction, 
a series of eight compounds with nanomolar affinity 
were derived from fragment-based lead discovery 
methods (Braisted et al. 2003; Raimundo et al. 2004) 
and structure-based design. 

Interleukin-2 (IL-2) is a cytokine that plays a key 
role in the immune response by activating T-cells 
through binding to its 3-chain receptor, IL2-Rtxpy 
(see Fig. 2). Scientists at Roche and Sunesis Pharma¬ 
ceuticals used a combination of peptidomimetic 
design, screening, and structure-based optimization 
to develop compounds that bound to IL-2 at the 
IL-2Ra-binding site with ^5 pM. The Sunesis 
group then used a fragment-based technique, called 
Tethering, to identify a new small molecule binding 
site in a flexible region of IL-2 adjacent to the PPI 
hotspot. Linking the fragments that bound in this site 
to the hotspot-binders led to a small molecule with 
Xj = 60 nM. Although the small molecule occupies 
about half of the surface area buried by IL-2Ra, it 
binds with similar affinity to lL-2Ra (10 nM versus 
60 nM; Fig. 2) and binds to the receptor hotspot 
(Rickert et al. 2005; Thanos et al. 2006). The differ¬ 
ences in the structure of IL-2 when bound to IL-2Ra 
and the small molecule inhibitor indicate that the IL-2 
binds the small molecule more deeply than the receptor 
surface, and that the compound mimics discontiguous 
segments of IL2-Ra. 
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Summary 

Protein-protein interactions have historically been 
considered difficult drug discovery targets due to the 
predominantly large, flat interfaces between mole¬ 
cules. Recently, significant progress has been made 
toward identifying therapeutically relevant inhibitors 
of protein-protein interactions. These advances have 
been realized mainly due to the structural and biophys¬ 
ical characterization that has enabled the design of 
specific, high-affinity, and well-behaved small mole¬ 
cule ligands. Analysis of PPI and their small molecule 
inhibitors demonstrates the importance of “hotspots” 
of protein binding and conformational adaptivity for 
molecular recognition. 
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► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 
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► NMR Studies of Macromolecular Interactions - 
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► Statistical Analysis and Prediction of Protein- 
Protein Interactions and Binding Sites 
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Mechanism of Association 

Structures of protein complexes and information 
about their thermodynamic characterization describe 
a “frozen” view of the complex, which ignores the 
kinetic nature of protein-protein association and dis¬ 
sociation. For a two state binding reaction = kjk^, 
were A'd is the affinity in units of Mol, k^ is the 
dissociation rate constant in units of 1/s, and k.^ is the 
association rate constant in units of l/(Mol x s). 
The affinity describes the concentration where 50% 
of the protein is bound and is independent on the 
reaction path. Conversely, the rate constants that relate 
to the reaction path may include intermediates and 
transition states (Fig. 1). 

The observed rate constants of protein association 
span six-orders of magnitude, from 10^ to 10^ M“*s“\ 
The diffusion limited collision rate between two 
globular proteins is estimated to be ^10^° M“*s“\ 
However, for two proteins to bind, their interfaces 
have to be oriented with high precision. A relative 
rotation of as little as a few degrees or a relative trans¬ 
lation by a few Angstroms is sufficient to break all 
specific interactions between the two proteins. The rate 
of association of a protein complex is limited by 
diffusion and geometric constraints of the binding 
sites, and may be further reduced by subsequent 
chemical processes. The random chance for two 
patches of 1 A^ to collide; assuming a total protein 
surface area of 2000 A^ is 1 in 4 x 10®. However, 
complex formation requires many residues to interact 


Energy 



A + B , A:B , AB 


k_.^ k_2 




Protein-Protein Interactions: Kinetics, Fig. 1 Free energy 
diagram describing the pathway of protein-protein binding. 
Two proteins (A and B) in solution form an encounter complex, 
[A:B], which following structural rearrangement and 
desolvation develops into the final complex, [AB] 


simultaneously, reducing significantly the change of 
this to occur. Thus, protein-protein association rates 
should be enhanced by having a capture zone, leading 
to the formation of the specific complex. An early 
attempt to satisfy the observed rate of association 
using computer simulations was done by Northrup 
and Erickson (1992), who tried to explain the relatively 
faster rates of association by assuming that binding is 
speeded up by multiple collisions of large bodies that 
are proximate to each other (in the order of 10) and that 
association can be assumed to occur once two to three 
correct inter-protein interactions are formed. These 
assumptions were used to simulate binding using 
Brownian Dynamic simulations (Gabdoulline and 
Wade 1997). The obtained rates were in reasonable 
agreement with the experimental rates. A somewhat 
broader view was taken by Zhou (Vijayakumar et al. 
1998), who assigns a region where association occurs 
in a diffusion-controlled limit and defines it as the 
encounter complex. 

A basic mechanistic question is whether association 
can be viewed as a simple diffusion limited reaction, 
or whether it involves a reaction limited com¬ 
ponent, accounting for desolvation and structural 
rearrangement of the interfaces. Experimental evi¬ 
dence suggests that binding goes through an interme¬ 
diate (encounter complex) and binding transition state 
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(Schreiber et al. 2009). Therefore, for the following 
discussion on association, we will view it as a three- 
step process that includes an intermediate state (A*B), 
in which the two proteins have near-native separations 
and orientations (often referred to as transient or 
encounter complex). A basic kinetic scheme describ¬ 
ing three state binding is: 

A-f B AB (Scheme 1) 

k_i k_2 


clear that a simple distinction between a diffusion- 
controlled, reaction-controlled, or mixed regime for 
a given association reaction is not appropriate, as this 
depends on the protein concentrations. At low protein 
concentrations, the observed rate of association 
increases linearly with concentration. However, at 
high protein concentration the encounter complex 
may be populated, thus association is reaction 
controlled. 


ki in this scheme is the only step that depends on the 
concentrations of the reactants. All the other rates 
include the formation and breaking of interactions 
and are thus at least partially reaction controlled. At 
physiological protein concentrations, the rate-limiting 
step for association is ki*k 2 , thus it is the formation of 
the transient complex times the time of evolution of the 
transient complex to form the final complex. For dis¬ 
sociation, the rate-limiting step is k _2 (Kiel et al. 2004; 
Schreiber et al. 2009). Assuming steady state, 
the overall rates for association and dissociation 
therefore are: 


. _ kihlAWB] 

“ k-,+k2 


kik-ilAB] 

k-i+k2 


^ojf 


= k-2 


( 1 ) 


Assuming a standard state (1 Mol concentration) 
and ^2 ^ k -2 the association rate constant can be 
written in its familiar form: 


k\k2 

k-i -f ^2 


( 2 ) 


which is bounded by the diffusion-controlled rate con¬ 
stant, ky, for reaching the transient complex. This limit 
is reached when conformational rearrangement is fast 
relative to the dissociation of the transient complex 
(i.e.,^ 2 >^-i), leading to 


k\k2 

k-\ 


(3) 


In the opposite limit k 2 ^ k_i, conformational 
rearrangement or reaction becomes rate-limiting and 


kon ~ k\ 


(4) 


Note that kilk_x is the equilibrium constant for 
forming the transient complex. From Eq. 1 it becomes 


Electrostatic Rate Enhancement of 
Binding Rates 

In the transient complex the two protein molecules 
must satisfy translational/rotational constraints, 
which severely hinder the diffusion-controlled rate 
constant ka„. In the absence of any biasing force, the¬ 
oretical estimates put the resulting “basal” value, ka„Q, 
in the range of 10"^-10^ M“*s“* (Shaul and Schreiber 
2005). To go beyond the basal rate constant and reach 
values of 10-10^ M“*s“* intermolecular long-range 
electrostatic forces must be present. Experimental 
measurements have shown that the electrostatic 
attraction speeding up the diffusion process is not 
affecting the rate of ^2 (Kiel et al. 2004). An analysis 
of many protein-complexes has shown that electro¬ 
static repulsion between the protein partners is rare, 
in about half of the cases electrostatic attraction had 
little influence, in another 30% electrostatic attraction 
speeded up association by up to 50-fold and in the 
remaining 20% electrostatic attraction had a major 
influence on the rate of association (Shaul and 
Schreiber 2005). This was observed in cases were 
speed was important, like for barstar inhibiting 
the RNAse activity of barnase (Schreiber and 
Eersht 1993) or toxins blocking the Shaker potassium 
channel. The charge-complementarity argument 
can also be made to rationalize the excess basic resi¬ 
dues on fasciculin, the snake toxin-targeting 
AChE (Bui and McCammon 2006). AChE uses 
a negative electrostatic surface around the entrance to 
the active-site gorge for the fast binding of its posi¬ 
tively charged substrate. Electrostatic rate enhance¬ 
ment is between the better-simulated phenomenon 
that can accurately be simulated and predicted using 
a number of different methods (Zhou 1997; Selzer 
et al. 2000; Gabdoulline and Wade 2001; Schlosshauer 
and Baker 2004). 
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Characterizing the Binding Encounter 
Complex 

The binding encounter complex was only recently 
experimentally assigned. Using paramagnetic NMR 
spectroscopy (Volkov et al. 2006) suggested that the 
structure of the encounter complex of the electron 
transfer complex of yeast cytochrome c peroxidase 
(CcP) and iso-1-cytochrome c is very short lived, 
with a dominant structure supporting electron transfer 
and a dynamic encounter complex. The results support 
the view that the conformational space sampled by the 
protein molecules during the dynamic part of the inter¬ 
action is localized around the Cc position in the dom¬ 
inant orientation. Tang et al. did a similar study for the 
association between the phosphocarrier protein Hpr, 
and three proteins in the bacterial phosphotransferase 
system using paramagnetic relaxation enhancement 
(PRE) NMR (Tang et al. 2006). However, they found 
a much broader definition of the encounter complex, 
which was spread across its adjacent surface, with 
electrostatic attraction being the main driving force in 
its stabilization. However, at higher salt, the encounter 
complex became more specific (Suh et al. 2007). This 
is not surprising, as electrostatic forces at low ionic 
strength are very strong, and can promote nonspecific 
binding (Selzer et al. 2000). A very different method 
was applied by Harel et al., who used double-mutant 
cycles probing the transition state for binding, AAGint^, 
as a measure of the transient interactions formed 
between the proteins during the association reaction: 

AAG^nt = AAG^x^a.y^a — AAG^x^a 

- AAG*y^a (5) 

Here, X and Y represent the wild type residues and A 
represents a mutant. Like <l)-value analysis, also this 
technique was first applied successfully to protein¬ 
folding studies (Horovitz and Fersht 1990). From an 
analysis of a large number of double-mutant cycles 
calculated for the activated complexes of barnase/ 
barstar and TEMl/BLIP, a number of significant inter¬ 
actions during association were retrieved. These were 
used to model the structure of the association transition 
states (Harel et al. 2007). It was shown that the transi¬ 
tion state for association was stabilized by electrostatic 
interactions, with the ensemble of structures spread out 
around the final complex. In neither case short-range 


interactions were formed during the transition state, 
suggesting a solvated transition state. The average 
transition state structure was not necessarily located 
exactly on the binding site, but may be shifted toward 
one side of the interface. These data suggest a certain 
pathway for association (energy funnel), which would 
help in speeding up association. Further studies on the 
nature of the encounter complex have suggested that 
encounters can be specific (occupying a small region) 
or diffusive (distributed over a larger region on the 
surface). Moreover, many encounter complexes may 
be futile, not leading at all to complex formation (Harel 
et al. 2009). These futile encounters can be eliminated 
or enhanced, without any cost in terms of rates of 
association. 

Summary 

The association of proteins to form a complex is 
a multi-step process, which starts by random collisions 
of the individual proteins. Multiple collisions and rota¬ 
tional diffusion bring the proteins to an orientation that 
is close to that of the native complex, leading to the 
formation of a transient complex. This part of the pro¬ 
cess is diffusion-controlled, and strongly affected by 
electrostatic interactions. Computer simulations and 
experimental data suggest that the transient complex 
develops into the native complex through a transition 
state. Structurally, the transient complex and the tran¬ 
sition state seem to be similar; however, the transition 
state is smaller, with the encounter complex occupying 
also futile areas that will not further develop into 
a complex. The most important parameter that affects 
the rate of association is the electrostatic force, which 
can act over a distance. However, local surface 
complimentarity and desolvation play also an important 
role, and in some cases mutations can be located that 
change the rate of association due to these factors. 

Cross-References 

► Differential Transition State Stabilization - 
Nonempirical Analysis 

► Double-Mutant Cycle Analysis 

► Electron Paramagnetic Resonance Spectroscopy 

► Steady-State Enzyme Kinetics 

► Transient-State Kinetic Methods 
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Introduction 

Exploration of sequence space is the mechanism by 
which proteins - and, by extension, species - evolve. 
Two general classes of sequence changes are: (1) local 
changes, involving substitutions of residues, or short 
insertions and deletions, and (2) larger scale reorgani¬ 
zations of domains. 

A domain is a compact subunit of a protein, often but 
not always formed of a contiguous stretch of polypep¬ 
tide chain. Most proteins in eukaryotes are comprised of 
multiple domains. The choice of domain, the number of 
repeats, and the order of the domains within the protein, 
all are common mechanisms whereby proteins can 
develop novel functions, or can combine functions 
associated with individual domain or domain types. 

Challenges in studying protein evolution include 
several related goals: 

1. A reliable method to decide which proteins are 
homologous. 

2. An understanding of the allowable evolutionary 
pathways that connect homologous proteins. 
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3. An understanding of what constrains the divergence 
of protein sequences, structures, and functions. 
From the point of view of the genome, the simplest 
change that may affect the structure or function of 
a protein is a single nucleotide polymorphism (SNP). 
A SNP in a protein-coding region may have one of the 
several effects on the amino-acid sequence: 

1. It may be what is usually called a silent mutation. 
That is, it may change a codon for one amino acid to 
another codon for the same amino acid. Although 
this indeed has no effect on the amino-acid 
sequence, it may have secondary effects on the 
folding process, or even on the final structure; per¬ 
haps silent - although a well-established term - is 
really too strong a characterization. 

2. A SNP may be a missense mutation, if it changes 
a codon for one amino acid to a codon for a different 
amino acid. The effect on the protein structure may be 
mild or severe: This will depend on (a) the position of 
the mutated residue within the stmcture - for instance, 
whether it is on the surface or in the interior - and (b) 
on the degree of change in physicochemical properties 
of the amino acid. Some amino-acid changes tend to 
be conservative, for example, leucine to isoleucme. 
Others change radically the size or charge of the side 
chain. Nonconservative mutations at positions in pro¬ 
tein interiors would be expected, /7rima/ade, to have 
more severe effects. 

3. The most dramatic effect that a SNP can have on 
a protein is a change from a codon that specifies an 
amino acid to a stop codon, or the reverse. Changing 
a codon for an amino acid to a stop codon will 
produce a truncated protein. Such a protein is 
extremely unlikely to be functional, or even to 
retain any semblance of the original native struc¬ 
ture. Conversely, a change of a stop codon to 
a codon that specifies an amino acid will cause 
“read-through,” and produce an extended protein. 
This may include the original protein structure, but 
the C-terminal tail may well cause dysfunction, 
instability, and even misfolding of the protein. The 
hemoglobin a-chain variant Hb Constant Spring is 
a protein in which the terminal stop codon has 
mutated to a codon for glutamine, producing 
a protein 172 residues long rather than the normal 
141. However, many proteins are engineered by 
addition of a green-fluorescent protein tag, in 
order to determine cellular location, and often the 


structure and activity of the original protein sur¬ 
vives. This should be considered an example of 
a domain fusion which is a common natural occur¬ 
rence. Note the difference between addition of 
a complete domain - which usually works - and 
extension by an unselected sequence as in Hemo¬ 
globin Constant Spring. 

SNPs outside of coding regions may also affect pro¬ 
tein structure and function. A mutation in a splice site 
may cause errors in processing of mRNA, producing an 
aberrant splice variant. Mutations in control regions can 
affect expression rates. Even if the individual proteins 
produced retain native structure and function, the muta¬ 
tions may affect the amount of protein produced. 

Insertions and deletions of nucleotides can also 
have mild or severe effects. Insertion or deletion of 
a multiple of three nucleotides will cause the localized 
insertion or deletion of one or more amino acids; this 
may be tolerable. Insertion or deletion of a number of 
nucleotides not equal to a multiple of three will disturb 
the reading frame. All translated residues after the site 
of insertion or deletion will no longer correspond to the 
original coding sequence. Such a protein would not 
show a normal structure and function. Changes of 
this type can however contribute to evolution: During 
the process of domestication of rice, a truncation in the 
Rc gene produced loss-of-function in a protein that 
promoted expression of an enzyme that produced red 
pericarp color, causing as a result the white grain color 
that was selected (Sweeney et al. 2006). 

The Relationship between Sequence 
Divergence and Structure Divergence 

Every amino acid substitution, or insertion or deletion, 
perturbs the structure of the protein. In studying evo¬ 
lution of natural proteins, we know that if the structure 
or even the function changed radically, the mutation 
may be lethal, and if so we will not see the protein in 
a viable organism. Much observable protein evolution 
at the domain level proceeds through small but pro¬ 
gressive divergence in sequence and structure. Gene 
duplication allows evolution to “eat its cake and have it 
too”: A functional copy can be retained while the other 
copy can diverge. 

Each mutation adds to the stress on the native struc¬ 
ture. Native protein structures of globular proteins are 
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compact assemblies. Their interiors are as well packed 
as crystals of amino acids, despite the constraints of the 
connected main chain. Any non-silent mutation must 
necessarily change at least the size and/or shape and/or 
charge of at least one side chain. How the protein 
structure reacts to the stress thereby created depends 
on where the mutations appear. 

Mutations of surface residues tend to make minimal 
changes in protein stmctures. However, they can affect 
protein interactions. These interactions may be with sub¬ 
strate molecules, in which cases mutations of active-site 
residues can alter function even if they do not alter overall 
structure. Changes in interactions with other proteins may 
change properties of aggregation. Sickle-cell anemia was 
the first example of a protein-aggregation disease, caused 
by mutations in a surface residue P6Glu ^ Val in human 
hemoglobin. Many other protein-aggregation diseases 
are now known, including Huntington’s disease and 
Alzheimer’s disease. 

The tertiary structure of a protein domain generally 
includes secondary structure elements - helices, and 
strands of sheet, connected by loops. Loops tend to be 
at or near the surfaces of proteins. They can often 
accommodate amino-acid substitutions, and insertions 
and deletions, relatively easily. It is mutations in the 
interior of the protein, at the interfaces between ele¬ 
ments of secondary structure, that pose the greatest 
challenges to protein structures. As a rule of thumb, 
the packings of interfaces between secondary struc¬ 
tures are tight enough that they can accommodate the 
odd extra methyl group, but not much more. Mutations 
in interior interfaces tend to cause shifts and rotations 
of secondary structure elements in contact. Helices 
tend to be more rigid than strands of sheet. 

As a result, the distributions of accepted mutations 
in natural evolution, and of the changes in structure, 
vary throughout a domain. In order to see the regular¬ 
ities, it is necessary to distinguish the portion of the 
structure that retains its basic topology - although it 
may be subject to distortion - from those other parts of 
the structure that are more plastic. 

Evolutionary Structural Change within 
Domains 

Families of related proteins tend to retain common 
folding patterns. However, although the general 


folding pattern is preserved, distortions increase as 
the amino-acid sequences progressively diverge. 
These distortions are not uniformly distributed 
throughout the structure. Usually, a large central core 
of the structure, including major elements of secondary 
structure and the active site, retains the same qualita¬ 
tive fold, and other parts of the structure change con¬ 
formation more radically. As a simple analogy, 
consider the letters B and R. As structures, they have 
a common core which corresponds to the letter P. 
Outside the common core they differ: At the bottom 
right B has a loop and R has a diagonal stroke. 

Figure 1 compares spinach plastocyanin and 
cucumber stellacyanin. (For other illustrations of struc¬ 
tural comparisons of homologous proteins, and discus¬ 
sion of classification schemes, see chapter 4 of Lesk 
(2001). That book contains a large number of pictures 
of protein structures suitable for browsing). 

Additional specific constraints on protein evolution, 
dictated by function, vary from case to case. For plasto¬ 
cyanin and stellacyanin, the distortions in even the dou- 
ble-P-sheet structure are much larger than those in the 
copper-binding site itself. In many cases, as here, these 
constraints operate at the global level of the domain, 
rather than at the local level only. Note that the copper¬ 
binding site is created by regions of the protein that are 
distant in the sequence but of course close together in 
space. That is, although there can be local shifts and 
rotations of interacting secondary structural elements, 
these shifts and rotations must be coupled over long 
ranges, to maintain the integrity of an active site. 

Systematic studies of the structural differences 
between pairs of related proteins have defined 
a quantitative relationship between the divergence of 
the amino-acid sequences of the core of a family of 
structures, and the divergence of structure (Chothia 
and Lesk 1986). As the sequence diverges, there are 
progressively increasing distortions in the main-chain 
conformation, and the fraction of the residues in the 
core usually decreases. Until the fraction of identical 
residues in the sequences drops below about 40-50%, 
these effects are relatively modest. Almost all the 
structure remains in the core, and the deformation of 
the main-chain atoms is on the average no more than 
1.0 A. With increasing sequence divergence, in most 
cases more regions refold entirely, reducing the size of 
the core, and the distortions of the residues remaining 
within the core increase in magnitude. 
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Proteins: Relationship Among Divergence of Sequence, 
Structure, and Function, Fig. 1 Two related proteins that 
share the same general folding pattern, but differ in detail. 
Circles represent copper ions, (a) Spinach plastocyanin [Protein 
Data Bank entry lag6], (b) cucumber stellacyanin [Ijer]. (c, d) 
Supeiposition, showing (c) the entire structures, and (d) only the 
well-fitting core. The main secondary structural elements of 


A correlation between the divergence of 
sequence and structure applies to all families of pro¬ 
teins. Figure 2a shows the changes in structure of the 
core, expressed as the root-mean-square (rms) devia¬ 
tion of the main-chain atoms after optimal superposi¬ 
tion, plotted against the sequence divergence, 
expressed as the % conserved amino acids of the core 
after optimal alignment. The points correspond to pairs 
of homologous proteins from many related families. 
(Those at 100% residue identity are proteins for which 
the structure was determined in two or more crystal 
environments, and the deviations show that crystal 
packing forces - and, to a lesser extent, solvent and 
temperature - can modify protein conformation 
slightly). Figure 2h shows the changes in the fraction 
of residues in the core as a function of sequence diver¬ 
gence. The fraction of residues in the cores of distantly 
related proteins can vary widely: In some cases, the 
fraction of residues in the core remains high, in others, 
it can drop to helow 50% of the structure. 


these proteins are two (l-sheets packed face-to-face. It is seen 
in the supeiposition that several strands of [l-sheet are conserved 
but displaced, and that the helix at the right of the cucumber 
stellacyanin structure has no counterpart in the spinach plasto¬ 
cyanin structure. Even the (relatively) well-fitting core shows the 
conservation of folding topology but nevertheless reveals con¬ 
siderable distortion (From Desk 2007) 


A family of proteins is a set for which similarities of 

sequence and/or structure provide evidence that they 

are related hy evolution. 

Evolutionary Relationships among Proteins: 
Homologues, Orthologues, and Paralogues 

• Proteins are homologous if and only if they are 
descended from a common ancestor. 

• Homologues in different species, descended from 
a single ancestral protein, are orthologues. 

• Homologues in the same species, arising from gene 
duplication, are paralogues. Their descendants are 
also paralogues. After gene duplication, one of the 
resulting pairs of proteins can continue to provide 
its customary function, releasing the other to 
diverge, to develop new functions. Therefore, infer¬ 
ences of function from homology are more secure 
for orthologues than for paralogues. 
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Proteins: Relationship Among Divergence of Sequence, 
Structure, and Function, Fig. 2 Relationships between diver¬ 
gence of amino-acid sequence and three-dimensional structure 
of the core, in evolving proteins, (a) Variation of rms. deviation 
of the core with the percent identical residues in the core, (b) 

The globin family contains both orthologues and 
paralogues. An ancestral monomeric hemoglobin 
gene duplicated about 450-500 Ma ago to form 
paralogous a and P chains. Subsequent divergence 
produced orthologous a chains and orthologous P 
chains in horse and human (and many other species). 

Differential gene loss can make it difficult to distin¬ 
guish orthology from paralogy. What if humans had 
lost their a chains and horses their P chains? The 
remaining human p and horse a chains would appear 
to be orthologues. Full-genome information can often 
resolve such ambiguities. 

Because only rarely can we observe homology 
directly, we must infer homology from similarity in 
sequence, structure, and function. This has two 
implications: 

• We must calibrate our measures of similarity to 
provide thresholds for confident conclusion about 
homology. To exclude non-homologues that never¬ 
theless show some degree of similarity, we must set 
the criteria fairly tight. 

• It would be very useful to know how far evolution¬ 
ary divergence can proceed. This would allow us to 
assess whether the choice of strict criteria for infer¬ 
ence from similarity to homology leads us to miss 
many genuine homologues that are highly diverged. 
One method for showing that two very dissimilar 


b 



Variation of size of the core with the percent identical residues in 
the core. This figure shows results calculated for 32 pairs of 
homologous proteins of a variety of structural types. (Adapted 
from Chothia and Lesk (1986) 

proteins are actually homologues would be to 
exhibit a series of intermediate proteins that form 
a connected path between them, such that each 
successive pair on the path has high similarity. 
This is not always possible. 

In the evolution of individual protein domains, 
a succession of mutations leads to progressive diver¬ 
gence of sequence and structure. The results presented 
in Fig. 2 describe observations of structural changes in 
proteins that retain sufficient similarity - in structure if 
not in sequence - for us to be confident that they are 
genuinely related, or homologous. Criteria for recogniz¬ 
ing distant homologues are extrapolations from compar¬ 
isons of proteins sufficiently closely related in structure 
and function that homology is not in question. For 
example, sperm whale and seal myoglobin have 127/ 
153 = 83% identical residues in an optimal pairwise 
amino-acid sequence alignment, have very similar 
structures including the same cofactor (the heme 
group) bound by corresponding residues, and have iden¬ 
tical functions. These are certainly homologues. 

For distant relatives, homology is recognizable 
more confidently in structure than in sequence. In 
most cases, the basic folding pattern of the family 
remains the same fold; plastocyanin and stellacyanin 
are examples. Usually, there is at least a small set of 
residues conserved at least in physicochemical 
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character, although it usually requires knowledge of 
the structure to appreciate their role in creating the 
common architecture of the fold. This, together with 
functional cues, gets most of the job done - the job 
being to distinguish homologous from nonhomologous 
proteins. 

What continues to create difficulty are: 

• Cases in which folds are tantalizingly similar in the 
absence of any sequence similarity above the noise 
level, and no analogy in function. The case of phy- 
cocyanin and the globins is an example (Pastore and 
Lesk 1990). 

• Cases of similar sequences with different folding 
patterns. These are relatively rare. 

The Relation between Protein Evolutionary 
Divergence and Classifications of Protein 
Structure 

Proteins from the same family have similar structures. 
However, proteins from different families often con¬ 
tain recurrent structural themes. Folding patterns of 
protein domains are built by combining local struc¬ 
tures, such as a-helices and p-sheets. Some features 
of protein structures are what they are because the laws 
of physics and chemistry would not allow them to be 
otherwise. Some are imposed by the mechanism of 
evolution. In addition, historical accident has played 
a large role in creating the roster of folding patterns 
observed in nature. It is by no means easy to sort out 
these effects. 

Nevertheless, we can compare and classify the con¬ 
formations of apparently unrelated proteins, on the 
basis of secondary structures and their folding patterns. 
Several web sites, such as SCOP and CATH, offer such 
classifications. They offer classifications 
encompassing all known protein structures, a useful 
thing to have. However, within the hierarchy of such 
a classification, only the relationships among classes of 
proteins within the same family necessarily reflect 
evolutionary divergence. At higher levels of the hier¬ 
archy, the classification is based purely on architec¬ 
tural similarity, independent of provable - albeit often 
suspected - evolutionary relationship. 

Although the idea that the basic common core fold 
of a protein remains intact during evolution is widely 
accepted, there are known exceptions. In a classic 
paper, Grishin (2001) exhibited the structures of the 


C-terminal domain of catabolite gene activator protein 
(CAP) and the HIN recombinase DNA-binding pro¬ 
tein. The former is an all-helical protein and the latter 
contains a P-sheet. Comparing only these two struc¬ 
tures, one would unhesitatingly conclude that their 
folding patterns are markedly different, and infer that 
they are not homologous. However, the N-terminal 
domain of biotin repressor and the C-terminal domain 
of ribosomal protein Lll are intermediate between 
them. Successive pairs in this series have more similar 
folding patterns. It is not possible easily to rule out that 
there is an evolutionary pathway between CAP and 
HIN recombinase (even if the N-terminal domain of 
biotin repressor and the C-terminal domain of ribo¬ 
somal protein Lll do not themselves lie on it), nor, 
conversely, to be confident that these four structures 
are homologues. 

How Do Proteins Evolve New Functions? 

Mechanisms of protein evolution that produce altered 
or novel functions include: (1) divergence, (2) recruit¬ 
ment, and (3) “mixing and matching” of domains. 

Divergence. Many mutations conserve function but 
modulate specificity. For example, the trypsin family 
of serine proteinases contains a specificity pocket, 
a surface cleft complementary in shape and charge 
distribution to the side chain adjacent to the scissile 
bond. Mutations tend to leave the backbone conforma¬ 
tion of the pocket unchanged but to affect the shape 
and charge of its lining, altering the specificity. Lactate 
and malate dehydrogenases are another example. 
Visual pigments tune spectral sensitivity by point 
mutations in residues interacting with the 
chromophore. 

Such families of proteins illustrate the kinds of 
structural features that change, and those that stay the 
same. In some cases, the catalytic atoms occupy the 
same positions in molecular space, although the resi¬ 
dues that present them originate in different points in 
the fold. In other cases, the positions in space of cata¬ 
lytic residues are conserved even though the identities 
and functions of the catalytic residues vary. In these 
cases, there appears to be a set of conserved functional 
positions within the space of the molecule. 

Several enzyme families show an even greater 
degree of divergence. The Apurinic/Apyrimidinic 
Endonuclease superfamily, a large diverse group of 
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phosphoesterases, includes members that cleave DNA 
and RNA, and lipid phosphatases. Even catalytic res¬ 
idues vary between different subfamilies of this group. 
For example, a His essential for function of DNA 
repair enzyme DNasel is not conserved in Exonuclease 
III. 

Conversely, many functions are provided by 
unrelated proteins. Chymotrypsin and subtilisin have 
produced the same catalytic mechanism for proteolysis 
by convergent evolution. 

Recruitment. Many people ask how much a protein 
must change its sequence before its function changes. 
The answer is: Not at all! There are numerous exam¬ 
ples of proteins with multiple functions: 

Eye lens proteins in the duck are identical in 
sequence to active lactate dehydrogenase and enolase 
in other tissues, although they do not encounter the 
substrates in the eye. They have been recruited to 
provide a structural and optical function. Several 
other avian eye lens proteins are identical or similar 
to enzymes. In some cases, residues essential for catal¬ 
ysis have mutated, proving that the function of these 
proteins in the eye is not enzymatic. In those species, 
the coexistence of mutated, inactive, enzymes in the 
eye and active enzymes in other tissues implies that the 
gene must have been duplicated. 

The activity of phosphoglucose isomerase 
(= neuroleukin = autocrine motility 
factor = differentiation and maturation mediator) 
depends on location. This protein functions as 
a glycolytic enzyme in the cytoplasm, but as a nerve 
growth factor and cytokine outside the cell. 

Divergence and recruitment are at the ends of 
a broad spectrum of changes in sequence and function. 
Aside from cases “pure” recruitment such as the duck 
eye lens proteins or phosphoglucose isomerase, in 
which a protein adopts a new function with no 
sequence change at all, there are examples on the one 
hand of relatively small sequence changes correlated 
with very small function changes, (which most people 
would think of as relatively pure divergence), rela¬ 
tively small sequence changes with quite large changes 
in function, (which most people would think of as 
recruitment), but also many cases in which there are 
large changes in both sequence and function. 

“Mixing and matching” of domains: Many large 
proteins contain tandem assemblies of domains or 
modules which appear in different contexts and orders 
in different proteins. Censuses of genomes suggest that 


many proteins are multimodular. Of 4,401 genes in E. 
coli, 287 correspond to proteins containing 2, 3, or 4 
modules. The structural patterns of 510 E. coli 
enzymes involved in metabolism of small molecules 
can be accounted for in total or in part by 213 families 
of domains. 

Multidomain proteins present particular problems 
for assignment of function in genome annotation, 
because the domains may possess independent func¬ 
tions, or may modulate one another’s function, or act 
in concert to provide a single function that may depend 
on the domain composition and even order. On the other 
hand, in some cases, the presence of a particular domain 
or combination of domains is associated with a specihc 
function. For example, NAD-binding domains appear 
almost exclusively in dehydrogenases. 

Protein Evolution at the Level of Domain 
Assembly 

Comparisons of protein sequences and structures show 
that the domain is an important unit of protein evolu¬ 
tion. Domains appear in different proteins in different 
combinations. Thereby, from a relatively small roster 
of domain families, evolution can assemble a large 
number of complete proteins. 

Based on known protein structures, it has been 
possible to define approximately 2,000 domain super¬ 
families. The population of domains encoded by 
known genes is unevenly distributed. Nine domain 
superfamilies account for 20% of the matched domains 
in human genes. These include: 


Most common domains 

Number of matches 
in human genome 

CH2 and C2HC zinc fingers 

3,693 

Immunoglobulin 

1,778 

P-loop nucleoside triphosphate hydrolase 

1,024 

G-protein-coupled receptors: family A 

824 

Fibronectin type III 

802 

EGF/laminin 

697 

Cadherins 

686 

Protein kinases 

539 

PH domains 

491 


From Chothia and Gough (2009) 


Similar results apply to other eukaryotic genomes: 
Fugu, Drosophila melanogaster, and Caenorhahditis 
elegans, although the rank order is not the same. 
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Yang and Bourne (2009) have shown that the 
domain prevalence profiles characterize different taxa 
in the tree of life. Indeed this allows inference of when 
in evolutionary history domains were “invented” or 
lost. Most domains appeared early on in the history 
of life. As organisms developed greater complexity in 
both structure and metabolism, it became more com¬ 
mon to develop novel protein functions by making new 
combinations, of greater numbers of domains, than by 
creating new ones. (For organisms deep in the tree of 
life, close to the root, it is more difficult to analyze 
microscopically the evolutionary distribution of 
domains, because of the increased amount of horizon¬ 
tal gene transfer, including but not limited to endosym- 
biosis events producing subcellular organelles - 
mitochondria and chloroplasts - followed by gene 
transfer from organelle to nucleus). 

Conclusion 

Included in the 70,000 protein structures now known 
are many families of domains in which the molecules 
maintain the same basic folding pattern over ranges of 
sequence similarity from near-identity down to well 
below 20% conservation. The proteome of an organ¬ 
ism depends not only on the particular sequence - and 
hence the particular structure and function - of indi¬ 
vidual domains, but on the combinations and spatial 
relationships of domains within complex proteins. 

The general response to mutation is structural 
change. However, many mutations produce structural 
changes that retain the basic characteristics of structure 
and function. It is characteristic of biological systems 
that the objects we observe to have a certain form arose 
by evolution from related objects with similar but not 
identical form. These forms must, therefore, be robust: 
The ability to tolerate variation is a necessary property 
of evolving life. 

Cross-References 

► Alignment of Protein Sequences 

► Homology Modeling of Protein Structures 

► Protein Structure Comparison Methods 

► Protein Structure Prediction and Structural 
Annotation of Proteomes 

► Structural Impact of SNPs 
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Synonyms 

Extracellular matrix components, Glycoconjugates, 
Glycoproteins 

Definition 

Proteoglycans are glycoconjugates consisting of a pro¬ 
tein core carrying O-linked glycosaminoglycan side 
chains. They are constituents of the cell surface and 
of the extracellular matrix and play roles in matrix 
structure and organization, embryonic morphogenesis, 
and cell surface signal transduction. 

Basic Characteristics 

The distinguishing feature of the group of glycopro¬ 
teins known as proteoglycans is the presence of one or 
more O-linked glycosaminoglycan (GAG) side chains. 
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Proteoglycans, Fig. 1 Structures of the glycosaminoglycans. 
Glycosaminoglycans (GAGs) are linear polysaccharides com¬ 
posed of repeating disaccharides in which a hexosamine unit 
alternates with a hexose (for keratan sulfate only) or uronic acid. 
All the GAGs other than hyaluronan (1) are found 0-linked 
through serine or threonine to a protein core. Hyaluronan is 
also the only GAG with a completely regular, unsubstituted 
structure. 

The glucosamine-containing GAGs are heparan sulfate (HS) 
and its highly sulfated variant heparin, both of which contain 
structures 2 and 3. After polymerization, structure 2 may be 
de-N-acetylated and re-N-sulfated; (l-D-glucuronate may be 
epimerized to a-L-iduronate; sulfotransferases then may add 
sulfate esters at C-2 of iduronate and/or C-6 of glucosamine. 
Complete transformation to structure 3 gives the main repeat unit 
of heparin. Partial transformation occurs to give domains of 
variable length and sequence in heparan sulfates, which are 
highly variable between tissues and species. 

The galactosamine-containing GAGs are the chondroitins A 
(chondroitin-4-sulfate, C4S), B (known as dermatan sulfate), 
and C (chondroitin-6-sulfate, C6S), shown in structures 4, 5, 
and 6, respectively. Most CS samples contain mixtures of these 
structures, often in the same polysaccharide chain. 

Keratan sulfate, KS (structure 7), does not contain uronic 
acid, and its degree of sulfation is variable 

Both protein core and GAG side chain are important to 
the physiological roles of proteoglycans. 

Structures of the GAGs are presented in Fig. 1 

GAGs are multifunctional macromolecules. They 
are highly charged, linear polyelectrolytes (Seyrek 
and Dubin 2010), and can be thought of as featureless, 
flexible rods of negative charge with bulk properties 
such as “charge density”; this can be useful when 


considering the structural properties of matrix proteo¬ 
glycans. On the other hand, they are also capable of 
containing saccharide sequences with high specificity 
for particular protein sites, and it is this set of proper¬ 
ties which dominates the behavior of, for example, 
heparin as an anticoagulant through its interaction 
with the serine protease inhibitor antithrombin. Anal¬ 
ysis of detailed heparan sulfate sequences is a subject 
of much current interest (Turnbull et al. 2010). 

Classification of Proteoglycans 

There is no formal classification scheme for the diverse 
protein cores of the PGs. In an influential review (lozzo 
1998), the matrix proteoglycans are described in three 
categories: basement membrane proteoglycans 

(perlecan, agrin, bamacan), small leucine-rich proteo¬ 
glycans (SLRPs), and “hyalectans,” proteoglycans 
which bind both hyaluronan and lectin (aggrecan, 
versican, neurocan, brevican). A second classic review 
(Bernfield et al. 1999) has described the cell-surface 
proteoglycans; the transmembrane protein syndecans, 
and the GPTanchored glypicans. These proteoglycans 
bear heparan sulfate chains and are involved in inter¬ 
actions between the cell and its environment. The 
proteoglycan serglycin, found in mast cell granules, 
does not fall into any of the above families. 

Proteoglycans as Structural Elements in the 
Extracellular Matrix 

Proteoglycans such as aggrecan, versican, and 
brevican, along with the protein collagen, and the 
polysaccharide hyaluronan, are major components of 
extracellular matrix (ECM). Far from being simple 
“filler” between cells, the matrix provides appropriate 
structural properties according to tissue type (for 
example, strength and flexibility for cartilage and 
tendon, transparency for the cornea) and modulates 
the movement of cells and proteins in response to 
change (for example, in embryonic development and 
inflammatory response to injury or infection). 

The proteoglycan aggrecan is abundant in connec¬ 
tive tissue, associated through its link protein domains 
to hyaluronan (itself a GAG, but with no protein core) 
(Fig. 2a). These complexes form a hydrated medium 
separating collagen fibrils. Aggrecan bears numerous 
chondroitin chains, packed closely so as to form an 
anionic GAG “brush.” The combination of tensile 
strength of collagen and compressibility of the 
hydrated GAG/proteoglycan medium is a common 
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Proteoglycans, Fig. 2 The diversity of form and function in 
proteoglycans. Simplified cartoon representations, accurate 
neither to scale nor to sequence, intended to give an impression 
of the molecular architecture of some proteoglycan types, 
(a) Aggrecan in complex with hyaluronan in the extracellular 
matrix. The thickly chondroitin and keratan substituted central 
section of aggrecan has globular domains at the N- and 
C-termini; two link protein (LP) domains interact with 
hyaluronan to give a huge, hydrated macromolecular complex 
in the extracellular matrix, (b) A generic SLRP: decorin or 
biglycan. The central sequence of leucine-rich repeats (LRRs) 


is flanked by two cysteine-containing sequences, with one or two 
GAG chains - either dermatan sulfate or chondroitin sulfate in 
the example shown but often keratan sulfate in, for example, 
lumican and keratocan. (c) Syndecan and glypican on the cell 
surface; syndecan has a transmembrane domain and a functional 
cytoplasmic domain, whereas glypican is GPTanchored (shown 
in red), (d) A heparin-substituted serglycin molecule, carrying 
an extremely high concentration of negative charge, necessary 
for the efficient packing of mast cell granules with basic serine 
proteases. Note that the monosaccharide representation is a 
simplified version of that in Fig. 1 
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structural feature responsible for the functionality of 
tendon and cartilage. 

The SLRP group of small proteoglycans is impli¬ 
cated in the formation and relative orientation of col¬ 
lagen fibrils in ECM. For example, several SLRP 
family members are necessary for the strictly regular 
distribution of collagen fibrils which ensures transpar¬ 
ency of the cornea. The protein cores are important for 
this function (Hassell and Birk 2010). The SLRPs form 
a large family of proteoglycans, many members having 
been first described in the relatively recent past 
(Schaefer and lozzo 2008); though the examples 
shown in Fig. 2b are substituted with chondroitin or 
dermatan sulfates, many SLRPs, including those in the 
comeal stroma, bear keratan sulfate. 

Heparan Sulfate Bearing Proteoglycans on the Cell 
Surface and Basement Membrane 

In addition to their structural roles in the ECM, the 
proteoglycans have a strong influence on the movement 
of small proteins between cells and on their reception at 
the cell surface and signal transduction into the cell. 
The heparan sulfate (HS) bearing proteoglycans of the 
cell surface (syndecans, glypicans) and basement mem¬ 
brane (such as perlecan) are associated with these func¬ 
tions. In embryonic development, regulation of the 
diffusion of morphogens is essential, and it has become 
clear that HS-bearing proteoglycans, whether on the 
cell surface or in the matrix, are involved in the shaping 
of morphogen gradients (Yan and Lin 2009). Similarly, 
matrix GAGs form guides for developing blood vessels 
and neurones. Though HS is the GAG most commonly 
associated with regulation of growth factors and other 
morphogens, some forms of CS may also play similar 
roles, particularly in the nervous system. 

Cell surface HS, borne on the transmembrane pro¬ 
teoglycan syndecan, has long been known to form a 
part of the cell surface receptor complex for the fibro¬ 
blast growth factors FGF-1 and FGF-2, and syndecans 
can also act as signaling molecules in their own right 
(Multhaupt et al. 2009). The GPI-anchored glypicans 
are more closely associated with morphogen interac¬ 
tions (such as Wnt, Hh, and bone morphogenetic pro¬ 
teins) (Filmus et al. 2008). Simplihed illustrations of 
syndecan and glypican architectures are shown in 
Fig. 2c. 

Perlecan is the exemplary basement membrane pro¬ 
teoglycan; this very large multi-domain protein bears 
HS chains at its N-terminus. Like the glypicans, it is 


essential for correct morphogenesis and is associated 
with the pericellular area. 

Serglycin 

Serglycin is the proteoglycan found in mast cell gran¬ 
ules. The associated glycosaminoglycan is either 
heparin or highly sulfated chondroitin, attached in the 
mid-region of the serglycin core to form a dense brush 
(Fig. 2d) (Kolset and Tveit 2008). Serglycin is essen¬ 
tial for the correct, tight packaging of mast cells pro¬ 
teases within the granule, and may also have a function 
in the activity of these proteases after their release, as 
part of the immune/inflammatory reaction. 

Cross-References 

► Carbohydrate NMR Spectroscopy 

► Glycoconjugates in Cell Function and Therapeutics 

► Glycoproteins 

► Glycosaminoglycan (GAG) 

► Molecular Dynamics Simulations of Carbohydrates 
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Synonyms 

Concerted proton-electron transfer; Electron transfer; 
Electron/proton transfer (EPT); Hydrogen atom trans¬ 
fer (HAT); Sequential proton loss electron transfer 
(SPLET) 

Definition 

Proton-Coupled Electron Transfer (PCET) reactions 
are broadly classified as redox processes where the 
rate and/or energetics are affected by one or more 
protons. This includes processes in which protons and 
electrons transfer among one or more reactants, in one 
or more kinetic steps, and processes in which protons 
modulate electron transfer (ET) processes, even if they 
do not transfer. 

Basic Characteristics 
Introduction 

Electron transfer (ET) and proton transfer (PT) are the 
two most fundamental chemical reactions. Many, per¬ 
haps most, redox reactions are coupled to changes in 
proton content. This type of redox reaction is broadly 
referred to as “proton-coupled electron transfer” or 
“PCET.” PCET reactions can involve multiple protons 
(H"^) and electrons {e~), such as the AYClAe~ oxidation 
of 2 H 2 O to O 2 , or a single proton and electron, such as 
the biologically important oxidation of tyrosine to 


tyrosyl radical. The broad and diverse set of reactions 
that is now termed PCET are integral to myriad pro¬ 
cesses, from biological catalysis and ► oxidative stress 
to large-scale industrial syntheses to energy transduc¬ 
tion (Hynes et al. 2007; Allemann and Scrutton 2009). 
This is a broad and rapidly evolving field; this 
entry reflects the most widely used definitions and 
theory at present. 

Classes of PCET Reactions 

Proton-coupled redox reactions are by no means a new 
concept. The Nernst equation of 1889 describes how 
aqueous redox potentials vary with pH when protons are 
involved. From another perspective, physical and 
organic chemists have been studying free radical 
“hydrogen atom transfer” reactions for over a century, 
and a hydrogen atom (H") consists of H"^ - 1 - e~. 
Only relatively recently was it realized that the 
seemingly disparate set of HV^“ transfer reactions are 
connected in many ways (Hynes et al. 2007). 
This section describes the different subsets of PCET 
that are currently and historically of the greatest interest. 

PCET, as originally formulated, referred to those 
reactions where H'*' and e~ transfer in a single (con¬ 
certed) kinetic step. As noted above, it has since been 
recognized that a great many redox reactions occur with 
changes in proton content, and the term “PCET” has 
been applied to of these reactions, regardless of mech¬ 
anism. Thus, it is now preferred to use the broad defi¬ 
nition of PCET given above (Huynh and Meyer 2007). 

Unfortunately, the nomenclature in this field is still 
in flux. Researchers have come to PCET from many 
different backgrounds and have often developed their 
own definitions to describe a given subset of PCET 
reactions. In many cases, different terminology is 
being used to describe the same set of reactions 
(Reece and Nocera 2009). For reactions in which 
one electron and one proton are transferred in 
a single kinetic step, Saveant and coworkers have 
proposed the term, “concerted proton-electron trans¬ 
fer” (CPET), that makes the mechanistic implication 
explicit. CPET is equivalent to the “CEP” term (con¬ 
certed electron/proton) used by Hammarstrom et al. 
and the “EPT” term (electron/proton transfer) used by 
Meyer et al. (Reece and Nocera 2009). The foregoing 
discussion will use the term CPET. 

One subset of CPET is hydrogen atom transfer 
(HAT). In the older physical organic literature, there 
was no need to define HAT as it was self-evident that 
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this referred to reactions involving concerted transfer 
of H' from a donor to an acceptor. However, it has now 
been realized that transition metal complexes can also 
undergo HAT reactions and that there is substantial 
overlap between HAT and other CPET reactions. 
This has led to the appearance of new definitions and 
new thinking about HAT (Hynes et al. 2007). 
Consider, for example, the degenerate H' self¬ 
exchange reactions between toluene and benzyl radical 
versus phenol and phenoxyl radical. In the toluene 
case, the H"^ and e~ start in the same bond and end in 
the same bond. In the phenol/phenoxyl reaction, how¬ 
ever, the transferring proton is in the molecular plane, 
but the transferring electron is in an orthogonal n 
symmetry orbital (Mayer 2004). These, and other, 
distinctions have led some to propose more restrictive 
definitions for HAT, such as restricting “HAT” 
processes as those there the proton and electron must 
come from the same bond (Huynh and Meyer 2007). 
(In the organic chemistry literature, “PCET” is some¬ 
times used for CPET reactions that are not considered 
to be HAT.) However, the restrictive definitions are 
often problematic, for example, leading to reactions 
being HAT in one direction but not another. In alkane 
oxygenations by the (porphyrin‘'^)Fe^'^ = O active site 
in ► cytochrome P450s, for instance, H"^ and e~ (H‘) 
are removed from a single C-H bond but are separated 
in the (porphyrin)Ee*'^(OH) product. Thus, it is pre¬ 
ferred to use the older definition, that hydrogen atom 
transfer refers to what is transferred. 

Finally, there are also concerted transfers of H"^ -i- e~ 
where the proton and electron transfer to/from 
different reagents, which are termed “separated 
CPET” or “multiple site EPT.” One important example 
is the oxidation of tyrosine Z (Yz) in the catalytic cycle 
of ► photosystem II, which is thought to occur by long- 
range electron transfer to PhSO*"^ concerted with proton 
transfer to a nearby histidine residue. Separated CPET 
events likely also occur in i*- ribonucleotide reductases 
and many other enzymatic reactions. Electrochemical 
CPET processes could also be classified as separated 
CPET since electron transfer occurs at the electrode 
and proton transfer occurs to/from a species in solution 
(Costentin 2008). The understanding of the differences 
and similarities between separated CPET and HAT are 
still emerging. CPET reactions can also occur to 
a molecule or active site in which the redox and acid/ 
base sites are spatially separated, such as a heme iron 
center and the heme propionate group. 


X-H > X- 


E°(XH+^0) 





Proton-Coupled Electron Transfer, Fig. 1 Thermochemical 
“square scheme” for a PCET reagent 


Thermochemistry of PCET Reactions 

Unlike electron transfer reactions, which can be 
uniquely dehned by a standard redox potential, PCET 
reactions require consideration of the energetics for 
both proton transfer and for electron transfer. Many 
biophysical redox potentials are reported at a single 
pH, but it would be preferable to measure the pH 
dependence of E and therefore the proton stoichiome¬ 
try. From the Nemst equation, E varies by [59 mV x 
(# protons involved)/(# electrons involved)] per pH 
unit change at 298 K. Thus, E for a 2e“/2H'^ couple 
such as quinone/hydroquinone varies by 59 mV/pH, 
while the le“/2H'^ semiquinone radical anion/hydro- 
quinone couple varies by 118 mV/pH. 

There are four thermochemical parameters that 
define the free-energy landscape of a le^/lH"^ PCET 
couple HX/X. These are £°(X°/“), £°(HX'"'°), and the 
pXaS of HX and HX"^, as shown in the widely used 
“square scheme” (Fig. 1). The diagonal of the scheme is 
the solution bond dissociation free energy (BDFE), 
which can be determined from £'°(HX'^^°) and 
pA:a(HX'"), or from E°{X°'~) and pA:a(HX'") (or from 
a \e~l\¥E aqueous redox potential) (Warren et al. 
2010). HAT reactions have been traditionally understood 
using homolytic bond dissociation enthalpies (BDEs) 
(Warren et al. 2010; Mayer 2011). However, it has 
recently been found that entropic contributions can be 
significant in some PCET reactions, especially those 
involving 3d transition metal ions. Therefore, we encour¬ 
age use of BDFEs as the best single measure of the 
thermochemistry of a le^/lH"^ PCET couple (Warren 
et al. 2010). 

Theoretical Treatments of CPET Reactions: Proton 
Tunneling 

Understanding the concurrent transfer of e~ and H"^ has 
required the development of new theoretical 
treatments (Hynes et al. 2007). The most widely 
employed framework is Hammes-Schiffer’s multistate 
continuum theory, which conceptually builds on 
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nonadiabatic Marcus theory (Hammes-Schiffer and 
Soudackov 2008). The proton is treated quantum 
mechanically, with a Born-Oppenheimer-type separa¬ 
tion between the motions of the transferring proton 
versus the other nuclei. CPET reactions are described 
as nonadiabatic transitions between electron-proton 
vibronic states. The rate constant (k) is related to the 
driving force (AG°), the reorganization energy (1), the 
electronic coupling between the reactant and product 
states, the vibrational overlaps between the initial and 
final proton wave functions, and other factors. Multi¬ 
state continuum theory - and most theoretical 
treatments of CPET - explicitly includes quantum 
mechanical tunneling of the proton. Proton tunneling 
has recently been implicated in a number of enzymatic 
reactions, including PCET reactions, and is part of 
a reexamination of the nature of enzymatic catalysis 
(Hynes et al. 2007; Allemann and Scrutton 2009). 

A simple version of the Marcus approach, the Mar¬ 
cus cross relation, has successfully been applied to 
small-molecule HAT reactions (Warren and Mayer 
2010; Mayer 2011). Rate constants are predicted usu¬ 
ally to within an order of magnitude for a variety of 
reactions of both organic compounds and transition 
metal complexes. While this treatment ignores the 
subtleties of PCET, its rough success indicates that 
typically the driving force (AG°) and the intrinsic 
barrier (/,) are the primary determinants of CPET rate 
constants. 

Conclusion 

Proton-coupled electron transfer (PCET) reactions are 
ubiquitous in biological and chemical transformations. 
Great strides have been made in the past 25 years in 
uncovering the diversity and complexity PCET reac¬ 
tions. Still, much is yet to be done to define the scope 
and intimate details of the different classes of PCET. 
This is a rapidly evolving field, and new discoveries 
will further reveal the importance and value of PCET 
reactivity, from energy transduction to free radical 
biochemistry. 

Cross-References 

► Coupled Electron Transfer 

► Cytochrome P450 - Computational Studies 

► Cytochrome c Oxidase (Complex IV) 


► Electron Hopping in Biomolecules 

► Electron Transfer Theory 

► Flavins 

► Ubiquinol-Cytochrome c Oxidoreductase (Complex 
III) 

► Mitochondrial Electron Transport 

► Peroxidases 

► Photosynthetic Electron Transport 

► Photosystem I 

► Photosystem II 

► Quinone Cofactors 

► Reactive Oxygen Species 

► Redox Potential 

► Ribonucleotide Reductase 

► Superoxide Dismutases 

References 

Allemann RK, Scrutton NS, editors. Quantum tunnelling in 
enzyme-catalysed reactions. Cambridge: Royal Society of 
Chemistry; 2009. 

Costentin C. Electrochemical approach to the mechanistic study 
of proton-coupled electron transfer. Chem Rev. 2008; 108: 
2145-79. 

Hammes-Schiffer S, Soudackov AV. Proton-coupled electron 
transfer in solution, proteins, and electrochemistry. J Phys 
Chem B. 2008;112:14108-23. 

Huynh MHV, Meyer TJ. Proton-coupled electron transfer. Chem 
Rev. 2007;107:5004-64. 

Hynes JT, Klinman JP, Limback H-H, Schowen RL, editors. 
Hydrogen-transfer reactions, vol. 1^. Weinheim: Wiley- 
VCH; 2007. 

Mayer JM. Proton-coupled electron transfer: a reaction chem¬ 
ist’s view. Annu Rev Phys Chem. 2004;55:363-90. 

Mayer JM. Understanding Hydrogen Atom Transfer: from 
Bond Strengths to Marcus Theory Acc Chem Res 2011;44: 
36^6. 

Reece SY, Nocera DG. Proton-coupled electron transfer 
in biology: results from synergistic studies in 
natural and model systems. Annu Rev Biochem. 2009;78: 
673-99. 

Warren JJ, Mayer JM. Predicting organic hydrogen atom transfer 
rate constants using the Marcus cross relation. Proc Natl 
Acad Sci USA. 2010;107:5282-7. 

Warren JJ, Tronic TA, Mayer JM. Thermochemistry of proton- 
coupled electron transfer reagents and its implications. Chem 
Rev. 2010;110:6961-7001. 


Protoporphyrin IX 

► Hemes 




Pulsed EPR: Principles and Examples of Applications to Hemeproteins 


2115 


P 


PS 2 

► Photosystem II 

PS I 

► Photosystem I 

PS II 

► Photosystem II 

PS1 

► Photosystem I 

Pseudocontact and Contact Shifts 

► Hyperfine Shifts 

PTR2 

► Membrane Transport Proteins: The Proton-Depen¬ 
dent Oligopeptide Transporter Family 

Pulsed Electron Double Resonance 
(PELDOR) 

► DEER of Metalloproteins 


Pulsed Electron Spin Resonance 
(Pulse ESR) 

► Pulsed EPR: Principles and Examples of Applica¬ 
tions to Hemeproteins 


Pulsed EPR: Principles and Examples of 
Applications to Hemeproteins 

Ines Garcia-Rubio 

Laboratory of Physical Chemistry, ETH-Zurich, 
Ziirich, Switzerland 

Synonyms 

Pulsed electron spin resonance (Pulse ESR); Time- 
domain EPR 


Definition 

Pulse electron paramagnetic resonance (pulse EPR) 
is a spectroscopic technique that entails observing 
the time evolution of the magnetization of a paramag¬ 
netic species in the presence of a static magnetic field 
when it is excited by one or several resonant micro- 
wave pulses. This kind of excitation involves applying 
intense electromagnetic radiation (normally in the 
order of 1 kW) in the range of the microwaves for 
a very short period of time (typically a few tens of 
nanoseconds). As for continuous-wave ► EPR, the 
frequency of the excitation must satisfy the resonance 
condition, that is, it must match the energy gap 
between the different electron-spin states. 

Introduction 

In ► nuclear magnetic resonance, pulse techniques 
(ET-NMR) have practically replaced continuous- 
wave methods because of their substantial gain in 
sensitivity. However, ET-EPR does not enjoy 
the same sensitivity improvement because EPR spectra 
are normally very broad and, especially in the case of 
transition metals, it is not possible to excite the 
entire spectrum with one microwave frequency. 
On the other hand, pulse EPR opens the door to manip¬ 
ulate magnetization almost at will and to use 
multidimensional techniques, which results in addi¬ 
tional information about the structure and dynamics 
of the paramagnetic system complementing the CW 
spectrum. 
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Practically all of the EPR techniques have nowa¬ 
days pulse variants (Schweiger and Jeschke 2001): free 
induction decay or echo-induced EPR (pulse field- 
sweep EPR), electron-nucleus double resonance 
(► Pulsed ENDOR and ESEEM: Principles and Exam¬ 
ples of Applications to Eleme Proteins), double elec¬ 
tron-electron resonance (► Pulsed ENDOR and 
ESEEM: Principles and Examples of Applications to 
Eleme Proteins, also known as DEER), 
etc. Additionally, a genuine pulse method, 
► ESEEM, has become customary to study weak 
hyperfine interactions. Pulse techniques can be used 
to regain the information about electron-nucleus and 
electron-electron interactions buried in inhomoge¬ 
neous broadenings and measure relaxation properties 
of the spin system in a direct way. Both of these 
capabilities can yield interesting structural and 
dynamic information. Pulsed EPR has been applied 
with great success in the field of biophysics, 
metalloprotein science, and specifically to the study 
of heme proteins and heme model systems (Prisner 
et al. 2001; Calle et al. 2006; Harmer et al. 2009; van 
Doorslaer 2009). Examples of these are given below to 
illustrate the use of pulse techniques. 

Although pulse EPR has specific characteristics and 
technical requirements, its principles and theoretical 
foundations are very similar to the ones of ► NMR. 
The rigorous quantum mechanical description of the 
experiments requires the density matrix formalism 
(Ernst et al. 1986; Schweiger and Jeschke 2001), how¬ 
ever, the following simple vector picture can provide 
a more intuitive understanding of the inner functioning 
of some of the pulse EPR experiments. 

Effect of a Microwave Pulse on 
a Paramagnetic System 

Inhomogeneous Broadening 

Normally, the linewidths in the EPR spectra of heme 
proteins and, in general, the spectrum of most para¬ 
magnetic systems, is due to the so-called inhomoge¬ 
neous broadening. This concept refers to the fact that 
the spectral line is made up from several overlapping 
narrower lines resonating at slightly different field (see 
Pig. 2). This overlap may appear due to g-strain (Hagen 
1989), very important in the case of heme proteins, 
unresolved hyperfine interactions with the large number 
of magnetic nuclei normally surrounding the 


paramagnetic species, or other broadening mechanisms. 
Each of these narrower lines is generated by a sub¬ 
ensemble of spins (spin packet) that resonate exactly at 
the same magnetic field or, for a fixed value of the 
magnetic field, that resonate at the same microwave 
frequency and whose linewidth (homogeneous 
linewidth) is determined by the spin-spin relaxation 
time, T 2 . 

Excitation Through Resonant Microwave Pulses. 
Selective and Nonselective Pulses 

The intrinsic angular momentum of an electron (spin. 
Til) has associated an also intrinsic magnetic moment 
related to the former as m = —g^figs, where fig is the 
Bohr magneton and g^, is the giromagnetic ratio of the 
electron. In the presence of a static magnetic field Bg, 
the magnetic moment experiences a torque that tends 
to align it with the magnetic field 

f = m X 5o, (1) 

at 

which makes the magnetic moment precess about the 
magnetic field like a spinning top on the ground. This 
precession would continue indefinitely if there were 
not relaxation mechanisms, analogous to the friction in 
the spinning top, that put the magnetization back 
to its equilibrium position along Bg, whose direction 
is chosen to be z. The frequency of rotation is 
and is called the Larmor frequency of the 
electron, which is the EPR resonance frequency of the 
free electron. In a macroscopic sample, the magneti¬ 
zation is the sum of the magnetic moments of all 
paramagnetic entities in the sample, these are not free 
electrons, their g-values differ from the one of the free 
electron and so varies their precession frequency in 
a magnetic field. 

In order to study the evolution of the magnetization 
of a spin packet in a pulse EPR experiment, it is usual 
to use a coordinate system that rotates around z with 
the frequency of the microwave radiation fflmw If the 
spin packet is in resonance, the frequency of the rotat¬ 
ing frame is exactly the Larmor frequency of that spin 
packet. Therefore, in this rotating frame, no precession 
is observed, the effect of the static magnetic field Bg is 
cancelled, which means that working in the rotating 
frame is equivalent to having an additional magnetic 
field B = opposite to Bg. Hence, not considering 
the effects of relaxation, the magnetization in the 
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pulse 

Pulsed EPR: Principles and Examples of Applications to 
Hemeproteins, Fig. 1 (a) The microwave radiation in the 
resonators used for EPR is linearly polarized. A linearly polar¬ 
ized wave (magnetic field shown in blue) can be decomposed as 
the sum of a left- (shown in green) and right- (shown in red) 
circularly polarized waves. In the rotating frame one of them will 

rotating frame is static. Similarly, the result of resonant 
microwave radiation circularly polarized in the xy 
plane is, in the rotating frame, the application of 
a static magnetic field B i in the xy plane with magni¬ 
tude equal to the amplitude of the microwave magnetic 
field. Then, the result of resonant microwave radiation 
is to rotate the magnetization around Bi ( 1 ) with 
a frequency coi = Pulse EPR involves manipu¬ 
lating the magnetization in this manner with micro- 
wave pulses. For example, if the initial state is the 
thermodynamic equilibrium, the magnetization is par¬ 
allel to the magnetic held with magnitude according to 
the Boltzmann population of the spin states and the 
application of resonant microwaves with the micro- 
wave magnetic held in the x axis will rotate the mag¬ 
netization in the yz plane (Fig. lb). 

The fact that the periods of microwave irradiation 
(pulses) are hnite in time makes their excitation prohle 
broader than a single frequency. As it can be seen in 


n 

pulse 

be static and will produce resonance and the other one will be 
seen as rotating at twice the Larmor frequency, not in resonance 
and therefore with no effect on the magnetization, (b) Effect of 
resonant microwave radiation on a spin packet initially in ther¬ 
mal equilibrium 

Fig. 2, the length of the pulses is related to their 
excitation prohle in the frequency domain by 
a Fourier Transform. This gives rise to the possibility 
of using sharp and short pulses with a broader excita¬ 
tion prohle (nonselective) or softer and longer pulses 
with narrower excitation prohle (selective pulses). The 
nonselective/selective designation refers to the ability 
of the pulse to excite more, or fewer, spin packets of an 
inhomogeneously broadened line. 

Response of the Magnetization to a | Pulse: FID 

The signal detected in pulse EPR is the magnitude of 
the transverse magnetization in the rotating frame. 
When a homogeneously broadened EPR line is excited 
by a I pulse (pulse length tp = 5 ^), the magnetization 
of the spin packet, initially in the direction z, is hipped 
to the xy-plane (transverse plane) and produces 
a signal in pulse EPR. If there were no relaxation, 
the transverse magnetization would be constant and 
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Pulsed EPR: Principles and 
Examples of Applications 
to Hemeproteins, Fig. 2 (a) 

Inhomogeneous broadening, 
(b) Continuous and pulsed 
excitations and their excitation 
profiles. Note that the 
frequency excitation 
bandwidth of the pulse is 
inversely proportional to its 
length; continuous irradiation 
is the limit of a very long 
(selective) pulse whose 
excitation profile is just one 
frequency 
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continuous 
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pulsed 

irradiation 
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the signal would be a continuous offset but, due to the 
interactions of the electron spins with the environment, 

(1) the magnetization in the xy plane decays and 

(2) returns to its equilibrium position along z. These 
two macroscopic processes are characterized by two 
relaxation times, T 2 (transverse relaxation time), which 
is the characteristic time at which the transverse mag¬ 
netization vanishes, and Ti (longitudinal relaxation 
time) inversely proportional to the rate at which the 
magnetization returns to z. As a result of relaxation, the 
magnitude of the transverse magnetization after 
a I pulse decays with a rate proportional to the 
linewidth of the CW-EPR signal in the frequency 
domain (T 2 ), this signal is known as free induction 
decay (FID), see Fig. 3. 

When the EPR line is inhomogeneously broadened, 
as in most cases, every spin packet resonates at a 
different frequency, so after a selective | pulse they 
do not all process with the same frequency o\. As a 
result, they will rotate with angular frequency 
Am = (Os — fflmw in the rotating frame. If the sum of 
all individual spin packets is considered after some 
time the dephasing is such that the magnetization of 
the different spin packets cancels each other out mak¬ 
ing the decay of the total transverse magnetization 
much faster (see Fig. 3). The decay of the FID does 


not reflect then the relaxation time but the shape of the 
EPR line and the excitation profile of the pulse. Nor¬ 
mally, the decay is so fast that it is not possible to detect 
it, as the spectrometer has a certain “dead time” after 
the microwave pulses where no detection is possible. 
Detection of the FID can be made possible by reducing 
the range of Am using selective pulses, see Fig. 3. 

Response of the Magnetization to a 11 Pulse: 

Hole Burning 

When a homogeneously broadened system is excited 
with a microwave n pulse {tp = ^), the magnetization 
is inverted, rotated 180°, so it is still in the direction of 
the magnetic field but its sense is reversed with respect 
to the thermal equilibrium situation. The spin states with 
the magnetic moment pointing against the magnetic 
field are more populated than the ones pointing in the 
same direction. The situation is reversed with respect to 
the continuous-wave EPR experiment, so if the spec- 
tram were measured a negative line would be obtained. 

If the line presents inhomogeneous broadening, 
a selective n pulse inverts the magnetization of only 
a small number of spin packets around camw according 
to its excitation bandwidth, it “burns a hole” in the 
polarization pattern (see Fig. 3). This longitudinal 
magnetization relaxes to its equilibrium situation 
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Pulsed EPR: Principles and 
Examples of Applications 
to Hemeproteins, Fig. 3 (a) 

On the left, CW-EPR lines 
with different sorts of 
broadening are shown in the 
frequency domain. On the 
right, the response of the 
corresponding system to 
a microwave pulse excitation 
(excitation prohle shown in 
blue). The grey squares 
represent the defense pulse of 
the spectrometer which covers 
and protects the detector after 
the microwave pulses until its 
energy has been dissipated. 
The measurement can only 
start after this ring-down time 
causing the so-called dead 
time of the spectrometer 
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with the time constant T^. Ideally, the n pulse does not 
result in transverse magnetization but prepares the spin 
system in a nonequilibrium population of states that 
can be subsequently manipulated and brought to detec¬ 
tion by other pulses. 

This is the case of the experiment ELDOR detected 
NMR (Schweiger and Jeschke 2001) presented in 
Fig. 4 where a nonequilibrium population of the spin 
states is prepared by a selective n pulse resonant on 
a partially forbidden EPR transition of a paramagnetic 
center with hyperfine interaction. The spectral holes 
burnt by the hrst pulse contain the information about 
the hyperfine coupling and can be read out with 
another selective n pulse of variable frequency. In 
this manner, pulse EPR can be used to obtain the 
EPR spectrum of one spin packet that was buried inside 
the total EPR linewidth. 

Electron Spin Echoes 

The dephasing of the transverse magnetization occur¬ 
ring after a | pulse can be inverted by a ti pulse. By 


means of a ti pulse, the transverse magnetization of the 
different spin packets is rotated 180° and the evolution 
pattern results in the mirror image of the pattern before 
the 71 pulse (see Fig. 5). The evolution of the spin 
packets will result in the transient refocusing of the 
magnetization when the evolution periods before and 
after the n pulse are the same (t in the figure). This 
phenomenon is called electron spin echo (ESE), whose 
intensity is proportional to the longitudinal magnetiza¬ 
tion before applying the hrst pulse. 

The sequence | — t — ti is perhaps the simplest giv¬ 
ing rise to an echo (primary echo, also called 
Hahn echo) but many of the experiments in pulse EPR 
are based in the detection of electron spin echoes (pri¬ 
mary-, stimulated-, refocused- echo...) (Schweiger and 
Jeschke 2001). One of the most simple ESE experiments 
is the echo-detected EPR, which consists in the detection 
of the intensity of any spin echo (in Fig. 6, the Hahn- 
echo) as a function of the magnetic held, which is swept 
as in CW-EPR. The display of echo intensity versus 
magnetic held gives a spectmm very similar to an absorp¬ 
tion CW spectmm and is more sensitive to broad and 
extended features than the standard CW-EPR (Fig. 6). 
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Pulsed EPR: Principles and Examples of Applications to 
Hemeproteins, Fig. 4 ELDOR detected NMR. (a) Micro- 
wave pulse sequence of the experiment and population dia¬ 
grams for a particular spin packet of the ensemble with an 
electron spin S = V 2 interacting with a nuclear spin I = V 2 . 
The integrated intensity of the FID (blue) is measured as 
a function of the microwave frequency of the read-out pulse 
(red), (b) The inhomogeneous EPR line (blue) is shown 
together with the contribution of the particular spin packet 

Direct Measurement of Relaxation Times 

The effect of a 71 pulse on a paramagnetic system in 
thermal equilibrium is to flip the magnetization which 
will relax to the original position with a time constant 
Ti- The time constant can be directly obtained if the 
longitudinal magnetization is probed during this relax¬ 
ation process, via a Hahn echo, for example. This 


which is excited on a semi-forbidden transition by the first 
microwave pulse (red), (c) ELDOR detected NMR spectrum 
of metmyoglobin. The nuclear spin transition frequencies can 
be extracted directly from the spectrum as in an ► ENDOR 
spectrum. The hypertine interaction of the heme iron with the 
protons and '"'N nuclei of the active site of myoglobin was 
obtained and interpreted in terms of the electron spin density 
distribution in the heme (Taken from Fittipaldi et al. 2008), 
V represents frequency in FIz units, that is, v = 2n(o 

experiment to measure the spin-lattice relaxation is 
called inversion recovery (see Fig. 7a). Another way 
to measure (longer) Ty relaxation times is repeating 
the Hahn echo sequence at different speeds, that is, 
measure the echo intensity versus repetition rate. After 
the echo formation, the transverse magnetization 
defocuses again and returns to the equilibrium along 
the z axis with a time constant T 1 (see Fig. 7b). If the 
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Pulsed EPR: Principles and Examples of Applications to Hemeproteins, Fig. 5 Sequence of microwave pulses and formation 
of a primary spin echo 


Pulsed EPR: Principles and 
Examples of Applications 
to Hemeproteins, 

Fig. 6 Echo detected EPR in 
cyt Cg of cyanobacteria. 

(a) Sequence of microwave 
pulses for the experiment. 

(b) EPR of the photosynthetic 
cytochrome Cg from Anahaena 
PCCl 19 measured using the 
standard CW-EPR experiment 
(lower trace) and the Hahn- 
echo sequence (upper traces). 
The echo-detected EPR helped 
in quantifying and 
characterizing a second highly 
anisotropic low-spin species 
only barely noticeable in the 
CW spectrum. The features 
corresponding to the principal 
axes of the g-tensor of the 
second species are labeled 
with an arrow. The additional, 
broad hump in the spectrum is 
caused by the effect of the 
interaction between electron 
and nuclear spins (► Pulsed 
ENDOR and ESEEM: 
Principles and Examples of 
Applications to Heme 
Proteins). As a result of the 
analysis of their g-values, the 
two species were assigned to 
two different conformations of 
the heme axial ligands 
methionine and histidine 
(Taken from Garcia-Rubio 

et al. 2003) 
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a 

inversion pulse detection sequence 




Pulsed EPR: Principles and Examples of Applications to Hemeproteins, Fig. 7 Pulse sequences to measure longitudinal and 
transverse relaxation times, (a) Inversion recovery, (b) Variation of shot repetition time, (c) Primary echo decay 


sequence is repeated too fast the magnetization does 
not have time to return to equilibrium and the echo 
intensity gets smaller. The time constants obtained 
from these experiments have to be interpreted care¬ 
fully, as the decay traces of the experiments are con¬ 
tributed to by a number of processes that take the spins 
off-resonance without changing the longitudinal mag¬ 
netization (Eaton and Eaton 2000a). Processes like 
motion of an anisotropic paramagnetic center, nuclear 
spin flip-flops, electron-nuclear cross relaxation, etc., 
generally named ► spin diffusion contribute to the 
spectra as an apparent relaxation. The experiment 
called saturation recovery is designed to minimize 


the contribution of spin diffusion by applying 
a strong and long pulse that saturates the spin system 
before detecting its relaxation. 

The decay of the transverse magnetization is char¬ 
acterized by the time constant 7’2- In order to measure 
this decay, transverse magnetization is formed by a | 
pulse, then allowed to evolve for a certain time, and 
then refocused by a ti pulse. The spin packets that have 
undergone a spin-spin flip-flop process during the evo¬ 
lution time have lost their phase and will not be 
refocused by the second pulse decreasing the intensity 
of the echo. By measuring the intensity of the Hahn 
echo as a function of the interpulse delay (Fig. 7c) one 
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Pulsed EPR: Principles and Examples of Applications to 
Hemeproteins, Fig. 8 Separation of paramagnetic species by 
way of the relaxation time, (a) Pulse sequence for the rj-flltered 
field-sweep experiment, (b) Spectrum of the freeze-quench reac¬ 
tion of cyt P450eam with peracid. The T,-filter is used to separate 
the signal of P450 (ferric heme iron plus a tyrosine radical) from 
a background signal from the resonator based on the different 


relaxation times of the different paramagnetic species. The upper 
spectrum is the echo-induced EPR, the lower spectrum is the 
spectrum measured with the sequence shown in A choosing the 
time delay f to eliminate the background signal, (c) Echo- 
induced spectrum of the empty resonator (upper spectrum), the 
lower spectrum shows the elimination of the background signal 
by subtraction of the empty resonator signal 


Pulsed EPR: Principles and 
Examples of Applications 
to Hemeproteins, Fig. 9 (a) 

Sequence used to measure 
of the nitroxyl label attached 
to methemoglobin. 

(b) Temperature dependence 
of l/Tn, of the nitroxyl signal 
in oxyhemoglobin (□), 
cyanomethemoglobin 
(+ and A), and imidazole- 
methemoglobin (•) (Taken 
from Budker et al. 1995) 
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can obtain an estimate of 72- Here, spin diffusion pro¬ 
cesses, especially nuclear spin diffusion, contribute to 
the apparent relaxation as well (characterized by a time 
constant T^). Note that in order to be able to detect any 
echo, the time should be long enough to at least be 
able to overcome the dead time of the spectrometer. At 
high temperatures, transition metal ions have a very 
short T 2 , therefore pulse EPR experiments in transition 
metal ions are performed at very low temperatures (for 
heme proteins 6 K and below). Deuteration of the 
solvents reduces the nuclear spin diffusion and 
increases T^, which can result in increased perfor¬ 
mance of certain pulse EPR experiments. 


The difference in relaxation times of overlapping 
paramagnetic species can be used to separate their 
spectra. Eor example, if in a Hahn-echo field-sweep 
experiment x ^ for one of the species, one 
can eliminate its contribution to the spectrum. The 
same is achieved when an inversion pulse is inserted 
at the beginning of an echo-detected field-sweep 
experiment if the delay between the first and second 
pulses is carefully selected to the time point where the 
longitudinal magnetization of one of the overlapping 
centers is zero (Maly and Prisner 2004), see Pig. 8. 
This Ti- filter can also be applied to other pulse EPR 
experiments such as ► ENDOR and ESEEM. 
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The relaxation times of heme centers are deter¬ 
mined by the combination of different processes, one 
of them being the interaction of the ferric iron with 
another paramagnetic center. In the study shown in 
Fig. 9, the relaxation time Tm of a nitroxyl label attached 
to cyano-methemoglobin was studied as a function of the 
temperature. At low and high temperatures, the relaxation 
is driven by processes not related to the interaction 
between the paramagnetic centers but in the region 
around 25 K, when the relaxation time of the iron is 
comparable to the iron-nitroxyl spin-spin coupling, the 
iron provides a very efficient relaxation mechanism for 
the nitroxide. By measuring also the temperature- 
dependent relaxation times of the iron, a distance of 
15 A between the two electron spins was determined 
(Budker et al. 1995; Eaton and Eaton 2000b). 

Summary 

Pulse EPR records the evolution of magnetization of 
a paramagnetic sample excited using microwave 
pulses. It can be used to resolve electron-nucleus or 
electron-electron interactions that are buried in 
inhomogenously broadened CW-EPR lines, to sepa¬ 
rate or simplify EPR spectra and to measure spin 
relaxation times. In this entry, the basic principles of 
pulse EPR are discussed using a simple magnetization 
vector picture. Examples of some of the most common 
experiments (excluding ENDOR and ESEEM) are 
illustrated with examples of heme proteins. 

Cross-References 

► Continuous Elow 

► Cytochrome P450 - Computational Studies 

► Cytochromes 

► DEER of Metalloproteins 

► Electron Transfer Cofactors 

► Electron Transport Chains in Bacteria 

► EPR and ENDOR of radicals and Metalloproteins in 
Biological Systems 

► EPR Spectroscopy: General Principles 

► NMR 

► NMR Methods for Kinetic Analysis 

► Paramagnetic Metalloproteins 

► Photosynthetic Electron Transport 

► Proton-Coupled Electron Transfer 


► Pulsed ENDOR and ESEEM: Principles and 
Examples of Applications to Heme Proteins 

► Spin Diffusion 
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Synonyms 

Electron paramagnetic resonance spectroscopy; Elec¬ 
tron spin echo envelope modulation; Pulsed nuclear 
electron nuclear double resonance electron spin echo 
ENDOR 


Definition 

Pulse ENDOR is the pulse variant of the technique 
electron nuclear double resonance spectroscopy, 
which aims to obtain nuclear spin transition frequen¬ 
cies of magnetic nuclei coupled with an electron spin. 
The method is based on the fact that the intensity of an 
electron spin echo (► Pulsed EPR: Principles and 
Examples of Applications to Hemeproteins) formed 
after the excitation of a paramagnetic system with 
resonant microwave pulses can be altered by 
a radiofrequency pulse resonant on a nuclear spin 
transition. The ENDOR spectrum displays differences 
in the echo intensity as a function of the 
radiofrequency pulse frequency. 

The term ESEEM spectroscopy includes a group of 
techniques which, as well, are used to obtain nuclear 
spin transition frequencies. Their basic principle relies 
on the fact that the intensity of an electron spin echo is 
modulated by the transition frequencies of interacting 
nuclear spins. The measurement of the echo intensity 
as a function of the proper interpulse delay (or delays 
in the case of two-dimensional experiments) in a 


sequence of resonant microwave pulses results in 
a decaying oscillatory signal that contains 
nuclear spin transition frequencies. The ESEEM spec¬ 
trum is obtained through Fourier transform of the time 
trace. 

Introduction 

Nuclear spin transition frequencies contain information 
about hyperfine and nuclear quadrupole interactions. 
Quadrupole splittings provide knowledge about local 
charge densities; in turn, the hyperfine interaction 
between electron and nuclear spins in the environment 
can be used to study the electron spin distribution, as 
well as the structure of the paramagnetic site as it 
depends on the nature, distance, and relative orientation 
between both spins. Normally, the NMR and NQR 
spectra of nuclei strongly interacting with paramagnetic 
species are broadened beyond detection (short relaxa¬ 
tion times [► Electron Paramagnetic Resonance Spec¬ 
troscopy]) and have poorer sensitivity than EPR; 
especially in those cases, EPR is the technique of 
choice. However, in the continuous wave EPR (cw- 
EPR [► EPR]) spectrum, sometimes the hyperfine split¬ 
tings are buried in the inhomogeneous linewidth 
(► Pulsed EPR: Principles and Examples of Applica¬ 
tions to Hemeproteins; ► EPR and ENDOR of Radicals 
and Metalloproteins in Biological Systems) of the EPR 
line. This is the case, for example, of ferric heme pro¬ 
teins where the interaction between the iron spin and 
nitrogen and hydrogen nuclei in its environment is not 
resolved in the EPR spectram. Pulse ENDOR and 
ESEEM can overcome inhomogeneous broadening 
and are used to study NMR transitions with EPR 
sensitivity. 

The cw-ENDOR (► EPR; ► EPR and ENDOR of 
Radicals and Metalloproteins in Biological Systems) 
experiment in which the spin system is continuously 
and simultaneously irradiated by microwave (MW) and 
radiofrequency (rf) fields is also able to provide 
such resolution enhancement; however, pulse ENDOR 
is normally exploitable in a much broader temperature 
range and allows the manipulation of the spin systems to 
simplify complicated spectra. On the other hand, as 
cw-EPR compared to pulse EPR, cw-ENDOR can 
offer better sensitivity in some cases. 

ESEEM does not have an equivalent cw experi¬ 
ment and is a genuine pulsed EPR method using 
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Pulsed ENDOR and ESEEM: 
Principles and Examples of 
Applications to 
Heme Proteins, 

Fig. 1 (a) Davies and Mims 
pulse ENDOR sequences - 
both experiments combine 
excitation with MW and rf 
pulses, (b) Diagram of the 
population of states in the 
Davies ENDOR experiment 
for an S = 1/2 ,1 = Vz system. 
The numbers in circles label 
the time point in the pulse 
sequence 
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a single MW frequency which makes it technically 
less demanding than pulse ENDOR. Although both 
techniques are used to obtain the same information, 
that is, nuclear spin transition frequencies through 
detection of changes in the intensity of an electron 
spin echo, the physical processes involved in the two 
experiments are different and so are the intensities 
(the magnitude of the echo change) in the two 
kinds of experiments. Actually, ESEEM and 
ENDOR often complement each other; ESEEM is 
very sensitive to low nuclear spin transition frequen¬ 
cies, while ENDOR sensitivity increases with 
increasing frequencies. ESEEM needs anisotropic 
interactions to provide broadband excitation of 
nuclear transitions, while ENDOR provides good per¬ 
formance when the system presents isotropic hyper- 
fine interactions by inducing selective nuclear spin 
transitions. 


The interpretation of the nuclear spin transition 
frequencies is made in the frame of the spin Hamilto¬ 
nian constructed only with spin operators (► EPR; 
► EPR and ENDOR of Radicals and Metalloproteins 
in Biological Systems). The details of the interactions 
are considered parameters (g-, A-, or Q-matrices) of 
this effective Hamiltonian. The experimental nuclear 
spin transition frequencies allow the determination of 
the interaction matrices of the spin Hamiltonian, which 
then have to be interpreted in terms of geometry, 
distances, or spin distribution using a microscopic 
model. 

The more common pulse ENDOR and ESEEM 
experiments and examples of its application to hemo- 
proteins are described below; the reader interested in 
more experiments is referred to the literature at the end 
of the chapter (Schweiger 1982, 1991; Dikanov and 
Tsvetkov 1992; Schweiger and Jeschke 2001; Gromov 
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Pulsed ENDOR and ESEEM: Principles and Examples of 
Applications to Heme Proteins, Fig. 2 (a) Microwave 
pulse sequences for two-pulse ESEEM, three-pulse ESEEM, 
and HYSCORE. The echo intensity is measured as a function 
of the time delays colored in red which are varied during the 
experiments, (b) Schematic diagram showing the cross- 

and Harmer 2007; Harmer et al. 2009; Van Doorslaer 
2008, 2009). 


Principles of Pulse ENDOR 

The two most commonly used pulse ENDOR experi¬ 
ments are Mims and Davies ENDOR. Their pulse 
sequences are shown in Eig. 1. To gain some insight 
into the inner functioning of the experiments, we will 
describe the effect of the different pulses in a simple 
system of an electron spin S = 1/2 coupled to a nuclear 
spin I = 1/2, which could be, for example, a low-spin 
ferric heme iron interacting with a proton. In the pres¬ 
ence of a magnetic field, the spin system has four levels 
which are labeled in Fig. lb according to the third 
component of the electron and nuclear spins. 
The states with different ms are much more separated 
in energy (about 10,000 times) than the states which 
have the same ms and different mj due to the 
stronger magnetic moment of the electron compared 
to the one of the nucleus. According to the Boltzman 
law, in a situation of thermal equilibrium, the states 
with ms = —1/2 are more populated than the ones with 
ms = -1-1/2. This population difference produces 


correlation peaks in HYSCORE experiments displaying signals 
from an I = Vi nucleus. If the hyperfine coupling is smaller than 
twice the nuclear Larmor frequency, the cross-peaks appear in 
the right quadrant, if the hyperfine coupling is larger than twice 
the Larmor frequency of the nucleus, they appear in the left 
quadrant 

a macroscopic magnetization in the direction of the 
magnetic field and is represented in the figure as black 
and white squares for the more and less populated 
states, respectively. In the Davies ENDOR experiment, 
the system is excited with a selective MW 7i-pulse 
(► Pulsed EPR: Principles and Examples of Applica¬ 
tions to Hemeproteins) resonant with one of the elec¬ 
tron spin transitions which has the effect of inverting 
the magnetization (and the population difference of the 
two states). The subsequent 7r/2 and % pulses generate 
an electron spin echo with intensity proportional to the 
population difference (read-out sequence). If 
a selective rf ti- pulse resonant with one of the nuclear 
spin transitions is inserted between the first and second 
MW pulses, it inverts the population difference 
between the two nuclear states. This equalizes the 
populations of the states of the excited electron spin 
transition and, consequently, the echo disappears. If 
the rf pulse is not resonant with any nuclear spin 
transition, it does not affect the spin system and an 
inverted echo is detected. The spectrum displays dif¬ 
ferences in the echo intensity as a function of the rf 
pulse frequency, and lines in the ENDOR spectrum 
correspond to nuclear spin transition frequencies in 
any of the electron spin manifolds. Mims ENDOR 
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Pulsed ENDOR and ESEEM: 
Principles and Examples of 
Applications to 
Heme Proteins, 

Fig. 3 Three-pulse (b) and 
two-pulse (a) ESEEM spectra 
of P450cam (H2*’0) and 
P450cam (H 2 O) and their 
ratio. MW frequency was 
10.68 GHz and g = 2.42, 

T = 210 ns in the three- 
pulse ESEEM (From 
Thomann et al. 1995) 


a 3P 9 = 2.42(10.68 GHz) 



Frequency (MHz) 


b 2P 9 = 2.42(10.68 GHz) 



experiment is based on the detection of the stimulated 
echo (► Pulsed EPR: Principles and Examples of 
Applications to Hemeproteins) as a function of the 
frequency of a selective rf 7t-pulse applied between 
the second and third MW pulses. The ENDOR signal 
intensities in this experiment are modulated by the 
factor cos(At) which means that there are “blind 
spots” in the spectrum when i = 27tn/A, where A is 
the hyperfine coupling parameter. This blind spot 
behavior makes it difficult to interpret spectra with 
large hyperfine couplings. On the other hand, Mims 
ENDOR uses nonselective MW pulses, so more elec¬ 
tron spins are involved in the experiment, and the echo 
amplitude is larger. These arguments together with the 
fact that Davies ENDOR is not very sensitive to weak 
couplings bring in the following rule of the thumb; 
Davies ENDOR is better suited for large coupling 
and Mims ENDOR for smaller hyperfine couplings. 

ESEEM Principles and Most Commonly Used 
Pulse Sequences 

This technique is based on the so-called nuclear modu¬ 
lation effect. When the intensity of the electron spin 
echoes is recorded as a function of a certain time interval 


in the MW pulse sequence, some oscillations can be 
observed in the echo intensity trace. Their frequencies 
are related to the nuclear spin transition frequencies 
because this nuclear modulation effect is due to the 
coupling between the electron spin and the nuclear 
spins, but it only occurs when anisotropic interactions 
are present. Therefore, ESEEM spectroscopy is not 
applied to samples in the liquid phase where the 
couplings are isotropic due to motional averaging. 

There is a large variety of ESEEM experiments 
in one and two dimensions; in Eig. 2, the MW 
pulse sequences of the most common two- and 
three-pulse echo modulation experiments, together 
with the one of the two-dimensional HYSCORE, 
are shown. The two-pulse ESEEM experiment is 
the most simple and consists in observing the decay 
of the primary echo, that is, measuring the intensity of 
the echo generated by the sequence of MW pulses 7 i /2 
-1 - 7 t as a function of the interpulse delay t. Assuming 
an electron spin S = Yi coupled with a nuclear 
spin I = 1/2, the expression of the echo intensity in 
the two-pulse echo experiment, which can be obtained 
using the spin density operator formalism (Schweiger 
and Jeschke 2001), contains the nuclear spin transition 
frequencies of both electron spin manifolds and also 
their sum and difference. The modulation amplitudes 
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[v-v(^H)] MHz 


Pulsed ENDOR and ESEEM: Principles and Examples of 
Applications to Heme Proteins, Fig. 4 Top-. Davies 
ENDOR of protons in compound I of CPO in H 2 O {black line) 
and in D 2 O {red line). The black brace indicates the largest 
exchangeable doublet (A ~ 10 MHz) and the red brace the 
largest nonexchangeable doublet (A ~ 6 MHz). Bottom-. Mims 
ENDOR of compound I of CPO in D 2 O (Taken from Kim et al. 
2006) 

for all four frequencies, given by the so-called 
modulation depth parameter, are zero if the hyperfine 
interaction is isotropic or the magnetic field is oriented 
along a principal axis of the hyperfine tensor and are 
maximal when the angle between the directions of 
nuclear spin quantification in the two different spin 
manifolds is 90°. The decay of the spin echo with the 
phase memory time (fast for transition metal ions) 
imposes an upper limit to frequency resolution in this 
experiment. 

The echo formed after the three-pulse sequence 7t/2 
-I - 7t/2 - T - 71/2 (stimulated echo) also shows 
a nuclear modulation effect. When the echo intensity 
is recorded as a function of the time interval T, only the 
nuclear spin frequencies in the two manifolds are 
observed (no sum or difference), weighted by the mod¬ 
ulation depth parameter and a factor cos((Ba_b't) which 
leads to “blind spots” at certain values of x. Normally, 
three-pulse ESEEM experiments enjoy a much better 


resolution as the echo decays with the longer longitu¬ 
dinal relaxation time T 1 . 

The resolution obtained in a three-pulse ESEEM 
experiment can be increased (and the interpretation 
made easier) by expanding the experiment into 
a second dimension. The most popular variant of the 
two-dimensional ESEEM techniques is the 
ElYSCORE (hyperfine sublevel correlation) experi¬ 
ment (Fig. 2b), which basically is a three-pulse 
ESEEM in which the evolution time T has been 
divided in two by an MW tt pulse. The result is that 
the transient states evolve in two different electron spin 
manifolds. In the HYSCORE experiment, the intensity 
of the echo is recorded as a function of the delay 
intervals tj and t 2 , which are varied independently. 
The modulation depth in this experiment is also mul¬ 
tiplied by an oscillatory factor that produces blind 
spots for certain frequencies depending on the value 
of I. Two-dimensional Fourier transform of the echo 
intensity data produce a two-dimensional frequency 
spectrum where the signals are pairs of symmetric 
cross-peaks (or ridges in the case of powder samples 
or frozen solutions) with respect to the diagonal. Each 
of the two coordinates of the signals in the HYSCORE 
spectrum corresponds to nuclear spin transition fre¬ 
quencies in a different electron spin manifold. There¬ 
fore, using HYSCORE, one can not only separate 
spectra otherwise overlapped due to the unfolding of 
the spectrum into a second dimension but obtain fur¬ 
ther information as to the assignment of the nuclear 
frequencies to one of the spin manifolds as well. 

Examples of Pulse ENDOR and ESEEM 
Applications to Heme Proteins 

All of the ENDOR and ESEEM studies in hemopro- 
teins aim at resolving the hyperfine interactions 
between the electron spin of the paramagnetic iron in 
the heme group and magnetic nuclei in its environ¬ 
ment. Already in the heme group, there are a number of 
those, four directly coordinating ''^N from the pyrrole 
rings and several protons (pyrrole and mesoprotons). 
When the coupled nuclei have nuclear spin I > i^, the 
nuclear quadrupole interaction is also a valuable piece 
of information that can be obtained. The next examples 
illustrate the kind of information that can be extracted 
from nuclear spin transition frequencies using pulse 
ENDOR and ESEEM experiments. 
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Pulsed ENDOR and ESEEM: Principles and Examples of 
Applications to Heme Proteins, Fig. 5 HYSCORE spectra 
showing the contributions of the weakly coupled ’"'N and to 
the ferric iron in metmyoglobin. (a) X-band HYSCORE spec¬ 
trum Bq = 320 mX, T — 128 ns. (b) Q-band HYSCORE spec¬ 
trum Bo — 412 mX (g = g-L°^^), t = 80 ns. (c) Q-band 
HYSCORE spectrum Bq = 562 mX, x = 80 ns. Xhe experimen¬ 
tal spectra are shown alone on the left and together with the 


superimposed frequency calculations on the right. Xhe red cir¬ 
cles correspond to the calculated nuclear frequencies for the Ns 
of the coordinating His, and the cyan crosses correspond to the 
ones calculated for the of the distal His. Xhe pattern of nuclear 
frequencies calculated for '"*C are displayed in green for the 
mesocarbons and brown for P-pyrrole carbons. Xwo simulation 
possibilities for a-pyrrole carbons are displayed in blue and 
magenta (Xaken from Garcia-Rubio et al. 2008) 
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The first example (Thomann et al. 1995) shows how 
two- and three-pulse ESEEM was used to prohe the 
axial coordination of the suhstrate-free resting state of 
cytochrome P450cam (► Cytochrome P450 - Compu¬ 
tational Studies). The studies were performed in 
enzyme frozen solutions prepared in natural- 
abundance and *^0-enriched water, and ESEEM traces 
were recorded for several values of the magnetic field. 
The paramagnetic species is in this case a low-spin 
(S = 1 / 2 ) ferric iron. When the electron spin interacts 
with more than one nucleus, the modulation of the echo 
is given by the multiplication of the modulation that 
would be produced by each nucleus independently 
(product rule). In the study, the interest was focused 
on the modulation signals of (1 = 5/2), so the 
ESEEM time traces of the enzyme in '^0-enriched 
water were divided by the ones in natural-abundant 
water to obtain only the modulation due to 
(Eig. 3). Simulations of the ratio spectra provided the 
hyperfine (A) and nuclear quadrupole (Q) interaction 
tensors. Based on Q, it was concluded that confined 
water, and not OH“, occupies the 6th coordination site 
of iron. The orientation constrains were attributed to 
hydrogen bounding to other water molecules or protein 
side chains or to orbital overlap. 

In the next selected example (Kim et al. 2006), the 
authors used Davies and Mims ENDOR to investigate 
a reaction intermediate (compound I) of the enzyme 
chloroperoxidase (► Peroxidases) (CPO). In this case, 
the paramagnetic species is a heme Pe(IV) iron 
interacting through exchange coupling with a free rad¬ 
ical which was supposed to be localized either in the 
porphyrin or in the sulfur axial ligand. The Davies 
ENDOR spectrum shown in Eig. 4 reveals the hyperfine 
coupling with an exchangeable proton assigned to the 
peptide hydrogen bonds from closeby alanine and proline 
residues to the sulfur. The experiment with the enzyme 
prepared in deuterated water shows the corresponding 
signals of deuterons and a further proton coupling 
(labeled in the figure with a horizontal bar) that was 
assigned to the p protons of the cysteinyl residue. The 
hyperfine coupling of this proton allows to estimate an 
upper bound for the spin density on the sulfur and con¬ 
clude that the radical is predominantly on the porphyrin. 

The potential of HYSCORE is demonstrated in the 
next example (Garcia-Rubio et al. 2008; Eittipaldi 
et al. 2008). In this study, the weak hyperfine interac¬ 
tions of high-spin (S = 5/2) ferric heme iron of 
metmyoglobin with distant nuclei were obtained. In 


Fig. 5, X-band and Q-band HYSCORE spectra taken 
at various magnetic field values throughout the aniso¬ 
tropic EPR spectrum of metmyoglobin are shown. As 
it can be seen, the spectra are crowded with signals that 
were not present in the ENDOR spectra and would 
have never been possible to resolve in an ESEEM 
spectrum in one dimension. Due to the orientation 
selection and the multifrequency approach, the hyper¬ 
fine and quadrupole tensors of two noncoordinated *"^N 
nuclei (assignments in the figure caption) and of the 
three kinds of porphyrin carbons (in isotope natural 
abundance, 1.1%) were determined. The hyperfine 
couplings of were translated to spin densities in 
carbon a and n orbitals. The spin density distribution 
allowed proposing a mechanism of spin delocalization 
via mixing of the semioccupied iron orbital and the 
occupied frontier orbital hi of the porphyrin. 

Summary 

Hyperfine and nuclear quadrupole interactions can 
reveal structural and spin density information of a 
paramagnetic center. Pulse ENDOR and ESEEM are 
two complementary techniques used to measure hyper¬ 
fine splittings nonresolved in standard EPR spectra due 
to inhomogeneous broadening. The principles of most 
common pulse ENDOR and ESEEM experiments are 
discussed, and several examples in the field of hemo- 
protein research are shown to illustrate the potential of 
these techniques. 

Cross-References 

► Cytochrome P450 - Computational Studies 

► Electron Paramagnetic Resonance Spectroscopy 

► EPR 

► EPR and ENDOR of Radicals and Metalloproteins 
in Biological Systems 

► EPR Spectroscopy: General Principles 

► Peroxidases 
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Synonyms 

NMR; PFG 


Definition 

The pulsed field gradient {PFG), as the name suggests, 
is a spatially dependent magnetic field (Bg (r)) applied 


across the NMR sample volume, for a short duration 
(typically < 10 ms), during which the magnetic field 
(Bq) becomes spatially inhomogeneous. The applica¬ 
tion of PFG for a short time (x) introduces a spatially 
dependent phase (<t> (i, r)) for the transverse magneti¬ 
zation that depends on Bg(r) as well as the gyromag- 
netic ratio (y) and the coherence order (p) of the 
individual j nucleus involved, and is given by 

0 (1,r) = 5 Bg(r) 

j 

The J dependence of the phase factor makes the 
gradient less efficient for nuclei with smaller y. This 
means that the spread of resonance frequencies across 
a sample on application of same gradient will be four 
times less for *^C than for *H nuclei. The origin of 
z-axis is assumed to be the center of the sample, which 
extends over the region ±r/2. Here, s is the shape 
factor and is defined as 



Here, A(t) is a function that defines the envelope of the 
gradient pulse. The initial applications of PFGs suf¬ 
fered from switching on the gradient pulse, which were 
to induce eddy currents in the nearby magnet bore and 
probe assembly. These currents perturb the spectrum 
by generating an additional magnetic field. This prob¬ 
lem could be overcome with the introduction of 
actively shielded gradient coils, which offer short 
recovery times, for the reestablishment of homoge¬ 
neous magnetic field after the application of a PFG. 
Since then, PFGs have been extensively used in high- 
resolution NMR. Most of the applications capitalize on 
the fact that the phase generated by the application of 
PFG depends on the coherence order and can be 
reversed by applying a gradient in opposite direction 
(by reversing the sign of the shape factor). 

Today, PFGs are used for spatial encoding of coher¬ 
ences in diffusion experiments and selection of desired 
coherence and elimination of unwanted coherences in 
high-resolution NMR spectroscopy by dephasing and 
rephasing the coherences. The use of PFGs allows 
minimization of phase cycling steps. They have the 
potential to select the desired signal even in one scan. 
This in turn helps in reducing the total instrument time. 
This revolutionized high-resolution NMR by 
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facilitating various aspects such as solvent suppres¬ 
sion, sensitivity enhancement, measurement of trans¬ 
lational diffusion, quadrature detection, shimming and 
selection of the desired coherence. 
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Push Through One-Way Valve 
Mechanism of Viral DNA Packaging 
Nanomotor 
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Pharmaceutical Sciences and Markey Cancer Center, 
University of Kentucky, Lexington, KY, USA 

Synonyms 

Molecular motors 


Definition of Nanomotor 

A motor is a device that converts any form of energy 
into mechanical energy for motion. The supplied 
energy can be achieved from an internal combustion 
or an arrangement of coils and magnets that converts 
electrical current into mechanical power. In our every¬ 
day life, we are surrounded by both combustion engines 
and electric motors. These belong to the category of 
man-made macroscopic motors. A popular technologi¬ 
cal revolution over the past few decades, however, is to 
scale down machines and appliances. This trend is 
likely to continue with the creation of many varieties 
of nanodevices. Many of these devices will require 
nanomotors (Grigoriev et al. 2004) to stimulate motion. 
Therefore, we need to downscale the way we think of 
motors to the nanoscale or molecular range. Nano- or 
molecular motors exhibit wide variability in structure. 
Most are quite complex, but in general, the structures 
consist of the same essential components: a mechanical 
frame, consisting of both moving and static parts, and 
an energy supply. The mechanical frame is formed by 
proteins in most molecular motors. However, the DNA- 
packaging motor of bacteriophage phi29 also includes 
a hexamer of RNA molecules as one of the essential 
components (Guo et al. 1987a, 1998). This motor has 
been assembled artificially using biomimetic 
approaches (Guo et al. 1986). 

Structure of Viral DNA Packaging 
Nanomotor Constructed In Vitro 

In viral replication, the long double-stranded genome 
of linear dsDNA viruses is packaged within 
a preformed protein shell called procapsid. This entro- 
pically unfavorable genome packaging process is 
accomplished by an ATP-driven viral nanomotor 
(Fig. 1). Most viral DNA packaging motors are com¬ 
posed of three modules, a dodecameric connector core 
with a channel for DNA translocation and two DNA 
packaging components, one of which is an ATPase 
(Guo et al. 1987b). In bacteriophage phi29, a virus 
that infects the bacterium Bacillus suhtilis, one of 
these two components is RNA (Guo et al. 1987a) 
and the other component is the protein enzyme gpl6 
(Guo et al. 1986, 1987b). A number of models have 
been proposed covering the main features of the mech¬ 
anism of DNA packaging (Guo and Lee 2007). 
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Push Through One-Way Valve Mechanism of Viral DNA 
Packaging Nanomotor, Fig. 1 Illustration of phi29 DNA 
packaging through the connector channel (a) during in vivo 


virion assembly; (b) embedded in membrane in vitro, (c) Struc¬ 
ture of connector/DNA complex (Adapted from ling et al. 
(2010) with permission from American Chemical Society) 


Mechanism of Motor Action 

Extensive research has shown that genomic DNA 
packaging in phi29 is similar to other hexameric pro¬ 
teins in the AAA-i- family that translocate along strands 
of dsDNA. Recent evidence supports the claim that the 
ATPase gpl6 is hexameric; years of work have verified 
that pRNA is hexameric as well (Guo et al. 1998; 
Shu et al. 2007; Xiao et al. 2008). Simply, the motor 
functions through two distinct roles: gpl6 ATPase for 
pushing dsDNA toward the portal and the connector 
for one-way valve control to prevent built-up pressure 
from inside the capsid to expel the dsDNA during the 
packaging process. Gpl6 threads dsDNA into the cap¬ 
sid, using ATP as energy and pRNA as a foothold, 
while the connector opens and closes in a coordinated 
function during each hydrolysis step to clamp and 
position dsDNA entry. 

Operation of Motor Components 

Many years have passed in a quest to understand the 
mechanism of viral DNA packaging and several dif¬ 
ferent models have been proposed. These include, but 
are not limited to, the following: (1) DNA compression 
and relaxation (Khan et al. 1995; Oram et al. 2008; 
Sabanayagam et al. 2007), (2) force of osmotic pres¬ 
sure (Serwer 1988), (3) ratchet mechanism (Fujisawa 
and Morita 1997), (4) Brownian motion (Astumian 
1997), (5) hvefold/sixfold mismatch connector 


rotating thread (Hendrix 1978), (6) supercoiled DNA 
wrapping (Grimes and Anderson 1997), (7) sequential 
action of motor components (Chen and Guo 1997; 
Moffitt et al. 2009), (8) electro-dipole within central 
channel (Guasch et al. 2002), (9) connector contraction 
hypothesis (Morita et al. 1995), and (10) connector 
rotation model (Guasch et al. 2002). However, none 
of these models have been conclusively supported by 
experimental data. There are cases where some models 
have been validated in one viral system but refuted in 
other systems. The hvefold/sixfold mismatch connec¬ 
tor rotation model, hrst proposed in 1978, was popular 
since it described the perpetual motion of the motor 
(Hendrix 1978). This mechanical motor prototype has 
been prevalent based on the mismatch between the 
hvefold symmetry of capsid at the vertex and sixfold 
symmetry of the connector, which is a 12-subunit 
dodecamer. Years later, however, another model was 
proposed in which the connector revolved during DNA 
packaging (Guasch et al. 2002). In this model, the 
positively charged lysine residues inside the highly 
negatively charged connector channel were thought 
to interact favorably with the negatively charged phos¬ 
phate backbone of DNA in the process of driving the 
DNA into the procapsid. However, both the proposed 
connector rotation model (Guasch et al. 2002) and the 
hvefold/sixfold mismatch connector rotating thread 
model (Guasch et al. 2002; Simpson et al. 2000) have 
been invalidated by single-molecule studies through 
huorescent labeling of the connector (Hugel et al. 
2007) and by testing with connector cross-linking 




Push Through One-Way Valve Mechanism of Viral DNA Packaging Nanomotor 


2135 


P 



Antibody to 
procapsid 
Connector — 


Quartz slide 
Procapsid 

pRNA 


Fluorophore 

Coverslip 


Objective 



144 

=j 

108 

< 

72 

'55 

36 

c 

B 

0 

c 

0) 

o 

c 

0 

102 

o 

05 

0 

68 

O 

D 

34 

LL 

0 










10 


,i: . Cy3 




12 


15 


12 


18 


Time (min) 



Coverslip 


0) 642 
o 

I 640 
b 

0) 638 


636 





100 200 300 400 

Time (s) 


500 


600 


Push Through One-Way Valve Mechanism of Viral DNA 
Packaging Nanomotor, Fig. 2 Single-molecule observation 
of phi29 DNA packaging motor, (a) Single-molecule fluores¬ 
cence photobleaching assay of pRNA. Top: the schematic of 
experimental setup. Bottom: fluorescence intensities time trajec¬ 
tory of differently labeled pRNA in one motor complex. 


(b) Direct observation of DNA motion during packaging. Top: 
schematic of the experimental setup. Bottom: Sequential images 
of a fluorescent particle attached to DNA and time trace of 
distance moved during DNA packaging (Adapted from Shu 
et al. (2007) with permission from European Molecular Biology 
Organization) 




experiments (Baumann et al. 2006; Maluf and Feiss 
2006). 

In 1998, it was proposed by Guo that the mechanism 
of viral DNA packaging motors is similar to that of the 
hexameric AAAh- ATPases that translocate along the 
strands of DNA (Guo et al. 1998). Recent evidence 
revealed that both the DNA-packaging pRNA (Fig. 2a) 
(Guo et al. 1998; Shu et al. 2007; Xiao et al. 2008) and 
the ATPase gpl6 of bacteriophage phi29 DNA pack¬ 
aging motor form hexamer rings (unpublished). Recent 
studies have revealed that the motor connector core is 
a valve that acts to control DNA traffic unidirection- 
ally. DNA is packaged through the motor channel 
section with the aid of the DNA packaging enzyme 
by energy provided by ATP hydrolysis in a mechanism 
similar to the hexameric AAA-i- family (Fang et al. 
2012; Geng et al. 2011; Schwartz et al. 2011). Single¬ 


molecule techniques have been applied to study phi29 
DNA packaging mechanisms. Motion of the DNA 
during packaging by a single phi29 motor was directly 
observed (Chang et al. 2008; Shu et al. 2007) (Fig. 2b). 
In addition, using optical tweezers, DNA packaging 
speed and force were measured for an individual motor 
complex (Moffitt et al. 2009; Rickgauer et al. 2008; 
Smith et al. 2001) (Fig. 3). A mechanism of coordina- 
tive interaction among motor subunits was proposed to 
elucidate the packaging process (Moffitt et al. 2009). 

As documented in many phage systems (Casjens 
2011; Guo et al. 1987b; Guo and Lee 2007; Zhang 
et al. 2012), the ATPase, or terminase as it is called 
in many systems, plays a key role in transporting the 
DNA into the procapsid. However, the role of the 
connector in DNA translocation remained 
unidentified. Recently, the use of a new method to 











p 


2136 


Push Through One-Way Valve Mechanism of Viral DNA Packaging Nanomotor 



Push Through One-Way Valve Mechanism of Viral DNA 
Packaging Nanomotor, Fig. 3 A. Schematic of the 
magnetomechanical system of phi29 DNA packaging. B. (a) 
DNA tether length against time for two different motors, (b) 


DNA packaging rate against the amount of DNA packaged 
relative to the original phi29 DNA length (Adapted from 
Chang et al. (2008) with permission from American Institute of 
Physics) 


measure dsDNA translocation through the portal pro¬ 
tein embedded in a lipid bilayer has facilitated the 
proposal of a pump and valve mechanism for bacteri¬ 
ophage phi29 DNA packaging motor (Guo and Lee 
2007) (for review, please see Haque and Guo 2011) 
which has been validated in recent reports (Fang et al. 
2012; Geng et al. 2011; Jing et al. 2010). In this model, 
the connector does not rotate, and the DNA is pushed 
through the connector channel section by section into 
the procapsid with the aid of a DNA packaging enzyme 
or terminase. The results are consistent with the “Push 


Through a One-way Valve” model proposed recently 
by Guo and coworkers (Fang et al. 2012; Schwartz 
et al. 2011), that suggests that the connector and 
ATPase gpl6 act separately but in a coordinated fash¬ 
ion to complete the packaging task. In this model, 
dsDNA packaging occurs through the combination of 
two individual tasks. The first one is the active pushing 
mechanism provided by the ATPase gpl6 (Aathavan 
et al. 2009; Guo et al. 1987b; Schwartz et al. 2011) 
bound to pRNA (Lee and Guo 2006; Zhao et al. 2008) 
(Fig. 6), which acts as a fulcrum to provide a base for 
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Push Through One-Way Valve Mechanism of Viral DNA at one voltage polarity (Adapted from Jing et at. (2010) with 
Packaging Nanomotor, Fig. 4 DNA translocation through permission from American Chemical Society) 
a single membrane-embedded connector channel only occurred 


the active motion during the ATP hydrolysis step. In 
the case of other dsDNA phage systems, the large 
subunit of the two terminases (Guo et al. 1987b) 
grips the motor component to push the dsDNA during 
the hydrolysis of ATP to ADP and inorganic phos¬ 
phate. The second step in the packaging task is the 
control of the direction of DNA migration using 
a channel to serve as a one-way valve (Jing et al. 
2010). The successful insertion of the connector chan¬ 
nel into a bilayer lipid membrane made it possible to 
study dsDNA translocation through the channel 
using a single-channel conductance assay (Fig. lb) 
(Jing et al. 2010; Wendell et al. 2009). Events of 
DNA translocation only occurred at one voltage polar¬ 
ity, demonstrated by numerous current blockades, 
while no events occurred at the other voltage polarity 
(Jing et al. 2010) (Fig. 4). Moreover, it was demon¬ 
strated that multiple channel insertions in the lipid 
membrane affects the frequency of DNA 


translocations due to their different orientation 
arrangements (Jing et al. 2010) (Fig. 5). These phe¬ 
nomena demonstrate that phi29 connector only allows 
DNA trafficking in one direction. Furthermore, the 
orientation of the embedded connector in DNA trans¬ 
location was probed and translocation was determined 
to take place from the narrower N-terminal end toward 
the wider C-terminal end. 

Although the protein sequences of many viral con¬ 
nectors share little homology, the global structures are 
morphologically similar (Guasch et al. 2002; Febedev 
et al. 2007; Olia et al. 2011; Simpson et al. 2001), 
suggesting that viral DNA packaging might utilize 
similar mechanisms. Extensive studies revealed that 
the structure of phi29 connector is a cone-shaped ring 
surrounded by 12 subunits of gplO (Badasso et al. 
2000; Guasch et al. 1998, 2002; Guo et al. 2005; 
Jimenez et al. 1986; Simpson et al. 2000, 2001). 
The wider end, which is buried inside the procapsid. 
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Push Through One-Way Valve Mechanism of Viral DNA Packaging Nanomotor, the connector in the bilayer (Adapted from Jing et al. (2010) with permission from 

Fig. 5 Change in DNA translocation frequency related to the orientation of the con- American Chemical Society) 

nectors in bilayers upon insertion of multiple connectors. Insets depict the orientation of 
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Push Through One-Way Valve Mechanism of Viral DNA 
Packaging Nanomotor, Fig. 6 “Push through one-way 
valve” mechanism of phi29 DNA packaging. ATPase gpl6 
pushes DNA into the procapsid using energy from ATP hydro¬ 
lysis. The connector channel exercises a unidirectional one-way 
traffic control to prevent DNA slipping during the packaging 
process (Adapted from Fang et al. (2012) with permission from 
Biophysical Journal) 


is 13.6 nm and the narrower protruding end is 6.8 nm. 
The narrowest inner diameter of the connector channel 
is 3.6 nm while the wider end of the channel is 6.2 nm 
which allows dsDNA to enter during DNA packaging 
and to exit during host infection. Since the highly 
negatively-charged channel wall is decorated with 
four lysine rings (K200, K209, K234, K235) scattered 
inside the channel, these basic lysine residues were 
proposed to interact with the negative phosphate back¬ 
bone of dsDNA during DNA packaging (Guasch et al. 
2002). The active role of the connector as well as the 
potential function of the lysine residues in DNA pack¬ 
aging have also been proposed in many DNA packag¬ 
ing models (Grimes and Anderson 1997; Guasch et al. 
2002; Simpson et al. 2000). It was unexpectedly found 
that neither mutation of the basic residues or changing 
of pH to 4 or 10 did not measurably impair DNA 
translocation or affect the one-way traffic property of 
the channel, suggesting that the positive charges in the 


lysine ring of the channel wall do not play a critical 
role in gearing the dsDNA. The discovery that the 
lysine residues of the connector are not critical for 
dsDNA translocation argues against the connector 
gearing or rotation model but lends support to the 
“Push through a one-way valve” mechanism. 

This mechanism is consistent with and supported by 
the recent hnding in the T4 phage system showing that 
dsDNA was compressed if DNA entry was blocked 
(Oram et al. 2008; Ray et al. 2010a, b; Sabanayagam 
et al. 2007). Although the authors interpreted that the 
force for compression was due to the torsional force 
from the coiled DNA at the external end, compression 
can also be attributed to ATPase pushing. In fact, their 
result supports our report in 1998 that the function of 
ATPase gpl6 was similar to the DNA-tracking motors 
of AAA-i- family that twist or ride along the dsDNA 
(Guo et al. 1998). Furthermore, in the T4 system, both 
ends of the packaged DNA were reported to remain 
outside of the procapsid (Ray et al. 2010a, b). If the 
motor would have generated a pulling instead of push¬ 
ing function, it is difficult to envision how the initiation 
of DNA translocation begins when both ends are 
located outside the capsid (Ray et al. 2010a, b). The 
“Push Through a One-way Valve” model is also con¬ 
sistent with the “supercoiled DNA wrapping model” 
(Grimes and Anderson 1997) that demonstrated the 
coiling of DNA, and the “ratchet model” in phage T3 
(Fujisawa and Morita 1997) that specified the direc¬ 
tional control of DNA translocation. 


Conversion of Chemical Energy from ATP 
Hydrolysis to Physical Motion 

The sequential action among the ATPase and addi¬ 
tional motor components is the most important event 
in force generation. It has been revealed that the con¬ 
tact of ATPase gpl6 to ATP resulted in its conforma¬ 
tional change to a higher binding affinity toward 
dsDNA (Schwartz et al. 2011). It was also found that 
ATP hydrolysis led to the departure of dsDNA from 
the ATPase/dsDNA complex, an action that might be 
the key step to push dsDNA to move along the con¬ 
nector channel (Schwartz et al. 2011). Motion of the 
ATPase gpl6 along the DNA is the force generation 
step, and it seems that gpl6 only translocates along one 
strand of the dsDNA with a 5' to 3' orientation 
(Aathavan et al. 2009). 
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Synonyms 

Molybdopterin 


Definition 

Pyranopterin, or molybdopterin as it is frequently 
referred to in the literature, is the organic component 
of molybdenum- and tungsten-containing enzymes 
(other than the molybdenum-containing nitrogenases) 
that coordinates to and modulates the properties of the 
metal center. While not formally redox-active, the 
pyranopterin is frequently found positioned within 
the polypeptide such that it intervenes between the 
molybdenum center and other redox-active sites in 
the protein. 

Basic Characteristics 

Molybdenum is broadly distributed in the aerobic bio¬ 
sphere owing to the high water-solubility of molyb¬ 
date, Mo 04 ^, in water. Likewise, tungsten is broadly 
in the anaerobic biosphere, and the two metals are 
frequently found to catalyze similar reactions, e.g., 
the oxidation of aldehydes to carboxylic acids 
(Hille 1996). With the notable exception of the molyb¬ 
denum-containing nitrogenases, all molybdenum- and 
tungsten-containing enzymes possess an organic 
cofactor that coordinates the metal via an enedithiolate 
group. This cofactor is derived from guanosine triphos¬ 
phate (GTP) in a multistep biosynthetic pathway and is 
variously referred to in the literature as pyranopterin or 
molybdopterin, and sometimes tungstopterin when 
tungsten is present. The cofactor consists of a pterin 
fused to a pyran ring possessing the enedithiolate and 
a phosphorylated side chain (Fig. 1). In eukaryotes, the 
cofactor exists as shown but in prokaryotes it is fre¬ 
quently elaborated as the dinucleotide of guanine, 
cytosine, adenosine, or infrequently inosine, and 
abbreviated (e.g., MGD for molybdopterin guanine 
dinucleotide). 

Although examples exist of enzymes possessing 
a molybdenum center as the sole redox-active center 
(e.g., the plant sulhte oxidases and dimethyl sulfoxide 
(DMSO) reductase from organisms such as 
Rhodobacter sphaeroides), most often they are found 
as part of large multi-subunit (or at least multi-domain) 
systems with multiple redox-active centers. The X-ray 
crystal structures of representative members of each of 
the three families of molybdenum-containing 
enzymes, as well as the two families of tungsten- 
containing enzymes, have been determined and 
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Pyranopterins, Fig. 1 The structure of the pyranopterin found 
in molybdenum- and tungsten-containing enzymes 


provide a structural context in which to understand 
biological electron transfer between centers that are 
held more or less fixed with respect to one another 
within a fully defined environment. As the molybde¬ 
num-containing enzymes are by far the best character¬ 
ized, electron transfer involving the molybdenum 
center in the case of a representative of each of the 
three families of enzymes is discussed. 

The first family of molybdenum-containing 
enzymes includes xanthine oxidoreductase and related 
enzymes that are involved in the hydroxylation of 
purines and other aromatic heterocycles, or aldehydes; 
the best studied member is xanthine oxidoreductase 
(XOR) from cow’s milk, which is a homodimer of 
^300kDa. The molybdenum center of oxidized XOR 
can be represented as LMo'^*OS(OH), where 
L represents the bidentate enedithiolate of the 
pyranopterin cofactor. The metal coordination geome¬ 
try is distorted square-pyramidal, with an apical 
Mo=0; the equatorial Mo-OH is catalytically labile 
and is transferred to substrate in the course of oxidative 
hydroxylation, being replaced by solvent hydroxide at 
the completion of the catalytic sequence. Upon reduc¬ 
tion, the equatorial Mo=S becomes reduced, giving an 
LMo^'^0(SH)(OH) center. In addition to 
a molybdenum center, each subunit of the enzyme 
possesses two [2Fe-2S] ► iron-sulfur clusters of the 
spinach ferredoxin variety and ► flavin adenine dinu¬ 
cleotide (FAD). The X-ray crystal structure of bovine 
XOR (Enroth et al. 2000) reveals that each of these 
redox-active centers resides in a separately folded 
domain of the protein and their layout within the 
subunit is such that there is a clear electron transfer 
sequence Mo ^ Fe/S I ^ Fe/S II ^ FAD (the two 
iron-sulfur clusters being designated Fe/S I and II on 
the basis of their EPR signals). Thus, after reduction by 
substrate, the molybdenum center, electrons pass indi¬ 
vidually from the molybdenum center via the Ee/S 
clusters to the EAD, where they are removed by 



Pyranopterins, Fig. 2 The orientation of the molybdenum 
center (right) and proximal [2Fe-2S] cluster (left) in bovine 
xanthine oxidoreductase (PDB 3B9J). Molybdenum is indicated 
in gray, iron in orange, and inorganic sulfur in yellow. The 
pyranopterin cofactor and iron-sulfur cysteines are rendered in 
the CPK coloring scheme. The distance between the molybde¬ 
num ion and the sulfur of the nearest cysteine ligand is 12.4 A 

reaction with ► nicotinamide adenine dinucleotide 
(NAD"^) or O 2 , depending on whether the enzyme is 
in a dehydrogenase or oxidase configuration (this 
determined by the conformation of the EAD-binding 
domain of the protein and dependent on factors such as 
the extent of proteolysis and oxidation of thiols to 
disulfides in the course of purification). Reoxidation 
of the molybdenum involves deprotonation of the Mo- 
SH group, and the process represents an example of 
proton-coupled electron transfer, a general phenome¬ 
non in molybdenum-containing enzymes. 

In the crystal structure of XOR, the nearer Ee/S I is 
disposed relative to the molybdenum center as shown in 
Fig. 2, with the iron-sulfur center adjacent to the distal 
amino group of the pyranopterin (in the structure of the 
closely related aldehyde oxidoreductase from 
Desulfovihrio gigas, this amino group is thought to 
hydrogen bond to one of the iron-coordinating cysteine 
residues of the cluster; Romao et al. 1995). The distance 
from the molybdenum atom to this nearest cysteinyl 
sulfur (which can be considered to represent the edge 
of the redox-active cluster) is 12.7 A. Electron transfer 
from the molybdenum center to the iron-sulfurs has 
been examined using pulse radiolysis to rapidly intro¬ 
duce reducing equivalents (via radiolytically generated 
A-methylnicotinamide radical, MeN*) and subsequently 
follow their migration through the protein, electron 
transfer from the molybdenum to Ee/S I being found 
to occur with a pH-independent rate constant of 
8,500 s“' at 25°C (Hille and Anderson 1991). This 
value is actually somewhat slower than expected on 
the basis of the distance between the electron transfer 
partners (Page et al. 1999), indicating that while the 
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pyranopterin intervenes between the molybdenum and 
Fe/S I, it does not provide an exceptionally effective 
pathway for electron transfer. The principal reason for 
this appears to have to do with the only limited 
TT-conjugation of the pyranopterin ring system, and the 
fact that electron egress from the redox-active dxy 
orbital of the molybdenum is principally ct rather than 
Ti in nature with respect to the enedithiolate ligand 
(Jones et al. 1999). 

Electron transfer has also been extensively studied 
in chicken sulfite oxidase, the eponymous member of 
a family of molybdenum-containing enzymes 
possessing an LMo'^^ 02 (S-Cys) core in the oxidized 
enzyme, with one apical and one equatorial Mo=0. 
Again, reduction results in protonation to give 
LMo^'^0(OH)(S-Cys), it being the equatorial (and cat- 
alytically labile) Mo=0 that becomes protonated. 
The enzyme, a homodimer-like xanthine oxidoreduc- 
tase, also has a fo-type cytochrome. Using flash photol¬ 
ysis to rapidly introduce an electron into the enzyme, 
rate constants for intramolecular electron transfer of 
500-1,000 s“* (depending on the experimental condi¬ 
tions) at 25°C have been determined. Interpretation of 
these values in the context of standard ► electron 
transfer theory is complicated by the fact that in the 
crystal structure of the protein, the heme domains, 
connected to the molybdenum-binding portion of the 
protein by a 12 amino acid long tether, have swung 
away from the molybdenum center with a heme- 
molybdenum distance in excess of 30 A (Kisker et al. 
1997). This distance is too great for electron transfer to 
occur on the timescale observed, and the electron 
transfer-active conformation of the heme domain is 
not known. An examination of the surface topology 
suggests that it is possible for the heme domain 
to swing into place over the solvent access channel to 
the active site, bringing the exposed heme edge to 
within ^10 A of the molybdenum and in a position 
similar to that seen by the heme-containing subunit of 
the related sulfite dehydrogenase from Starkeya 
novella (Kappler and Bailey 2005). A second position 
also exists at the back of the molybdenum-binding 
domain, however, with the distal amino group of the 
pyranopterin some 6.5 A from the surface of the 
pyranopterin intervening between heme and molybde¬ 
num (at the longer distance of ^17 A). Uncertainties 
regarding the orientation of the heme domain notwith¬ 
standing, on the basis of the disposition of the 
pyranopterin in the molybdenum-binding domain, it 


is not possible to identify a uniquely effective electron 
transfer pathway involving the pyranopterin. That 
motion of the heme domain is essential for catalysis 
is reflected in the viscosity-dependence of electron 
transfer in sulfite oxidase, reflecting the fact that the 
electron transfer event can be thought of as being 
conformationally ► gated electron transfer (i.e., occur¬ 
ring within only a limited number of protein configu¬ 
rations that are part of a much larger ensemble). 

The final family of molybdenum-containing 
enzymes that utilize pyranopterin consists of DMSO 
reductase and related enzymes. The molybdenum cen¬ 
ter of oxidized DMSO reductase from R. sphaeroides 
has a molybdenum coordination sphere with two 
equivalents of pyranopterin coordinated to the metal 
as L 2 Mo'^^ 0 ( 0 -Ser) in a trigonal prismatic coordina¬ 
tion geometry; the two equivalents of the pterin cofac¬ 
tor are designated P and Q on the basis of their 
disposition in the polypeptide fold. Upon reduction, 
the Mo=0 group is lost altogether, and the molybde¬ 
num coordination sphere assumes an approximately 
square-pyramidal coordination geometry with the ser¬ 
ine ligand occupying the apical position. While the 
enzymes from Rhodohacter species possess 
a molybdenum center as their sole redox-active center, 
other members of this family contain a number of iron- 
sulfur clusters, one of which always lies near the 
Q pterin (typically within ^6.5 A). It appears that the 
electronic properties of the Q pterin, as distinct from 
the P pterin, direct electron transfer out of the molyb¬ 
denum center toward its nearest redox partner (in the 
case of several enzymes in this family, the physiolog¬ 
ical direction of electron flow is toward, rather than 
away from, the molybdenum center). Among the dif¬ 
ferences between the P and Q pterin, the former is 
thought to have an enedithiolate group that is exten¬ 
sively TT-delocalized, while the latter has more discrete 
dithiolate character on the basis of their resonance 
Raman signatures (Garton et al. 1997). Unfortunately, 
there is no case in which intramolecular electron trans¬ 
fer involving a ft/s-enedithiolate molybdenum center 
has been investigated. 

Cross-References 

► Coupled Electron Transfer 

► Electron Transfer Cofactors 

► Electron Transfer Proteins; Overview 
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► Electron Transfer Theory 

► Electron Transfer Through Proteins 

► Gated Electron Transfer 

► Iron-Sulfur Clusters 

► Proton-Coupled Electron Transfer 

References 

Enroth C, Eger BT, Okamoto K, Nishino T, Nishino T, Pai EE. 
Crystal structures of bovine milk xanthine dehydrogenase 
and xanthine oxidase: structure-based mechanism of conver¬ 
sion. Proc Natl Acad Sci USA. 2000;97:10723-8. 

Feng C, Kedia RV, Hazzard JT, Hurley JK, Tollin G, Enemark JH. 
Effect of solution viscosity on intramolecular electron transfer 
in sulfite oxidase. Biochemistry. 2002;41:5816-21. 

Garton SD, Hilton J, Oku H, Crouse BR, Rajagopalan KV, 
Johnson MK. Active site and catalytic mechanism of 
Rhodohacter sphaeroides dimethyl sulfoxide reductases as 
revealed by resonance Raman spectroscopy. J Am Chem 
Soc. 1997;119:12906-16. 


Hille R. The mononuclear molybdenum enzymes. Chem Rev. 
1996;96:2757-816. 

Hille R, Anderson RE. Electron transfer in milk xanthine oxidase as 
studied by pulse radiolysis. J Biol Chem. 1991;266:5608-15. 

Jones RM, Inscore EE, Hille R, Kirk ML. Freeze-quench magnetic 
circular dichroism spectroscopic study of the “very rapid” inter¬ 
mediate in xanthine oxidase. Inorg Chem. 1999;38:4963-70. 

Kappler U, Bailey S. Molecular basis of intramolecular electron 
transfer in sulfite-oxidizing enzymes is revealed by high- 
resolution structure of a heterodimeric complex of the cata¬ 
lytic molybdopterin subunit and a c-type cytochrome 
subunit. J Biol Chem. 2005;280:24999-5007. 

Kisker C, Schindelin H, Pacheco A, Wehbi WA, Garrett RM, 
Rajagopalan KV, Enemark JH, Rees DC. Molecular basis of 
sulfite oxidase deficiency from the structure of sulfite oxi¬ 
dase. Cell. 1997;91:973-83. 

Page CC, Moser CC, Chen X, Dutton PL. Natural engineering 
principles of electron tunnelling in biological oxidation- 
reduction. Nature. 1999;402:47-52. 

Romao MJ, Archer M, Moura I, Moura JJG, LeGall J, Engh R, 
Schneider M, Hof P, Huber R. Crystal structure of the xan¬ 
thine oxidase-related aldehyde oxido-reductase from 
Desulfovihrio gigas. Science. 1995;270:1170-6. 



Q 


q (Queer)-Channels 

► HCN Channels: Biophysics and Functional 
Relevance 


Quenched-Flow Methods 

Corinne Lionne 

CPBS, UMR5236 CNRS, Universities of Montpellier 
I and II, Montpellier, Cedex 5, France 

Synonyms 

Rapid kinetic methods 

Introduction 

Transient kinetics methods are essential for the char¬ 
acterization of enzyme reactions, that is, the chemistry 
and the rates of inter-conversion of the intermediates 
that make up their pathways. Optical methods such as 
the ► stopped-flow method have been used extensively 
toward this end. However, many enzyme systems are 
optically silent and, furthermore, when there is a signal 
it may be difficult to assign to a particular intermediate. 
In many cases, the quenched-flow method is a solution 
to these problems. 

The quenched-flow method is a chemical sampling 
method used in the milliseconds to seconds time range. 
Experiments are carried out in especially constructed 
apparatus in which the enzyme and substrate(s) are 


mixed, the reaction mixtures aged for milliseconds to 
tens of seconds, and the reaction stopped (quenched) 
by a suitable agent, typically acid. The quenched reac¬ 
tion mixtures are then assayed at leisure. Assays can be 
chosen that are both specific and sensitive such as 
HPLC. The quenched-flow technique has been applied 
to several enzyme systems (for reviews, see Barman 
and Travers 1985; Johnson 1995; Barman et al. 2006). 

How to Start and Stop the Reaction? 

There are different modes: time-delay, continuous, or 
pulsed. The principle of each mode is described and 
illustrated below. In all three modes, reservoir syringes 
contain solutions of enzyme, substrate(s), buffer, or 
quencher. 

The time-delay mode was hrst described by Fersht 
and Jakes (1975). Enzyme and substrate(s) are mixed 
by pushing the plungers of the “enzyme” and “sub¬ 
strate” reservoir syringes. The mixture occurs in 
the mixer Ml (Eig. 1) and the reaction proceeds in 
a delay line (between Ml and M2). At a given time, 
a second drive pushes the plungers of the “buffer” and 
“quencher” reservoir syringes. The buffer chases the 
reaction mixture into mixer M2, where the reaction is 
stopped by a quenching agent (typically acid), in 
a collection tube or loop after which the quenched 
reaction mixture is collected and analyzed. To obtain 
the time course of the reaction, the experiment is 
repeated at different times of quenching. 

The second mode is based on the optical ► contin¬ 
uous flow method of Hartridge and Roughton (1923). 
The principle is to mix the two solutions and then to 
observe a change in spectroscopic properties of the 


G.C.K. Roberts (ed.). Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
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Before experiment 





At time f chosen 



Apparatus is ready 
for experiment at 
another time t 



Reaction mixture 
quenched at time t 


Quenched-Fiow Methods, Fig. 1 Principle of the time-delay 
mode 


mixture at different place along the reaction tube dur¬ 
ing the flow. In the continuous mode of the quenched- 
flow method, the observation chamber is replaced by 
a second mixer in which the quenching agent arrives. 
Different aging times are obtained by varying either 
the flow rate or the volume of the line between the first 
and the second mixer, as illustrated in Fig. 2. In the 
continuous flow mode, large reaction mixture volumes 
are needed, typically 1 mL. 

Considerable amount of solutions may be saved if 
enzyme and substrates are preloaded in two sample 
loops located between the two reservoir syringes and 
the first mixer Ml and if the reservoir syringes is filled 
in with buffer (Eccleston et al. 1985). In this way, 
reaction mixtures of less than 50 pL can be obtained. 

Recent devices combine the two modes into 
a pulsed mode that offers the advantage of a unique 
reaction line (between Ml and M2). This reaction line 
has to be small enough to allow small quenching time 
with a continuous fast flow. Longer quenching times 
are obtained by pushing the samples by successive 
small steps that are separated by delay pauses. The 
final quenching time is equal to the duration of pushing 
steps plus the duration of the delay pauses between 
them multiplied by the number of steps that are neces¬ 
sary to All in the reaction line. For example, if the 
reaction line is 4 pL and each pushing step corresponds 
to 1 pL of reaction mixture, then the time of quenching 
is equal to four times the duration of each step plus four 
times the duration of the delay between each step 
(Fig. 3). The importance of the pulsing is that it causes 
turbulent flow to ensure good mixing and an aging 
process that is linear with distance along the tube. 

In addition to the technical difficulties common 
to all rapid mixing techniques (e.g., the chemical inert¬ 
ness of the device material or the precision and unifor¬ 
mity of temperature control), particular care should 
be taken for efficient mixing. This is because two 
mixing steps are necessary in quenched-flow devices: 
one to trigger the reaction, the other to stop it. Different 
configurations have been used for the mixers. With 
solutions of equal and low viscosity, a simple 
T-mixer in which the jets that carry the two reagents 
are slightly offset is sufficient. With a 1.6-mm diame¬ 
ter reaction tube, flow rates below 1 m/s (that corre¬ 
sponds to 2 mL/s) or above 6 m/s (12 mL/s) need to be 
avoided, because they may lead to laminar flow or 
cavitation problems, respectively. Poor mixing leads 
to false transient lag or burst phases, depending if it 
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Quenched-Flow Methods, 
Fig. 2 Principle of the 
continuous flow mode 



Successive 

micro-pulses 



Reaction mixture quenched at time 

t= nb of pulses x (duration of pulse + delay) 



Possible washing of the reaction tube between time points 


Quenched-Flow Methods, Fig. 3 Principle of the pulsed mode 

concerns the (enzyme -i- substrate) or the (reaction 
mixture -i- quencher) mixing, respectively. A conve¬ 
nient test to check for the quality of the mixing is to 
use the simple reaction of alkaline hydrolysis of 
2,4-dinitrophenyl acetate (DNPA). Sodium hydroxide 


is mixed with DNPA and the reaction quenched with 
hydrochloric acid. The formation of dinitrophenol is 
assayed in a spectrophotometer by measuring the 
absorbance at 320 nm. The observed rate constant of 
the hydrolysis reaction depends on the concentration 
of sodium hydroxide used. This simple and cheap 
assay can be used at different flow rates to determine 
the lower and upper limit of flow rate in a particular 
apparatus (see Eccleston et al. 2008 for details). The 
complete reaction profile can be compared with that 
obtained using a stopped-flow spectrophotometer. 

Poor quenching is not always due to a bad mixing, it 
may be due to the choice of quencher. The quenching 
has to be total, irreversible, and fast. The most popular 
quenching agent is strong acid, in which non-covalent 
enzyme product intermediates dissociate and therefore 
total product, i.e., free plus enzyme-bound product is 
determined. In the case of acid stable enzyme, protein 
proteolysis (e.g., by pepsin in the case of RNase, 
Guasch et al. 1989) may be used in addition to acid. 
When intermediates of interest are unstable in acid 
(e.g.. Naught and Tipton 2005, with phosphoglucomu- 
tase), strong alkali may be used as quencher. In case of 
irreversible ligand binding, for example, the binding of 
ATP to myosin (Geeves and Trentham 1982; Barman 
et al. 1983), radioactive substrate chase with an excess of 
unlabelled substrate was also used successfully. EDTA 
may be used in the case of divalent metal-dependent 
reaction (Kurzmack et al. 1977). Different quenching 
agents are reviewed in detail in Barman and Travers 
(1985). Other physical methods, such as filtration, rapid 
freezing, or rapid evaporation, are less commonly used. 
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How to Detect Reaction Intermediates? 

The choice of experimental conditions and of the sam¬ 
pling time scale is important for obtaining transient 
phases. These may represent decay of substrate, forma¬ 
tion and decay of reaction intermediates, or formation of 
product. Ideally, one would like to assay the concentra¬ 
tion of each intermediate, substrate, and product of 
a reaction. In practice, it is usually the concentration of 
total product (which may be protein bound as well as 
free) that is determined. In order to be able to detect 
a transient phase, one has to work on a sampling time 
that is earlier than the reciprocal of the overall catalytic 
rate constant of the reaction (l/kcat)- Experiments may be 
performed under single- or multi-turnover conditions, in 
which the enzyme concentration is higher or lower than 
the substrate concentration, respectively. In both condi¬ 
tions, it must be high enough to allow the detection of 
intermediates. In multi-turnover conditions, the enzyme 
is able to perform several turnovers, but the detection of 
intermediates is only possible during the first turnover. 
Subsequent turnovers may give information on the 
steady state (Iq-at and K^) but are less informative on 
the reaction mechanism. 

It is often difficult to match these conditions with 
the native enzyme and its physiological substrates 
under normal conditions. Therefore, perturbations of 
the system may be interesting. The enzyme itself can 
be modified, either chemically or by directed mutagen¬ 
esis. Analogues of substrate, product, or transition 
state may be used. Finally, the experimental conditions 
may be modified: pH, ionic strength, temperature. 
A powerful perturbation approach is cryoenzymology 
(Barman and Travers 1985), i.e., experimenting with 
enzymes at low temperature (under 0°C) and in the 
presence of an organic solvent that acts as antifreeze. 
Together, low temperature and solvent may have an 
advantageous effect. It is a method that can allow for 
the accumulation of intermediates not observable 
under normal conditions by slowing down their kinet¬ 
ics of formation, by changes in the rate limiting steps, 
or by shifts in equilibria. For example, with creatine 
kinase, lag and burst phases of phosphocreatine pro¬ 
duction were measurable at 4°C, but not at 25°C 
(Travers et al. 1979). Furthermore, solvent alone may 
have interesting effect on the reaction. In the case of 
muscle myosin, the use of 40% ethylene glycol in 
the buffer allowed to measure the saturation kinetics 
of ATP binding, therefore confirming the two-step 


Quenched-Flow Methods, Table 1 Quenched-flow modes: 
Typical reaction time limits 



Reaction time limits 

Maximum time 

Mode 

Minimum 

Maximum 

limited by 

Time-delay 

~0.2 s 

Several 

Diffusion of reagents into 



minutes 

mixers 

Continuous 

3 ms 

300 ms 

Minimum flow rate for 
good mixing 

Pulsed 

6 ms 

-30 s 

Time to collect samples 


binding mechanism (Barman et al. 1983). With 
3-phosphoglycerate kinase, the addition of 30% metha¬ 
nol allowed to change equilibria in the reaction, which 
resulted in the accumulation of an intermediate that was 
not detectable in absence of solvent (Geerlof et al. 1997). 

Comparison of Quenched-Flow and Other 
Rapid Mixing Techniques 

Compared to other rapid mixing techniques such as 
stopped-flow, the quenched-flow technique suffers 
from the fact that it is time consuming because it is 
a point by point method (Table 1). For each time 
course, 20-30 reaction mixtures quenched at different 
times have to be investigated. However, with the 
quenched-flow method even an optically silent reac¬ 
tion can be studied and the signal detection is specific: 
chemical species are assayed specifically and therefore 
kinetics measured can be assigned to individual steps 
in the reaction pathway. The analysis of a particular 
reaction intermediate can be ensured by the use of 
a suitable quencher. Finally, opaque, even particulate 
systems, can be studied by quenched-flow technique. 

Cross-References 

► Continuous Flow 

► Stopped-Flow Techniques 

► Transient-State Kinetic Methods 
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Definition 

Hybrid quantum mechanical/molecular mechanical 
(QM/MM) theoretical approaches are commonly used 
for computing, in an accurate and efficient manner, the 
vibrational IR and Raman spectra of protein-bound 
cofactors. 


Introduction to the QM/MM Approach 

Vibrational infrared (IR) and ► Raman spectroscopy 
are techniques largely used to probe condensed phase 
biological systems such as macromolecules in solution 
or cofactors embedded into a protein-binding pocket 
(Siebert and Hildebrandt 2008). ► Vibrational spectra 
contain a huge amount of valuable structural and 
dynamical information concerning the molecular sys¬ 
tem under study (solute) and its immediate environ¬ 
ment (solvent). However, decoding condensed phase 
spectral fingerprints are in many cases problematic, 
requiring the guidance of computational methods for 
a deeper interpretation of the experimental results 
(Siebert and Hildebrandt 2008). 

Often, vibrational studies focus on a localized 
region of a large bio-molecular system, e.g., the pro¬ 
tein-active site. If this region is small enough (less than 
200 atoms) a vibrational spectrum can be calculated 
with the necessary accuracy using quantum mechani¬ 
cal methods. This task is generally performed with 

► Density Functional Theory (DFT) since it offers 
sufficiently accurate results at a reasonable computa¬ 
tional cost. The quality of the calculated spectrum 
increases significantly when interactions with the 
immediate environment - the protein-binding pocket 
of a cofactor or the surrounding solvent waters - are 
appropriately taken into account (Speranskiy and 
Kumikova 2004). This was clearly shown for the 
heme-CO complex of myoglobin and hemoglobin 
(Cui and Karplus 2000), for the Raman spectra of 
tetrapyrrole chromophores embedded in photosensi¬ 
tive proteins (Mroginski et al. 2007) and for the IR 
spectra of quinone chromophores in the photosynthetic 
reaction center of bacteria (Nonella et al. 2003). 
A solid strategy to include intermolecular effects and 
account for environmental features is provided by 

► quantum mechanical/molecular mechanical (QM/ 
MM) hybrid methods, in which the solute is treated 
by an accurate quantum mechanical (QM) approach 
and the solvent environment by a computationally effi¬ 
cient molecular mechanics (MM) force field. 

Calculation of Vibrational Spectra 

Within a QM/MM hybrid approach, calculation of the 
vibrational spectra of condensed phase molecules can 
be performed using a variety of methods, either based 
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on the diagonalization of the Hessian matrix (Normal 
Mode Analysis, NMA), on multivariate statistical 
analysis or on time correlation functions (Schmitz 
and Tavan 2006). 

Normal Mode Analysis (NMA) and Instantaneous 
Normal Mode Analysis (INMA) 

The Normal Mode Analysis (NMA) is a conventional 
technique widely used in computational chemistry for 
the calculation of vibrational frequencies and normal 
modes of vibrations within the assumption of harmonic 
approximation. It is used in purely quantum mechani¬ 
cal, QM/MM, and empirical approaches. The standard 
procedure in the former two cases is to diagonalize the 
mass-weighted force constant matrix (Hessian matrix) 
of the molecule obtained from the second derivative 
of the total energy with respect to the atomic displace¬ 
ments (Cui and Karplus 2000). Purely quantum mechan¬ 
ical and QM/MM approaches differ in the way the energy 
is calculated. The normal modes of vibrations and the 
fundamental frequencies of the molecule are computed 
from the eigenvectors and eigenvalues of the Hessian 
matrix, respectively. Since the starting point for NMA is 
a stable conformation of the system that represents 
a minimum on the potential energy surface, this approach 
is valid only when the first derivatives of the energy with 
respect to displacement of the atoms are zero. 

Within the QM/MM framework and depending 
on the system to be studied, one may opt between 
different strategies for computing the vibrational fre¬ 
quencies. If one is interested on the vibrational spectra 
of the entire molecule, Cui and Karplus (Cui and 
Karplus 2000) proposed to compute and diagonalize 
the full Hessian matrix of macromolecule at the QM/ 
MM level (Fig. 1). This approach is referred to as the 
Full Hessian Vibrational Analysis (FHVA). Due to the 
high density of modes associated to biological mole¬ 
cules the FHVA is applicable to relatively small sys¬ 
tems such as, for example, the heme-CO complex of 
myoglobin and hemoglobin (Cui and Karplus 2000). If 
only part of the macromolecule vibrational spectra is 
relevant, for example, that of the active site of an enzyme 
or the vibrational spectra of the chromophore site of 
a photoreceptor, a partial Hessian vibrational analysis 
can be performed (PHVA), where only the subblock of 
the Hessian matrix associated with the fragment of inter¬ 
est is computed and diagonalized (Li and lensen 2002). In 
the case of a QM/MM description of the molecule, the 
calculation of the Hessian matrix within the PHVA 
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QM/MM Calculations of Vibrational Spectra, Fig. 1 Struc¬ 
ture of the QM/MM Hessian matrix. E is the total energy of the 
system, qj represents the coordinates of the QM fragment and Qj 
the coordinates of the MM fragment 

method is reduced to the submatrix involving the set of 
coordinates of the QM fragment (see Fig. 1) (Mroginski 
et al. 2007). 

In the NMA approach, spectral intensities are sep¬ 
arately calculated through the first spatial derivatives 
of either the molecular dipole moment (IR spectrum) 
or the molecular polarizability (Raman spectrum) 
(lensen 2001). Shape and width of the spectral bands 
are not directly accessible, since calculations are 
formally performed at 0 K. 

The prerequisite of finding a minimum energy 
structure is one of the mayor drawbacks of the NMA 
approach. Especially for large flexible molecules in 
complex environments it becomes a non-trivial task 
to identify all equilibrium conformations which may 
contribute to the vibrational spectrum. 

At room temperature, the environment which sur¬ 
rounds a solute molecule constantly fluctuates, adopting 
different conformations (Fig. 2). These thermal fluctua¬ 
tions of the environment induce changes in the polariza¬ 
tion of the solute’s electron distribution which modifies 
its intramolecular force field and leads to shifts of the 
vibrational bands. NMA calculations performed using 
a set of protein stmctures extracted out of a molecular 
dynamics (MD) trajectory showed that thermal stmctural 
changes of the solvent, represented by the protein matrix, 
may produce shifts of 20 cm” * (and even higher) of the 
vibrational modes of the solute molecule (Mroginski et al. 
2010). 
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QM/MM Calculations of Vibrational Spectra, Fig. 2 Top-. 
Thermal fluctuation of the Phycoerythrocyanin protein matrix 
(yellow) and its phycoviolobilin chromophore (red) during an 
MD simulation. Bottom-. QM/MM-calculated Raman spectra 
using the NMA approach for 12 different snapshots from MD 
trajectory (red) showing the dispersion of vibrational modes; 
sum over all individual spectra (black) (Mroginski et al. 2010) 

A variant of the NMA approach for describing the 
vibrational spectra of cofactors in fluctuating protein 
environments is the so-called Instantaneous Normal 
Mode Analysis (INMA) (Nonella et al. 2003; Schmitz 
and Tavan 2006). According to the INMA approach, for 
each normal mode of vibration, an average frequency and 
its standard deviation, and average intensities (IR or 
Raman) are estimated out of a set of the spectra, each 
and one of them computed with the standard NMA 
approach using stractures from different snapshots of an 
MD trajectory. In order to guarantee a minimum energy 
configuration of the solute molecule required for the 
computation of its Hessian, the instantaneous structures 
of the solvent at each snapshot are kept fixed and only the 
solute molecule is allowed to relax in the rigid solvent 
cage by energy minimization. The INMA spectmm is 
finally generated as sum of Gaussian line shapes, 
where each band represents a specific vibrational mode. 


These Gaussian functions are centered at the mode’s 
average frequency, with a bandwidth given by its stan¬ 
dard deviation and a height determined by the average IR 
or Raman intensity. 

In addition, since in the INMA the degrees of freedom 
of the solute are decoupled from those of the solvent, this 
approach does not capture dynamical effects (motional 
narrowing or vibrational dephasing) which affect the 
bandwidths (Schmitz and Tavan 2006). 

Despite its simplicity, the INMA approach has been 
successfully applied to several biological systems for 
predicting, for example, the IR spectrum of the retinal 
chromophore in bacteriorhodopsin (Babitzki et al. 
2009) and the Raman spectra of p-Lactamase inhibi¬ 
tors intermediates bound to a mutated 6-Lactamase 
(Miani et al. 2009). 

Both, the NMA and the INMA approaches are based on 
the harmonic approximation. The anharmonic effects 
which are neglected in the theoretical treatment can be 
incorporated into the computed vibrational frequencies 
and transition matrix elements via perturbation theory 
(Schmitz and Tavan 2006). Another way of correcting the 
intrinsic errors arising from the harmonic approximation 
and from the computational method used for the calcula¬ 
tion of the vibrational frequencies is to scale the computed 
frequencies (Scott and Radom 1996) or the force constants 
(Rauhut and Pulay 1995) with empirical parameters. 

Fourier Transform of the Time Correlation 
Function (FTTCF) 

Alternative to NMA approaches, the Fourier Transform 
of Time Correlation Functions (FTTCF) can also 
provide vibrational spectra (McQuarrie 2000). The 
autocorrelation function of the dipole moment or the 
polarizability can be calculated from QM/MM molecu¬ 
lar dynamics simulations at finite temperature. Under 
the assumption that the perturbation of the system due 
to the external applied field is small (linear response 
approximation), frequency, intensity, and band shape of 
the vibrational spectrum are obtained directly from the 
Fourier transformation of autocorrelation functions. 

According to the Nyquists theorem for Fourier 
transforms (McQuarrie 2000), the spectral resolution 
Av obtained by the FTTCF method depends on the 
total length AT of the analyzed MD trajectory, such 
that Av « AT“*. Thus, to achieve a spectral resolution 
of 1 cm”', a 30 ps-trajectory is required. 

An advantage of the FTTCF approach compared to 
the NMA-based methods is that dynamical effects of the 
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environment as well as vibrational anharmonicities are 
directly included in the computation of the vihrational 
spectra as a consequence of the conformational sampling 
obtained from the MD trajectory. Therefore, the FTTCF 
method can render correct line widths and line shapes. 

Until recently, the main drawback of FTTCF 
methods was the difficulty to assign the calculated 
bands to individual atomic displacements or vibra¬ 
tional modes. This problem was overcome by Martinez 
and co-workers (Martinez et al. 2006), who derived 
normal modes as linear combination of non-redundant 
internal coordinates, such that the Fourier transform of 
the auto-correlation function (or power spectrum) of 
these modes are as localized as possible in frequency. 
The approach was shown to he a generalization of the 
NMA method to anharmonic systems at finite temper¬ 
atures and to be equivalent to PMA (see next section) 
when using Cartesian coordinates. Quantitative assign¬ 
ment of the vibrational spectrum was finally carried 
out with the potential energy distribution (FED) 
approach, as used in standard normal mode analysis. 

Principal Mode Analysis (PMA) and Time-Resolved 
Filtered Generalized Virial (TF-GV) Approach 

Finally, condensed phase spectra can be calculated 
by applying multivariate statistical analysis on quantum 
or QM/MM molecular dynamics trajectories. According 
to the Principal Mode Analysis (PMA) approach, as 
proposed by Wheeler and co-workers (Wheeler and 
Dong 2003), average mass-weighted normal modes and 
associated frequencies are derived from MD trajectories, 
after diagonalization of the covariance matrix of the 
atomic fluctuations. Schmitz subsequently revised the 
PMA method, correcting the main artifacts in frequency 
position and intensity by introducing mode-specific tem¬ 
peratures and bandpass filtering in the so-called Time- 
resolved Filtered Generalized Virial (TF-GV) approach 
(Schmitz and Tavan 2006). This method was shown to 
yield accurate harmonic spectra as well as dynamical 
effects of the fluctuating environment, which allowed 
distinguishing homogeneous and inhomogeneous contri¬ 
butions to line shapes. 

Treatment of the Link Atom 

In cases where the QM and MM fragments are linked 
by covalent bonds, it is important to model these bonds 
in adequate manner. The most common procedure is to 


replace these bonds by new bonds between the QM 
fragments and artificially introduced hydrogen atoms. 
These hydrogen atoms are called link atoms and serve 
to saturate the electronic structure of the QM fragment. 

The presence of link atoms in the QM region intro¬ 
duce additional degrees of freedom so that their 
contributions to the QM/MM Hessian matrix have to 
be projected out before the normal mode analysis. 
A projection scheme has been proposed by Cui 
and Karplus (Cui and Karplus 2000) for partially 
correcting the Hessian matrix. Despite the projection, 
the presence of link atoms still have an implicit influ¬ 
ence on the Hessian elements of the boundary QM 
atom, contaminating in particular the low frequency 
vibrational modes. For example, in the case of protein- 
bound tetrapyrroles, it was shown that in particular the 
low frequency modes helow 600 cm“* were affected 
by the presence of link atoms (Mroginski et al. 2007). 

The scaled position link atom method (SPLAM), 
proposed by Tavan and co-workers (Eichinger et al. 
1999), removes the additional degrees of freedom 
introduced by the link atom by fixing the orientation 
and length of the artificial C-H bond, such that it 
describes the electronic and mechanical properties of 
the original C-C bond in the best way. This approach 
has been initially tested on an ethane molecule 
(Eichinger et al. 1999) and successfully applied for 
predicting the IR spectrum of the retinal chromophore 
in bateriorhodopsin (Babitzki et al. 2009). 

Modeling of the Solvent Environment 

As mentioned above, according to the QM/MM 
formalism the solvent environment is described with 
a computationally efficient MM force field approach. 
These conventional empirical force fields (e.g., 
CHARMM, GROMACS, AMBER) describe the electro¬ 
static interactions in terms of fixed, usually centered on 
atoms, charges (Jensen 2001). Thus, a general limitation 
inherent in such a simple description is the lack of 
dynamic electronic polarizability. The neglect of polariz¬ 
ability in standard MM force fields may cause energetical 
and structural artifacts in the MD simulation of macro¬ 
molecules which may consequently induce errors in the 
computation of the corresponding vibrational frequencies 
and spectral intensities (Schmitz and Tavan 2006). This 
in particular is tme for charged solute molecules, such as 
the protonated Schiff base of bacteriorhodopsin, where 
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QM/MM Calculations of Vibrational Spectra, Fig. 3 Exper¬ 
imental resonance Raman spectrum of the Pr state of CphlA2 
(top) compared with the Raman spectra obtained from QM/MM 
calculations (blue) and from QM calculations (red) using 
a ZZZasa and a ZZZssa conformation of the Phycocyanobilin 
chromophore (Mroginski et al. 2009). A-B, B-C, and C-D stand 
for the C=C stretching modes at the methine bridges between 
rings A and B, rings B and C, and rings C and D, respectively. 
NH ip represents the in-plane deformation mode of the N-H 
groups at rings B and C 


the significant underestimation of the strength of the 
reaction field is undoubtedly reflected on to the vibra¬ 
tional spectra (Babitzki et al. 2009). 

In proteins, the enlargement of the QM region to 
such an extent that it also includes the immediate sol¬ 
vent environment is in most cases computationally 
expensive and does not necessarily improve the quality 
of the computed QM/MM spectra (Babitzki et al. 2009). 

The use of polarized (Babitzki et al. 2009) or polar¬ 
izable MM force fields (Halgren and Damm 2001), in 
which the charge distribution of the solvent is ade¬ 
quately adapted to that of the solute and is allowed to 
vary during an MD simulation, represents a viable 
solution for improving the accuracy of vibrational 
spectra calculation within the QM/MM framework 
and at relatively low computational cost. 


Example of "In Vacuo" Versus "In Condense 
Phase" Calculations 

Theoretical calculations performed on Cphl Phyto¬ 
chrome (Mroginski et al. 2009) are a clear demon¬ 
stration of the necessity of employing QM/MM 
methods for computing the vibrational spectra 
of cofactors in proteins. The Cphl Phytochrome 
is a biological photoreceptor which harbors a 
phycocyanobilin (PCB) chromophore in a ZZZssa 
configuration (Fig. 2, Chap. ► Resonance Raman 
Spectroscopy of Protein - Cofactor Complexes). 
For this system, the QM calculations of the Raman 
spectra of an isolated model compound of the PCB 
cofactor predict the wrong conhguration of the chro¬ 
mophore. As shown in Fig. 3, the Raman spectrum 
obtained with a ZZZasa conhguration coincidentally 
provides a mode pattern in the marker band region 
similar to the QM/MM-calculated spectrum of 
the ZZZssa chromophore in the protein and thus in 
very good agreement with the experimental data. 
In particular, these results show that the neglect of 
the protein environment in the computation of 
the vibrational spectra of protein-bound cofactors 
may induce band shifts of up to 30 cm“* and hence 
blur the analysis and interpretation of the experimen¬ 
tal data. 


Summary 


Calculation of the vibrational IR and Raman spectra 
of cofactors immersed in a protein environment 
can be efficiently and accurately performed using 
a hybrid quantum mechanics/molecular mechanics 
(QM/MM) approach where only the electronically 
important region, the solute, is treated at an accurate 
QM level and its environment is described with 
a computationally cheap MM force held. Among the 
various methods to compute vibrational spectra, the 
conventional NMA formalism offers “quick” and 
“cheap” results, that can be used for normal mode 
analysis and the vibrational assignment of experimen¬ 
tal spectra. However, anharmonic and dynamical 
effects arising from thermal huctuations of the sys¬ 
tem, neglected in the NMA approach, can be computed 
via FTTCF method which provides excellent band 
shapes in addition to vibrational frequencies and 
intensities. 
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The accuracy of the QM- /MM-vibrational calcula¬ 
tions depends not only on the precision of the QM 
method but also on the description of the boundary 
region and the polarizing properties of the MM 
force field. 


Cross-References 

► Infrared Spectroscopy of Protein Structure 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 
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Synonyms 

Hybrid quantum mechanical/molecular mechanical 
methods 


Definition 

The combination of quantum mechanics (electronic 
structure) and molecular mechanics (empirical force 
field) methods, used for modeling, e.g., chemical 
reactions in large systems. 

Basic Characteristics 

Combined quantum mechanical/molecular mechanical 
methods, introduced for proteins by Warshel in the 
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mid 1970s (Warshel and Levitt 1976), have become 
a popular method for the investigation of large bio¬ 
molecules (Senn and Thiel 2009; Ranaghan 
and Mulholland 2010). QM/MM methods offer the 
advantage of lower computational cost than a 
QM calculation for a large system, and allow modeling 
of processes such as chemical reactions, which cannot 
be modeled simply by typical MM methods. Part of the 
system, e.g., the active site of an enzyme, is treated by 
a quantum mechanical electronic structure method, 
which allows the modeling of bond making and break¬ 
ing. The bulk of the system is treated more simply, but 
usually in atomic detail, by an empirical molecular 
mechanics (MM) force field. The system of interest is 
divided into a small (^lOs of atoms) QM region and 
a usually much larger (^1,000 s of atoms) MM region. 
In a typical QM/MM calculation, the total energy of 
the system can be expressed as: 

7?total = Eqm + T/mm + £qm/mm 

+ ^BOUNDARY- 

(Other possible combinations of energy terms may 
be used in the dehnition of a combined QM/MM 
energy, see below). Eqm and £mm are the energies of 
the QM and MM regions, respectively, calculated in 
a standard way at those levels. £boundary is a term to 
account for the fact that the system may have to be 
truncated for practical reasons. This term is included to 
represent the effects of any omitted parts of the system 
and bulk solution. £qm/mm describes the interaction 
between the QM and MM regions and can be treated in 
various ways. 


polarized by the MM region using the technique of 
“electrostatic embedding,” in which the partial charges 
of the MM atoms are included in the QM Hamiltonian. 
More sophisticated schemes also exist where polariza¬ 
tion of the MM region by the QM atoms is also 
included, but they are significantly more computation¬ 
ally expensive and therefore not so widely used. Anal¬ 
ysis of polarization effects in biologically relevant 
hydrogen bonded complexes showed that QM/MM 
methods gave good agreement with QM calculations 
(Senthilkumar et al. 2008), giving encouraging results, 
demonstrating that important properties of key biomo- 
lecular interactions can be treated well at the QM/MM 
level employing good-quality levels of QM theory. 

In biomolecules, the interaction between the 
QM and MM regions can be divided into bonded and 
nonbonded interactions. Bonded interactions between 
QM and MM atoms are usually treated at the MM level 
and are typically applied whenever at least one 
QM atom is involved in the bond, bond angle or dihe¬ 
dral (Field et al. 1990). Nonbonded interactions 
can be further separated into electrostatic and non¬ 
electrostatic interactions. Electrostatic interactions 
are accounted for by the point charges of 
the MM atoms (see above), and the non-electrostatic 
interactions are included by van der Waals 
(e.g., Lennard-Jones potential) term. The accuracy of 
the QM/MM interaction relies on the choice of the MM 
point charges and the van der Waals parameters. These 
depend on the MM force field: It is common practice 
to use the standard MM parameter set associated with 
the force field. MM parameters often must be devel¬ 
oped for nonstandard molecules such as ligands and 
cofactors. 


Q 


Coupling Between the QM and MM Regions 

There are two particularly important considerations for 
the treatment of the interaction between the QM and 
MM regions: (1) the treatment of non-covalent inter¬ 
actions between QM and MM atoms, and (2) where 
covalent bonds exist between atoms in the QM and 
MM regions, the treatment of bonds at the frontier 
between the two regions. The simplest QM/MM 
implementations involve “mechanical” embedding of 
the QM region in the MM environment and the inter¬ 
action between the two regions is treated at the lower 
level of theory (e.g., the ONIOM method). Most cur¬ 
rent QM/MM calculations allow the QM region to be 


Partitioning of the QM and MM Regions 

In most enzyme reactions, the dehnition of a suitable QM 
region requires the “breaking” of a covalent bond at the 
frontier between the QM and MM regions. This requires 
special consideration because the QM part treats elec¬ 
trons explicitly whereas the MM region does not, so 
potentially leaving the QM atom with an unpaired elec¬ 
tron. Different methods have been developed to over¬ 
come this problem, which generally involve either the 
addition of an atom or pseudo-atom to hll the valence of 
the QM atom or a special treatment of the frontier 
bond orbital. Approaches include: the link atom method 
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(Singh and Kollman 1986), generalized hybrid orbitals 
(Gao et al. 1998), or the local self-consistent field (LSCF) 
method (Thery et al. 1994). The link atom approach is the 
simplest and most widely used partitioning scheme. 
Konig et al. (2005) tested a number of different QM/ 
MM partitioning methods based on the link atom 
approach, including studies of enzymatic reactions. The 
effect of the link atom method was assessed through the 
calculation of several properties such as deprotonation 
energies, dipole moments, and energetic of proton trans¬ 
fer reactions. Activation barriers and reaction energies for 
proton transfer reactions were found to be fairly insensi¬ 
tive to the particular link atom scheme, because of can¬ 
celation of errors, for reactions in the gas phase and in 
enzymes. This is encouraging in indicating that a simple 
QM/MM partitioning scheme can give useful results in 
QM/MM calculations on enzymes. 

Cross-References 

► Computational Enzymology 

► ONIOM 
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Quantum Chemical Cluster Approach to 
Modeling Enzyme-Catalyzed Reactions 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Definition 

An approach to modeling enzyme reactions that 
involves selection of key residues in the active site 
and using quantum-mechanical methods to calculate 
minima and transition states of the reaction. 

Basic Characteristics 

The quantum-mechanical cluster approach for model¬ 
ing enzyme reactions is based on selecting a limited 
number of atoms from the enzyme to represent a model 
of the active site. The method can be seen as an alter¬ 
native to QM/MM approaches for calculating potential 
energy barriers and reaction energies in enzyme 
reactions. It has proved very useful in modeling 
enzyme-catalyzed reaction mechanisms, for example, 
identifying likely mechanisms (including novel mech¬ 
anisms) and ruling out other proposals as unlikely. 
Applications of the method have been reviewed (Sieg- 
bahn and Himo 2009; Noodleman et al. 2004). Initial 
positions of the atoms in the model are typically based 
on those found by protein X-ray crystallography. All 
other atoms in the enzyme are ignored. The selected 
atoms are then treated quantum mechanically at the 
highest computationally affordable level, which usu¬ 
ally means hybrid density functional theory (DFT): 
Many DFT functionals provide a good combination 
of accuracy and computational efficiency. To account 
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for the conformational restraints of the enzyme envi¬ 
ronment, atoms in the periphery of the model are usually 
held fixed to their initial positions. As an approximation 
of effects of the enzyme environment, a polarizable 
continuum model (or alternative similar method) is 
often used: The sensitivity of the results to the choice 
of the dielectric constant can be tested. In early studies, 
typically the models used contained 20-30 atoms and 
geometries of minima and transition states were opti¬ 
mized without further positional restraints. Currently, 
models with up to 200 atoms are possible in relatively 
standard calculations. The larger model size should 
improve accuracy in some respects, but also introduces 
several new problems for modeling. For example, the 
larger the model is, the larger the number of local 
minima will be. 

Cross-References 

► Macromolecular Crystallography: Overview 

► QM/MM Methods 
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Quantum Effects/Tunneling 

Kara E. Ranaghan 

Centre for Computational Chemistry, School of 
Chemistry, University of Bristol, Bristol, UK 

Definition 

Quantum mechanical tunneling is the penetration of 
a particle into a region forbidden in classical mechan¬ 
ics as the particle lacks the energy to reach it. The 
phenomenon of tunneling is a quantum mechanical 
effect: it can only be explained when the wave-like 
nature of matter is considered. 


Basic Characteristics 

Kinetic isotope effects (KIEs) are the main experimen¬ 
tal indicator of quantum effects in enzyme-catalyzed 
reactions (Allemann and Scrutton 2009). Most enzyme 
reactions involve the transfer of hydrogen (H) in some 
form, i.e., as a proton (H-t), hydride (H—), or hydrogen 
atom. Replacement of the transferring particle by 
a heavier isotope of hydrogen (tritium (T) or deuterium 
(D)), or isotopic substitution elsewhere (e.g., at other 
positions of the substrate) can lead to measurable 
(sometimes large) changes in reaction rate (i.e., 
a KIE), for a chemical reaction step or the overall 
reaction. In many cases, KIEs can be understood 
mainly in terms of the difference between the zero- 
point energies of the reactants and TS. A number of 
experimental studies of enzyme kinetics have shown 
that in some cases, the observed rate coefficients for 
protio-, deuterio-, or tritio-substrates (^h> and k-x, 
respectively) cannot be accounted for in this way, and 
that the observed rates must instead be due in large part 
to tunneling through the classical barriers (Pu et al. 
2006; Ranaghan and Mulholland 2010). In some cases, 
the primary kinetic isotope effects found on substitu¬ 
tion of hydrogen by the heavier isotopes deuterium or 
tritium are very large and sometimes show unusual 
apparent temperature dependence. One of the largest 
primary H/D KIEs for an enzyme reaction (^80 at 
room temperature) has been observed for soybean 
lipoxygenase-1 (SLO-I); it is only weakly temperature 
dependent (Knapp et al. 2002). 

A controversial issue is the potential role of protein 
dynamics in tunneling in enzymes (Glowacki et al. 
2012; Hay and Scrutton 2012). Some workers believe 
that the temperature dependence of the KIEs indicates 
that motions of the protein promote tunneling and it 
is sometimes suggested that quantum effects cannot 
be reconciled within the framework of transition state 
theory (TST) (Klinman 2009; Hay and Scrutton 2012). 
TST (and variational transition state theory, VTST, 
with multidimensional corrections for tunneling) 
have been successfully applied to several enzymes 
which exhibit significant kinetic isotope effects 
(Ranaghan and Mulholland 2010). Glowacki et al. 
have shown that a multidimensional energy landscape 
with enzymes existing in two states with different 
reactivities can explain the available kinetic data for 
SLO-1 and aromatic amine dehydrogenase (AADH) 
within the framework of TST (Glowacki et al. 2012). 
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Molecular calculations also indicate that, while tunnel¬ 
ing occurs in enzymatic reactions, the effect is not 
signihcantly catalytic because the degree of tunneling 
is similar in enzymes and their equivalent reference 
reactions, i.e., in solution (Kamerlin and Warshel 
2010). The apparent temperature independence of 
measured KIEs in many enzymes has also been 
questioned, because the narrow range of temperatures 
accessible experimentally may be too small to estab¬ 
lish temperature independence with any certainty 
(Siebrand and Smedarchina 2004). 

Modeling enzyme-catalyzed reactions involving 
quantum tunneling is challenging, because of the inher¬ 
ently multidimensional nature of tunneling, and the diffi¬ 
culty of carrying out quantum simulations of the 
dynamics of (parts of) large polyatomic systems such as 
enzymes, incorporating the effects of the protein and 
solvent environment on the reaction at the active site. 
Early theoretical approaches (and indeed many current 
studies) applied simple, empirical low-dimensional 
models, which can be of heuristic value in rationalizing 
experimental observations and suggesting hypotheses, 
but ultimately may be of little predictive value. Such 
simplihed models may also be based on incorrect 
assumptions, or fail to include essential details. To test 
hypotheses, interpret (and predict) experimental results, 
and analyze such reactions in detail, atomistic simulations 
based on the fundamental physics will be (and are prov¬ 
ing to be) indispensable. 


Cross-References 

► Dihydrofolate Reductase - Computational Studies 

► Kinetic Isotope Effects 

► Methylamine Dehydrogenase (MADH): Studies 
with Variational Transition State Theory (VTST) 

► Protein Dynamics 

► Protein Dynamics in Catalysis - Computational 
Studies 
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Quantum Mechanical Simulations of 
Biopolymer Vibrational Spectra 

Timothy A. Keiderling 

Department of Chemistry, University of Illinois at 
Chicago, Chicago, IE, USA 

Synonyms 

Simulations of IR and Raman spectra 


Definition 

Quantum mechanical (QM) simulations of vibrational 
spectra use fundamental QM methods to determine the 
force field (EE) and predict the frequencies and inten¬ 
sities of IR and Raman spectral transitions with varying 
degrees of approximations appropriate to biopolymer 
applications. 

Introduction 

Vibrational spectroscopic methods provide an impor¬ 
tant experimental data for studies of biological mole¬ 
cules, peptides, proteins, nucleic acids, and lipids, and 
are often interpreted via empirical correlation of fre¬ 
quencies and intensities with molecular properties 
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Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra, Fig. 1 Amide vibrational modes. Sche¬ 
matic representation of three normal modes of vibration of the 
amide group which are most important for peptide and protein 
secondary structure analyses: (a) amide I, (b) amide II, (c) amide III 

(Hering and Haris 2009; Krimm and Bandekar 1986; 
Mantsch and Chapman 1996; Tsuboi 1987). However, 
to develop deeper understanding of the observed spec¬ 
tra and the changes they undergo when structure or 
environment vary, theoretical simulations of the spec¬ 
tra are needed. Initially, empirical models for force 
field (FF) calculations based on parameters derived 
from fitting spectra of small model molecules were 
extended to larger biomolecules, sometimes including 
transition dipole coupling (TDC) interactions (Krimm 
and Bandekar 1986). In the last decade or so, ab initio 
quantum mechanical (QM) computations of complete 
FFs have provided a powerful means for determination 
of vibrational frequencies and intensities independent 
of empirical parameters (Krimm 1999; Pulay 1995). 
Most of these developments have addressed peptide 
vibrations, in particular their amide-based modes, such 
as illustrated in Fig. 1 , which provide sensitive probes of 
the secondary structure and can be utilized in both infra¬ 
red (IR) and Raman studies. However, alternative (nec¬ 
essarily larger and more approximate) computational 
efforts for simulating spectra for nucleic acids and even 
complex protein stmctures are now available (Kubelka 
et al. 2009). Models of lipid model vibrations focussed 
on the hydrocarbon tails as derived from FFs which 
developed over many years from polymer studies. 

While IR and Raman biopolymer studies primarily 
depend on frequencies of component bands, the inten¬ 
sities and polarizations of individual modes provide 
added insights necessary for modeling bandshapes of 
overlapping transitions. In addition, polarization prop¬ 
erties enhance structural sensitivity, such as utilized in 
linear dichroism (LD) of oriented samples, including 
lipid bilayers. Polarization is also critical for study 
of solution phase chiral molecules with vibrational circu¬ 
lar dichroism (VCD) and Raman optical activity (ROA), 


whose sign patterns are conformationally determined. 
QM modeling of intensities provides theoretical tools 
to fully model these highly sensitive conformational 
probes (Kubelka et al. 2009). 

As vibrational spectra have become more broadly 
used, more subtle aspects of their ability to probe 
structure arose. Thus, it became necessary to interpret 
small changes in frequencies and intensities due to 
environmental variations, such as protonation of spe¬ 
cific sites in a protein, or conformational changes, such 
as unfolding a helical segment. New tools such as 
nonlinear 2-D IR and related Raman methods have 
exposed couplings for specific modes not previously 
identifiable within collective polymer modes. Selec¬ 
tive isotope labeling gives vibrational spectra a site- 
specific structural sensitivity, even though such spectra 
are not normally resolvable. For these spectral proper¬ 
ties, QM modeling provides a model-free interpreta¬ 
tion, which is not possible with empirical models. 

The development of QM-based FFs and intensity 
parameters for peptides and their application to vibra¬ 
tional spectra are the topics of this review. At present, 
while empirical methods are still employed to interpret 
spectra and particularly coupling between local modes, 
these remain restricted to assumptions based on models 
for selected modes. Detailed description of complete 
vibrational spectra have mostly involved calculations 
using density functional theory (DFT) methods, which 
provide better agreement with the experiment, some par¬ 
tial accommodation for configuration interaction, and 
quicker, less expensive computations than traditional 
methods based on Hartee-Fock theory and its extensions. 
Simulations of vibrational spectra for larger biopolymers 
are approximated by transfer of DFT vibrational param¬ 
eters using various methods. Corrections of simulated 
spectra for solvent and other structural perturbations 
have also been proposed. This section provides a simple 
overview of vibrational theory and its biopolymer appli¬ 
cations for linear vibrational spectra analyses; more 
extensive reviews are available elsewhere (Choi and 
Cho 2009; Krimm 2000; Kubelka et al. 2009). 

Empirical Calculations of Peptide Vibrational 
Properties 

A TDC mechanism to couple identical, repeating local 
amide oscillators into exciton-like distributed modes 
provided an initial representation of spectral differences 
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for a-helix and P-sheet conformations for peptides in the 
IR (Miyazawa 1960). This general approach has been 
broadened by inclusion of various coupling mechanisms 
(through-bond, hydrogen bond, and TDC) and exten¬ 
sion to heterostructure models such as found in globular 
proteins. Inclusion of other oscillators, in a more general 
approach, was particularly useful in applying this 
method to DNA modeling, where the bases alternate 
and thus the oscillators are not identical (Tsuboi 1987). 
Methods based on dipole coupling automatically predict 
IR intensities by mixing the transition dipoles of the 
individual oscillators, according to the contributions of 
each local oscillator in the coupled mode. 

In a more comprehensive approach, all atom empir¬ 
ical force helds (FF) were derived by adjusting internal 
coordinate force constants (bond stretches and bends) 
based on the detailed structure of the molecule to fit 
observed frequencies of small molecules and transfer 
of those to larger (biopolymer) models. Systematic 
development of the force fields for polypeptides was 
undertaken by Krimm and coworkers by refinement of 
the empirical force field parameters to fit polypeptide 
experimental data (Krimm and Bandekar 1986). Long- 
range coupling of local modes via TDC was also incor¬ 
porated into these FF, which improved agreement with 
the experiment. Alternatively, ab initio QM FF param¬ 
eters from model amides or small peptides have been 
transferred to form a polypeptide FF (Colvin et al. 
2006; Krimm and Reisdorf 1994; Krimm 1999). 
Approximations of the IR spectral intensities within 
these empirical approaches can be derived from bond 
dipoles or atomic partial charges partitioned among the 
modes derived from the empirical FFs. These calcula¬ 
tions are only sufficient for qualitative IR spectra, such 
as differentiating strongest and weakest bands, but are 
less useful for Raman (which require polarizability 
tensors) or for VCD and ROA. In the latter, spectral 
bandshapes are critically dependent on the relative 
intensities and the sign patterns of the differential 
spectral contributions from band components, which 
require a QM model to properly model through-bond 
effects (coupling) on the wavefunction. 


Quantum Mechanical Calculations of 
Vibrational Spectra 

In the standard Bom-Oppenheimer (BO) approxima¬ 
tion approach to molecular vibrational potential 


surfaces, the total molecular wavefunction is 
represented as T(r,R) = (l)^i{r,R)Xv(R) where );u(R) 
represents the wavefunction corresponding to nuclear 
motion, primarily vibrations, and (/)ei(r>R) is the elec¬ 
tronic function. Solving the electronic Schrbdinger 
equation yields an electronic energy for the ground 
state, £(R/), which combined with the nuclear repul¬ 
sion, V„„iRi), forms an effective (average) potential 
energy term for the nuclei. Since this is a complex 
function of all the nuclear position variables, R,, the 
potential is normally expanded in their displacements, 
AR,, from equilibrium, and only the quadratic (har¬ 
monic) term is retained, yielding a Hamiltonian: 



( 1 ) 


where R, are the nuclear momenta and the Cartesian 
force constant matrix. 


Fij = 


d^s 

dRidRj 


( 2 ) 


is the force field (FF, also referred to as the Hessian). 
Fij is the key property computed in QM electronic 
structure calculation programs, as obtained either by 
finite displacement or direct analytical evaluation of 
the second derivatives. 

In the harmonic approximation, the multidimensional 
Hamiltonian can be reduced to a sum of independent one¬ 
dimensional operators, h/;(Qk), by a unitary transforma¬ 
tion to normal coordinates, to yield a vibrational 
equation of 


3N 


3N 




dQj 


(OiQi ]Xi{Qi) 


= ErXiiQi), i=l,2,...,3N 


(3) 


The vibrational frequencies and motions then reflect 
the harmonic oscillator energies, £,,,, and wavefunctions, 
Xi, respectively, as analytical solutions to the 1-D 
harmonic oscillator: 


= Ti(Oi ( 2 + 'X-oi (“^i) = " 


( 4 ) 
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where d, is the quantum number, and 

A|//u ((^,) are the normalized Hermite polynomials. 

Once the molecular FF, is known, calculation 
of the vibrational frequencies and normal modes of 
vibration is straightforward: the Cartesian force con¬ 
stant matrix is mass weighted and diagonalized and 
the resulting diagonal constants are related to the 
square of the fundamental transition quantum ener¬ 
gies (u, = 0^1), = ! for each normal mode /). If QM 
FF calculations are done in Cartesian coordinates, six 
eigenvalues have E, = 0 and represent the rotational 
and translational motions. The accuracy of vibrational 
frequencies and normal modes is determined by the 
accuracy of the FF, and the structural model, which 
then become the major issues for vibrational spectral 
calculations. 

IR intensities are expressed as the dipolar strength, 
Duo, the square of the electric transition dipole 
moment: 

Dm = fio„ ■ fiM = Ia'ooI^ = (5) 


where denotes the electric dipole moment operator 
and D is the vibrational quantum number. The 
harmonic selection rules allow only Avj = ±1, usually 
0^1 transitions, for one mode, Qj, at a time. 
Expressing this as a function of the normal 
modes, Qj, the ath component (e.g., x, y, z) of the 
transition dipole moment for the normal mode j 
becomes: 


(XiHlxo) 




(6) 


where 5 is a representation of the nuclear motion, 
or the normal mode, and the atomic polar tensor, 
P^(a,P), for atom A (APT or “dipole derivative”) is 
defined as: 


with being the electronic, and 

the nuclear (where Z^e is the nuclear charge 

of atom A), contributions to the APT. The 


electronic contribution to the dipole derivative can 
be represented as: 

dH\ ( d^c \ \ 

pDa.) 0 \dRA,p) 0 / 

( 8 ) 

where tpg is the electronic ground state wavefunction, 
F is the electric field, and the electric field derivative 
of the Hamiltonian is the dipole operator: (0/// 
= —Rei.x- The derivatives can be obtained either 
numerically or, more accurately and efficiently, by use 
of analytical derivative techniques. 

Raman intensities can be derived from the square of 
the change of polarizability in the mode, Qj, but must 
also account for excitation frequency and for the 
experimental setup (polarizations) yielding various 
possible expressions. Eor VCD and ROA, the magnetic 
dipolar and electric quadrupolar interactions must also 
be included, which represent higher order interactions 
with the field. VCD is simpler, with its intensity being 
proportional to the rotational strength, R„o, 



Dm = Im(po„ ■ m^) = ^ \{Xu\Ea\Xo){Xo\m^\Xv)\ 

a 

(9) 


where are the components of the magnetic dipole 
moment operator. This poses a special problem 
since the magnetic, Wq,, matrix elements can be origin 
dependent, which is typically avoided by the use of 
gauge invariant atomic orbitals (GIAO). Finally, since 
absorption (IR, VCD) is a one-photon process and 
scattering (Raman, ROA) is a two-photon process, 
the latter has added measurement possibilities to dis¬ 
criminate possible intensity mechanisms. 

Ab initio FF and intensity computations initially 
used Hartree-Fock (HF) methods with finite deriva¬ 
tives, but then adopted more efficient and accurate 
analytical methods (Pulay 1995). HF-level calculations 
are systematically in error (frequencies ^10% too high), 
so a series of scaling methods were incorporated to facil¬ 
itate comparison to experiment. Higher level-correlated 
calculations (e.g., MP2) provide improved results, but 
with much greater computational cost. DFT methods, 
which optimize wavefunctions to yield independent 
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particle electron densities, p, and include correlation cor¬ 
rection, were then adapted to IR, Raman, VCD and now 
ROA calculations enabling more accurate modeling even 
for larger molecules (Kubelka et al. 2009; Stephens and 
Devlin 2007). Accuracy approaching that of MP2 calcu¬ 
lations could be obtained with some DFT functionals 
using CPU times comparable to those needed for HF 
calculations. However, DFT incorporates the functional 
effectively as a parameter, since there is not a systematic 
method like the variational principle for its choice. Expe¬ 
rience now provides guidance based on conditions 
whereby the results obtained with selected standard func¬ 
tionals and basis sets have been shown to be reliable. DFT 
provides a compromise between accuracy and computer 
resources, which is ideal for biopolymer applications. 
A detailed account of DFT methodology is beyond the 
scope of this entry but is widely available (Bartolotti and 
Hurchick 1996; Becke 1995; McGill 1998; Parr and 
Yang 1995; St-Amant 1996). Several QM codes have 
been developed that incorporate DFT calculations, 
allowing the user to access these tools relatively straight¬ 
forwardly by patterning new calculations on standard 
approaches used for other related systems. Perhaps 
the most commonly used package is the Gaussian suite 
of programs, which, although quite complex and multi¬ 
faceted, is user accessible. CADPAC and the Dalton 
programs are more specialized for molecular electromag¬ 
netic property calculations, and the latter is freely avail¬ 
able. Turbomole and the Amsterdam density functional 
package (ADF) can also compute IR and VCD 
intensities. 

The quality of DFT calculations depends on the 
exchange-correlation (XC) energy, Excip), and devel¬ 
opment of approximations for exchange and correla¬ 
tion density functionals is a key for success of DFT 
methods as reviewed by Becke (Becke 1995). For 
biopolymer calculations, generalized gradient approx¬ 
imation (GGA) functionals, which correct for over¬ 
estimation of binding energies by addition of terms 
dependent on the gradient of the density, are useful. 
Combining GGA functionals to include the exchange 
of Becke (B) and the correlation of Lee, Yang and 
Parr (LYP) or Perdew and Wang (PW91) yield the 
often used BLYP and BPW91 functionals (Parr and 
Yang 1995). PW91 uses no empirical parameters, 
therefore, it is rigorously ah initio, while the Becke 
exchange functional contains a parameter that was fit 
to the exact exchange energies of noble gas atoms. 
In addition, many labs use hybrid functionals, which 


incorporate the Fock exchange integral and have added 
parameters, such as the widely employed B3LYP. 
While B3LYP gives good agreement across a range 
of vibrational modes, BPW91 is faster and gives better 
amide frequencies for peptides. 

Both HF and DFT methods use linear combinations 
of atomic orbitals (LCAO) to form molecular orbitals 
(MO), and most quantum chemical programs use 
Gaussian-type AOs (GTOs). Larger basis sets provide 
better results but at the cost of longer computational 
times. T is a major issue for biopolymer studies since 
cost (CPU time) for FF calculations grows as >N^, 
where N is the number of basis functions used. Incom¬ 
plete basis sets can cause significant FF error, so selec¬ 
tion is important. Most FF calculations are done 
satisfactorily with polarized split valence basis sets 
(double and triple zeta), including added functions 
with higher angular momentum, for example, 6-3IG 
(d) (or 6-3IG*), which adds a d-type function for all 
heavy (second row) atoms. Diffuse basis sets, such as 
6-31-i-i-G, contain larger radius 2s orbitals for hydro¬ 
gen and 3p orbitals for second row atoms which can aid 
in computing polarizability, H-bonds and solvation 
effects, but impose a significant computational burden. 
Effects of solvent can be partially accommodated by 
inclusion of a polarized continuum model (PCM) cor¬ 
rection, which improves the EE and particularly amide 
frequencies, but better results are obtained by inclusion 
of explicit solvent to represent H-bonds to water 
(Kubelka et al. 2005). 

Model IR and VCD Simulations 

Already at the dipeptide level, the computed spectra 
for constrained helical or sheet conformations have 
characteristic features of the specific secondary struc¬ 
tures due to the dominance of near-neighbor coupling. 
Tripeptides give similar results, and show a clear dis¬ 
crimination between helix, sheet, and coil (disordered) 
forms, as seen in Fig. 2; however, FFs for these short 
peptides do not focus intensity into single delocalized 
modes as is characteristic of exciton coupling in poly¬ 
mers (Bour et al. 2000; Kubelka et al. 2002, 2009). 

Development of improved computer systems, 
codes, and DFT methods made it possible to compute 
vibrational spectra for moderately large peptide struc¬ 
tures that include H-bonds between residues. As shown 
in Fig. 3, with increased length of the peptide, the 
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Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra, Fig. 2 Simulated tripeptide IR, VCD, 
Raman spectra. Simulated IR {top), VCD (middle) and Raman 
(bottom) spectra for triamides Ac-Ala 2 -NH-CH 3 in (a) a-heli- 
cal, (b) 3 10 -helical, and (c) left-handed 3 1 -helical (PolyProII- 


like) conformations, calculated at the DPT: BPW91/6-31G* 
level. Vertical lines indicate the positions and relative intensities 
of the normal modes. Envelopes represent sums over Lorentzian 
broadened bandshapes centered on the mode positions to give 
a representative spectrum in molar units (for IR and VCD) 



Wavenumber (cm-i) 


Q 


Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra, Fig. 3 Simulated length and optimiza¬ 
tion effects, helical IR and VCD. Simulated IR and VCD spectra 
for longer ot-helical model oligopeptides at the DPT BPW91/6- 
31G(d) level: (a) idealized a-helical hepta-amide Ac-fAla)^- 


NPI-CPI 3 , (b) idealized a-helical undeca-amide Ac-(Ala)io- 
NH-CPI 3 , and (c) fully optimized deca-amide Ac-(Ala) 9 -NH- 
CPI 3 . Intensity representation for the modes and envelop as 
in Pig. 2 


IR intensity builds in one component and the amide 
I becomes the dominant spectral feature, which is 
a characteristic of extended repeating structures. The 
differences between the fully minimized and an ideal 
oligomer (Fig. 3h, c) are minor. Many simulations of 
vibrational spectra for other oligopeptide structures. 


including various helices, sheets, and mixed structures 
such as hairpins based on idealized geometries as 
well as on x-ray and NMR-derived structures are now 
available (Kubelka et al. 2009). 

Some general patterns emerge from these analyses 
that fit experimental observations quite well. However, 
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Wavenumber (cm”'') 


Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra, Fig. 4 Simulation of IR, VCD, and 
Raman for long peptides using transferred FF parameters. 
Raman {top), IR {middle) and VCD {bottom) spectra of an 
alanine 21 -amide peptide, Ac-Ala 2 o-NFI-CFl 3 , calculated by 


CCT transfer of FF, APT, and AAT parameters from shorter 
fragments having (ct),v|/) appropriate for an (a) a-helix, (b) 3io- 
helix, and (c) 3i-helix. The large intensity in the Raman at 
~ 1,500 cm“* is due to the -CFI 3 groups, which are computed 
at too high wavenumber and overlap the amide II position 


the DFT-calculated frequencies for the amide I, II, 
and III modes are all significantly high, the amide 
I being the worst. This error is intrinsic to the method 
and is primarily due to neglect of solvation effects, 
which if corrected, result in reduced error that is more 
uniform for all the modes. However, correcting for 
solvent is not vital, since the shifts do not significantly 
affect the local mode interactions, which are the 
important part of the FF for structural analyses. 
Early methods scaled the FF to better align computa¬ 
tions with experiment, but since such scale factors are 
empirical parameters, it is better to account for the 
error in the analyses, since it is well-understood. As 
noted above, longer helical sequences have sharper 
bands and thus appear to be more intense, mostly due 
to conservation of dipole moment now focused in 
a few excitonic modes, but in part due to coupling to 
the helical macrodipole. Similar behavior is seen in 
sheets, with very flat extended sheets giving rise to 
sharp, intense amide 1 bands at lower frequencies. 
In addition as the helices and sheets get longer or 
more extended, the dominant amide I frequency 
does tend to decrease. Conversely, the frequency 
increases when a sheet becomes more twisted, 
since with increased twist the H-bond pattern 
becomes very distorted for multiple strands (Kubelka 
and Keiderling 2001). 


Larger Systems, Approximations 

DFT-based methods have been extended to very large 
systems including proteins (Grahnen et al. 2010), 
models of fibrils, and nucleic acids by fragmenting 
the structure and transferring FF and APT parameters 
from DFT calculations on the fragments to the larger 
biopolymer structure (Fig. 4). Our methods have used 
sequences of peptides that encompass local interac¬ 
tions between residues at the DFT level, and can 
include longer range interactions at a lower level, 
much as would be done with a QM/MM approach 
that uses classical molecular mechanics (MM) for the 
lower level. An alternate method, initiated by Torii and 
expanded by Cho and coworkers has created maps of 
residue-residue interactions as a function of 4>, v|/ 
backbone torsional angles worked out using a series 
of HF or DFT level calculations. FF parameters were 
transferred from a dipeptide or other simple structure 
to the target larger peptide (or protein) structure 
while adding geometry-specific coupling interactions 
obtained from the interaction map. A transform into 
local modes is needed to create a set of uniform oscil¬ 
lators that can be coupled. The FF obtained tends to 
represent the modes of interest, mostly amide I in the 
results so far available (Choi and Cho 2009; Rhee et al. 
2010). This method allows much quicker calculation 



































Quantum Mechanical Simulations of Biopolymer Vibrational Spectra 


2165 


of spectra for a wide variety of structures. Thus, it is 
adaptable to coupling the QM vibrational spectral rep¬ 
resentation with molecular dynamics (MD) represen¬ 
tations of the ensemble of structures in a fluctuating or 
unfolding system, and thus allows one to model 
dynamics as sensed via spectra. 

An alternative way of dealing with the solvent 
effects has used MD to represent the solvent around 
the peptide. These geometries are converted to an 
effective electric field sensed at the amide group. 
A correction to the FF is parameterized to fit the shifts 
of the diagonal frequencies to the electric field from the 
solvent as the environment fluctuates. This approach 
has gained wider interest as a means of representing 
dynamics in the fluctuating peptide - solvent environ¬ 
ment (Rhee et al. 2010). 

Summary 

Analyses of vibrational spectra have moved from an 
empirical base to a fully ah initio foundation in the last 
decade or more. With such tools, the relationship of 
spectra and structure can be probed in ever more detail, 
allowing biophysical studies to probe more aspects of 
the dynamics and structure of biopolymers. 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Homology Modeling of Protein Structures 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Infrared Spectroscopy of Protein Structure 

► Isotope Edited Raman and Infrared Difference 
Spectroscopy Studies of Protein Structure 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 

► Polarized Infrared Spectroscopy 

► Protein Structural Models - Evaluating Quality 

► QM/MM Methods 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Raman Optical Activity Studies of Structure and 
Behavior of Biomolecules 


► Vibrational Circular Dichroism of Biopolymers 

► Vibrational Spectroscopy with Neutrons 

► Vibrational Spectroscopy, A Short History of 
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Quasi-elastic Light Scattering (QELS) 

► Multiangle Light Scattering from Separated Sam¬ 
ples (MALS with SEC or FEE) 


Quinone Cofactors 

Victor L. Davidson 

Burnett School of Biomedical Sciences, 

College of Medicine, University of Central Florida, 
Orlando, EL, USA 


Definition 

A quinone is a cyclic organic compound that contains 
two carhonyl (C=0) groups that are in the same aro¬ 
matic ring. Quinone cofactors associate with enzymes 


and participate in electron transfer or catalysis or both. 
Quinone cofactors may also be formed by posttransla- 
tional modification of amino acid residues of proteins. 

Basic Characteristics 
Membrane-Bound Quinones 

Quinone molecules within cell membranes (Fig. 1) 
act as electron transfer shuttles between proteins that 
function in respiration and photosynthesis. During 

► mitochondrial electron transport, ubiquinone (also 
called Coenzyme Q) accepts electrons from both 

► Complex 1 and ► Complex II and donates electrons 
to ► Complex III. Ubiquinone is also present in 

► bacterial electron transfer chains, although some 
anaerobic respiratory chains use menaquinone instead. 
During photosynthetic electron transfer, plastoquinone 
transfers electrons from the ► photosystem II reaction 
center to the ► cytochrome bf complex. 

Pyrroloquinoline Quinone (PQQ) 

PQQ (Fig. 2) is an endogenous cofactor which is uti¬ 
lized by several bacterial dehydrogenases that use as 
substrates a variety of alcohols, sugars, and aldehydes 
(Matsushita et al. 2002). This cofactor is relevant to 
electron transfer because in contrast to most dehydroge¬ 
nases, PQQ-dependent enzymes do not use ► nicotin¬ 
amide adenine dinucleotide (NAD"^) or NADP"^ as 
electron acceptors. Instead, soluble PQQ-dependent 
dehydrogenases donate electrons to other soluble ► elec¬ 
tron transfer proteins, usually c-type ► cytochromes, and 
membrane-bound PQQ-dependent dehydrogenases donate 
electrons to ubiquinone (Davidson 2004). The importance 
of PQQ to human metabolism and nutrition is a topic of 
controversy. Although no PQQ-dependent enzymes have 
been found in mammals, there is evidence from experi¬ 
mental animal models that it may be an essential nutrient 
(Rucker et al. 2009). 

Protein-Derived Quinone Cofactors 

Several enzymes have been identified that possess 
protein-derived quinone cofactors (Fig. 3) that are 
formed by posttranslational modification of tyrosine 
or tryptophan residues (Davidson 2011). 

Topaquinone (2,4,5-trihydroxyphenylalanine qui¬ 
none, or TPQ) is the prosthetic group of copper- 
containing amine oxidases present in plants, mammals, 
fungi, and bacteria. These amine oxidases utilize a wide 
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Quinone Cofactors, 

Fig. 1 Membrane-bound 
quinones 
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Quinone Cofactors, Fig. 2 Pyrroloquinoline quinone (PQQ) 


range of substrates and are involved in a wide range of 
physiological functions (Floris and Mondovi 2009). The 
biosynthesis of TPQ which requires the insertion of two 
oxygens into a specihc tyrosine residue, has been shown 
to be a self-processing event which requires molecular 
oxygen as well as copper (Brazeau et al. 2004). The 
overall reaction catalyzed by the TPQ-dependent copper 
amine oxidases is the conversion of a primary amine 
substrate to its corresponding aldehyde plus ammonia 
with molecular oxygen and water as co-suhstrates and 
H 2 O 2 as an additional product. 

Lysine tryptophylquinone (LTQ) is the protein- 
derived cofactor of mammalian lysyl oxidase, an 



Quinone Cofactors, Fig. 3 Protein-derived quinone cofactors. 
The posttranslational modifications to form the cofactors are 
indicated in red 


important enzyme in the metabolism of connective tis¬ 
sue which catalyzes modification of elastin and collagen 
(Kagan and Li 2003). LTQ is formed by posttransla¬ 
tional modification in which one atom of oxygen is 
incorporated into the tyrosine ring and a covalent bond 
is formed between the C5 carhon of the tyrosine ring 
and the side-chain nitrogen of a lysine residue. 

Typtophan tryptophylquinone (TTQ) is the protein- 
derived cofactor of bacterial amine dehydrogenases 
(Davidson 2001). To form TTQ, two oxygens are 
incorporated into a tryptophan side chain to generate 
a quinone, and that side chain also is covalently 
cross-linked to another tryptophan residue. These 
TTQ-dependent dehydrogenases are soluble enzymes 
localized in the periplasmic space of gram-negative 
bacteria. Each is an inducible enzyme that allows the 
host bacterium to utilize particular primary amines 
as a sole source of carbon and energy. Each enzyme 
catalyzes the oxidative deamination of a primary 
amine to yield the corresponding aldehyde and ammo¬ 
nia. During the reductive half-reaction, the quinone 
cofactor transfers electrons to a specific redox protein, 
usually a ► cupredoxin, which mediates the transfer of 
these electrons to the membrane-bound respiratory 
chain. 

Cysteine tryptophylquinone (CTQ) is the protein- 
derived cofactor of bacterial quinohemoprotein amine 


Q 
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dehydrogenases. These enzymes contain CTQ as well 
as two covalently bound c-type ► hemes. To form 
CTQ, two oxygens are incorporated into a tryptophan 
side chain to generate a quinone, and that side chain 
also is covalently cross-linked to a cysteine residue. 
These enzymes catalyze the oxidative deamination of 
a primary amine to yield the corresponding aldehyde 
and ammonia. During the reductive half-reaction, the 
quinone cofactor transfers electrons via the two hemes 
also present in the enzyme to a specific redox protein, 
usually a c-type ► cytochrome, which mediates the 
transfer of these electrons to the membrane-bound 
respiratory chain. 

Cross-References 

► Chlorophylls and Light-Harvesting Complexes 

► Cupredoxins 

► Cytochromes 

► Electron Transfer Proteins: Overview 

► Electron Transport Chains in Bacteria 

► Hemes 

► Mitochondrial Electron Transport 

► NADH-Ubiquinone Oxidoreductase (Complex I) 

► Nicotinamide Adenine Dinucleotide (NAD) 


► Photosystem II 

► Succinate Dehydrogenase (Complex II) 

► Ubiquinol-Cytochrome c Oxidoreductase 
(Complex III) 
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Raman Imaging of Biological Samples 

► Raman Spectroscopy and Microscopy of Cells and 
Tissues 


Raman Microspectrometry of Cells and 
Tissues 

► Raman Spectroscopy and Microscopy of Cells and 
Tissues 


Raman Optical Activity Studies of 
Structure and Behavior of Biomolecules 

Laurence Barron 

Department of Chemistry, University of Glasgow, 
Glasgow, Scotland, UK 

Synonyms 

Vibrational optical activity 

Definition 

Raman optical activity measures optical activity in 
vibrational transitions of chiral molecules. It provides 
information about conformation and absolute configu¬ 
ration, and aqueous solution structure and behavior of 
biomolecules like peptides, proteins, carbohydrates, 
nucleic acids, and viruses. 


Basic Characteristics 

Raman optical activity (ROA) probes molecular 
chirality via a small circular polarization dependence 
of Raman scattering (Nafie 1997; Hug 2002; 
Barron 2004). Whereas UV-visible circular dichroism 
(► Circular Dichroism Spectroscopy of Biomacro¬ 
molecules) measures electronic optical activity, ROA 
measures vibrational optical activity and so, like the 
complementary technique of vibrational circular 
dichroism (► Vibrational Circular Dichroism of Bio¬ 
polymers), provides more stereochemical information. 

ROA is due to different scattering probabilities 
for right and left hand circularly polarized light. 
ROA originates in interference between light waves 
scattered via the polarizability and optical activity 
molecular property tensors. In 1971, L. D. Barron and 
A. D. Buckingham published the definitive theory of 
ROA and introduced the dimensionless circular inten¬ 
sity difference (CID) zl = (/^ —/^)/(/’^-f 7^) as 
a suitable experimental quantity, where 7*^ and 7^ are 
the scattered intensities in right- and left-circularly 
polarized incident light. ROA was first observed in 
1972 by L. D. Barron, M. P. Bogaard, and A. D. 
Buckingham in small chiral organic molecules. 

ROA is also manifested as a small circularly polarized 
component in the scattered beam using unpolarized inci¬ 
dent light (Nafie 1997; Barron 2004). Within the far- 
from-resonance approximation, measurement of this cir¬ 
cular component, called scattered circular polarization 
(SCP) ROA, as (7r — 7l ) / (7r -f 7l ), where 7r and 7l are 
the intensities of the right- and left-circularly polarized 
components of the scattered light, provides equivalent 
information to the CID measurement, called incident 
circular polarization (ICP) ROA. 


G.C.K. Roberts (ed.), Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 
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Raman Optical Activity Studies of Structure and Behavior of Biomolecules 


In terms of the electric dipole—electric dipole 
molecular polarizability tensor and the electric 
dipole—magnetic dipole and electric dipole—electric 
quadrupole optical activity tensors G'afi and A^py, the 
ICP Rayleigh optical activity for backscattering in an 
isotropic sample is (Barron 2004) 

„ = 

c [45a2 + 

The tensor component products have been averaged 
over all orientations of the molecule: a and G' are the 
isotropic invariants, and P{oc)^, P{G')^, and P{A)^ the 
corresponding anisotropic invariants. This equation 
applies to Rayleigh scattering. For Raman scattering, 
the molecular property tensors are replaced by 
corresponding vibrational Raman transition tensors. 
Although ROA may be measured using different 
scattering directions, backscattering boosts the ROA rel¬ 
ative to the background Raman intensity and is usually 
the best experimental strategy for biomolecules in aque¬ 
ous solution (Barron et al. 2000). 

Although ICP ROA instruments established ROA 
in stereochemistry and biomolecular science (Barron 
et al. 2000), studies were restricted by the delicate 
nature of the measurements. This situation improved 
following the introduction of a new design of 
instrument based on the use of the SCP strategy 
(Hug 2002). A commercial instrument based on this 
design that incorporates a sophisticated artifact sup¬ 
pression protocol is available (the GAimXRAMAN from 
BioTools, Inc.). 

By comparison with observed spectra, calculated 
ROA spectra (Ruud and Thorvaldsen 2009) can pro¬ 
vide the complete solution structure (conformation, 
absolute configuration, conformational populations) 
of molecules containing up to around 200 atoms. 
ROA calculations may be performed using the 
DALTON (http://www.kjemi.uio.no/software/dalton/ 
dalton.html) and Gaussian (http://www.gaussian.com) 
software packages. 

ROA of Biomolecules 

ROA spectra of biomolecules in aqueous solution, 
measured over the spectral range ^200-1,800 cm“\ 
provide more informative, less complex spectra than 
conventional infrared or Raman since the largest sig¬ 
nals are often associated with vibrations that sample 


the most rigid chiral parts of the structure (Barron et al. 
2000). These are usually within the backbone and often 
generate ROA band patterns characteristic of the back¬ 
bone conformation. Proteins are favorable in this 
respect since signals from the peptide backbone usu¬ 
ally dominate the ROA spectrum, unlike the parent 
Raman spectrum in which bands from the amino acid 
side chains often obscure the peptide backbone bands. 
Carbohydrate ROA spectra are similarly dominated by 
signals from skeletal vibrations, in this case centered 
on the constituent sugar rings and connecting glyco- 
sidic links. Although the parent Raman spectra of 
nucleic acids (► Infrared Spectroscopy of DNA) are 
dominated by bands from the intrinsic base vibrations, 
their ROA spectra tend to be dominated by bands 
characteristic of the stereochemical dispositions of 
the bases with respect to each other and to the sugar 
rings, together with signals from the sugar-phosphate 
backbone. 

Individual protein ROA bands may be assigned to 
elements of secondary structure, but the presence of 
bands originating in loops and turns also leads to 
overall ROA band patterns characteristic of the fold. 
Figure 1 displays the backscattered SCP ROA spectra 
of three proteins with different folds, namely human 
serum albumin (SCOP class: all a; fold: serum- 
albumin-like), human immunoglobulin G (all (3; 
immunoglobulin-like), and bovine P-casein (natively 
unfolded). The ROA band pattern for serum albumin 
reflects the large amount of extended a-helix it con¬ 
tains, and that for immunoglobulin G reflects the large 
amount of antiparallel P-sheet within each of its 12 
P-sandwich domains. The ROA spectrum of P-casein 
is similar to that of disordered polypeptides, which are 
thought to contain large amounts of poly(L-proline)II 
(PPII) helix (Barron et al. 2002). Some side-chain 
vibrations, especially those associated with tryptophan 
and phenylalanine, also generate useful ROA signals. 

The many structure-sensitive bands in protein ROA 
spectra makes them suitable for multivariate analysis 
techniques such as principal component analysis 
(PCA) and nonlinear mapping (NLM) to deduce struc¬ 
tural relationships among proteins. A two-dimensional 
NLM plot for a set of 80 polypeptide, protein and virus 
ROA spectra shows tight clustering corresponding to 
different types of structure (Zhu et al. 2006). Clusters 
corresponding to the following structural classes are 
observed: all a, mainly a, ap, mainly p, all P, mainly 
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Natively unfolded 


Raman Optical Activity Studies of Structure and Behavior 
of Biomolecules, Fig. 1 Backscattered SCP Raman (/r + IC) 
and ROA (/r — /l) spectra of (a) human serum albumin, (b) 
human immunoglobulin G, and (c) bovine P-casein, all in 


aqueous solution and recorded on the ChiralRAMAA instrument. 
Experimental conditions: laser wavelength 532 nm, laser power 
at the sample ~500 mW, protein concentrations ~50 mg/mL 
(sample volume ~30 pL), and acquisition times ~5 h 


disordered/irregular and all disordered/irregular. The 
average standardized ROA spectra of the polypeptides 
and proteins within each structure class in the NLM 
plot are shown in Fig. 2. Each has several distinct 
features characteristic of the type of structure. The 
average of all disordered/irregular ROA spectrum is 
similar to that of PPII helix. 

ROA is also useful in the study of proteins that are 
unfolded in their native functional state. Residual strac- 
ture in such intrinsically disordered proteins (IDPs) may 


be characterized using multivariate analysis and ah initio 
simulation of ROA spectra (Zhu et al. 2008). ROA is also 
valuable for studying misfolding (► Infrared Spectros¬ 
copy of Protein Folding, Misfolding and Aggregation) 
and disease (► Amyloid formation, ► Infrared Spectros¬ 
copy of ftotein Folding, Misfolding and Aggregation) 
associated with IDPs (Barron et al. 2002). Intact glyco¬ 
proteins (► Glycoproteins), which are difficult to study 
using conventional techniques, give ROA spectra with 
clear bands originating in both the polypeptide and 
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Wavenumber/cm ^ 

Raman Optical Activity Studies of Structure and Behavior 
of Biomolecules, Fig. 2 Averages of the backscattered ROA 
spectra (ICP and SCP) in aqueous solution for the seven main 
protein structure classes within a set of 80 polypeptide, protein, 
and virus ROA spectra (Adapted from Zhu et al. [2006]) 


carbohydrate components and which provide informa¬ 
tion about the structure of both (Barron 2006). ROA 
spectra may also be obtained for most types of virus in 
aqueous solution, including filamentous, helical, rod 
shaped, and icosahedral, from which information about 
the structure of the protein and nucleic acid constituents 
of an intact virus may be deduced (Blanch et al. 2002). 

Dynamics and Behavior 

ROA intensity depends on absolute chirality in the 
arrangement of bonds, with cancelation of contribu¬ 
tions from structures with “opposite” chirality as 
a mobile structure explores the range of accessible 
conformations, resulting in enhanced sensitivity to 
dynamics of biomolecular structure. In contrast, 
observables that are “blind” to chirality, such as 


conventional Raman intensities, are generally additive 
and consequently less sensitive. This enhanced sensi¬ 
tivity of ROA to mobility is valuable in monitoring 
partially structured intermediate states accessed by 
proteins as they unfold under the influence of heat 
and/or reduced pH (Barron et al. 2002), and in 
detecting glass-like transitions in nucleic acids (Barron 
et al. 2000). 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► (Glyco)Protein Folding Disorders 

► Glycoproteins 

► Infrared Spectroscopy of DNA 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Raman Spectroscopy 

► Vibrational Circular Dichroism of Biopolymers 
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Definition 

After a brief discussion of the theoretical background of 
Raman scattering, this entry considers the instrumental 
and experimental aspects of this technique. Starting from 
the light sources, biters, spectrometers, and detectors 
used for standard Raman and resonance Raman spectros¬ 
copy, the discussion leads to sophisticated ways to sup¬ 
press luminescence background like deep UV excitation 
and temporal bitering. 


Introduction 

The discovery of inelastic light scattering on molecular 
samples was recognised by the award of the Nobel Prize 
in 1930 to C.V. Raman, after whom the spectroscopy 
based on inelastic light scattering is conventionally 
named Raman spectroscopy. Though the effect has 
been known for more than 80 years by now, only the 
last 1 or 2 decades witnessed a drastic increase in analytic 
applications of Raman spectroscopy in particular in the 
belds of biochemical analytics and biomedical sciences. 
With the availability of highly sensitive detectors and in 
combination with laser scanning microscopy or near- 
beld optics, the use of metal nanoparticles for signal 
enhancement and the exploration of polarization effects, 
Raman scattering has matured to become a powerful 
analytic technique due to its molecular sensitivity, its 
ease of implementation, and the fact that in contrast to 
IR absorption spectroscopy the presence of water is not 
hindering its applicability. While the applications of 
Raman spectroscopy originated in physics and chemis¬ 
try, these applications have reached out toward biology 
and medicine unraveling the molecular signature of dis¬ 
eases and individual bacteria (Schlucker and Kiefer 
2010). Furthermore, the combination of ultrafast 


spectroscopy and Raman spectroscopy reveals the 
ultrafast dynamics of structural rearrangements during 
chemical reactions and Raman reaches for the stars - 
with the upcoming implementation of Raman spectrom¬ 
eters in the Mars-missions of ESA and NASA (Schmitt 
and Popp 2006). Finally, nonlinear Raman scattering 
methodologies (Long and Kiefer 1982) such as 
coherent anti-Stokes Raman scattering have been devel¬ 
oped, which in combination with laser scanning micros¬ 
copy allow rapid, i.e., in vivo chemical imaging of (live) 
samples (Burkacky and Zumbusch 2008; Krafft et al. 
2009). 

While the following entries in this encyclopedia are 
going to be concerned with details of the various 
aspects of contemporary Raman spectroscopy in bio¬ 
physics, this entry will present some general and basic 
aspects of Raman instrumentation and experimental 
considerations. Starting from the molecular origin of 
the signal, general challenges to be met in Raman 
spectroscopy will be presented and experimental strat¬ 
egies to cope with them will be laid out. 

The Raman signal originates from the inelastic 
scattering of light in (molecular) samples (Long 
2002). Classically, the description of the effect starts 
with the formula for the induced dipole moment p., 
which can be written as: 


fi = oiE 


( 1 ) 


The molecular dipole ji is induced by the external 
electric field E, which is characterized by its vector 
amplitude Eq and its oscillation frequency (Oq, and 
which can be expressed as: 


R 


E = Eq ■ cos(cuof) (2) 

The proportionality factor between the external 
field and the induced dipole moment in Eq. 1 is the 
molecular polarizability, a. This quantity is physically 
related to the extent to which the driving field is able to 
disturb the electron density of the sample out of its 
equilibrium conhguration, i.e., the configuration in the 
absence of an external field. Nuclear motion, however, 
will modify the polarizability as the electron density 
adiabatically adjusts to the instantaneous nuclear 
geometry of a molecule to overall minimize the energy 
of the system. Thus, the polarizability is expanded into 
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a Taylor series around the equilibrium nuclear geom¬ 
etry Qo, where Q represents the group of all individual 
normal modes q. 



1 / a^a 

■ <? + T 




2 \dqdq'J,„^„ 


■qq' + 0{q^) 


(3) 


In turn, oscillation with a characteristic frequency 
(o^ along each normal coordinate q can be excited, thus 

q = qo- cos{(Dqt) (4) 


Inserting Eqs. 2-4 into Eq. I yields: 


n{t) 


oto + 



■ qo ■ cos {(Oqt'j 


■ Eo ■ cos(cooO (5) 


Now the sum over all normal modes is reduced to 
a single mode q and the expansion in Eq. 3 is truncated 
after the linear term. Thus, Eq. 5 can be rewritten as: 


/t(f) =a.o-Eo- cos(cL>oO + ^ 

qO-Eo- COs[(fflo - OOq) t] 
qo - Eo • cos[(mo + «?) ?] 


90 

1 
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( 6 ) 


Equation 6 constitutes the conceptual core of Raman 
scattering as a limiting case of general light scattering. 
The time-dependent induced dipole moment presents 
a secondary source of (scattered) radiation, which con¬ 
tains three distinct frequency terms: 

The first one, i.e., lo ■ Eo ■ cos(ffloO> oscillates at the 
frequency of the incident laser radiation and is respon¬ 
sible for scattered light at this frequency, i.e., elastic 
Rayleigh scattering. 

The second term in Eq. 6 oscillates at a difference 
frequency, i.e., the difference between the laser and the 
molecular normal mode frequency. As a consequence, 
this induced dipole moment radiates light with a longer 
wavelength than the excitation frequency. This term 
representing inelastic light scattering is the source of 
the Stokes scattering. The last term is characterized by 
the sum of frequencies mo + cOp. It also presents 
a source for inelastic light scattering; however, it is 
a source for radiation which is blueshifted with respect 


to the laser frequency. In analogy to Stokes scattering, 
the process giving rise to this radiation is called 
anti-Stokes scattering. 

Having laid out the conceptual fundamentals of 
Raman scattering it is straight forward to pinpoint the 
challenges to be faced in the experimental implemen¬ 
tation of Raman scattering. These challenges are also 
the reason why Raman scattering despite being known 
for 80 years has only blossomed very recently: As the 
inelastic light scattering present higher terms in the 
Taylor-expansion of the polarizability tensor com¬ 
pared to the elastic light scattering, the cross section 
for Raman scattering is low and orders of magnitude 
lower than the cross section for Rayleigh scattering. 
This imposes the first two central aspects of modem 
Raman instrumentation: the use of light sources with 
a high spectral density of photons, i.e., narrow-band 
high-power light sources, which became available 
with the development of robust and easy to handle 
continuous-wave (cw) lasers, and powerful filters, 
which allow for the efficient suppression of Rayleigh 
scattered light in the Raman spectrum. Contemporary 
Raman spectroscopy setups use cw lasers for excita¬ 
tion. These sources combine various benefits for 
Raman spectroscopy, first of all the narrow spectral 
bandwidth that is narrower than the spectral width of 
a typical Raman transition, i.e., ^10 cm“\ Thereby, 
the choice of light source does not limit the spectral 
resolution obtained and hence does not obscure the 
structural information to be deduced from the Raman 
spectrum. Eurthermore, cw lasers provide a high inten¬ 
sity in their narrow emission window, which is utmost 
beneficial for the investigation of a weak scattering 
process. Additionally modem cw lasers are robust, 
easy to handle, and stable with respect to output 
power and pointing stability over extended periods of 
time, which present mediocre advantages of using 
them for spectroscopic purposes. 

As the cross-section for inelastic light scattering is 
significantly lower than for elastic light scattering, the 
Rayleigh scattered light overwhelms the actual Raman 
signal if insufficient stray light suppression is applied. 
This is of particular importance when aiming for low- 
wave-number Raman spectroscopy, e.g., for the inves¬ 
tigation of the collective vibrations in proteins or size 
and shape effects of nanostructures (Kramer 2005). 
Here the development of sharp-edge long-pass filters or 
notch filters, i.e., filters blocking a very narrow spectral 
range while being transparent to the radiation both at 
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Fig. 1 Schematically depicts 
a scattered-light spectrum, 
which contains contributions 
from Stokes-, anti-Stokes-, 
and Rayleigh scattering. The 
shaded area indicates the 
spectral range that needs to be 
filtered in order to suppress the 
overwhelming contributions 
from elastic light scattering 
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longer and shorter wavelengths, was important for 
the outreach of Raman spectroscopy for the following 
reason; Previous to the wide spread and commercial 
availability of high-quality optical filters for Raman 
spectroscopy, suppression of Rayleigh scattered light 
was most efficiently performed using triple¬ 
monochromators with internal slits. In order to achieve 
significant suppression of Rayleigh scattered light, nar¬ 
row slit-widths needed to be used leading to low light 
levels at the detector. Furthermore, most spectra were 
recorded in point scanning mode using photomultipliers. 
While those detectors are very sensitive and provide 
a high dynamic range, the serial acquisition of a full 
spectrum takes a long time. Hence today’s most widely 
used detectors for Raman spectroscopy are so-called 
CCD chips, very similar to the detectors used in modem 
cameras. These detector have several distinct advan¬ 
tages, firstly they provide the possibility to acquire spe¬ 
cific spectral regions in one single exposure. This means 
the number of detector elements along the spectral axis 
should be as high as possible. Modem detectors provide 
between 1,000 and 2,000 detector elements allowing the 
simultaneous detection of the entire spectral region, (e.g., 
200-4,000 cm"*) with one exposure and a spectral reso¬ 
lution sufficient for most applications in condensed mat¬ 
ter. Compared to photomultipliers, CCD cameras can be 
operated from the UV region close to 1.1 pm. The upper 
limit is due to the band gap of silicon, which is the basis 
material of CCDs. Connected to the material is also the 
extremely high quantum efficiency, particularly in the 
visible region, that can reach values of more than 90%. In 


combination, this makes CCD elements the detector of 
choice for Raman spectroscopy (Fig. 1). 

For Raman spectroscopic applications the particular 
choice of the excitation wavelength often turns out to 
be important as, by tuning the laser wavelengths into 
resonance with a molecular electronic transition of 
interest, the Raman scattering cross section of vibra¬ 
tions coupled to the respective electronic transition can 
be selectively enhanced by several orders of magni¬ 
tude. This effect is referred to as resonance Raman 
scattering (RRS) and among other applications fre¬ 
quently employed to investigate structure and dynam¬ 
ics of proteins (van Manen et al. 2008; Wood and 
McNaughton 2008). In order to employ the full flexi¬ 
bility of the RRS effect it is highly desirable to work 
with light sources that combine a narrow spectral 
bandwidth with a high spectral density of photons as, 
e.g., cw lasers and with the large range of wavelength 
tenability that is typically achieved with pulsed lasers. 
Though some solid state and diode lasers can be tuned 
quite substantially, the development of fully tunable 
narrow-band light sources in the UV-Vis-NIR spectral 
range presents a significant defiance in modem Raman 
instrumentation. When considering RRS another 
significant problem to be dealt with in Raman spec¬ 
troscopy arises: luminescence suppression. As the 
excitation wavelength is chosen to coincide with an 
electronic transition, population is transferred from the 
electronic ground state to higher lying electronic 
states. Hence, in many samples luminescence will 
take place and will potentially show significant 
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spectral overlap with the Stokes-shifted scattered light. 
However, depending on the sample the luminescence 
light might be up to nine orders of magnitude more 
intense than the Raman scattered light. Consequently, 
the luminescence will dramatically overwhelm the 
Raman signal. As this is not only a problem of RRS 
but also often occurs in the presence of autofluorescent 
chromophores when using Raman spectroscopy to 
investigate biological tissue, various strategies have 
been developed to circumvent this problem. The most 
straightforward solution to cope with autofluorescence 
in biological tissue is the use of near-IR light for the 
excitation of the Raman scattering, i.e., typically diode 
laser excitation at 785 nm. In this spectral rage, barely 
any chromophore absorbs and hence no additional 
light emission will interfere with the Raman signals. 
Though this is a suited approach also with respect to 
issues of phototoxicity, it is not applicable, when pur¬ 
suing RRS with the aim of actually working in reso¬ 
nance with an electronic transition of the analyte. In 
this case, different experimental approaches need to be 
pursued in order to record luminescence-free RRS 
spectra. In order to do so the spectral shape and the 
temporal dependence of Raman scattering and lumi¬ 
nescence are employed, i.e., two separate intrinsic 
characteristics with respect to which both light-matter 
interactions differ conceptually. As mentioned before 
vibrational Raman bands of analytes in solution or 
complex biological matrices typically show linewidths 
of ^10 cm”*, while luminescence bands can easily be 
1,000 cm”* broad. This difference in spectral band¬ 
width can be employed for luminescence subtraction 
using a frequency-modulated excitation laser, e.g., 
a current-driven diode laser. If the wavelength of the 
laser line used to excite the Raman scattering is mod¬ 
ulated to alternate between two settings separated by as 
little as 1 or 2 nm, this corresponds to a spectral shift in 
the order of some 10 cm”*, which appears as an alter¬ 
nating offset in the Raman spectra recorded on 
a wavelength scale. However, the luminescence spec¬ 
trum will remain unaffected as luminescence in far 
most cases takes place from the vibrationally relaxed 
lowest lying electronically excited state, i.e., is not 
affected by modulating the excitation wavelength. 
Thus, if the detection occurs in-phase with the modu¬ 
lation of the excitation wavelength, subsequently 
recorded Raman spectra can be subtracted and lumi¬ 
nescence-free spectra can be generated. Another 
option is to use the aforementioned fact that emission 


in far most cases occurs from the vibrationally relaxed 
lowest lying electronically excited state and to excite 
RRS spectra with much higher energy in the far-blue 
flank of the absorption spectra. Thereby, the Raman 
bands appear redshifted with respect to the excitation 
light by 1,000-1,800 cm”* considering the spectral 
region of molecular fingerprints, while the lumines¬ 
cence on the other hand is shifted by more than, e.g., 
3,000 cm”* with respect to the excitation line. Conse¬ 
quently, the Raman spectrum appears in a totally dif¬ 
ferent spectral window than the luminescence and 
hence can be detected without disturbing contributions 
from luminescence. In doing so it could be shown, e.g., 
that antimalaria active substances could be localized in 
plants by luminescence background-free UV RRS 
(Frosch et al. 2007) (Fig. 2). 

An alternative solution to luminescence suppression 
builds on the different temporal structure of Raman 
scattering and luminescence. While Raman scattering 
is an instantaneous process, i.e., the signal is generated 
at the time of interaction between the incident light and 
the sample, luminescence appears delayed and its 
appearance is correlated with the excited-state lifetime 
of the chromophores, which typically fall into the nano¬ 
second (fluorescence) to micro- and millisecond range 
(phosphorescence). Hence, time-gated detection can be 
employed to separate luminescence background from 
the Raman signal. In order to do so pulsed lasers have 
to be employed, which are typically chosen to have pulse 
durations between some 100 fs and some ps, thereby 
roughly matching the natural linewidth of vibrational 
Raman transitions of analytes in solution or biological 
matrices. Conventionally, a so-called Kerr gate is 
employed (Rulliere et al. 1998), where the Raman 
scattered light is focused in a medium with a high 
nonlinear polarizability and is overlapped with a strong 
gate pulse. The signal and the gate pulse are polarized 
perpendicularly and a polarizer is inserted into the opti¬ 
cal path of the signal to block the signal after the Kerr 
medium. If both signal and gate pulse interact simulta¬ 
neously in the Kerr medium, the polarization of the 
signal is rotated slightly and transmission through the 
polarizer is induced. Thereby, a selected time-window 
can be detected and hence the luminescence, which 
appears delayed, is suppressed. A conceptually analo¬ 
gous time gating of the Raman signal can be achieved 
using femtosecond stimulated Raman scattering, a tech¬ 
nique recently developed (McCarmant et al. 2007) and 
featured in the entry ► Ultrafast Raman Spectroscopy. 
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Fig. 2 Presents a resonance 
Raman spectrum in the 
presence of fluorescence and 
corrected for the luminescence 
(a). Panels (b) and (c) 
schematically depict the 
experimental procedure to 
reduce the luminescence 
contributions (broad 
spectrum) to the resonance 
Raman spectrum (sticks) by 
modulation of the excitation 
frequency. Panels (d) and (e) 
summarize the experimental 
approach to separate Raman 
(sticks) and fluorescence 
contributions (line) by 
employing the differences in 
the temporal dependences of 
the signals 



Both of these techniques rely on a rather complex optical 
setup and are difficult to combine with imaging technol¬ 
ogies due to the complexity of the optical paths. For 
these reasons, contemporary research aims at the devel¬ 
opment of multichannel detectors, which can be time¬ 
gated with a high temporal resolution and allow for 
shutter times in the order of below 1 ns. Such detectors, 
which will be able to electronically time-gate the detec¬ 
tion of Raman scattered signal and suppress most of the 
(delayed) luminescence, will have the potential to 
greatly simplify the experimental setup needed for 
RRS spectroscopy on the one hand and allow for 
a combination of imaging technologies with lumines¬ 
cence suppression on the other hand. 


Cross-References 

► Nonlinear Raman Spectroscopy: Coherent 
Anti-Stokes Raman Scattering (CARS) 

► Raman Optical Activity Studies of Structure and 
Behavior of Biomolecules 

► Raman Spectroscopy and Microscopy of Cells and 
Tissues 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 

► Surface-Enhanced Raman Spectroscopy for 
Bioanalytics 

► Ultrafast Raman Spectroscopy 
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Summary 

In conclusion, recent and ongoing improvements with 
respect to available experimental equipment and 
instrumentation have boosted and are continuing to 
spread the applications of Raman spectroscopy in bio¬ 
physics and the life sciences as well as in the field of 
biochemical and biomedical analytics (Strehle and 
Popp 2006), which will be detailed in the following 
entries of this encyclopedia. 
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Synonyms 

Cancer diagnosis with Raman spectroscopy; Raman 
imaging of biological samples; Raman microspec¬ 
trometry of cells and tissues 

Definition 

The combination of Raman spectrometers and micro¬ 
scopes offers prospects as a powerful imaging tool. 


Advantages include that the method gives detailed 
biochemical and structural information on the molec¬ 
ular levels in a label-free and nondestructive way. 
Here, the applications to characterize single cells and 
tissues are described. 

Introduction 

Instrumental progress of Raman spectroscopy as 
described in (► Raman Spectroscopic Instrumentation, 
Experimental Considerations) enables to collect Raman 
spectra of single cells and tissues within seconds. Com¬ 
bination with microscopes gives diffraction-limited lat¬ 
eral resolution in the submicrometer range which offers 
prospects of Raman microscopy as a powerful imaging 
tool. Generally, more bands are observed in Raman 
spectra of tissues and cells than in other optical spectra 
because numerous vibrations of biomolecules can be 
excited simultaneously. Therefore, more potent informa¬ 
tion about the biochemistry, composition, and molecular 
structure of the underlying sample may be collected 
without external labels, with minimal preparation, with¬ 
out damaging the samples and even under in vivo con¬ 
ditions. As Raman spectra have been recognized to 
provide a hngerprint for the structure and compositions 
of biomolecules such as proteins, lipids (► Vibrational 
spectroscopy of biomembranes and lipids), carbohy¬ 
drates, and nucleic acids (► Raman of DNA, RNA), 
a sensitive and specific fingerprint can also be obtained 
for cells and tissues that are composed of these biomol¬ 
ecules. Similar as for IR spectroscopy (► Infrared Spec¬ 
troscopy: Data Analysis, ► Infrared Spectroscopy of 
Cells, Tissues, and Biofluids), Raman spectra and 
Raman images need to be analyzed by statistical 
methods to extract the relevant information of the 
strongly overlapping spectral contributions and to corre¬ 
late the fingerprint with the type and state of the sample. 
The interest in Raman spectroscopy to assess cells and 
tissues is demonstrated by several reviews that appeared 
within the last years. They summarized biomedical 
applications (Krafft and Sergo 2006), disease recognition 
(Krafft et al. 2008b), noninvasive analysis of single cells 
(Notingher and Hench 2006; Swain and Stevens 2007), 
and Raman and CARS spectroscopy of cells and tissues 
(Krafft et al. 2009). This chapter is organized into an 
experimental part describing Raman microscopic sys¬ 
tems for cell and tissue studies. Then Raman spectra 
and images of single cells and tissues in dried and 
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non-dried state are presented. Spectral contributions are 
assigned to the main cellular constituents (nucleus, cyto¬ 
plasm, and lipid vesicles) and main soft tissue classes 
(connective tissue, muscle tissue, epithelial tissue, and 
nervous tissue). 

Raman Microscopic Systems for Cell and 
Tissue Studies 

The coupling of a Raman spectrometer with 
a microscope offers two main advantages. First, lateral 
resolutions can be achieved down to the Abbe’s limit 
of diffraction below 1 pm. High axial resolutions are 
obtained by confocal microscopes. Second, maximum 
sensitivity can be achieved because the photon flux 
(photon density per area) of the focused laser beam 
onto the sample is at maximum and the collection 
efficiency of scattered photons from the sample is at 
maximum. The lateral resolution, the diameter of the 
focused laser, and the collection efficiency depend on 
the numerical aperture (NA) of the microscope objec¬ 
tive which is the product of the sine of the aperture 
angle and the diffraction index. High NA objectives 
have small working distances below 1 mm in air. The 
working distance of water immersion objectives can be 
larger due to the larger diffraction index of water. 
Oil immersion objectives in combination with oil 
should be avoided because spectral contributions of 
oil overlap with the Raman spectrum of the sample. 
Most Raman spectrometers for cell and tissue studies 
utilize the near-infrared advantage. Even trace 
amounts of fluorescent molecules in biological mate¬ 
rial can cause an intense signal due to several orders of 
magnitude larger efficiency. Such an autofluorescence 
background often masks the weaker Raman signals 
when using visible lasers for excitation. As most tis¬ 
sues and body fluids show minimum absorption in the 
wavelength interval from 700 to 900 nm, the excited 
autofluorescence is at minimum and the penetration of 
the exciting and scattered radiation is at maximum. 
Furthermore, it has been reported that excitation laser 
in the visible range (400-600 nm) induces cell death 
within a few seconds of exposure, whereas living cells 
are not affected using near-infrared laser excitation. 
Partial absorption of the focused laser light harbors 
the risk to induce thermal tissue damage. A typical 
state-of-the-art Raman spectrometer which is commer¬ 
cially available from several manufactures uses a diode 


laser at 785 nm for excitation, focuses the radiation 
onto the sample by a high NA microscope objective, 
collects the scattered light in backscattering geometry, 
separates the elastically scattered light by a notch or an 
edge filter, disperses the inelastically scattered radiation 
in a grating spectrograph, and detects the spectrum by 
near-infrared-optimized CCD detectors. 

Raman imaging combines the spectral and the spatial 
information. Most Raman images are collected in the 
point-mapping mode. Here, the laser focus is moved by 
a scanning mirror or the sample is moved by a motorized 
stage to the next position of a predefined grid. The total 
exposure time depends on the number of spectra, the 
exposure time per spectrum, and time for moving the 
stage. Raman imaging using laser-line illumination or 
wide-field illumination has been developed to reduce 
acquisition time. In the first case, the spatial data is 
registered on the detector on a line parallel to the entrance 
slit of the spectrometer, and the spectral information is 
dispersed perpendicularly. The other spatial dimension of 
an image is recorded by scanning in the direction perpen¬ 
dicular to that line. This so-called line-mapping registra¬ 
tion mode is faster because only one dimension instead of 
two dimensions in the point-mapping mode has to be 
scanned. In the second case, the parallel registration 
approach employs intense, global sample illumination. 
The inelastically scattered light from the sample is 
projected onto a two-dimensional CCD detector. Most 
wide-field Raman imaging spectrometers use filters to 
select the wavelength such as dielectric, acousto-optic 
tunable, and liquid-crystal tunable filters. 

Raman Images of Cells 

Single cells are very suitable objects for Raman 
spectroscopy because of the high concentrations of bio¬ 
molecules in their condensed volume. Protein concen¬ 
tration as high as 250 pg/pL, and DNA and RNA 
concentrations in the range of 100 pg/pL have been 
reported. These values depend on the cell type, the 
phase of the cell cycle, and the location within the cell. 
Raman imaging of cells can complement current tech¬ 
niques in cell biology to study cellular and subcellular 
processes and structures or to identify cell differentiation 
and cell type. The most frequently used biophotonic 
method for single-cell studies is fluorescence micros¬ 
copy which depends on exogenous fluorophores because 
most biomolecules cannot directly be detected. Problems 
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Raman Spectroscopy and 
Microscopy of Cells and 
Tissues, Fig. 1 Raman 
microspectroscopic imaging 
of a single, dried Hep G2 cell. 
Mean cluster spectra from 600 
to 1,800 cm“* (a): cytoplasm 
(trace 1), nucleus (2), nucleoli 
(3), cai'otenoid compound (5), 
and polystyrene bead (6). 
Magnified difference 
spectrum (trace 4 = 2-1) 
shows spectral contribution of 
nucleic acids. 
Photomicrograph (b) and 
cluster membership map (c). 
Color code: cytoplasm (blue, 
cyan, brown), nucleus 
(yellow), nucleoli (orange), 
polystyrene bead (black), 
carotenoid compound 
(magenta) 




result from the limited stability, bleaching, and restricted 
accessibility of external markers. In principle, Raman 
imaging can overcome many restrictions because 
Raman signals are based on inherent vibrational proper¬ 
ties of the cell’s biomolecules. 

Dried Hep G2 Cell 

Hep G2 is a perpetual cell line which was derived 
from the liver tissue of a 15-year-old male with a well- 
differentiated hepatocellular carcinoma. A photomicro¬ 
graph (b) and a Raman image (c) of a dried Hep G2 cell 
are shown in Fig. 1 . The Raman image of 51 x 61 spectra 
was collected with a step size of 500 nm and an 
exposure time of 6 s per spectrum using a Raman 
microspectrometer with 785 nm single-mode laser for 
excitation. After low-intensity spectra representing the 
medium around the cell were removed and the remaining 
spectra were normalized, the data set was analyzed by 
k-means clustering. This algorithm groups the spectra 
into k clusters (here: 7) according to their similarity 
which is calculated from the Euclidian distance in 
a given spectral range. The cluster memberships are 
color-coded for display. Averaging all spectra within 
a cluster offers the advantages that pixel-to-pixel 


variations are reduced and the signal-to-noise ratio is 
improved compared to a single spectrum. Five spectra 
(trace 1, 2, 3, 5, 6) and a difference spectram (trace 
4 = 2-1) are displayed in Fig. la. Trace 1 represents the 
cytoplasm (cyan cluster). Spectral contributions of pro¬ 
teins dominate. Bands are assigned to the aromatic amino 
acid side chains (Phe: 622, 1,003, 1,032, 1,208 cm”*; 
Tyr: 644, 830, 853, 1,208 cm”*; Trp: 759 cm”*), ali¬ 
phatic amino acid side chains (1,338, 1,448 cm”*), and 
the peptide backbone (amide III at 1,267 cm”*, amide I 
at 1,657 cm”*). Spectral contributions of lipids are 
resolved at 1,303, 1,130, 1,060, 716, 700 cm”*. 
The latter bands are not labeled. The other cytoplasm 
spectra (blue and brown clusters) only slightly deviate 
and will not be discussed here. Trace 2 represents the 
nucleus (yellow cluster). Beside protein bands that resem¬ 
ble those in the cytoplasm, spectral contributions of DNA 
are evident at 783 and 1,099 cm”*. As most DNA bands 
overlap with protein bands, a difference spectrum was 
calculated. Then positive difference bands are assigned to 
nucleotides (A: 728, 1,335, 1,484, 1,576 cm”*; C: 783, 
1,245 cm”*; T: 783,1,374, 1,673 cm”*; G: 1,484,1,576 
cm”*) and the phosphate backbone (1,099 cm”*). The 
intensities of DNA bands increase in trace 3 which 
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Raman Spectroscopy and 
Microscopy of Cells and 
Tissues, Fig. 2 Raman 
microspectroscopic imaging 
of a macrophage in medium. 
Representative spectra from 
200 to 1,800 and from 2,700 to 
3,400 cm^* (a): lipid droplet 
(trace 1) and medium 
(2) without processing, 
difference spectra lipid droplet 
minus medium (3), and 
foreign particle minus medium 
(4). Photomicrograph (b) and 
chemical map at 2,852 cm“* 
(c). A gray-scaled chemical 
map at 545 cm^ ’ was included 
in (c) to visualize the position 
of the foreign particle 
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represents nucleoli (orange cluster). Traces 5 and 6 
represent special features (magenta and black clusters, 
respectively). Bands at 1,158 and 1,529 cm“* are 
assigned to carotene. An inherent property of carotene 
compounds is their high Raman cross-section which 
makes them easily detectable in Raman spectra even at 
low concentrations. Intense bands at 621, 1,002, 1,032, 
1,584, and 1,603 cm~* are typical for polystyrene. The 
unexpected detection of this polymer head demonstrates 
the power of Raman microspectroscopic imaging. The 
location and distribution of numerous molecules can be 
assessed by their spectroscopic fingerprint without exog¬ 
enous labels. 

Macrophage in Medium 

Macrophages comprise a special class of white blood 
cells (leukocytes). As phagocytes, they engulf and 
digest cellular debris and pathogens. They are also 
involved in the uptake of lipids by various mechanisms 
that create the progressive plaque lesion of atheroscle¬ 
rosis. Figure 2 shows a photomicrograph (b) and 
a Raman image (c) of a macrophage in medium. The 
Raman image of 78 x 72 spectra was registered with 
a step size of 700 nm and using a water immersion 


objective to focus the 785 nm single-mode laser onto 
the cell and to collect the scattered light. Typical fea¬ 
tures of the macrophage are lipid droplets that line the 
margin of the cell. The most intense Raman hands of 
lipids are CH 2 stretch vibrations near 2,900 cm“*. The 
chemical map in Fig. 2 plots the intensity distribution 
of these bands. Under the experimental conditions, 
larger droplets are resolved, smaller droplets could 
not be resolved. Higher lateral resolution can be 
achieved by shorter excitation wavelength and smaller 
step size. Unmanipulated Raman spectra represent the 
lipid droplets and the medium around the cell in Fig. 2a 
(trace 1 [black] and 2 [gray], respectively). Spectral con¬ 
tributions are assigned to the substrate calciumfluoride 
(322 cm”*) and water (1,640 and above 3,000 cm”*). The 
overlay of both spectra indicates that the spectral contri¬ 
butions of the lipid droplets are less intense. Subtracting 
the spectmm of the medium compensates bands of the 
substrate, water, and also constant spectral contributions 
of optical elements in the light path. The resulting differ¬ 
ence spectmm (trace 3) reveals pure lipid bands with 
a very low background. The bands are assigned to the 
choline group (716, 869 cm” ), phosphate group 
(1,080 cm”*), ester group (1,742 cm”*), and CC and 
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CH vibrations of fatty acids (1,063, 1,130, 1,299, 
1,439, 2,852, 2,886 cm“'). Bands at 1,266, 1,657, and 
3,012 cm” * are indicative of unsaturated fatty acids. The 
Raman spectrum (trace 3) is consistent with the phospho¬ 
lipid phosphatidylcholine. The Raman spectrum of 
a foreign particle at the bottom part of the cell (trace 4) 
shows numerous bands in the low wavenumber region at 
545,437,369, and 298 cm”*. The distribution of the band 
at 545 cm”* has been included in Fig. 2c as a gray-scale 
chemical map. Figure 2 shows a macrophage at work 
engulhng a foreign particle. 

Raman Images of Tissue 

Tissue is a cellular organizational level intermediate 
between cells and a complete organism. Hence, tissue 
is an ensemble of cells, and the principles described 
above can be transferred; that means, Raman spectros¬ 
copy can complement established tools in histology 
and histopathology to study tissue. Diseases and other 
pathological anomalies lead to chemical and structural 
changes in tissues on the molecular level which also 
change the Raman spectra and which can be used as 
sensitive, phenotypic markers of the disease. As these 
spectral changes are very specihc and unique, they can 
be considered as a fingerprint. Early reports in the 
literature regarding the utility of Raman spectroscopy 
to biomedical problems were based on acquisition of 
spectral data only at single points which required an 
a priori knowledge of the location or a preselection of 
the probed position. Since the inhomogeneous nature 
of tissue on the microscopic level was not considered 
in these early studies, an accurate correlation between 
the histopathology of the sampled area and the 
corresponding spectra was only possible for homoge¬ 
nous tissue on the macroscopic level. Considerable 
progress was made since high throughput, and more 
sensitive instruments became available for Raman 
microspectroscopic imaging. They enable to micro¬ 
scopically collect larger number of spectra from larger 
sample populations in less time, improving statistical 
signihcance and spatial specihcity. 

Colon Tissue Section 

A photomicrograph (b) and a Raman image (c) of 
a colon tissue section are shown in Fig. 3. The sample 
was prepared from a frozen specimen using a cryotome 
without further manipulation. One should be aware 


that each processing step such as embedding or fixation 
might alter the chemical composition and, conse¬ 
quently, the Raman spectra. After transfer onto on 
a suitable substrate and vaporization of the water con¬ 
tent in a dry atmosphere, the proteins are precipitated 
to form an insoluble mass that is resistant to degrada¬ 
tion. This fixation of thin tissue sections by air drying 
works surprisingly well. The choice of substrates 
depends on the excitation wavelength. Glass gives 
high spectral background at 785 nm excitation. Quartz 
and calciumfluoride are preferred due to their lower 
background signals. 

Colon tissue contains all four major tissue types 
such as muscle tissue, connective tissue, epithelium, 
and also nerve cells. K-means clustering identihes 
the gross architectural division of colon tissue. The 
mucosa (blue) is the innermost, highly specialized 
layer of the gastrointestinal tract which is responsible 
for absorption and secretion, important processes in 
digestion. Some mucus (magenta) which is secreted 
by epithelial glands is found at the bottom part. The 
submucosa (brown) consists of a dense irregular layer 
of connective tissue. Below the submucosa, a circular 
muscle layer (yellow) and a longitudinal muscle layer 
(olive) are identihed. Serosa (red), which belongs to 
the connective tissue class, is found at the top. Nerves 
and ganglia are the neurons of the enteric nervous 
system that controls the gastrointestinal tract. Ganglia 
are often connected with each other to form a complex 
system known as plexus. However, the step size in 
the 79 X 79 Raman image was 62 pm which was not 
sufficient to resolve nerve cells. In a related report, 
smaller step sizes of 10 pm and 2.5 pm were applied 
to identify nerve cells in ganglia and subcellular details 
(Krafft et al. 2008a). 

The cluster-averaged spectra reveal information 
about the underlying spectral properties (Fig. 3a). 
Compared with the previous spectra of single cells, the 
Raman spectrum of submucosa (trace 1) contains addi¬ 
tional spectral contributions, most evident at 759, 816, 
854,920,937, and 1,245 cm”*. These bands are assigned 
to collagen which is a major component in connective 
tissue. Collagen has an unusual amino acid composition 
with high content of hydroxyproline, proline, and glycine, 
and a special coiled-coil triple helix secondary structure. 
The reduced intensities of collagen-associated bands con¬ 
stitute the main difference between submucosa (trace 1) 
and muscle tissue and between longitudinal muscle 
layer (trace 2) and circular muscle layer (trace 3). 
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Raman Spectroscopy and 
Microscopy of Cells and 
Tissues, Fig. 3 Raman 
spectroscopic imaging of 
a colon tissue section. Mean 
cluster spectra from 600 to 
1,800 cm“* (a): Submucosa 
(trace 1), longitudinal (2) and 
circular muscle tissue (3), 
mucosa (4), and mucus (5). 
Photomicrograph (b) and 
cluster membership map (c). 
Color code: serosa (red), 
circular muscle (olive), 
longitudinal muscle (yellow), 
submucosa (brown), mucosa 
(blue), and mucus (magenta) 
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Smooth muscle tissue often must be stretched, so elastic¬ 
ity is an important attribute. Therefore, smooth muscle 
cells secrete a complex extracellular matrix including 
collagen. The distinction of circular versus longitudinal 
muscle based on the relatively small decrease of collagen 
bands is imperfect due to the relative low signal-to-noise 
ratio (SNR) of the Raman spectra. The SNR of single 
spectra could be increased by longer exposure times, 
which however would also increase the total acquisition 
time. As the current exposure times were already as long 
as 62 h, a further increase was impracticable. The colla¬ 
gen-associated bands further decrease in the Raman spec¬ 
tra of mucosa (trace 4). More intense Raman bands at 780 
and 1,092 cm”' point to higher DNA content in mucosa 
which indicates a high metabolic activity. Together with 
the fact that the mucosa is the outermost layer of the colon 
and it is exposed to various, potentially carcinogenic sub¬ 
stances, this explains why the carcinogenesis most often 
occurs from the mucosa and the epithelial layer. The 
Raman spectrum of mucus (trace 5) is markedly different 
from the previous spectra. Additional Raman bands are 
found at 845, 859, 1,063, 1,125, 1,140, 1,235, 1,279, 
1,396, and 1,468 cm”*. The spectral signature is 


consistent with carbohydrates and peptides. The main 
constituent of mucus is mucin which is known to consist 
of highly glycosylated peptides. Furthermore, some 
bands point to poly(ethyleneglycol) 4,000 which 
was administered before surgery to flush the gastrointes¬ 
tinal tract. 

Non-dried Murine Brain Tissue 

An important step in the development of new diagnostic 
methods is their application in animal models. A mouse 
model was used to test the detection of brain metastases 
by Raman spectroscopic imaging. Brain metastases of 
malignant melanomas were induced by injection of 
tumor cells into the carotid artery. Murine brains were 
sectioned, and the sampling area was covered by 
a calciumfluoride window to prevent the surface from 
drying during data acquisition. Raman images were 
acquired by a Raman system which was coupled to 
a fiber-optic probe. The specimens were placed onto 
a motorized stage, the radiation from a 785 nm multimode 
laser was focused by the probe to spot of 60 pm diameter, 
the scattered light was collected by the probe and spectra 
were sequentially registered at a step size of 120 pm with 
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Raman Spectroscopy and 
Microscopy of Cells and 
Tissues, Fig. 4 Raman 
spectroscopic imaging of 
murine brain tissue. Mean 
cluster spectra from 400 to 
1,800 and from 2,750 to 3,550 
cm^* (a): fiber tracts (1), 
cerebellum (2), tumor margin 
(3), and tumor (4). Cluster 
membership maps of normal 
specimen (b) and specimen 
with metastasis of malignant 
melanoma (c) Abbreviations: 
fiber tracks (FT), cerebellum 
(CB), mesencephalon (M), 
cistern (C), ventricle (V) 



8 s exposure time. The laser intensity of 100 mW was 
found to be nondestructive. The clusters in the color- 
coded membership maps were obtained by k-means clus¬ 
tering and can be assigned to anatomical features (Fig. 4b 
and c). Fiber tracts corresponding to white matter could be 
identified in the cerebellum and around a ventricle. 
A cistern was resolved between the left and the right 
brain hemisphere and below the mesencephalon. In 
a second specimen, brain metastases of malignant mela¬ 
nomas were found near the cistern. The result demon¬ 
strates the ability of Raman imaging to recognize 
the transition between normal brain tissue and brain 
tumors with high accuracy, but without additional labels. 
Further details including FTIR images were previously 
reported (Krafft et al. 2007). Mean cluster spectra of fiber 
tracts (trace 1, orange cluster), cerebellum (trace 2, blue 
cluster), tumor margin (trace 3, black cluster), and tumor 
(trace 4, gray cluster) are shown in Fig. 4a. Bands at 3,240 
and 3,430 cm~* are assigned to water. The Raman spec¬ 
trum of fiber tracts show the typical signature of white 
matter with maximum spectral contributions of lipids and 
cholesterol. Spectral contributions of cholesterol are evi¬ 
dent at 427,544,608, and 700 cm” *. The intensity ratio of 
lipid-to-protein bands decreases in Raman spectra of gray 
matter which is represented by the blue and cyan cluster in 
normal mouse brain tissue. Raman spectra of brain 


metastases can easily be distinguished from normal tissue 
by intense spectral contributions of the pigment melanin 
at 457, 599, 727, 976, 1,124, 1,400, and 1,592 cm”'. 
Whereas melanin bands have almost the same intensity 
as the other biomolecules of brain tissue at the margin, 
they dramatically increase in the central part of the metas¬ 
tasis. The pigment partially absorbs the excitation wave¬ 
length 785 nm which gives a pre-resonance effect in the 
spectral range 400-1,800 cm”*. It is important to note 
that the expression of the pigment melanin is a molecular 
property of the primary tumor cells. That means the 
secondary tumors contain the molecular information of 
the primary tumor. As Raman spectroscopy probes the 
molecular fingerprint of tissues and cells, this opens the 
exciting perspective to develop a Raman-based approach 
for identification of the primary tumor of metastases. 

Summary 

Compared with the related vibrational spectroscopic 
technique infrared spectroscopy, Raman spectroscopy 
offers several advantages that are important for studying 
cells and tissues. First, the spatial resolution of Raman 
microscopy is higher, which is a consequence of 
the shorter wavelengths. Infrared spectroscopy uses 
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typically radiation between 2.5 and 25 pm to probe 
vibrations from 4,000 to 400 cm~*. Raman spectra are 
usually excited with visible and near-infrared lasers. 
Second, water strongly absorbs infrared radiation 
which gives penetration depths of micrometers in non- 
dried cells and tissues. As the Raman signals of water are 
weak, near-infrared lasers can penetrate several millime¬ 
ters into tissue. This property is utilized in an approach 
called Spatially Offset Raman Scattering (SORS) that 
was pioneered by Pavel Matousek (Matousek et al. 
2006). Third, fiber-optic probes are available for cou¬ 
pling with Raman spectrometers. However, dedicated 
biomedical probes need to be developed in the future. 
Fiber materials for the typical infrared wavelength range 
have unfavorable properties and are expensive. In spite 
of the advantages, fewer Raman than infrared publica¬ 
tions are reported in the context of cells and tissues due to 
the weak Raman effect, Raman spectra of low signal-to- 
noise ratio, more complex Raman instrumentation, and 
longer image acquisition. Enhancement effects have 
been applied such as resonance Raman (► Resonance 
Raman Spectroscopy of Protein-Cofactor Complexes), 
surface-enhanced Raman scattering (► Surface- 
Enhanced Resonance Raman Spectroscopy in Electron 
Transfer Studies, ► Surface-Enhanced Raman Spectros¬ 
copy for Bioanalytics), and nonlinear Raman spectros¬ 
copy such as coherent anti-Stokes Raman scattering 
(► Nonlinear Raman Spectroscopy: Coherent Anti- 
Stokes Raman Scattering (CARS)), and stimulated 
Raman scattering (► Ultra fast Raman Spectroscopy 
and Stimulated Raman Scattering) to compensate low 
signal intensities. Together with progress in instrumenta¬ 
tion, the prospects of Raman spectroscopy for biomedical 
applications are recognized to increase in the future. 
Recent textbooks are recommended for more details 
in vibrational spectroscopy in life science (Siebert and 
Hildebrandt 2007), vibrational spectroscopy for medical 
diagnosis (Diem et al. 2008), biomedical vibrational 
spectroscopy (Lasch and Kneipp 2008), and infrared 
and Raman imaging (Salzer and Siesler 2009). 

Cross-References 

► Cell Sensing with Raman Spectroscopy 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► Infrared Spectroscopy of DNA 

► Nonlinear Raman Spectroscopy: Coherent 
Anti-Stokes Raman Scattering (CARS) 


► Raman Spectroscopic Instrumentation, 
Experimental Considerations 

► Raman Spectroscopy 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 

► Surface-Enhanced Raman Spectroscopy for 
Bioanalytics 

► Ultrafast Raman Spectroscopy 

► Vibrational Circular Dichroism of Biopolymers 

► Vibrational Spectroscopy with Neutrons 

References 

Diem M, Griffiths PR, Chalmers JM. Vibrational spectroscopy 
for medical diagnosis. Chichester: Wiley; 2008. 

Krafft C, Codrich D, Pelizzo G, Sergo V. Raman and FTIR 
microscopic imaging of colon tissue: a comparative study. 
J Biophoton. 2008a; 1(2): 154-69. 

Krafft C, Dietzek B, Popp J. Raman and CARS microspectroscopy 
of cells and tissues. Analyst. 2009;134(6):1046-57. 

Krafft C, Kirsch M, Beleites C, Schackert G, Salzer R. Methodol¬ 
ogy for fiber-optic Raman mapping and FT-IR imaging of 
metastases in mouse brains. Anal Bioanal Chem. 2007; 
389(4): 1133^2. 

Krafft C, Sergo V. Biomedical applications of Raman and infrared 
spectroscopy to diagnose tissues. Spectroscopy. 2006;20(5-6): 
195-218. 

Krafft C, Steiner G, Beleites C, Salzer R. Disease recognition 
by infrared and Raman spectroscopy. J Biophoton. 2008b; 
2(l-2):13-28. 

Lasch P, Kneipp J. Biomedical vibrational spectroscopy. 
Hoboken: Wiley; 2008. 

Matousek P, Draper ER, Goodship AE, Clark IP, Ronayne KL, 
Parker AW. Noninvasive Raman spectroscopy of human 
tissue in vivo. Appl Spectrosc. 2006;60:758-63. 

Notingher I, Hench LL. Raman microspectroscopy: a noninvasive 
tool for studies of individual living cells in vitro. Expert Rev 
Med Devices. 2006;3(2):215-34. 

Salzer R, Siesler HW. Infrared and Raman spectroscopic imaging. 
Wiley-VCH: Weinheim; 2009. 

Siebert F, Hildebrandt P. Vibrational spectroscopy in life science 
(Tutorials in biophysics). Weinheim: Wiley-VCH; 2007. 

Swain RJ, Stevens MM. Raman microspectroscopy for non¬ 
invasive biochemical analysis of single cells. Biochem Soc 
Trans. 2007;35:544-9. 


Rapid Kinetic Methods 

► Continuous Elow 

► Kinetics: Relaxation Methods 

► Quenched-Flow Methods 

► Stopped-Elow Techniques 

► Transient-State Kinetic Methods 





R 


2186 


Rapid Scan FTIR 


Rapid Scan FTIR 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 


Rapid Scan X-Ray Fluorescence (RSXRF) 

► X-Ray Fluorescence Imaging: Elemental and 
Chemical Speciation Mapping of Biological Systems 


Reaction Paths and Rates 

Ron Elber 

Institute for Computational Engineering and Sciences, 
Department of Chemistry and Biochemistry, 
University of Texas at Austin, Austin, TX, USA 

Introduction 

Processes in molecular biology are selected by 
kinetics, namely, the rate in which one type of 
molecule is converted to another. Processes that 
happen in a cell are the quickest possible and are 
not necessarily the route that leads to the most stable 
products. Hence, understanding kinetics in biophys¬ 
ics is essential for consideration of function. Kinetic 
control keeps the cell far from equilibrium, as it 
should be, and allows for a timely response to envi¬ 
ronmental changes. 

Quantification of kinetics is done with measure¬ 
ments of rate, the change in concentration, or number 
of molecules per unit time. In this entry different 
approaches are considered for the computation of 
rates and the mechanisms that determine them. The 
complete collection of cell processes and their rates 
makes it possible to generate models of cells and their 
behavior and is studied in the field of Systems Biology 
(Alon 2006). Collecting all the kinetic information is 
however tedious and difficult due to the large number 
of measurements required. This is the point in which 
computational estimates of rates can be invaluable. 
Indeed, significant theoretical and computational 
research focuses on calculations of rates. 


The Rate 

Consider the rate, dria/dt. It is the change in the pop¬ 
ulation, nj,t), of a state a, as a function of time t. The 
rate of transition from state to state, say oc to fS that is 
written as dn^^p/dt is of particlar interest. The state is, 
for example, the concentration of a particular molecule 
in the cell or having a protein in an active conforma¬ 
tion. One of the most efficient ways of computing rates 
is to use information provided by the reaction path 
approach. This is the focus of the present entry. 

Reactions in general are motions between two sta¬ 
ble states a and fi on a potential energy surface U in 3N 
dimensions, where N is the number of particles. The 
energy surface has two minima Ua and Up at coordi¬ 
nates I'a and rp. Note the difference between a state a 
which is a set of configurations and the minimum 
coordinate which is just one structure. If a single 
structure is used to represent the state a then /o, is 
a natural choice. The conformations of the a state 
may be defined as the set of all coordinates r that 
following local minimization of the energy U(r) con¬ 
verge to Tj,. A more rigorous definition of states uses 
timescale separation in which the timescale to exit 
a state is signihcantly longer than the time of visiting 
any other position within the state. 

The Steepest Descent Path 

A reaction path is defined as a curvilinear coordinate 
Rap{s) which is connected at its two end points to the 
reactants and products, respectively. It is a function of 
a parameter s {s G [0,1]) such that s = 0 is the mini¬ 
mum of the reactant (r^) and 5=1 the product (rp). The 
reaction path is a curve that minimizes a cost function. 
Different measures of cost lead to potentially different 
paths. One of the most popular definitions of reaction 
path, with a particular choice of cost, is the Steepest 
Descent Path (SDP Heidrich (1995)). Eor q^pis) to be 
the SDP, the gradient of the potential at any point along 
the path is exactly parallel to the path slope (Pig. 1). 

^U(qfp‘’{s)) II ( 5 ) 5 e [0, 1] (1) 

A simple interpretation of the SDP is the curvilinear 
coordinate connecting minima and rp with a barrier 
(a high energy point along the path) which is as low as 
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Reaction Paths and Rates, 
Fig. 1 A two-dimensional 
potential (Fukui et al. 1975) 
presented as a contour plot 
where blue represents the 
lowest energy values. The 
potential has three minima and 
two saddle points. Also shown 
is a discrete representation of 
the Steepest Descent Path 
(SDP). The discrete points are 
connected to guide the eye 



possible compared to alternative pathways. This bar¬ 
rier is a saddle point with a single negative eigenvalue 
of the Hessian (//). The Hessian is the second deriva¬ 
tive matrix of the potential energy {d^U/dzidz^ and 
its eigenvalues determine the local curvature of the 
potential. At a minimum the eigenvalues are positive 
and at a maximum they are negative. This choice of 
a path is appealing, since at sufficiently low tempera¬ 
tures, trajectories that transition from one state to the 
next are likely to pass near a lowest energy barrier. 
A lowest energy barrier is provided by the SDP. As 
a result, the SDP is a plausible qualitative description 
of the process, and its energy barrier (especially if it is 
much larger than the thermal energy kT) is the bottle¬ 
neck of the process. 

How to calculate the SDP is a problem that can be 
addressed in two major ways: by mode following (Wales 
2003), and by optimization of a whole path (Olender and 
Elber 1997). A formal definition in which the SDP 
is obtained by minimizing the energy starting from 
a predetermined saddle point is not considered here. 
If the saddle point is already known, the problem of 
determining the SDP is straightforward. Therefore, this 
definition is not helpful computationally. 

In mode following, a stationary point of the poten¬ 
tial energy is sought, that is, a configuration with zero 
net force. A displacement A from the current position 
vector r is computed such that Vf/(r-f A) = 0. 
The force —VU at the shifted point is 
Vf/(r -f A) = VU{r) + H{r) ■ A. With this expansion 


of the force at hand the Newton Raphson minimization 
step is A = —//^'V[/(r). The Hessian matrix//(/•) can 
be written as a sum of one rank matrices: H — 
where e, is an eigenvector of H, and an eigehvalue. 
To ensure stability of the algorithm, eigenvalues that 
are zeros (e.g., six overall translations and rotations of 
the molecular system) are scaled to values larger than 
the maximum eigenvalue found in the Hessian of the 
system \ = Ak + C Xk = Q C Imax to obtain the 
scaled Hessian. 

Finding a reaction coordinate is, however, different 
from minimization. Along a reaction path, the system 
may progress up in energy, however during a minimiza¬ 
tion the energy is monotonically nonincreasing. The 
search for the path starts, without loss of generality, 
at a minimum energy coordinate in which all the 
eigenvalues of the scaled Hessian are positive. 
A small eigenvalue, Xj, is shifted to a negative 
value Xj = Xj — rj < 0. The modified Hessian is 
H — — V^j^j + C^ ete'p. and now A is given 

/ k 

by A = —H^^VU{r). Since lyis negative the step is 
uphill on the energy surface in the direction of Cj while 
the energy is minimized in all other orthogonal directions 
i 7 ^ j. Once a new coordinate set is obtained, the calcu¬ 
lation of the displacement is repeated until a complete 
path between fixed points (minima and saddle points) is 
recovered. 

Mode following is popular in studies of relatively 
small molecules and therefore also in ab-initio 
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calculations. Considerable success in applying these 
protocols to larger molecular systems was also made 
(Wales 2003). Nevertheless, it is difficult to apply 
mode following for large biological molecules. First, 
the creation, manipulation, and storage of the Hessian 
matrices is computationally expensive for large mole¬ 
cules. Second, on rough energy landscapes, typical of 
biological macromolecules, there are many local min¬ 
ima and saddle points that may hinder the correct path. 
The mode following approach is making a local choice 
based on the local force and the local Hessian. The 
choice does not incorporate information on the end 
points. If multiple SDPs exist and the interest is in 
a reaction coordinate connecting two predetermined 


states (or minima) then mode following may not be 
efficient. 

A number of techniques were therefore introduced 
to compute reaction coordinates while exploiting 
information on the two end points. Some of these 
approaches hnd an optimal path which is not the 
SDP. To keep the current discussion focused, only 
algorithms that provide the SDP are considered. One 
such approach is the Locally Updated Planes (LUP) 
(Ulitsky and Fiber 1990) that simulatenously rehne the 
whole path between minima and rp. With the defi¬ 
nition of the SDP (Eq. 1) at hand an equation for 
quenching an arbitrary initial path to an SDP is: 


lim 

f—»oo 


d 

dt 


q^fi{s,t) = -Vt/ • 



Vqo,p{s,t)S/q^fi{s,t) 

t)\^ 



0 qo.p{s,t ^ oo) 




( 2 ) 


For computational purpose, the curvilinear path 
between the two minima is approximated by 
a discrete set of configurations along the path 
q^fiis) (Fig- !)■ With a discrete 


description of the path at hand a quenching procedure 
for the discrete path is: 


lim 

t—^OO 


dzj 

dt 


-VU(z,-) 


I - 


(z;+i — z,'_i)(z/+i — z,'_i)' 


|(Zi+i -z,-_i) 


z,(f ^ (X)) ^ zf®^ 


(3) 


Equation 3 couples first order differential equations for 
all z,-s with the exception of the first and last points that 
are held fixed. These differential equations are inte¬ 
grated as a function of the dummy variable t until the 
path is fixed. A stationary solution of Eq. 3 is the 
steepest descent path. A similar concept was put for¬ 
ward later by the Nudge Elastic Band (NEB) method 
(Jonsson et al. 1998) and by the zero temperature string 
(Weinan et al. 2002). In particular, the last two algo¬ 
rithms explicitly enforce a small difference between 
sequential points z, and Zi+j either by a penalty 


function or using Lagrangian constraints. These 
enforcements add to the numerical stability of the 
LUP algorithm. 

Minimum Free Energy Path 

The more recent string method went further by the 
construction of a minimum free energy path (MFEP) 
(Ren et al. 2005). Equation 3 is modified to: 


lim 

f—>-oo 


d 

dt 


q^xisis, t) 




^ V<7„/j(^, t)' 


V<7a/i(^,t)^ . 

ds 
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c = 0 (4) 
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where is a Lagrange multiplier ensuring uniform 
distribution of points along the path. The average 
(...)^ is over all degrees of freedom with the excep¬ 
tion of selected coarse variables. For example, in 
a description of peptide folding, the backbone tor¬ 
sions are a natural choice for coarse variables and 
the MFEP is computed in the subspace of the 
coarse variables. Rather than a single (minimum 
energy) path to decide the weight of a process, an 
average (typically canonical) over the fine variables 
is calculated resulting in an average coordinate and 
an average force. The MFEP includes contributions 
from entropy and better reflects the dynamics of the 
system at finite temperatures. On the other hand, 
the SDP and the minimum free energy path are 
static objects. One is a geometrical concept (SDP), 
and the other an equilibrium time-independent 
property (MEEP). It is therefore still not obvious 
how the knowledge of a reaction path is translated 
into estimates of time-related properties such as 
rate, which is the ultimate connection to experiment 
that is sought. 


Functional Optimization 

Other approaches to compute minimum energy/free 
energy paths that exploit information about the 
end points are available. Perhaps the most significant 
alternatives are contributions based on optimization 
of functionals. One example is the following: Let 
S[( 7 c(y 9 (i)] be a functional of a path (5'[q'a/;(^)] = 
1 

f The SDP is the path with the min- 

0 

imal value for the functional (Olender and Elber 1997). 
The advantage of this approach is the availability of 
a functional to minimize. In a minimization, a large 
change to the path can be made and accepted as an 
improvement as long as the value of the functional 
is reduced. In contrast, the LUP method (Eq. 2) 
does not have a monitoring function that is mono- 
tonically decreasing for better paths. Therefore, 
small steps must be used in LUP when integrating 
the quenched equations of motions. Hence, func¬ 
tional optimization has the advantage of stability of 
calculation and the availability of an absolute qual¬ 
ity measure. 


Transition State Theory 

Rate of change from state a (reactants) to state /I 
(products) is measured as the number of trajectories 
that enter state /I from state a per unit time and stay 
there at the time of the measurement. The definition 
requires a boundary in conformation or phase space 
for the state ji in order to determine the time of entry. 
Eor the definition of this boundary, the reaction path 
approach may come in handy. It is suggested to place 
the entry to /I as the hypersurface normal to the 
reaction coordinate at the highest energy (or free 
energy) point of the reaction coordinate between 
a and fi. This hypersurface is hard to reach and if it 
is crossed once (on the way forward) chances that it 
will be crossed again in the same trajectory are slim. 
Hence, the existence of a surface of no return is 
assumed. The oldest theory for rate calculations 
that exploits this observation is the Transition State 
Theory (TST) (McQuarrie and Simon 1997). 
The special hypersurface mentioned above is called 
the Transition State (TS), even though it is not 
a state in the usual sense. A “state” is expected to 
be metastable and the TS is not. 

In TST, the number of molecules that pass through 
the TS at unit time is counted. The probability density 
(probability per unit length) of being at the top of the 
barrier is the ratio of the partition function of the 
system constrained to the barrier divided by the parti¬ 
tion function of the reactants. The rate R is given by the 
product of this ratio and a frequency factor at the TS. 


U{x) 
2kT kT 
dr =dx ■ dp 


R = 


M pexp 


h Q 


pM-^p\ JJ^exp[-£'] 

2kT J ^ J^exp[-£] 


(5) 


where p is the momentum and M is the diagonal mass 
matrix. The notation x’ and p’ means the phase 
subspace at the TS fixing, the value of the reaction 
coordinate. The integration over {x^p) is over the 
phase space of the reactant. A phase space volume 
elemet is denoted by dF and the total energy by E. 
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The partition function Q’ is of the system constrained 
to the TS. The reaction path is making a useful 
contribution to the calculations of Eq. 5 by determining 
the TS subspace {x’,p’). 

The critical assumption of TST is the commitment 
of the trajectories at the TS to move on to the product 
state. It implies, for example, that if a trajectory starts 
at the TS with velocities pointing forward, it will end 
up in the product state with probability one (ideal TST 
conditions). This is however not always the case and it 
is possible to find trajectories that turn around and 
end up in the reactant after crossing the TS. This 
phenomenon of recrossing makes the rate estimates 
of TST an upper bound. Recrossing is more likely if 
the energy barrier of the TS is not large compared to 
thermally accessible energies. Recrossing is also likely 
if the energy landscape is rough including a number of 
minima and barriers along the reaction path. In these 
cases, the TST concept may not be useful. 

Transmission Coefficient 

Obviously, there is some gray area in which 
recrossings are important but a dominant barrier is 
still present; hence, at some level, the TST approach 
is expected to be useful. An example is when the TS 
position along the reaction path is not identified accu¬ 
rately. A correction for TST in this domain is the 
transmission coefficient k (Chandler 1978) in which 
the rate R is replaced by kR. The transmission coeffi¬ 
cient is the fraction of trajectories ending at the product 
starting from ideal TST conditions. The transmission 
coefficient provides, in principle, an exact rate. How¬ 
ever, an efficient calculation of the transmission 
coefficiet depends on how quickly the system settles 
in the product or the reactant sides. If the TS is easily 
recrossed (many times) and settling time is long, esti¬ 
mating the rate with the transmission coefficient will 
be expensive. This is the case for a low barrier and/or 
rough energy landscape with a broad distribution of 
barrier heights. 

Sampling Along Pathways 

Recent simulation approaches exploit more fully 
reaction paths, exploitation that goes beyond the single 
value of q used in TST. In these approaches. 


trajectories are computed between hypersurfaces 
along the reaction coordinate, and the fluxes through 
the interfaces are estimated as a way to compute rates. 
By now there are numerous variations of these 
technologies, and descriptions of all of them is beyond 
the scope of the present entry. A detailed discussion is 
given to only one class of these methods (Transition 
Interface Sampling (van Erp et al. 2003), Forward Flux 
(Allen et al. 2005), or Milestoning (Kirmizialtin 
and Fiber 2011)). 

A trajectory is a time-continuous motion that obeys 
particular dynamical laws. A reactive trajectory does 
not necessarily follow a reaction path which is 
a condensed time-independent presentation of the 
process. The reaction path does not have inherent 
formulation of kinetics. On the other hand, following 
the dynamics explicitly has its limitations. A trajectory 
with a single transition from reactant to product is 
insufficient to account for experimental measurements. 
Measurements are typically averages over a large 
number of events. In contrast, a single reaction path 
with the help of statistical rate theory, such as TST, 
approximately accounts for an ensemble of events. 

In time-dependent calculations, a large ensemble of 
trajectories with many transition events is needed to 
study the process in a statistically meaningful way. 
A large number of trajectories or events is expensive to 
compute. Moreover, the broad timescales of biological 
processes (e.g., protein conformational transitions are in 
the range of microseconds and milliseconds) make the 
calculations of individual transitional events expensive 
and in many cases impossible with available computa¬ 
tional resources. The combination of the reaction path 
approach (which does not contain explicit and expensive 
to compute time information) with trajectory fragments 
is particularly promising for extending the timescale of 
rate calculations. 

In interface-based methods, trajectories are initiated 
at the interface similar in spirit to the calculation of the 
transmission coefficient in TST. However, a sequence of 
interfaces along the reaction coordinates is considered. 
Trajectories are terminated (and recorded) when they hit 
a forward or backward interface or Milestone. The idea 
of partitioning phase space to cells and computing rates 
beween cells was adopted by numerous techniques 
including the Macrostate model (Shalloway 1996), 
Milestoning (Kirmizialtin and Fiber 2011), the 
Markov State Model (Chodera et al. 2007), and more 
(Huber and Kim 1996). In one particular implementation 
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(Milestoning (Kirmizialtin and Elber 2011)) trajectories 
are used to determine a local transition operator between 
interfaces, Kij{t). Let «, be the number of trajectories 
initiated at interface i typically obtained by sampling 
constrained to the Milestone. Let «y(f) be the 
number of trajectories initiated at Milestone i at 
time r = 0 and reached for the first time another 
Milestone (/) after time t. The local transition 
operator is estimated as Kij{t) = riijit) / iii. 
The flux, G, is computed from the integral 

t 

equation Gy(f) = r]jS{t^) + f Gj(f)Kij{t — l')dt' 

i 0 

(Faradjian and Elber 2004), where rjj is the initial popu¬ 
lation of state /. The overall mean first passage time (t) is 
an integral over the flux entering the final state weighted 

OO 

by time: (t) = J tGf{t)dt. The inverse of the mean first 

0 

passge time is an estimate for the rate constant. 
Because only short trajectories are computed to estimate 
the local transition operators, the calculations are orders 
of magnitude more efficient than straightforward 
Molecular Dynamics. 

Summary 

Computations of reaction pathways on complex energy 
surfaces in conjunction with short Molecular Dynamics 
trajectories that connect Milestones or interfaces supple¬ 
ment experimental techniques. Computations are likely 
to enhance understanding of cell function at the molec¬ 
ular level. Examples for conformational transition in 
biological molecules investigated with reaction path 
approaches can be found in Elber (2011). 
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Definition 

Reaction-induced infrared difference spectroscopy 
records changes in infrared absorption which are 
caused by a reaction. Applied to proteins, it studies 
the relationship between protein structure and protein 
function. The chapter summarizes ways to initiate pro¬ 
tein reactions, interpretation of infrared difference 
spectra, enzyme activity measurements, and applica¬ 
tions of the method to bacteriorhodopsin, cytochrome 
c oxidase and related heme copper oxidases, Car'^- 
ATPase, pyruvate kinase, and the Alzheimer’s peptide. 
These selected examples investigate biological pro¬ 
cesses like proton pumping, ion transport, ligand bind¬ 
ing, catalysis, and (mis)folding of polypeptides. 

Introduction 

Elucidating the molecular mechanism of protein reac¬ 
tions is a major challenge for the life science commu¬ 
nity. Applying infrared spectroscopy to this topic 
exploits several of its advantages: (1) high time reso¬ 
lution (<1 ps), (2) universal applicability from small 
soluble proteins to large membrane proteins, (3) the 
high molecular information content, and (4) a sensitiv¬ 
ity high enough to detect a change in bond strength of 
one bond in a large protein. 

In favorable cases, the transition between two states 
of a biomolecule can be observed in the infrared 
absorption spectrum. More usually, the effects are so 
small that subtracting spectra obtained from different 
samples is not sensitive enough (see the chapter by 
Deng and Callender [► Isotope Edited Raman and 
Infrared Difference Spectroscopy Studies of Protein 
Structure], however, for a special technique). Instead, 
the reaction between the two states of interest has to be 
initiated directly in one sample (Braiman and Roth¬ 
schild 1988; Zscherp and Barth 2001; Eabian and 
Mantele 2002; Kdtting and Gerwert 2005; Berthomieu 
and Hienerwadel 2009). This technique is termed reac¬ 
tion-induced infrared difference spectroscopy. In 
a typical experiment, the biomolecule is prepared in 
a stable state A and a spectrum of this state is mea¬ 
sured. Then the reaction is triggered, the protein pro¬ 
ceeds to state B and again the spectrum is recorded. 
Erom the spectrum recorded before the start of the 
reaction (state A) and the spectrum recorded after the 
reaction (state B) a difference spectrum is calculated. 


This reflects the changes in infrared absorption associ¬ 
ated with the reaction. Instead of only one final state B, 
several transient states may be adopted in the course of 
the reaction and these can be followed by time- 
resolved infrared spectroscopy [► Eourier Transform 
Infrared Spectroscopy for Biophysical Applications: 
Technical Aspects]. An example is shown in Eig. 1. 

A difference spectrum shows bands only from 
“active” groups, that is, those that are affected by the 
change from state A to B. Negative bands are charac¬ 
teristic of the state before the reaction (A) and positive 
bands of the state(s) after or during the reaction (B). All 
“passive” residues are invisible in the difference spec¬ 
trum, so that the number of observed groups is dramat¬ 
ically reduced compared to the absorption spectrum. 
This simplification enables an interpretation on the 
molecular level, for example, concerning length and 
strength of individual bonds in a large protein. 

Triggering Protein Reactions 

A cracial problem in reaction-induced difference spec¬ 
troscopy is how to trigger the protein reaction of interest. 
The number of methods for this has constantly increased 
in the last decades and the main approaches are (Zscherp 
and Barth 2001; Eabian and Mantele 2002; Kdtting 
and Gerwert 2005; Siebert and Hildebrandt 2008; 
Berthomieu and Hienerwadel 2009): 

1. Inducing photoreactions by illumination (Noguchi 
2007). 

2. Releasing compounds from photosensitive precur¬ 
sors - named caged compounds - by illumination 
(Cepus et al. 1998; Goeldner and Givens 2005). 

3. Rapid mixing techniques (Eabian and Naumann 2004; 
Kdtting and Gerwert 2005). These are difficult to 
apply because of the viscous consistency of a typical 
sample and the small path length of less than 10 pm 
due to the strong *H20 absorption (<50 pm for ^H20). 

4. Attenuated total reflectance [► Eourier Transform 
Infrared Spectroscopy for Biophysical Applica¬ 
tions: Technical Aspects] (ATR) measurements 
(Rich and Iwaki 2007). In an ATR experiment, 
a sample is placed on a crystal and infrared radiation 
is passed into the crystal at an angle such that the 
light undergoes total internal reflection at the crys¬ 
tal-sample interface. The total reflection process 
generates an evanescent wave that extends about 
one wavelength away from the crystal surface and 
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Reaction-Induced Infrared 
Difference Spectroscopy, 
Fig. 1 Structural changes in 
the active sarcoplasmic 
reticulum Ca^'^-ATPase 
obsei'ved by time-resolved 
infrared difference 
spectroscopy. Photolytic 
release of ATP from caged 
ATP triggered the transition 
from the bound form to 
the ATP complex (1), the first 
phosphorylated intermediate 

(2) , and the second 
phosphorylated intermediate 

(3) . The spectra are the 
absorption of the actual state 
minus the absoiption of the 
initial state 



senses the absorption of the sample in this layer. 
The sample is usually a him, prepared by drying, 
with a buffer solution above it. The buffer can be 
exchanged to induce reactions in the sample. In 
a recent extension of the ATR technique, 
a dialysis membrane separates a sample compart¬ 
ment close to the ATR crystal and a reservoir. In 
this way, the medium in the reservoir can be altered 
without disturbing the protein sample in the sample 
compartment. A him is not required, making the 
technique also applicable for soluble biomolecules. 

5. Temperature- and pressure-jumps to study unfolding 
of proteins [► Infrared Spectroscopy of Protein Fold¬ 
ing, Misfolding and Aggregation] (Callender, and 
Dyer 2006). 

6. Equilibrium electrochemistry to initiate redox 
reactions using an ultrathin-layer spectroelectro- 
chemical cell. 

7. Photoreduction by photoexcitable electron donors 
to induce redox reactions. 


Interpreting Difference Spectra 

Infrared difference spectra are sensitive to the same 
factors that affect absorption spectra. They provide 
molecular information on 

• Changes of chemical structure, for example, 
changes of protonation state and the conversion of 
substrates to products in enzymatic reactions 

• Changes in conformation 


• Changes in the interaction with the environment 

• Changes in conformational and environmental 

heterogeneity, for example, when a molecule 

binds to a protein 

The sensitivity of the infrared spectrum to many 
different factors is an advantage and a drawback at the 
same time: at hrst, an infrared difference spectmm may 
seem hopelessly crowded with many overlapping bands. 
Therefore, most features in a difference spectrum are not 
readily interpretable and a hrst step is to regard the 
spectra as a hngerprint of the stmctural change (Barth 
2007). Similar approaches have a long tradition in huo- 
rescence and absorption spectroscopy. The approach can 
be used to identify and compare reaction intermediates 
and to classify and quantify conformational changes. 

For a molecular interpretation of the spectra, the 
difference bands need to be assigned to particular 
molecular groups. A hrst, tentative assignment can be 
based on the spectral position (Barth 2007) of the 
observed bands. However, several assignments are 
possible for most difference bands and additional 
experiments are necessary to base the assignments on 
more solid grounds. These comprise modihcation of 
the sample (mutation, isotopic labeling[^ Isotope 
Edited Raman and Infrared Difference Spectroscopy 
Studies of Protein Structure], and modihcations to sub¬ 
strates or protein cofactors), study of model compound 
spectra, and comparison with computed spectra 
(► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra and ► QM/MM Calculations of 
Vibrational Spectra. There are at least three QM/MM 
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Reaction-Induced Infrared Difference Spectroscopy, 
Fig. 2 Infrared spectra of ATP and of its hydrolysis products 
ADP and phosphate 

contributions in the entire encyclopedia, only one is in 
the vibrational spectroscopy section. The link should 
refer to the latter entry). 

Enzyme Activity Measurements with 
Infrared Spectroscopy 

Enzymatic activity is usually measured indirectly 
because substrate and product cannot be distinguished 
in the UV or visible spectral range. This “state of the 
art” of activity measurements implies that for a “new” 
enzymatic reaction, an activity assay or a suitable 
substrate analogue has to be developed. This is often 
time-consuming and techniques are highly desired that 
monitor enzymatic reactions directly. 

Infrared spectroscopy can provide such a direct, 
“online” monitoring of enzymatic reactions. When a 
molecular structure is modified in an enzymatic reac¬ 
tion, the infrared spectrum is altered and changes in 
infrared absorption can be followed to monitor the 
progress of the reaction (Barth 2007). As example. 
Fig. 2 shows the infrared spectra of ATP and its hydro¬ 
lysis products ADP and phosphate. They are clearly 
different because ATP hydrolysis converts a P02~ 
group into a P 03 ^~ group and this has been used 
to measure ATPase activity. In contrast to infrared 
spectroscopy, UV spectroscopy cannot discriminate 
between reactants and products in this case. 


Selected Mechanistic Studies of Proteins 

Overview 

In the following, reaction-induced infrared difference 
spectroscopy is illustrated using three different proteins 
as examples. Reference to the original work mentioned 
here can be found in a previous review (Barth 2007). All 
three proteins pump ions, driven either by light, redox 
potential, or ATP. Therefore they required different tech¬ 
nical solutions for the application of the difference prin¬ 
ciple to the biological system: bacteriorhodopsin 
highlights light-induced difference spectroscopy, cyto¬ 
chrome c oxidase has been studied with a variety of 
techniques, and the Ca^'^-ATPase has been triggered by 
photolytically induced concentration jumps. Other pro¬ 
teins commonly studied with difference spectroscopy are 
rhodopsin, photosynthetic reaction centers, and Ras pro¬ 
teins (Goeldner and Givens 2005; Kotting and Gerwert 
2005; Callender and Dyer 2006; Noguchi 2007; Siebert 
and Hildebrandt 2008; Berthomieu and Hienerwadel 
2009). Recent work in our group investigated binding 
of the substrate phosphoenolpymvate to pymvate kinase 
using dialysis-coupled ATR spectroscopy (Kumar and 
Barth 2010) and aggregation of the Alzheimer”s peptide 
(amyloid (3 peptide, AP), which was initiated by a sudden 
pH drop upon photolysis of caged sulfate (Peralvarez- 
Marin et al. 2008; Mandal and Barth, unpublished). In 
the former work, bands of the bound substrate were 
identified in the difference spectmm by isotopic labeling. 
Their spectral positions indicated only little change in 
bond strengths compared to an aqueous environment and 
thus suggested that the substrate adopts a reactive con¬ 
formation only after binding of the second substrate 
ADP. Regarding short variants of the Ap peptide, we 
observed the formation of two different aggregates with 
P-sheets. One of them was able to grow by recruiting 
monomeric peptides as seen by a downshift of the main 
P-sheet band near 1,620 cm” *. The antiparallel architec¬ 
ture of these aggregates contrasts with the parallel archi¬ 
tecture of the fibrils of the full-length peptide. 

Bacteriorhodopsin 

Bacteriorhodopsin is a light-driven proton pump of 
Haloarchea. Light is absorbed by the chromophore 
retinal, which is covalently linked to Lys-216. Light 
absorption causes isomerization of the retinal from 
aW-trans to 13-cA. The strained photoisomerized state 
of the protein relaxes back to its ground state in a series 
of reaction steps that are accompanied by proton 
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transfer from the intracellular side to the extracellular 
side of the plasma membrane. 

Bacteriorhodopsin was one of the first proteins stud¬ 
ied with reaction-induced infrared difference spectros¬ 
copy since its photocycle can be conveniently triggered 
by light (Maeda 1995; Heberle 1999; Dioumaev 2001; 
Kotting and Gerwert 2005; Siebert and Hildebrandt 
2008). The robustness of bacteriorhodopsin made it 
possible to apply time-resolved techniques which 
require numerous repetitions of the flash-induced exper¬ 
iment on a single sample, like single-wavelength 
measurements with a conventional light source, (strobo¬ 
scopic FTIR spectroscopy [► Fourier Transform Infra¬ 
red Spectroscopy for Biophysical Applications: 
Technical Aspects] and the step scan [► Fourier 
Transform Infrared Spectroscopy for Biophysical 
Applications: Technical Aspects]) technique. A time 
resolution in the submicrosecond range and trapping of 
intermediates at low temperature enabled characteriza¬ 
tion of the structural changes upon proton pumping. Of 
particular significance were the identification of the 
proton transporting groups - Asp-85, Asp-96, and 
a water cluster close to Glu-194 and Glu-204 - and the 
characterization of their p/i^a changes in the photocycle. 
Another important finding was that the 13-cA confor¬ 
mation of retinal is twisted shortly after photoexcitation 
and relaxes subsequently to the conformation adopted 
during most of the remaining photocycle. 

Cytochrome C Oxidase and Related Heme-Copper 
Oxidases 

Cytochrome c oxidase [► Cytochrome c Oxidase 
(Complex IV)] is the terminal electron acceptor in the 
respiratory chain and reduces an oxygen molecule 
to water. The reaction needs the input of four electrons 
from cytochrome c and four protons from the 
mitochondrial matrix or bacterial cytoplasm. In 
addition, four protons are pumped from matrix to 
intermembrane space in mitochondria or from cyto¬ 
plasm to periplasm in bacteria. The enzyme contains 
four redox-active metal centers: Cua, heme a, heme 03 , 
and Cub. The latter two constitute the catalytic center 
where the reduction of oxygen takes place. 

Cytochrome c oxidases and related heme-copper 
oxidases have been studied with a variety of infrared 
spectroscopic techniques as reviewed previously 
(Gennis 2003) including photodissociation of carbon 
monoxide (CO), electrochemistry, photoreduction, 
and ATR spectroscopy. 


Work on CO photodissociation benefits from CO 
absorbing in a spectral window without overlapping 
absorption from other protein components. CO binds 
to heme in the dark, dissociates upon illumination, 
and binds subsequently to Cub at low temperatures. At 
room temperature, it can leave the enzyme upon pho¬ 
todissociation and appear in the bulk solution. Already 
in 1981, CO photodissociation experiments have led to 
significant conclusions regarding the nature of the 
binding sites, their distance, and the structural hetero¬ 
geneity of the CO complexes. 

As cytochrome c oxidase pumps and consumes pro¬ 
tons, much work has been devoted to identify the 
protonation sites. Glu-286 of the Rhodohacter 
spaeroides oxidase and corresponding residues in the 
Pamcoccus denitiificans oxidase and in cytochrome 
boj, from E. coli are located at the end of one of the 
proton pathways to the catalytic center. Infrared dif¬ 
ference spectroscopy has shown that they are proton- 
ated in the oxidized and reduced states and involved in 
electron transfer reactions. Glu-286 undergoes proton¬ 
ation changes during catalysis, indicating that its pK^ is 
modulated by a changing protein environment. 

Evidence for the participation in proton transfer 
reactions has also been obtained for the ring 
D propionate of heme a, for a nonconserved Asp or 
Glu residue of bovine heart oxidase (possibly Asp-51), 
for Asp-75 of E. coli cytochrome hos and for a Tyr, 
possibly belonging to the unusual, covalently linked 
His and Tyr residues close to Cub. 

Ca^^-Atpase 

The sarcoplasmic reticulum Ca2-i—ATPase [> Fi- 
ATPase: Fundamental Properties and Structure and 
► ATPase: Overview] belongs to the family of P-type 
ATPases, which are major players in active transport 
across biological membranes. It couples the pumping 
of two Ca^"^ to the hydrolysis of one ATP molecule. 
The ATPase exemplifies the use of caged compounds 
in infrared spectroscopy, which began in 1990. Mostly, 
the photolytic release of ATP from caged ATP or of 
Ca^"^ from a caged Ca^"^ was used to trigger several 
partial reactions of the enzyme (Barth 2008). 

One study mapped the interactions between protein 
and substrate ATP. ATP binding induces a change in 
protein conformation which alters the amide I 
absorption [► Infrared Spectroscopy of Protein Struc¬ 
ture] (1,700-1,610 cm”*) as shown in Fig. 3. These 
changes are different when particular functional 
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Reaction-Induced Infrared Difference Spectroscopy, 
Fig. 3 Mapping the ATP binding site of the Ca^'^-ATPase 
with infrared spectroscopy. Left and right panels', difference 
spectra caused by binding of ATP and ATP analogues: ATP 


{black), ADP (green), 3'-deoxyATP (purple), ITP (red), and 
2'-deoxyATP (blue). Middle panel', summary of the detected 
interactions 


groups of ATP are modified. Therefore, the conforma¬ 
tional change upon nucleotide binding depends, to 
a surprising degree, on individual interactions between 
ATPase and nucleotide. It has been concluded that the 
ATPase interacts with the y-phosphate, the ribose 
hydroxyls and the amino function of ATP. This has 
later been confirmed by X-ray crystallography. The 
missing of individual interactions produces more than 
just local effects: it affects the entire conformational 
change upon binding. This suggests a concerted 
conformational change for which all interactions 
need to be in place. 

Transfer of the nucleotidic y-phosphate to Asp-351 
follows nucleotide binding. In this reaction, an inter¬ 
esting observation has been made with ITP: the phos¬ 
phorylation spectrum obtained with ITP shows 
additional signals in the amide I region as compared 
to ATP, which are similar to ATP binding signals. In 
contrast, the previous step of ITP binding gave much 
smaller signals than observed for ATP binding. Thus in 
the phosphorylation reaction with ITP, the enzyme 
seems to catch up on a conformational change that 
could not be achieved in the binding step because the 
interactions between protein and base moiety were 
impaired. 


In another study, the second ATPase phosphoenzyme 
intermediate has been investigated with the aim to under¬ 
stand its high rate of hydrolysis of the aspartylphosphate 
ester bond. An oxygen isotope exchange at the phosphate 
group produced a difference spectrum, shown in Fig. 4a, 
to which only the isotope exchange reaction contributed. 
This selectively observed the phosphate vibrations 
against a background of ^50,000 protein vibrations. 
The experiment exemplihes that difference spectroscopy 
drastically simplifies the spectrum and that this enables 
to determine the vibrational frequencies of particular 
groups. The latter Is the prerequisite for a molecular 
interpretation. 

The phosphate frequencies revealed that the sclssile 
bond between the covalently bound phosphate group 
and the protein is lengthened by ^0.1 A by the protein 
environment, which substantially weakens the bond 
and facilitates its hydrolysis as illustrated in Fig. 4b. 
Weakening and elongation of the P-0 bond is not 
accomplished by external mechanical forces that pull 
the bond apart. Instead it is an in-built response of 
aspartyl phosphate to a shift of interactions from phos¬ 
phate to aspartyl oxygens, with only subtle changes in 
distances required. This provides an elegant “handle” 
for the enzyme to control hydrolysis. 
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Reaction-Induced Infrared Difference Spectroscopy, 
Fig. 4 Measuring bond lengths with infrared difference spec¬ 
troscopy. (a) Difference spectrum of the ’^O to '*0 isotope 
exchange at the phosphate group. Negative bands define the 
band positions of the '^O isotopomer. (b) Molecular interpreta¬ 
tion of the above data. The green aiTow indicates the P-0 bond 
elongation induced by the enzyme environment. Narrow dashed 
lines indicate weaker interactions compared to the model com¬ 
pound acetyl phosphate in water. The thick dashed line illus¬ 
trates stronger interactions. Bond elongation is caused by weaker 
interactions to the terminal phosphate oxygens and stronger 
interactions to the aspartyl oxygens with the main effect at the 
bridging oxygen 

Outlook 

Reaction-induced infrared difference spectroscopy stud¬ 
ies the molecular mechanism of proteins in exquisite 
detail. This holds even for systems which are well char¬ 
acterized by other high-resolution methods because 
“FTIR spectroscopy is a more sensitive method to detect 
small structural perturbations than crystallography at the 
available resolutions” (Lanyi and Schobert 2007). New 
methods to perturb proteins or to initiate protein reac¬ 
tions will expand the number of systems that can be 
investigated and will also pave the way for biotechnol¬ 
ogy applications. Of particular interest here are a number 
of mixing devices that have recently been developed and 
which aim at making infrared spectroscopy as univer¬ 
sally applicable as UV-visible spectroscopy. Thus, infra¬ 
red spectroscopy will continue to contribute to our 
understanding of proteins. 


Summary 

Reaction-induced infrared difference spectroscopy of 
proteins enables detailed characterization of enzyme 
function on the level of single bonds of proteins, cofac¬ 
tors, or substrates. Discussed are methods to initiate 
protein reactions in infrared samples, general aspects 
of spectra interpretation, measurements of enzyme 
activity, and exemplary studies of protein function. 

Cross-References 

► ATPase: Overview 

► Cytochrome c Oxidase (Complex IV) 

► Fi-ATPase: Fundamental Properties and Structure 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Infrared Spectroscopy of Protein Structure 

► Isotope Edited Raman and Infrared Difference 
Spectroscopy Studies of Protein Structure 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 


References 

Barth A. Infrared spectroscopy of proteins. Biochim Biophys 
Acta. 2007;1767:1073-101. 

Barth A. Structural dynamics of the Ca^'^-ATPase studied by 
time-resolved infrared spectroscopy. Spectroscopy. 2008;22: 
63-82. 

Berthomieu C, Hienerwadel R. Fourier transform infrared 
(FTIR) spectroscopy. Photosynth Res. 2009;101:157-70. 

Braiman MS, Rothschild KJ. Fourier transform infrared tech¬ 
niques for probing membrane protein structure. Annu Rev 
Biophys Biophys Chem. 1988;17:541-70. 

Callender R, Dyer RB. Advances in time-resolved approaches to 
characterize the dynamical nature of enzymatic catalysis. 
Chem Rev. 2006;106:3031^2. 

Cepus V, Ulbrich C, Allin C, Troullier A, Gerwert K. Fourier 
transform infrared photolysis studies of caged compounds. 
Methods Enzymol. 1998;291:223^5. 

Dioumaev AK. Infrared methods for monitoring the protonation 
state of carboxylic amino acids in the photocycle of bacteri- 
orhodopsin. Biochem (Moscow). 2001;66:1269-76. 

Fabian H, Mantele W. Infrared spectroscopy of proteins. In: 
Chalmers JM, Griffiths PR, editors. Handbook of vibrational 
spectroscopy. Chichester: Wiley; 2002. 

Fabian H, Naumann D. Methods to study protein folding by 
stopped-flow FT-IR. Methods. 2004;34:28^0. 






R 


2198 


Reactive Oxygen Species 


Gennis RB. Some recent contributions of FTIR difference spec¬ 
troscopy to the study of cytochrome oxidase. FEES Lett. 
2003;555:2-7. 

Goeldner M, Givens R, editors. Dynamic studies in biology. 
Weinheim: Wiley-VCFI; 2005. 

Fleberle J. Time-resolved ATR/FT-IR spectroscopy of mem¬ 
brane proteins. Recent Res Devel Appl 
Spectrosc. 1999;2:147-59. 

Kotting C, Gerwert K. Proteins in action monitored by time- 
resolved FTIR spectroscopy. Chemphyschem. 2005;6:881-8. 

Kumar S, Barth A. Phosphoenolpyruvate and Mg^"^ binding to 
pyruvate kinase monitored by infrared spectroscopy. 
Biophys J. 2010;98:1931^0. 

Lanyi JK, Schobert B. Structural changes in the 
L photointermediate of bacteriorhodopsin. J Mol Biol. 
2007;365:1379-92. 

Maeda A. Application of FTIR spectroscopy to the structural 
study on the function of hacteriorhodopsin. Isr J Chem. 
1995;35:387^00. 

Noguchi T. Light-induced FTIR difference spectroscopy as 
a powerful tool toward understanding the molecular mecha¬ 
nism of photosynthetic oxygen. Photosynth Res. 
2007;91:59-69. 

Peralvarez-Marm A, Barth A, Graslund A. Time-resolved infra¬ 
red spectroscopy of pH-induced aggregation of the 
Alzheimer AP(l-28) peptide. J Mol Biol. 2008;379:589-96. 

Rich PR, Iwaki M. Methods to probe protein transitions with 
ATR infrared spectroscopy. Mol Biosyst. 2007;3:398^07. 

Siebert F, Hildebrand! P. Vibrational spectroscopy in life 
science. Weinheim: Wiley-VCH; 2008. 

Zscherp C, Barth A. Reaction-induced infrared difference spec¬ 
troscopy of the study of protein reaction mechanisms. Bio¬ 
chemistry. 2001;40:1875-83. 


Reactive Oxygen Species 

Julio F. Turrens 

Department of Biomedical Sciences, Pat Capps Covey 
College of Allied Health Professions, University of 
South Alabama, AL, USA 

Synonyms 

Oxygen free radicals 


Definition 

The expression “Reactive Oxygen Species” (ROS) is 
used to describe a variety of chemical species derived 
from molecular oxygen (O 2 ). Some ROS involve O 2 
(in the ground state or electronically excited) as well as 


molecules containing partially reduced oxygen. 
Although the term ROS is usually used as a synonym 
of “oxidants,” some of them act often as “reductants” 
instead. Thus, the reactivity and chemical structure of 
different ROS varies substantially. The chemical struc¬ 
ture of the most common ROS and their toxicity to 
biological systems are described. 

Introduction 

Aerobic organisms utilize O 2 as a terminal acceptor in 
the electron transport chain for electrons produced 
during the oxidation of nutrients required for energy 
production. During this process, O 2 will accept a total 
of four electrons, becoming fully reduced and produc¬ 
ing two molecules of water (H 2 O). However, these 
four electrons are not transferred in a single step but 
as four independent single-electron exchanges. This 
peculiar chemical behavior of O 2 is a direct result of 
its electronic configuration. In its ground state, the 
outer electron layer of O 2 contains two unpaired elec¬ 
trons, both of them with the same spin and therefore 
located in separate orbitals (Fig. 1). This type of 
electronic configuration is called “triplet state.” 

Chemical bonds involve pairs of electrons with 
opposite spins. The two unpaired electrons in 
triplet O 2 (both with the same spin) cannot react with 
a chemical bond since the target pair has two electrons 
with opposite spins. Out of the four electrons involved 
in the process, three will have the same spin, and 
therefore this attack could effectively produce only 
one bond in which the electrons have opposite spins. 
In other words, O 2 in the ground state can only 
accept one electron at a time to pair up with each 
unpaired electron. 

Examples of ROS 

Singlet oxygen ('O 2 ): This species is a form of O 2 that 
results from a rearrangement of the two unpaired elec¬ 
trons found in the outer layer of triplet O 2 . In this 
process, one of the unpaired electrons in triplet O 2 
absorbs energy and changes its spin. The resulting 
species may have both electrons as a pair in one orbital 
(*Ag 02 ) or remain in two separate orbitals but 
with opposite spins (*Zg’^02) (Fig. 2). Since the two 
electrons in the outer layer have opposite spins. 




Reactive Oxygen Species 


2199 


P 


^ ^ .• 

11 11 ^ 

11 „ 

Reactive Oxygen Species, Fig. 1 Molecular orbitals for the 
outer electron layer in an oxygen molecule, following the 
general principles of LCAO (Linear Combination of Atomic 
Orbitals). In the ground state (triplet), the last two electrons fall 
in separate orbitals, which gives oxygen its biradical 
characteristic. This configuration explains why the reduction of 
oxygen must occur in single-electron steps 
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Reactive Oxygen Species, Fig. 2 Electronic configurations 
for two different forms of singlet oxygen. In both cases, the outer 
layer contains two electrons with opposite spins making this 
species very reactive toward other chemical bonds. (This figure 
was adapted from Puig Muset 1975) 


*©2 can react in one step with other chemical bonds 
since, in this case, both ^©2 and the target bond have 
opposite spins. Pairs of n electrons such as those pre¬ 
sent in double bonds (for example, in unsaturated fatty 
acids) are particularly susceptible to oxidation by *© 2 , 
making '©2 a very powerful oxidant in biological 
systems. Its formation depends on energy transfer 
from other excited molecules, including dyes such as 
rose Bengal or biomolecules such as porphyrins (Greer 
2006). For example, patients suffering from porphyria 
accumulate porphyrins in the skin which in turn cause 
tissue damage as they become electronically excited 
upon exposure to light. Scavengers rich in double 
bonds, such as P-carotenes, are effective antioxidants 
against '© 2 . 

Ozone (©3): This is another form of ©2 in the 
ground state. It is produced in the presence of ultravi¬ 
olet light through the reaction of molecular ©2 with 
atomic oxygen, ©zone also reacts with double bonds in 


organic molecules, causing them to split into carbonyls 
through a chemical process known as “ozonolysis.” It 
is very unstable, and although it is not metabolically 
produced in biological systems, it is used in industry as 
a disinfectant because of its reactivity toward organic 
matter (Bromberg and Koren 1995). Accumulation 
of ©3 in the stratosphere also protects the earth from 
ultraviolet rays (UV-B irradiation). 

Superoxide anion (©2 ); This species is produced 
as a result of a one-electron reduction of ©2 in the 
ground state. The single electron will pair up with 
one of the two unpaired electrons in the outer layer, 
forming © 2 ”. This species is quite stable and may act 
either as an oxidant or a reductant. As a reductant, it 
may react with transition metals, which in turn may 
reduce hydrogen peroxide (H 2 © 2 ) producing one 
hydroxyl radical (©H ), one of the most powerful 
oxidants found in nature (see below). Two molecules 
of © 2 ~ may also react with each other in a process 
known as “dismutation” (a reaction in which one mol¬ 
ecule acts as an oxidant and the other as a reductant), 
yielding H 2©2 and © 2 . Therefore, although © 2 ” is 
not very reactive, it is responsible for the formation 
of ©H • by being both a precursor of hydrogen peroxide 
and a reductant of transition metals (Liochev and 
Fridovich 1999). Furthermore, © 2 ” may also react 
with nitric oxide (another radical present in biological 
systems) to produce another powerful oxidant known 
as peroxynitrite. Aerobic species have evolved various 
antioxidant enzymes that catalyze the dismutation 
of © 2 ~ to Fl 2©2 and © 2 , preventing the reaction of 
© 2 ~ with transition metals, thus protecting the cell 
against ©H- formation. These metalloenzymes are 
known as ► superoxide dismutases and include metals 
such as Cu, Fe, Mn, or Ni in their active sites 
(Fridovich 1995; Zelko et al. 2002; Turrens 2010). 
The reduction of ©2 to © 2 ~ in biological systems 
may be catalyzed by a variety of redox enzymes. 
The ► nicotinamide adenine dinucleotide phosphate 
(NADPH)-dependent oxidases located on the cell 
membrane of polymorphonuclear cells is one the 
most relevant physiological sources of © 2 ” and serves 
as a first line of defense against infection (Babior 
1999). In the ► mitochondrial electron transport 
chain, © 2 ~ is produced at several points as a result of 
direct reaction of ► electron transfer cofactors with ©2 
(Turrens 2003). 

Hydrogen peroxide (H 2 © 2 ): The reduction of ©2 
by two electrons (© 2 ^) produces H 2 © 2 , a very stable 
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molecule. Its toxicity is due primarily to the reaction 
mentioned in the previous paragraph in which reduced 
metals split the molecule to form OH-. Various 
antioxidant enzymes (► Catalases and Catalase- 
Peroxidases and ► peroxidases (Chelikani et al. 2004; 
Steinbrenner and Sies 2009; Turrens 2010) have evolved 
to eliminate H 2 O 2 , maintaining a low intracellular steady- 
state concentration of H 2 O 2 . The most common source of 
H 2 O 2 in biological systems results from the dismutation 
of © 2 ” anion (either spontaneous or catalyzed by super¬ 
oxide dismutases). 

Hydroperoxides (ROOH): These organic peroxides 
are chemically related to H 2 O 2 . They are usually 
formed in vivo through chemical chain reactions such 
as the process known as “lipid peroxidation.” The 
chain reaction starts by the abstraction of H atoms 
from organic molecules by an oxidant such as OH-. 
The resulting radicals (R ) react with molecular O 2 to 
produce peroxyl radicals (ROO ) which in turn 
abstract a H atom from another molecule, producing 
a new radical and restarting the cycle. The formation of 
lipid hydroperoxides derived from fatty acids in the 
phospholipid bilayer disrupts the membrane and may 
cause leakiness and cell death. Hydroperoxides are 
metabolized to alcohols by specific ► peroxidases 
(Steinbrenner and Sies 2009). 

Hydroxyl radical (OH ): The reduction of O 2 by 
three electrons causes the bond between both oxygen 
atoms to split, producing one OH- and one hydroxyl 
anion (OH~). This small radical is extremely reactive 
and nonspecific. It may abstract a hydrogen atom from 
any biological molecule (nucleic acids, proteins, 
lipids, etc.). As mentioned above, it may be produced 
through the reaction of reduced transition metals with 
H 2 O 2 or directly through the interaction of ionizing 
radiation and water. Since 02~ is both a reductant of 
transition metals and the precursor of H 2 O 2 , its forma¬ 
tion is indirectly responsible for the formation of OH • 
in biological systems. Given its high reactivity, the 
only antioxidant defense against these powerful radi¬ 
cals is through their reaction with low molecular 
weight antioxidants such as ascorbic acid, cysteine, 

► glutathione, uric acid, and vitamin E. 
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Synonyms 

Ion channels 


Definition 

Reconstitution should be defined as the process of 
reintroduction of a purified membrane protein into 
an artificial phospholipid membrane. However, in 
practice, this term is applied to the transfer of either 
a purified protein or a vesicle of native membrane, 
containing ion channels, into an artificial membrane 
system that permits the characterization of channel 
function. 


Introduction 

Why Reconstitute? 

Over the last 50 years classical macroscopic and 
single-channel electrophysiological approaches have 
facilitated enormous progress in our understanding of 
the identity and function of ion channels (Hille 2001). 
The vast majority of channels studied using these 
approaches are present in the surface membranes 
of cells; however, ion channels are also present 
in a number of intracellular membrane systems 
and organelles (Szewczyk and Just 2010). Some 
of these can be investigated using conventional elec¬ 
trophysiological approaches following the isolation 
and manipulation of nuclei or mitochondria (Szewczyk 


and Just 2010; Foskett et al. 2007). However, several 
important ion channels cannot be studied using con¬ 
ventional approaches. Some are expressed in mem¬ 
brane systems that are not amenable to standard 
electrophysiological techniques, for example, chan¬ 
nels in endoplasmic or sarcoplasmic membranes of 
eukaryotic cells or channels in bacterial cells. Simi¬ 
larly, the function of channels that have been isolated 
from native membranes to enable biochemical or 
structural characterization cannot be readily assessed 
using standard voltage clamp methodologies. In all of 
the above cases, functional characterization of these 
channels requires their incorporation into some form of 
artificial membrane. In addition to expertise in conven¬ 
tional electrophysiology, workers in this field require 
skills in membrane protein and lipid biochemistry. 

The reconstitution of an ion channel into a 
man-made membrane also offers advantages for the 
determination of some of the more abstruse aspects of 
channel function as many of the approaches provide 
the investigator with complete control of both the 
composition of the membrane into which the channels 
are incorporated and the ionic environment at both 
sides of the membrane. As a consequence, the function 
of a channel can be characterized in unphysiological 
ionic environments and the influence of membrane 
lipids, cofactors present in the channel-accessory 
protein complex and membrane surface charge may 
be assessed. 

Without doubt, the single most significant advance 
in our understanding of the mechanisms governing 
the selection and translocation of ions through mem¬ 
brane channels and the structures involved in their 
gating has come with the determination of channel 
structure at atomic resolution. This was made possible 
by the recognition that homologues of eukaryotic ion 
channels are present in bacteria, the purification 
and crystallization of these bacterial ion channels, the 
determination of the functional properties of these 
channels following reconstitution, and the correlation 
of function with protein structure (MacKinnon 2003; 
Sansom et al. 2002). 

Given that reconstitution is the only way in which the 
function of many important ion channels can be assessed, 
it might appear unnecessary to highlight limitations of 
the technique and in tmth these are not really limitations 
but obstacles that must be addressed if meaningful infor¬ 
mation is to be obtained. The major impediment is that in 
all cases the channels are removed to a greater or lesser 
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extent from both the environments, and in some cases the 
membrane, in which they exist in the cell. Isolation or 
purification of a membrane protein requires cell disrup¬ 
tion, separation of the native membrane population 
enriched in the channel in question from other membrane 
systems, and ultimately solubilization of the membrane 
with a detergent followed by a usually complex and 
lengthy separation of the channel from other solubilized 
membrane proteins. Channel protein denaturation is pos¬ 
sible at various stages during this procedure. The choice 
of detergent employed to solubilize the channel is par¬ 
ticularly important and the purification of a novel mem¬ 
brane channel must involve the testing of a range of 
detergents and conditions during solubilization. Wher¬ 
ever possible the integrity of the channel should be 
monitored during purification and the function of the 
purified channel protein fully characterized in the final 
reconstituted system. 

Starting Material 

Channels in Membranes of Eukaryotic Cells 

Ion channel reconstitution has been used extensively 
to monitor the function of ion channels present in 
intracellular membrane networks of native tissues, 
most notably the sarcoplasmic reticulum membrane 
system of striated muscle tissue (Miller 1978; Smith 
et al. 1985) and the endoplasmic reticulum of brain 
(Bezprozvanny 2005). The technique has also been 
used to assess the function of a large number of 
both native and recombinant channels from other 
membrane systems (Favre et al. 1999). 

Characterization of function requires the isolation of 
the membrane system in which the channel of interest is 
expressed. This is achieved using cell fractionation tech¬ 
niques following cell disruption. The standard method is 
to subject disrupted cells to differential and density gra¬ 
dient centrifugation and to identify the membrane vesicle 
population enriched in the channel of interest by ligand 
binding or by monitoring a specific function. Once iden¬ 
tified, membrane vesicles containing the channel can be 
incorporated into an artificial membrane to allow the 
characterization of function (see later). 

Native membrane vesicles will contain, in addition 
to the channel of interest, various other ion channels 
and membrane proteins. The incorporation of more 
than one species of channel into an artificial membrane 
complicates the interpretation of channel activity and 


severely limits the detailed characterization of channel 
function. A specific channel may be “isolated” in 
the artificial membrane by limiting the availability 
of permeant ions (e.g., Smith et al. 1985), alternatively 
the channel under investigation can be separated 
from other membrane components following detergent- 
mediated solubilization. This approach has the added 
advantage that isolation of the channel of interest is likely 
to reveal closely associated accessory proteins that may 
play a role in regulating channel function as part of 
a multimolecular complex. Isolated intracellular mem¬ 
brane channel proteins may be stable in a solubilization 
buffer containing low concentrations of detergent; alter¬ 
natively, they may require reconstitution into unilamellar 
proteoliposomes by detergent removal in the presence of 
excess exogenous lipid. The method used for detergent 
removal will depend upon the properties of the detergent 
used (Seddon et al. 2004). These processes are shown 
schematically in Fig. 1 . 

Bacterial Channels 

Routinely, recombinant, histidine-tagged, channel 
proteins are expressed in bacterial cells of choice. Fol¬ 
lowing expression, the cells are broken apart and mem¬ 
branes solubilized using an appropriate detergent. The 
solubilized channel protein is purified by metal affinity 
chromatography and reconstituted into unilamellar 
proteoliposomes by detergent removal in the presence 
of excess exogenous lipid. These processes are shown 
schematically in Fig. 1. 

Techniques Used for Channel Reconstitution 
and Characterization 

The methods outlined above provide the starting 
material for the characterization of function of purified 
channel proteins or ion channels normally inaccessible 
or not amenable to conventional electrophysiological 
methodologies. The function of these channel proteins 
can then be investigated using one or more of the 
following methods. 

Macroscopic Ionic Flux 

The functional state of populations of isolated channel 
proteins can be assessed following their reconstitution 
into proteoliposomes by monitoring channel-mediated 
radioactive isotope uptake. This technique was initially 
developed to characterize the function of purified 
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Eukayrotic tissue, eukaryotic cells 
in culture or prokaryotic cells 


disrupt cells 




Cell homogenate 


isolate membrane fraction enriched 
in channel of interest 




^Membrane vesicles 


solubilise with appropriate 
detergent 




Soluble membrane proteins 


separate channel of interest from 
other solubilised proteins 


Pure channel protein in 
detergent 


reconstitute channel by removal 
of detergent 




^Reconstituted pure channel 


proteins 




Reconstitution of Ion Channels from Intracellular Mem¬ 
branes and Bacteria Not Amenable to Conventional Elec- 
trophysiological Techniques, Fig. 1 A scheme outlining the 
methods used for the preparation of material for functional 
assays of reconstituted ion channels. Cell disruption - Intact 
tissue homogenized using a blade homogenizer; eukaryotic 
cells in culture subjected to hypo-osmotic shock followed by 
homogenization by repeated passage through a 23 gauge needle; 
prokaryotic cells are broken apart by sonication or french press. 
Membrane isolation - Achieved by differential and/or density 
gradient centrifugation. Solubilization of membrane proteins - 
Membrane vesicles are incubated with a detergent that liberates 
the channel of interest from the membrane. Membrane proteins 
are notoriously difficult to purify in a functional state and it is 
usual to trial a number of detergents and combinations of deter¬ 
gents and lipids when isolating a membrane protein for the first 
time. Channel purification - Once solubilized, the channel of 
interest may be separated from other components of the mem¬ 
brane. Procedures include separation based on size or density; 
for recombinant proteins separation is often achieved by affinity 
chromatography by engineering a polyhistidine tag onto either 
the N- or C-terminus of the protein. Channel reconstitution - 
Purified channel proteins are stabilized by reincorporation into 
a membrane. Membrane proteins insert into small unilamellar 


eukaryotic ion channels and has been more recently 
adopted to investigate the function of populations of 
purihed prokaryotic channels (Nimigean 2006). 

Electrophysiological Characterization 

The function of purified channels or channels in native 
membrane vesicles can be studied under voltage clamp 
conditions following their incorporation into an artifi¬ 
cial membrane. For simplicity, these approaches can be 
divided into two themes: planar phospholipid bilayers 
and patch-clamped liposomes. More detailed accounts 
of the techniques described here can be found in (Wil¬ 
liams 1994,1995; Favre et al. 1999; Kapoor et al. 2008). 


Planar Phospholipid Bilayers 

In this technique, ion charmels are incorporated into a 
membrane formed on a hole in a partition that separates 
two fluid hlled chambers (Fig. 2). The material from 
which the partition is made, its orientation (vertical or 
horizontal), the size of the hole, and the method of its 
manufacture can vary as can the method used to form the 
phospholipid bilayer and the composition of the bilayer. 
Planar phospholipid bilayers can be formed using two 
approaches. 

(a) Painted hilayers: Individual or mixtures of pure 
phospholipids are dispersed in the nonpolar sol¬ 
vent, «-decane (phospholipids are mixed in chlo¬ 
roform, dried with a stream of N 2 , and resuspended 
in «-decane at 20-50 mM). For a stable bilayer to 
form, the area surrounding the hole in the partition 
must be primed with a small volume of the 
«-decane phospholipid dispersion. Once the prim¬ 
ing dispersion has dried the chambers on either 
side of the partition can be filled with an appropri¬ 
ate electrolyte solution and the bilayer can be 
formed. This is done by drawing a small quantity 
of phospholipid dispersion across the hole using 
some sort of implement (a small plastic or glass 
rod, a pipette tip, or a small paintbrush with most 
bristles removed). Using this method, the hole 
joining the two chambers is occluded with a thick 
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liposomes as detergent is removed in the presence of added 
phospholipids. The method used for detergent removal will 
depend on the properties of the detergent used to solubilize the 
membrane. Assays of function - Both membrane vesicles and 
reconstituted pure channel proteins (#) can be used to monitor 
channel function using ionic flux or electrophysiological tech¬ 
niques (see text for details) 
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Reconstitution of Ion Channels from Intracellular Mem¬ 
branes and Bacteria Not Amenable to Conventional Elec- 
trophysiological Techniques, Fig. 2 A cartoon depicting the 
method used for the incorporation of channel proteins into 
planar phospholipid bilayers (not to scale), (a) Formation of 
the bilayer - A bilayer is formed on a hole in a partition 
separating two fluid filled chambers. As explained in the text, 
bilayers can be formed either from a dispersion of pure phos¬ 
pholipids in an organic solvent or by the apposition of two 
preformed monolayers, (b) Addition of channel proteins - 
Following bilayer formation native membrane vesicles or 
reconstituted pure channel proteins are added to the solution 
at one side {cis) of the bilayer. A small proportion of the 
added vesicles adhere to the bilayer (the pre-fusion state) and 
vesicles can be encouraged to coalesce with the bilayer by the 
establishment of an osmotic gradient across the bilayer, 
(c) Incorporation of channels into the hilayer and monitoring 
function - The fusion of a vesicle with the bilayer transfers 
component proteins to the bilayer; the orientation of the 
channel in the bilayer is determined by its orientation in 
the native membrane vesicle or proteoliposome. Once present 
in the bilayer, ion translocation through the channel is moni¬ 
tored under voltage clamp conditions (for more details see 
Williams 1995). 


film of phospholipid dispersion that is in equilib¬ 
rium with a much larger annulus of dispersion 
around the hole. The film covering the hole thins 
spontaneously as the bulk of the lipid and «-decane 
drains away into the annulus driven by Plateau- 
Gibbs border suction and London-van der Waals 
attraction. Eventually, the majority of the hole is 
covered by a phospholipid bilayer into which func¬ 
tional ion channels can be incorporated. Clearly, the 
thinning process is fundamental to the success of 
this approach; a membrane-spanning ion channel 
will not function in a phospholipid dispersion that 
is thicker than a bilayer. Therefore, the thinning 
process must be monitored closely and this can be 
done in two ways. The formation of the bilayer can 
be followed by observing the light reflected from the 
film wifh a suifably placed lens and lighf source. 
Initially, the thick film will reflect a considerable 
amount of light and will appear multicolored (equiv¬ 
alent to the multicolored film seen from light 
reflected from an oil-covered puddle). As the film 
thins it reflects less and less light until it becomes 
just two phospholipid molecules thick (>5 nm) and 
reflects no light at all, at which point it appears black 
and is termed a black lipid membrane (BLM). Alter¬ 
natively, the thinning process can be followed by 
monitoring capacitance. The increase in area of 
a region two phospholipid molecules thick will pro¬ 
duce a rise in the capacitance of the system that can 
be estimated by applying a low-frequency triangular 
voltage wave across the film. The capacitance of the 
final bilayer should be 0.4 pF/cm^ and therefore it is 
necessary to know the surface area of the hole on 
which the bilayer is formed. 

(b) Folded hilayers: Painted bilayers will inevitably 
contain trace amounts of nonpolar solvent; this con¬ 
tamination can be reduced by forming the bilayer by 
the apposition of two pure phospholipid monolayers. 
In this technique, phospholipid monolayers are 
formed at the surfaces of electrolyte solutions in 
the chambers on either side of the partition. The 
surface of the solutions is below the hole in the 
partition and the bilayer is formed on the hole by 
increasing the volumes of the solutions in both 
chambers, so raising the monolayers to cover the 
hole. Phospholipid monolayers are routinely formed 
by applying pure phospholipids in a volatile solvent 
such as n-pentane to the surface of the electrolyte 
solution; as the solvent evaporates the monolayer 





























Reconstitution of ion Channels 


2205 


R 


forms at the air-water interface. Before bilayer for¬ 
mation, the area surrounding the hole should be 
primed with a small volume of 0.5% v/v hexadecane 
in hexane. When the priming solution has dried, the 
solution in the first one and then the other chamber is 
raised, lifting the monolayers to cover the hole so 
forming the bilayer. 

Incorporation of Channels into Planar Phospholipid 
Bilayers 

Once a stable bilayer has been constructed using either 
the painting or folding technique, endogenous eukaryotic 
or prokaryotic channels or recombinant channels can 
be incorporated. For intact intracellular membrane 
vesicles or purified channel proteins reconstituted into 
proteoliposomes, the transfer of channels to the phos¬ 
pholipid bilayer is achieved by fusion. Channel- 
containing vesicles are added to the solution at one side 
of the bilayer (the side of the bilayer to which vesicles are 
added is conventionally referred to as cis, whilst the other 
is termed trans, see Fig. 2) and the solution stirred vig¬ 
orously. Vesicles adhere to the bilayer forming a pre¬ 
fusion state and the probability of occurrence of this 
state can be increased, if the vesicles or bilayer contain 
acidic phospholipids, by the presence of millimolar con¬ 
centrations of divalent cations. Adhering vesicles are 
induced to fuse with the bilayer by the creation of an 
osmotic gradient across the bilayer and between the solu¬ 
tion and the membrane vesicle. Addition of an osmoticant 
to the cis solution favors the flow of water from the trans 
chamber. Water entering vesicles in the pre-fusion state 
on the bilayer will induce them to swell until they rupture 
and coalesce with the bilayer. On fusion, channels present 
in the membrane vesicle are transferred to the bilayer and 
the orientation of the channels in the bilayer will be 
determined by their orientation in the native vesicle or 
proteoliposome. As the fusion process is driven by 
the transmembrane osmotic gradient, it can be stopped 
by dissipating the gradient. This can be achieved either by 
raising the osmotic strength of the trans chamber or, 
preferably, by lowering the osmotic strength in the cis 
chamber by perfusion. This has the added advantage of 
removing un-incorporated channels. 

Purified channel proteins in isolation buffer contain¬ 
ing detergent can incorporate directly into a bilayer if 
added to the cis solution and stirred. It is advisable to 
concentrate the channel protein so that very small vol¬ 
umes of sample can be added to minimize the concen¬ 
tration of detergent to which the bilayer is exposed. 


Patch-Clamped Liposomes 

The function of eukaryotic intracellular membrane chan¬ 
nels and bacterial channels can also be monitored follow¬ 
ing their incorporation into large unilamellar liposomes 
suitable for patch-clamp recording. This technique offers 
potential advantages over reconstitution into planar phos¬ 
pholipid bilayers. Most significantly, patch-clamping 
provides considerably enhanced resolution. Planar bila¬ 
yers are large (rarely less than 100 pm in diameter) and 
are therefore subject to mechanical and electrical noise, 
factors which are minimized when charmels are in 
a bilayer at the tip of a conventional patch electrode. 
Secondly, planar bilayers will contain very small quanti¬ 
ties of organic solvent and these are not present if this 
alternative method is used. The key to this approach is the 
incorporation of isolated membrane vesicles or purified 
channel proteins into liposomes that are large enough 
to patch-clamp. Two methods have been developed to 
permit this (summarized in Fig. 3). 

(a) Freeze and thaw: In this method, detergent- 
solubilized membrane proteins are incorporated 
into small unilamellar liposomes by detergent 
removal in the presence of excess lipids. The 
resulting proteoliposomes are transformed into 
large multilamellar liposomes by cycles of freezing 
in liquid N 2 and thawing at room temperature. The 
formation of structures suitable for patch-clamping 
is signaled when the initially clear suspension 
becomes turbid (Tank et al. 1982). Freezing results 
in the formation of ice crystals that disrupt the lipo¬ 
somes and during the thawing process liposome 
fragments combine to produce large multilamellar 
structures. The inclusion of a cryoprotectant such as 
sucrose in the solution in which the liposomes are 
suspended should be avoided as this will prevent the 
formation of large ice crystals and hence the forma¬ 
tion of patchable proteoliposomes. 

(b) Dehydration-rehydration: This procedure was 
developed initially to permit the study of intracellular 
eukaryotic membrane channels. As in the freeze- 
thaw method, detergent-solubilized membrane pro¬ 
teins are incorporated into small unilamellar 
liposomes by detergent removal. The transformation 
of these to something that could be patch-clamped 
is achieved by dehydration of the sample in a buffer 
containing 5% (w/v) ethylene glycol, which prevents 
complete drying of the sample. When most of the 
water in the sample has been removed the remaining 
film is covered with a small volume of the desired 
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Reconstitution of Ion Channels from Intracellular Mem¬ 
branes and Bacteria Not Amenable to Conventional Elec- 
trophysiological Techniques, Fig. 3 A scheme outlining the 
methods used to incoiporate channel proteins into liposomes 
suitable for patch-clamp recording, (a) Freeze-thaw - Following 
solubilization, membrane proteins are incorporated into small 
unilamellar proteoliposomes by detergent removal and these are 
transformed into large structures that can be patch-clamped by 
cycles of freezing in liquid N 2 and thawing slowly at room 


temperature, (b) Dehydration-rehydration - Channels are incor¬ 
porated into small unilamellar liposomes either by fusion of 
native membrane vesicles with liposomes or by reconstitution 
of solubilized channels as the result of detergent removal. Large 
structures that can be patch-clamped are produced by partial 
dehydration of the sample followed by slow rehydration. 
Proteoliposomes suitable for patch-clamp recording are formed 
toward the edge of the rehydrated sample. 


recording solution and the sample left at 4°C 
overnight to rehydrate. Following rehydration, large 
proteoliposomes suitable for patch-clamping are seen 
at the edges of the sample (Criado and Keller 1987). 

A variation on this procedure has been developed to 
permit the characterization of bacterial ion channels 
without exposure to detergents. In this approach, iso¬ 
lated membrane vesicles are fused with liposomes 
produced by cycles of freezing and thawing of small 
unilamellar liposomes initially formed by the sonica- 
tion of lipid mixtures. Dehydration and rehydration 
of the bacterial membrane vesicles and liposomes 
produces large proteoliposomes amenable to patch- 
clamp investigations (Delcour et al. 1989). The most 
recent development of this technique is the incorpora¬ 
tion of purified recombinant wild-type and mutant 
bacterial channels into small unilamellar liposomes 
by detergent removal and the production of large, 


patch-clampable, proteoliposomes by dehydration 
and rehydration (Cortes et al. 2001). 

Irrespective of the method used to incorporate chan¬ 
nel proteins into large proteoliposomes, the properties 
of the channels of interest are monitored using conven¬ 
tional patch-clamp techniques (Gibb 1995). 

Summary 

While the functional properties of the majority of ion 
channels can be investigated using conventional elec- 
trophysiological techniques, many important channels 
are expressed in membranes systems that are inacces¬ 
sible to these approaches. Alternative strategies must 
be employed for the investigation of the function of 
channels of intracellular membranes and channels 
of prokaryotic cells. In addition, our understanding of 
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the mechanisms governing channel function has been 
revolutionized in recent years by the availability of 
structural detail at atomic resolution. Crystallization 
of channel proteins involves isolation of the proteins 
from the native membrane, and for the meaningful 
interpretation of structural information it is essential 
that the functional integrity of the purified channel is 
characterized. In all the above cases, channel function 
can be investigated following reconstitution into 
a man-made membrane. This entry provides an intro¬ 
duction to the procedures currently in use for the 
reconstitution and characterization of both eukaryotic 
and prokaryotic ion channels inaccessible to conven¬ 
tional electrophysiology. 

Cross-References 

► Analysis of Macroscopic Currents 

► Fluorescence Techniques for Studying Ion Channel 
Gating; VCF, FRET, and LRET 

► Patch-Clamp Recording of Single Channel Activity; 
Acquisition and Analysis 

► Potassium Channel Selectivity and Gating at the 
Selectivity Eilter; Structural Basis 
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Redox Potential 


Definition 

Redox potential is a measure of the propensity of 
a chemical or biological species to either acquire or 
lose electrons through ionization. 


Basic Characteristics 

A species with a higher reduction potential possesses 
a higher tendency to acquire electrons and be reduced. 
Conversely, a species with a higher oxidation potential 
possesses a higher tendency to lose electrons and be 
oxidized (Zanello 2003). The reduction and oxidation 
potentials for a given species are identical in value but 
opposite in sign. In this chapter, only reduction poten¬ 
tials will be used to define redox properties, i.e., 
a higher reduction potential has a more positive value 
(see Fig. 1). Factors such as ionic strength, pH, tem¬ 
perature, or partial pressure of gasses can affect reduc¬ 
tion potentials. As such, for a fair comparison 
reduction potentials are typically measured under the 
standard condition of 25°C, 1 M concentration for 
reduced/oxidized species, or a partial pressure of 
1 atm for each gas, unless otherwise noted. These 
measured potentials are then compared relative to a 
standard hydrogen electrode (SHE) or normal hydro¬ 
gen electrode (NHE), which is set as 0 V. The differ¬ 
ence between the reduction potential of the compound 
of interest and the SHE is defined as the standard 
reduction potential. The difference in the reduction 
potential between an electron donor and acceptor will 
determine the driving force, or free energy, for electron 
transfer {ET) with electrons typically flowing sponta¬ 
neously from a donor with more negative reduction 
potential to an acceptor with more positive reduction 
potential. As all redox reactions are equilibrium pro¬ 
cesses, it is important to note that electrons can flow in 
reverse, i.e., from more positive potential to more 
negative, depending on the value of the equilibrium 
constant for the redox reaction and the relative con¬ 
centration of the reactants. 

Relationship Between Redox Potentials and 
Electron Transfer Reactions 

Redox potentials play a major role in determining the 
spontaneity of ET reactions. The difference in reduc¬ 
tion potentials between the donor and acceptor (AE) 
define the driving force (AG) for ET according to (1), 
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Redox Potential, Fig. 1 Ranges of biologically relevant redox 
potentials and the protein types that cover a given potential 
range. The position of common references used in electrochem¬ 
istry, ferrocene (Fc), saturated calomel electrode (SCE), Ag/ 
AgCl, and the standard hydrogen electrode (SHE) are also 
indicated 


where n is the number of electrons transferred and F is 
the Faraday constant. 

AG = -n ■ F ■ AE (1) 

According to ► electron transfer theory, the AG of 
the redox reaction influences the ET rate (kEr)- In (2), 
a higher drive force promotes faster ET, reaching 
a maximum k^r when AG is equal to the reorganization 
energy (T) for the reaction. After this point, k^r actu¬ 
ally decreases with increasing AG. This region is 
known as the Marcus inverted region (see entry on 
► Electron Transfer Theory). Therefore, proper mea¬ 
surement of redox potentials and an understanding of 
the factors that influencing the redox potentials is crit¬ 
ical in fine-tuning ET efficiency. 
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Techniques Used to Measure Redox Potentials 

Several techniques are available for measuring reduc¬ 
tion potentials, including cyclic voltammetry (CV), 
spectrochemical titration, and spectroelectrochemistry 
(Zanello 2003; Kaim and Klein 2008). In CV, a work¬ 
ing electrode, an auxiliary electrode, and a reference 
electrode are immersed in an electrolyte solution 
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Redox Potential, 

Fig. 2 Example cyclic 
voltammogram (CV) of 
a variant of Pseudomonas 
aeruginosa azurin collected by 
protein film voltammetry 
(Marshall et al. 2009). This 
data is plotted by the 
American convention with 
negative potential on the right. 
Redox events are identified by 
a change in current at a given 
potential and the center of that 
peak is determined as shown 
in red. The redox potential is 
typically reported as an E„„ 
defined as the average between 
the potential of the oxidative 
and reductive signal 


Oxidizing 
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containing the redox active species of interest. A given 
potential range is then scanned, and the amount of 
current measured at the working electrode is plotted 
versus potential. The redox potential is then calculated 
based on the positions of any oxidation or reduction 
peaks, identified by an increase in the observed current 
at a given potential (see Fig. 2). While CV typically 
displays strong signals for most small molecules and 
some biomolecules that can communicate with the 
electrode efficiently, CV does not work well for most 
biomolecules. To overcome this limitation, protein 
film voltammetry (PFV) has been invented in order to 
increase CV signal and to lessen the effect of diffusion 
on the kinetics of ET (Butt and Armstrong 2008). In 
PFV, a protein can be directly adsorbed onto the work¬ 
ing electrode, suspended in a conductive polymer 
matrix and allowed to dry on an electrode, or tethered 
to an electrode by a small molecule linker. Certain 
proteins will bind to electrode surfaces simply through 
natural affinities, such as the binding of a hydrophobic 
portion of a protein to a graphite electrode, or electro¬ 
static interactions between the protein surface and the 
electrode. 

Spectrochemical titration and spectroelectro- 
chemistry are similar to one another in that spectral 
changes in the species of interest upon oxidation or 
reduction can be used to identify the redox event (see 
Fig. 3). Any spectral feature that changes from the 
oxidized to reduced states, such as the visible spec¬ 
trum, may be used to calculate a redox potential by 
either technique (Kaim and Klein 2008). 
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Redox Potential, Fig. 3 Spectroelectrochemistry experiment 
using a variant of the heme protein myoglobin (Yeung et al. 
2009). The visible spectra of the reduced and oxidized states of 
the heme are distinct from one another, and as the potential of 
a working electrode is varied, the spectrum changes. By deter¬ 
mining the relative ratio of reduced to oxidized species at each 
potential, the reduction potential of the protein may be 
determined 


R 


The techniques differ in that spectrochemical titra¬ 
tion involves the controlled addition of another redox 
active compound with known potential to either 
oxidize or reduce the species of interest, whereas 
spectroelectrochemistry typically utilizes a working 
electrode, like CV, and spectral changes are observed 
as a function of the applied potential to the working 
electrode. Spectroelectrochemistry has the added 
advantage in that the current at each potential can 
also be monitored, allowing for the calculation of 
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other parameters, such as the number of electrons 
transferred. For both techniques, the redox potential 
of the molecule of interest can then be calculated by 
applications of the Nernst equation (3), where z is 
moles of electrons transferred, R is the gas constant, 
and F is the Faraday constant. 


E = E° 


R ■ T 


• In 


[red] 

[ox] 


(3) 


Factors That Influence the Redox Potentials 

Every redox active species has an intrinsic redox 
potential which is determined by the relative energy 
of the oxidized and reduced states. In redox active 
metal complexes, the magnitude and direction of the 
reduction potential is determined by factors such as the 
identity of the metal ions, the differential binding of 
ligands to the oxidized and reduced metal ions, the 
geometry of metal complex, and interactions with sol¬ 
vent. These considerations also apply to redox active 
metals in ► electron transfer proteins. For example, 
► iron-sulfur clusters generally possess lower reduc¬ 
tion potentials than ► cupredoxins (see Fig. 1) and 
replacing the manganese ion in manganese ► superox¬ 
ide dismutase with iron resulted in a decrease of reduc¬ 
tion potential by >500 mV (Miller 2008). Noncovalent 
interactions around the redox active species in proteins 
may also significantly influence redox potential (Mauk 
and Moore 1997) allowing proteins to use relatively 
few redox active ► electron transfer cofactors, yet fine- 
tune the redox potentials of those cofactors across 
a wide range of biologically relevant potentials. 

Electrostatic interactions in proteins that influence 
redox potentials include nonspecific screening effects 
by ionic species in solution, dipoles within both the 
protein and solvent, internal charges that are not 
exposed to solvent, and charges on the surface exposed 
to solvent (Mauk and Moore 1997; Warshel et al. 
1997). The importance of net surface charges depends 
upon the type of charged group and the size of the 
protein, with more dramatic effects observed for redox 
species buried less deep within a protein or in smaller 
proteins. For example, replacement of a neutral valine 
within van der Waals contact distance of the heme in 
the protein myoglobin with a negatively charged aspar¬ 
tate reduces the heme redox potential by 200 mV 
(Varadarajan et al. 1989). Furthermore, the interactions 
between solvent or protein dipoles have been shown to 
cause dramatic changes in the redox potential of iron in 


superoxide dismutase (Miller 2008) and high-potential 
iron-sulfur protein (Dey et al. 2007). 

Increasing Hydrophohicity can selectively destabi¬ 
lize the redox state with higher charge and thus raise 
the reduction potential. The reduction potential of the 
cupredoxin azurin was fine-tuned in this way by sys¬ 
tematically changing the axial methionine ligand of 
copper to progressively more and less hydrophobic 
residues. A linear correlation between the reduction 
potential of the protein and the hydrophohicity of the 
axial ligand was observed (Garner et al. 2006). 

Hydrogen bonding to and around the metal¬ 
coordinating amino acids can also fine-tune the electron 
density on the ligands or the geometry of the ligands, 
making it possible to either increase or decrease reduc¬ 
tion potentials predictably, as shown in cupredoxins 
(Machczynski et al. 2002; Marshall et al. 2009) and 
iron-sulfur proteins (Zuris et al. 2010). 

The above interactions have also been shown to 
be additive or cooperative, allowing for predictable 
tuning of the reduction potential of proteins across 
a broad range of biologically relevant reduction poten¬ 
tials (Marshall et al. 2009; Zuris et al. 2010). 


Cross-References 

► Cupredoxins 

► Electron Transfer Cofactors 

► Electron Transfer Proteins; Overview 

► Electron Transfer Theory 

► Hemes 

► Iron-Sulfur Clusters 
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Regulation of Protein Surfaces Through 
Posttranslational Modification 
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Synonyms 

Chemical modihcation of proteins 


Definition 


Posttranslational Modifications 

Posttranslational modification (PTM) refers to the 
chemical modihcation of proteins following transla¬ 
tion. Many types of modihcations are covered by this 
description including: the addition of simple chemical 
groups (such as phosphate, acetyl, methyl, and sulfate 
groups), or more complex molecules, such as AMP, 
ADP-ribose, sugars, lipids, and small proteins (such as 
ubiquitin and ubiquitin-like proteins). PTMs also 
encompass the modihcation of specihc amino acids 
(e.g., deimination of arginine residues to form citrul- 
line, or proline isomerization) and the cleavage of 
a peptide bond (i.e., proteolysis). 

PTMs modify activities, half-lives, the intracellular 
localization of the target proteins, and can have 
a profound effect on the ability of proteins to make 
interactions with proteins or other biomolecular targets. 
For certain interactions one or more specihc PTMs must 
be in place to effect binding, whereas other specihc 
PTMs can block binding. Although some PTMs are 
effectively permanent (such as proteolysis), many others 
are reversible. Most PTMs are brought about by the 
action of enzymes, and additional enzymes demodify 
the target. For example, kinases add and phosphatases 
remove phosphate groups, and acetyltransferases add 
whereas deacetylases remove acetyl groups. These and 
numerous other sets of modifying/demodifying enzymes 
are normally carefully balanced in any cell type to 
regulate the required network of interactions. 

Dysregulation of PTMs are often frequent contrib¬ 
utors to disease, including numerous types of cancer 
(e.g., Berger et al. 2006). Thus, inhibitors of enzymes 
involved in PTM are important dmg targets. For example. 
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many inhibitors of histone deacetylases (HDACs) are 
currently undergoing clinical trial to treat cancers and 
neurodegenerative diseases such as Parkinson’s and 
Alzheimer’s diseases (e.g., Kazantsev and Thompson 

2008) . PTMs are also effective strategies by which 
some pathogens act on host organisms (e.g., see reviews 
Randow and Lehner 2009; Ribet and Cossart 2010). 
A classic example of this is the ADP-ribosylation of 
elongation factor-2 by diphtheria toxin (Collier and 
Cole 1969). The effect of this PTM is to inactivate protein 
translation which results in cell death. 

Although the full extent of PTM is unknown, it is 
generally assumed that most, if not all proteins, 
undergo some type of modification following transla¬ 
tion. Advances in quantitative mass spectrometry (MS) 
have allowed PTMs to be studied at a proteome scale, 
revealing widespread modification that changes in 
response to various stimuli. For example, 3,600 lysine 
acetylation sites were found on 1,750 proteins 
(Choudhary et al. 2009), and 6,600 phosphorylation 
sites on 2,244 proteins (Olsen et al. 2006). At present, 
the proteome-wide studies tend to be restricted to 
one or two types of PTM, but the case studies of 
a few well-studied proteins show that a single protein 
can be affected by numerous different PTMs. Histone 
protein tails can be modified by acetylation, methyla- 
tion, phosphorylation, ADP-ribosylation, deimination, 
ubiquitination, sumoylation, and proline isomerization 
(reviewed in Kouzarides 2007). Combinations of 
PTMs on histone tails are thought to form a “histone- 
code” that regulate interactions with DNA and nuclear 
proteins to control chromatin states which play roles in 
gene expression, DNA repair, and cell division (e.g., 
Hoffman and Vu 2005). p53 has been identified as 
being modified by at least 50 individual PTMs, often 
in response to stresses (e.g., Anderson and Appella 

2009) . Various combinations of these PTMs accumu¬ 
late in response to others, and the different modifica¬ 
tions can cooperate to modulate the function of p53 
(Meek and Anderson 2009). 

Information on types and masses of many PTMs 
can be found on http://au.expasy.org/tools/findmod/ 
findmod_masses.html, and at PTM sites identified 
from high-resolution mass spectrometric data at least 
nine different species can be found on the PHOSIDA 
database (http://www.phosida.com) (Gnad et al. 2011). 

More detailed entries can be found covering two 
very common modifications that affect protein-protein 
interactions, phosphorylation, and ubiquitinylation. 


Cross-References 

► Mass Spectrometry in Proteomics 
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► Protein Lysine Methyltransferase - Computational 
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Relaxation Dispersion 

Rieko Ishima 

Department of Structural Biology, University of 
Pittsburgh School of Medicine, Pittsburgh, PA, USA 

Synonyms 

Carr-Purcell-Meiboom-Gill spin echo; Rotating Frame 
(RiP) Relaxation Dispersion 

Definition 

In the NMR field, the term “relaxation dispersion” 
refers to the dependence of the spin relaxation rate on 
the magnetic field strength. Typically either the static 
field, Bq, provided by the spectrometer magnetic or the 
radio-frequency (RF) field, Bj, generated by the probe 
transmitter coil is varied over a wide range. The depen¬ 
dence of spin relaxation on Bq has been used to char¬ 
acterize relaxation mechanisms and spectral density 
functions, while the dependence on Bi has been used 
to study chemical exchange. 

Recent high-resolution biomolecular NMR applica¬ 
tions of “relaxation dispersion” have focused upon stud¬ 
ies of chemical exchange to characterize slow motions 
(on the milli-microsecond timescale) in biomolecules. 
These measurements are made either by recording trans¬ 
verse relaxation rates (R 2 ) as a function of the effective 
field strengths defined by the spacing of CPMG pulses, 
or by recording Rip values as a function of the (on- or 
off-resonance) B 1 field strength. These recent dispersion 
experiments have revealed populations, stractures, and 
exchange rates of protein minor conformers, whose 
NMR signals are too weak to observe directly, and 
have thereby uncovered slow conformational dynamics 
involved in enzymatic reactions and protein folding 
pathways. 

Relaxation dispersion data for studies of chemical 
exchange is most often analyzed using the two-site 
exchange model, which yields the forward and reverse 
exchange rates (from which values of site populations 
and the exchange correlation time are obtained), the 
difference in chemical shifts of the two sites (5co/27t), 
and their intrinsic relaxation rate (R 2 °). Accurate 
optimization of these parameters normally requires 


recording relaxation dispersion profiles for multiple 
nuclei in a protein or at two (or more) magnetic field 
strengths. Otherwise, the dispersion data may be ana¬ 
lyzed by combining some of the parameters as one 
term. Parameter optimization is carried out solving 
the Bloch-McConnell equation or using its analytical 
solutions. When the exchange model involves three or 
more sites, these equations are solved numerically. 

Cross-References 

► CPMG 

► Kinetics: Relaxation Methods 

► NMR Studies of Macromolecular Interactions - 
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Synonyms 

Alignment in liquid crystal; Anisotropic magnetic 
alignment; Molecular alignment in a magnetic field 

Definition 

Small residual magnetic nuclear dipolar interactions 
can be observed in solution by Nuclear Magnetic Res¬ 
onance Spectroscopy when the molecule, containing 
the interacting nuclei, experiences a small net of ori¬ 
entation with respect to a magnetic field. This dipolar 
interaction contains useful structural information that 
is long range in nature. 

Introduction 

Nuclear magnetic resonance (NMR) spectroscopy is 
a very sensitive tool to probe interactions between 
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magnetic nuclei. These interactions, such as dipolar, 
quadrupolar, and chemical shift anisotropy can be 
quite large, in the order of 10-10^ Hz. In solid state 
NMR, where the molecules are not under constant 
motion, these interactions can be measured directly 
and they provide information on local molecular 
geometry with respect to the external magnetic field. 
In solution NMR, however, fast Brownian rotational 
diffusion, which is desirable to create sharp resonance 
lines, also causes these interactions to average to zero. 
Therefore, they cannot be measured directly. Instead 
the primary sources of structural information in solu¬ 
tion NMR are the Nuclear Overhauser Effect (NOE), 
isotropic chemical shifts, and scalar coupling. These 
provide enough information to solve structures in solu¬ 
tion. They, however, are only sensitive to local geom¬ 
etry. The NOE is restricted to probing distances under 
5 A, while scalar coupling is limited to a few chemical 
bonds. Thus, in the case where the molecule gets large 
(greater than 25 kD), or in molecular complexes where 
their interaction interfaces might be rather small, or in 
RNA and DNA where the number of NMR structural 
data is low, it becomes more difficult to obtain high- 
resolution structure relying solely on these types of 
information. Therefore, it would be beneficial if 
nuclear interactions, such as dipolar coupling, could 
be reintroduced in solution and provide additional 
structural information. 

With the availability of a high field magnet, 
Bothnerby and colleagues were able to show that small 
molecules in solution weakly aligned in the magnetic 
field due to their susceptibility anisotropy (Bothnerby 
et al. 1981). This resulted in small but measureable 
dipolar couplings of a few Hertz. Subsequently, small 
dipolar couplings were also observed for larger mole¬ 
cules, such as a heme-containing protein myoglobin 
(Tolman et al. 1995), a DNA duplex (Kung et al. 1995), 
and a protein Ubiquitin (Tjandra et al. 1996). These 
proved that it was possible to reintroduce useful nuclear 
interactions in solution by a slight alignment of the mol¬ 
ecule in the magnetic field. The challenge was to make it 
generally applicable and to increase the degree of align¬ 
ment to permit the observation of larger, and therefore 
easier to measure, dipolar couplings. This required a fine 
balance, since too large of an alignment would result in 
substantially strong couplings that would lead to broad 
resonance lines and complex spectra. One solution was 
to add a liquid crystalline phase to the protein solution. 
The liquid crystalline molecule possesses significant 


susceptibility anisotropy and would align cooperatively 
in the field. The protein, through its interaction with the 
liquid crystalline phase, will also experience a degree of 
alignment (Tjandra and Bax 1997) (Fig. 1). 

A dipolar interaction Hj between two nuclei I and 
S depends on the distance and direction of the 
intemuclear vector r and is given by: 


Hd 


47ir^ 



( 1 ) 


In the case where I and S are different types of nuclei, 
the above equation simplifies (through secular approxi¬ 
mation) such that only the z-components remain: 

Mo _ y^y/^Vo 0^1 

Anr^ 

= - f ° 5,4(3cos^g - 1) (2) 

where 9 is the angle between intemuclear vector r and 
the static magnetic field, r is the distance between the 
two nuclei, /tg is magnetic permeability, Ti is the 
Planck’s constant, and ji and are the gyromagnetic 
ratios for the 7 and S nuclei, respectively. The orienta¬ 
tion of the intemuclear vector changes in solution due 
to fast rotational diffusion. Therefore, the effective 
dipolar interaction is an average of both time and all 
possible states of the ensemble (Fig. 2). 

If the intemuclear distance is constant, 


= - W'^y» S,;,(p,(cos8)) (3) 

Am 

where P 2 {x) is the Legendre polynomial. In isotropic 
solution the (7’2(cos 9)) averages to zero. Under aniso¬ 
tropic condition (7*2 (cos 6)) will not average to zero 
and will lead to observable dipolar coupling which 
provides information on the average orientation of 
the intemuclear vector. Defining the Residual Dipolar 
Coupling (RDC) constant as D, 




(3cos^0 — l) -|--/?sin^0cos2<l) 


(4) 


where is the magnitude and R is the rhombicity of 
the alignment tensor, while 0 and O are polar 
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Residual Dipolar Coupling, 
Fig. 1 An illustration of 
alignment induced by liquid 
crystalline phase. The disk¬ 
shaped bicelles align with their 
normal perpendicular to the 
magnetic field (Bq) and in turn 
can induce alignment in the 
protein through steric 
interaction 


Bo 
i \ 



coordinates defining the orientation of the dipole in the 
alignment coordinate system (Principal Axes System, 
PAS). In the PAS, the alignment tensor is diagonal and 
its elements are defined as 

A^ = -Da{\-\5R) (5) 

=-D„(l + 1.5/?) 

Azi = 2 / 5 ^ 

where jA^-l > |A 3 ,_v| > \A^x\- 

Alignment Methods 
Intrinsic Alignment 

Each molecule contains a certain degree of magnetic 
susceptibility anisotropy, thus a tendency to align in 
a high magnetic field. Paramagnetic molecules have 
strong anisotropic susceptibility. Tolman et al. showed 
that myoglobin, a protein with a paramagnetic iron- 
containing heme, aligned in a magnetic field (Tolman 
et al. 1995). After measuring the Jhn at two different 


magnetic fields they were able to extract Dnh RDCs on 
the order of a few Hz. Meanwhile Kung et al. demon¬ 
strated that double-stranded DNA also aligned in 
a magnetic field, which led to measurable RDCs 
(Kung et al. 1995). The susceptibility anisotropy in 
a diamagnetic molecule comes from groups such as 
aromatic rings and peptide backbone. Their effects can 
be additive if they are arranged in a regular stacked 
manner, such as the bases of a DNA duplex. Proteins in 
general contain irregular geometry of their peptide 
backbone and do not align in the magnetic field. 
A protein in complex with a DNA duplex, however, 
will exhibit magnetic alignment and result in measur¬ 
able RDCs for the protein. Another approach involves 
metal-binding proteins, where the metal ligands can be 
replaced with paramagnetic lanthanides (Bertini et al. 
2002). Alternatively, the protein can be modified 
to include special tags to artificially incorporate 
lanthanides. 
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Indirect Alignment 

In general protein’s intrinsic alignment is very weak 
and does not allow for practical measurements of 
RDCs. An alternative approach is to create an 
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Residual Dipolar Coupling, 
Fig. 2 An illustration to show 
orientation averaging in 
dipolar interaction. The 
dipolar interaction is the 
average of all the possible 
orientations of the 7 S nuclear 
interaction vector, defined 
within the xyz molecular 
frame, with respect to the 
magnetic field Bq 



anisotropic medium by adding molecules with 
a substantial susceptibility anisotropy that will align 
in the magnetic field. The interaction between the 
protein and these solvent molecules will induce 
a net alignment. This was first illustrated by adding 
bicelles into the solution (Tjandra and Bax 1997). 
Bicelles are formed by a mixture of lipids (DMPC 
and DHPC) and at higher than room temperature 
form disk-like particles. These particles align in 
a high magnetic field and can induce alignment of the 
protein. Since bicelles cooperatively align and form 
liquid crystalline phase at low concentration (a mini¬ 
mum of 3%), their addition to the mixture does not 
affect the quality of the protein signal. In order to 
measure RDC, a reference measurement in isotropic 
conditions is required. In the case of bicelles, the same 
sample can be used as a reference by lowering the 
temperature, where the lipid mixture behaves as an 
isotropic solution. Since then a number of alternate 


anisotropic media, that use different mechanisms to 
align molecules, have been developed to suit various 
applications. An excellent listing of them, their char¬ 
acteristics, and applicability can be found in the review 
by Prestegard et al. (2004). The amount of alignment 
experienced by the protein can be adjusted by modu¬ 
lating its interaction with the solvent molecules either 
by varying their concentration or buffer condition, or 
by modifying the protein itself. 

A general guideline in optimizing sample condition 
for RDC measurements is to achieve enough alignment 
without affecting the NMR resonances. The upper 
limit of an alignment is where the NMR resonances 
show poor quality. This is typically due to line broad¬ 
ening that occurs when the alignment is too strong, and 
long-range dipolar couplings introduce multiplets in 
the NMR resonances. In addition increased viscosity 
of the medium can also decrease the transverse relax¬ 
ation times as well as exchange broadening of the 
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Residual Dipolar Coupling, 
Fig. 3 Heteronuclear 
J-coupled spectra from 
anisotropic and isotropic 
samples. Spectra showing 
residues G35 and T55 of 
Ubiquitin with 10 mg/ml Pfl 
phage (a) and without (b). 
There is slight resonance 
broadening from long-range 
proton-proton dipolar 
interaction that could be 
observed. The difference in 
the J values from the two 
spectra provides the RDCs. 
Since J^h is negative, the D^h 
values are 8.5 and — 6.8 Hz for 
T55, and G35, respectively 


a b 



protein due to its interaction with the medium. A good 
target is an alignment that results in RDC values that 
are one order of magnitude larger than the experimen¬ 
tal error. This typically corresponds to a degree of 
alignment in the order of 10”^ or RDC values 

in the range of 10-30 Hz. 

Measurements of RDC 

The RDC manifests itself as an apparent change in the 
measured coupling constant between the two nuclei. 
To extract the RDC values, measurements of the 
J-couplings in anisotropic and isotropic conditions 
are required and their difference is the RDC. Any 
methods that can be used to measure J-couplings are 
applicable for RDCs as well. Corresponding 
spectral regions of Ubiquitin from a sample containing 
Pfl phage and one without are shown in Fig. 3. The 
RDCs can be calculated by taking the difference of the 
J values between the two spectra. The resulting exper¬ 
imental Dnh are plotted as a function of residue num¬ 
ber in Fig. 4. 

RDC is an ensemble and time average of the dipolar 
interaction between two nuclei. This averaging not 
only includes the Brownian rotation, but internal 
motions of the protein as well. The time average covers 
motions in timescales up to the measurement time of 
the coupling (1/D ^ ms). Typically flexible parts of 
a molecule will have smaller dipolar couplings than 
rigid segments. The amplitude of these types of 
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Residual Dipolar Coupling, Fig. 4 Range of measured Di,jh 
values for Ubiquitin. The measured Dnh values in a sample 
containing 10 mg/ml Pfl phage are plotted as a function of 
residue number 

motions can be represented by the generalized order 
parameter (S) that scales the RDC. In some special 
circumstances where multiple conformers might exist 
in solution, a more complex description of the RDC 
will be needed to average over all of the possible 
conformers in the ensemble. 

Possible Uses of RDC 

It is useful to illustrate why RDC provides unique 
structural information compared to other NMR 
restraints. RDC of directly bonded nuclei, where the 
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Residual Dipolar Coupling, 
Fig. 5 The Dnh values define 
the orientation of the NH 
bond. Each bond orientation is 
defined to the same alignment 
coordinate axes (Axx, Ayy, and 
Axx), thus providing a relative 
orientation between them 
irrespective of their distances. 
In contrast, no NOE 
information is available for 
residues V26 and Q31 since 
they are further than 5 A apart 


A 


zz 



distance between them is known, depends only on the 
orientation of their bond relative to the principal axes 
system (PAS) and the degree of alignment. Figure 5 
shows a short helical segment of a protein Uhiquitin in 
the PAS. A measured Dnh value for a residue, such as 
V26, will position its NFl bond in the PAS with asso¬ 
ciated cylindrical coordinates 026 and (^ 26 - Since the 
orientations of NFl bonds for different residues based 
on their Dnh values are described in the same coordi¬ 
nate system, for instance, for V26 (026 and ^ 26 ), 130 
(030 and cjtso), and Q31 ( 03 i and 4>3i)i their relative 
orientations are determined and are independent of 
distance separating them. In contrast, NOE informa¬ 
tion will not be available to provide information 
linking residues V26 to 130 or Q31 since the distance 
between them is over 5 A. Therefore, RDCs contain 
long-range structural information, making them espe¬ 
cially suitable for unique applications that will be 
discussed below. 

One immediate application of RDCs is to validate 
a known structure. By fitting measured RDCs to 
a structure the five elements of the alignment matrix 
can he determined. They will contain the alignment 
magnitude (D„), rhombicity (R), and three Euler angles 
defining the principal axes system in the molecular 
frame. The calculated RDCs can be compared to the 
measured values to determine how well the structure 


agrees with the experimental data. The most conve¬ 
nient method to quantify the agreement is to use the 
Q factor, 

mens _ j^calc^^^ ^ 

^ J^meas ^ ^ ^ 

where the angular brackets signify average over all of 
the RDCs. The Q value can vary between 0 and 1 with 
lower value indicating a better agreement. The best ht 
for the Dnh values for Uhiquitin obtained in 10 mg/ml 
Pfl phage is shown in Fig. 6. The Q factor associated 
with that fit is 0.23 to the Ubiquitin X-ray structure 
(Vijay-Kumar et al. 1987). 

Using the same approach, Annila et al. (1999) 
showed that RDCs measured for an unknown protein 
structure could be used to choose the best representa¬ 
tive from a library of known structures. This is a very 
powerful method in identifying a possible three- 
dimensional fold of a protein quickly. 

Alternatively if the structure is known, an alignment 
tensor may be predicted with the assumption of steric 
or electrostatic interaction with the alignment medium 
using the program PALES (Zweckstetter 2008). The 
predicted RDCs can be compared to the measured 
values and can extend the use of RDCs in structural 
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Residual Dipolar Coupling, Fig. 6 A correlation plot between 
measured and expected D^h values for Ubiquitin. The RDC 
values shown in Fig. 4 are fit to the X-ray structure. The optimal 
values for £)„ = —13.1 Hz, R = 0.67, and the Euler angles 
describing the alignment tensor with respect to the X-ray coor¬ 
dinate frame is a = 89°, (3 = 122°, and j = 48° 



Dnh (Hz) 

Residual Dipolar Coupling, Fig. 7 The histogram distribu¬ 
tion of Dnh for ubiquitin. The shape is similar to powder pattern 
encountered in solid state NMR. The positions of its extrema 
correspond to the diagonal elements of the alignment tensor Axx, 
Ayy, and A^^ 


analysis to determine oligomeric states, flexibility, 
multiple conformations, and structural homology. 

In multi-domain proteins, where the structure of 
individual domains are known but their relative orien¬ 
tation cannot be defined due to lack of NOE informa¬ 
tion, RDCs are very useful. The alignment tensor for 
individual domain can be determined independently by 
fitting their experimental RDCs. In systems where the 
domains are rigidly coupled, there should only be one 
alignment tensor. Therefore, the and R values for all 
of the domains should be the same and by imposing 
their alignment tensors to be collinear, the relative 
orientations between the domains can be determined. 
Inter-domain flexibility can also be assessed the same 
way. Varying and R values for different domains 
indicates that they do not behave as a single rigid 
molecule. The ratio between their values can 
provide an estimate for their relative flexibility. 

The more general use of RDCs is to determine or 
refine an unknown structure. In this case, an estimate of 
the magnitude of the alignment tensor is required. This 
can be achieved by examining the distribution of the 
measured RDCs. Figure 7 shows the histogram distri¬ 
bution for measured Dnh values of Ubiquitin in Pfl 
phage. Its extrema are associated with NH bonds ori¬ 
ented parallel to the alignment principal axes, therefore 
they provide the principal values of the alignment 
tensor: Axx = —3.8, Ayy = 24.6, and = —26.6. 
They do not follow the traceless property of the 


alignment tensor. This is due to the incomplete orien¬ 
tation sampling of the measured dipolar couplings. In 
general, using a combination of different backbone 
RDCs: Dnh, Dnc, DcaHa, and Dcacs with proper 
scaling, can solve this problem. Nevertheless, using 
the above values the alignment magnitude 
Da = —13.3, and rhombicity R = 0.56, can be esti¬ 
mated. They can be used as starting values for 
a structure calculation using programs such as Xplor- 
NIH (Schwieters et al. 2003). During the calculation 
orientation of each NH bond vector as well as the overall 
alignment tensor will be optimized to provide the best 
agreement with its measured RDC. A comparison 
between structures calculated with and without RDCs 
is shown in Fig. 8. A striking improvement in the preci¬ 
sion of the stmctures with the addition of RDCs can be 
observed. 

Another novel proposal was to use RDCs to deter¬ 
mine de novo structures. This requires multiple RDCs 
for each peptide plane. There are at least two different 
strategies. First is the molecular fragment replacement 
method where measured RDCs are used to pick 
a fragment from a library of high-resolution structures, 
whose predicted backbone RDCs agree with measured 
values (Delaglio et al. 2000). This is done sequentially 
with overlapping fragments. They are then combined 
to form an initial fold which can further be refined with 
additional NMR data. Another approach is called 
meccano, where the backbone structure is built up 
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Residual Dipolar Coupling, Fig. 8 Superposition of 20 low¬ 
est energy structures calculated without (a) and with (b) RDCs 
for factor-H N-terminal domains 1-3. The structures are least 
squared fit using the backbones of domains 1-3. The RMSD for 
structures calculated without RDCs is 3.2 A, while those calcu¬ 
lated with RDCs resulted in RMSD of 0.8 A (Maciejewski et al. 
201 1). The CCP domains 1, 2, and 3 are indicated in (b) 


from peptide planes, which are assumed to be rigid and 
can be oriented in the alignment frame with multiple 
RDCs (Hus et al. 2000). 

Summary 

The ability to measure RDCs by aligning biomolecules 
in a magnetic field has opened up new possibilities in 
addressing a broad range of questions from fundamental 
physical/chemical properties of molecules to molecular 
mechanism. The type of structural information RDCs 
provide uniquely suits them as tools to validate strac- 
tures, determine relative domain orientation, and de novo 
structure determination. More importantly, since RDCs 
are time and ensemble averaged, they are excellent 


reporters for motions in biomolecules that can occur at 
multiple timescales and in some cases even provide 
information on the possible conformational ensembles. 
Clearly RDCs will contribute in how NMR can be used 
to relate structure and dynamics of molecules to their 
functions. 
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Definition 

Cofactors in proteins are organic molecules or metal 
ions that are bound to proteins and serve as active sites 
for various biological functions. The Raman bands of 
cofactors that exhibit electronic transitions separated 
from those of the protein matrix can be selectively 
enhanced upon resonant excitation. This resonance 
Raman spectroscopy allows determining structures 
of protein-bound cofactors and monitoring their struc¬ 
tural changes during biophysical and biochemical 
processes. 

Introduction 

With the development of lasers, powerful monochro¬ 
matic light sources became available for Raman spec¬ 
troscopy in the beginning of the 1970s. This technical 
breakthrough led to a renaissance of this spectroscopic 
technique since it is now possible to choose excitation 
lines in resonance with an electronic transition of 
a molecule. Under these conditions, the vibrational 
modes solely of this chromophore are selectively 
enhanced resulting in a substantial increase of the 
Raman scattering quantum yield. The advantages 
of this resonance Raman (RR) effect for studying 
biological systems were soon recognized: RR spec¬ 
troscopy allows probing the vibrational spectra of 
cofactors in proteins with high selectivity and high 
sensitivity and without interference by Raman bands 


of the optically transparent macromolecular protein 
matrix. 

Meanwhile, thousands of original papers and 
review articles have been published documenting 
the contribution of RR spectroscopy to elucidating stmc- 
ture-dynamics-function relationships in protein-cofactor 
complexes. The various classes of proteins that have been 
studied include quite different cofactors such as redox- 
active metalloporphyrins (hemes), iron-sulfur centers, 
nonheme iron complexes, and copper sites, as well as 
photoactive retinals, open-chain tetrapyrroles, or chloro¬ 
phylls. This account is dedicated to point out the potential 
and limitations of the RR technique and to highlight a few 
approaches applicable to specific classes of protein- 
cofactor complexes, focussing on heme proteins and ret¬ 
inal- and tetrapyrrole-containing proteins. For more com¬ 
prehensive reviews, the reader is referred to the literature 
(Siebert and Hildebrandt 2007; Czemuszewicz and Spiro 
1999; Spiro 1987; Spiro 1988). 

General Considerations for the Design of 
Resonance Raman Experiments 

Although RR spectroscopy probes the vibrational 
modes of the electronic ground state, the RR intensities 
are related to the properties of the resonant electronic 
transition. In general, the enhancement of the Raman 
bands depends on the proximity of the excitation line 
with respect to the energy of the electronic transition 
and its oscillator strength. Thus, the first prerequisite 
for exploiting the advantages of the RR effect is 
a sufficiently strong electronic transition of the cofac¬ 
tor that is in addition separated from those of the 
aromatic amino acid side chains to avoid interference 
of the cofactor RR spectrum with scattering and emis¬ 
sion contributions from the protein. This condition 
is fulfilled for essentially all organic cofactors and 
open-shell transition metal complexes that exhibit 
71 ^ TT* and charge-transfer transitions above 
350 nm, respectively. However, when the excitation 
line approaches the 0^0 electronic transition from 
lower energies, the population of the electronically 
excited state increases such that unwanted photophysical 
and photochemical processes of the cofactor may gain 
importance. The decay to the electronic ground state may 
be radiationless as in the case of cofactors containing 
open-shell transition metals, or via emission of light, 
usually fluorescence that falls in the same spectral region 
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as the Stokes-shifted RR bands. If this process occurs 
with a quantum yield larger than ca. 10~^, as it is the case 
for open-chain tetrapyrroles, chlorophylls, or flavins, 
fluorescence may obscure the RR signals for which 
quantum yields may be, at best, in the order of 10~ . 

There are various strategies to circumvent fluores¬ 
cence or to recover the RR spectrum in the presence of 
fluorescence (Siebert and Hildebrandt 2007). The most 
simple approach is to shift the excitation line away 
from the fluorescing excited state either to shorter 
wavelengths (resonance with a higher lying electronic 
state) or to longer wavelengths (pre-resonance 
enhancement). Further strategies employ optoelec¬ 
tronic blocking of the fluorescence using a fast Kerr 
shutter or to subtract the fluorescence background 
using a shifted excitation difference technique. 

RR experiments of photoreceptors such as retinal or 
tetrapyrrole proteins face the additional challenge that the 
exciting laser beam may also cause a photochemical 
reaction of the cofactor, followed by thermal relaxation 
processes. Experiments that are directed to probe the 
parent state, that is, the unphotolysed state, require con¬ 
ditions to ensure the lowest possible photoconversion 
parameter, which is the product of the photochemical 
rate constant Iq and the residence time of the molecules 
in the exciting laser beam At (Siebert and Hildebrandt 
2007; Althaus et al. 1995). This can be achieved by a low 
photon flux (i.e., low laser power) and a rapid movement 
of the sample volume through the laser beam, using 
either rotating cuvettes or flow systems. 

In photo-inactive and non-fluorescing molecules, 
the major portion of the incident radiation energy that 
is absorbed is converted to thermal energy via radia¬ 
tionless transitions. For photon fluxes typically used in 
continuous wave (cw) experiments (i.e., with continu¬ 
ous illumination), the energy is rapidly dissipated 
to the environment which may cause a local heating 
of the irradiated volume of a stationary sample. 
The effects of laser-induced heating may be even 
more severe in pulse-laser experiments. Thus, also 
for photo-inactive molecules, a moving sample device 
is recommended to avoid laser-induced heating. 

Time-Resolved Resonance Raman 
Experiments 

The essential element of the set-up is a device to 
synchronize either the probe or the detection event 


with the initialization of the process to be studied. 
The way how synchronization is accomplished 
depends on the nature of the process and the time 
resolution of interest. A particular wide time range 
from femtoseconds to seconds can be covered in the 
case of photoinduced reactions using an additional 
photolysis laser beam that triggers the reaction. For 
the photolysis (pump) beam the photoconversion 
parameter Iq-Ai should be large to ensure an efficient 
photoconversion but it should be kept as small as 
possible for the probe beam to avoid secondary photo¬ 
reactions (Althaus et al. 1995). Ultrafast events mainly 
referring to exited state reactions of the cofactor 
(<10~" s) are probed by stimulated Raman spectros¬ 
copy (Dasgupta et al. 2009). Subsequent relaxation 
processes that occur on the long picosecond to nano¬ 
second timescale are based on pump-probe experi¬ 
ments with pulsed laser excitation in which 
synchronization of the pump and probe event is 
achieved via optical delay lines (Ondrias and Simpson 
1996). Slower processes are accessible by cw-laser 
experiments in which pump and probe beams are 
focussed to a flowing sample but spatially displaced 
with respect to each other. Thus, the flow rate of the 
sample and the beam displacement defines the time 
resolution of the experiment. Alternatively, time- 
resolution may be achieved by optoelectronic gating 
of the cw laser beams which offers the additional 
advantage of expanding the dynamic range of the 
experiments substantially (Naumann et al. 2006). 

A more general approach for time-resolved RR 
spectroscopy that does not rely upon photo-triggering 
is based on rapid-mixing devices, applicable to 
probe, for instance, enzymatic processes or protein 
(un)folding. In these experiments, the solution leaves 
the mixing chamber with a constant velocity and enters 
an optical cell for probing the RR spectra. The time- 
resolution is then controlled by the spatial displace¬ 
ment of the probe beam with respect to the mixing 
chamber, whereas the limit of the time-resolution 
depends of the mixing time (Yeh et al. 1998). As an 
alternative, the mixing device may be coupled to 
a freeze quench set-up which allows terminating 
the reaction at variable delay times by injection of 
the solution into a bath of isopentane at ca. 150 K. 
Subsequently, the various samples stabilized at 
different steps of the reaction are transferred 
to a cryostat for RR spectroscopic measurements 
(Oellerich et al. 2000). 
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Resonance Raman Spectroscopy for 
Analyzing Cofactor Structures 

RR spectra contain detailed information about the 
molecular structure of the cofactor and its interactions 
with the protein environment. However, up to now, it is 
still a challenge to extract structural data from the 
vibrational spectra. The choice of the most appropriate 
strategy depends on the class of protein-cofactor 
complexes, the availability of three-dimensional 
structure models, genetically engineered protein 
variants, or isotopically labeled cofactors. 

Heme Proteins 

Heme proteins are ideal target proteins for RR spec¬ 
troscopy since the heme cofactor exhibits a strong 
electronic transition in the violet spectral region and 
a weaker one between 500 and 570 nm. Both transi¬ 
tions can be used to measure RR spectra although the 
underlying scattering mechanism is different (A- and 
B-term mechanism) (Siebert and Hildebrandt 2007; 
Czernuszewicz and Spiro 1999; Spiro 1988). The 
structure of the heme cofactor is relatively rigid and 
there are only minor structural differences among the 
various heme proteins, brought about by specific inter¬ 
actions with the protein environment. The changes of 
structural parameters and electronic properties that can 
nicely be monitored by RR spectroscopy refer to the 
heme iron and its axial ligands. Extensive studies on 
metalloporphyrins of known high resolution structures 
allowed identifying characteristic marker bands that 
are highly sensitive toward subtle changes of the 
heme core size and the electron density in the tetrapyr- 
role macrocycle (Parthasarathi et al. 1987). These 
parameters vary with changes of the oxidation and 
spin state of the heme iron as well as with the number 
and type of the axial ligands. The transition from the 
ferric to the ferrous state causes an increase of the 
electron density in the anti-bonding 7r*-orbitals of 
the porphyrin, specihcally at the C-N bonds, which 
hence experience a weakening of the respective force 
constants. As a result the mode V 4 , which is dominated 
by the C-N stretching coordinate, displays a distinct 
frequency downshift, typically by 10 cm“* or more. 
The transition from a six-coordinated low-spin ( 6 cLS) 
configuration of the ferric heme iron to the six- 
coordinated high spin ( 6 cHS) configuration, brought 
about by the exchange of strong-held by weak-held 
ligands, is associated with a considerable increase of 


the radius of the iron. To keep the iron in the porphyrin 
plane, the macrocycle has to expand which causes 
a weakening of C-C stretching force constants of 
the pyrrole rings. Modes which include these C-C 
stretching coordinates such as the mode V3, therefore, 
shift down in frequency, typically by ca. 20 cm~*. If 
one of the axial ligands is removed, then the remaining 
ligand pulls the iron somewhat out of the porphyrin 
plane such that the steric distortions of the macrocycle 
are less severe and the frequencies of the respective 
modes are just in between those of the 6 cLS and 6 cHS 
configuration. Also the redox transition is accompa¬ 
nied by a change of iron radius. While the ferrous 6 cLS 
iron just fits into the porphyrin core, the ferric 
6 cLS iron is smaller such that the macrocycle 
contracts, leading to an upshift of the respective 
marker modes. 

On the basis of these relationships, it is possible to 
probe functionally important processes of heme pro¬ 
teins, including electron transfer, substrate and ligand 
binding, and biocatalysis (Siebert and Hildebrandt 
2007; Hildebrandt 1992; Han et al. 2000). The uptake 
and release of molecular oxygen by hemoglobin and 
myoglobin (Mb) represents a biological function that is 
directly coupled to changes of the geometry and elec¬ 
tron density distribution of the heme. These changes 
are sensitively reflected by the RR spectrum as illus¬ 
trated in the case of Mb. O 2 binding is only possible in 
the reduced deoxy-Mb, which exists in the 5cHS state 
(Fig. lb). Oxygen occupies the vacant axial coordina¬ 
tion site under preservation of the formal oxidation 
state of the heme iron. However, the coordinated O 2 
possesses empty 7 t*-orbitals that may accept electron 
density from the porphyrin. As a consequence, the 
electron density marker mode V 4 undergoes an upshift 
from 1,356 to 1,376 cm“' (Fig. Ic), which is 
even higher in frequency than the mode of the ferric 
Met-Mb (1,370 cm”*. Fig. la). 

Also in heme-copper oxidases, oxygen binding to 
a ferrous 5cHS heme is a crucial step in the enzymes’ 
function. In this case, however, the bound O 2 is 
stepwise reduced and eventually converted to water 
(Han et al. 2000; Kitagawa 2000). Details of the cata¬ 
lytic cycle have been elucidated in time-resolved RR 
experiments monitoring not only the marker bands of 
the porphyrin but also the Fe-ligand modes that appear 
in the lower frequency range between 900 and 
500 cm“*. The nature of these bands can readily be 
assigned on the basis of isotope shifts and 
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Resonance Raman Spectroscopy of Protein-Cofactor 
Complexes, Fig. 1 RR spectra of (a) met-Mb, (b) deoxy-Mb, 
and (c) oxy-Mb obtained with 413-nm excitation 

thus allow identifying the intermediate states as 
a prerequisite for understanding the molecular mecha¬ 
nism of the catalytic oxygen reduction. 

Tetrapyrrole Binding Proteins 

Albeit chemically related to hemes, open-chain tetra¬ 
pyrrole display quite different properties (Fig. 2). Due 
to the lack of a bound open-shell transition metal ion 
and the significantly more flexible structure, fluores¬ 
cence and photochemical reactions represent the 
most efficient decay channels of these cofactors in 
the excited state. These excited state properties consti¬ 
tute a considerable challenge for RR spectroscopic 
characterization of protein-tetrapyrrole complexes. 
This is particularly true for phytochromes, a class of 


O 



Resonance Raman Spectroscopy of Protein-Cofactor 
Complexes, Fig. 2 Structural formula of the tetrapyrrole 
cofactor of phytochrome in the parent state Pr. The red arrow 
indicates the position of photoinduced double-bond isomeriza¬ 
tion that constitutes the first step of the Pr —> Pfr photoconversion 

tetrapyrrole-proteins serving as sensory photorecep¬ 
tors (Mroginski et al. 2007). Upon light absorption, 
the tetrapyrrole chromophore undergoes a double¬ 
bond photoisomerization at a methine bridge, followed 
by thermal relaxations of the cofactor and the protein 
which eventually brings the protein to its physiologi¬ 
cally active state. To circumvent fluorescence and 
uncontrolled photochemical reactions induced by the 
Raman probe beam, various strategies have been 
employed. Using near-infrared (1,064 nm) excitation 
avoids population of the first electronic transition at ca. 
700 nm and thus subsequent excited-states processes, 
but is sufficient for a pre-resonance enhancement of the 
Raman bands of the cofactor over the those of the 
protein matrix. These conditions do not only allow 
probing the unphotolyzed state but also the photocycle 
intermediates after inducing the photoprocess in cryo¬ 
genic experiments. In this way, all (intermediate) 
states of the photoconversion of phytochrome could 
be trapped and characterized by RR spectroscopy 
(Fig. 3) (Kneip et al. 1999). Each of the individual 
states displays a characteristic vibrational band pattern 
reflecting the respective chromophore structure and the 
specific inferactions with the protein environment. 
There are three spectral regions of particular interest. 
The bands between 1,580 and 1,650 cm“^ originate 
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Resonance Raman Spectroscopy of Protein-Cofactor 
Complexes, Fig. 3 RR spectra of the various states of the 
Pr —> Pfr photoconversion of plant phytochrome phyA obtained 
with 1,064-nm excitation and measured at 80K. Spectra of the 
intermediate states were obtained upon photolysing the sample 
at various temperatures between 130 and 240 K, followed by 
mutual subtraction to yield the “pure” spectra of the respective 
states (Kneip et al. 1999). The spectra are shown in the order of 
the reaction sequence. The contributions from Raman bands of 
the protein matrix are indicated by “P.” These contributions are 
weak for Pr, Lumi-R, and Pfr, but relatively strong for the two 
Meta-R species due to the weaker electronic transitions and the 
resultant poor resonance enhancement of the cofactor Raman 
bands 


from modes dominated by the C = C stretching coor¬ 
dinates of the methine bridges, which are indicators for 
the methine bridge geometry of the tetrapyrrole. Here, 
the most pronounced changes are noted for the transi¬ 
tion from Pr to Lumi-R indicating the double-bond 
isomerization at the C-D methine bridge. Between 


1,500 and 1,580 cm“\ the protonated tetrapyrrole 
displays two bands attributable to the N-H in-plane 
bending modes of the inner pyrrole rings B and C, 
which are missing when one of these rings is 
deprotonated as in the case of Meta-Rc. In the region 
around 800 cm“\ the RR spectra of all species show 
a strong band that is assigned to the C-H out-plane 
deformation of the C-D methine bridge and assumed 
to be sensitive for the torsion of this part of the tetra¬ 
pyrrole. However, a more specific correlation of the 
spectral changes in these regions with structural 
parameters of the cofactor is not possible without sup¬ 
port by a reliable normal mode analysis. Due to the 
lack of three-dimensional structure models, such the¬ 
oretical approaches were initially restricted to quantum 
chemical methods which were employed to calculate 
Raman spectra of the large number of theoretically 
possible cofactor geometries differing, inter alia, with 
respect to the configuration (Z/£) and conformation 
(syn/anti) of the three methine bridges (Fig. 2). 
The comparison with the experimental spectra of 
various phytochrome adducts including isotopically 
labeled tetrapyrroles then allowed valuable conclu¬ 
sions about the gross chromophore structure in the 
protein (Mroginski et al. 2007). With the first success¬ 
ful determination of three-dimensional structures of 
the parent states of phytochrome, advanced theoretical 
methods (QM/MM) can be employed that take into 
account the effect of the protein environment on the 
cofactor (Mroginski et al. 2009). Further development 
of these methods opens the possibility to comprehen¬ 
sively extract the structural information from the RR 
spectra to refine existing structural models at the 
cofactor sites and, moreover, to generate structural 
models for protein states not accessible to the crystal¬ 
lographic analysis. 
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Time-Resolved Resonance Raman 
Spectroscopy for Analyzing Cofactor 
Dynamics in Photoreceptors 

The most comprehensively studied photoreceptor is 
bacteriorhodopsin (BR), a bacterial membrane-bound 
protein that serves as a light-driven proton pump to 
convert light energy into chemical energy. Proton 
pumping is coupled to a photoinduced reaction cycle 
that starts with the photoisomerization of the retinal 
cofactor. The photocycle is completed within a few 










































R 


2226 


Resonance Raman Spectroscopy of Protein-Cofactor Complexes 



Av/cm-1 

Resonance Raman Spectroscopy of Protein-Cofactor 
Complexes, Fig. 4 Time-resolved RR spectra of the various 
states of the photocycle of SrII from Natronohacterium 
pharaonis. The spectra were obtained by the gated cw pump- 
probe technique using a 514-nm pump beam and various probe 
beams to meet the resonance condition for the species of interest. 
The individual experiments afforded spectra including contribu¬ 
tions from various species. Spectra of the pure species as shown 
in this figure were obtained by subtraction procedures guided by 
a component analysis (Naumann et al. 2006) 


milliseconds. These kinetic properties as well as the 
high stability constituted BR as an ideal test protein for 
developing time-resolved spectroscopic techniques to 
characterize the vibrational signature in the various 
intermediate states of the photocycle and to monitor 


the dynamics of the cofactor structural changes 
(Althaus et al. 1995). Applying time-resolved RR to 
sensory rhodopsin II (SrII), a related retinal protein, 
represents a challenge inasmuch as the recovery time 
of SrII is longer by a factor of nearly 1,000. As 
a consequence, the dynamic range to be covered in 
cw pump-probe experiments comprises more than six 
orders of magnitude which can be most elegantly 
achieved by employing the optoelectronic gating 
technique (Naumann et al. 2006). 

The time-resolved RR spectra of the intermediate 
states of SrII that are formed in this time range, as well 
as the parent state, are shown in Fig. 4. For each species 
the most prominent band in the spectra originates from 
a C = C stretching mode of the polyene chain of the 
cofactor. The frequency of this band follows an inverse 
linear relationship with the wavelength maximum of 
the electronic absorption band and thus is most appro¬ 
priate to monitor the time evolution of the various 
intermediates in kinetic RR studies of the photocycle. 
For determining the cofactor structure, however, the 
analysis of the “fingerprint” region between 1,150 and 
1,300 cm“* which involves modes including C-C 
stretching and C-H in-plane coordinates (Althaus 
et al. 1995) is more important. These modes are diag¬ 
nostic for the trans/cis configuration and syn/anti 
conformations of the polyene double and single 
bonds, respectively. 

Summary 

In this account on the RR spectroscopy of protein- 
cofactor complexes, a few approaches and examples 
have been described in an attempt to highlight partic¬ 
ular promising applications of this technique. The 
selection is certainly not complete and, in addition, 
biased by the author’s preference for specific fields in 
molecular biophysics. However, the interested reader 
may readily find furfher illustrative examples and 
more comprehensive treatises in the literature cited in 
this contribution. 

Cross-References 

► Nonlinear IR Spectroscopy of Proteins 

► QM/MM Calculations of Vibrational Spectra 

► Reaction-Induced Infrared Difference Spectroscopy 
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► Surface-Enhanced Resonance Raman Spectroscopy 
in Electron Transfer Studies 

► Ultraviolet Resonance Raman (UVRR) Spectroscopy 
Studies of Structure and Dynamics of Proteins 
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Retinal Prosthesis 

Patrick Degenaar 

School of Electrical Electronic and Computer 
Engineering, Newcastle University, Newcastle upon 
Tyne, UK 

Synonyms 

Artificial eye; Retinal implant; Visual prosthesis 


Definition 

A retinal prosthesis is a device which returns sight to the 
blind by selective stimulating remaining functional layers 
in the retina which still connect to the visual cortex. As 
a method of treatment, it is predominantly applicable to 
those with the blinding disease retinitis pigmentosa. 

Basic Characteristics 

A retinal prosthesis fundamentally consists of an imag¬ 
ing system to acquire the visual scene, a processing 
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Fig. 1 Retinitis pigmentosa. 
The normal retina consists of 
photoreceptor layers, 
processing neurons 
(horizontal, bipolar, and 
amacrine cells) and 
communication cells (retinal 
ganglion cells). In retinitis 
pigmentosa, the rods are 
dysfunctional, which 
eventually causes death of 
surrounding cones. As there 
are more rods in the periphery, 
the effect is a tunnel vision 
which slowly decreases in 
tunnel dimension until vision 
is lost 


Normal Retina I Retina with RP 



Retinitis Pigmentosa 


Largely fuctional 


Some remaining 
function? 


Photoreceptors 


Total dysfunction in 
late stages 


> 


engine, and a neural stimulator to transfer the signals 
to the visual system. Additional components include 
optics, power supply, and in the case of implantable 
prosthesis, a transmission system between the external 
and implantable components. 

Clinical Applicability 

According to the Royal National Institute for the Blind 
(RNIB) between 2% and 6.5% of the population, 
depending on ethnicity, will have disorders of the retina 
which can render them legally blind (RNIB 2012). Addi¬ 
tionally, the RNIB estimates that the cost to the economy 
due to lost earnings and social care support for the blind 
is around £18,000 per person per year. 

While refractive errors can be corrected with glasses 
and cataracts (opacity in the lens) via surgery, degener¬ 
ation of the retina is much less treatable. Nevertheless, in 
recent years, pharmaceuticals have become available to 
slow the degeneration of those with so-called wet condi¬ 
tions whereby blood vessels in the eye leak. Future gene 
therapy and stem cell approaches will help, but for those 
who have lost their ability to see, visual prosthesis pres¬ 
ently holds the best hope for visual restoration. The target 
population for retinal prosthesis is those who still have 
a functional connection between the retina and the visual 


cortex. The primary target condition is therefore retinitis 
pigmentosa (prevalence 1:3,500) which is described in 
Fig. 1. 

History 

Stimulation of nerves related to sight results in phos- 
phenes, i.e., apparent flashes of light (Dagnelie 2011). 
The first reports of such phosphene generation in the 
literature date as far back as the fifth century BC 
(Griisser and Hagner 1990) when deformation of the 
eyeball presumably resulted in retinal ganglion cell stim¬ 
ulus. However, for many years, it was assumed that 
retinal degenerative disorders destroyed all communica¬ 
tion between the eye and brain. As such the early focus 
was on visual cortical prosthesis. The first attempts 
started by Forster, a German physiologist, in 1929, who 
inserted electrodes into the visual part of patient’s brains. 
More serious attempts began in 1968 by an American 
researcher called Brindley who implanted an array of 
electrodes into a 50-year-old woman. Since then, despite 
trials in the early 1970s under Dobelle, progress has been 
very limited. Although an Australian called Tassicker 
patented (US patent: 2760483) a concept of retinal pros¬ 
thesis in 1956, it was not until 1992 that such an 
approach was considered viable. In this year. Stone and 
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Fig. 2 The main concept of 
visual prosthesis. The 
prosthesis conceptually 
consists of image acquisition 
(camera), image processing 
(mobile phone processor), 
a stimulator (electrode array or 
optoelectronic array), and 
some form of power 
transmission such as RF. 
Target areas for stimulation 
are then either the retina, the 
optic nerve, the lateral 
geniculate nucleus (visual part 
of the thalamus), or the visual 
cortex 




Image 

acquisition 


Image/retinal 

processor 


Stimulator 


Power 

supply 


coworkers discovered that patients with a blinding dis¬ 
ease called retinitis pigmentosa still had functioning 
retinal ganglion cells. Subsequently, the dominant 
focus of the visual prosthesis community thus shifted 
to retinal prosthesis. 

Forms of Visual Prosthesis 

The human visual system consists of the eye, the optic 
nerve, and the visual processing centers in the thala¬ 
mus and visual cortex of the brain. The eye basically 
consists of an optical component, which performs the 
optomechanics (i.e., focusing and movement) and an 
image acquisition/processing component known as 
the retina. The communication cells (retinal ganglion 
cells) of the retina then project down the optic nerve to 
the visual thalamus, which sorts the incoming infor¬ 
mation. The visual cortex then processes the informa¬ 
tion to allow the individual to recognize his/her 
environment (Bharath and Petrou 2008). Any damage 
to this sophisticated but delicate system through 
trauma or disease will impair subsequent vision, and 
any upstream components would need to be replicated. 

A conceptual visual prosthesis is seen in Fig. 2. An 
image acquisition system (camera) is required to acquire 
the visual scene. A processing engine is then required to 


encode the information and a stimulator is required to 
transfer visual information to the target component of 
the visual system. Some form of power supply or battery 
is then required to allow the components to function. 
Much of the external image acquisition and processing 
can be implemented using components found on modem 
mobile phones. Therefore, the key limiting factor in the 
quality of vision returned lies with the stimulator com¬ 
ponent. Generally if the optic nerve, which carries sig¬ 
nals from the eye toward the visual cortex, is destroyed, 
then prostheses targeting the visual cortex or the visual 
thalamus would need to be considered. If the optic nerve 
is intact, then a prosthesis stimulating the retina or optic 
nerve would be less invasive than entering the brain. 

Electronic Retinal Prosthesis 

Since the discovery that functional layers remained in 
patients with retinitis pigmentosa, there have been 
numerous approaches developed to insert arrays of 
electrodes to the retina in order to return functional 
vision. Epi-retinal approaches have placed the implant 
between the vitreous and the retinal ganglion cells 
while subretinal approaches have placed the implant 
between the inner retina and the retinal pigment epi¬ 
thelium. The subchoroidal approaches have placed the 
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Fig. 3 Optogenetic retinal 
prosthesis. In the case of 
optogenetic retinal prosthesis, 
a virus is injected into the eye 
to make one of the remaining 
cell layers photosensitive. 
High brightness miniature 
screens using, for example, 
microLED arrays, then project 
the appropriate stimulating 
light pattern into the eye 



implant behind the choroid, and sub-scleral approaches 
have placed the electrodes on the surface of the eyeball 
(Tombran-Tink et al. 2007; Dowling 2009). So far, 
rigid planar chips have been implanted. The field of 
view returned to the patient is thus limited by the size 
of the chip which in turn is limited by the curvature of 
the eye. Within the returned field of view, the resolu¬ 
tion of returned vision is related to the number of 
stimulating points. The quality of the returned vision 
is related to how effectively the electrodes generate 
visually encoded signals in the target cells. 

It was initially assumed that simply implanting 
arrays of stimulating electrodes in the eye would 
bring back sight. The early devices did indeed generate 
phosphenes, but there were many engineering, mate¬ 
rials, and clinical challenges. As such, it has taken till 
2012 for the first devices to become commercially 
available. Returned vision is presently rudimentary, 
but advances in retinal implant surgery, new electrode 
materials and topographies, stimulation biophysics, 
and implantable circuit designs in recent years will 
progress the field further. 

There are many groups now working in the field, 
particularly in the USA, Germany, Japan, China, and 
Australia. There are also presently two companies 
competing to bring electronic implants to market. Ret¬ 
ina AG, a German company based in Tubingen focuses 
on a subretinal approach, implanting chips of up to 
1,500 electrodes underneath the retina. They have 
already demonstrated short-term human trials whereby 


the patient could read large individual characters with 
black and white contrast. Second Sight, a California 
based company focuses on the epi-retinal approach 
and has successfully implanted chips chronically for 
some years. Their 60 electrode Argus implants are 
implanted between the vitreous humor and the retina 
with a special retinal tack to hold them in place. Both 
systems incorporate radio frequency coils to wirelessly 
transmit power and visual signals from the external 
headset to the implanted chip. 

Optogenetic Retinal Prosthesis 

The most radical in retinal prosthesis design could result 
from the new field of optogenetics. In 2003, a group 
investigating the basic properties of algae discovered 
a protein called channelrhodopsin (Georg Nagel et al. 
2003) which is now revolutionizing the neuroscience and 
neuroprosthetic field. Channelrhodopsin can be geneti¬ 
cally incorporated into nerve cells and acts as an optical 
switch allowing the cell to be controlled with pulses of 
light. As the eye is transparent, if remaining cell layers in 
the retina were rendered light sensitive, they could be 
stimulated from the outside without the need for implan¬ 
tation, as described in Fig. 3. As such a European team 
being led by groups in Newcastle, UK, and Basel, Swit¬ 
zerland, are developing this new approach. In the end, it 
could be administered by a simple injection of virus into 
the retina, followed by the fitting of a headset with 
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sufficient brightness to stimulate the target cell layers. As 
present virtual reality headsets typically have 40° fields 
of view and resolutions similar to modem mobile phone 
displays, returned vision may be significantly improved 
over electronic approaches. Also without the requirement 
for hours of surgery, it may be significantly cheaper than 
electronic approaches. 


Reverse Transcriptase 

► HIV-l Reverse Transcriptase - Computational 
Studies on the Polymerase Active Site 


Perspectives 

In the coming years, retinal prostheses will become 
increasingly capable. However, most in the community 
accept that it is highly unlikely that present approaches 
will return what is accepted as fuUy functional vision. 
Presently, returned vision can be described as handful of 
flashing dots (Dagnelie 2011). Simple high contrast objects 
can be perceived, but face recognition, normal reading, and 
many normal daily visual tasks will still be very difficult 
and perhaps best performed via assistive devices. Never¬ 
theless, in these early years, if functional vision, sufficient 
for navigation and simple tasks, can be returned, the impact 
on individual patients will he very high. 
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Abbreviations 

DARR Dipolar-assisted rotational resonance 
DOPE 1,2-dioleoyl-5«-glycero-3- 
phosphoethanolamine 
EFG Electric field gradient 

EPR Electron paramagnetic spin resonance 

FTIR Fourier transform infrared 
GPCR G protein-coupled receptor 
MAS Magic angle spinning 

Meta I Metarhodopsin I 

Meta II Metarhodopsin II 

POPC 1 -palmitoyl-2-oleoyl-5«-glycero-3- 

phosphocholine 
TM Transmembrane 
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Synonyms 

Seven-transmembrane domain receptors; Solid-state 
NMR 


Introduction 

Rhodopsin is responsible for the dark (scotopic) vision of 
vertebrates, and is a member of the vast protein family of 
G protein-coupled receptors (GPCRs) that regulate 
many important functions of the human body. Although 
they are the targets of more than 30% of known drugs 
and pharmaceuticals, the structures of most GPCRs are 
currently unknown. Conventional approaches are limited 
hy difficulties in crystallization for X-ray analysis, as 
well as the requirement for detergent solubilization in 
high-resolution NMR spectroscopy. Notably, rhodopsin 
is one of the few GPCRs for which the X-ray stractures in 
different states have been recently determined (Park et al. 
2008; Scheerer et al. 2008; Choe et al. 2011; Standfuss 
et al. 201 1). Activation of rhodopsin has been studied by 
Fourier transform infrared (FTIR) methods which detect 
short-range local interactions (Mahalingam et al. 2008), 
together with solid-state ^^C NMR spectroscopy for 
short distance restraints (Ahuja et al. 2009a), and by 
site-directed spin-labeling experiments that are sensitive 
to longer distances (Altenbach et al. 2008). This entry 
reviews the application of solid-state NMR methods for 
obtaining structural and dynamical information for the 
retinal ligand of rhodopsin pertinent to its activation 
mechanism (Stmts et al. 2007, 2011a; Ahuja et al. 
2009a). Solid-state NMR spectra provide knowledge of 
the retinal stracture whereas NMR relaxation methods 
reveal information about its dynamics. Combining the 
results of solid-state NMR spectroscopy with com¬ 
plementary FTIR, *^C NMR, and spin-labeling methods 
yields a unified picture of the rhodopsin activation 
mechanism (Stmts et al. 2011b). 

Retinal Structure in Rhodopsin Is 
Determined Using Angular and Distance 
Restraints from Solid-State and NMR 

Studies of aligned bilayers containing rhodopsin with 
selectively deuterated retinal (Fig. la) allow one to 
determine the orientation of the quadrupolar coupling 
tensor of deuterium nuclei (Struts et al. 2007) 


with respect to the membrane normal (vector no in 
Fig. lb). Orientations of the deuterated methyl groups 
calculated in this way can be used for modeling of the 
retinal stmcture in the binding pocket of rhodopsin. 
For rapidly spinning methyl groups, the principal axis 
of the motionally averaged quadrupolar tensor coin¬ 
cides with the rotation axis (symmetry axis) of the 
methyl group M, and thus determines the average 
value of the angle 0.md between M and no (Fig. 1). 
Solid-state NMR spectra of rhodopsin and 
metarhodopsin I obtained for the different methyl 
groups are presented in Fig. 2a, b (Struts et al. 2007). 
The fitting parameters used in the simulations of the 
theoretical NMR spectra include the C-C^Hs 
methyl bond orientation, the width of the orientational 
distribution of the local membrane normals (mosaic 
spread), the residual deuterium quadrupolar coupling, 
and the NMR line broadening. Reduction of the 
quadrupolar interactions of deuterium nuclei with the 
electric field gradient (EFG) in the molecule occurs 
due to the fast spinning of the methyl groups and 
reorientation of their symmetry axes. The NMR 
line shape is calculated by randomly generating the 
angle of rotation of the rhodopsin molecule and angle 
of deviation of the membrane normal from the average 
value, and subsequently averaging the spectra over all 
possible orientations (Struts et al. 2007). The above 
method has been applied to aligned membranes 
containing rhodopsin (Brown et al. 2010) or bacterio- 
rhodopsin (Brown et al. 2007), as well as to aligned 
DNA fibers (Nevzorov et al. 1999). For rhodopsin, 
analysis of the angular-dependent, solid-state 
NMR spectra gives the following orientations of the 
retinal C5-, C9-, and C13-methyl groups with respect 
to the protein long axis: 70 ± 3, 52 ± 3, and 68 ± 2° in 
the dark state, and 72 ± 4, 53 ± 3, and 59 ± 3° in the 
Meta I state (Fig. 1) (Struts et al. 2007). 

Solid-state NMR spectroscopy is complemen¬ 
tary to rotational resonance distance measure¬ 
ments (Verdegem et al. 1999) in that orientational 
restraints are provided, together with distance restraints 
that allow site-specific structural information to be 
obtained analogous to solution NMR. Magic angle spin¬ 
ning (MAS) rotational resonance NMR utilizes dipolar 
recouplmg or magnetization transfer between nuclear 
spins that arises when the rotation speed of the sample 
matches the difference for the isotropic shifts of two 
chemically nonequivalent nuclei. Both the NMR line 
splitting and magnetization exchange depend on the 
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Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy, Fig. 1 Site-specific NMR spectroscopy inves¬ 
tigates the structure and functional dynamics of retinal within the 
binding pocket of rhodopsin in a native-like membrane environ¬ 
ment. (a) Chemical structure of 11-cA retinal chromophore indicat¬ 
ing numbering scheme for /?-ionone ring and polyene chain, 
(b) Theoretical analysis of geometry and ps-ns mobility of 
retinylidene methyl groups within the binding cavity of rhodopsin. 
Rotational dynamics of methyl groups are treated as axial threefold 


hops or continuous rotational diffusion. Euler angles fl/./ indicate 
orientation of the C-^H bond (where P denotes principal axis of 
EFG) within the internal frame of the methyl threefold axis (/). 
Angles G/mCO describe time dependence of fluctuations of the C-^H 
bond relative to the average methyl rotor axis system (M). Angles 
O.MD are for average methyl axis to membrane normal no {D is the 
membrane system), or alternatively CIml for the methyl axis to 
laboratory frame (L). Finally the angles Clot are for the membrane 
to laboratory frame as defined by the main magnetic field Bo 
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Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy, Fig. 2 Solid-state NMR spectroscopy char¬ 
acterizes the structure of the retinal ligand bound to rhodopsin. 
Examples of solid-state NMR spectra for dark (a) and Meta I 
(b) states of rhodopsin regenerated with 1 l-cw-retinal specifically 
^H-labeled at C5-, C9-, and C13-methyl positions in aligned 


POPC membranes (1:50 molar ratio). The spectra are measured 
for the orientation of the membrane normal parallel to the mag¬ 
netic field at temperatures of — 150°C and —100 °C for rhodopsin 
and Meta I, respectively, (c) The solid-state NMR structure pro¬ 
posed for retinal in the dark state of rhodopsin. (d) The solid-state 
NMR structure proposed for retinal in the Meta I state 


R 


intemuclear distance. One can then determine the dis¬ 
tance either by calibration with similar reference com¬ 
pounds, or by simulation of NMR spectra or exchange 
plots with known NMR parameters (chemical shielding 
tensor, scalar coupling constants, zero-quantum spin- 
spin relaxation time). 

Using retinals doubly labeled at the 

C8-C16/C17, C8-C18, C10-C20, and C11-C20 


positions, the spatial structure of the C10-C11=C12- 
C13-C20 motif and rotation of the /1-ionone ring with 
respect to the polyene chain have been examined in the 
dark and Meta 1 states. Distances between carbons 
C10-C20 and Cl 1-C20 calculated from rotational 
resonance NMR were 0.304 ± 0.015 and 0.293 ± 
0.015 nm, correspondingly, in the dark state, and 
>0.435 and 0.283±0.015 nm in the Meta I state 
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(Verdegem et al. 1999). Distances for carbons C8- 
C 16/C 17 and C8-C18 were reported to be 0.405 ± 
0.025 and 0.295 ± 0.015 nm in the dark state, and 
effectively unchanged in the Meta I state (Spooner 
et al. 2003). The twist for the C12-C13 bond was 
estimated to be 44 ± 10° in the dark state and a relaxed 
all-rran^ retinal structure was observed in the Meta 
I state (Verdegem et al. 1999). It has been established 
that the C5=C6-C7=C8 torsion angle characterizing 
the fi-ionone ring orientation was —28 ± 7° in the dark 
state, and a similar orientation was revealed in the 
Meta I state (Spooner et al. 2003). 

Investigation of the 3D retinal structure was carried 
out within the framework of a three-plane model 
(Struts et al. 2007) (Fig. 2c). Within each of the three 
planes (A, B, and C) deviation of the torsion angles 
from the ideal 11-cw or a\l-trans configuration was 
neglected (the polyene chain was assumed to be flat 
within the planes). Only torsion angles between the 
different planes were taken into account (Fig. 2c). 
Comparison of the NMR structure with crystallo¬ 
graphic data indicates that such a model corresponds 
well to the X-ray structure of retinal within the binding 
pocket of rhodopsin in the dark state (Struts 
et al. 2007). Notably the methyl group orientations 
are obtained directly from the NMR line shape; they 
are not related to the above model (“model-free”), and 
hence can be compared directly with the crystallo¬ 
graphic results. Such a comparison indicates a very 
good agreement between the NMR (Struts 
et al. 2007) and the X-ray data (Okada et al. 2004). 

Referring to Fig. 2 we see that the three planes used 
for the retinal modeling (Struts et al. 2007) encompass 
the /1-ionone ring (plane A) and the polyene chain on 
both sides of the C12-C13 bond (planes B and C). The 
methyl group orientations calculated from the orienta¬ 
tion-dependent NMR line shapes provide a set of 
angular restraints that allow one to determine the 
dihedral angles between the planes, together with the 
orientation of the retinal ligand with respect to rhodop¬ 
sin. The torsion angle Xi.k between two planes with 
common bond C—C* is given by (Struts et al. 2007): 


Xi,k = cos^ 


^cos 6 i cos — cos 


sin 9j sin Q 


i,k) 


_[ / cos Ok cos Og'’^^ — cos O^g^'' 


( 1 ) 


sin Ok sin Og'^^ 


where O^g and O^g^ are the orientations of the individual 
methyl group axes with respect to the membrane nor¬ 
mal, and Oi and Ok are angles of the two methyl axes 
relative to the C—Ck bond. The C—C^, bond angle to the 
membrane normal is 0 g’'‘\ and is calculated from the 
C9-methyl orientation, together with the transition 
dipole (Struts et al. 2007). 

Orientations of the methyl groups obtained from 
NMR and the orientation of the electronic transition 
dipole moment of the retinal are used as angular restraints 
(Struts et al. 2007). The distances between carbons CIO- 
C20 and carbons C11-C20 calculated from NMR 
(Verdegem et al. 1999) are used as additional geometri¬ 
cal constraints in the extended model. Using this 
approach, various solutions were obtained for the torsion 
angles between the planes A, B, and C, giving a total of 
64 combinations for the relative orientations of the 
planes, and 128 configurations and orientations for the 
overall orientation of retinal (Struts et al. 2007) within 
the ligand-binding cavity. However, most of these solu¬ 
tions can be discarded, as they do not satisfy the 
NMR distance constraints between the atoms C8-C18, 
C8-C16/C17 (Spooner et al. 2003), C10-C20, and 
Cl 1-C20 (Verdegem et al. 1999), and also the signs of 
the torsion angles obtained from circular dichroism (CD) 
(Fujimoto et al. 2002). The resulting NMR retinal 
structure in the dark state is shown in Fig. 2c (Struts et al. 
2007). One of its pecuharities is the twist around the 
C11=C12 bond. Note that attempting to calculate the 
retinal structure without the twist failed, since the retinal 
molecule does not fit into the binding cavity of the crystal 
structure. This proposal can explain the ultrafast isomer¬ 
ization of retinal within the binding pocket. Overall, 
the NMR structure of retinal and its orientation in 
the binding pocket are in very good agreement with the 
crystallographic data (Okada et al. 2004). 


NMR Structure of Retinal in the Preactivated 
Meta I State Shows Relaxation of Torsional 
Twisting 

In metarhodopsin I, the possible solutions for the torsion 
angles are: ±32 ± 7 and ±57 ± 7° for the angle between 
planes A and B (the /1-ionone ring and the polyene 
chain), and ±173 ± 4° for the angle between planes 
B and C (Struts et al. 2007). Thus eight combinations 
of the angles and the corresponding retinal structures are 
conceivable. Fitting of the retinal into the rhodopsin 
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Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy, Fig. 3 Solid-state NMR evaluates distance 
restraints that change upon light activation of rhodopsin. (a) Two 
views of the retinal binding pocket illustrating the distance 
range of the DARR NMR experiment (Patel et al. 2004). The 
spheres around the -labeled atoms on tyrosine and serine 
with a 5.5-A radius indicate the approximate detection limit of 
the DARR experiment. The *^C-labeled retinal methyl carbons 


are shown by black spheres, (b) Representative expanded 
regions of 2D DARR NMR spectra of rhodopsin {black) and 
Meta II {red) showing cross peaks between the retinal and 
protein *^C labels (Patel et al. 2004). The protein and retinal 
labels are: 4'-'^C tyrosine, 3-'^C serine, and 14,15-*^C retinal. 
The spectra were obtained at —80 °C. (Adapted with permis¬ 
sion from Patel et al. (2004). Copyright 2004 National Acad¬ 
emy of Sciences, USA) 


binding pocket in the dark state demonstrates that most 
of the eight possible structures have the /i-ionone ring 
position different from the dark state (Struts et al. 2007). 
On the other hand, according to electron crystallography 
data, the /1-ionone ring does not change its position in the 
Meta I state versus the dark state (Ruprecht et al. 2004). 
Only two of the possible solutions for the NMR 
retinal structure in the Meta 1 state have the /i-ionone 
ring in a position similar to the dark state: One is shown 
in Fig. 2d with the torsion angles between the planes 
A-B and B-C of —32 and 5-173°, respectively. The 


other has corresponding torsion angles of 32 ± 7 and 
— 173 ± 4°, which can be obtained from the first struc¬ 
ture as a mirror image reflected in a vertical plane 
(Struts et al. 2007). 

Electron crystallography does not reveal any 
noticeable displacements for the seven transmembrane 
helices of rhodopsin in the Meta I state compared to the 
dark state, and hence the shape of the retinal binding 
pocket should be essentially the same. In that case, the 
second of the above retinal structures seems unlikely 
due to multiple steric clashes within the binding 



















R 


2236 


Rhodopsin Activation Based on Soiid-State NMR Spectroscopy 


pocket. That leaves us with the only possible retinal 
structure in the Meta I state presented in Fig. 2d. This 
structure indicates only a single steric clash with 
the side chain of Trp265. As further discussed below, 
the main changes in the retinal conformation in the 
Meta I state include rotation of the polyene chain 
adjacent to the C13-methyl group toward the C9- 
methyl group, straightening and elongation of the 
polyene chain, and a resulting displacement of the 
fi-ionone ring toward helix H5. The displacement of 
the /?-ionone ring, which is also observed by X-ray 
crystallography and NMR (Nakamichi and 

Okada 2006; Ahuja et al. 2009a) in different rhodopsin 
photointermediates, is assumed to play a crucial role in 
the receptor activation. The rotation of the (proximal) 
part of the polyene chain containing the Cl 3-methyl 
group may contribute to destabilization of the ionic 
lock involving the protonated Schiff base, and thus 
facilitate its deprotonation. 


Conformational Changes in the Retinal 
Binding Pocket Are Revealed by Rotational 
Resonance ^^C NMR 

Another solid-state NMR technique that has proved to 
be quite useful for structural studies of membrane pro¬ 
teins is 2D dipolar-assisted rotational resonance 
(DARR) NMR spectroscopy (Ahuja et al. 

2009b). It allows one to determine close contacts 
between selectively labeled carbon atoms in the pro¬ 
tein. If the distance between the labeled atoms is 5 A or 
less (Fig. 3a) one can observe a cross peak in 2D 
DARR NMR spectra between ^^C-labels (Fig. 3b); 
for larger distances a cross peak does not appear in 
the spectrum. A number of experiments have mapped 
distance changes between the labeled carbons in the 
retinal and amino acids in the binding pocket in 
metarhodopsin II with respect to the dark state. 
Labeled amino acids included tyrosine, serine, cyste¬ 
ine, glycine, threonine, and others. The active Meta II 
state was stabilized by solubilizing rhodopsin in deter¬ 
gent micelles (1% «-dodecyl maltoside solution). The 
DARR NMR spectra revealed displacement of the 
retinal fi-ionone ring in the rhodopsin binding pocket 
toward helix H5 (Ahuja et al. 2009a) and the E2 loop 
toward the extracellular side of the membrane (Ahuja 
et al. 2009b), in agreement with NMR data (Brown 
et al. 2010 ). 


Retinal NMR Relaxation Times Show 
Site-Specific Differences in Mobility in the 
Rhodopsin Dark State 


Utilization of solid-state NMR relaxation is useful for 
membrane protein studies, because it provides infor¬ 
mation on local dynamics and intramolecular interac¬ 
tions of biomolecules that is generally unavailable 
using other methods (Struts et al. 2011a, b). Dynamics 
of the methyl groups of the retinal in the dark, Meta I, 
and Meta II states of rhodopsin have been studied by 
solid-state NMR relaxation. Experimental temper¬ 
ature dependences of the deuterium relaxation times of 
Zeeman (Tiz) and quadrupolar {T ig) order are summa¬ 
rized in Eig. 4. Relaxation times Tiz and Tiq are 
characterized by the spectral densities of thermal fluc¬ 
tuations of the quadrupolar coupling tensor near the 
resonance frequency: 

l/riz = ^7t2;f^[/i(mo)+4/2(2mo)] (2) 

i/rie = ^7t'ze/iK) (3) 


Here Xq is the quadrupolar coupling constant, 
denotes the spectral density {m = 1 , 2 ), and (Oq is the 
nuclear resonance (Larmor) frequency. In terms of 
generalized model-free analysis (Brown 1982), the 
spectral densities J^(ma}Q) are given by; 



where m = mcoQ and m = 1,2 (Brown 1982). The 
quantities in square brackets are the mean-square ampli¬ 
tudes of motion, and the corresponding reduced spectral 
densities are designated as (cu). In (4) D^'(Q,,) denotes 
the Wigner rotation matrix of rank j (Brown 1996), where 
= {aij, flij,yjj) are the Euler angles describing the 
relative spatial orientations of coordinate systems 
i,j = Referring to Eig. lb, the label P means 

the principal axis system of the EEG tensor for an 
nucleus (principal z-axis is parallel to C-^H bond); / is for 
an intermediate coordinate frame for methyl rotation (z- 
axis is C 3 symmetry axis of C-C^H 3 bond); M represents 
the coordinate frame for the average methyl orientation 
(z-axis is average C-C^H 3 bond direction); and L signifies 
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Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy, Fig. 4 Solid-state NMR relaxation times 
reveal site-specific differences in the mobility of retinal within 
the binding cavity of rhodopsin. Spin-lattice (Tjz) and 
quadrupolar-order {Tiq) relaxation times are plotted against 
inverse temperature for rhodopsin in POPC membranes (1:50 
molar ratio) in the dark state with retinal ^H-labeled at (a, b) the 


C5-methyl, (c) the C9-methyl, and (d) the Cl 3-methyl groups. 
The experimentally determined Tjz and Tiq times were fit 
simultaneously using analytical models for molecular dynamics 
(Brown 1982) (see text): axial threefold jump model (rate con¬ 
stant k, solid lines)', continuous rotational diffusion model (coef¬ 
ficients D|| and with solid lines for = 0 and dashed lines 
forD^=D„) 


R 


the laboratory frame (z-axis is external magnetic field). 
For unoriented (powder-type) samples, averaging 
over all the methyl orientations is performed 
(Nevzorov et al. 1998). Assuming exponential relaxation, 
the reduced spectral densities are given by; 

jl^\co) = + oP'Tiy^) where try are the (rota¬ 

tional) correlation times for the Wigner rotation matrix 
elements. 

Further analysis requires implementation of 
a motional model to describe the molecular fluctuations 
(Brown 1982). In the case of A-fold hops about a single 
axis, the correlation times read (Brown 1982; Torchia and 


Szabo 1982): 1 /t,.^ ^ 1/t,. = 4 ^sm^(7i:r/A) (axial 
averaging is assumed) and yield l/r,. = 3k for a methyl 
group. For continuous diffusion in a potential of mean 
torque (Trouard et al. 1994), the rotational correlation 
times are written as (Brown 1982): 
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Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy, Fig. 5 Deuterium NMR relaxation illuminates 
changes in dynamics of the retinylidene methyl groups upon 
light activation of rhodopsin. Spin-lattice relaxation times (Tiz) 
are plotted against inverse temperature for rhodopsin in POPC 
membranes (1:50 molar ratio) with retinal ^H-labeled at the C5-, 
C9-, or Cl 3-methyl groups, respectively in (a, c) the Meta I, and 
(b, d) the Meta 11 states. The relaxation data are fit both with an 


axial threefold jump model (rate constant k) as well as a diffusion 
model (axial coefficient Du with D^ = 0). For the C5-methyl 
group in Meta I either different rotational diffusion constants 
were assumed (Du ^ Dj^, not shown) or different conformers 
(Struts et al. 2011b). Following 11-ctT to tmns isomerization, 
a reduction occurs from three distinct methyl environments in 
the dark state to Ri two environments in Meta I and Meta II, 
suggesting a more planar retinal conformation 


Here the diffusion coefficients D^^ and 7)^ quantify 
axial and off-axial rotations of a methyl group. The 
moments and the mean-squared moduli 

(I£)®(Q/m)I^) depend on the second- and fourth-rank 
order parameters (Trouard et al. 1994). In a strong- 
collision approximation, the orientation changes by 
any amount, and the correlation times correspond to 
those of a rigid rotor (Brown 1982): 
1 hrq 1 hr = (>D^ + (D|| — D. For either 
a threefold jump or continuous diffusion model, the 
rotational correlation times are inversely related to 
A exp(—EJRT), where A is the preexponential factor 
(ko or Dq), and is the activation energy (barrier) 
for methyl rotation (Struts et al. 2011a). Both models 
have been used to ht experimental relaxation data 


(Struts et al. 2011a). In the dark, Meta I, and Meta II 
states the temperature dependences of the T iz relaxa¬ 
tion times can be fit using either the jump or continuous 
diffusion models. Measurement of both Zeeman and 
quadrupolar-order relaxation allows one to obtain 
information on the anisotropy of the local molecular 
motion (Struts et al. 2011a). 

Rotational correlation times are found in the range of 
2-12 ps at 30 °C and 3^5 ps at —60 °C, depending on 
the position of the methyl group. The T^z and Tiq 
temperature dependences correspond to the activation 
energy for methyl rotation and are related to the height 
of the rotational potential barriers. The unexpectedly 
low activation energy for the C9-methyl group 
can be explained by compensation of intra-retinal 1-6 
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interactions with hydrogens H7 and HI 1 (Fig. 2c), since 
the corresponding rotational potentials are offset with 
respect to one another by about 180°. Higher activation 
energies for the C5- and C 13-methyl groups are the 
result of intra-retinal 1-7 interactions with the H8 and 
HIO hydrogens, respectively. Lastly, the high activation 
energy for the C5-methyl group indicates a 6 -s-cis con¬ 
formation of the /?-ionone ring. Contributions due to 
interactions with the side chains in the binding pocket 
of rhodopsin in the dark state are secondary due to 
large distances from the methyl groups (>4 A) 
(Fig. 5). 

Light-Induced Changes in Retinal Dynamics 
Occur in the Meta I and Meta II States 

Noticeable changes in the dynamics of the retinal methyl 
groups are observed in the Meta I and the Meta II states 
after photoisomerization (Stmts et al. 201 la). The Meta 
I and Meta II states were quantitatively cryotrapped in 
POPC and POPCiDOPE (3:1 ratio) bilayers, respectively 
(Stmts et al. 2011a). The largest photoinduced changes 
take place for the C9-methyl group, whose activation 
energy increases sequentially from the dark to the 
Meta I state, and then to Meta U. At the same time, the 
Ea value for the Cl 3-methyl group decreases progres¬ 
sively in the Meta I and the Meta U states. Consequently, 
after isomerization to yield trans retinal the activation 
energies and spin-lattice relaxation times become com¬ 
parable for both the Cl3- and C9-methyl groups, which 
appears logical because both groups are now on the same 
side of the molecule, and have similar rotational poten¬ 
tials (Fig. 2d). The decrease of the activation energy for 
the Cl 3-methyl group can be explained analogously to 
the low Eg value for the C9-methyl group in the dark 
state, viz. by the compensation of intra-retinal 1-6 inter¬ 
actions with hydrogens Hll and H15, whose rotational 
potentials are in antiphase (180° phase shift; see above). 
The larger activation energy for the C9-methyl group and 
decrease in its mobility after retinal isomerization can be 
attributed to the changes in the retinal conformation, and 
possible interactions with the surrounding side chains 
(e.g., Tyr268 or Thrll8) in the binding pocket. Note 
that the observed E^ values do not indicate any steric 
clashes of the C9-methyl and the C13-methyl groups in 
the Meta I and Meta U states. For the C5-methyl group 
some interesting and unusual changes are observed in the 
Tiz temperature dependence in the Meta I state 


(Stmts et al. 2011a). Overall, the activation energy 
for C5-methyl rotation does not change much in the 
transition from the dark to active Meta II state. 
Consequently, one can conclude that the conforma¬ 
tion and the environment of the /?-ionone ring remain 
similar after rhodopsin activation (Struts et al. 2011a). 

Rhodopsin Activation Is Illuminated by 
Combining Crystallography with Magnetic 
Resonance Methods 

Rhodopsin activation is the result of retinal conforma¬ 
tional changes that disrupt a system of hydrogen¬ 
bonding networks stabilizing the inactive state, first 
unlocking the receptor around the retinal, and then in 
other more distal domains. The receptor is transformed 
into its intrinsic “natural” active state, which has 
been proposed to resemble the ligand-free opsin struc¬ 
ture, with the formation of new hydrogen-bonding 
networks. Transformation of the photonic energy 
into thermal motions that include helical fluctuations 
facilitates transitions over the barriers separating 
the inactive, preactivated, and active states. However, 
a detailed picture of the activation mechanism is more 
complicated, and not completely understood at present 
(Patel et al. 2004; Vogel et al. 2005; Nakamichi and 
Okada 2006; Stmts et al. 2007, 201 la; Altenbach et al. 
2008; Mahalingam et al. 2008; Ahuja et al. 2009b; 
Choe et al. 2011; Standfuss et al. 2011). Figure 6 
illustrates changes in the retinal binding pocket in the 
activation process. In Fig. 6a an electron density map 
of the constitutively active rhodopsin mutant El 13Q in 
complex with a peptide derived from the carboxyl 
terminus of the a-subunit of the G protein transducin 
(the GaCT peptide) is presented (Standfuss et al. 
2011). Figure 6b shows the 3D stmcture of E113Q 
(PDB code: 2X72) superimposed with ground-state rho¬ 
dopsin (PDB code: IGZM). However a different retinal 
conformation was observed for opsin regenerated with 
all-rraw-retinal to form the putative active Meta II state 
(Choe et al. 201 1). Superposition of this retinal stmcture 
(PDB code: 3PQR) with the ligand conformation in 
rhodopsin (PDB code: 1U19) (Okada et al. 2004) is 
presented in Fig. 6c. Last of all Fig. 6d shows the ^H 
NMR stmcture of retinal in the Meta I state overlapped 
with the binding pocket in the dark state (Stmts et al. 
2007, 2011b). A relaxed all-trans conformation of 
the ligand and displacement of its fi-ionone ring are 
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Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy, Fig. 6 Local changes in retinal initiate large- 
scale protein rearrangements leading to receptor activation, (a) 
Electron density map and (b) PDB structure (accession code 
2X72, blue) of constitutively active rhodopsin mutant E113Q 
with the GaCT peptide (Standfuss et al. 2011) compared to 
rhodopsin structure in dark state (PDB code IGZM, green). 
Note that all-fra/ti-retinal (yellow) is not covalently bound to 
Lys296. (Adapted with permission from Standfuss et al. 2011.) 
(c) Retinal conformation in the Meta II state (PDB code 3PQR, 
blue, protein is shown in orange) compared with the dark state 


clearly observed in the Meta I (Fig. 6d) and Meta II 
(Fig. 6b, c) states. 

Data for the NMR retinal structure and dynamics 
in different states can be combined with the results of 
other structural and biophysical studies (Fujimoto et al. 
2002; Patel et al. 2004; Vogel et al. 2005; Nakamichi 
and Okada 2006; Altenbach et al. 2008; Mahalingam 
et al. 2008; Ahuja et al. 2009a; Choe et al. 2011; 
Standfuss et al. 2011) to yield the following proposed 
mechanism of rhodopsin activation (Struts et al. 2011b). 
Photoisomerization of retinal leads to the rotation of the 


(PDB code 1U19, red, protein is green) (Okada et al. 2004; Choe 
et al. 2011). (d) Data from NMR spectroscopy (Struts et al. 
2011a) and other biophysical measurements suggest the C 13- 
methyl and the C=NH'^ moieties of the protonated Schiff base 
change their orientation upon ligand isomerization. The /!- 
ionone ring with the C5-methyl group remains in its hydropho¬ 
bic pocket, but its displacement due to elongation of retinal 
triggers allosteric movements of helices H5 and H6 that lead to 
rhodopsin activation (see text). (Figure produced with PyMOL 
(http ://py mol. sourceforge .net/)) 


Cl 3-methyl group and the adjacent part of the retinal 
polyene chain toward the /?4-sheet on the extracellular 
loop E2 (Nakamichi and Okada 2006; Struts et al. 2007). 
In addition, the polyene chain of the retinal straightens 
and elongates (Struts et al. 2007; Choe et al. 2011). 
Straightening of the polyene chain leads to a steric 
clash with Trp265, and its displacement toward the cyto¬ 
plasmic end of the protein and toward helix H5. On the 
other hand, retinal elongation leads to a translation of the 
/i-ionone ring toward helix H5. Rotation of the retinal 
polyene chain with the Cl 3-methyl group at the Schiff 
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base end of the ligand destabilizes the hydrogen-bonding 
networks connecting the E2 loop, helices H1-H3, H6, 
H7, and retinal in the inactive state (this includes the 
ionic lock between the protonated Schiff base and its 
counterion Glull3) (Okada et al. 2004; Vogel et al. 
2004). Additionally, the ionic lock of the protonated 
Schiff base with its (complex) counterion may be 
destabilized by the ll-cis to trans retinal isomerization, 
which changes the pAl^ of the protonated Schiff base 
(Mahalingam et al. 2008). As a result, the hydrogen¬ 
bonding network rearranges (Choe et al. 2011) and the 
Schiff base deprotonates, breaking the ionic lock and 
allowing rearrangement of the helical bundle. Displace¬ 
ment of the /1-ionone ring (whose C5-methyl group is in 
the vicinity of Glul22) toward helix H5 is apparently 
coupled to rearrangement of another hydrogen-bonding 
network between Glul22 on H3 and His211 on H5 
(Vogel et al. 2005; Struts et al. 2007; Choe et al. 201 1), 
which initially stabilizes the inactive conformation. It is 
also coupled to rotation of the cytoplasmic end of the 
helix H5 closer to H6 (Ahuja et al. 2009a). This motion 
destabilizes the ionic lock between the helices H3 and 
H6 involving Glul34, Argl35, and Glu247. Conse¬ 
quently, the activating rotation of the helix H6 occurs 
accompanied by transient opening of the transducin 
binding site (Scheerer et al. 2008; Struts et al. 
2011a, b). The displacement of Trp265 on helix H6 due 
to a steric clash with the retinal polyene chain after 
photoisomerization may facilitate H6 helical rotation in 
the process of activation (Struts et al. 2007). 

Molecular Aspects of Large-Scale Functional 
Protein Motions in Rhodopsin Activation 

In the sequence of the events in rhodopsin activation, 
the early photointermediates (photo-, batho-, 
lumirhodopsin) are followed by the major reactions: 
Meta I ^ Meta II„ ^ Meta 11* -i- HaO"" ^ Meta IlftH-", 
where Meta II* and Meta II/jH"^ are active conforma¬ 
tional substates. All of the substates (from Meta I to 
Meta II*H'^) are in thermodynamic equilibrium after 
the last state in the activation is reached. The above 
picture of rhodopsin activation naturally raises the 
question: Are these the only substates of rhodopsin in 
the activation process? An important aspect is how the 
local isomerization of retinal (Struts et al. 2007) initi¬ 
ates the large-scale helix fluctuations in the Meta 
I-Meta II equilibrium. Induced fit of retinal to the 


binding pocket gives a highly twisted conformation 
of the ligand in the dark state (Struts et al. 2007). Our 
MD simulations (Huber et al. 2004) clearly indicate 
that in the dark state not only the cytoplasmic loops but 
also the extracellular loops of rhodopsin undergo 
/ts-ms timescale motions. On the other hand, the TM 
helical core is more stable (Huber et al. 2004). Large- 
scale functional protein motions, such as the 
Meta I-Meta II fluctuations of rhodopsin, have large 
activation barriers; they occur infrequently due to their 
collective nature because they involve displacements 
of many atoms within the protein on the ms timescale. 
Rhodopsin has been proposed to be partially unfolded 
in the Meta II state (Brown 1994), so an ensemble of 
conformers is expected. 

According to the above activation mechanism, 
allosteric interactions trigger concerted movements of 
the TM helices, together with changes in the cytoplasmic 
loops that expose the Gt recognition sites. Extension of 
helix H5 (Park et al. 2008) and a rigid-body rotation of 
H6 (Altenbach et al. 2008) lead to an overall elongation 
of the protein (Botelho et al. 2006), accompanied by its 
expansion within the lipid bilayer (Attwood and 
Gutfreund 1980). Stemming from a highly locked recep¬ 
tor core, activation of rhodopsin generates an opening of 
the protein, whereby the collective, large-scale protein 
fluctuations are paralleled by a gain in partial molar 
volume (Attwood and Gutfreund 1980). A concomitant 
loosening of the receptor core enables increased water 
penetration to the ligand-binding cavity. These conclu¬ 
sions differ from the conventional view of a single light- 
activated receptor (R*) conformation. The notion of 
a dynamically activated receptor explains how the 
large-scale protein fluctuations in rhodopsin activation 
are initiated by the local retinal dynamics. 

The presence of multiple active substates in the 
Meta I-Meta II equilibrium may explain the different 
orientations of the retinal in the Meta II state reported 
by different groups (Patel et al. 2004; Ahuja et al. 
2009a; Choe et al. 2011; Standfuss et al. 2011). In the 
Meta II X-ray structure obtained by soaking opsin 
crystals with all-fraMi'-retinal, the retinal is rotated by 
«180° about its long axis compared to rhodopsin in the 
dark state (Pig. 6c) (Choe et al. 2011). By contrast, the 
distance constraints obtained from dipolar-assisted 
rotational resonance NMR spectroscopy for Meta II 
(Patel et al. 2004; Ahuja et al. 2009a) indicate an 
opposite orientation of the retinal ligand, which is 
supported by the X-ray crystal structure of the active 
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El 13Q rhodopsin mutant (Standfuss et al. 201 1). Sub¬ 
stantial further work is needed to firmly establish the 
nature of the activated state(s) of rhodopsin; however, 
it is clear that solid-state NMR probes different tiers of 
the protein energy landscape (Frauenfelder et al. 
2009). Solid-state NMR has an important role to play 
in combination with X-ray diffraction, because it is 
exquisitely sensitive to dynamics (Struts et al. 2007). 
The implementation of innovative new crystallogra¬ 
phy and magnetic resonance approaches can be 
expected to contribute to a more complete understand¬ 
ing of the structure and functioning of rhodopsin and 
other membrane proteins. These subjects are certain to 
occupy membrane biophysicists for years to come. 

Summary 

Rhodopsin is a representative member of the G protein- 
coupled receptors, which are responsible for signal trans¬ 
duction across cellular membranes and are of great 
pharmaceutical interest. However, many details of the 
activation mechanism remain unknown or contradictory, 
notably the positioning and orientation of the bound 
retinal in the active Meta II state. Solid-state NMR spec¬ 
troscopy allows investigation of the local structure and 
molecular motion of the retinal ligand in different rho¬ 
dopsin intermediates, and illuminates large-scale func¬ 
tional protein dynamics. To characterize light-induced 
changes in structure and mobility of rhodopsin, site- 
specific and labels are introduced into the retinal 
ligand. Solid-state NMR spectra and NMR relaxation 
times (spin-lattice, Tiz, and quadmpolar-order. Tig) in 
the dark, Meta I, and Meta II states of rhodopsin are 
measured by cryotrapping the various states in membrane 
lipid bilayers. The solid-state NMR data together with 
Fourier transform infrared and spin-labeling results sug¬ 
gest that local changes in the retinal structure trigger 
activating helical fluctuations of the receptor. The Meta 
I-Meta II equilibrium of rhodopsin entails a complex 
energy landscape with multiple substates rather than 
a simple activation switch. 

Cross-References 

► Membrane Proteins in Phospholipid Bilayers: 

Structure Determination by Solid-State NMR 

► Rhodopsin 


► Rhodopsin: Stability and Structural Organization in 
Membranes 

► Solid-State NMR 
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Rhodopsins - Intramembrane Signaling 
by Hydrogen Bonding 

Hideki Kandori 

Department of Frontier Materials, Nagoya Institute of 
Technology, Showa-ku, Nagoya, Japan 

Synonyms 

FTIR spectroscopy; Hydrogen bonding; Light-signal 
transduction 

Definition 

Rhodopsins are members of the seven-transmembrane 
receptor family, in which a retinal molecule is embed¬ 
ded as the chromophore (Sharma et al. 2006; Hofmann 
et al. 2009). Rhodopsins convert light into energy or 
a signal. Such a function is initiated by photoisome¬ 
rization of the retinal chromophore, followed by struc¬ 
tural changes of protein (Kandori 201 1). There are two 
types of rhodopsins, microbial (type 1) and visual 
(type 2) rhodopsins. Microbial rhodopsins show wide 
variety of functions, such as light-driven outward 
proton and inward chloride pumps (light-energy con¬ 
version), light-gated cation channels, and photosensors 
that activate either transmembrane or soluble trans¬ 
ducers (light-signal transduction) (Fig. 1). Visual rho¬ 
dopsin is a member of the G protein-coupled receptor 
family that has evolved into a photoreceptive protein in 
visual cells of vertebrates and invertebrates. 

There is no sequential homology between microbial 
and visual rhodopsins, which possess an aW-trans and 
11-cw retinal, respectively. Nevertheless, similar acti¬ 
vation mechanism has been implicated. In microbial 
and visual rhodopsins, a retinal molecule is commonly 
attached to a lysine residue in the seventh helix via 
protonated Schiff base linkage (Fig. 2). The protonated 
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Schiff bases have a positive charge, which is stabilized 
by its counterion, deprotonated Asp/Glu or chloride 
ion. Deprotonation of the Schiff base during photore¬ 
action is coupled to the function in many rhodopsins. 
Based on these similarities, it is believed that there is 
a common mechanism for the function of microbial 
and visual rhodopsins. Figure 2 indeed shows the pres¬ 
ence of water molecules in the Schiff base region of 
light-driven proton pump (BR), chloride pump (HR), 
negative photo taxis sensor (SRII), and visual sensor 
(bovine rhodopsin). It is thus believed that water- 
containing hydrogen-bonding network plays important 
roles in the functional process of rhodopsins, where 
protein-bound water molecules stabilize the ion-pair 
structure between the Schiff base and its counterion 
(Kandori 2000). In addition, local hydrogen-bonding 
alteration that is initiated by retinal photoisomerization 
results in global protein structural changes for each 
function, whose understanding is the main focus in 
the biophysics of rhodopsins. 

FTIR spectroscopy to monitor hydrogen-bonding 
alterations in rhodopsins. Light-induced difference 
Fourier-transform infrared (FTIR) spectroscopy is 
a powerful tool to study protein structural changes in 
rhodopsins (Kandori 2000). In particular, the method is 
good at monitoring hydrogen-bonding alterations, 
where spectral changes are normally detected at 
<1,800 cm“'. Although X-H (0-H or N-H) stretches 
at >1,800 cm“* are a direct probe of the hydrogen 
bond, strong absorption of water involved in the 
sample easily masks such information. It is however 
known that films of rhodopsins exhibit a normal 
photocycle under suitable hydration conditions. In 
addition, protein conformation changes are conve¬ 
niently obtained as difference IR spectra since rhodop¬ 
sins are a photoreceptive protein. These facts allowed 


differences in IR spectra in the frequency region not 
only <1,800 cm~* but also in the water-stretching 
region (4,000-1,800 cm”*) to be obtained. 

By use of low-temperature FTIR spectroscopy, pro¬ 
tein-bound water molecules have been studied in micro¬ 
bial rhodopsins such as light-driven proton pumps, 
chloride pumps, and phototaxis sensors (Furutani et al. 
2005a). The importance of hydrogen-bonding alteration 
in light-energy storage provided an unexpected finding, 
i.e., positive correlation between strong hydrogen bond 
of water and proton pump activity (Kandori 2010). 
Water 0-D stretching vibrations in D 2 O appear at 
2,700-2,100 cm” * dependent on their hydrogen-bonding 
strength, and the frequency of strongly hydrogen-bonded 
water appears at <2,400 cm”* (Kandori 2010). Compre¬ 
hensive FTIR analysis revealed that proton-pumping 
rhodopsins possess strongly hydrogen-bonded water mol- 
ecule(s). Therefore, strongly hydrogen-bonded water is 
the functional determinant of light-driven proton pump. It 
is likely that destabilization of water-containing hydro¬ 
gen-bonding network plays an important role for light- 
energy storage, from which global protein structural 
changes take place for unidirectional transport of protons. 

Light-signal transduction from receptor to trans¬ 
ducer in archaeal negative phototaxis. In the light- 
signal transduction of negative phototaxis of halophilic 
bacteria, hydrogen-bonding alteration of specific amino 
acid plays an important role (Fig. 3). SRII, an archaeal 
negative phototaxis receptor, forms a complex with 
Htrll, its transducer protein, in membranes and transmits 
light signals through the change of the protein-protein 
interaction. By use of difference FTIR spectroscopy at 
77 K, it was found that the O-H group of Thr204 in SRII 
to be D/H-unexchangeable, and the specific hydrogen¬ 
bonding alteration of Thr204 takes place upon retinal 
photoisomerization only in the presence of Htrll 
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Rhodopsins - Intramembrane Signaling by Hydrogen 
Bonding, Fig. 2 X-ray crystallographic structures of the Schiff 
base region in BR, HR, SRII, and bovine rhodopsin (PDB entries 
are 1C3W, IJGJ, 1E12, and 1L9H, respectively). Each mem¬ 
brane normal is approximately in the vertical direction of the 
figures. Upper and lower regions correspond to the cytoplasmic 


and extracellular sides, respectively. Green spheres represent 
water molecules. Hydrogen atoms and hydrogen bonds (dotted 
lines) are supposed from the structures, while the numbers are 
the hydrogen-bonding distances in A (This figure is modified 
from ref. Eurutani et al. (2005a)) 


R 


(Sudo et al. 2003). Difference FTIR spectra at 77 K also 
revealed the presence of a steric constraint between the 
C 14 -H group of retinal and Thr204, which is relayed into 
the interaction surface with Htrll. Although difference 
FTIR spectra at 250 K were surprisingly similar notwith¬ 
standing the presence of Htrll, hydrogen bond of Asn74 
in Htrll with Tyrl99 of SRII is strengthened, providing 
a light-signaling pathway from retinal of the receptor to 
Asn74 of the transducer through Thr204 and Tyrl99 
(Fig. 3) (Fumtani et al. 2005b). Transducer activation is 
likely to involve a relaxation of Thr204 in the receptor 
and hydrogen-bonding alteration of Asn74 in the 


transducer, during which the helices of the transducer 
perform rigid-body motion without changing their sec¬ 
ondary structures. 

Significant temperature dependence between 250 K 
and 293 K was observed for the amide-I vibrations of 
helices only for the SRII/Htrll complex, where the 
amplitude of amide-I vibrations was reduced at room 
temperature (Kamada et al. 2006). ^^C-labeling of SRII 
or Htrll revealed that such spectral changes of helices 
originate from SRII, not Htrll. The hydrogen-bonding 
alteration of Asn74 in Htrll was temperature indepen¬ 
dent. On the other hand, temperature-dependent 
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Rhodopsins - Intrametnbrane Signaling by Hydrogen 
Bonding, Fig. 3 (Top) A light-signaling pathway from the 
receptor to the transducer in archaeal negative phototaxis. The 
pathway is from Lys205 (retinal) of the receptor to Asn74 of the 
transducer through Thr204 and Tyrl99. (Bottom) Temperature 
dependence observed for the amide-I vibrations of helices only 


for the receptor/transducer complex, where the amplitude of 
amide-I vibrations was reduced at room temperature. Although 
the hydrogen-bonding alteration of Asn74 in transducer was 
temperature independent, temperature-dependent structural 
changes of helices were diminished for the complex of receptor 
with the G83 mutants of transducer 


stractural changes of helices were diminished for 
the complex of SRIl with the G83C and G83F mutants 
of Htrll. Gly83 connects the transmembrane helix 
and cytosolic linker region in a flexible kink near the 
membrane surface of Htrll, and its replacement by Cys 
or Phe abolishes the photosensory function. Thus, Gly83 
plays an important stractural role in the activation 
processes of the ppR/pHtrll complex. 

Summary 

In summary, retinal photoisomerization causes steric 
constraint between chromophore and protein in 


rhodopsins, resulting in global protein structural 
changes such as translational movement of helices. 
Since there is a water-containing hydrogen-bonding 
network in the active center of rhodopsins, the Schiff 
base region, its hydrogen-bonding alteration plays 
important roles in the structure-function relationship 
in rhodopsins. Presence of strongly hydrogen-bonded 
water is the functional determinant of light-driven 
proton pump, in which weakened hydrogen bond by 
light is used for light-energy storage. Coupling of 
steric constraint and hydrogen-bonding alteration 
drives protein structural changes in microbial and 
visual light sensors, leading to activation of transducer 
proteins. 











Rhodopsin - Stability and Characterization of Unfolded Structures 


2247 


Cross-References 

► Channelrhodopsin 

► Sensory Rhodopsin I 

► Sensory Rhodopsin II: Signal Development and 
Transduction 
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Rhodopsin - Stability and 
Characterization of Unfolded Structures 

Arpana Dutta and Judith Klein-Seetharaman 
Department of Structural Biology, University of 
Pittsburgh School of Medicine, Pittsburgh, PA, USA 

Synonyms 

Dim-light mammalian photoreceptor protein; Light- 
activated conformation of rhodopsin; Metarhodopsin- 
II; Protein fluorescent dye labeling; Rhodopsin; 
Transducin 


Definition 

Rhodopsin is a mammalian membrane protein of the 
G-protein-coupled receptor family, functioning to 
mediate response to light. 

Unfolded states refer to the ensemble of conforma¬ 
tions occupied at initial stages of folding in vivo, 
achieved in vitro by denaturation of a folded structure. 
Ideally, we would like to describe the states with the 
least amount of restriction in conformational space, to 
understand early events of folding. 

Introduction 

Rhodopsin is a membrane protein (MP) present in the 
disks of rod outer segments of the retina, a layer of 
cells in the back of the vertebrate eye, and enables dim- 
light vision. It is comprised of seven transmembrane 
(TM) helices, three extracellular (EC) loops facing the 
intradiscal space, and three cytoplasmic (CP) loops 
with the C-terminal tail facing the cytoplasm (Fig. 1). 
A chromophore, 11-cA retinal, a vitamin A derivative, 
is bound to lysine 296 in the TM domain via 
a protonated Schiff base linkage. The chromophore- 
protein interaction, indicative of a natively folded con¬ 
formation, is responsible for the 500-nm peak observed 
in absorption spectra of rhodopsin along with 
the 280-nm protein peak. The isomerization of 
ll-cii retinal to all-trans retinal upon illumination of 
rhodopsin leads to formation of the light-activated 
Metarhodopsin-II state which interacts with the 
G-protein, transducin, and triggers visual signaling. 

Motivation: Misfolding of Rhodopsin in 
Retinitis Pigmentosa 

More than 150 mutations in rhodopsin have been asso¬ 
ciated with the retinal degenerative disease, retinitis 
pigmentosa (RP), and several related diseases. Most of 
these mutations lead to misfolding and/or instability of 
rhodopsin which has been postulated to be the initiat¬ 
ing event ultimately leading to death of rod cells and 
eventually the entire retina causing initially night 
blindness followed by progressive loss of vision. 
There is no cure for this disease. The hope is that 
understanding the fundamental mechanisms of the 
folding of rhodopsin and how these mutations alter 
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Rhodopsin - Stability and Characterization of Unfolded 
Structures, Fig. 1 Secondary and 3D structure representation 
of rhodopsin. (a) Secondary structure representation of rhodop¬ 
sin. The disulfide bond between cysteines at positions 110 and 


187 is shown in red dotted line and palmitoylation sites at 
Cys322 and Cys323 are encircled in red. (b) 3D structure of 
rhodopsin (PDB ID: 1L9H) showing TM helices in red, loops in 
green, P-sheet in yellow, and 11-cis retinal in cyan 


the folding/stability of rhodopsin can help devise new 
strategies to combat RP. A critical aspect of character¬ 
izing folding mechanism of proteins in vitro is to 
characterize unfolded states because it has been 
shown that these states have native/nonnative inter¬ 
residue interactions (called residual structure) that 
influence the folding of a protein by acting as 
a folding nucleus (Dobson 2001). Therefore, an under¬ 
standing of the early stages of folding of rhodopsin 
can help determine whether the debilitating effect of 
the misfolding RP mutations are due to formation of 
different inter-residue interactions in the initial stages 
of folding than in wild type. 

Proposed Folding Models 

The held of MP folding is at a nascent stage compared 
to that of soluble protein folding. Initially, folding 
studies on bacteriorhodopsin, a bacterial MP, have 
led to the formulation of the “two-stage hypothesis” 
(Popot and Engelman 1990). According to this model 
(Fig. 2), TM helices are formed independently in the 
hrst stage and formation of tertiary contacts among 
them occurs in the second stage of folding to generate 


the three-dimensional structure of a helical TM pro¬ 
tein. Rhodopsin shares structural similarity but no 
sequence similarity with bacteriorhodopsin. However, 
its folding cannot be fully explained by the two-stage 
hypothesis. Cell-based, computational, and in vitro 
folding studies with rhodopsin have suggested that 
interactions between EC and TM domains are impor¬ 
tant in the early stages of folding. Based on these 
studies, a model of MP folding called the “long-range 
interaction model” (Fig. 2) was proposed which 
emphasizes the importance of interactions between 
residues that are far apart in sequence but come 
close in space during early stages of folding of a MP 
(Tastan et al. 2007). This model necessitates detailed 
characterization of early folding stages of rhodopsin 
to identify possible inter-residue interactions. To 
characterize such states, the folding studies that 
have been carried out so far are detailed in the fol¬ 
lowing sections. 

Cell-Based Folding Studies 

Early studies on the folding of rhodopsin involved 
a fragment-based experiment in which sets of 
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Folded bacteriorhodopsin 

Rhodopsin - Stability and Characterization of Unfolded 
Structures, Fig. 2 Folding models of membrane proteins. Left: 
Two-stage theory of folding depicting the formation of indepen¬ 
dent helices followed by formation of their tertiary contacts to 
form the native structure. Right. Long-range interaction theory 
of folding depicting the formation of tertiary interactions 
between different regions of a protein to form a nonnative 


fragments containing different number of TM 
helices of rhodopsin separated at different loops were 
cotransfected and their ability to fold into native struc¬ 
ture was determined (Ridge et al. 1995). Folding into 
native structure of rhodopsin was tested spectrophoto- 
metrically by measuring the ratio of absorbance at 500 
and 280 nm which should be 1.7-2.2 for a correctly 
folded structure. It was found that rhodopsin could he 
refolded in cells from fragments where at least one 
fragment contained a minimum of three TM helices. 
This indicates that individual helices cannot act as 
independent folding domains and specific inter-helical 
interactions are necessary during folding. It was fur¬ 
ther observed in these experiments that only fragments 
that were separated at the second and third CP loops 
could participate in reconstituting rhodopsin indicating 
that loop-loop and/or helix-loop interactions are 
indispensable during folding. Additional evidence 
supporting the importance of loops comes from muta¬ 
tional studies which show that mutations are not well 


Long-range interaction model 



Folding involves tertiary 
interactions within and between 
helices and loops 



Folded mammalian rhodopsin 

structure that triggers folding of the whole protein. As these 
folding models are largely based on folding studies of bR and 
rhodopsin, the protein structures are shown here as representa¬ 
tive MPs (Reprinted with permission from Tastan et al. 
Comparison of stability predictions and simulated unfolding 
of rhodopsin structures. 2007;83(2): 351-363. Copyright 
© 2009-2012 Wiley) 


tolerated in the EC loops and the large number of RP 
mutations in the EC region as compared to the CP 
region of rhodopsin (Tastan et al. 2007). 

In Silico Folding Studies 

Floppy Inclusion and Rigid Substructure Topography 
(FIRST) was used to predict inter-residue interactions 
in early stages of folding of rhodopsin (Rader et al. 
2004). The FIRST method involves breaking H-bonds 
in a protein in order of their increasing strength and 
identifying rigid and flexible regions in the protein at 
each step. The most rigid cluster that remains toward 
the end of denaturation is representative of a “folding 
core” which is a model for the structure that is formed 
in the early stages of folding of a protein to help initiate 
the process of folding. It was shown for rhodopsin that 
its TM helices do not unfold independently of each 
other and at the end of the unfolding process rigid 
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regions remain the largest of such regions consist of 
residues from EC loops and TM helices. The folding 
core, when mapped onto the crystal structure of rho¬ 
dopsin, lies at the interface of the EC and the TM 
domains and includes the disulfide bond, indicating 
the potential importance of these regions in folding of 
rhodopsin. Mutations of more than 90% of the residues 
in the folding core cause misfolding of rhodopsin, 
underlining the importance of the core (Rader et al. 
2004). These studies support the theory that interac¬ 
tions between EC and TM domains are necessary for 
forming a stable core that then triggers formation of the 
native structure of rhodopsin. 

Single Molecule Spectroscopy Folding 
Studies 

In vitro folding experiments that are carried out to 
characterize early folding states include disrupting 
the native structure to study the interactions that exist 
in the resulting unfolded state. This latter state is the 
closest possible analog, which can be obtained in vitro, 
to the early stages of protein folding. The most com¬ 
mon methods to obtain these states are by denaturing 
the protein with chemical denaturants (see below) or 
by force. Using single molecule force spectroscopy 
and dynamic force spectroscopy, rhodopsin in rod 
outer segments has been unfolded by pulling it from 
one end with a cantilever and measuring the resulting 
deflections of the cantilever (Sapra et al. 2006). Erom 
these deflections, a force-distance curve is obtained 
showing the force required to pull a particular protein 
segment which in turn reflects the rigidity or stability 
of the segment. Most of the stable structural segments 
observed on pulling rhodopsin span across TM helices 
and EC loops, indicating that these regions form stable 
interactions in largely unfolded states of rhodopsin. 
The rigid structural segments discovered by dynamic 
force spectroscopy, spanning TM and EC regions, 
were found to agree with that of the folding core 
residues identified by FIRST analysis. 

In Vitro Folding Studies Using Chemical 
Denaturants 

There have been very few chemical denaturation 
experiments with MPs, compared to soluble proteins. 


mainly due to the difficulty to perturb these systems in 
their stable hydrophobic environment with chemical 
denaturants (Booth and Curnow 2006). Another chal¬ 
lenging issue has been application of biophysical 
methods to study these unfolded states due to (a) their 
usually large size, (b) associated lipids/detergents 
which add to their size and background noise in 
biophysical measurements, and (c) interaction of dena¬ 
turants with lipids/detergents, leading to complicated 
interpretation of results. Stability of unfolded states for 
long periods of time, i.e., absence of aggregation of 
unfolded states, is another concern in such studies. 
These hindrances have triggered the need for a more 
systematic approach that addresses these issues while 
attempting to understand the nature of unfolded states. 

For characterization of largely unfolded states of 
rhodopsin, chemical denaturing conditions were opti¬ 
mized to obtain a condition under which the protein 
would be largely unfolded but without aggregation 
(Dutta et al. 2010b). CD spectroscopy was used to 
determine secondary structure changes in rhodopsin 
upon addition of different denaturants. Decrease in 
mean residue ellipticity at 222 nm, indicating decrease 
in amount of helical content in rhodopsin, was found to 
follow the order TEA > GuHCl > urea > SDS -i- 
urea > SDS. Next, the propensity of rhodopsin to 
aggregate under these denaturing conditions was tested 
by (a) measuring dependence of ellipticity at 222 nm 
on protein concentration, (b) detecting presence of 
oligomers on silver-stained SDS-PAGE, and (c) distri¬ 
bution of peaks on ID NMR spectra. Only SDS and 
SDS -I- urea denatured states did not aggregate even 
after a significant decrease in secondary structure, thus 
qualifying as suitable states for further biophysical 
characterization. 

Global tertiary structural changes in SDS-denatured 
states of rhodopsin were characterized using retinal- 
protein interaction as a probe, tryptophan fluorescence, 
and cysteine accessibility (Dutta et al. 2010a). These 
changes were found to correlate with changes in micel¬ 
lar structure of SDS with increase in SDS concentra¬ 
tion. SDS concentrations below its critical micelle 
concentration induced opening of the helical bundle 
of rhodopsin, leading to disruption of retinal-protein 
interaction. In presence of SDS spherical micelles, 
a compact intermediate shown by burial of cysteines 
and tryptophans and a decrease in overall hydrody¬ 
namic size of the denatured state was formed. In higher 
SDS concentrations where cylindrical micelles 
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a Stage 1 (SDS concentration range 0.05%-0.3%) 



X SDS 



Native state 



Initial opening (softening) 
of helix bundle 
(Trp fluorescence increase, 
cysteine accessibility, 
slow surface segment 
dynamics) 



Full opening of helix bundle 
(loss of retinal-protein 
contacts) 


b Stage 2 (SDS concentration range 0.3%-3%) 



^ SDS 



Native state 



Full helix bundle opening 
(loss of retinal-protein 
contacts, 

Trp fluorescence increase) 


Protein structural change and 
formation of compact state 
(partial Trp fluorescence 
decrease, burial of cysteines, 
micelle size) 


Rhodopsin - Stability and Characterization of Unfoided 
Structures, Fig. 3 Model of SDS -induced denatiiration of rho¬ 
dopsin. (a) Stage 1 represents addition of 0.05% SDS to rhodop¬ 
sin whereby an initial slow opening of helical bundle on 
a timescale of seconds occurs. This is followed by complete 
opening of the bundle on a timescale of minutes, (b) Stage 2 
refers to denaturing concentration of SDS beyond 0.05% up to 


3% whereby an opening of the helical bundle occurs in millisec¬ 
onds followed by structural changes in rhodopsin, leading to 
formation of a compact state seen on a second timescale 
(Reprinted with permission from Dutta et al. Characterization 
of membrane protein nonnative states. 2. The SDS-unfolded 
states of rhodopsin. 2010a;49(30): 6329-6340. Copyright 
© 2010 American Chemical Society) 




formed, burial of cysteines and tryptophans was seen 
along with large degree of unfolding and a significant 
increase in overall backbone flexibility indicated by 
ID 'H NMR. Therefore, these experiments indicated 
that a compact intermediate formed during SDS- 
induced unfolding of rhodopsin followed by formation 
of largely unfolded states with residual structure bury¬ 
ing specihc cysteines and tryptophans (Fig. 3). 


Relevance for Folding of Other 
Transmembrane Helical Proteins 

Folding studies on MPs so far have mostly focused on 
unfolding and refolding the protein to gather thermody¬ 
namic and kinetic folding parameters. Such refolding 
studies have been done for bacterial MPs bacteriorho- 
dopsin, KcsA, DAGK and DsbB, and EmrE and for the 
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LHCII complex from higher plants. In all these cases, 
except bacteriorhodopsin, refolding was accomplished 
only from a partially unfolded, near-native state (sum¬ 
marized in Dutta et al. 2010b). In contrast, opsin, which 
showed a decrease of 50% in its helical content in the 
presence of urea, failed to refold from such a largely 
unfolded state. Therefore, refolding of MPs is typically 
feasible only from a minimally denatured state which is 
structurally very similar to the native state but becomes 
difficult when large portions of the structure are 
unfolded, the latter resembling early stages of folding 
more closely. These difficulties have resulted in fewer 
folding studies of MPs. A different approach involving 
biophysical and structural characterization of unfolded 
states, shown by the in vitro folding studies with rhodop¬ 
sin, has opened up opportunities for systematic charac¬ 
terization of denatured states of MPs that potentially 
mimic early folding stages. 

Future Work 

The presence of residual structure in unfolded states 
and its importance in folding has been well character¬ 
ized for soluble proteins (Dobson 2001). It has been 
postulated that formation of native inter-residue inter¬ 
actions in initial stage of folding helps in reducing the 
number of conformations to be sampled to attain native 
state, thereby leading to a more efficient folding mech¬ 
anism. A similar contribution of inter-residue interac¬ 
tions in folding of large MPs can be expected but needs 
to be experimentally demonstrated. 

Mapping residues involved in forming residual 
structure In largely unfolded states of rhodopsin and 
other MPs upon chemical denaturation needs to be 
carried out. Identification of rigid regions will also 
enable correlation with the “folding core” predicted 
by long-range interaction model. Structural character¬ 
ization of denatured states of different MPs will enable 
comparison of the mechanistic details and indicate 
whether there are general principles followed by all 
MPs during folding. For example, comparison of struc¬ 
tural characterization of denatured states of bacterio¬ 
rhodopsin and rhodopsin will explain the differences, 
if any, in the folding pattern of these two proteins and 
hence explain the two different folding models. Such 
residue level characterization of denatured states 
can be carried out by EPR spectroscopy where site- 
specific labeling enables collection of residue-specific 


information. Further, recent advances in NMR spec¬ 
troscopy have enabled structural characterization of 
sparsely populated folding intermediates in soluble 
proteins (Korzhnev et al. 2010). 

Cross-References 

► Atomic Force Microscopy 

► Atomic Force Spectroscopy 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Dynamics of Helix 8 in GPCR Function 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Membrane Proteins; Structure and Organization 

► Protein Dynamics and Allostery 

► Protein NMR - Introduction 
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Rhodopsin: Stability and Structural 
Organization in Membranes 

Andreas Engel and Krzysztof Palczewski 
Department of Pharmacology, School of Medicine, 
Case Western Reserve University, Cleveland, 

OH, USA 

Synonyms 

Retinal purple; Rhodopsin; Seven-Transmembrane 
domain receptors 

Definition 

Rhodopsin (Rho) is the seven-transmembrane- 
spanning receptor that converts light into a chemical 
signal in the eye (Palczewski 2006). 

Basic Characteristics 

Composed of protein moiety called opsin and cova¬ 
lently linked ll-c/i-retinal, Rho is densely packed in 


disk membranes that are stacked like micron-sized 
pancakes in the rod outer segments of retinal photore¬ 
ceptors cells. Upon absorption of a single photon Rho 
transitions to the activated state, Rho*, which binds the 
heterotrimeric transducin (Gt) inducing GDP-to-GTP 
exchange and Gt activation and dissociation. Down¬ 
stream events amplify the photon-induced signal to 
ultimately produce a detectable electric pulse, the 
visual signal processed by vertebrate brain. 

The wealth of stractural, biophysical, and biochemical 
data on its function makes Rho a paradigm for under¬ 
standing G-protein-coupled receptors (GPCRs) and their 
activation. Its structure has been solved first to 2.8 A 
(reviewed in Palczewski 2006) and more recently 
improved to 2.2 A resolution (Okada et al. 2004). Although 
significant progress has been achieved in the high- 
resolution structure determination of multiple GPCRs 
and their various forms (reviewed in Salon et al. 2011), 
complete description of their native state is still lacking. 

Structural Organization of Rhodopsin in the 
Membrane 

High-resolution images of native disk membranes 
obtained by atomic force microscopy (AFM) revealed 





Rhodopsin: Stability and Structural Organization in Mem¬ 
branes, Fig. 1 (a) Atomic force microscopy of disk mem¬ 
branes adsorbed to mica reveals densely packed rhodopsin 
arranged in row of dimers. Row center-to-center distance is 
8.4 nm and dimer center-to-center distance in a row is 3.8 nm. 
Scale bar: 200 nm (Adapted from Fotiadis et al. 2003). 
(b) Negatively stained disk membranes adsorbed to carbon 


film reveal similar lines as those observed by AFM. Power 
spectra were calculated from a circular region adsorbed 
directly to the carbon film {top left insets) and from another 
one lying on a disk membrane {top right insets). Both diffrac¬ 
tion patterns document crystallinity irrespective of the 
support. Scale bars: 150 nm and 2.5 nm“* {insets) (Adapted 
from Liang et al. 2003) 
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Rhodopsin: Stability and Structural Organization in Mem¬ 
branes, Fig. 2 Model of the Rho-Gt heteropentamer (Adapted 
from Liang et al. 2003) 


that densely packed Rho forms rows of dimers (Fig. la; 
Fotiadis et al. 2003). Multiple AFM and transmission 
electron microscopy (TEM) experiments subsequently 
confirmed the oligomeric state of native Rho and opsin 
under fundamentally different conditions (Fig. lb; 
Liang et al. 2003; Suda et al. 2004), corroborating the 
existence of Rho dimers and higher oligomers in disk 
membranes. AFM data provided direct information 
about the packing arrangement of Rho and facilitated 
the construction of a working model for the active 
pentameric Rho-Gt complex (Fig. 2). 

Understanding the chemistry of photoactivation 
requires the structure of the Rho-Gt, Rho-arrestin, 
and Rho-rhodopsin kinase complex to be solved. 
To this end Rho needs to be purified in its native 
state, photoactivated, and incubated with purified Gt. 
Cross sections of different intermolecular interfaces 
have been derived from the Rho packing model 


(578 A^, 333 A^, and 146 A^), indicating weak 
intermolecular interactions within the membrane. 
Blue native gel electrophoresis demonstrated that 
even mild detergents, such as n-dodecyl-P-o-maltoside 
(DDM), disrupt Rho dimers when used at excessive 
concentrations (Fig. 3a; Suda et al. 2004). Milder 
detergents conserve different oligomeric forms of 
native Rho. For example, n-hexadecyl-p-o-maltoside 
(HDM) preserves higher oligomers consisting of 
worm-like rows of Rho dimers (Fig. 3b; Jastrzebska 
et al. 2006). Functional analyses demonstrated an 
approximately tenfold increase of Gt activation by 
FIDM-solubilized Rho compared to DDM-solubilized 
Rho, underlining the functional importance of its 
native, membrane-bound-like state. 

Solubilization of native Rho dimers and in vitro 
reconstitution of the Rho*-Gt complex in the nucleo- 
tide-free arrested state made it possible to isolate and 
characterize the active Rho*-Gt complex (Jastrzebska 
et al. 201 1). Its mass determined by scanning transmis¬ 
sion electron microscopy is 221 ± 12 kDa, compatible 
with a complex comprised of two Rho molecules, one 
Gt and a detergent micelle. The 21 A molecular enve¬ 
lope calculated from projections of negatively stained 
Rho*-Gt complexes accommodates two Rho mole¬ 
cules and one Gt heterotrimer (Fig. 4), corroborating 
the heteropentameric structure of the Rho*-Gt com¬ 
plex. Structures of Rho with other partner proteins 
such as rhodopsin kinase and arrestin have yet to be 
determined. 

Compelling evidence for the dimer concept of 
GPCRs comes from fully functional receptor pairs 
of which one receptor is defective in ligand binding 
and the other in G-protein coupling (Rivero-Muller 
et al. 2010). Functional complementation assays 
showed that the minimal signaling unit consists of 
two class A receptors and a single G-protein (Han 
et al. 2009). Accumulating evidence demonstrates 
that each monomer in this complex is not equivalent 
and retains its unique properties (Salon et al. 2011). 
Whereas atomic scale structural information is now 
available for several GPCRs (Salon et al. 201 1), Rho 
is the only GPCR that is highly abundant in disk 
membranes allowing the native oligomeric state to 
be assessed directly by high-resolution microscopies. 
Biochemical and biophysical analyses of Rho oligo¬ 
mers as well as the Rho-Gt complex are hampered by 
their fragility. Nevertheless, experimental evidence 
accumulates that native Rho exists as a dimer or 
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Rhodopsin: Stability and 
Structural Organization in 
Membranes, Fig. 3 (a) Blue 
native gel of unbleached disk 
membranes solubilized in 
0.3 % (lanes 1-3) and 0.6 % 
DDM (lanes 4-6) at protein 
concentrations of 0.9 mg/ml 
(lane 1 and 4), 0.6 mg/ml (lane 
2 and 5), and 0.3 mg/ml (lane 3 
and 6). Labels on the right 
indicate Rho tetramers (R4), 
trimers (R3), dimers (R2), and 
monomers (Rl), and opsin 
dimers (02) and monomers 
(Ol). The arrow marks empty 
DDM micelles (Adapted from 
Suda et al. 2004). (b) 
Negatively stained preparation 
of HDM-solubilized disk 
membranes revealing worm¬ 
like protein aggregates that 
resemble the rows of dimers 
(Fig. 1) in morphology and 
dimensions. Scale bar: 150 nm 
(Adapted from Jastrzebska 
et al. 2006) 



Rhodopsin: Stability and 
Structural Organization in 
Membranes, Fig. 4 Model 
of the Rho-G, heteropentamer 
built from squid rhodopsin 
dimer (PDB ID: 2Z73) and 
nucleotide-bound Gt (PDB ID: 
IGOT) by fitting the 
components to the molecular 
envelope calculated from 
projections of negatively 
stained Rho-G, complexes. 
Scale bar: 50 A. (Adapted 
from Jastrzebska et al. 2011) 
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higher oligomer in disk membranes, and that the 
active native Rho-Gt complex is a heteropentamer. 
The functional consequences of this arrangement 
remain to be elucidated. 

Cross-References 

► Atomic Force Microscopy 

► Dynamics of Helix 8 in GPCR Function 

► Membrane Proteins: Structure and Organization 

► Rhodopsin - Stability and Characterization of 
Unfolded Structures 

► Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy 

► Rhodopsins - Intramembrane Signaling by 
Hydrogen Bonding 
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Definition 

Ribonucleotide Reductase is the enzyme that catalyzes 
the reduction of ribonucleotides into deoxyribonucle- 
otides (EC: 1.17.4.1). 

Basic Characteristics 

Radical enzyme, nucleotide diphosphates as sub¬ 
strates, allosterically regulated. 

Introduction 

Most of the living forms have their genetic information 
encoded into DNA molecules. The exploit and main¬ 
tenance of this information, through DNA replication 
and repair, depends on the availability of deoxyribo- 
nucleotides. These compounds cannot be obtained 
by external sources and the only way by which they 
can be synthesized is through the reduction of 
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Ribonucleotide Reductase 
- Computational Studies, 
Scheme 1 General reaction 
catalyzed by ribonucleotide 
reductases 



HO OH 

Ribonucleotide 


SH SH 

Cys225 Cys462 


S — S 

^_L 

Cys225 Cys462 



HO H 

Deoxyribonucleotide 



Reductant Species 


2'-ribonucleoside 5'-diphosphates (ribonucleotides) 
into 2'-deoxyribonucleoside 5'-diphosphates (deoxyri- 
bonucleotides). This pathway is strictly conserved in 
all organisms and is catalyzed hy a peculiar group of 
radical enzymes called ribonucleotide reductases 
(RNRs). 

RNRs are mechanistically fascinating enzymes 
because of their free radical chemistry, unusual 
metallic cofactors, and complex regulatory mecha¬ 
nisms (Stubbe 1990). The reduction process catalyzed 
by all RNRs is supposed to be very similar and occurs via 
a free radical mechanism that involves the replacement 
of the C2'-hydroxyl group on the ribose moiety by 
a hydrogen atom. RNR acts in both purine and pyrimi¬ 
dine nucleotides (Sjoberg 1997). In the end of the cata¬ 
lytic process the corresponding 2'-deoxyribonucleotide 
diphosphates and an oxidized enzyme are obtained. In 
order to promote the enzymatic turnover, a disulphide 
bridge between C225 and C462 needs to be reduced 
by external reductants (Scheme 1) (Nordlund and 
Reichard 2006). 

Once the resulting 2'-deoxyribonucleoside 
5'-diphosphates are phosphorylated they become the 
precursors needed for both synthesis and repair of 
DNA. This means that any interference with RNR 
activity can block DNA replication and DNA repair - 
a condition that ultimately can trigger cell death. This 
key role turns RNR into an attractive and primary 
target for therapeutic intervention in several human 
diseases, in which cell proliferation is abnormal and 
responsible for pathologic condition. As such, its 
inhibitors are used in antitumor, antiviral, and 
antibacterial therapies. 

In order to enhance the development of new drugs 
several studies have been carried out over the last three 


decades aiming to expand the understanding of this 
enzyme (Cerqueira et al. 2005). For this purpose 
computational methods, and particularly quantum 
chemical calculations, have become indispensable 
complements to the available structural biology 
studies and biochemical experiments. One advantage 
is that they can provide structural information on tran¬ 
sition states and snapshots of molecules in the act of 
reaction, and the direct detection of such transient 
events is extremely difficult by normal physical 
methods. This is particularly important in the case of 
RNR since it is a radical enzyme, and these methods 
enable us to identify unstable intermediates, and get 
a clear evaluation of the chemical mechanisms 
involved in these processes, with atomic detail (Pereira 
et al. 2006). 

The following section will briefly describe the cur¬ 
rently knowledge provided by computational studies 
addressed to investigate the catalytic and inhibitory 
mechanisms of RNR. They will also highlight how 
these studies have provided useful information for 
further developments in this field. 


R 


The Catalytic Mechanism 

In spite of the apparent simplicity of the catalytic 
mechanism that is depicted at Scheme 1, the reduction 
of ribonucleotides into deoxyribonucleotides is rather 
complex. In the last 20 years, many studies were 
conducted aiming to understand the mechanistic por¬ 
trait of this reaction (Kolberg et al. 2004). However, 
this was not an easy task due to the radical transient 
nature of several intermediates and the limitations of 
the experimental methodologies to monitor the 
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Ribonucleotide Reductase - Computational Studies, metallic cofactor located at protein R2 where the radical is 
Fig. 1 Left. Predicted three-dimensional structure of the RNR generated. Right. Currently accepted mechanism for the reduc- 
holoenzyme (a) Active site of RNR located at protein Rl, (b) tion of the natural substrate 


formation of such species. Taking this into account, 
several computational studies were conducted in order 
to shed some light into the catalytic mechanism and try 
to ht, into a unified mechanism, all the transient inter¬ 
mediates identified by experimental methods. Based 
on these studies, the mechanism that is currently 
accepted for the reduction process is depicted at 
Fig. 1 (Cerqueira et al. 2004). 

The mechanism is initiated when the radical is 
transferred from protein R2, where it is generated, to 
protein Rl, where the active site is located (Fig. 1). 
Taking into account the available experimental data, it 
is known that the active site residue Cys439 is the one 
that receives the radical and is responsible for the 
beginning of the reduction process. 

The mechanism involves six steps, in which sev¬ 
eral hydrogen atoms and proton exchanges take 
place between the substrate (the ribonucleotide) 
and a set of residues of the active site, namely, 
Cys439, Glu441, Cys225, and Cys462. In the course 
of the reaction one water molecule is produced and 
a radical anionic disulfide bridge is transiently 
formed. This last intermediate was a subject of 
great discussion in the literature, but it was ulti¬ 
mately identified through EPR (Lawrence et al. 
1999) and its role elucidated by computational 
methods (Cerqueira et al. 2004). This intermediate 
is later oxidized to a disulfide bridge, which remains 
unchanged until the end of the reduction process. It 
is proposed that the disulfide bridge is reduced by 
reductive species (such as the thioredoxin or 


glutathione) after product release. Only after this 
step the enzymatic turnover takes place. 

The Inhibitory Mechanism of 2'-Nucleotide 
Analogues 

RNR inhibitors are commonly administrated as inac¬ 
tive prodrugs that only become active after two phos¬ 
phorylation steps promoted by cell kinases. The 
resulting compounds are recognized by RNR as 
normal substrates and react in the active site leading 
to abnormal products that inactivate the enzyme. RNR 
tolerates a wide range of modifications on the ribose 
ring, but the most interesting ones are the 2'-analogues 
that have activity in vivo (Cerqueira et al. 2007). These 
compounds differ from the natural substrate in the 
group that is bound to C-2'. Examples include 
the monohalogenated 2'-deoxy-2'-fluoronucleosides 
(FdN) and 2'-deoxy-2'-chloronucleosides (C/dN), the 
dihalogenated 2'-deoxy-2',2'-difluoronucleosides 
(F 2 dN), the 2'-azido-2'-deoxynucleosides (A^^dN), 
the 2'-deoxy-2'-methylenenucleosides (C// 2 dN), the 
2'-deoxy-2'-fluoromethylenenucleosides (CHFcM), 
and the 2'-deoxy-2'-mercaptonucleosides (5//dN). 

In the last decade, almost all the 2'-deoxynucleoside 
analogue inhibitors were studied by computational 
methods (Perez et al. 2010), including gemcitabine 
(2',2'-difluoro-2'-deoxycytidine) that is approved by 
FDA and EMEA for the treatment of several types of 
cancer. The results have shown that in spite of 
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Ribonucleotide Reductase - Computational Studies, Table 1 Available experimental and computational results used to 
classify RNR nucleoside analogues in three main classes, taking into account their mode of inactivation 


Class of inhibitors 

Inhibitors 

Inhibition 

Protein Rl 

Protein R2 

Furanone derivatives 
detection (nm) 

Activity at physiologic 
conditions 

Class 1 

C/^fNDP 

Yes“ 

No 

320 

No 


Fr/NDP 

Yes“ 

No 

320 

No 


SHdNDP (anaerobic) 

Yes“ 

No 

320“ 

No 


C//2<fNDP 

Yes“ 

No 

326 and 366 

No 

Class 2 

iV,rfNDP 

Yes“ 

Yes 

320 

Data not available 


CHFdNDP 

Yes“ 

Yes 

334 

Yes 

Class 3 

FjrfNDP 

Yes'’ 

Yes” 

- 

Yes 


S//r/NDP(aerobic) 

Yes 

Yes 

- 

Yes 

“In the presence of reductive species 
'’In the absence of reductive species 

“The formation of the furanone derivative has been proposed by computational 

means 



the apparent similarity of these compounds, the nature 
of the chemical entity that is bound to C-2' can 
completely change the course of the reduction process 
and lead to different types of inhibition, as it is 
observed experimentally. These studies have also 
shown that the mode of inhibition is dependent on the 
type of mechanism that is followed. This usually 
means that if different compounds follow analogous 
mechanisms, they will produce similar inhibitory 
effects. If they follow different pathways the type of 
inhibition will be usually different. 

Considering all the available data (derived experi¬ 
mentally and computationally), it is now possible to 
divide the 2'-nucleoside inhibitors in three distinct 
classes (Table 1). The classification is based on the 
type of inhibition. The inhibition process generally 
involves the irreversible inactivation of protein Rl, 
protein R2, or both. 

The inhibitors of Class I include all the compounds 
that inhibit protein Rl due to the formation of 
a furanone derivative. Once in solution, this compound 
reacts with an active site residue and blocks its 
entrance, precluding any further reduction. From all 
the three classes, these inhibitors are the less efficient 
in in vivo conditions, because they are inefficient in 
the presence of reductive species (widely available at 
physiological conditions), which react with the 
furanone derivatives. 

The inhibitors of Class II contain N3dNDP and 
CHFdNDP. These compounds have similar character¬ 
istics to those ones of the first class but also inhibit 
protein R2. These characteristics improve their 


cytotoxic properties and enhances their efficiency 
under physiologic conditions, since the inhibition pro¬ 
moted at R2 is independent of the presence of reduc¬ 
tive species. The main difference between the 
inhibitors of this class and the ones of class 1 is the 
nature of the substituents bound to C-2'. These sub¬ 
stituents tend to be more reactive toward the active 
site, which allows them to remain active after the 
formation of the furanone derivative contrarily to 
what happens to the ones of the first class. 

The inhibitors of class III include all the compounds 
whose inhibitory pathway does not involve the forma¬ 
tion of furanone derivatives. Generally, the enzyme is 
inhibited through the covalent bonding of a reaction 
intermediate to an active site residue, leading to 
the destruction of the radical that is essential for the 
catalysis. The concerted inhibition is very efficient 
and generally leads to a complete inactivation of the 
enzyme. The compounds that are already used clini¬ 
cally and therefore already approved by FDA and 
EMEA agencies for the treatment of some types of 
cancer belong to this class. 

Future Developments in This Field 

Regarding the key role of RNR as a critical target 
in antitumor, antiviral, and antibacterial therapies, 
together with the large number of RNR inhibitors that 
were developed/patented in the last 30 years, several 
theoretical and computational studies were conducted 
to understand how the enzyme inhibition occurs. 
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This was a major milestone in this field because, in 
spite of large amount of experimental data that was 
available at that time, the correct mechanism through 
which inhibition occurs was still poorly understood. 
This knowledge was rather urgent because the FDA 
and the EMEA agencies had already approved some 
RNR inhibitors for the treatment of several types of 
cancer but the mechanism by which they act and 
inhibit RNR was still not fully understood. In addition, 
there were many unrelated experimental data that 
could not be fitted into a unified mechanism, and this 
information was crucial for the development of new 
and more potent RNR inhibitors. 

The current knowledge now allows looking for the 
future of RNR inhibition in a new way. The developed 
tools and the available information allows for the 
prediction of the type (and eventually the mechanism) 
of inhibition of new 2'-nucleoside analogues. These 
fundamental guidelines can now be used to embark in 
a more rational drug discovery process to find more 
potent inhibitors of RNR. 

Cross-References 

► Computational Enzymology 

► Computational Studies of Enzyme Motions 

► Differential Transition State Stabilization - 
Nonempirical Analysis 

► DNA Repair and Recombination 

► EPR 

► Machinery of DNA Replication 

► QM/MM Calculations of Vibrational Spectra 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 
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Definition 

Catalytic domains present in modular, multidomain 
RNA-binding proteins carry out enzymatic reactions. 
These proteins also contain dsRNA-binding domains 
that interact with duplex regions of RNA. 

Basic Characteristics 

A subset of the large family of RNA-binding proteins 
contains catalytic domains. These multidomain enzymes 
play important roles in regulating gene expression, 
nucleic acid metabolism, immunity, and other cellular 
processes. In particular, enzymes that contain RNA- 
binding proteins are core components of the RNA inter¬ 
ference machinery and function in related pathways 
involving small RNAs. In most cases, RNA binding is 
mediated by the presence of one of more dsRNA-binding 
motifs (dsRBMs). The dsRBM binds to dsRNAs and to 
RNAs that contain duplex regions, but it does not interact 
with single-stranded RNAs or with RNA-DNA hetero¬ 
duplexes. A prominent group of enzymes that all contain 
the dsRBM are endonucleases of the RNase III family 
that function to cleave dsRNAs. These proteins contain 
one or two endonuclease domains, a dsRBM and, in 
some cases, additional domains are present (Fig. 1). 
Members of this family include the prokaryotic 
and yeast RNase III enzymes, Dicer, which functions in 
the siRNA pathway and Drosha, which mediates the first 
endonucleolytic cleavage reaction in the miRNA 
pathway. Another class of catalytic RNA-binding pro¬ 
teins are the adenosine deaminases that act on RNA 
(ADARS). These enzymes convert adenosines to 
inosines in RNA duplexes present within both coding 
and noncoding RNAs. The A ^ 1 editing reaction within 
coding regions of mRNAs can lead to a change in protein 
sequence. However, ADARS most frequently target 
introns and 3'- and 5'-UTR regions in mRNAs and also 
interact with the RNA interference pathway. ADARS 


contain a conserved C-terminal catalytic domain and 
between one and three N-terminal dsRBMs (Fig. 1). 
Some ADARS also contain arginine rich domains that 
bind ssRNA, and one ADAR contains two Z-DNA bind¬ 
ing domains. Finally, the enzyme protein kinase R (PKR) 
plays a key role in the innate immunity response to viral 
infection and is also implicated in the control of cell 
growth and differentiation. PKR contains two dsRBMs 
and a protein kinase domain (Fig. 1). In contrast to the 
RNase III and ADAR families, where the dsRBMs are 
involved in the recognition of the RNA substrate, the 
catalytic domain present in PKR does not act on RNA 
but instead functions to phosphorylate protein substrates. 
In PKR, dsRNA binding is not directly involved in 
substrate recognition but serves to activate the kinase 
domain. In this article, we will focus on PKR because it 
has been extensively investigated using a variety of 
structural, biophysical and biochemical approaches. 

PKR is induced by interferon in a latent form 
and is activated upon binding dsRNA to undergo 
autophosphorylation and subsequently phosphorylate 
cellular substrates. PKR is also activated by 
5'-phosphorylated RNAs containing a short dsRNA hair¬ 
pin flanked by single stranded regions as well as by the 
protein activator PACT. The most well-characterized 
cellular substrate of PKR is the alpha subunit of eukary¬ 
otic initiation factor eIF2. Phosphorylation of eIF2tx 
inhibits protein synthesis in virally infected cells. PKR 
contains two copies of the dsRBM at the N-terminus 
(dsRBM 1 and dsRBM2) and a C-terminal kinase domain, 
with a ^90 amino acid unstructured linker lying between 
these domains (Fig. 2a). PKR is the only RNA-binding 
protein that contains a kinase domain. However, it 
belongs to a family of stress-activated enzymes that all 
contain a conserved kinase domain that phosphorylates 
eIF2a linked with different regulatory domains. 

In the NMR structure of an N-terminal construct of 
PKR containing both dsRBMs, each of the motifs 
adopts the canonical aPPPa fold and they are 
connected by an unstructured linker of ^ 20 amino 
acids (Nanduri et al. 1998). The overall structures of 
the two domains are quite similar, with a backbone root 
mean square deviation of 2 A (Nanduri et al. 1998). 
However, NMR relaxation measurements indicate that 
dsRBM 1 exhibits greater motional flexibility on the ms 
timescale than does dsRBM2 (Nanduri et al. 2000). 
They also differ with respect to RNA-binding 
affinity, with dsRBM2 binding much more weakly 
than dsRBMl (Tian and Mathews 2001). However, 
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■NI^NNNN-5' 
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3'-NN-5' 3'-NNNN-5' 



RNA-Binding Proteins - Catalytic Domains, Fig. 1 Repre¬ 
sentative RNA-binding proteins containing catalytic domains. 
Left, domain organization of S. cerevisiae Rntlp (RNase III), 
H. sapiens ADAR2 (adenosine deaminase acting on RNA), and 
H. sapiens PKR (protein kinase R). Right: reaction catalyzed by 


the three enzymes. RNase III enzymes cleave dsRNA leaving 
two or three nucleotide 3'-overhangs. Adenosine deaminases 
convert adenosines to inosines. The major substrate of PKR is 
eukaryotic initiation factor 2a 


constructs containing both motifs bind dsRNA about 
30-fold more strongly than dsRBMl alone, indicating 
cooperation between the two motifs (Tian and 
Mathews 2001; Ucci et al. 2007). NMR shift perturba¬ 
tion studies show that only dsRBMl interacts with 
a 20 hp dsRNA, whereas both motifs are perturbed 
upon binding to an activating 40 bp dsRNA (Ucci 
et al. 2007). The crystal structure of a complex of the 
PKR kinase domain with eIF2a have been solved (Dar 
et al. 2005). The catalytic domain has a typical protein 
kinase fold consisting of an N-terminal lobe that is 
mostly P sheet and a C-terminal lobe that is predomi¬ 
nantly helical. The 90 residue region lying between 
the dsRNA-binding motifs and the kinase has low 
sequence complexity with a high content of uncharged, 
polar residues. Small angle scattering (VanOudenhove 
et al. 2009) and NMR (McKenna et al. 2007) measure¬ 
ments indicate that this linker is flexible in solution. 

The regulation of PKR by RNA activators has fasci¬ 
nated researchers for over 30 years. In the original 
autoinhibition model for PKR activation, dsRNA bind¬ 
ing induces a conformational change which leads to the 
release of the dsRNA-binding domain from the kinase, 
thus relieving the inhibition of the latent enzyme 
(Fig. 2b). However, more recent structural and biophys¬ 
ical data now favor a model where dsRNA principally 
functions to induce dimerization of PKR via the kinase 
domain (Fig. 2c). Equilibrium and kinetic analysis of the 


binding of fluorescent nucleotide analogues to PKR 
reveals that ATP can indeed interact with the latent 
enzyme (Lemaire et al. 2006). Furthermore, nucleotide 
binding affinities and kinetics are only weakly affected 
upon binding PKR to an activating dsRNA or by PKR 
autophosphorylation. Thus, the active site is freely acces¬ 
sible in both latent and activated PKR. NMR shift per¬ 
turbations measurements (Gelev et al. 2006; Nanduri 
et al. 2000) indicate that dsRBM2 binds to the kinase 
domain. However, this interaction is very weak (Ander¬ 
son and Cole 2008) and argues against an autoinhibition 
model in which latent PKR is locked in a stable, closed 
conformation. Indeed, small-angle scattering analysis 
indicates that latent PKR adopts multiple open and 
closed conformations in solution (VanOudenhove et al. 
2009). 

A variety of structural, biophysical, and biochemi¬ 
cal data suggest that PKR dimerization plays a key role 
in the activation mechanism (Fig. 2c). A defining fea¬ 
ture of PKR is the “bell-shaped” curve for activation, 
where low concentrations of dsRNA activate but higher 
concentrations are inhibitory. These results can be ratio¬ 
nalized in a model where low concentrations of dsRNA 
favor assembly of multiple proteins - possibly assem¬ 
bling as dimers - on a single dsRNA, whereas higher 
dsRNA concentrations dilute PKR monomers onto sep¬ 
arate molecules of dsRNA. PKR is capable of dimerizing 
in the absence of dsRNA, and this reaction is capable of 
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RNA-Binding Proteins - Catalytic Domains, Fig. 2 PKR 

domains and their interactions, (a) Domain organization and 
structures. The N-terminal regulatory domain comprises two 
dsRNA-binding motifs, dsRBMl and dsRBM2, connected by 
an unstructured linker. Each of these motifs adopts the canonical 
aPPPa fold in the NMR structure of the dsRBD (PDB code: 
1QU6). In the crystal structure of a complex of the PKR kinase 
domain and eIF2a (PDB code: 2A1 A), the kinase domain has the 
typical bilobal structure observed in other eukaryotic protein 


kinases and dimerizes via the N-terminal lobe (note, the sub¬ 
strate eIF2a was omitted from the figure for clarity), (b) 
Autoinhibition model for PKR. The interaction of dsRBM2 
with the kinase domain blocks the activity of latent PKR. Bind¬ 
ing to dsRNA activates PKR by removing dsRBM2 from the 
kinase, (c) Dimerization model for PKR activation by dsRNA. 
The binding of PKR to dsRNA of at least 30 bp induces PKR 
dimerization via the kinase domain, resulting in activation 


R 


activating autophosphorylation in the absence of dsRNA. 
The structure of the PKR kinase domain complexed with 
eIF2a revealed a dimerization interface localized on the 
N-terminal lobe (Dar et al. 2005) An allosteric pathway 
linking the dimer interface and the PKR kinase active 
site has been proposed. Mutations designed to disrupt 
intermolecular interactions at this interface block PKR 
autophosphorylation. Like PKR, the kinase domain from 
Mycobacterium tuberculosis PknB forms a back-to-back 
dimer and recent structural and biochemical indicate 


that dimerization allosteric ally activates the kinase 
(Lombana et al. 2010). 

PKR and other RNA-binding proteins containing 
catalytic domains are regulated by a complex network 
of labile protein-protein and protein-RNA interac¬ 
tions. In order to quantitatively characterize the stoi¬ 
chiometries, affinities, and free-energy couplings that 
govern the assembly of the relevant macromolecular 
complexes, it is critical that the interactions are mea¬ 
sured at equilibrium using rigorous biophysical 
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methods. We have found analytical ultracentrifuga¬ 
tion, isothermal titration calorimetry, and fluorescence 
anisotropy measurements to be particularly useful in 
our studies of protein self-association, domain interac¬ 
tions, and protein-RNA interactions. 
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Synonyms 

Arrest; Elongation; Initiation; Multi-subunit RNA 
polymerase; Pausing; Promoter escape; Termination; 
Transcription cycle 

Definitions 

Multi-subunit RNA polymerases (RNAPs) synthesize 
a RNA chain from a complimentary DNA template. 
The ternary elongation complex (TEC) is composed of 
RNAP, DNA, and nascent RNA. Template and non¬ 
template DNA separate to form the transcription 
bubble. Because RNAPs translocate along the DNA 
template during polymerization, RNAPs are consid¬ 
ered to be molecular motors. 


Introduction 

According to the central dogma of molecular biology, 
information flow in biological systems runs from DNA 
to RNA to protein. ► DNA polymerases (DNAPs) and 
RNA polymerases (RNAPs) are enzymes that copy 
genetic material through a template-dependent polymer¬ 
ization process. DNA-dependent DNAPs synthesize 
a complementary DNA from a DNA template, and 
DNA-dependent RNAPs synthesize a complementary 
RNA from a DNA template. RNA synthesis is referred 
to as “transcription,” rather than “replication,” because 
a RNA is made from a DNA template, subtly changing 
the chemistry of the genetic code. Both DNAPs and 
RNAPs utilize an analogous 2-Mg^'^ mechanism in their 
polymerization reaction. 

Transcription in eubacteria is carried out by 
a single RNAP. Eubacterial RNAPs have hve core 
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RNA Polymerases and Transcription, Fig. 1 The yeast (Sc) NT non-template DNA, GMPcPP a GTP analogue, gi-een 
RNAP II TEC. The ten subunit RNAP II is shown, lacking spheres Mg^"^, gray spheres Zn^*. Structural figures were 
subunits Rpb4 and 7 (PDB 2NVT). R RNA, T template DNA, drawn using visual molecular dynamics 


enzyme subunits: P' (catalytic), p, a dimer of 
a subunits, and a ® subunit. Archaea also have 
just one RNAP, which may consist of 13 subunits, 
homologous to the subunits in the three types of 
eukaryotic RNAPs. Eukaryotes have three nuclear 
RNAPs: RNAP I synthesizes ribosomal RNAs; 
RNAP II synthesizes mRNAs and some non-coding 
RNAs; RNAP III synthesizes small nuclear RNAs, 
including 5 S ribosomal RNA, tRNAs, and some 
► microRNA precursors. The mitochondrial RNAP is 
a single subunit enzyme homologous to bacteriophage 
RNAPs. This review focuses on multi-subunit RNAPs 
(Fig. 1), which are highly conserved in eubacteria, 
archaea, and eukarya. 

The Transcription Cycle 

Because RNA synthesis lies at the heart of biological 
information flow, transcription is highly regulated. 
The transcription cycle progresses from initiation to 
promoter escape to elongation to termination to 


reinitiation (Fig. 2). After synthesis, RNAs are sub¬ 
ject to numerous regulatory events including internal 
and external processing, covalent modification, 
polyadenylation, and sequestration. RNAPs initiate 
transcription from a promoter DNA sequence. Pro¬ 
moter recognition involves accessory factors most of 
which are dissociated during the elongation phase of 
the cycle. Promoter escape is the process of RNAP 
leaving the promoter and dissociating from accessory 
initiation factors. In some cases, abortive initiation 
occurs, when RNAP fails to free itself from the 
promoter to productively extend a RNA chain. Elon¬ 
gation is the chemical polymerization phase in which 
RNA is synthesized. In cells, elongation is modu¬ 
lated by many accessory factors. Multi-subunit 
RNAPs are highly “processive,” meaning that once 
in a productive elongation phase, RNAP tenaciously 
retains its DNA template and nascent RNA chain. 
Termination is usually an active process. After the 
nascent RNA is released, the transcription bubble 
collapses, and RNAP cannot restart the elongation 
of the same RNA molecule. 


R 
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RNA Polymerases and Initiation 

Transcription, Fig. 2 The Promoter DNA sequence 

transcription cycle Initiation factors 



Eiongation 
Elongation factors, 
Anti-termination factors 
Pausing, arrest, re-activation 


Initiation of Transcription 

Bacterial RNAPs 

The topic of transcriptional initiation is too broad and 
complex to fully address here, so a short version is 
offered. Essentially, during initiation RNAPs must 
convert from a promoter-specific DNA-binding pro¬ 
tein to a protein with little DNA sequence recognition 
during elongation. Generally, RNAPs utilize accessory 
factors for promoter recognition and initial DNA 
strand separation. Some promoters have binding sites 
for protein factors that suppress or activate initiation. 
Bacterial RNAPs utilize sigma factors for core pro¬ 
moter recognition. Sigma factors participate directly in 
DNA strand separation at the promoter, and sigma 
factors dissociate from elongating RNAP. The exiting 
RNA chain, for instance, helps to displace a sigma 
factor from bacterial RNAP. 

Promoter escape is the transition between initiation 
and productive elongation. At many bacterial promoters, 
short, abortive transcripts are repeatedly released, indi¬ 
cating that reinitiation can out-compete commitment to 
stable elongation. Another way that promoter escape has 
been visualized is by “scmnching.” Using fluorescence 
resonance energy transfer (► FRET) probes to map 
RNAP movement during initiation and elongation, it 
was shown that RNAP compresses the growing DNA 
single-stranded bubble, rather than translocating forward 
and jumping back (the alternate “excursion” model) 
(Kapanidis et al. 2006). The transition from abortive 


initiation to elongation therefore occurs when RNAP 
translocates forward from the promoter, releases strong 
promoter contacts, and stably fills its RNA-DNA hybrid. 

Eukaryotic RNAP II 

Each of the three eukaryotic RNAPs utilizes a distinct 
promoter architecture and set of transcription initiation 
factors. As an example, RNAP 11 utilizes enhancer¬ 
binding factors, repressors, adaptor factors, mediator 
factors, and general initiation factors (Sikorski and 
Buratowski 2009; Taatjes 2010). In addition to 
sequence, promoter architecture involves positioned 
► histones and regulated covalent modifications of 
histones, transcription factors, and DNA. DNA strand 
separation at the promoter generally requires general 
initiation factor TFllH, a complex factor that includes 
two subunits with ATP-dependent DNA helicase activ¬ 
ities. TFllH-dependent DNA strand separation appears 
to be required for initiation and promoter escape by 
RNAP 11. Other RNAPs initiate without a TFllH-like 
helicase. 


RNAP Structure and Rate-Limiting Steps in 
Transcription Elongation 

Polymerization Reaction 

Multi-subunit RNAPs utilize nucleoside triphosphate 
(NTP) substrates, which are added at the RNA 3' end 
as nucleoside monophosphate (NMP) units, releasing 
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RNA Polymerases and Transcription, Fig. 3 The 2-Mg^* 
mechanism of RNA synthesis. The image shown is from 
a bacterial TEC (PDB 205 J). RNA is elongated by attack of 
the 3'-0 of the RNA on the alpha-phosphate of the NTP substrate 
(white arrow’), releasing pyrophosphate. The / and i + I 
translocation registers are indicated (Mg is green) 

diphosphate (pyrophosphate) (Fig. 3) (Brueckner et al. 
2009a, b; Komberg 2007). NMP addition to the grow¬ 
ing RNA chain, therefore, is in the 5' ^ 3' direction, 
according to the numbering of the ribose sugar of the 
ATP, GTP, CTP, or UTP base. Two Mg^"^ ions are held 
by acidic groups in the RNAP structure and by the NTP 
triphosphate. Polymerization occurs by in line attack 
of the 3'-0 of the ribose sugar on the alpha-phosphate 
of the substrate NTP, releasing diphosphate/pyrophos¬ 
phate. This polymerization reaction is very analogous 
to that catalyzed by single-subunit RNAPs (i.e., bacte¬ 
riophage T7 RNAP) and DNAPs. 

Like some DNAPs and single-subunit RNAPs, 
multi-subunit RNAPs have a mechanism for opening 
and closing of the RNAP active site (Fig. 4) (Erie and 
Kennedy 2009; Kireeva et al. 2010; Zhang et al. 2010). 
In the case of multi-subunit RNAPs, opening and clos¬ 
ing involves a conformational rearrangement of the 


trigger loop (Fig. 4; middle and right panels). In 
a closed TEC, the trigger loop becomes more ordered 
with additional helical stmcture. In the open TEC, the 
trigger loop is more disordered. The closed TEC is the 
catalytic structure and requires a loaded cognate NTP 
substrate. RNAP has a deeply buried active site. 
Molecular dynamics simulation indicates that RNAP 
utilizes the trigger loop to enclose and dehydrate the 
active site base pair (Seibold et al. 2010). Closing of 
the RNAP active site strips water from the dNMP-NTP 
base pair, stabilizing the substrate NTP in the active 
site. Many enzymes including some DNAPs and sin¬ 
gle-subunit RNAPs utilize active site closing mecha¬ 
nisms that likely regulate specific water placement 
during catalysis. 

NTP Loading to the Active Site 

The RNAP active site is not readily accessible to NTP 
substrates, and two possible NTP loading routes are 
currently debated. The localization of the cognate and 
non-cognate NTP substrates in the TEC structure has 
been interpreted as evidence of NTP diffusion through 
the secondary pore in the open TEC (Fig. 4; left 
panels). Electrostatic diffusion modeling, however, 
suggests that the pore is deep, narrow, and negatively 
charged, and passive diffusion of negatively charged 
NTP-Mg^"^ through this channel is predicted to be 
slow. If the secondary pore represents the primary 
route for the incoming NTP, substrate loading should 
be rate-limiting for transcription in vivo and in vitro. 
However, pre-steady-state kinetic analyses clearly 
demonstrate that RNAP TECs bind and stably 
sequester the incoming NTP much faster than the 
phosphodiester bond is synthesized. Moreover, elon¬ 
gation factors that occlude the secondary pore do not 
necessarily slow down RNA synthesis. Taken together, 
these observations suggest that NTP loading might 
occur through a different route, for example, via the 
downstream DNA cleft. The secondary pore might 
represent the expulsion route for pyrophosphate and 
incorrect NTP substrates (Erie and Kennedy 2009; 
Zhang et al. 2010; Kireeva et al. 2010). 

Translocation 

Translocation of RNAP along the DNA results in the 
displacement of the recently incorporated NMP base 
paired to the template DNA base from the substrate¬ 
binding site, enabling the subsequent NTP to load to 
the active site. The closed conformation of the TEC 
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RNA Polymerases and Transcription, Fig. 4 The open and 
closed conformations of the trigger loop in the bacterial RNAP 
TEC. Top panels represent the closed TEC (C); Lower panels 
represent the open TEC (O). Leftmost panels show that the 
secondary pore is almost closed in the closed TEC and that 
the RNAP active site is deeply buried. Blue: beta'; tan: beta; 


is constrained for translocation along the DNA tem¬ 
plate, and the open TEC is likely the translocating 
form (Fig. 4). Translocation might represent a rate- 
limiting step in transcription elongation. Two models 
have been considered for translocation; the passive 
thermal ratchet and the NTP-driven (or allosteric) 
translocation. According to the passive thermal 
ratchet model, the trigger loop opens spontaneously 
after chemistry, pyrophosphate is released prior 
to NTP loading, and NTPs load only to a post- 
translocated TEC. The post-translocated state is 
stabilized by the incoming cognate NTP, but the 
translocation is not allosterically driven by the 


yellow: DNA and RNA; red: NTP. Middle and right panels show 
two views of the nucleic acids and active site. Yellow: bridge 
helix (BH); orange: trigger loop (TL); green: Mg^"'". The ATP 
substrate in the active site is indicated. Images are derived from 
PDB 2PPB and 205 J 


incoming NTP (Erie and Kennedy 2009; Zhang 
et al. 2010; Kireeva et al. 2010). 

The NTP-driven (allosteric) translocation model 
posits that the incoming NTP acts as an allosteric 
effector supporting a conformational change that 
involves opening of the trigger loop and freeing the 
RNAP thermal ratchet for translocation. In this way, 
pyrophosphate release is coupled to trigger loop open¬ 
ing and translocation and is dependent on the incoming 
NTP. Because templated binding of the incoming 
NTP facilitates pyrophosphate release from RNAP, it 
appears that a NTP and pyrophosphate must be on 
template simultaneously. Further kinetic analyses of 
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RNA Polymerases and 
Transcription, Fig. 5 The 

transcription bubble. The 
image is from a 
post-translocated yeast 
RNAP 11 TEC lacking 
a substrate NTP. The 
non-template DNA strand was 
placed by FRET 
measurements. R RNA, 

T template DNA, NT 
non-template DNA, BH bridge 
helix. Dimensions of the 
bubble and RNA-DNA hybrid 
are indicated. In a 
pre-translocated TEC the 
RNA-DNA hybrid is 9 nt. 

The ; and i + 1 translocation 
registers are indicated 
(The PDB file was from 
J. Michaelis) 



transcription elongation are required to establish 
whether pyrophosphate release and/or translocation is 
rate-limiting in transcription and if the incoming NTP 
actively promotes translocation. 

Transcription Bubble Maintenance and DNA 
Unwinding 

RNAP initiates a RNA chain on double-stranded DNA, 
necessitating separation of DNA strands to expose the 
DNA template. As RNAP elongates a RNA chain, 
a transcription bubble is maintained (Fig. 5). The 
length of the bubble has been measured as 12-13 
nucleotides (Andrecka et al. 2009). The length of the 
nascent RNA hybrid with DNA is 8-9 bases. The DNA 
is bent in the active site; additional bending of the 
DNA may occur at the upstream edge of the transcrip¬ 
tion bubble. RNAP moves on the DNA template from 
the “upstream” to the “downstream” direction, the 
5' ^ 3' directionality of the RNA chain, which is the 
direction of RNA synthesis (Fig. 5). At the down¬ 
stream end of the bubble, DNA must be continually 
unwound, and, at the upstream end of the bubble, DNA 
must continuously be rewound. RNAP movement 
along the template results in accumulation of negative 
supercoils upstream and positive supercoils down¬ 
stream from RNAP. 

Pausing, Backtracking, Arrest, Termination 

Transcription elongation is tightly regulated, and tem¬ 
porary stopping (pausing) of RNAP plays an essential 


role in the regulation of gene expression (Landick 
2006; Kireeva et al. 2010). In prokaryotes, transcrip¬ 
tional pausing is essential for attenuation mechanisms. 
In eukaryotes, pausing in the vicinity of the promoter 
and the regulated, factor-dependent restart of tran¬ 
scription has recently emerged as a critical mechanism 
of gene regulation. In addition to pausing, an irrevers¬ 
ible catalytic inactivation of RNAP, or transcription 
arrest, which occurs without dissociation of RNAP 
from the template, has been observed in vitro. In the 
cell, transcription arrest occurs when RNAP encoun¬ 
ters an obstacle (a DNA-binding protein or a DNA 
lesion). 

The best characterized mechanism for the revers¬ 
ible or irreversible catalytic inactivation of RNAP is 
backtracking, a unique property of multi-subunit 
RNAPs. During backtracking, the RNAP moves 
upstream along the DNA, extruding the 3' end of the 
RNA from the active site and maintaining the 8-9 bp 
RNA-DNA hybrid. The backtracking mechanisms of 
RNAP catalytic inactivation were initially demon¬ 
strated in vitro using biochemical approaches. Arrest 
and backtracking have been demonstrated in vivo and 
further confirmed by crystallographic data and single¬ 
molecule analyses. Some pauses of prokaryotic RNAP 
and promoter-proximal pauses of eukaryotic RNAP II 
occur by a backtracking mechanism. 

Backtracked TECs may spontaneously return to 
the active conformation. Alternatively, they may be 
reactivated by spontaneous or factor-assisted cleavage 
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RNA Polymerases and 
Transcription, Fig. 6 TFIIS 
regulation of RNAP II 
elongation. T template DNA, 
NT non-template DNA, 

R RNA, BH bridge helix, 
ns TFIIS, /// domain III 
(a ribbon). 290-DE-291 
is in red (PDB 2NVT and 
lYlV) 



of the extruded transcript, resulting in resetting of the 
active site and the RNA 3' end. RNAP II transcript 
cleavage factor TFIIS has a DE motif at the end of 
a Zn^"^ ribbon that reaches through the secondary pore 
into the RNAP II active site and that holds a Mg^"^ close 
to the location of Mg-B (Brueckner et al. 2009a, b). 
Positioning Mg-B close to the more strongly held 
catalytic RNAP II Mg-A creates the active site of 
a RNAP II-TFIIS endonuclease that can cleave the 3' 
end of the nascent RNA, generally in dinucleotide 
increments (Fig. 6). Bacterial Gre factors function 
similarly to TFIIS. 

Not all the pausing events involve backtracking. 
Bacterial RNAP pausing in the leader regions of 
operons encoding biosynthetic genes, such as the his¬ 
tidine and tryptophan operons, undergo structural 
rearrangement of the active site, described as fraying 
of the RNA 3' end, in response to RNA hairpin forma¬ 
tion. The regulation of hairpin-dependent pausing is 
multi-partite, and the details of the mechanism are still 
being elucidated. Another example of RNAP catalytic 
inactivation, which does not involve backtracking, is 
the stalling of the TEC in front of a DNA lesion. 

The exceptional stability of the TEC poses a signifi¬ 
cant problem for RNAP recycling, and transcription 


termination is an active process. In prokaryotes, tran¬ 
scription may be terminated on well-defined sequences 
(terminators), which encode a strong RNA hairpin 
followed by a stretch of 8-9 nt oligo-U tract. Another 
mechanism requires involvement of a specific termina¬ 
tion factor rho. Termination by eukaryotic RNAPs is less 
well studied. RNAP II is likely to terminate on oligo-U 
tracts with assistance of specific termination factors that 
destabilize the TEC by a “torpedo” mechanism. 

RNAP as a Molecular Motor 

RNAP, a powerful and precise molecular motor, runs on 
a DNA template with a defined but short translocation 
step length of about 3.4 A. ► Single-molecule RNAP 
elongation experiments have been done to determine: 
(1) the translocation step length; (2) the order of addition 
of NTPs; (3) force-velocity relationships; (4) pausing, 
backtracking, and arrest; (5) responses to pyrophosphate; 
and (6) termination (Herbert et al. 2008). The effects of 
TFIIS on RNAP II elongation and histone barriers to 
elongation have been analyzed. Using single-molecule 
approaches, it is challenging technically to observe the 
3.4 A step. Generally, “molecular tweezers” have been 
used and RNAP elongation has been observed indirectly 
through the movement of tethered beads. TFIIS helps to 
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RNA Polymerases and Transcription, Fig. 7 The RNAP II transcription cycle is highly regulated 


rescue RNAP II TECs facing overwhelming resisting 
applied force. Bacterial RNAPs are exceptionally resis¬ 
tant to opposing external force, suggesting that forward 
translocation may be rate-limiting. 

Transcription as a Mini-Cell Cycle: Accessory 
Factors and Covalent Modifications 

RNAP II elongation is heavily regulated by accessory 
protein factors, covalent modification of histones, 
RNAP II and other proteins, and formation of higher 
order gene architectures (Fig. 7). Once again, this 
subject moves beyond the scope of this review, so it 
is dealt with in an abbreviated manner. The analogy 
that we suggest is that RNAP II is regulated during 
elongation by myriad accessory factors, cyclin- 
dependent kinases, ► phosphatases, ubiquitin ligases, 
methyltransferases, prolyl isomerases, proteases, and 
acetyltransferases, in stages with checkpoints, simi¬ 
larly to a cell cycle (Perales and Bentley 2009; 
Buratowski 2009; Kim et al. 2010). The transcription 
cycle has redundant layers of control, so it can function 
and be regulated in a tangle of contexts. RNAP II has 


a carboxy terminal domain (CTD) of its largest subunit 
(Rpbl) with the consensus sequence l-YSPTSPS-7, 
repeated 52 times in human RNAP II. S2 (serine 2) is 
the target of PTEF-b, a cyclin-dependent kinase asso¬ 
ciated with processive elongation. TFIIH includes 
a cyclin-dependent kinase that targets S5 (and S7). 
The Ssu72 and Rtrl phosphatases remove the S5P 
(and S7P) marks. In early elongation, there appears to 
be a checkpoint at which the S5P (and S7P) marks are 
largely removed, the 5' end of the RNA is covalently 
capped, and PTEF-b is recruited to increase S2 ► phos¬ 
phorylation to support processive elongation. Proline 
isomerization regulates CTD modification and acces¬ 
sory factor binding. RNAP II, the CTD, and nascent 
RNA are interaction points for many regulatory elon¬ 
gation factors including processivity factors, RNA 
► splicing factors, and 3' end processing and termina¬ 
tion factors. Chromatin is differently modified and 
regulated near the promoter, immediately downstream, 
within the bulk of the gene, and near the gene 3' end. 
Highly active genes appear to orient promoters and 
gene 3' ends in close proximity, perhaps to facilitate 
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transcriptional reinitiation after termination. Some 
genes associate with the nuclear pore to facilitate 
mRNA transport. Often, knocking out a single elonga¬ 
tion factor is not sufficient to abrogate the elongation 
control proteome, indicating signihcant complexity 
and redundancy and therefore robustness of function. 
Different genes may regulate checkpoints differently 
and recruit factors in different orders utilizing different 
primary points of control, indicating that a universal 
model for all genes or for classes of genes may not be 
applicable. 

Biophysical Studies 
Kinetics 

Millisecond phase kinetic measurements have been 
done of RNAP elongation (Kireeva et al. 2009). 
This has been used as a means to characterize mutant 
RNAPs and elongation factors. ► Quench flow 
methods have been used to show that quenching 
with HCl and EDTA provides a different kinetics 
of elongation. HCl quenching shows the timing of 
phosphodiester bond synthesis, and EDTA quenching 
the timing of stable NTP-Mg^"^ sequestration, which 
precedes bond formation. These experiments show that 
the processive transition between the formation of one 
bond and the next can be rate-limiting and apparently 
highly NTP-dependent. ► Stop flow experiments show 
that pyrophosphate release is coupled to the templated 
loading of the incoming NTP substrate. 

FRET 

Because the non-template DNA strand is missing in 
X-ray structures, FRET probes have been used to 
determine the structure of the non-template DNA 
strand in TECs (Andrecka et al. 2009). FRET has 
also been used to analyze promoter escape by bacterial 
RNAP (Kapanidis et al. 2006). It appears that FRET 
could be used to measure the kinetics of thermal 
ratchet translocation by RNAP and the NTP- 
dependence of translocation, although at the time of 
writing this has not been reported. 

Molecular Dynamics Simulation 

Molecular dynamics is a means to extend the understand¬ 
ing and description of otherwise static and dehydrated 
X-ray crystal stmctures. Applied to RNAP structures, 
molecular dynamics shows long range conformational 


coupling in RNAP (Seibold et al. 2010; Huang et al. 
2010). Dynamics and bending of the bridge helix have 
been observed in RNAP with a closed trigger loop. 
Normal mode analysis and molecular dynamics simula¬ 
tion indicate that an open trigger loop favors transloca¬ 
tion and a closed trigger loop favors catalysis and resists 
translocation. Single subunit bacteriophage T7 RNAP 
has been analyzed through a translocation step. Opening 
of the active site induces a steric clash that facilitates 
forward translocation, which appears to be driven largely 
by thermal motions. 

Quantum Mechanics 

For DNAPs, quantum mechanics simulations have been 
reported to track the chemical step of the polymerization 
reaction. Presumably, a similar analysis could be done 
for RNAPs. Because DNAP and RNAP mechanisms are 
analogous, the quantum mechanical treatment, which 
involves core atoms, should be largely applicable to 
RNAPs. These studies indicate that chemistry is 
supported by extraction of the 3'-OH hydrogen, making 
the 3'-0 a more potent nucleophile to attack the alpha- 
phosphate of the NTP. Protons may jump to the catalytic 
Mg or to an ordered water and then to the pyrophos¬ 
phate leaving group to support chemistry. 

Drug Targets 

Bacterial RNAPs have been shown to be inhibited by 
a number of antibiotics including: (1) rifampicins, 
(2) streptolydigin, (3) Myxopyronin, (4) mycrocin 
J25, and (5) tagetatoxin. Rifampicin is a mainline phar¬ 
maceutical against human tuberculosis. Eukaryotic 
RNAP II is potently inhibited by alpha-amanitin, 
a toxin derived from the death cap mushroom. These 
drugs have been important mechanistic and structural 
probes for RNAPs (Brueckner et al. 2009a, b). 

Mutagenesis 

Extensive mutagenesis is reported of RNAPs (Trinh 
et al. 2006; Hein and Landick 2010). Recently, the 
“RNAP factory” automated approach for saturation 
mutagenesis at particular amino acid positions (all 
19 possible substitutions at a position) was reported for 
segments of the bridge helix and trigger loop of an 
archaeal RNAP. Mutations are identified that affect 
many aspects of RNAP function including: (1) transcrip¬ 
tion initiation; (2) elongation, pausing, and termina¬ 
tion; (3) fidelity; (4) inhibitor response; and (5) factor 
binding. 
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Summary 

Multi-subunit RNAPs are among the largest and most 
dynamic biological molecules known, and RNAPs are 
powerful molecular motors with surprising properties. 
RNAPs utilize a double-stranded DNA, which is 
unwound to expose one strand to act as the template 
DNA, so RNAPs unwind and rewind DNA. RNAPs are 
highly processive, and the TEC is exceptionally stable. 
The 8-9 nt RNA-DNA hybrid is essential for the TEC 
stability. DNA bends at nearly a right angle within the 
TEC. RNAPs, therefore, tightly bind and bend DNA. 
A complementary RNA strand is synthesized using 
NTP substrates by a 2-Mg mechanism. RNA chains 
are synthesized with exceptionally high fidelity, 
so NMP misincorporation and frameshift errors are 
uncommon events, and genetic information is tran¬ 
scribed faithfully during synthesis. Eidelity is achieved 
by the correct substrate selection in the active site and 
post-incorporation editing mechanisms. Because of the 
very large size of RNAPs, these enzymes present 
a challenge for structural studies, biophysical tech¬ 
niques, and dynamic modeling. RNA synthesis is 
highly regulated, and protein-protein, protein-DNA, 
and protein-RNA interactions hgure prominently in 
RNAP control. The RNAP system, therefore, presents 
opportunities and challenges for biophysicists. 

Cross-References 

► DNA Polymerase 

► FRET 

► Elistone-Lysine A-methyltransferase 

► mRNA Localization 

► Phosphatases - Computational Studies 

► Protein Phosphorylation 

► Quenched-Flow Methods 

► Single-molecule methods 

► Splicing 

► Stopped-Flow Techniques 
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Rotating Cross-Bridge Model 

Dilson E. Rassier 

Departments of Kinesiology and Physical Education, 
Physics and Physiology, McGill University, Montreal, 
QC, Canada 

Synonyms 

Actomyosin cycle; Cross-bridges; Molecular motors 

Definition 

The rotating cross-bridge model describes the interac¬ 
tion between myosin and actin, and the conversion of 
energy derived from ATP hydrolysis into mechanical 
work. 

Basic Characteristics 

The rotating cross-bridge model was initially devel¬ 
oped to explain the mechanisms of muscle 


contraction (Huxley 1957) and represents the first 
attempt to describe the interaction between myosin 
and actin filaments. In the original model, myosin 
was assumed to attach to actin and form cross¬ 
bridge between the filaments. Once attached, the 
cross-bridges would cause sliding of the actin fila¬ 
ment and force production. The model was subse¬ 
quently refined by including transitions in the 
myosin states once attached to actin (Huxley and 
Simmons 1971), similar to conformational changes 
that are observed currently in many molecular 
motors. Such conformational changes were 
achieved by the rotation of an elastic element 
within the myosin cross-bridges due to strain during 
the power stroke (Huxley and Simmons 1971). 

Currently, the cross-bridge model provides 
a framework for the interpretation of most linear 
molecular motors, including different classes of myo¬ 
sin, kinesin, and dynein. Detailed explanations of indi¬ 
vidual steps of the cross-bridge cycle are given in other 
articles of the Encyclopedia of Biophysics (► ATP- 
Driven Mechanical Work Performed by Molecular 
Motors; ► Myosin Work and Motility: Mechanism); 
the reader is encouraged to look for these 
articles. Eurthermore, there are reviews (Elinas 
et al. 2012; Sweeney and Houdusse 2010) and 
comprehensive books (Howard 2001; Woledge 
et al. 1985) with in-depth information of the acto¬ 
myosin cycle. 

The rotating cross-bridge model has three 
main components, which were introduced over the 
years in parallel with the development of techniques 
and experimental methods that allowed scientists to 
advance our understanding of muscle contraction and 
motility of molecular motors (Howard 2001). 

1. The initial cross-bridge model, that includes the 
myosin attachment to actin and the myosin power 
stroke, describing how myosin generates force and 
slides the actin filaments (Huxley 1957; Huxley and 
Simmons 1971). 

2. The biochemical steps behind the ATPase cycle, 
which describes how the nucleotides resultant 
from ATP hydrolysis (P; and ADP) control the 
attachment and detachment of myosin to actin 
filaments (Lymn and Taylor 1971). The mechanical 
attachment of myosin to actin catalyzes a chemical 
step (ATP breakage and release of P; and ADP), 
which in turn regulates the unbinding of myosin 
cross-bridges from actin. Although myosin 
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Rotating Cross-Bridge 
Model, Fig. 1 Structural 
basis of the myosin power 
stroke, showing high- 
resolution structures of 
myosin and actin (Rayment 
et al. 1993b) in the pre-power 
stroke state (before P; and 
ADP release), the rigor state, 
and the post-rigor state (after 
a new molecule of ATP hinds 
to myosin). The myosin 
attaches to actin, and after 
release of P; and ADP, it 
induces changes in the 
converter domain situated in 
the light chain region of the 
molecule - the basis for the 
myosin power stroke. Then 
ATP binds to myosin forcing 
detachment from actin. After 
repriming of the lever arm, the 
myosin is again ready for 
a new attachment cycle 
(Adapted from Sweeney and 
Houdusse (2004), with 
permission). RLC regulatory 
light chains, ELC essential 
light chains 



hydrolyzes ATP in the absence of actin, myosin- 
actin interaction accelerates the release of the P; and 
ADP, responsible for the power stroke and force 
generation. 

3. The swinging lever arm mechanism and mechanical 
myosin cycle, which describes how structural 
changes in small regions of the myosin molecule - 
closely associated with the ATP hydrolysis sites - 
are largely amplified into conformations of the 
entire myosin molecule (Rayment et al. 1993a, b) 
(Fig. 1). The description of the crystal structure of 


myosin was determinant for the connection 
between the cross-bridge cycle, ATPase kinetics, 
and structural conformational changes that happen 
in the myosin molecule. 

The interconnection between the cross-bridge 
cycle, the ATPase cycle and the conformational 
changes in myosin molecules (Fig. 2) provides the 
basis for motility of many molecular motors. These 
features have been incorporated successfully in motil¬ 
ity models for myosin, kinesin, and dynein (Howard 
2001 ). 
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Rotating Cross-Bridge Model 
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Acto-myosin rigor state 



Pi release state 

Product 

release 



Acto-myosin:ADP:P| 
Pre-powerstroke state 




ATP 'J actin 
binding detachmsnt 


Myosin:ATP 
Post-rigor state 


Actin rebinding 


Repriming 

„ I 


ATP 

hydrolysis 


Myosin:ATP / ADPiP, 
Pre-powerstroke state 


I 

Myosin:ATP 
actin detached 
state 



Rotating Cross-Bridge Model, Fig. 2 Biochemical and struc¬ 
tural states in the actomyosin cycle (figure from Llinas et al. 
(2012), with permission). In the central panel, a schematic rep¬ 
resentation of the ATPase cycle proposed by Lymn and Taylor 
(1971) is shown. In the outer panel, a structural view of the 
myosin-actin cycle for the corresponding ATPase cycle is 
shown, (a) In the rigor state, the myosin motor domain (gray) 
is strongly bound to the actin-filament (blue arrow). The myosin 
lever arm is composed of the converter sub-domain (green) and 
two light chains: essential light chains (ELC, red) and regulatory 
light chains (RLC, light violet). The helix spanning the motor 


domain (dark blue helix) and controlling the converter orienta¬ 
tion is straight in this state, (b) When ATP binds to myosin, it 
induces a conformational change causing the myosin motor 
domain to release from the actin filament. Then myosin 
undergoes a recovery stroke, repriming the lever arm. (c) After 
ATP hydrolysis, a conformational change in the myosin head 
closes the nucleotide-binding pocket, necessary for ATP hydro¬ 
lysis. The relay helix adopts a kinked conformation. The myosin 
head then rebinds to the actin filament, which induces release of 
Pi and ADP. Force is generated, causing displacement of the 
actin filament 


Cross-References 

► ATP-Driven Mechanical Work Performed by 
Molecular Motors 

► Dynein Family Classification 

► Kinesin Superfamily Classification 

► Myosin Family Classification 

► Myosin Work and Motility: Mechanism 

► Myosin; Fundamental Properties and Structure 
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Saccharide Surfactants 

► Carbohydrate Surfactants 

Sampling Methods in Infrared 
Spectroscopy 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

SAMT Mitochondrial 5-Adenosyl- 
Methionine Transporter 

► Mitochondrial Transport Protein Family 

SANS 

► Small Angle Neutron Scattering 

► Small Angle Neutron Scattering (SANS) Software 

SAR by NMR 

Robert Powers and Jennifer C. Copeland 
Department of Chemistry, University of 
Nebraska-Lincoln, Lincoln, NE, USA 

Synonyms 

NMR; Structure-activity relationship by NMR 


Definition 

Structure-activity relationship (SAR) by NMR is 
a technique developed in 1996 by Stephen Fesik at 
Abbot Laboratories. SAR by NMR is the first experi¬ 
mental demonstrati on of the ► fragment-based 
approach to dmg discovery. The method uses ► NMR 
spectroscopy to probe the surface area surrounding 
a protein’s active site for ligand binders. A small, stmc- 
turally diverse chemical library is screened by NMR 
with the goal of identifying ligands that bind proximal 
to each other in the protein’s active site. Chemically 
linking these small chemical building blocks is generally 
expected to yield potent molecules that can be evolved 
into a dmg. SAR by NMR is considered to be a protein- 
targeted approach to ► NMR dmg discovery since the 
protein’s NMR spectrum is monitored for evidence of 
ligand binding. The target protein restricts the applica¬ 
tion of SAR by NMR since the method is limited by 
protein size (typically <40 kDa), requires a few hundred 
milligrams of pure, *^N-labeled protein, and is depen¬ 
dent on the availability of a protein structure and NMR 
assignments. Additionally, the function and active site 
of the protein are typically known. In its original incep¬ 
tion, SAR by NMR utilized mixtures of fragments (com¬ 
pounds <300 kDa) at high concentrations that would 
weakly interact (Kd > 100 pM) with a *^N-labeled 
protein. Two-dimensional *H-*^N heteronuclear single 
quantum coherence (HSQC) spectra were obtained to 
observe a binding event. 2D *H-*^N HSQCs are rela¬ 
tively rapid NMR experiments that provide a peak for 
each amide (*H-*^N correlation) or amino acid, except 
proline, in a protein. The relative position (chemical 
shift) of each peak is extremely dependent on the local 
environment of the amide (amino acid). Thus, a change 
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Scalar Coupling 




in the local environment due to the close proximity of 
a bound ligand will result in a chemical shift change, 
which can be mapped onto the protein structure in order 
to determine the binding site for the first fragment. 
A second binding site is determined by either 
performing the NMR fragment-based screen in the pres¬ 
ence of the first fragment or by using the original binding 
information to find a second binding site proximal to the 
first. NMR, X-ray, or modeling are then routinely used 
to generate a co-structure of the protein bound to each 
fragment. The structural information is used to design 
a linker that combines both fragments while maintaining 
their proper orientation for optimal binding to the pro¬ 
tein. The new compound derived from the linked frag¬ 
ments results in a higher binding affinity (Kq < 1 pM) 
since the binding energy is the sum of the binding 
energies from each individual fragment. The fragments 
can also be chemically modified to optimize their indi¬ 
vidual binding affinities. A fragment-based library with 
chemical linking dramatically increases the coverage of 
stmctural space. While a fragment-based library is typ¬ 
ically small (<10^), the number of virtual compounds 
searched by SAR by NMR is large (>10*^) since it is 
a product of the number of compounds, linkers, and 
subsites within the ligand-binding site. 

Cross-References 

► Fragment Screen 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR in Drug Discovery - Introduction 

► Protein NMR - Introduction 


Scalar Coupling 

► J-Based NMR Correlation Spectroscopy of Biolog¬ 
ical Solids 

► J Coupling 


Scanning Electron Microscopy 

► Electron Microscopy: Classical Sample Preparation 


Second Harmonic Generation 
Microscopy (SHG) 

Paolo Bianchini and Alberto Diaspro 

Department of Nanophysics, Italian Institute of 

Technology(IIT), Genoa, Italy 

Department of Physics, University of Genoa, Genoa, 

Italy 

Synonyms 

Frequency doubling microscopy; Second harmonic 
imaging microscopy; SHG; SHIM 

Definition 

Second harmonic generation microscopy is a label 
free nonlinear imaging technique. Biological mole¬ 
cules such as collagen, myosin, tubulin, starch are 
some of the targets for this method due to their 
structure. 

Basic Characteristics 

Aside from two-photon excitation fluorescence 
microscopy (2PEFM), that is the primary imaging 
technique for thick tissue imaging, second harmonic 
generation (SHG) microscopy can also be used to 
image specific kinds of biological samples. SHG imag¬ 
ing was one of the earliest forms of biological 
nonlinear microscopy, proposed and demonstrated 
decades ago (Gannaway and Sheppard 1978). SHG is 
a two-photon coherent nonlinear optical process that 
arises from polar molecules ordered in noncentrosym- 
metrical assemblies. It is known that several endoge¬ 
nous proteins exhibit SHG (Zoumi et al. 2002). The 
orientation of collagen fibers within tissues such as 
tendons, ligaments, bones, or tumors is of primary 
importance. Since SHG is a nonlinear process, it pro¬ 
vides intrinsic optical sectioning to point-scanning 
microscopy, high in-depth penetration (up to 
500 pm), while drastically reducing out-of-focus 
photobleaching and phototoxicity (Campagnola and 
Loew 2003). SHG is therefore well suited for in situ 
and in vivo studies. Nevertheless, the interpretation of 
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the contrast is somewhat tricky as it results from the 
combination of various local second-order nonlinear 
properties (local symmetry and organization), orienta¬ 
tion of the microstructure with respect to both the 
direction of propagation and polarization of the inci¬ 
dent beam (Chen et al. 2012). Quantitative determina¬ 
tion of the orientation fields in biological tissues is at 
the basis of characterization and of understanding of 
the general anisotropic properties that regulate 
mechanical and rheological properties, cell prolifera¬ 
tion and migration, diffusion. 

The response of the material medium is specified by 
means of the polarization P and depends on the ampli¬ 
tude E of the electric field of the applied optical wave. 
Under the simplest circumstances, this relationship can 
be expressed in the time domain as 

P{t) = £o + .. .j (1) 

whereis the linear susceptibility is the second- 
order susceptibility is the third-order susceptibil¬ 
ity, etc. Second harmonic generation occurs as a result 
of the second-order response described by 

In 2PE fluorescence microscopy, the emitted signal 
is determined by the intensity profile of the excitation 
beam. In SHG, the phase plays a crucial role in the 
characteristics of the emitted signal and cannot be 
overlooked. Near the focus region, produced by 
a high numerical aperture objective, the phase fronts 
do not appear to travel as quickly as in a collimated 
beam. As a result of this phase lag, called a Gouy’s 
shift (Chen et al. 2012), the SHG from a focused beam. 


instead of propagating on-axis, now propagates off- 
axis and actually also back. This may be explained by 
momentum conservation (or, in nonlinear optics par¬ 
lance, phase-matching). It should be emphasized that 
the SHG angular pattern can be different depending on 
the order and distribution of the radiating molecules 
(Odin et al. 2008). 


Backward and Forward SHG 

A century ago, Gustav Mie discovered that objects 
extended along the optic axis (z-direction) for 
a distance on the order of the emission wavelength 
exhibit linear scattering that is forward directed. 
This principle holds true for SHG (Mertz and Moreaux 
2001; Zoumi et al. 2002) and other nonlinear processes 
as well. It results from the fact that extended scatterers 
emitting synchronously with a forward-going 
fundamental generally remain phase matched only in 
the same forward-going direction. However, backward 
SHG generated signal can be collected (Diaspro et al. 
2002; Zoumi et al. 2002). The phenomenon is 
somewhat complicated with the use of a highly 
focused illumination beam, but the general 
rule remains (Bianchini and Diaspro 2008, 2009) 
(Fig. 1). 

Emitted SHG is generally anisotropic because of 
phase-matching constraints of coherent scattering. 
E/B depends primarily upon the axial extent of 
the scatterer as compared to the illumination 
wavelength. 



SHG backward 


SHG forward 


Fluorescence 


Second Harmonic Generation Microscopy (SHG), 
Fig. 1 SHG and TPEF images of a section of a mouse tendon. 
The first line shows a lateral view of the specimen, a three 
channels acquisition and the overlay. The second line is the 


axial view taken in the center of the above images. The excita¬ 
tion wavelength is 860 nm. The SHG differences between back¬ 
ward and forward are clearly visible (Bianchini and Diaspro 
2008) 
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Microscope Design 

Since the SHG emission directions could bring 
additional information to the bulk image, both the 
backward and forward directions (BSHG, FSHG) 
can be usefully exploited for imaging purposes. 



Second Harmonic Generation Microscopy (SHG), 
Fig. 2 Simplified scheme layout of a typical nonlinear' optical 
laser scanning microscope 


Confocal microscope is usually not designed and 
optimized to perform SHG experiments, hence 
some small changes should be done to the optical 
setup. A good starting point is the possibility of 
adapting a spectral confocal microscope since 
effective spectral separation is one of the key of 
success for SHG imaging. 

Figure 2 shows a general optical scheme of a second 
harmonic microscope setup. 

Keeping in mind that the SHG wavelength is 
always half the illumination wavelength, the 
FSHG signal is collected exploiting the transmis¬ 
sion pathway of the microscope. This channel has 
to be endowed of a stop hlter avoiding transmitted 
excitation light and a high quality narrow band 
filter that selects FSHG signal against fluorescence 
or autofluorescence. This mode allows collecting 
most part of the generated SH signal at least in 
the case of thin specimens (less than 500 mm). 
Usually a common microscopy photomultiplier is 
appropriate to obtain high quality images at appre¬ 
ciable signal to noise ratio. 

The BSHG is acquired using the confocal laser 
scanning head. The wavelength of the light collected 
by each channel is selected by a spectrophotometer, 
avoiding the use of filters and enabling spectral vali¬ 
dation (Fig. 3). 



Second Harmonic Generation Microscopy (SHG), 
Fig. 3 Frontal section of a medial collateral ligament in the 
rat knee. The images show a mosaic obtained by nonlinear 
optical microscopy technique. The specimen was excited at 
880 nm. The two-photon excited fluorescence emitted by 
nonflbrillar material was acquired in the epi-channel in the 


spectral range from 470 to 600 nm and is displayed in blue. 
The second harmonic signal generated by collagen fibers was 
acquired in the transmission and epi-channel, both in the spectral 
range from 430 to 440 nm and they are displayed in green and 
red, respectively (Franchi et al. 2010) 
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Cross-References 

► Sum Frequency Generation Vibrational 
Spectroscopy 

► Two-Photon Excitation Fluorescence Microscopy 
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Sedimentation Equilibrium Analytical 
Ultracentrifugation 

Arthur J. Rowe 

National Center for Macromolecular Hydrodynamics, 
School of Biosciences, University of Nottingham, 
Sutton Bonington, Leicestershire, UK 

Synonyms 

Analytical ultracentrifugation 

Definition 

The observation and interpretation of the equilibrium 
distribution of solute particles in a centrifugal field. 

Introduction 

Sedimentation equilibrium analysis is performed on 
a solution of macromolecular solute particles, using an 
Analytical Ultracentrifuge, in which the solution to be 
analyzed is placed in a cell located in a rotor which can 
be accelerated to speeds of up to 60,000 revolutions 
per minute (r.p.m). The cell has high-quality plane 
windows - optical quartz or sapphire - permitting the 
employment of optical systems to record the distribu¬ 
tions obtained under the g-force employed. The 
geometry of the cell in which the solution is placed 
and the available modes of visualization of the solute 
distribution are identical to those employed for sedi¬ 
mentation velocity analysis (► Sedimentation Velocity 
Analytical Ultracentrifugation). The single, most 
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conspicuous difference between the two methods lies in 
the rotor speed employed. In sedimentation velocity 
analysis, the aim, at least initially, is to maximize the 
resolution in detection of the range of species present, 
hy employing the highest rotor speed attainable: 
although for higher molecular weight (molar mass) 
samples, whose rate of sedimentation may he excessive, 
the rotor speed will be deliberately moderated, in order 
to permit sufficient time for accumulation of data, as 
a set of “scans” logged to disc through an online 
computer at fixed time intervals. In contrast, for sedi¬ 
mentation equilibrium analysis, a rotor speed is 
employed which for the range of solute molecular 
weights under study allows for the sedimentation poten¬ 
tial at any radial position in the cell to become equal to 
the chemical potential at that position, and under these 
conditions the solute distribution patterns become time- 
invariant. This, hy definition, means that a state of 
equilibrium has been attained, and the system can then 
he analyzed on the basis of classical chemical thermo¬ 
dynamics. A practical issue, discussed below, is that this 
equilibrium state can only be achieved within a sensible 
timescale if the height of the solution column is limited 
to around 20% or less of the cell capacity as employed 
for velocity analysis (i.e., to around 80 pL as compared 
to 400 pL). 

Historically, the development of the sedimentation 
equilibrium approach had lagged behind that of the 
velocity method (Svedberg and Pedersen 1940). This 
may be related to the fact that the need for the use of 
short columns of solution had yet to be appreciated. 
Primarily the method was used, prior to the 1970s, for 
the estimation of molecular weight values for biolog¬ 
ical macromolecular solutes and their complexes. For 
this purpose there were obvious attractions: the method 
was “free solution” based, without dangers inherent in 
matrix interactions; and it was an “absolute” method, 
calling for no calibration routines, and having a sound, 
thermodynamic basis. From the 1960s onward, 
however, it became clear that the interaction of solute 
components (both homo- (A -i- A) and hetero- (A -i- B), 
and also with low molecular weight ligands or 
effectors) could be studied very effectively by this 
technology, especially as more and more the solutes 
being studied were the products of recombinant DNA 
technology, and hence their molecular weight values 
were known parameters for a system under study. In 
current practice, a whole range of interaction 
parameters, both simple associative (Law of Mass 


Action - Ka, the “association constant”) and thermo¬ 
dynamic (“virial” terms) can often be extracted from 
sedimentation equilibrium data. The range of problems 
that can be tackled using sedimentation equilibrium is 
now large and growing. For full details of these 
methods the reader is referred to the range of 
a general and specialized review, a selection of which 
is given below (Laue and Stafford 1999; Lebowitz 
et al. 2002; Schuster and Laue 1994; Scott et al. 2005). 

The Theoretical Basis of Sedimentation 
Equilibrium 

The equation describing the flux of solute at any radial 
position r is given by the “Lamm Equation” 

(5cr/dt)j. = -(l/r){a/dr[sa)^r^Cr - Dr(5cr/dr)J}( 

( 1 ) 

where s is the sedimentation coefficient of the solute, 
(B the angular velocity of the rotor (radians per second), 
Cf the concentration of solute at radial position r, and 
D its translational diffusion coefficient. This equation 
is valid for a single solute, non-self-associating, and at 
infinite dilution. Since the ratio s/D defines (for a solute 
of known partial specific volume) the molar mass 
(commonly referred to as the molecular weight, M), 
it follows that numerical methods which can fit the 
above equation to a data set in (Cr, r) must yield an 
estimate for M. As is described below it is actually 
currently possible to achieve this, under conditions 
where the above conditions for validity are obeyed. 

However, the “classical” approach to extracting 
estimates for M (and other parameters) has relied 
upon the observation that since equilibrium is defined 
by zero flux at all radial values, then within the inner 
bracketed term in Eq. 1 the two terms present must be 
numerically equal, that is, the sedimentation potential 
must equal the chemical potential at all r;. This equality 
can then be recast in a number of ways, most usually as 

Cr = Ca exp[0.5a(r^ - r^)] (2) 

where 

cj = M(l - vp)«)VRT (3) 

a dimensionless parameter related to the buoyant mass 
of the particle, and (Ca, ta) are a reference data pair. 
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which have often been dehned as referencing the radial 
value of the solution meniscus, but may in fact be at 
any user-defined position, R is the gas constant, v is 
the partial specific volume of the protein, p is the 
solvent density (not the solution density - this has 
been proved), and T the temperature (°K). 

Equation 2 is of course limited in its validity in the 
same way and in respect to the same manner as is 
the Lamm Equation (Eq. 1). While for many solutes 
(e.g., “globular” proteins) the departure of their dilute 
(<1 mg/mL) solutions from non-ideality can be 
neglected, we need to note that there are (a) other sol¬ 
utes, usually of extended shape, for which this assump¬ 
tion cannot be made; and (b) there are macromolecular 
solutes - including simple “globular” proteins - which 
can self- or cross-interact, even at high dilution. This has 
led historically to the use of the concept of an “apparent 
molecular weight” (Mapp), which is simply the opera¬ 
tional result of applying a basic equation (Eq. 2) to a data 
set acquired, without regard as to whether this is a valid 
procedure or not. Thus for example, if significant ther¬ 
modynamic non-ideality is present, then we may write, 
for a single, non-self-interacting solute component, 
following classical approaches 

(l/Mapp) = (l/M)*(l-b2BMc) (4) 

where B is a “second virial coefficient,” and the 
product 2BM a second virial term. 

In more general terms, this should be written for 
a single species (Wills and Winzor 2005) as 

(1/M,pp) = (l/M)*(l-f2Bi,iMc-f 3CMc-f ...) 

(5) 

where the subscript to the second virial term signifies 
fhaf this is a 1-1 interaction, arising from both solute 
volume (always) and solute charge effects (often, espe¬ 
cially at low solvent ionic strength). The coefficient 
C (and higher order) terms arise from multiple, 
non-contact interactions. In all these cases, the sedi¬ 
mentation potential is considered to be unchanged for 
the single solute species defined, as weight can never 
be a concentration-dependent parameter. Thus in prac¬ 
tice the a term in Eq. 2 must be modified by the 
incorporation of the “non-ideality” term given in 
Eq. 5, or its more basic form (Eq. 4): 

c, = Caexp[0.5(a/(1 4- 2BMCr))*(r^ - r^)] (6) 


We see an immediate problem with fitting any 
equation of this type, in that it is transcendental in 
form, since the variable c^ appears in both sides of the 
equation. Such equations are incapable of explicit 
solution, and can only be htted using nonlinear 
algorithms by a process of iteration. This approach is 
hazardous for the present system, where the error sur¬ 
face is notoriously uneven. The problem becomes even 
more intractable if multiple species are present. 
Algebraic approximations have, therefore, had to be 
employed to make the retrieving of parameters under 
non-ideal states feasible. These are briefly discussed 
below. 

How Sedimentation Equilibrium Runs Are 
Performed 

The two optical systems available in the Beckman 
XL-I instrument (uv/visible light absorption and Ray¬ 
leigh interference) are both widely used in this context, 
and since the final distribufion whose details are to be 
logged is time-invariant, there is no handicap in 
recording the distribution by both optics. This being 
said, for a solute having no usable optical density, there 
would be no point in so doing. On the face of it, the 
precision in signal attainable by Rayleigh interference 
optics is considerably superior to that from absorption 
optics - as shown, for example, by Ang and Rowe 
(2010). But these authors also show that the effect 
of the “shot noise” in the basic signal (fringe 
displacement/optical density) is of surprisingly little 
importance in comparison to knowledge of the abso¬ 
lute level of the signal. In particular, the fact that the 
optical baseline (TI - Time Independent signal) is an 
unknown parameter in interference optics, whereas it 
can be defined wifhin close limifs using absorption 
optics, gives the latter a built-in advantage. Eigure 1 
illustrates the difference between the data recorded by 
the two different optical systems. 

The procedure is then that AUC cells are loaded 
with a suitable volume of solution, to give a column 
height of 3 mm or less, with an amount - carefully 
matched, especially for interference optics - of 
solvent. Nomograms are available which indicate an 
optimal rotor speed, and various computational aids 
indicate an approximate time by which the signal at 
any given radial position will have become time- 
invariant. Einal scans are then taken and logged to 
disk, prior to analysis by whatever algorithm/program 
is most appropriate. 
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Sedimentation Equilibrium Analytical Ultracentrifugation, 
Fig. 1 Illustration of the mapping of a solute concentration 
difference in an AUC cell at the end of a sedimentation equilib¬ 
rium experiment. Using absorption optics, a single trace is 
obtained (shown in red), and the baseline can in most cases be 
taken to be very close to the electronic zero, that is, the y = 0 line 


in the illustration. Hence “true” values of the absorption at given 
radius are known. In contrast, interference optics yield a family 
of parallel fringes, whose phase shift in the y-direction with 
respect to radius indicates the relative changes in solute concen¬ 
tration, but which do not allow the absolute fringe increment at 
any radial value to be computed 


The Analysis of Sedimentation Equilibrium Data 

Standing historical sequence on its head, we first 
consider the analysis of a (Cf, r) data set via the use of 
fitting through numerical methods of the Lamm 
Equation (Eq. 1). This derives basically from the use 
of such methods in the analysis of sedimentation 
velocity data, implemented in the software SEDEIT 
(see, e.g., Schuck 2002). This approach owes much 
of its success and widespread adoption to the fact that 
a very large number of successive scans (typically 
^300) can be analyzed in a single analytical process, 
that Is, the system is “information-rich.” In contrast, 
the equilibrium data is a single data set, by definition, 
and reduction of noise - of whatever origin - can only 
be effected to a limited degree, by globally fitting data 
from a number of final patterns at different solute 
concentrations and rotor speeds. Ideally, one would 
like to fit the entire approach to equilibrium, but thus 
far this has defied attempts to derive an analysis which 
could be the basis of an algorithm. Eor dilute systems, 
where thermodynamic and hydrodynamic non-ideality 
effects can be disregarded, the SEDPHAT software 
based upon this approach (Vistica et al. 2004) is simple 


to use, stable, and involves no numerical approxima¬ 
tions beyond those inherent in the fitting of Lamm 
Equation solutions. However, if one has to consider 
the possibility of the presence of significant 
non-ideality, then problems arise. Both hydrodynamic 
and thermodynamic non-ideality must be considered 
(the former for the sedimentation coefficient, s, term 
in Eq. 1 , and both for the term in the diffusion coeffi¬ 
cient, D). In neither case is there uniformly accepted 
theory available, and to float values of coefficients in 
semi-empirical algebraic terms increases the number 
of floated parameters to a level which may be 
unsupportable. 

Reverting then to approaches based upon the fitting 
of Eq. 2 and its many modifications, the most widely 
used software has been the NONLIN software 
(Johnson et al. 1981), found in the Beckman ORIGIN 
software supplied with the instrument (McRorie and 
Voelker 1993). Obviously, to define the molecular 
weight of a monodisperse single solute one simply 
fits directly using Eq. 2, normally with a baseline offset 
(E) specified as a parameter (essential for interference 
optics). Figure 2 shows a typical fit, to a distribution of 
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Sedimentation Equilibrium Analytical Ultracentrifugation, 
Fig. 2 A fit using an IDEAL 1-type function to data obtained 
using interference optics from a preparation 1.0 mg/mL of 
RNAse A. Error analysis was performed, with 500 iterations, 
confidence intervals 68.3%. The parameter distribution shows an 
estimated best fit value of a = 1.402 ± 0.003 (theoretical value 
a = 1.392). Unpublished analysis from a study by Ang and 
Rowe (2010) 

RNAse at a concentration (1 mg/mL) where non¬ 
ideality is minimal. The residuals - which show no 
systematic trend with respect to radial position - 
indicate a good ht. The slight (positive) error in a is 
in agreement with the known fact that a very 
small amount of reversible dimer is present (Ang and 
Rowe 2010). 

To analyze systems where dynamic equilibria may 
be present, NONLIN uses an ingenious approach, in 
which the monomer (not the total) concentration at 
a reference radial position is a variable parameter. 
This leads to a fitting function in which up to three 


Ka values can be fitted, at least under optimal condi¬ 
tions. For a monomer-dimer equilibrium, for example, 
the relationship used is given by 

Cr = c„,a exp[0.5*c(r2 - r^)] 

+ c^,Ka,2exp[0.5*a(r2-r2)] 

where c^ a is the monomer concentration at the refer¬ 
ence radius a, and Ka _2 is the equilibrium constant. 
A further additional term is added for each additional 
species, identical to the second term in Eq. 7 but 
written with the order of the term in c^ a raised 
to n (for an n-mer association) and the appropriate 
Ka value indicated. 

This approach enables a range of interactions to be 
incorporated into the treatment, and hence a range of 
models to be tested. Caution is needed, however. Sim¬ 
ply to add more and more parameters and then look for 
the “best distribution of residuals” is not valid, since 
“more parameters = better ht” is a general observation 
in curve-htting experience. Optimal practice is to 
evaluate by “bootstrapping” in a full error analysis 
the extent to which sensible parameter distributions 
(cf. Figs. 2 and 3) are obtained for all of 
the Ka parameters and the other parameters employed. 
Modern curve-htting software increasingly makes this 
operation very simple to perform. 

The treatment of thermodynamic non-ideality in 
NONLIN and similar software is achieved by an 
approximation, in which the second virial coefficient 
(B) appears in a term additive to that associated with 
contact interaction(s). This means that even for 
a simple monomer-dimer interaction one cannot hoat 
both interaction terms (K^ and BM). However, so long 
as the former term is an order of magnitude or more 
greater than the latter - which will normally be the case 
for “globular” proteins at least - then hxing the B value 
at a level typical for such is solute will be adequate so 
far as estimating K^ is concerned. 

For ultra-weak interactions however (K^ >0.1 mM, 
where K^ is the dissociation constant), when the 
contact-interactive and the thermodynamic interactive 
terms are of a similar order of magnitude, then 
approaches such as NONLIN cannot be applied (see 
Rowe 2011). A solution in this case arises from the 
observation that although the family of transcendental 
equations such as Eq. 6 cannot be fitted directly, 
they can easily be fitted indirectly via their 
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Fig. 3 A fit using the INVEQ 
algorithm to data obtained 
using interference optics from 
a preparation ~2.3 mg/mL of 
YFP. Error analysis was 
performed, with 500 
iterations, confidence intervals 
68.3%. The parameter 
distributions show S.D. values 
of ±11% for the 
thermodynamic interaction 
term BM, and ±13% for K^. 
Unpublished analysis from 
a study by Zeng et al. (2006) 




(non-transcendental) inverse - the INVEQ algorithm. 
Figure 3 illustrates typical results from an analysis of 
this kind, applied to the self-association of Yellow 
Fluorescent Protein (YFP). 

Summary 

Sedimentation equilibrium analysis shares with 
sedimentation velocity analysis the advantage of 
being a “free solution” method, in which matrix inter¬ 
actions can be neglected. Its fundamental theoretical 
base is simple, and soundly based in thermodynamics. 
While it operates in a less data-rich environment that 
does its counterpart sedimentation velocity mode, it 
requires a lower sample loading volume, which in turn 
enables the use of multi-channel cells. Thus, it is pos¬ 
sible to perform up to 24 analyses in a single run in an 
8-hole rotor, as compared to 7/8 only using velocity 
mode, and while run times are longer, many equilib¬ 
rium runs are performed overnight. A further 
advantage lies, as we have seen, in the way in which 
ultra-weak interactions (Kj >0.1 mM) can be studied 
on a routine basis, at a level of sensitivity attained by 


no other biophysical method, and all interaction stud¬ 
ies based upon equilibrium data are likely to be free of 
pressure effects, on solute and solvent alike, arising 
from the much higher g-forces and longer solution 
columns needed for velocity work. 

The range of systems which have been and are 
being studied, and the modifications of the basic 
technology required, is vast. Topics such as nucleic 
acid-protein binding via multi-wavelength scanning, 
solvent density matching to assess amounts of bound 
ligand, floatational equilibrium analysis of low-density 
solutes, rapid analysis of unstable samples via 
approach-to-equilibrium monitoring, assessment of 
activation energies via multiple temperature 
analysis, the description of the various average weight 
distributions for solutions of polymers including car¬ 
bohydrates and mucus glycoproteins (Harding and 
Rowe 2010), the use of fluorescence optics to probe 
inter alia the behavior of labeled solute in crowded 
solutions - all these and other approaches can be found 
described in the general reviews cited. Sedimentation 
equilibrium analysis is of course not by definition 
suited to the analysis of reaction kinetics. But as 
a “AG” method, it well complements results 
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from a “AH” method such as ITC. And - not to be 
forgotten - there are still large complexes of proteins 
especially whose structure may be suggested by mod¬ 
ern electron microscopic methods, but whose mass 
analysis can be problematic for mass spectroscopy. 
Yet these yield to the simplest approach of all for the 
presently described methodology. 


Cross-References 

► Sedimentation Velocity Analytical 
Ultracentrifugation 
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Synonyms 

Analytical ultracentrifugation 

Definition 

The experimental observation and analytical interpre¬ 
tation of the temporal evolution of spatial distributions 
of dissolved particles in a centrifugal field. 

Basic Principle 

Sedimentation velocity (SV) experiments always take 
place in an ultracentrifuge, usually starting with 
a spatially uniform initial equilibrium of dissolved 
particles in a solution volume that forms a cylinder 
segment from a “meniscus” air-water interface closest 
to the axis of rotation to a “bottom” radius, which is the 
highest radius of the solution column defined by the 
sample holder (Fig. 1). In a basic SV configuration, 
experiments are conducted such that the start of the 
centrifugation will place the system far from its cen¬ 
trifugal equilibrium, essentially allowing one to visu¬ 
alize the free fall of the ensemble of particles 
and analyze the dynamics of this process. (This is 
in contrast to sedimentation equilibrium analytical 
ultracentrifugation, which is concerned with 
the thermodynamic equilibrium attained after long 
time - see Rowe (► Sedimentation Equilibrium 
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Fig. 1 In analytical 
ultracentrifugation, a sector¬ 
shaped solution column (top) 
held in the centrifugal held is 
probed with light 
perpendicular to the plane of 
rotation (red arrows) such that 
the concentration of dissolved 
macrosolute (indicated on 
a light to dark gray scale) can 
be recorded along the radial 
direction (blue arrows). An 
example for a concentration 
prohle featuring 
a sedimentation boundary is 
shown in the middle panel. 
The concentration prohles in 
SV typically form 
sedimentation boundaries that 
migrate with time, and may 
resolve multiple boundaries as 
shown in the bottom panel 



light source 



detector 
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Analytical Ultracentrifugation)). This typically results 
in a sedimentation boundary, which is a localized 
region of steep concentration gradients monotonically 
increasing from a “solvent plateau” (a region void 
of solutes) toward a “solution plateau” (a region of 
constant concentration). With time, the sedimentation 
boundary translates and changes shape, while 
dissolved particles accumulate at the bottom. If the 
density of the particles is less than that of the solvent, 
flotation will take place with accumulation at the 
meniscus. In contrast to preparative ultracentrifuga¬ 
tion, recovery of the material is not intended. 

As the name of the technique suggests, the most 
prominent parameter derived from this is the velocity 
of sedimentation of the dissolved particles. It is 
normalized to the centrifugal field and reported as 
sedimentation coefficient (short “s-values”) 


in units of Svedbergs, with 1 S = 10~'^ s, with v 
denoting the linear velocity, cu the rotor angular veloc¬ 
ity, and r the distance from the center of rotation. For 
ideally sedimenting, monodisperse particles, the sedi¬ 
mentation velocity can naively be taken as the velocity 
of the migrating sedimentation boundary. The sedi¬ 
mentation coefficient depends on the mass, density, 
and translational frictional coefficient of the particle. 
Indeed, for particles of known mass and density, the 
translational frictional coefficient can be very precisely 
determined. However, SV is more general in that 
it observes the sedimentation of the entire ensemble 
of particles, which gives rise to characteristic mono- 
modal or multi-modal boundary shapes that evolve 
with time, in a fashion that is governed jointly by the 
diffusion properties of the particles, their polydisper- 
sity in mass, friction, and/or density, and their physical 
and chemical interactions. 

Applications 

SV was developed in the 1920s by The Svedberg and 
colleagues (Svedberg and Pedersen 1940) and since 
then used in a great variety of configurations for 
many different types of applications. The centrifugal 
field in current analytical ultracentrifuges can be 
adjusted between 1,000 and 300,000 g, which sets 
wide limits for the size of the particles under study. 


ranging from dissolved mono-atomic ions and small 
molecules below 1 kDa up to micron-sized particles in 
the GDa range. Accordingly, SV was central in the 
discovery and characterization of macromolecules in 
chemistry, biochemistry, and molecular biology. 

The characterization of the size distribution of 
polymers and colloidal dispersions has been among 
the classical, industrial applications of SV. In polymer 
physics, SV has been used to study the relationship 
between mass and hydrodynamic friction and the 
conformation of linear and branched polymers. 
Currently, it is used in supramolecular chemistry to 
study the size distribution, density, composition, and 
stability of complexes and self-assembled systems. 
Similarly, it has been applied in the characterization 
of size distribution and composition of various types of 
nanoparticles. 

Applications of SV in the biological sciences 
comprise all classes of biological macromolecules, 
including carbohydrates, proteins, nucleic acids, but 
also self-assembled structures such as micelles, 
liposomes, and whole microorganisms. SV was a key 
method in laying the foundations of modern 
biochemistry and molecular biology, starting with the 
demonstration of the macromolecular nature, globular 
shape, and monodispersity of proteins. It was also 
a key method in the elucidation of cellular 
machinery such as the ribosome, the discovery of 
semiconservative replication and DNA supercoiling, 
and in the study of protein allostery and ligand-induced 
conformational changes. 

In the last decade, SV has experienced 
a renaissance, as it is one of the most powerful methods 
for the solution characterization of reversible protein 
interactions with regard to the number of complexes, 
their stoichiometry, binding constants (detectable in 
the range from low nM to mM range), and gross 
shapes. In particular, it is very suitable for the study 
of reversible multi-protein complexes with more than 
two components and multiple complex species 
(reviewed in Schuck 2007). In addition, SV has 
recently become an important industrial tool for deter¬ 
mining the purity of protein pharmaceuticals. 

Experimental 

The essential aspect of SV is the measurement of the 
solute concentration distribution during centrifugation. 
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This is accomplished with analytical rotors and sample 
holders with transparent windows that allow light to 
transverse through the sample solution parallel to the 
axis of rotation (Fig. 1). The optical system is triggered 
with the revolution of the rotor such that data are 
accumulated only at a specific rotor angle when 
a particular sample solution is located in the focus of 
the optical system. In different systems, the sample is 
either illuminated in a narrow radial range that is 
scanned through the sample volume, or the whole sam¬ 
ple is illuminated at once and imaged onto a camera. 
Many different optical systems have been developed 
over time, but most widely used have been refractive 
index sensitive imaging systems, such as Schlieren and 
Rayleigh interference systems, and absorption optical 
scanning systems equipped with monochromator (for 
specific detection between 200 and 800 nm), radially 
moveable slit, and photomultiplier. 

A key requirement for the meaningful interpretation 
of SV is that sedimentation occurs in the absence of 
wall effects and convection. By carrying out SV with 
sector-shaped sample volumes aligned parallel to 
the centrifugal field, wall effects can be limited to the 
bottom of the cell and the “back-diffusion” region 
generated by the steep concentration increase toward 
the bottom of the cell. Convection is experimentally 
minimized by careful thermal equilibration. It is often 
further suppressed by small density gradients arising 
from the sedimentation of small concentrations of co¬ 
solvents, which are usually added in order to shield 
charges and eliminate long-range electrostatic forces 
between particles influencing their sedimentation 
behavior. 

The sedimentation velocity will scale with the 
solvent viscosity, rj, and particle buoyancy (1 — vp), 
where v is the partial-specific volume and p the solvent 
density. Changes in the solvent density across the 
sample volume due to solvent compressibility are 
usually small and negligible at the modest hydrostatic 
pressures typically achieved in SV (^20-25 MPa), but 
they become significant at the highest rotor speeds 
and can be computationally corrected for. In practice, 
s-values are often transformed to standard conditions 
of water at 20° C 

»7exp(l - VPlO.w) 

*^20,^ — “^exp 7 ^ (2) 

»720,w( 1 - VPexpj 


in order to eliminate the solvent viscosity and density 
dependence and allow comparison with hydrodynamic 
predictions. Hydrodynamic models may be based, for 
example, on protein crystal structures, small angle 
scattering models, or molecular dynamics simulations. 
These may be used to predict translational friction 
coefficients and s-values based on the Kirkwood- 
Riseman-Bloomfield method, as implemented in the 
software HYDROPRO (Garcia de la Torre et al. 2000), 
or the boundary element Stokes flow method, as 
implemented in the software BEST (Aragon and 
Hahn 2006). 

There are subtle aspects as to what exactly consti¬ 
tutes the “sedimenting particle,” as the macroscopic 
view of a solid rigid body in a continuous solvent 
breaks down on the macromolecular size range usually 
studied in AUC. The “sedimenting particle” should 
better be regarded as an effective particle comprised 
of a dynamic system of transiently co-sedimenting 
molecules, which is intricately connected to the 
properties of the solvent and particle/solvent interac¬ 
tions. For typical proteins, the volume of the interfacial 
region where the solvent is perturbed is on the same 
order of magnitude as the dry protein volume 
(Halle and Davidovic 2003). Therefore, the hydration 
dynamics and the effect of transiently bound water 
dissipating energy and increasing the hydrodynamic 
friction of the particle are very significant. For exam¬ 
ple, for proteins, this can be empirically accounted for 
by an increase of the effective contour by ^ 1 A above 
the van-der-Waals contour seen in the crystal structure 
(Garcia de la Torre et al. 2000; Aragon and Hahn 
2006). Similarly, ions (in particular in poly-electro- 
lytes) or other co-solvents contribute to the 
sedimenting particle system in multicomponent sol¬ 
vents (reviewed in Ebel 2007). The effect of 

preferential solvation on the “particle” (such as 
preferential exclusion of sucrose from the protein 
hydration shell) can be very large. This can 

pragmatically be expressed with the help of 
thermodynamic density increments that are 

operationally defined by equilibrium dialysis and 
densimetry. (Weighing required for densimetry links 
the microscopic determination of molecular weights to 
a macroscopic weight measurement.) Through the 
solvent dependence of the buoyancy of the effective 
sedimenting particle, SV-AUC can provide insights 
into macromolecular solvation (Ebel et al. 2000). 
In the absence of significant preferential solvation 
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effects, the density increments of proteins can be 
approximated with the help of partial-specific volumes 
predicted from amino acid composition and the 
chemical composition of other moieties. 

The idea of an effective sedimenting system 
also applies to macromolecules undergoing rapid 
self-association or forming heterogeneous complexes 
with other macromolecules in multicomponent 
solutions (Schuck 2010a). With regard to the dynamic 
nature of macromolecules or macromolecular 
complexes, what is macroscopically observed is 
expected to be a time-average state on the time scale 
of the SV-AUC experiment, typically 1,000-10,000 s. 

Traditionally, SV-AUC is concerned with close-to- 
dilute solutions, where the (hydrodynamic) volume of 
the suspended particles occupies less than 1% of the 
solution volume, and the sedimentation process is 
reflective of the properties of the individual particles 
rather than that of the suspension as a whole. In this 
close-to-dilute regime, hydrodynamic repulsive 
particle-particle interactions and retardation of 
sedimentation from the counterflow of solvent in 
the fixed-volume samples exist, but these effects can 
be well accounted for with a linear concentration 
dependence of the sedimentation coefficients on 
(weight) concentration. This takes the form 

5 = 5o(l-^sw), (3) 

with H’ denoting the weight concentration and ks 
denoting a nonideality coefficient (reviewed in 
Harding and Johnson 1985). Hydrodynamic theory 
predicts the magnitude of ks to be 6.55 x v/, (Batchelor 
1972) (with v/, a hydrodynamic partial-specific 
volume), buf this value has proven difficult to experi¬ 
mentally confirm, as ks is sfrongly shape dependent 
(Rowe 1977) and dependent on the interparticle dis¬ 
tance distribution (Batchelor and Wen 1982; 
Solovyova et al. 2001), which may be modulated by 
long-range repulsive electrostatic interactions on the 
one hand, or by transient binding reactions or attractive 
inter-particle potentials on the other hand. For most 
practical work on compact particles such as folded 
macromolecules, the high sensitivity of the optical 
systems and the precision of the computational data 
analysis allows one to work sufficiently close to the 
dilute limit such that even the linear hydrodynamic 
concentration dependence is usually negligible. This 
is not the case in the study of weak macromolecular 


binding reactions that require high concentrations. 
However, at not too high concentrations and practical 
concentration gradients in SV, and for molecules 
where transport from diffusion is at least comparable 
to that from sedimentation, chemical reactions are 
decoupled from the hydrodynamics of sedimentation. 


Theory and Data Analysis: Noninteracting 
Mixtures 

From the balance of gravitational and frictional forces 
(Coriolis forces are negligible), we find thaf fhe sedi- 
menfation coefficienf is dependenf on the particle mass, 
m, its density, p (or the buoyancy (1 — vp), respec¬ 
tively), and its translational friction coefficient, /. 

s = m{l-vp)/f (4) 

For a single (hypothetically non-diffusing) particle 
initially located at the meniscus r^, the definition of the 
5-value in Eq. 1 leads to the exponential trajectory 

r{t) = r„,exp{s(o^t) (5) 

This also describes the boundary position of 
a monodisperse suspension of ideally sedimenting, 
non-diffusing particles. As a consequence of the expo¬ 
nential and radial movement increasing the average 
interparticle distance, the solution plateau region in 
the sample will experience a drop in concentration 
from the initial concentration, Cq, 


c{t) = co&xp[—2soj^t) (6) 

termed “radial dilution.” Now considering diffusion, the 
boundaries will experience broadening proportional to 
the square root of time, and overall the evolution is 
described by a diffusion/convection equation in radial 
geometry, known as the Lamm equation 


dc 

Ift 


1 ^ 
r dr 





(7) 


It can be solved very accurately and efficiently 
using finite element Lamm equation solutions 
(Brown and Schuck 2008). Examples for the shapes 
of the sedimentation profiles are shown in Eig. 2. 




2294 


Sedimentation Velocity Analytical Ultracentrifugation 





radius (cm) 


d 


e 


f 


1 


0.5 


0 

1 


0.5 


0 




radius (cm) 


Sedimentation Velocity Analytical Ultracentrifugation, 
Fig. 2 Examples for the shapes of Lamm equation solutions. 
The profiles are calculated for particles of different sizes and 
sedimentation properties. All conditions are calculated for 
a rotor speed of 50,000 rpm, and 50 concentration profiles are 
shown at different (in each case equally spaced) time intervals. 
Later scans are indicated by a higher color temperature, 
(a) Small molecules with a sedimentation coefficient of 0.2 S 
and a diffusion coefficient of 6 x 10^® cm^/s, in At = 300 s 
intervals. Similar values are frequently observed for sedimenting 


buffer salts, (b) Sedimentation of a peptide of 1 kDa and 0.3 S, 
At = 1,000 s. (c) A small protein of 10 kDa and 1.5 S, At = 500 s. 
(d) A protein of 100 kDa and 6 S, At = 300 s. (e) Particle of 1 
MDa and 30 S, At = 50 s. (f) A floating particle with a sedimen¬ 
tation coefficient of —3.0 S and a diffusion coefficient of 
2.71 X 10“^ cm^s. Such data patterns may be obtained, for 
example, with large emulsion or lipid particles. In flotation, the 
radial dilution is replaced with a radial increase in concentration 
in the plateau region of successive profiles 


For particles governed by Brownian motion, the 
relaxation response to concentration fluctuations and 
to an external force are the same, which means that the 
frictional coefficients for sedimentation and diffusion 
are identical, leading to the Svedberg equation 


s M(1 — vp) 
D ~ 


( 8 ) 


It states that from the displacement of the 
sedimentation boundary (sedimentation coefficient) 
and the diffusional broadening of the boundary 
(diffusion coefficient), the particle’s buoyant 
molecular weight M(1 — vp) can be extracted from 
the dynamics of the sedimentation process. For 
particles less dense than the solvent, negative 
sedimentation coefficients and buoyant molecular 
weights occur, leading, as noted above, to a flotation 
process that is entirely analogous to the case of 
sedimentation. 


In practice, the boundary spread is exquisitely 
sensitive to heterogeneity of the sample under study, 
which often requires one to abandon the description 
with a discrete, monodisperse particle-size model. 
Several approximate descriptions for the 
sedimentation of polydisperse mixtures have been 
during the long history of SV. Recently, the exact 
description via integral equations became 
computationally feasible. In the simplest form, 
a continuous distribution of sedimentation coefficients 
is considered as 


a{r,t)= j c{s)x^{s,D{s)p-j)ds + h{r) + P{t) (9) 


with c(s) denoting the differential sedimentation coef¬ 
ficient distribution, where c{s)ds is the concentration of 
particles with sedimentation coefficient between 5 and 
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s + ds (Schuck 2000). r, t) is a normalized 

Lamm equation solution, usually exploiting a scaling 
law D(s) that relates sedimentation and diffusion coef¬ 
ficients. This is normally available from prior 
knowledge of the nature of the sample, for example, 
D{s) ^ for compact particles. In the definition of 
Eq. 9, a(r,t) is the experimental sedimentation data that 
is directly fit in a least-squares sense, and h{r) and P(t) 
are systematic time-invariant and radial-invariant sys¬ 
tematic noise components typical for SV data, which 
can be determined unambiguously while solving this 
Fredholm integral equation. Solving the integral equa¬ 
tion requires maximum entropy or Tikhonov regulari¬ 
zation, which can be enhanced with Bayesian prior 
knowledge. The scaling law also contains pre¬ 
exponential scaling factors (for compact particles 
determined by the average frictional ratio, ///o), that 
can be fit by nonlinear regression of Eq. 9. 

Several variations of this approach are possible and 
useful for different applications. First, the scaling law 
D{s) also allows converting the sedimentation coeffi¬ 
cient distribution c{s) into a molar mass distribution 
c{M). Further, scaling-law free models have been 
developed, such as the general c{s*), and the two- 
dimensional size-and-shape distribution cis,f). With 
regard to the latter, it should be noted, though, that 
the extra distribution dimension is rarely justified and 
often strongly ill-conditioned, since experimental data 
only provide sufficient information for a single diffu¬ 
sion parameter per sedimentation boundary. Finally, 
for mixtures of spectrally distinguishable components, 
a multi-signal Ck{s) distribution has been described for 
simultaneously extracting molar sedimentation coeffi¬ 
cient distributions for each component from the global 
data sets acquired at multiple signals (Balbo et al. 
2005). An example for the combined spectral and 
hydrodynamic resolution of a three-component mix¬ 
ture of unlabeled proteins is shown in Fig. 3, and the 
use of this method to unravel multiple binary and triple 
protein complexes formed in three-component mix¬ 
tures of interacting proteins is illustrated in Fig. 4 
below. 

These distribution models can routinely provide 
a precision of the sedimentation process to within 1% 
or better. Typical SV data sets are sufficiently infor¬ 
mation rich to allow exquisite sensitivity and high 
hydrodynamic resolution, for example, baseline- 
separation of monomer and dimers of small proteins. 
In contrast to many diffusion-based methods, the 


resolution and sensitivity of SV increases strongly 
with particle size. 

The linear distribution model Eq. 9 is based on 
ideally sedimenting, noninteracting solutes. It is appli¬ 
cable in good approximation also to the sedimentation 
of discrete macromolecular systems with rapid chem¬ 
ical reactions, with the caveat that some peaks in the 
distribution represent reacting system that have to be 
interpreted as effective particles (see below) (Schuck 
2010b). However, for nonideal sedimentation with 
hydrodynamic repulsive interactions, the Fredholm 
integral equation Eq. 9 becomes nonlinear and for 
practical purposes at present computationally virtually 
intractable. Phenomenologically, in this case, the 
boundary ceases to migrate as predicted by Eq. 5, 
boundary self-sharpening occurs, and the relative mag¬ 
nitudes of the boundary components become distorted 
from the Johnston-Ogston effect (Fujita 1975). In this 
case, a rigorous weighted-average sedimentation coef¬ 
ficient can still be determined through the transport 
method. 


iu. = - 


Q}^r^c{rp) dt 


c{r,t)rdr (10) 


which is purely based on mass transport through an 
imaginary plane at radius i-p in the plateau region. Any 
model that can fit the raw data well (including c(^), 
irrespective of its motivation) will be faithful to the 
integral in Eq. 10, and therefore can report s^. The 
concentration dependence of can be studied to 
reveal the magnitude of kg, which can inform on mac¬ 
romolecular shape or/or weak attractive/repulsive 
interactions (see above). 


Theory and Data Analysis: Interacting 
Macromolecular Systems 

SV provides a lot of information on reversible 
interacting systems under dilute or close-to-dilute 
conditions. Insight into the rich and interesting phe¬ 
nomenology of sedimenting reacting systems is 
essential in the study of macromolecular interactions 
by SV. In the dilute case, the sedimentation process 
can be described by an extension of the Lamm 
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Sedimentation Velocity Analytical Ultracentrifugation, 
Fig. 3 Top Panel: Sedimentation coefficient distributions c{s) 
derived from sedimentation velocity profiles of a mixture of IgG 
and aldolase. The inset shows the raw sedimentation signals 
acquired at different time points with interference, absorbance 
at 280 nm, and absorbance at 250 nm (top to bottom), at a rotor 
speed of 48,000 rpm at 26°C. The c(x) distributions are calcu¬ 
lated separately for the refractive index data (solid line), the 
absorbance data at 280 nm (dotted line), and the absorbance 
data at 250 nm (dashed line). To facilitate comparison of the 
peak positions, the vertical dotted line indicates the peak of the 
c(s) distribution from the interference data. Middle Panel: 


Global multi-wavelength analysis and decomposition into the 
component sedimentation coefficient distributions Ct(r) for the 
IgG sample (red) and the aldolase (blue). The extinction coeffi¬ 
cients for these two components at the three signals were 
predetermined from separate experiments with IgG and aldolase 
alone, which resulted in the sedimentation coefficient distribu¬ 
tions indicated by the dotted lines. Lower Panel: Calculated 
component sedimentation coefficient distributions from 
a mixture (solid lines) of IgG (red), aldolase (blue), and BSA 
(green, right ordinate) and comparison with the distributions 
obtained from the individual proteins (dotted lines) (Data taken 
from Balbo et al. 2005) 
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Sedimentation Velocity Analytical Ultracentrifugation, 
Fig. 4 Example of the multi-signal C/i(s) analysis of a triple 
protein mixture of a viral glycoprotein (green), its cognate 
receptor (blue), and a heterogeneous antigen-recognition 
receptor fragment (red). The content of each protein component 
in the different s-ranges is obtained from the global analysis 
of sedimentation data acquired with the interference optics 
and with the absorbance system at two different wavelengths 


(data not shown), using two chromophorically labeled and one 
unlabeled protein. Solid lines show the C/i(s) analysis of the triple 
mixture. The analogous distributions of each protein alone are 
shown as dashed lines. The formation of coexisting binary 
complexes of the receptor/viral glycoprotein (6.5-7 S), cognate 
receptor/antigen-recognition receptor (7 S), and a ternary com¬ 
plex of the three with 1:1:1 stoichiometry ( ~8.5 S) can be 
discerned (Data as shown in Schuck 2007) 


equation to a system of radial reaction/diffusion/con¬ 
vection equations 


dck 1 d 
dt r dr 


I 2 2 
I CkSkO) r 



qt ( 11 ) 


where the spatial migration of all species k is coupled 
via their local chemical reaction fluxes (Fujita 
1975). The boundary patterns generated by this equa¬ 
tion fall into two classes, dependent on the timescale of 
the reaction kinetics relative to the timescale of the SV 
experiment (there is only a narrow transition region). 

For slow kinetics on the timescale of sedimentation, 
the process of sedimentation/diffusion is decoupled 
from the chemical reaction, and SV exhibits bound¬ 
aries corresponding to the individual species. Accord¬ 
ingly, the different complexes may be resolved from 
the unbound material by c{s) or Ck{s), as illustrated in 
Fig. 4. 

For fast kinetics on the timescale of sedimentation, 
coupled sedimentation and reaction occurs, and the 
sedimentation boundaries are reflective of the entire 
interacting system, with free and complex species 
sedimenting in a concerted fashion as effective 


particles, with concentration-dependent s-values 
between those of the free and the fully complexed 
species. In the limit of fast kinetics, generally, n- 
component systems exhibit n boundaries. 

Gilbert was the first to describe the properties of the 
single sedimentation boundary of self-associating sys¬ 
tems (Gilbert 1959). Due to the concentration gradient, 
the boundary is bimodal at concentrations exceeding 
the (effective) dissociation equilibrium constant Ko, 
and essentially mono-modal below. The boundary sep¬ 
aration increases with higher oligomerization schemes. 

For two-component mixtures, there will usually be 
an “undisturbed” boundary formed by a single compo¬ 
nent at the s-value of that component, and always 
a “reaction boundary” containing a mixture of all spe¬ 
cies (Fig. 5) sedimenting with an s-value between that 
of the faster sedimenting free species and the largest 
complex species. 

The sedimentation patterns of multicomponent 
mixtures are explained by effective particle theory 
(EPT), which is derived from the application of trans¬ 
port considerations to the coupled Lamm equation 
system Eq. 11 for fast kinetics (Schuck 2010a). 
It leads to the recognition of the ergodicity of the 
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Sedimentation Velocity Analytical Ultracentrifugation, 
Fig. 5 Sedimentation coefficient distributions c(s) representing 
the boundary patterns of the interacting system of species 
A (3.5 S) reversibly interacting with B (5 S) forming transient 
complexes AB (6.5 S), shown at different total loading concen¬ 
trations. The vertical lines indicate the s-values of the free and 
complex species. Top: Dilution series with equimolar concen¬ 
trations at 0.1 Kd {blue), 0.3 {pink), (green), 3 Kq (red), 
10 Kd (cyan). The c(s) distributions are normalized relative to 
the total loading concentrations. The undisturbed boundary can 


be recognized from the constant peak at 3.5 S, and the reaction 
boundary from the concentration-dependent s-value between 
that of B and the complex AB. Bottom'. Titration series of 
a constant total concentration CAtot = of the smaller species 
A with increasing concentrations Catot of 0-1 Kd (blue), 0.3 Kq 
(pink), Kd (green), 2.366 Kq (red), 10 (cyan). Distributions 
are not normalized. This concentration trajectory crosses the 
phase transition. Here, the undisturbed boundary at low concen¬ 
trations is at 3.5 S, but it disappears in the red trace, and 
reappears at 5 S in the cyan trace (For details, see Schuck 2010a) 


reaction boundary, in that the time-average velocity of 
all components match that of the reaction boundary 
velocity. The salient feature of the reaction boundary is 
depicted in Fig. 6: It shows that the component A sed¬ 
iments slower when in the free state and, therefore, 
must recombine earlier than B to match the time- 
average velocities. This results in an asymmetry in 
the concentration of free species co-sedimenting in 
the reaction boundary. 

From this condition, the velocity and composition 
of the reaction boundary can be derived unambigu¬ 
ously, in addition to the properties of the undisturbed 
boundary, all of which depend on the reaction stoichi¬ 
ometry, binding constant, and loading concentrations, 
as well as the relative sedimentation coefficients. For 
example, with “A” denoting the slower sedimenting 
component, the s-values of the reaction boundary are 

{ {caSa + cabSab)I{ca + cab) for cb>4(ca) 

{cbSb +CabSab)!{cb +Cab) else 

( 12 ) 


with c denoting the molar species concentration as 
predicted by mass action law for given total concen¬ 
trations, and with the asymmetric phase transition at 




Kca{vab - Va) + (vg - Vh) 
K{vab - vb) 


(13) 


where K is the equilibrium binding constant, and v are 
the species sedimentation velocities (Schuck 2010a). 
Failure to recognize the nature and qualitative features 
of coupled reaction/sedimentation systems can lead to 
qualitative misinterpretation of experimental data 
(Fujita 1975). 

The data analysis of interacting systems typically 
proceeds in different stages. Generally, the sedimenta¬ 
tion prohles of the reaction boundaries of rapidly 
interacting two-component mixtures can still be 
approximated well with a single y-value and 7)-value, 
except close to the phase transition condition, and 
consequently, the linear decomposition of the bound¬ 
ary profiles with the Fredholm integral equation Eq. 9 


















Sedimentation Velocity Analytical Ultracentrifugation 


2299 



- > 

time 


Sedimentation Velocity Analytical Ultracentrifugation, 
Fig. 6 Cartoon of the effective particle A- ■ B (encircled in 
red). Indicated is the fractional time that A (green triangle) 
and B (blue shape) spend in the complex (grayed time intervals) 
or free (white background). The representation is faithful with 


regard to relative concentrations, velocities, and species life¬ 
times of the system depicted in Fig. 5. Component A spends 
a smaller fraction of time free than B, resulting in a match of 
their time-average velocities. An animation is shown in the 
Supporting Information SI (For details, see Schuck 2010a) 


into a sedimentation coefficient distribution c(i) is 
usually very successful. The molecular weights 
implied by c{M) for the reaction boundary are that of 
an “effective particle,” in between that of the larger 
free species and the largest complex. The integration of 
the sedimentation coefficient distribution leads to the 
weighted-average sedimentation coefficients 
i'vv(consistent with the definition of Eq. 10) and the 
dependence of on loading concentration may be 
modeled with theoretical isotherms based on mass 
action law. This can reveal estimates for the binding 
constants, as well as s-values of the complexes. 

Further, the boundary pattern can usually be classi¬ 
fied regarding the reaction kinetics, that is, whether it is 
slow or fast, after inspection of the concentration 
dependence of the cis) peaks. This offers more detailed 
quantitative interpretations exploiting the characteris¬ 
tic boundary patterns. Because slow systems hydrody- 
namically separate all species, the species populations 
as a function of loading concentration can be directly 
measured and interpreted with mass action law-based 
population models. For fast systems, analogously the 
amplitudes and s-values of the undisturbed boundary 
and reaction boundary can be modeled as a function of 
loading concentrations with isotherms based on EPT. 
Finally, in a most detailed model, solutions of the 
coupled Lamm equations can be fitted directly to 
experimental data, provided sufficiently monodisperse 
reactants and products. For multicomponent systems 
that are of unknown reaction scheme, the multi-signal 
Ck{&) approach can reveal the number of complexes and 
their composition or stoichiometry. 

All concepts above are implemented in the software 
SEDFIT and SEDPHAT for simulation and rigorous 
analysis of raw experimental data. 


Summary 

Sedimentation velocity analytical ultracentrifugation 
is a very powerful tool for the study of macromolecules 
and nanoparticles, with regard to their size distribution, 
their shape or solution conformation, and sometimes 
their composition. Further, it is a delicate and highly 
informative tool for the study of reversible macromo- 
lecular interactions in free solution, allowing the char¬ 
acterization of binding constants or virial coefficients, 
and binding kinetics. For multicomponent systems 
exhibiting multiple complexes, it offers the opportu¬ 
nity to study the number and stoichiometry of com¬ 
plexes formed. In the last decade, the technique has 
experienced a transformation due to significant 
theoretical and computational developments. This has 
led to improved accuracy and resolution, and opened 
the way to many new types of applications. 
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Introduction 

During the past two decades, the use of miniaturized 
systems for the manipulation and processing of fluid 
samples has gained significant interest because of their 
chemical, biological, and biomedical applications. 
This interest in large part has been driven by concom¬ 
itant advances in the areas of genomics, proteomics, 
drug discovery, high-throughput screening, and diag¬ 
nostics, with a clearly defined need to perform rapid 
measurements on small sample volumes. At a basic 
level, microfluidic activities have been motivated by 
the fact that physical processes can be more easily 
controlled when instrumental dimensions are reduced 
to the micron scale. Key additional benefits include the 
ability to process small volumes of fluid, enhance 
analytical performance, reduce instrumental foot¬ 
prints, lower unit costs, and facilitate integration of 
functional components within monolithic substrates 
and the capacity to exploit atypical fluid behavior in 
both time and space. 

In broad terms, microfluidic systems can operate 
within either a continuous-flow regime or a droplet/ 
segmented flow regime. Continuous-flow operation is 
well suited for use in centrifugal fluidics and large- 
scale integration applications; however, issues related 
to Taylor dispersion, fluid interaction with channels 
walls, cross-contamination, and long channel lengths 
limit their use as an effective tool in a variety of 
situations. To this end, the use of segmented flow 
systems has become increasingly popular and over¬ 
come all of the above through the formation of discrete 
and isolated droplets with volumes on the pico- to 
nanoliter. The development of such platforms for 
biophysics and biotechnology applications has been 
explored extensively and aspects of these studies will 
be highlighted herein. 




Segmented Flow Microfluidics 


2301 


s 


Droplet-Based Microfluidics 

Droplet-based microfluidics deals with the generation 
and manipulation of discrete droplets contained within 
an immiscible continuous phase. There are two types 
of platforms that use this concept; segmented flow 
microfluidics and digital microfluidics. The latter 
deals with the manipulation of discrete nL-pL volume 
droplets on planar surfaces using electrical or acoustic 
forces. Actuation, splitting, merging, and dilution of 
droplets are typical procedures performed within such 
platforms (Berthier 2008). A discussion of research in 
this area lies outside the scope of the current work. On 
the other hand, segmented flow microfluidics makes 
use of two immiscible phases to create discrete 
volumes that reside a move within a continuous flow. 
In the case of gas-liquid segmented flow microfluidics, 
gas is injected into a continuous stream of liquid, thus 
creating a segmented flow, consisting of liquid plugs 
separated by gas bubbles. Conversely, in liquid-liquid 
segmented flow microfluidics, droplets of the dis¬ 
persed phase are produced as a result of the interaction 
of the flow-generated shear force and the interfacial 
tension at the fluid-fluid interface (Casadevall i Solvas 


a Capillary 


and deMello 2011; Teh et al. 2008). Significantly, 
microfluidics allows for the production of monodis- 
perse droplets at rates in excess of tens of KHz and 
for independent control of each droplet (in terms of 
their size, position, and contents). These unique fea¬ 
tures have great potentials in biomedical engineering. 

The most common microfluidic droplet generation 
strategies are presented in Fig. 1. A co-flow 
microfluidic device (Fig. la) generates droplets of 
one fluid in another immiscible fluid using two coaxial 
capillaries. In a T-junction microfluidic device 
(Fig. lb), two immiscible fluids intersect perpendicu¬ 
larly and form an interface at the junction where the 
droplets are created. Finally, in a flow-focusing 
microfluidic chip (Fig. Ic), the two phases are forced 
in parallel through a narrow nozzle (or channel 
constriction) where the dispersed phase can be pinched 
off. In a flow-focusing device, drop formation can be 
occurred at the nozzle via a “dripping” process or in 
a jetting mode, where the inner fluid (dispersed phase) 
forms a thin stream (or jet) that breaks into drops 
downstream of the nozzle. The transition from 
dripping to jetting can be controlled through variation 
of the flow rates of the two phases (Utada et al. 2007). 


ON 
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Microfluidics, 

Fig. 1 Droplet generation 
strategies: (a) co-flow in 
a capillary format; 

(b) T-junction in a planar chip 
format; (c) flow-focusing in 
a planar chip format 
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Segmented Flow Microfluidics: Unit 
Operations 

The formation of droplets in microfluidic devices has 
been extensively studied for both liquid-liquid and 
gas-liquid systems. Nevertheless, the use of segmented 
flows in complex chemical/biological processing relies 
on the ability to perform a range of integrated, unit 
operations in high throughput. 

Reactions play a central role in chemistry and biol¬ 
ogy and the control of reagent concentrations, reaction 
times, and mixing is of great importance. In a simple 
sense, droplets may be considered as individual 
chemical reactors, and as such need to perform the 
same functions as large-scale reactors. That is to say 
reagents need to be added at known concentrations and 
at different points in time, the reaction must be incu¬ 
bated for the correct time and the product must be 
assessed. These functions can be accomplished by 
simple unit operations. 

The introduction of multiple reagents to droplets 
can be easily controlled by the variation of their flow 
rates (Theberge et al. 2010). Efficient mixing of 
reagents within any reactive process is an important 
process. Mixing of reagents inside droplets can be 
extremely rapid because typical diffusion distances 
are short. However, when droplets move within 
a straight channel, mixing occurs in each half of the 
droplet but not between the halves (Song et al. 2006). 
Using curved or winding channels to transport droplets 
allows efficient reorientation of droplets with respect 
to the flow direction. This in turn accelerates reagent 
mixing through chaotic advection (Fig. 2a). 

Droplet merging is a fundamental process allowing 
the combination of reagents to initiate or quench 
a chemical reaction (Niu et al. 2011; Casadevall 
i Solvas and deMello 2011). Merging can be achieved 
using either passive or active strategies. Passive 
droplet merging is based on the control of the channel 
geometry (Fig. 2b) to induce coalescence. Alterna¬ 
tively, active merging is achieved through the applica¬ 
tion of external stimuli such as electric forces, surface 
acoustic waves, and the thermocapillary effect which 
employs the temperature dependency of surface 
tension to guide a droplet in a microfluidic channel 
(Gu et al. 2011). Similar strategies can be applied to the 
(reverse) process of droplet splitting (Fig. 2c). 

It is significant to note that since the residence time 
of droplets inside microfluidic devices is typically 


between a few seconds and a few minutes, droplet 
storage or incubation is important in situations where 
biological processes must be screened over longer time 
periods (often hours to weeks). The use of droplet 
traps, chambers, or external tubes (Casadevall 
i Solvas and deMello 2011; Teh et al. 2008; Theberge 
et al. 2010) allows multiple droplets to be stored 
(Fig. 2d) in parallel for extended periods of time. 

Considering that on-chip generation and efficient 
manipulation of nano- to femtoliter droplets is getting 
reliable, there is a need to develop online rapid detec¬ 
tion methods of high sensitivity and low detection 
limits to analyze thousands of small droplet volumes. 
Accordingly, different on-chip analytical tools have 
already been applied to gain information on physical 
and chemical properties of microdroplet contents 
(Theberge et al. 2010). Fluorescence imaging is the 
dominant method to follow thousands of individual 
droplets over time but also it is widely used to charac¬ 
terize chemical or biological processes arising inside 
their volume. Finally, Raman and mass spectroscopy, 
electrophoresis, and electrochemical detection are 
label-free detection approaches, which have success¬ 
fully been integrated in microfluidics for the extraction 
of the required analytical information. 

A final unit operation, which is of clear utility, is 
droplet sorting that offers the possibility to isolate 
droplets of interest from a larger population. Through 
use of electric fields (dielectrophoresis), acoustic 
waves, and light-induced heating, it is feasible to 
guide specific droplets into a predetermined path 
(Fig. 2e) (Guet al. 2011). 

Applications of Segmented Flow 
Microfluidics 

Since segmented flow microfluidic platforms can 
perform a variety of unit operations, they have found 
significant application in chemistry and biology since 
it is an attractive platform for the integration of func¬ 
tional components (Theberge et al. 2010; Casadevall 
i Solvas and deMello 2011; Song et al. 2006). 

For example, segmented flow microfluidic tools 
have been shown to have significant utility in synthe¬ 
sizing functional micro- and nanoparticles (Hung and 
Lee 2007; Song et al. 2006). Synthesis of monodis- 
perse particle populations is desirable in almost all 
situations, and relies on the precise control of particle 
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Segmented Flow Microfluidics, Fig. 2 A summary of com¬ 
mon unit operations performed in segmented flow microfluidics. 

(a) Mixing inside the droplets enhanced by winding channels; 

(b) passive, channel-geometry-mediated droplet merging 
(Reprinted with permission from Niu et al. (2008). Copyright 
2008, Royal Society of Chemistry); (c) passive droplet splitting 


(Reprinted with permission from Link et al. (2004). Copyright 
2004, American Physical Society); (d) trapping of droplets 
(Reprinted with permission from Huebner et al. (2008b). Copy¬ 
right 2008, Royal Society of Chemistry); (e) electrosorting of 
droplets (Reprinted with permission from Niu et al. (2007). 
Copyright 2007, American Institute of Physics) 


nucleation and growth. The ability to control both heat 
and mass transfer within small-volume environments 
provides a direct route to producing particle 
populations of low polydispersity. Indeed, it has been 
shown in numerous studies that the adoption of seg¬ 
mented flow microfluidic reactors leads to a better 
control of both nanoparticle size and size distribution 
when compared to standard macroscopic and continu¬ 
ous-flow methods (Zhao et al. 2011). A variety of 
functional particles of controlled size, shape, and com¬ 
position have been synthesized using segmented flow 
microfluidic reactors. These include Janus particles, 
microcapsules (Fig. 3a) with polymerized shells, 
microgels, porous or patterned particles, compound 
semiconductor nanoparticles, and vesicles (liposomes 
and polymersomes) (Wang et al. 2011; Kumacheva 
and Garstecki 2011; Fernandez-Nieves et al. 2011). 
The latter category constitutes an interesting and valu¬ 
able tool in drug delivery and screening. Vesicles 
themselves are powerful tools in drug delivery 


applications since different biomolecules such as 
DNA, proteins, and cells can be attached to the 
membrane (Wang et al. 2011). 

The exploration of cellular-membrane processes 
using a synthetic approach is very attractive in seg¬ 
mented flow microfluidics. Droplet interface bilayers 
can mimic the properties of lipid bilayer cell mem¬ 
branes. This is crucial part of the cell that regulates 
function through the activity of membrane-bound 
compounds (proteins and ion channels) (Casadevall 
i Solvas and deMello 201 1). Moreover, with the devel¬ 
opment of “artificial” biological and macromolecular 
systems in microdroplets, it is now feasible to create 
biosensors to detect toxins, or to perform intracellular 
processes such as gene expression. 

Microdroplet platforms have already provided for 
significant advances in many fundamental molecular 
biological processes, including the polymerase chain 
reaction (PCR), with applications in diagnostics and 
sequencing. PCR can be carried out in laboratory-scale 
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Segmented Flow Microfluidics, Fig. 3 (a) Monodisperse 
microcapsules with different configurations (Reprinted with per¬ 
mission from Kim et al. (2011). Copyright 2011, Wiley-VCH 
Verlag GmbH& Co. KGaA). (b) Continuous-flow PCR in which 
droplets pass through the inner circles in the hot zone for initial 
denaturation of the template and then from the periphery where 
primer annealing and template extension occur. The droplets 


then pass to the center, where the DNA is denatured and another 
cycle begins. Finally, the droplets exit the device after 34 cycles 
(Reprinted with permission from Schaerli et al. (2009). Copy¬ 
right 2009, American Chemical Society), (c) Stochastic loading 
and ordered encapsulation of cells in droplets (Reprinted with 
permission from Edd et al. (2008). Copyright 2008, Royal 
Society of Chemistry) 


PCR cyclers as well as in continuous-flow and batch 
uniphasic microfluidic systems. However, these 
approaches are beset by issues related to reagent and 
enzyme adsorption on channel walls (leading to 
reduced reaction yields), cross-contamination between 
multiple samples and reagent dispersion (in the case of 
continuous-flow methods) (deMello 2001). Compart- 
mentalization of reactions in water-in-oil emulsions 
significantly enhances amplification efficiencies, due 
to the prevention of analyte/surface interactions 
(Fig. 3b) (Casadevall i Solvas and deMello 2011). 
Additionally, many PCR reactions can be performed 
consecutively in high throughput (Schaerli et al. 2009). 


Segmented flow microfluidics can also be applied to 
cell and macromolecule encapsulation, since the load¬ 
ing of defined numbers of macromolecules into drop¬ 
lets can be performed in a controlled way. Many 
droplet platforms have been successful in performing 
experiments with bacterial, yeast, mammalian, 
and human cells, and even with organisms such 
as Caenorhahditis elegans (Huebner et al. 2007; 
Lindstrdm and Andersson-Svahn 2010). Such plat¬ 
forms have the following advantages over conven¬ 
tional techniques: (a) Monodisperse droplets are 
generated with ease, allowing quantitative analysis of 
metabolite or reagent concentration changes, (b) In situ 
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reagent addition by droplet fusion allows complex 
biological procedures to be carried out. (c) Reaction 
or metabolic products are confined within the droplet 
volume preventing their escape or dilution due to dif¬ 
fusion. Since loading cells into droplets is a purely 
random process, it is dictated by Poisson statistics. 
The probability of a droplet containing k cells is 
A^exp(—where X is the average number of 
cells per droplet. This means that to minimize the 
number of droplets containing more than one cell 
requires very low average loading densities, meaning 
that most droplets contain no cells. This constraint 
significantly reduces the number of usable drops in 
an analytical process. Fortunately, self-organization 
phenomena in high-aspect ratio microchannels have 
been used to ensure single-cell encapsulation 
(Fig. 3c) in 70-80% of produced droplets (Edd et al. 
2008). Accordingly, segmented hows have great 
potential in high-throughput screening, where the abil¬ 
ity to conduct cell-based expression and enzymatic 
measurements allows characterization of biological 
systems in vitro (Huebner et al. 2008a). 

Present Issues and Future Challenges 

Segmented how microhuidics has become an impor¬ 
tant and rapidly developing biological research tool. 
System downscaling and the associated improvements 
in mass and thermal transfer have been key. However, 
there are still problems that must be solved, to ensure 
the widespread adoption of droplets as basic tools in 
biological research. 

The first challenge relates to the development of 
new materials for the fabrication of micro-structured 
devices. Today, polydimethylsiloxane (PDMS) is the 
dominant material for the fabrication of microfiuidic 
devices. It is elastomeric, optically transparent in the 
visible region of the electromagnetic spectrum, 
biocompatible, hydrophobic, and permeable to gases 
(advantageous in cell-based experimentation). None¬ 
theless, its poor compatibility with nonpolar solvents, 
and the diffusion of non-cross-linked oligomers to the 
channel surface impede its use in many (nonbiological) 
applications. Glass microfiuidic devices provide supe¬ 
rior solvent compatibility; nevertheless, they are more 
difficult, costly, and time consuming to fabricate. 
Accordingly, the introduction of alternative materials 
or the modification of existing PDMS devices (use of 


perfluoropolyether, poly(methylmethacrylate), and 
polycarbonate) is an area of much investigation 
(Wong and Ho 2009; Renckens et al. 201 1). 

Since monodisperse picoliter-volume droplets can 
be formed rapidly and in large numbers, the ability to 
probe each and every droplet with a sensitive detection 
method is key to extracting the vast amount of infor¬ 
mation contained within. A variety of detection 
methods have been used in conjunction with seg¬ 
mented flows, including fluorescence spectroscopy, 
electrochemistry, vibrational spectroscopies, and 
mass spectrometry (Casadevall i Solvas and deMello 
2011). The collection of data (both chemical and 
biological) from these detection methods is feasible; 
however, the use of multiple detection methods oper¬ 
ating simultaneously offers maximum information 
recovery, and will undoubtedly be an area of much 
investigation. 

Under many circumstances, droplets within a seg¬ 
mented flow offer almost perfect compartmentaliza- 
tion of reaction volumes without inter-droplet 
communication. Although this is essentially valid for 
reactions involving large biomolecules, droplets can 
be porous to smaller molecules (Wootton and deMello 
2010). Accordingly, the development of new surfac¬ 
tants and immiscible (fluorinated) oil phases will be 
key to controlling molecular confinement in, or trans¬ 
port between droplets (Baret 2012). Interestingly, 
a droplet-based microfiuidic system has been devel¬ 
oped for the oxidation of methane (CH 4 ) to methanol 
(CH 3 OH), using oxygen as the oxidant. Droplets of 
catalyst solutions were passed through Tefion tubes 
using a carrier fluid. Pressurized CH 4 and O 2 were 
introduced via diffusion through the Tefion tubing 
and, in the presence of active catalysts, reacted to 
produce methanol. At high temperatures, methanol 
can diffuse through the carrier fluid to a neighboring 
droplet that contains a colorimetric indicator to report 
activity of the catalyst (Kreutz et al. 2010). 

Finally, it is noted that biological processes 
often occur over timescales that are long compared 
to typical residence times (of droplets) in chip- 
based microfiuidic systems. This means that strate¬ 
gies for incubating and storing droplets for extended 
time periods are highly desirable. Much work is 
ongoing in this area and includes the development 
of droplet traps and novel surfactant systems 
(Casadevall i Solvas and deMello 2011; Theberge 
et al. 2010 ). 
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Summary 

There is little doubt that the development of 
segmented flow microfluidic systems has enabled 
significant advances in chemical and biological 
experimentation. The ability to produce and manip¬ 
ulate thousands of monodisperse, sub-nanoliter 
droplets in both time and space has the potential to 
transform the manner in which high-throughput 
and quantitative experimentation is performed. The 
potential for using droplets in a broad range of 
applications, such as single-cell analysis, genomics, 
medical diagnostics, nanomaterial production, and 
proteomics, is both significant and clear. 

Cross-References 

► Droplet-on-Hydrogel Bilayer (DHB) 

► Microfluidics for Single Molecule Detection 

References 

Bare! JC. Surfactants in droplet-based microfluidics. Lab Chip. 
2012;12(3):422-33. 

Berthier J. Microdrops and digital microfluidics. Norwich: 
William Andrew; 2008. 

Casadevall i Solvas X, deMello A. Droplet microfluidics: recent 
developments and future applications. Chem Commun. 
2011;47:1936. 

deMello AJ. DNA amplification: does ‘small’ really mean 
‘efficient’? Lab Chip. 2001;l:24N-9. 

Edd JF, Di Carlo D, Humphry KJ, Koster S, Irimia D, Weitz DA, 
Toner M. Controlled encapsulation of single-cells into 
monodisperse picolitre drops. Lab Chip. 2008;8:1262M. 
Femandez-Nieves A, Wyss H, Mattsson J. Microgel 
suspensions. Fundamentals and applications. Weinheim: 
Wiley-VCH; 2011. 

Gu H, Duits MHG, Mugele F. Droplets formation and merging in 
two-phase flow microfluidics. Int J Mol Sci. 2011; 12:2572-97. 
Huebner A, Srisa-Art M, Holt D, Abell C, Hollfelder F, deMello 
AJ, Edel JB. Quantitative detection of protein expression in 
single cells using droplet microfluidics. Chem Commun. 
2007;12:1218-20. 

Huebner A, Olguin L, Bratton D, Whyte G. Development of 
quantitative cell-based enzyme assays in microdroplets. Anal 
Chem. 2008a;80:3890-6. 

Huebner A, Bratton D, Whyte G, Yang M, deMello AJ, Abell C, 
Hollfelder F. Static microdroplet arrays: a microfluidic 
device for droplet trapping, incubation and release for 
enzymatic and cell-based assays. Lab Chip. 2008b;9:692-8. 
Hung L, Lee A. Microfluidic devices for the synthesis of 
nanoparticles and biomaterials. J Med Biol Eng. 
2007;27:1-6. 


Kim SH, Hwang H, Lim CH, Shim JW, Yang SM. Packing of 
emulsion droplets: structural and functional motifs for multi- 
cored microcapsules. Adv Fund Mater. 2011;21:1608-15. 

Kreutz JE, Shukhaev A, Du W, Druskin S, Daugulis O, Ismagilov 
RF. Evolution of catalysts directed by genetic algorithms in 
a plug-based microfluidic device tested with oxidation of 
methane by oxygen. J Am Chem Soc. 2010;132:3128-32. 

Kumacheva E, Garstecki P. Microfluidic reactors for polymer 
particles. Chichester: Wiley-VCH; 2011. 

Lindstrom S, Andersson-Svahn H. Overview of single-cell 
analyses: microdevices and applications. Lab Chip. 2010; 
10:3363-72. 

Link D, Anna S, Weitz D, Stone H. Geometrically mediated 
breakup of drops in microfluidic devices. Phys Rev Lett. 
2004:92:054503. 

Niu X, Zhang M, Peng S, Wen W, Sheng P. Real-time detection, 
control, and sorting of microfluidic droplets. Biomicro¬ 
fluidics. 2007; 1:044101. 

Niu X, Gulati S, Edel JB, deMello AJ. Pillar-induced droplet 
merging in microfluidic circuits. Lab Chip. 2008;8:1837ML 

Niu X, Gielen F, Edel JB, deMello AJ. A microdroplet dilutor for 
high-throughput screening. Nat Chem. 2011;3:437M2. 

Renckens TJA, Janeliunas D, van Vliet H, van Esch JH, Mul G, 
Kreutzer MT. Micromolding of solvent resistant microfluidic 
devices. Lab Chip. 2011;11:2035-8. 

Schaerli Y, Wootton RC, Robinson T, Stein V, Dunsby C, 
Neil MAA, French PMW, deMello AJ, Abell C, 
Hollfelder F. Continuous-flow polymerase chain reaction of 
single-copy DNA in microfluidic microdroplets. Anal Chem. 
2009;81:302-6. 

Song H, Chen DL, Ismagilov RF. Reactions in droplets in 
microfluidic channels. Angew Chem Int Ed. 2006; 
45:7336-56. 

Teh S-Y, Lin R, Hung L-H, Lee AP. Droplet microfluidics. Lab 
Chip. 2008;8:198. 

Theberge AB, Courtois F, Schaerli Y, Fischlechner M, Abell C, 
Hollfelder F, Huck WTS. Microdroplets in microfluidics: an 
evolving platform for discoveries in chemistry and biology. 
Angew Chem Int Ed. 2010;49:5846-68. 

Utada AS, Chu L-Y, Femandez-Nieves A, Link DR, Holtze C, 
Weitz DA. Dripping, jetting, drops, and wetting: the magic of 
microfluidics. MRS Bull. 2007;32:702-8. 

Wang J-T, Wang J, Han J-J. Fabrication of advanced particles 
and particle-based materials assisted by droplet-based 
microfluidics. Small. 2011;7:1728-54. 

Wong 1, Ho C-M. Surface molecular property modifications for 
poly(dimethylsiloxane) (PDMS) based microfluidic devices. 
Microfluid Nanofluid. 2009;7:291-306. 

Wootton RCR, deMello AJ. Microfluidics: exploiting elephants 
in the room. Nature. 2010;464:839M0. 

Zhao C-X, He L, Qiao SZ, Middelberg APJ. Nanoparticle 
synthesis in microreactors. Chem Eng Sci. 2011 ;66:1463-79. 


SEIR Spectroscopy 

► Surface Enhanced Infrared Absorption Spectroscopy 




Selective Plane Illumination Microscopy (SPIM) 


2307 


Selected Fit 

► Molecular Recognition: Lock-and-Key, Induced Fit, 
and Conformational Selection 


Selective Plane Illumination Microscopy 
(SPIM) 

Zeno Lavagnino*, Francesca Celia Zanacchi*’^ and 
Alberto Diaspro*’ 

'Department of Nanophysics, Italian Institute of 

Technology (IIT), Genoa, Italy 

^Department of Physics, University of Genoa, Genoa, 

Italy 

Synonyms 

Individual light sheet microscopy; Orthogonal plane 
fluorescence optical sectioning (OPFOS); Thin laser 
light sheet microscope (TSLM); Ultramicroscopy 

Definition 

Light-sheet-based microscopy is a fluorescence optical 
microscopy technique which employs a planar illumi¬ 
nation and an orthogonally oriented detection path. 

Basic Characteristics 

The concept of light sheet illumination applied to opti¬ 
cal microscopy was reported by Richard Zsigmondy 
for gold particle analysis in colloidal solutions in his 
Nobel Lecture on December 11, 1926. The particles, 
too small to be visible in an ordinary light microscope 
and usually suspended in a liquid, were illuminated 
with a “strong” light beam perpendicular to the optical 
axis of the microscope developed with H. Siedentopf in 
Jena (Siedentopf and Zsigmondy 1903), also known as 
ultramicroscope. Gold particles scatter light and their 
movements were seen only as flashes against a dark 
background while their structure was still not resolved. 
In 1993, a similar idea was further developed by Voie 
et al. (1993) with the Orthogonal Plane Fluorescence 


Optical Sectioning (OPFOS) microscope, in which 
tissue imaging was performed using an optical system 
based on two independent illumination and detection 
optical paths. The same concept and optical arrange¬ 
ment were adapted in a very effective way at Stelzer’s 
lab developing the Selective Plane Illumination Micro¬ 
scope (SPIM) (Huisken et al. 2004). The principle 
behind this technique relies on the basic idea of prim¬ 
ing fluorescence exploiting a planar illumination of the 
entire focal plane of the detection lens, which can be 
obtained illuminating the sample with a comparatively 
thin light sheet from the side. The static light sheet 
illuminates the sample, creating a plane of fluorescent 
light, and a widefield detection scheme is used to 
collect the fluorescence signal on a CCD camera, 
according to the vignette reported in Fig. 1 . 

The light-sheet-based excitation beam can be pro¬ 
duced in different ways: using a rectangular slit, 
a cylindrical lens or rapidly scanning a Gaussian 
beam in the back focal plane of the illumination objec¬ 
tive (Digital Scanned Laser Microscopy, DSLM) 
(Keller et al. 2008). The sample can be mounted in 
agarose gel and can be moved within the focal plane in 
order to perform three-dimensional imaging. Further¬ 
more, the ability to rotate the sample around its 
perpendicular axis, allows the reconstruction of the 
entire 3D tomography of the sample, by Multi View 
Reconstruction (MVR) technique (Huisken et al. 
2006). Light-sheet-based imaging methods combine the 
advantages of widefield fluorescence optical microscopy, 
such as fast acquisition and high dynamic range, with 
three-dimensional optical sectioning capabilities. Thus, 
they provide the capability to perform fast 3D imaging of 
living samples reducing photobleaching effects and rep¬ 
resent an incredibly powerful tool for imaging biological 
structures in depth. Furthermore light-sheet-based fluo¬ 
rescence microscopy provides a large number of advan¬ 
tages in comparison to other standard fluorescence 
techniques, such as an increased isotropy of the Point 
spread function of the system, compared to standard 
single-lens-based techniques, and an improved resolution 
while using low NA detection objectives. 

Several variants of light-sheet-based techniques, 
such as ultramicroscopy, single plane illumination 
microscopy (SPIM) (Huisken et al. 2004), digital 
scanned laser microscopy (DSLM) (Keller 
et al. 2008), and Planar illumination microscopy 
(Holekamp et al. 2008) have been further developed 
to provide 3D high resolution imaging of large 
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Selective Plane 
Illumination Microscopy 
(SPIM), Fig. 1 Light sheet 
illumination (blue) and 
widefield detection (green) in 
the SPIM. Illumination: CL 
cylindrical lens, t light sheet 
thickness, w light sheet width. 
Detection: OL objective lens, 
NA numerical aperture, fov 
field of view, wd working 
distance (From Huisken 
et al. 2006) 



biological specimens like embryos and mouse epithe¬ 
lial tissue. A combination of SPIM and tissue clearing 
can be utilized for brain imaging. Moreover, a recently 
developed individual molecule localization (IML) 
imaging technique (Celia Zanacchi et al. 2011a), 
based on light sheet illumination microscopy, allows 
to extend the application of super-resolution to thick 
biological samples in depth. In the last decade, SPIM 
and its variations on the theme knew an intense growth 
in optical fluorescence microscopy field and were 
extended with other techniques to further enhance 
their performances. As a matter of fact, many quanti¬ 
tative fluorescent techniques like Single Particle 
Tracking (► SPT), Fluorescence Correlation Spectros¬ 
copy (► Fluorescence Correlation Spectroscopy) 
(Wohland et al. 2010), or Fluorescence Lifetime Imag¬ 
ing Microscopy (FLIM) (Greger et al. 2011) were 
exploited with a light-sheet-based architecture. 
Recently, it was shown that selective plane illumina¬ 
tion schemes can be coupled also with > Two-Photon 
Excitation (Truong et al. 2011; Celia Zanacchi et al. 
201 lb) to further improve the penetration depth capa¬ 
bilities of such techniques. 


Cross-References 

► Fluorescence Correlation Spectroscopy 

► Fluorescence Three-Dimensional Optical Imaging 

► Fluorescence: General Aspects 

► Individual Molecule Localization Techniques 

► Optical Fluorescence Microscopy 

► Single Particle Tracking 

► Two-Photon Excitation Eluorescence Microscopy 
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Self-Assembled Protein Fibrils 

► Amyloid Formation 


Self-Assembly 

► Bacterial Flagellar Motor: Overview 

► Lipid Organization, Aggregation, and Self-assembly 

► Membrane Proteins: Folding and Stability 


Self-Consistent Charge Density Function 
Tight Binding (SCC-DFTB) 

► Carboxypeptidase A - Computational Studies 

► Protein Splicing - Computational Studies 

► Semiempirical Quantum Mechanical Methods 


SEM 

► Electron Microscopy: Classical Sample Preparation 


Semiempirical Quantum Mechanical 
Methods 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Synonyms 

Self-Consistent Charge Density Function Tight Bind¬ 
ing (SCC-DFTB) 

Definition 

Semiempirical quantum mechanical methods are quan¬ 
tum chemical electronic structure calculation techniques 
that are typically based on a formalism for ab initio 


quantum mechanics (e.g., molecular orbital) or density 
functional theory. They apply further approximations 
and use some parameters from empirical data. Although 
they are less accurate than higher level calculations, they 
are typically much faster. A popular approximation is 
called Modified Neglect of Diatomic Overlap (MNDO), 
which is applied in the AMI and PM3 methods often 
applied in QM/MM studies on enzymes. A more recent 
method that is often applied in ► QM/MM methods for 
enzyme-catalyzed reactions is self-consistent charge 
density functional tight binding, or SCC-DFTB, 
a semiempirical method based on the principles of 
density functional theory. 

Cross-References 

► QM/MM Methods 


Sensor 

► Cell Sensing with Raman Spectroscopy 


Sensory Rhodopsin I 

John L. Spudich 
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Synonyms 

Rhodopsin 

Definition 

Phototaxis defined in a general sense is motility 
behavior of an organism in response to light stimuli 
such as intensity gradients or light direction. 

Sensory rhodopsin I (SRI) is a visual pigment-like 
light-sensor that mediates color-sensitive phototaxis by 
the unicellular halophilic archeaon Halohacterium 
salinarum. 
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Sensory Rhodopsin I 




Basic Characteristics 
Introduction 

Sensory rhodopsin I (SRI), the first light-sensor 
discovered in a microorganism (Bogomolni and 
Spudich 1982), is a phototaxis receptor modulating 
the swimming behavior of H. salinarum in light 
intensity gradients. Like visual pigments, SRI is 
a membrane-embedded protein with seven transmem¬ 
brane helices forming a pocket for the chromophore 
retinal (vitamin A-aldehyde), which is covalently 
attached in a protonated Schiff base linkage to the 
s-amino group of a lysyl residue in the seventh helix. 
SRI uses photoisomerization of the retinal from 
all-rran^ to 13-cw as the photoreception mechanism. 
Other microbial sensory rhodopsins homologous to 
SRI and using the same photoreception mechanism 
have been found subsequently, including the algal 
light-sensors Chlamydomonas sensory rhodopsins 
A and B (CSRA and CSRB, also known as channelr- 
hodopsins-1 and -2) that have been shown to mediate 
phototaxis in C. reinhardtii (Sineshchekov et al. 2002) 
(see ► Channelrhodopsin). For reviews of these and 
other microbial sensory rhodopsins, see Jung (2007), 
Klare et al. (2008), and Spudich and Spudich (2008). 
For a comprehensive review of the rhodopsin super¬ 
family, see Spudich et al. (2000). 

Physiological Function 

An unusual feature of SRI among light-sensors is that it 
discriminates color. Its physiological function in the 
haloarchaeal cells is to mediate phototactic attraction 
to orange light used for photoenergy capture by light- 
driven ion pumps in the cytoplasmic membrane. How¬ 
ever, if the orange light is accompanied by potentially 
damaging near-UV light, SRI mediates strong repel¬ 
lent (avoidance) responses. Therefore the cells avoid 
direct sunlight such as occurs on the surface of salt 
marshes or solar evaporation ponds that haloarchaea 
inhabit and accumulate in regions in which the near- 
UV light has been attenuated, e.g., beneath algal 
patches in their environment. The mechanism of this 
color discrimination by the SRI protein is based on its 
alternative 1-photon and 2-photon photochemical 
reactions (Spudich and Bogomolni 1984). Single¬ 
photon excitation (orange) and 2-photon excitation 
(orange followed by a near-UV photon absorbed by 
a photocycle intermediate) produce opposite effects 
(inhibition and activation, respectively) on the 


histidinyl kinase CheA. CheA, named for its participa¬ 
tion in chemotaxis behavior, modulates switching 
of the rotation direction of the flagellar motors of the 
cell (Fig. 1). 

Molecular Structure and Function 

Sensory rhodopsin I is assembled in a 2:2 complex 
with the membrane-embedded transducer Htrl. Htrl is 
a homodimer that binds CheA at its distal end in the 
cytoplasm. CheA controls swimming reorientation 
frequency of the cells by phosphorylation of the 
CheY response regulator protein. Structural models 
of the receptor-transducer complex have been based 
on X-ray crystal structures of the closely related 
mid-spectrum repellent phototaxis receptor sensory 
rhodopsin II (SRII) in complex with Htrll, also from 
haloarchaea (see ► Sensory Rhodopsin II: Signal 
Development and Transduction). 

Evidence from biophysical techniques such as laser 
flash photolysis, electron paramagnetic resonance 
(EPR), and time-resolved Eourier transform infrared 
spectroscopy combined with mutagenesis has demon¬ 
strated two molecular processes to be critical for signal 
relay from the SRI and SRII receptor subunits to their 
cognate Htr transducer subunits. One is a protein con¬ 
formational change similar to that occurring in the 
photocycle of the light-driven proton pump bacterio- 
rhodopsin (BR). Light induces BR to release its Schiff 
base proton to the outside followed by 
a conformational change that (1) switches the accessi¬ 
bility of the Schiff base from the outside to the cyto¬ 
plasmic side and (2) opens a cytoplasmic channel to 
ensure reprotonation from the cytoplasmic side. The 
largest protein motion involved is a light-induced out¬ 
ward movement of helix E involved in the opening of 
the cytoplasmic channel. Evidence is compelling that 
similar conformers as in the BR pumping cycle are 
crucially involved in signaling by both SRI and SRII, 
repellent signaling entailing the switch of the Schiff 
base from outward-to-cytoplasmic accessibility and 
outward helix F displacement, and attractant signaling 
entailing the opposite, switch of the Schiff base from 
cytoplasmic-to-outward accessibility and inward helix 
F motion (Fig. 2) (Sineshchekov et al. 2008; Klare 
et al. 2008; Sasaki et al. 201 1). 

A second chemical process required for SRI and 
SRII signaling, not present in the BR photochemical 
reaction, is a steric conflict of the retinal with the 
protein during the photoisomerization process. This 
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Sensory Rhodopsin I, Fig. 1 Color-discriminating photo¬ 
chemical reaction cycle of H. salinarum SRI and its coupling 
to the flagellar motor. Orange and violet arrows indicate light 
reactions. Wavelength maxima in nanometers are indicated for 
the unphotolyzed state SRI and photointermediate species. 
S designates I-photon intermediates and S’’ the 2-photon inter¬ 
mediate shown (the existence of a brief 4I0-nm S'’ state imme¬ 
diately prior to S'^SIO has also been detected). Indicated 


signaling states transmit signals through the transducer Htrl 
which binds the histidinyl kinase CheA. The repellent signal 
transiently activates CheA which phosphorylates CheY. 
CheY-P binds to the flagellar motor switch and induces 
a reversal of the cell’s swimming direction. The attractant signal 
transiently suppresses CheA activity, thereby suppressing swim¬ 
ming reversals. The 1-photon photocycle halftime is 800 ms and 
the 2-photon photocycle halftime is 80 ms 


Kinase-Activating 

Conformer 


inwardiy Connected 
Schiff base & 
Outward Position of 
Helix F 


SRI-Htrl attractant 
conformer 


2-photon product of 
SRI-Htrl attractant 
conformer 

SRII-Htrll 

photoproduct 



High swimming 
reversal frequency 



( His-kinase ) 

Low swimming 
reversal frequency 


Kinase-inhibiting 
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Outwardly Connected 
Schiff base & 
Inward Position of 
Helix F 


SRI-Htrl repellent 
conformer 

1-photon product 
of SRI-Htrl attractant 
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SRII-Htrll 


Sensory Rhodopsin I, Fig. 2 Model of the role of retinylidene 
Schiff base (SB) connectivity conformers in dark states and 
functional intermediates of SR-Htr complexes. Connectivity of 
the Schiff base means the direction, inward or outward, of Schiff 


base proton release or uptake following photon absorption. 
Product and photoproduct mean photointermediates of 
photocycles that end in initial thermostable conformations 
(Redrawn from Sineshchekov et al. 2008; Sasaki et al. 2011) 


“steric trigger” in SRI occurs between the 13-methyl 
group of the retinal and a protein residue, very likely 
Leu84 based on modeling the SRI structure with that of 
SRII. Without this interaction SRI does not form 
a primary photoproduct and returns from the excited 
state to the ?i\\-trans retinal ground state without con¬ 
formational changes or signaling function. Results 
from low temperature flash photolysis suggest 
a model in which the retinylidene 13-methyl group’s 


steric contact with Leu84 functions as a fulcrum to 
permit movement of one or both ends of retinal to 
overcome an energy barrier against isomerization. 
The steric trigger in SRII (studied in Natronomonas 
pharaonis SRII) is very different from that of SRI in its 
nature in that (1) it occurs between residue Thr204 and 
C14H in the retinal polyene chain (Ito et al. 2008), and 
(2) its absence does not prevent retinal isomerization 
nor a photochemical reaction cycle including 
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deprotonation of the retinylidene Schiff base, but does 
prevent signal relay to Htrll. Furthermore when the 
SRII steric trigger residue is engineered into BR linked 
to Htrll, the artificial BR-Htrll complex mediates 
robust repellent phototaxis responses (Sudo and 
Spudich 2006). Without the triggering Thr residue, 
BR-Htrll does not produce detectable phototaxis, indi¬ 
cating the conformational change discussed above is 
insufficient for signal relay to the Htr transducer. The 
interplay between the contributions of the steric trigger 
and canonical conformational change to the signaling 
process remains to be elucidated. 


Cross-References 

► Channelrhodopsin 

► Flash Photolysis 

► Membrane Proteins: Structure and Organization 

► Sensory Rhodopsin II: Signal Development and 
Transduction 
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Synonyms 

Phoborhodopsin 


Definition 

Sensory rhodopsin II belongs to the microbial rhodop¬ 
sins, which constitute a family of seven-helix mem¬ 
brane proteins with the chromophore retinal. Members 
of this family are distributed throughout the Bacteria, 
Archaea, and Eukaryota. These photoactive proteins 
use a common structural design for two distinct 
functions: light-driven ion transport and phototaxis. 
The sensors start a signal transduction chain similar 
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to that of the two-component system of euhacterial 
chemotaxis. The connecting membrane protein 
between the photoreceptor and the following cytoplas¬ 
mic signal cascade is formed by a transducer molecule 
that binds tightly and specifically to its cognate 
receptor by means of two transmembrane helices 
(TMl and TM2) (Gordeliy et al. 2002). 

Basic Characteristics 

The discovery of purple membrane from 
Halobacterium salinarum and its constituent bacterio- 
rhodopsin (bR) more than 40 years ago (Oesterhel and 
Stoeckenius 1971) caused an intense interest in the 
mechanism and function of the light-activated proton 
pump. This seven-helix membrane protein carrying a 
retinal chromophore was originally thought to 
be a bacterial rhodopsin - hence its name - providing 
a model system for vision in vertebrates. However, it 
soon became evident that, despite structural similari¬ 
ties, the functions were quite different, with rhodopsin 
being a photosensor and bR being a photoreceptor, 
converting light energy into a proton gradient (for 
a review on microbial rhodopsins see Klare et al. 
(2007) or Spudich (2006)). 

Further research on H. salinarum the archaeal 
strain, from which bR was first isolated, found other 
rhodopsin-like pigments with functions other than pro¬ 
ton pumping. The first evidence was provided by 
action spectra of halobacterial phototaxis. In this 
work, two photosystems were recognized, enabling 
the bacteria to avoid harmful blue light, while seeking 
optimal conditions for the function of bR. Subse¬ 
quently, two responsible pigments were identified. 
Sensory rhodopsin I, SRI, (Bogomoln and Spudich 
1982) enables H. salinarum to swim to favorable 
light (above 500 nm), and, in a two photon process, 
to flee conditions of UV light. A second sensor, 
sensory rhodopsin II, SRII, (also named 
phoborhodopsin), is responsible for avoiding blue 
light, which is especially important for the bacteria to 
escape conditions of oxidative stress. Further investi¬ 
gations showed ion pump activity different to that of 
bR (Matsuno-Yagi and Mukohata 1977) which turned 
out to be a chloride pump. Thus, in the early 1980s, 
four different pigments had been identified in 


H. salinarum. These can be grouped info two subfam¬ 
ilies: bR and HR, which function as ion pumps, gener¬ 
ating an ion gradient across the membrane, and SRI 
and SRII, which serve the bacteria as photoreceptors to 
find favorable light conditions at low oxygen concen¬ 
tration. Originally thought to be confined fo Archaea, 
they have been detected in all three kingdoms of life 
and the family microbial rhodopsin was coined. Most 
of the data presently available were obtained from 
sensory rhodopsin II from Natronomonas pharaonis 
(ApSRII) (see Klare et al. 2007; Spudich 2006). 

After excitation by blue-green light NpSRII trig¬ 
gers, by means of a tightly bound transducer protein 
(ApHtrll), a signal transduction chain homologous to 
the two-component system of euhacterial chemotaxis 
(Fig. la). Two molecules of ApSRII and two molecules 
of A/pHtrll form a 2:2 complex in membranes as shown 
by electron paramagnetic resonance and X-ray 
structure analysis (Fig. lb. Video 1). VpHtrll displays 
not only a high homology to the cognate transducer of 
sensory rhodopsin I (SRI) but also to enterobacterial 
chemotactic receptors. The cytoplasmic domain of 
VpHtrll - as deduced from data of chemotactic 
receptors - is assumed to consist of a coiled-coil rod. 
Its tip is involved in binding of a histidine kinase, 
CheA, and in the middle of the rod dimer methylation 
sites are found which are involved in adaptation 
processes. Further investigations on VpHtrll enabled 
to determine the X-ray structures of the photocycle 
intermediates K and late M (M2) explaining the 
evolution of the signal in the receptor after retinal 
isomerization and the transfer of the signal to the 
transducer in the complex. The formation of late 
M has been correlated with the formation of the sig¬ 
naling state (Moukhametzianov et al. 2006; Gushchin 
et al. 2011). From these data, a mechanism for the 
light-induced activation of the signaling complex was 
deduced. On excitation by light, retinal isomerization 
leads in the K state to a rearrangement of a water 
cluster that partly disconnects helices C and G of the 
receptor. In the transition to late M the changes in the 
hydrogen bond network proceed further. In addition, 
the transfer of a proton from the Schiff base to Asp 75 
weakens the electrostatic forces between helices C and 
G. Thus, in late M state, an altered tertiary structure 
establishes the signaling state of the receptor. The 
transducer responds to the activation of the receptor 
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Sensory Rhodopsin II: 
Signal Development and 
Transduction, Fig. 1 (a) 

After light activation the 
signal starts from the receptor 
SRII (red), is transferred to 
Htrll {green), propagates 
along the coiled-coil structure 
and activates the His kinase 
CheA which phosphorylates 
CheY and CheB. CheY binds 
to the flagellar motor for 
changing the direction of 
rotation. CheB demethylates 
aspartic acid groups in the 
middle of the coiled-coil 
structure which is 
continuously methylated by 
CheR. (b) Side view of the 2:2 
complex of SRII and Htrll in 
red and green, respectively 
(ES extracellular side, CS 
cytoplasmic side). The 
hydrophobic core of the 
complex is indicated by dotted 
lines. The complex is colored 
according to B-factor 
mobility: lightQess mobile), 
dark (mobile) 



by a clockwise rotation of about 15° of helix TM2 and 
a displacement of this helix by 0.9A° at the 
cytoplasmic surface (see Video 2). 

Although the development of the signal from the 
triggering of the receptor by light to the activation of 
the transmembrane domain of the transducer is quite 
well understood, the further transmission to the activa¬ 
tion of the His kinase CheA and the mechanism of 
signal amplification is not clarified. The homology 
with the cytoplasmic methyl-accepting domains of 
chemotactic receptors like Tar suggests a common 
mechanism of signal transfer. 

For an understanding of the signal transfer, the 
membrane proximal linker region is of decisive impor¬ 
tance. It contains two HAMP domains (conserved in 
histidine kinases, adenylyl cyclases, methyl-accepting 
chemotaxis proteins, and phosphatases) for which 
a role as signaling transducing unit was proposed. 
Electron paramagnetic resonance data indicated that 
the HAMP domain is engaged in a “two-state” 


equilibrium between a highly dynamic (dHAMP) and 
a more compact (cHAMP) conformation (Doebber 
et al. 2008). The structural properties of the cHAMP 
as proven by mobility, accessibility, and intra-trans- 
ducer-dimer distance data are in agreement with the 
four helical bundle NMR model of the HAMP domain 
from Archaeoglohus fulgidus (Hulko et al. 2006). In 
addition, a recent molecular dynamics study enlightens 
the structural and dynamical aspects of both HAMP 
domains (Gushchin et al. 201 lb). 

It has been shown that chemoreceptors form trimer 
of dimers which cluster into higher aggregates. It is 
possible that also photoreceptors might form such 
arrays. If this can be proven, a mechanism of transfer 
and amplification of an incoming signal can be 
envisioned. In fact, the notion that a two-state 
equilibrium between populations with different 
degrees of dynamics can be the language of the 
activation/deactivation of the cytoplasmic domain 
was also put forward by (Hazelbauer et al. 2008). 
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Sensory Rhodopsin II: Signal Development and Transduc¬ 
tion, Video 1 Movie of the structure of the complex SRII/Htrll 
in redIgreen. Darker parts of the complex protrude out of the 
membrane surface; lighter parts are within the membrane. The 
retinal is in purple color 



Sensory Rhodopsin II: Signal Development and Transduc¬ 
tion, Video 2 Movie of part of the complex: Ground and 
signaling states of helices F and G are shown as a string presen¬ 
tation in red and yellow, respectively. The other parts of SRII are 
depicted in ribbon or surfaces. Ground and signaling states of 
Htrll are colored in gray and green, respectively 


Cross-References 

► Channelrhodopsin 

► Rhodopsin: Stability and Structural Organization in 
Membranes 

► Sensory Rhodopsin I 
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Definition 

Simulation of spin-label EPR spectra comprises a set 
of computational methods that allow extraction of 
magnetic and dynamic parameters such as rotational 
correlation times from EPR spectra by way of comput¬ 
ing theoretical EPR spectra for given parameter sets 
employing various levels of spin physics and 
reorientational dynamics theory and using 
least-squares htting algorithms to fit these spectra to 
the experimental ones. 

Introduction 

The EPR spectrum (► Spin-Labeling EPR of Proteins: 
Dynamics and Water Accessibility of Spin-Label Side 
Chains, ► EPR Spectroscopy: General Principles) of 
a nitroxide spin label attached to a biomolecule 
(► Chemistry of Spin Labeling, ► Spin Labeling of 
DNA and RNA, ► Spin-Labeling EPR of Proteins: 
Dynamics and Water Accessibility of Spin-Label 
Side Chains) is affected by the reorientational dynam¬ 
ics (► Spin-Labeling EPR of Proteins: Dynamics and 
Water Accessibility of Spin-Label Side Chains) of the 
nitroxide ring. The dynamics happens on three levels 
(see Pig. 1): (a) the spin label reorients locally with 
respect to the protein (or other hiomolecule) it is 
attached to, (b) the protein rearranges internally via 
side chain and backbone motions, and (c) the protein 
tumbles globally with respect to the matrix, which is 
either an isotropic solution or an anisotropic environ¬ 
ment like a lipid bilayer. The magnetic parameters of 
the spin label (► Spin-Labeling EPR of Proteins: 
Dynamics and Water Accessibility of Spin-Label 
Side Chains, ► EPR Spectroscopy: General Principles; 
effective g value and hyperfine coupling), determined 
by the orientation of the label relative to the external 
magnetic field, vary in time and lead to dynamic broad¬ 
ening effects in the EPR spectrum. 

Information about the dynamic processes can be 
extracted from the EPR spectra by computational 
methods that simulate the spectra based on a model 
and fit the model to the experimental data. The 
appropriate theory level for spectral simulations 
depends on the timescale of the dynamics, summarily 
characterized by the rotational correlation time Tc, 
compared to the timescale of the EPR experiment, 
which is about the inverse of the EPR spectral width 



Simulation of Spin-Label EPR Spectra, Fig. 1 Local, 
internal, and global reorientational motions of nitroxide spin 
label and protein in a matrix 
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Simulation of Spin-Label EPR Spectra, Fig. 2 Motional 
regimes for nitroxide spin labels 


Act). Several motional regimes are distinguished: the 
rigid and the isotropic limits, and the slow-motion and 
fast-motion regimes (see Fig. 2). 

Slow-Motion Regime 

If the reorientational motion of the spin label is on 
a timescale similar to the EPR timescale, the EPR 
spectrum is broadened. Simulation methods for this 
regime differ in how the different levels of orienta¬ 
tional dynamics are treated and can use either 
deterministic or stochastic models. Deterministic 
models are based on atomistic molecular dynamics 
(MD) simulations (► Molecular Dynamics Simula¬ 
tions of Carbohydrates) and are able to treat complex 
local and internal dynamics. Stochastic dynamics (SD) 
models describe the reorientational motion as diffusion 
of a single-particle rigid rotor and can account for 
simple local or global rotational dynamics. The most 
common stochastic model is Brownian rotational dif¬ 
fusion, a random walk in three-dimensional orienta¬ 
tional space. The anisotropy of the reorientational rates 
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due to the nonspherical shape of the spin label is 
described by an anisotropic diffusion tensor whose 
orientation is tied to that of the nitroxide. For a free 
spin label in solution, the local environment is 
isotropic. On the other hand, it is generally anisotropic 
for a spin label bound to a protein or other biomolecule, 
resulting in preferential alignments and inaccessible 
orientations. The free energy of the label is a function 
of its orientation /2 and is described by an orientational 
potential 

In SD models, the global dynamics of the protein 
can be treated at several levels. The MOMD 
(microscopic order macroscopic disorder) approach 
assumes the protein molecules in the sample are 
randomly oriented and immobile on the timescale of 
the local spin label dynamics, which is treated 
stochastically. The SRLS (slowly relaxing local 
structure) model allows for global stochastic dynamics 
of the protein in the matrix (Earle and Budil 2006). 

Stochastic Liouville equation 

Simple SD models such as Brownian diffusion in 
a restricting potential are best simulated by solving 
the stochastic Liouville equation (SEE) which 
describes the joint time evolution of both the quantum 
spin degrees of freedom and the classically treated 
label orientation, starting with transverse magnetiza¬ 
tion and an equilibrium orientational probability 
distribution of an ensemble of spin labels determined 
by U(S1). A very efficient SEE solver was developed 
by Freed and coworkers (Earle and Budil 2006). The 
orientational probability distribution and the potential 
are expressed as linear combinations of rotational basis 
function. The computational effort scales with the 
number of basis functions, which increases with 
decreasing diffusion rate and increasing complexity 
of the potential. In principle, the SEE approach can 
be used to simulate a rigid-limit spectrum. However, 
the basis size required for achieving converged spectra 
is large, so that dedicated rigid-limit methods are 
preferred. 

Trajectories 

As a more general alternative to solving the SEE for 
a stochastic model, EPR spectra can be computed from 
sets of explicit orientational trajectories that describe 
the change in time of the orientation of the label. This 
method is applicable to trajectories obtained from both 
stochastic and deterministic models. A trajectory 


determines the time dependence of the magnetic 
parameters and therefore the EPR spectrum of 
a tumbling spin label. Given a trajectory, the time 
evolution of the magnetization following a 90° pulse 
is computed using spin density matrix or Bloch 
magnetization vector propagation (Sezer et al. 2008). 
From the resulting free induction decay (FID), the EPR 
spectrum is obtained by Fourier transformation. To 
generate a converged spectrum, FIDs of a set of trajec¬ 
tories generated from a range of possible initial 
orientations of the spin label have to be combined. 
The appropriate time resolution of the trajectories is 
determined by the spectral width via the Nyquist 
criterion. The length of the trajectory is determined 
by the required spectral resolution and has to be of 
the order of the transverse relaxation time T 2 (► Pulsed 
EPR: Principles and Examples of Applications to 
Hemeproteins) to yield accurate EPR spectra. 

The computation of SD trajectories is fast, and 
the number and length of trajectories required for 
accurate spectral simulations are easily obtained. 
MD trajectories are computationally significantly 
more demanding. The expense of computing long 
MD trajectories can be avoided by relying on extrapo¬ 
lation (Oganesyan 2007), or by deriving an effective 
orientational potential from short MD trajectories and 
then using it to generate SD trajectories (Steinhoff and 
Hubbell 1996) or to solve the SEE (Budil et al. 2006). 
A single MD trajectory can be reused several times by 
rotation or resampling. In general, despite being much 
slower than the SEE approach, trajectory methods are 
superior in the complexity of dynamics than can be 
modeled. Global dynamics cannot rely on MD simu¬ 
lations, so stochastic models are used (DeSensi 
et al. 2008). 

The methodology for simulating slow-motional 
EPR spectra of two coupled spin labels attached to 
the same macromolecule is not as developed as for 
single labels. There exist methods based on the SEE 
(Zerbetto et al. 2007) and on trajectories (Hustedt 
et al. 1997) for the simple case of a tumbling protein 
with two rigidly attached labels. 

Rigid Limit 

In the rigid limit, the spin labels are immobilized. 
Methods for the simulation of rigid-limit spectra of 
spin labels are identical to the ones used for transition 
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metal centers (see ► Simulation of Transition-Metal 
EPR Spectra). Perturbation theory methods based on 
the high-field approximation are usually adequate. 

The solid-state spectrum of two coupled spin labels 
is broadened relative to the single-label spectrum by 
the distance-dependent dipolar coupling between the 
two nitroxides and can be simulated using either 
matrix diagonalization or, within the high-field 
approximation, perturbation theory (Hustedt 
et al. 1997). Spectra have to he integrated over 
nonuniform or partially correlated orientational distri¬ 
butions. A simplified decoupled method that convolves 
a single-nitroxide spectrum with a spectrum (Pake 
pattern) representing the dipolar coupling between 
the two nitroxides can be used to estimate distances 
between spin labels in doubly labeled systems 
(► Interspin Distance Determination by EPR; 
Steinhoff et al. 1997). 

Fast-Motion Regime and Isotropic Limit 

If the reorientational motion of a nitroxide spin label is 
fast compared to the EPR timescale, the EPR spectrum 
consists of three lines of different widths that depend 
on the details of the reorientational dynamics. 
Analytical expression based on Redfield perturbation 
theory can be used to simulate such spectra 
(Atherton 1993). The algorithms used for the 
slow-motion regime can be used as well. In the isotro¬ 
pic limit, where the motion is orders of magnitude 
faster than the EPR timescale, the Breit-Rabi equation 
affords an exact method without recurrence to 
perturbation theory approximations. 


Cross-References 

► Chemistry of Spin Labeling 

► EPR Spectroscopy: General Principles 

► Interspin Distance Determination by EPR 

► Molecular Dynamics Simulations of Carbohydrates 

► Pulsed EPR: Principles and Examples of 
Applications to Hemeproteins 

► Simulation of Transition-Metal EPR Spectra 

► Spin Labeling of DNA and RNA 

► Spin-Labeling EPR of Proteins: Dynamics and 
Water Accessibility of Spin-Label Side Chains 
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Definition 

Simulation of transition-metal EPR spectra comprises 
a set of computational methods that allow extraction of 
magnetic spin Hamiltonian parameters from one or 
a set of EPR spectra by way of computing theoretical 
EPR spectra for given parameter sets employing vari¬ 
ous levels of spin physics theory and using least- 
squares fitting algorithms to fit these spectra to the 
experimental ones. 

Introduction 

Solid-state EPR spectra of biological transition-metal 
centers and clusters contain a wealth of information on 
geometric and electronic structure. This information is 
extracted in two steps. Eirst, the spectrum is simulated 
and htted using a phenomenological spin Hamiltonian 
model (Weil and Bolton 2007; Mabbs and Collison 
1992; Pilbrow 1990), yielding a set of magnetic and 
electronic parameters. In the second step, these param¬ 
eters are interpreted and correlated to the geometric 
and electronic structure of the spin center using either 
analytical tools like ligand held theory or numerical 
quantum chemical calculations (Kaupp et al. 2004). 

To simulate an EPR spectrum, a spin Hamiltonian 
model including number and types of spins and type 
and symmetry of the coupling tensors is chosen at the 
outset. The corresponding EPR spectrum is then com¬ 
puted as the sum of spectra from all the differently 
oriented spin centers in the sample, which in turn are 
sums over all transitions (energy level pairs). Each 
energy level pair from each spin center generally 
gives rise to a single resonance line with a certain 
position, intensity, width, and shape. This sum-over- 
lines approach for spectral simulations is valid in 
almost all situations encountered in practice, except 
for looping transitions, where the microwave quantum 
is able to bridge two energy levels for a set of orienta¬ 
tions, but not for others (Gaffney and Silverstone 
1998). 

Line Positions 

EPR line positions can be computed at two levels of 
theory, perturbation theory (PT) and numerical matrix 
diagonalization (ND). PT uses closed-form analytical 


solutions for the energy levels as a function of the 
magnetic held to compute the resonance helds (Weil 
and Bolton 2007). These expressions are valid under 
a set of assumptions, most importantly that the electron 
Zeeman interaction is much larger than any other 
interaction (high-held approximation). Often, the 
nuclear Zeeman and quadrupole interactions are 
neglected as well. PT approaches are very fast and 
the method of choice for simple systems containing 
one electron spin. Second-order methods are more 
accurate than hrst-order methods, and third-order 
methods are occasionally employed. PT methods 
have been implemented in many programs, but very 
few of them validate the assumptions for the particular 
input parameters. ND methods construct and numeri¬ 
cally diagonalize the full spin Hamiltonian matrix of 
the system of coupled spins for several held values to 
obtain energy levels over a held range. The resulting 
energy level diagram is then used to locate resonance 
line positions. Computationally, ND methods are sig- 
nihcantly more expensive than PT methods, but are 
accurate since no mathematical approximations or 
physical assumptions are involved. 

Line Intensities 

The intensity of an EPR absorption line is determined 
by several factors: (a) the excitation bandwidth factor, 
(b) the transition probability, (c) the l/g factor, and (d) 
the polarization factor, (a) The excitation bandwidth 
factor describes to what degree a transition with energy 
difference AE is excited by the given microwave quan¬ 
tum For continuous-wave EPR spectra, this is 
a simple delta function, (b) The transition probability 
is given by the squared matrix element of the transition 
operator and is computed from eigenvectors obtained 
either from PT or ND. The transition operator depends 
on the held conhguration of the resonator. The com¬ 
mon Anis = ± 1 experiments are performed in resona¬ 
tors where the magnetic held component of the 
incident microwave is perpendicular to the static mag¬ 
netic held. Eor detecting Am^ = 0 transitions in inte¬ 
ger-spin systems, resonators with a parallel 
conhguration are employed and require a different 
operator, (c) The l/g factor is a correction factor that 
accounts for the fact that EPR spectra are held sweeps 
rather than frequency sweeps. This factor is transition 
dependent and is the inverse of the held derivative of 
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the energy level separation, (dAit/clS)”*, which 
reduces to 1/g for a spin-1/2 system, (d) Lastly, the 
line intensity depends on the polarization, that is, 
population difference, between the two energy levels 
involved. Under thermal equilibrium, this is tempera¬ 
ture dependent and given by the Boltzmann distribu¬ 
tion. For a spin-1/2 system, the polarization factor can 
usually be neglected since it affects only the overall 
intensity, but not the shape of the spectrum. For high- 
spin systems with large zero-field splittings measured, 
its inclusion is essential for accurately modeling the 
relative intensities of different lines in low- 
temperature spectra. 

Line Widths and Shapes 

Individual resonance lines have nonzero widths for 
several reasons, (a) Lines show homogeneous lifetime 
broadening due to relaxation. The resulting line shape 
is Lorentzian. (b) Many splittings due to very small 
electron-nuclear and electron-electron couplings are 
unresolved in the EPR spectrum and give rise to 
a residual dipolar line broadening. In spectral simula¬ 
tions, this inhomogeneous broadening is summarily 
modeled by a Gaussian shape with orientation- 
dependent width, (c) Lines are also broadened due to 
structural heterogeneity between nominally identical 
spin centers in the sample. This gives rise to spin 
Hamiltonian parameter distributions called strains 
(g strain, D strain, etc.). In simulations, they are taken 
into account either by using analytical expressions and 
Gaussian shapes if they are narrow or by explicitly 
integrating over the distributions if they are wide, 
(d) The line width and shape is also affected by field 
modulation. Small-amplitude modulation is modeled 
by using derivatives of the line shapes, whereas 
overmodulation is simulated by convoluting the EPR 
absorption spectrum with a mathematical function 
describing the modulation. 

Frozen Solutions and Crystals 

EPR spectra of biological spin centers are most often 
acquired in frozen solutions. There, the orientations of 
the spin centers are random, and there is no preferential 
alignment in space. The EPR spectrum is the sum of 
the EPR spectra of all centers. Even though a typical 


sample contains lO'^-lO*^ differently oriented spin 
centers, only a small subset (N = 10-10^) is used in 
a simulation. The computation timescales linearly with 
N, but simulated spectra converge to the true powder 
spectra only very slowly with increasing N. If N is too 
small, the spectrum shows spurious peaks due to the 
incomplete sampling of the orientational distribution. 
Methods for the choice of the N orientations range 
from Monte Carlo to sets that minimize the total repul¬ 
sion energy between orientations. Except the worst 
ones, all methods perform roughly similarly. Interpo- 
lative methods that use spectral data computed for two 
close orientations to efficiently obtained spectral data 
for orientations in between are applied in several pro¬ 
grams (Hanson et al. 2004; Stoll and Schweiger 2006). 
They substantially increase the convergence rate and 
allow N to be much smaller than in brute-force sum¬ 
mation approaches. 

Occasionally, protein single crystals are studied. 
Depending on the space group of the crystal, there 
might be up to 24 differently oriented protein sites in 
the unit cell. In addition, the protein might contain 
several symmetry-related spin centers. To compute 
a single-crystal EPR spectrum, spectra of all the unit 
cell sites and the spin centers have to be simulated and 
added. 


Fitting 

In order to fit a spin Hamiltonian model to an experi¬ 
mental EPR spectrum, least-squares fitting algorithms 
are used that iteratively simulate spectra starting from 
a set p of initial guess parameter values for the chosen 
spin Hamiltonian model and then adjust p until the 
deviation between the simulated spectrum ysim(p) and 
the experimental spectrum yexp is minimal. Methods 
differ in two respects: (a) in the way the error between 
simulation and experiment is computed, (b) in the 
algorithm that chooses successive parameter sets, (a) 
The error is always computed as the sum of the squared 
residuals between experiment and simulation. How¬ 
ever, jsim and yexp are either used directly, or they 
are integrated, double integrated, or even Fourier 
transformed before the computation of the residuals. 
Integrals and double integrals tend to have larger con¬ 
vergence radii, whereas the Fourier transform can be 
advantageous for spectra with many narrow lines, 
(b) There is a vast gamut of available fitting 
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algorithms. They fall into two classes, global and 
local. Global methods such as Monte Carlo and 
genetic/evolutionary algorithms are able - at least in 
principle - to locate the global minimum. Local 
methods, such as the Simplex and Levenberg/ 
Marquardt algorithms, can only find a minimum close 
to the starting set of parameters, but are more efficient. 
Hybrid methods that combine elements of both classes 
are becoming more popular due to their robustness. 

Programs 

Currently, the most widely adopted general software 
packages for EPR spectral simulations are Simfonia 
(implementing PT only, commercial), XSophe (ND 
only, commercial) (Hanson et al. 2004), and EasySpin 
(both PT and ND, noncommercial) (Stoll and 
Schweiger 2006). Many others are used, but are mostly 
tailored to specific applications. 

Cross-References 

► Simulation of Spin-Label EPR Spectra 
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Synonyms 

Fluorescent intermittency; Photoblinking; Photoswitch 


Definition 

Light-emitting species, including organic dye molecules, 
fluorescent proteins and semiconductor particles, are 
prone to enter transient dark states, collectively referred 
to as “blinking” events. Depending on the nature of 
the fluorophore species, the illumination source and 
intensity, as well as the environmental conditions, 
blinking events can range from rare to frequent and last 
from microseconds to minutes. Blinking events are 
distinguished from photobleachmg, the irreversible loss 
of fluorescence, by their temporary nature. 

Temporary dark states can arise for a variety of 
reasons. In the case of organic fluorophores, transient 
dark states may occur due to reversible isomerization of 
the conjugated n orbital system, resulting in temporary 
formation of a low-fluorescent isomer and thus cycles of 
bright and dark states. Organic fluorophores may 
also undergo intersystem crossing, a low probability 
electronic transition to a triplet state, which is thought 
to arise from a “forbidden” inversion of an excited 
electron’s magnetic dipole moment, or spin. Triplet 
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states have a relatively long lifetime and high energy 
(Turro et al. 2010), rendering the fluorophore vulnerable 
to downstream reactions. For instance, organic 
fluorophores in the triplet state are prone to reactions 
with reducing or oxidizing agents that generate radical 
ions (Vogelsang et al. 2008). Such entities can exhibit 
substantially longer dark states, often lasting seconds to 
minutes depending on fluorophore environment. 

Inorganic semiconductor particles, such as Quantum 
Dot (QD) species, are prone to temporary dark states 
through the ejection of one or more electrons from the 
particle’s core. Such events leave behind a positively 
charged “hole,” rendering the particle nonfluorescent 
until the QD is neutralized by acquiring one or more 
free electrons from the environment (Frantsuzov et al. 
2008). The probability of electron loss and electron gain 
depends on the size and composition of the semicon¬ 
ductor particle and can be influenced by environmental 
conditions. 

The blinking of fluorescent proteins can be caused 
by photoisomerization of the chromophore within the 
protein core to distinct protonation states (Remington 
2006). Correspondingly, the probability and duration 
of blinking events vary substantially depending on the 
nature of the chromophore, the surrounding residues in 
the protein and the protein fold. 

Under physiological conditions, blinking events are 
transient in nature and relatively rare. This is due to the 
presence of molecular oxygen in solution which can 
photo-oxidize and chemically damage fluorophores 
that have entered triplet, dark states (Levitus and Ranjit 
2011). Such reactions typically result in photobleaching 
(see the chapter of single fluorophore photobleaching). 
While the removal of molecular oxygen can substan¬ 
tially reduce the rate of photobleaching, such efforts 
tend to exacerbate the duration of blinking events by 
reducing the number of relaxation pathways out of the 
triplet state. As blinking events reduce the flux and 
number of photons emitted, substantial efforts have 
been made to mitigate their occurrence and/or duration. 
Chemical additives, such as p-mercaptoethanol (BME), 
Trolox (a vitamin E derivative), cyclooctatetraene 
(COT), 4-nitrobenzyl alcohol (NBA), n-propyl gallate, 
and l,4-diazabicyclo[2.2.2]octane (DABCO), have 
come into widespread use for this purpose as they 
serve to facilitate relaxation from triplet dark states 
when they arise. By reducing the duration of blinking 
events, such compounds also tend to reduce the proba¬ 
bility of photobleaching (Levitus and Ranjit 2011). 


Such compounds may operate as electron donors as 
they have also proven effective for the suppression of 
QD blinking (Hohng and Ha 2004). It is important to 
note, however, that fluorophore blinking can be lever¬ 
aged as a means of identifying single, light-emitting 
species in a complex milieu. Such efforts have recently 
led to fundamental breakthroughs in fluorescence imag¬ 
ing, referred to collectively as super-resolution imaging. 
This approach takes advantage of blinking phenomena 
to localize single fluorophores with high spatial resolu¬ 
tion. In so doing, the limits of optical resolution can be 
extended below the Rayleigh diffraction limit into the 
nanometer regime. (See the chapter of ► Photoactivated 
Localization Microscopy (PALM) and ► Stochastic 
Optical Reconstruction Microscopy for details.) 

In addition to biomedical applications that rely 
heavily on fluorophores, the tunable control of blinking 
(and correspondingly photobleaching) may also have 
an impact on disparate fields such as quantum comput¬ 
ing, electronics and photovoltaics, that each stand to 
benefit from enhancements in the performance of light- 
emitting species. 

Cross-References 

► Photoactivated Localization Microscopy (PALM) 

► Single Fluorophores Photobleaching 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Stochastic Optical Reconstruction Microscopy 
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Definition 

Photobleaching describes any process that leads to the 
permanent loss of fluorescence from light-emitting 
fluorescent species, including organic dye molecules, 
semiconductor particles such as quantum dots, and 
fluorescent proteins. As such processes terminate the 
flux of photons emanating from a fluorophore, 
photobleaching serves as a key limiting factor in 
applications relying on light emission from 
a fluorophore. Such applications include medical 
imaging technologies and diagnostics (Diaspro et al. 
2006). 


Basic Characteristics 

Photobleaching is caused by chemical damage of the 
fluorophore species. Such processes are enhanced by 
prolonged illumination and photo-induced damage 
occurring during cycles of photon absorption and emis¬ 
sion. The observed rates of fluorophore photobleaching 
depend strongly on the physical properties of the 
fluorophore species, the illumination intensities used, 
and the environment in which the fluorophore is probed 
(Diaspro et al. 2006). While semiconductor particles are 
particularly resistant to photobleaching (Chan and Nie 
1998), organic fluorophores and fluorescent proteins are 
highly prone to photodamage. When illuminated, such 
species have a probability of photobleaching on the 
order of 10“® (Lakowicz 2006), meaning that a typical 
organic fluorophore or fluorescent protein can only 
emit 10^-10^ photons prior to photobleaching. Such 
considerations have broad implications for the time- 
scales and signal-to-noise ratios of fluorescence 


imaging using such species, and correspondingly prac¬ 
tical limitations in their utility as effective tools in 
research and industry. 

Photobleaching Mechanisms 

For organic fluorophores, the principal photobleaching 
pathway involves reactions with singlet oxygen. 
Whereas fluorescence normally entails cycles of excita¬ 
tion from the singlet ground state into the singlet excited 
state and its return to the ground state via non-radiative 
or radiative relaxation pathways, excursions can occur 
from the singlet excited state to triplet excited states 
(Fig. 1). For organic fluorophores, such processes are 
typically described as resulting from a low-probability 
intersystem crossing event, occurring when an excited 
electron undergoes the inversion of its angular momen¬ 
tum or spin (Fig. 1). As triplet-state fluorophores have 
relatively high energy and long lifetime, they are capa¬ 
ble of reacting with molecular oxygen, generating reac¬ 
tive singlet oxygen, which can subsequently oxidize the 
dye molecule. For example, in the case of cyanine 
fluorophores, a singlet oxygen molecule can react with 
its conjugated polymethine chain, damaging the 
fluorophorogenic center (Levitus and Ranjit 2011). 
Alternatively, organic fluorophores can photobleach 
by the photolysis from higher singlet excited states 
(Fig. 1) (Eggeling et al. 2005). In the case of fluorescent 
proteins, photoactivation of the fluorogenic center and 
the generation of singlet oxygen and/or other reactive 
oxygen species contributes to photobleaching as well as 
side reactions that may lead to downstream phototoxic 
effects (Greenbaum et al. 2000; Remington 2006). 
Semiconductor quantum dots are the most photostable 
of the three types of fluorescent species discussed here, 
being nearly 100 times more resistant to photobleaching 
than organic dyes and fluorescent proteins (Chan and 
Nie 1998). Upon excitation, the photoinduced oxidation 
of the quantum dot core is responsible for its 
photobleaching (van Sark et al. 2001). The precise 
mechanisms of fluorophore photobleaching, and how 
to control photochemical reactions in each class of 
molecules, are active areas of research. 

Strategies to Minimize Photobleaching 

Photoinduced oxidation plays an important role in the 
photobleaching mechanism and typically involves 
a fluorophore’s reaction with molecular oxygen. 
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Single Fluorophore Photobleaching, Fig. 1 Jablonski dia¬ 
gram of a fluorophore, showing multiple pathways for 
photophysical and photochemical processes. Upon excitation, 
a fluorophore absorbs a photon and jumps to the excited singlet 
state (1), from which it can decay to ground state through 
radiative (2) or non-radiative (5) pathways. Alternatively, the 


singlet excited molecule can transit, via intersystem crossing (4), 
to a triplet state, which decays to the ground state (5) after 
quenching by molecular oxygen or organic triplet state 
quenchers. In multiple-photon excitation or in FRET experi¬ 
ments, higher excited states can be reached (6), from which 
photolysis (7) may occur 


Consequently, substantial reductions in the rate of 
photobleaching can be afforded by any means of 
reducing the concentration of molecular oxygen pre¬ 
sent during fluorescence imaging. Most commonly, 
this is achieved by degassing procedures and/or enzy¬ 
matic strategies designed to specifically convert solu¬ 
bilized oxygen into inert or less reactive molecular 
species (Diaspro et al. 2006). 

However, molecular oxygen is also a highly effi¬ 
cient triplet state quencher (TSQ). Consequently, 
removing oxygen prolongs the triplet state lifetime of 
the fluorophore and therefore tends to exacerbate 
blinking. A common strategy to resolve this problem 
is the addition of alternative TSQs or other protective 
chemical species in solution in combination with the 
oxygen scavenging system. Thiol-containing com¬ 
pounds, like P-mercaptoethanol (BME), are widely 
used to attenuate the triplet state, but such methods 
tend to induce blinking of the fluorophores (Levitus 
and Ranjit 2011). Other commonly used and effective 
additives include a water-soluble derivative of vitamin 
E (Trolox), cyclooctatetraene (COT), 4-nitrobenzyl 
alcohol (NBA), n-propyl gallate, and 1,4- 
diazabicyclo[2.2.2]octane (DABCO), which reduce 
photobleaching in a concentration- and dye-dependent 


manner (Levitus and Ranjit 2011). Trolox is thought to 
act through a redox process (Levitus and Ranjit 201 1), 
while the detailed mechanism of other protective 
chemicals remains unclear. Recently, a more mecha¬ 
nistic approach, termed reducing and oxidizing system 
(ROXS), has been developed (Levitus and Ranjit 
201 1). In this system, a fluorophore in the triplet state 
is first converted to an ionic radical and then quickly 
recovered to the ground state by a combination of the 
oxidizing and reducing agents. ROXS has been shown 
to effectively quench the triplet state, thereby reducing 
the probability of fluorophore photobleaching. Alter¬ 
native approaches for controlling fluorophore perfor¬ 
mance entails imaging under conditions where 
molecular oxygen is present to minimize blinking 
while adding reducing chemicals, like cysteamine, 
to eliminate oxygen-free radicals that cause 
photobleaching (Campos et al. 2011). 

Cross-References 

► Single Lluorophore Blinking 

► Single-Molecule Lluorescence Resonance Energy 
Transfer 
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► Electron Cryo-microscopy of Molecular Machines 


Single Particle Tracking 
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Synonyms 

Nanovid microscopy; Single-molecule methods; SPT 


Definition 

Single Particle Tracking is a microscopy technique 
whereby the motion trajectories of individual particles 
or molecules are visualized. 


Basic Characteristics 

Single Particle Tracking (SPT) is a fluorescence 
microscopy technique developed in the 1980s with 
the aim to study the movement of individual particles 
or molecules from a time series of microscope images. 
The particles of interest in SPT have a size in the range 
of 10-1,000 nm. In order to distinguish the particles or 
molecules from the background, they are usually 
labeled with one or more fluorescent molecules. 
A light source illuminates the sample, so that the 
particles emit fluorescence light. An objective lens 
captures the fluorescence, and guides it to a camera 
that records the images. 

The conventional SPT setup is basically 
a fluorescence microscope with a high power light 
source and a fast and sensitive camera (Braeckmans 
et al. 2010). The light source can be a mercury arc 
lamp, but a laser is used more often since it offers high 
power within a narrow spectral region. Contrary to 
a confocal laser scanning microscope, where the laser 
beam is focused to a diffraction limited spot, in SPT the 
laser is defocused to illuminate a larger region in the 
sample. The objective lens typically has a high numer¬ 
ical aperture (NA), so that as much as possible fluores¬ 
cence light emitted by the individual particles is 
collected. Low light levels and rapid particle move¬ 
ment necessitate the use of fast and sensitive (up to 
single photon sensitivity) cameras. An electron multi¬ 
plying charge coupled device (EMCCD) or a scientific 
complementary metal-oxide-semiconductor (SCMOS) 
camera is most often used. 

After recording the movie, the position of every 
particle in each image is determined. Eirst the particle 
spots are separated from the background by image 
processing algorithms, for example, by applying 
a combination of smoothing filters and background 
subtractions. The particles that have a size below the 
microscope resolution (usually around 250 nm for high 
NA objective lenses) will appear in the image as spots 
with a width around that resolution, called the micro¬ 
scope point spread function (PSF), regardless of the 
particle size. This is a fundamental property of any 
optical system. Fortunately, by fitting, for example, 
a Gaussian distribution to the PSF, the particle location 
can be determined with a much better precision than 
the resolution itself (in the order of 10 nm). Once all 
particle positions in each image have been determined, 
they can be connected to form the trajectories. 
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Single Particle Tracking, 
Fig. 1 (a) A fluorescence 
microscope image from 
a movie, showing 200 nm 
diameter fluorescent 
nanospheres diffusing in 
water, (b) The trajectories of 
the single nanospheres that 
contain at least 10 positions, as 
calculated from the movie 
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see Fig. 1. Several algorithms for building these tra¬ 
jectories exist, a simple one is the nearest neighbour 
algorithm that connects a certain particle position to 
a particle position in the previous image if it is the 
closest one. 

The particle trajectories can be further analyzed to 
yield information on the particles. It is common prac¬ 
tice to calculate the mean square displacement (MSD) 
of a trajectory (Saxton and Jacobson 1997). Plotting 
the MSD in function of the corresponding time lags 
reveals information on the type of motion. In case the 
plot is a straight line, the particle is diffusing freely and 
the diffusion coefficient can be extracted from the 
slope of the line (Qian et al. 1991). The diffusion 
coefficient in turn allows to calculate the size of the 
individual particle or the viscosity of the medium, 
according to the Stokes-Einstein relation. An SPT 
experiment therefore allows to derive a size 
distribution that is based upon individual size measure¬ 
ments, which is more detailed than the distributions 
provided by commonly used ensemble averaging 
techniques, such as dynamic light scattering (DLS). 
Besides a linear MSD plot, a quadratically increasing 
behavior can also be found, this indicates the presence 
of motion with a constant velocity. Other deviations 
from linearity in the MSD plot can be associated with 
certain forms of anomalous diffusion. 

SPT has been specifically useful in cell biology, 
where it has been applied extensively to study the 
behavior of lipids and proteins in the plasma mem¬ 
brane of living cells, allowing, for example, to deduce 
information on the membrane organization in lipid 
rafts (Wieser and Schutz 2008) . The technique has 
also been used inside living cells, for example, to study 


the transport of proteins or vesicles along microtubuli, 
or the infection pathways of viruses (Levi and Gratton 
2007). SPT of particles undergoing Brownian motion 
has been of interest in pharmaceutical research to mea¬ 
sure the size of drug complexes in biological fluids, 
and in rheology to assess the properties of complex 
fluids. In colloidal sciences, SPT has been used to 
study phase transitions. 
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Synonyms 

Single Protein Production system in bacteria 

Definition 

The Single Protein Production (SPP) system was ini¬ 
tially developed in Inouye’s laboratory in 2005 and has 
been substantially improved since then. In the SPP 
system, live E. coii cells are converted into 
a bioreactor producing only a single protein of interest 
in a high yield. It works as follows: when MazF, an 
ACA-specific mRNA endoribonuclease or interferase, 
is induced in E. coii cells, almost all cellular mRNAs 
are removed so that the cell growth is completely 
arrested. However, importantly, the MazF-induced 
cells retain full metabolic activity for RNA and protein 
synthesis. Therefore, when the mRNA for a protein of 
interest is engineered to be devoid of ACA sequences 
without altering the amino acid sequence of the protein 
(note that any mRNA can be engineered to be ACA- 
less without changing the original amino acid 
sequences) and the mRNA is induced in the MazF- 
induced cells, the cells start to produce only the protein 
from the ACA-less mRNA in a high yield. 

Since cellular proteins of E. coii are not produced in 
the SPP system, isotope-enriched compounds such as 
a single- or double-labeled amino acid(s) if added in 
the culture medium are incorporated only into the 
target protein. Therefore, this system is the method of 
choice for preparation of isotope-labeled protein for 
NMR study. Notably, the protein labeled by the SPP 
system can be directly applied for NMR study in prin¬ 
ciple without any purification. This is particularly 
important for NMR study of membrane proteins as 
only the membrane protein of interest is labeled with 
isotopes allowing one to use the whole membrane 
fraction for solution NMR or solid-state NMR. 


In addition, one may use whole cells to perform 
whole-cell NMR or In-cell NMR to study the structure 
of a single protein in intact cells by using the SPP 
system. 

The major features of the SPP system are summa¬ 
rized as follows: Since the cells in the SPP system are 
not growing, (a) the culture can be condensed as much 
as 20 to 100-fold (1-1 culture can be condensed to 
50-10 ml) without affecting the protein yield; (b) 
toxic compounds like D 2 O have no negative effects 
on the protein yield; (c) as a result, one can achieve 
signihcant cost saving (more than 99%) with materials 
including D 2 O and/or any other isotope-enriched com¬ 
pounds such as uniformly *^N-enriched amino 
acids; (d) a dual-inducible SPP system is available 
using a MazF mutant protein (either His or Trp residue 
in MazF is replaced with other amino acid residues 
without changing its RNA cleavage specihcity). MazF 
is first induced in the absence of either His or Trp 
(during this period, the target protein cannot be made 
as it contains His and/or Trp). In this fashion, produc¬ 
tion of E. coii cellular protein is completely eliminated 
before expression of the target protein, (e) Using 
amino acid auxotroph strains in the SPP system, 
the protein of interest can be effectively labeled 
with a specific amino acid(s) as well as toxic amino 
acid analogs which do not exert toxicity in the SPP 
system. 

Cross-References 

► Isotope Labeling 

► Membrane Proteins: Structure and Organization 

► NMR Methods for Kinetic Analysis 


Single-Dye Tracing 

► Single-Particle Tracking 


Single-Molecule Biophysics 

► Site-Specific Protein Labeling for Single-Molecule 
Fluorescence Studies 

► Single Molecule Methods 
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Single-Molecule Fluorescence 
Resonance Energy Transfer 

Alexander E. Johnson-Buck', Mario R. Blanco^ and 
Nils G. Walter' 

'Department of Chemistry, The University of 
Michigan, Ann Arbor, MI, USA 
^Department of Cellular and Molecular Biology, The 
University of Michigan, Ann Arbor, MI, USA 

Synonyms 

Fluorescence resonant energy transfer (FRET); Single¬ 
molecule Forster resonance energy transfer; Single¬ 
pair fluorescence resonance energy transfer 

Definition 

Single-molecule fluorescence resonance energy trans¬ 
fer (smFRET) is a technique used to measure nanome¬ 
ter-scale distances between specific sites on an 
individual molecule, usually as a function of time. 

Introduction 

In this article, we review smFRET, a powerful tech¬ 
nique for measuring distances and monitoring dynam¬ 
ics at the molecular scale. In this technique, the 
researcher monitors distances between two or more 
individual fluorescent labels by measuring how effi¬ 
ciently electronic energy is transferred between them, 
a phenomenon known as ► Edrster resonance energy 
transfer (ERET). Its power lies in its ability to detect 
distance changes as small as ^0.3 nm in individual 
molecules (or pairs of molecules) within 
a heterogeneous population (Roy et al. 2008). 
Although it requires a specialized fluorescence micro¬ 
scope, smFRET is employed in many laboratories 
worldwide and has been used to address questions 


about topics ranging from intermolecular interactions 
to macromolecular folding and catalysis. 

Although smFRET has only been technically pos¬ 
sible since the mid-1990s, its theoretical foundations 
were laid much earlier through the pioneering work of 
Theodor Edrster in the 1940s and through others who 
elucidated the distance dependence of ERET and its 
use as a molecular ruler (Stryer 1978). A separate 
branch of inquiry, the first single-molecule measure¬ 
ments of ion channels using the ► patch clamp tech¬ 
nique, already established some of the core aspects of 
analyzing time-lapsed recordings from individual mol¬ 
ecules in the 1970s and early 1980s (Sakmann and 
Neher 2009). By the mid-1990s, improvements in 
fluorescence detection brought about by ► total inter¬ 
nal reflection fluorescence microscopy made it possi¬ 
ble to measure the weak emission of single organic 
fluorophores under ambient conditions and, soon 
thereafter, smERET was realized (Ha et al. 1996). 

We begin with a discussion of the theory of ERET 
as it applies to single-molecule experiments, followed 
by a brief description of a typical smFRET experiment, 
including the necessary equipment and materials. We 
then discuss some of the major insights gained from 
smERET, as well as some of its strengths and limita¬ 
tions. Finally, we conclude by commenting on recent 
and ongoing developments in the field. 

Photophysical Basis of smFRET 

In 1948, Forster developed the theory governing the 
non-radiative transfer of energy from one electroni¬ 
cally excited molecule to another nearby molecule. 
When a fluorescent molecule (fluorophore) is excited 
by a photon of the proper energy, it returns to the 
ground state via one of several possible pathways: It 
can dissipate its energy by emitting a photon, by trans¬ 
ferring energy to other molecules without emitting 
a photon (non-radiatively), or by reacting chemically 
(Fig. 1). In FRET, one form of non-radiative energy 
transfer, the excited “donor” fluorophore transfers its 
excitation energy to a nearby “acceptor” fluorophore 
through an electronic dipole-dipole interaction. The 
excited acceptor may then return to its ground state 
via one of various pathways, including by emitting 
a photon (fluorescence). 

Eor ERET to occur efficiently, the fluorescence 
emission spectrum of the donor must overlap 
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Single-Molecule Fluorescence Resonance Energy 
Transfer, Fig. 1 Simplified electronic diagram of FRET. 
Solid lines represent rapid electronic transitions, and dashed 
lines represent slow transitions. A FRET donor can be excited 
from its ground state (G) to an excited state (E) by a photon of 
energy hv {blue). The excited donor can then return to the ground 
state, emitting a photon of lower energy (i.e., it can fluoresce, 
green)', enter a nonfluorescent triplet state (T); permanently 
photobleach (B); or donate its energy to a nearby acceptor 
molecule by FRET. The excited acceptor can then similarly 
emit a photon (red), enter a triplet state, or photohleach 



Wavelength 


Single-Molecule Fluorescence Resonance Energy 
Transfer, Fig. 2 Desirable spectral properties for a FRET 
donor-acceptor pair. For specific fluorescent excitation and 
detection, the absorption spectra (solid curves) of the donor 
and acceptor should overlap little, as should their emission 
spectra (dashed curves). Flowever, for efficient FRET to occur, 
the emission spectrum of the donor should overlap considerably 
with the absorption spectrum of the acceptor 

considerably with the absorption spectrum of the 
acceptor, i.e., the fluorophores must be in resonance 
(Fig. 2); the two molecules must be within a certain 
distance of one another (typically <10 nm); and their 
transition dipole moments must be in (partial) align¬ 
ment. At the same time, their absorption spectra should 
be separate enough that the donor can be excited with 
high specificity, and their emission spectra sufficiently 


separate to ensure specihc detection of both donor and 
acceptor. 

Since FRET is a dipole-dipole interaction, its effi¬ 
ciency E depends on the sixth power of the separation r 
between the donor and acceptor as: 

£ = r-V(r-®+V) (1) 

where the Forster radius Rq, corresponding to the sep¬ 
aration at which energy transfer is 50% efficient, is: 

Ro = (jK^QQn-''y'^ X 9.7 x lO^A (2) 

The value E, which can be calculated from the 
relative fluorescence of the donor and acceptor (see 
Data Processing and Analysis, below) thus reports on 
the distance between the two fluorophores. The value 
of Rq depends on the overlap integral J between the 
donor emission and acceptor excitation spectra, the so- 
called orientation factor k describing the relative ori¬ 
entations of the donor and acceptor, the quantum yield 
Qo of donor fluorescence in the absence of acceptor, 
and the refractive index n of the medium in which the 
interaction takes place. If the fluorophores are freely 
rotating at a rate faster than the excited state lifetime of 
the donor, = 2/3, and Rq is constant for a given 
donor-acceptor pair in aqueous solution. This can be 
conhrmed for a particular system by measuring the 
fluorescence anisotropy of both fluorophores. 

Experimental Design and Data Acquisition 

In order to use smFRET to measure conformational 
changes or interactions between molecules, the mole- 
cule(s) of interest are labeled, usually at speciflc sites, 
with donor and acceptor fluorophores that report on 
a distance of interest. The ideal fluorophore is stable 
under high photon flux, has high molar absorptivity 
and fluorescence quantum yield, and undergoes mini¬ 
mal “►blinking” (spontaneous excursions into 
nonfluorescent states). The fluorophores used are gen¬ 
erally small (<1 nm) organic molecules, the two most 
common being the cyanine dyes Cy3 and Cy5, though 
the Alexa Eluor and ATTO series of dyes appear com¬ 
parable. Quantum dots and fluorescent proteins can 
also be used in smERET, but their use has been more 
difficult to implement because they are larger and, 
in the case of fluorescent proteins, less photostable 
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Single-Molecule Fluorescence Resonance Energy 
Transfer, Fig. 3 Distance dependence of FRET. Efficiency of 
energy transfer (E) is a steep function of distance at values 
near So 

(Roy et al. 2008). Positions of the fluorophores should 
ideally be chosen such that the distance r between them 
is close to the Forster radius since that is where FRET 
efficiency is most sensitive to changes in r (Fig. 3). 

All organic fluorophores eventually photobleach, per¬ 
manently losing their fluorescence properties 
through reaction with molecular oxygen (Figs. 1 and 4). 
This is useful because the instantaneous loss of fluores¬ 
cence signal upon bleaching is evidence that the fluores¬ 
cence originates from a single molecule. However, 
because photobleaching also reduces the time window 
of observation, it is often desirable to delay it as much as 
possible. So-called oxygen scavenging systems reduce 
the concentration of oxygen in solution by catalyzing its 
reaction with substrates other than the FRET donor and 
acceptor. The most common systems use either: (1) the 
enzymes glucose oxidase and catalase in combination 
with the substrate glucose, or (2) the enzyme 
protocatechuate dehydrogenase and its substrate, 
protocatechuic acid. A related problem is blinking, 
a term used for temporary nonfluorescence caused by 
excursions to kinetically trapped triplet states (Eig. 1). 
When oxygen, a good triplet state quencher, is removed 
from solution using a scavenger system, other additives 
(Trolox, P-mercaptoethanol) are often employed to dra¬ 
matically reduce blinking. 

Observation of the weak fluorescence signal from 
single molecules requires: (1) high-powered illumina¬ 
tion, (2) means of reducing or rejecting background 
fluorescence that would otherwise greatly diminish the 


signal-to-noise ratio, and (3) sensitive detection. To 
meet requirement (1), the illumination is almost 
always provided by the high-powered, monochromatic 
light of lasers. Requirement (2), the reduction of back¬ 
ground fluorescence, is usually achieved by exciting 
only a small volume of the sample by means of one of 
the following illumination schemes: ► total internal 
reflection fluorescence (TIRE), confocal, highly 
inclined, and laminated optical sheet (HILO), near¬ 
field scanning optical (NSOM), or zero-mode wave¬ 
guides (Walter et al. 2008). Most common of these 
illumination schemes is total internal reflection fluo¬ 
rescence (TIRE), which reduces background fluores¬ 
cence by illuminating only that part of the sample that 
is within ^100 nm of the surface of the microscope 
slide or coverslip. This requires immobilizing the mol¬ 
ecules of interest at the illuminated interface, which is 
usually achieved using specific, high-affinity binding 
such as the streptavidin-biotin interaction. To resolve 
single molecules, they must be immobilized from 
a very dilute (^100 pM) solution, resulting in 
a surface density no larger than about 0.2 pm“ . Sur¬ 
face immobilization has the added benefit of allowing 
one to observe the same molecule over several sec¬ 
onds, minutes, or even hours. Einally, requirement (3), 
sensitive detection, is usually provided by an electron 
multiplying charge-coupled device (EMCCD) camera 
in the case of wide-field illumination, such as TIRE, or 
avalanche photodiodes in the case of point-detection 
schemes like confocal microscopy. 

An smERET experiment also requires optics for 
filtering out stray excitation light and directing fluores¬ 
cence from the donor and acceptor into separate detec¬ 
tion channels. Scattered excitation light is removed 
from the detection path using high-optical density fil¬ 
ters that only transmit certain frequency bands. Sepa¬ 
ration of donor and acceptor signals is accomplished 
using dichromatic (dichroic) mirrors, which reflect 
specific frequencies of light and transmit others. Addi¬ 
tional mirrors are then used to direct the emission 
signal onto the detector(s) so that the donor and accep¬ 
tor emission can be measured simultaneously. 

Data Processing and Analysis 

Processing of smERET data from raw camera movies 
involves: (1) locating and matching corresponding 
donor and acceptor signals in the field of view. 
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Single-Molecule Fluorescence Resonance Energy 
Transfer, Fig. 4 Simulated typical FRET time trace of a single 
molecule. The fluorescence intensity counts of the donor and 
acceptor change in a discrete, anticorrelated fashion, reporting 
on underlying molecular distance changes that are detected as 
transitions between high and low FRET efficiency (E^pp). Upon 
photobleaching, the donor and acceptor intensity counts instan¬ 
taneously fall to ~0, evidence that the signal originated from 
a single FRET pair 

(2) matching each donor signal with its acceptor signal 
(channel registration), and (3) determining the FRET 
efficiency as a function of time for each molecule 
throughout the movie (Fig. 4). Since the donor and 
acceptor channels of the CCD image do not typically 
have a linear correspondence, they are generally 
mapped using a higher-order polynomial transforma¬ 
tion to ensure correct assignment of donor-acceptor 
pairs. Usually so-called fiduciary markers (such as 
fluorescent beads) that are visible in both channels 
are used to establish this mapping. 

The apparent FRET efficiency Eppp can be calcu¬ 
lated as: 


Ia 

X y 


(3) 


gives an estimate as to the relative distances between 
donor and acceptor in different FRET states (Roy et al. 
2008). 

Analysis of the Eppp versus time data varies greatly 
depending on the behavior of the sample. The typically 
low signal-to-noise ratios of single-molecule detection 
often make it challenging to determine the number and 
values of different FRET states in a given molecule. To 
facilitate the assignment of FRET states, ► hidden 
Markov modeling (FIMM) or nonlinear filters may be 
applied. Histograms of FRET efficiency are usually 
generated from the time traces of hundreds of mole¬ 
cules under the same conditions and can often be fit 
with multiple Gaussian distributions to estimate the 
number of states and their values of Eppp (Fig. 5), 
thus providing information about the equilibrium prop¬ 
erties of the system (Roy et al. 2008). 

If the molecules show transitions between discrete 
Eapp values over time (i.e., dynamics), either HMM or 
analysis of the time spent in each state (dwell times) 
can be used to estimate the underlying kinetics of the 
system (Fig. 5). Genuine FRET transitions are charac¬ 
terized by inversely proportional changes in acceptor 
and donor signal intensity, or anticorrelation (Figs. 4 
and 5), a property that can be used to filter out spurious 
transitions due to other photophysical phenomena such 
as changes in the local environment of one of the 
fluorophores. Transition density plots, which simulta¬ 
neously display the probability of all possible transi¬ 
tions between different FRET states, are often 
compiled based on HMM to provide a global view of 
the observed dynamics. If certain transitions have slow 
kinetics or appear only infrequently in time traces, 
evidence of them may be suppressed by faster (more 
frequent) transitions; in this case, the probability that 
a transition occurs at least once within a given mole¬ 
cule may instead be plotted (Blanco and Walter 2010). 


where Ip, and Ij^ are the total number of photons emitted 
per movie frame by the donor and acceptor, respec¬ 
tively. The parameter y, which depends on the relative 
quantum yields and detection efficiencies of the donor 


and acceptor, can be calculated as y = 


, where A /_4 


and A/£) are the changes in intensity of the acceptor and 
donor intensity upon photobleaching of the acceptor. If 
the Forster radius is known and if anisotropy of both 
fluorophores can be ruled out (see Strengths and limi¬ 
tations, below), Eapp can be related to absolute dis¬ 
tances according to equation (1). If not, E^pp still 


Insights, Strengths, and Limitations 

Single-molecule FRET has several strengths. First, the 
steep distance dependence of E^pp near Rq, and speci¬ 
ficity of the energy transfer between donor and accep¬ 
tor make it possible to monitor very specific 
interactions and events. Second, the ratiometric prop¬ 
erty of smFRET renders it one of the most sensitive 
and robust single-molecule techniques. Third, when 
using appropriate oxygen scavengers, blinking 
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the apparent FRET efficiency E^pp is calculated in each frame and plotted as 
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suppressants, and acquisition hardware, smFRET can 
be used to monitor molecular events occurring on 
timescales ranging from ^1 ms to hours. Because of 
these properties, smFRET is particularly well-suited to 
monitoring conformational changes in single 
immobilized molecules over long periods of time, but 
has been used to study phenomena ranging from the 
dynamics of motor proteins and RNA enzymes to 
structural transitions in DNA nanomachines and tran¬ 
sient intermolecular interactions. 

Perhaps the greatest advantage of smFRET, as with 

► single-molecule spectroscopy in general, lies in its 
ability to discern heterogeneous behavior within 
a population of molecules at equilibrium without the 
need for rapid mixing or other synchronization. In 
studies of conformational dynamics in protein and 
RNA enzymes, smFRET has repeatedly revealed het¬ 
erogeneous behavior; that is, the kinetics of conforma¬ 
tional transitions varies over time or from molecule-to- 
molecule. Some molecules even appear to have multi¬ 
ple native states with the same activity but subtly 
different structures that do not interconvert. This 
stands in contrast with the classical view of macromol¬ 
ecules as having a unique native state with a single, 
well-defined structure and behavior (Hwang et al. 

2009) . 

There are also several limitations to consider with 
smFRET. First, and perhaps most fundamentally, 
this technique generally reports on only a single 
dimension of interest, though strides have been made 
toward extending it to two or more dimensions (see 

► Recent Developments and Extensions of smFRET, 
below). Due to this fact, data from smFRET experi¬ 
ments must be carefully interpreted in light of all 
available structural and functional information about 
the system, such as that obtained by X-ray crystallog¬ 
raphy, mutational studies, and structural footprinting. 
Second, the millisecond time resolution is too slow to 
detect some important molecular dynamics. Third, 
only ^10^ photons can be collected from even very 
stable organic fluorophores before photobleaching 
occurs so that a finite number of observations can be 
made on a given molecule. Furthermore, even with the 
addition of anti-blinking agents such as Trolox, the 
acceptor may occasionally enter a dark state, giving 
the appearance of an excursion to a state with E^pp = 0; 
such dark states should be excluded from analysis or 
their influence quantified using controls (Benitez et al. 

2010 ) . 


Another concern is that rotational constraint of the 
donor or acceptor fluorophore can interfere with accu¬ 
rate distance measurements; for instance, common 
organic fluorophores have been shown to stack at the 
ends of nucleic acid duplexes, influencing the apparent 
FRET efficiency. To reduce the likelihood of such 
interactions, a short flexible organic linker (generally 
an alkyl moiety) can usually be added between the 
molecule of interest and each fluorophore. Still, if 
accurate absolute distance information is required, 
fluorescence anisotropy measurements are needed to 
ensure that the anisotropy values of the donor and 
acceptor are acceptably low (generally <0.2). If only 
relative distance information is needed, higher anisot¬ 
ropy values are tolerable since Eapp is still generally 
a monotonic function of distance (Roy et al. 2008). 
Finally, it is important to verify that the surface immo¬ 
bilization, if used, does not perturb the behavior of 
interest; this can be done by comparing results with 
those of ensemble FRET assays in solution, conducting 
single-molecule activity assays, or comparing results 
using different immobilization strategies. 

Recent Developments and Extensions of 
smFRET 

As the complexity and heterogeneity of molecular 
dynamics becomes more apparent, efforts are in pro¬ 
gress to extend the capabilities of smFRET and com¬ 
bine it with other techniques for a more comprehensive 
real-time picture of molecular events. For instance, 
three-color smFRET has been developed to simulta¬ 
neously monitor the distances between a single donor 
and two distinct acceptor fluorophores (Roy et al. 
2008). A challenge in extending such approaches to 
measure a larger number of distances is finding multi¬ 
ple acceptors whose emission spectra are sufficiently 
distinct to resolve their signals. To partially bypass this 
issue, switchable smFRET was developed to monitor 
energy transfer from one donor to multiple acceptors 
by photochemically switching each acceptor on and off 
in succession (Uphoff et al. 2010). 

Recently, smFRET has also been used in conjunc¬ 
tion with other single-molecule techniques, as pro¬ 
posed by Shimon Weiss (Weiss 1999). For example, 
a combination of smFRET and electrical recording was 
used to monitor dimerization of single ion channels, 
thus simultaneously providing structural and 
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functional information. Optical or magnetic trapping 
has been used to manipulate single molecules while 
monitoring their dynamics by smFRET (Hwang et al. 
2009). 

Finally, efforts are in progress to perform smFRET 
measurements in vivo to study the behavior of mole¬ 
cules in their native environment. Such efforts are 
complicated by the significant background 
autofluorescence within the cell, as well as the limited 
ability to control the photophysics of fluorophores 
in vivo, as is done in vitro through oxygen scavenging 
and additives. Nevertheless, smFRET has been 
employed to study a small number of intracellular 
systems, including protein-protein interactions at the 
cell membrane, where TIRF illumination can signifi¬ 
cantly reduce background fluorescence. 

Summary 

Single-molecule FRET is a powerful technique with 
the unique ability to monitor dynamic processes in 
single molecules over distances of <10 nm and time- 
scales of milliseconds to hours. It exploits the steep 
distance dependence (^^r®) of Forster energy transfer 
and high sensitivity of ratiometric fluorescence detec¬ 
tion to measure the distance between specific molecu¬ 
lar sites over time in single molecules. Although it 
requires some specialized equipment, smFRET is 
now widely used and has revealed kinetics, mechanis¬ 
tic details, and heterogeneous behaviors that are 
masked in traditional assays by ensemble averaging. 

Cross-References 

► CNS (Crystallography and NMR System) 

► Fluorescence and FRET in Membranes 

► Fluorescence Labeling of Nucleic Acids 

► Helicases 

► Hidden Markov Modeling in Single-Molecule 
Biophysics 

► Magnetic Tweezers 

► Optical Tweezers 

► Patch-Clamp Recording of Single Channel Activity: 
Acquisition and Analysis 

► Protein Fluorescent Dye Labeling 

► Single Fluorophore Blinking 

► Single Fluorophores Photobleaching 


► Single-Molecule Methods 

► Single-Molecule Spectroscopy 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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Synonyms 

Two-color FIONA 


Definition 

Single molecule high resolution colocalization 
(SHREC) is a technique for fluorescence microscopy 
that allows precise distances to be measured well 
below the diffraction limit of light microscopy. 

Basic Characteristics 

Microscopy is often used to investigate biological 
systems such as cells and macromolecular complexes, 
and distances measured between two components 
are frequently used as a proxy for their functional 
connection. Over the past decade, the spatial resolution 
of isolated fluorescent probes has improved immensely 
(Yildiz et al. 2003; Betzig et al. 2006; Mortensen et al. 
2010), requiring that colocalization techniques of two 
fluorescent probes achieve high resolution. 

Historically, Forster resonance energy transfer 
(FRET) has been used to infer proximity below 
^10 nm in biological samples. At the other end of 
the spectrum, diffraction-limited light microscopy 
has allowed distance measurements above ^250 nm. 
SHREC measures distances between these extremes 
without requiring special equipment (Churchman 
et al. 2005; Churchman and Spudich 2007). Of the 
colocalization techniques (Lacoste et al. 2000; 
Michalet et al. 2001; Koyama-Honda et al. 2005; 
Pertsinidis et al. 2010), SHREC has the highest 
reported colocalization resolution and is among the 
easiest to apply. Of particular note, one exception is 
a recent strategy that allows distance resolutions 


of 0.5 nm, but did require the use of a closed-loop 
feedback control system (Pertsinidis et al. 2010). 

Theory 

A biological sample is typically differentially labeled 
with probes of distinct emission frequencies. For dual¬ 
color far-field microscopy, an image produced in the 
sample plane is split into two images by wavelength 
that are recorded separately. The different channels 
will not directly correspond to each other because of 
their separate imaging paths. Aberrations arise due to 
the different lenses and mirrors used in the two optical 
paths and due to chromatic effects. In theory, one 
channel’s image would simply need to be transformed 
with respect to the other in a linear conformal manner 
including only a rotation, a scaling, and a translation. 
However, the imperfect nature of the optical elements 
induces aberrations that need to be mapped in a local 
nonlinear manner if colocalization resolution in the 
nanometer range is desired (Reference [Pertsinidis 
et al. 2010] explores the types of aberrations that 
occur). The local weighted mean method described 
by Goshtasby has been used successfully in the imple¬ 
mentation of SHREC (Goshtasby 1988). While this 
may not be the only mapping method that would 
produce such high resolution results, an advantage 
is that the software program Matlab has built the 
algorithm into functions that are found within the 
image processing toolbox. 

SHREC requires that a mapping is calculated from 
a large dataset of control points, which consists of pairs 
of positions that correspond to one another. The map¬ 
ping is then applied to subsequent images of the 
biological sample of interest. In practice, the mapping 
is determined from images of a fiducial marker; an 
object that, when put in the sample plane, can be seen 
in each of the imaging channels. Fluorescent beads are 
an excellent choice because they are bright and 
relatively photostable. Additionally, if the fluorescent 
bead is diffraction limited, then the image it produces 
in each imaging channel is the point spread function 
(PSF) of the microscope and the position of the bead 
can be found to high precision by a fit with a two- 
dimensional Gaussian function (Mortensen et al. 
2010). By imaging the fiducial bead over the 
entire field of view, a group of control points can be 
determined and a mapping can be calculated. 

An important point to note is that there is an error 
associated in the measurement of the control points 
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(Mortensen et al. 2010). A successful mapping can be 
performed only when the control point error is smaller 
than that sought for colocalization error. Many 
fluorescent heads are bright enough to he localized 
with high resolution. It is necessary, however, to verify 
that is the case in each channel. 


Cross-References 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Labeling of Nucleic Acids 

► Protein Fluorescent Dye Labeling 

► Single Particle Tracking 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Single-Molecule High-Resolution Imaging with 
Photobleaching (SHRImP) 
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Synonyms 

NALMS (nanometer-localized multiple single-mole¬ 
cule fluorescence microscopy) 

Definition 

Single-molecule high-resolution imaging with 
photobleaching (SHRImP) is a single-molecule tech¬ 
nique that uses the quantal photobleaching of two 
fluorophores to measure distances between 10 and 
100 nm with nanometer precision. 

Basic Characteristics 

Before the development of single-molecule imaging 
techniques, a gap existed in the ability of fluorescence 
microscopy to measure distances in biological sys¬ 
tems. Distances smaller than 10 nm could be probed 
with fluorescence resonant energy transfer (FRET) and 
distances larger than «250 nm could be resolved using 
conventional microscopy. The SHRImP technique and 
its cousin, NALMS, bridge this gap by circumventing 
the Rayleigh criterion. In conventional light micros¬ 
copy, the Rayleigh criterion limits the minimum 
resolvable level of detail in an image. According to 
diffraction theory, a point source of light - no matter 
how small the “point source” is - has a finite width of 
A,/(2 * numerical aperture) or approximately 250 nm. 
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Single-Molecule High- 
Resolution Imaging with 
Photobleaching (SHRImP), 
Fig. 1 Sequential 
photobleaching of a doubly 
labeled DNA oligomer 
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Single-Molecule High- 
Resolution Imaging with 
Photobleaching (SHRImP), 
Fig. 2 Application of the 
SHRImP algorithm 




The Rayleigh criterion says that two point sources of 
light - two fluorophores, for example - are resolvable 
when the maximum of one source coincides with the 
first diffraction minimum of the other. For a 1.45 NA 
visible-light objective, this means that the smallest 
resolvable feature in an image is separated by approx¬ 
imately 250 nm. The SFIRImP technique combines 
single-molecule localization with photobleaching to 
resolve distances smaller than this. 

To measure distances using SFIRImP, two identical 
fluorophores are attached to the structure of interest 
and excited by a light source. An EMCCD camera or 
other low-noise imaging device is used to record the 
emitted fluorescence over time. After prolonged exci¬ 
tation, the fluorophores will photobleach, that is, cease 
to emit light (Fig. 1). This process is the result of 
irreversible chemical damage or covalent modification 
to the fluorophores. SHRImP and NALMS use the 
sequential, stochastic photobleaching of the two or 
more fluorophores used as probes to localize each 
fluorophore individually. After the first fluorophore 


photobleaches, the FIONA (fluorescence imaging 
with one nanometer accuracy) technique can be used 
to locate the center of the point spread function of the 
single remaining fluorophore. This is accomplished by 
collecting enough photons from a single dye so that, 
when fit with a Gaussian function, the mean of the 
emission peak can be located very precisely. (See 
► Fluorescence Imaging with one-nanometer accuracy 
(FIONA) section for more details.) By subtracting the 
image of the final remaining fluorophore from images 
containing the fluorescence of the two dyes 
(before either photobleached), the point spread func¬ 
tion of the first photobleached dye can be 
reconstructed. FIONA can then be applied to localize 
this dye (Fig. 2). From the positions of the two 
SHRImP fluorophores, the dyes’ separation can be 
computed. 

The SHRImP technique, however, does require that 
the fluorophores be stationary over the time course of 
the entire measurement. They must be immobilized to 
obtain reliable distance measurements since the two 
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fluorophores are localized at different times. Another 
technique, called single-molecule high-resolution 
colocalization (SHREC) (Churchman et al. 2005; 
Toprak et al. 2010), overcomes this problem by using 
two fluorophores or quantum dots that emit at spec¬ 
trally different wavelengths. Each emission is col¬ 
lected simultaneously, and therefore distance changes 
that are shorter than the photobleaching time (but 
slower than the integration time) can be measured. 
However, chromatic aberration must be carefully 
accounted for with these techniques. Another caveat 
of SHRImP (and SHREC) has to do with collecting 
high quality data. Although the application of SHRImP 
(and SHREC) only requires that the fluorophores be 
easily resolved as bright puncta against a uniform 
background, maximum precision is obtained by max¬ 
imizing the signal-to-noise ratio of the fluorophores 
over the background. In practice, this optimization 
often requires imaging bright, photostable dyes, with 
total internal reflection microscopy (TIRFM). 

The application of SHRImP allows greater preci¬ 
sion than could be obtained by fitting the emission of 
two fluorophores with a sum of Gaussian functions. 
For separations between 10 and 100 nm, the overlap 
between the two point spread functions of single 
fluorophores is significant. A fit with a sum of Gauss- 
ians would involve six fitting parameters. An attempt 
to simultaneously fit this many parameters to a single 
fluorescent spot would yield a poor fit. However, the 
point spread function of a single dye can be fit using the 
FIONA technique with nanometer precision. The pre¬ 
cision that can be obtained by SHRImP is limited by 
the same factors that limit the precision with which 
a single spot can be localized (photon shot noise, 
background noise, and finite pixel size of the detector) 
but is also affected by experimental conditions that 
may change over time such as drift of the microscope 
stage, or sample and temporal fluctuations in the emit¬ 
ted fluorescence of the dyes. The precision that can be 
obtained with SHRImP depends on the choice of 
fluorophores, but for bright organic dyes such as Cy3 
with a lifetime of 10° photons, 8-10 nm precision can 
be obtained. 

SHRImP has been applied to a variety of systems 
using several fluorescent labelling techniques. It has 
been used to measure the distance between the labeled 
ends of DNA oligomers labelled with Cy3 dyes 
(Gordon et al. 2004; Qu et al. 2004). It has also been 
applied to measure the inter-head separation of myosin 


VI dimers tagged with eGFP (Balci et al. 2005), and 
the clustering of Cy5 labelled viral receptors in cell 
membranes (Bakker et al. 2007). The use of photo- 
bleaching to resolve and localize many individual 
fluorescent probes has been extended beyond the 
SHRImP with super-resolution techniques such as 
PALM and STORM, which only localize a relatively 
small number of fluorophores at one time and form 
a composite image of many sequentially localized 
fluorophores with sub-diffraction limit detail (Toprak 
et al. 2010). 


Cross-References 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Photoactivated Localization Microscopy (PALM) 

► Single-Molecule High-Resolution Colocalization 
(SHREC) 

► Stochastic Optical Reconstruction Microscopy 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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► Anti-Brownian Traps 
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Synonyms 

RNA; mRNA FISH; smFISH; Single-molecule FISH 

Definition 

Single-molecule RNA FISH is a method for detecting 
individual RNA molecules within cells via fluores¬ 
cence microscopy. 

Introduction 

Fluorescent in situ hybridization targeting ribonucleic 
acid molecules (RNA FISH) is a methodology for 
detecting and localizing particular RNA molecules in 


fixed cells. This detection utilizes nucleic acid probes 
that are complementary to target RNA sequences 
within the cell. These probes then hybridize to their 
targets via standard Watson-Crick base pairing, after 
which one may detect them via fluorescence micros¬ 
copy, either through direct conjugation of fluorescent 
molecules to the probe or through fluorescent signal 
amplification schemes. Recent advances in RNA FISH 
have increased the specificity and sensitivity of the 
method to enable the detection of individual RNA 
molecules, providing very accurate measurements of 
RNA abundance and localization at the single cell or 
even subcellular level. While most applications thus 
far have been in fixed cells, advances in probe technol¬ 
ogy have led to the ability to detect single RNA mol¬ 
ecules in living cells. 

Development of RNA In Situ Hybridization 

The protocols for in situ hybridization (ISH) targeting 
RNA molecules are well established and conceptually 
simple. All RNA ISH protocols essentially involve 
bathing the sample in a high concentration of nucleic 
acid probe (or probes) that are complementary in some 
way to the target RNA species, driving hybridization 
of the probe to the target, a principle derived from the 
development of DNA ISH (Gall and Pardue 1969). 
After hybridization, one washes away excess unbound 
probe, in principle leaving only those probes specifi¬ 
cally bound to the target molecule. Differences 
between the variants of RNA ISH typically pertain to 
the type of nucleic acid used for the probe and the type 
of labeling scheme used to detect the probe via micros¬ 
copy. Initially, researchers used radiolabeled cDNA 
probes complementary to the appropriate target (Har¬ 
rison et al. 1973). Issues with radiolabeling include the 
low spatial resolution and difficulties associated with 
the handling and stability of radioactive materials. 
Thus, the development of fluorescence-based (FISH) 
approaches using DNA or RNA probes provided 
a major step forward in the field, first applied to DNA 
FISH (Bauman et al. 1980) and then RNA FISH 
(Singer and Ward 1982). Rapidly, researchers adopted 
the approach of generating cDNA or RNA probes (via 
enzymatic amplification or nick translation) containing 
modified bases (Langer et al. 1981) that allowed the 
conjugation of various haptens or even fluorophores, 
thus facilitating either indirect or direct detection via 
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Single-Molecule RNA FISH, 
Fig. 1 Image of two different 
species of mRNA detected by 
single-molecule RNA FISH 
(Raj et al. 2008; image 
courtesy of Virzhiniya Lekova 
and Arjun Raj). Green spots 
correspond to Suzl2 mRNA 
and red spots correspond to 
JAZFl mRNA (nuclei stained 
blue by DAPI) 



fluorescence microscopy. These probes did present 
some challenges, however, because it was hard to 
ensure a consistent and high degree of labeling from 
experiment to experiment. Moreover, the sensitivity 
of that style of fluorescent probes is generally poorer 
than that of radiolabeled probes due to cellular 
autofluorescence. This is problematic because many 
important mRNAs (such as those encoding transcrip¬ 
tion factors) are often present at very low abundances, 
often on the order of a few molecules per cell or less. 
Subsequent developments in RNA FISH methodology, 
however, have resolved many of these problems by 
refining RNA FISH to the point where it can 
detect single RNA molecules, enabling the direct 
quantification of RNA species of very low abundance 
and providing absolute measurements of RNA copy 
number. 

Methods for Single-Molecule RNA FISH 

Broadly, current methods for single-molecule RNA 
FISH fall into two categories: those that use some 
form of signal amplification and those that rely on 
direct detection of signal. Direct detection involves 
labeling the probes themselves with fluorophores. In 
order to achieve single-molecule sensitivity, the 
probes must have enough fluorescence to be detectable 
above background autofluorescence. One technique is 
to use a set of short single-stranded DNA 


oligonucleotides complementary to various regions of 
the target RNA, each labeled with one or more fluo¬ 
rescent moieties (Femino et al. 1998; Raj et al. 2008). 
The binding of multiple probes localizes enough 
fluorophores to the target RNA such that the RNA is 
easily visible as a fluorescent spot via fluorescence 
microscopy (Fig. 1). The benefit of using several oli¬ 
gonucleotide probes at the same time is that the off- 
target binding of a single oligonucleotide in the probe 
pool will either be undetectable or readily distinguish¬ 
able from the much brighter spots corresponding to the 
true RNA, thus reducing the chances of false positives. 
False negatives are similarly unlikely, for even if 
a single probe out of the pool fails to bind, the rest 
are likely to bind. Recent research indicates that the 
use of nucleic acid chemistries with tighter and more 
specific binding properties such as linked nucleic acids 
(LNA) may be able to produce similar results using 
a single probe (Taniguchi et al. 2010), although the 
detection of just one or few fluorescent molecules may 
only be feasible in cases like bacteria where cellular 
autofluorescence and other sources of background are 
minimal (as opposed to probe pools, which work in 
a variety of contexts (Raj et al. 2008)). In general, the 
signals produced via these methods are low and require 
the use of sensitive CCD cameras and high NA optics 
for detection. 

In order to circumvent the limitation of low signals 
from the relatively small numbers of fluorescent mol¬ 
ecules targeted to mRNA in these direct detection 
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methods, researchers have also developed a large vari¬ 
ety of schemes to amplify signals from individual 
molecules. Some of these are relatively simple exten¬ 
sions of the direct detection methods, such as detection 
of the probe by fluorescently labeled antibodies 
targeting specific haptens incorporated in large num¬ 
bers into an RNA probe (Pare et al. 2009). Others 
involve targeting nucleic acid probes with a single or 
few haptens with antibodies conjugated to enzymes; 
those enzymes in turn will act upon a substrate in such 
a manner as to create a fluorescent product that will 
become covalently linked to surrounding molecules 
(Kerstens et al. 1995). Yet other methods amplify 
signals by using DNA polymerase and circular tem¬ 
plates to locally create long, repetitive single-stranded 
DNA tracts in situ, which one then targets with short 
oligonucleotide probes (Larsson et al. 2010). Such 
methods have the advantage of labeling targets to 
such a degree that the signals are easily visible even 
by eye, precluding the need for expensive optical 
equipment. Also, they are able to reliably detect short 
RNA molecules such as miRNA (Lu and Tsourkas 
2009), and researchers have even demonstrated the 
ability to detect single-base differences in RNA mole¬ 
cules (Larsson et al. 2010). However, such methods are 
somewhat prone to lower detection efficiencies owing 
to the large number of steps in such protocols, each of 
which has some probability of failure. Some reports 
indicate that such issues are relatively minor (Lu and 
Tsourkas 2009), while other amplification methods 
detect only a small fraction of the target mRNA 
(Larsson et al. 2010). Another point of comparison 
between direct detection and amplihcation is the abil¬ 
ity to detect different species at the same time by using 
spectrally distinguishable fluorescent moieties: Many 
amplification methods are limited to a single “color” 
owing to the necessity of using a single enzyme/ 
substrate pair, whereas direct detection can utilize the 
plethora of organic dyes with different spectral prop¬ 
erties currently available. The adoption of these vari¬ 
ous methods for performing FISH in fixed cells is 
likely to depend on the particular demands of the 
application at hand. 

FISH in Living Cells 

Traditionally, researchers have considered FISH to be 
a methodology that only applies to fixed cells as the use 


of oligonucleotides in living cells has proven to be very 
challenging for a host of reasons. Chief among these 
are the fact that cells quickly sequester short single- 
stranded DNA oligonucleotides in the nucleus for 
rapid degradation, and the inability to wash away 
unbound probe, leading to high background. A recent 
study has circumvented these issues by designing 
a degradation-resistant variant of a “molecular bea¬ 
con,” which is a short oligonucleotide probe with 
a fluorophore on one end and a quencher on the other 
(Tyagi and Kramer 1996). The probe is designed in 
such a way that the fluorophore and the quencher are in 
close proximity when the probe is not bound to the 
target but are far apart when it is bound to the target; 
thus, the probe is only fluorescent when bound to the 
target. This reduces the background to the point where 
one can detect individual mRNA molecules in living 
cells (Vargas et al. 2005). Further refinements of these 
methods may open the door to new applications of 
these live-cell FISH techniques. 

Applications 

RNA FISH has a large number of biological applica¬ 
tions but is particularly useful as an assay for spatial 
aspects of gene expression. One scenario is one in 
which there is significant cellular heterogeneity, pre¬ 
cluding more traditional bulk biochemical assays such 
as qRT-PCR or northern blots. For instance, it is com¬ 
monly used to detect gene expression in the study of 
developmental biology, where it allows for the detec¬ 
tion of gene expression in particular sets of cells within 
the context of a whole developing organism. It also has 
applications in fully developed tissue that contains 
a variety of cellular types that would otherwise be 
averaged together. It is also useful in the study of 
subcellular localization of RNAs. Examples include 
the asymmetries in mRNA distributions in the devel¬ 
oping Drosophila melanogaster syncytium (Fowlkes 
et al. 2008), the localization of ASHl mRNA to the 
daughter cell during yeast cell division (Long et al. 
1995), and the localization of beta-actin mRNA to the 
leading edge of cells during cellular migration 
(Lawrence and Singer 1986). The development of sin¬ 
gle-molecule RNA detection via FISH allows for the 
accurate quantification in these contexts and allows for 
the detection of the localization of even low- 
abundance mRNAs and noncoding RNAs (Khalil 
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et al. 2009). Single-molecule counting also provides 
absolute quantification, which is useful even in cases 
like cell lines in which bulk methods are in common 
use, providing, for instance, very accurate measure¬ 
ments of variability in gene expression (Raj et al. 
2006; Raj and van Oudenaarden 2009). On the 
horizon, the development of live single-molecule 
RNA FISH methods promises to bring a much deeper 
temporal understanding of transcription and RNA 
dynamics. 

Summary 

The advent of single-molecule sensitivity represents in 
many ways the ultimate fruition of RNA FISH. 
Through the establishment of simple, reliable methods, 
RNA FISH represents in many ways a new gold stan¬ 
dard for RNA quantification. On the horizon, the 
development of live cell versions of FISH promise to 
animate the already rich three-dimensional picture 
RNA FISH can yield in fixed cells. With applications 
across a range of biological fields, RNA FISH is 
a powerful tool for studying gene expression and 
RNA biology. 

Cross-References 

► Fluorescence 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Labeling of Nucleic Acids 

► mRNA Localization 

► RNA Polymerases and Transcription 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Single-Molecule Methods 

► Single-Particle Tracking 

► Splicing 
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Single-Molecule Sequencing 

Jonas Korlach and Stephen W. Turner 
Pacific Biosciences, Menlo Park, CA, USA 

Definition 

Single-molecule sequencing refers to techniques 
that can read the base sequence directly from individ¬ 
ual strands of DNA or RNA present in a sample of 
interest. 

Basic Characteristics 

Nucleic acid sequencing represents one of the 
most important techniques in the study of biological 
systems - the order of the four different bases in the 
genome and transcriptome dehne the genetic blueprint 
of an organism. In addition to this basic hereditary 
genetic information, DNA also contains epigenetic 
modihcations that are variable with respect to space 
(the organism’s different tissues and cell types) and 
time (different stages in the organism’s life cycle). 
While sequencing has already altered the way biolog¬ 
ical research is undertaken, extensive and cost- 
effective DNA and RNA sequencing, when combined 
through powerful computational methods with proteo- 
mics, biomolecular dynamics, and a host of other phe¬ 
notypic information, is expected to precipitate a radical 
expansion of our understanding of biology, and to 
improve human health, food and energy supplies, and 
environmental conservation. 

Decoding the sequence of bases in a population of 
identical DNA molecules became technically feasible in 
the late 1960s (reviewed in (Wu 1993)).These ensemble 
(non-single-molecule) sequencing technologies have 
steadily grown to provide ever greater DNA sequencing 
throughput at reduced cost (reviewed in [Mardis 2008; 
Shendure and Ji 2008]). Concurrently, there has also 
been great interest in pushing the DNA input sensitivity 
to the ultimate analytical limit - obtaining the sequence 
from an individual DNA molecule. The principal driv¬ 
ing forces behind these efforts are: 

1. Increasing intrinsic sequencing speeds. Ensemble 

DNA sequencing technologies rely on precisely 


gating the identical processing of many DNA mol¬ 
ecules during sequencing to assure synchronicity 
within the clonal DNA population. This is com¬ 
monly achieved by terminating various enzymatic 
activities (such as DNA polymerization or ligation) 
after one or more bases, interrogation of the result, 
removal of the termination agent, and repetition of 
this cycle. The multi-step nature of this protocol 
(often including several washing steps) results in 
relatively slow cycle times and thus low intrinsic 
DNA sequencing speeds - often many minutes per 
base sequenced. 

2. Increasing the length of DNA sequencing reads. In 
ensemble-based sequencing techniques, the DNA 
sequence quality decreases with sequence length 
because the yield for the gated addition of each 
base is less than 100%, thereby resulting in the 
population of molecules becoming less synchro¬ 
nized with each processing cycle. This limits the 
readlength of these techniques to a range of only 
several hundred bases. Single-molecule sequencing 
interrogates each DNA molecule independently, 
thereby avoiding this problem of loss of synchrony 
or dephasing, resulting in longer readlengths with 
uniform sequence quality. 

3. Avoiding processing steps prior to sequencing. 
Ensemble DNA sequencing technologies routinely 
require amplihcation techniques, such as polymer¬ 
ase chain reaction (PCR) or whole-genome ampli¬ 
hcation (WGA). These steps have the potential to 
introduce limitations and errors, e.g., limiting the 
size of DNA fragments, introducing population bias 
through uneven amplihcation efficiencies, and base 
sequence errors introduced during the amplihcation 
process. They also increase the overall time and 
cost of sequencing. 

4. Resolving heterogeneous DNA mixtures. Com¬ 
monly, natural samples contain a mixture of differ¬ 
ent DNA molecules, e.g., maternally and paternally 
inherited alleles, environmental microbial samples, 
tissue biopsies containing both healthy and tumor 
cells, or forensic samples from crime scenes. To 
address this problem, some ensemble-based 
sequencing techniques employ sample preparation 
steps that proceed through dilution and subsequent 
amplihcation steps that produce clonal populations 
(originating from a singly diluted molecule) for 
sequencing. In addition to the concerns around 
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Single-Molecule Sequencing, Table 1 Commercialization of different single-molecule sequencing methods 

Company 

Method 

Website 

Halcyon Molecular 

Transmission electron microscopy (TEM)-based SMS 

www.halcyonmolecular.com 

Helicos Biosciences 

True Single-Molecule Sequencing (tSMS'‘'“) 

www.helicosbio.cora 

IBM 

Transistor-based nanopore DNA sequencing 

www.ibra.com 

Life Technologies 

Qdot-based SMS 

www.lifetechnologies.com 

NobleGen Biosciences 

Optical readout nanopore DNA sequencing 

www.noblegenbio.com 

Oxford Nanopore Technologies 

Nanopore DNA sequencing 

www.nanoporetech.com 

Pacific Biosciences 

Single-Molecule, Real-time (SMRT™) DNA sequencing 

www.pacificbiosciences.com 

Reveo 

Scanning tunneling microscopy (STM)-based SMS 

www.reveo.com 

ZS Genetics 

Transmission electron microscopy (TEM)-based SMS 

www.zsgenetics.cora 


amplification described in (3), these steps typically 
break the original DNA fragments into small pieces, 
thereby confounding the task of correctly assem¬ 
bling the native constitution of the heterogeneous 
DNA sample. 

5. Reading DNA base modifications as part of 
sequencing. Base modifications are present in the 
genomes of virtually all organisms, as well as 
viruses and phages. They include epigenetic 
markers essential for growth and development 
(5-methylcytosine (5-mC), 5-hydroxymethylcytosine 
(5-hmC)), bacterial identity marks important in 
host-phage interactions (5-mC, 5-hmC, glucosylated 
5-hmC, 6-methyladenine, 4-methylcytosine), or 
products of DNA damage (e.g., 8-oxoguanine, 
BPDE adducts (a major form of damage from ciga¬ 
rette smoke), thymine dimers) (Murray 2002; Siede 
et al. 2005; Tollefsbol 2010). Currently, the vast 
majority of these modifications cannot be sequenced 
with ensemble-based sequencing techniques, and 
where possible require complex chemical steps 
(e.g., bisulfite conversion of 5-methylcytosine). 
Single-molecule sequencing technologies aim to 
read these important DNA marks as an integral part 
of the sequencing method. 

Several different single-molecule DNA sequenc¬ 
ing techniques have been developed or are currently 
in development (reviewed in [Xu et al. 2009; Schadt 
et al. 2010b]), and are being commercialized 
(Table 1). 

There are currently two principal approaches; 
(1) DNA polymerase-based single-molecule DNA 
sequencing, and (2) direct inspection of individual 
DNA strands. 


DNA Polymerase-Based Single-Molecule 
Sequencing 

Gated Single-Molecule Sequencing-By-Synthesis 
Gated sequencing-by-synthesis - at the heart of many 
ensemble-based DNA sequencing approaches - has 
been adapted to the single-molecule regime in 
a technology commercialized by Helicos Biosciences. 
In its principle, individual DNA molecules are 
extended and detected one base at a time through the 
use of DNA polymerase and fluorescently labeled, 
terminating nucleotides that carry a cleavable moiety 
to remove the termination group after the detection 
step. The technology meets the desired performance 
criteria (3) and (4) described above - avoiding ampli¬ 
fication prior to sequencing, and interrogating DNA 
molecules asynchronously, thereby allowing the 
sequencing of mixed samples. However, because 
sequencing is still based on gating the polymerase 
activity, it does not meet criteria (1) and (2), with 
cycle times of ^2 h per base, and average readlengths 
of ^35 bases. 

Single-Molecule, Real-Time DNA Sequencing 
Engineered by nature, DNA polymerases are arguably 
the most advanced sequencing machines. Over 
millions of years of molecular evolution, they have 
been optimized to rapidly and faithfully replicate 
genomes. Their speed, accuracy, processivity, and sen¬ 
sitivity to base modifications can be harnessed directly 
in single-molecule, real-time sequencing approaches 
using DNA polymerase - not as a reagent consumable, 
but as the sequencing engine itself. 

Single-molecule sequencing by monitoring such 
uninterrupted activity of DNA polymerase is realized 
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in Single-Molecule, Real-Time (SMRT™) DNA 
sequencing commercialized by Pacific Biosciences. 
Phospholinked nucleotides - carrying the fluorescent 
label attached to the terminal phosphate moiety - are 
employed to allow continuous DNA synthesis, as the 
fluorescent label is removed by the natural enzymatic 
activity of the DNA polymerase during base incorpo¬ 
ration. The enzyme/DNA complex is placed in 
a photonic nanostructure called a zero-mode wave¬ 
guide (ZMW), which confines the observation volume 
to the immediate vicinity of the enzyme, thereby mak¬ 
ing the polymerase activity visible against the solution 
background of diffusing labeled nucleotides, and mak¬ 
ing gating and washing steps unnecessary. Individual 
DNA molecules are independently and asynchro¬ 
nously sequenced on arrays containing tens of thou¬ 
sands of ZMW, with intrinsic sequencing speeds of > 1 
base per second, and readlength averaging >1,000 
bases. On topologically circular DNA template mole¬ 
cules, the polymerase will undergo rolling-circle DNA 
synthesis, allowing a consensus sequence to be 
obtained from multiple passes on the same molecule 
(Travers et al. 2010). Using this circular consensus 
sequencing approach, the error-free sequencing of an 
individual DNA molecule greater than 1 kb in length 
(segment 7 of influenza strain HlNl (A/New York/ 
1682/2009); 1,027 bp) has been reported (Schadt et al. 
2010a). The polymerase kinetics is affected by the 
DNA template structure, allowing the sequencing and 
identification of epigenetic markers and other DNA 
base modifications (Flusberg et al. 2010). 

Another real-time sequencing approach using quan¬ 
tum dots for labeling polymerase molecules and nucle¬ 
otides in conjunction with fluorescence resonance 
energy transfer as an alternative observation volume 
confinement strategy is pursued by Life Technologies, 
but little information is available about the feasibility 
of this approach. 

Direct Inspection of Individual DNA Strands 

Nanopore-Based Single-Molecule Sequencing 
In nanopore-based single-molecule sequencing 
approaches, DNA molecules are individually driven 
through nanoscale pores of dimensions that only 
permit the passage of single-stranded DNA in strict 
linear sequence. Nanopores can be of biological 
origin - membrane protein complexes such as 
a-hemolysin or Mycobacterium smegmatis porin 
A (MspA) protein - or they can be created 


synthetically from solid state materials, such as silicon 
nitride, aluminum oxide, multilayer graphene sand¬ 
wiches, or carbon nanotubes. 

The pore contains a detection region capable of 
differentiating the four different bases, thereby gener¬ 
ating a temporal signal corresponding to the sequence 
of bases as they traverse the pore. Different types of 
signals are being evaluated, including differential ionic 
current blockades through the pore, tunneling currents 
across the pore, or capacitance changes. Using an 
a-hemolysin protein nanopore with a covalently 
attached cyclodextrin adapter molecule as the detector 
reading head, the four bases and 5-methylcytosine 
could be distinguished as free nucleotide 
monophosphates by measuring the residual pore 
current (Clarke et al. 2009). 

Precise control over the DNA transport through the 
pore is crucial to the success of these methods, and has 
in some instances been implemented by the use of 
enzymatic activities, such as polymerases or exonucle¬ 
ases positioned close to the pore’s entrance. Using this 
approach, the real-time detection of several sequential 
nucleotide additions by a DNA polymerase has been 
measured by differential current blockage of an 
a-hemolysin pore as the DNA template strand was 
drawn through the nanopore lumen during replication 
(Chu et al. 2010; Lieberman et al. 2010). To address 
potential challenges related to achieving large multi¬ 
plex factors of independently monitoring the small 
electrical signals from each nanopore, one alternative 
strategy has been reported which allows for the parallel 
optical readout from multiple nanopores (McNally 
et al. 2010). 

Microscopy-Based Single-Molecule Sequencing 
Advanced electron microscopy techniques are being 
pursued to directly image the sequence of bases in 
a DNA template molecule. The different approaches 
include transmission electron microscopy (TEM) for 
direct chemical detection of atoms that would uniquely 
identify the different bases. Other approaches include 
prior differential chemical labeling of the bases with 
compounds such as iodine and bromine to enhance 
detectability and contrast. A third approach encom¬ 
passes the imaging of single-stranded DNA molecules 
by scanning tunneling microscopy (STM), and detec¬ 
tion of the four bases by differential tunneling current 
between the atomically sharp STM tips, the base, and 
a conductive surface. Using this approach, an 
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“electronic fingerprint” of guanine bases in single- 
stranded DNA molecules of known sequence has 
been demonstrated (Tanaka and Kawai 2009). 


Cross-References 

► DNA Polymerase 

► Electron Microscopy 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Single-Molecule Methods 

► Zero-Mode Waveguides 
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Introduction 

We are used to conceptualizing chemical reactions and 
biochemical processes in terms of individual mole¬ 
cules. We do so on various levels of complexity, rang¬ 
ing from simple textbook cartoons to the atomic detail 
of ► molecular dynamics simulations (► Molecular 
Dynamics Simulations of Lipids). But, remarkably, 
the vast majority of our knowledge about these systems 
has been derived from experiments on large ensembles 
of molecules, which yield only average values of 
observable properties. The sequence of molecular 
events describing the underlying reactions is typically 
inferred from testing models by systematic variation of 
parameters. For kinetic ensemble studies, the reactions 
need to be synchronized, which is often difficult. The 
concept of observing single molecules is particularly 
appealing for processes with a large degree of confor¬ 
mational and dynamic heterogeneity, which abound in 
biological systems. 

Only rather recently has it become feasible to 
investigate biological systems on the level of single 
molecules with spectroscopic methods. These tech¬ 
niques offer a fundamental advantage beyond our 
mere fascination for the direct observation of molecu¬ 
lar processes: they can resolve and quantify the 
properties of individual molecules or subpopulations, 
which are often inaccessible in ensemble experiments. 
► Fluorescence spectroscopy (► Fluorescence and 
FRET in Membranes) has been a particularly popular 
technique, owing to its extreme sensitivity and versa¬ 
tility. For example, single-molecule spectroscopy 
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allows individual molecules to be localized and their 
motions to be tracked; in combination with ► Forster 
resonance energy transfer (FRET) (► Fluorescence 
and FRET in Membranes), inter- and intramolecular 
distances and distance distributions can be quantihed; 
time-resolved measurements allow the determination of 
mechanisms for a wide range of biological processes, 
such as binding, assembly, conformational changes, or 
the action of biomolecular machines; dynamics and 
diffusive processes can be investigated on timescales 
down to nanoseconds with ► fluorescence correlation 
spectroscopy (FCS) (► Fluorescence Correlation Spec¬ 
troscopy); and many of these methods can be applied 
both in vitro and in vivo. The use of single-molecule 
detection has also led to important applications, such as 
super-resolution microscopy (► Stimulated Emission 
Depletion Microscopy (STED), ► Photoactivated 
Localization Microscopy (PALM)), and high- 
throughput DNA sequencing (► Single-Molecule 
Sequencing). In this entry, some of the history, the 
basic underlying concepts of single-molecule spectros¬ 
copy and analysis, and the most common techniques are 
summarized. 

Historical Development and Principles of 
Optical Single-Molecule Detection 

Remarkably, some of the very first single molecules 
detected were proteins. In 1970, step-like changes in 
ion conductance were observed for artificial lipid films 
containing small numbers of gramicidin A molecules. 
Since then, the recording of such signals from single 
channels (► Patch-Clamp Recording of Single Chan¬ 
nel Activity: Acquisition and Analysis) has matured to 
a well-established electrophysiological technique 
(Sakmann and Neher 1995). Due to its comparatively 
long history, the single-channel field provides a wealth 
of examples and analysis methods for the stochastic 
kinetics observed in single molecules. A very different 
type of method allowed the first observation of single 
atoms and molecules on surfaces in the early 1980s; 
scanning tunneling microscopy and ► atomic force 
microscopy (AEM) (► Atomic Eorce Microscopy), 
where a very fine tip is used to probe the surface of 
a sample with atomic resolution. The simplicity of the 
instrumentation quickly made AFM a standard 
method, and soon it allowed the imaging of samples 
in solution, including proteins. In the late 1990s, the 


first mechanical single-molecule experiments on bio¬ 
molecules were reported, and the investigation of 
biomolecular processes with ► AEM (► Atomic 
Eorce Microscopy) and ► optical tweezers (► Optical 
Tweezers) or ► magnetic tweezers (► Magnetic Twee¬ 
zers) has developed into a large field in biophysics. 
Even though these methods are sometimes referred to 
as “force spectroscopy” (► Atomic Eorce Spectros¬ 
copy), this entry focuses on the study of the interaction 
between electromagnetic radiation and matter for 
probing individual molecules, in particular the absorp¬ 
tion and emission of visible light. 

Already in the 1970s, it became possible to measure 
fluorescence from single atoms in dilute atomic beams, 
that is, in the gas phase, where the background 
problem is minimal. Observing single molecules in 
the condensed phase is much more challenging 
because Rayleigh and Raman scattering produce 
a large background. Additionally, the ubiquity of con¬ 
taminants makes great demands on the purity of the 
matrix. Einally, an atom in vacuum is a chemically 
very stable system, even in its excited state, whereas 
fluorescent molecules in the condensed phase survive 
only a limited number of excitation-emission cycles 
before they are irreversibly destroyed - a process 
termed ► photobleaching (► Single Fluorophores 
Photobleaching). Single-molecule detection in a solid 
or liquid matrix therefore requires additional mea¬ 
sures. First of all, the method must provide a signal 
from an individual molecule that exceeds the back¬ 
ground from the large number of matrix molecules. 
This requirement makes fluorescence a very attractive 
approach, where a dye molecule resonantly interacts 
with the excitation light. Due to the Stokes shift, the 
emitted light can be selected spectrally from scattered 
light, for example, with interference filters, which can 
provide ratios of transmitted and reflected intensities at 
different wavelengths in excess of 10®. Since the back¬ 
ground signal from matrix molecules is proportional to 
the number of matrix molecules contained in the 
observed volume, a common strategy for the detection 
of signal from individual molecules is to reduce the 
size of the detection volume as much as possible. Such 
spatial selection can be achieved, for instance, by very 
tightly focusing a laser beam into the sample, com¬ 
bined with confocal detection (Rigler et al. 1993), or 
by total internal reflection fluorescence (TIRF) 
(► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies), which allows excitation 
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Single-Molecule Spectroscopy, Fig. 1 Schematics of 
common single-molecule instruments using (a) confocal detec¬ 
tion and (b) total internal reflection fluorescence (TIRF). 
(a) Fluorescently labeled molecules diffusing in solution 
(or immobilized on the surface) are excited in a diffraction- 
limited focal volume, and the emitted fluorescence is collected 
by the same objective that forms the laser focus. A confocal 
pinhole acts as a spatial filter to remove out-of-focus light. The 
collected photons can then be sorted by their spectroscopic 
properties, for example, polarization and color (illustrated here 
for a typical four-channel single-molecule FRET experiment). 
The photons are focused onto sensitive point detectors, for 
example, avalanche photodiodes, and the signal is recorded by 

of a ^200 nm layer of the sample in the evanescent 
field of a reflected laser beam (Fig. 1). A further reduc¬ 
tion of the observation volume can be achieved with 
► zero mode waveguides (► Zero-Mode Wave¬ 
guides), which allows measurements at higher concen¬ 
trations of fluorescent sample. 

The first detection of individual chromophores in 
the condensed phase was achieved in 1989 at liquid 
helium temperature with modulated absorption 
spectroscopy (Moerner and Kador 1989). Shortly 
after, ► fluorescence spectroscopy (► Fluorescence 
and FRET in Membranes) was shown to yield 


suitable counting electronics. The same type of geometry is 
typically used for FCS. (b) Fluorescently labeled molecules 
close to the glass surface are excited in the evanescent field of 
a laser beam reflected at the glass/solution interface. Fluores¬ 
cence signal is collected through the objective and imaged onto 
a suitably sensitive array detector (e.g., a CCD image sensor). 
Spectral information can be obtained by splitting the image, for 
example, according to color (as shown here). A common alter¬ 
native to this prism-type configuration is the generation of an 
evanescent field at the lower interface by coupling the laser beam 
into the objective from below (similar as in a), but laterally 
displaced from the optical axis. 

a superior signal-to-noise (Orrit and Bernard 1990). 
Within a few years, successful fluorescence experi¬ 
ments at room temperature followed, leading to an 
explosion of the field (Zander et al. 2002), and opening 
the way for single-molecule detection of suitably 
labeled biomolecules (Weiss 1999). The introduction 
of ► FCS (► Fluorescence Correlation Spectroscopy) 
with a confocal detection scheme and single-molecule 
sensitivity was an important step for single-molecule 
studies in solution (Magde et al. 1972; Rigler and 
Elson 2001; Rigler et al. 1993). The demonstration of 
► ERET (► Fluorescence and FRET in Membranes) in 
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individual, labeled DNA molecules in 1996 (Ha et al. 
1996) triggered a revival of the “spectroscopic ruler” 
(Stryer 1978), enabling distance measurements in sin¬ 
gle biomolecules (Selvin and Ha 2008). ► Surface- 
enhanced Raman spectroscopy (► Surface-Enhanced 
Raman Spectroscopy for Bioanalytics) is a technique 
based on the enhancement of light scattering 
by molecules adsorbed to metal surfaces and can 
also be used to detect single molecules (Zander et al. 
2002 ). 

Single-Molecule Spectroscopy Techniques 

Labeling for Single-Molecule Spectroscopy 

Since most chromophores intrinsic to biomolecules do 
not have sufficiently high fluorescence quantum yields 
or photostability for current single-molecule spectros¬ 
copy, extrinsic ► labeling (► Site-Specific Protein 
Labeling for Single-Molecule Fluorescence Studies, 
► Fluorescence Labeling of Nucleic Acids) with 
organic fluorophores, fluorescent proteins, or semicon¬ 
ductor nanocrystals (► Quantum Effects/Tunneling) is 
often required. Fluorophores such as rhodamine and 
cyanine dyes with emission in the visible or near- 
infrared range are commonly used and typically 
attached to biomolecules by derivatizing thiol or 
amine groups. The growing family of fluorescent pro¬ 
teins (Tsien 2009) allows the labeling of proteins by 
genetically encoding suitable fusion proteins, which 
can then be detected both in vitro and in vivo. How¬ 
ever, their photophysical properties are currently still 
inferior to those of the best organic fluorophores (Sauer 
et al. 2011). ►Quantum dots (►Quantum Effects/ 
Tunneling) are frequently used for imaging and local¬ 
ization studies because of their extreme photostability 
combined with broad and tunable absorption spectra 
(Rigler and Vogel 2008). 

Single-Molecule Detection and Localization 

At the core of many single-molecule techniques is the 
ability to detect and localize individual biomolecules 
in free solution, on surfaces or membranes, or in the 
context of larger assemblies or cellular structures. The 
methods most commonly used are total internal reflec¬ 
tion fluorescence (TIRF) (► Total Internal Reflection 
Fluorescence Microscopy for Single-Molecule Studies) 
or confocal excitation and detection (Fig. 1). Near-held 
scanning optical microscopy (SNOM/NSOM), which 


also exploits the evanescent held generated via a small 
aperture, has proven less suited for the investigation of 
soft matter. An advantage of the ► TIRF (► Total 
Internal Rehection Fluorescence Microscopy for Sin¬ 
gle-Molecule Studies) approach is the possibility to 
monitor the signal from many molecules at the same 
time, but detection is limited to molecules close to the 
surface where rehection occurs, and the speed of cur¬ 
rent cameras sufficiently sensitive for single-molecule 
detection limits the time resolution to the millisecond 
range (Selvin and Ha 2008). With confocal detection, 
molecules can usually only be sampled sequentially, 
but with the possibility of three-dimensional scanning 
and time resolution in the sub-nanosecond range 
(Selvin and Ha 2008; Schuler and Eaton 2008). 

The ability to localize individual molecules has 
been used extensively to track (► Single Particle 
Tracking) the diffusive motion of biomolecules, such 
as proteins or lipids in membranes, nucleic acid¬ 
binding factors, or motor proteins. With the known 
shape of the point spread function of emission from 
an individual molecule, the position of the emitter can 
be localized (► Fluorescence Imaging with One Nano¬ 
meter Accuracy (FIONA)) with much greater preci¬ 
sion than the diffraction limit. The uncertainty in the 
position decreases with the number of detected pho¬ 
tons (AO essentially as and is thus largely 

limited by the photostability of the fluorophore. 
The same principle for localization is employed in 
superresolution microscopy techniques in 
combination with the photoactivation of fluorophores 
(► Photoactivated Localization Microscopy (PALM)). 
► Stimulated Emission Depletion (STED) Microscopy 
is also based on the localization of individual 
molecules. 

Single-Molecule Spectroscopy: Photophysics and 
Photochemistry 

Under cryogenic conditions, embedded in solids, the 
emission lines of individual molecules can be widely 
distributed around the average wavelength due to local 
heterogeneities in the surrounding matrix (inhomoge¬ 
neous broadening). Changes in the local environment 
over time can then be observed as spectral diffusion, 
that is, changes in the resonance frequency for absorp¬ 
tion and emission of the chromophore over time 
(Basche et al. 1997). Such behavior has been investi¬ 
gated also for biological systems, for example 
individual photosynthetic pigment-protein complexes 
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(Zander et al. 2002). In solution, the dynamics of the 
solvent are too rapid to observe spectral diffusion, 
and the spectral properties of individual molecules 
resemble those observed with ensemble methods. 

Several processes that can often be ignored in 
ensemble spectroscopy of biomolecules have to be 
taken into account for single-molecule studies. Partic¬ 
ularly important is the process of ► photobleaching 
(► Single Fluorophores Photobleaching), the irrevers¬ 
ible light-induced chemical damage of a fluorophore, 
because it limits the observation time for the individual 
molecule. Related phenomena, collectively referred to 
as “blinking” (► Single Fluorophore Blinking), lead to 
discrete fluctuations of the fluorescence intensity with 
time. They arise from intermittent transitions of the 
fluorophore to a less or nonfluorescent state, and can 
either be detected directly in the fluorescence time 
trace as transitions from “on” to “off’ states, or as 
contributions to the fluorescence correlation function. 
Such “off’ states can be caused by a variety of mech¬ 
anisms, for example by transitions to triplet states, the 
transient formation of radicals by electron transfer, 
static fluorescence quenching, or isomerization of the 
chromophore. Since the resulting changes in fluores¬ 
cence intensity are often difficult to distinguish from 
the process of interest, such as conformational changes 
in a labeled protein, suitable additives are frequently 
used to minimize their contribution (Selvin and Ha 
2008). 

Single-Molecule Forster Resonance Energy 
Transfer (FRET) and Photoinduced Electron 
Transfer (PET) 

A goal of many single-molecule studies is the obser¬ 
vation of conformational transitions or molecular 
interactions that may be difficult to synchronize and 
observe in ensemble studies. This requires methods for 
probing intra- and intermolecular distances on the 
nanometer length scale. The most popular method is 

► FRET (► Fluorescence and FRET in Membranes, 

► Single-Molecule Eluorescence Resonance Energy 
Transfer, also called single-pair FRET, spERET), 
which is based on the radiationless transfer of excited 
state energy from a suitable donor to an acceptor chro¬ 
mophore. With the known dependence of the rate of 
transfer on the inverse sixth power of the distance, the 
fluorescence emission of donor and acceptor can be 
related quantitatively to the separation of the 
chromophores. In this way, distances, distance 


distributions, and the underlying dynamics can be 
monitored in single molecules on length scales of 
^5 nm, and on timescales from nanoseconds to 
minutes (Selvin and Ha 2008; Sauer et al. 2011; Rigler 
and Vogel 2008). Applications include protein-nucleic 
acid interactions, conformational changes in biomole¬ 
cules, RNA folding, and protein folding, to name but 
a few (Selvin and Ha 2008; Schuler and Eaton 2008; 
Sauer et al. 2011). Particularly useful for a quantitative 
analysis is the combined use of fluorescence count 
rates, fluorescence lifetimes, fluorescence anisotropy, 
correlation functions, and other derived parameters 
(Sisamakis et al. 2010). 

A complementary approach for the investigation of 
macromolecular dynamics is PET, which is based on 
the static ► quenching (► Quenched-Flow Methods) 
of a fluorophore by another group, frequently a trypto¬ 
phan residue (Sauer et al. 2011). Since PET requires 
very close contact of dye and quencher, dynamics 
over a shorter distance range can be probed than with 
► single-molecule FRET (► Single-Molecule Fluores¬ 
cence Resonance Energy Transfer), but usually no 
distance information can be extracted. In combination 
with correlation analysis, timescales down to nanosec¬ 
onds are accessible. 

Fluorescence Correlation Spectroscopy (FCS) 

All of the processes described above can lead to fluc¬ 
tuations (► Fluorescence Fluctuation Spectroscopy) in 
fluorescence intensity. An important way of quantify¬ 
ing the underlying dynamics is ► correlation analysis 
(► Fluorescence Correlation Spectroscopy) (Rigler 
and Elson 2001; Zander et al. 2002). This type of 
analysis can provide access to a wide range of times, 
for short times currently only limited by the time 
resolution of the detection system (typically in the 
picosecond to nanosecond range), and for long times 
by the observation time of the molecules. Although 
correlation analysis is frequently applied also to mea¬ 
surements of immobilized molecules, the most com¬ 
mon application is in confocal measurements on freely 
diffusing molecules. The accessible concentration 
range of fluorescently labeled molecules is much 
greater than in typical single-molecule experiments, 
from picomolar to low micromolar (with an optimal 
contrast in the nanomolar range), which facilitates 
many applications both in vitro and in vivo. 
A process very frequently investigated by ► FCS 
(► Fluorescence Correlation Spectroscopy) is 
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translational diffusion, both in three dimensions or, for 
example, within a membrane, but ► FCS 
(► Fluorescence Correlation Spectroscopy) can 

equally be used to study conformational changes 

probed by ► FRET (► Single-Molecule Fluorescence 
Resonance Energy Transfer) (Schuler and Eaton 2008; 
Rigler and Vogel 2008) or PET (Sauer et al. 2011), 
rotational diffusion, rates of chemical reactions, fluid 
flow, and a large range of photophysical and photo¬ 
chemical reactions (Rigler and Elson 2001). Many var¬ 
iants of ECS have been developed, including 

► fluorescence cross-correlation spectroscopy 

(► Eluorescence Cross-correlation Spectroscopy), 

► dual-focus ECS (► Dual-Beam Eluorescence Cross- 
Correlation Spectroscopy), scanning FCS, or image 
correlation spectroscopy (Rigler and Elson 2001; 
Sauer et al. 201 1). 

From Ensembles to Single Molecules: 
Kinetics and Equilibria 

In an infinitely large homogeneous system (a good 
approximation for a standard experiment in a test 
tube, and commonly used in chemical kinetics: 
100 pi of a 1 pM sample still contain 6 x 10*^ mole¬ 
cules) consisting of a set of n subpopulations (“states”) 
separated by sufficiently large free energy barriers, the 
concentrations of molecules in these states will evolve 
in time according to a set of coupled, first order, ordi¬ 
nary differential equations: the reaction rate equations. 
In this case, the overall reaction is thus treated as 
a continuous, deterministic process. The time course 
of the reaction will be described by a sum of n-1 
exponential terms. Their individual time constants, 
however, will be complex algebraic expressions of 
all the rate constants in the mechanism and therefore 
have, in general, no simple physical significance. 

In fact, however, the system is neither continuous 
nor deterministic. It is discrete, because molecules 
come in integral numbers, and molecular populations 
only change by integer amounts. Moreover, the system 
is stochastic, because the transitions of individual mol¬ 
ecules from one state to another are ultimately initiated 
by thermally driven diffusive fluctuations and molec¬ 
ular collisions in the system. If we observe the signal 
from a sufficiently small number of molecules, these 


fluctuations about the average behavior become large 
enough to be measured, and can be used to extract 
information about the dynamics of the system, an 
insight conceived in the context of light scattering 
experiments and membrane channel recordings, and 
now used routinely, for instance, in ► ECS 
(► Eluorescence Correlation Spectroscopy). Let us 
illustrate this point with the example of a simple 
protein folding process (but the same principles apply 
to other reactions). Suppose we observe N = 1,000 
protein molecules under conditions corresponding to 
their unfolding midpoint. The probability of an 
individual molecule being folded is p = 0.5, so the 
variation in the number of folded molecules is given by 
the standard deviation of the binomial distribution as 
[N p (7-f>)]'^^ ~ 16. The number of protein molecules 
that are folded at equilibrium is therefore not constant, 
but it is 500 ±16, where the standard deviation reflects 
the random fluctuations in the number of folded 
molecules from instant to instant. 

The smaller the number of molecules we observe, 
the larger the relative fluctuations of the signal will be. 
Ultimately, if only a single molecule is observed 
(► Kinetics: Single-Molecule Techniques), we expect 
step-like transitions between the states that the mole¬ 
cule can populate. In the simplest example of a two- 
state reaction 

U^N (1) 

where equilibration within each state is fast relative to 
the transition rates between the states, the molecule 
will reside in either the folded (N) or the unfolded state 
(U) for extended periods of time, with intermittent 
rapid jumps between them. The average dwell times 
in N and U are given by the inverse rate coefficients of 
the unfolding reaction and the folding reaction kf, 
respectively. 

If we consider the temporal behavior of an individ¬ 
ual molecule with transitions between a folded and an 
unfolded state, the term “fraction of molecules folded 
at equilibrium” must be replaced by the “fraction of 
time for which the molecule is folded,” a quantity that 
can be measured accurately from a single-protein mol¬ 
ecule only with observation times much longer than 
the effective relaxation t time of the two-state system. 
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that is, the inverse sum of the folding and unfolding 
rate constants kf and 


lim 

Ar^O 


U{t+^t) - U{t) 
At 


dU{t) 

dt 


-xU{t) 


(5) 


t={kf + k,)-' ( 2 ) 

Although observation times in single-molecule 
fluorescence experiments at non-cryogenic tempera¬ 
tures are limited by the photo stability of current 
chromophores, examples of this behavior are 
available, such that a quantitative kinetic and thermo¬ 
dynamic analysis of the reaction from an individual 
molecule are feasible. 

In thinking about the ► kinetics of a single 
molecule, an important question is: given the protein 
molecule is, say, unfolded at time t, what is the 
unfolded state lifetime Tu of the molecule? More gen¬ 
erally, we would like to know the probability density 
functions of the times the protein molecule stays 
folded or unfolded. To this end, we first define U(t) 
as the probability that the molecule remains unfolded 
throughout the time from 0 to t. If we now assume that, 
for an unfolded molecule, the probability P of folding 
during a short time interval Af is just proportional to 
the length of this time interval (Af must be small 
relative to 1/a, so we can neglect processes such as 
multiple transitions during Af), we get the conditional 
probability 

P{t+ Af I unfolded at f) = a Af, (3) 

where a is a constant, independent of time. This 
latter assumption means that the probability of 
folding is independent of what has happened prior to f. 
This is a fundamental characteristic of this type of 
random process (a ► Markov process, ► Markov 
Model). As the molecule can either fold or not fold 
during Af, and the probabilities of these two alternatives 
must add up to 1, we thus also know that the probability 
of remaining unfolded during Af is 1^ Af. We can then 
express the probability that the molecule remains 
unfolded during the entire interval from 0 to f -i- Af as 

t/(f-f Af) = t/(f)(l-aAf), (4) 

taking into account the Markov assumption. After 
slightly rearranging the equation, we get in the limit 
of Af ^ 0 


by the definition of the derivative. As (7(0) = 1, 
we obtain as a solution 

U{t) = e-" (6) 

and the corresponding probability density function of 
the unfolded state lifetime 

p{t) = ae-^‘ (7) 

We recognize immediately that our constant a is 
the inverse time constant Tu, and thus the rate coeffi¬ 
cient kf. It is this connection with the single-molecule 
world that illustrates why exponential decays are 
so abundant in chemical kinetics. The concepts of 
stochastic processes and statistical physics are thus 
crucial for the description of single-molecule experi¬ 
ments, and a wide range of theoretical methods for 
their rigorous analysis have been developed (Barkai 
et al. 2009). 

Current Developments and Future Prospects 

Even though many types of single-molecule 
experiments have reached a high level of maturity, as 
illustrated by high throughput applications (► Single- 
Molecule Sequencing) and the availability of 
commercial instrumentation (Resch-Genger 2008), 
the methodology is still advancing at a high rate. 
Essential components of the instrumentation, includ¬ 
ing lasers, detectors, and counting electronics, are 
continuously being improved, and many extensions 
of existing single-molecule techniques have been 
developed. Examples are the combination of single¬ 
molecule spectroscopy with ► microfluidics 
(► Microfluidics for Single Molecule Detection), 
► atomic force microscopy (► Force-Fluorescence 
Spectroscopy), ► particle trapping, and ► zero mode 
waveguides (► Zero-Mode Waveguides). With the 
development of free electron lasers (► Single- 
Molecule Methods), the prospect of obtaining much 
more detailed structural information from single 
molecules than currently possible is on the horizon. 
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Summary 

Single-molecule spectroscopy can be used to resolve 
and quantify the properties of individual molecules or 
subpopulations even in heterogeneous assemblies 
or environments, which are often inaccessible in 
ensemble experiments. Fluorescence spectroscopy 
has been a particularly popular single-molecule tech¬ 
nique, owing to its extreme sensitivity and versatility. 
For example, single-molecule spectroscopy allows 
individual molecules to be localized and their motions 
to be tracked; in combination with Forster resonance 
energy transfer (FRET), inter- and intramolecular dis¬ 
tances and distance distributions can be quantified; 
time-resolved measurements allow the determination 
of mechanisms for a wide range of biological pro¬ 
cesses, such as binding, assembly, conformational 
changes, or the action of biomolecular machines; 
dynamics and diffusive processes can be investigated 
on timescales down to nanoseconds with fluorescence 
correlation spectroscopy; and many of these methods 
can be applied both in vitro and in vivo. Besides its 
increasing use in biophysical research, single¬ 
molecule spectroscopy has also led to important 
applications, such as super-resolution microscopy and 
high-throughput DNA sequencing. 

Cross-References 

► Atomic Force Microscopy 

► Atomic Force Spectroscopy 

► Automated Patch Clamp: Advantages and 
Limitations 

► Dual-Beam Fluorescence Cross-Correlation 
Spectroscopy 

► Fluorescence and FRET in Membranes 

► Eluorescence Correlation Spectroscopy 

► Eluorescence Cross-Correlation Spectroscopy 

► Eluorescence Fluctuation Spectroscopy 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Labeling of Nucleic Acids 

► Force-Fluorescence Spectroscopy 

► Magnetic Tweezers 

► Markov Model 

► Microfluidics for Single Molecule Detection 

► Optical Tweezers 

► Photoactivated Localization Microscopy (PALM) 


► Quantum Effects/Tunneling 

► Quenched-Elow Methods 

► Single Eluorophore Blinking 

► Single Eluorophores Photobleaching 

► Single Particle Tracking 

► Single-Molecule Eluorescence Resonance Energy 
Transfer 

► Single-Molecule Fligh-Resolution Colocalization 
(SHREC) 

► Single-Molecule Fligh-Resolution Imaging with 
Photobleaching (SHRImP) 

► Single-Molecule Methods 

► Single-Molecule Sequencing 

► Single-Particle Tracking 

► Site-Specific Protein Labeling for Single-Molecule 
Fluorescence Studies 

► Stimulated Emission Depletion (STED) Microscopy 

► Surface-Enhanced Raman Spectroscopy for 
Bioanalytics 

► Surface-Tethered Lipid Vesicles 

► Torque 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 

► Zero-Mode Waveguides 
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Single-Particle Tracking 

Arash Sanamrad and Johan Elf 
Uppsala University, Uppsala, Sweden 

Synonyms 

Single-molecule methods; Single-dye tracing 


Definition 

The use of optical microscopes and tracking algo¬ 
rithms to obtain movement trajectories of individual 
particles or molecules in vitro or in living cells. 

Basic Characteristics 

Single-particle tracking (SET) has become an impor¬ 
tant class of techniques for studying how individual 
particles or molecules move in vitro or in living cells. 
This has been made possible by the development of 
bright and photostable probes, quantitative labeling 
techniques, simple and stable fluorescence micro¬ 
scopes as well as sensitive and fast cameras. 

The study of individual particles is important when 
the particles are heterogeneous, for example, because 
they are in different conformations, in different bind¬ 
ing states, or in different microenvironments. In such 
cases, information is lost by averaging over the ensem¬ 
ble of particles. SET can be used to study how diffusion 
properties change on different length- and timescales 
and can reveal, for instance, that diffusion is fast on 
a short length-scale but slow on a long length-scale. 
One of the advantages of SET is that it keeps track of 
the identities of particles over time, which is essential 
if one is interested in for how long time a particle is 
bound to an immobile target or in a particular mode of 
translocation. One good example is single-virus track¬ 
ing in living cells, which made it possible to identify 
the different phases of infection and the mechanism of 
cell entry (Rust et al. 2008). In vitro SET has made it 
possible to study for example how motor proteins walk 
on actin filaments and microtubules with nanometer 
resolution (Selvin et al. 2008). Here, the single-particle 
nature and time resolution of SET are essential to 
resolve the individual discrete steps, which clearly 
would be lost by averaging over a population of parti¬ 
cles or over time. 

Eluorescence-based SET typically requires a stable 
epifluorescence or total internal reflection fluorescence 
(TIRE) microscope with laser excitation, a high- 
numerical-aperture objective and a sensitive and fast 
charge-coupled device (CCD) camera. Optimal illumi¬ 
nation conditions depend very much on the probe and 
the sample. However, just like any type of fluorescence 
imaging, high-precision tracking requires a good sig- 
nal-to-background ratio. Therefore, laser excitation is 
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often necessary to excite the fluorophores at high inten¬ 
sities at the optimal wavelength while simultaneously 
minimizing the excitation of intrinsically fluorescent 
cell components. If the process under investigation can 
be monitored close to the coverslip by surface immo¬ 
bilization or by studying membrane-associated cellular 
processes, TIRF illumination is often used because 
fluorescence excitation is restricted to the ^150-nm 
zone closest to the coverslip and out-of-focus 
autofluorescence is minimized. The objective should 
have a high numerical aperture to collect the maximum 
amount of light. Furthermore, the camera needs to be 
sufficiently fast and sensitive and it is therefore advan¬ 
tageous to use an electron-multiplying CCD 
(EMCCD) camera, which is very sensitive and rela¬ 
tively fast, or a complementary-metal-oxide semicon¬ 
ductor (CMOS) camera, which is very fast but not as 
sensitive. 

In addition to the signal-to-background consider¬ 
ations, SPT involves imaging particles that are mov¬ 
ing, possibly in living cells. The frame rate and 
illumination exposure time are therefore important. 
The optimal exposure time is a trade-off between 
collecting the large number of photons needed for 
precise determination of the location and minimizing 
the broadening of the fluorescence spot caused by the 
movement of the particle during the exposure time. For 
fluorescent proteins and most fluorescent dyes, irre¬ 
versible photobleaching limits the number of time 
points in the trajectories. For this reason, the frame 
rate can be varied independently of the exposure time 
in order to capture dynamics on different timescales 
before the particles bleach (Elf et al. 2007). Tracking in 
three dimensions is significantly more difficult than 
two-dimensional tracking. The additional information 
about the z-positions of individual particles can be 
obtained through methods based on, for example, opti¬ 
cal astigmatism (Kao and Verkman 1994) or multiple- 
plane detection. 

Particles are labeled in different ways for different 
applications (Giepmans et al. 2006). Eor accurate 
tracking over long time periods, it is important to use 
a probe that gives a strong signal and does not bleach 
easily. In this case, fluorescent beads, quantum dots or 
colloidal gold are popular choices. Colloidal gold is 
nonfluorescent but gives a very good contrast in bright- 
held and thermal imaging. The problem with these 
bleach-resistant probes is that they will in some cases 
interfere with the motion or function of the labeled 


particles due to their sizes (5-100 nm). Additionally, 
it can be difficult to maintain the solubility and to 
ensure a specific one-to-one labeling of the particles 
of interest. By comparison, fluorescent proteins 
can be genetically fused to a protein of interest and 
expressed in the cell type of interest. Their major 
drawbacks are the limited brightness and the poor 
photostability. In addition, the fluorescent protein 
may interfere with the activity of the protein of inter¬ 
est. In order to overcome the limitations of fluorescent 
proteins, new techniques are being developed to spe¬ 
cifically label small peptide motifs with bright and 
photostable organic dye molecules (Fernandez-Suarez 
and Ting 2008). 

Once the particles have been imaged over time, 
their locations should be determined in the individual 
frames and the locations of each particle should be 
connected into a trajectory that can be analyzed. 
Because of the diffraction-limited optics of light 
microscopes, photons from a small particle are distrib¬ 
uted according to an approximately Gaussian distribu¬ 
tion of width ^250 nm in the image plane. However, 
since many photons are collected from the same parti¬ 
cle, the center of the distribution, which corresponds to 
the actual position of the particle, can be determined 
with very high precision by fitting a Gaussian function 
to the image of the particle (Fig. 1). Background noise 
in the image due to autofluorescence, Raman scatter¬ 
ing, readout noise and dark current from the camera as 
well as pixelation noise reduce the fitting precision 
(Thompson et al. 2002) and need to be considered 
when optimizing the experiment for precise localiza¬ 
tion of the particles. 

Accurate tracking is problematic at high particle 
densities. The problems of high particle densities can 
be solved to some extent by using robust tracking 
algorithms; see for example Jaqaman et al. and refer¬ 
ences therein (Jaqaman et al. 2008). In order to over¬ 
come these problems, Manley et al. and Niu et al. have 
combined SPT with the use of photoconvertible 
proteins (Manley et al. 2008; Niu and Yu 2008). This 
makes it possible to have a large population of 
nonfluorescent molecules, to convert just a few at 
a time and to image these until they bleach. When 
combined with stroboscopic illumination the use of 
photoconvertible proteins makes it possible to obtain 
trajectories for hundreds of individual freely diffusing 
protein molecules in the cytoplasm of individual bac¬ 
terial cells (English et al. 2011). 
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Single-Particle Tracking, 
Fig. 1 Nine consecutive 
frames of a time-lapse movie 
of a freely diffusing protein in 
an exponentially growing 
Escherichia coli cell and 
a Gaussian fit to frame 3 



The movement of the particles can be characterized 
by estimating the mean square displacements (MSDs) 
from the trajectories, MSD(t) = <(r(t -i- 1 ) — r(t))^>, 
where r is the position of the particle and x is the time 
interval. The MSDs provide information about how the 
particles move on different timescales and can be 
compared to theoretically derived or simulated MSD 
curves corresponding to different hypotheses about par¬ 
ticle movement. For example, an unconfined diffusion 
model in two dimensions will give MSD(t) = 4Dt“, 
where D is the diffusion coefficient of the particle, a = 1 
for normal diffusion, a < 1 for subdiffusion and a > 1 
for superdiffusion. The analysis can also reveal that the 
motion is directed or confined (Saxton and Jacobson 
1997), with different implications for the molecular 
mechanism that causes the movement. 

Cross-References 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Labeling of Nucleic Acids 

► Live-Cell Single-Molecule Imaging 


► Protein Fluorescent Dye Labeling 

► Single-Molecule Fligh-Resolution Colocalization 
(SHREC) 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 
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Single-Stranded RNA Recognition 
Domains 
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Synonyms 

RNA; RNA recognition domains; Zinc fingers 


Definition 

Single-stranded RNA binding domains are protein 
domains that bind RNA in a single-stranded or linear 
conformation. 

Introduction 

RNA recognition by ssRNA binding proteins is a core 
event in mRNA metabolism that confers specificity to 


regulatory mechanisms and allows the coordination of 
the different steps of mRNA synthesis, processing, 
export, localization, translation, and degradation. 
Most ssRNA binding domains are relatively small 
(40-100 amino acids). Some have a unique fold or 
are found in a few copies across the genome while 
others are present in many hundreds of different pro¬ 
teins, often in multiple copies. They recognize ssRNA 
and - in the case of KH, RRM, and ZnF - ssDNA, but 
also sense the structural context the ss sequence is 
embedded in. Important open questions include the 
code of sequence recognition, the role of multiple 
synergistic interactions, and the role of local and 
inter-domain motions. Here we present a few impor¬ 
tant examples of protein-ssRNA complexes that illus¬ 
trate the diversity and the specificity of the interactions 
and we highlight some of the most important tech¬ 
niques used. The molecular characterization of these 
systems helps us to understand how ssRNA binding 
domains may be used to control gene expression. 

RRM (RNA Recognition Motif) 

RRMs (also called RNA Binding Domain or RBDs) 
are the most abundant eukaryotic RNA binding 
proteins and play a role in all of the steps of post- 
transcriptional regulation. A canonical RRM motif 
has a Pa(3(3aP topology (Fig. 1) and spans approxi¬ 
mately 90 residues. This central core is sometimes 
flanked by additional a-helices or P-strands that can 
extend or sequester the RNA binding surface as well as 
mediate RRM(s) functional oligomerization and 
change the relative arrangement of neighboring 
domains. In most RRM domains, the p-sheet RNA 
binding surface contains two conserved motifs 
(RNPl and RNP2) important to RNA binding 
(Table 1). Typically, two aromatic residues within 
these motifs stack on two RNA bases. This recognition 
unit can be extended to the two flanking RNA bases, 
while a positive charged residue on the P-sheet forms 
a salt-bridge with the RNA phosphodiester backbone. 
Due to the conformational flexibility of the ssRNA 
targets, an induced fit of the RNA to the protein surface 
is normally observed, where the loss of conformational 
entropy is compensated by the strong enthalpic contri¬ 
bution from intermolecular hydrophobic interactions 
and H-bonds, with the latter been crucial to sequence 
recognition. Indeed sequence recognition is achieved 
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Zn-finger Pumilio 



La 


Single-Stranded RNA Recognition Domains, 
Fig. 1 Structural folds of ssRNA binding domains: RRM 
(PDB accession code: 2AD9), KH (PDB 2HH3), Zn-flnger 
(PDB IZNF), Pumilio (PDB 1M8Z), La (PDB 1S7A) 


Single-Stranded RNA Recognition Domains, Table 1 Over¬ 
view of the secondary structure and conserved elements of the 
ss-RNA-binding proteins described in this review 


Protein 

Secondary structure 

Conserved elements 

RRM 

Pl«a[i2P.t«bP4 

RNPl six residues 

RNP2 eight residues 

KH 

paappa type I 

paap 

VIGXXGXXI 

Zn-flnger 

PPa 

X2CysX2 4CysX]2HisX3 4 sHis 

Pumilio 

aaaaaaa 

PUP repeat 
three a-helices 

La 

a(5aa(5(i 

Winged a-tum-a 


by a combination of stacking interactions and protein- 
RNA H-bonds. However, deviations from this basic 
mode of binding are found in many RRM-RNA 
complexes: for example, some RRM proteins use 


additional secondary structures or loops to recognize 
structural elements and to reach nM Kds (Clery et al. 
2008). In other cases, the binding of multiple RRM 
domains to the same RNA targets is used to reach 
the necessary affinity and specificity. The study of the 
RRM domains from the UlA and PTB proteins 
described below highlights how this motif can recog¬ 
nize target RNAs and act not just as a recruiting 
module but as an architectural factor. 

UlA protein (Fig. 2) uses the first of two RRM 
domains to recognize two different physiological 
targets, a small RNA stem-loop within the U1 small 
nuclear ribonucleoprotein (snRNA) and an internal 
loop within the 3'UTR of its own mRNA, the 
polyadenylation internal element (PIE). In the 
UlA-PIE interaction, a change in the orientation of 
the C-terminal helix away from the P-sheet surface of 
the protein mediates the dimerization of the protein on 
the 3'UTR and the recruitment of polyadenylation 
factors, creating a self-regulatory loop for UlA expres¬ 
sion. Binding of UlA to both its known targets is very 
tight (Kd ^ nM) and NMR and X-ray structural data 
(Oubridge et al. 1994; Allain et al. 1996) show that the 
interaction is mediated by both the recognition of 
a number of nucleotides within the RNA hairpin’s 
terminal loop and by contacts with the first base pair 
of the stem. The loop itself is mainly disordered in its 
free form and folds upon binding; the RNA bases 
create a tightly packed hydrophobic core with amino 
acid side chains of the protein p-sheet. Nitrocellulose 
filter binding assays showed that at least 70% of the 
free energy of binding derives from nonelectrostatic 
interactions. Extensive NMR analysis of protein 
dynamics in the UlA system coupled with thermody¬ 
namic data by CD and ITC have shown that the flexi¬ 
bility of the free protein can be used to lower the 
energy barrier of the interaction, while the entropic 
penalty derived from the loss of conformational free¬ 
dom in the loop nucleotides is compensated by 
a favorable enthalpic contribution. 

The polypyrimidine tract-binding protein (PTB) 
(Fig. 2) is an important factor in the regulation of 
alternative splicing and provides an interesting exam¬ 
ple of RNA recognition by multiple RRM domains. 
The isolated RRM domains of PTB recognize short 
ssRNA sequences using the surface of the P-sheet. 
However, while RRMl and 2 are free to rearrange 
their relative orientation in solution, RRM3 and 4 
interact via a large mainly a-helical hydrophobic 
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456 
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PTB1 



11 91 


210 282 


hUIA 



49 119 174 240 


424 491 


hNova-1 



32 99 130 196 


406 473 



KHSRP 


144 208 233 299 

322 38 

6 424 491 

1 

KH1 


KH2 


KH3 


KH4 

711 







h HiV-1 


h PUM1 


390 40Z 

-43 


1435 


428 
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1168 



PUM-HD 


n86 


99 111 


187 225 


334 


hLa 


La - RRM1 


RRM2 


408 


Single-Stranded RNA Recognition Domains, Fig. 2 Domain composition of representative ssRNA-binding proteins: PTB, 
hUlA, hNOVA, KSRP, hHIV-1, hPUM, hLA 


interface. Each of the four RRM domains of PTB 
interacts with 3-5 nucleotides in an extended confor¬ 
mation. NMR titration assays showed that the Kd of 
complexes between RRM2, RRM3, and RRM4 and 
polypyrimidine varies between 20 and 50 pM while 
RRMl binding is substantially weaker. On the con¬ 
trary, EMSA assays have shown that the four-RRM 
construct bind the RNA target(s) (i.e., sequences 
containing 15-20 pyrimidines) with a Kd in the 
nanomolar range. Although PTB RRM domains do 
not recognize a structural element during splicing reg¬ 
ulation, the use of multiple domains allows RNA rec¬ 
ognition with affinity and specificity comparable to the 
one achieved by structure-specihc proteins. Further, in 
the PTB system RNA binding by multiple domains has 
also an architectural role, as simultaneous binding to 
RRM3 and RRM4 implies a 180° change in the direc¬ 
tion of the RNA chain. This change, which has been 
validated by fluorescence and NMR spectroscopy, has 


been linked to the looping out of specific exons flanked 
by pyrimidine tracts. A further possible function for 
RRM3-RRM4 interaction is the general increase in the 
microsecond dynamics of the two RRMs when bound 
to RNA - as observed from *^N NMR relaxation mea¬ 
surements - which could provide a compensation for 
the entropic loss due to the interaction with the RNA. 

KH (K Homology) 

KH-containing proteins are common both in eukary¬ 
otes (type I KH with paappa topology) (Fig. 1) and 
bacteria (type II KH with appaap topology) (Table 1) 
and are associated with both transcriptional and post- 
transcriptional regulations. The KH motif is smaller 
than the RRM one, spanning about 70 residues. As for 
the RRM domains, the canonical KH fold can be 
flanked by additional elements. For example, in the 
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STAR family of proteins, the KH domain is flanked by 
a QUA2 motif that packs against the a-helical surface 
of the KH. In KH domains, nucleic acid recognition is 
mediated by a conserved hydrophobic cleft, flanked on 
one side by a short invariant GXXG segment and on 
the other by a variable loop. The interaction is based 
mainly on Van der Waals and hydrophobic contacts 
and, to a lesser extent, on electrostatic interactions 
(Valverde et al. 2008). KH domains interact with 
their target with different degrees of affinity and spec¬ 
ificity. They can recognize specifically up to four 
nucleobases although a larger number of nucleotides 
can be involved in the interaction. The initial analysis 
of the NMR and X-ray structures of a number of KH 
domains coupled to the evaluation of their sequence 
specificity, mainly by EMSA and SELEX, had indi¬ 
cated that the two nucleobases in the central positions 
(positions 2 and 3) were required to be either an A or 
a C. However, the in-depth analysis of the sequence 
specificity of the KSRP KH3 domain by scaffold inde¬ 
pendent analysis (SIA) (Beuth et al. 2007) and NMR 
has shown that a G is also tolerated in this position. So 
far, KH domains have not been shown to recognize 
structural elements within RNA physiological targets 
and their binding affinities to ssRNA are at least two 
orders of magnitude lower than the ones observed for 
UlA and Eox-1 RRM domains (e.g., UlARRMl). It is 
only when the canonical KH fold is extended by 
a flanking QUA2 motif that a higher affinity is reached. 
The more open RNA recognition surface and the larger 
size of the RRM domains are probably the key to the 
recognition of a broader range of targets and to a higher 
affinity. 

Most KH domains are found in multiple copies in 
eukaryotic proteins. The Nova and KSRP proteins 
exemplify how the different domains can be used syn- 
ergistically to recognize the RNA targets and are ana¬ 
lyzed below. The Nova-1 and Nova-2 splicing factors 
(Fig. 2) are the first multi-domain RNA binding pro¬ 
teins of the kind where a clear correspondence has 
been found between the in vitro determined sequence 
specificity (by SELEX and EMSA) and the ensemble 
of in vivo ssRNA targets (as defined by Cross-Linking 
Immuno-Precipitation - CLIP). Nova-1 and 2 proteins 
contain three KH domains connected by linkers of 
different lengths and the X-ray structure of Nova-1 
KH3 domain in complex with RNA was the first struc¬ 
ture to be solved of a KH domain in complex with 
nucleic acids (Lewis et al. 2000). EMSA and 


nitrocellulose filter binding assays indicated that the 
three-domain protein binds the RNA target with high 
affinity (Kd close to nanomolar to the target RNA) but 
the affinity of the isolated domains for the ssRNA 
target sequences is at least three orders of magnitude 
lower. Interestingly, the NMR analysis of the dynam¬ 
ics of the Nova KH3 backbone showed that a coupling 
may exist between protein oligomerization and RNA 
binding, with the dimerization of the domain leading to 
reduced high frequency motions in the RNA binding 
groove and potentially decreasing the entropic loss 
upon protein binding. 

Nova-1 recognizes sequences containing multiple 
copies of the YCAY tetranucleotide, while the four- 
KH domain KSRP (Fig. 2) protein recognizes a very 
broad range of sequences. The two proteins share 
a similar overall affinity for their targets (nanomolar 
Kds) and a combinatorial type of binding is observed 
also for the KSRP protein. Both Nova and KSRP have 
a number of structurally independent NA binding units 
joined by flexible linkers that allow the protein to 
recognize different RNA sequences and structures, 
although, solution structural studies have shown that, 
in KSRP, the two central KH2 and KH3 domain sand¬ 
wich the linker and assume a well-defined orientation, 
while the KH1-KH2 and KH3-KH4 domains are 
connected by long and flexible linkers. Interestingly, 
it has been shown that in KSRP the same domain can 
play a very different role in the recognition of the 
different physiological targets (Trabucchi et al. 
2009). This observation introduces a further element 
of complexity when evaluating the role of different 
domains of the same protein in RNA binding. 

Finally, several KH containing proteins display sta¬ 
ble di-domain units which act as RNA architectural 
factors. X-ray and CD data have shown that in 
Eubacteria the binding of the type II KH1/KH2 
domains of NusA protein to Nbox disrupt a short 
helix that plays an important role in antitermination. 
In vertebrates, interaction of two target sequences 
within the mRNA 3'UTR zipcode element with type 
I KH3/KH4 domains of IMPl results in a 180° change 
in the direction of the RNA backbone. This change in 
the direction of the RNA chain or looping of the RNA 
is similar to the one observed for the interaction 
between PTB RRM3/RRM4 and RNA. These two 
studies clarify that the use of inter-domain contacts to 
impose a change of direction on to the RNA chain is 
found both in KH and in RRM domains. 




2362 


Single-Stranded RNA Recognition Domains 




Zn-Fingers 

Zinc fingers (Fig. 1) are the most abundant NA-binding 
domain in eukaryotic genomes (Klug 2010). They can 
be classified based on the arrangement of the cysteines 
and histidines coordinating the zinc atom (e.g., CCHC, 
CCCH) (Table 1) and are, in most cases, <40 amino 
acids long. Zinc fingers can recognize ds and ssNA 
sequences with high specificity. We focus here on 
a few examples of ZnF-ssRNA recognition that high¬ 
light the general principles of the recognition and the 
role of inter-domain linkers in targeting specific RNA 
sites. 

In HIV-1, genome packaging is mediated primarily 
by interactions between the nucleocapsid NC domain 
of the viral Gag polyprotein (which contains two zinc 
fingers. Fig. 2) - and a ^110 nt segment of the viral 
RNA known as the T' which contains four stem-loops 
(SL1-SL4) connected by relatively short linkers. ITC 
studies have shown that the Kd of the complex between 
HIV-1 NC and SL2 is 110 nM and the one of the 
complex with SL3 is 170 nM. NMR data show that in 
the free HIV-1 NC protein the two zinc knuckle 
domains are connected by a flexible basic linker but, 
upon binding to the RNA target hairpins (SL2 or SL3), 
form extensive inter-domain interactions (De Guzman 
et al. 1998). The flexibility of the free NC protein is 
probably important to bind different stem-loops in an 
adaptive manner. Similarly, the disordered RNA 
nucleotides in the apical loop become ordered upon 
binding. Solution NMR revealed that tight binding 
of HIV-1 NC to SL2 (and SL3) is mediated by 
a large number of hydrogen bonds and that sequence 
specificity is defined by the recognition of solvent 
exposed guanosines both in HIV and the related 
MCV virus. Indeed, the use of a high number of 
H-bonds to define sequence specificity is a general 
hallmark of ZnF. A further contribution to recognition 
is provided by the inter-domain linkers and by the 
relative arrangement of the domains on the RNA, as 
exemplified by the TISlld and MBNLl proteins 
discussed below. 

TISlld (TPA-inducible sequence 11) binds to the 
AU-rich element (ARE) of the 3' untranslated region 
(3'-UTR) of target mRNAs and recruits the mRNA 
degradation machinery. The Kd of the complex 
between TISlld and a 20-mer ARE oligonucleotide 
from a target 3' UTR, determined by fluorescence 
anisotropy assays, is ^19 nM. The NMR structure of 


the tandem CCCH zinc finger domain of TISllD in 
complex with a shorter SELEX-optimized RNA 
nonamer shows that each finger contacts a separate 
5'-UAUU-3' sequence within the RNA molecule and 
that the complex is stabilized via a combination of 
electrostatic, hydrogen bonding, and stacking interac¬ 
tions, the latter between conserved aromatic side 
chains and the RNA bases. Contrarily to what is 
observed for the HIV-1 NC CCHC zinc fingers, NMR 
*H-*^N heteronuclear NOE data show that the central 
part of the linker is flexible and unaffected by binding 
and that the domains do not make contact upon 
interacting with the RNA. A second CCCH ZnF- 
containing protein is the human Muscleblind-like 1 
(MBNLl), which has been shown to regulate alterna¬ 
tive splicing events. The protein contains four highly 
conserved zinc finger RNA binding domains arranged 
in pairs and separated by three linkers of variable 
length. EMSA data indicated that zinc fingers 3 and 4 
(ZnF3/4) bind a 18 nt pre-mRNA fragment of hTNE 
with 1:1 stoichiometry (Teplova and Patel 2008). 
Quantitative estimation of binding affinity by protein 
fluorescence intensity showed that this 18-nt fragment, 
which contains a 10-nt spacer between two GCU ZnE 
target sites, form a complex with Kd = 40 ± 10 nM. 
The main difference between the TISlld-RNA com¬ 
plex and the MBNLl-RNA one is observed in the 
linker regions connecting the tandem zinc finger 
domains. The linker between MBNLl ZnF3 and 
ZnF4 form a rigid antiparallel P-sheet in both the free 
and RNA-bound state and the two fingers interact via 
a hydrophobic surface forming a single compact unit, 
with the two RNA binding surfaces away from each 
other and the direction of the RNA chains in the two 
binding sites been antiparallel. On the contrary, the 
linker connecting TISlld ZnEl and ZnF2 adopts an 
extended conformation in the complex with the RNA 
binding surfaces been in register and therefore been 
able to bind consecutive UAUU sequences. 

Pumilio and La Motifs 

A Pumilio homology domain (PUM-HD) (Fig. 1) con¬ 
tains eight tandem repeats of 36 amino acids plus two 
short N- and C-terminal sequences (Table 1). Pumilio/ 
FBF (PUF) containing proteins regulate translation 
and mRNA stability in a variety of eukaryotes. They 
interact with the 3' untranslated region of target 



Single-Stranded RNA Recognition Domains 


2363 


s 


transcripts recognizing a core sequence of 5'-UGUR-3' 
(Wickens et al. 2002). 

In human Pumilio (Fig. 2), the Puf domain recog¬ 
nizes a pair of 32 nucleotide Nanos response elements 
(NREl and NRE2) within the 3' untranslated region of 
hunchback (hh) mRNAs. Eluorescence anisotropy, 
analytical ultracentrifugation, and EMSA data demon¬ 
strated that each NRE contains two binding sites. Box 
A (GUUGU) and Box B (AUUGUA) (Zamore et al. 
1997) that can be bound cooperatively by two Puf 
domains with nanomolar Kd. This 2 Pumilio; 1 NRE 
complex has been proposed to represent the functional 
regulatory unit in vivo and fluorescence data have 
shown that mutation of the RNA binding sites lowers 
the affinity of three orders of magnitude. The structure 
of the human Puf protein, pumilio 1, in complex with 
a high-afhnity 10 nt NRE ssRNA has been determined 
by X-ray crystallography (Wang et al. 2002). 
A comparison with the structure of free Pumilio 
shows that the protein undergoes little conformational 
change upon RNA binding. RNA recognition is medi¬ 
ated by the concave surface of the protein, with the 
bases contacting protein moieties and the phosphate 
groups facing the solvent. The remarkable number of 
stacking and hydrogen bonding interactions between 
the human Pum domain and the NRE RNA together 
with the general architecture of the complex explains 
the high affinity and specificity of Pumilio-RNA inter¬ 
actions. One important difference between RRM, KH, 
and ZnE motifs and Pumilio is that, while the smaller 
domains contact the RNA target using both side chains 
and backbone groups, Pumilio-RNA recognition is 
mediated by groups of the protein side chains. This 
recognition mode may facilitate the future design of 
sequence-specihc Pumilio proteins. 

The La protein is a nuclear phosphoprotein that 
recognizes the UUU-OH 3' of nascent RNA polymer¬ 
ase III transcripts. Specihc recognition is mediated by 
the La motif (Fig. 1), a ^80 amino acid domain (Curry 
and Conte 2006) (Table 1) which sports a highly con¬ 
served patch of mainly aromatic surface residues, crit¬ 
ical for high-affinity RNA binding. Reported Kd 
values for the full-length human La protein (hLa) 
(Fig. 2) RNA complex range from 5 to 37 nM 
depending on the RNA and assay conditions. High 
affinity and specificity RNA binding by LA require 
both the La domains and the first of two RRM domains, 
RRMl. One of the more interesting features of La 
recognition is its ability to distinguish RNAs with 


UUU-OH 3' from those with UUU-P 3'. The prefer¬ 
ence for the hydroxyl over phosphate group allows La 
protein to distinguish nascent polymerase III tran¬ 
scripts from degraded RNA molecules and to discrim¬ 
inate between 3' terminal and internal polyuridine 
stretches. For the wild-type protein, the amount of 3' 
UUU-OH competitor required to dissociate 50% of the 
protein-RNA complex was 9.9 nM, while for the 
3'UUU-P competitor it was 988 nM. 

The crystal structure of the N-terminal domain of 
human La, consisting of La and RRMl motifs bound to 
(UGCUGUUUU-OH) shows that the two domains 
form a functional RNA binding scaffold for binding 
of UUU-OH 3' ends. Interestingly, neither the recog¬ 
nition helix of the winged-helix La motif nor the (3 
sheet face of the RRMl motif, the typical nucleic acid 
binding surfaces of these domains, are involved in 
UUU-OH recognition. Instead La uses the edge and 
back of these motifs for UUU-OH recognition, making 
the canonical surface available for additional RNA 
interactions such as the ones necessary for the rescue 
of structurally impaired pre-tRNAs. The majority of 
the hydrogen bonding and stacking contacts of the 
UUU-OH end segment are with the La motif rather 
than with RRMl or with the inter-domain linker. How¬ 
ever, the linker, which is flexible in the free state, 
becomes rigid upon complex formation, indicating 
a role in recognition. A change in the flexibility of 
the linker is confirmed by NMR transverse and longi¬ 
tudinal relaxation rate measurements showing that the 
domains in the free La-NTD are able to tumble inde¬ 
pendently in solution whereas the binding of RNA 
stabilizes a single orientation. 

Summary 

The analysis of RNA binding by RRM, KH, ZnF, PUF, 
and LA domains described above have identified 
a number of key principles in protein-RNA recogni¬ 
tion. The entropic loss linked to RNA binding and the 
intrinsic charges of the RNA molecules play an impor¬ 
tant role in the interactions. Base stacking and hydro¬ 
gen bonding are at the basis of sequence specificity, but 
this specificity is not very high in many isolated RRM 
and KH domains. It is now clear that ssRNA recogni¬ 
tion is often a synergistic process with multiple 
domains from the same or different proteins participat¬ 
ing in the interaction. A range of biophysical methods. 
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from fluorescence, to ITC, from CD, to NMR have 
been used to characterize the very different RNA bind¬ 
ing affinities and specificities of single domains and 
full-length multi-domain proteins. Similarly, NMR 
and other biophysical techniques (e.g., FRET) have 
provided a description of motions on different time- 
scales and of different amplitudes - and in some cases, 
linked these motions to binding energies. High- 
resolution structural techniques and a broad range of 
biophysical methods have provided a significant 
insight in the recognition of ssRNA by protein domains 
and current work aims to integrate these data within 
a quantitative framework. 
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Definition 

Labeling proteins with fluorescent dyes is crucial for 
most cutting-edge single-molecule fluorescence inves¬ 
tigations of complexity in protein folding, misfolding, 
and function. Several current and emerging methods 
and chemistries for site-specific labeling at single and 
multiple sites are discussed. Developments in this area 
are expected to be essential for a broad application of 
these methods in protein biophysics. 

Single-Molecule Experiments and 
Fluorescent Labeling 

Single-molecule fluorescence and manipulation 
methods have revolutionized a number of areas of 
biophysics and chemistry during the last two decades 
(Deniz et al. 2008; Michalet et al. 2006; Joo et al. 2008; 
Ferreon et al. 2010). The detailed snapshots and 
movies of molecular structure, movement, and func¬ 
tion generated using these methods are powerful tools 
for understanding biophysical mechanisms with rele¬ 
vance to basic biology and disease. In particular, 
single-molecule fluorescence methods are relatively 
non-perturbing and have wide spatial and temporal 
dynamic range, making them broadly applicable both 
in vitro and in vivo. Most often, these experiments 
require attachment of extrinsic fluorescent moieties to 
the biomolecules of interest. For example, site-specific 
attachmenf of bright and photostable dyes is used to 
probe local structural properties or localization in vitro 
or in cells, and attachment of two dyes is used for 
detailed structural studies in single-molecule FRET 
(smFRET) experiments (Deniz et al. 2008; Ferreon 
et al. 2010). Thus, site-specific labeling of proteins is 
often critical to successful performance of these 
increasingly popular experiments. Here, some key 
concepts and chemistries used to obtain appropriately 
labeled proteins are discussed. 

Individual Labeling Chemistries and 
Methods 

A few chemistries have been used for site-specific 
protein labeling (Fig. 1), with the basic idea being 
that the chemistry (1) be specific to one protein reac¬ 
tive moiety and orthogonal (unreactive) to those of the 


other amino acid reactive groups in the protein not 
targeted for labeling, and (2) be localized to one or 
two points of interest (Ferreon et al. 2010). The 
methods involve reaction of natural or unnatural 
amino acid side chains or terminal functionalities 
with dyes functionalized with the appropriate comple¬ 
mentary functional groups. 

Reaction with protein thiols (of cysteine side- 
chains) is the most commonly used method for dye 
attachment to proteins. The corresponding reactive 
moiety on the dye is often a maleimide, although 
iodoacetamides and thiol moieties (for disulfide for- 
mafion) are also utilized. This method is often used 
because of the orthogonal chemistry and the relatively 
low abundance of exposed cysteines in proteins. 
Hence, many small systems are either cysteine-free 
or can be prepared in this way; introduction of cyste¬ 
ines at specific sites then permits specific labeling. 

Another very prevalent chemistry in proteins is that 
of lysine primary amines. However, these are generally 
far more abundant, and hence with a few exceptions, 
site-specific labeling has rarely been attempted using 
this method. The N-terminal amine in proteins 
however has special characteristics, with a pKa lower 
than those of the lysine side chains. This special 
chemistry has been used for site-specific reaction, 
for example, via conversion to an aldehyde, although 
then the attachment site is confined to the protein 
terminus. 

Another emerging class of protein chemistry is that 
of unnatural amino acids, whose side chains encode 
reactivities orthogonal to those of natural proteins. 
These represent a powerful expansion of the attach¬ 
ment chemistry repertoire, and can be used to comple¬ 
ment natural chemistries and in cases such as larger 
proteins with multiple cysteines. Incorporation of 
unnatural chemistries can be accomplished via total 
synthesis, ligation of peptide fragments, or via genetic 
encoding methods. For example, a ketone side 
chain can undergo orthogonal specific reaction with 
hydroxylamines or hydrazides (Brustad et al. 2008). In 
addition, the “Click Chemistry” (Kolb et al. 2001) 
1,3-dipolar cycloaddition reaction between alkyne 
and azide functional groups is seeing increasing use. 

Finally, several methods for enzymatically 
performing site-specific chemisfry have been devel¬ 
oped recenfly, especially with a view of labeling 
proteins on or in cells (Hinner and Johnsson 2010). 
For example, phosphopantetheinyl-transferase, biotin 
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Site-Specific Protein 
Labeling for Single- 
Molecule Fluorescence 
Studies, Fig. 1 Scheme 
showing a few common and 
emerging methods for 
attachment of dyes site- 
specifically to proteins for 
single-molecule studies. The 
3D smFRET histogram at the 
right bottom directly revealed 
the complex binding-folding 
landscape of the intrinsically 
disordered neural protein 
a-synuclein that was labeled 
using a dual-cysteine approach 
(Adapted from Ferreon et al. 
2009. Proc. Natl. Acad. Sci. 
USA. 106:5645-50) 
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ligase, lipoic acid ligase, and soitase have all been used 
to transfer reactive groups or tags to short (several 
amino acid) motifs introduced into proteins. HaloTag, 
SNAP-Tag, and CLIP-Tag methods have also been 
used for making attachments to proteins, although in 
the context of a larger tag size (here the tag is an 
enzyme that can react with an appropriately 
functionalized dye). These and related methods 
should see increasing use, especially in the context 
of tracking and functional experiments on proteins in 
live cells. 

Site-Specific Labeling at Multiple Sites 
on a Protein 

The need for multiple site-specific labeling increases 
the difficulty of preparing suitable protein samples 
substantially. For example, smFRET experiments, 
which are very powerful for studies of protein folding 
and dynamics studies, generally require attachment of 
one donor and one acceptor dye to a protein. For 
smaller systems, this has most often been accom¬ 
plished using a dual-cysteine approach. This is com¬ 
monly done by statistical labeling, resulting in four 
labeling isomers, two of which are dual-labeled. This 
can be acceptable, though in some cases, large changes 


in dye photophysics due to local protein environments 
could make further purification essential. Purification 
of individual dual-labeled isomers has sometimes been 
achieved using different methods chosen on a case- 
by-case basis. In a recent example, a dual-cysteine 
approach has been used to label the intrinsically disor¬ 
dered protein a-synuclein, and to reveal a complex and 
tunable binding-folding landscape (Fig. 1) (Ferreon 
et al. 2009; Gambin et al. 201 1). 

A more powerful and general approach would be to 
use two or more orthogonal chemistries for multiple 
labeling. For example, cysteine and ketone chemistries 
have been used in tandem recently for preparing site- 
specifically dual-labeled T4-lysozyme for smFRET 
studies (Brustad et al. 2008). Any combination of the 
labeling methods discussed in the previous section 
could be used for this approach. A complementary 
technique is to site-specifically label fragments of the 
protein using cysteine or other chemistry and then use 
ligation to prepare the dual-labeled full-length protein. 
An early example of this approach was the dual¬ 
labeling of the protein CI2 for smFRET measurements, 
permitting the first direct smERET observations of 
two-state folding (Deniz et al. 2000). Synthetic or 
semisynthetic approaches could be used in this case 
(Tiefenbrunn and Dawson 2010; Vila-Perello and 
Muir 2010). 
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Additional Considerations 

For the general community, labeling of proteins has 
been greatly facilitated by the commercial availability 
of dyes linked to appropriate reactive functionality, as 
is the case for many of the common chemistries 
described here. As new chemistries are demonstrated 
to be generally useful, it is anticipated that the 
corresponding reactive dyes will also become avail¬ 
able for general use. 

An important requirement of labeling is that it be 
non-perturbative, which should be tested by compari¬ 
son of one or more properties of the labeled and 
unlabeled protein. Correspondingly, more than one 
labeling position should ideally be chosen and tested, 
and labeled species showing significant perturbation 
should be eliminated from consideration for further 
experiments. 

The extensive field of autofluorescent proteins 
which are used ubiquitously in cell biology research 
is also important to note. The interested reader is 
directed to a review (Tsien 2009) and citations within 
for more information. 


Concluding Remarks 

Considerable advances have been made with site- 
specific labeling for single-molecule fluorescence 
experiments during the last several years. Given the 
popularity of these methods, it is anticipated that pro¬ 
gress in this area will accelerate even further in the near 
future. More gentle, selective and efficient reaction 
conditions (Kolb et al. 2001; Jewett and Bertozzi 
2010) and corresponding insights from computational 
analysis (e.g., [Chenoweth et al. 2009]), improved 
means of introducing orthogonal chemistries, novel 
chemistries and reactions (Kanan et al. 2004), smaller 
peptide tags, and better dyes will all positively impact 
this important prerequisite step in single molecule 
fluorescence experiments both in vitro and in cells. 
Finally, improved labeling methods will be especially 
critical for multicolor smFRET experiments (Gambin 
and Deniz 2010) or combination fluorescence/ 
manipulation studies, where attachments needed at 
more than two points in a protein raise additional 
challenges. Here, a more conventional triple-cysteine 
approach would likely generate an impossibly 


complex mixture of labeling isomers; hence combina¬ 
tions of orthogonal chemistries or other novel strate¬ 
gies will need to be used in this case. 
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Introduction: The Skin Is a Barrier Composed 
of Stacked Bilayers of Ceramides in the Fully 
Extended Conformation 

Terrestrial life was only made possible through the 
adaptive evolution of a waterproof barrier in the integ¬ 
ument of organisms. In man, like in all other land¬ 
living higher vertebrates, this barrier is constituted by 
a uniquely organized lipid material situated between 
the cells of the horny layer of skin (Blank 1952; 
Breathnach et al. 1973; Elias and Friend 1975) 
(Figs. 1, 2a). Recently, the lipid material’s molecular 
organization has been determined in situ with the aid 
of a novel experimental approach: high-resolution 


cryo-electron microscopy of vitreous tissue section 
(CEMOVIS) defocus series combined with molecular 
modeling and electron microscopy simulation (Iwai 
et al. 2012). The lipid material is organized in an 
arrangement not previously described in a biological 
system - stacked bilayers of fully extended ceramides 
with cholesterol molecules associated with the 
ceramide sphingoid moiety (Iwai et al. 2012). 

Below follows a brief account of the structure- 
function relationships of the skin barrier. 

Skin Lipid Composition and Phase State 

The lipids consist of a heterogeneous mixture of satu¬ 
rated, long-chain ceramides, free fatty acids, and cho¬ 
lesterol in a roughly 1:1:1 molar ratio (Wertz and 
Norlen 2003). More than 300 different species have 
been identified in the ceramide fraction alone 
(Masukawa et al. 2009). 

The most characteristic features of the stratum 
corneum lipid composition (Wertz and Norlen 2003) 
are (1) extensive compositional heterogeneity with 
broad, but invariable, chain length distributions (20- 
32 C; peaking at 24 C) in the ceramide fatty acid and 
free fatty acid fractions, (2) almost complete domi¬ 
nance of saturated very long hydrocarbon chains 
(C20:0-C32:0), and (3) large relative amounts of cho¬ 
lesterol (about 30 mol%). 

These are the same as the factors that typically 
stabilize lipid gel phases. It has therefore been pro¬ 
posed that the stratum corneum lipid structure exists as 
a single and coherent gel phase (Norlen 2001b). The 
viscous gel-like behavior of the lipid structure has 
recently been demonstrated by its remarkable mallea¬ 
bility in situ (Iwai et al. 2012). 

Molecular Structure Determination In Situ in 
Native Biological Tissues 

With CEMOVIS (Fig. 2), the native tissue is preserved 
down to the molecular level, and the micrograph pixel 
intensity is directly related to the local electron density 
of the specimen (Dubochet et al. 1988; Al-Amoudi 
et al. 2004; Norlen et al. 2009). As biomolecules are 
essentially composed of atoms with similar atomic 
weight (carbon, nitrogen and oxygen), they generally 
possess small inter- and intramolecular differences in 
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Skin Lipids, Fig. 1 Schematic drawing of skin. Left part. 
schematic cellular-scale drawing of epidermis. Mid part, molec¬ 
ular-scale drawing of the lamellar lipid structure occupying the 
space between the cells of the stratum comeum. Right part. 


atomic model of the lipid structure’s repeating unit, composed 
of two mirrored subunits, each composed of one fully extended 
ceramide (CER), one cholesterol (CHOL), and one free fatty 
acid (FFA) molecule 


electron density. However, for orderly arranged 
molecular assemblies, such as lipid head groups and 
tails in membranes, even small differences in shape 
and atomic composition may be amplified due to inter¬ 
ference effects that appear in the image phase contrast. 
During image acquisition, phase contrast is made vis¬ 
ible using defocus (cf. e.g., Fanelli and Oktem 2008). 

At very high magnification (pixel sizes of a few 
Angstrom), complex interference patterns can be 
resolved in CEMOVIS micrographs (Iwai et al. 2012) 
(Fig. 3a-c, g-i). This can be exploited by recording 
CEMOVIS micrographs repeatedly at the same posi¬ 
tion of the tissue sample while increasing stepwise the 
microscope’s defocus, thus ensuring that differences in 
the recorded micrographs are due exclusively to the 
different defocuses used (Iwai et al. 2012). Simulated 
electron micrographs (cf. Rullgard et al. 2011) 
(Fig. 3d-f, j-1) can then be generated at corresponding 
defocuses for different molecular models, and com¬ 
pared to the CEMOVIS micrographs (Iwai et al. 2012). 


Skin Lipid Structure 

CEMOVIS has recently shown that the skin lipid struc¬ 
ture expresses an asymmetric two-band ^11 nm 
(10-12 nm) repeating unit consisting of alternating 
narrow (^4.5 nm) and broad (^6.5 nm) bands, where 


the 4.5-nm bands express a higher averaged image 
intensity (i.e., are darker) than the 6.5-nm bands 
(Fig. 2e) (Iwai et al. 2012). 

In order to determine the underlying molecular lipid 
organization, a combination of CEMOVIS defocus 
series, molecular modeling, and electron microscopy 
simulation was employed (Iwai et al. 2012). The idea 
was to identify a lipid organization that would be 
consistent not only with the observed asymmetric 
two-band intensity profile but also with the intensity 
interference pattern changes observed upon changing 
the microscope’s defocus during image acquisition 
(Fig. 3). 

The lipid organization that emerged from the anal¬ 
ysis was a bilayer structure of fully extended (splayed 
chain) ceramides with the sphingoid moieties interfac¬ 
ing. Both cholesterol and the free fatty acids were 
distributed selectively: cholesterol at the ceramide 
sphingoid end and the free fatty acids at the ceramide 
fatty acid end (Iwai et al. 2012) (Fig. 1). 

A notable feature of the skin’s lipid structure is that 
it comprises a bilayer rather than an arrangement of 
stacked monolayers. The preference for the bilayer 
organization could result from the biological processes 
involved in its formation. The lipid matrix is formed 
from stacked bilayers of glycosylceramides in the hair¬ 
pin conformation in a hydrated environment, after 
deglycosylation and dehydration (Holleran et al. 


s 
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Skin Lipids, Fig. 2 The CEMOVIS intensity pattern of the 
skin’s lipid structure, (a) Medium magnification CEMOVIS 
micrograph of the interface between two cells in the midpart of 
the stratum comeum. Note that in CEMOVIS the tissue is 
unstained, and that the pixel intensity is directly related to the 
local electron density of the sample. The stacked lamellar pattern 
represents the extracellular lipid matrix. Dark ~10-nm dots 
represent keratin intermediate filaments that fill the intracellular 
space. Note that the extracellular lipid matrix locally expresses 
extensive folding, (b) High magnification CEMOVIS micro¬ 
graph of the extracellular space in the midpart of the stratum 
comeum. The intensity profile (e) of the lipid matrix was 
obtained by fuzzy distance-based image analysis. The red stars 

1993; Norlen 2001a; Al-Amoudi et al. 2005). During 
skin barrier formation, the initial hairpin glycosyl- 
ceramide bilayer organization may therefore carry 
over into the extended (splayed chain) ceramide 
bilayer organization by a flip of one of the two cer¬ 
amide alkyl chains together with cholesterol (Iwai 
et al. 2012). The extension of the two ceramide chain 
moieties in opposite directions aids the separation of 
both the ill-matched cholesterol and free fatty acids 
(Norlen et al. 2007), and the ill-matched ceramide C24 
fatty acid and C18 sphingoid moieties, into different 
bands within the close-packed lipid bilayer structure. 

Skin Lipid Formation 

In order to appreciate the structure-function relation¬ 
ships of the skin barrier in vivo, it is important to 


in (b) represent the manually chosen start and end points for 
fuzzy distance-based path growing, (c) The red line represents 
the traced-out path. Stacked lines mark extracted intensity pro¬ 
files. (d) Enlargement of the central part of (b). Note that the 
electron dense bands are composed of dark 1-3-nm dots from 
which thin, weak lines protrude 2-3 nm into the lucent areas. The 
same pattern is also evident in (b). (e) Reversed, averaged pixel 
intensity profile obtained from the extracted area in (c). Peaks in 
correspond to dark bands, and valleys to lucent bands in (d). 
Black arrows in (b) denote electron lucent narrow bands at the 
center of the 6.5-nm bands. Section thickness ~50 nm (a-d). 
Scale bar (a): 100 nm. Pixel size in (a-d): 6.02 A (Adapted from 
Iwai et al. (2012) with permission) 

understand skin lipid formation, as the skin’s lipid 
structure may represent a “frozen-in” or 
“immobilized” open biological system rather than a 
primary minimum energy order equilibrium system. 
Skin lipid formation is also central from a dermatolog¬ 
ical standpoint, since barrier malformation may be an 
etiological factor in barrier-deficient skin conditions 
such as eczema, psoriasis, and “dry skin.” 

It has recently been proposed that skin lipid 
formation proceeds via (1) membrane synthesis in 
the trans-Golgi of a membrane system with cubic- 
like symmetry, followed by (2) morphologically 
continuous (non-fusion dependent) secretion of 
the cubic-like membrane system into the extracel¬ 
lular space, (3) phase transition from cubic-like 
to lamellar membrane morphology, (4) dehydration, 
(5) condensation, and (6) lipid chain 
rearrangement from a folded (hairpin) to an 
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Skin Lipids, Fig. 3 Electron microscopy simulation of stacked 
bilayers of fully-extended ceramides with cholesterol molecules 
associated with the ceramide sphingoid moiety accurately 
accounts for the interference pattern changes observed in 
sequential CEMOVIS micrograph defocus series, (a-c) High 
magnification CEMOVIS micrographs (first exposition images) 
of the extracellular space in the midpart of the stratum comeum 
obtained at —0.5 pm (a), —2 pm (b), and —5 pm (c) defocus. At 
very low defocuses (—0.5 pm; (a)), CEMOVIS intensity patterns 
can only be observed at very high magnification (< 1.88 A pixel 
size). At very high defocuses (—5 pm) (c), image resolution is 
low but still allows for resolution of the ~ 11 -nm repeating unit, 
(d-f) represents corresponding atomic model (cf. Eig. 1, mid- 


and right parts) electron microscopy simulation images recorded 
at —0.5 pm (d), —2 pm (e), and —5 pm (f) defocus, (g-i) 
Sequential CEMOVIS micrograph defocus series obtained at 
very high magnification (1.88 A pixel size), (j-1) represents 
corresponding atomic model (cf. Fig. 1) electron microscopy 
simulation images recorded at — 1 pm (j), —2 pm (k), and —3 pm 
(1) defocus. It is shown that the atomic model in Fig. 1 accurately 
accounts not only for the major features of the CEMOVIS 
micrographs (a-f) but also for the interference intensity pattern 
changes observed upon varying the microscope’s defocus during 
image acquisition at very high magnification (g-l). UF: under 
focus; Pixel sizes in (c, f): 3.31 A; (b, e): 6.02 A; and (a, d and 
g-l): 1.88 A (Adapted from Iwai et al. (2012) with permission) 
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extended (splayed chain) stacked bilayer conforma¬ 
tion (Norlen 2001a; Iwai et al. 2012). CEMOVIS 
supports the proposed continuity of the lipid secre¬ 
tion system as well as the proposed structural asso¬ 
ciation of non-lamellar and lamellar lipid 
morphologies (Norlen et al. 2003; Al-Amoudi 
et al. 2005). However, structure determination of 
the intermediate stages of skin lipid formation may 
require access to native molecular resolution tomo¬ 
graphic 3D data in situ (molecular tissue TOVIS 
(cf. Norlen et al. 2009)), a developing technology 
that may not yet have reached its full potential. 


Skin Lipid Function 

Current knowledge suggests that a stacked, fully 
extended (splayed chain) ceramide bilayer arrange¬ 
ment (Fig. 1) with a high cholesterol content and a het¬ 
erogeneous, saturated, long-chain lipid composition 
may represent an optimized barrier material for skin. 
This is because it may be largely impermeable to water 
as well as to both hydrophilic and lipophilic sub¬ 
stances, because of its condensed chain packing and 
its alternating lipophilic (alkyl chain) and hydrophilic 
(head group) regions. Likewise, it may be resistant to 
both hydration and dehydration because of its lack of 
exchangeable water between lipid leaflets. It may also 
be resistant toward temperature and pressure changes 
because of its heterogeneous lipid composition and 
high cholesterol content, which stabilize gel-like 
chain packing and thereby prevent both lateral domain 
formation and induction of “pores” or non-lamellar 
morphologies. Further, this bilayer arrangement 
may account for stratum corneum cell cohesion 
without advocating specialized intercellular adhe¬ 
sion structures such as desmosomes. The arrange¬ 
ment may hence allow for sliding of stratum 
corneum cells to accommodate skin bending. 
Finally, as the interaction between the individual 
layers of the lipid structure involves only hydrocar¬ 
bons, the layers may be relatively free to slide with 
respect to one another, making the lipid structure 
pliable. The fully extended ceramide bilayer 
arrangement with high cholesterol content and het¬ 
erogeneous saturated long-chain lipid composition 
may thus meet the barrier needs of skin by being 
simultaneously impermeable and robust. 


Summary 

The permeability barrier of skin consists of a uniquely 
organized lipid material situated between the cells of the 
homy layer of skin. It was recently shown that this lipid 
material is organized in an arrangement not previously 
described in a biological system: stacked bilayers of 
fully extended ceramides with cholesterol molecules 
associated with the ceramide sphingoid moiety. 

The physical state of the skin’s lipid structure has 
been proposed to be that of a single and coherent gel 
phase. Furthermore, the lipid structure may be formed 
via a phase transition from cubic-like to a stacked 
lamellar morphology, followed by a flip of the 
constituent lipid components from a folded (hairpin) 
to an extended (splayed chain) ceramide bilayer 
conformation. 

The skin’s lipid structure is responsible for both the 
skin’s low permeability toward water and hydrophilic 
and lipophilic substances, and the barrier’s robustness 
toward environmental stress, such as hydration and 
dehydration, temperature and pressure changes, 
stretching, compression, bending, and shearing. 

Cross-References 

► 3D Electron Microscopy Based on Cryo-Electron 
Tomography 

► Chemical Diversity of Lipids 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 

► Electron Microscopy of Membrane Lipids 

► Electron Microscopy: Cryo-Preparation 

► Functional Roles of Lipids in Membranes 

► Lipid Bilayer Asymmetry 

► Lipid Bilayer Lateral Pressure Profile 

► Lipid Domains 

► Lipid Flip-Flop 

► Lipid Lateral Diffusion 

► Lipid Organization, Aggregation, and Self- 
assembly 

► Membrane Fluidity 

► Membrane Lipid Electrostatics 

► Molecular Dynamics Simulations of Lipids 

► Neutron Scattering of Membranes 

► Phase Transitions and Phase Behavior of Lipids 

► Thermodynamics of Lipid Interactions 
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Dependent Oligopeptide Transporter Family 

Small Angle Neutron Scattering 

Giuseppe Zaccai 

CNRS and Institut Laue Langevin, Grenoble, France 

Synonyms 

Low angle neutron scattering; Low Q neutron scatter¬ 
ing; SANS 

Definition 

Neutron scattering at low scattering vector values from 
a dilute solution of identical particles contains infor¬ 
mation on the molar mass, radius of gyration, and 
shape of the particle. 

Introduction 

Diffraction and small angle scattering are described in 
some detail in Sect. G of reference (Serdyuk et al. 
2007), and in the book edited by Glatter and Kratky 
(1982). In a diffraction experiment, waves of radiation 
scattered coherently by different objects interfere to 
give rise to an observable pattern from which the 
relative arrangement (or structure) of the objects can 
be deduced. In Small Angle Scattering experiments to 
study shape, the scattering from macromolecules in 
solution is analyzed under two basic assumptions: 
(1) The solution is sufficiently dilute that there is no 
correlation between macromolecule positions or orien¬ 
tations; waves scattered by atoms within the same 
particle interfere in diffraction, but not those scattered 
from atoms in different particles. (2) The solution is 
monodisperse; all the particles are identical. The con¬ 
ditions can be relaxed in specific studies. For example, 
interparticle interference can be observed and 
interpreted in concentrated solutions to study attractive 
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or repulsive forces between particles; scattering from 
polydispersed solutions can be interpreted in certain 
cases, by applying a model. This entry, however, con¬ 
centrates on cases where the two assumptions are 
fulfilled. 

The equations governing Small Angle Neutron 
Scattering (SANS) are similar to those of Small 
Angle X-ray Scattering (SAXS) (see ► X-Ray Scatter¬ 
ing of Lipid Membranes), with X-ray scattering ampli¬ 
tudes replaced by neutron scattering amplitudes. 
Because of the different interaction with matter of the 
two radiation types, however, there are important dif¬ 
ferences in experimental design and practice that give 
each method advantages and disadvantages over the 
other, and make them strongly complementary for the 
study of biological macromolecules in solution. On the 
practical side, in contrast to SAXS, SANS beams can 
have large cross-sections ( ^cm^) and long wave¬ 
lengths (>10 A) with negligible absorption, which 
compensates to some extent for the lower flux of neu¬ 
tron sources compared to X-ray sources. SAXS and 
SANS are low-resolution methods. At low resolution, 
atoms are not distinguishable and can be grouped to 
define scattering density in a volume; the contrast of 
part of a particle with respect to another or with respect 
to the solvent is proportional to the difference between 
their scattering densities. 

Neutrons are scattered by atomic nuclei and are 
sensitive to different isotopes. The different scattering 
powers of H and deuterium have been especially useful 
in biophysical studies of macromolecular complexes, 
because deuterium labeling of particle components or 
solvent permits contrast variation. Coherent scattering 
is defined as the case in which waves scattered by a set 
of atoms interfere to give a single wave in a given 
direction, resulting from the sum of individual wave 
amplitudes with appropriate phases; if the scattering is 
incoherent, each atom scatters independently of the 
others, and the resultant intensity is the sum of indi¬ 
vidually scattered intensities. In SANS, only coherent 
scattering contains structural information, and incoher¬ 
ent scattering contributes to the background. Incoher¬ 
ent neutron scattering arises mainly from spin 
incoherence and is largest for the *H nucleus. The 
background from H 2 O solutions is therefore much 
higher than from D 2 O solutions. 

Atomic scattering cross-sections for X-rays and 
neutrons are extremely weak ( ^10~ cm^), and, 

even with the most intense sources currently available. 


about 10*^ macromolecular particles are required in 
order to observe an interpretable signal. For a particle 
of 60 kDa molecular mass, this corresponds to a 
solution of 0.1 mL with a concentration of the order 
of mg/mL. 

SANS Without Equations 

The following questions are addressed by SANS: 

1. What is the effective association state (is it 
a monomer, a dimer,...) of a protein or other mac¬ 
romolecules in solution or within a membrane 
environment? 

2. What is its shape or conformation (is it compact and 
globular or ellipsoidal, long and narrow, or flat and 
broad, star-shaped or branched... is its structure in 
solution similar to its crystal structure...)? 

3. Do different macromolecules in solution interact to 
form a complex or not? 

4. How do the answers to points 1, 2, 3 vary as 
a function of solvent conditions (pH, salt, ligand, 
temperature...)? Are there modifications in associ¬ 
ation state or conformational changes, with respect 
to different conditions or previously determined 
crystal structures? 

5. The method of contrast variation in small angle 
neutron scattering allows one to render visible 
only one component within a complex structure. It 
is then possible to address questions 1^ for 
individual components within, for example, 
a macromolecular machine made up of various 
proteins, a protein-nucleic acid complex interac¬ 
tion, or a membrane protein in a lipid or detergent 
environment. By rigid body modeling, domains or 
subunits of known structure can be placed within 
a complex. 

An important point is that the answers to the differ¬ 
ent questions above are obtained independently of each 
other. 

Practical Requirements for a SANS 
Experiment 

1. On the order of 100 pL of solution containing a few 
mg/mL of macromolecule 

2. An accurate measurement of the macromolecular 
concentration (in mg/mL) 
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3. Access to a neutron small angle camera and com¬ 
petent help to plan the experiment. Instrumentation 
is available following acceptance of proposals at 
various neutron sources, including 
in Europe: 
http://www.ill.eu/ 

http://www.frm2.tum.de/en/index.html 
in the USA: 

http://www.ncnr.nist.gov/index.html 
http://neutrons.ornl.gov/ 
in Australia: 

http://www.ansto.gov.au/research/bragg_institute 

Experimental Protocol 

The coherent scattering due to the macromolecular 
particles only in the sample solution is obtained after 
normalizing for detector response and absolute scale, 
correcting for transmission and subtracting the back¬ 
ground (due mainly to the incoherent scattering of the 
solvent). The background scattering can be measured 
from a sample of solvent alone. Slit scattering, etc., is 
negligible. The background is mainly due to incoher¬ 
ent scattering, which is flat in the angular range 
observed, so that it can be averaged to subtract 
a constant value, with significant improvement in sta¬ 
tistical precision. Absolute scale is obtained either by 
calibration of the incident beam or by using the inco¬ 
herent scattering of a sample of H 2 O, which provides 
adequate accuracy for most experiments (Jacrot and 
Zaccai 1981). In order to obtain scattering per particle 
on an absolute scale, knowledge of sample mass con¬ 
centration (mg/mL) on an absolute scale is essential. If 
protein concentration is measured by UV absorption 
spectroscopy at 280 nm, the extinction coefficient 
should have been calibrated previously by amino acid 
analysis, for example. 


Basic Equations of SANS 

The Full Scattering Curve 

The coherent scattering from N particles in solution is 
given by: 

/w(G)=/v(|F(Q)|2^ (1) 


where Q is the scattering vector (of magnitude 
471 sin 6 

- - - , 20 IS the scattering angle and A the wave- 

k 

length); F(Q) is the scattered wave amplitude 
from the particle in one given orientation; and 

|F(Q)p^ is the scattered wave intensity averaged 

over all particle orientations. 

The one particle scattering intensity is calculated 
from particle structure and solvent composition by 
using the Debye formula: 

(iF(Q)P) = EEte-A)(/.-A.)^ 

( 2 ) 

where the double sum is over all atom pairs (/, k) and rjk 
is the distance separating a pair. Equation 1 becomes 

In{Q) = 

i k 

= ^fj - P°^j) (3b) 

where in, is the amplitude contrast in volume v, of the 
particle;^ is the sum of atomic scattering amplitudes in 
vf, and p° is the scattering amplitude density of the 
solvent. 

Example calculation: p° for H 2 O is 

{(2 x -0.374-f 0.58) x 10^‘2}/{30x lO^^^jcm^^ 
= —5.6 X lO^cm^^ 

where —0.374 x 10~*^ cm is the scattering amplitude 
of H, 0.58 X 10“^^ cm that of O. The volume of a water 
molecule is 30 A^ or 30 x 10“^“* cm^ (calculated from 
the density of water, its molecular weight, and 
Avogadro’s number). The same calculation for D 2 O 
yields 63.3 x lO'^ cm“^. 

The contrast may also be expressed in terms of 
local scattering amplitude or length density pj =fj Ivj, 
then 

= (Pj - p°)u (3c) 

where (pj—p°) is scattering length density contrast. 
The scattering length density of any compound or 
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chemical group may be calculated by a useful tool at 
www.ncnr.nist.gov/resources/sldcalc.html. 

It is useful to express N in terms of the particle 
concentration C in g/L. 


In a mechanical analogy, Rq is the radius of gyra¬ 
tion of the nij distribution. It can also be written as 

Rg^ = mpf I (7b) 


In{Q) 

C 


sin Q>~ik 


(4) 


where (mol“*) is Avogadro’s number and M is the 
particle molar mass (g/mol). It can be shown that the 
double sum in Eq. 4 is proportional to then 


In{Q) 

c 


cx M. 


(5) 


The Guinier Approximation, the Forward 
Scattered Intensity, and the Radius of Gyration 

By expanding (sin Qi'jkjlQrjk in the Debye equation 
for a single particle in terms of a series in powers of 
Qrj!^, a very good approximation to the scattering curve 
is obtained for small values of Q: the Guinier 
approximation 

/(e)=/(0)exp(-l/3f?G"G") 

ln/(G) = ln/(0) - l/3/?G"e2 

where 7(0) and R(^ are experimental parameters 
derived from the intercept and slope respectively of 
In I versus Q^. When the two conditions (no 
interparticle interaction and monodispersity) presented 
in the introduction are satisfied, then 


7(0) 


Rg^ 



The 7(0) value, therefore, depends only on particle 
and solvent composition (contrast) and not on 
atomic organization in the particle (structure). The 
Rg value is a first description of structure: it 
describes the distribution of contrast about its center 
of mass. 


where rj ^ is measured from the center of mass of the ot, 
distribution. 

Note that the radius of gyration always appears as 
its square {Rc^) in the equations. This is important. 
Since the contrast amplitudes, mj, may be positive or 
negative, physically meaningful negative R(^ values 
may be obtained. 

Equation 6 is known as the Guinier approximation. 
It is generally valid for the scattering curve of a particle 
of any shape, provided QRq is smaller or equal to 
about 1. Eor a sphere, the scattering curve goes below 
the approximation as QRq increases beyond 1; for an 
elongated or very flat particle, it deviates above the 
approximation curve above QRq equal to 1; for an 
ovoid shape (ellipsoid of axial ratio 1:1:1.7), the scat¬ 
tering curve follows the approximation to well beyond 

QRc-^- 


Asymptotic Behavior of the Scattering Curve at 
Large Q Values. The Porod Relation 

If the particle has a well-defined boundary, that is, 
a well-defined envelope beyond which there is homo¬ 
geneous solvent, at high Q values, I{Q) oscillates 
around an asymptotic line corresponding to: 


lim 7(G)g4 
G —> oo 27t7(0) 


( 8 ) 


This is known as the Porod relation. S is the external 
surface area of the particle envelope. At high Q values 
the scattering curve decreases sharply, as Q~'^. In prac¬ 
tice, the formal Q tends to oo in Eq. 8, and corresponds 
to QRg > ^5. A precise measurement of 7(G) in this 
scattering vector range would allow a determination 
of S. In practice, however, Eq. 8 is used in order to 
adjust the experimental background level (e.g., due to 
scattering from the solvent alone) so that after back¬ 
ground subtraction from the solution scattering, 7(G) 
follows a Q~^ dependence. This is because the particle 
scattering intensity is very weak at large Q values (two 
to three orders of magnitude weaker than 7(0) and very 
sensitive to the experimental error in the background 
measurement). 







Small Angle Neutron Scattering 


2377 


s 


The Distance Distribution Function 

The different approaches to analyze the full scattering 
curve are the same for SANS and SAXS and are 
described in the chapter by D. Svergun and Manfred 
Roessle ► Small Angle X-ray Scattering, and the chap¬ 
ter on ► Small Angle Neutron Scattering (SANS) Soft¬ 
ware by P. Callow. In particular, there are advantages 
in calculating the inverse Fourier transform of I{Q), 
a function that depends on the distance r between 
component pairs in the particle (see chapter by Clatter 
in reference (Clatter and Kratky 1982)). 

. ^ 1 /■“ 7 . ^ sinQr 

’''<'■' = 25/. e7(e)-^^e (« 

The particle volume is V. The function }'(j )is 
a correlation function, defined as the average of the 
product of two scattering density contrast values sep¬ 
arated by a distance r. 

?('■) = {{piXj) - P°)(p(ri-) - P°)) (10) 

where r = |ry — 

The integral is in fact equal to an integral from 0 to 
Dmax> the maximum distance in the particle, since scat¬ 
tering density contrast is zero for r greater than that value. 

It is written as 


p{r) = 4Tir^Vy{r) (11) 


The function p(r) is called the pair-distance distri¬ 
bution function of the particle. It represents the number 
distribution of “distances” joining volume element 
pairs in the particle, weighted by the product of their 
scattering density contrast. Equation 9 becomes 


P{r) 



n 



sin Qr 


dQ 


( 12 ) 


The pair distribution function is the inverse Fourier 
transform of I(Q). Provided, therefore, that the scatter¬ 
ing curve can be integrated from zero to infinity, it will 
be possible to calculate the p{r) function of the particle. 
The measurement of I{Q) to 2 = 0 is of course not 
possible, and it is also impossible to measure the scat¬ 
tering curve to infinitely large Q. The evaluation of the 
integral can be performed if one can fill in the missing 
parts of the curve by extrapolation in the context of 
the single particle assumptions (Comment G2.2): for 


example, the curve can be extrapolated to 2 = 0, by 
using the Guinier approximation, and by using the 
Porod relation for the high Q end. 


Contrast Variation 

The concept of contrast was defined above. It is 
contained in Eqs. 3b, 3c. The contrast match point 
corresponds to where p, = p°, and the j component 
becomes invisible. An important point is that the vol¬ 
ume concerned may be large (e.g., the entire protein or 
nucleic acid component in a protein-nucleic acid com¬ 
plex) provided it can be considered as homogeneous at 
the resolution of the experiment. In which case, the 
average scattering density can be considered as valid 
for the entire component. In the protein-nucleic acid 
example, if the protein is large compared to the nucleic 
acid, it cannot be considered as homogeneous, because 
scattering density fluctuations about the mean value 
are significant for the different amino acids. Similarly 
for the lipid bilayer in a membrane protein-lipid com¬ 
plex, the scattering density fluctuation between head- 
group and hydrocarbon chains is large and over 
a distance that is similar to the protein dimension. It 
will not be possible, therefore, to match out the lipid in 
this case. Deuteration could help in both examples, 
because it tends to homogenize scattering density. 
This was applied in a study of a membrane protein in 
lipid vesicles (Hunt et al. 1997), in the study of protein- 
RNA distribution in the ribosome (Ramakrishnan 
1986), and of the muscle protein complex troponin 
(King et al. 2005). 

Contrast in SANS is modified by changing the par¬ 
ticle p values and/or solvent scattering density, p°. 
p values are modified by deuterium labeling or by 
changing the incident radiation. Solvent scattering 
density is changed by deuterium labeling. The funda¬ 
mental assumption of a contrast variation experiment 
is that the particle structure is not modified by chang¬ 
ing the contrast conditions. 

Mean neutron scattering amplitude densities of bio¬ 
logical macromolecules, and water as functions of D 2 O 
percentage in the solvent are given in Eig. 1 . Because 
atomic compositions are fairly similar within each 
macromolecular family, it is usually quite reliable to 
assume the appropriate mean value for a protein or 
nucleic acid of unknown composition. Thus, protein 
is matched out at close to 40% D 2 O, 70% nucleic acid. 
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Small Angle Neutron Scattering, Fig. 1 Scattering length 
density of various molecules and water as a function of D 2 O 
percentage in the solvent 

10% lipid, and 20% detergent (not shown). If the 
resolution/homogeneous component condition is satis¬ 
fied, therefore, one of the components in a protein- 
nucleic acid or membrane protein complex could be 
matched out to observe the other. 


Contrast in the Guinier Approximation 

The forward scattered intensity and radius of gyration 
obtained from the Guinier approximation are both 
contrast-dependent. Combining Eqs. 3c and 7a, 

m = {{p-p°)vf ( 13 ) 

The square root of the forward scattered intensity is 
linear with solvent scattering density, p°, and it crosses 
zero at the particle contrast match point, where p° = p, 
the mean scattering length density in the particle. Note 
that the forward scattered intensity is not structure 
dependent, so that Eq. 3 is valid in general and not 
only for a homogeneous particle. 

The contrast dependence of the radius of gyration is 
given by the Stuhrmann equation, which separates 
scattering length density in the particle into the mean 
value and fluctuations pp about the mean 

Rg^ = Rv^ + :^ j PpirydV + :^{J pp{r)rdV}^ 

(14a) 


where 

Rg^ = Rv^ + oi/p -/ + yp-^ (14b) 

Ry^ is the radius of gyration of a particle of the same 
shape as the particle under study but of homoge¬ 
neous density. 

a is the second moment of the scattering length density 
fluctuations, pp, in the particle. A positive value of a 
denotes that the higher density is toward the periph¬ 
ery of the particle, and vice versa. 
flI p'^ is the separation between the center of scattering 
mass of the density fluctuations in the particle and 
the center of mass of the particle shape, when the 
mean contrast conditions correspond to p. The plot 
of R(^ versus 1/p is a parabola from which a and P 
can be deduced. 

a positive: higher scattering density in the particle 
lies further away from its center on average than lower 
scattering density. 

a negative: lower scattering density in the particle 
lies further away from its center on average than higher 
scattering density. 

P — 0, the centers of scattering mass and density 
fluctuations coincide. 

The a parameter is more straightforward to derive 
than an accurate value for /?, which depends more on 
the low contrast data. 

Important Checks 

The long wavelength and sample-detector distances 
used make SANS particularly well adapted for the 
observation of the Guinier region of the scattering 
curve on an absolute scale, even for large particle at 
very small Q values. This region of I(Q) is very impor¬ 
tant in biological macromolecule studies and it should 
be examined carefully as In 1 versus and not as log I 
versus log Q, as often done in polymer studies. Before 
fitting the full scattering curve to derive a shape, an 
accurate measurement of the Guinier region is a good 
check that there is no aggregation and that the two 
conditions (no interparticle effect and monodipersity) 
are fulfilled. The effective molecular mass of the par¬ 
ticle should be calculated from the /(0)/C after absolute 
calibration of the direct beam and concentration mea¬ 
surement (see above), and the radius of gyration will 
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already have information on whether the particle is 
compact or elongated or flat. 

In contrast variation experiments, several con¬ 
trast conditions should be measured and the square 
root of the forward scattered intensity from the 
Guinier region plotted to check for linearity and 
to determine that contrast match points are indeed 
as expected from the biochemical preparations. 
A nonlinear dependence would be an indication of 
one or both of the two conditions not being 
fulfilled. 


Summary 

The two assumptions required for the interpretation of 
the scattering curve in terms of a single particle struc¬ 
ture are (1) the solution is infinitely dilute, that is, there 
is no correlation between the positions or orientations 
of the particles in the solution; (2) the solution is 
monodisperse, that is, all the particles in the solution 
are identical. 

At small scattering vector (Q) values the scattering 
curve can be interpreted in the Guinier approximation, 
from which the radius of gyration of the particle and its 
molar mass are derived independently and in a model- 
independent manner. 

At large scattering vector values the intensity 
scattered by a particle with a well-defined border 
with solvent follows Porod’s 2“"* Law. 

The distance distribution function in the particle 
p(r) is calculated from the Fourier transform of the 
scattered intensity I(Q). 

Contrast variation approaches allow one to focus on 
the shapes of different parts of a complex particle; in 
SANS, contrast is varied by deuterium labeling of the 
particle or solvent. 


Cross-References 

► Neutron Scattering of Membranes 

► Small Angle Neutron Scattering (SANS) Software 

► Small Angle X-ray Scattering 

► Wide and Small Angle X-Ray Scattering 

► X-Ray Scattering of Lipid Membranes 
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Small Angle Neutron Scattering (SANS) 
Software 

Phil Callow 

Institut Laue Langevin, Grenoble Cedex 9, France 

Synonyms 

SANS 


Definition 

Small angle neutron scattering (SANS) data analysis 
software enables important structural information to be 
extracted from the small angle neutron scattering pat¬ 
terns measured from biological molecules in solution. 

Introduction 

In recent years increasingly sophisticated software has 
become available for the interpretation of small angle 
neutron scattering (SANS) data from biological mole¬ 
cules in solution. This entry will address the main 
methods of analyzing this type of SANS data and 
discuss the most commonly used software. It is not 
intended to be a comprehensive list of all SANS data 
analysis software available, but rather an overview of 
what can now be done to interpret SANS data. The 
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featured software is freely available, comparatively 
easy to use and is supported for most common operat¬ 
ing systems. It should also be noted that many research 
groups have developed their own software for analyz¬ 
ing SANS data and are willing to share it; however, this 
often requires close collaboration between groups as it 
requires expert handling to be used correctly. 

The key difference between the small angle scatter¬ 
ing of neutrons and X-rays is that neutrons are 
scattered by the nuclei of atoms rather than electrons 
as is the case for X-rays. As a result neutrons are not 
only sensitive to the composition of the sample but also 
to the isotopes present. Naturally occurring biological 
macromolecules have characteristic neutron scattering 
length densities which are different for different mol¬ 
ecules, for example, DNA has a very different scatter¬ 
ing length density compared to protein. It so happens 
that the scattering length densities of biological mole¬ 
cules fall between the scattering length densities of 
hydrogenated water (H 2 O) and deuterated water 
(D 2 O). Therefore it is possible, by varying the hydro¬ 
gen and deuterium content of the solvent, to contrast 
match or “make invisible” different components of 
a complex biological system (e.g., a protein-DNA 
complex). In this way multiple SANS datasets (or 
a “contrast series”) may be collected for a single sam¬ 
ple, providing valuable information concerning the 
arrangement of the molecules present in the sample. 
Specific deuteration of the components of the system 
may also be used to further enhance the structural 
information gained. 

While it is possible to analyze and model a single 
small angle scattering curve from X-rays and neutrons 
in a similar manner, the SANS technique is at its most 
powerful when a contrast series is analyzed simulta¬ 
neously. The software described in this entry enables 
this type of analysis to be performed. 

As the processing of raw data collected at a SANS 
instrument is usually performed by instrument and/or 
institute-specific software this entry will only deal 
with the interpretation of data once it has been 
processed into one-dimensional absolute intensity ver¬ 
sus momentum transfer (Q) data. 

Primary Data Evaluation 

Several model independent parameters can be 
extracted from SANS data once it has been processed. 


These include the radius of gyration (Rg) of the particle 
in solution, its scattering intensity extrapolated to zero 
angle (Iq), and its volume. Each of these values can be 
determined by using the program PRIMUS (Konarev 
et al. 2003), which is particularly user-friendly and in 
its latest form even has automated functions for deter¬ 
mining radius of gyration and Iq values. However, care 
should be taken to confirm that any results obtained 
meet the criteria defined by the Guinier approximation 
(Guinier 1939). 

Particle Distance Distribution Function (P(r)) 

The intra-molecular distances within a monodisperse 
solution of biological molecules can be determined 
using the program GNOM (Svergun 1992). The 
function can be used to determine the maximum 
dimension of the particle in solution and also gives 
a crude indication as to its shape, for example, the 
distance distribution function may indicate whether 
the molecule is compact, elongated, or hollow. 

Modeling SANS Data Using Simple 
Geometric Shapes 

Form factors exist for many simple geometric shapes 
from which the small angle neutron scattering curve 
can be calculated and compared to experimental SANS 
data. A number of software packages exist for this 
purpose in particular SASfit (Kohlbrecher), FISH 
(Heenen), and the NCNR SANS package (Kline 
2006). This software is particularly powerful as it 
enables the fitting of multiple SANS datasets, where 
the neutron contrasts of the different components in the 
system are varied, against a single model. This enables 
not only the dimensions of the particle to be 
determined but also the scattering length densities of 
different regions within the particle to be calculated. 

Modeling SANS Data Using High-Resolution 
Structures 

Where an atomic resolution structure of a biological 
molecule or macromolecular complex has been 
obtained using other techniques, such as X-ray or neu¬ 
tron crystallography or NMR, it is possible to calculate 
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the small angle neutron scattering data that this 
molecule or complex would provide in solution. By 
comparing this modeled scattering pattern with that 
obtained experimentally it is possible to determine 
whether there are conformational changes as a result 
of changing sample environment conditions. The 
modeling software used to do this is called CRYSON 
(Svergun et al. 1998). This software is particularly 
useful as it takes into account the local neutron scat¬ 
tering length densities within the high-resolution struc¬ 
ture and the hydrogen/deuterium content of the solvent 
thus providing a means of simulating a neutron con¬ 
trast variation experiment for a biological complex of 
different molecules, such as a protein-DNA complex 
or a protein-protein complex where one of the proteins 
has been deuterated. 

Rigid Body Modeling of SANS Data 

It is frequently the case that atomic resolution struc¬ 
tures are available for the domains of a particular 
protein or for the subunits of macromolecular com¬ 
plex. Small angle neutron scattering and the use of 
the neutron contrast variation technique are powerful 
tools that allow the relative positions of these compo¬ 
nents to be determined and the overall structure of the 
molecule or complex to be determined. This method is 
known as rigid body modeling for which there are two 
programs available, MASSHA (Konarev et al. 2001) 
and SASREF (Petoukhov and Svergun 2005). 
MASSHA allows the manual and automated manipula¬ 
tion of atomic structures and low-resolution models 
with respect to each other and then calculates the 
small angle scattering pattern that the resulting structure 
would give and compares it to a single experimental 
dataset. SASREF, on the other hand, performs quater¬ 
nary structure modeling of a complex formed by sub¬ 
units with known atomic structure against solution 
scattering data. More importantly, it can simultaneously 
fit multiple small angle scattering data from both X-ray 
and neutron datasets from sub-complexes of the system 
or from a neutron contrast variation series. 

Ab Initio Modeling of SANS Data 

Where there is no atomic resolution structural data 
available for a biological molecule or macromolecular 


complex it is possible to build a model solely on the 
basis of small angle scattering data. This process is 
called ab initio modeling. A volume is defined on the 
basis of the maximum dimension of the molecule 
determined from the particle distance distribution 
function and is populated with densities or “beads” 
that either correspond to scattering length density of 
the solvent or that of the molecule being studied. By 
comparing the scattering pattern that would result from 
this shape with experimental data the distribution of 
densities corresponding to the biological molecule can 
be refined and a shape determined that would result in 
the measured small angle scattering data. The software 
that is used to do this for small angle neutron scattering 
is called MONSA (Svergun 1999), which is an 
extended version of the most widely used ab initio 
modeling program for small angle X-ray scattering 
DAMMIN. MONSA allows the “beads” within the 
model to be assigned different scattering length 
densities thus leading to a multiphase model for 
multi-component complexes (such as protein-DNA 
complexes, protein-RNA complexes, or protein- 
protein complexes where one of the proteins has been 
selectively deuterated). The model is fitted simulta¬ 
neously to multiple scattering curves which may have 
come from small angle X-ray scattering as well as 
a SANS neutron contrast variation series. 

Summary 

Through the use of the software described above it is 
now possible to extract significant structural informa¬ 
tion concerning a single biological molecule or 
a macromolecular complex from small angle neutron 
scattering data, whether this is in the form of single 
structural parameters (such as the radius of gyration) or 
by rigid body or ab initio modeling techniques. As 
biologists look at ever increasingly complicated sys¬ 
tems it is likely that this software and small angle 
neutron scattering will play a key role in the future of 
structural biology. 


Cross-References 

► Small Angle Neutron Scattering 

► Small Angle X-ray Scattering 
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► Small-Angle X-Ray Solution Scattering: Software 
Tools for Biological Structure Analysis 

► X-Ray Scattering of Lipid Membranes 
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Small Angle X-ray Scattering 

Manfred Roessle and Dmitri I. Svergun 
European Molecular Biology Laboratory 
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Synonyms 

Wide and small angle X-Ray scattering 


Introduction 

Small-angle X-ray scattering (SAXS) is a universal 
technique to study the structure of various 


noncrystalline systems on a nanometer scale (Feigin 
and Svergun 1987; Glatter and Kratky 1982). For bio¬ 
logical objects, analysis of the low-resolution structure 
of macromolecules and their complexes is among the 
most important SAXS applications. The notion of 
“small-angle” scattering comes from reciprocality of 
Fourier transformation, describing the scattering 
process leading to an inverse relation of the scattering 
angle and the size of the object. In structural 
studies, relatively hard X-rays with the wavelength 
X from about 0.05 to 0.2 nm are employed. Thus, in 
► crystallography (► X-Ray Diffraction and Crystal¬ 
lography of Oligosaccharides and Polysaccharides), 
large diffraction angles are recorded yielding atomic 
resolution models, while for the investigation of the 
entire proteins with dimensions of several tens of nm, 
small angles have to be measured. In structural biol¬ 
ogy, SAXS is employed to study disordered systems 
such as proteins or complexes in solution, weakly 
ordered systems like lipid layers and micelles, and 
ordered assemblies such as biological fibers like hair 
and silk. 


Basic Principle of X-Ray Scattering 

For a single elastic scattering process, the amplitude Aj 
of scattered X-ray photons can be described as a plain 
wave scattered by an atom: 

Aj = 

where hj is the scattering density, the is the atomic 
coordinate, and kp is the wave vector of the incident 
wave (Fig. la). 

The total scattering amplitude of a particle is then 
described by summation of all the atomic scattering 
amplitudes Aj within the particle: 

A(k) = ^ 

j 

and by defining ~s = as the momentum 

transfer vector: 

A(T)=^ 

j 
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Small Angle X-ray Scattering, Fig. 1 (a) Scattering principle: a wave with wave vector Uq is scattered by the an atom at ~r j (b) 
scattering intensity /fij of a globular protein and (c) distance distribution function p(r} 


This scattering amplitude is experimentally not 
accessible, but the scattering intensity I(T); the product 
of A(5) with its complex conjugate A*(T): 

A4* = /(?) = 

k j 

where are the ~rj^ vectors pointing from the jth to the 
kth scattering center. The atomic scattering factors 
are for X-rays proportional to the number of electrons, 
and the heavier the atom, the more it scatters 
(in contrast to Neutrons, where this dependence is 
irregular [► Neutron Scattering of Membranes]). 
A continuous electron density function can be 
defined as where the dv^- describes the 

volume element surrounding the scattering center k 
and the particle structure is conveniently represented 
by the electron density distribution The recon¬ 

struction of this distribution from the scattering data is, 
generally speaking, the main aim of the structural 
studies. The nominal resolution of the possible recon¬ 
struction d depends on the maximum value of the 
momentum transfer in the experimental data Sn,ax and 
can be estimated as d = 27i/Sn,ax (here, the modulus of 
the scattering vector is |y| = ^ sin 20, where s 20 is 


the scattering angle). In crystallography, resolution 
down to 0.1 nm is possible; SAXS studies usually 
provide low resolution (1-2 nm). 

X-Ray Scattering from Dilute Solutions 

In solutions, suspensions and emulsions, particles are 
mostly able to move freely. Solutions of purified pro¬ 
teins are usually studied at low (below 1%) protein 
concentrations and are, thus, dilute monodisperse solu¬ 
tions. In this case, the proteins do not interfere with 
each other and no attractive or repulsive interactions 
are present. The scattering from the solvent is 
subtracted (Fig. lb) such that the net signal from 
the solute is defined by the excess electron density 
Ap(r) = Apii(r)—ps, where p^ is the scattering density 
of the solvent. The average value of the excess density 
is called contrast (p), which is typically very small for 
biological objects containing light atoms only. Note 
that measurements in solvents with different p^ lead to 
the change in the recorded intensity, and this allows 
one to extract additional information about the solute. 
This technique called contrast variation is in principle 
applicable for X-rays (e.g., by adding salts or glycerol 
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to the solvent) but is most useful in neutron scattering 
exploiting the difference in the scattering amplitude 
between hydrogen and deuterium (► H/D Exchange). 

In an ideal dilute solution, the scattering intensity 
from the entire sample will be isotropic and propor¬ 
tional to the scattering from the single particle aver¬ 
aged over all o rientations Q. After the integration of 
the term (e^ )q = , the particle scattering inten¬ 

sity can be expressed as (Debye 1915): 

k j 

One has to note that the vectors s and r are now 
reduced to the absolute values s and r which leads to 
a significant loss of information in the SAXS patterns 
compared, e.g., to the crystallographic data. The SAXS 
patterns from particles in solution are rapidly decreas¬ 
ing functions, usually displayed as one-dimensional 
scattering curves as shown in Fig. lb. 


The Pair Distance Distribution Function 

An important function directly related to the scattering 
intensity is a so-called pair distance distribution func¬ 
tion of the particle p(r), calculated by the Fourier 
transformation of the scattering pattern: 


The Radius of Gyration Rg 

At very small values of s (e.g., very small angles), the 
scattering intensity l(s) be can extrapolated by 
expanding the term + 

a McLaurain series resulting in 


/(.) = 

0 


p{>-) 



dr 


p{r) dr 

0 


p{r)r^dr H- 

0 


oo 

The integral J p{r) r^dr describes the second 
0 

moment of the p(r) distance distribution function, and 
in analogy with mechanics, a radius of gyration can be 
introduced 


„2 _ 1 C p(''y 

* 2 rp(r)dr 

such that the intensity is represented by 


/(s) ^ 




dr 


p(r) 


1 

2n 


OO 

/(s) sr sin srds 

0 


which is the serial expansion for the exponential func¬ 
tion, and the so-called Guinier approximation (Guinier 
and Fournet 1955) can be written as 


The p(r) function has a direct physical meaning. 
Indeed, taking an arbitrary point r, inside the particle, 
a probability 'y(r) can be introduced that a point r, at 
a distance r = Ir, — r,l in an arbitrary direction from the 
given point will itself also be in the particle. The pair 
distance distribution function p(r) = r^ xipfr,) Ap(rj) 
Y(r) reflects how often a distance appears within the 
particle and is weighted by the product of the electron 
density contrast at the two points. Figure Ic shows 
a typical p(r) function from a protein in solution. The 
p(r) is thus directly related to the shape and structure of 
the particle (Glatter 1977), being a spherically aver¬ 
aged Patterson function employed in crystallography. 
Obviously, p(r) equals to zero beyond the maximal 
dimension D^ax of the particle such that p(r) is defined 
in the range [0, D^ax] (see ► Wide and Small Angle 
X-Ray Scattering). 


l{s) ^ 

In practice, the Guinier plot is constructed of In I{s) 
versus (Fig. 2a), and the values of In (/q) and Rg are 
determined from the linear fit: 

In 7 ( 5 ) = \n{Io)-^s^ 

Note that this approximation holds in the range of 
validity of the McFaurain series, and for most of the 
practical applications one takes the range 
(Rg ■ s) < 1.2. Rg is historically the first structural 
parameter determined from the SAXS data (Guinier 
and Fournet 1955), yielding direct information 
about the particle size and shape. The value of /q 











Small Angle X-ray Scattering 


2385 


s 



Small Angle X-ray Scattering, Fig. 2 (a) A typical Guinier 
plot for a protein in solution. From the slope of the linear 
approximation, the radius of gyration Rg is estimated, the 1(0) 


from the intersection of the linear line with the intensity axes, (b) 
Kratky plot of the folded protein, indicated hy the bell shape of 
the function 


is proportional to the squared number of electrons 
in the particle and to the particle concentration so 
that absolute measurements allow one to determine 
the molecular mass of the solute (Mylonas and 
Svergun 2007). 

Porod Invariant and Kratky Plot 

Another important overall value computed from the 
SAXS data is the so-called Porod invariant (Porod 
1951, 1982): 

OO 

s^I{s) ds = Q 

0 

which, contrary to Rg, depends only on the particle 
volume and not on its form, and, for homogeneous 
particles, the volume can be computed as V = 27t^ I 
(0)/Q. Here, the behavior of the scattering intensity at 
higher scattering vectors plays a significant role. 
Indeed, the Guinier approximation is valid for small s 
values, i.e., large interparticle distances. As the I(s) is 
related by Fourier transformation with the p(r) func¬ 
tion, the higher angle part of the I(s) corresponds to the 
small interparticle distances. Therefore, in the high s 
regime, the scattering from the internal structure and in 
particular near the particle surface S is dominating the 
signal. For compact globular particles, the scattering 


intensity at higher angles decays as s ^ and the specific 
particle surface ^ can be derived: 

o lim s‘^l{s) 


The SrW value can be estimated directly from the 
asymptotic behavior of the plot s‘^I{s) versus s (Porod 
plot) at high s (Porod 1982). 

The Kratky plot (I(i')-i’^ versus s) (Glatter and 
Kratky 1982) can be used to identify disordered states 
and distinguish them from globular particles. As indi¬ 
cated above, the scattering intensity of a globular pro¬ 
tein has a Gaussian-like shape at small s and decays as 
s“"^ at high s yielding a bell-shaped Kratky plot with a 
well-defined maximum. In contrast, an ideal Gaussian 
chain has an s~^ asymptotic dependence of l{s) and 
displays a plateau at large s values. In the case of an 
unfolded protein, the Kratky plot also presents a pla¬ 
teau over a specific range of s, which is followed by 
a monotonic increase. 

X-Ray Scattering of Interacting Systems 

The above considerations were made under the 
assumption of freely moving particles in an ideal solu¬ 
tion without any interactions. In a general case of 
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Small Angle X-ray Scattering, Fig. 3 Principle of scattering from interacting particles. The structure factor S(s) is dominating the 
scattering intensity I(s) 


interacting particles, the scattering from the ensemble 
is described by two functions: the form factor F(s) and 
the structure factor S(s). The form factor represents the 
shape and form of the scattering particle, while the 
structure factor depends on the relative positions of 
the particles in solution. The experimental resulting 
scattering intensity l(s) is the product: 

I{s) = F{s) X 5(s) 

The assumption of freely moving particles leads to 
a simplification from T —> 5, and in the special case of 
an ideal solution (e.g., monodispersity, no interac¬ 
tions), the structure factor S(s) is equal to unity and 
consequently I(s) arises of the form factor F(s) only. If 
the particles are well organized like in the case of an 
ideal crystal, the scatterers are fixed on dehned posi¬ 
tions within the crystal and the structure factor 
becomes the major contribution to the scattering pat¬ 
tern. The theory of crystal diffraction is described 
elsewhere (► X-Ray Diffraction and Crystallography 
of Oligosaccharides and Polysaccharides), and par¬ 
tially organized systems will be considered in the fol¬ 
lowing sections. 

Scattering of Micelles and Lipids 

The assumption made for moving particle still holds, if 
the particles are able to orient themselves randomly, 
but with preferred locations within the scattering 
ensemble. A schematic view of a form factor and 
structure factor for such a case is shown in Fig. 3. 

One has to remind that only the scattering intensi¬ 
ties I(s) are accessible experimentally and the 
deconvolution of l(s) into its factors F(s) and S(s) is 
possible, strictly speaking, only for nearly spherically 
symmetric particles (although it can be employed also 
in other cases if detailed information about the system 
is available) (Brunner-Popela and Glatter 1997). 


The regular arrangement of the particles in the scatter¬ 
ing ensemble results in maxima in the structure factor 
whose positions correspond to the repeating distances 
between the particles. 

Scattering of Well-Organized Systems 

If the scattering ensemble becomes more organized, 
the assumption of independently scattering particles 
does not hold. In this case, the intensity is anisotropic 
and may contain of defined peaks, from which the 
orientation and the distances between the particles 
with respect to each other can be determined. Figure 4 
shows an X-ray scattering pattern of aligned silk hbers. 
Silk proteins in the fiber consisf of a well-folded 
P-sheef-rich rigid core with elastic nonfolded 
subdomains (Roessle et al. 2004). These domains are 
well aligned, and the core regions are located on fixed 
posifions befween fhe flexible protein chains. This 
organization of folded and unfolded domains is 
responsible for the mechanical properties such as strain 
and fiber strength and is as well reflected in the scat¬ 
tering pattern. The distance between the particles is 
derived from the positions of the scattering peaks, but 
the fixed orientation of the particles is reflected by the 
position of the peaks with respect to each other. The 
spatial resolution in the studies of organized systems is 
normally higher than in a S AXS experiment, and larger 
angles in the wide-angle X-ray scattering (WAXS) 
range are often recorded. 

Examples 

Scattering of Dilute Macromolecular Solutions 

Measurements of SAXS from biological macromole¬ 
cules in solution allow for structure determination of 
proteins, nucleic acids, and complexes at a 1-2-nm 
resolution. For solution scattering, no high-quality 
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Small Angle X-ray 
Scattering, Fig. 4 Scattering 
from a well-organized silk 
fiber. The high scattering 
intensity peaks represent 
the different distances within 
the fiber 



crystals are required and objects in a very broad range 
of molecular mass (from a few kDa to GDa) can be 
studied (Svergun and Koch 2003; Vachette et al. 2003; 
Mertens and Svergun 2010). In solution, one can read¬ 
ily change the environmental conditions such as tem¬ 
perature, pH value, or salinity and add ligands or 
substrates. Modern solution scattering analysis soft¬ 
ware permits rapid shape analysis by ab initio and 
rigid body modeling. In many structural biology inves¬ 
tigations, solution scattering gives complementary 
information to high-resolution structural data derived 
from protein crystallography or nuclear magnetic res¬ 
onance (► NMR). 

In the example below, SAXS is employed to reveal 
structural changes of a protein upon ligand binding 
(Schmid et al. 2009). The pathogenic bacteria 
Yersiniae pestis utilize a special secretion system to 
inject outer membrane proteins into host cells in order 
to undermine their immune response. One target 
is phosphoenolpyruvate carboxylase (PEPC), a 
homotetrameric protein complex of 98 KDa, to which 
a small bacterial protein YscMl (12 kDa) bind as 
ligand. 

The scattering patterns from the two states and the 
ab initio models reconstructed by the program 
DAMMIN are presented in Fig. 5. The shape of the 
nonliganded state (a) differs significantly from that of 
the ligand bound state (b). YscMl induces 


conformational changes that reduce the twisting of 
the two dimers, leading to a more planar arrangement 
of the four subunits. These conformational changes 
may represent transitions from a relaxed “R” state to 
a tense “T” state, which are assumed to control the 
activity of allosteric enzymes. 

Scattering of Lipids and Nanoparticles 

As SAXS is able to provide structural information 
about large particles (up to tens or even hundreds nm 
in size), the technique is ideally suited for various 
nanostructured systems, in particular, lipid systems 
and nanoparticles (Pabst et al. 2010). As an example. 
Fig. 6a shows the SAXS pattern from the solution of 
BaTi 03 nanodisks (Zhu et al. 2005). The prominent 
peak at about 0.7 nm“* is a result of stacking the 
nanodisks into long rods. The center to center distance 
d of the disks in the rod calculated as <7 = — is about 
9 nm. The small-angle part of the scattering data 
largely corresponds in this case to the form factor of 
the particle. The scattering intensity is however 
a product of the form factor with the structure factor. 
In this example, the form factor is visible as slight 
modulation on the structure factor peak as seen in 
inlet of Fig. 6a. This rod packing and conformational 
changes on the nanodisks upon varying the environ¬ 
mental conditions such as temperature and pH value 
can be investigated in detail. 









2388 


Small Angle X-ray Scattering 







s (nm '') 


Small Angle X-ray Scattering, Fig. 5 The different binding (b) represent the SAXS envelope of PEPC with YscMl. The 

states of PEPC. Left: Scattering intensities of PEPC (gray) and of binding of the ligand switches all four suhunits in a different 

PEPC in complex with YscMl (black). Right: The solution state, which indicates an allosteric binding system of the entire 

structures of the protein without YscMl (a) and the structure protein 
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Small Angle X-ray Scattering, Fig. 6 (a) Scattering profile of 
the BaTiOs nanodisks. At small angles, the form factor is still 
visible, but the regular stacking raises a structure factor peak at 


Important types of systems analyzed by SAXS are 
lipid membranes and vesicles. In the following exam¬ 
ple (Howe et al. 2008), the binding of small peptides to 
a lipid membrane is investigated. The membrane orga¬ 
nization can be classihed into two groups, which yield 
distinctly different SAXS prohles: 

1. Lamellar membrane organization: The SAXS 
reflections are grouped in equidistant ratios, i.e., 1, 
1/2, 1/3, 1/4, etc., of the lamellar repeat distance dj. 

2. Cubic membrane organization: The different space 
groups of these nonlamellar three-dimensional 
structures differ in the ratio of their spacing. The 



s (nm '') 


0.7 nm *. (b) Scattering functions of lipids with cubic organi¬ 
zation (gray profile) and lamellar stacking (black profile) 


relation between reciprocal spacing Shu = -;r~ ^nd 

^hkl 

lattice constant a is: 

/ fh^ + k^ + P\ 

« = ]l [ —i— j 

and hkl are integer numbers, which describe the crys¬ 
tallographic planes and orientations in a lattice (called 
Miller indices in X-ray crystallography [► X-Ray 
Diffraction and Crystallography of Oligosaccharides 
and Polysaccharides]). 
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In Fig. 6b, an example of a lamellar and a cubic 
membrane organization is shown (Howe et al. 2008). 
In this study, the interactions of small peptides with the 
lipids were investigated. These peptides cause 
a reorganization of the lipids depended on their bind¬ 
ing behavior. The reflections at 6.4, 5.22, and 3.3 of the 
gray profile are l/-\/2, l/v/S, and l/v/8 of a periodicity 
at 9.2 nm, and thus, this pattern corresponds to a cubic 
structure of space group which was found to play 
an essential role for bioactivity of lipids. The lamellar 
organization of the intact lipid bilayer reveals itself in 
pronounced equidistant maxima (black profile in 
Fig. 6b). In this case, the peptides cause a lipid stacking 
of about 4.8 nm, and they may intercalate into the lipid 
layer influencing its biological activity. 
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Small-Angle X-Ray Solution Scattering: 
Software Tools for Biological Structure 
Analysis 

Dmitri I. Svergun and Petr V. Konarev 
European Molecular Biology Laboratory, Hamburg 
Outstation, Hamburg, Germany 

Introduction 

Small-angle scattering of x-rays (SAXS) is a powerful 
method for the low-resolution structural analysis of 
biological macromolecules in solution. The technique 
provides three-dimensional low-resolution structures, 
using ab initio and rigid body modeling, and allows 
a researcher to assess the oligomeric state of proteins 
and protein complexes. In addition, SAXS is 
a powerful tool for structure validation and the quan¬ 
titative analysis of flexible systems, and is highly com¬ 
plementary to the high-resolution methods of X-ray 
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crystallography (► Macromolecular Crystallography: 
Overview) and NMR (►NMR). At present, SAXS 
analysis methods allow one to perform automated 
and rapid characterization of protein solutions in 
terms of low-resolution models, quaternary structures, 
and oligomeric composition. A brief overview of soft¬ 
ware tools available for the analysis of proteins in 
solution using SAXS is presented here. 

Primary Data Reduction and Processing 

The scattering patterns from dilute solutions of macro¬ 
molecules are typically presented as one-dimensional 
(ID) curves (Fig. 1, Box I) and correspond to the 
scattering from a single macromolecule averaged 
over all orientations. The data manipulations with ID 
data sets and the primary data analysis steps (radial 
averaging, buffer subtraction, merging, extrapolation 
to zero concentration, etc.) can be done by several 
program packages available to the scientihc commu¬ 
nity via the Internet, e.g., FIT2D, GRASP, FISH, 
SASFIT, SCATTER, IRENA, or PRIMUS (see e.g., 
references in Konarev et al. 2006). 

Overall SAXS Parameters and Rapid Sample 
Characterization 

Several overall invariant geometrical and weight 
parameters can be extracted directly from the scatter¬ 
ing curves enabling a rapid sample characterization 
(Box II in Pig. I). These parameters include the molec¬ 
ular mass (MM), radius of gyration (R^), hydrated 
particle volume (Vp), and maximum particle diameter 
(Dmax) (Peigin and Svergun 1987; Putnam et al. 2007; 
Jacques and Trewhella 2010). 

The Pourier transformation of the scattering inten¬ 
sity yields the particle distance distribution function 
p(r) conveniently visualizing the peculiarities of the 
particle shape (Box III in Pig. 1). Thus, globular parti¬ 
cles yield bell-shaped p(r) with a maximum at approx¬ 
imately while multi-domain particles display 

shoulders or maxima due to inter-domain separations. 
The distance distribution functions are usually com¬ 
puted using so-called indirect transformation methods, 
e.g., programs IPT or GIPT, the latter taking into 
account possible interparticle interference or GNOM 
(Konarev et al. 2006). 


Ab Initio Methods 

Reconstruction of three-dimensional (3D) structure 
from isotropic scattering data ab initio is possible 
for monodlsperse ensembles of particles without 
interactions (►NMR Studies of Macromolecular 
Interactions - Introduction), and only at a low resolu¬ 
tion (1-2 nm). At this resolution, the search is usually 
limited to homogeneous models (shape determina¬ 
tion). In the past, shape modeling has been done on 
a trial-and-error basis by computing scattering patterns 
from different shapes built from hnite volume ele¬ 
ments (e.g., beads) and comparing them with the 
experimental data. Starting from the end of the 
1990s, with the improved number-crunching power 
of computers the burden of model building has been 
delegated to the latter. The idea of the finite element 
shape determination is to start from an initial (usually 
random) model configuration and modify the model in 
small steps to finally construct a compact and 
interconnected model providing a good fit to the exper¬ 
iment. Different flavors of Monte Carlo minimization 
(e.g., genetic algorithm or simulated annealing) are 
employed and different strategies using limited and 
unlimited search volumes are implemented in com¬ 
puter programs (such as DAPAI_GA, SAXS3D, 
SASMODEP, DAMMIN/DAMMIF, GASBOR, see 
Petoukhov and Svergun 2007 for references). A pro¬ 
gram MONSA permits one to build inhomogeneous 
bead models of multicomponent particles from multiple 
scattering data sets (Konarev et al. 2006) (Boxes IV-V 
in Pig. 1) from SAXS and SANS experiments (► Small 
Angle Neutron Scattering (SANS) Software). 

Ab initio shape determination became now 
a standard tool of SAXS data analysis. Given the loss 
of information due to the spherical average of the 
scattering data, unambiguous shape reconstruction is 
fundamentally impossible and care must be taken not 
to overinterpret the data. 

The ab initio methods, seeking to reconstruct a 3D 
shape from a ID scattering pattern cannot provide 
a unique solution and, when running the programs 
multiple times, somewhat different models are 
obtained. A comparison of these models ensures that 
the most persistent features are identified from 
a number of models that fit the data equally well but 
show variation in shape. The programs SUPCOMB 
and DAMAVER (see references in Petoukhov and 
Svergun 2007) are available to identify the most 
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probable/representative ensemble member and to gen¬ 
erate the average model. In addition, one must always 
remember that the scattering patterns are invariant to 
handedness such that an enantiomorphic model must 
always be considered (SUPCOMB/DAMAVER do 
allow for this enantiomorphism). 

Computation of Scattering from High- 
Resolution Models 

If a high-resolution structure or homology model of 
a macromolecule or a complex is available, the theoret¬ 
ical scattering curve can be computed compared with 
the measured SAXS data. This computation should take 
into account, generally speaking, three terms: the 
atomic scattering in vacuo, the excluded volume (the 
volume occupied by the biomolecule in solution that is 


inaccessible to solvent), and the scattering arising from 
the hydration layer. Obtaining a good agreement with 
the experiment suggests that the overall structure of the 
high-resolution model does coincide with the structure 
of the macromolecule in solution. This approach is often 
used to validate predicted models, to identify biologi¬ 
cally active conformations and crystallographic oligo¬ 
mers (► Protein Oligomers), and to analyze possible 
differences in the quaternary structure between crystal 
and solution (Box VI in Fig. I). Several programs are 
available for the computation of the theoretical scatter¬ 
ing curves from the atomic models (e.g., CRYSOL, see 
references in Petoukhov and Svergun 2007). New 
approaches are being developed for the accurate predic¬ 
tion of wide angle scattering data (WAXS) where infor¬ 
mation on both tertiary and secondary structure 
(► Protein Secondary Structure Prediction in 2012) 
can potentially be extracted (Yang et al. 2009). 
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Rigid Body Modeling 

The assembly of macromolecular complexes can be 
studied through the docking of individual components 
into ah initio shapes (► HADDOCK). However, as the 
resolution of SAXS-derived shapes is low, it is more 
reliable to model the assembly of such complexes 
through direct refinement against the scattering data. 
A number of interactive and automated approaches 
(programs ASSA, MASSHA, etc.) (Box VI in Fig. 1) 
have been developed using SAXS to determine the 
positions and orientations of subunits within macro- 
molecular complexes (Konarev et al. 2006; Perkins et 
al. 2008). From the known structures (or homology 
models) (► Homology Modeling of Protein Structures) 
of subunits, the theoretical scattering of a complex can 
be rapidly calculated using spherical harmonics for¬ 
malism. This formalism is also applied to automated 
rigid body modeling in the programs SASREF and 
BUNCH (Petoukhov and Svergun 2005) (Boxes 
VI-VII in Fig. 1) and is applicable both for X-rays 
and neutron scattering (Petoukhov and Svergun 2007) 
(► X-Ray Diffraction and Crystallography of Oligo¬ 
saccharides and Polysaccharides, ► Neutron Scatter¬ 
ing of Membranes). 

SASREF is a comprehensive automated rigid body 
modeling program allowing for simultaneous fitting of 
multiple scattering curves (e.g., when multiple con¬ 
structs such as deletion mutants have been measured). 
The use of symmetry, orientational constraints (e.g., 
from residual dipolar couplings measured by NMR 
(► NMR)), inter-residue contacts (e.g., from mutagen¬ 
esis or cross-linking experiments) and inter-subunit 
distances (e.g., from FTIR and FRET) 
(► Fluorescence and FRET in Membranes) and are 
supported. SASREE requires that complete high- 
resolution models (or reliable homology models 
(► Homology Modeling of Protein Structures)) of 
all of the subunits are available for the rigid body 
modeling of a protein complex (► Protein Com¬ 
plexes). When the structures of linkers or entire 
domains are unknown, an alternative approach 
combining both rigid body modeling and ab initio 
methods is used. The program BUNCH allows one to 
additionally model missing regions in macromolecular 
complexes and multi-domain proteins by representing 
these fragments as chains of dummy amino acid 
residues. 


Analysis of Mixtures 

SAXS allows for the quantitative description of mix¬ 
tures (e.g., oligomeric equilibria and assembly pro¬ 
cesses). For mixtures and polydisperse solutions of 
noninteracting particles the resulting scattering pattern 
is a sum of the contributions from each component of the 
mixture 4(5), weighted by the volume fraction v^. of that 
component. Determination of the latter fractions is the 
most typical task in the analysis of mixtures. Several 
methods have been developed to help simplify the anal¬ 
ysis of equilibrium mixtures using SAXS (Konarev et al. 
2006). In the program OLIGOMER, volume fractions 
are calculated provided the scattering patterns of the 
components are known (e.g., computed from known 
structures of components). Other methods used to deter¬ 
mine the number of independent components include 
singular value decomposition (SVD) and multivariate 
curve resolution (MCR-ALS) (Box VIII in Fig. 1). 

Flexible Systems 

Flexible systems constitute a great challenge for all 
structural methods, as one cannot search for a single 
model representing the particle structure, given that 
many different conformations may coexist with each 
other. Typical examples are intrinsically unfolded pro¬ 
teins and modular multi-domain proteins with flexible 
linkers that can be represented as ensembles of struc¬ 
tures in solution. In an ensemble optimization method 
(EOM) (Bemado et al. 2007), a large pool of random 
configurations is generated and a genetic algorithm is 
employed to select the ensembles of conformers, mix¬ 
tures of which yield the scattering best fitting the 
experimental data. The properties of these ensembles 
are compared to those of the random pool to provide 
quantitative information about flexibility and proper¬ 
ties of the system (Box IX in Fig. I). 

SAXS and Complementary Techniques 

Although SAXS can be used as a standalone structural 
tool, the technique is most efficient when synergistically 
employed together with other structural, biophysical, 
and biochemical methods and bioinformatics. In fact, 
most of the programs mentioned above are using 
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complementary information from other methods to 
build or validate the SAXS models. Approaches to 
structural modeling are currently being developed, 
which aim at building consensus models based on as 
much complimentary information as possible. An inte¬ 
grated modeling platform (IMP) is one such project 
using SAXS data as an additional spatial restraint (For¬ 
ster et al. 2008). 


Summary 

Small-angle scattering of x-rays (SAXS) is a powerful 
method for the low-resolution structural analysis of 
biological macromolecules in solution. The technique 
provides three-dimensional low-resolution structures, 
using ab initio and rigid body modeling, and allows 
one to assess the oligomeric state of proteins and 
protein complexes. In addition, SAXS is a powerful 
tool for structure validation and the quantitative anal¬ 
ysis of flexible systems, and is highly complementary 
to the high-resolution methods of X-ray crystallogra¬ 
phy and NMR. At present, SAXS analysis methods 
allow one to perform automated and rapid characteri¬ 
zation of protein solutions in terms of low-resolution 
models, quaternary structure, and oligomeric compo¬ 
sition. The brief overview of software tools available 
for the analysis of proteins in solution using SAXS is 
presented. 


Cross-References 

► Fluorescence and FRET in Membranes 

► HADDOCK 

► Homology Modeling of Protein Structures 

► Macromolecular Crystallography: Overview 

► Neutron Scattering of Membranes 

► NMR 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► Protein Complexes 

► Protein Oligomers 

► Protein Secondary Structure Prediction in 2012 

► Small Angle Neutron Scattering (SANS) Software 

► Wide and Small Angle X-Ray Scattering 


References 

Bemado P, Mylonas E, Petoukhov MV, Blackledge M, Svergun DI. 
Structural characterization of flexible proteins using small- 
angle X-ray scattering. J Am Chem Soc. 2007; 129:5656-64. 

Feigin LA, Svergun DI. Structure analysis by small-angle x-ray 
and neutron scattering. New York: Plenum; 1987. 

Forster F, WebbB, Krukenberg KA, TsurutaH, AgardDA, Sail A. 
Integration of small-angle X-ray scattering data into stmctural 
modeling of proteins and their assemblies. J Mol Biol. 
2008;382:1089-106. 

lacques DA, Trewhella J. Small-angle scattering for structural 
biology - expanding the frontier while avoiding the pitfalls. 
Protein Sci. 2010;19:642-57. 

Konarev PV, Petoukhov MV, Volkov VV, Svergun DI. ATSAS 
2.1, a program package for small-angle scattering data anal¬ 
ysis. J Appl Crystallogr. 2006;39:277-86. 

Perkins SI, Okemefuna AI, Fernando AN, Bonner A, Gilbert HE, 
Furtado PB. X-ray and neutron scattering data and their 
constrained molecular modelling. Methods Cell Biol. 
2008:84:375^23. 

Petoukhov MV, Svergun DI. Global rigid body modelling of 
macromolecular complexes against small-angle scattering 
data. Biophys J. 2005;89:1237-50. 

Petoukhov MV, Svergun DI. Analysis of X-ray and neutron 
scattering from biomacromolecular solutions. Cuit Opin 
Struct Biol. 2007;17:562-71. 

Putnam CD, Hammel M, Hura GL, Tainer JA. X-ray solution 
scattering (SAXS) combined with crystallography and com¬ 
putation: defining accurate macromolecular structures, con¬ 
formations and assemblies in solution. Q Rev Biophys. 
2007;40:191-285. 

Yang SC, Park S, Makowski L, Roux B. A rapid coarse residue- 
based computational method for X-Ray solution scattering 
characterization of protein folds and multiple conformational 
states of large protein complexes. Biophys J. 2009;96:4449-63. 


Small Molecule Inhibitors of Protein- 
Protein Interactions 

► Bcl-2 Inhibitors 


smFISH 

► Single-Molecule RNA FISH 


Sodium Channels 

► ENAC, Degenerins, ASICs, and Related Channels 

► Voltage-Gated Sodium and Calcium Channels 







2394 


Sodium-Driven Flagellar Motor: Structure and Mechanisms 
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Synonyms 

Energy transduction; Molecular motors; Rotary motor 

Definition 

Ion-driven motor of flagella is very unique and is 
a force generation machine driven hy ion flow. The 
flagellar motor, which is a molecular machine com¬ 
posed of many proteins, is embedded in inner mem¬ 
brane of bacteria and generates torque by ion flux. 

Introduction 

Many motile bacteria move by rotating flagella. These 
are filamentous organs extending from the cell body. 
A flagellum consists of three parts, the filament, the 
hook, and the basal body. The filament functions as 
a helical propeller, and the basal body acts as a rotary 
motor embedded in the cell membranes. The hook 
works as a universal joint between these two structures 
that transmit torque smoothly. The motor can run 
reversibly in both directions: clockwise (CW) and 
counterclockwise (CCW). Energy for rotation of the 
motor comes from the electrochemical gradient of 
specific ions across the cytoplasmic membrane and 
motors are classified info two types by the coupling 
ions: the H'^-driven motor (in Escherichia coli. 
Salmonella enterica, and Bacillus suhtilis), and the 
Na'^-driven motor (in alkalophilic Bacillus and marine 
Vibrio). Motor torque is generated by the interactions 
between rotating part (rotor) and nonrotating part (sta¬ 
tor) of the motor that is coupled to the ion translocation 
through the channel in the stator. Only five proteins 
appear to be responsible for torque generation among 
more than 50 gene products involved in flagellar 
assembly and function. Three of them are rotor pro¬ 
teins, FliG, EliM and FliN, located at the cytoplasmic 


side of the basal body and form the C ring structure. 
The other two are integral membrane proteins that 
form the stator channel complex, MotA and MotB 
for H'^-driven motors, and Pom A and PomB for 
Na'^-driven Vibrio motors. In Vibrio, additional two 
proteins, MotX and MotY, are also essential for torque 
generation (McCarter 1994a, b; Okunishi et al. 1996). 

The Na'^-driven motor has advantages for the study 
of motor function because the Na’^-motive force can be 
more easily manipulated, by simply changing the Na"^ 
concentration in the medium, and the motor function 
can be inhibited by specific Na"^ channel blockers, 
which are also powerful tools for the study of the 
energy-coupling mechanism of the motor (Yorimitsu 
and Homma 2001). 

The Na'^-driven motor was first reported in 
alkalophilic Bacillus that lives optimally and is vigor¬ 
ously motile at pH 10-11. Under alkaline conditions, 
the H’^-motive force is small because the intracellular 
pH of these bacteria is maintained at pH 8-9 by 
homeostatic mechanisms, suggesting that the energy 
source for the motor is not the H’^-motive force. 
Indeed, only Na"^ is effective at supporting motility of 
alkalophilic Bacillus, and motor rotation was proved to 
be dependent on the sodium motive force. Besides 
alkalophiles, Na'^-driven motors are found in the 
marine bacterium. Vibrio alginolyticus. When grown 
in liquid environments, this bacterium swims by using 
a Na’^-driven single polar flagellum. This motor can 
rotate at extraordinary high speed, up to 1,700 Hz 
(Magariyama et al. 1994). Since genetic manipulation 
of this organism has been established, most of the 
studies for the Na'^-driven motor have been conducted 
in this bacterium. Although the basic mechanism for 
torque generation is shared with H'^-driven motor,the 
Na’^-motor in V. alginolyticus has several components 
and properties distinct from H'^-type motors. In 
this entry, we review our current knowledge of the 
Na'^-driven motor, mostly based on the studies from 
V. alginolyticus. 

The Basal Body of the Na'^-Driven Polar 
Flagellar Motor of V. alginolyticus 

Structural features of the flagellar basal body have 
been extensively studied in the Salmonella motor. 
The Salmonella motor consists of several ring struc¬ 
tures (L, P, MS, and C rings) and a rod that is connected 
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Sodium-Driven Flagellar Motor: Structure and Mecha¬ 
nisms, Fig. 1 (a) Model of the flagellar basal body in Vibrio. 
H ring formed by FlgT and T ring formed by MotX and MotY are 
shown as light blue. The PomA/B complex is shown as orange. 
(b) Model of the stator assembly in the sodium-driven polar 
flagellar motor of Vibrio. MotY is indicated by the surface 
model colored by blue for N-terminal and yellow for C-terminal 
domain. MotX, PomA and PomB are colored by orange, (i) 
Mature MotY forms a complex with MotX at the N-terminal 


domain and diffused around in the periplasmic space with low 
affinity to peptidoglycan layer. The complex can be tightly 
associated with the basal body via the N-terminal domain of 
MotY to form the T ring, (ii) The PomA/PomB stator complex 
migrates to the basal body. The PomA/PomB complexes are 
located around the rotor through association with MotX in 
T ring. OM outer membrane, PG peptidoglycan layer, IM inner 
membrane 


to the hook. Three of these rings, the L, P, and MS 
rings, are embedded in the outer membrane, peptido¬ 
glycan layer and inner membrane, respectively, while 
the C ring is attached to the cytoplasmic side of the 
basal body. The electron microscopic observation of 
the purified hook-basal body of the polar flagellar 
motor of V. alginolyticus revealed that overall struc¬ 
ture of the basal body was similar to that of Salmonella, 
but when carefully observed, there was an additional 
new ring structure that appeared as an appendage on 
the periplasmic side of the P ring. This ring, later 
named T ring, had not been previously observed in 
the basal bodies of other Gram-negative bacteria 
(Terashima et al. 2006). Components of the purified 
hook-basal body were analyzed by immunoblot and, 
unexpectedly, MotX and MotY, essential components 
of torque generation in the Na"^ motor, were detected. 
They were co-fractionated with the basal body proteins 
by sucrose density gradient centrifugation, suggesting 
that MotX and MotY are basal body components. 
Immunoelectron microscopic observation conhrmed 
this idea, showing that the gold-labeled particles were 
observed around the L, P, or T rings. Furthermore, the 
T ring structure was not observed in strains that lacked 


MotX and/or MotY. These results indicated that MotX 
and MotY form the T ring structure. Recently, we 
found an H ring in the basal body of V. alginolyticus 
and showed that the flgT gene was essential for the 
formation of the H ring, which seemed to be a scaffold 
for forming the T ring structure (Terashima et al. 
2010a) (Fig. la). The next question was to identify 
the role of T ring in torque generation. Since MotX 
and MotY are essential for rotation, T ring must be 
involved in motor function. Clues to answer this ques¬ 
tion came from the fact that the localization of the 
PomA/PomB stator complex to the motor was not 
observed in the strains that lack MotX or MotY 
(Terashima et al. 2006). This suggests that T ring is 
involved in the incorporation and/or stabilization of 
the stator complex in the motor. Mutational and bio¬ 
chemical analyses showed that external side of the 
T ring is formed by the MotX, and MotX and may 
interact with PomB. The crystal structure of MotY 
showed two distinct domains, an N-terminal domain 
(MotY-N) and a C-terminal domain (MotY-C) 
(Kojima et al. 2008). Expression of MotY-N in 
AmotY strain only slightly restored motility, indicating 
that MotX interacts with MotY-N. From this evidence. 
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it was proposed that PomA/PomB stator complexes 
assemble around the rotor through association with 
MotX in the T ring and binding to the PG layer via 
the C-terminal domain of PomB (Fig. lb). 

Structure and Function of the Na'^-Driven 
Stator Complex 

As described above, the stator of the Na’^-driven polar 
flagellar motor is composed of PomA and PomB. They 
are orthologs of Mot A and MotB, the fP^-driven 
stator proteins. PomA and PomB form a complex 
with 4A:2B stoichiometry, and Na"^ flux through the 
purified PomA/PomB complex reconstituted into the 
proteoliposome has been demonstrated (Sato and 
Homma 2000), indicating that this complex couples 
Na"^ flux to rotation. PomA has four transmembrane 
(TM) segments with a large cytoplasmic loop, and 
PomB has a single TM segment with a large 
C-terminal periplasmic region, that contains the 
peptidoglycan-binding motif responsible for anchoring 
the stator complexes around the rotor. The TM of 
PomB contains the essential and conserved aspartate 
(Asp24), and it is proposed that protonation of this 
aspartate in MotB trigger the conformational changes 
in the cytoplasmic domain of MotA that act on the 
rotor to drive rotation by a power stroke mechanism. 
Therefore, a similar mechanism of torque generation in 
the Na'^-motor is suggested with Na"^ binding to Asp24 
of PomB being essential for torque generation. 

There are no high-resolution structural data of the 
stator complex because its strongly hydrophobic 
nature makes it difficult to obtain crystals. Instead, 
the arrangement of the TM segments of MotA and 
MotB from E. coli has been investigated by systematic 
disulfide cross-linking studies. The results uncovered 
the initial picture of the MotA/MotB stator complex; 
a symmetric dimer of MotB segments is at the center of 
the complex, and TMs of four MotA molecules are 
arranged around the MotB dimer. Cysteine residues 
introduced in the third and fourth TMs (TM3 and 
TM4) of MotA can form disulfide bridges between 
those introduced in the single TM of MotB, which 
contains the proton-accepting Asp residue, suggesting 
that proton-conducting channel is formed by these 
three segments. This arrangement seems to fit into the 
PomA/PomB complex: the TM3 of PomA positions 
very near the TM of PomB. 


Phenamil, a potent inhibitor for eukaryotic epithe¬ 
lial sodium channel, also works as a specific inhibitor 
for the rotation of Na’^-driven motor. Mutations that 
allow motor to rotate in the presence of phenamil were 
mapped near the cytoplasmic face of the TM3 of 
PomA (D148Y) and TM of PomB (P16S), and when 
replaced to Cys, these two residues can form a disulfide 
bridge (Kojima et al. 1999; Yakushi et al. 2004). These 
results suggest that residues in TM3 of PomA and TM 
of PomB (including critical Asp24) may participate in 
forming a Na'^-binding site near the cytoplasmic face. 
Systematic Cys replacement of the residues located in 
the periplasmic loops of PomA revealed that Prol72, 
which are located in loop3-4, forms cross-linked 
dimer under the oxidation condition, so loop3^ of 
two PomA molecules are positioned side by side in 
a PomA/PomB complex (Yorimitsu et al. 2000). 
Recently, it was reported that mutation in Asp24 
could be suppressed by introduction of negatively 
charged residue at position 194 in PomA (N194D, in 
TM4), suggesting that an introduced Asp 194 in PomA 
can also serve in ion conduction and Asp24 of PomB 
seems to be positioned optimally for Na'^-binding 
(Terashima et al. 2010b). Therefore, these residues 
are essential components of the Na'^-binding pocket 
in the ion-conducting pathway of PomA/PomB stator 
complex (Fig. 2). Indeed, Na"^ binding to Asp24 of 
PomB was directly observed by the attenuated total 
reflection Fourier-transform infrared (ATR-FTIR) 
spectroscopy (Sudo et al. 2009). 

Which part of the stator proteins determines ion 
specificity? The Fl'^-driven stator of the Rhodohacter 
sphearoides is composed of MotA (RsMotA) and 
MotB (RsMotB), with RsMotA being remarkably 
similar to PomA of V. alginolyticus. Both of them are 
composed of 253 amino acids and their identity is more 
than 40% over their entire length, whereas RsMotB is 
only similar to the TM region of PomB. When RsMotA 
was expressed in the pomA strain of V. alginolyticus, 
the motor was functional and driven by Na’^-motive 
force (Asai et al. 2000). On the other hand, cells are 
nonmotile when RsMotB was expressed together with 
RsMotA in the pomB strain of V. alginolyticus. There¬ 
fore, determinants of ion specificity in the stator must 
exist in PomB (and MotB). A series of chimeric pro¬ 
teins, consisting of N-terminal RsMotB and C-terminal 
PomB (termed MomB), was constructed. Some of the 
MomB constructs, whose entire TM was derived 
from RsMotB, functioned as Na’^-type stator when 
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Sodium-Driven Flagellar Motor: Structure and Mecha¬ 
nisms, Fig. 2 The model of the ion flux pathway, (a) Asp24 
of PomB and Asp32 of MotB form Na* and H* binding sites, 
respectively, Cys31(PomB)-Leul83(PomA) and Ala39(MotB)- 
Met206(MotA) serve as a gate-keeper and regulator for H* and 
Na* flux, respectively. The apparent gate size for and PomA/ 
PomB is 14 -19 larger than that for MotA/MotB, 
corresponding to the volume difference between H and Na 

coexpressed with RsMotA in the SpomAB strain of V. 
alginolyticus (Fig. 3). In this case, four proteins, 
RsMotA, MomB, MotX, and MotY, are involved in 
the Na’^-driven rotation. Interestingly, a MomB con¬ 
struct, whose junction is near the periplasmic end of 
TM segment, functions better as Li*-driven motor than 
as Na*-driven motor. Later, it was found that 
a chimeric protein of PotB, a fusion of N-terminal 
TM of PomB and the C-terminal periplasmic segment 
of E. coli MotB, functions as an Na'^-type stator in E. 
coli only when coexpressed with PomA (Asai et al. 
2003). In this case, just two proteins PomA and PotB 
are required for Na'^-driven rotation. These lines of 
evidence indicate that cytoplasmic and TM domains 
of PomA/PomB complex are sufficient for Na*-driven 
motility, and periplasmic C-terminal part of B subunit 
(MotB or PomB) determines the requirement of MotX 
and MotY for function. Probably, the size of channel 
pore is varied in these chimeric stator complexes, 
altering the ion specificity. In agreement with this 


(ca. 15 A^). (b) Putative membrane topology of TM3 and TM4 
of PomA and the TM of PomB. The essential aspartate of PomB 
(Asp-24) is labeled by red. The residue targeted by the suppres¬ 
sor of the D24N substitution (N194D in PomA) is labeled in light 
blue. Mutation sites, PomA-D148 and PomB-P16, for the 
phenamil resistant and PomA-P172 for Cys cross-linking are 
labeled by yellow 


idea, a comparative mutational study of E. coli and 
Vibrio stators suggested that the periplasmic sides of 
the single TM segment of PomB and MotB form part of 
the ion flux pathway and pore with MotA and PomA. 
Recently, alkalophilic Bacillus clausii was shown to be 
motile in a pH range from 7 to 11 although its genome 
encodes only one Mot proteins (BCl-MotAB). Motility 
assay of this organism revealed that BCl-MotAB 
couples motor rotation to Na* at the high end of its 
pH range but uses H"^ at lower pH, indicating that 
BCl-MotAB has a naturally bifunctional stator that 
preferentially uses either H"^ or Na"^ across the full pH 
range. Mutational analysis identified three residues 
near the periplasmic end of the TM segment of MotB 
that confer ion specificity in this stator (Terahara et al. 
2008). These results are consistent with the chimeric 
study of the PomB/MotB proteins, indicating that ion 
specificity is determined at the region preceding the 
critical aspartate in the ion-conducting pathway of the 
stator (Fig. 2). 
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Sodium-Driven Flagellar 
Motor: Structure and 
Mechanisms, 

Fig. 3 Chimeric motors with 
the Na"^- and the H'^-driven 
components. The stator 
composed of MotA and MotB 
of motor (a) and PomA and 
PomB of Na’^ motor (b) 
expressed in the mot mutant of 
E. coli and the pom mutant of 
V. alginolyticus, function as 
and Na"^ motor, 
respectively. The stator 
composed of PomA and PotB 
which is chimeras between 
PomB and MotB (c) expressed 
in the mot mutant functions as 
Na"^ motor. The stator 
composed of MotA and 
MomB which is chimeras 
between MotB and PomB 
(d) expressed in the pom 
mutant functions as Na"^ 
motor. The dotted boxes show 
the inner membrane 


a in E. coli b in Vibrio 



Interaction Between Stator and Rotor 

To generate torque, the stator units must interact with 
rotor. As described above, on the rotor side FliG, FliM 
and FliN are involved in torque generation, with FliG 
most closely involved in torque generation, especially 
at the C-terminal domain that contains well-conserved 
charged residues. On the stator side, MotA and PomA 
have a large cytoplasmic loop, and this region also 
contains conserved charged residues. Mutational 
studies of the E. coli motor showed that electrostatic 
interactions between these conserved charged residues 
on the rotor and stator are important for motor rotation: 
disruption of these electrostatic interactions impairing 
motility. However, in V. alginolyticus their role in 
motor function is still ambiguous, since mutations in 
these charged residues in FliG or PomA results in only 
minor effects on torque generation, but there are sig- 
nihcant effects on motility when such mutations are 
introduced into the Na'^-driven hybrid and chimeric 
E. coli motor engineered to use rotor and stator 


components of Vibrio, suggesting that rotor-stator 
interface in Vibrio motor is somewhat more robust 
against mutation (Yakushi et al. 2006). The crystal 
structure of the middle and C-terminal domains of 
FliG from Thermotoga maritima (FliG-MC) and the 
recently reported full length FliG from Aquifex 
aeolicus (FliG-FL) revealed that the charged residues 
are exposed to the surface and clustered on a prominent 
ridge at the globular C-terminal domain (Lee et al. 
2010). To allow electrostatic interaction with charged 
residues in MotA or PomA, FliG is likely to be posi¬ 
tioned with this C-terminal ridge near the stator. 

Assembly of the Functional Motor 

V. alginolyticus has a single polar flagellum, so com¬ 
ponents required to build it are localized to a single cell 
pole. The mechanism for directing the MS ring, the 
first component for flagellar assembly, to the pole 
remains unknown, but two proteins, the putative 
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GTPase FlhF and the putative ATPase FlhG, have been 
shown to be involved in the process. The number of 
polar flagella is positively regulated by FlhF and neg¬ 
atively regulated by FlhG. FlhF is localized at flagel¬ 
lated cell pole, whereas FlhG was diffused throughout 
the cytoplasm. Immunoprecipitation demonstrated that 
FlhF and FlhG interact with each other. According to 
these results, a current model proposes that FlhF func¬ 
tions in polar location of the flagellum, and FlhG 
interacts with FlhF to prevent FlhF from localizing to 
the pole and thus FlhG negatively regulates the flagel¬ 
lar number in V. alginolyticus cells (Kusumoto et al. 
2008). 

Once FlhF and FlhG have determined flagellar 
position, assembly of the motor proceeds from inner 
structure to the outer one with tight coupling to gene 
expression. Motor assembly is completed when about 
a dozen stator complexes assemble around the rotor. 
To produce fully functional motor, these multiple 
stator units have to be incorporated at appropriate 
positions around the rotor and anchored there to gen¬ 
erate torque on the rotor. However, the detailed mech¬ 
anism of stator assembly remains largely unknown. 
To investigate the assembly mechanism of the stator, 
V. alginolyticus, with itsa Na’^-driven single polar 
flagellum, is useful. Since the motor is located at the 
base of the single polar flagellum, stator assembly 
around the rotor is easily evaluated by following its 
polar localization when fused with GFP. Using this 
system, it was found that stator assembly is a more 
dynamic process than expected with the polar locali¬ 
zation of the stator depending on the extracellular 
Na"^ concentration (Fukuoka et al. 2009). The polar 
localization of stator proteins was easily and reversibly 
regulated, simply by changing the concentration of the 
Na"^ in the buffer. In addition, the polar localization of 
the stator proteins was drastically reduced under con¬ 
ditions that collapsed the sodium motive force or in the 
presence of inhibitor phenamil. These results suggest 
that stator assembly requires not only Na"^ but also 
a sodium motive force (Fig. 4). Similar observation 
was reported in Shewanella oneidensis (Paulick et al. 
2009). This organism uses two different stator systems 
to drive rotation of its single polar flagellum, the 
H’^-driven MotA/MotB stators and the Na'^-driven 
PomA/PomB stators. Intriguingly, under the condition 
of high Na"^, PomA/PomB is more efficiently incorpo¬ 
rated into the motor, whereas MotA/MotB is 
incorporated in place of PomA/PomB in response to 


decreasing Na"^ concentration. These finding revealed 
the dynamic stator assembly process in response to the 
environmental conditions. 

Torque of the Na'^-Driven Motor 

To understand the mechanism of motor rotation, we 
need to know its basic properties: the power input and 
output, and their relationships. The measurement of the 
power input to a single motor is hindered by technical 
difficulties, but power output can be measured as 
“rotation” of the flagellar filament. Using E. coli, 
a method was developed for the precise measurement 
of motor rotation in which the cell body was fixed to 
the glass surface and a polystyrene bead was attached 
to a stub of one of its flagellar filaments. By using 
beads of different sizes and changing the viscosity of 
external medium, one can measure motor torque under 
a wide range of speeds. The results showed that an 
essentially constant torque and then a linear decline as 
the speed increased. In a high-speed regime, torque 
was strongly dependent on temperature, as seen by 
a downward shift in the “knee” value, the region of 
the transition from constant torque to declining torque, 
at lower temperature. Comparative studies have been 
reported for the Na’^-driven polar flagellar motor of 
V. alginolyticus and later for Na'^-driven PomA/PotB 
hybrid motor in E. coli. Basically similar torque-speed 
relationship to the wild-type E. coli motor was 
observed in these two reports (Sowa et al. 2003; 
Inoue et al. 2008). In the case of PomA/PotB stator 
with full expression in E. coli, the torque is approxi¬ 
mately constant (at 2,200 pN nm) from stall up to 
a “knee” speed of 420 Hz, and then falls linearly with 
speed, extrapolating to zero torque at 910 Hz (Inoue 
et al. 2008). The overall shape of the torque-speed 
curve is quite similar to the H'^-driven E. coli motor 
described earlier, but the effect of the concentrations of 
the coupling ion on torque-speed curves of V. 
alginolyticus was different from those of E. coli. The 
curves for E. coli did not change even when the exter¬ 
nal environment had a pH value in the range of 4.7-8.8. 
However, when external NaCl concentrations were 
changed, the generated torque was changed over 
a wide range of speeds for Vibrio motor. This differ¬ 
ence seems to be derived not from the mechanism of 
flagellar motors but from cell homeostasis when the 
concentration of the external coupling ion is varied. 
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Sodium-Driven Flagellar 
Motor: Structure and 
Mechanisms, Fig. 4 Model 
of the stator assembly in the 
Na'^-driven polar flagellar 
motor of Vibrio. PomA and 
PomB are colored by red and 
blue, respectively. White 
circles with D in the PomB 
transmembrane region 
indicate the aspartic acid 
residue for Na'^-binding site. 
Green circles indicate sodium 
ions, (a) PomA and PomB 
form a 4A:2B complex 
independent of Na"^. (b) The 
complex diffuses around in the 
inner membrane, and the 
PomA/PomB complex in 
which Na"^ is bound can locate 
around the rotor through 
association with MotX in 
T ring or FliG. (c) The bound 
Na"^ is released from the 
complex and generates 
a rotational force. The stator 
complex is released from the 
rotor 




The Na'^-driven motor is likely to generate torque 
using basically the same mechanism as the H'^-driven 
motor. 

Steps in Rotation of the Na'^-Driven Motor 

The rotation of the motor occurs by a sequence of 
discrete molecular events, presumably including the 
stages of ion binding in the stator followed by confor¬ 
mational changes in the stator that drive the rotor. 
Therefore, one can imagine that the motor behaves 
like a stepping motor. Direct observation of steps in 
the motor has been achieved by making a motor rotate 
as slow as possible, again using the PomA/PotB hybrid 
E. coli motor (Sowa et al. 2005). Motor rotation was 


detected by back-focal-plane (BFP) interferometry of 
500-nm diameter beads attached spontaneously to 
sticky flagellar filament stub or by high-speed video 
recording of 200-nm diameter fluorescent beads. By 
using these experimental setups, the stepping rotations 
were observed at speeds below 7 and 40 Hz in BFP and 
fluorescence experiments, respectively. The step size 
was analyzed from these observations, and it was 
revealed that there is a peak at 13.7° (26 per revolution) 
and —10.3° (35 per revolution) for forward and back¬ 
ward steps, respectively. Observation of 26 steps per 
revolution in the flagellar motor is consistent with the 
periodicity of the ring of FliG protein, suggesting that 
the motor steps along each FliG molecule in the rotor. 
It was reported that a wild-type E. coli cell with an 
H'^-motive force of about 150 mV drives a 1 pm bead 
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with an estimated 280 pN nm per unit, suggesting that 
maximal step size for one ion in one unit would be 5° if 
all the input energy was consumed for the torque 
generation. Thus, there may be smaller substeps, 
or alternatively mechanical step may be coupled to 
several ion translocations. 

Summary 

Since protons are difficult to manipulate, the Na"^- 
driven motor is more useful to study ion-binding site 
and ion-conducting pathway in the stator channel. 
Using this advantage, direct demonstration of Na"^ 
binding to critical aspartate in PomB was achieved, 
and mutational studies identified several important 
residues for ion binding nearby this residue. Moreover, 
stator assembly was found to be dependent on the Na"^ 
concentration in the medium. The chimeric Na'^-driven 
stator complex, PomA/PotB, makes it much easier to 
work on the Na’^-motor in E. coli, thereby opened to the 
field for biophysical assays, which have led to the 
measurement of the stepping motion of the motor. 
Currently many microbial genomes are available and 
homologous proteins to PomA and PomB have been 
identified in several uncharacterized bacteria. In the 
near future, comparative studies of such Na'^-driven 
motors will provide a more detailed mechanism of 
torque generation coupled with Na"^ influx. However, 
an atomic model of the stator complex and establish¬ 
ment of simultaneous measurement system to 
investigate input-output relationships are required for 
understanding full mechanism of motor function. 

Cross-References 

► Bacterial Flagellar Motor: Biochemical and 
Structural Studies 

► Bacterial Flagellar Motor: Overview 

► Electron Microscopy of Motor Structure and 
Possible Mechanisms 

► Membrane Proteins: Structure and Organization 
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Definition 

Members of the Sodium-Substrate Symporter or 
Solute-Sodium Symporter (SSS) family (TCDB 2. 
A.21 and GO term 0015370, and the human gene 
family SLC5) utilize the potential energy from the 
Na"^ electrochemical gradient to drive the uphill 
cotransport of a variety of molecules into the cell in 
all organisms. These diverse solutes transported by 
specihc members of the family include sugars, amino 
acids, organo-cations, nucleosides, inositols, vitamins, 
urea, or anions. 

Introduction 

Symporters catalyze the transfer of one substance 
across a biological membrane with the synchronized 


transport of a different substance in the same direction. 
Unlike primary active transporters, which consume 
ATP, secondary active transporters use electrochemi¬ 
cal gradients as the energy for transport of the second 
species against their concentration gradient 
(► Membrane Transport, Energetics and Overview). 
Members of the SSS family of transporters use 
a sodium electrochemical gradient, generated by the 
Na'^/K'^ ATPase, to transport a diverse set of 
molecules. 

These membrane transport proteins work via an 
alternating access mechanism in which the substrate 
binds to one face of the membrane protein. After 
energy input a conformational change exposes the 
substrate to the opposite face of the membrane where 
it is released (Fig. 1). The ability of the protein to 
accumulate the substrate inside the cell depends on 
the stoichiometry of the sodium and substrate fluxes. 
For the same sodium electrochemical potential gradi¬ 
ent, a symporter that couples substrate transport with 
two sodium ions has the theoretical ability to concen¬ 
trate the substrate 20-fold higher than the one that is 
only coupled to a single sodium ion (Wright et al. 
2011). 

The SSS gene family contains hundreds of members 
from both prokaryotic and eukaryotic origin (Fig. 2), 
which are primarily grouped based on their 
sequence similarity. The crystal structures of an 
SGLT from Vibrio cholera (vSGLT) (Watanabe et al. 
2010; Faham et al. 2008), a member of the SSS 
family, demonstrated that proteins from unrelated 
gene transporter families (► Mhpl, the Na’^-Hydantoin 
Membrane Transport Protein) have a similar core 
structure, suggesting a common transport mechanism 
(Abramson and Wright 2009). 

The Human Genes: The SLC5 Family 

There are 12 SLC5 genes in the human genome (Fig. 2 
and Table 1). All, with the exception of hSGLT3, are 
sodium-substrate cotransporters; hSGLT3 is not 
a cotransporter but acts as a glucose sensor. The other 
11 members are sodium-coupled transporters for 
sugars (SGLTl, SGLT2, SGLT4, and SGLT5), inosi¬ 
tols (SMITl and SMIT2), anions (NIS), short chain 
fatty acids (SMCTl and SMCT2), choline (CUT), and 
vitamins (SMVT). Apart from being sodium 
symporters, these multifunctional proteins act as 
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Sodium-Substrate 
Symporter Family, 

Fig. 1 A six-state model for 
sodium-substrate cotransport 
(Abramson and Wright 2009). 
Sodium ions bind to the 
outward-facing conformation 
of the protein opening the 
external gates to the substrate¬ 
binding site (2). Substrate 
binding is followed by 
a conformational change that 
closes the extracellular 
binding cavity and exposes 
substrate and ions to the 
intracellular side. Then, 
substrate and ions are released 
and the protein returns to the 
outward-facing conformation 
to restart the transport cycle 



water and urea channels (SGLTl, NIS) and as tumor 
suppressors (SMCT) (Ganapathy et al. 2008). The 
biological function of the SLC5 genes is reflected in 
their expression pattern (Table 1). For example, 
SGLTl is highly expressed in the small intestine, 
where its role is to absorb glucose and galactose from 
the diet. NIS is expressed in the thyroid gland where its 
role is to concentrate iodide for the synthesis of the 
hormone thyroxin. 

Disease and Therapeutic Applications 

Mutations in the hSGLTl gene cause Glucose Galac¬ 
tose Malabsorption (GGM), a rare metabolic disorder 
characterized by a defect in intestinal glucose and 
galactose absorption (Wright et al. 2011). Newborn 
children with GGM who are on a milk diet suffer 
from severe diarrhea and dehydration that may end in 
death unless lactose, glucose, and galactose are 
completely removed from their diet. In addition, 
SGLTl plays a central role in oral rehydration therapy 
(ORT) for the treatment of acute diarrhea in patients 


infected with cholera. Mutations in the hSGLT2 gene 
cause Familial Renal Glucosuria (FRG), a rare autoso¬ 
mal recessive condition characterized by elevated uri¬ 
nary glucose excretion but with no other systemic 
consequences. The absence of any other clinical phe¬ 
notype in FRG patients has encouraged the pharma¬ 
ceutical industry to develop SGLT2 inhibitors to lower 
blood glucose levels in diabetes patients (Wright et al. 
2011 ). 

Mutations in the hNIS gene cause a congenital 
iodide transport defect (ITD), an autosomal recessive 
disorder characterized by hypothyroidism, goiter, low 
thyroid iodide uptake, and low saliva/plasma iodide 
ratio. In addition, there is an increasing interest in the 
role that hNIS plays In the uptake of radioiodide used 
for the diagnosis and treatment of both thyroid and 
breast cancer (Bizhanova and Kopp 2009). Other 
members of the SSS family related to disease are 
hSMITl associated with the pathogenesis of Down 
syndrome, and hSMCTl, a tumor suppressor gene, 
the expression of which is silenced in tumors of 
colon, thyroid, stomach, kidney, and brain (Ganapathy 
et al. 2008). 
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41 SGLT family members 
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Sodium-Substrate Symporter Family, Fig. 2 Phylogenetic 
tree of the SSS family. Unrooted phylogenetic tree showing 
selected prokaryote and eukaryote members of the SSS family 
and emphasizing in red members of the human family (SLC5) 
and in blue Vibrio parahaemolyticus SGLT (vSGLT) (Wright 
and Turk 2004). The 45 homologs were aligned in CLUSTAL 
W, Legend to Tree: Afu Archaeoglobus fulgidus, Archaea, Bha 
Bacillus halodurans, Gram+, Bsu Bacillus subtilis 55 kDa, Gram 
-t, Cel Caenorhabditis elegans, Cho choline tr., Dme Drosophila 
melanogaster, E59 Escherichia coli hypothetical 59 kDa, E62 E. 
coli hypothetical 62 kDa, Eco Escherichia coli, Eugu Eugu 
rubripes, h human, Hin Haemophilus influenzae, hyp hypothet¬ 
ical, Lpo Limulus polyphemus, horseshoe crab, Mth 


Methanothermobacter thermautotrophicus; Archaea, NIS 
Iodide tr., paa phenylacetic acid tr.. Pah Pyrococcus ahyssi, 
Archaea, Pae Pseudomonas aeruginosa, panP Pantothenate tr., 
Pmu Pasteurella multocida, Ppu Pseudomonas putida, putP 
Proline tr., rah rabbit, rna from cRNA, Sau Staphylococcus 
aureus, Gram-r, See Saccharomyces cerevisiae, Sco Streptomy- 
ces coelicolor, SMIT myoinositol tr., SNSTl Nucleoside tr., 
SAATl Amino acid tr., now SGLT3, Sty Salmonella 
typhimurium, Syn Synechocystis, Cyanobacteria, v Vibrio 
parahaemolyticus, Vch Vibrio cholerae, Vit Vitamin tr., Xau 
Xanthohacter autotrophicus, dhlC, Xen Xenopus laevis (Modi¬ 
fied from Wright & Turk 2004) 


Overall Structure of vSGLT 

vSGLT is a 14 transmembrane helical protein with 
a core formed by two inverted repeats composed of 


TMl-5 and 6-10 (Faham et al. 2008). The inverted 
repeats are related by a ^153° rotation parallel to the 
membrane plane with an RMSD of 3.9 A between the 
two (Fig. 3a). Even though the two bundles are 
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Sodium-Substrate Symporter Family, Table 1 The human SLC5 family 


Gene name 

Protein name 

Gene locus 

Substrates 

Tissue distribution 

Link to disease 

SLC5A1 

SGLTl 

22ql3.1 

Glucose, galactose 

Small intestine, trachea, kidney, heart, 
brain, testis, prostate 

Glucose galactose 

malabsorption 

(GGM) 

SLC5A2 

SGLT2 

16pl2-pll 

Glucose 

Kidney, brain, liver, thyroid, muscle, 
heart, salivary gland 

Familial renal 
glucosuria (FRG) 

SLC5A3 

SMITl 

21 q22.12 

Myoinositol 

Brain, heart, kidney, lung 

Down’s syndrome 

SLC5A4 

SGLT3 

22ql2.2-ql2.3 

Glucose 

Small intestine, skeletal muscle 


SLC5A5 

NIS 

19pl3.2-pl2 

r, SeCN^, SCM-, 
C 103 ^, NOj^ 

Thyroid gland, salivary glands, gastric 
mucosa, lactating mammary gland 

Iodide transport 
defect (ITD) 

SLC5A6 

SMVT 

2p23 

Pantothenate, 
biotin, lipoate 

Ubiquitous 


SLC5A7 

CHT 

2 ql2 

Choline 

Putamen, medulla, spinal cord 


SLC5A8 

SMCTl 

12q23.1 

Short-chain fatty 
acids, lactate, 
pyruvate, 
nicotinate 

Intestine, kidney, brain, retina, thyroid 
gland 

Silenced in 
tumors of colon, 
thyroid 

SLC5A9 

SGLT4 

lp32 

Glucose, mannose 

Small intestine, kidney, liver, brain, 
lung, trachea, uterus, pancreas 


SLC5A10 

SGLT5 

17pll.2 

Mannose, fructose 

Kidney cortex 


SLC5A11 

SMIT2^ 

16pl2.1 

Myoinositol, d- 
chiro-inositol 

Brain, kidney, heart, skeletal muscle, 
spleen, liver, placenta, lung 


SLC5A12 

SMCT2 

llpl4.2 

Lactate, pyruvate, 
nicotinate 

Intestine, brain, retina, skeletal muscle 



“SMIT2 also known as SGLT6 


Structurally conserved, the sequence conservation 
between them is low. A group of seven central helices 
constitute the functional sugar and ion sites, with helix 
1 shared between the two sites making it paramount for 
the “communication” between them. This basic archi¬ 
tecture is shared by a number of transporters spanning 
different gene families with no sequence similarity, 
including LeuT of the neurotransmitter sodium 
symporter (NSS) family, BetP of the betaine/chlo- 
rine/camitine transporters (BCCT) family, Mhpl of 
the nucleobase-cation-symport-1 (NCSl) family, and 
AdiC of the basic amino acid/polyamine antiporter 
(APA) family (Fig. 3b) (► Membrane Transport Pro¬ 
teins; The Five-Helix Inverted Repeat Superfamily). 
The ligands for each of these transporters differ con¬ 
siderably in their chemistry, but the binding site loca¬ 
tion is well conserved, approximately halfway across 
the membrane bilayer in the center of the core domain. 
The vSGLT structures are in the inward-occluded and 
inward-open conformations, but structures of other 


transporters captured at different stages of the transport 
cycle show discrete conformational changes related to 
the alternating access mechanism (► Membrane 
Transport Proteins: The Five-Helix Inverted Repeat 
Superfamily). 

Sugar-Binding Site 

Galactose is bound at the center of vSGLT occluded 
from the extracellular and intracellular compartments 
by two highly conserved hydrophobic gates. Polar side 
chains from residues on TMl, TM2, TM6, TM7, and 
TMIO coordinate all OH-moieties of galactose and 
Y263 of the intracellular gate stacks with the pyranose 
ring. The opening and closure of these gates are deter¬ 
minants for sugar binding from the extracellular side 
followed by release to the intracellular side, consistent 
with an alternating access view of membrane transport 
(Jardetzky 1966). 
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Sodium-Substrate 
Symporter Family, 

Fig. 3 Structures of sodium 
symporters (Abramson and 
Wright 2009). (a) The two 
structural repeats in the core 
domain have an inverted 
topology where TM1-TM5 
are related to TM6-TM10 by 
an apparent twofold symmetry 
around an axis through the 
center of the membrane plane. 
A superposition of vSGLTs 
TM1-TM5 and inverted 
TM6-TM10 yields an RMSD 
of 3.9 A for Ca atoms, (b) 
Cylinder representation of 
vSGLT, BetP, LeuT, Mhpl, 
and AdiC structures at the 
same orientation. All four 
structures have the 10 helix 
core domain. TM1-TM5 and 
TM6-TM10 are colored red 
and blue, respectively, with 
additional helices on the N- 
terminus and C-terminus 
colored gray 




TMl-5 


vSGLT 


LeuT 



TM6-10 Overlay 


Mhpl 


Na'^-Binding Site and Substrate Release 
Mechanism 

Sodium not only provides the energy for the transport 
of monosaccharides in SGLT, it is also involved in the 
discrete steps associated with the weakening of the 
binding of galactose and its concomitant release to 
the intracellular milieu. In the inward-occluded con¬ 
formation, at the intersection of TMl and TM8, ^ 8 A 
from the sugar-binding site, Na"^ is coordinated by 
residues from both helices (162 and 165 from TMl, 
and A361 and S365 from TM8) (Fig. 4a). Additionally, 
galactose is coordinated by a number of residues, 
including N64 and E88, and its path toward the 
cytoplasm is blocked by Y263 (Fig. 4b), whereas 


in the substrate-free inward-open conformation, helix 
1 kinks ^ 13°, away from the Na"^ site, and N64 forms 
hydrogen bonds with both Y263 and E88 (Eig. 4c) 
(E88 in turn is hydrogen bonded to the substrate in 
the inward-occluded conformation). In addition to 
the static structures mentioned, molecular dynamics 
simulations showed that Na"^ is released after 9 ns 
from its binding site, followed by an opening of the 
pathway for galactose to the intracellular milieu, 
enabling galactose release after 110 ns. The consol¬ 
idation of the structural data and molecular dynam¬ 
ics simulations offers an attractive model, in which 
sodium release triggers the aforementioned confor¬ 
mational changes that ultimately lead to sugar 
release. 
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Sodium-Substrate Symporter Family, Fig. 4 (a) In the Na2 

site, Na"^ is coordinated by residues from TMl and 8 (in gray). 
Conformational changes in the inward-open structure (in colors) 
resulting in a 13° kink in the unwound segment of TMl prevent 
sodium coordination, (b) In the inward-occluded structure, 


Tyr263 blocks the exit of galactose through a hydrogen bond 
with Asn 64 on TMl (c) In the absence of galactose, the galac¬ 
tose-binding residue Asn 64 hydrogen bonds to Glu 88 and Tyr 
263, maintaining an open pathway from the intracellular space to 
the substrate-binding site 


Summary 

The SSS family of sodium-substrate symporters con¬ 
tains 12 human genes that are involved in the physiol¬ 
ogy and pathophysiology of organs such as kidney, 
intestine, heart, brain, and thyroid gland. The crystal 
structure of a bacterial membrane vSGLT has 
a common fold with six unrelated symporters and 
antiporters. We predict that these share a common 
mechanism of substrate transport. 

Cross-References 

► Membrane Transport Proteins: The Five-Helix 
Inverted Repeat Superfamily 

► Membrane Transport, Energetics and Overview 

► Mhpl, the Na'^-Hydantoin Membrane Transport 
Protein 
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Bernhard Brutscher 

Biomolecular NMR Spectroscopy group, Institut de 
Biologic Structurale - J.-P. Ebel, Grenoble, France 

Definition 

Band-Selective Optimized Flip Angle Short Transient 
(SOFAST) HMQC is an NMR technique optimized for 
fast and sensitive 2D heteronuclear or 

NMR spectroscopy. The main features of the SOFAST 
HMQC experiment are (1) the use of shaped 'H pulses 
that manipulate only a subset of proton spins in order to 
enhance their spin-lattice relaxation (Ti) via dipolar 
interactions with the unperturbed proton spins; and 
(2) Frnst angle excitation to achieve maximal sensitiv¬ 
ity for high repetition rates of the pulse sequence. 
SOFAST HMQC experiments are preferably applied 
either in the fast-pulsing regime, where the recycle 
delay is reduced to a few milliseconds in order to 
allow for 2D acquisition times of a few seconds, or in 
the optimal sensitivity regime, where the recycle delay 
and flip angles are adjusted to yield the highest possi¬ 
ble overall sensitivity. Sensitivity enhancements 
observed for SOFAST HMQC with respect to 
a common reference experiment are typically up to 
one order of magnitude if both experiments are run in 
the fast-pulsing regime, while sensitivity gains of 
30-100% are observed when both experiments have 
been individually optimized for maximum sensitivity. 
A number of variants of the SOFAST HMQC experi¬ 
ment have been proposed that are optimized for 
various purposes and experimental situations, for 
example, the standard SOFAST provides maximal sen¬ 
sitivity, the FTA-SOFAST should be used together 
with a fast-mixing device for kinetic measurements, 
the ST-SOFAST avoids probe heating due to 
heteronuclear decoupling, and an ultraSOFAST 
version allows recording of a 2D spectrum in a 
single scan. 

SOFAST HMQC has been developed in the context 
of biomolecular NMR applications where it provides 
new opportunities for high-throughput protein (or 
nucleic acid) screening, in vivo studies of protein mod¬ 
ifications or metabolomic changes, as well as real-time 
studies of molecular kinetics, such as protein folding. 


H/D exchange processes, and conformational transi¬ 
tions of regulatory RNAs or protein enzymes. It may 
also be useful for the study of other proton-rich 
macromolecule s. 

Cross-References 

► H/D Exchange 

► In-Cell NMR 

► Multidimensional NMR Spectroscopy 

► NMR 

► Nucleic Acid NMR - Introduction 

► Protein NMR - Introduction 


Solid-Phase Detection Techniques 

Jacob Piehler 

Fachbereich Biologic, University of Osnabriick, 
Osnabriick, Fower Saxony, Germany 

Synonyms 

Biosensors; Surface plasmon resonance 

Definition 

Methods for monitoring interactions with molecules 
immobilized on a solid support in real time. 

Introduction 

Solid-phase assays for detection of biomolecular inter¬ 
actions and for the assessment of binding affinities 
have a long tradition in biochemistry. These assays 
are based on immobilizing one interaction partner on 
the surface of a solid support (e.g., a polymer bead), 
followed by selectively capturing an interaction 
partner from solution. Interaction partners can be 
identified by methods such as Western blot or mass 
spectrometry, and (apparent) binding constants can be 
determined by quantitative detection such as enzyme- 
linked immunosorbent assay (EFISA). The key feature 
of these solid-phase assays is the simple removal of 





Solid-Phase Detection Techniques 


2409 


s 


non-bound species. Solid-phase detection techniques 
combine this simple separation method with the possi¬ 
bility to monitor the interaction of the analyte with an 
immobilized ligand in real time. For this purpose, 
a variety of physical transducer methods have been 
developed during the past two decades, many of 
which do not require additional spectroscopic labels 
on the interaction partner. Because interactions are 
monitored in real time, the ► kinetics of biomolecular 
interactions can be readily explored by solid-phase 
detection techniques (Fig. lb). Moreover, equilibrium 
constants can be readily determined even for transient 
interactions. 

The principle setup of solid-phase detection tech¬ 
niques is depicted in Fig. 1. Next to a rugged signal 
transducer for sensitive detection of molecules binding 
to the surface, sample handling by a (micro-) fluidic 
system plays an important role for obtaining clean 
binding curves with a minimum amount of sample. 
Moreover, label-free solid-phase detection of biomol¬ 
ecules requires biochemical engineering of the sub¬ 
strate surface for functional immobilization of 
biomolecules (in particular proteins) and for minimiz¬ 
ing nonspecific interactions. 

Detection Techniques 

A key challenge for solid-phase detection techniques is 
the selective detection of biomolecules bound to 
a substrate surface. For unbiased interaction kinetics 
(see below), reliable detection of a few pg/mm^ is 
required. This can be achieved without requiring addi¬ 
tional spectroscopic probes by a variety of optical and 
acoustic interrogation techniques. Optical solid-phase 
detection methods are fundamentally based on the dif¬ 
ference in the refractive index n between water («: 
1.333 at 589 nm) and organic molecules («: 1.4-1.5 
at 589 nm). Upon binding of biomolecules to the trans¬ 
ducer surface, the local refractive index at the surface 
increases, which affects the propagation of light. This 
can be detected by so-called microrefractometers, 
which selectively probe the optical properties close to 
the surface. Such surface-sensitive optical interroga¬ 
tion is achieved by the ► evanescent field, which is 
generated by illumination using total internal reflec¬ 
tion. Owing to the exponential decay of the electric 
field along the surface normal with a decay length (1/e) 
of 50-200 nm, the evanescent field is most sensitive to 


a 




Solid-Phase Detection Techniques, Fig. 1 (a) Principle and 
components of solid-phase detection techniques. Specific bind¬ 
ing of biomolecules from solution to an interaction partner 
immobilized into a biocompatible polymer interface is detected 
in real time by a signal transducer, (b) Typical binding curve 
providing information on the association kinetics, the equilib¬ 
rium response, and the dissociation kinetics 


changes in optical properties close to the surface. An 
evanescent field is created upon total internal reflection 
of light at a boundary to lower refractive index 
medium. A number of different optical transducer 
have been employed for sensitive detection of changes 
in the refractive index close to the surface (Gauglitz 
2005). Among these, surface plasmon resonance (SPR) 
is by far the most popular method (Tanious et al. 2008). 
This technique is based on excitation of a collective 
vibration of delocalized electrons in a thin metal film 
(typically 40-50 nm gold) on the surface of a dielectric 
support (typically glass). This surface plasmon can be 
excited by illumination with light in total internal 
reflection (Fig. 2a). 
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Solid-Phase Detection Techniques, Fig. 2 Basic principles 
of SPR. (a) Upon total internal reflection illumination, a surface 
plasmon is excited in a thin gold layer at a characteristic reso¬ 
nance angle. The electric field of the surface plasmon (shown in 

At a constant wavelength, the efficiency of excita¬ 
tion depends on the incidence angle. At the resonance 
angle, a maximum absorbance of the total internally 
reflected light beam is observed. Changes in the refrac¬ 
tive index upon binding of molecules to the surface of 
the gold layer change the resonance angle. Thus, bind¬ 
ing of molecules to the substrate surface can be 
followed determining the resonance angle for surface 
plasmon excitation. The basic approach for measuring 
SPR is depicted in Fig. 2. By illumination through 
a focusing lens and detection of the reflected light on 
a CCD array, a spectrum of different incidence angles 
can be measured simultaneously without moving parts. 
Thus, the resonance angle can be determined rapidly 
with possible time resolutions in the millisecond 
regime. Parallel SPR detection within a line of mea¬ 
surement spot is readily possible by employing 
a 2-dimensional CCD array. In recent years, various 
approaches for SPR imaging have been established for 
label-free solid-phase detection in a highly 
multiplexed array format (Scarano et al. 2010). Other 
optical techniques for label-free detection include grat¬ 
ing couplers, frustrated total internal reflection (reso¬ 
nant mirror), and photonic crystals. Next to this broad 
spectrum of optical methods for label-free detection 
(Gauglitz 2005), also acoustic methods are available 
for monitoring protein binding to surfaces. Among 
these, the quartz crystal microbalance (QCM) is the 
most prominent technique, which detects changes in 
the resonance frequency of a piezoelectric crystal upon 
surface binding. By probing both the resonance fre¬ 
quency and the heat dissipation (QCM-D), more 


b 



red) decays exponentially from the surface of the gold layer into 
the adjacent medium (buffer), (b) Upon binding of biomolecules 
in proximity of the surface, the increase in the refractive index 
change leads to a shift in the resonance incidence angle 

information on the molecular surface architecture can 
be obtained (Ferreira et al. 2009). 

Signals of both optical and acoustical transducers 
increase proportional to the mass of material adsorbed 
to the surface. For this reason, these techniques are 
frequently referred to as “mass-sensitive.” Typical 
detection limits, e.g., for SPR are 1-10 pg/mm^, 
which corresponds to ^l%o of a protein monolayer. 
The sensitivity in terms of detectable molecules per 
surface area depends on the molecular mass of the 
analyte. Macromolecules such as proteins are readily 
detectable with these techniques, while the detection of 
small molecules is much more challenging. Typically, 
background signals associated to changes in the bulk 
refractive index during injection of the sample rather 
than the detector noise define the detection limit. This 
is particularly critical for transient, low-affinity inter¬ 
actions, because high analyte concentration is 
required, while the binding signal is difficult to dis¬ 
criminate from the background. Based on background 
referencing using multichannel detection systems, the 
interaction of small molecules with immobilized pro¬ 
teins can be detected (Karlsson 2004). 

Sample Handling 

Key benefits of solid-phase binding assays arise from 
the simple and versatile sample handling. Since very 
small quantities of proteins are bound to the surface, 
minimum amounts of sample are required. This is 
a key advantage for biomolecular interaction analysis, 
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samples 



Solid-Phase Detection Techniques, Fig. 3 Sample handling 
in a flow-through system, (a) Typical setup for a single channel 
flow through system. Two syringe pumps independently control 
sample loading and buffer flow, respectively. By using an injec¬ 
tion valve and a sample loop, the sample is injected into the flow 


chamber, into which the transducer is mounted, (b) Ideal rect¬ 
angular concentration profile (black line) and realistic concen¬ 
tration (red line) profile involving a delay volume and a finite 
rise time 


since often only very small quantities of the samples 
are available. For quantitative kinetic analysis, the 
concentration of the analyte and the diffusional mass 
transport to the surface should be kept constant over 
a defined period of time. For this purpose, a flow¬ 
through format has proven most suitable. Typically, 
the sample is injected into a constant buffer stream 
using an injection valve (Fig. 3), thus minimizing 
signal perturbations due to changes in the flow rate. 
Rectangular concentration profiles within the flow cell 
(Fig. 3) are desired for reliable kinetic analysis. Flow- 
ever, on the way into the flow cell, the injected sample 
is mixed with the running buffer. Therefore, a certain 
fraction of the sample needs to be pumped through the 
flow cell until the final concentration is reached at the 
detection area. These rise times within the flow cell 
ultimately limit the time resolution of solid-phase 
detection methods. Low rise times can be reduced by 
increasing the flow velocity. For a given sample vol¬ 
ume, however, the total contact time decreases with the 
sample decreases with increasing flow velocity. Next 
to maintaining a constant concentration of the analyte, 
the flow-through format ensures that the diffusion 
layer between sample and surface stays constant over 
time. For optimum mass transport to the transducer 
surface, a minimum thickness of the diffusion layer is 
desired, which can be achieved by high flow velocities. 
Thus, high flow velocities are ideal both for optimum 
concentration profiles and for optimum mass transport. 
In order to minimize sample consumption, 
microfluidics is applied with flow cells of less than 


100 nl volume, which allows binding experiments 
with sample volumes of a few 10 pi. Microfluidics 
furthermore enables for parallel addressing multiple 
detection spots with different samples. Label-free 
solid-phase detection systems with multiplexed 
addressing of several dozens of spots are already com¬ 
mercially available. Even for probing the interaction 
with a single immobilized ligand, microfluidics pro¬ 
viding a reference channel without the ligand allows 
for efficient measurement of background signals (such 
as bulk refractive index and/or nonspecific binding), 
which are required for correction of the binding curve. 

Surface Modification 

While the solid-phase assay format offers substantial 
advantages for detection and samples handling, it also 
brings several challenges. The bare surface of the 
transducer material nonspecifically interacts with pro¬ 
teins and other biomolecules, which cannot readily be 
discriminated from specific interaction of the analyte 
with the immobilized ligand. Moreover, proteins 
directly adsorbed to the transducer surface denature 
due to multiple interactions with the solid support 
and thus lose their capability to recognize interaction 
partners. For this reason, surface modification with 
indifferent hydrophilic polymers is required for 
rendering the transducer biocompatible and shield 
against nonspecific interactions. Moreover, chemical 
or biochemical functional groups for directed 
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immobilization of proteins and other biomolecules are 
required. Polymeric carbohydrates such as dextrans or 
alginates as well as poly(ethylene glycol) (PEG) have 
been successfully employed for surface modihcation. 
Dextran layers form extended hydrogels ensuring good 
accessibility of immobilized biomolecules. Moreover, 
higher surface concentrations of immobilized proteins 
can be obtained in these hydrogel layers compared to 
a rigid 2-dimensional assembly. In some cases, how¬ 
ever, interactions can be affected by hindered diffusion 
within dextran hydrogels. In such cases, a planar pre¬ 
sentation of the immobilized interaction partner on 
PEG polymer brush is preferable. Next to polymers, 
also lipid layers very effectively shield against 
nonspecific binding. By suitable surface modification, 
spontaneous spreading of lipid vesicles into mono- or 
bilayers can be achieved. These approaches are partic¬ 
ular attractive for exploring the function of proteins in 
the context of a membrane environment. 

Biomolecules can be immobilized into such poly¬ 
meric surface architectures either by direct covalent 
attachment or by functional groups for non-covalent 
capturing. Typically, covalent coupling chemistries 
are directed toward amine or thiol groups, which are 
often available on the surface of proteins. Coupling 
biomolecules through thiol moieties is more efficient 
and selective, while amine-specific coupling is 
more generic. For coupling proteins through amine 
groups, surfaces functionalized with carboxylate 
groups are employed, which are pre-activated as 
N-hydroxysuccinimide esters for spontaneous forma¬ 
tion of amide bonds. In hydrogel-based surface 
architectures, electrostatic preconcentration can be 
exploited for increasing the efficiency of protein cou¬ 
pling at low concentrations. However, protein immo¬ 
bilization by direct coupling through thiol or amine 
residues may severely affect the interaction with target 
molecules in case the residues are in or close to the 
binding site. Moreover, the stability of proteins 
immobilized on surfaces is limited, and covalently 
attached molecules cannot be exchanged. Because 
high and homogeneous functionality of the 
immobilized protein is required for quantitative anal¬ 
ysis, immobilization by non-covalent capturing of pro¬ 
teins through affinity tags often is more suitable. This 
can be achieved with antibodies, which are very robust 
proteins with long lifetimes on the surface. Antibodies 
are readily immobilized by amine-specific coupling 
chemistry and often bind target proteins and peptide 


tags with very high affinity. In many cases, proteins 
captured by antibodies can be removed by relatively 
harsh conditions without affecting the functional integ¬ 
rity of the immobilized antibody. Thus, repetitive 
cycles of protein immobilization and surface regener¬ 
ation are possible without the need to exchange the 
transducer slide. Suitable antibodies against affinity 
tags of typical fusion proteins are commercially avail¬ 
able. Other, reversible protein immobilization 
approaches make use of the oligohistidine tag, which 
is the most popular affinity tag for preparative purifi¬ 
cation of recombinant protein. By surface functiona¬ 
lization with metal chelators for immobilizing 
transition metal ions, His-tagged proteins are readily 
captured to the surface, which can be selectively eluted 
with a competitor such as imidazole. Next to site- 
specific and reversible immobilization, non-covalent 
protein immobilization enables to selectively capture 
proteins from crude samples matrices - e.g., directly 
from cell culture supernatants or even cell lysates. 
A critical issue in non-covalent immobilization 
schemes, however, is a slow dissociation of the 
immobilized protein, which may bias data analysis. 
Also, the binding stability of such capturing systems 
depends on the experimental conditions, i.e., tempera¬ 
ture, pH, ionic strength, redox potential, as well as 
transition metal ions or chelating compounds in the 
buffer. 

For the immobilization of membrane proteins on 
solid supports, lipid-based surface architectures have 
been developed. Peripheral membrane proteins can 
be readily adsorbed to lipid mono- or bilayers, which 
are directly fused onto the transducer surface. For 
capturing vesicles containing integral membrane pro¬ 
teins, polymer cushions functionalized with fatty acid 
chains can be used. More oriented organization of 
integral membrane proteins on solid support has 
been obtained by site-specific protein capturing 
followed by reconstitution into lipid membranes on 
the chip. 

Binding Assays and Data Analysis 

Solid-phase detection techniques provide a variety of 
assays for quantitative interaction analysis. Most 
prominently, by monitoring biomolecular interactions 
in real time, the rate constants of association (^on) and 
dissociation (koff) can be determined from such 
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binding curves (Karlsson et al. 1991). For a typical 
reversible bimolecular interaction 

^on 

A-bB AR 

^off 

with A immobilized on the surface and B as free 
analyte in the injected sample, perfect pseudo-first- 
order kinetics can be assumed due to the flow-through 
format: the analyte concentration (cg) is maintained 
constant throughout the injection period, independent 
on the absolute concentration of the analyte and the 
surface density of the immobilized interaction partner. 
Another important feature of solid-phase detection is 
the possibility to directly monitor the dissociation of 
complexes during rinsing with running buffer. Thus, 
the concentration of free analyte is maintained at zero, 
and the dissociation kinetics can he treated as an 
irreversible first-order reaction. By fitting 
a monoexponential decay to the dissociation phase, 
the dissociation rate constant can be determined very 
reliably. From the association phase, the apparent rate 
constant = ko„ x Cg + k^ff can he determined. Since 
koff can be determined independently from the disso¬ 
ciation phase, kon can be determined from a single 
binding experiment. However, in order to confirm the 
validity of the applied model, binding curves at differ¬ 
ent analyte concentrations are required. Thus k^n and 
koff can be independently determined from binding 
curves measured under different conditions. Ideally, 
global fitting of multiple binding curves is applied for 
determination of a single k^^ and koff respectively, from 
a set of binding curves. Next to a more reliably deter¬ 
mination of the rate constants, concentration- 
dependent binding curves can be used for determina¬ 
tion of the equilihrium constant of the interaction from 
the equilibrium surface loading using Scatchard plot 
analysis. Since the equilibrium surface loading is 
obtained from fitting of the association kinetics, equi¬ 
librium constants can be determined even for slow 
interactions, which do not reach equilibrium during 
the injection period. Thus, equilibrium constants can 
be determined independently from the kinetic rate 
constants and from the equilibrium surface loading. 

Solid-phase detection techniques are useful for 
exploring interactions in multiprotein complexes by 
sequential injection of different interaction partners. 
Because of the mass sensitivity, the stoichiometry of 


complexes can be inferred from the relative binding 
signals under saturating conditions. Cooperative bind¬ 
ing of multiple interaction partners can be explored, 
though mass-sensitive detection does not allow to dis¬ 
criminate different interaction partners. Moreover, the 
equilibrium constants of interactions in solution can be 
readily probed by binding inhibition assays (Piehler 
et al. 1997). While the solid-phase format provides 
versatile assay formats, it also causes some fundamen¬ 
tal limitations: prior to the interaction on the surface, 
analyte molecules need to transfer across a diffusion 
layer, which is depleted due to binding to the surface 
(Fig. 4a). For unbiased detection of interaction kinet¬ 
ics, diffusional mass transport to the surface must 
be substantially faster than the association kinetics 
(Glaser 1993). In case of hiomolecular interactions 
involving macromolecules and high association rate 
constants, often binding curves are observed, which 
are biased by mass transport limitation. While the 
association rate constant is responsible for mass trans¬ 
port limitations both the association and the dissocia¬ 
tion kinetics are affected. Optimized fluidics with 
high flow velocities is required to optimize mass 
transport to the surface (see above). Moreover, 
minimum amounts of immobilized ligand should be 
applied in order to minimize sample depletion at the 
interface. By fitting kinetic models including a mass 
transfer rate, kinetic rate constants can he determined 
even for binding curves biased by mass transport 
limitation. 

Summary and Outlook 

Label-free solid-phase detection techniques are partic¬ 
ularly useful for quantitative protein interaction anal¬ 
ysis. Suitable surface modification and coupling 
chemistries are required for functional protein immo¬ 
bilization and for minimizing nonspecihc interactions. 
With optimized surface architectures, protein captur¬ 
ing as well as interaction analysis from complex matri¬ 
ces is possible. Not only equilibrium binding 
constants, but also kinetic rate constants of protein 
complexes can be determined. By sensitive, parallel 
detection and suitable sample handling by 
microfluidics, versatile and multiplexed quantitative 
interaction analysis is possible. Moreover, interaction 
partners captured by the immobilized ligand can be 
further analyzed by mass spectrometry. However, 
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time [s] 

Solid-Phase Detection Techniques, Fig. 4 Solid-phase 
binding assays in a flow-through format, (a) Binding of analyte 
molecules to an immobilized ligand involves diffusion through 
diffusion layer, (b) Analyte binding at different concentrations to 
an immobilized ligand 


conditions for ligand immobilization and assay format 
need to be optimized for extracting reliable quantita¬ 
tive data. For this reason, solid-phase detection tech¬ 
niques are well suitable for quantitative screening of 
interaction with defined protein ensembles such as 
mutant libraries, but less so for proteome-wide 
interactomics. 


Cross-References 

► Evanescent Wave Microscopy 

► Kinetics: Overview 


References 

Ferreira GN, da-Silva AC, Tome B. Acoustic wave biosensors: 
physical models and biological applications of quartz crystal 
microbalance. Trends Biotechnol. 2009;27:689-97. 

Gauglitz G. Direct optical sensors: principles and selected appli¬ 
cations. Anal Bioanal Chem. 2005;381:141-55. 

Glaser RW. Antigen-antibody binding and mass transport by 
convection and diffusion to a surface: a two-dimensional 
computer model of binding and dissociation kinetics. Anal 
Biochem. 1993;213:152-61. 

Karlsson R. SPR for molecular interaction analysis: a review 
of emerging application areas. J Mol Recognit. 2004;17: 
151-61. 

Karlsson R, Michaelsson A, Mattsson L. Kinetic analysis of 
monoclonal antibody-antigen interactions with a new bio¬ 
sensor based analytical system. J Immunol Methods. 
1991;145:229M0. 

Piehler J, Brecht A, Giersch T, Hock B, Gauglitz G. Assessment 
of affinity constants by rapid solid phase detection of equi¬ 
librium binding in a flow system. J Immunol Methods. 
1997;201:189-206. 

Scarano S, Mascini M, Turner AP, Minunni M. Surface plasmon 
resonance imaging for affinity-based biosensors. Biosens 
Bioelectron. 2010;25:957-66. 

Tanious FA, Nguyen B, Wilson WD. Biosensor-surface plasmon 
resonance methods for quantitative analysis of biomolecular 
interactions. Methods Cell Biol. 2008;84:53-77. 


Solid-State NMR 

► J-Based NMR Correlation Spectroscopy of Biolog¬ 
ical Solids 

► M2 Proton Channel from Influenza A: Example of 
Structural Sensitivity to Environment 

► Magic Angle Spinning Solid-State NMR on Proteins 

► Membrane Proteins in Phospholipid Bilayers: 
Structure Determination by Solid-State NMR 

► Oriented-Sample NMR of Membrane Proteins: Sen¬ 
sitivity Enhancement and Spectroscopic Assignment 

► Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy 

► Solid-State NMR Structural Studies of Outer Mem¬ 
brane Proteins 






























Solid-State NMR Structural Studies of Outer Membrane Proteins 


2415 


s 


Solid-State NMR Structural Studies of 
Outer Membrane Proteins 

Francesca M. Marassi, Yi Ding and Yong Yao 
Sanford Burnham Medical Research Institute, La Jolla, 
CA, USA 

Synonyms 

Solid-state NMR 

Definition 

Outer membrane proteins are found embedded in the 
outer membranes of bacteria and mitochondria. 

Phospholipids are the principal components of bio¬ 
logical membranes; they are composed of a polar 
headgroup linked via a phosphodiester group to hydro¬ 
carbon chains. 

Introduction 

The membrane proteins found in the outer membranes 
of bacteria and mitochondria form a major functional 
class, with prominent roles in molecular transport, 
cellular homeostasis, and bacterial pathogenesis. 
Since these proteins form the first line of communica¬ 
tion between cells or mitochondria and the external 
environment, they are important targets for structure 
determination and structure-based drug design. Solid- 
state NMR spectroscopy has made important contribu¬ 
tions in this area. 

Structural Biology of Outer Membrane 
Proteins 

Several outer membrane proteins have had their struc¬ 
tures determined by ► x-ray crystallography or ► solu¬ 
tion NMR spectroscopy. With very few exceptions, 
they adopt transmembrane p-barrel architectures 
(Schulz 2002), thus satisfying the requirements for 
membrane insertion: all backbone ► hydrogen bonds 
are compensated and buried in the hydrophobic core of 
the phospholipid bilayer, nonpolar side chains point 


toward the membrane, while hydrophilic side chains 
point toward the polar barrel interior or the membrane 
exterior. The P-barrel architecture of outer membrane 
proteins is described by the number of P-strands - 
typically between 8 and 22 - and the average inclina¬ 
tion angle of the P-strands relative to the barrel axis. 
Many outer membrane proteins can be produced 
recombinantly into inclusion bodies and refolded into 
detergents or phospholipids to reconstitute the native 
conformation and function for structural, biophysical, 
and biochemical studies. 

Although x-ray crystallography and solution NMR 
spectroscopy have provided important information 
about outer membrane protein structure they cannot 
provide information about the intimate relationship 
that exists between membrane proteins and the sur¬ 
rounding phospholipids. This is because in both these 
cases the proteins are removed from their natural mem¬ 
brane environment and placed either in a crystalline 
lattice or a detergent solution. Since it is increasingly 
well recognized that the surrounding phospholipid 
bilayer is essential for preserving the native structure 
and function of integral membrane proteins, obtaining 
structural information in this environment is an impor¬ 
tant goal of structural biology. 

Advantages of Solid-State NMR 
Spectroscopy 

The principal advantage of ► solid-state NMR spec¬ 
troscopy is that it can determine structures of integral 
membrane proteins in phospholipid bilayer mem¬ 
branes under physiological conditions, without the 
need for protein modifications, detergents, very low 
temperatures, or other factors that have the potential of 
distorting protein structure, dynamics, and activity. 
Furthermore, the spatial orientation adopted by mem¬ 
brane proteins in cell membranes is an intrinsic compo¬ 
nent of their biological structure and function. Such 
directional information is lost in both the crystalline 
and micelle samples used for structure determination 
by X-ray diffraction and solution NMR spectroscopy. 
However, it is retained in samples where the proteins are 
embedded in lipid bilayer membranes, such as those 
suitable for solid-state NMR spectroscopy. 

The two principal solid-state NMR techniques, OS 
(oriented sample) solid-state NMR and MAS (magic 
angle spinning) solid-state NMR, each have distinct 
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Solid-State NMR Structural Studies of Outer Membrane 
Proteins, Fig. 1 OS solid-state NMR spectra and membrane 
orientation of E. coli OmpX. (a, b) One-dimensional chem¬ 
ical shift spectra and two-dimensional 'H-*^N separated local 
field spectra obtained for uniformly (u) or selectively (F) '^N- 
labeled OmpX embedded in phospholipid bilayers, doped with 
Yb^’"^ to induce a perpendicular orientation of the bilayer plane 
relative to the magnetic field. Assigned resonances provide 
DC and CSA orientafion restraints for structure 


refinement. Spectra were recorded on a 500 MHz Bruker spec¬ 
trometer with 4-8 mg of ’^N labeled protein embedded 
in ^47 mg of 14-O-PC (l,2-0-ditetradecyl-in-glycero-3- 
phosphocholine) plus ~9 mg of 6-0-PC (1,2-0-dihexyl-in- 
glycero-3-phosphocholine). (c) Membrane orientation of 
OmpX derived from the experimental data. The P-barrel axis, 
h, is tilted ~7° from the membrane normal, n. Amide N atoms 
are shown as red spheres for membrane-embedded Phe residues 
and black spheres for water-exposed Phe residues 


advantages: the ability to measure accurate orientation 
restraints for structure determination (OS), and the 
ability to resolve and assign multiple peaks through 
the use of labeled samples (MAS). These 

advantages can be combined in a joint OS-MAS 
approach that has been used to determine the structures 
of helical membrane proteins in phospholipid lipo¬ 
somes (Das et al. 2012). 

Solid-State NMR Structural Studies of Outer 
Membrane Proteins 

OS solid-state NMR has been used to characterize the 
structure of the outer membrane protein OmpX from 
Escherichia coli in planar phospholipid bilayers that 
were macroscopically oriented in the magnetic field 
(Mahalakshmi and Marassi 2008). Homologues of 
OmpX in the human pathogens Yersinia, Enterohacter, 
Klehisiella, and Salmonella are important outer mem¬ 
brane virulence factors that regulate the ability of 
pathogenic Gram-negative bacteria to resist the 
human host defense system. The structure of OmpX 
has been determined by both X-ray crystallography 


(Schulz 2002) and solution NMR (Fernandez et al. 
2004). It forms an eight-stranded P-barrel with 
a four-stranded /?-sheet that protrudes outside the 
membrane where it is thought to mediate host cell 
adhesion and invasion during pathogenesis. 

The NMR spectra obtained from selectively 
*^N-Phe labeled OmpX enabled values of the ’H-*^N 
dipolar coupling (DC) and *^N chemical shift anisot¬ 
ropy (CSA) to be measured and assigned to specific 
molecular sites (Fig. la-c). These provide accurate 
orientation restraints that enable the transmembrane 
orientation of the OmpX P-barrel to be experimentally 
determined (Mahalakshmi and Marassi 2008). OmpX 
traverses the membrane with a ^7° tilt, matching the 
hydrophobic thickness of the P-barrel with that of the 
lipid bilayer (Fig. Id). 

High-resolution MAS solid-state NMR spectra can 
be obtained for both E. coli OmpX and the homologous 
outer membrane protein Ail from Y. pestis, the etio- 
logic agent of plague (Fig. 2). The correlation 

MAS spectra of OmpX and Ail in proteoliposomes 
show several resolved peaks; Ail has four Thr, and 
four signals are observed with shifts in the 
expected region (Fig. 2b). In both spectra, peaks from 
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Solid-State NMR Structural Studies of Outer Membrane 
Proteins, Fig. 2 MAS solid-state NMR spectra of uniform 
labeled OmpX and Ail in phospholipid 
proteoliposomes. (a, b) coiTelation spectra showing 

the entire range of *^C shifts in the proteins, (c) One-dimensional 


’^C and '^N spectra of OmpX. Spectral regions containing peaks 
from Ala (A), He (I), Ser (S), and Thr (T) residues are in red 
boxes. Spectra were obtained on a 700 MHz Bruker spectrome¬ 
ter with 2-A mg of '^C/'^N-labeled protein in ~20 mg of 
14-0-PC 


Ala, lie, Ser, and Thr populate the regions expected for 
P-sheet conformation. The measurement of 
rotationally averaged powder patterns, obtained by 
recoupling '^CA CSA, CSA, DC, and 

DC in *^C-detected three-dimensional sepa¬ 
rated local field (SLF) experiments (Das et al. 2012; 
Tycko 2011) provide orientation-dependent frequen¬ 
cies that can be used as restraints in structure 
determination. 

MAS solid-state NMR has been used to characterize 
a number of outer membrane proteins of bacterial and 
mammalian origin. Motional-hltered two-dimensional 
solid-state NMR experiments have provided structural 
and dynamics information about the E. coli outer mem¬ 
brane protein BamA, a major component of the bacte¬ 
rial outer membrane protein assembly machinery 
(Renault et al. 201 1). 

Whole cell preparations, as well as isolated mem¬ 
brane fractions and reconstituted proteoliposomes 
were used with MAS to obtain structural information, 
based on chemical shifts, about PagL from Pseudomo¬ 
nas aeruginosa, an enzyme integral to the bacterial 
outer membrane that removes a fatty acyl chain from 
lipopolysaccharide (Renault et al. 2012). 

High-resolution MAS solid-state NMR ^H-'^C cor¬ 
relation spectra have been reported for the uniformly 
^H/*^C labeled outer membrane porin OmpG which 
facilitates sugar uptake by E. coli. These results 


demonstrate the feasibility of structural studies for 
this protein (Lalli et al. 2011). 

MAS solid-state NMR has been used to provide 
structural and functional information about the volt- 
age-dependent anion channel (VDAC), which is found 
in mammalian mitochondrial outer membranes where 
it constitutes the major pathway for the transport of 
metabolites and is a major regulator of mitochondrial 
homeostasis (Bauer et al. 2011; Schneider et al. 2010). 
A combination of isotopic labeling schemes enabled 
resonances from the VDAC N-terminal ► a-helix to be 
assigned. The resulting experimental chemical shifts 
enabled a structural model to be generated for this 
region of the protein incorporated in phospholipid 
bilayers. The data argue against conformational flexi¬ 
bility, which was proposed to exist in this region of the 
protein embedded in detergent micelles highlighting 
the importance of performing structural studies in the 
natural membrane environment (Schneider et al. 
2010 ). 

MAS solid-state NMR experiments were also used 
to obtain information about the interaction between 
human VDAC and erastin, a small molecule that is 
selectively lethal to certain tumor cells. Comparison 
*^C-^^C correlation spectra obtained in the absence or 
presence of erastin suggests that erastin may interact 
specifically with the center of the P-barrel pore of 
VDAC (Bauer et al. 2011). 
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Summary 

Solid-state NMR spectroscopy is capable of providing 
information about the structure and dynamics of inte¬ 
gral membrane proteins including those found in the 
outer membranes of bacteria and mitochondria. The 
principal advantage of this technique is that it can be 
performed with samples that closely match the native 
membrane environments of the proteins, and thus pro¬ 
vide useful insights about the intimate relationship 
between protein and lipids. 

Cross-References 

► J-Based NMR Correlation Spectroscopy of 
Biological Solids 

► M2 Proton Channel from Influenza A: Example of 
Structural Sensitivity to Environment 

► Magic Angle Spinning Solid-State NMR on Proteins 

► Membrane Proteins in Phospholipid Bilayers: 
Structure Determination by Solid-State NMR 

► Oriented-Sample NMR of Membrane Proteins: 
Sensitivity Enhancement and Spectroscopic 
Assignment 

► Rhodopsin Activation Based on Solid-State NMR 
Spectroscopy 
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► Absorbance Spectroscopy: Spectral Artifacts and 
Other Sources of Error 


Sparse Sampling in NMR 

Wiktor Kozmihski 

Faculty of Chemistry, University of Warsaw, 
Warsaw, Poland 

Synonyms 

Nonuniform sampling 


Definition 

The continuous signals have to be sampled discretely 
at specific time points. The conventional sampling of 
oscillatory signal requires the constant interval acqui¬ 
sition with at least two data points per period of the 
fastest oscillation. This rule, known as Nyquist theo¬ 
rem, limits the distance between sampling points and, 
therefore, maximum achievable sampling time for 
a given number of points. This, in turn, may result in 
the insufficient resolution in the frequency domain. 
The sampling limitation becomes important in 
multidimensional NMR, where the total number of 
points is the product of their numbers in each dimen¬ 
sion separately. As a result, the time required for 
acquisition of 3-4D NMR experiments might be pro¬ 
hibitively long. Both experiment duration and desired 
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resolution can be optimized by the use of sparse sam¬ 
pling. The trivial method of sparse sampling is trunca¬ 
tion of a signal, usually followed by extrapolation. The 
sparse sampling can also be applied in order to extend 
the sampled space in several ways. Among them, two 
are of the particular importance: sampling at constant 
intervals, but along the radius in a time domain, or 
randomly. The first option is utilized in projection 
spectroscopy, and requires the algebraic decoding 
of peak frequencies, or the reconstruction of 
multidimensional spectrum. The second (random sam¬ 
pling) enables to obtain of a fully dimensional 
spectrum featuring improved resolution or acquired 
faster than conventionally. The application of sparse 
sampling causes the presence of spectral artifacts. 
They appear in the regular form (e.g., ridges, rings) 
for the regular sampling, and resemble noise for the 
case of random sampling. The sparsely and randomly 
sampled data sets need to be processed using the Discrete 
Fourier Transformation (DFT), or reconstructed using 
maximum entropy. Multidimensional Decomposition 
or Compress sensing methods. The reconstruction 
methods allow one to obtain spectra quite similar to 
conventional ones, but in signihcantly reduced time. 
The DFT is far less computationally demanding; 
however, in the presence of intensive peaks the weaker 
ones could be obscured by artifacts. The important 
feature of off-grid random sampling and DPT processing 
is independence of artifact intensity on the degree of 
sparseness, and therefore, it makes feasible experiments 
of very high dimensionality and resolution. The sparse 
sampling is employed also in MRI. 

Cross-References 

► Maximum Entropy Reconstruction 

► Multidimensional NMR Spectroscopy 

► NMR 

► NMR Methods for Kinetic Analysis 

► Triple Resonance NMR 


Spatiotemporal Organization of Cyclic 
Nucleotides 

► Ion Channels: New Tools to Track Cyclic Nucleo¬ 
tide Changes in Living Cells 


Spectral Simulations 

► Simulation of Spin-Label EPR Spectra 

► Simulation of Transition-Metal EPR Spectra 


Spectrophotometric Error 

► Absorbance Spectroscopy: Spectral Artifacts and 
Other Sources of Error 


Spectroscopic Assignment 

► Oriented-Sample NMR of Membrane Proteins: Sen¬ 
sitivity Enhancement and Spectroscopic Assignment 

► Protein NMR Resonance Assignment 


Spectroscopy of Chiral Molecules 

► Circular Dichroism Signals: Qualitative Description 
of Origins 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 


Spectroscopy of Oriented Systems 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 


Sphingolipids and Gangliosides 

Thomas Kolter 

LiMES - Program Unit Membrane Biology & Lipid 
Biochemistry, Universitat Bonn, Bonn, Germany 

Definition 

Sphingolipids contain a sphingoid base within their 
structure. Gangliosides are sialic-acid-containing 
glycosphingolipids. 
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Introduction 

Sphingolipids are characteristic eukaryotic lipids 
which - with few exceptions - do not occur in bacteria. 
The name “sphingolipids” was introduced by 
H. E. Carter (1947) and referred to a lipid class that 
contains a sphingoid base. Today, sphingolipids repre¬ 
sent one of the categories into which lipids have been 
divided, together with fatty acyls (► Fatty Acids, 
Alkanols, and Diacylglycerols), glycerolipids 
(► Glycerolipids: Chemistry), glycerophospholipids, 
sterol lipids, prenol lipids, saccharolipids, and 
polyketides. Sphingolipids themselves have been classi¬ 
fied into several major classes: sphingoid bases, 
ceramides, phosphosphingolipids, phosphonosphin- 
golipids, neutral glycosphingolipids, acidic glycosphin- 
golipids, basic glycosphingolipids, amphoteric 
glycosphingolipids, arsenosphingolipids, and others 
(Fahy et al. 2005). 



OH 

Sphingosine 



OH 

Sphinganine 


NH 2 OH 

(CHaligCHa 

OH 

D-ribo-Phytosphingosine 

Sphingolipids and Gangliosides, Fig. 1 Representative 
structures of sphingoid bases 


Structures 

Sphingoid Bases 

The name sphingoid base (from “sphinx”) was coined 
by J. L. W. Thudichum in 1884 for long chain bases, 
the characteristic building blocks of sphingolipids. 
More than 60 different sphingoid bases have been 
characterized from animals, plants, and microorgan¬ 
isms, mostly as structural component of sphingolipids 
and glycosphingolipids. The by far most abundant 
member in mammals is D-eryt/tro-sphingosine 
(sphing-4-enine) with a chain length of 18 or 20 carbon 
atoms, followed by sphinganine and D-ribo- 
phytosphingosine (Fig. 1). Alkyl chain lengths may 
vary in dependence of cell type and organism and are 
usually between 14 and 22 carbon atoms. Additional 
double bonds, hydroxyl-, or methyl groups might also 
be present. In mammals, the free bases themselves 
occur only in trace amounts (1-10 nmol/g wet weight). 
Additional sphingoid bases are found in plants, such 
as (£)- and (Z)-sphing-8-enine and (4E,8E)- and 
(4£,82)-sphinga-4,8-dienine (Pata et al. 2010). 

Derivatives of sphingoid bases are the sphingoid 
base 1-phosphates, lysosphingomyelins, lysoglyco- 
sphingolipids, A-methylated sphingoid bases, and 
others (Fig. 2). Sphingosine is the precursor of 
sphingosine-1-phosphate (Fyrst and Saba 2010), 
which is an important primary and second messenger 
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(CH2)i2CH3 
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Sphingosylphosphorylcholine 
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Sphingolipids and Gangliosides, Fig. 2 Selected lysosphin- 
golipids. Structures of representative members are shown 


in vertebrates, insects, and plants. It is formed in 
response to interferon, tumor necrosis factor a, or 
other extracellular stimuli by phosphorylation of sphin¬ 
gosine. Sphingosine-1-phosphate can be transported out 
of the cell by members of the ABC transporter family 
and bind to the G-protein-coupled receptors S IPl-S 1P5 
on cell surfaces. The sphingosine-1-phosphate signal is 
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Ceramide (N-Acyl-sphingosine) 


O 



O ,NH 

Acylceramide ("Ceramide 1" or "Cer EOS" of the human skin) 
Sphingolipids and Gangliosides, Fig. 3 Structures of two molecular ceramide species 


terminated by phosphatase-mediated cleavage of the 
phosphate moiety, or by a lyase-meditated, pyridoxal 
phosphate-dependent cleavage of the sphingoid back¬ 
bone between C2 and C3. Sphingosine-l-phosphate is 
an autocrinic and paracrinic regulator with intracel¬ 
lular and extracellular effects. Within a cell, it antag¬ 
onizes ceramide-mediated effects, acts as a mitogen 
and inhibits apoptosis, and regulates calcium ion 
homeostasis and cell growth. 

Sphingosylphosphorylcholine (Nixon et al. 2008) is 
a mediator substance much less characterized than 
sphingosine-l-phosphate. It belongs to the so-called 
lysosphingolipids, which lack the amide-bound fatty 
acid usually found in sphingolipids bearing carbohy¬ 
drate or phosphorylcholine head groups. In patients 
with defects of sphingolipid and glycosphingolipid 
degradation, cytotoxic lysosphingolipids accumulate 
and contribute to the pathogenesis of these diseases 
(Kolter and Sandhoff 2006). They can occur also under 
normal conditions; for example, lysosulfatide is not 
only a storage lipid in metachromatic leukodystrophy 
but also present in high-density lipoproteins from nor¬ 
mal probands were it is thought to lead to nitrogen 
oxide release and protection from heart diseases. 

Ceramides 

Acylation of sphingoid bases gives rise to 
A-acylsphingosines (ceramides), A-acylsphinganines 
(dihydroceramides), A-acyl-4-hydroxysphinganines 
(phytoceramides), and acyl ceramides (Fig. 3). 
Ceramides occur as structural components of the mem¬ 
brane lipid sphingomyelin and of glycosphingolipids 


and in free form as intermediates of sphingolipid 
metabolism. As signaling molecules, they can be 
formed in response to intracellular and extracellular 
stimuli and mediate mostly antimitogenic effects such 
as cell differentiation and apoptosis (Bartke and 
Hannun 2009). In higher concentrations, ceramides 
are found in the human skin, where they are required 
for the maintenance of the water permeability barrier. 
Skin ceramides differ in their structure from those 
found in other tissues. They may contain very long 
acyl chains, which can be ra-hydroxylated and acylated 
(acyl ceramides), especially with linoleic acid (Fig. 3). 
Skin ceramides can also be covalently bound via their 
CD-hydroxy group to proteins of the so-called cornified 
envelope of the stratum corneum. Two different 
nomenclatures are in use for skin ceramides (Uchida 
and Holleran 2008). 

Ceramide-1-phosphates, the sphingolipid analogs 
of phosphatidic acids, are formed by ceramide 
kinase-mediated phosphorylation of ceramide. They 
can act as an intracellular second messenger and, for 
example, mediate inflammation and cell survival, or as 
extracellular signaling substances that stimulates cell 
migration (Arana et al. 2010). 

Phosphosphingolipids 

Phosphosphingolipids are the ceramide phosphoryl- 
cholines (sphingomyelins), ceramide phosphoryletha- 
nolamines, and ceramide phosphoinositols (Fig. 4). 
The by far most important ceramide phosphocholines 
in mammals are the sphingomyelins, which occur in 
higher animals, but not in microorganisms and plants. 
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Sphingolipids and Gangliosides, Fig. 5 Structures of 
selected monoglycosylceramides 


They have the same hydrophilic head group like phos¬ 
phatidylcholine and are the most abundant mammalian 
sphingolipids. They contribute to about 10% of the 
lipids in the human brain. The sphingoid- and fatty 
acid composition of sphingomyelins depends on spe¬ 
cies and cell type. In mammalian tissues, the sphingoid 
base is usually sphingosine (sphinganine in eye 
lenses), while polyunsaturated fatty acids are found 
in sphingomyelins of testes and spermatozoa 
(Sandhoff 2010). In lipid bilayers, sphingomyelins 
form complexes with cholesterol. 

Ceramide phosphorylethanolamines are the 
phosphosphingolipids of insects such as Drosophila, 
while inositol phosphorylceramides are found in yeasts 
and fungi, where they can be further glycosylated with 
mannosyl residues. 


in number, identity, linkage, and anomeric configura¬ 
tion of the corresponding carbohydrates and within the 
structures of their hydrophobic ceramide anchor. In 
addition to the species dependence, glycosphingolipids 
form cell-type-specific patterns, which change with 
cell growth, differentiation, viral transformation, 
ontogenesis, and oncogenesis (Yu et al. 2009). In 
contrast to plants, where most glycolipids are linked 
to diacylglycerol, the majority of mammalian 
glycolipids contain a ceramide backbone. Despite the 
great variety of naturally occurring glycosphin¬ 
golipids, evolutionary-related organisms contain 
glycosphingolipids that can be assigned to a limited 
number of so-called series. Only few series are found 
within a certain cell type. This can be attributed to the 
expression and specificity of the glycosyltransferases 
the catalyze glycosphingolipid formation. 


Glycosphingolipids 

The lUPAC recommends the name glycosphingolipid 
for any lipid containing a sphingoid base and one or 
more sugars. A great variety of glycosphingolipids has 
been characterized from different species. They differ 


Monoglycosylceramides 

Glycosphingolipids in which only one glucosyl or 
galactosyl residue is connected in P-anomeric linkage 
to ceramide are called cerebrosides (Fig. 5). Glucosyl- 
ceramide (GIcCer) is the biosynthetic precursor of most 
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Sphingolipids and Gangliosides, Table 1 Selected glycosphingolipid series 

Series 

Structure 

Name 

Abbreviation 

Gala 

GalQ(l,4GaipiCer 

Galabiaosylceramide 

GaOse2Cer 

Ganglio 

Gal P1,3GalN Ac P1,4Gal P1,4Glc P1 Cer 

Gangliotetraosylceramide 

GgOse4Cer 

Globo 

GalN Ac P1,3Gala 1,4Gal P1,4Glc P1 Cer 

Globotetraosylceramide 

GbOse4Cer 

Isoglobo 

GalNAcpi,3Galal,3Gaipi,4GlcpiCer 

Isoglobotetraosylceramide 

iGbOse4Cer 

Lacto 

Gal P1,3GlcN Ac P1,3Gal P1,4Glc P1 Cer 

Lactotetraoslyceramide 

LcOse4Cer 

Neolacto 

Gaip 1,4GlcNAc P1,3Gaip 1,4GlcP 1 Cer 

Neolactotetraoslyceramide 

nLcOse4Cer 

Muco 

Gaip 1,3Gal P1,4Gaip 1,4Glc P1 Cer 

Mucotetraoslyceramide 

McOse4Cer 



Sphingolipids and Gangliosides, Fig. 6 Structure of ganglioside GQlb, one of the major gangliosides of adult human brain. 
Ac Acetyl 


mammalian glycosphingolipids and occurs also in 
photosynthetic tissues of plants. Galactosylceramide 
(GalCer) is a member of the gala-series glycosphin¬ 
golipids that occur in higher concentrations in the white 
matter of the brain, where they are formed by oligoden¬ 
drocytes. Monoglycosylceramides with different sugars 
or conhgurations have been isolated from lower 
organisms; for example, an a-galactosyl(phyto)ceramide 
has been isolated from the marine sponge Agelas 
mauritianu. This substance (KRN7000) displays highly 
immunogenic properties which are mediated by binding 
of the KRN7000-CD1 complex to the T-cell receptor of 
natural killer cells. 

Oligoglycosylceramides 

The glycosphingolipid series of higher animals are 
derived from lactosylceramide or galactosylceramide 
(Table 1). 


Glycosphingolipids with (partial)structures 
corresponding to those given in the table are assigned 
to the respective series. 

With the exception of lysoglycosphingolipids, most 
glycosphingolipids are electrically neutral or bear 
a negative charge. Among the negatively charged 
glycosphingolipids, the gangliosides and sulfoglyco- 
sphingolipids (sulfatides) are most abundant. 

Gangliosides 

Gangliosides are sialic-acid-containing glycosphin¬ 
golipids. In higher animals, sialic acids are found 
within glycosphingolipids especially of the ganglio, 
lacto, and neolacto series. Most gangliosides belong 
to the ganglio series (see Table 1) and can contain 
sialic acids bound to the inner or the outer galactose 
residue. Structures can be deduced from Figs. 6 and 7. 
Also, lacto and neolacto series glycosphingolipids can 
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Sphingolipids and Gangliosides, Fig. 7 Ceramide biosynthesis in mammals. SPT: serine palmitoyltransferase, KR: 
3-ketosphinganine reductase, LASS: sphinganine A-acyltransferases encoded by lassl-lass6. DESl dihydroceramide desaturase 


contain sialic acid residues in various positions, for 
example, attached in a2,3 linkage to the terminal 
galactose residue (LMl), and also in a2,6 linkage to 
the GalNAc residue (3',6'-isoLDl). Sialic acids are 
derivatives of neuraminic acid, such as 
A-acetylneuraminic acid. A-glycolylneuraminic acid, 
the so-called Hanganutziu-Deicher antigen, is usually 
not found in human glycosphingolipids. Sialic acids 
can be attached to several positions within the core 
structures of the ganglio, lacto, and neolacto series, but 
also to galactosylceramide in ganglioside GM4. Like 
in most other glycoconjugates, sialic acids occur only 
in a-anomeric configuration. Acetylation and 
lactonization of sialic acids contribute to further struc¬ 
tural complexity found in gangliosides. G-acetylated 
sialic acids in gangliosides appear to be markers for 
growing cells and tissues (Schauer 2010). 

Nomenclature 

The nomenclature of gangliosides has been introduced 
by Svennerholm in 1963 and has been approved by the 
lUPAC. The name contains information about the 
series (e.g., G = ganglio), the number of neutral 
sugars (5-n), and of sialic acid residues (“A” = 0, 
“M” = 1,“D” = 2, etc.). 

Analysis and Structure Elucidation 

Together with other glycosphingolipids, gangliosides 
can be isolated from tissues by chloroform-methanol 
extraction (Lacomba et al. 2010). In a subsequent par¬ 
tition step developed by Folch with the system chloro¬ 
form/methanol/aqueous solution, gangliosides 
partition into the aqueous phase. This can be followed 
by separation into neutral and negatively charged spe¬ 
cies via anion exchange chromatography, subsequent 


separation by HPLC or thin layer chromatography and 
quantification by staining and densitometry. Structure 
elucidation is achieved by classical analytical tech¬ 
niques such as MALDI and electrospray mass spec¬ 
trometry (► Lipidomics) in combination with 
additional methods like enzymatic degradation or 
staining by glycan-specific antibodies. 

Ganglioside Function 

Gangliosides are especially abundant in the nervous 
system, where they contribute to 10-12% of the lipid 
content (Posse de Chaves and Sipione 2010). They form 
complex patterns that change with cell differentiation 
and organ development. For example, brain develop¬ 
ment is accompanied by a shift from simple 
gangliosides like GM3 (NeuAca2,3Gaipi,4GlcpiCer) 
and GD3 (NeuAca2,8NeuAca2,3Gaipi,4GlcpiCer) to 
more complex ones like GDla (NeuAca2,3Gaipi,3Gal 
NAcpl,4 (NeuAca2,3)Gaipi,4GlcpiCer) and GTlb 
(NeuAca2,3Gaip 1,3GalNAc P1,4(NeuAca2,8NeuAca 
2,3)Galpl,4GlcpiCer) during synaptogenesis and 
myelination (Yu et al. 2009). The exact biological 
function of these highly complex ganglioside patterns 
is unclear, but the phenotypes of knockout mice defi¬ 
cient in the biosynthesis of certain gangliosides indica 
te a vital role of the diversity for reproduction and 
neural function (Kolter et al. 2002). While even 
complex systems like cultured cells can survive 
without glycosphingolipids, they are required for the 
development of multicellular organisms. 

Gangliosides can mediate recognition events with 
soluble or membrane-bound lectins outside the cell. 
For example, the myelin-associated glycoprotein 
MAG recognizes NeuAca2-3Gaipi-3GalNAc termini 
on axonal gangliosides like GDla and GTlb. Several 
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infectious diseases (Hanada 2005) are influenced by 
sphingolipids and glycosphingolipids, which can, for 
example, act as cellular receptors and co-receptors for 
viruses, bacteria, and microbial toxins. Examples are 
ganglioside GMl as the receptor for cholera toxin and 
globotriaosylceramide as receptor for verotoxins. 

It is assumed that glycosphingolipids are not homo¬ 
geneously distributed on the cell surface but, together 
with glycosylphosphatidylinositol-anchored proteins, 
sphingomyelin, and cholesterol, segregate into mem¬ 
brane domains (rafts), which are the physiological 
surroundings of many membrane proteins (► Lipid 
Domains). Gangliosides can modify the activity of 
pharmacologically relevant receptors like that for insu¬ 
lin, epidermal growth factor, or nerve growth factor. 
They can influence acquired human diseases 
(Wennekes et al. 2009; Cowart 2009). For example, 
in patients with metabolic syndrome, palmitate- 
induced stimulation of de novo synthesis of ceramide 
and ganglioside GM3 is assumed to have a deleterious 
effect on insulin response. 

Metabolism 

Biosynthesis 

The first step of sphingolipid biosynthesis is 
the pyridoxal-phosphate-dependent condensation of 
L-serine with a fatty acid thioester with coenzyme 
A (CoA), mostly palmitoyl-CoA (Fig. 7). This 
rate-determining step is catalyzed by serine palmitoyl¬ 
transferase and leads to the formation of 
3-ketosphinganine, which is rapidly reduced to 
sphinganine by ketosphinganine reductase. Acylation 
to dihydroceramides with different acyl chain lengths 
is carried out by acyltransferases, which are encoded by 
lass genes (homologues of the yeast longevity assurance 
gene lagl). Mammals contain six different 
acyltransferases that display specificities for coenzyme 
A thioesters of different fatty acids. The lass genes are 
expressed in a cell-type-specific fashion so that different 
cell types form ceramides of different acyl chains. In 
animals, dihydroceramides are desaturated to ceramides 
or oxidized at the 4-position of the sphinganine moiety 
to phytoceramides. Sphingosine itself is not a product of 
de novo sphingolipid biosynthesis, but is formed during 
sphingolipid degradation. The steps leading to ceramide 
formation take place at the cytoplasmic leaflet of the 


membrane of the endoplasmic reticulum. From there, 
ceramide is either transported to the Golgi apparatus via 
vesicular flow or mediated by the transport protein 
CERT. Glucosylceramide is formed at the cytoplasmic 
leaflet of the Golgi apparatus. It can undergo 
a translocation through the Golgi membrane and 
become a substrate of glycosyltransferases with active 
sites in the lumen of the Golgi apparatus. Glucosyl¬ 
ceramide is the only glycosphingolipid which can also 
have a cytoplasmic orientation. All other glycosphin¬ 
golipids face the anticytoplasmic face of cellular mem¬ 
branes and are therefore projecting their carbohydrate 
part to the outside of a cell when they are present in the 
plasma membrane (van Meer and Hoetzl 2010). Forma¬ 
tion of lactosylceramide and members of the 
glycosphingolipid series derived from it, but also of 
galactosylceramide and of sphingomyelin occurs at the 
luminal leaflet of the Golgi membrane. Biosynthesis of 
more complex glycosphingolipids continues by the step¬ 
wise addition of single carbohydrate residues. In the 
biosynthesis of gangliosides, lactosylceramide and gan¬ 
gliosides GM3, GD3, and GT3 serve as precursors for 
complex gangliosides, which can bear no (0 series), one 
(a series), two (b series), or three (c series) sialic acid 
residues on the inner galactose moiety (Fig. 8). The 
transferases that catalyze early glycosylation steps 
show high specificity toward their glycolipid substrate, 
while those acting more downstream are less specific. 

Degradation 

The major sites of constitutive glycosphingolipid deg¬ 
radation are the endosomes and lysosomes. 
Glycosphingolipid degradation occurs in a stepwise 
manner at the surface of intraendosomal and 
intralysosomal membranes. It requires the presence 
of bis(monoacylglycero)phosphate as a negatively 
charged endolysosomal lipid, an appropriate lipid 
composition of the inner membranes, and the presence 
of lipid transfer proteins like the saposins or the GM2 
activator protein that make the membrane-bound sub¬ 
strates accessible to the hydrolases present in the lyso- 
some (Kolter and Sandhoff 2010). 

Glycosphingolipid Storage Diseases 

Mutations in the genes encoding lysosomal enzymes or 
sphingolipid activator proteins lead to defects in 
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Fig. 8 Ganglioside 
biosynthesis in the Golgi 
apparatus. 

ST sialyltransferase; 

T = transferase; 

GalNAc = W-Acetyl- 
galactosamine. Steps for 
which mice with engineered 
defects in glycosyl 
transferases have been 
characterized are indicated by 
vertical bars 
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Sphingolipids and Gangliosides, Table 2 Lysosomal (glyco)sphingolipid and ganglioside storage diseases 


Sphingolipidosis 

Deficient protein 

Major storage compound 

Dehciency of lipid transfer proteins 

GM2 activator, prosaposin, 
saposins B or C 

Different lipids 

Fabry 

a-Galactosidase A 

Globotriaosylceramide (Gal-a 1,4-Gal P1,4-Glc P1 -1 Cer) 

Farber 

Acid ceramidase 

Ceramide 

Gaucher 

Glucosylceramide 

P-glucosidase 

Glucosylceramide 

GMl gangliosidosis 

GMl P-galactosidase 

Ganglioside GMl (Gaipi,3GalNAcpi,4(NeuAca2,3) 
Gaipi,4GlcpiCer) 

Krabhe 

Galactosylceramide 

P-galactosidase 

Galactosylceramide 

Metachromatic leukodystrophy 

Arylsulfatase A 

Sulfatide 

(Gal-3’-sulfate)piCer 

Niemann-Pick, type A, B 

Acid sphingomyelinase 

Sphingomyelin 

Sandhoff 

P-hexosaminidases A and B 

Ganglioside GM2, globotetraosylceramide 
(GalN Ac P1,3 Gala 1,4Gal P1,4Glc P1 Cer) 

Tay-Sachs 

P-hexosaminidases A and S 

Ganglioside GM2 (GalNAcpi,4(NeuAca2,3) 
Gaipi,4GlcpiCer) 
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glycosphingolipid degradation. This gives rise to in 
part very severe human diseases, the sphingolipidoses. 
Table 2 gives an overview on the inherited diseases 
associated with defects in lysosomal sphingolipid deg¬ 
radation. With the exception of Fabry disease, the 
sphingolipidoses exhibit an autosomal recessive 
mode of inheritance. The cell-type-specific expression 
of glycosphingolipid series and the residual activity of 
the variant patient proteins account for the vast differ¬ 
ences between the phenotypes of the diseases (Kolter 
and Sandhoff 2006). Secondary to this, additional 
alterations contribute to the pathogenesis (Vitner 
et al. 2010). Enzyme replacement therapy is available 
for nonneuronopathic forms of Gaucher disease and 
for Fabry disease; other approaches for causal therapy 
include substrate reduction therapy, pharmacological 
chaperones, and bone marrow transplantation. 


Cross-References 

► Chemical Diversity of Lipids 

► Fatty Acids, Alkanols, and Diacylglycerols 

► Functional Roles of Lipids in Membranes 

► Glycerolipids: Chemistry 

► Glycolipids 

► Lipid Domains 

► Lipid Flip-Flop 

► Lipidomics 

► Membrane Traffic 

► Skin Lipids 
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Definition 

Transfer of spin polarization among the same kinds of 
nuclear spins (like-spins), for example, protons or *^C, 
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within a macromolecule or across interacting mole¬ 
cules. This occurs as a result of dipolar coupling 
among like-spins, when they are densely packed in 
relative proximity within a few to several Angstroms 
distance in space. 

The dipolar spin Hamiltonian between two like- 
spins contains I/l 2 ~ and Ii~l 2 ^ terms besides other 
terms. The action of these terms is to cause mutual flips 
of the two spins so that the polarization of the first spin 
is transferred to the polarization of the second spin 
located in close distance (cross relaxation). If the 
second spin interacts with third spin through dipolar 
coupling, then the same mechanism may cause the 
transfer of polarization from the second to the third, 
and likewise polarization may continue to propagate 
diffusively among spins until the extra polarization is 
lost to the lattice (thermal motions) via the Tj process. 
This phenomenon is observable in many ways and is 
called “spin diffusion,” as the polarization diffuses 
over several or more spins. The rate of cross relaxation 
depends on the internuclear distance r, the NMR fre¬ 
quency CO, and the characteristic time constant of the 
fluctuation of dipolar interaction. When raxc ^ 1, 
which could often be the case for proteins in solution 
whose molecular weight exceeds 10,000 Da and the 
NMR frequency exceeding several hundred MHz, the 
cross relaxation rate is roughly proportional to xjx^. 
Consequences of spin diffusion; 

1. Longitudinal relaxation times Ti of dipolar-coupled 
protons in a macromolecule tend to become uni¬ 
form, making proton T i no longer a sign of local 
motion (Kalk and Berendsen 1975). For most 
strongly coupled protons in proteins, the effective 
Ti becomes on the order of ^1 s as a result of spin 
diffusion to the relaxation sinks provided by methyl 
group rotations and internal motions of surface res¬ 
idues (Akasaka et al. 1990). 

2. The shortened effective Ti (on the order of a second) 
for most protons even of a high-molecular-weight 
protein makes the pulse repetition time for signal 
accumulation to be in a reasonable time range (a few 
seconds) instead of perhaps 10-100 s if no spin 
diffusion were to take place. 

3. Selectively saturating the magnetization (nulling 
polarization) of one particular spin by 
radiofrequency irradiation will cause saturation of 


the magnetization of its neighboring spin and 
decrease its NMR signal (cross saturation). Cross 
saturation of the nearest-neighbor proton 
magnetization is called Nuclear Overhauser Effect 
or NOE, a crucial parameter for NMR structural 
elucidation of biopolymers in solution. To gain 
NOE values of nearest protons, one must reduce 
spin diffusion as much as possible, for example, 
by taking truncated NOE, etc. 

4. The region of high internal mobility (roXc < 1) in 
a macromolecule will be left out of the effect of 
cross saturation. This phenomenon (selective spin 
dijfusion) may be used to map the high mobility 
regions of a macromolecule (Akasaka et al. 1978). 

5. Spin diffusion can also take place across molecules 
that interact with each other (intermolecular spin 
dijfusion) (Akasaka 1979). Cross saturation is being 
used for selecting ligands that interact with 
a macromolecule and for identifying contact resi¬ 
dues in a protein-protein complex. 

6. In solid state, long-range distance restraints obtain¬ 
able from spin diffusion are used for determining 
the protein structure (Castellani et al. 2002). 

Cross-References 

► Cross Saturation/Transferred Cross Saturation 
(TCS) 

► Nuclear Overhauser Effect 
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Synonyms 

Electron paramagnetic resonance spectroscopy; EPR; 
MTSL and MTSSL; Site-directed spin-labeling 

Definition 

Spin Label: A spin label is a small and stable organic 
radical that can be attached to macromolecules. 

EPR: A spectroscopic technique that detects 
unpaired electrons and yields information about geo¬ 
metric and electronic structures as well as dynamics. 


Basic Characteristics 

In order to study the structure, folding, or dynamics of 
DNA or RNA with EPR, they have to contain unpaired 
electrons. Since nucleic acids usually do not contain 
such paramagnetic centers they have to be artificially 
incorporated. One strategy can be to replace diamag¬ 
netic metal ions with paramagnetic ones, e.g., the 
exchange of Mg^"^ by Mn^"^ (Hunsicker-Wang et al. 
2009) or the incorporation of artihcial metal ion bind¬ 
ing sites (Clever et al. 2010). More commonly, organic 
radicals are used that are covalently and site specih- 
cally attached to the oligonucleotide (Berliner 2010). 
This technique is called site-directed spin labeling 
(SDSL) and the organic radicals mainly used are 
based on nitroxides (Pig. 1). Nitroxides are thermody¬ 
namically stable and kinetically protected by four 
bulky methyl groups meaning that they are stable for 
long times and that the N-O group usually does not 
undergo reactions with the biomolecule. The electron 
spin density resides to over 90% within the NO group 


making them very good candidates for EPR-based 
long-range distance measurements because of the 
validity of the point dipole model. In addition, the 
three line continuous wave EPR spectrum is sensitive 
to dynamics, local pH-values, and solvent viscosity 
and a careful analysis of such spectra does thus report 
on these properties. 

In the following, examples of nitroxide spin labels 
and the corresponding chemistry to bind them to the 
nucleic acids are discussed. Generally, SDSL of nucleic 
acids is based on incorporation of modified nucleotides 
into the oligonucleotide where the modification can be 
at the base, the sugar or the phosphate moiety. The 
modification may already be a spin label or 
a functional group that reacts with a correspondingly 
functionalized nitroxide. In the latter case, the nitroxide/ 
nucleotide coupling reaction can be performed either 
during or after the synthesis of the DNA or RNA strand. 
Having successfully labeled the nucleic acid, it is 
important to check that the spin label does not induce 
perturbations of the structure or function of the 
oligonucleotide. Usually, this is done by means of 
UV-VIS spectroscopy of thermal denaturation, circular 
dichroism measurements, or biochemical assays. 

Spin Labeling the Phosphate Backbone 

SDSL of the phosphate backbone has been achieved by 
reacting the iodo-modified (Eig. 1, 1 ) nitroxide with 
RNA or DNA containing a commercially available 
phosphorothioate modification at a position within 
the oligomer sequence where one wants to have the 
spin label. Using this approach, the interaction 
between a GAAA tetraloop and its RNA receptor 
(Qin et al. 2001) and the interspin distances between 
pairs of nitroxides on DNA were investigated 
(Cai et al. 2006). An advantage of this method is its 
sequence independence, the formation of two stereo¬ 
isomers and the reactivity of the phosphorothioate 
group can be of disadvantage. 

Spin Labeling the Sugar Moiety 

The sugar moiety has been labeled by reacting an 
isocyanate derivative of tetramethylpiperidyl-N-oxy 
(TEMPO) with RNAs containing the 2'-amino-uridine 
modification (Eig. 1, 2 ). The label and the 2'-amino- 
uridine phosphoramidite are commercially available 
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Spin Labeling of DNA and RNA, Fig. 1 Examples of 
nitroxide spin labels used for spin labeling of nucleic 
acids. (1) Thiol-reactive nitroxide, 3-iodomethyl-( 1-oxy- 
2,2,5,5-tetramethylpyrroline), reacted with a phosphorothioate 
modification. (2) Isocyanate tetramethylpiperidyl-N-oxy reacted 
to 2'-amino-uridine. (3) 2,2,5,5-Tetramethylpyrroline-l-oxyl- 


3-acetylene reacted with 5-iodo-uridine. (4) 4-Amino 
tetramethylpiperidyl-N-oxy reacted to correspondingly modi¬ 
fied cytidine. (5) Azidoisoindoline-N-oxy reacted to alkyne 
modified uridine. (6) ^ spin label. (7) Analogue of the ^ spin 
label that binds non-covalently to nucleic acids 


and the yields are reported to be high. This labeling 
approach has been successfully used to study TAR 
RNA dynamics (Edwards et al. 2002) and, in 
combination with PELDOR, to unravel different 
RNA conformations (Schiemann et al. 2003). In each 
case the label has been shown to induce negligible 
perturbation to the secondary structure of duplex 
RNAs. 

Spin-Labeling Nucleobases 

The nucleobase moiety can be labeled via several 
strategies. One possibility is the palladium-catalyzed 
cross-coupling of the spin label 2,2,5,5-tetramethyl- 
pyrrolin-l-yloxyl-3-acetylene (TPA) (Fig. 1, 3) to an 
oligonucleotide containing either 5-iodo-uridine, 
5-iodo-cytidine, or 2-iodo-adenosine. The reaction 
gives high yields and is performed with the DNA or 
RNA strand still on solid support. The two-atom 
alkyne linker makes the label fairly rigid which is 
advantageous for PELDOR or dynamics measure¬ 
ments. An application example for this label is the 
nanometer distance ruler for oligonucleotides 


(Piton et al. 2007). A one-atom linker between 
nucleobase and nitroxide is achieved with 4-amino- 
2,2,6,6-tetrametylpiperidin-l-oxyl (Fig. 1, 4). This 
label can either be incorporated on the 
phosphoramidite level by preparing the spin-labeled 
2'-deoxycytidine phosphoramidite or postsynthetically 
by reacting it with complementary modified guano- 
sine, adenosine, or cytidine introduced into RNA 
oligomers via the corresponding and commercially 
available phosphoramidites. Both strategies have 
been used, the former to identify single-base mis¬ 
matches in DNA duplexes (Cekan and Sigurdsson 

2009) , the latter to probe the secondary structure of 
RNAs (Sicoli et al. 2010). Most recently DNA has 
been site-specihcally spin labeled by postsynthetically 
reacting azide functionalized nitroxides to oligonucle¬ 
otides containing alkyne-modihed deoxyuridine or 
deoxyadenosine via click chemistry (Fig. 1, 5). Exam¬ 
ples of applications of this strategy include EPR mea¬ 
surements on canonical DNA duplexes (Ding et al. 

2010) and on those containing mismatches or abasic 
sites (Jakobsen et al. 2010). 
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Measuring dynamics and distances on nucleic acids 
using spin labels is often complicated by the inherent 
flexibility of the linkers between the spin label and the 
nucleoside. By fusing a nitroxide isoindol moiety to 
a modified cytidine the rigid spin label Q has been 
designed (Fig. 1, 6) and its phosphoramidite incorpo¬ 
rated into DNA during solid-phase synthesis (Barbate 
et al. 2007). The spin label has been shown to form 
a Watson-Crick base pair with guanine without causing 
a structural perturbation to the DNA duplex (Edwards 
et al. 2011). The rigid spin label q (Fig. 1,7), which is 
the base moiety of Q without the sugar-phosphate 
group, has been site-specifically incorporated into 
DNA through non-covalent interactions (stacking and 
hydrogen bonding) and shown to bind to abasic sites 
opposite guanine (Shelke and Sigurdsson 2010). 
The q spin label eliminates the chemistry of spin label¬ 
ing and since oligonucleotides containing abasic sites 
are commercially available spin labeling with a rigid 
spin label is easily achieved without excessive training 
in spin-labeling chemistry. 

In summary, nitroxide spin labels have been suc¬ 
cessfully incorporated at various sites within nucleic 
acids. Which label one chooses will strongly depend 
on the need to minimize structural and dynamical 
alterations and the questions to be answered. Together 
with the broad range of continuous wave and pulsed 
EPR methods, nitroxide spin labels have been shown 
to provide valuable information on local dynamics and 
to yield precise structural information on both DNA 
and RNA constructs. 


Cross-References 

► EPR Spectroscopy: General Principles 

► Interspin Distance Determination by EPR 

► PELDOR and DEER 

► Residual Dipolar Coupling 

► Spin Labeling 
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Definition 

Spin-label electron paramagnetic resonance (EPR) is 
a spectroscopic technique which detects the response 
of a paramagnetic system to resonant microwave radi¬ 
ation. For its timescale and sensitivity, it is particularly 
appropriate to study lipid membranes. 

Introduction 

Spin-label EPR spectroscopy (► Chemistry of Spin 
Labeling) is one of the most efficient tools to 
investigate structural, dynamical, and molecular 
properties of lipid and protein components in natural 
and model membranes (► Membrane Dynamics; 
Berliner 1976; Marsh 1981; Marsh and Horvath 1989; 
Hemminga and Berliner 2007). 

The success and the importance of the EPR 
technique for membrane studies lies in the fact that 
its timescale matches the one characteristic of 
molecular motion of lipids and proteins. The motional 
sensitivity of spin-label EPR extends from the T 2 
spin-spin relaxation timescale (corresponding to 
rotational correlation times Tr « 10”^' to 10~* s) to 
the T 1 spin-lattice relaxation timescale (corresponding 
to Tr « 10“^ to 10“^ s). In this extended time window 
lie both slow translational motions (lateral lipid and 
protein diffusion, anisotropic protein rotations) and 
fast anisotropic lipid rotations (long axis rotations, 
angular motions, chain segmental motions). 

Most biomembrane studies are performed with 
continuous wave cw-EPR at 9 GHz (X-band) in the 
linear regime, but also under progressive saturation or 
saturation transfer (► EPR Spectroscopy: General 
Principles). In recent years, significant progress has 
been made in the investigation of biological 
membranes by using high-field/high-frequency EPR 
and pulsed Fourier transform FT-EPR (► Pulsed 
EPR: Principles and Examples of Applications to 
Hemeproteins; Borbat et al. 2001; Marsh et al. 2005; 
Bartucci et al. 2006; Hemminga and Berliner 2007). 
These different EPR methods in combination with the 
availability of distinct spin-label species allow to make 
in-depth studies of different aspects in the field of 
membrane biophysics. 

This entry focuses on the application of spin-label 
EPR to lipid bilayer membranes. Emphasis is given to 


lipid chain order and dynamics, to intramembrane 
polarity and water accessibility profiles, and to lipid- 
protein interactions. 

Spin Labels for Lipid Membrane EPR Studies 

Natural membranes as well as membrane model 
systems do not possess an intrinsic paramagnetism 
which is necessary for EPR spectroscopy. The 
technique is applied by using magnetic reporter groups 
or “spin labels.” Label moieties routinely used in 
membrane systems are based on the stable free radical 
of the nitroxide group - NO. Different nitroxide spin 
labels for studying the lipid component in membranes 
can be synthesized and are also commercially 
available. They consist of spin probes and spin-labeled 
lipids and steroids (Fig. 1). 

The water-soluble spin probes TEMPO and DTBN 
(Fig. la, b) partition in a temperature-dependent 
manner between the aqueous and the fluid hydrocarbon 
membrane phases. They are well suited for studying 
membrane fluidity, phase transitions associated with 
lamellar bilayer structures, and lateral phase 
separations in lipid membranes. 

A strong impulse for the use of EPR in membrane 
research is given by the synthesis of spin-labeled 
lipids. In spin-labeled lipids, the spin-label reporter 
group can be covalently attached at the polar head 
(Fig. Ic) or to a C-n atom along the hydrocarbon 
chain of a lipid molecule (n = 5 in Fig. Id and 
n = 14 in Fig. le). To avoid spin-spin interaction 
and minimize bilayer perturbation, the spin-labeled 
lipids are dissolved at low concentration (0.5-1% of 
total lipids) in the membrane dispersions. They mix 
well with membrane lipids and orient themselves with 
the polarhead directed toward the polar/apolar 
interfacial bilayer region and the hydrocarbon tail in 
the hydrophobic membrane interior. 

In general, the headgroup labels (an example is 
TEMPO-stearate in Fig. Ic), besides being sensitive 
to perturbations at membrane surface, are also useful in 
studying transbilayer flip-flop, asymmefric distribution 
of lipid molecules, and lateral diffusion rates. 

A wide variety of commonly occurring biological 
lipids can be labeled along the hydrocarbon chain. 
Stearic acid spin labels bear the doxyl moiety at 
selected carbon atom along the acyl chain (n-SASL); 
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Spin-Labeling EPR of Lipid Membranes, Fig. 1 Structures 
of common spin labels used in EPR studies of bilayer lipid 
membranes, (a) 2,2,6,6-tetramethylpiperidine-l-oxyl, TEMPO; 
(b) <fi-fc;t-butyl nitroxide, DTBN; (c) 2,2,6,6-tetramethyl- 
piperidin-l-oxyl-4-yl octadecanoate, TEMPO-stearate; (d) «-(4, 
4-dimethyloxazolidinyl-N-oxyl)stearic acid, ;i-steaiic acid spin 
label, w-SASL; (e) l-acyl-2-(«-(4,4-dimethyloxazolidinyl-N-oxyl) 
stearoyl)-sn-glycero-3-phosphocholine, ;i-phosphatidylcholine 


spin label, n-PCSL; (f) l,2-dipalmitoyl-sn-glycero-3- 
phosphocholine, DPPC; (g) 4',4'-dimethylspiro(5a-cholestane- 
3,2'-oxazolidin)-3'-yloxyl, cholestane spin label, CSL; (h) 17(3- 
hydroxy-4',4'-dimethylspiro(5a-androstane-3,2'-oxazolidin)-3'- 
yloxyl, androstanol spin label, ASL. (i) Orientation of the 
magnetic axes (x, y, z) of the doxyl spin-label group relative to 
the acyl chain axis (dashed line) 


double-chained glycerolipids with different polar 
heads can bear the doxyl group at any carbon position 
down the sn-2 acyl chain between n = 4 and 14, as well 
as at C-16 position. These are the «-PXSL: n-PCSL, 
-PESL, -PGSL, -PSSL, -PISL, and -PASL are 
spin-labeled phosphatidylcholine, phosphatidyletha- 
nolamine, phosphatidylglycerol, phosphatidylserine, 
phosphatidylinositol and phosphatidic acid, 
respectively. n-PXSL have advantages over the 
n-SASL because they intercalate in the membranes in 


register with the phospholipids and they possess 
a phospholipid headgroup specificity. Depending on 
the membrane composition, the most appropriate 
spin-labeled lipid species can be chosen which, once 
inserted in the biomembrane, then behaves as the 
endogenous lipids of the parent membrane matrix. 
An unlabeled dipalmitoylphostatidylcholine (DPPC) 
lipid molecule is shown in Fig. If. 

Other more complex chain-labeled lipid species can 
be synthesized. They differ with regard to headgroup, 
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fatty acyl chain composition, and type of linkage 
between glycerol backbone and fatty acyl residues. 
Examples are spin-labeled glycolipids, distinct species 
of gangliosides, and a variety of lyso and acyl 
derivatives of cardiolipin. 

Chain-labeled lipids are employed for studying 
segmental lipid chain motions, lipid/protein interac¬ 
tions, and several molecular and physical properties of 
membranes including transmembrane polarity profiles. 

The spin-labeled steroids, cholestane and 
androstanol spin labels (Fig. Ig, h), can be 
incorporated in membrane dispersions and locate in 
the bilayer hydrophobic zone. Because of the orienta¬ 
tion of the nitroxide axes (y parallel to the steroid long 
axis), they are mainly used in oriented biomembrane 
preparations. They can be considered analogues for 
studying the role of cholesterol in membranes. 

Details on spin-label synthesis, sample handling, 
and application to membrane studies are reported in 
(Berliner 1976; Marsh 1981, 2008; Marsh and 
Watts 1982). 


Application of Spin-Label EPR to Lipid 
Bilayer Membranes 

Lipid Chain Order and Dynamics 

Due to the dominating contribution of the axially 
symmetric '^N-hyperfine interaction to the spectral 
anisotropy and to its optimum timescale, 9-GHz 
cw-EPR spectroscopy (► EPR Spectroscopy: General 
Principles) of chain-labeled lipids is particularly 
appropriate to study transverse chain ordering and 
rotational dynamics of lipid molecules in biological 
membranes (see, e.g.. Marsh 1981; Marsh and Horvath 
1989). 

The basic structural arrangement of biomembranes 
is the lipid bilayer which represents a highly dynamical 
matrix by virtue of the segmental rotation within the 
lipid chains. Lipids spin-labeled at a selected position, 
«, along the chain are used to characterize the 
segmental chain mobility and its profile in membranes. 

The EPR spectra of chain-labeled lipids in bilayer 
membranes in the fluid phase show a progressive 
motional averaging of the hyperfine anisotropy 
(► EPR Spectroscopy: General Principles) as the 
label moiety is stepped down the acyl chain from the 
first segments adjacent to the apolar/polar interface 


toward the chain terminus (see Fig. 2, left panel). 
This behavior is indicative of a gradient of increased 
rotational mobility on proceeding toward the bilayer 
midplane. The chain segmental flexibility profile is 
a hallmark of lamellar bilayer assemblies in the liq¬ 
uid-crystalline phase. It has been observed for lipid 
membranes, including phospholipids bilayers, biolog¬ 
ical membranes, and lipid/protein complexes (Hubbell 
and McConnell 1971; Berliner 1976; Marsh 1981; 
Marsh and Watts 1982; Bartucci et al. 1993). 

The fundamental molecular process that gives rise 
to membrane mobility or fluidity is the progressive 
trans-gauche rotational isomerism about the C-C 
bonds of the chain. A mobility gradient is also recorded 
in lamellar bilayer gel phases with the lipid chains 
nearly in the extended all-trans configuration. The 
mobility becomes more evident on increasing the 
temperature within the gel phase. 

The flexibility profile in membranes can be 
delineated by following the dependence on labeling 
position, n, of the outermost peak separation, 2Amax> 
in the conventional EPR spectra (see Fig. 2, right 
panel). 2Amax depends on both the amplitude 
(i.e., order) and rate of motion and is a useful parameter 
for characterizing lipid chain dynamics under all 
conditions of immobilization. The effective order 
parameter, 5eff, determined from the hyperfine 
splittings of the axially anisotropic spectra and/or 
spectral simulations can also be used. 

Chain dynamics of bilayer membranes on the 
nanosecond timescale can be studied at low cryogenic 
temperatures by using 9-GHz 2-pulse echo-detected 
(ED-) EPR spectroscopy (► Pulsed EPR: Principles 
and Examples of Applications to Hemeproteins; Erilov 
et al. 2004; Bartucci et al. 2006). ED-EPR spectra 
of lipid spin labels in membranes reflect the depen¬ 
dence of the echo decay on the angular reorientation 
of the spin label. The lineshapes of ED spectra 
as a function of the interpulse separation directly 
reflect the amplitude and rate of rotational motion 
of the spin-labeled chain segments. The ED-EPR 
method exploits the sensitivity of spin-label EPR to 
rotational motions in a time-resolved manner. Partially 
relaxed ED-spectra have been recorded for n-PCSL in 
DPPC membranes containing equimolar cholesterol 
(Erilov et al. 2004). The ED-spectra at 210 K of 
14-PCSL in DPPC/cholesterol membranes are 
reported in Fig. 3. 
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Spin-Labeling EPR of Lipid 
Membranes, Fig. 2 {Left) 
cw-EPR spectra of 
phosphatidylcholine spin-label 
positional isomers, «-PCSL, in 
fluid bilayer membranes of 
dipalmitoylphosphatidylcholine 
(DPPC). Spectral width = 

100 G and T = 50°C. (Right) 
Chain flexibility profile of the 
hyperfine splitting, 2A^^, of 
n-PCSL in fluid bilayer 
membranes of DPPC 


?A 




They show typical features of isotropic librational 
motion characterized by rapid oscillations (correlation 
time in the nanoseconds) of small angular amplitude 
(<15°). The positional dependence of the relaxation 
rates, deduced from the ED-spectra, varies in the direc¬ 
tion of increasing cumulative torsional amplitude 
toward the end of the chain. A transition from 
cumulative segmental librations to concerted chain 
motion takes place at about 200 K (Erilov 
et al. 2004). These librational chain motions must be 
present, in addition to larger-scale rotational motions, 
in the higher temperature phases of lipid membranes. 

The orientational sensitivity of nitroxide cw-EPR 
spectra increases on going to higher microwave 
operating frequencies (Borbat et al. 2001; Marsh 
et al. 2005; Hemminga and Berliner 2007). At 
94-GHz (W-band), the axial Zeeman anisotropy of 
nitroxide spin labels exceeds the '"^N-hyperfine 


anisotropy. At this microwave frequency, the spectra 
of chain-labeled lipids in membranes are sensitive to 
nonaxial rotations and to both lateral and transverse 
ordering of the phospholipid chains, which affect the 
X-, y-elements and the z-elements, respectively, of the 
spin-label g-tensor. High-frequency EPR is therefore 
particularly suitable for investigating lipid membranes 
where in-plane lipid domain formation is expected to 
occur. 94-GHz EPR results on the temperature and 
chain-position dependence of «-PCSL in fully 
hydrated membranes of dimyristoylphosphati- 
dylcholine containing 40 mol% cholesterol are 
reported in (Kurad et al. 2004). 

Polarity and Water Penetration Profiles in Lipid 
Membranes 

The accessibility of the lipid-membrane interior to 
water and the resulting transmembrane polarity profile 
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Spin-Labeling EPR of Lipid 
Membranes, 

Fig. 3 Two-pulse electron 
spin echo sequence (7i/2- x -n- 
T -echo) and echo-detected 
(ED-) EPR spectra at 210 K 
for different interpulse 
separation time, x, of 14-PCSL 
in bilayer membranes of 
DPPC + 50 mol% cholesterol 



are relevant to structure/function relationships in mem¬ 
branes. They impact on the permeability and transport 
properties of biomembranes, and on the stability and 
function of integral membrane proteins. 

Intramembrane polarity prohles can be determined 
with EPR of chain-labeled lipids by following the 
dependence on chain position, n, of polarity-sensitive 
parameters (for a review, see Marsh 2010). cw-EPR 
determinations of membrane polarity are based on the 
measurements of the 14N-hyperfine isotropic splitting 
constant, ao, in fluid membranes, of the component 
of the hyperfine tensor in frozen membranes, and of the 
g-values (► Spin-Labeling EPR of Proteins: Dynamics 
and Water Accessibility of Spin-Label Side Chains, 
► EPR Spectroscopy: General Principles). The three 
parameters reflect environmental polarity via the 
dependence on the net unpaired electron spin density 
on the nitroxide group: an increase of polarity in the 
vicinity of the spin-label group leads to an increase in 
the hyperfine couplings and to a decrease of g-values. 

By using fatty acids and phospholipid spin-labels, 
Griffith et al. (1974) first described the shape of the 
hydrophobic barrier in microsomal membranes and in 
bilayers of extracted lipids. Successively, profiles at 
high spatial resolution have been obtained in bilayer 
model membranes, with and without high content of 
cholesterol, employing lipids spin-labeled either 


systematically down the chain or in the polar-head 
region. cw-EPR measurements were performed at 
9 GHz (Subczynski et al. 1994; Marsh 2001) and at 
94 GHz, where the highest sensitivity of g-values to 
polarity is attained for the gxx tensor element (Kurad 
et al. 2003). The cw-EPR polarity-sensitive parameters 
show a sigmoidal dependence on chain position, n, in 
which a transition takes place from a high-polarity 
region adjacent to the lipid head groups to 
a low-polarity region at the bilayer midplane in the 
center of the membrane. The characteristics 
(i.e., position, width, and magnitude) of the transition 
depend on the membrane lipid composition, particu¬ 
larly on cholesterol content, and on the lipid phase. An 
example of polarity profiles of ao of chain-labeled 
lipids in fluid bilayer membranes is given in Pig. 4. 

The parameters ao, A^z, and gxx indirectly reflect 
transmembrane water penetration via their dependence 
on hydrogen bonding and dielectric constant. Direct 
determination of water (D 2 O) penetration at a given 
depth in the membrane is possible by ^H-electron 
spin-echo envelope modulation (► Pulsed ENDOR 
and ESEEM: Principles and Examples of Applications 
to Heme Proteins) of chain-labeled lipids (Erilov et al. 
2005; Bartucci et al. 2006). The electron spin-echo 
decay shows rapid oscillations arising from weak 
hyperfine interaction of the electron spin of the label 
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Spin-Labeling EPR of Lipid 
Membranes, Fig. 4 Polarity 
profiles of the isotropic 
'"'N-hyperfine coupling 
constant, ao, of n-PCSL in 
fluid bilayer membranes of 
DPPC and of 

DPPC + 50 mol% cholesterol 
(Figure adapted with 
permission from Marsh (2001) 
Proc Natl Acad Sci USA 
98:7777-7778. “Copyright 
2001 by the National 
Academy of Sciences, 

U.S.A”) 
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Spin-Labeling EPR of Lipid 
Membranes, Fig. 5 ESEEM 
spectra at 77 K of 5-PCSL 
(upper) and of 14-PCSL 
(lower) in membranes of 
DPPC -r 50 mol% cholesterol 
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with matrix 'H protons and slow modulations which 
arise specifically from the dipolar hyperfine interaction 
with the nuclear spin of the D 2 O molecules. The 
extent of accessibility of water at the label is directly 
determined by the intensity I(D 20 ) of the signal in the 
ESEEM frequency spectrum. In bilayer model mem¬ 
branes, with and without cholesterol, in the frozen state 
at 77 K, I(D20) is relatively large for positional iso¬ 
mers at the polar/apolar interface and is almost 


vanishingly small at the bilayer center. The ESEEM 
spectra of two extreme positions of chain labeling, i.e., 
5-and 14-PCSL, in membranes of DPPC - 1 - 50% cho¬ 
lesterol are given in Pig. 5. 

The water-penetration profiles display a sigmoidal 
dependence on chain position that is characteristic 
of transmembrane polarity profiles established by 
other, less direct, spin-label methods presented 
above. The pulsed-EPR technique allows to separate 
















2438 


Spin-Labeling EPR of Lipid Membranes 




and distinguish free, unbonded water from water sin¬ 
gly and/or doubly H-bonded to the nitroxides (Erilov 
et al. 2005). 

Lipid-Protein Interactions 

EPR of chain-labeled lipids in lipid-protein mix¬ 
tures (► Membrane Dynamics) allows one to 
study the influence of proteins on lipid bilayer 
properties. 

In the case of integral membrane proteins, the EPR 
spectra of lipids spin-labeled at C-n positions close to 
the chain terminus (i.e., n = 12, 14 or 16) in mem¬ 
branes have evidenced the existence of boundary 
lipids. The spectra consist of two components with 
different rotational mobility. An inner, isotropic 
component is due to more mobile lipids in the bulk 
bilayer regions and an outer, more anisotropic, 
component is due to the motionally restricted boundary 
lipids in direct contact with the hydrophobic 
intramembranous surface of the proteins. Resolution 
and quantification of the two components by spectral 
subtraction yields information on the stoichiometry 
and selectivity of lipid/protein interaction. The 
boundary lipids possess conformational disorder 
and are in exchange with bulk lipids with rates in the 
range from 10^ s“' (lipids with no selectivity) to 
10® s“^ (lipids with the highest specificity). These 
rates are slower than the diffusional lipid-lipid 
exchange rate in fluid phosphatidylcholine bilayers 

~ 10 ^ S ^). 

For proteins that bind at the polar surface and 
partially insert into the hydrocarbon interior of 
membranes, the motional restriction imposed on the 
lipids by the protein is measured by the outer hyperfine 
splitting, 2An,ax> of the motionally restricted 
component in the spectra of chain-labeled positional 
isomers. Comparison of A^ax-values relative to those 
in lipid bilayers alone then allows one to estimate the 
extent of penetration and the location of the protein in 
the hydrophobic membrane region. 

Peripheral, surface-associated proteins cause 
limited perturbation of the mobility of the lipid chains. 
This is revealed by changes in 2Amax in the single¬ 
component spectra of lipids spin-labeled close to the 
polar/apolar interface (i.e., n = 5). 

A useful presentation of spin-label EPR data for 
different proteins and distinct species of lipids is 
given in a recent review (Marsh 2008). 


Summary 

Applications of multifrequency approaches of 
cw-EPR and pulsed EPR methods to the study of 
chain-labeled lipid membranes are presented. The 
segmental lipid chain order and dynamics, the trans¬ 
membrane polarity, and permeation water profiles 
and some aspects of lipid/protein interactions in 
membranes are addressed. 
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► Chemistry of Spin Labeling 

► EPR Spectroscopy: General Principles 

► Membrane Dynamics 

► Pulsed ENDOR and ESEEM: Principles and 
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Applications to Hemeproteins 
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Water Accessibility of Spin-Label Side Chains 
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Synonyms 

Site-directed spin-labeling 


Definition 

By introducing a side chain carrying a paramagnetic 
center (spin label, ► chemistry of spin labeling) 
at a desired site in a protein, one can obtain via 


EPR information on the dynamics of the side 
chain, its water accessibility, and the polarity of its 
microenvironment. Scanning those properties along 
the polypeptide chain can reveal details of the second¬ 
ary and tertiary structure of the protein under 
investigation. 

Introduction 

The subject of this entry is the application of site- 
directed spin labeling (SDSL) electron paramagnetic 
resonance (EPR) to characterize dynamics and water 
accessibility of specific sites in proteins. Some practi¬ 
cal advantages of the use of EPR to study dynamics 
and water profiles in proteins should be emphasized. 
First of all the nitroxide label is relatively small and 
highly flexible, which enables its introduction at spe¬ 
cific sites in proteins with minor perturbation of the 
protein structure. Second, SDSL EPR can be applied to 
proteins of any size for which no structural details exist 
(e.g., no NMR or X-ray structures are available), or for 
which only one conformation is known with atomistic 
details, and one wants to investigate the changes in 
dynamics and water accessibility during the conforma¬ 
tional transitions accompanying the protein’s physio¬ 
logical activity (e.g., transport of substrates, light 
activation, dimerization, oligomerization, membrane 
insertion). Moreover, the technique is highly sensitive 
and can be performed at any temperature, enabling 
measurements on as little as 100 pmol of protein 
(10 pi of 10 pM spin-labeled protein). The relevant 
timescale in conventional X-band EPR at 9.5 GHz 
(from 10 ps to 100 ns) is adequate to follow side 
chain reorientations and fast backbone dynamics. In 
this dynamic range also the relatively fast protein tum¬ 
bling processes (in the case of small water soluble 
proteins below 20 kDa) can be detectable. However, 
in the case of spin-labeled proteins embedded into 
membrane bilayers or solubilized in detergent 
micelles, the rotational correlation time of the overall 
tumbling exceeds the sensitivity range of the tech¬ 
nique; thus one can focus only on the internal motion 
of the spin label and the dynamics of the secondary 
structure associated with it. 

EPR offers a valuable tool to investigate under 
conditions close to the physiological ones the 
structural flexibility of a wide range of proteins, 
a property intimately correlated to their function. 
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Spin-Labeling EPR of Proteins: Dynamics and Water 
Accessibility of Spin-Label Side Chains, Fig. 1 The 

nitroxide spectrum, (a) Zeeman and hyperfine splitting scheme 
for a '^'N nitroxide radical (S = 1/2, I = 1). (b) Structure of 
a pyiToline-type nitroxide. The coinciding g and A tensor frames 
are shown relatively to the pz orbital of the N atom. Due to the 
anisotropy of the hyperfine interaction with the ’"'N nucleus 


(I = 1), different orientations with respect to the magnetic field 
leads to different hyperfine splitting as shown in the panel, (c) 
Separate effects of the hyperfine and g anisotropy on each mj line 
are computed analytically. At the bottom absorbance and first- 
derivative “powder spectrum” at X band (Adapted from http:// 
www.epr.ethz.ch/education/epr2007.pdf) 


EPR is a complementary technique to ► X-ray and 
► NMR, in fact, unlike X-ray, where the dynamic 
regions of crystals are only poorly defined and NMR, 
where large proteins embedded in membranes are dif¬ 
ficult to resolve, EPR offers an adequate tool to map 
elements of secondary and tertiary structure around 
specific sites in large membrane proteins and protein 
complexes with an exquisite sensitivity for flexible 
regions. A disadvantage of SDSL EPR is that the 
information collected are site-specific and one needs 
to go through the mutagenesis and labeling steps for 
each position under investigation. 

The Nitroxide EPR Spectrum 

Nitroxide radicals are mostly pyrroline- or piperidine- 
type radicals carrying an unpaired electron delocalized 
in the n. orbital system of the N-0 bond which is 
stabilized by steric hindrance due to the four methyl 
groups preventing dimerization. Several labeling strat¬ 
egies and spin labels are available at present. However, 
the extreme specificity for the free thiol of cysteines, 
a stoichiometric reaction at most sites, and the 


relatively small size and flexibility of the modified 
side chain made the (l-Oxyl-2,2,5,5-tetramethyl-A3- 
pyrroline-3-methyl) methanethiosulfonate spin label 
(MTSSL) the most popular choice for SDSL applica¬ 
tions. The typical ► continuous wave EPR spectrum of 
a nitroxide radical consists of three hyperfine lines due 
to the interaction of the electron spin (S = 1/2) with the 
'^N nucleus (I = 1) of the pyrroline or piperidine ring 
(Fig. la). The hyperfine splitting of a nitroxide spec¬ 
trum depends on the relative orientation of the 
nitroxide molecular frame with respect to the 
external magnetic field Bq. Crystallographic studies 
of pyrroline-type radicals revealed the planar structure 
of the radical ring, with the N-O bond lying in the 
C-N-C plane. The coinciding hyperfine and g tensor 
principal axis systems (PAS) are represented in Fig. lb 
on a pyrroline ring according to the standard 
nomenclature. 

The orientation-dependent properties of the nitroxide 
spectrum are determined by the anisotropy of the A and 
g tensors. For MTSSL, the hyperfine values are Axx 
Ayy = 0.45-0.50 mT and A^^. = 3.3-3.7 mT and the 
g values are gxx = 2.0081-2.0091, gyy = 2.0061, and 
gzz = 2.0021. The large range of A^^ and gxx values 
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Spin-Labeling EPR of Proteins: Dynamics and Water 
Accessibility of Spin-Label Side Chains, Fig. 2 Effects of 
high frequency (high Bg) on the nitroxide spectrum, (a) Simula¬ 
tions of powder spectra performed with the Easyspin function 
“pepper” at S (4 GHz, 0.14 T), X (9.5 GHz, 0.34 T), Q (34 GHz, 
1.2 T), and W (95 GHz, 3.4 T) bands. The g and A parameters 


resolved in the W-band spectra are shown, (b) Simulations of 
nitroxide spectra in the intermediate motional regime performed 
with “chili” for isotropic reorientational motion with Xc = 1 ns. 
(c) Nitroxide spectra in the fast motional regime simulated with 
“garlic” for isotropic reorientational motion with x^ = 10 ps 


correlates with their dependency on the polarity of the 
nitroxide microenvironment, as it will he explained 
later in the chapter. Due to fast averaging of the 
anisotropies during the free tumbling of the nitroxide in 
water at room temperature, the resulting spectrum shows 
three narrow lines separated by an isotropic hyperfine 
splitting Uiso (1/3 of the trace of the A tensor) and 
centered at an average g value (1/3 of the trace of the 
g tensor). The typical room temperature ► continuous 
wave EPR spectrum of a freely tumbling MTSSL in 
water consists of three equally spaced narrow lines 
(hyperfine splitting aj^o 1.5 mT, peak-to-peak linewidth 
0.16 mT). 

In the rigid limit, e.g., at low temperature, the 
weighted superpositions of all possible molecular ori¬ 
entations leads to the so-called powder spectrum which 
is dominated by the singularities arising at the princi¬ 
pal axis directions (Fig. Ic). The powder spectrum is 
characterized by a central mj = 0 line with a frequency 
dispersion solely caused by g anisotropy and by 
a high field mj = — 1 line broader than the low field 
mi = -1-1 line due to the relative order of the A and 
g values (Azz » Axx, Ayy and g^^ < gyy < gxx). 

In conventional X-band EPR the g anisotropy is not 
resolved, and only the A^^ hyperfine parameter can be 
extracted from the fitting of the spectra. For all other 
parameters of the A and g tensor, one needs to detect 
the spectra at higher frequency. Figure 2a shows the 
rigid-limit nitroxide spectra at different bands: 
S (4 GHz), X (9.5 GHz), Q (34 GHz), and 


W (95 GHz) simulated with the Easyspin (Stoll and 
Schweiger 2006) function “pepper.” W-band frequen¬ 
cies are sufficient to completely resolve all nitroxide 
spectral features. In the other extreme of motion, 
namely, the fast isotropic motion, the multifrequency 
comparison of the spectra simulated with the Easyspin 
function “garlic” (correlation time 10 ps) shows no 
relevant high-frequency advantages. The spectra 
show three narrow lines centered at gj^o with 
a splitting aiso (Fig. 2c) with the only recognizable 
change being a minor broadening on the third line of 
the spectrum at higher frequency. In the two motional 
extremes the nitroxide spectra differ tremendously. In 
intermediate regimes the partial averaging of the 
anisotropic properties leads to a gradual change in the 
spectral features. The lineshape of the cw spectra 
encodes all information on the complicated anisotropic 
overall reorientational dynamics of the nitroxide mol¬ 
ecule, usually referred to as a general “mobility” of the 
spin label. Figure 2b shows a simulation for an inter¬ 
mediate isotropic correlation time of 1 ns. The broad¬ 
ening of the spectral lines due to incomplete averaging 
of the hyperfine anisotropy is already evident at 
S band. Comparing the spectra at different bands in 
Fig. 2b to the fast motion spectra (Fig. 2c) reveals that 
high-frequency spectra are more sensitive to fast 
motions, due to the higher Zeeman spectral resolution. 
On the contrary, slow motions which may be already 
close to the rigid-limit case at high frequencies can be 
still detected at lower frequencies. 
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Spin-Label Side Chain Mobility and Protein 
Structure 

The sensitivity of the EPR spectra to the 
reorientational motion of the nitroxide side chain 
attached at specific sites in the protein has been exten¬ 
sively reviewed (Berliner 1976; Berliner 1979; Berli¬ 
ner and Reuben 1989; Bordignon and Steinhoff 2007). 
Reorientational motions of spin labels that are fast on 
the EPR timescale effectively average the anisotropic 
magnetic interactions leading to motionally narrowed 
spectral features. At X band, this corresponds to cor¬ 
relation times less than 1 ns. Typically, side chain 
motions at the N- and C-terminus of a protein, or in 
extended flexible loops fall into this category. Eor 
these fast motions, a pseudo-isotropic rotational corre¬ 
lation time can be calculated by comparing heights and 
widths of the three hyperfine lines of the spectrum 
according to Kivelson (Kivelson 1960). Intermediate 
motions result in partial averaging of the anisotropic 
magnetic interactions leading to broadened spectral 
features. This corresponds to effective correlation 
times in the 1-5 ns time window, characteristic of 
motions of residues at the protein surface or in small 
loops (Pig. 3a, position 164). The global tumbling of 
small globular proteins in solution also falls into this 
range (<20 kDa). Slower motions (5-100 ns), charac¬ 
teristic of buried sites in proteins and of global tum¬ 
bling of large proteins in solution (>100 kDa), lead to 
resolution of spectral features with increased apparent 
hyperfine splittings and spectral linewidth (Pig. 3a, 
position 170). Very slow motions (>1 ps), character¬ 
istic of the tumbling of transmembrane proteins in 
a lipid bilayer, lead to rigid-limit EPR spectra. Confor¬ 
mational changes in many proteins also occur in the 
microsecond timescale or longer, thus the spectrum 
will be the weighted sum of the spectra corresponding 
to each protein conformation. Pigure 3 shows some 
examples of different spin-label mobilities and the 
semiquantitative approach which is often used to 
extract spin-label dynamics based on lineshape analy¬ 
sis. Generally the term “mobility” is used to define the 
effects on the EPR spectral features due to the motional 
rate, amplitude, and anisotropy of the overall 
reorientational motion of the nitroxide label. Two 
mobility parameters are presented: the inverse of the 
spectral second moment and the inverse of the central 
EPR line (Pig. 3b). Both parameters are increasing 
functions of the spin-label mobility and can be used 


to give insights in the protein secondary structure, 
based on the available correlation between mobility 
and protein structure obtained from a large number of 
studies. Although the semiquantitative analysis is 
usually enough to draw conclusions on secondary 
structure and conformational changes (Pig. 3c), inter¬ 
pretative problems may arise if spectra show compli¬ 
cated features with multiple components, which in 
literature are often identified as “immobile” and 
“mobile” componenfs (Fig. 3c, arrows). 

Clearly, the reorientation of the spin label is 
a complex function of the spin-label molecular 
structure and the primary, secondary, tertiary, and 
eventually quaternary structure of the protein under 
investigation and it is strictly anisotropic. Addition¬ 
ally, a distribution of motional states can be concluded 
from those spectra which exhibit more than one com¬ 
ponent, which is the majority of the cases encountered 
experimentally. Simulations of the cw spectra were 
obtained by ► molecular dynamics simulations 
(Steinhoff and Hubbell 1996; Beier and Steinhoff 
2006; Oganesyan 2007; Sezer et al. 2009; Oganesyan 
2011) or using a slow motional theory based on the 
stochastic Liouville equation. The theoretical model 
developed by Freed (SRLS, slowly relaxing local 
structure) is the most advanced up to date (Liang and 
Freed 1999; Zhang et al. 2010). Two principal dynamic 
modes are included in SRLS, the global tumbling of 
the protein and the internal motion consisting of back¬ 
bone fluctuations and side chain isomerizations. The 
application of the SRLS model is shown to be able to 
separate the effects of faster internal modes of motion 
from slower overall motions only if spectra at different 
frequencies are simultaneously fit. Nevertheless, even 
with this approach up to three distinct spectral compo¬ 
nents are often required to account for the spectral 
features at all frequencies at different temperatures, 
showing the intricate and composite interaction 
between the label and the protein environment. 

Accessibility of the Spin-Label Side Chain 
and Protein Topography 

To obtain insights in the protein topography with 
respect to the water and membrane interfaces (in case 
of membrane proteins), the nitroxide dynamics in 
a spin-labeling site scan (several single point mutation 
along the polypeptide chain) do not suffice. The 
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Spin-Labeling EPR of Proteins: Dynamics and Water 
Accessibility of Spin-Label Side Chains, Fig. 3 Spin label 
“mobility." (a) Ribbon representation of bacteriorhodopsin with 
two spin-labeled positions. The clouds define more than 90% of 
the accessible space of the N-O group as obtained from MD 
simulations (Beier and Steinhoff 2006). The motion of the spin 
label attached at position 164 is only slightly restricted 
(“mobile” position) while the one at 170, buried in the interior 
of the protein, is strongly restricted (“immobile”). Experimental 
spectra {black) and MD-based simulations {green) (Beier and 
Steinhoff 2006). (b) Plot of the mobility parameters 
(inverse of the second moment) versus AHq^* (inverse of the 
central line width) extracted from the experimental spectra of 
a set of positions investigated in bacteriorhodopsin. The topo¬ 
logical regions in the mobility plot are drawn according to 
Mchaourab et al. (1996), Isas et al. (2002) and are not intended 
to be exhaustive of all mobilities experimentally observed 
(Adapted from Bordignon and Steinhoff 2007). (c) Available 


crystal structures for the vitamin B 12 ► ABC transporter in the 
absence (PDB code 1L7V) and presence of the binding protein 
BtuF (PDB code 2QI9). No structural data exist for the ATP- 
bound state of the complex. The R1 side chains attached at the 
cytoplasmic and periplasmic gates (positions 141 and 168, 
respectively) are calculated and visualized using the MMM 
software (http://www.epr.ethz.ch/software/index) based on 
a rotamer library approach (Polyhach et al. 2011). On the right, 
cw EPR spectra detected at room temperature on the transporters 
reconstituted in liposomes showing the different dynamics of the 
gates upon BtuF binding, and after the conformational change 
induced by ATP binding. The decrease in mobility for position 
168 upon BtuF binding and the further spin-label immobilization 
in the ATP state is apparent from the spectra. Opposite effects 
are observed in the cytoplasmic region of the protein (position 
141) giving insights in the gate mechanism (Reproduced exper¬ 
imentally according to Goetz et al. 2009) 


lineshape analysis can be complemented with the mea¬ 
sure of the accessibility of the nitroxide probe toward 
the bulk water and the membrane bilayer. Several EPR 
techniques are available to obtain this important piece 
of information. Advantages and disadvantages of each 
approach will be described in the following. The most 
common technique is power saturation cw EPR, which 
consists in detecting the EPR spectra at increasing 


microwave power to induce saturation (Altenbach 
et al. 1990). The amplitude of the central line is then 
plotted versus the square root of the incident micro- 
wave power, and the saturation curve is analyzed to 
obtain the empirical P 1/2 value (the power at which the 
intensity of the line is half of what it should be in the 
absence of saturation). The P 1/2 value in a system 
described by the Bloch equations is inversely 
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proportional to the product of the effective relaxation 
times Ti-T 2 . The saturation behavior is then strictly 
dependent on nitroxide relaxation, which in turn is 
characteristic of the microenvironment of the spin 
label and can be modified by addition of paramagnetic 
species soluble in the water or lipid phases. The satu¬ 
ration curve is detected in gas-permeable TPX capil¬ 
laries first under N 2 atmosphere, second in the presence 
of oxygen (O 2 preferentially accumulates in the apolar 
membrane bilayer), and third in the presence of water- 
soluble fast relaxing paramagnetic agents such as 
chromium oxalate (negatively charged) or Ni-EDDA 
(neutral). Collisions with the paramagnetic reagents 
increase the nitroxide relaxation rates via Heisenberg 
exchange (predominant mechanism) which is propor¬ 
tional to the reagent concentration [R]. 



The difference in P 1/2 in the presence and absence of 
paramagnetic species reflects the relative collision fre¬ 
quency or the relative “accessibility” of the spin 

labels toward the exchange reagent. 

A p _ pt? _ pO ^ _ ^ ^ 

1/2 1/2 1/2 ~ ji ) 

T 2 being much smaller than the encounter complex 
lifetime, one can assume that T 2 ^ « T 2 °. The T 2 
dependence of P 1/2 is accounted for by dividing the 
Pi /2 values by the EPR linewidth, and normalizing 
them to a standard measurement to obtain the param¬ 
eter n (Bordignon and Steinhoff 2007). 

One example of differential effect of oxygen and 
Ni-EDDA is provided in Eig. 4a. The trend of accessi¬ 
bility on a site scan can reveal periodical patterns, thus 
elements of secondary structures. Advantages of this 
method are: (1) fast measurements on commercial cw 
spectrometers, (2) little amount of sample required, 
and (3) simple data analysis. Disadvantages are: 

(1) indirect effects of paramagnetic quenchers are 
measured instead of the direct effects of water, 

(2) the water soluble reagents are bulky and do not 
sample the same cavities as the water molecules, 

(3) some reagents are charged, and this can affect the 


approach toward charged membrane bilayers, (4) when 
multiple spectral components are present, only aver¬ 
aged accessibility can be obtained, and (5) calibration 
with a standard required. 

A second technique consists in the direct measure of 
the changes induced by paramagnetic agents in the 
spin-lattice relaxation T 1 by saturation recovery 
(Pyka et al. 2005). In this case an intense saturation 
pulse of microwave is delivered at room temperature at 
a frequency corresponding to the central EPR line and 
the return of the spectral intensity is monitored with 
a weak cw observing microwave held. Advantages are: 
(1) direct measurement of Tj, (2) selective Ti mea¬ 
surement on different spectral components. Disadvan¬ 
tages are: (1) dedicated pulse EPR spectrometer 
required, (2) poor long-term stability of the 
experiment. 

A third approach is to detect directly the accessibil¬ 
ity of the nitroxide toward deuterated water molecules 
by comparing the amplitude of the ESEEM oscillations 
in low-temperature experiments (Volkov et al. 2010). 
The advantages are: (1) direct measurement of the 
water molecule accessibility, (2) high sensitivity, and 
(3) high reproducibility. Disadvantages are: (1) cryo¬ 
genic temperature, (2) difficulty to distinguish buried 
protein sites from membrane embedded ones. 

A fourth approach measures by NMR the local 
hydration dynamics around nitroxides at room temper¬ 
ature via Overhauser dynamic nuclear polarization 
(DNP). Key steps to pave the way for the application 
to spin-labeled proteins were performed by the group 
of Han (Armstrong and Han 2009). Advantages of this 
new approach could be: (1) direct measurement of the 
water accessibility, (2) highly amplified proton nuclear 
signal. Disadvantages: (1) DNP dedicated setup 
required, (2) nonconventional methodology, thus 
absence of standard calibration data, and (3) difficulty 
to distinguish buried protein sites from membrane- 
embedded ones. 

Polarity and Proticity of the Spin-Label 
Microenvironment 

Analogously to the well-known change in fluorescence 
of a chromophore due to a change in solvent polarity, 
the Azz principal value of the '"^N hyperflne tensor, and 
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Spin-Labeling EPR of Proteins: Dynamics and Water 
Accessibility of Spin-Label Side Chains, Fig. 4 Accessibility 
and polarity of the nitroxide microenvironment, (a) Left, sche¬ 
matic view of a transmembrane protein with a spin label pointing 
toward the bulk water. The concentration gradients of the para¬ 
magnetic reagents are depicted with the arrows. Middle, exam¬ 
ple of three saturation curves. The dotted line is the linear 
regression of the first points of the curves. Right, schematic 
drawing of an alpha-helical periodical pattern which can be 


identified by a site scanning over the helix length, (b) Left, plot 
of gxx versus for few positions in bacteriorhodopsin. Position 
162, water exposed and 46 in the middle of the protein channel, 
are highlighted. The other points in the plot are detected for 
nitroxides in different solvents (Adapted from Brutlach et al. 
2006). Middle, W-band cw spectra detected at 160 K for the two 
highlighted residues in the structure {right panel, PDB 3HAO). 
The spectral changes due to decrease in polarity are indicated 
by arrows 


the gxx principal value of the g tensor vary with polarity 
(electric field effects) and proticity (propensity to form 
H-bonds). Generally, a polar environment shifts to 
higher values (for MTSSL = 3.3-3.7 mT going 
from apolar-aprotic to polar-protic solvents) whereas 
gxx is decreased (for MTSSL gxx = 2.0091-2.0081, 
respectively) (Bordignon and Steinhoff 2007). The A^^ 
component can be easily obtained from low tempera¬ 
ture cw X-band EPR spectra of spin-labeled proteins in 
frozen samples. In contrast, the principal g-tensor com¬ 
ponents and their variation due to solute-solvent inter¬ 
actions can be determined with high accuracy only at 
W band or higher frequencies (Fig. 2a). Theoretically, 
both gxx and A^^ are expected to be linearly dependent 


on p,c°, the n spin density on the oxygen atom of the 
nitroxide group on account of the sum condition 
Pn+Pn = 1 : 


4 ~ 

Azz — UnPr. 


Sxx — + ■ 


no)p^^c% 

AF 


where = 2.0023 is the free-electron g value; ^(O) is 
the oxygen spin-orbit coupling parameter; is the 
LCAO coefficient of the 2py atomic orbital 
contributing to the oxygen lone-pair MO; and AE„„* 
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is the n ^ 71* excitation energy. The spin density on 
the nitrogen atom increases in the presence of an elec¬ 
tric held along the N-0 bond direction. This together 
with changes induced in the lone-pair orbital energy 
leads to the polarity dependence described above. The¬ 
oretical calculations supported by experimental evi¬ 
dences showed the remarkable dependence of gxx 
with H-bonding to the N-0 group. By plotting gxx 
versus extracted from low-temperature W-band 
spectra, one can easily recognize polarity and proticity 
prohles of sequentially spin-labeled proteins (Fig. 4b). 
It is worth mentioning that the gxx region of the W- 
band spectra is usually more complex than it would be 
if just one gxx component were present. In fact, both in 
spin-labeled lipids and proteins, respectively, up to two 
or three gxx components were observed, indicating 
different degrees of H-bonding to the N-0 group. 

Summary 

Site-directed spin-labeling EPR gives detailed informa¬ 
tion on the unnatural side chain carrying the unpaired 
electron in terms of dynamics and water accessibility. 
The information obtained can be related to the dynamic 
properties of the protein segment at which the spin label 
is attached, giving insights into protein structural ele¬ 
ments and their modification during protein activity 
under conditions close to the physiological ones. More¬ 
over, the sensitivity of the hyperfine and g tensor on 
polarity and H-bonds to the nitroxide moiety gives 
a unique opportunity to monitor water permeation pro¬ 
files in membrane proteins and correlate conforma¬ 
tional changes to water rearrangements. 
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Introduction 

The technique of spin trapping was first introduced as 
a term in the paper by Janzen and Blackburn published 
in the Journal of the American Chemical Society in 
1969 (Janzen and Blackburn 1969) following the pub¬ 
lication of an earlier paper in the same journal (Janzen 
and Blackburn 1968). The technique they developed 
was the use of an unsaturated compound (C-nitroso or 
nitrone) to react with the radical(s) to be detected via 
a free radical addition reaction to give a new radical 
with a longer lifetime, known as a spin adduct. The 
unsaturated compound used to trap the radicals is 
known as the spin trap. The history and development 
of the technique is covered in a review article (Janzen 
and Haire 1990). The development of this technique 
has greatly facilitated the detection of radical interme¬ 
diates, which would otherwise be difficult to detect. 
The methods commonly employed with ESR 


spectroscopy are either the generation of a high steady 
state radical concentration by continuous irradiation, 
or by use of continuous flow of aqueous reagents or the 
stabilization of radical intermediates by use of low 
temperatures. Spin trapping has greatly aided the 
detection of radical intermediates in aqueous solution 
at room temperature, and using small reaction volumes, 
which is an advantage for studies of biological systems 
where the material generally available for study 
can often be limited in supply (reviewed Davies and 
Timmins 1996; Burkitt 2004). 

Theory and Applications 

The choice of spin trap is usually a C-nitroso 
compound for chemical systems and nitrones for 
biological systems. Typical nitroso compounds in 
common use are 2-methyl-2-nitrosopropane (MNP), 
and 3,5-dibromo-4-nitroso-benzenesulfonic acid 
(DBNBS). 

Nitroso Spin Traps 

The reaction of a radical X‘ with a nitroso compound 
results in the addition of the radical to the nitrogen 
atom with the formation of an aminoxyl or nitroxide 
free radical as shown below in reaction Scheme 1. 

The direct bonding of the added radical to the nitro¬ 
gen atom of the nitroso spin trap enables the interac¬ 
tions due to the chemical structure of the added radical 
(hyperhne interactions) to be observed in the ESR 
spectrum. Spin adducts of nitroso compounds, by con¬ 
trast with nitrones, give more information about the 
structure of the primary radical trapped due to the 
direct attachment of the radical to the aminoxyl nitro¬ 
gen, resulting in some spin density being delocalized 
on to the added radical. The spectrum is characterized 
by a triplet (three line) splitting due the interaction of 
the nitrogen nucleus (spin 1) with the unpaired elec¬ 
tron, which is delocalized between the nitrogen and 
oxygen atoms. The number of lines resulting 
from interaction of an unpaired electron with a spin 
I is 21 -I- 1. The hyperhne splitting is described as a(N) 
as shown in Fig. 1. 

The trapping of a radical via a -CH moiety results in 
a 1:1 doublet splitting of the nitrogen splitting on the 
basis of a simple interaction with one spin Vi nucleus, 
and resulting in six lines of equal intensity. The 
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Spin-Trapping: Theory and Applications, Scheme 1 Reac¬ 
tion of a nitroso spin-trap with a radical X to form a nitroxide 
spin-adduct 

^ a(N) ^ 





Spin-Trapping: Theory and Applications, Fig. 1 Stick dia¬ 
gram to indicate the hyperfine splitting when an unpaired elec¬ 
tron of a nitroxide radical is spin-coupled to the nuclear spin of 
the adjacent imminoxyl nitrogen 

nitrogen splitting a(N) is the distance between the 
outer lines of each doublet as shown in Fig. 2, and 
a splitting due to the hydrogen interaction is a(H). 

However, additional nuclei are more complicated, 
and the spectrum depends on the radical structure. For 
example, if a methyl group is bonded to the spin trap, 
then the three equivalent methyl protons (each spin I/ 2 ) 
result in the further splitting of the nitrogen lines into 
1:3:3; 1 quartet resulting overall in a quartet of triplets 
as shown in Fig. 3. 

For simplihcation, the 1:3:3:1 methyl proton split¬ 
ting arises due to the possibility of one situation where 
the three protons are spin up; one situation where they 
are all spin down; three of two spin up and one spin 
down arrangements (all equivalent energy transitions) 
and three of two spin down and one spin up (again all 
equivalent). The trapping of a CH 3 CH 2 ' radical results 
in a further 1:2:1 triplet splitting of the three nitrogen 
lines due to the combinations of Hi spin up/ H 2 spin 
down and H 2 spin up/ Hi spin down being equivalent 
(Fig. 4). 

Nitrones 

Nitrone spin traps differ from nitroso spin traps in their 
reaction with the radical at a carbon atom alpha to the 


^ a(N) ^ 


Spin-Trapping: Theory and Applications, Fig. 2 Stick dia¬ 
gram to indicate the hyperfine splitting when the unpaired elec¬ 
tron of a nitroxide radical is spin-coupled to the nuclear spins of 
the alpha-nitrogen and beta-hydrogen of a nitroso-trapped — CH 
moiety 



Spin-Trapping: Theory and Applications, Fig. 3 Stick dia¬ 
gram to indicate the hyperfine splitting when the unpaired elec¬ 
tron of a nitroxide radical is spin-coupled to the nuclear spins of 
an alpha-nitrogen and the beta-hydrogens of a nitroso-trapped 
—CH 3 moiety 


aminoxyl nitrogen. This is illustrated by the reaction 
shown in Scheme 2. The unpaired electron spin density 
is now one atom removed from the bonded radical 
and this results (usually) in no resolvable hyperfine 
interactions from the trapped radical but solely 
a 1:1:1 triplet from the nitrone nitrogen further 
split into doublets by the P-hydrogen located on 
the spin trap. Typical nitrone spin traps in 
use are 5,5-dimethyl-1-pyrroline-N-oxide (DMPO), 
N-t-butyl-a-phenylnitrone (PEN), and a-(4-pyridyl-l- 
oxide)-N-t-butyl-phenylnitrone (POBN). 
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Spin-Trapping: Theory and Applications, Fig. 4 Stick dia¬ 
gram to indicate the hyperfine splitting when the unpaired elec¬ 
tron of a nitroxide radical is spin-coupled to the nuclear spins of 
an alpha-nitrogen and the beta-hydrogens of a nitroso-trapped 
—CH 2 CH 3 moiety 



Nitroxide radical (spin-adduct) 
DMPO - 5,5-dimethyl-1-pyrroline-A/-oxide 

Spin-Trapping: Theory and Applications, Scheme 2 Reac¬ 
tion of DMPO with a radical R to give a nitroxide spin-adduct 


Advantages and Disadvantages of Nitroso Versus 
Nitrone Spin Traps 

Nitroso compounds tend to be more toxic than 
nitrones, the lifetimes of the spin adducts tend to he 
short and artifacts can he a problem due to their pho¬ 
tosensitivity. They are used to detect carbon-centered 
radicals, but heteroatom radicals (e.g., O and S) are 
difficult to detect due to short lifetimes of the spin 
adducts. Nitroso compounds generally absorb light in 
the 350^50 nm range, making their use in photolytic 
systems problematic. Photolysis of the C-N bond in 
MNP results in the formation of di-t-butyl nitroxide 
radicals from reaction of a t-butylnitroxide radical with 
MNP. Nitroso compounds can also undergo nonradical 
“ene” reactions, to form ultimately nitroxides, partic¬ 
ularly in lipid-rich environments. Their particular 
advantage is that they give more information on radical 


structure than nitrones, hut the magnitude of the cou¬ 
pling constants are very solvent dependent making 
comparison of spectra obtained using different solvent 
systems difficult. By contrast, nitrones trap a wider 
range of radicals with less associated artifacts. The 
radical is identified, however, by the P-H splitting 
which as indicated is relatively invariant for the spin 
traps PBN and POBN. The nitrone spin trap of choice, 
particularly for studies of biological systems, is DMPO 
where the P-H splitting is strongly dependent on the 
type of atom center of the radical trapped. For exam¬ 
ple, carbon adducts are typically 2.0-2.4 mT, oxygen 
adducts 1.0-1.5 mT, and sulfur adducts 1.4-1.8 mT 
(Fig. 5). DMPO is often the choice of trap for detecting 
oxygen radicals, particularly in biological systems. 
The hyperfine splittings of a wide range of radical 
adducts of commonly used spin traps are comprehen¬ 
sively summarized in a review (Buettner 1987). 

Rates of Trapping Versus Decay 

For a spin trap to be effective at stabilizing radicals, it 
is necessary that the rate of trapping is high and rate of 
decay low (high stability of the adduct). Some rates of 
trapping of different radicals by the spin trap DMPO 
are shown in Fig. 6, along with measured half-lives 
(Finkelstein et al. 1980). DMPO reacts most rapidly 
with carbon-centered radicals and the hydroxyl radi¬ 
cal, and the associated adducts are relatively stable and 
have a long life being readily detected in both model 
in vitro systems and in cells and tissues, up to hours or 
even (for the hydroxyl adduct) days after the initiation 
of the oxidative insult. 

Other radicals spin-trapped relatively rapidly and 
sufficiently stable as a radical adduct to be observed 
during the ESR experiment are alkoxyl and peroxyl 
radicals. Alkoxyl adducts are trapped more readily 
than alkyl-peroxyl adducts and can be the predominant 
species in peroxidizing lipids, which generate peroxyl 
radical intermediates. Alkyl radical adducts result 
from the decay of initially formed peroxyl radical 
adducts. Detection of alkoxyl adducts can therefore 
be indicative of the prior formation of peroxyl adducts 
in a biological system (Davies and Slater 1987; 
Dikalov and Mason 1999). 

A radical frequently studied in biological systems is 
the superoxide radical, which at physiological pH is 
not protonated. Both the rate of trapping of this radical 
by DMPO and the stability of the spin adduct at phys¬ 
iological pH are low, making the superoxide adduct 
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Spin-Trapping: Theory and 
Applications, Fig. 5 Stick 
diagram to indicate the 
hyperfine splitting when 
oxygen, sulphur and carbon- 
centred radicals are trapped by 
DMPO. The spectrum 
comprises a triplet of doublets 
due to the spin-coupling of the 
unpaired electron with the 
nuclear spins of nitrogen and 
hydrogen of DMPO. The beta- 
hydrogen splitting varies with 
the nature of the atom centre 
trapped 


Oxygen-adduct a(H) = 1.0 - 1.5 mT 


a(N) a(H) 


Sulphur-adduct a(H) = 1.4 - 1.8 mT 

a(N) "" ^i(H) ^ 


I I I I I Carbon-adduct a(H) = 2.0 - 2.4 mT 
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* In aqueous solution 

Spin-Trapping: Theory and Applications, Fig. 6 Measured 
rate constants (moP* dm^ s“*) for the spin-trapping of different 
radicals with DMPO (Finkelstein et al. 1980) 


difficult to detect in biological systems with DMPO. 
The protonated form, however, is both more rapidly 
trapped and stable and thus readily detected at low pH 
(Haywood and Linge 2004) as discussed in the next 
section. Thus, although DMPO is very useful for 
detecting and distinguishing oxygen radicals, it is 
important to bear in mind the different rates of trapping 
and adduct decay. By contrast with carbon and oxygen 
radicals, sulfur-centered radicals are more difficult to 
detect in biological systems (Buettner and Oberly 
1978). While they are readily trapped, the decay can 
be very rapid and occur over the duration of the scan. 
This instability of sulfur-centered adducts is often used 
to distinguish them from the more stable hydroxyl 


adduct, since sulfur adducts, particularly glutathione, 
can have similar hyperfine splittings to hydroxyl 
adducts and can be difficult to distinguish. 

Biological Applications of Spin Trapping 

The application of spin trapping to biology has grown 
considerably in recent years, and as mentioned above, 
the nitrone spin trap is the choice of spin trap for 
detecting oxygen radicals, which are most frequently 
studied in biological systems. A typical example of the 
effectiveness of DMPO in the trapping of oxygen 
radicals is shown by the spectrum in Fig. 7, which 
was obtained during the UVA irradiation of DL-dopa 
melanin and DMPO at pH 4.5 (Haywood and Linge 
2004). The hydroxyl adduct of DMPO (*) is partly 
obscured by the hydroperoxyl (protonated superoxide) 
adduct (marked with the stick diagram) but the outer 
lines are clearly separated and distinct. The hydroxyl 
adduct is characteristically a four line 1:2:2:1 spec¬ 
trum, due to the nitrogen hyperfine splitting a(N) and 
P-hydrogen splittings being equal and 1.49 mT 
(14.9 G). The outer lines of the hydroxyl adduct always 
occur at low and high field of the lines due to the 
superoxide adduct, since the latter hyperfine splittings 
(a(N) = 1.49 mT, a(H) =1.12 mT) are more charac¬ 
teristic of peroxyl radical adducts where a(N) > a(H) 
and six lines are detected. 

The presence of a hydroxyl adduct is, however, not 
always indicative of the actual trapping of the hydroxyl 
radical. A hydroxyl adduct can form in a number of 
ways other than via the trapping of OH', for example. 
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Fig. 7 Electron spin 
resonance spectrum of 
synthetic DL-dopa melanin 
and DMPO (180 M) in pH 4.5 
buffer during UVA-irradiation 


2.0 mT 


atMt atR-Hi 



from the decay of the superoxide radical adduct at pH 7 
(Buettner and Oberly 1978), from the reaction of the 
trap with singlet oxygen (which is not a radical spe¬ 
cies) (Nishizawa et al. 2004), and from oxidation of 
DMPO itself which result in the formation of DMPO 
radical cations which undergo nucleophilic 
addition reactions with water (Chignell et al. 1994). 
Also apparent in the spectrum in Fig. 7 is a y-hydrogen 
splitting (a(y-H) = 0.12 mT) from the proton of the 
hydroperoxyl radical, which is not often observed in 
nitrone adducts. Each of the six lines is further split 
into doublets due to this interaction. This spectrum was 
obtained by the author at pH 4.5 where the protonated 
superoxide radical is more stable than at physiological 
pH. Its biological relevance here is the putative acidic 
pH of the melanosome where melanin is synthesized, 
and this circumvents the problems of superoxide detec¬ 
tion at physiological pH where the superoxide adduct 
is unstable and rapidly decays to an adduct resembling 
the hydroxyl adduct. Being able to detect the 
hydroperoxyl radical in this system has enabled 
quantification of its photoformation in UVA-irradiated 
melanosomes of different pigmentation levels 
(Haywood et al. 2008a). 

Spin Trapping Macromolecules 

The rotational freedom of a nitroxide radical 
covalently bonded to the radical in a spin adduct has 
a significant influence on the ESR spectrum of 
the radical adduct. The spectrum shown above 
(Fig. 7) of the hydroperoxyl adduct of DMPO is an 
isotropic spectrum and reflects the trapping of here 
a low-molecular-weight radical species. This type 
of spectrum can also result from the trapping of 
high-molecular-weight species if the nitroxide radical 
is located on a site of the radical whereby it can 
move freely. If, however, its motion is restricted. 


a , 3.0 mT 


Anisotropic protein- 









Spin-Trapping: Theory and Applications, Fig. 8 Electron 
spin resonance spectra of skin samples of different melanin 
pigmentation and DMPO recorded subsequent to exposure to 
solar-simulated irradiation (15-30 min) (a) Caucasian (b) inter¬ 
mediate pigmentation and (c) Afro-Caribbean 


then the spectrum broadens and is anisotropic. 
This is often observed with radical adducts of 
high-molecular-weight biological molecules such as 
proteins and lipids. While the information obtained 
directly about the trapped radical is less (unless the 
trap is DMPO) the spectra of proteins, for example, are 
distinctive in mixtures of biological radicals and 
enables their distinction. This is shown in Fig. 8 in 
the spectra of UV-irradiated human skin samples of 
different levels of melanin pigmentation. Intact tissue 
is readily loaded with DMPO by simple incubation of 
the tissue in the spin trap in a phosphate buffer solu¬ 
tion. The presence of protein and lipid radical adducts 
is apparent by the presence of anisotropic and 
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relatively isotropic spectra, the latter of oxygen 
adducts. Using this technique, it was possible to dem¬ 
onstrate a broad decline in total formation of radical 
adducts in irradiated skin of different racial origin and 
increasing levels of melanin pigmentation. The broad 
anisotropic protein radicals can be distinguished from 
the isotropic lipid radical adducts, which here are mov¬ 
ing relatively rapidly. Also illustrated here is the 
decline in total radical adduct formation with increase 
in melanin pigmentation in the skin samples, highlight¬ 
ing the usefulness of spin trapping for studying UV 
photo-damage to skin (Haywood et al. 2008b, 201 1). 

Although anisotropic spectra of immobilized 
nitroxides do not contain much information about the 
radical trapped, it is a unique characteristic, however, 
of immobilized DMPO nitroxides that the P-H splitting 
pattern is retained in the anisotropic spectrum (Davies 
et al. 1991, 1993). This enables it possible to establish 
whether a carbon-, sulfur-, or oxygen-atom center of 
the radical has been trapped. 

This is illustrated by protein radical adducts of 
DMPO, which have been formed by the UVA irradia¬ 
tion of bovine serum albumin (BSA) and tendon col¬ 
lagen and elastin proteins. For BSA, the P-H splitting is 
predominantly that of a sulfur-centered radical adduct 
(1.8 mT), as is that of tendon collagen (shown in 
Fig. 9a). However, in elastin, spectra with both larger 
splitting of 2.1 mT (shown in Fig. 9b) and smaller 
(1.8 mT) splittings are detected dependent upon the 
spin trap concentration used suggesting the trapping of 
proteins via different radical centers of predominantly 
sulfur and carbon, respectively. These differences 
enabled the anisotropic spectrum in UVA-irradiated 
skin to be deduced to correspond to damage to collagen 
in the skin (Haywood et al. 2011). 

In summary, spin trapping as a technique to stabi¬ 
lize transient radical intermediates has found wide¬ 
spread application, particularly to studies of 
biological systems. While nitroso compounds can 
give more information about the radical trapped than 
nitrones, effectively trapping carbon-centered radicals 
but less effectively oxygen and sulfur heteroatom cen¬ 
ters, they are subject to artifacts particularly in photo- 
lytic systems. Nitrones are less associated with 
artifacts trapping carbon-, sulfur-, and oxygen- 
centered radicals but (with the exception of DMPO) 
give less information about the trapped radical than 
nitroso compounds. The most commonly used spin 
trap is the nitrone DMPO, which enables the 



Spin-Trapping: Theory and Applications, Fig. 9 ESR spec¬ 
tra obtained during the board spectral (solar) irradiation of (a) 
Tendon collagen (b) Bovin elastin pre-incubated with DMPO 


distinction of both low- and high-molecular-weight 
carbon, oxygen, and sulfur radicals on the basis of 
wide variation in P-H splitting. It is also amphiphilic 
and readily distributed to aqueous and nonaqueous 
compartments of cells, tissues, and molecular compo¬ 
nents of biological systems. While considerable work 
has been undertaken in the trapping and characteriza¬ 
tion of protein and lipid radicals, the trapping of DNA 
radicals has proved more elusive although the trapping 
of possible fragments of DNA oxidation have been 
reported (Hawkins et al. 2002; Haywood et al. 2011). 
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in Biological Systems 

► EPR Spectroscopy: General Principles 

► Pulsed EPR: Principles and Examples of 
Applications to Hemeproteins 

► Spin Labeling of DNA and RNA 

► Spin-Labeling EPR of Lipid Membranes 

► Spin-Labeling EPR of Proteins: Dynamics and 
Water Accessibility of Spin-Label Side Chains 
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Introduction 

Protein-protein interactions (PPI), which play an 
essential role in all aspects of biology, create a great 
diversity of macromolecular assemblies that mirrors 
the diversity of life itself. The study of PPI is of 
major interest to biology and health science, and 
many genetic, biochemical, and biophysical tools are 
available to detect and characterize them both in vivo 
and in vitro. Here, we report features that are observed 
in the three-dimensional structures of protein assem¬ 
blies, and concern geometric, physical-chemical, and 
evolutionary aspects of the interfaces between poly¬ 
peptide chains engaged in PPI. We also describe how 
these features can be used to predict the binding sites 
and build models of the assemblies, and present an 
assessment of the prediction methods. 

Properties of Protein-Protein Interfaces 

Three-Dimensional Structures of Protein 
Assemblies 

The Protein Data Bank (PDB, Table lA) stores the 
information on the three-dimensional structure of bio¬ 
logical macromolecules in the form of atomic coordi¬ 
nates issued from high-resolution X-ray 
crystallography and NMR studies. Whereas large 
assemblies such as the ribosome are comparatively 
few in the PDB, there are hundreds of crystal structures 
of binary protein-protein complexes, and thousands of 
oligomeric proteins in it that illustrate PPI at the atomic 
level, and reveal the physical interactions that take 
place at the interfaces between polypeptide chains 
(Jones and Thornton 1996; Kleanthous 2000; Janin 
and Wodak 2002; Russell et al. 2004; Janin et al. 
2008). Together with the desolvation of the interfaces, 
these interactions govern the stability of the assembly, 
and they give it the specihcity that its function requires. 
In a protein-protein complex, the PPI is non-obligate or 
► transient, as it occurs between pre-existing proteins; 
examples are an enzyme and a protein substrate or 
inhibitor, an antibody and a cognate protein antigen, or 
protein members of the same signal transduction path¬ 
way. An oligomeric protein is a protein with typically 
2-12 polypeptide chains, often homomeric (identical in 
sequence). Its assembly, commonly called a quaternary 
structure, rarely coexists with the monomer, so that the 
PPI is generally considered as obligate. 


The great majority of the relevant PDB entries are 
crystal structures, and in a crystal, the contacts that 
define the quaternary structure coexist with other that 
hold the crystal together. They may not exist in solu¬ 
tion and have no biological relevance. Similarly, func¬ 
tional interactions (PPI) coexist in a cell with all sorts 
of nonspecific contacts. Bona fide PPI and nonspecific 
contacts are of the same physical-chemical nature, and 
setting them apart may not be a trivial task, as the 
needed experimental data in solution may not exist, 
especially in high-throughput structural analysis. 
A number of Web servers and databases deal with 
the question (Table IB). PiQSi offers a survey of the 
biochemical literature on oligomeric proteins in the 
PDB, which is very valuable albeit far from compre¬ 
hensive. PQS, which covers the whole PDB, propose 
“probable quaternary structures” based on geometric 
and physical-chemical parameters derived from the 
atomic coordinates. The PDB itself shows “biological 
assemblies” that are taken either from PQS or from the 
authors of the structure. The assembly models of both 
PQS and the PDB must be used with caution, and their 
error rate is above 15% when they are assessed against 
curated sets. 

Geometry 

The buried surface area (BSA) is a convenient measure 
of the size of the interface between two (macro)mole- 
cules. The BSA is the area of the solvent accessible 
protein surface (ASA) lost in the assembly relative to 
the individual subunits. It is proportional to the number 
of water molecules released from contact with the 
protein surface, of amino acid residues and of protein 
atoms that form the interface, and also the number of 
atom-atom or residue-residue pairs in contact. The 
PISA database (Table IB) reports the ASA of all the 
protein chains in the PDB, and the BSA of all the 
pairwise contacts including the crystal packing 
contacts. 

Table 2 cites the average values of the BSA 
in curated sets of homodimeric proteins and protein- 
protein complexes. The homodimers, which are mostly 
obligate assemblies, have interfaces about twice as 
large as the complexes, and six times as large as the 
average crystal packing contact. A majority of the non- 
obligate interfaces bury 1,200-2,000 A^ and implicate 
17-30 residues per subunit, and a majority of the 
obligate ones, 1,500—4,000 A^ and 20-60 residues. 
However, the standard deviations are all large, and 
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Statistical Analysis and Prediction of Protein-Protein 
Interactions and Binding Sites, Table 1 PPI in databases 
and servers 


A. Protein structure 

Protein Data Bank 
(PDB) 

http://www.rcsb.org/pdb/ 

B. Derived data on quaternary structure and protein assemblies 

Biounit (PDB) 

http://www.rcsb.org/pdb/ 

ProtBuD 

http://dunbrack.fccc.edu/ProtBuD/ 

PQS 

http://pqs.ebi.ac.uk/ 

PISA 

http ://w w w .ebi. ac. uk/msd-srv/prot int/ 

3D Complex 

http://3dcomplex.org/ 

PiQSi 

http://www.piqsi.org/ 

C. Protein-protein interface analysis and comparison 

PROTORP 

http://www.bioinformatics.sussex.ac.uk/ 

protorp/ 

SCOPPI 

http://141.30.193.6/scoppi/ 

PiBase 

http://modbase.compbio.ucsf.edu/pibase/ 

Dockground 

http://dockground.bioinformatics.ku.edu/ 

3DID 

http://3did.irbbarcelona.org 

IBIS 

http://www.ncbi.nlra.nih.gov/Structure/ 

ibis/ibis.cgi 

MAPPIS 

http://bioinfo3d.cs.tau.ac.il/MAPPIS/ 

D. Protein-protein docking 

CAPRI experiment 

http://capri.ebi.ac.uk/ 

Zdock benchmark 

set 

http://zlab.bu.edu/zdock/benchmark. 

shtml 

E. Docking servers and software 

FTDock server and 
software 

http://www.sbg.bio.ic.ac.uk/docking/ 

ftdock.html 

DOT 

http://www.sdsc.edu/CCMS/Papers/ 
DOT sc95.html 

ZDock server and 
software 

http://zlab.bu.edu/zdock/ 

PIPER and ClusPro 

sei'ver 

http://cluspro.bu.edu/login.php 

GRAMM-X server 

http://vakser.bioinformatics.ku.edu/ 

resources/gramm/grammx/ 

pyDOCK server 
and software 

http://inb.bsc.es/gn6/PyDock/ 

HEX server and 
software 

http://www.loria.fr/~ritchied/hex/ 

ICM-DISCO 

http://www.molsoft.com/icm pro.html 

ATTRACT 

http://www.ibpc.fr/chantal/www/ptools/ 

RosettaDock server 
and software 

http://graylab.jhu.edu/docking/rosetta/ 

HADDOCK server 
and software 

http://www.nmr.chem.uu.nl/haddock/ 

3D-Garden server 

http://www.sbg.bio.ic.ac.uk/3dgarden 

PatchDock server 
and software 

http://bioinfo3d.cs.tau.ac.il/ 

SKE-DOCK server 

http://www.pharm.kitasato-u.ac.jp/bmd/ 
files/SKE DOCK.html 


{continued) 


Statistical Analysis and Prediction of Protein-Protein 
Interactions and Binding Sites, Table 1 (continued) 


F. Binding site prediction 

Evolutionary Trace 

http ://maramoth.bcm. tmc .edu/ETserver. 
html 

ProMate 

http://bioinfo.weizmann.ac.il/promate/ 

PRISM 

http://prism.ccbb.ku.edu.tr/prism/ 

PINUP 

http://sparks.inforraatics.iupui.edu/ 

PINUP/ 

PPI-Pred 

http://bmbpcu36.leeds.ac.uk/ppi pred/ 

PIER 

http://abagyan.ucsd.edu/PIER/ 

PPISP 

http://pipe.scs.fsu.edu/ppisp.html 

WHISCY 

http://nmr.chem.uu.nl/Software/whiscy/ 

SPIDDER 

http://sppider.cchmc.org/ 

ISIS 

http://cubic.bioc.columbia.edu/services/ 

isis/ 


individual values spread widely. There are homodimer 
interfaces that bury over 20,000 and implicate 300 
residues per subunit, and small ones with afiSA of only 
800-1,000 A? and 10 residues, like a crystal packing 
contact. In addition to their size, the databases of 
Table 1C report several geometric properties of bio¬ 
logical interfaces, such as their planarity and their 
connnectivity. Protein-protein interfaces tend to be 
planar, unlike the binding sites for small molecules 
that are often pocket-shaped. When a plane is htted 
through the interface atoms, the least-square distance 
to the plane is generally greater in homodimers than 
transient complexes (Jones and Thornton 1996), but 
this may just reflect the larger size of their interfaces. 
The connectivity describes how the interface atoms 
and residues are distributed on the protein surface. 
After grouping neighboring atoms or residues into 
patches with a geometric algorithm, nearly all biolog¬ 
ical interfaces with a BSA less than 2,000 A^ form 
a single connected patch. Larger interfaces are multi¬ 
patch, and often each patch implicates a different struc¬ 
tural domain in a multi-domain protein (Janin et al. 
2008). 

Chemical and Amino Acid Composition 

The chemical and the amino acid compositions of an 
interface may be evaluated by counting atoms and 
residues, or their contribution to the BSA (Table 2). 
Relative to the protein surface, the biological inter¬ 
faces are depleted of charged groups like the protein 
interior albeit less completely. Those of the 
homodimers, but not the transient complexes, are also 
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Statistical Analysis and Prediction of Protein-Protein Interactions and Binding Sites, Table 2 Observed properties of 
protein-protein interfaces 

Interface parameter 

Homodimers" 

Protein-protein complexes'’ 

Number of entries in data set 

315 

70 

Size*’ 

BSA (A^) 

3570 ± 2500 

1910 ± 760 

Amino acids at interface 

96 ±64 

57 ± 22 

Composition'' 

Nonpolar/ neutral /charged 

65/22/13 

58/28/14 

Polar interactions® 

H-bonds per interface 

Interface waters 

17 ± 15 

45 ±42 

10 ±5 

20 ± 13 

Conservation in evolution^ 

Shannon entropy, core/rim ratio 

0.87 ± 0.5 

0.82 ± 0.3 


“Data from Dey et al. (2010) 

'’Data from Janin et al. (2008) 

‘^BSA and number of interface residues per homodimer or complex it standard deviation; note that the PISA database (and many 
publications) reports BSA values per subunit, that is BSAI2; all amino acid residues that contribute to the BSA are counted as part of 
the interface 

‘'Percentage of the BSA contributed by nonpolar, neutral polar, and charged chemical groups. All carbon-containing groups count as 
nonpolar; oxygen-, nitrogen-, or sulfur-containing groups as polar; or charged in Lys, Arg, Asp, and Glu side-chains 
“Number ± standard deviation of the H-honds bridging the subunits and the water molecules in contact with hoth subunits 
^Ratio ± standard deviation of the average Shannon entropies of the core and rim residues in the interfaces of 121 homodimers and 52 
protein-protein complexes; core and rim are dehned as in Fig. 1; the Shannon entropy is s(/) = p(k) ln(p(A'), where p(A:) is the 
frequency of observation of amino acid type k at position i of a set of aligned amino acid sequences (Guharoy and Chakrabarti 2005) 


depleted of neutral polar groups, and enriched in non¬ 
polar (hydrophobic) groups. Most of the neutral polar 
groups are from the protein main chain, which contrib¬ 
utes about 18% of the BSA and is a full-fledged partner 
in all PPI types. On average, the nonpolar character 
increases in the order: solvent accessible surface < 
crystal contacts « non-obligate interfaces < obligate 
interfaces < protein interior. 

Interfaces and protein surface also differ in their 
average amino acid compositions. In Fig. 1, red bars 
express this difference in terms of residue propensities 
for the core of homodimer interfaces. Here, the core is 
defined as the set of residues that contain atoms 
completely buried at the interface, and it comprises 
about 60% of the interface residues, the remainder 
forming the rim. Aromatic (Phe, Tyr, Trp) and ali¬ 
phatic (Val, Leu, He, Met) residues have positive core 
propensities, meaning that they contribute more in 
proportion to the interface cores than to the solvent 
accessible surface. The charged residues Lys, Glu, and 
Asp do the opposite, and they have negative propensi¬ 
ties, whereas Arg is as abundant at the interface cores 
as on the protein surface. The core propensities are 
about the same in protein-protein complexes. There, 
the interface core comprises most of the ► binding hot 


■ interface core 

■ crystal contact 
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Statistical Analysis and Prediction of Protein-Protein 
Interactions and Binding Sites, Fig. 1 Interface propensities 
of amino acid residues. The propensity of amino acid type / to be 
at an interface v\s. the protein surface is p^ = In (fk'’/fk“) where 
fk’’ and fk'’ are the fractional contributions of residues of type k to 
the protein ASA and the interface BSA, respectively. Red bars 
represent the propensities to be part of the core of a homodimer 
interface, dehned as the set of residues that contain at least one 
atom that is fully buried (zero ASA) at the interface; the remain¬ 
der forms the rim. The propensities to be part of the interface rim 
are very similar to those to be at a crystal contact (green bars) 
(Adapted from Janin et al. 2008) 

spots that cause large affinity changes in site-directed 
mutagenesis experiments, whereas some residues out¬ 
side the core (or outside the interface) can contribute to 
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the ► binding kinetics. The rim propensities are weak 
in all cases, and similar to the crystal packing propen¬ 
sities shown for comparison as green bars in Fig. 1 . 

Conservation in Evolution 

Given a family of homologous proteins, a conserved 
position in their amino acid sequences suggests that the 
residue is implicated in the protein structure or func¬ 
tion. An obligate PPI is part of the protein structure, 
a transient one, of its function, and therefore, it is 
commonly assumed that the interfaces should be better 
conserved than the remainder of the protein surface. 
Yet, this can be grossly wrong: the antigen-combining 
site of an antibody is the most divergent region of its 
sequence. More generally, when a PPI implicates two 
protein families, each member of one family interacting 
with a different member of the other family, some of 
the sequence positions involved are conserved, 
and others substituted for the sake of specificity. 
A different selection pressure acts at each position, 
creating a branching pattern in the alignments. 
The Evolutionary Trace method exploits this pattern 
in order to identify functional sites (Lichtarge and 
Wilkins 2010). 

Another point to consider is that the PPI itself may 
not be conserved. Like myoglobin and hemoglobin, 
distant members of a protein family often have differ¬ 
ent quaternary structures: a majority of the protein 
pairs in the PDB do so below 40% sequence identity 
(Levy et al. 2008). As a consequence, interface con¬ 
servation is rarely obvious, even in obligate assem¬ 
blies. Excluding antibodies, the interface residues of 
protein-protein complexes are less divergent in evolu¬ 
tion on average than other surface residues, but the 
difference is significant only in subsets, and it is the 
most obvious at the interface cores (Guharoy and 
Chakrabarti 2005; Mintseris and Weng 2005). Figure 2 
illustrates the example of a non-obligate interface in 
which the core (defined as above from the buried 
atoms) is well conserved, but the rim is not. As 
shown in Table 2, this is generally the case in both 
the homodimers and the complexes. 

Predicting Physical Interactions and Binding 
Sites 

Determining the function(s) of a protein by experiment 
can be very slow compared to determining its sequence 



Statistical Analysis and Prediction of Protein-Protein 
Interactions and Binding Sites, Fig. 2 Conservation of the 
interface core in evolution. Transducin (PDB entry IGOT) is the 
trimeric GTPase associated with rhodopsin in the retina. The 
transient interaction between subunits Ga and GPy plays a major 
role in the visual signal transduction pathway, (a) The molecular 
surface of subunit Ga is drawn with the binding site for subunits 
GPy colored in red for residues of the interface core (residues 
that contain atoms fully buried at the interface), blue for those of 
the interface rim. (b) The interface is colored according to the 
evolutionary divergence of each residue, estimated by its 
Shannon entropy in aligned sequences; red', conserved residues 
with an entropy below average; pink: entropy near average; blue: 
divergent residues with an entropy above average. The coinci¬ 
dence of the red regions in the two panels indicates that the 
interface core is conserved, whereas the rim is not (Reprinted 
with permission from Nussinov and Schreiber 2009) 

or its structure in high-throughput studies, and this has 
made bioinformatic approaches very popular in recent 
years. In the field of PPI, this goes from predicting 
whether two proteins (or rather, two gene products) 
will interact, to locating the binding sites on their 
surface, to building a detailed atomic model of their 
assembly (Szilagyi et al. 2005; Nussinov and Schreiber 
2009; Zacharias 2010). The chances of success much 
depend on the proteins and what is known in advance 
about them. For instance, locating the antigen-binding 
site of an antibody is straightforward, whereas 
predicting the epitope it recognizes, is difficult even 
when the antigen has a known structure. In this 
particular case, residue conservation is useless, and 
the prediction must rely on geometry (shape comple¬ 
mentarity, planarity), physical-chemical properties of 
the protein surfaces, and biochemical data. Conversely 
with an enzyme, substrates and inhibitors are expected 
to bind at the active site, and this can often be located 
just on the basis of its shape and high degree of con¬ 
servation. In other types of PPI, the prediction will 




2458 


Statistical Analysis and Prediction of Protein-Protein Interactions and Binding Sites 




have to combine geometry, physical chemistry, and 
evolutionary considerations. 

Protein-Protein Docking 

If three-dimensional structures are available from 
either structural studies or modeling, a physical 
model of the assembly may be built by docking its 
components. All protein-protein docking procedures, 
including those implemented as Web servers 
(Table IE), first search for complementary surfaces 
by moving and rotating one component relative to the 
other. The search may use the Fourier correlation algo¬ 
rithm, or other algorithms that handle the proteins as 
rigid bodies. Alternatively, it may rely on Monte-Carlo 
simulated annealing or Molecular Dynamics, which 
allow some internal coordinates to change to simulate 
local conformational changes (Halperin et al. 2002; 
Gray 2006). Large conformational changes must be 
handled in a separate step (Bonvin 2006). They con¬ 
stitute a major difficulty in docking, the other being the 
many false positives produced by the search. All 
docking procedures have a second step that aims to 
identify the correct solutions on physical chemical 
(atom, residue and pair propensities, electrostatics, 
desolvation) and sequence conservation criteria. 
Nonstructural information from the literature can be 
very helpful at that stage, but as it may be incorrect or 
misleading, it should be treated as a soft restraint rather 
than a rigid constraint in the search. 

A set of PDB entries representing protein-protein 
complexes and their free components (Table ID) can 
be used to benchmark the performances of docking 
procedures. Moreover, the CAPRI (Critical Assess¬ 
ment of PRedicted Interaction) blind prediction exper¬ 
iment is designed to assess their reliability and 
accuracy in an objective way, by comparing the 
docking models of target protein-protein complexes 
to unpublished experimental structures. Forty-two tar¬ 
gets have been subjected to prediction up to now, and 
two-thirds have been correctly modeled by at least one 
participant group, the criterion being that the model 
locates the interface to within 2 A of its native position, 
and reproduces at least 30% of the native residue- 
residue contacts. The prediction failed on six targets, 
mostly due to conformation changes that could not be 
simulated properly, and yielded models of lesser qual¬ 
ity in the remainder. Thus, docking predictions have 
generally been successful in CAPRI, with the proviso 
that each predictor group was allowed to submit ten 


models of each target, and that no one correctly 
predicted more than half of the targets (Janin 2005; 
Lensink and Wodak 2010). 

Template-Based Modeling 

An alternative to docking is to use known structures as 
templates. In the simplest case, the templates can be 
identified by running a sequence similarity search on 
all the protein assemblies in the PDB, but the coverage 
is then very low, due to their small number. Other 
approaches search for a local structural similarity 
irrespective of the sequence, and compare the surface 
of the component proteins to all the known interfaces. 
Several databases store information on the interfaces in 
the PDB (Table 1C), and they offer tools for comparing 
their geometric and chemical features to those of 
a query interface. They are large, with over 200,000 
interfaces, but highly redundant, and they mostly con¬ 
tain interfaces between domains or between the sub¬ 
units of oligomeric proteins. The targets of binding site 
predictions are usually non-obligate complexes, and 
those are poorly represented with the exception of 
protease-inhibitor and antigen-antibody complexes. 

In the long run, new structural data will extend the 
field of application of template-based prediction 
methods, but at present, they are outperformed by 
docking in blind predictions. Their main application 
has been the large-scale modeling of the thousands of 
interactions that are detected in genome-wide genetic 
and biochemical experiments, and lack a structural 
basis. 

Structure-Based Binding Site Prediction 

Binding site predictions aim to identify the residues 
and regions of surface of a protein that are implicated 
in PPI (de Vries and Bonvin 2008; Ezkurdia et al. 
2009). In general, they do not require the partner to 
be known, and thus, they may identify sites relevant to 
other interactions than the one of interest. The interface 
of a non-obligate complex typically comprises 5-25% 
of the surface residues, that of a homodimer, 20% on 
average; in a larger assembly, there may be more 
residues at the interfaces than on the free surface. The 
criteria for their identification are first their accessibil¬ 
ity to solvent in the free subunit, then their propensity 
to be at a protein-protein interface, and their conser¬ 
vation in homologs. The servers of Table IF all use 
a combination of these criteria, plus some that are 
specific to each method, with weights that have been 
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optimized on a particular set of interfaces. Most pre¬ 
diction methods also favor residues that are close in 
space and form connected patches on the protein 
surface. 

The quality of the predictions can be judged by their 
specificity (how many residues in the predicted set are 
effectively at the binding site?) and sensitivity (how 
many of the binding site residues are in the predicted 
set?). When benchmarked on the same set of protein- 
protein complexes, all the methods show similar 
performances, achieving 24-28% specihcity at 50% 
sensitivity (de Vries and Bonvin 2008). Thus, if 
a binding site comprises 20 residues, they may be 
expected to identify 10, plus 25 or 30 false positives. 
Albeit far from perfect, such a prediction can be 
a useful starting point in docking, and help designing 
a site-directed mutagenesis experiment. 

Sequence-Based Methods 

Several of the servers in Table IF also accept the 
protein sequence as only input. The prediction then 
proceeds as in the structure-based mode, except that 
the accessibility to solvent of the residues must be 
predicted, and that they may not be grouped into 
patches. The performances are necessarily degraded, 
but it will be difficult to judge to what extent until an 
independent validation is done. 

Conclusion 

PPI are ubiquitous and highly diverse in nature. Our 
present knowledge is patchy, and those for which we 
have detailed structural and biochemical data represent 
only a small fraction. This sample nevertheless dis¬ 
plays a number of structural and evolutionary features 
that many other PPI must share, and the prediction 
methods that use these features achieve some success. 
As the experimental study of PPI is a very active field 
in biology and health science, we can expect to learn 
much in a near future and improve our capacity to 
predict both the binding sites and the modes of 
assembly. 

Summary 

The subunit interactions that we can observe in the 
three-dimensional structures of oligomeric proteins 


and protein-protein complexes have a number of fea¬ 
tures in common, some geometric: the size and shape 
of the interfaces; others physical chemical: their 
atomic and amino acid compositions; or evolutionary: 
the conservation of interface residues in homologous 
sequences. The methods that have been developed to 
predict the position of the binding sites on the protein 
surface make use of these features, and they comple¬ 
ment docking algorithms in the structural prediction of 
assembly modes. 


Cross-References 

► Heteromeric Versus Homomeric Association of 
Protein Complexes 

► Oligomeric Proteins 

► Protein Complexes: Definitions of Obligate Versus 
Transitory (Strengths of Binding) 

► Proteins: Relationship Among Divergence of 
Sequence, Structure, and Function 
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Introduction 

The stopped-flow technique was developed in the 
1940s by modihcation of the ► continuous-flow 
method with the principal advantage of more eco¬ 
nomic use of reagents (Chance 1943). Thus, instead 
of monitoring the reaction along the length of the 
reaction tube, the detection system was placed at 
a fixed point close to the mixing chamber and the 
reaction was followed in real time after abruptly stop¬ 
ping the drive syringes (Fig. 1). This modification was 
possible owing to improvements in the response time 
of the recording apparatus. The reaction is usually 
triggered by including a microswitch in the backstop 
of the stop syringe plunger. Signal capture has contin¬ 
ued to evolve through the use of analogue storage 
oscilloscopes, digital oscilloscopes to PCs with 


Arc lamp Cell Photomultiplier 



Stopped-Flow Techniques, Fig. 1 Schematic diagram of the 
key components of a stopped-flow spectrophotometer 

analog-digital (A/D) cards that sample at frequencies 
>10 kHz (Eccleston et al. 2001; Gibson 1969). 

For general use, sample cells have a volume of 
around 40 pL with dimensions of 2 x 2 x 10 mm, so 
allowing good sensitivity for absorption measurements 
with a 10 mm pathlength. Such a cell has a dead time of 
around 1-2 ms, which represents the earliest time point 
at which reactions can be monitored. Reducing the cell 
dimensions decreases the dead time, but also lowers 
the sensitivity, although with fluorescence detection 
this compromise may be acceptable. The drive syrin¬ 
ges are normally operated by gas pressure or stepper 
motors and are pushed as fast as possible, but below the 
limit where cavitation becomes a problem. Cavitation 
arises from local areas of low pressure which cause the 
formation of vacuum bubbles that interfere with 
mixing and optical recording. Usually drive syringes 
of equal volume are used, but solutions can be mixed in 
different ratios if required (e.g., denaturant dilution). 
The kinetics under observation reflects the concentra¬ 
tion of components after mixing, whereas preexisting 
equilibria reflect the concentration in the drive syrin¬ 
ges. To avoid ambiguity it is useful to record both 
values in the accompanying notes (e.g., 20/10 pM 
ligand was mixed with 2/1 pM protein). 

Three phases can be identified in the progression of 
the stopped-flow reaction (Fig. 2a). First, the old solu¬ 
tion from the previous reaction is washed out; second, 
there is a brief continuous-flow phase as new solution 
passes though the cell; and third, the flow is stopped 
and the reaction profile is measured. Normally the 
stopped-flow apparatus is set up so that the monitoring 
device is triggered at the start of the third phase. How¬ 
ever, it is useful to check the entire profile in order to 
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Stopped-Flow Techniques, Fig. 2 (a) The phases of 
a stopped-flow reaction. Normally the record is triggered and 
recorded at the time the flow stops defined by time = 0 s. (b) The 
effect of changing the shot volume from each of the two drive 
syringes from 60, 50, 40, 30, and 20 pL from left to right with 



Time (s) 

a 40 pL observation cell. Note that the most economic setting 
which gives full amplitude is 2 x 40 pL (i.e., total shot 
volume = double the cell volume). However, the rate constant 
derived from the 2 x 20 pL setup is similar, only the amplitude is 
reduced (Data courtesy of Prof. M. A. Geeves) 


optimize the total volume of reagents that is used for 
each push (typically twice the flow cell volume; 
Fig 2b). With digital recording devices, this is readily 
achieved using the pre-trigger option that allows data 
to be observed that was captured before the stop 
syringe triggered the microswitch. For reagents that 
are in short supply, the drive volume can be minimized 
so that the continuous-flow phase is absent, but at the 
risk of losing some of the amplitude of the reaction 
(Fig. 2b). 

The majority of instruments use optical detection 
(u.v. - visible absorption, ► fluorescence) or some 
variant (e.g., ► circular dichroism (Hooke et al. 
1994), fluorescence polarization (Gong et al. 2009)) 
with the principal advantages of sensitivity and high 
time resolution. However, the method has been 
adapted for other signals including heat (Millar et al. 
1987), conductivity, ► X-ray scattering (Panine et al. 
2006), and ► magnetic resonance (Hoeltzli and 
Frieden 1998). Several manufacturers market pur¬ 
pose-built stopped-flow spectrophotometers with 
broadly comparable performance, but differing in the 
details. These instruments can be supplied in various 
formats, but typically contain a Xe arc lamp source 
coupled to an excitation monochromator and use filters 
to select the emission wavelengths, together with 
a photomultiplier detector. Excitation paths can be 
switched to allow for dual absorption/fluorescence 
detection or two-channel emission. Diode array 


detectors are also available for recording time-resolved 
absorption spectra, but these are generally too insensi¬ 
tive to record emission spectra on the millisecond 
timescale. 

Performance 

A simple T mixer gives efficient mixing of the two 
reactants when they are of similar volumes and solvent 
density. However, in situations such as protein folding 
reactions in which a protein in high denaturant con¬ 
centration is diluted tenfold with a buffer, some resid¬ 
ual mixing may occur within the observation cell 
giving rise to artifacts. The efficiency of mixing can 
be checked by including a nonreactive dye in one 
syringe, which should give a flat response if its dilution 
is complete within the dead time of the apparatus. 

The dead time of the apparatus represents the time 
from mixing to the first observation. It can be checked 
using a variety of standard reactions, typically under 
pseudo first-order conditions such that the observed 
rate constant can be varied across the range of 10- 
1,000 s“' (Fig. 3). For an instrument with a dead time 
of 1 ms, a reaction with a rate constant of 1,000 s“ * will 
suffer a loss of 63% of the initial amplitude, compared 
with a reaction at 10 s“*. However, provided the 
signal-to-noise is adequate, the residual 37% of the 
reaction would be sufficient to determine an accurate 
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Stopped-Flow Techniques, Fig. 3 Determination of the dead 
time of a stopped-flow apparatus (Hiromi 1979). 70/35 pM 2,6- 
dichlorophenolindophenol (DCIP) was mixed with either 
2/1 mM (top trace) or 20/10 mM (bottom trace) ascorbic acid 
(concentrations before/after mixing) and the absorbance mea¬ 
sured at 524 nm. The signals after 2 ms were fitted with single 
exponential functions to yield rate constants of 47 and 428 s“', 
respectively. Note the extrapolated fitted lines (thin blue lines) 
intersect to give an initial absorbance of 0.24 cm“' at time zero 
and the observed signal (thick black lines) follows the fitted lines 
from 0.0015 s, the dead time of the apparatus (Data from EMBO 
Practical Course, Canterbury 2008) 

rate constant (assuming a monophasic exponential 
reaction). This scenario highlights the equal impor¬ 
tance of the sensitivity of detection in comparing 
stopped-flow instruments (Kurzawa and Geeves 
1996). Other key performance characteristics are not 
just the minimum shot volume required to give the full 
amplitude of the signal (typically 2 x 40 pL for a stan¬ 
dard 40 pL cell, Fig. 2a) but the initial sample volume 
required to fill the dead space of the instrument, which 
may be several hundred microliters. 

The sensitivity of a stopped-flow apparatus may be 
measured as the minimum detectable change in 
absorption or fluorescence, but this depends on 
a number of factors such as lamp output and stability, 
wavelength and wavelength range, pathlength(s), 
mechanical stability and the time range over which 
the reaction is followed. The shot noise of the detector, 
relative to the total photon count is reduced with longer 
accumulation times (i.e., slow kinetics are easier to 
measure than fast kinetics) or by averaging repeated 
traces. However, longer times may be accompanied by 
photobleaching or back diffusion of the contents of the 
cell, so that records do not usually exceed 100 s. Over 
and beyond this timescale use of a standard spectro¬ 
photometer with manual mixing is preferred. For 
absorbance measurements, lamp stability is important 


(as the optics are usually single beam) and in the near 
UV to visible region, a tungsten halide lamp may give 
better results than an arc lamp. The latter however is 
required for fluorescence detection, where the smaller 
size of an arc source allows more efficient illumination 
of the cell. Here the background signal with buffer 
alone determines the minimum detectable fluorescence 
change and is aided by the use of high-quality emission 
filters to exclude Raman scatter. For the majority of 
experiments, colored glass filters are adequate to block 
the excitation wavelength. For specific wavelengths, 
illumination intensity may be enhanced by using a 
Hg/Xe lamp. Slit widths can also be increased to 
achieve maximum illumination before photobleaching 
becomes significant. In some instances, high illumina¬ 
tion intensities are selected to give optimal signal-to- 
noise over short timescales, even though the record 
over longer timescales suffers from bleaching. 

Mechanical stability is also important as the shock 
wave on stopping the flow must be dissipated without 
interfering with the optics. Separating the reaction 
chamber from the excitation monochromator with 
a light guide assists in this regard. Another source of 
noise is the release of pressure from the ram which may 
occur tens of milliseconds after stopping the flow. The 
ability to change this delay to avoid a critical point in 
the reaction pathway is useful. 

A/D converters appropriate for stopped-flow 
recording capture data at 10-100 kHz, i.e., a time 
interval < dead time. For convenience of data han¬ 
dling, records may contain 400-1,000 points. This 
gives the possibility of oversampling, whereby points 
are collected at the maximum rate and averaged over 
the required time interval to maximize signal-to-noise. 
Thus for a 10 s time base with 1,000 points, 100 points 
at 10 kHz would be averaged to give the signal for each 
10 ms interval. However a linear time base with equal 
intervals is not usually optimal for records with the 
characteristics of exponential decay, because a large 
number of data bins are used in defining the asymptotic 
end point where there is little signal change. It is 
normally better to use a logarithmic distribution 
where an equal number of points are used to define 
each decade (typically, four decades are appropriate). 
This regime is obviously advantageous for capturing 
multiphasic reactions that are spread over several 
decades in time; however, it is also advantageous in 
resolving two closely associated transients whose rate 
constants only differ by twofold, because equal 
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weighting is given to the fitting of each transient 
(Walmsley and Bagshaw 1989). A split time base can 
also overcome some of the limitations of a single linear 
time base, but the choice of the split requires prior 
information about the reaction and can bias the ht to 
one phase over another. A logarithmic time base has 
conceptual limitations when charactering the dead 
time of the apparatus, as pre-trigger information can¬ 
not be shown as negative time. It is of note that, when 
combined with oversampling, a log time base gives 
noisy signals at the start of the reaction - however, this 
is an inherent property of the system since less time is 
available to average the signals when sampled at short 
intervals. 

The detection electronics will have some intrinsic 
limitation in response time, but this usually is much 
shorter than the interval between acquisitions of 
points. Under these conditions, the signal should be 
electronically filtered to prevent high-frequency noise 
being recorded or problems with aliasing. The latter 
refers to low-frequency “beat” signals generated by the 
interference of two high-frequency signals. When long 
time bases are required and the interval between acqui¬ 
sitions is extended, then either the signal can be 
oversampled (as described above) or the electronic 
filtering can be increased to allow only the passage of 
low-frequency signals. In general, the time constant of 
the filter should be at least ten times shorter than the 
transient of interest. Care is also required when the end 
point of the reaction has a very different signal ampli¬ 
tude to the initial reaction, since if the time constant is 
set too large, the washout phase may be slowed to an 
extent that it distorts the start of the desired reaction. 

Double-mixing stopped-flow devices are also avail¬ 
able that allows two solutions to be premixed and then 
a third solution added after a known delay of several 
milliseconds to seconds. Syringes may be driven by 
gas pressure which provides high flow rates. In this 
case, the ratio of the volumes mixed can be controlled 
by using drive syringes with different the cross- 
sectional area. Alternatively, the syringes may be 
driven by independent stepper motors that can be 
programmed to deliver different volumes. Using 
a four-syringe system even permits the reactants to be 
diluted with different amounts of buffer before mixing, 
thereby allowing a range of concentrations to be 
explored from a single set of stock solutions. However, 
the more complex the mixing options, the greater the 
potential for increased dead space. 


Experimental Design 

Stopped-flow spectroscopy is ideally suited to measur¬ 
ing the kinetics of binding reactions between two or 
more components on the milliseconds to tens of sec¬ 
onds timescale. However, by the appropriate design of 
the experiment, other forms of kinetics can be studied. 
The first requirement for stopped-flow analysis is to 
establish a signal (usually spectroscopic) for following 
the reaction. Artificial, chromophoric ligands and sub¬ 
strates can be used, and the kinetics of native ligand 
binding can be evaluated in competition assays with 
these. 

Binding Reactions 

As explained in the ► transient-state kinetic methods 
entry, binding reactions can be studied under second- 
order or pseudo first-order conditions. The latter has 
the advantage of requiring knowledge of only one of 
the component concentrations accurately (usually that 
of the small ligand which is used in excess over the 
protein) and also this measurement potentially reveals 
information about the reverse first-order dissociation 
reaction. The second-order association rate constant is 
derived from the slope of the plot of /robs versus con¬ 
centration of the ligand in excess, while the intercept 
defines the dissociation rate constant. Ideally the com¬ 
ponent in excess should be >10x the concentration of 
the other component, but even if this condition is not 
met, the estimated rate constant is likely to be within 
a factor of two of the correct value. Note that if the 
dissociation reaction (/r_i) is > 1,000 s~* (typical of 
weak binding reactions), then the observed rate con¬ 
stant for binding will be >1,000 s“* at all concentra¬ 
tions of the ligand (since ^obs = ^i[B] + ^-i) and thus 
unlikely to be resolved within the dead time of mixing. 
In these cases, ► relaxation methods with shorter dead 
times provide a more suitable approach. The binding of 
non-chromophoric native ligands may be studied by 
competitive binding with chromophoric ones. 

Displacement Reactions 

Dissociation reactions can, in principle, be investi¬ 
gated by dilution of a preformed complex to determine 
the dissociation rate constant. However, the observed 
rate constant will have a significant contribution from 
the reverse association reaction and may be difficult to 
analyze. A better approach is to displace the preformed 
complex with a competitive ligand. Under conditions 
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where the initial ligand is fully displaced by the com¬ 
peting ligand, the observed rate constant will reflect the 
dissociation rate constant of the first ligand. This can 
be achieved by using a high concentration of the com¬ 
peting ligand. Doubling the latter concentration should 
produce no further change in the signal amplitude nor 
observed rate constant, ^obs- If the displacement is only 
partial, ^obs will change because it reflects the kinetics 
of both ligands (see entry on ► Transient-State Kinetic 
Methods). The latter situation is useful to analyze the 
kinetics of a native ligand in competition with an 
artificial chromophoric ligand that provides the signal 
for monitoring the reaction. The use of high concen¬ 
trations of a chromophoric ligand to displace the native 
ligand may be problematic because the signal change 
will occur on a high background of free chromophoric 
ligand. However, in some cases, the signal may arise 
preferentially from the bound chromophoric ligand 
(e.g., a ► FRET-based assay), so that excess free chro¬ 
mophoric ligand does not interfere. 

Turnover Reactions 

Stopped-flow techniques are ideal for studying the 
transient kinetics of enzymes whose turnover numbers 
are less than 1,000 s“*. Substrate turnover can be 
followed under single or multiple turnover conditions 
(see ► Kinetics: Overview entry). 

Single turnovers require the enzyme to be at 
a concentration > if the observed kinetics are to 
reveal the kinetics of steps beyond the initial binding 
reaction (Fig. 4). In the case of multiple turnover 
reactions, spectroscopic signals derived from the sub¬ 
strate or product will also reveal a steady-state phase 
which should correlate with the kinetics derived from 
conventional ► steady-state assays with catalytic 
amounts of enzyme. However, in some cases conven¬ 
tional steady-state kinetics with manual mixing may be 
insufficient if the linear phase is too short to obtain an 
accurate initial rate measurement. Here, stopped-flow 
mixing may allow steady-state data to be measured at 
an earlier stage of the reaction (e.g., before an inhibi¬ 
tory product complex accumulates). Usually, stopped- 
flow methods are used to investigate the pre-steady- 
state phase of the reaction where lags and bursts define 
reactions that precede the “rate-limiting” step. Indeed, 
such studies often reveal that the steady-state rate is not 
dominated by a single elementary step, but a combina¬ 
tion of several elementary reactions. Care is required 
when the signal derives from a chromophoric product. 


since the reaction will continue to occur in the cell after 
the recording is finished. This high concentration of 
product needs to be washed out in the next shot of the 
drive syringes. Under these conditions, it may be 
advantageous to give two quick shots and monitor the 
reaction only of the second one. Stopped-flow analyses 
also strongly complement ► steady-state assays in 
their ability to study partial reactions (e.g., binary 
complex formation in a two-substrate enzyme reac¬ 
tion) and can confirm the order of binding deduced 
indirectly from steady-state assays. 

Jump Reactions 

Equilibrium reactions may be perturbed by jumping 
the conditions (e.g., pH, ionic strength) on rapid 
mixing in a stopped-flow apparatus. This is particu¬ 
larly important in studying ► protein folding kinetics, 
where denaturants may be rapidly added or diluted out. 

Analysis 

Commercial stopped-flow instruments are normally 
supplied with software for fitting the acquired kinetic 
profiles (i.e., signal versus time) to analytical equations 
(e.g., first-order single exponential functions, second- 
order equation) and many third-party programs are 
also available for this purpose. Such analysis can pro¬ 
vide estimates of rate constants; however, in general 
these are kobs values and do not necessarily equate to 
a single step in a reaction pathway. Analysis beyond 
this point perhaps provides the greatest challenge in 
applying kinetic methods because there is no one sim¬ 
ple set of rules to follow. Rather the most likely (and 
simplest) schemes must be tested and rejected or mod¬ 
ified to give a satisfactory explanation of the data. 
Experience of how elementary reactions couple in 
a reaction pathway may be gained by two complemen¬ 
tary methods. Reaction schemes can be simplified to 
the point where the dominant reaction steps allow an 
analytical solution (e.g., steps which are in rapid equi¬ 
librium are lumped together as a single state). Or the 
scheme can be examined by modeling such pathways 
using numerical integration, and the influence of each 
rate constant studied. Indeed, it is possible to fit the 
raw data globally to such models, but this approach 
is not advisable as a first step because it is easy to 
generate schemes which explain the data, but are 
underdetermined. That is, a rate constant value may 
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Stopped-Flow Techniques, Fig. 4 Single turnover of 0.1/ 
0.05 pM Cy3-EDA-ATP by 10/5 pM myosin II subfragment 1 
monitored by Cy3 emission (>570 nm) on excitation at 550 nm 
(Conibear et al. 1996). (a) Binding phase displayed on short time 
base to yield a pseudo first-order rate constant of 2 s“’. (b) 
Product release phase following hydrolysis displayed on a long 


be assigned to a step which accounts for the data but is, 
in itself, not unique. Thorough testing of the model 
using Monte Carlo methods helps to identify the con¬ 
fidence values of the individually derived parameters 
(see ► NMR). Furthermore, apart from rate constants, 
the observed signal will depend on the specific absorp¬ 
tion coefficient and/or quantum yield of each interme¬ 
diate which may be unknown. This gives further 
degrees of freedom to fitting the data, and hence further 
potential for generating an underdetermined model. 
Such parameters may be coupled, e.g., the value deter¬ 
mined for an equilibrium constant of a particular step 
will be coupled to the specific signal amplitude 
assigned to the complex, and the data may only allow 
a range of plausible values to be assigned. It is highly 
unlikely that the uncertainty in the value of a parameter 
is symmetrically distributed around its most likely 
value, and hence values retuned with a ± error should 
be viewed with suspicion. Such error limits may, how¬ 
ever, be useful in distinguishing better models from 
less good ones. 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Continuous Flow 

► Fluorescence 

► Fluorescence and FRET in Membranes 



time base to yield a rate constant of 0.025 s“*. Note the enhanced 
fluorescence at the end of the reaction indicates the product Cy3- 
EDA-ADP remains partially bound at equilibrium. The record 
was obtained in a single shot using a logarithmic time base (lO^"* 
to 10^ s) 


► Kinetics: Overview 

► NMR 

► Protein Folding 

► Steady-State Enzyme Kinetics 

► Transient-State Kinetic Methods 

► X-Ray Scattering of Lipid Membranes 
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Introduction 

An enzyme Is a biocatalyst, and as such, speeds up 
the rates of chemical reactions important for cellular 
function. Initial rate studies are invaluable in the study 
of enzymes. The dependence of the initial rate on 
reactant concentration in the absence and presence of 
inhibitors, as the pH and temperature are changed, or 
upon substitution of a light atom with a heavy atom in 
reactant is an invaluable tool in the elucidation of 
enzyme mechanism. There are a number of texts 
that deal with the theory of steady-state enzyme 
kinetics and their application to the study of enzyme 
mechanism. The treatment below is necessarily brief. 
For more information, the reader is referred to the text 
written by Cook and Cleland (2007). 

Michaelis-Menten Equation 

It was noted early on that the initial rate of an enzyme 
reaction depends in a hyperbolic manner on the sub¬ 
strate concentration. A rate equation describing this 
behavior was proposed by Leonor Michaelis and 
Maude Menten (1913), who were studying sucrase, 
and Briggs and Haldane (1925). Michaelis and Menten 
were following up on the work of Victor Henri, who 
proposed a similar treatment in 1903. The initial deri¬ 
vation assumed pre-equilibrium binding of substrate to 


enzyme followed by slow conversion of reactant to 
product. Briggs and Haldane reasoned there was no a 
priori requirement for pre-equilibrium binding, and 
reactant binding could contribute to rate limitation. 
The form of the rate equation does not change, but 
the definition of terms does. The equation is named 
for Michaelis and Menten. 

The derivation of the rate equation is straightfor¬ 
ward using the scheme shown below: 

kyS kp 

E . - ES-►E + P (1) 

k-i 

which describes binding of the substrate, S, to enzyme, 
E, to give an enzyme-substrate complex, ES, followed 
by conversion of ES to product. Steady-state kinetic 
studies generally monitor the second to minute time 
scale. A steady state is rapidly established such that the 
concentrations of E and ES are maintained at 
a constant concentration until the equilibrium position 
is approached. Thus, the concentrations of E and ES do 
not change with time, and the rate expressions for dES/ 
dt and dE/dt can be written as follows according to 
mechanism 1. 


— = ^i(S)(E) - (^_i +^2)(ES) = 0 (2) 

dt 

riF 

— = ki(S)(E)-(k_i)(ES) = 0 (3) 

dt 

Using the conservation equation for enzyme 
(Et= E + ES) and the above rate expressions, the 
Michaelis-Menten equation can be derived. The initial 
rate in terms of the appearance of P is: 

V = ^ = k2(ES) (4) 

Solving for E in the conservation equation and 
substituting for E in Eq. 2, solving for ES, 
and substituting for ES in Eq. 4 gives the following 
expression for the initial rate, v. 

kikzSE, kzSEt 

V =-=- () 

k^i+k2+kiS ^=^ + S 

Dividing numerator and denominator of Eq. 5 by 
gives the expression on the right side of Eq. 5. When S 
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is much greater than {k_i + k 2 )/ki, v is equal to ^ 2 Et- 
The rate is the maximum that can be attained at the 
concentration of enzyme used. Thus, ^ 3 Et is called the 
maximum velocity, V^ax- When the concentration of 
S is set at a value that will give Y2Vmax, S is equal to 
(k-i + k 2 )lki, called the Michaelis constant, K^- In 
terms of kinetic constants, the Michaelis-Menten 
equation is v = V^ax^KK^ + S). The initial rate is 
measured as close to time zero as possible, such that 
the time course for the appearance of P is linear. 
Operationally, as long as the substrate concentration 
does not decrease below 90%, the initial rate should be 
measured. In addition, the steady-state approximation 
requires that S > E,, or steady state cannot be 
achieved, at least for very long. It is assumed that the 
concentration of substrate is the hnal concentration in 
the reaction mixture prior to initiation of reaction. For 
most enzymes, reactant values are in the pM to mM 
range, while enzyme concentrations are generally in 
the nM range so that when the rate is measured using 
substrate concentrations around the K„j, the condition 
S > E, is easily met. Since the change in the concen¬ 
tration of substrate is very small near t — 0, the con¬ 
centration of substrate used to measure each initial rate 
is considered the concentration at the beginning of the 
reaction. It is assumed the substrate concentration has 
no error, but it is best to determine the concentration of 
the substrate stock solution as accurately as possible to 
minimize error. The limiting rate constants of the 
Michaelis-Menten equation are V^ax, as S tends 
to inhnity, and V^axlKm as S tends to zero. V^ax 
is a pseudo-zero-order rate constant with units of 
M/min, while VmaxlKm is a pseudo-first-order rate 
constant with units of min“\ The ratio of and 


Km and Kd 

It should be mentioned that K„j is the steady-state 
dissociation constant, measured under conditions 
where the enzyme is catalyzing its reaction. Each sub¬ 
strate in a reaction will have its own K^, determined at 
saturating concentrations of all other reactants and 
subscripted with the name of the substrate, e.g., the 
K„j for pyruvate in the lactate dehydrogenase reaction 
is A'pyruvate- The A'„ is usually not the thermodynamic 
dissociation constant for the enzyme-substrate 
complex. The definition of derived from the 


Michaelis-Menten approach is (k_i + k 2 )lki. The dis¬ 
sociation constant, Kj, for the ES complex is k_i/ki. If 
the rate of conversion of ES to products, ^ 2 , is slow, 
will be equal to Kj, while if ^2 > k_i, > K^. The 
case where is apparently not allowed. 

However, ^2 is not a simple rate constant and must 
also include formation of an EP complex, which then 
dissociates to E and P. 


E^^ES^^EP—►E-i-P (6) 

k_i k_2 

Derivation of the expression for using this 
mechanism gives the following. 


K„, 


Kd 


1 4 - 4 - ^ 



(7) 


Note that depending on the values of ko,, the off-rate 
constant for product and k_i, the off-rate constant 
for substrate, can be greater than, equal to, or less 
than Kd- 


Presentation of Kinetic Data 

Measurement of the initial rate as a function of sub¬ 
strate concentrations around its gives a rectangular 
hyperbolic function when v versus S is plotted. How¬ 
ever, it is often difficult to estimate Vmax from such 
a plot, especially with substrates of limited solubility. 
Estimates of each of the kinetic parameters with their 
associated standard error can be obtained by fitting the 
data using the Michaelis-Menten equation. There are 
a number of programs available for carrying out 
the nonlinear least squares fit, including EnzEitter, 
SigmaPlot, and even MSExcel. However, data are 
usually displayed as a Lineweaver-Burk plot, a linear 
transform of the Michaelis-Menten equation, Fig. 1. 



There are other linear transforms of the Michaelis- 
Menten equation, e.g., Eadie-Hofstee (v vs v/S) and 
Hanes-Wolf (S/v vs S), but the Lineweaver-Burk or 
double reciprocal plot is by far most commonly used 
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Steady-State Enzyme Kinetics, Fig. 1 Lineweaver-Burk or 
double reciprocal plot of initial velocity and substrate concen¬ 
tration. The graphical estimation of kinetic constants is shown 

and, as a result, the most easily recognized by the large 
majority of investigators in the held. One can easily 
read the double reciprocal plot (Cook and Cleland 
2007) 

The intercept and slope of the plot are equal to 
l/V'max and respectively. The abscissa 

intercept is —1//^„, the ratio of the slope and intercept. 


Overall Rate Equation and the Haldane 
Relationship 

The discussion above considered initial rate condi¬ 
tions, i.e., reaction in one direction, e.g., S to P, in 
the absence of product, P. Allowing for reversibility of 
the reaction, i.e., measuring the initial rate in the pres¬ 
ence of products, requires that the rate constant for 
addition of P must be present since its concentration 
is not zero at r = 0. 

E . - ES ^—- EP E + P (9) 

k_i k_2 k_3 

Under these conditions, the overall rate equation is 
given below. Addition of enzyme to a reaction mixture 
containing both S and P will give a hrst-order approach 
to the equilibrium position. The numerator of the equa¬ 
tion below rehects the approach to equilibrium, and 
depending on the values of the kinetic parameters and 
the concentrations of S and P, the reaction will proceed 
either toward S or P {K^s and K^p are values for S 
and P, respectively. 


Limiting Rate Constants 

The limiting pseudo-zero-order rate constant V„,ax is 
equal to Division of V„uix by Et gives the value of 
the rate constant for conversion of ES to E and P. The 
rate constant ^2 is not a microscopic rate constant 
representing an elementary step but is a net rate con¬ 
stant that includes a contribution from decomposition 
of the EP complex. U/Et is called the turnover number 
(TON) or kcat, and is an intrinsic property of any given 
enzyme, including isozymes if any exist. The highest 
TON was reported for the enzyme catalase, about 
10^ s“*, but values can be as low as < 1 s“\ The 
limiting pseudo-first-order rate constant V^axlK-m 
reflects the catalytic efficiency of an enzyme and 
includes steps for binding and catalysis. Umax/^mEt is 
a second-order rate constant with units of M“^s“* and 
includes all steps from addition of substrate to the first 
irreversible step, which is usually release of the first 
product since at f = 0, P = 0. The maximum value the 
second-order rate constant can have is the diffusion- 
limited rate constant for binding of substrate to 
enzyme, which is 10^-10^ M“^s“* for a small mole¬ 
cule combining with a large molecule. 


+ S + P 


( 10 ) 


A relationship exists between the equilibrium con¬ 
stant, and kinetic constants (Alberty 1953; 

Cleland 1982). At equilibrium, the net rate of produc¬ 
tion of A or P is zero, i.e., the initial rate, v, is equal to 
zero. It follows from the overall rate equation that 
ViS = V 2 {K„JK„,p)P. Solving for P/S (since the sys¬ 
tem is at equilibrium, this is actually Peq/Seq, which is 
equal to gives V\K„^plVjK-ms- The relationship 
between the equilibrium constant and kinetic constants 
is termed the Haldane Relationship. 


Higher-Order Reactions - Derivation of Rate 
Equations 

In the likely case one is studying an enzyme with more 
than one reactant, the initial rate is a function of both 
reactants and the initial rate is interdependent on 
the concentrations of the two (or more) substrates. 
The initial rate is measured as a function of one 
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substrate at different fixed levels of the second. How¬ 
ever, at any fixed concentration of one of the reactants, 
the initial rate measured as a function of the other will 
adhere to the Michaelis-Menten equation. The rate 
equations for bi-reactant mechanisms are necessarily 
more complex and will change depending on order of 
addition of substrates to the enzyme’s active site, and 
whether any chemistry can take place once one the 
reactants has combined with enzyme. The method 
used above for the derivation of Eq. 5 becomes quite 
cumbersome when applied to mechanisms with two or 
more reactants, particularly for branched pathways. 
There are two methods that are commonly used for 
derivation of complex rate equations. One is the algo¬ 
rithm method of King and Altman (1956), which can 
be used for any mechanism. Generally, one constructs 
geometric patterns with the number of sides equal to 
the number of enzyme forms in the pathway. For 
mechanism 6 above, for example, three enzyme 
forms are present, E, ES, and EP, and can be 
represented by a triangle. From the main geometric 
pattern, other patterns can be generated, each with 
one side missing. In the present case, three such pat¬ 
terns are possible. Distribution equations representing 
the fraction of each enzyme form are generated by 
multiplying all the rate constants that form the enzyme 
form of interest, summing the terms obtained from all 
of the patterns and dividing by Et, which is the sum of 
all terms generated from all of the patterns. The expres¬ 
sion for the initial rate in mechanism 6 is v = ^ 3 EP. 
One solves for EP in the distribution equation for EP 
and substitutes for EP in the initial rate expression to 
obtain the final rate equation. If there are branches in 
the mechanism, for example, resulting from random 
addition of reactants to enzyme, the equation is much 
more complex, as is the derivation. For more informa¬ 
tion, see the text by Cook and Cleland (2007), which 
provides a step-by-step treatment of the King-Altman 
method. Derivation of the rate equation for a linear or 
simple-branched mechanism can be carried out using 
a method developed by Cleland (1975), called the 
Method of Net Rate Constants. The method makes 
use of partitioning of intermediates along the reaction 
pathway. A net rate constant, k’, is assigned to each 
step in the mechanism, usually beginning with an 
irreversible step and working backward. For an irre¬ 
versible step, the net rate constant will be equal to the 
rate constant. In mechanism 6 , ^ 3 ’ = ^ 3 , while the net 
rate constant k 2 will be equal to ^2 times the partition 


ratio for EP toward E and P, kj,lik _2 + k^), and the net 
rate constant ki ’ is equal to kiS times the partition ratio 
for ES, k 2 ’/(k_i + ^ 2 ’). Once all of the net rate con¬ 
stants have been defined, the initial rate is the recipro¬ 
cal of the sum of the reciprocals of all of the net rate 
constants. For more information, see the text by Cook 
and Cleland (2007). For details on bi-reactant mecha¬ 
nisms and other aspects of initial rate studies, including 
the use of inhibitors and variation of rate with pH and 
isotopic substitution, the reader is referred to the book 
by Cook and Cleland (2007). 

Cooperative Kinetics 

The above discussion assumed Michaelis-Menten 
kinetics, which give linear Lineweaver-Burk (LB) 
plots. Nonlinear LB plots that are concave upward or 
concave downward can be observed and reflect 
a number of different possibilities. A random order of 
addition of two reactants with different rates of gener¬ 
ating a productive enzyme-reactant complex can give 
either a concave upward or concave downward 
LB plot. In addition, a nonlinear LB plot can reflect 
cooperative reactant binding. Positive cooperativity 
exhibits a sigmoid (S-shaped) v versus S plot, or 
a concave upward LB plot, and results from an increase 
in affinity of subsequent molecules of the substrate 
once the first one binds. Although there are a number 
of models that will explain this behavior, the thermo¬ 
dynamic “all-or-none” model of Monod et al. (1965) is 
referred to most commonly. Negative cooperativity 
results in a concave downward LB plot and stems 
from a decrease in affinity of subsequent molecules 
of the substrate once the first one (or two) binds. The 
MWC model cannot explain negative cooperativity, 
but the “sequential” model of Koshland et al. (1966) 
can be used to explain both positive and negativity 
cooperativity. Of the two types of cooperativity, 
negative cooperativity is seldom seen. 

Cross-References 

► Allostery - Allosterism 

► Binding Hotspots 

► Catalases and Catalase-Peroxidases 

► Kinetics: Overview 

► Transient-State Kinetic Methods 
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Synonyms 

Biosynthetic labeling; Regio- and stereoselective- 
isotope labeling method 

Definition 

The SAIL method is a technique that simultaneously 
alleviates the line-broadening and signal-overlapping 


problems complicating the analyses of NMR spectra of 
larger proteins. The method takes advantage of optimal 
isotope labeling patterns to obtain structural informa¬ 
tion much more efficiently and accurately, as com¬ 
pared to conventional isotope labeling approaches. 
The stereo- and regiospecifically labeled amino acids 
used in the SAIL method are chemically and enzymat¬ 
ically synthesized, and then incorporated into the 
targeted proteins by efficient protein expression 
methods, such as a cell-free system. By virtue of the 
improved spectral qualities obtained for the SAIL pro¬ 
teins, one can solve protein structures as large as 
50 kDa or even more, without compromising the struc¬ 
tural quality (Kainosho et al. 2006; Kainosho and 
Giintert 2009). 
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► Cell-Free Expression 

► NMR Studies of Macromolecular Interactions - 
Introduction 

► NMR-based Structural Proteomics 
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Definition 

Stimulated emission depletion (STED) microscopy is 
a far-field fluorescence imaging technique that funda¬ 
mentally breaks the diffraction barrier. In a STED 
microscope, the spatial resolution can be increased to 
the molecular scale or even beyond; Theoretically, its 
spatial resolution is unlimited. The resolution enhance¬ 
ment happens commonly along the lateral plane but 
can be realized also along the axial direction. 

Introduction 

Already more than a century ago, Ernst Abbe demon¬ 
strated that diffraction makes it impossible to focus light 
by any far-held optics more sharply than d = , 

where NA is given by the numerical aperture of the 
lens and X by the wavelength of the focus light. Exper¬ 
imentally the numerical aperture is limited to be 1.4 or 
1.5 (at max) and leaving the visible spectrum of light to 
the UV causes many other problems like less penetra¬ 
tion depth, sample toxicity, and complicated optics. In 
other words, even with high performance optics and 
green light excitation, the resolution is limited to be 
around 200 nm along the lateral dimensions and around 
500 nm along the axial direction. With STED micros¬ 
copy, a new technique came up during the last decade 
that is able to provide a resolution much below the Abbe 
limit. In the following, the basic principles of STED will 
be explained and some examples of experimental real¬ 
ization will be presented. To start with, the basic con¬ 
cept called RESOLET (reversible optical fluorescence 
transition) on which STED is based on will be intro¬ 
duced in the following section. This opens a new era in 
Optical Microscopy (Diaspro 2010a, b; Pawley 2006). 

The RESOLFT Concept 

A very successful concept of breaking the diffraction 
barrier has been realized in the RESOLET-technique 
(reversible saturable optical fluorescence transition) 
(Hell 2004; Hofmann et al. 2005). The main idea is 
based on the existence of two distinguishable states 
A and B. The transition between the states can be 
driven optically and is saturable. In this section, the 
states are described as the bright state B and the dark 
state A according to fluorescence microscopy where 


this concept circumvents the diffraction limit by selec¬ 
tively switching in space and time between these 
states. To achieve sub-diffraction resolution, two 
processes are required: the fluorescent marker can 
reversible switch between a bright and a dark state 
B ^ A and the shape of the switching beam has to 
feature at least a zero-intensity point (which can be 
easily created by interference effects). Two prominent 
implementations have been used, namely, a doughnut 
shape beam and stripe pattern. Starting with such a two 
level system, the basic of this idea is to have a (mostly) 
light driven transition that switches the molecules from 
the state B to state A with a rate ksA depending on the 
cross section (Jba and the intensity of the switching 
beam Iba, ^ba = (^baIba - This transition has to be opti¬ 
cally saturable. Due to the specially shaped switching 
light distribution, Iba only falls in areas within the 
periphery of zero-intensity points and only there mol¬ 
ecules in state B are brought to the dark state A. As 
mentioned before, the transition B ^ A has to be sat¬ 
urable. In other words, from a certain intensity level of 
the switching beam all molecules can be considered to 
be driven to the dark state. Typically, a parameter 
named saturation intensity 7^^' is introduced that 
deflnes the intensity of the switching beam needed to 
convert 50% of the molecules from B to A. lg‘^ is 
a system constant parameter which mainly depends 
on the effective cross section cjba- With an increase 
of Iba, owing to the saturation effect, not only mole¬ 
cules located in the maximum of the switching beam 
are switched dark but also molecules closer to zero- 
intensity points. The result is an effective focal area 
containing molecules in the bright state B which size is 
significantly smaller than the diffraction limit. What 
makes this method so important is the fact that no 
theoretical limit of the resolution is given: the final 
extension of the effective active (bright) volume is 
mainly governed by the efficiency of the switching 
process. This switching efficiency can directly be 
influenced by the intensity Iba of the irradiated light 
or by the cross section (Tba for the transition. 

Experimentally the RESOLET concept has been 
realized in forms of ground state depletion 
(Bretschneider et al. 2007), photoswitchable proteins 
(Grotjohann et al. 2011; Hofmann et al. 2005), sub¬ 
diffraction direct laser writing lithography (Eischer 
et al. 2010; Harke et al. 2012), and most prominently 
STED microscopy which will be discussed in the 
following section. 
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Fluorescence Switching by Stimulated 
Emission Depletion 

STimulated Emission Depletion (STED) microscopy 
is part of the RESOLET concept and was first proposed 
in 1994 by Hell and Wichmann (Hell 2003; Hell and 
Wichmann 1994). In this concept, switching between 
the fluorescent state Si (analogue to the bright state 
B in the RESOLET concept) to the ground state 
So (state A) happens by means of stimulated emission. 

A dye molecule is excited from the ground state 
So to a higher vibronic sublevel of the first excited state 
S”* by the excitation beam (Fig. la). Within several ps 
it decays via internal conversion to the lowest energy 
level of S 1 . From this state fluorescence occur in a time 
range of several ns. The switching beam in STED 
microscopy, commonly named STED beam, adds 
a competing de-excitation pathway to the molecule. 
It stimulates the molecule to release its energy across 
an energy transition which corresponds to the same 
energy, i.e., same wavelength, as the STED beam 
itself. Since the wavelength of the STED beam is 
chosen to be red shifted relative to the fluorescence 
(detected) wavelength the molecule appears dark to the 
fluorescence detector. Figure lb shows an example of 
wavelength combination for the crimson fluorescent 
microspheres (Invitrogen) typically used as calibration 
samples for STED microscopes working in the red to 
near infrared spectrum. Excitation wavelength is set to 
635nm and generates the corresponding fluorescence 
signal measured between 650 and 690 nm. The STED 
wavelength has to be chosen carefully in order to have 
a sufficient cross section for stimulated emission which 
in first approximation follows the emission spectrum 
of the molecule. In the other hand, STED wavelength 
has to be red shifted in order to avoid direct excitation 
of the molecule. In this example, the STED wave¬ 
length can be chosen between 730 and 770 nm. 

In order to characterize whether a specific dye or 
a wavelength combination shows a significant STED 
effect, the fluorescence level has to be measured for 
different STED intensity levels Isted applied (Fig. Ic). 
For this calibration measurement, Gaussian-shaped 
excitation and STED foci are overlaid in space and 
the fluorescence signal emitted from a specific sample 
(in this case the same fluorescent spheres as in Fig. lb) 
is measured. The curve shows efficient fluorescent 
reduction due to the STED beam and it represents 


a key experiment for STED microscopy since various 
important parameters can be estimated from this curve. 

As described in the previous section, the saturation 
intensity gives the level of STED intensity 
needed to switch off 50% of the fluorescence signal 
and it can be read out by this depletion curve. Also the 
information to which extent the fluorescence can be 
switched off can be directly seen in the measurement. 
A remaining fluorescent signal is to a certain extent 
(<15%) acceptable but it surely reduces signal to noise 
ratio during STED imaging (Galiani et al. 2012). 
In this example curve, the remaining fluorescence can 
be considered to be low enough for high resolution 
imaging. 

In order to achieve sub-diffraction resolution with 
a STED microscope the STED beam focus has to be 
modified to exhibit a central zero point in its center 
which can be realized by placing a phase plate into the 
STED beam path. Details on various phase masks will 
be discussed in the following section while in this 
section the focus is on the use of the “classical” phase 
mask creating a doughnut-shaped focal distribution 
improving resolution along the lateral plane. 
As described in the previous section, the saturation 
effect causes the switching of molecules very close to 
the center of the excitation focus. The final resolution 
of a STED microscope can be described as the size of 
the effective volume in which the molecules emit 
a fluorescence signal. Quantitatively, the resolution 
d can be described as follows (Harke et al. 2008a): 



dc denotes the confocal resolution and a represents the 
shape, especially the pattern steepness, of the STED 
focus. Both parameters are system constants. In order 
to achieve maximum resolution, the factor has to 

.STED 

be raised which can be done experimentally by simply 
increasing the STED intensity Isted- Experimental 
validation is shown in Fig. 2 in which the STED 
beam intensity was stepwise increased and the 
improved resolution of the STED microscope can be 
directly seen in the imaging of fluorescent beads (same 
as measured in Fig. 1). Since in lab conditions, avail¬ 
able laser power is limited, it is not always straightfor¬ 
ward to simply increase Isted- Alternatively, Isted 
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Stimulated Emission Depletion (STED) Microscopy, 
Fig. 1 (a) Jablonsky diagram of the involved states during 
STED transitions; (b) Absorption (blue) and emission 


(red dotted) spectra of crimson fluorescent spheres; (c) Fluores¬ 
cence reduction due to increasing STED light intensity (All data 
from Harke 2008) 



Stimulated Emission Depletion (STED) Microscopy, Fig. 2 STED microscopy imaging with increasing STED intensities 
(Harke et al. 2008a) (from left to right). Left panel confocal, scale bar 200 nm 


be reduced by further optimizing the dye system or the 
STED wavelength in order to improve the cross section 
for stimulated emission. 

1D, 2D, and 3D STED Imaging: Shaping the 
Focal Spot 

In a STED microscope, the fluorescence is switched 
off in regions where the STED beam falls. In order to 
gain resolution without losing signal the STED beam 
has to be of zero intensity in its center in order to 
maximize the fluorescence on-off contrast between 
the center and the periphery. However, apart from the 
central zero the STED focus can be shaped arbitrary. 
Experimentally the focus shaping is realized by using 
proper phase masks (or to be more flexible a spatial 
light modulator) that are simply placed into the STED 
beam paths. Three main phase masks (Fig. 3) have 


been elaborated so far in order to finally achieve ID, 
2D or 3D resolution enhancement (Harke 2008). Also 
combinations of phase plates can be used in order to 
either increase the total STED power by combining 
two identical phase plates (Wildanger et al. 2009) or 
in order to extent the dimensions in which the resolu¬ 
tion is enhanced (Harke et al. 2008b; Wildanger 
et al. 2009). 

STED Microscopy with Various Laser Sources 

In order to sufficiently switch the fluorescence signal, 
the STED effect has to compete with the spontaneous 
decay, i.e., fluorescence, which has a lifetime in the 
order of ns. That requires a high photon flux during the 
time the dye molecule populates the excited state Si. 
Typically, a high photon flux within a short time is 
provided by a pulsed laser source which is the reason 
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Stimulated Emission Depletion (STED) Microscopy, Fig. 3 The three main phase plates used for STED microscopy and possible 
combinations 


why STED has been first (Westphal and Hell 2005) 
implemented with ultrafast Titan-Sapphire (TiSa) 
lasers with a repetition rate of ^80 MHz and a pulse 
width of '^150 fs at laser output. In order to avoid any 
linear/nonlinear excitation of the dye by the STED 
beam itself, the pulses are temporally stretched by 
dispersion in regular single mode fiber. A pulsed exci¬ 
tation diode externally triggered by a photo diode 
signal of the TiSa laser guarantees excitation light 
with a synchronized frequency and a defined phase. 
Delay-time optimization, i.e., excitation pulse is 
placed immediately before STED pulse arrival, can 
be done electronically or optically. However, working 
with two synchronized pulsed laser systems requires 
some experimental precaution and increases the total 
costs of the setup. An alternative system is based on 
a pulsed white light laser source which serves as 


excitation and STED light source (Wildanger et al. 
2008), respectively, by using appropriate filters. 
Delay optimization can be realized by an optical 
delay line (Galiani et al. 2012). The laser system pro¬ 
vides outstanding resolution with reduced costs. 

With lowest costs as well as lowest experimental 
complexity a STED microscope can be realized by the 
use of continuous wave (CW) lasers (Willig et al. 
2007) which makes any synchronization and delay 
optimization obsolete. Due to naturally lower peak 
intensities with CW lasers the final resolution achieved 
cannot reach the resolution values obtainable by pulsed 
laser systems. Nevertheless, resolution values of 
<50nm has been reported. With a combination of 
a pulsed excitation source and a CW STED laser, the 
performance can be significantly enhanced by apply¬ 
ing time-gated detection (Vicidomini et al. 201 1). 
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Summary 

STED microscopy provides a theoretically unlimited 
resolution in a far-field fluorescence microscope. 
Its field of application is mainly but not only focused 
on imaging biological tissue. The wide portfolio of 
compatible dyes and compatible laser systems makes 
the method so powerful in an imaging facility. Lots of 
developments are still performed focusing on how to 
extract highest performance from a given system or 
how to make this method even more variable in terms 
of multicolor and high speed applications. 

Cross-References 

► Fluorescence: General aspects 

► Gated Stimulated Emission Depletion Microscopy 
(g-STED) 

► Two-Photon Excitation Stimulated Emission 
Depletion Microscopy 
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Synonyms 

Direct stochastic optical reconstruction microscopy 
(dSTORM) 


Definition 

Stochastic optical reconstruction microscopy 
(STORM) is a super-resolution imaging technique 
based on single molecule localization of photo¬ 
switchable fluorophores. 

The basic idea behind this technique relies on the 
possibility to switch a fluorophore between a “bright” 
and a “dark” state, thus collecting and localizing with 
nanometer accuracy the position of a sparse subset of 
emitters. The sample is labeled by indirect immuno¬ 
fluorescence with paired photoswitchers, whose spec¬ 
tral properties can be readily controllable and which 
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exhibits high brightness, a good thermal stability, and 
low fatigue rates. Similarly, a variant based on the 
basic concept of STORM and named direct STORM, 
employs conventional synthetic fluorophores and 
exploits the transition to dark states to obtain the 
sparseness of the collected molecules. Further devel¬ 
opments of STORM techniques allowed to perform 
multicolor and 3D super-resolution imaging achieving 
a lateral localization accuracy of approximately 25 nm 
and an axial one close to 50 nm in fixed samples. 

STORM belongs to the wide family of ► individual 
molecule localization based techniques (► IML), 
which comprises also ► photoactivation localization 
microscopy (PALM), ► Fluorescence photoactivation 
localization microscopy (FPALM), ► ground state 
depletion imaging (GSDIM), ► fluorescence imaging 
with one-nanometer accuracy (FIONA), ► single¬ 
molecule high-resolution imaging with 
photobleaching (SHRIMP), and *• SHREC. 
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Introduction 

Structural genomics (SG) can be defined as the 
systematic application of current technology to 
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structural studies driven by the availability of gene 
sequences. It is, in a way, an engineering approach 
to structural biology. The held arose in the wake of 
the success of genome sequencing efforts with the 
realization that amino acid sequence information was 
far outpacing our understanding of the relationship 
between sequence and structure. The question arose 
whether structural studies, as with sequencing, 
could be carried out more efficiently on a large 
scale. Pilot projects were started in Japan, Europe, 
and the USA to explore this question, and 
a key organizational meeting was held at Argonne 
National Laboratory in the USA. It was becoming 
clear that proteins exist as a hnite number of 
folds and structural classihcations and that structures 
could be predicted given a close enough sequence 
identity. In addition, the beam lines at synchrotron 
facilities were enabling very rapid single-crystal 
X-ray diffraction data collection. Structures could be 
solved efficiently by anomalous dispersion phasing 
through the incorporation of selenomethionine into a 
recombinant protein. These pilot projects led to the 
formation of SG projects in many countries around the 
world, the largest of which were supported by granting 
agencies in Japan (RIKEN, httpV/www.rsgi.riken.go. 
jp/rsgi_e/index.html) and the USA (Protein Structure 
Initiative (PSI) http://www.nigms.nih.gov/Initiatives/ 
PSI/) and by an international consortium of govern¬ 
ments, charitable foundations, and industry (the Stmc- 
tural Genomics Consortium (SGC), http://www. 
thesgc.org) (Edwards 2009). 

The underlying policies for the US SG effort were 
worked out at workshops sponsored by the National 
Institutes of Health. This led to a new way of 
conducting biological science in which targets and 
their progress were to be disclosed, solved 3D struc¬ 
tures were to be deposited and released promptly from 
the Protein Data Bank, protocols and methodology 
were to be exchanged freely, and clones and expres¬ 
sion plasmids were to be made available publically. 
The work at the large-scale PSI centers was conducted 
by a large team of scientists, and the large-scale and 
smaller centers formed a collaborative network. 

All SG projects rely on the availability of gene 
sequences and are limited by the extent of coverage 
and quality of sequencing and annotation. A popular 
initial approach was to go after the “low hanging fruit,” 
namely, those proteins, generally from bacterial 
species (often hyperthermophiles), that could be 


expressed and yielded structures. An early debate was 
whether ► nuclear magnetic resonance spectroscopy 
should play any role in SG because, unlike X-ray 
crystallography, NMR was not a high-throughput 
approach. However, the few centers that employed 
both NMR and X-ray soon were able to document 
that the two approaches are complementary, with 
NMR showing higher success rates with smaller 
proteins and X-ray with larger proteins. The major 
bottleneck for both approaches was shown early on to 
be protein production (Edwards et al. 2000). 

Target Selection 

Protein targets in SG have been chosen in various ways 
to reflect the focus of a center. An early emphasis 
of several SG centers was the discovery of new folds, 
but it soon became apparent that the total number of 
folds was smaller than anticipated. The major goal of 
the large-scale PSI SG centers was to explore the rela¬ 
tionship between protein sequence and structure. Their 
strategy was to sort all available sequences into clusters 
related by close sequence identity and to solvestructures 
of representative members of each large family. The 
advantage of this approach was that it maximized the 
leverage of structure prediction in providing structural 
models for a large number of proteins. 

The SG approach currently is being applied to 
a wide variety of target classes; several of these are 
listed below as examples: 

Cancer signaling proteins 

Comprehensive structural coverage of all proteins 
from a model organism such as Methanococcus 
jannaschii and Thermotoga maritima 
Drug discovery for infectious diseases (Anderson 
2009; Van Voorhis et al. 2009) 

Enzyme function (Gerlt et al. 2011) 

Epigenetic readers, writers, and erasers 
Kinases 

Membrane proteins 
Metabolic enzymes 

Metagenomic proteins (e.g., human gut, oceanic) 
Metal-binding proteins 
Mitochondrial proteins 
Phosphatases 

Protein design and synthetic biology 
Tuberculosis (Chim et al. 2011) 

Viral proteins 
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US Protein Structure Initiative 

The Protein Structure Initiative (PSI) supported by 
the Institute for General Medical Sciences, National 
Institutes of Health, was initiated in 2000. The first 
5-year phase (PSTI) supported nine pilot projects 
aimed at testing the concept of high-throughput 
structure determination and at identifying bottlenecks 
and approaches to overcoming them. The second 
5-year phase (PST2) funded four large-scale structure 
determination centers, four smaller, specialized centers 
focused on technology development, two membrane 
protein centers, and two ► homology modeling cen¬ 
ters. Toward the end of the 5-year period, a materials 
repository and a knowledge base were added. 

The current PSI program, PSI: Biology, began 
in 2010. The PSI: Biology program supports a network 
consisting of four “Centers for High-Throughput Struc¬ 
ture Determination,” nine “Centers for Membrane 
Protein Structure Determination,” twelve “Consortia 
for High-Throughput-Enabled Structural Biology 
Partnerships,” the “Materials Repository” (Cormier 
et al. 2011), and the “PSI Structural Biology 
Knowledgebase” (PSTSBKB) (http://www.sbkb.org/). 
As stated on the PSTSGKB website, the goal of PSI: 
Biology is “to apply the paradigm of high-throughput 
structure determination via highly organized networks 
of investigators to solve the three-dimensional structure 
of proteins and macromolecular complexes of substan¬ 
tial biological and biomedical problems and to use these 
structures to understand their biological function.” 
The PSTSBKB maintains “TargetTrack,” which docu¬ 
ments online all targets studied by the PSI and other 
cooperating SC programs, their progress and status, and 
the protocols used in both successful and unsuccessful 
trials. TargetTrack provides a wealth of information of 
use to persons interested in particular proteins or for 
longitudinal studies of how sequences succeed or fail. 

Technology Development and Structural 
Genomics 

An indisputable by-product of the international efforts 
in SC has been the development of technologies that 
have reduced sharply the costs of protein structure 
determination by X-ray crystallography and NMR 
spectroscopy. These developments have been funded 
directly by or have been stimulated by SC centers. The 


advances span the entire gamut of the protein structure 
determination pipeline: bioinformatics used in target 
selection and structure prediction, information man¬ 
agement systems, strategies for protein production 
and characterization, structural data collection and 
analysis, evaluation and validation of structures, and 
tracking the impact of centers. Prior to SC funding, 
it was difficult to find support for such technology 
development. But even more important was that SC 
pipelines provided the means for evaluating technolo¬ 
gies on a large number of targets so superior general 
approaches could be clearly identified, along with 
salvage pathways to rescue targets that failed. 

The “Technology Portal” of the PSTSBKB provides 
ready access to descriptions of methods used in high- 
throughput structural biology efforts along with refer¬ 
ences for further information. The Technology Portal 
can be searched by text queries or by browsing under 
several headings: target selection, reagents, cloning, 
protein expression, purification, crystallography, 
NMR, annotation/function, modeling, and dissemina¬ 
tion tools. The PSTSBKB hosts an online technology 
discussion forum for comments and questions to the 
community and a YouTube channel that presents 
video examples of methods and instrumentation. 

Bioinformatics advances have included algorithms 
for predicting the success of a target on the basis of 
its sequence. These have made use of predictors of 
disorder or ► structurally disordered proteins, informa¬ 
tion on regions of low complexity, and predicted 
transmembrane stretches. The SC centers have been 
major contributors of new families, superfamilies, and 
folds to the Structural Classification of Proteins (SCOP) 
database. This additional information on ► structure 
comparison has led to increased diversity of the 
known protein structure space, including new topolo¬ 
gies and structural features (Andreeva and Murzin 
2010), and has improved the ► biologically significant 
alignment of protein sequences. Structures determined 
by SC centers and put temporarily on hold at the World¬ 
wide Protein Data Bank (wwPDB) have provided valu¬ 
able targets for several ongoing competitions: Critical 
Assessment of Structural Prediction (CASP) 
(Kryshtafovych et al. 2011), Critical Assessment of 
PRedicted Interactions (CAPRI) (Janin 2010), and Crit¬ 
ical assessment of Automated Structure Determination 
by NMR (CASD-NMR) (Rosato et al. 2009). 
These competitions, respectively, have spurred the 
development and comparison of improved methods 
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for ► predicting protein structures, the ► structure of 
protein complexes, and automated ► structure determi¬ 
nation by NMR. In addition, substantial advances have 
been made in the validation of protein structures deter¬ 
mined by X-ray crystallography and NMR spectros¬ 
copy, and the wwPDB has sponsored “Validation 
Task Forces” for both approaches aimed at the devel¬ 
opment of freely available software to be used to assess 
the ► quality of protein structural models. 

The larger SG centers have developed laboratory 
information management systems (LIMS) used to 
organize, track, and disseminate work in centers 
(Haquin et al. 2008). The Sesame LIMS, which 
was developed at the Center for Eukaryotic Structural 
Genomics (CESG) and is freely available (http://www. 
sesame.wisc.edu), is in use at multiple PSI centers as 
well as traditional biological laboratories. Sesame 
provides a convenient method of capturing informa¬ 
tion that is mandatory or recommended for inclusion in 
the PSI-SBKB. 

The Joint Center for Structural Genomics (JCSG) 
has developed streamlined methods for cloning, 
including automated primer generation and ordering 
of oligonucleotides and the polymerase incomplete 
primer extension (PIPE) cloning method. The cost of 
gene synthesis has fallen to the point where it has 
become the method of choice for obtaining DNA cod¬ 
ing for targets that are difficult to clone. Although 
algorithms have been developed for codon optimiza¬ 
tion of synthetic genes and shown to improve yields in 
some cases, this approach has yet to be found useful in 
the context of SG. 

All of the cloning and expression vectors developed 
by the PSI are available from the Materials Repository 
(Cormier et al. 2011), and expression vectors are 
ordered as frequently as clones containing open 
reading frames coding to protein targets. The most- 
ordered clone has been that for a reagent: CESG’s 
vector for the production of tobacco etch virus (TEV) 
protease, which is widely used to cleave peptide or 
protein tags attached to structural targets. 

By far, the most successful approach to protein 
production has been to express a construct coding for 
the target fused to an oligohistidine tag in Escherichia 
coli cells and to isolate the product by immobilized 
metal ion affinity chromatography (IMAC). Members 
of a variety of SG groups collaborated on the publica¬ 
tion of a consensus “what to try first” description of 
protein production and purification based on this 


approach (Graslund et al. 2008). Unfortunately, 
one size does not fit all when it comes to protein 
production, and alternative approaches are needed to 
improve success rates. The CESG has demonstrated an 
excellent success rate with eukaryotic proteins by com¬ 
bining E. coli cell-based protein production with 
a robotics-driven wheat germ cell-free platform 
(Markley et al. 2009); targets that failed with one 
approach often succeeded with the other. The wheat 
germ cell-free platform also has shown promise for the 
production of ► membrane proteins (Beebe et al. 
2011). Additional approaches used in SG centers 
include protein production from yeast, mammalian, 
and insect cells. Both cell-based and cell-free 
approaches have been developed for the simultaneous 
production of two or more proteins that form com¬ 
plexes. The protein yield frequently can be improved 
by producing the target protein as a cleavable fusion 
with an oligohistidine tag or another protein, such as 
SUMO or maltose-binding protein, which may assist 
the protein to fold correctly. 

Protein targets for X-ray crystallography generally 
are labeled with selenomethionine to provide for 
single- or multi-wavelength anomalous diffraction 
(SAD/MAD) phasing of the collected data. Protein 
targets for NMR spectroscopy are labeled with stable 
isotopes (*^C, *^N, and possibly see: ► Stereo- 
Array Isotope Labeling (SAIL) Method). Improved 
labeling techniques based on autoinduction or cell- 
free technology have come from SG (Markley et al. 
2009). 

Once proteins are produced and isolated, it is impor¬ 
tant to characterize them for purity (gel electrophore¬ 
sis), expected molecular weight (mass spectrometry), 
dispersity (► dynamic light scattering), and oligomeric 
state (gel filtration). One can, in addition, use 
thermodenaturation to screen for protein stability as 
a function of buffer composition or to detect ligand 
binding, and purified proteins can be subjected to high- 
throughput screens for enzymatic activity (Vedadi 
et al. 2010). 

SG projects have stimulated technology develop¬ 
ment on several fronts for the benefit of ► macromo- 
lecular crystallography (Joachimiak 2009). The amount 
of protein required for crystallization trials has been 
reduced through the application of microfluidics and 
the miniaturization of crystal screening and ► crystal 
preparation. The operation of beam lines has been 
streamlined by the introduction of crystal 
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cryoprotection and robotics for crystal handling. 
A third-generation synchrotron beamline has the 
capability of turning out 300 structures/year. The SG 
programs have contributed to the establishment of 
SAD/MAD as a routine method for X-ray structure 
determination. Data for a structure can be collected 
from a single crystal, and phases can be extracted 
semiautomatically with anomalous signal phasing. 
Some beam lines now have mini-beam (5-20 pm diam¬ 
eter) capability, which improves signal-to-noise and 
effective diffraction limit while accommodating 
smaller crystals that are easier to produce. SG also has 
stimulated the development of powerful new software 
packages (e.g., HKL3000 and PHENIX) that enable 
rapid solution and refinement of X-ray structures. Soft¬ 
ware also has been developed to expedite PDB deposi¬ 
tions. Salvage pathways have been developed and/or 
refined at SG centers for the redesign of proteins resis¬ 
tant to crystallization. ► H/D exchange detected 
by mass spectrometry or in situ partial proteolysis com¬ 
bined with mass spectrometry can be used to identify 
unstructured regions. Another approach is surface 
entropy reduction through the replacement of clusters 
of two or three solvent-exposed high conformational 
entropy residues with alanine or other small residues. 

► Protein NMR spectroscopy has both made 
a contribution to SG and has benefitted from techno¬ 
logical advances stimulated by SG (Montelione et al. 
2009). The technological advances include methods 
for fast ► triple-resonance NMR data collection 
through ► sparse sampling, automated data analysis 
and assignment, signal recognition, secondary struc¬ 
ture determination, automated NOE assignment, 
improved methods for ► structure determination by 
NMR, and structure refinement and validation. An 
important by-product of the increased number of 
protein NMR structures deposited in the wwPDB 
with assigned chemical shifts deposited in the Biolog¬ 
ical NMR Data Bank (► BMRB) has been improved 
methods for determining structure from chemical 
shifts (see: ► Chemical Shift Index). These benefitted 
algorithms for automated assignment and secondary 
structure determination from NMR data and also 
approaches for determining protein structure from 
chemical shifts (Shen et al. 2008) and other sparse 
data (residual dipolar couplings and/or small angle 
X-ray scattering). On the NMR instrumentation side, 
► cryogenic probes have been developed that require 
smaller sample volumes, the sensitivity of standard 


cryogenic probes for 5-mm OD NMR tubes has 
increased, automated sample changers have been 
developed that can keep samples chilled prior to 
being moved to the NMR spectrometer for data collec¬ 
tion, and NMR sample tubes have been designed that 
yield improved sensitivity with salt-containing protein 
samples (salt frequently must be added to promote 
protein solubility and stability). 

Summary 

SG projects have demonstrated that protein structure 
determination can be streamlined by the application of 
engineering principles to develop efficient pipelines. 
This approach has reduced the costs of protein structure 
determination by a factor of five or more while increas¬ 
ing the quality of the resulting structures. SC centers 
have developed technology - much of which 
is applicable to smaller scale research - and have dis¬ 
seminated this technology freely to the scientific com¬ 
munity. The US PSI effort, which consists of a large 
network of researchers who share information and tech¬ 
nology, represents a novel approach to biological sci¬ 
ence - one that may serve as a model for other kinds of 
projects in the future. 

Protein structures deposited by SG projects now 
represent about 14% of those in the wwPDB; they 
represent a much higher fraction of unique structures 
than found in the PDB overall. SG structures 
have greatly increased our understanding of the relation¬ 
ship between sequence and structure and have improved 
our ability to model structures of proteins with related 
sequence (30% sequence identity or less). The majority 
of proteins associated with key molecular processes are 
now represented by a stmctural model. One of the 
brightest outcomes has the demonstration that structures 
of ► membrane proteins can be attacked by SG 
approaches. The success rate with membrane proteins, 
including GTP-binding protein-coupled receptors 
(GPCRs) (Katritch et al. 2012), is increasing steadily. 

However, there is much more to be discovered. 
The current challenges are to learn more about the 
relationship between structure and function and to 
improve our ability to deal with more difficult targets. 
There is a continuing need for higher efficiency 
in all parts of the SG pipeline. This is particularly 
true for human proteins and other proteins of high 
biological importance. Looking to the future, efficient 
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approaches are needed for investigations of molecular 
machines that consist of multiple proteins and protein- 
ligand complexes. In studying ► macromolecular 
interactions, hybrid approaches that make use of 
data from a variety of methodologies (EM, X-ray, 
NMR, SAXS, molecular modeling) appear particularly 
attractive (see: ► HADDOCK, ► Mass Spectrometry: 
Mapping Large Stable Protein Complexes). In addi¬ 
tion, it is important to leam more about the inherent 
flexibility of proteins, ► coupled folding and binding, 
multiple conformations that are populated in solution, 
and the sparsely populated (high-energy) conforma¬ 
tions that play a role in protein function (as can be 
visualized by NMR ► relaxation dispersion). Finally, 
the importance and diversity of RNA molecules is 
becoming more and more apparent. It may be time 
to apply high-throughput approaches to the discovery 
of sequence-structure relationships in RNA molecules. 
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Definition 

The simplest form of mutation is a single DNA base 
change, frequently referred to as a “single nucleotide 


polymorphism” (SNP). Strictly, this term should only 
be applied to single base changes that are observed in 
at least 1% of a “normal” population. However, it is 
frequently used to refer to any single base mutation and 
is used in that context here. Many SNPs occur in 
noncoding regions of DNA, where they may affect 
transcription, ► mRNA splicing, or mRNA stability. 
When a single base change occurs in an exon, it will 
fall into one of three classes: (1) a “synonymous” muta¬ 
tion which does not change the amino acid sequence of 
the resultant protein (although this may still affect 
expression, splicing, or mRNA stability), 
(2) a “nonsense” mutation resulting in premature ter¬ 
mination of the protein sequence, or (3) a “non-synon- 
ymous” (or “missense”) mutation (an nsSNP) resulting 
in a single amino acid change. At the protein level, an 
nsSNP results in a “single amino acid polymorphism” 
(SAAP, a term we use regardless of frequency in the 
population). Any amino acid change will have some 
structural effect on the protein, possibly leading to 
► misfolding. 

Introduction 

SAAPs result in a phenotype which can be classified in 
terms of its effect on the fitness of the individual; 

(1) beneficial - resulting in an increase in fitness, 

(2) neutral - having no visible effect on fitness, and 

(3) deleterious or pathogenic - resulting in a decrease 
in overall fitness. Interestingly, some pathogenic muta¬ 
tions introduce an amino acid which is the native 
amino acid in another species, a concept known as 
a “compensated pathogenic deviation” (Baresic and 
Martin 2011). 

The fraction of individuals having the mutation 
who exhibit the phenotype is referred to as the 
“penetrance” - 100% penetrance mutations are 
inherited in a Mendelian fashion, while lower pene¬ 
trance mutations often express their phenotype as 
a result of epistatic interactions with other mutations, 
or with the environment. 

Any change in phenotype is likely to result from 
structural effects that the mutation has on the protein 
structure. Such effects fall into three broad categories: 
{\) functional mutations - those which affect residues 
directly involved in protein function, (2) fold¬ 
preventing mutations - those which physically stop 
the protein from being able to fold up in the correct 
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way, (3) fold-destabilizing mutations - those which do 
not stop the protein from being able to fold up in the 
correct way, but which destabilize the correct fold with 
respect to misfolded or unfolded conformations. This 
is the most interesting category as it may be possible to 
design drugs to bind to the correctly folded form 
thus stabilizing it and restoring activity through the 
► Law of Mass Action. Since most proteins have 
a surprisingly small thermodynamic stability (between 
—3 and —10 kcal/mol) and most SAAPs lead to a A AG 
of 0.5-5 kcal/mol, it is not surprising that many SAAPs 
will destabilize the protein (DePristo et al. 2005). 

Structural Effects 

Structural effects of mutations include the following: 

• Introducing large residues that clash - Clearly, if 
a small residue is replaced by a larger (or more 
bulky) residue, there is a strong possibility that the 
replacement sidechain will clash with the surround¬ 
ing amino acids. This will destabilize the protein 
and is likely to prevent it from folding correctly. 

• Introducing a void - Conversely, if a buried large 
sidechain is replaced by a smaller sidechain, a void 
may be left in the protein. Voids are unfavorable 
both for enthalpic and entropic reasons and conse¬ 
quently the protein will be destabilized and is likely 
to change its conformation to hll the void. 

• Mutations to proline - Proline is unique among the 
amino acids as its cyclic sidechain, which links 
back onto the backbone nitrogen (strictly making 
it an “imino” acid), restricts the rotational freedom 
of the bond between the backbone nitrogen and the 
Cot (the cp angle). Consequently, if another amino 
acid, adopting a (p angle not allowed for proline, is 
mutated to a proline, some structural rearrangement 
will have to occur to accommodate the proline. 

• Mutations fl-om glycine - Glycine lacks a sidechain. 
This means that there is less conformational restric¬ 
tion on combinations of backbone cp and \|/ angles; 
in particular, positive tp angles are largely inacces¬ 
sible to the other amino acids. If a glycine adopting 
a backbone conformation not accessible to other 
amino acids is mutated to any other amino acid 
then some structural rearrangement of the backbone 
will have to occur. 

• Disruption of cis-prolines - The unique nature of 
proline also means that, unlike the other amino acids. 


the cA-isomer of the peptide bond preceding the 
proline has approximately the same stability as the 
trans-isomer. Consequently, a mutation of a cis- 
proline to any other amino acid will destabilize the 
protein stracture probably leading to a change in 
backbone conformation from the cis- to trans-isomer. 

• Introduction of hydrophobic residues on the 
suiface - Protein folding is driven by the hydropho¬ 
bic effect - an entropy-driven need to bury hydro- 
phobic residues and expose hydrophilic residues on 
the surface. Thus a mutation which introduces 
a hydrophobic residue on the surface will destabi¬ 
lize the protein and may result in some change in 
folding to bury the residue. 

• Disruption of sidechain hydrogen bonds - When 
hydrophilic (hydrogen-bond capable) residues do 
occur inside proteins, hydrogen-bonding capability 
is almost always fully satisfied; in other words, 
a buried sidechain capable of making a hydrogen- 
bond interaction almost always will be making 
a hydrogen bond (McDonald and Thornton 1994). 
Consequently, if a sidechain is involved in 
a hydrogen bond (with another sidechain, or with 
backbone) and the resultant sidechain after 
a mutation cannot maintain that hydrogen bond, 
then the protein will be destabilized. 

• Introducing a hydrophilic or charged residue in the 
protein core - For the same reasons that disrupting 
a hydrogen bond will destabilize a protein, introduc¬ 
ing a hydrophilic sidechain into the core will result in 
an unsatisfied hydrogen bond, thus destabilizing the 
protein. If the residue is charged, then it will be further 
destabilizing as the charge will not be satisfied. 

• Mutations to residues involved in an interface or 
binding - Proteins function through their interac¬ 
tions with ligands or other proteins; additionally 
many proteins function as ► protein complexes 
(homo- or hetero-multimers). Consequently a muta¬ 
tion to a residue involved in an interface, or making 
specific binding interactions with another protein or 
ligand, is likely to affect function. 

• Mutations to other functional residues - In addition 
to binding, residues may be involved in catalytic 
mechanisms or post-translational modihcations. 
Mutations to such residues may affect function. 

• Mutations to or from cysteine - Cysteines may 
be involved in disulphide bonds. Mutations to 
these may destabilize the protein. If a protein con¬ 
tains disulfides, then introducing an additional 
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cysteine may lead to scrambling of disulfides and 
misfolding. 

Residues which are highly conserved probably have 
some structural importance even if this cannot be 
determined directly. At the structural level, residues 
that are close to highly conserved patches may be 
functionally important even if they are not highly con¬ 
served themselves. 


Predicting and Visualizing Structural Effects 

A number of tools have been developed to predict and 
visualize the structural effects of mutations including 
the following: 

• TopoSNP (http://gila.hioengr.uic.edu/snp/toposnp/) 
provides sequence-hased metrics, but maps muta¬ 
tions to structure. 

• ModSNP accessible via Swiss-Prot web pages pro¬ 
vides information from Swiss-Prot features and 
maps the mutation to structure using ► homology 
models where necessary. 

• stSNP (http://ilyinlab.org/StSNP/) links SNPs with 
pathway information from KEGG and maps muta¬ 
tions to structure. 

• SNPejfect (http://snpeffect.vih.he/) focuses on 
information related to catalytic active sites, second¬ 
ary structure formation, aggregation, ► amyloid 
formation, solvent accessibility, and trans¬ 
membrane or cellular localization as well as phos¬ 
phorylation and glycosylation sites. 

• SAAPdh (http://www.hioinf.org.uk/saap/db/) pro¬ 
vides a detailed, pre-calculated, analysis of struc¬ 
tural effects of mutations as described above. 
A server for novel mutations is under development. 

Prediction of Phenotypic Effects 

The importance of mutations in disease has led to 
a number of methods for predicting whether mutations 
will be tolerated. Many of these rely only on sequence 
information (e.g., SIFT (http://sift.bii.a-star.edu.sg/) 
and Polyphen (http://genetics.bwh.harvard.edu/pph/)), 
but some make use of structural information including 
the following: 

• nsSNPAnalyzer (http://snpanalyzer.uthsc.edu/) uses 
solvent accessibility and secondary structure elements. 


• SNAP (http://www.rostlab.org/services/SNAP/) 
uses solvent accessibility to optimize sets of train¬ 
ing features. 

• PMUT (http://mmb2.pcb.ub.es:8080/PMut/) uses 
secondary structure to select between optimized 
neural networks. 

• SNPsSD (http://www.snps3d.org/) uses structural 
effects such as solvent accessibility, hydrophobic- 
ity, electrostatic interactions, and atomic packing to 
estimate thermodynamic stability. 

• PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2/) 
uses a combination of eight sequence-based and 
three structure-based features and contains models 
optimized for Mendelian-inherited and low- 
penetrance nsSNPs. 

Cross-References 

► Amyloid Formation 

► Homology Modeling of Protein Structures 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Law of Mass Action 

► Splicing 

► Subunit Composition of Protein Complexes 
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Introduction and Definition 

Structurally or intrinsically disordered proteins (IDPs 
for short) are typically defined as biologically active 
proteins that do not adopt a well-defined tertiary struc¬ 
ture or native fold when isolated in physiological solu¬ 
tion in the absence of interaction partners. They are 
sometimes also referred to as intrinsically unstructured 
proteins or natively unfolded proteins, and the same 
definition can be applied to regions of polypeptide that 
comprise a part of a larger protein, which are then 
referred to as intrinsically disordered regions (IDRs). 
Despite a general consensus regarding this definition, 
there is still considerable controversy regarding what 
constitutes an IDP since the definition can encompass, 
in the broadest sense, an entire continuum when 
applied to polypeptides, ranging from highly flexible 
and dynamic regions that behave nearly like ideal 
random coil models at one extreme to highly organized 
and sometimes nearly native molten globule-like spe¬ 
cies. Indeed, while the widespread existence of IDPs 
has only become appreciated relatively recently 
(Tompa 2009), the novelty of this appreciation is 
most evident in the case of highly disordered IDPs, as 
more ordered (yet nonnative) protein states have been 
recognized to exist and been studied for some time. 

IDPs Versus Disordered Polypeptides 

It is important to distinguish between “authentic” 
IDPs, which must be real proteins (or protein regions 
for IDRs) that are genetically encoded in some organ¬ 
isms and are biologically active, from arbitrary disor¬ 
dered polypeptides that do not constitute real proteins 
and which can represent anything from randomly gen¬ 
erated amino acid sequences to fragments of intact 
proteins that lack structure only because they have 
been excised from a more complete context (i.e., 
sequence) in which they would be able to fold. 

IDPs In Vivo 

Given that many IDPs do fold up into a well-defined 
structure when in complex with their protein binding 
partners, some have suggested that IDPs may never be 
found in their disordered “free” state in vivo and that 
their “intrinsically disordered” properties may 


therefore not be of great relevance to their biology. 
There are several lines of evidence that suggest this 
conclusion may be flawed, perhaps the most convinc¬ 
ing of which is that while many IDPs fold upon 
encountering binding partners (► Coupled Folding 
and Binding) (Wright and Dyson 2009), there are 
clear instances of IDPs or IDRs that do not, and that 
instead remain significantly disordered even in func¬ 
tional complexes (Mittag et al. 2010), strongly 
suggesting that this would be the case in vivo as well. 
Nevertheless, it will be difficult to settle this debate 
conclusively until robust determinations can be made 
regarding the conformations sampled by IDPs in 
authentic in vivo settings, and this goal therefore rep¬ 
resents one of the most important challenges in the IDP 
field. 

Prevalence of IDPs and the Structure/ 
Function Paradigm 

The existence of IDPs as a general structural class of 
proteins emerged from a convergence of studies cen¬ 
tered on individual examples, each of which yielded 
the (at the time) surprising result that an authentic and 
biologically intact protein was found to be highly 
dynamic and lacking in ordered stmcture (Wright and 
Dyson 1999), and more directed efforts, inspired in 
part by these early observations, to identify the primary 
sequence characteristics of IDPs and use these to probe 
genomic databases in order to ascertain the general 
prevalence of IDPs and IDRs (He et al. 2009). These 
latter efforts quickly lead to a surprising result: IDPs 
are not unusual rarities and instead are abundant 
through the proteomes of essentially all organisms, 
ranging from simple viruses to human beings. Indeed, 
the fraction of predicted protein sequences identified as 
IDPs or proteins containing IDRs of substantive length 
has been estimated to be as great as 30% in some 
genomes (Oldfield et al. 2005). This fundamental 
result presents a direct challenge to the classic struc¬ 
ture/function paradigm, which posits that protein bio¬ 
logical function is inextricably linked to (native) 
protein structure and which has underpinned much of 
structural biology since its earliest days. If structure is 
necessary for function, how can functional yet intrin¬ 
sically disordered proteins exist? Some view this as 
a trivial conundrum and postulate, as alluded to above, 
that IDPs are always found complexed to binding 
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partners in a well-ordered conformation and that adop¬ 
tion of such a structure is necessary for their function, 
effectively restoring the classic paradigm. A more gen¬ 
eral consensus in the field, however, is that even those 
IDPs that become fully ordered in some functional 
contexts require the property of intrinsic disorder to 
carry out the complete set of biological functions with 
which they are associated. In addition, as mentioned 
above, some IDPs retain a considerable amount of 
disorder even when in a functional complex with inter¬ 
action partners (Mittag et al. 2010). 


Why IDPs Exist 

Given the previous dominance of the classic structure/ 
function paradigm, a variety of attempts have proposed 
to explain why IDPs and IDRs should exist. Beyond 
their obvious suitability as flexible linkers between 
ordered proteins or domains, and as sites easily acces¬ 
sible for posttranslational modifications, various 
advantages ascribed to IDPs include (among others) 
increased binding promiscuity (the ability to interact 
with more partners than an equivalently sized globular 
protein), enhanced allostery (► Protein Dynamics and 
Allostery) (the ability to generate a larger coupling 
between binding events and conformational change) 
(Hilser and Thompson 2007), and improved respon¬ 
siveness (the ability to form interactions with high 
specificity but modest affinity that can therefore be 
easily reversed in response to environmental condi¬ 
tions or specific signals). 

While these reasons appear plausible in many cases, 
perhaps a more intriguing formulation of the problem 
may be: why should intrinsically ordered/structured 
proteins exist? There are certainly specific instances 
that appear to call for highly structured proteins, 
including, most prominently perhaps, the need for 
enzymes to create binding sites that are highly com¬ 
plementary to substrates, reaction intermediates, tran¬ 
sition states (► Differential Transition State 
Stabilization - Nonempirical Analysis), and/or prod¬ 
ucts in order to catalyze otherwise inefficient chemical 
processes. It is difficult (though not impossible) to 
imagine how a highly disordered protein could achieve 
effective catalysis. Structure may also be useful where 
highly organized molecular machines, with parts that 
have to be precisely placed and oriented with respect to 


one another and that can carry out work are needed. 
Many large molecular machines, however (e.g., ribo¬ 
somes and polymerases (► RNA Polymerases and 
Transcription), include proteins that are not well struc¬ 
tured outside the context of the intact assembly, so 
intrinsic structure is in fact not an absolute prerequisite 
for the assembly of large macromolecular machines. 
Structure is also thought to provide protection against 
aggregation as well as unregulated proteolysis. Yet 
IDPs (and even those involved in protein aggregation 
disorders (► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation), such as the Alzheimer’s 
protein tau or the Parkinson’s protein alpha-synuclein) 
can be highly soluble, aggregating (► Amyloid 
Formation) only under highly specific conditions, at 
high concentrations, and/or in the presence of facilitat¬ 
ing reagents. Furthermore, intrinsic disorder, in and of 
itself, does not appear to be a sufficient property for 
regulated proteolysis since IDPs can be successfully 
recovered from a variety of cell types, including mam¬ 
malian cells (Galea et al. 2009). Protection from deg¬ 
radation could be afforded by structuring upon binding 
to partners but to the extent that IDPs populate their 
disordered free states in vivo, such observations sug¬ 
gest that at least in contexts where proteolysis is tightly 
controlled, structure is not necessary for this purpose 
either. 


Biophysical Characterization of IDPs 

Understanding the properties, functions, and biology 
of IDPs requires characterization of their structural and 
physicochemical attributes. A host of biophysical 
methods, many of which have been routinely 
employed in the study of well-folded proteins or of 
unfolded or partially folded states of native proteins, 
have been applied to IDPs (Mittag and Forman-Kay 
2007; Eliezer 2009), and some general lessons have 
emerged. Most IDPs appear to contain at least some 
degree of local secondary structure (► Protein Second¬ 
ary Structure Prediction in 2012), often but not always 
transient and more commonly helical than strand-like. 
Such sites are often involved in intermolecular inter¬ 
actions and have sometimes been given names such as 
MoREs (molecular recognition elements) or MoRFs 
(molecular recognition features). In some cases, they 
can be considered to be folding nucleation sites for 



Structurally Disordered Proteins 


2487 


s 


intermolecular binding or folding-upon-binding 
(► Coupled Folding and Binding) reactions, in analogy 
to intramolecular folding nucleation sites. As such 
sites tend to be functionally important, identification 
of such sites can be useful in the characterization or 
even prediction of IDP function. Intriguingly, 
a number of IDPs have also been shown to possess 
a transient long-range or tertiary structure resulting 
from favorable interactions between parts of the pro¬ 
tein that are distant in sequence. The roles of such 
transient long-range structure in modulating IDP func¬ 
tion remain to be clearly elucidated. 

Examples of IDPs 

A number of specific IDPs have been subjected to 
extensive biophysical studies in light of their impor¬ 
tance in different areas of biology, and a few of these 
are briefly mentioned here as representatives of the 
class. The Parkinson’s disease linked protein alpha- 
synuclein is highly disordered in its free state but 
exhibits transient helical structure and long-range 
structure, assumes a helical but nonglobular structure 
when bound to lipid membranes, and forms classic 
amyloid fibrils (► Amyloid Formation) that are depos¬ 
ited in Lewy bodies in the brain (Eliezer 2008). The 
Alzheimer’s disease associated protein tau, also highly 
disordered in its free state, contains stretches of resid¬ 
ual helical and strand structure, binds to microtubules 
in a poorly understood conformation, and forms paired 
helical filament aggregates that are deposited in neu¬ 
rofibrillary tangles in the brain (Mandelkow et al. 
2007). The tumor suppressor protein p53, mutations 
in which are associated with nearly 50% of all human 
cancers, contains several disordered regions that are 
critical for the regulation of its function, including the 
N-terminal transactivation domain, long linker regions 
between its well-folded tetramerization and DNA- 
binding domains, and a regulatory C-terminal region 
(Joerger and Fersht 2010). The cyclin-dependent 
kinase inhibitor p27 (Kipl) contains a nascent helical 
structure between the kinase and cyclin interaction 
sites of its kinase inhibitory domain, as well as tran¬ 
sient discrete structure at the kinase interaction site 
(Galea et al. 2008). The C-terminal domain of the 
measles virus nucleoprotein contains a region of resid¬ 
ual helical structure that folds upon its interaction with 
the C-terminal X-domain of the phosphoprotein, as 


well as a second region that is involved in the interac¬ 
tion but appears to remain largely unstructured even in 
the complex (Longhi and Oglesbee 2010). 

Summary 

Intrinsically disordered proteins exist, are abundant in 
all organisms, function in numerous biologically and 
medically important pathways, and challenge the clas¬ 
sic protein structure/function paradigm. A crucial gap 
in our knowledge is to what extent IDPs and IDRs 
populate their disordered states in vivo, but it is clear 
that at least some IDPs maintain a significant degree of 
disorder even in the context of functional complexes. 
While distinct from unfolded, partially folded, or 
destabilized forms of intrinsically ordered/structured 
proteins, IDPs likewise can range from highly lacking 
in any structure, to containing elements of transient 
secondary and/or tertiary structure, and to containing 
well-formed secondary structure and molten globule¬ 
like organization. IDPs may possess some specific 
functional advantages over well-structured proteins 
but may in principle be able to carry out most of the 
functions observed for well-structured proteins, and 
one may consider that it is the existence of intrinsic 
structure, rather than its absence, that likely confers 
specific advantages or activities that are not achievable 
in its absence. The biological importance of IDPs will 
necessitate a reformulation of the classic protein struc¬ 
ture/function paradigm and in addition has profound 
implications for our ability to annotate (► Protein 
Structure Prediction and Structural Annotation of 
Proteomes), predict (► Statistical Analysis and Predic¬ 
tion of Protein-Protein Interactions and Binding Sites), 
and design (► Protein Design for Biosensors) protein 
function. 

Cross-References 

► Amyloid Formation 

► Coupled Folding and Binding 

► Differential Transition State Stabilization - 
Nonempirical Analysis 

► (Glyco)Protein Folding Disorders 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Protein Design for Biosensors 
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► Protein Dynamics and Allostery 

► Protein Secondary Structure Prediction in 2012 

► Protein Structure Prediction and Structural 
Annotation of Proteomes 

► RNA Polymerases and Transcription 

► Statistical Analysis and Prediction of Protein- 
Protein Interactions and Binding Sites 
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Synonyms 

Heteronuclear multidimensional nuclear magnetic res¬ 
onance spectroscopy; NMR; Macromolecular struc¬ 
ture; Protein structure 

Definition 

A brief overview of the nature of biological solution 
structure determination based upon NMR spectros¬ 
copy is provided. The overall approach is described 
in outline and the various data types that can be incor¬ 
porated into the calculations, including nuclear 
Overhauser effects, torsion angle estimation from 
J-couplings, H-bond prediction from H/D exchange 
rates, etc., are introduced. 

Introduction 

Over the past 30 years or so, developments in 
multidimensional NMR spectroscopy have provided 
the framework for the determination of the 3D struc¬ 
tures of small-to-medium-sized proteins and nucleic 
acids in solution (Cavanagh et al. 2007; Clore and 
Gronenborn 1998; Driscoll 2011; Giintert 2011; 
Kwan et al. 2011; Rule and Hitchens 2006; Vuister 
et al. 2011). This activity has contributed a substantial 
fraction of the coordinate sets deposited in interna¬ 
tional databases of macromolecular structure (e.g., 
the Protein Data Bank curated at RCSB-Rutgers and 
the European Bioinformatics Institute). The techniques 
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employed in NMR-based structure determination are 
not as mature as those used in macromolecular crys¬ 
tallography and, to some extent, are still undergoing 
development and refinement (Guerry and Herrmann 
2011; Grishaev et al. 2005; Kontaxis et al. 2005; 
Mueller et al. 2000; Rieping et al. 2005; Rieping 
et al. 2008; Shen et al. 2008). Not least is the fact that 
NMR data are exquisitely sensitive to the overall and 
internal molecular dynamics of the target molecular 
entity, and proper representation of the dynamical 
aspects of the structure is a matter of significant 
research in the field. Neverfheless, a series of core 
concepts associated with the NMR paradigm for struc¬ 
ture determination can clearly be recognized. The fol¬ 
lowing provides a brief description of these main 
principles, with particular focus upon the example of 
the structure determination of a single domain globular 
protein, with limited internal motion (other types of 
molecules are tractable by NMR but require a more 
nuanced approach to restraint identification and appli¬ 
cation, for which there is too limited space here to 
describe in detail). 


Outline of the Process 

The determination of the 3D solution structure of 

a protein requires the following steps: 

1. NMR resonance assignment 

2. Identification of structurally relevant NMR param¬ 
eters such as interatomic distances from NOEs or 
PREs, torsion angles from scalar coupling con¬ 
stants, RDCs, NH solvent exchange rates, etc. 

3. Encoding of the structurally relevant experimental 
parameters in terms of restraint energy terms as 
elements of a pseudo-realistic molecular dynamics 
(MD) potential energy function 

4. Optimization of the MD target function in such 
a way as to select a structural model consistent 
with both well-known stereochemical rules and 
the experimental restraint parameters 

5. Repetition of step (4) using different starting con¬ 
figurations to test convergence of the calculations 
by consideration of the fit to restraints and degree of 
coordinate precision 

6. Iteration of steps (1) through (5) until errors in 
resonance assignment or experimental restraint 
identification have been minimized 


7. If appropriate, deposition of chemical shifts, 
restraint lists, and atomic coordinates into the 
appropriate public databases (BioMagResBank, 
RSCB, PDBe) 

With reference to step (1), the principles of 
resonance assignment are described in greater detail 
elsewhere (e.g., see TRIPLE RESONANCE NMR). 
Let it suffice to say here that structure determination 
will proceed with greater efficiency the more com¬ 
plete the status of the identification of the chemical 
shifts represented in the spectrum. This objective 
includes backbone and side chain chemical shifts of 
all observable exchangeable and nonexchangeable 
protons. 

Distance Restraints from NOEs 

In the most general case, structure determination by 
NMR is dominated by the characterization of the 
intemuclear distances between proton nuclei, through 
detection of the corresponding dipole-dipole interac¬ 
tions that constitute the nuclear Overhauser effect 
(NOE) (Cavanagh et al. 2007; Rule and Hitchens 
2006; Vuister et al. 201 1). Eor a given pair of protons 
i and /, the magnitude of the NOE depends upon the 
cross-relaxation rate between the two nuclei that in 
turn is a function of the characteristic time for the 
reorientation of the i-j vector (tumbling time Xy) and 
the (dynamically averaged) intemuclear separation 
distance qy. Specifically, in the two-spin approxima¬ 
tion of cross relaxation (that strictly only applies to 
isolated spin pairs), and assuming behavior in the slow 
motion limit (relevant for ordered globular proteins) 
and a short NOESY mixing period 

NOE,y _ 

NOE,,y ,-l 

where “ref” refers to a reference pair of protons with 
known (fixed) separation. The inverse b^-power rela¬ 
tionship between NOE magnitudes and interproton 
distances means that it is usually the case that NOE 
effects are only detected for neighboring protons in the 
protein structure, with an upper limit for the observa¬ 
tion of an NOE cross peak corresponding to 
a separation no greater than 5-6 A. Moreover, because 
the isolated spin pair approximation cannot strictly 
apply in the presence of several neighbors (and next- 
nearest neighbors) and because of the heterogeneity of 
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dynamic averaging that occurs as a result of intramo¬ 
lecular fluctuations, the quantitative relationship 
between NOE,, and r,, is unreliable except in an 
approximate sense. Thus estimation of interproton 
distances based upon NOE cross peak magnitudes is 
usually conducted on a conservative basis, and very 
often only in terms of an upper distance bound, 
e.g., a weak NOE cross peak is taken to mean 
a separation less than 6 A, a medium cross peak 
4.5 A, and a strong cross peak 2.7 A. Eor a typical 
small (ca. 150 residue) protein, as many as 
2,500-4,000 interproton NOE cross-peak intensities 
might be collected from 2D, 3D, and perhaps even 
4D NOESY spectra. Despite the approximate nature 
of the distance estimates, taken together a large set of 
interconnected restraints can circumvent the lack of 
precision and provide a set of 3D structural models 
with an apparent resolution that reasonably represents 
the uncertainties in the NOE data. 

Torsion Angle Restraints 

Within the nuclear spins systems that constitute the 
building blocks of protein NMR spectra, NMR 
active nuclei are described as exhibiting scalar, or 
J-coupling as a result of the pairwise reciprocal influ¬ 
ence on the precise resonance frequency of one 
nucleus, via the intervening bonding electrons, by 
the instantaneous spin state of another (Cavanagh 
et al. 2007; Rule and Hitchens 2006; Vuister et al. 
2011). The presence of scalar coupling results in 
multiplet splitting of NMR resonances, revealed 
(line widths permitting) as hne structure in the ID 
NMR spectrum, and provides the basis for magneti¬ 
zation transfer exploited in multidimensional corre¬ 
lated NMR spectroscopy. In general, the magnitude 
of J-couplings declines with increasing separation 
of the nuclei involved, assessed as the number of 
intervening chemical bonds n: < */. 

From a structure determination perspective, the 
most useful J-couplings are of the V variety since 
the magnitude is related the corresponding dihedral 
torsion angle. For example, the value of the VnNHa 
coupling reports on the value of the polypeptide phi- 
angle for the C',_i-N,-C,“-C', torsion. The quantita¬ 
tive relationship between the .7-value and the torsion 
angle has been evaluated by reference to measure¬ 
ments made on proteins with known 3D structure and 


follows a general sinusoidal form, as predicted by 
Karplus: 

^J{6) = A.cos^(6) -t- B.cos{9) + C 

Thus, consensus sets of Karplus coefficients (A, B, 
and C) for different V couplings in proteins are 
available in the literature. NMR measurements of cou¬ 
pling constants usually require specific pulse 
sequences and sometimes line shape fitting. The nature 
of the relationship between J and d means that dispa¬ 
rate values of the angle can be equally consistent with 
the measured coupling constant. Thus, conversion of 
measured v values into structural restraints in the MD 
optimization scheme itemized above requires this 
potential degeneracy to be taken into account. 

H/D Exchange and H-Bonding 

Within globular protein structures, a major factor in 
determining the local order is the formation of 
hydrogen bonds. H-bonds constitute an important con¬ 
stituent of the description of nonlocal parts of the 
polypeptide chain, e.g., with the parallel or antiparallel 
pairing of P-strands within p-sheet structures. It has 
long been established that the presence of H-bonds is 
associated with a slowing of the rate of exchange of 
H”^ protons with bulk solvent. For example, when 
a protein is freeze-dried from a normal H 2 O solution 
and redissolved in D 2 O, the long-persisting H”^ reso¬ 
nances can safely be identified as taking part in strong 
H-bonds within the protein structure. Strictly the rate 
of exchange is a function of the sample pH and 
temperature, and so the speed with which such mea¬ 
surements must be obtained in order to obtain accurate 
demarcation between slow- and fast-exchanging H'^ 
protons can vary. However, experience has shown 
that it is possible in this way to reliably identify H- 
bond donor groups (Cavanagh et al. 2007; Rule and 
Hitchens 2006; Vuister et al. 2011). Unfortunately, 
identification of the corresponding H-bond acceptors 
is much less straightforward. In rare cases, direct deter¬ 
mination of H-bonding pairs can be performed by the 
detection of three-bond J-couplings (Vnc ) between 
a *^N atom in the donor group and the 13C' carbonyl 
carbon acceptor in double '^C,*^N-isotope-labeled 
samples. However, these couplings are small (—0.2 to 
—0.9 Hz) and therefore difficult to measure except 
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under favorable conditions when the resonance line 
widths are narrow (such as for small proteins or in 
perdeuterated samples). In the absence of such direct 
information, inference of the H-bond acceptor is 
required on the basis of the contextual evidence for 
the local secondary structure, e.g., from the pattern of 
backbone interproton NOEs, VeNHa coupling con¬ 
stants, and chemical shifts. Identified 

H-honds are typically incorporated into protein struc¬ 
ture determination algorithms by way of distance 
restraints on the separation of donor and the puta¬ 
tive acceptor atoms, often with accompanying distant 
restraints to bias the H-bond N-H^-acceptor geometry 
toward linearity. 

Residual Dipolar Couplings 

Like NOE magnitudes, V measurements and 
exchange rates are reporters of the local structure 
of the nuclei involved. The compilation of such “local” 
structure measures to obtain a larger 3D structural 
model is notoriously subject to the potential for cumu¬ 
lative error and an outcome in which the overall molec¬ 
ular shape is distorted from reality. For this reason, 
there has been much interest in developing approaches 
that permit the spectroscopist to obtain more global 
estimates of molecular shape from NMR observables. 
It has long been known that the NMR spectrum of an 
anisotropic solute dissolved in a liquid crystal matrix 
yields a situation in which dipolar couplings between 
spin-1/2 nuclei within the solute incompletely average 
to zero (as they would in isotropic solution). This 
happens because, over time, the rotational diffusion 
of the solute is biased by collisions with the liquid 
crystal matrix, leading to an averaged weak but never¬ 
theless preferred alignment in the sample tube on 
account of the overall molecular shape (and possibly 
electrostatic properties). The weak alignment is 
manifested as “residual dipolar couplings” (RDCs) 
that can be measured by comparison of the solute 
J-couplings obtained in the isotropic and aligned con¬ 
ditions (Cavanagh et al. 2007; Rule and Hitchens 2006; 
Vuister et al. 2011). With the advent of dilute liquid 
crystal phases that are compatible with biological mac¬ 
romolecules in aqueous solution, RDC measurements 
have become an accessible means to obtain restraints 
on the orientation of bond vectors within proteins 
relative to the tensor frame that describes the induced 


molecular alignment, and thereby to provide a basis for 
refinement of the overall molecular shape. 

On a practical note, a variety of liquid crystal media 
are available that can prove compatible with proteins, 
including binary mixtures of phospholipids that form 
elongated micelles, or “bicelles”; alkyl-poly(ethylene 
glycol)/hexanol Lo, phases; high concentrations of 
filamentous bacteriophage particles; anisotropically 
compressed (or stretched) polyacrylamide gels; and pur¬ 
ple membranes. In general, it is necessary perform trials 
with different alignment media in order to find one 
(or more) that is compatible with the target macromol¬ 
ecule and does not interact too strongly or lead to sample 
precipitation. Under these conditions, to obtain the 
RDCs the spectroscopist can use NMR pulse sequences 
(or variants thereof) that are used to extract scalar cou¬ 
plings. Thus, in general, the measured coupling /exp in 
the isotropic sample (iso) will be identical to the tme 
valuebut in the aligned (aniso) sample will be the sum 
of scalar and residual dipolar (D) terms: 

jiso _ riso 

‘^exp ~ 

jamso^jiso^D 

Thus, the value D can be obtained for any given pair 
of nuclei that yield a scalar coupling by measuring the 
difference between the measured couplings in the iso¬ 
tropic and aligned states. The spectroscopist is 
required to identify conditions where the degree of 
alignment of the solute is very small (^10~^) so that 
the residual dipolar couplings do not represent the 
dominant interaction between nuclei (and so retain 
hrst-order dipolar splittings) but of a sufficient 
magnitude to be measurable (e.g., with a maximum 
IDI ^ 30 Hz). On their own, the RDC values inform the 
relative orientation of the bond vectors of the 
J-coupled pairs of atoms, and contain no translational 
information. Their use is more commonly found in the 
refinement or validation of protein structures. For this 
purpose, one requires to obtain information about the 
relationship of the pattern of RDC values to the partial 
alignment of the macromolecules in the liquid crystal 
or, more precisely the diagonal elements of the Saupe 
matrix that describes the alignment tensor. The deter¬ 
mination of the alignment tensor from a set of 
experimental RDCs is a nontrivial problem, but 
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procedures have been worked out wherein the align¬ 
ment tensor principal components and Euler angles can 
be estimated with a numerical optimization procedure 
(in the case where the 3D structure is already known), 
or co-refined along with the fit to other NMR-derived 
structural restraints (in the case where the structure is 
not known). In the latter case, initial estimates of the 
alignment tensor principal values can be obtained by 
the histogram method. The degree to which the 3D 
structural model is consistent with the measured ori¬ 
entational restraints is typically described by the qual¬ 
ity factor (Q-factor): 


^J^meas _ J^pred'^ 

The Q-factor describes the root mean square differ¬ 
ence between the experimentally measured RDCs and 
the corresponding predicted values (based on the 3D 
structure and alignment tensor parameters) divided by 
the root mean square of the measured values. Q thus 
varies between 0 (perfect fit) and 1 (no fit). Experience 
shows that good quality protein crystal structures typ¬ 
ically yield Q ^ 0.2-0.25 when tested against 
experimental RDC measurements in solution, whereas 
NMR structures determined with NOE data alone yield 
significantly higher Q-factors {Q > 0.4). Thus, inclu¬ 
sion of orientational restraints from RDC measure¬ 
ments is generally expected to improve the quality of 
an NMR solution structure. 

It is worth noting that for a given pair of dipolar- 
coupled nuclei, the value of the RDC will be 
diminished by the degree of motional disorder, so in 
a strict sense the use of RDC values for structure 
determination or validation should take this aspect 
into account (e.g., by excluding RDC values for 
regions that are undergoing significant internal 
motion). At the same time, the profile of RDC values 
can be used to quantity the degree to which different 
globular domains within a multidomain protein are 
coupled together. Finally, it is important to realize 
that because there is a quadratic dependence of the 
RDC value upon the internuclear vector coordinates, 
there are intrinsic directional degeneracies in the rela¬ 
tionship between the two quantities. This degeneracy 
can be alleviated if RDC values can be obtained under 
two (or more) different modes of alignment such that 


the principal components of the alignment tensors are 
distinctly different. Thus, in general the information 
quotient of RDCs increases substantially if more than 
one dilute crystal medium can be made tractable with 
the target macromolecule. 

Chemical Shifts 

The chemical shifts of NMR resonances in proteins are 
extremely sensitive to the local conformation and the 
nature of the immediate “chemical” environment. 
After all, the increased dispersion of NMR resonances 
in the spectrum of a folded globular protein (as 
opposed to an unstructured polypeptide) occurs as the 
result of contributions to the chemical shift from 
the local backbone and side chain conformation, 
nonbonded contacts with neighboring residues, ring 
currents, H-bonds, electric fields, and interaction with 
the solvent (Robustelli et al. 2010). As a result, the 
chemical shift of a given nucleus in the structure (5obs 
differs from that which would occur in a random coil 
(5rc by an amount A(5, the secondary chemical shift 

^obs — 

Over many years, spectroscopists have settled on 
consensus values of (5rc and pursued methods to use 
the derived values of Ai) to infer aspects of 3D 
structure. One common exploitation of chemical shift 
information is with the program TALOS which uses 
I3ca, 13 15 ijja^ ij^N chemical shifts to 

search a database of chemical shift information 
obtained for proteins with high-resolution 3D struc¬ 
tures and to predict the backbone ip and i// angles. 
The most recent TALOS release, known as TALOS+ 
(Shen et al. 2009), includes a predictor of local 
dynamic disorder, also based on the chemical shift 
information, that indicates when the use of the 
predicted iplxf/ angles to restrain the backbone confor¬ 
mation is warranted. The state of the art in the use of 
chemical shifts is for ah initio calculations of the 
overall structure based upon the Monte Carlo molecu¬ 
lar fragment assembly, for instance, with programs 
such as CS-Rosetta (Shen et al. 2008). The approach 
relies upon building up the target structure using 
overlapping peptide fragments (derived from the 
PDB) that have either measured or predicted A5 chem¬ 
ical shift patterns that match those observed in the 
experimental data. 
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Structure Determination by NMR: Overview, Fig. 1 Deter¬ 
mination of protein 3D structure using NMR data proceeds by 
application of high-temperature restrained molecular dynamics 
using a simulated annealing schedule (MD-SA) to drive random¬ 
ized polypeptide coordinates (left) toward a structural model 
(right) that is consistent with standard stereochemical rules and 


(within the eiTor of their measurement) the input experimental 
restraints. Typically, the calculation is performed for several 
different randomized starting configurations, yielding different 
output models that - when best lit by atomic superposition - 
yield a “bundle” of output conformers each member of which is 
in agreement to a similar degree with the input experimental data 


Scattering Profiles 

Over the last few years, small angle X-ray (and to 
a lesser extent neutron) scattering measurements of 
molecular shape have been adopted as a means to 
help determine the solution structure of proteins and 
their complexes. In the case of small-angle X-ray scat¬ 
tering (SAXS), the scattering profile is a function of the 
difference in electron density between the solute and 
the solvent. The scattering curve maps the Fourier 
transform of the distribution of interatomic distances 
within the macromolecular target, which is related to 
the molecular shape and internal structure. As the 
density of the solvent layer surrounding the protein is 
slightly higher than that of the bulk solvent, the overall 
scattering includes a contribution from this layer and 
must be accounted for during the data analysis. In the 
context of its utility for NMR investigations, SAXS has 
the advantage that it can be applied in a relatively 
efficient manner; the SAXS measurements can be 
matched to the conditions used for the NMR experi¬ 
ments, and can be applied to the same isotope-labeled 
preparations. The program XPLOR-NIH (Schwieters 
et al. 2003), in particular, has a facility for using the 
SAXS scattering curve as an additional restraint term in 
its potential energy function for the structure 


determination procedure, alongside more traditional 
NMR restraints. In general, the SAXS data provides 
relatively low-resolution structural information (of the 
order of ^ 10 A) and so is particularly useful for systems 
in which there is scope for significant shape anisotropy 
or long-range flexibility such as multidomain proteins 
or multichain complexes (Grishaev et al. 2005). 

Combining Restraints 

Once the various experimental restraint types have 
been harvested together, they are typically incorpo¬ 
rated into molecular mechanics-type calculations to 
drive models of the polypeptide chain toward solutions 
that are both consistent with established stereochemi¬ 
cal rules of protein folding and satisfy the experimental 
restraints within the error margins of the relevant mea¬ 
surements (Fig. 1). 

Unlike the case with very high quality protein crys¬ 
tals that diffract to better than 1 A resolution, in NMR 
applications there is always an insufficient number of 
experimental data to determine ab initio the spatial 
relationship between all atoms in the chain. Thus 
NMR-based structure determination methods adopt, 
in simplified form, the known bond length, bond 
angle, and atomic radii parameters for amino acid 
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structure and that constitute parameter sets in molecu¬ 
lar dynamics simulation packages - in what might be 
called a “geometric parameter set” -and allow the 
chain to move under a Newtonian force held that 
includes terms that capture the deviation of the struc¬ 
ture from the experimental restraints. Thus the instan¬ 
taneous potential energy of the system within the 
calculation is described by the sum of geometric and 
restraint terms: 

^pot — ^bonds ^angles ^vdW “t" ^elec ^NOE 
“h ^H-bond f^RDC ^chemical shift 

^scatter 

Within this context, structure determination equates 
to optimization of itpot by varying the coordinate posi¬ 
tions of a model of the target protein chain(s). This is 
typically achieved by adopting artihcially high- 
temperature molecular dynamics during which 
self-interaction of the chain is controlled by manipu¬ 
lating the geometric and restraint term force constants 
and slowly allowing the system to “cool” in the 
computer while simultaneously ramping up the force 
constants. This approach (which is also routinely 
applied in X-ray crystallographic structure determina¬ 
tion) is known as molecular dynamical simulated 
annealing (MD-SA) and is adopted (with different 
peculiarities of implementation) in each of the well- 
known software packages used for NMR-based 
structure determination: CNS (Briinger et al. 1998), 
XPLOR-NIH (Schwieters et al. 2003), and CYANA 
(Giintert 2011). A “good outcome” of the application 
of MD-SA is that one obtains a set of models derived 
from different randomized starting conformations, 
each member of which has low energies in respect of 
both geometric and restraint terms and, when super¬ 
posed to each other in Cartesian space, yields a low 
average pairwise root mean square difference in coor¬ 
dinate position. It is commonly observed that for 
regions of the protein that correspond to a high density 
of experimental restraints the structure effectively con¬ 
verges to a common solution and that part of the 
structure can be regarded as well defined, yielding 
good confidence in the relative atomic positions. On 
the other hand for regions of the protein chain where 
there is a low density of experimental restraints, e.g., in 
exposed loops or at the chain terminii, the emergent 
models often yield a variety of nonsuperimposing solu¬ 
tions indicating that the local structure is not well 
defined by the data. Commonly such lack of definition 


will be consistent with intrinsic flexibility within those 
sections of the polypeptide chain, although verification 
of this interpretation should always be sought by inde¬ 
pendent means, e.g., by characterization of the chain 
dynamics through nuclear relaxation analysis. 

It is worthwhile to note that macromolecular struc¬ 
ture determination by NMR is still a developing area. 
The era of “structure genomics” and other develop¬ 
ments in the field is witnessing a significant effort to 
streamline what has traditionally been a rather “hands- 
on” business of spectrum analysis and iteration of 
(particularly) NOE cross-peak assignment (see step 
(6) in the scheme above). Of note, there are emerging 
paradigms (e.g., in the ARIA/CNS and CYANA pack¬ 
ages) that reduce the scope for subjectivity in NOESY 
spectrum analysis, can take into account second-order 
NOE effects (spin-diffusion), and can provide more 
objective measures of the fit of the structure to the 
experimental data (akin to the R-factor well known 
in the field of X-ray crystallography) (Guerry and 
Herrmann 2011; Giintert 2009; Giintert 2011; Rieping 
et al. 2007). 

Summary 

Protein structure determination by NMR spectroscopy 
has evolved over time to incorporate many different 
types of information that are available from NMR mea¬ 
surements. The field has developed from an original 
focus on interproton distance estimates from 
NOE spectroscopy to incorporate additional local (e.g., 
J, NH exchange, chemical shift) and global measures 
(RDCs, SAXS, paramagnet relaxation enhancements, 
rotational diffusion anisotropy, etc.) of the target struc¬ 
ture. It is worth mentioning here that unlike with X-ray 
crystallography, the parameters derived from NMR 
spectroscopy can address in great detail nonglobular 
states of protein chains, such as interdomain linkers, 
molten globules, and intrinsically disordered regions, 
as well as (in favorable cases) membrane-attached and 
membrane-embedded proteins, making the general 
approach extremely versatile. 

Cross-References 

► CNS (Crystallography and NMR System) 

► CYANA 



Structured Illumination Microscopy (SIM) 
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► Electron Paramagnetic Resonance Spectroscopy 

► H/D Exchange 

► Multidimensional NMR Spectroscopy 

► NMR 

► Nuclear Overhauser Effect 

► Nucleic Acid NMR - Introduction 

► Protein NMR - Introduction 

► Protein Segmental Labeling 

► Residual Dipolar Coupling 

► Triple Resonance NMR 

► TROSY 

► X-Ray Scattering of Lipid Membranes 
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Definition 

Structured illumination microscopy (SIM) refers to 
microscope techniques employing nonuniform illumi¬ 
nation of the sample. 

Introduction 

The conventional epi-fluorescent wide-held micro¬ 
scope features a uniform illumination of an extended 
sample region. A problem arises with this setup when 
specimens are used whose thickness in the direction of 
the optical axis (i.e., the z-direction) is greater than the 
objective’s depth of held. In this case, not only in-focus 
huorophores but also those lying above or below the 
plane of focus are excited. Since the objective is not 
able to distinguish from which part of the sample the 
emission light originates, light from out-of-focus 
huorophores is also detected. The result is poor quality 
in the hnal image, as out-of-focus structures appear 
blurred; furthermore, their emission light contributes 
to the background and leads to a reduction in image 
contrast (Fig. la). 

Removing out-of-focus light yields an optically 
sectioned image: a thin slice of a thick sample that 
only contains in-focus information (Fig. lb). Taking 
a stack of sectioned images allows for a three- 
dimensional reconstruction of the specimen. 

Sectioning SIM (sSIM) is a method to obtain opti¬ 
cally sectioned data (Neil et al. 1997), similar to that 
obtained from the widely used confocal microscope 
(Minsky 1988). A more advanced variant of SIM, 
subsequently referred to as high-resolution SIM 
(hrSIM) (Fleintzmann and Cremer 1999; Gustafsson 
2000), has the potential of providing optical sectioning 
combined with a significantly higher x-y resolution 
(Fig. 2), while at the same time being more light 
efficient and potentially faster than confocal 
microscopy. 

Setup and Data Acquisition 

Structured illumination microscopes are characterized 
by their use of structured light to illuminate the sample. 
Figure 3 shows the typical setup used for SIM. 
The principal component is the illumination mask, 
inserted at the field diaphragm position in the 


illumination path of a conventional fluorescence 
microscope. The illumination mask, which is typically 
a one-dimensional grating, is projected onto the 
sample and can be shifted along the lateral axis by 
well-defined displacements. Note that in opposition 
to spinning disc systems (Egger and Petran 1967) and 
programmable array microscopes (Flanley et al. 1999; 
Fleintzmann et al. 2001), there is no mask in the 
detection beam path in this setup. 

A typical data set obtained from a structured illu¬ 
mination apparatus comprises a z-stack of image sets, 
each corresponding to a different focus position in the 
sample. Each image set contains images acquired at 
different positions of the illumination pattern in sam¬ 
ple space. Variants of SIM like sSIM and hrSIM differ 
with respect to the setup (Eig. 3) as well as with respect 
to the required image processing (see below). 

Optical Sectioning - sSIM 

One benefit of sSIM is its property of removing out-of- 
focus light. During this process an algorithm is applied 
to the set of images taken at a specific focus position to 
yield an optical section of the sample. One set consists 
typically of three images, each taken at a different 
position of the illumination pattern in sample space. 
As mentioned before, it is common to use a one¬ 
dimensional grating as the illumination mask. To 
achieve optical sectioning, a relatively coarse grating 
(approx. 900 nm periodicity in sample space for 
a 63 X, 1.4 NA oil immersion objective) is used, illu¬ 
minated by an incoherent light source. After the first 
image is acquired, the grating is displaced linearly by 
one third of the grating period to capture the second 
image and another one third to capture the third image 
(Fig. 4). From these three images /i J 2 ,h the sectioned 
image Issim can for example be calculated using (Neil 
et al. 1997) 

Lisim = yj (7i — hf' + {h — h)^ + {h — h)^ 

It is possible to separate, based on the detected 
intensity variation over the three images, the part of 
the sample from which the light originated. Due to the 
use of incoherent light to illuminate the grating, the 
contrast of the illumination pattern in sample space is 
lower for out-of-focus planes. Consequently, the 
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structured Illumination 
Microscopy (SIM), 

Fig. 1 Two images of adult 
rat cardiomyocytes stained 
with monoclonal mouse 
antibodies against the thin 
epitope T12 and secondary 
Cy2-conjugated anti-mouse 
immunoglobulin antibodies. 
(a) Wide-field fluorescence 
image, (b) Sectioning SIM 
reconstruction 



Structured Illumination 
Microscopy (SIM), 

Fig. 2 Images of COS-7 cells 
labelled with MitoTracker® 
(samples generously provided 
by B. Qualmann and M.M. 
Kessels, Jena University 
Hospital/Friedrich-Schiller- 
University Jena, Germany). 

(a) Wide-field fluorescence 
image (b) High-resolution 
SIM reconstruction 



degree of modulation over the three images is strongest 
for the in-focus slice. Out-of-focus structures experi¬ 
ence a reduced degree of modulation, or, if they lie 
distinctly outside of the focal plane, no modulation 
at all. Accordingly, the intensity of light emitted by 
in-focus structures (fluorophore II in Fig. 5) is not 
constant over the three images, whereas that of light 
emitted by out-of-focus structures (fluorophore I in 
Fig. 5) is. The reconstruction algorithm compares the 
intensity in each pixel over the three images, retaining 
the modulated component and suppressing the 
unmodulated part. 

Modulation due to the structured illumination is 
further reduced if the images of neighboring 
fluorophores overlap. Since neighboring fluorophores 
may experience the intensity modulation with a differ¬ 
ent phase, the overlap of their images can result in 
a mutual demodulation. The degree of suppression 


thus depends on the three-dimensional distribution of 
fluorophores in the sample. Due to blurring, the mutual 
overlap and thus the demodulation effect is far greater 
for out-of-focus regions than for the focal slice. 

Hence, both incoherent illumination and the 
detected out-of-focus blurring contribute to the sec¬ 
tioning effect of sSIM. 

However, the need for a high-contrast illumination 
pattern in the focal slice calls, under incoherent 
illumination, for a relatively coarse grating to be 
used, which is too coarse to achieve a significant 
in-plane resolution improvement as featured by hrSIM. 

Resolution Improvement - hrSIM 

By utilizing a smaller grating in sample space and 
more complex image processing, hrSIM provides 
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Structured Illumination 
Microscopy (SIM), 

Fig. 3 Typical setup for SIM. 
The illumination mask, which 
is typically a one-dimensional 
grating, is inserted at the field 
diaphragm position in the 
illumination path of 
a conventional wide-field 
fluorescence microscope and 
projected onto the sample 
using an incoherent and 
a coherent light source for 
sSIM and hrSIM, respectively. 
Lenses LI, L2 and the 
objective are in telecentric 
arrangement. The tube lens L3 
relays the light onto the 
camera 


camera 


L3C "T] 


incoherent illumination (sSIM) 
coherent illumination (hrSIM) 
emission 








Q. 








objective 


grating 


L1 

n 


U 


■ illumination path 


sample 


I 


light source 



Structured Illumination Microscopy (SIM), Fig. 4 Three 
raw images necessary for the sSIM reconstruction algorithm. 
Each image is taken at a different position of the grating in 
sample space. For illustration purposes the same grid line is 


marked with a red bar in each image. Specimen: Adult rat 
cardiomyocytes, stained with monoclonal mouse antibodies 
against the titin epitope T12 and secondary Cy2-conjugated 
anti-mouse immunoglobulin antibodies 


optical sectioning while also yielding significantly 
improved resolution in the x-y plane. 

This resolution improvement is associated with the 
moire effect, which occurs when two high-frequency 
patterns are multiplied. Since the illumination of the 
sample corresponds mathematically to a multiplication 
of the illumination pattern (Fig. 6a) with fine details of 
the sample structure (Fig. 6b), hrSIM entails a moire 


effect. The multiple moire images (one of them is 
shown in Fig. 6c), obtained during the hrSIM data 
acquisition process, in combination with knowledge 
of the illumination pattern allow one to infer details 
of the sample structure that are not detected in 
a conventional microscope. 

For a more complete explanation, a description in 
Fourier space must be considered (see next section). 
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Structured Illumination Microscopy (SIM), Fig. 5 Simula¬ 
tion of a x-z cross section through sample space showing the 
three raw sSIM images taken at different grating positions and 
the effect on the light intensity emitted from fluorophores / and 


II. Due to the decreasing contrast of the illumination pattern for 
out-of-focus planes, the displacement of the grating leads to an 
intensity modulation for the in-focus fluorophore If, but not the 
out-of-focus fluorophore I 


Structured Illumination 
Microscopy (SIM), 

Fig. 6 The moire' effect 

(a) A high-frequency 
illumination pattern. If 
multiplied with a sample, as in 

(b) , containing fine details that 
cannot be resolved by the 
optical system, a moire pattern 
forms (c). The illumination 
pattern is known and can be 
removed computationally; 
through this procedure, which 
implies the acquisition of 
multiple moire images (c), 
each with a different position 
of the illumination pattern in 
sample space, otherwise 
undetectable structures can be 
uncovered 



In order to obtain a significant in-plane resolution 
improvement, an illumination pattern with a suffi¬ 
ciently fine structure (approx. 200 nm grating period¬ 
icity in sample space for a 63 x, 1.4 NA oil immersion 


objective) has to be used. To ensure a homogeneous 
effect in all in-plane directions, one usually acquires, at 
each focus position, three image sets, each with 
a different rotation of the grating in the x-y plane. 
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For highest performance in three dimensions one 
image set consists usually of five images with different 
positions of the illumination structure in sample space. 
Since it is of paramount importance that the illumina¬ 
tion pattern has high contrast in sample space, 
a coherent light source must be used. Coherent light 
allows for 100% illumination contrast at the finest 
possible generated illumination structure. 

In addition to the sectioning effect, a lateral resolu¬ 
tion improvement up to a factor of two is possible with 
this approach compared to conventional wide-field 
microscopy. A twofold enhancement of axial resolu¬ 
tion can also be achieved. 


Everything You Always Wanted to Know 
About Image Reconstruction, but Were 
Afraid to Ask 

A mathematical analysis allows the image reconstruc¬ 
tion process to be understood in more detail. The 
emitted intensity Ig„,{r), as a function of spatial coor¬ 
dinate r, is considered to be proportional to the local 
fluorophore concentration S{r) and to the local illumi¬ 
nation intensity ///;„ (r): 

/™(r) =5(r)/,7,„(r) (1) 

This multiplication in real space corresponds to 
a convolution in Fourier space: 

iem{k) = S{k) ® iillu{k) ( 2 ) 

with ^ denoting the Fourier transform of the respec¬ 
tive function, k the spatial frequency, and ® the 
convolution operator. Figure 7 illustrates this mathe¬ 
matical operation. Assuming a standing wave (i.e., 
sin^) illumination pattern Iiiiu{r) present in the sample, 
its Fourier transform Iiiiu{k) corresponds to three delta 
peaks. The convolution thus yields a sum of three 
Fourier-transformed objects (Fig. 7c). Note that two 
of the three so-called object components are displaced 
from the center position by the illumination patterns 
k-vector km,,. Only the conventional wide-field com¬ 
ponent remains unshifted. The position and shape of 


the illumination mask in real space determines the 
individual position, strength, and complex phase of 
the object components in Fourier space. 

The imaging process, a convolution of I emir) with 
the point spread function (PSF) h{r) in real space, can 
be described as a multiplication of the Fourier trans¬ 
form of the emitted intensity distribution /p„, {k) with 
the optical transfer function (OTF) h{k) in Fourier 
space: 

Img{k) = Iem{k) h{k), 

with Img{k) denoting the Fourier transform of the 
image. The OTF h{k) is characteristic for the optical 
system in use and defines the maximum frequency that 
can be transmitted. Figure 7d shows that only low 
object frequencies of the unshifted wide-field compo¬ 
nent lie within the detection passband. In consequence, 
high spatial frequencies, which correspond to fine 
details of the sample, are lost during the conventional 
wide-field imaging process (Fig. 7b). In hrSIM the 
shifted object components compensate for some of 
this loss. Due to their displacement in Fourier space, 
higher object frequencies are shifted into the transmit- 
table region of the detection passband and thus 
contribute to the image (Fig. 7d). 

Taking a set of multiple images allows for compu¬ 
tational separation of the individual components and 
shifting of them back to their correct position in 
Fourier space (Fig. 7e). For optimal performance, 
overlapping information, stemming from different 
separated components, is averaged with frequency- 
dependent weights (Fig. 7f). Finally, the reconstructed 
image can be obtained by an inverse Fourier transform. 

The linear sample response leads to the Abbe limits 
for detection and illumination being combined, yield¬ 
ing a new, extended Abbe limit. Since the Abbe limit 
defines the smallest sample structure that can be 
resolved by the optical system, this extension corre¬ 
sponds to a resolution enhancement. 

sSIM Versus hrSIM 

Due to the use of a relatively coarse grating, illumi¬ 
nated by an incoherent light source, sSIM is a method 
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Structured Illumination Microscopy (SIM), Fig. 7 Mathe¬ 
matical basis of image reconstruction in hrSIM (a) Uniform 
illumination in a conventional wide-field microscope leads to 
a delta peak (red bar) centered in Fourier space denoting the 
position of the Fourier-transformed object in the emitted light 
distribution, (b) The process of optical imaging corresponds to 
a multiplication of the emitted light distribution with the OTF 
(The shape of the OTF is assumed to be rectangular for illustra¬ 
tion purposes) in Fourier-space. The optical system acts therein 
as a low-pass filter. Fligh frequencies, i.e., |/:| > /:n,ax, are lost 
during the conventional wide-field imaging process, (c) The 
Fourier transform of the emitted light distribution in an hrSIM 


Detected 

b Intensity 

(OTF corrected) 



Detected 

d Intensity 

(OTF corrected) 



Reconstructed 
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device yields, in addition to the conventional component (blue), 
two side components (green) located at ± Uv/u- The frequency 
information in each of the side components is the same as in the 
conventional component but shifted, (d) Due to their displace¬ 
ment the low-pass filter acts asymmetrically with respect to 
the shifted side components and therefore lets pass object 
frequencies formerly rejected, (e) After computational separa¬ 
tion of the individual components both side components 
can be shifted to the correct position in Fourier space, 
(f) The combination of all components, using weighted averag¬ 
ing, leads to an improved passband and hence to an improved 
resolution 
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to obtain optically sectioned data by relatively simple 
experimental and computational means. Commercial 
devices, along with the requisite computer software, 
can he easily embedded in an off-the-shelf wide-held 
microscope and thus make it a relatively inexpensive 
and robust sectioning method compared to confocal 
microscopy. 

The heneht of hrSIM over sSlM is an additional 
resolution improvement in all three dimensions. In 
contrast to the latter method, the acquisition of 
a higher number of images and extensive image 
processing is indispensable in hrSIM. Due to the use 
of coherent laser light and a hne illumination structure, 
this method is also more delicate and costly. 


Outlook 

For further resolution improvement beyond hrSIM, 
it becomes necessary to circumvent the Abbe limit 
entirely. This can be achieved by breaking the linear 
dependence between illumination and emission inten¬ 
sity (Eq. 1). Nonlinear structured illumination methods 
like SPEM (Heintzmann et al. 2002), SSIM 
(Gustafsson 2005), higher harmonics microscopy 
(Eujita et al. 2002), and STED (Klar et al. 2000) 
retrieve theoretically unlimited spatial frequencies, 
which corresponds to infinite resolution in the 
reconstructed image. However, in practice, the signal- 
to-noise level limits the resolution. Other promising 
approaches utilizing separation of fluorophores in 
time and single molecule localization are Pointillism 
(Lidke et al. 2005), PALM (Hess et al. 2006), and 
STORM (Rust et al. 2006). 

The requirement for multiple image acquisitions 
in sSIM and hrSIM reduces their speed; in 
contrast, modern approaches like Multicolour SIM 
(Krzewina and Kim 2006) and polarization-coded 
SIM (Wicker and Heintzmann 2010) can achieve the 
same effect with a single exposure. 


Cross-References 

► Photoactivated Localization Microscopy (PALM) 

► Stimulated Emission Depletion (STED) Microscopy 

► Stochastic Optical Reconstruction Microscopy 
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Definition 

• ABC = ATP-binding cassette. ABC proteins are 
defined by conserved sequence motifs in the 
nucleotide-binding domain (“the ATP-binding 
cassette”), which are critical for the binding and 
hydrolysis of Mg-ATP. 

ATP-binding cassette transporters = ABC 
transporters = ABC importers and exporters 
Before 1992, ABC transporters were also referred 
to as traffic-ATPases. 

• ABC exporter catalyzes ATP-driven export of 
compounds from the cytoplasm to the external 
medium or organelle lumen. 

• ABC importer catalyzes ATP-driven uptake of 
solutes from the medium into the cytoplasm of 
prokaryotes. 

• SBP = Substrate-binding protein, originally 
referred to as periplasmic binding protein or 
substrate receptor. Some ABC transporters have 
the SBP fused to the transmembrane domain 
(TMD) and here the receptor is referred as 
substrate-binding domain (SBD). 

• TMD = Transmembrane domain. Two TMDs are 
present per functional ABC transporter; the TMDs 
can be separate proteins, fused to each other, fused 
to the NBD, and/or fused to the SBD. 

• NBD = Nucleotide-binding domain. Two NBDs 
are present per functional ABC transporter. 

• ECF = Energy coupling factor. A subset of ABC 
transporters (named ECF-type) does not require an 
SBP/SBD for solute import. Instead, they use 
an integral membrane substrate-binding protein 
(S-component). 

• ECF module = a tripartite complex with which the 
S-component associates to form a complete 
transporter. The module consists of a membrane- 
embedded subunit (EcfT or T-component) and two 
NBDs (EcfA or A-component). 

• Translocator = TMD plus NBD that facilitates the 
movement of substrate across the membrane. 


• Atypical ABC transporters are neither importers 
nor exporters, but fulfill different functions, 
e.g., ATP-gated channels like CFTR. 

Summary 

Most ABC importers known to date employ 
a soluble substrate-binding protein to capture the 
ligand and donate the molecule to the translocator. 
The SBP can be a soluble periplasmic protein or 
tethered to the membrane via a lipid moiety or pro¬ 
tein anchor or fused to the translocator. In the hybrid 
ABC transporters, multiple SBDs can be fused in 
tandem and provide several extracytoplasmic sub¬ 
strate-binding sites. A subset of ABC transporters 
employs a membrane-embedded S-component to 
capture the substrate. The S-component together 
with the ECF module also forms the translocation 
path for the substrate. Multiple S-components can 
associate consecutively with one and the same ECF 
module. An overview of the mechanism of substrate 
capture by different types of ABC transporters is 
presented, together with a scheme illustrating the 
alternating access mechanism for the overall trans¬ 
port process. 

Introduction 

ATP-binding cassette (ABC) proteins serve many 
functions, including the transport of nutrients into the 
cell, transport of compounds across organellar 
membranes, the secretion of proteins, antigen (peptide) 
presentation, cell volume regulation, regulation of pro¬ 
tein synthesis, detoxification, and antibiotic resistance. 
The vast majority of ABC proteins are part of 
complexes that mediate vectorial translocation, 
i.e., the systems transport molecules across cellular or 
organellar membranes. A smaller group of ABC 
proteins is associated with soluble (supra)molecular 
complexes and involved in DNA repair, recombina¬ 
tion, chromosome condensation and segregation, and 
translation elongation. Regardless of whether the ABC 
proteins are found in membrane transport or soluble 
(supra)molecular complexes, they provide a power 
stroke in which chemical energy is converted into 
mechanical energy (e.g., for a translocation or 
dislocation event). The mechanism of transport of 
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Substrate Capture by ABC Transporters, Fig. 1 Model of 
substrate capture and alternating access in ABC importers. 
A solute is captured by a soluble substrate-binding protein and 
transferred to the membrane-embedded translocator (top). Upon 
binding of ATP, the translocator is brought in the outward-facing 
conformation and is capable of accepting the substrate. Subse¬ 
quent isomerization of the binding site from out to in allows the 
substrate to be released on the trans side of the membrane. Upon 
hydrolysis of ATP and release of the reaction products (bottom). 


the system is reset for another translocation cycle. Although both 
Type I and Type U ABC importers operate according to the 
alternating access mechanism, the nucleotide-induced confor¬ 
mational changes are opposite. In the maltose system (Type I) 
binding of ATP triggers the outward-facing conformation, 
whereas in the vitamin B12 transporter (Type II) ATP binding 
elicits a conformational change from outward- to inward facing 
(loseph et al. 2011) 


ABC importers involves the binding and release 
of substrate from a dedicated extracytoplasmic 
substrate-binding protein and alternating access 
of the substrate-binding site in the translocator domain. 
A simplified scheme of solute translocation by ABC 
importers is depicted in Fig. 1 . In this entry, the empha¬ 
sis is on ABC proteins that mediate transport of solutes 
into prokaryotic cells and the focus is on the proteins or 
protein domains (SBP, SBD, S-component) that cap¬ 
ture the substrate from the medium and deliver it to the 
translocator. 


Classification of ABC Transporters 

ABC transporters have been subdivided into exporters 
and type I and type II importers. Exporters mediate 
transport of molecules from the cytoplasm to the exter¬ 
nal medium or organelle lumen. They bind their 
ligands directly within the TMD, without the need for 
accessory proteins. Access of the binding pocket is 
thought to be either from the membrane (lipophilic 
ligands) or cytoplasm (hydrophilic ligands), depending 
on the system (Bolhuis et al. 1996). Upon binding and 
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hydrolysis of Mg-ATP, the ligand is translocated 
across the membrane and released from the outward¬ 
facing conformation of the protein. Prototypical 
examples of ABC exporters are P-glycoprotein (Pgp), 
the transporter for antigen presentation (TAP), and the 
bacterial putative drug efflux system Savl866 
(Hollenstein et al. 2007). 

Type 1 and type II importers capture their ligand via 
so-called accessory substrate-binding proteins (SBP). 
Upon substrate capture, the SBP changes conformation 
from open to closed and, subsequently, docks onto the 
translocator. Through the binding of Mg-ATP in the 
NBDs, the translocator transits from an inward- to an 
outward-facing conformation, which allows the 
transfer of the substrate from the SBP to the TMD. 
Subsequent hydrolysis of ATP and release of ADP and 
inorganic phosphate (Pi) completes the reaction cycle 
and releases the substrate on the trans side of the 
membrane. Type I and type II importers differ in 
their architecture of the SBP and TMD, but also 
mechanistically these transporters appear to operate 
differently (Davidson et al. 2008; Lewinson 
et al. 2010). The maltose transporter MalFGK2E 
from E. coli is the paradigm for type I import 
(Davidson et al. 2008; Oldham et al. 2001), the vitamin 
B12 transporter BtuCDF from Escherichia coli is 
prototypical of a type II importer (Locher 
et al. 2002). For type I importers, it has been shown 
that a limited number of related SBPs may associate 
with one and the same translocator, enabling the 
system to import multiple distinct substrates. There is 
evidence that type I importers hydrolyze two 
molecules of ATP per substrate translocated (Patzlaff 
et al. 2003). The power stroke for transport is elicited 
by the binding of ATP; the hydrolysis of ATP and 
release of reaction products resets the system to the 
ground state (Davidson et al. 2008; Hollenstein 
et al. 2007). 

Here, the ECF transporters are defined as type III 
importers. ECE-type ABC transporters capture their 
substrates via membrane-embedded S-components, 
which associate with the ECE module (EcfT or 
T-component plus two NBDs, often referred to as 
A and A’ component). The type III importers transport 
micronutrients such as vitamins and trace metal ions, 
and multiple different S-components can associate 
with a single ECE module (Rodionov et al. 2009). 


How the substrate, bound via the S-component, is 
subsequently transported is poorly understood 
compared to type I and type II importers. Crystal 
structures are available for riboflavin- and 
thiamine-specihc S-components (Erkens et al. 2011; 
Zhang et al. 2010). Rearrangement of the membrane- 
embedded LI loop of the S-component may open 
a lateral gate for the vitamin facing the EcfT subunit. 
Repositioning of LI will perturb the binding site resi¬ 
dues, which will reduce the binding affinity and allow 
the vitamin to leave the S-component. It has been 
hypothesized that the actual translocation step takes 
place on the interface between EcfT and the 
S-component, in line with the import of substrates via 
type I and II ABC transporters as well as exporters. 

Structural Features of Substrate-Binding 
Proteins 

Substrate-binding proteins were first discovered in the 
periplasm of E. coli (Berger and Heppel 1974), 
a Cram-negative bacterium, and even to date they are 
often referred to as periplasmic binding proteins. The 
first SBP crystal structure, the L-arabinose-binding 
protein (ABP), was solved in 1974 (Quiocho 
et al. 1974). In general, the soluble SBPs are present 
in large excess over the translocator complexes in the 
membrane, allowing efficient capture of substrates and 
initiation of the translocation reaction. In 
Cram-positive bacteria and archaea, i.e., microorgan¬ 
isms lacking an outer membrane and periplasm, SBPs 
are exposed on the cell surface and attached to the 
cytoplasmic membrane via a lipid-anchor or 
a transmembrane-peptide (to date, the latter has been 
only observed in archaea), or they can be fused to the 
TMDs resulting in two SBDs per functional complex. 
In some cases, two or even three SBDs fused in tandem 
are linked to the TMDs and these systems have a total 
of four or six extracytoplasmic substrate-binding sites. 
Systems with SBDs fused to the TMDs can be also 
found in Cram-negative bacteria but less frequently 
than in Cram-positives (van der Heide and 
Poolman 2002). The linking of SBDs to the membrane 
or the fusing of multiple SBDs to the TMD increases 
the effective concentration of the substrate-binding 
sites near the translocator and may increase the 
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efficiency of transport. The mechanism of substrate 
binding of periplasmic, membrane-anchored, and 
TMD-fused SBDs is similar and so is the mode of 
action of the corresponding ABC transporters. 

The archetype SBP consists of two lobes connected 
by a linker or hinge (domain). The two lobes close and 
engulf the ligand upon substrate binding (mode of 
substrate capture: akin to that of a Venus’s Flytrap) 
(Quiocho and Ledvina 1996). Structures are available 
of SBPs in the open-unliganded, closed-unliganded, 
open-liganded, and closed-liganded forms, of which 
the latter conformation is thought to productively 
interact with the translocator complex. In solution, 
the equilibrium state of the SBPs is toward the open- 
unliganded conformation in the absence of substrate 
and closed-liganded in the presence of substrate. Muta¬ 
tional and structural analyses indicate that each lobe of 
the SBP binds to one of the TMDs (Davidson 
et al. 2008; Hollenstein et al. 2007), and conforma¬ 
tional changes in the NBDs upon binding of ATP are 
transmitted via the TMDs to the SBP. Thus, indirectly 
ATP is involved in the opening of the SBPs and 
substrate transfer to the translocator. 

SBPs are not only associated with ABC transporters 
but with a wide variety of translocation and signal 
transduction systems in both prokaryotic and 
eukaryotic organisms, including tripartite ATP- 
independent periplasmic (TRAP)-transporters, 
two-component regulatory systems, guanylate 
cyclase-atrial natriuretic peptide receptors, G-protein- 
coupled receptors (GPCRs), and ligand-gated ion 
channels (Berntsson et al. 2010). In addition, SBP 
domains are part of prokaryotic DNA-binding proteins 
involved in gene regulation. SBPs are very diverse in 
sequence and phylogenetic analyses based on multiple 
sequence alignments do not yield stable alignments. 
(The sequence identity of the proteins is often <20%.) 
However, the structures of SBPs are remarkably 
similar, which has been used to cluster the SBPs 
based on structural similarity instead of sequence 
similarity (Berntsson et al. 2010). The structures of 
more 100 SBPs have been superimposed in pairwise 
fashion to produce a structural distance tree. The SBPs 
were found to group into six defined clusters (A-F), 
three of which (cluster A, D, and F) were further 
subdivided. The analyses have shown that the proteins 
within the six clusters can be discriminated on the basis 
of the linker (hinge) region that connects the two 


protein lobes (Fig. 1): The generic secondary structure 
is (p) 4 / 5 (a)n-hinge-(P) 4 / 5 (a)n (Fukami-Kobayashi 
et al. 1999). There is little or no correlation between 
the structural clustering and functional classification, 
i.e., SBPs with very different substrate specificities are 
present in each cluster. Furthermore, in the individual 
clusters, the proteins are not necessarily homologous, 
as judged from the absence of significant sequence 
similarity. Clusters B and F contain the most diverse 
SBPs, associated with various types of transport and 
signal transduction systems; SBPs in cluster A are 
unique to type II ABC transporters; cluster C and 
D contain SBPs of type I ABC transporters; and 
SBPs in cluster E are found in TRAP transporters. 
Specific information on the six clusters is presented 
below (Fig. 2): 

Cluster A 

The distinguishing characteristic of cluster A is an 
a-helix serving as the hinge between the two domains. 
The rigidity of this helix is reflected in the small 
movement of both domains upon substrate binding. 
All of the SBPs in cluster A play a role in metal 
binding, either directly or as metal chelates. The 
cluster A SBPs associate with type II ABC importers. 

Cluster B 

Cluster B consists of SBPs that bind carbohydrates 
(such as ribose, glucose, and arabinose), branched 
chain amino acids, natriuretic peptides, and 
autoinducer-2 (AI-2). The SBPs in cluster B interact 
with type I ABC importers, two-component histidine- 
sensory complexes, and guanylate cyclase-atrial 
natriuretic peptide receptors. The hinge of the SBPs 
in cluster B is built of three distinct regions connecting 
the lobes. Homologous to the proteins in cluster B are 
the /ac-repressor type transcription factors, such as the 
LacR, PurR, and CcpA. 

Cluster C 

The cluster C SBPs interact with type I ABC trans¬ 
porters and bind diverse ligands including di- and 
oligopeptides, arginine, nickel ions, and cellobiose. 
They all have an extra domain. For AppA from 
B. suhtilis and OppA from L. lactis, it has been 
shown that the extra domain extends the 
oligopeptide-binding cavity in order to accommodate 
very large ligands. OppA from L. lactis binds peptides 
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Substrate Capture by ABC Transporters, Fig. 2 The differ¬ 
ent clusters of SBPs are shown with their distinct structural 
feature colored in orange, (a) Cluster A contains proteins having 
a single connection between the two domains in the form of 
a rigid helix, (b) Cluster B contains SBPs with three 
interconnecting segments between the two domains, (c) Cluster 
C contains SBPs that have an extra domain and are significantly 
larger in size when compared with the others, (d) Cluster 
D contains SBPs with two relative short hinges (e) Cluster 
E contains SBP associated with TRAP transporters which all 


C-term 



N-term 


C-term 



contain a large helix functioning as hinge region, (f) Cluster 
F contains SBPs with two hinges similar like cluster D; however, 
these hinges have almost double the length creating more flex¬ 
ibility inside the SBP. Please note that clusters A, D, and F can 
further be subdivided based on the substrate of the SBP (see 
text). The proteins used to illustrate the features in clusters A-F 
are BtuF (PDB code: 1N2Z), RBP (PDB code: IDRJ), OppA 
(PDB code: 3DRF), ModA (PDB code: lONR), UehA (PDB 
code: 3FXB), and FlisJ (PDB code: IFISL), respectively 
(Figure was taken from Bemtsson et al. 2010) 


with widely varying length (up to at least 35 residues) 
and sequence. The function of the extra domain in 
other SBPs is not known. 

Cluster D 

The discernible feature of these proteins is that their 
hinge-region consists of two short stretches, 4-5 amino 
acids long. This large group of SBPs binds a large 
variety of substrates: carbohydrates, putrescine. 


thiamine, tetrahedral oxyanion as well as ferric or 
ferrous iron. The subclusters found in this cluster 
correspond to the substrate specificity of the proteins 
(for details, see Berntsson et al. 2010). 

Cluster E 

In this cluster, all substrate-binding proteins are part of 
the TRAP transporter (tripartite ATP-independent 
periplasmic transporter) family. In contrast to ABC 




2508 


Substrate Capture by ABC Transporters 




transporters, the TRAP transporters use an electro¬ 
chemical ion gradient to fuel the uphill translocation 
of substrates. The remarkable feature of TRAP 
SBPs is a large single B-strand that is part of the 
two five-stranded 13-sheets of both lobes. All 
TRAP-dependent SBPs structurally characterized so 
far have conserved features, such as the strand order, 
typical of class II SBPs (P 2 PiP 3 pnP 4)5 an additional 
13-strand connecting both domains as well as the 
number and positioning of the flanking a-helices. 
A second distinguishing feature is a long helix that 
spans both domains. Such a long helix is found in all 
crystal structures of SBP proteins reported for TRAP 
transporters, although in some structures, this helix is 
interrupted by a kink. The known substrates of cluster 
E proteins are limited to ectoine, pyroglutamic acid, 
lactate, 2-keto acids, and sialic acid. 

Cluster F 

The distinguishing feature of the cluster F proteins is 
a hinge consisting of two segments connecting the 
two lobes. The linker stretches of cluster F proteins 
are significantly longer (8-10 amino acids) than the 
hinges of 4-5 amino acids observed in SBPs of cluster 
D. Possibly the longer linker provides more flexibility 
between the open and closed conformation. 
Cluster F SBPs bind a large variety of substrates 
ranging from trigonal planar anions (nitrate, bicar¬ 
bonate) to amino acids and compatible solutes such 
as glycine betaine. The overall structure of the pro¬ 
teins within cluster F is similar, but they can be 
subdivided based on their substrates (Berntsson 
et al. 2010). Noteworthy is the difference in primary 
sequence between OpuAC from B. suhtilis and 
closely related proteins in cluster F: A domain swap 
has taken place. 

Structural Features of S-Components 

The crystal structures of the riboflavin-specific 
S-component from Staphylococcus aureus and thia¬ 
mine-specific S-component from Lactococcus lactis 
have been determined (Zhang et al. 2010; Erkens 
et al. 2011). In both cases, the substrate was present 
in the protein. The overall fold of the two proteins is 
very similar (RMSD = 3.5 A for 145 Ca atoms). 


although at the sequence level, the two proteins are 
unrelated (14% sequence identity). The lack of 
sequence conservation between S-components of the 
ECF transporters is remarkable, because these proteins 
interact with a common partner, the shared ECF 
energizing module. ThiT and RibU contain six 
hydrophobic helical segments that cross the mem¬ 
brane. A part of the LI loop is also embedded in the 
lipid bilayer, which is needed because a-helix 2 is too 
short to span the entire thickness of the membrane. The 
position of LI is thought to play an important mecha¬ 
nistic role in the translocation of substrates across 
the membrane. 

The substrate-binding site is located in a pocket 
near the extracellular side of the membrane and lined 
by helices 4, 5, and 6 and the loops LI and L5. In the 
crystal structures of both RibU and ThiT, the substrates 
are almost completely occluded. Entrance of 
a substrate into the binding site from the external side 
of the membrane would require conformational 
changes of loops LI, L3, and L5, which form lids on 
top of the substrate. On the basis of the structures of 
ThiT and RibU, it seems unlikely that the 
S-components by themselves line a translocation 
path. The translocation pore is most likely formed at 
the interface of the ThiT and EcfT subunits, requiring 
the repositioning of LI. 

In conclusion, type I and type II ABC importers 
employ extramembranous substrate-binding proteins 
to capture substrate and deliver the molecule to the 
translocator. Type III ABC importers use an integral 
membrane protein (S-component) for the initial bind¬ 
ing of substrate; transport is thought to take place at the 
interface of the S-component and another integral 
membrane protein. Whereas full structures of 
type I and type II ABC importers w/wo SBPs are 
available, the translocation mechanism by type III 
importers awaits further structural analysis. 


Cross-References 

► CFTR, Ion Channel Evolved from ABC Transporter 

► Membrane Protein Function 

► Membrane Proteins; Structure and Organization 

► Membrane Transport, Energetics and Overview 
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Synonyms 

Menaquinol-fumarate reductase; Succinate-quinone 
oxidoreductase 

Definition 

Succinate dehydrogenase often referred to as complex 
II or succinate-ubiquinone oxidoreductase is the only 
membrane-bound member of the citric acid cycle (also 
called the tricarboxylic acid (TCA) cycle or the Krebs 
cycle). 

Basic Characteristics 

Succinate dehydrogenase or complex II in most organ¬ 
isms is composed of four distinct protein subunits 
(Cecchini 2003). These subunits are arranged in 
a hydrophilic domain exposed to the cytoplasm in 
bacteria or the matrix of mitochondria. The hydro¬ 
philic subunits are bound to the membrane through 
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interaction with two hydrophobic integral membrane 
subunits which span the inner mitochondrial mem¬ 
brane or irmer membrane in gram negative bacteria. 
The largest subunit (^66 kDa) is the hydrophilic SdhA 
flavoprotein subunit which contains a covalently 
bound > flavin adenine dinucleotide (FAD) cofactor 
and the catalytic active dicarboxylate binding site 
where succinate is oxidized to fumarate. The SdhA 
subunit is bound to the hydrophilic SdhB subunit 
(^27 kDa) which is an iron-sulfur protein which con¬ 
tains three distinct ► iron-sulfur clusters. The SdhB 
subunit is bound to the membrane through interaction 
with the hydrophobic SdhC (^15 kDa) and SdhD 
(^13 kDa) subunits. Amino acid residues from the 
SdhB, SdhC, and SdhD subunits provide a binding 
pocket where the redox active mobile electron carrier 
ubiquinone (UQ) binds and provides a second catalytic 
site in the enzyme where quinone reduction occurs. 
The SdhC and SdhD subunits provide the binding site 
for one additional redox center, namely, a 7»-type heme, 
with a histidyl residue from each subunit providing the 
ligation for the heme. It should be mentioned that the 
family of proteins that encompasses complex II has 
many variations in prokaryotes that may contain only 
a single membrane-spanning subunit and two fo-type 
hemes (Lancaster 2002); however, in all cases the 
hydrophilic SdhA and B-subunits are very similar. 

The edge-to-edge spatial separation of the redox 
centers of the protein (FAD, iron-sulfur clusters, qui¬ 
none, and heme) are all within the 14 A distance 
thought to be optimal for proficient electron transfer 
(Page et al. 1999). Figure 1 illustrates the various redox 
centers in succinate dehydrogenase and shows the spa¬ 
tial distance between the centers as exemplified by the 
Escherichia coli complex II. These distances and the 
general redox properties (Table 1) of all of the redox 
centers are the same regardless of the organism from 
which complex II is isolated. One exception, however, 
is the midpoint ► redox potential of the /j-type heme in 
mammals is approximately 180 mV lower than that 
found in E. coli complex II. The reason for the lower 
redox potential of the I^-type heme in mammals appar¬ 
ently is due to the more polar environment near the 
heme in mammals versus that found in E. coli complex 
II (Sun et al. 2005). 

The physiological reaction of complex II is the two 
electron oxidation of succinate to fumarate (succinate/ 
fumarate couple E^-j = -1-30 mV) coupled with the 
transfer of two electrons and two protons to reduce 





Succinate Dehydrogenase (Complex II), Fig. 1 Electron 
transfer pathway in E. coli complex II showing all redox centers. 
The FAD cofactor is shown in green, with the dicarboxylate 
oxaloacetate (OA) which is present at the active site indicated. 
The iron-sulfur clusters are shown as spheres with the sulfur 
atom in gold and the iron atom in orange. The ubiquinone 
molecule is shown in blue and the heme h in magenta. The 
nearest edge-to-edge distance in A between the redox centers is 
indicated by the dashed line 


ubiquinone to ubiquinol. As can be seen in Fig. 1, the 
FAD and quinone redox centers are connected by the 
chain of iron-sulfur clusters providing a relatively lin¬ 
ear electron transfer pathway to the quinone. The heme 
h is not directly located on this pathway but rather it 
appears that the pathway is branched with the nearest 
edge-to-edge distance from the [3Fe-4S] cluster to the 
quinone ~6.3 A as compared to the ~8.1 A distance to 
the heme h (Ruprecht et al. 2009). There is evidence 
that the heme h does not play a direct role in physio¬ 
logical electron transfer (Tran et al. 2007) although it 
clearly is in electronic communication with the qui¬ 
none and [3Fe-4S] cluster as evidenced by fast reaction 
kinetics (Anderson et al. 2005). This is still an area of 
controversy in the complex II family of enzymes. 

As seen in Table 1 , the redox potentials of the redox 
centers of succinate dehydrogenase are poised to 
enhance the removal of electrons from the flavin to 
the quinone. During the two electron oxidation of 
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Succinate Dehydrogenase (Complex II), Table 1 Midpoint {E^ j) potential of the redox centers ofE. coli complex II 

FAD [2Fe-2S] [4Fe-4S] [3Fe-4S] Ubiquinone h heme 

-80 mV +10 mV -175 mV +65 mV +90 mV +35 mV 


succinate electron paramagnetic resonance (► EPR) 
spectroscopy has shown that a anionic flavin 
semiquinone (FAD'“) is formed during turnover and 
the electrons from the flavin can then be sequentially 
delivered one at a time to the one electron accepting 
iron-sulfur clusters. The iron-sulfur protein subunit 
(SdhB) is comprised of two distinct domains one of 
which adopts a plant-type ferredoxin fold and contains 
the [2Fe-2S] cluster nearest the flavin. The second 
domain adopts a bacterial type ferredoxin fold 
and contains both the [4Fe-4S] and [3Fe-4S] clusters. 
The linear chain of iron-sulfur clusters provides an 
efficient electron transfer pathway from the flavin to 
the quinone. It is worth noting that the midpoint poten¬ 
tial of the [4Fe-4S] cluster in the middle of the chain is 
the lowest of any of the redox centers and this suggests 
it might provide a thermodynamic barrier to electron 
transfer. However, mutagenesis and theoretical studies 
have shown that the [4Fe-4S] cluster does directly 
participate in the electron transfer reactions and is not 
a significant barrier to efficient electron transfer from 
the FAD to quinone (Moser et al. 2006). Thus, elec¬ 
trons are delivered on a microsecond time scale from 
the FAD to the [3Fe-4S] cluster which then donates the 
electron to ubiquinone generating a ubisemiquinone 
(UQ‘~). Upon delivery of a second electron from the 
[3Fe-4S] cluster and two protons, the ubiquinone is 
reduced to ubiquinol (UQH 2 ) completing the reaction 
sequence. The source of the protons, whether they are 
delivered from amino acid residues of the protein back¬ 
bone near the quinone-binding site or associated water 
molecules remains unclear (Ruprecht et al. 2009). 

As suggested above the role of the heme h in elec¬ 
tron transfer in complex II remains enigmatic although 
it clearly has a structural role in maintaining the sta¬ 
bility of the enzyme (Tran et al. 2007). In the electron 
transfer pathway described above and as seen in Fig. 1, 
it is clear that an electron can reduce the oxidized 
heme h', however, it is not necessary for the physiolog¬ 
ical electron transfer reaction. It has been suggested 
that under certain conditions the heme h can serve as an 
electron sink to prevent electron leakage to oxygen 
which otherwise would result in the formation of 


damaging ► reactive oxygen species (Yankovskaya 
et al. 2003). Evidently further work is needed to clarify 
the role of the heme h. 
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► Electron Transport Chains in Bacteria 
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► Gated Electron Transfer 
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Synonyms 

Broadband sum frequency generation (BB-SEG); 
Lipid bilayer; Vibrational sum frequency spectroscopy 
(VSES); Vibrationally resonant sum frequency gener¬ 
ation (VR-SEG) 

Definition 

Sum frequency generation vibrational spectroscopy is 
a nonlinear optical spectroscopic technique to analyze 
surfaces and interfaces, including those of biological 
relevance. 

Introduction 

Sum Erequency Generation (SEG) vibrational spec¬ 
troscopy can be used to study many types of surfaces 


and interfaces, but this entry will focus on studies 
related to biology. Understanding biointerfaces is the 
key to understanding diverse topics such as biomedical 
materials, marine antifouling coatings, biosensors, 
antimicrobial peptides and membrane proteins 
(reviewed in. Ye et al. 2009). Many analytical tech¬ 
niques have been developed to examine various 
biointerfacial phenomena such as surface plasmon 
resonance spectroscopy, neutron reflection, atomic 
force microscopy, ellipsometry, attenuated total reflec- 
tance-Fourier transform infrared spectroscopy 
(ATR-FTIR) (reviewed in, Lee 2005). However, no 
single technique is capable of elucidating molecular 
structure of interfaces in situ with a submonolayer 
surface specificity. 

SEG has been developed into a powerful analytical 
technique to investigate surfaces and interfaces since 
1987 (Guyot-Sionnest et al. 1987). It has been widely 
applied to identify functional groups, deduce molecu¬ 
lar orientation distribution and investigate the dynam¬ 
ics of various types of interfaces including liquid- 
liquid, air-liquid, and liquid-solid interfaces. In 
a typical SEG experimental set-up, two pulsed laser 
beams (a fixed-frequency visible and a tunable- 
frequency infrared) are overlapped in space and time 
at the interface to generate an output beam at the sum 
frequency (Fig. la). The simplified energy level dia¬ 
gram of a SEG process is shown in Fig. lb. Using 
specific polarizations of each of the three beams, it is 
possible to probe specific components of the second 
order nonlinear optical susceptibility the sample. 
This quantity determines the contribution to the 
polarization of matter that is induced by the square of 
an electric field E(t), = Eq E^(t). Under the 

electric-dipole approximation, when inversion sym¬ 
metry is broken (as occurs at surfaces and interfaces), 
X*^^ is nonzero, and signals can be detected; this selec¬ 
tion rule is the basis for the intrinsic surface specificity 
for SEG. Thus, SEG can distinguish interfacial species 
from the bulk molecules, eliminating the need to per¬ 
form background subtraction of bulk signals. Also, 
SEG signal can be enhanced when the input IR beam 
frequency is tuned over the vibrational resonances of 
interfacial molecules, yielding peaks that correspond 
to vibrational spectra of interfaces. These peaks are 
characteristic of specific functional groups, and there¬ 
fore SEG signals can be detected without any fluores¬ 
cent labels that might change the chemical properties 
of the sample. 
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Sum Frequency Generation 
Vibrational Spectroscopy, 
Fig. 1 SFG experimental 
geometry and energy diagram. 

(a) Total reflection geometry. 

(b) Simplified energy level 
diagram of SFG process 
(Reproduced from Ye et al. 
(2009), with permission) 



Sum Frequency Generation 
Vibrational Spectroscopy, 
Fig. 2 SFG spectrum of 
a DSPC/DSPC-rfgs bilayer 
recorded with i-polarized 
sum-frequency, ^-polarized 
visible, and p-polarized IR. 
Insert: schematic of 
asymmetric lipid bilayer 
structure (Reproduced from 
Liu and Conboy (2004a), with 
permission) 



In the following text, SFG studies involving 
phospholipid hilayers (of relevance as model cell 
membranes) will be presented first. Then methods 
for quantifying the ordering and orientations of 
peptides and proteins associated with lipid bilayers 
will be discussed. Lastly, SFG research on proteins 
at polymer surfaces, with a focus on conforma¬ 
tional change and surface immobilization, will be 
introduced. 

Phospholipids Flip-Flop in Model Cell 
Membranes 

Extensive SFG research has been performed on fatty 
acids and lipid monolayers/bilayers. Some researchers 
believed that the exchange of lipids between the distal 
and proximal leaflets of cellular membranes was medi¬ 
ated by proteins named “flipases” or “flopases.” 
Although other techniques such as NMR, fluorescence 


and EPR have shown that the movement is also 
possible in the absence of proteins, these studies usu¬ 
ally used chemically modified lipids and the measured 
rates of flip-flop are usually very slow. Conboy 
and coworkers used SEG to observe the slow move¬ 
ment (“flip-flop”) of l,2-distearoyl-sn-glycero-3- 
phosphocholine (DSPC) in a planar supported lipid 
bilayer system, in the absence of protein in situ (Liu 
and Conboy 2004a). The regular lipid bilayer has 
inversion symmetry more or less. There is a water 
layer between the substrate and the lipid bilayer 
shown in Pig. la, therefore the lipid leaflet closer to 
the substrate and the lipid leaflet closer to the subphase 
are similar. In order to differentiate SPG signals from 
each leaflet in a supported lipid bilayer, usually 
a bilayer with one hydrogenated and one deuterated 
leaflet is used. 

As shown in Fig. 2, Conboy’s model system was 
partially deuterated in order to induce asymmetry for 
SFG study. As the two leaflets underwent flip-flop and 
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Sum Frequency Generation 
Vibrational Spectroscopy, 
Fig. 3 Representation of gel 
(blue) to liquid-crystalline 
(red) phase transition 
illustrating domain dislocation 
and domain size disparity 
which could give rise to 
membrane asymmetry. Also 
shown is the cancellation of 
the terminal CH 3 symmetric 
stretching mode in case of 
symmetric leaflets, i.e. before 
and after the phase transition 
(Reproduced from Liu and 
Conboy (2004b), with 
permission) 




“mixed,” the intensity of the SFG signal from the 
terminal CH 3 symmetric stretching mode of the inner 
layer (which is next to the substrate) decreased. Later, 
the authors measured the phase transition temperatures 
of different lipids (Liu and Conboy 2004b). Unlike the 
previous work, the bilayer used was symmetric (Fig. 3) 
and produced no signal when uniform in the gel phase 
and the lipid phase. When the temperature approached 
the phase transition temperature, heterogeneities arose 
that resulted in a break in symmetry along the mem¬ 
brane normal (and, consequently, detectable SFG sig¬ 
nals). Together, these bilayer studies show that SFG 
can be used to detect a biological process that changes 
the symmetry. As in the flip-flop study, this was done 
by introducing isotope labeled lipids to break the orig¬ 
inal symmetry of the lipid bilayer and study each 
leaflet of the bilayer simultaneously. Later on, other 
groups used similar isotope labeled lipid models to 
study the interactions between various molecules and 
the model cell membrane. For example, molecular 
interactions between an antibiotic oligomer and 
a lipid bilayer have been investigated using SFG, 
showing that the molecule can cut into the lipid bilayer 
like a knife (Chen et al. 2006). When the molecular 
structure of the oligomer is altered, the interactions 
with bilayers can be markedly varied (Avery et al. 
2009). SFG studies also showed that when melittin 
molecules interact with the lipid bilayer, the two leaf¬ 
lets are disrupted differently, one after the other. When 


tachyplesin I molecules interact with the bilayer, 
two leaflets were disrupted simultaneously (Chen and 
Chen 2006). 

Peptides with a-Helical or p-Sheet 
Secondary Structure in Lipid Bilayers 

Analogous to IR spectroscopy, the amide I band in 
SFG spectra provides information about the secondary 
structure of peptides and proteins. Moreover, polarized 
SFG spectroscopy can be adopted to deduce the orien¬ 
tation distribution. For an orientation angle 0, e.g., the 
tilt angle between an a-helical principal axis and the 
surface normal, ATR-FTIR can relate the signal inten¬ 
sity ratio measured using different polarized IR beams 
to <cos^0> (the ensemble average of cos^0). SFG 
spectroscopy measures different parameters, <cos 0 > 
and <cos^ 0 >, making it possible to characterize the 
orientation and distribution in more detail. In addition 
to providing more measurements than ATR-FTIR, 
SFG has a much lower detection limit due to its intrin¬ 
sic sensitivity. It can be used to study peptides with 
a low surface coverage and under conditions where 
ATR-FTIR could not detect any signal. For example, 
SFG can measure the orientation of magainin 2 at 
the concentration of 2.0 pM on POPC bilayer, but 
there is no ATR-FTIR signal detected (Nguyen et al. 
2009a). 
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Zhan Chen group has developed systematic meth¬ 
odologies for orientation analysis on interfacial a-heli- 
cal, 3io-helical and P-sheet structures using SFG 
(Nguyen et al. 2009b, 2010). Based on the methodol¬ 
ogy for a-helical peptides, Zhan Chen and his col¬ 
leagues have measured the average tilt angle of 
magainin 2 peptides in lipid bilayers, and found that 
on negatively charged (POPG/POPG) lipid bilayer, 
magainin 2 molecules adopt a transmembrane orienta¬ 
tion whereas on zwitterionic (POPC/POPC) lipid 
bilayer, a parallel orientation is preferred (Nguyen 
et al. 2009a). These results are of relevance to under¬ 
standing the antimicrobial activity and selectivity of 
magainin 2. Since SFG and ATR-FTIR measure dif¬ 
ferent orientation parameters, the combination of 
SFG and ATR-FTIR studies can be used to determine 
complicated orientation distributions. For example, 
combined studies on a-helical melittin indicate that 
melittin molecules adopt two distinct orientations, 
with ^30% roughly standing up while ^70% lie 
down and tilt slightly on the membrane surface. Such 
a complicated orientation distribution could not have 
been determined using only a single technique or mea¬ 
surement alone (Chen et al. 2007b). With the 
methodology for P-sheet peptides, Zhan Chen group 
successfully determined the tilt and twist angles of 
a P-sheet peptide tachyplesin I in DPPG/dDPPG lipid 
bilayers. Interestingly, unlike linear a-helical peptides, 
the chiral symmetry of the P-sheet structure allows the 
observation of chiral signals in additional polarizations 
than ssp and ppp (Nguyen et al. 2010). SFG has also 
been applied to investigate the orientation of 
alamethicin, which adopts a mixed a-helical and 3io- 
helical structure. It was found to adopt with 
a transmembrane orientation in fluid-phase lipid bila¬ 
yers. (Ye et al. 2010b) 

Peripheral Proteins 

Many efforts have been made to understand the 
structure and function of proteins associated with 
cellular membranes. Such functions are often 
dependent on the protein adopting a certain orien¬ 
tation, with is hard to measure in situ with most 
structure techniques. By extending the analysis 
methods from simple peptides to larger proteins, 
SFG was used to determine the membrane orienta¬ 
tion of GPy. This orientation was found to depend 


on the composition of the lipid bilayer, due to 
differences in how the protein interacts with each 
type of lipid (Chen et al. 2007a). 

Proteins and Peptides at Other Interfaces 

Biomedical technologies commonly involve artificial 
surfaces that protein can interact with. For example, 
the blood protein Factor XII can be activated on neg¬ 
atively charged surfaces, but not on neutral surfaces. 
The activation of Factor XII leads to blood coagulation 
through the intrinsic pathway. Factor Xll has been 
investigated using polarized SFG spectroscopy along 
with some other techniques. The results show that 
a negatively charged polymer surface can cause Factor 
XII activation by orienting the protein molecules on 
the surface (Chen et al. 2007c). 

Another important blood protein, fibrinogen, has 
been investigated comprehensively by Zhan Chen 
group to understand its molecular behavior on 
polymer interfaces. It was shown that fibrinogen 
changes its conformation over time after adsorption 
onto various surfaces, including a polyurethane, a 
silicone-polyurethane copolymer and a fluorinated 
polymer. After the initial adsorption, fibrinogen 
adopts a “bent” structure. On the polyurethane sur¬ 
face, fibrinogen slowly lies down and becomes 
linear. On the other two surfaces, fibrinogen 
becomes even more “bent” (Clarke et al. 2005). 
SFG has also been used to study fibrinogen on 
polystyrene surfaces. Orientation analysis on the 
SFG spectra along with maximum entropy function 
method supports that fibrinogen exhibits a broad 
distribution of orientations on polystyrene surfaces 
(Wang et al. 2006, 2008). 

Molecular orientation is also important to the 
understanding of biosensors based on immobilized 
enzymes, for which performance is mediated by the 
molecular structure (e.g. orientation) of the active 
site. SFG can be used to directly observe differ¬ 
ences in molecular orientation when a variety of 
immobilization methods are used. SFG results 
show that C-terminus cysteine modified cecropin 
PI peptides stand up when chemically immobilized 
onto a polystyrene maleimide (PS-MA) surface but 
adopt a multiple-orientation distribution when 
physically adsorbed on the polystyrene surface 
(Ye et al. 2010a). 
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Summary 

SFG is a powerful biophysical and analytical technique 
that can be used to study biological molecules at inter¬ 
faces; valuable and novel results have been obtained 
for lipids, peptides and proteins. SFG can be used to 
probe buried interfaces in situ with submonolayer sur¬ 
face specificity. It can be used to individually probe 
behavior of each leaflet in a lipid bilayer, measure the 
orientation and orientation distribution of peptides 
with different secondary structures at interfaces in 
situ, and examine the orientation of larger proteins. 
The results that were reviewed in this article represent 
only the earliest stages of SFG as a bioanalytical 
method; in the future, SFG has the potential to eluci¬ 
date highly detailed mechanism of complex interfacial 
biological processes. 
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Definition 

Superoxide dismutases (SODs) are enzymes that cata¬ 
lyze the proton-dependent disproportionation of two 
molecules of 02 ' to one molecule of O 2 and one of 
hydrogen peroxide (Eq. 1). 

202*-+ 2H+^ H 2 O 2 + O 2 (1) 

Introduction 

Although thermodynamically reactive, O 2 has accu¬ 
mulated to some 20% of our atmosphere in the 
presence of oxidizable substances, due to a kinetic 
barrier originating in 02 ’s triplet spin state. There¬ 
fore, biochemical mechanisms for utilization of O 2 
commonly proceed via a superoxide ( 02 *^) interme¬ 
diate. 02 ' is also produced spontaneously upon 
reaction of O 2 with radical intermediates and 
reduced transition metal ions. © 2 *^ retains the ther¬ 
modynamic reactivity of O 2 , and can satisfy the 
requirement of spin conservation by initiating free 
radical chain reactions. Thus, 02 * is the parent of 
a series of ► reactive oxygen species and reactive 
nitrogen species which underlie ► oxidative stress 
and the symptoms of many medical conditions 
including inflammation, neurodegeneration, arthritis, 
and aging. SODs therefore provide vital defenses 
against oxidative stress. 


Basic Characteristics 

The Significance of Dismutation 

The reaction catalyzed by SOD is very favorable, 
however, it requires catalysis because (1) it entails 
bringing together four reactants, (2) it is not energeti¬ 
cally realistic to bring two anions together, and 
(3) © 2 *^ itself is only moderately reactive. Alternative 
methods for eliminating © 2 *^ such as conversion of 
© 2 *^ to H 2©2 require a reducing equivalent and two 
protons. However the availability of reductants will be 
limited when a system is under acute oxidative stress. 
In contrast, SOD provides an essential stand-alone 
means of removing © 2 *^. 

Evolution has produced three independent families 
of SOD: Cu and Zn-utilizing SODs utilize a redox- 
active Cu supported on the outside of a beta barrel by 
histidine ligands and a Zn^"^ site; NiSODs employ 
a mononuclear Ni site bound by thiolate ligands at 
the terminus of a four-helix bundle; EeSODs and 
MnSODs have their metal ion bound by N and 
© ligands in the interface between domains of 
a mixed alpha and beta monomer. 

These SODs all employ a two-step ping-pong reac¬ 
tion in which each half-reaction involves only two 
substrates. In one, © 2 * is oxidized to ©2 (Eq. 2a) 
and in the other © 2 *^ is reduced to H 2©2 (Eq. 2b) 

© 2 *^ -f H+ -f SODox ^ ©2 + SODred • H+ (2a) 
02*- -f H+ + SODred • H+ ^ H2©2 + SODox (2b) 

Each SOD enables anionic © 2 *^ to acquire an 
electron (Eq. 2b) by redistributing and neutralizing 
charge, binding of © 2 *^ to the cationic metal ion or 
by supplying a proton to © 2 * ■ Each SOD employs an 
outer-sphere mechanism for at least one half-reaction, 
and each appears to tightly couple transfer of a proton 
to electron transfer, resulting in charge-neutral chem¬ 
istry that supplies a proton to © 2 *^ thus greatly favor¬ 
ing its reduction. Finally, all three SODs employ 
a transition metal ion to permit reconcilliation of spin 
differences between © 2 *^ and each of the products. 

Fe-Containing SODs (FeSODs) 

FeSOD is a dimer that binds one Fe per monomer at the 
interface between domains using residues from both, 
in addition to a coordinated solvent molecule (Fig. 1). 
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Superoxide Dismutases, Fig. 1 Cartoon of the fold {left), and 
active site (right) of FeSOD and MnSOD based on the coordi¬ 
nates lISA.pdb (Lah et al. 1995). The N-terminus of one mono¬ 
mer is yellow and the C-terminus of the other is blue. The active 
site Fe or Mn ion is orange and is coordinated by three conserved 


Flis’, an Asp“ and a molecule of coordinated solvent that is 
involved in a Fl-bond network with the conserved Gin and Tyr. 
This and all other molecular graphics were produced using 
Pymol (DeLano 2002) 
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Superoxide Dismutases, Scheme 1 Proposed mechanism for FeSOD based on (Miller 2003). Dashed lines denote H-bonds here 
and elsewhere 


For FeSOD, the mechanism is believed to proceed 
via 02 *^ coordination to Fe^"^ (Bull and Fee 1985; 
Miller 2001; Miller et al. 2005) followed by inner- 
sphere electron transfer to produce O 2 and Fe^"^. 
Reduction of Fe is tightly coupled to Fl"^ uptake (Bull 
and Fee 1985), converting coordinated OH to coordi¬ 
nated H 2 O (Scheme 1). 

FeSOD is believed to employ a second-sphere 
mechanism for the second half reaction, based on the 
demonstration that competitive inhibitors of this half 
reaction do not coordinate to Fe (Miller et al. 2005). 
The return to the Fe^"^ state is associated with proton 
release from coordinated solvent. Transfer of this to 
nascent peroxide greatly facilitates 02 *^ reduction. 
Deprotonation of coordinated solvent also restores its 
strength as a ligand and enables it to out-compete 
peroxide, which has been proposed to form a side-on 
complex in MnSOD (below). Rate-limiting product 


dissociation is also dependent on a (Bull and Fee 
1985). Protonation of HOT to neutral H 2 O 2 enables 
this product to escape the positive electrostatic poten¬ 
tial of the enzyme active site. 

Mn-Containing SODs (MnSODs) 

The overall structure of MnSOD is very similar to that 
of FeSOD, and the active site amino acids are similarly 
conserved (Fig. 1). Both sites couple proton transfer to 
electron transfer; however there are critical distinc¬ 
tions that are related to the identity of the metal ion 
and the different electronic configurations associated 
with versus Fe^'^'^^'^. While Fe^'^SOD is com¬ 

petitively inhibited at high pH by OH“ coordinating to 
Fe^"^, Mn^'^SOD suffers deprotonation of the conserved 
active site Tyr34 (Maliekal et al. 2002). In contrast to 
its inner-sphere binding to Fe^'^SOD, N 3 binds to Mn^ 
"^SOD predominantly in the outer-sphere (Jackson 
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Superoxide Dismutases, Scheme 2 Proposed mechanism for MnSOD based on Miller (2003) 


et al. 2004), so it has been proposed that substrate 
forms an outer-sphere E • S complex that nonetheless 
could employ a transient inner-sphere mechanism for 
electron transfer (Scheme 2). 

In the second half-reaction, ► EPR studies demon¬ 
strate that substrate analogs can coordinate directly to 
Mn^"^ with assistance from Tyr34 (Tabares et al. 2007) 
consistent with Mn^'^’s high-spin d^ electronic config¬ 
uration. Transfer of a proton from coordinated solvent 
to the distal O of nascent peroxide would favor elec¬ 
tron transfer to product but not product dissociation. 
Indeed, at high 02*^ concentrations some form of 
coordinated (hydro)peroxide has been shown to trap 
the enzyme in an oxidized but peroxide-coordinated 
state (Hearn et al. 2001). Deprotonation of coordinated 
solvent in conjunction with Mn oxidation would not 
only supply a proton for formation of hydroperoxide, 
but would also generate the stronger OH~ ligand which 
would discourage formation of a side-on peroxide 
complex and even promote HO 2 dissociation since 
Mn^"^ is a Jahn-Teller ion and would tend not to be 
six-coordinate. The inhibited complex could reflect 
a branching ratio affecting the site of protonation of 
nascent peroxide, as peroxide protonated on the distal 
O could bind or remain bound to Mn^"^ whereas pro¬ 
tonation of peroxide on the proximal O would essen¬ 
tially displace the Mn^"^ ion. 

Cu and Zn-Containing SODs (Cu,ZnSODs) 

Cu,ZnSOD is a homodimer composed of 16 kDa 
P-barrel monomers that each support an independent 
active site via external loops (Fig. 2). In the active site, 
a Cu ion is coordinated by His side chains in 
a geometry that alternates between distorted trigonal 
bipyramidal His 4 -Cu^'^ and trigonal planar His 3 -Cu'^. 
One of the Cu^"^ ligands, His63 (numbering of 


Saccharomyces cerevisiae) also coordinates the Zn^"^ 
ion, which is roughly tetrahedral and additionally coor¬ 
dinated by two more His’ and an Asp” (Fig. 2b). 
A conserved Argl43 at the entry to the active site is 
believed to play an important electrostatic role in sub¬ 
strate binding and possibly proton transfer to nascent 
product. 

Again, a pair of sequential reactions is proposed 
(Scheme 3). © 2 *^ is believed to coordinate to Cu^"^ 
based on spectroscopic and crystallographic studies, 
and reduce it via inner-sphere electron transfer 
(Reviewed by (Hart et al. 1999; Miller 2003)). The 
reduced Cu* is trigonal planar as His63 is released 
and takes up a proton. 02 *^ then binds reduced 
Cu,Zn-SOD in an anion-binding site formed by a 
conserved Argl43, thereby becoming at least 
partially protonated and ripe for reduction via an 
outer-sphere mechanism. Electron transfer to 02*^ is 
accompanied by transfer of a proton from His63 
to nascent peroxide. Rebinding of His to the Cu 
further promotes its oxidation and departure of perox¬ 
ide. Once again, tight coupling of proton and electron 
transfer results in a concerted reduction and proton¬ 
ation 02 *^ to HO 2 while avoiding high energy transi¬ 
tion states. 

Cu,Zn-SOD is implicated in several neurodegenera- 
tive and inflammatory conditions (Potter and Valentine 
2003; Ramirez et al. 2009). Debate continues as to 
the significance of loss of SOD activity, accumulation 
of protein aggregates and pro-oxidative activity of 
damaged or mutant Cu,ZnSODs (Potter and Valentine 
2003). Cu,Zn-SOD reacts with H 2 O 2 via at least three 
different mechanisms. Reaction with the active site Cu 
results in fragmentation of the protein at specific sites 
and oxidation of active site residues resulting in Cu 
release (Ramirez et al. 2009). Released Cu can catalyze 
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Superoxide Dismutases, 
Fig. 2 Cartoon of the yeast 
Cu,Zn-SOD monomer (left), 
and active site (right), based 
on a superposition of oxidized 
azide-bound Cu.ZnSOD 
(green C and Cu, lYAZ (Hart 
et al. 1999)) and reduced Cu, 
ZnSOD (cyan C and darker 
teal Cu, IJCV (Ogihara et al. 

1996)). Zn is colored gray. The 
azide is to the right of the Cu^"^ 
ion in the right-hand panel 
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Superoxide Dismutases, Scheme 3 Proposed mechanism of Cu,Zn-SOD, based upon (Half et al. 1999; Miller 2003) 


deleterious reactions, resulting in fragmentation of other 
proteins and SOD fragmentation at other sites (Ramirez 
et al. 2009). Finally, in the presence of HC 03 ~/C 02 , 
diffusible C 03 *^ is formed which oxidizes other 
proteins and other sites and leads to protein aggregation 
(Goss et al. 1999; Liochev and Fridovich 2004). 

Ni-Containing SODs (NiSODs) 

NiSOD was discovered in Streptomyces species (Youn 
et al. 1996), is implied in Actinomycetes and 
cyanobacteria, and is much less widely distributed than 
the other SODs (Wuerges et al. 2004). NiSOD is 
a hexamer of four-helix bundles, each of which subtends 
a Ni site at the N-terminus of the hrst helix (Barondeau 
et al. 2004; Wuerges et al. 2004) (Fig. 3, left). 
A H-bond network between adjacent monomers is crit¬ 
ical for stabilizing the active site and activity (Herbst 
et al. 2(309). The Ni^"^ is coordinated in square planar 
geometry by the protein’s terminal amine as well as 
a deprotonated backbone amide N and two Cys ^ S“ 
groups, whose electron-rich contributions to the coordi¬ 
nation sphere are believed to lower the ► redox potential 
of the Ni and favor Ni-centered chemistry over oxidation 


of the thiolates (Fiedler et al. 2005). The Ni^"^ state is 
five-coordinate with additional coordination of a side 
chain N of His 1 (Fig. 3, right). 

Mechanistic details have begun to emerge. The 
active site Ni cycles between the +3 and +2 states 
and 02 ' is believed to bind near Ni^"^, benefitting 
from H-bonds from the phenol of Tyr9 as well as 
backbone amides (Barondeau et al. 2004; Herbst 
et al. 2009) (Scheme 4). An outer-sphere mechanism 
is proposed for the first half reaction based on EPR 
demonstration that Ni^"^ does not coordinate N 3 “ or 
Cl~ (Barondeau et al. 2004; Herbst et al. 2009). 
Resulting O 2 would depart leaving Ni^'^SOD wherein 
the erstwhile axial imidazole ligand is dissociated 
(Wuerges et al. 2004) and has acquired a proton 
(Barondeau et al. 2004). The Ni^"^ formed upon reac¬ 
tion with a second 02 *^ recaptures the side chain of 
Hisl as a ligand. Although Hisl is too far from the 
presumed substrate binding site to transfer a proton 
directly to nascent peroxide, transfer via a Cys thiolate 
is deemed viable (Fiedler et al. 2005) and would make 
a proton available close to the Ni-proximal end of O 2 ' 
thus favoring dissociation of H 02 ~. 
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Superoxide Dismutases, 
Fig. 3 Fold {left) and active 
site {right) of Ni-SOD. For the 
fold, monomers A and C are 
shown, based on IQOD.pdb). 
The oxidized active site (green 
C and Ni, based on IQOD.pdb) 
is superimposed on the 
reduced active site (light-blue 
C, based on IQOK.pdb). 
Coordinates are those of 
(Wuerges et al. 2004). 

A solvent molecule above the 
Ni suggests a plausible site of 
substrate binding {right panel) 
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Superoxide Dismutases, Scheme 4 Mechanism of NiSOD, based on (Barondeau et al. 2004; Flerbst et al. 2009) 


Parallels and Differences among the SODs 

While the SODs described differ with respect to iden¬ 
tity of the redox-active metal ion and supporting amino 
acids, the four mechanisms described reveal common 
strategies for solving the problems posed by dispro¬ 
portionation of 02 *^ and the reactivity of the toxic 
H 2 O 2 product that results. Amino acids such as the 
Tyr in the active sites of Fe-, Mn-, and Ni- SODs are 
thought to play important roles in product dissociation 
thus diminishing the enzyme’s tendency to form 
inhibited complexes or to succumb to deleterious reac¬ 
tions with peroxide. Since proton transfer is critical, it 
is not surprising that constellations of crystallographi- 
cally-conserved waters and polar amino acid side 
chains are observed in all the SODs. In all three fam¬ 
ilies, a proton is released from a metal ion ligand upon 
metal ion oxidation. This tight coupling of proton 
transfer to electron transfer conserves the overall 
charge of the active site, while also providing 
a proton that is critical for reduction of O 2 ' in the 
outer-sphere and dissociation of resulting peroxide. 


Summary 

The existence of three independently evolved classes 
of superoxide dismutases offers a rare opportunity to 
compare multiple mechanisms to identify common 
themes that are therefore likely to be determined by 
the chemical and physical challenges of the reaction, 
whereas differences between the different SODs may 
be interpretable in terms of the different metal ion 
cofactors and protein contexts. Recurring mechanistic 
elements include a ping-pong mechanism in which the 
metal ion alternates oxidation state, proton coupled 
electron transfer, use of both inner- and outer-sphere 
substrate binding, devices for supplying protons in 
a positionally defined way, and electrostatic and steric 
contributions to substrate binding and specificity. 

Cross-References 

► Catalases and Catalase-Peroxidases 

► Coupled Electron Transfer 
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► Mitochondrial Electron Transport 

► Oxidative Stress 

► Proton-Coupled Electron Transfer 

► Reactive Oxygen Species 

► Redox Potential 
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Introduction 

Supported lipid bilayers are used in many biophysical 
investigations as well as in analytical devices. They 
are mechanically stable with defined properties, 
and they can be interfaced with means to monitor 
electric, mechanical, or diffusion-related properties 
(► Lipid Lateral Diffusion). Furthermore, supported 
lipid bilayers have been designed to exhibit well- 
defined shapes and curvature, which can he planar, 
spherical, or cylindrical depending on the nature of 
the support. They have been prepared as single bilayer 
deposits (Brian and McConnell 1984) or as stacks of 
lipid bilayers where only the outermost membranes are 
in contact with the solid support (Fig. 1) (Powers and 
Clark 1975). In the latter case, the mechanical align¬ 
ment of the membrane in direct contact with the 
mechanical support is transmitted throughout the 
stack through the interaction between subsequent bila¬ 
yers (Fig. le) (Aisenbrey et al. 2010). Supported lipid 
membranes come in many different physical arrange¬ 
ments, and the choice depends on the underlying 
application, physical method used for their investiga¬ 
tion and requirements on stability, ease of preparation, 
and so on (Fig. 1). The composition of the lipid mem¬ 
branes can be that of a native or of model membranes 
(► Chemical Diversity of Lipids, ► Functional Roles 
of Lipids in Membranes). Supported lipid bilayers 
have also been formed from cell membrane fragments 
(Bally et al. 2010; Perez et al. 2006). 

Applications of Supported Bilayers in 
Research and Industry 

Supported lipid bilayers are useful tools in research 
and used in analytical setups. Therefore, applications 
are manyfold and include the measurement of the 
association of molecules to ► membrane proteins 
using, for example, surface plasmon resonance (SPR) 
(Cooper 2002) or high-throughput diagnostic 
biosensing based on liposomal arrays (Bally et al. 
2010). Furthermore, nanopore settings have been 
designed for sensing the electrochemical properties 
of membrane ion channels (► Ion Channel Regulation 
by G-Protein-Coupled Receptors; Recent Advances 
with Optical Biosensors) when inserted into 


freestanding supported lipid bilayers (Fig. lb) (Bally 
et al. 2010; Reimhult and Kumar 2008). The possibil¬ 
ities of such supported lipid bilayers for high- 
throughput screening of ► membrane proteins for 
diagnostics and drug discovery are of considerable 
interest. Valuable insights into the properties of lipid 
membranes have been obtained from biophysical mea¬ 
surements on supported ► asymmetric lipid bilayers 
(Kiessling et al. 2009) or when the lateral phase sepa¬ 
ration (► Lipid Domains) was investigated using opti¬ 
cal spectroscopies and ► atomic force microscopy 
(AFM) (Ivanov et al. 2006; Loose and Schwille 
2009). Furthermore, settings have been designed 
where structural and functional properties of ► mem¬ 
brane proteins are investigated simultaneously 
(Naumann and Knoll 2008). 

Supported Lipid Bilayers Occur in Many 
Shapes and Characteristics 

Whereas single bilayer or monolayer deposits are com¬ 
monly explored types of supported lipid bilayer 
arrangements in analytical devices, stacks of supported 
lipid membranes (Fig. le) have considerable advan¬ 
tages when bulk amounts of material are to be investi¬ 
gated in an uniaxially oriented fashion by biophysical 
approaches such as ATR-FTIR ► (IR/ATR), oriented 
circular dichroism or solid-state ► NMR spectroscopy 
(► Structure Determination by NMR: Overview, 

► NMR Methods for Kinetic Analysis, ► NMR, 

► Multidimensional NMR Spectroscopy) (Aisenbrey 
et al. 2010; Arkin 2006). These methods have allowed 
the development of new strategies for the analysis of 
the structure, dynamics, topologies, and interactions of 
membrane proteins and peptides (Arkin 2006; 
Bechinger et al. 2011; Martin et al. 2003) (► Solid- 
State NMR, ► Lipid Lateral Diffusion). 

When compared to studying liposomes in solution 
or Langmuir monolayers, supported lipid bilayers have 
distinct advantages. First of all, the membranes can be 
aligned along the surface and biophysical investiga¬ 
tions performed in uniaxially oriented systems (Arkin 
2006; Bechinger et al. 2011; Martin et al. 2003). As 
stacks of macroscopic size can be prepared, consider¬ 
able amounts of membrane material can be investi¬ 
gated to boost the sensitivity of the experiment 
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Supported Lipid Bilayers, 
Fig. 1 Sketches supported 
lipid bilayer arrangements. 

(a) Membranes directly 
oriented along the support, 

(b) membranes suspended 
over nanosized openings, 

(c) polymer cushions or 
hydrogels create an additional 
spacing between the bilayer 
and the solid support, 

(d) bilayer tethered to a solid 
surface, and (e) stacks of 
oriented membranes on a solid 
support. Whereas the solid 
support is shown in gray, the 
hydrophobic interior is 
illustrated in green and 

a membrane-inserted protein 
in yellow 



(Bechinger et al. 2011). Second, the investigation of 
a lipid bilayer on solid surfaces permits the use of 
surface science tools and allows for the presentation 
of membrane receptors (► Membrane Proteins: Struc¬ 
ture and Organization) to cells or to molecules in the 
surrounding solution, both being important aspects for 
the development of biosensors and related devices 
(Oliver and Parikh 2010). Such setups also allow the 
dissection of molecular events, as they are easier to 
image by optical and surface science techniques such 
as SPR or ► AFM, the latter requiring direct physical 
contact (Loose and Schwille 2009). Finally, the 
supported lipid bilayers can be applied to surfaces of 


different shape such as flat, circular, tubular, or cylin¬ 
drical, thereby modulating in a controlled manner the 
spatial range of lipid lateral diffusion (► Lipid Lateral 
Diffusion) (Kam 2009) or the bilayer curvature of the 
membrane (Loose and Schwille 2009; Oliver and 
Parikh 2010). This allows specific applications 
such as membranes on beads (Tanaka and Sackmann 
2005) or to make them fit a particular technical 
device (Salnikov et al. 2010). Furthermore, with 
supported lipid bilayers, the miniaturization of the 
sample or the creation of asymmetric bilayers by the 
Langmuir-Blodget technique is possible (Kiessling 
et al. 2009). 
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The Preparation of Supported Lipid Bilayers 

In view of the versatility of shape, composition, and 
macroscopic arrangements of supported lipid hilayers, 
it is not surprising that a variety of methods exist for 
their preparation. They are obtained from Langmuir- 
Blodgett approaches as well as by vesicle fusion 
(Richter et al. 2006). Forming a monolayer by the 
Langmuir-Blodgett approach and fusing it with vesi¬ 
cles to form the second leaflet allow one to create 
asymmetric lipid bilayers as well as the directed recon¬ 
stitution of membrane polypeptides (Kiessling et al. 
2009; Li et al. 2009). This asymmetry is better pre¬ 
served when the bilayer is in direct contact with 
a polymer cushion (Fig. Ic) rather than the solid sup¬ 
port (Fig. la) (Kiessling et al. 2009). 

The preparation of multilayered supported mem¬ 
branes (Fig. le) can also be achieved by vesicle fusion 
(Aisenbrey et al. 2010; Powers and Clark 1975). As 
bulk amounts of material need to be deposited, 
a common technique is to slowly dry the vesicles 
onto a solid support and/or to remove bulk water by 
slow equilibration of a vesicle suspension at defined 
humidity (e.g., 93% r.h.). This method has allowed for 
the investigation of membrane-associated proteins and 
peptides in uniaxially oriented systems (Arkin 2006; 
Bechinger et al. 2011; Martin et al. 2003). 

When peptides interacting with stacks of bilayers 
are investigated (Fig. le) an alternative method con¬ 
sists in preparing a homogenous organic solution of 
lipids and peptides and to apply this mixture onto the 
solid support (Aisenbrey et al. 2010). Obviously, in 
such cases, the stability of the support in the presence 
of solvents merits consideration. The removal of all 
organic solvent by exposure to high vacuum is 
followed by equilibration of the membranes in defined 
humidity, stacking, and sealing of the sample 
(Aisenbrey et al. 2010). In general, this method results 
in better orientational heterogeneity when compared to 
the vesicle deposition but can only be used for poly¬ 
peptides and systems that remain stable and recover 
their native conformation after exposure to the respec¬ 
tive organic solvent mixtures. When stacks of mem¬ 
branes are to be prepared, special attention has to be 
paid to membrane hydration. Whereas the presence of 
bulk water results in the detachment of bilayers from 
the stack and thereby the loss of orientational order, 
insufficient hydration may lead to artifacts, unneces¬ 
sary steric constraints, and even protein denaturation. 


Therefore, equilibration in 93-95% r.h. has proven to 
be a suitable compromise for many applications which 
assures that most of the membrane-associated water is 
present (Aisenbrey et al. 2010). 

Mechanisms of Formation of Supported 
Lipid Bilayers 

A number of studies have been performed that inves¬ 
tigated the mechanisms of how supported lipid bilayers 
form with the goal to improve their properties (Richter 
et al. 2006). The formation of supported lipid bilayers 
was followed with the help of ellipsometry and quartz 
crystal microbalance (QCM) approaches to keep track 
of the deposition of material as well as ► AFM to 
monitor the continuity of the membranes and the pres¬ 
ence of remaining vesicles. The vesicle fusion tech¬ 
nique occurs in distinct steps, namely, vesicle 
adsorption, rupture, and fusion on the solid support 
(Fig. 2). Interestingly, vesicle fusion is enhanced by 
scanning with the ► AFM tip, and its shape and mode 
of action have an important influence on the tip- 
induced fusion process (Richter et al. 2006). Further¬ 
more, electrostatic interactions, including the presence 
of calcium ions, vesicle size, lipid composition, and 
surface properties (mica vs silica supports), all have an 
influence on the vesicle interactions and fusion. After 
the initial adsorption of the vesicles onto the surface, 
interactions with an “active edge” of a bilayer already 
in place lead to fusion with additional vesicles 
(Fig. 2b). Overall, it seems important that the interac¬ 
tions of the membranes with the surface and with each 
other are sufficiently strong to resist small and transient 
forces but weak enough to allow reshaping and 
movements. 

The Support and Tethered Lipid Bilayers 

A high degree of variation exists for not only the shape 
but also the chemical nature of the support (► Solid- 
State NMR). Indeed, substrates can be made of silica- 
based material (glass) - treated or pure (Oliver and 
Parikh 2010), mica, or organic polymers such as 
PEEK or high density polyethylene (Salnikov et al. 
2010). Eor ► ATR-FTIR applications, many different 
materials with different refractive indices and different 
softness exist. Crystals or materials made from Ge, Si, 
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Supported Lipid Bilayers, 
Fig. 2 Illustrates the stepwise 
adsorption of vesicles onto the 
solid support (a), followed by 
the spreading of a first 
membrane layer including an 
“active edge” (b), the fusion 
with other vesicles, and the 
formation of a continuous 
bilayer (c). The color coding is 
the same as for Fig. 1 



KRS-5, ZnS, and ZnSe, and diamond are the most 
widely used (Ivanov et al. 2004). Notably, these sur¬ 
faces are in general not atomically flat, and the bilayers 
may be in contact with small pins or edges with large 
parts being suspended between such protrusions 
(Ivanov et al. 2004; Kiessling et al. 2009). Due to the 
close contact between the substrate and the supported 
bilayer, its choice can have considerable effects on the 
physical properties of the supported membranes. For 
example, it has been observed that lateral diffusion 
within the membrane (► Lipid Lateral Diffusion) is 
reduced to about half its value due to friction with the 
support (Loose and Schwille 2009). Although, for 
many applications, the 1-2-nm water-filled cleft 
between the membrane and the support is sufficient 
(Fig. la) (Kiessling et al. 2009), it is also possible that 
molecules carrying bulky extramembranous domains 
are immobilized due to the limited thickness of the 
hydration layer of the bilayer. Such close contacts 
may even cause the denaturation of proteins. As 
a solution to this problem, hydrated polymer cushions 
and hydrogels (Fig. Ic) have been introduced 
(Kiessling et al. 2009; Tanaka and Sackmann 2005; 


Loose and Schwille 2009) or tethered membranes that 
through spacers fixed to the surface allow the control 
of the distance between support and lipid mem¬ 
brane (Fig. Id) (Tanaka and Sackmann 2005). The 
tethering of membranes is performed by linking 
thiophospholipids to gold surfaces, through hydropho¬ 
bic interactions, photo-cross-linking of polymerizable 
lipids, as well as the coupling between biotin and 
streptavidin or between His-tags and metal-chelating 
surfaces (Tanaka and Sackmann 2005; Naumann and 
Knoll 2008). Extensions of the technique have been 
performed by tethering vesicles to supported lipid bila¬ 
yers (► Surface-Tethered Lipid Vesicles) using DNA, 
or streptavidin/biotin, and include the use of electric 
fields to create gradients of vesicle composition (Chan 
and Boxer 2007). 

The intervening cushion layers made of hydrophilic 
tethers, hydrogels, polymers, and biopolymers 
(Fig. Ic, d) decouple the substrate and the proximal 
lipid monolayer/bilayer from frictional and electro¬ 
static coupling which can otherwise cause undesired 
overall and leaflet-dependent compositional asymme¬ 
try not originally present in the parent vesicles (Oliver 
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and Parikh2010). On the other hand, surface properties 
such as wettability, topography, dimensionality, and 
surface electrostatics can be altered dynamically and in 
a controlled fashion thereby tailoring the membrane 
properties in terms of spatial molecular distribution, 
compositional heterogeneity (► Chemical Diversity of 
Lipids, ► Functional Roles of Lipids in Membranes), 
lateral tension (► Lipid Bilayer Lateral Pressure 
Profile), packing density, curvature, and membrane 
morphology (Oliver and Parikh 2010). For example, 
electric field gradients can be used to deliberately 
couple substrate properties with the bilayer in order 
to create lipid asymmetries. When combined with 
recent advances in patterning and surface modifica¬ 
tions, surface alterations at the micro- and nanolevel 
are obtained (Oliver and Parikh 2010). Various kinds 
of technologies including photolithography, soft 
lithography, robotic arraying, microcontact printing, 
dip-pen nanolithography, and colloidal lithography 
have been used to prepare the surfaces and to create 
nanopatterns of liposome arrays or nanostructured 
membranes (Bally et al. 2010; Tanaka and Sackmann 
2005). 

The size of such domains corresponds to those 
created by the cytoskeletal organization or those 
found in lipid domains thereby providing important 
tools to study cell signaling and membrane physiology 
(Kam 2009). Free-standing bilayers by tethered sup¬ 
ports are particularly useful for ion channel sensing 
applications (Bally et al. 2010; Kiessling et al. 2009), 
in particular when combined with nanoporous supports 
where openings of 20-100 nm result in good stability 
of the freely suspended membranes (Fig. lb) 
(Reimhult and Kumar 2008). By using lysine-coated 
glass surfaces, it is possible to rip off membrane 
patches from cells to create free-standing bilayers for 
channel measurements (Fig. lb) (Schmidt et al. 2000). 

Summary 

In conclusion, supported lipid bilayers can be prepared 
in a variety of shapes, composition, and dimensions for 
a wide range of applications in academia and industry. 
The technique is used in combination with surface 
modification and surface analytical tools to assemble 
devices or to perform biophysical studies in defined 
environments. Supported lipid bilayers are a field of 
ongoing investigations, often in collaboration with 


other domains of research and engineering, to explore 
new tools and strategies in basic and applied research 
as well as in applications. 

Cross-References 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Infrared Spectroscopy of Protein Structure 

► Triple Resonance NMR 

► Vibrational Circular Dichroism of Biopolymers 
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Surface Enhanced Infrared Absorption 
Spectroscopy 
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Synonyms 

Monolayer spectroscopy; SEIR spectroscopy; Surface- 
enhanced infrared spectroscopy 

Definition 

Surface Enhanced InfraRed Absorption Spectroscopy 
(SEIRAS) is a strictly surface sensitive technique 
that exploits the electromagnetic properties of 


nanostructured metal films to enhance the vibrational 
bands of a molecular adlayer. This entry describes the 
basic properties, the experimental realization, and the 
application of SEIRAS to biological molecules bound 
to the solid surface. 

Introduction 

In nanotechnology, protein-surface interactions are piv¬ 
otal for the assembly of interfacial protein constructs, 
such as sensors, activators, and other functional compo¬ 
nents at the biological/electronic junction. A detailed 
mechanistic understanding of the protein-surface inter¬ 
action would be enormously valuable to these fields, and 
the ability to tailor specific protein-surface interactions 
would benefit nanoscale materials and bio-nano- 
assembly technologies. Surface enhanced infrared 
absorption spectroscopy (SEIRAS) is a very promising 
approach for monolayer analysis on the molecular level. 
Vibrational bands of surface-bound molecules are 
enhanced by a factor of about 100 when adsorbed onto 
a nanoscale-roughened metal surface. SEIRAS is strictly 
surface sensitive and when combined to an appropriate 
surface treatment, the technique represents a versatile 
tool for the analysis of protein structure and function on 
the level of a single monolayer. 

Mechanism of SEIRA 

Two theories are discussed to account for the observed 
surface enhancement (Osawa 2002). While the local 
electromagnetic (EM) field enhancement through sur¬ 
face plasma polariton (SPP) excitation of the nano¬ 
structured metal surface is mostly used to explain 
surface enhancement particularly in the visible region 
(e.g., surface enhanced Raman scattering: SERS), the 
enhanced demodulation of the absorption by the metal/ 
adsorbate layer (mixed effective medium) plays an 
equally important role in SEIRAS. Although the EM 
field is enhanced in the near to mid-infrared region by 
SPP excitation, the extent of enhancement is calculated 
to be only about 10 if a randomly oriented nanoparticle 
gold film is used (Osawa et al. 1986; Gersten and 
Nitzan 1985). This theoretical conclusion, however, 
contradicts the experimentally determined enhance¬ 
ment factor which is about 100 (Osawa 2002). More¬ 
over, the strong SEIRA effect observed on thin films of 
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transition metal such as Pt, Ni, Ir, Pd, or Zn provided 
additional evidence against the sole EM contribution 
because SPP excitation of these transition metals is not 
possible in the visible to mid-lR range. Effective 
medium theory takes into account the roughness of 
the SEIRA active metal surface. As the wavelength 
of IR radiation is much longer (2,500-10,000 nm) then 
the size of the surface structures (50-200 nm), a com¬ 
posite medium of metal/adsorbed molecule/environ¬ 
ment is probed. Such an effective medium shows 
broad absorption in the near-to-mid IR range due 
to the rough morphology of the metal surface. When 
the composite medium is excited by IR radiation, the 
oscillating dipole of the surface-adsorbed molecule 
couples to the induced dipole of the metal. 
As a consequence, the dielectric function of the metal 
is altered and the absorbance of the effective medium 
is stronger at the vibrational frequency of the adsor¬ 
bate. The higher absorptivities and volume fraction of 
the metal layer lead to the dominant contribution of 
the effective medium absorption to the surface 
enhancement. 

It is worthwhile to note that the morphology of the 
metal nanostructure signihcantly contributes to the 
enhancement observed by SEIRAS. It was recently 
reported (Adato et al. 2009) that a periodic Au nano¬ 
rod array generated signal enhancements in the order 
of lO'^-lO^. Such sensitivity allows for the determina¬ 
tion of adsorbate concentrations down to the 
zeptomole (10”^’ M) level. 

SEIRA spectra often appear to be different in rela¬ 
tive band intensities as compared to transmission or 
ATR spectra. This difference is related to the surface 
selection rule (Osawa 2002) which states that the 
vibrational bands of molecules whose dipole moments 
are perpendicular to the surface plane, are strongly 
enhanced. Those vibrational modes which oscillate 
parallel to the surface plane are weak or completely 
missing in the SEIRA spectra. The surface selection 
rule is very useful in determining the surface orienta¬ 
tion of the adsorbed molecules by comparing the rela¬ 
tive intensities of two vibrational bands whose 
molecular dipole moment is orthogonal. 


Optical Configuration 

The most common optical arrangement in SEIRAS is 
the metal underlayer configuration comprising three 



bulk electrolyte 

membrane monolayer 

linker molecule 
gold layer 

internal reflection element 
IR beam 


Surface Enhanced Infrared Absorption Spectroscopy, 
Fig.1 Surface-enhanced IR absorption spectroscopy (SEIRAS) 
using internal reflection geometry 


layers: sample, metal him, and supporting substrate. 
Any IR-transparent solid material can be used as 
supporting substrate. Transmission geometry is applied 
when the thickness of the metal him is less than 10 nm. 
With internal rehection (Eig. 1), highly refractive 
materials as Si, Ge, or ZnSe are used as substrate. 
Here, the metal him thickness can be increased up 
to several hundred nanometers. In applications where 
IR-intransparent materials, for example, glass, 
polymers, carbon, and metal, are employed as 
substrate, external rehection is preferable and the 
metal overlayer conhguration is typically used (metal 
particle/sample/substrate). Here, the thin metal layer 
(ca. 4-7 nm) is directly evaporated onto the sample/ 
support surface. 


Surface Modification 

Signal enhancement in SEIRAS is restricted to the 
immediate vicinity of the surface and rapidly drops off 
within 10 nm. Due to this optical near-held effect, it is 
essential that the sample is tethered to the metal him 
surface. Molecules with thiol (—SH) and carboxyl 
(—COO”) moieties exhibit high affinity to metal sur¬ 
faces and spontaneously form strong bonds. Self- 
assembled monolayers (SAMs) of thiols are commonly 
used because of the stable adhesion via quasi-covalent 
bond formation between the sulfur and the metal. Such 
SAM is composed of three parts, namely, the linker SH 
group, the spacer, and the terminal group. The spacer 
can be varied in length to adjust the distance between the 
functional group and the surface. It is preferred to keep 
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the length of the spacer molecule as short as possible in 
order to obtain larger surface enhancement. The termi¬ 
nal functional group of the linker can be used to add 
new chemical properties to the metal surface. 
Replacement of the functional group from positively 
charged (e.g., -NHs"^) to negatively charged 
(e.g., -COO”), or from hydrophobic (e.g., -phenyl) to 
hydrophilic (e.g., -OH) introduces new functionalities 
to the metal surface. The terminal group can also be 
designed to specifically adhere proteins via tags that are 
inserted in or appended to the protein by recombinant 
techniques. Examples of such functional groups are 
those known from affinity chromatography, like Ni- 
NTA, biotin, and others. As soon as the surface is 
functionalized by such moieties, proteins equipped 
with a suitable tag, like a poly-His or a Strep tag, are 
specifically tethered to the solid surface. By these 
means, all proteins will be unidirectionally orientated 
with respect to the solid surface (Ataka et al. 2004). This 
approach is mandatory when membrane proteins are 
studied because this important class of proteins is 
strictly vectorial in functionality. Moreover, the orien¬ 
tation of the protein is conveniently changed by 
inserting the tag at different positions of the protein 
sequence. 


Functional Studies of Proteins by SEIRAS 

Chemical modification of the metal surface is crucial 
when biological samples are studied because tethered 
proteins may degrade when they are directly adsorbed 
to the metal surface. Even if the protein is not denatured, 
it may malfunction when the orientation or the distance 
to the metal surface are not properly controlled. 
Maintaining the protein function after adsorption is 
important for functional studies of the protein monolayer 
by application of external stimuli. A critical point in such 
experiments is that SEIRA enhances signals from both 
the chemically modified layer and the adsorbed protein. 

SEIRAS is combined to electrochemical setups 
when the plasmonic metal film is employed as elec¬ 
trode. This is advantageous for direct transfer of elec¬ 
trons from/to the adsorbed molecules and SEIRAS 
monitors structural changes during the redox reaction 
in situ. This approach was exploited for the functional 
studies of redox-driven proteins, such as cytochrome c 
(Ataka and Heberle 2003) or [NiEe] hydrogenase 


(Wisitruangsakul et al. 2009). This application is fur¬ 
ther extended to whole cell studies, such as Ee^*^ reduc¬ 
ing bacteria Geohacter sulfurreducens (Busalmen 
et al. 2008). This bacterium is enriched with c-type 
cytochrome in the outer membrane. Direct contact to 
the electrode leads to electron transfer to the mem¬ 
brane surface, and SEIRAS monitors the structural 
changes of the cellular membrane. 

Monolayer detection of SEIRAS is perfectly suited 
for functional studies of purified membrane proteins, 
which exist as a monolayer in the native cell 
membrane. Application of a voltage on the tethered 
membrane protein through the electrode is suitable 
model system of membrane potential. The effect of 
the membrane potential on the reaction mechanism 
has been demonstrated by studies of the photoreaction 
of sensory rhodopsin II (Jiang et al. 2008, 2010). The 
results of these studies open new ways to study regu¬ 
latory mechanisms of membrane proteins at the atomic 
level. 


Summary 

It has been almost 20 years since SEIRAS appears as 
trace surface analytical tool but the use of SEIRAS for 
functional studies of proteins has been introduced in 
2003 (Ataka and Heberle 2003). Since then, the number 
of such applications rapidly increased. Improvement in 
conditions for specific and effective tethering of proteins 
to the plasmonic substrate surface helped to the accep¬ 
tance and usefulness of the methodology. Orientational 
control is conveniently achieved with affinity tags which 
are routinely introduced by molecular genetic tech¬ 
niques. An oriented sample is crucial when the inherent 
vectoriality of membrane proteins is addressed. Many 
membrane proteins are asymmetric in their functionality 
as they translocate ions or solutes preferably in one 
direction or as their stimulus operates only from one 
side. A protein monolayer is mandatory for performing 
time-resolved studies of the dynamics of those molecu¬ 
lar machines that act as rectifiers, like those involved in 
vectorial electron transfer across the membrane. The 
presented methodology is also applicable to membrane 
proteins which are gated by a transmembrane voltage 
and, thus, opens a new avenue to study the mechanism 
of these medically and pharmaceutically important class 
of proteins on the atomic level. 
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Cross-References 

► Surface-Enhanced Raman Spectroscopy for 
Bioanalytics 

► Surface-Enhanced Resonance Raman Spectroscopy 
in Electron Transfer Studies 
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Synonyms 

Surface PEGylation; PEG; poly (Ethylene Glycol); 
PEG; Poly (Ethylene Oxide) 

Definition 

Surface passivation means modification of a surface 
for minimal unspecific interaction with proteins. 


Introduction 

Examination of biomolecules attached to surfaces with 
sensitivities down to the single molecule level raises 
demands on the surface that comprise minimized 
unspecific interaction between surface and protein, 
low intrinsic fluorescence of the coating material and 
the ability to immobilize biomolecules in their func¬ 
tional conformation. A number of studies and methods 
have been developed to achieve that aim, including 
physisorption of proteins such as serum albumin with 
and without subsequent cross-linking, plasma-poly¬ 
merization, and electrostatically driven adsorption 
like layer-by-layer adsorption of polyelectrolytes or 
monomolecular layers from grafted comb-copolymers 
such as Poly(L-lysine)-g-poly(ethylene glycol) 
(Michel et al. 2005). This entry will focus on the 
covalent immobilization of polymer chains at surfaces 
with a special focus on coating materials based on poly 
(ethylene oxide) (PEO), as PEO-coated surfaces dis¬ 
play exceptional protein resistance (Sofia et al. 1998; 
Michel et al. 2005; Gasteier et al. 2007; Unsworth et al. 
2008). 
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Surface Passivation for Singie Moiecule Detection, 
Fig. 1 Strategies for surface grafting of polymers, (a) “Grafting 
from” techniques take advantage of the dense grafting that can 
be achieved by the surface immobilization of low molecular 
weight initiators, followed by polymerization of polymer chains 
from the surface, (b) “Grafting to” techniques with linear chains 
rely on the grafting of ready polymer chains via one reactive end 
to a surface. While the variation of molecular weight is bigger, it 
is harder to achieve a high grafting density that results in 
a polymer brush. This may be achieved by grafting at critical 
solubility with minimal hydrodynamic radius per chain or by the 
use of grafted comb-copolymers, (c) Branched structures such as 


star polymers possess the advantage of a high number of func¬ 
tional groups per molecule. However, a monolayer of high 
molecular weight multi-arm stars results in a dense sphere-like 
structure with defects and an overall polymer segment density 
lower than for linear grafted chains. In contrast, the use of low 
molecular weight star molecules with fewer arms and reactive 
end groups that allow interpolymeric cross-linking results in 
a dense polymer network with high polymer segment density 
that can be achieved with easy preparation techniques. More¬ 
over, functional end groups that cannot take part in network 
formation due to steric constraints remain in the film and may 
be used for functionalization 


Strategies for Surface Grafting of Polymers 

The general strategy to render surfaces inert toward 
proteins is to introduce a coating layer that prevents 
protein adsorption either thermodynamically, so that 
attractive surface interactions are overcompensated by 
repulsive interactions with the layer, or at least kinet- 
ically by creating a free energy barrier of sufficient 
height that cannot be overcome on relevant time scales 
(Halperin 1999). Different strategies may be pursued 
to achieve such coatings (Fig. 1). The key demand for 
minimization of unspecific profein adsorption is the 
generation of a high polymer segment density on the 
surface (Szleifer and Carignano 2000). This can be 
achieved elegantly by immobilization of low molecu¬ 
lar weight initiators followed by polymerization of 
polymer chains from the surface. As the packing den¬ 
sity that can be achieved with the initiators is high, the 


resulting grafting density of the polymer chains is 
equally high. This method has extensively been used 
using radical polymerization techniques also for PEO- 
modified (meth-)acrylate monomers (Barbey et al. 
2009) but is not directly applicable to PEO. Alterna¬ 
tively, ready polymer chains may be grafted to sub¬ 
strates. This method is commonly used for PEO and 
will be addressed in more detail in the following 
paragraph. 

Surface Grafting of PEO and Protein 
Repellence 

Protein-repellant PEO coatings are frequently made 
from long, randomly coiling linear chains terminally 
anchored to the surface. Only in the brush regime the 
grafting density of linear chains is high enough for the 
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Surface Passivation for 
Single Molecule Detection, 
Fig. 2 Suiface coating with 
reactive end-functionalized 
star polymers, (a) Schematic 
of the chemical composition of 
the NCO-sP(EO-iter-PO) 
system and the cross-linking 
reaction induced by water. 

(b) Schematic of a cross- 
linked sP(EO-^?ar-PO) surface 
coating where all isocyanate 
groups are hydrolyzed and 
aminolyzed, resulting in free 
amino groups {green) and urea 
bridges between the stars 
(yellow), respectively 


NCO-sP{EO-sfaf-PO) 



Mn = 2 kDa / arm; m = 0.8; n = 0.2 



attached polymer chains to stretch out perpendicularly 
to the surface, thus avoiding unfavorable monomer- 
monomer interactions and maintaining optimal salva¬ 
tion (Fig. 1). Consequently, the grafted chains provide 
adequate coverage and thickness to form a very effec¬ 
tive steric barrier against protein adsorption. Grafting 
density and chain length are thus the two essential 
experimental control parameters by which the degree 
of protein resistance is governed. It has recently 
been shown that, for molecular weights between 600 
and 2,000 g/mol, a grafting density of 0.5 linear 
OH-terminal PEG chains/nm^ is the threshold for min¬ 
imal protein adsorption (Unsworth et al. 2008). 

Star-Shaped Molecules 

Star-shaped PEG molecules (star PEG) have a central 
core region from which the PEG arms extend. Due to 
this constraint, their density is higher than that of 
a linear chain, offering the opportunity to produce 
PEG surfaces with higher grafting density. Moreover, 


the ends of the arms are preferentially located near the 
periphery due to the steric constraints in the interior of 
the star. Therefore, the probability is increased for end- 
functionalized groups to bind to the surface. Star PEG 
systems thus appear an attractive choice to confer 
protein resistance to surfaces. Indeed, star PEGs with 
70 arms and a molecular weight of 5,200 g/mol per arm 
(Sofia et al. 1998), with 24 arms and a molecular 
weight of 9,700 g/mol per arm, as well as with 72 
arms and a molecular weight of 4,500 g/mol per arm 
(Irvine et al. 1998) have been reported to pack closely 
on the surface and efficiently reduce protein adsorp¬ 
tion. However, the efficiency is reduced for small 
proteins such as cytochrome c in the case of the 
star PEG molecules with higher number of arms and 
higher overall molecular weight. This is explained 
by the spherical shape of the high molecular weight 
PEG stars with rigid poly-divinylbenzene core 
that forms a packing of spheres on the surface so 
that gaps remain between the stars that appear suffi¬ 
ciently large for small proteins to reach the surface and 
adsorb to it. 
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Surface Passivation for 
Single Molecule Detection, 
Fig. 3 Quality of swface 
passivation. Nonspecific 
protein adsorption on cross- 
linked sP(EO-stat-PO) films is 
compared to that on grafted 
linear PEO 5000 and 
physisorbed BSA using single 
molecules of Alexa Fluor 546 
labeled RNase H (a, density of 
spots) and Cy3-ATT0647N 
labeled nucleosomes 
(b, fluorescence images and 
normalized fluorescence 
signal intensities). 
Significantly higher levels of 
unspecific adsorption were 
observed on BSA than on PEO 
surfaces. sP(EO-itor-PO) 
surfaces showed negligible 
unspecific adsorption 
(Figure partially reprinted 
with permission from Groll 
et al. 2004, copyright 
American Chemical Society 
and Koopmans et al. 2008, 
copyright Wiley-VCH Verlag 
GmbH& Co. KGaA) 
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The NCO-sP(EO-sfot-PO) System 

Based on the results discussed above, protein-resistant 
properties of star PEO coatings may be improved by 
using smaller molecules with a lower amount of arms 
and a more flexible core. In addition, functionalization 
of the stars with reactive groups at the ends of the arms 
that enable intermolecular cross-linking will result in 
a more homogeneous lateral PEO density profile and 
higher surface coverage. This will also be of advantage 
for functionalization with molecules that may be 
introduced through reaction with the end groups. 
One system that meets these criteria is the NCO-sP 
(EO-5taf-PO)-coating system that is based on six-arm, 
star-shaped molecules with a sorbitol core, a backbone 
of statistically copolymerized ethylene oxide and 
propylene oxide in a ratio of 4:1, a molecular mass of 
2,000 g/mol per arm and reactive isocyanate end 
groups. The NCO-sP(EO-5far-PO) molecules can be 
dissolved in water and coatings can be applied from 


aqueous solutions. When the NCO-sP(EO-5far-PO) 
material is dissolved in water, hydrolysis of the isocy¬ 
anate groups at neutral pH leads to formation of amine 
groups that directly react with unreacted isocyanate 
groups to form urea bridges between the NCO-sP 
(EO-yfflt-PO) molecules (Fig. 2). Since the kinetics of 
amine addition to isocyanate is much faster than 
hydrolysis, urea bridge formation occurs preferentially 
until steric restrictions significantly lower the reaction 
probability. The aqueous NCO-sP(EO-i'taf-PO) solu¬ 
tion can be used for coating surfaces either by simple 
dip-, spin-, or spray-coating. After coating, the system 
requires at least 12 h for completion of the cross- 
linking reaction within the layer. During this time, all 
isocyanate groups hydrolyze and then either react with 
other isocyanate groups to form urea bridges or remain 
as free amino groups (Gasteier et al. 2007). These 
layers have proven extremely efficient in minimizing 
protein adsorption under single molecule fluorescence 
conditions (Fig. 3). 
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Surface Passivation for Single Molecule Detection, 
Fig. 4 Schematic illustration of the FRET technique used to 
monitor structural information of immobilized proteins. In the 
folded state, the dye moleeules, which are placed at specific sites 
on the protein, are close together. Upon exeitation of the 
donor dye (green), transfer efficiency of the energy to the accep¬ 
tor dye (red) is high. If the protein is unfolded, the distance 
between the dyes inereases and, consequently, the energy 
transfer efficieney decreases. The distanee dependence of the 
FRET efficiency can be used to infer details of the protein 
structure. The pictures show exemplary scanning confocal fluo¬ 
rescence microscopy images of RNase FI proteins immobilized 
on NCO-sP(EO-ifa?-PO)coated surfaces and the resulting 


single-molecule FRET histograms of RNase FI immobilized on 
NCO-sP(EO-itof-PO) and linear PEO 5000 coated surfaces under 
buffer conditions, followed by 6 M guanidinium chloride 
(GDmCl) and then re-immersion in buffer. The green fraction 
centered at zero FRET efficiency represents molecules that have 
no acceptor molecule due to incomplete labeling or dye 
photobleaching. The red fraction at high (~0.9) FRET efficiency 
depicts molecules in their folded state. The yellow fraction at 
low-to-intermediate FRET efficiency (^^0.4) represents 
molecules in their unfolded state (Figure partially reprinted 
with permission from Groll et al. 2004, copyright American 
Chemical Society) 
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Biofunctionalization for Single Molecule 
Experiments 

One particular advantage of the NCO-sP(EO-stof-PO) 
system is the change in reactivity from isocyanate 
groups, which are reactive toward nucleophilic groups 
such as alcohols, amines, and thiols, to amine groups 
during the layer preparation and curing of the coating. 
Addition of water-soluble compounds that bear nucle¬ 
ophilic groups to the aqueous NCO-sP(EO-5taf-PO) 
solution before coating thus results in covalent attach¬ 
ment of these molecules to the coating. After complete 
hydrolysis of the isocyanate groups, the compounds 
that are covalently bound to the layers are embedded in 
a coating that inhibits nonspecific interactions with 
proteins and cells, so that the immobilized molecules 
can interact specifically. This feature is achieved in 
a one-step layer preparation without the use of further 
chemical blocking-agents. When biocytin is intro¬ 
duced into the coatings in this way, surfaces can be 
produced where biotinylated proteins may be specifi¬ 
cally immobilized via streptavidin. Ribonuclease H, 
labeled with a ERET donor-acceptor pair of fluorescent 
dyes, has been immobilized on such surfaces in this 
way. Denaturation/renaturation of the immobilized 
protein was followed with fluorescence microscopy, 
and compared to biotinylated linear grafted PEO as 
well as biotinylated adsorbed BSA (Eig. 4). The results 
show that only on the sP(EO-^raf-PO) surfaces 
RNase H could be unfolded and refolded repeatedly. 
Most importantly, they reveal that the energy of fold¬ 
ing is essentially identical to that in solution (Groll 
et al. 2004). This demonstrates that very delicate struc¬ 
tural changes of proteins may be followed on such 
coatings. 


Cross-References 

► Atomic Force Microscopy (AFM) for Topography 
and Recognition Imaging at Single Molecule 
Level 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Surface-Tethered Lipid Vesicles 

► Total Internal Reflection Fluorescence Microscopy 
for Single-Molecule Studies 


References 

Barbey R, Lavanant L, Paripovic D, Schiiwer N, Sugnaux C, 
Tugulu S, Klok H-A. Polymer brushes via surface-initiated 
controlled radical polymerization: synthesis, characteriza¬ 
tion, properties, and applications. Chem Rev. 2009; 109(11): 
5437-527. 

Gasteier P, Reska A, Schulte P, Salber J, Offenhaeusser A, 
Moeller M, Groll J. Surface grafting of PEO-based star¬ 
shaped molecules for bioanalytical and biomedical applica¬ 
tions. Macromol Biosci. 2007;7:1010-23. 

Groll J, Amirgoulova E, Ameringer T, Heyes CD, Rocker C, 
Nienhaus GU, Moller M. Biofunctionalized, ultrathin coat¬ 
ings of crosslinked star-shaped poly(ethylene oxide) allow 
reversible folding of immobilized proteins. J Am Chem 
Soc. 2004;126:4234-9. 

Halperin A. Polymer brushes that resist adsorption of model 
proteins: design parameters. Langmuir. 1999;15:2525-33. 

Irvine DJ, Mayes AM, Satija KS, Barker GJ, Soha-Allgor SJ, 
Griffith LG. Comparison of tethered star and linear poly 
(ethylene oxide) for control of biomaterials surface proper¬ 
ties. Biomed Mater Res. 1998;40:498-509. 

Koopmans WJA, Schmidt T, van Noort J. Nucleosome immobi¬ 
lization strategies for single-pair FRET microscopy. Chem 
Phys Chem. 2008;9:2002-9. 

Michel R, Pasche S, Textor M, Castner DG. Influence of PEG 
architecture on protein adsorption and conformation. Lang¬ 
muir. 2005;21(26):12327-32. 

Sofia SJ, Premnath V, Merrill EW. Poly(ethylene oxide) grafted 
to silicon surfaces: grafting density and protein adsorption. 
Macromolecules. 1998;31:5059-70. 

Szleifer I, Carignano MA. Tethered polymer layers: phase tran¬ 
sitions and reduction of protein adsorption. Macromol Rapid 
Commun. 2000;21:423M8. 

Unsworth LD, Sheardown H, Brash JL. Protein-resistant poly 
(ethylene oxide)-grafted surfaces: chain density-dependent 
multiple mechanisms of action. Langmuir. 2008;24:1924-9. 


Surface PEGylation 

► Surface Passivation for Single Molecule Detection 


Surface Plasmon Resonance 

► Solid-Phase Detection Techniques 


Surface-Enhanced Infrared Spectroscopy 

► Surface Enhanced Infrared Absorption Spectroscopy 






Surface-Enhanced Raman Spectroscopy for Bioanalytics 


2537 


s 


Surface-Enhanced Raman Spectroscopy 
for Bioanalytics 

Duncan Graham, Karen Faulds and Jennifer 
A. Dougan 

Centre for Molecular Nanometrology WestCHEM, 
Department of Pure and Applied Chemistry, 
University of Strathclyde, Glasgow, Scotland, UK 

Synonyms 

DNA; SERRS; SERS 

Definitions 

Surface enhanced Raman spectroscopy for 
bioanalytics is the exploitation of the spectroscopic 
technique, SERS, for bioanalytical applications. Two 
examples of bioanalytical target classes which have 
benefited from SERS detection strategies are nucleic 
acid and enzymatic detection. 

DNA Detection by Surface-Enhanced Raman 
Scattering (SERS) 

The sensitive and selective detection of DNA is of 
fundamental importance to the improvement of ana¬ 
lytical and therapeutic methods for human health. 
Surface-enhanced (resonance) Raman scattering 
(SE(R)RS) spectroscopy has been applied to the 
sensitive and selective detection of DNA (Hering 
et al. 2008; Schlucker 2009). Arguably the greatest 
advantage that SERS spectroscopy offers as an analyt¬ 
ical tool is that the spectra provide distinct vibrational 
information about the molecule being interrogated, 
making the technique highly suitable for the instanta¬ 
neous detection of several analytes, that is, 
multiplexed analysis (Faulds et al. 2007b)(Cao et al. 
2002; Sun et al. 2007). While Raman scattering 
of DNA and RNA can be obtained directly from oli¬ 
gonucleotides, to achieve sequence-specihc analysis 
discriminatory labeling of the oligonucleotide is 
required in order to distinguish between 
specroscopically similar sequences (Barhoumi et al. 


2010). The inclusion of an appropriate dye label also 
makes use of a resonance enhancement contribution to 
the Raman effect from the label in addition to the 
surface enhancement effect and allows for multiplexed 
detection. Typically the scattering cross sections of the 
labels are much higher than the bases of the nucleic 
acids and as a result give greater sensitivity and dis¬ 
crimination as the signal relates to a specifically 
labeled sequence and not the general presence of 
a nucleic acid. Although fluorescence (coupled with 
the polymerase chain reaction (PCR)) remains the 
most popular technique for molecular diagnostics, 
fluorescent signals are spectrally broad (tens to hun¬ 
dreds of nanometers) with little molecularly specific 
information, making them less suitable for multiplexed 
analysis (Ni et al. 1999). In Raman spectroscopy, fluo¬ 
rescence is an unwanted effect but the ability of the 
metal surfaces used in SERRS to quench fluorescence 
allows a wide range of dyes which have been devel¬ 
oped for fluorescent labeling of DNA to be exploited in 
SERRS detection strategies (Table 1) (Graham and 
Eaulds 2008). Quantitative responses can be achieved 
when employing a wide range of commercially avail¬ 
able fluorescent dyes, such as the fluorescein-based 
dyes; FAM, TET, HEX, Yakima Yellow, rhodamine- 
based dyes; TAMRA, R6G, cyanine dyes; Cy3, Cy3.5, 
Cy5, Cy5.5, Cy7; and BODIPY TR-X and BODIPY 
650 (Stokes et al. 2007b). As an alternative, custom 
synthesized SERRS labels for oligonucleotides, such 
as benzotriazolyl azo dyes, have been developed with 
surface-seeking groups and chromophoric regions for 
ease of adsorption and SERRS identification (Graham 
et al. 2001). Nevertheless, the scale of research into the 
fluorescent modification of DNA means that in practi¬ 
cal terms the simplest approach of obtaining SERRS- 
active probes is to exploit the commercially available 
chemistry. It should be noted that when using nega¬ 
tively charged dyes, such as FAM, it is beneficial to the 
sensitivity of the SERRS response to incorporate 
propargylamine-modifled bases into the probe design. 
These propargylamino residues should be incorporated 
close to the dye label and studies show that as few as 
two units are required to improve sensitivity (Faulds 
et al. 2007a). At physiological pH the amino residues 
generate positively charged sites in the probe which 
improves the adsorption of the oligonucleotide to the 
(negative) colloidal surface resulting in better signal- 
to-noise ratio and lower detection limits. 





2538 


Surface-Enhanced Raman Spectroscopy for Bioanalytics 




•S .9 


•O "O 


3 3 
C 

§ S 

M £■ 

(U 

> . 
cS ' 

-c ;9' 

t - 

Ic 'T 

^ in 

c« <N 
(U — 
>> — 
^ 6 
~0 m 
0 ) — 
.a iK 

0<3 Q 

£ O 
•B CNJ 
c ^ 

^ >> 

£ 15 

2 3 

C/D 

3 

T3 ^ 
3 (L> 

I 


o 


S -rt 

■£ S 

> tL- 

^ "O 
"S 

'o 2 

fc- c/3 

S I 

o S 

O 'S 

•3^ s 
■s § 

i^-( CT' 
® X) 
O C 
£ ^ 
o >. 
00 .ts 


£ .S 


lU 


J -I 

>~k CS 

b£> O 

^ s 

£“ T3 

2 O 

(i g 


P3 

a o 
g 1 

°f I 

U a 


U 


o 

03 

<D 


o 

Z 



<D 

>> 


Q 

H 

P3 


m 












BT Dye Custom No Yes/No 

synthesized 


Surface-Enhanced Raman Spectroscopy for Bioanalytics 


2539 


s 


o 

c/3 

<D 


o 

Z 



<D 


Q 

F- 

CQ 


r- 


oo 


S 







2540 


Surface-Enhanced Raman Spectroscopy for Bioanalytics 




Surface-Enhanced Raman 
Spectroscopy for 
Bioanalytics, 

Fig. 1 Representation of the 
action of spermine 
hydrochloride in facilitating 
the adsorption of polyanionic 
DNA on the negatively 
charged “citrate” surface layer 
of colloids ((Graham and 
Faulds 2008) - Reproduced by 
permission of The Royal 
Society of Chemistry) 



To achieve meaningful SERRS spectra of dye- 
labeled oligonucleotides not only must a dye be 
selected and matched with an appropriate excitation 
source, but a suitable surface also must be used which 
complements not only the dye but also the wavelength 
of excitation (Stokes et al. 2007b). Nanoparticles of the 
coinage metals (typically gold and silver) are the most 
commonly used surface enhancement substrates for 
SERRS detection strategies (Schlucker 2009). Colloi¬ 
dal nanoparticles are often prepared via reduction of 
the corresponding metal salt leaving a surface layer, 
most commonly citrate or EDTA (Heard et al. 1983). 
These surface layers confer a net negative charge to the 
nanoparticle colloid. 

This, coupled with the polyanionic nature of the 
DNA backbone, requires some manipulation to pro¬ 
mote the required surface adsorption of the probe spe¬ 
cies and allow the generation of a meaningful SERRS 
spectra. The most widely applicable approach to over¬ 
come the issue of electrostatic repulsions is to create 
electrostatic layers. This involves the addition of spe¬ 
cies, such as poly-L-lysine, metal ions, or spermine 
hydrochloride, that form a layer of positive charge 
between the negative colloid surface and the 
polyanionic DNA (Fig. 1). It has been shown that 
spermine hydrochloride not only gives the maximum 
surface adsorption of DNA onto the nanoparticle sur¬ 
face, but simultaneously provides a mechanism for 
aggregation of the nanoparticles (Faulds et al. 2004). 
This is a convenient duality of purpose since achieving 
sensitive SERRS detection requires aggregation of the 
colloid. The aggregation process is necessary to gen¬ 
erate “hot spots” at the interstices of the aggregated 
particles from which the SERRS response is greatest 
(Michaels et al. 2000). The electrostatic layering 
method is applicable across a wide variety of dye 


labeled probes. Quantitative SERRS of dye-labeled 
oligonucleotides can thus be achieved at concentra¬ 
tions below monolayer coverage of the nanoparticle 
surfaces and, in some circumstances, is more sensitive 
than fluorescence (Table 2). 

The multiplexing capability of these direct- 
detection systems has been exemplified by 
a separation-free 5-plex - the simultaneous identifica¬ 
tion of five different dye-labeled oligonucleotides in 
one mixture by use of two excitation wavelengths 
(Fig. 2) (Faulds et al. 2007b). 

Further, a SERRS 6-plex has been shown where 
post-analysis chemometrics identified the absence or 
presence of each labeled oligonucleotide in a mixture 
of other similarly labeled probes and allowed the iden¬ 
tification of the labels present in a complex mixture 
(Faulds et al. 2008). 

Of course, in a diagnostically useful situation the 
target of an assay would not be labeled. Instead, a 
biological recognition event that generates a specific 
complex between labeled probe and target would take 
place to generate a unique SERS signal corresponding 
to the presence of the target nucleic acid. 

An early approach to DNA detection via SERRS 
was achieved by using PCR-based assays. 
The identification of the cystic fibrosis transmembrane 
conductance regulator (CFTR) gene was achieved in 
a SERRS 2-plex (Graham et al. 2002). A Raman 
reporter probe was attached to the primer and incorpo¬ 
rated into the amplified DNA as part of the PCR. Post- 
PCR, unincorporated primers were removed and the 
corresponding SERRS signal obtained from only the 
PCR product. Alternatively, non-dye-labeled PCR can 
be run with a biotinylated primer strand. Bead-based 
separation and subsequent hybridization to a short 
Raman active probe allowed the identification of 
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Surface-Enhanced Raman Spectroscopy for Bioanalytics, Table 2 Limits of detection shown for a range of labeled oligonu¬ 
cleotides (mean of the blank plus 3 x StDev of the blank divided by the slope of the calibration curve). Where no value is shown the 
limit of detection was greater than 1 x 10~^ M. N. M. = not measured ((Graham and Faulds 2008) - Reproduced by permission of The 
Royal Society of Chemistry) 


Dye label 

/tmax/nrn 

/lex- 514.5 nm 
SilverimoX dm^^ 

/lex: 632.8 nm 
SilverfmoX dm~ 

GoW/mol dm 

2ex: 785 nm 
SilverimoX dm-^ 

Fluorescence 
LoD/mol dm-^ 

FAM 

492 

2.7 X 10^'^ 

2.0 X 10-“ 



6.5 X 10-® 

TET 

521 

1.6 X 10^“ 

2.0 X 10-“ 



2.6 X 10-® 

R6G 

524 

1.2 X 10^'^ 

1.1 X 10-'“ 

— 

— 

3.5 X 10-® 

Yakima yellow 

526 

1.7 X 10-" 

— 

— 

— 

N.M. 

HEX 

535 

7.8 X 10-'^ 

1.2 X 10-“ 

— 

— 

1.1 X 10-® 

Cy3 

552 

2.6 X 10-'“ 

1.5 X 10-'“ 

— 

— 

4.6 X 10““ 

TAMRA 

565 

3.5 X 10-'^ 

1.8 X 10-'“ 



1.1 X 10-® 

Cy3.5 

581 

2.5 X 10-" 

7.5 X 10-'^ 

2.5 X 10-'“ 


N.M. 

ROX 

585 

8.1 X 10-" 

3.3 X 10-" 

1.1 X 10-“ 


2.3 X 10-® 

BODIPY TR-X 

588 

1.3 X 10-'“ 

7.9 X 10-'^ 

4.9 X 10-'“ 


N.M. 

PtcCo 

625 


3.2 X 10-" 

N.M. 


Not fluorescent 

PtcAl 

640 


2.8 X 10-" 

N.M. 


Not fluorescent 

Cy5 

643 


8.3 X 10-" 

1.7 X 10-“ 


3.1 X 10““ 

PtcZn 

680 

1.4 X 10-'“ 

3.2 X 10-" 

N.M. 


Not fluorescent 

Cy5.5 

683 


5.2 X 10-'^ 

7.3 X 10-" 


N.M. 

Cy7 

748 




5.8 X 10-" 

N.M. 

a 


514 nm 

b 


632 nm 





Surface-Enhanced Raman Spectroscopy for Bioanalytics, Fig. 2 SERRS Spectra of dye-labeled oligonucleotides 
(1.89 X 10^®) using (a) 514.5 nm or (b) 632.8 nm excitation (Reproduced with permission from Faulds et al. 2007b) 
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Surface-Enhanced Raman 
Spectroscopy for 
Bioanalytics, 

Fig. 3 Representation of the 
steps involved in the bead- 
based assay for the SERRS 
detection of Chlamydia 
trachomatis (Reprinted with 
permission from Monaghan 
et al. 2007) 


1. Sample DNA 



2. PCR using a biotinylated primer 



5. Capture biotinylated hybridized 
complex on streptavidin coated 
beads 



6. Heat to release detection probe 




3. Amplified biotinylatd target after 7. Detect detection probe using 

PCR reaction SERRS 



4. Hybridize SERRS labelled 
detection probe 


Chlamydia trachomatis PCR product (Monaghan et al. 
2007) (Fig. 3) and, indeed, the extension of this 
methodology to a low volume, microfluidics-hased, 
Lah-on-a-Chip detection format allowed the 
multiplexed analysis of three E. coli genes in 
a microfluidics system. 

Molecular beacon-based SERRS assays utilize 
a hybridization-based structural change of an oligonu¬ 
cleotide to provide a detectable change in SERRS 
response. One method used was to attach the beacon 
to a nanoparticle by a thiol modification. The inclusion 
of a SERRS label at the 3 ’ terminus produced a SERRS 
spectrum when the beacon was in the “closed” hairpin 
loop structure. Hybridization with target DNA opens 
the loop, producing a linear duplex with the 3’-dye 
label removed from the enhancing surface and 
a reduction, or elimination, of the SERRS signal 
(Wang and Vo-Dinh 2009). This “on to off’ SERRS 
system was extended to the multiplexed detection 
of breast cancer biomarkers. As an alternative 
method a benzotriazolyl SERRS linker was used to 
immobilize a beacon on a nanoparticle surface with 
a conventional fluorophore, EAM, at the other end of 


the oligonucleotide. In this format, hybridization with 
target DNA leads to the reduction of EAM-based 
SERRS and an increase in EAM fluorescence allowing 
the detection of the target sequence (Eig. 4) (Eaulds et 
al. 2006). 

In a novel approach, the observation that single- 
stranded DNA adsorbs more readily to nanoparticle 
surfaces than the corresponding duplex has been used 
in the development of an assay to detect the genes of 
bacteria responsible for hospital-acquired infections 
(HAIs). In this case hybridization of a labeled probe, 
in the presence of unlabeled target DNA, results in a 
low SERRS signal due to the poor adsorption of 
dsDNA, whereas controls run in the absence of target 
produce strong SERRS signals (Eig. 5). Locked 
Nucleic Acid (LNA)-based probes improved discrim¬ 
ination, allowing three different genes to be detected 
simultaneously in a closed tube assay (MacAskill et al. 
2009). 

Harnessing the organizing ability of DNA to aggre¬ 
gate nanoparticles provides an avenue through which 
to “switch on” the SERRS response by generating the 
“hotspots” that allow for SERRS detection of a DNA 
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Surface-Enhanced Raman 
Spectroscopy for 
Bioanalytics, 

Fig. 4 Representation of 
SERRS beacon approach for 
DNA detection ((Faulds et al. 
2006) - Reproduced by 
permission of The Royal 
Society of Chemistry) 
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Surface-Enhanced Raman Spectroscopy for Bioanaiytics, 
Fig. 5 SERRS assay based upon the principle that ssDNA 
adsorbs with greater affinity than dsDNA, used to 


simultaneously and homogenously detect three gene sequences 
for HAIs (Reprinted with permission from MacAskill et al. 2009, 
© 2011, American Chemical Society) 


target. In order to target unlabelled DNA, two batches 
of silver nanoparticles were dye-coded and 
functionalized with noncomplementary probe 
sequences of thiol-modified DNA. The inclusion of 


a dye on the nanoparticle surface means that hybridi¬ 
zation-induced aggregation with a target complemen¬ 
tary to both probes allowed sequence specific target 
detection by SERRS (Fig. 6) (Graham et al. 2008). 
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Surface-Enhanced Raman Spectroscopy for Bioanalytics, 
Fig. 6 The aggregation of dye-coated, oligonucleotide- 
modified nanoparticles by hybridization with an unlabelled 
target allowing detection by SERRS. Left, isolated 



nanoparticles; middle, aggregated nanoparticles in the presence 
of DNA; right, SERS from the aggregated nanoparticles and 
then intensity over time for match and mismatch (Reproduced 
with permission from Graham et al. 2008) 


Not all assays have been conducted in the solution 
phase; SERRS was used to detect the BRCAl breast 
cancer gene using a silver metal film as the enhancing 
surface (Allain and Vo-Dinh 2002). A rhodamine B 
labeled oligonucleotide was immobilized, by hybridi¬ 
zation with a capture probe, onto a silver metal film 
and observed by the characteristic SERRS response of 
the label. Thioctic acid-modified oligonucleotide cap¬ 
ture probes were immobilized on a gold nanostructured 
surface to detect, after hybridization, a BODIPY- 
labeled oligonucleotide (Stokes et al. 2007a). 

More importantly, unmodified target DNA has been 
detected on surfaces which can be interrogated for 
SERRS in a high throughput manner. Sandwich 
hybridization between an immobilized capture probe, 
unmodified target, and dye-labeled detection probe 
allows the identification of biologically useful target 
sequences (Fig. 7). Mirkin detected various disease¬ 
coding sequences by SERRS, including (amongst 
others): hepatitis A virus, human immunodeficiency 
virus (HIV), Ebola virus, and variola virus (smallpox) 
in a scanometric chip format (Cao et al. 2002). The 
reporter probe contained a SERRS-active dye and gold 
nanoparticle at the terminus. Colloid aggregation 
could not be achieved in order to induce the “hot 
spots” necessary for SERRS; therefore, silver deposi¬ 
tion was carried out across the surface which provided 
signal enhancement. Making use of a plasmonically 
tuned nanostructured gold surface eliminates the 
need for a SERRS “development” step. Stokes and 
coworkers immobilized, in a controlled manner 
via dip-pen nanolithography (DPN), a thioctic 
acid-modified oligonucleotide capture sequence and 
by sandwich hybridization detected an unlabelled 


1. '\/\i (Target DNA) 








Surface-Enhanced Raman Spectroscopy for Bioanalytics, 
Fig. 7 Scanometric SERRS analysis of immobilized probes for 
gene-specific detection by sandwich hybridization. The target 
DNA links the support and dye-labeled detection probe allowing 
biochemically specific analysis ((Cao et al. 2002) Reprinted with 
permission from AAAS) 


target by use of a Cy5-labeled reporter probe (Stokes 
et al. 2008). 


Enzyme Analysis by SERS 

In addition to SERRS detection of DNA sequences, 
other biomolecules such as enzymes can be accurately 
analyzed using SERRS. SERS can be directly obtained 
from enzymes themselves; however, of greater use is 
the ability of SERRS to detect the presence or activity 
of an enzyme (Larmour et al. 2010). The basic concept 
of enzymatic SERRS analysis is the conversion, by 
enzyme action, of a SERRS-inactive precursor into 
a strongly SERRS-active product. There are two 
main ways that this occurs. The first involves the 
production of a dye as a result of enzymatic action 
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Surface-Enhanced Raman 
Spectroscopy for 
Bioanalytics, 

Fig. 8 SERRS-based ELISA 
based upon the generation of 
SERRS-active azoaniline by 
peroxidase (POD) (Reprinted 
with permission from (Dou 
etal. 1997), © 2011, 
American Chemical Society) 



3000 2500 2000 1500 1000 500 
Wavenumber (cm“^) 


upon a substrate and subsequent SERRS detection. 
This has been used to a great effect in immunoassays. 
SERRS was applied to a standard enzyme-linked 
immunosorbent assay (ELISA) system in which 
the secondary antibody is labeled with peroxidase. 
The addition of the SERRS-inactive enzyme sub¬ 
strates, o-phenylenediamine and hydrogen peroxide, 
result in their conversion to the SERRS-active dye 
azoaniline only where the secondary antibody (and 
by extension antigen and enzyme) is present with 
resulting sensitivities in the ng ml”' range (Eig. 8) 
(Duo et al. 1997). Another SERRS-based ELISA sys¬ 
tem was developed for the detection of prostate- 
specific antigen (PSA) which is a biomarker for the 
detection of prostate cancer. By use of horseradish 
peroxidase, the substrates 2,2’-azino-bis(3-ethylben- 
zothiazoline-6-sulfonic acid) (ABTS) and hydrogen 
peroxide were converted into a green, oxidized product 
with an absorption maxima at 420 nm. The detection of 
this species by SERRS using gold nanoparticles and 
a standard ELISA kit gave detection limits in the pg 
ml”* range which is a significant improvement upon 
the accepted “decisive” limit of ng ml”* (Stevenson 
et al. 2009). The use of alkaline phosphatase to convert 
5-bromo-4-chloro-3-indolyl phosphate (BCIP) into an 
insoluble SERRS-active BCI dimer was exploited in 
an ELISA format to detect a C-reactive protein (CRP) 
by use of an alkaline phosphatase modified anti-CRP 
antibody (Campbell et al. 2008). In these cases, the 


production of the SERRS-active dye as a result of 
enzyme action was used to infer the presence of an 
antigen by use of the sandwich-based ELISA. 

A particularly elegant method of detecting enzyme 
activity/presence is the use of “masked” surface¬ 
seeking dyes. The enzymatic substrate is prepared by 
the generation of dyes with their surface-seeking moi¬ 
ety masked by an enzyme substrate. In the presence of 
the enzyme, the substrate is acted upon and the mask is 
cleaved from the dye thereby permitting adsorption of 
the dye to a suitable SERRS surface (Eig. 9). 
Benzotriazole-based dyes are particularly suited for 
this purpose. Benzotriazole exhibits a strong adsorp¬ 
tion affinity for silver nanoparticles prepared by the 
citrate reduction method, displacing the citrate surface 
layer, and producing strong SERRS signals. Better still 
was the development of masked 8-hydroxyquinoline- 
based dyes. The limits of detection were improved 
100-fold with respect to benzotrazolyl analogues 
(Ingram et al. 2007). A large range of enzymes have 
been studied by this method, including lipases. Pseu¬ 
domonas cepacia lipase was used due to its ability to 
hydrolyse 3-phenylbutyric esters. Upon addition to 
a benzotriazolyl azo modified through the surface¬ 
binding nitrogen by a 3-phenylbutyric ester, the 
enzyme activity released the unmasked dye producing 
strong SERRS signals (Fig. 10) (Moore et al. 2004). 
These enzymatic systems are particularly sensitive and 
show remarkable enantioselectivity with the enzyme 
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Surface-Enhanced Raman 
Spectroscopy for 
Bioanalytics, 

Fig. 9 Schematic 
representation of the 
“masked” dye technique for 
enzymatic SERRS analysis, 
where an enzyme-cleavable 
mask is applied to a surface¬ 
seeking dye, such as 
a benzotriazolyl or 
8-hydroxyquinoline-based 
azo. Only upon enzyme 
activity will the mask be 
cleaved and a SERRS signal 
obtained ((Graham D et al. 
Chem Commun. 2006; 
4363^371) - Reprodueed by 
permission of The Royal 
Society of Chemistry) 
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Surface-Enhanced Raman Spectroscopy for Bioanalytics, benzotriazolyl azo dye producing a highly SERRS-active prod 
Fig. 10 Time study of the enzyme action of Pseudomonas uct (Reproduced with permission from Moore et al. 2004) 
cepacia lipase upon a 3-phenylbutyric ester masked 
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acting preferably upon one substrate over the other. 
Other enzymes, in addition to the lipases have been 
studied, including the esterases, phosphatases, and 
galactosidases. 

Cross-References 

► Infrared Spectroscopy of DNA 

► Raman Spectroscopic Instrumentation, 
Experimental Considerations 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 

► Surface-Enhanced Infrared Spectroscopy 

► Surface-Enhanced Resonance Raman Spectroscopy 
in Electron Transfer Studies 
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Surface-Enhanced Resonance Raman 
Spectroscopy in Electron Transfer 
Studies 

Peter Hildebrandt 

Institute for Chemistry, Technical University 
of Berlin, Berlin, Germany 

Definition 

Redox proteins can be immobilized on biocompatibly 
coated metal electrodes. These devices allow probing 
electron transfer processes of the immobilized protein 
by stationary and time-resolved surface enhanced 
resonance Raman spectroscopic techniques. Thus it is 
possible to disentangle the interfacial redox processes 
in terms of electron tunneling and protein dynamics. 

Introduction 

Molecules immobilized on nanostructured Ag or Au 
surfaces experience an amplification of their Raman 
scattering by 4-5 orders of magnitude. This surface- 
enhanced Raman (SER) effect is due to the coupling of 
the radiation field with the surface plasmons of the 
metal (Kneipp et al. 2006). When the excitation line 
matches the resonance conditions of both the surface 
plasmons of the metal and an electronic transition of 
the adsorbate, the SER effect and the molecular 
resonance Raman (RR) effect are combined (surface- 
enhanced resonance Raman - SERR) resulting in 
a high sensitivity approaching the level required for 
single-molecule detection. 
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Cotton et al. were the first to apply this technique to 
study heme proteins on Ag electrodes demonstrating 
that SERR spectroscopy is capable to selectively probe 
the cofactor solely of the adsorbed proteins (Cotton 
et al. 1980). This pioneering work initiated many SER 
and SERR studies on various biomolecules (Cotton 
et al. 1992). However, it soon turned out that the direct 
interactions with metal surfaces may cause structural 
changes of the protein and eventually denaturation, 
dampening the interest in applying SERR spectros¬ 
copy to biomolecules (Smulevich and Spiro 1985). It 
took more than 10 years to overcome this problem by 
using biocompatible coatings of the electrode, in par¬ 
ticular self-assembled monolayers (SAMs) of 
alkanethiols (Hobara et al. 1998). These coatings do 
not only allow protein binding at a high surface cover¬ 
age and under preservation of the native protein struc¬ 
ture but, in addition, may be considered as simple 
models for biomembranes, offering the possibility to 
study membrane-bound proteins under conditions sim¬ 
ilar to their natural environment. Such devices have 
gained increasing importance for probing redox pro¬ 
cesses of proteins and enzymes not only by SERR 
spectroscopy but also surface-enhanced infrared 
absorption (SEIRA) spectroscopy and electrochemical 
techniques (Murgida and Hildebrandt 2008, 2006). In 
this account, the current state of the art of SERR 
spectroscopic of redox proteins will be described, 
mainly focusing on the heme protein cytochrome c 
(Cyt-c) which is an ideal model protein to demonstrate 
the potential and limitations of the technique. 

Experimental Strategies 

The design of successful SERR experiments of redox 
proteins must include three essential elements. First, 
the prerequisite for the surface enhancement of the 
Raman signals is a nanostructured metal surface 
which is usually obtained by electrochemical 
oxidation/reduction cycles that lead to roughness of 
ca. 50-100 nm. Due to the wavelength dependence of 
the plasmon resonances, Ag exhibits by far the best 
optical performance since it provides strong enhance¬ 
ment from the near UV (400 nm) down to the infrared 
spectral region (Sanchez-Gil and Garcia-Ramos 1998). 
Thus, it allows combining the SER and the RR effect 
for basically all redox-active proteins. This is unfortu¬ 
nately not the case for Au which exhibits a higher 


chemical and electrochemical stability and thus 
would be more appropriate for spectroelectrochemical 
studies of biomolecules. However, due to d^d transi¬ 
tions of the metal, an appreciable surface enhancement 
is only possible for wavelengths longer than 600 nm 
and thus outside the range of electronic transitions 
of most redox-active cofactors, including hemes. 
This drawback may be overcome in the future by 
using layered hybrid electrodes in which the 
optical properties of Ag and the electrochemical 
stability of Au can be advantageously combined 
(Feng et al. 2009). 

Second, the Ag surface must be covered by a thin 
molecular film to ensure biocompatible interactions 
with the redox protein. For immobilizing soluble pro¬ 
teins, the most versatile coatings are based on SAMs 
using mercaptanes of different alkyl chain lengths and 
different terminal functions for binding the proteins. 
These functions may be either negatively (carboxylate, 
phosphonate) or positively charged (amino), polar 
(hydroxyl), or hydrophobic (methyl), depending on 
the binding domain of the protein and the desired 
mode of SAM/protein interactions (Murgida and 
Hildebrandt 2006). Electrostatic and hydrophobic 
interactions constitute transient complexes whereas 
covalent cross-linking following electrostatic pre¬ 
alignment lead to a more rigid and durable 
SAM/protein complex. Membrane-bound proteins 
may be either immobilized in the solubilized form, 
together with its detergent shell directly on the metal 
surface, or incorporated in supported membranes fixed 
on the electrode. 

Third, precautions must be taken to avoid unwanted 
laser-induced processes such as photo-reduction or 
local heating in the laser spot. Thus, similar to RR 
spectroscopic experiments of proteins in solution, the 
sample must be moved through the laser spot 
which can be achieved either using rotating ring or 
disc electrodes (Siebert and Hildebrandt 2007). 

SERR experiments of redox proteins are usually 
carried out as a function of the electrode potential to 
probe the redox equilibria of the immobilized proteins 
(Murgida and Hildebrandt 2001a). Furthermore, SERR 
spectroscopy can be extended to the time-resolved 
mode by coupling the measurements with the potential 
jump technique (Wackerbarth et al. 1999; Murgida and 
Hildebrandt 2001b; Siebert and Hildebrandt 2007). 
Here, a rapid jump of the electrode potential (Ej to 
Ef) perturbs the original equilibrium at Ei of the 
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immobilized protein such that relaxation to the new 
equilibrium conditions at Ef is required. The relaxation 
process is probed within a given time interval after 
a delay time 5 following the potential jump. Immedi¬ 
ately after the probe event the potential jump is 
reversed to restore the original equilibrium at Ej. This 
sequence of potential jumps, delay times, and probe 
events is repeated many times until the accumulated 
signal is sufficient for a quantitative analysis of the 
spectrum in terms of relative spectral contributions of 
the species involved. 

These time-resolved experiments are carried out as 
a function of the delay time to determine the relaxation 
kinetics of the potential-jump induced processes. Eor 
redox proteins, these processes include electron trans¬ 
fer between the adsorbed protein and the electrode, 
possibly coupled to further reaction steps such as con¬ 
formational transitions which, unlike electrochemical 
methods, can as well be analyzed in time-resolved 
SERR experiments. 

Protein Adsorption 

Studying the electron transfer processes of redox pro¬ 
teins by SERR spectroscopy primarily requires a high 
surface coverage. A rational immobilization strategy 
may be employed in those cases when the three- 
dimensional structure is known. Then, as in the case 
of Cyt-c, molecular dynamics methods allow identify¬ 
ing preferred binding domains for electrostatic or 
hydrophobic interactions with the correspondingly 
functionalized SAMs (Eeng et al. 2008; De Biase 
et al. 2009). Cyt-c is small soluble heme protein with 
a redox potential of -i-O.Ol V versus Ag/AgCl. 
It exhibits a well-defined binding domain of various 
positively charged residues for efficient immobiliza¬ 
tion on carboxyl-terminated SAMs (Fig. 1). Compared 
to mammalian Cyt-c from horse heart, the dipole 
moment is distinctly larger for the protein from yeast 
(iso-1 Cyt-c) (Fig. 1) which leads to a more rigid 
fixation of the protein on the negatively charged 
SAM. Since protein dynamics is essential for an effi¬ 
cient electron transfer {vide infra), a release of these 
constraints may be achieved by reducing the electric 
field strength at the SAM/protein interface. This quan¬ 
tity decreases with increasing SAM thickness (number 
of methylene groups) and decreasing charge density of 
the SAM head groups. Thus, for a given SAM 



Surface-Enhanced Resonance Raman Spectroscopy in 
Electron Transfer Studies, Fig. 1 Structural models of cyto¬ 
chrome c from horse heart (Cyt-c; left) and yeast (iso-1 Cyt-c; 
right) indicating the main binding domain for electrostatic inter¬ 
actions with negatively charged SAMs. The arrows represent the 
molecular dipole moment (Feng et al. 2008). TML72 denotes the 
trimethylated Lys residue 72 in iso-1 Cyt-c 

thickness, one may lower the number of negatively 
charged head groups by the admixtures of hydroxyl- 
terminated mercaptanes (mixed monolayers) in order 
to increase the mobility of the bound protein (Feng 
et al. 2008). 

Stationary Experiments 

Cyt-c readily binds to SAM-coated Ag electrodes of 
carboxyl-terminated mercaptanes. For thick SAMs 
composed of mercaptanes with more than five methy¬ 
lene groups, the SERR spectra are basically identical to 
the RR spectra of the protein in solution, indicating that 
the native protein structure is not affected upon immo¬ 
bilization (Murgida and Hildebrandt 2001a). SERR 
spectra, measured as a function of the electrode poten¬ 
tial (Fig. 2), display the increase of the oxidized form at 
the expense of the reduced form with increasing poten¬ 
tial. The quantitative analysis of the spectra affords 
the potential-dependent distribution of the spectral 
contributions of the two species involved which, after 
conversion into relative concentrations, allows deter¬ 
mining the redox potential. Taking into account the 
interfacial potential drop, the redox potential of the 
immobilized protein was found to be the same as in 
solution, consistent with the preservation of the native 
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Surface-Enhanced Resonance Raman Spectroscopy in 
Electron Transfer Studies, Fig. 2 Stationary SERR spectra 
of Cyt-c immobilized on an Ag electrode coated with a SAM of 
mercaptounodecaonic acid. The spectra measured at different 
electrode potentials using 413-nm excitation were subjected to 
the component analysis with the red and blue traces referring to 
the reduced and oxidized state B1 of Cyt-c, respectively. The 
potentials refer to Ag/AgCl (3 M KCl) electrode and are 
corrected for the interfacial potential drop (Murgida and 
Hildebrand! 2001a) 


protein structure as concluded from the comparison of 
the SERR and RR spectra. 

Upon decreasing the number of methylene groups 
of the SAM, corresponding to an increase of the elec¬ 
tric field strength in the SAM/protein interface, there is 
an increasing contribution of non-native states of Cyt-c 
to the experimental SERR spectra due to species in 
which the Met-80 ligand is removed from the heme 
and eventually replaced by His-33 (or Elis-26) 
(Oellerich et al. 2002). These species can be identified 
on the basis of the oxidation, spin, and coordination 
state marker bands which are particularly enhanced 
upon excitation in resonance with the strong Soret 
transition of the heme (413 nm excitation). The quan¬ 
titative analysis of the spectra reveals that the confor¬ 
mational transition between the native (Bl) and the 


non-native (B2) states depends on the electric field. 
In the B2 state, the redox potential is lowered by ca. 
400 mV as determined by SERR spectroscopy 
(Murgida and Hildebrandt 2006). 

Complementary information about the behavior of 
the immobilized Cyt-c is obtained from the SERR 
spectra measured under pre-resonance with respect to 
the Q band of the heme. Under these conditions, the 
RR selection rules which control the Soret-band 
excited spectra are softened and, instead, Raman selec¬ 
tion rules play a considerable role in controlling the 
intensities of the heme modes. As a consequence, the 
surface enhancement of totally and non-totally sym¬ 
metric modes depends on the orientation of the heme 
with respect to the surface (Kranich et al. 2008). This 
relationship offers the opportunity to probe the orien¬ 
tational changes of the immobilized Cyt-c with respect 
to the electrode surface, induced by varying the 
electrode potential and the thickness of the SAM. 

Kinetic Studies 

In analogy to the stationary experiments, qualitatively 
different information may be obtained from potential- 
jump time-resolved experiments depending on the 
excitation conditions. They allow disentangling two 
essential steps in electron transfer: reorientation of 
Cyt-c and electron tunneling between Cyt-c and the 
electrode. 

Using Soret-band excitation, time-resolved SERR 
spectra monitor the oxidation and coordination marker 
bands and thus electron transfer and possible structural 
changes of the heme. For Cyt-c electrostatically bound 
to the Ag electrodes coated with carboxyl-terminated 
SAMs at low electric fields (vide supra) the only 
detectable process is electron transfer between the 
heme and the electrode (Fig. 3). The time-dependent 
spectral changes can be analyzed in terms of a one-step 
relaxation kinetics. If the potential jump is carried out 
to the redox potential, the kinetic analysis affords the 
heterogeneous electron transfer rate constant at 
a driving force of 0 eV. Varying the final potential, 
time-resolved SERR spectra probe the electron trans¬ 
fer kinetics as a function of the overpotential defined 
by Tj = IE° — EfI. Such experiments allow determining 
the reorganization energy of the electron transfer reac¬ 
tion (Murgida and Hildebrandt 2002). With increasing 
electric field strength, heme oxidation is coupled to the 
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Surface-Enhanced Resonance Raman Spectroscopy in 
Electron Transfer Studies, Fig. 3 Stationary and time- 
resolved SERR spectra of Cyt-c immobilized on an Ag electrode 
coated with a SAM of mercaptoacetic acid (Murgida and 
Hildebrandt 2001b). The spectra were measured with 413-nm 
excitation and subjected to the component analysis with the red 
and blue components referring to the reduced and oxidized Cyt-c 
state Bl, respectively. The top and bottom spectra represent 
stationary measurements at the initial and final potential, respec¬ 
tively, whereas the time-resolved spectrum (middle) was 
obtained at a delay time of 4 ms. The red and blue traces refer 
to the reduced and oxidized Cyt-c state Bl, respectively. 
The potentials refer to Ag/AgCl (3 M KCl) electrode and are 
corrected for the interfacial potential drop (Murgida and 
Hildebrandt 2001a) 

conformational transition B1^B2 leading to a more 
complex interfacial kinetics. 

Q-band excited spectra, on the other hand, allow 
probing protein re-orientation following the potential 
jump using the orientation-dependent intensity ratio of 
totally symmetric V 4 (Aig) and non-totally symmetric 
heme modes, for example, V 4 (Aig) versus Vio (Big) 
(Kranich et al. 2008). Protein re-orientation was 
found to be much faster than heme reduction/oxidation 
for thick SAMs (10 or more methylene groups) (Fig. 4, 




Surface-Enhanced Resonance Raman Spectroscopy in 
Electron Transfer Studies, Fig. 4 Kinetic data obtained 
from time-resolved SERR experiments of Cyt-c immobilized 
on Ag electrodes coated with SAMs of mercaptohexadecanoic 
acid (top) and mercaptohexanoic acid (bottom) for a potential 
jump from —0.07 V (EO to the redox potential E**. The relative 
concentrations of the reduced Bl state of Cyt-c (blue) and the 
intensity ratios of the totally symmetric V 4 (Aig) and non-totally 
symmetric V|o (Big) modes (red) were determined from the 
SERR spectra obtained with 413 nm (Soret-band) and 514-nm 
(Q-band) excitation, respectively (Kranich et al. 2008) 

top). These findings suggest that in this low-field 
regime electron tunneling is the rate-limiting step of 
the interfacial redox process. In fact, this conclusion is 
confirmed by analyzing the distance and overpotential 
dependence on the electron transfer rate which follows 
the predictions of Marcus theory (Murgida and 
Hildebrandt 2002). Upon decreasing the SAM- 
thickness corresponding to increasing electric field 
strength, however, the rate of protein re-orientation is 
slowed down whereas the rate of electron tunneling 
increases exponentionally. As a consequence, the 
values of the respective rate constants approach each 
other such that at SAMs with five methylene groups or 
less, the interfacial redox process appears to be primar¬ 
ily controlled by the orientation dynamics and the 
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orientational distribution of the protein (Fig. 4, 
bottom). Accordingly, one may distinguish between 
the electron tunneling (low held) and the gated regime 
(high held) (Fig. 4) (Kranich et al. 2008). A more 
detailed SERR spectroscopic analysis of the gated 
region demonstrates a decrease of the relaxation rate 
constant with increasing solution viscosity consistent 
with the view that large-scale protein motions repre¬ 
sent the rate-limiting steps. In addition, a signihcant 
kinetic isotope effect is determined at very short dis¬ 
tances, implying that re-organization of the hydrogen 
bond networks, possibly in the Cyt-c/SAM interface, is 
slowed down at very high electric helds (Murgida and 
Flildebrandt 2001b). This interpretation is in line with 
molecular dynamics simulations and electron transfer 
pathway analysis (De Biase et al. 2009). 

The interplay between protein dynamics and elec¬ 
tron tunneling constitutes a complex reaction mech¬ 
anism of the interfacial redox processes of redox 
proteins that does not only hold for Cyt-c but also 
for various other heme proteins and metalloproteins 
(Murgida and Hildebrandt 2006; Kranich et al. 2009; 
Zuo et al. 2009; Feng et al. 2008). This conclusion is 
supported by studies carried out with various 
spectroelectrochemical and electrochemical 
techniques but among them, only SERR spectros¬ 
copy allows direct probing of the individual reaction 
steps. 


Summary 

SERR spectroscopy probes the redox cofactor of 
protein immobilized on biocompatibly coated Ag elec¬ 
trodes. Stationary experiments carried out as a function 
of the electrode potential provide information about 
the redox and conformational equilibria of the protein, 
whereas the combination with the potential-jump tech¬ 
niques allows monitoring the electron transfer reac¬ 
tions and protein dynamics. As demonstrated for the 
model protein Cyt-c, these experiments allow 
disentangling the complex interfacial redox process 
in terms of individual reactions steps, that are not 
directly accessible to other spectroelectrochemical 
and electrochemical techniques. Thus, SERR spectros¬ 
copy is an important tool for elucidating details of 
structure-dynamics-function relationships of redox 
proteins. 


Cross-References 

► Cytochromes 

► Electron transfer through proteins 

► Resonance Raman Spectroscopy of Protein- 
Cofactor Complexes 

► Surface-Enhanced Infrared Spectroscopy 

► Surface-Enhanced Raman Spectroscopy for 
Bioanalytics 
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Synonyms 

Single-molecule methods; Surface immobilization; 
Vesicle encapsulation 


Definition 

Lipid vesicles tethered to a solid support can be used 
to encapsulate and immobilize single biomolecules, 
facilitating studies of their dynamics. 


Introduction: Lipid Vesicles 

Lipid vesicles are being employed for a variety of 
purposes in basic and applied biological research 
(Lasic and Barenholz 1996; Christensen et al. 2010). 
The spectrum of applications covers the whole range 
from in vivo drug delivery to sensing. This entry will 
focus on one particular, albeit novel application of 
lipid vesicles, as platforms for fluorescence studies of 
molecular dynamics on the single-molecule level. 

The bilayered envelope of lipid vesicles (Fig. 1, 
sometimes called liposomes) is formed of phospho¬ 
lipids, the natural components of biomembranes 
(Szoka and Papahadjopoulos 1980). Either a single 
phospholipid type or a mixture of lipids can be used, 
and additional additives such as cholesterol are often 
employed as well. These constituents determine the 
physical properties of the membrane. In addition, the 
method of preparation is important as well. Mixing 
phospholipids with an aqueous medium leads to the 
formation of onion-like multilamellar vesicles. 
A variety of methods have been conceived to generate 
unilamellar vesicles from these, including sonication 
and extrusion through filters with well-defined hole 
sizes (Macdonald et al. 1991). The latter method 
typically leads to the formation of small, unilamellar 
vesicles with radii of the order of 100-200 nm or 
less. In addition, using other techniques such as 
electroformation, one can generate giant unilamellar 
vesicles whose size can be several microns (Sharonov 
et al. 2008). Below we will focus on applications of 
small unilamellar vesicles (SUV) to study dynamics of 
individual molecules trapped within their lumen. 

Vesicle Immobilization for Single-Molecule 
Studies 

Biophysical studies of single-molecule dynamics 
characteristically focus on biomacromolecules and 
their function (Weiss 2000; Lavery et al. 2002; 
Michalet et al. 2006). Fluorescence methods have 
proven particularly powerful for such experiments, 
which aim at deciphering particular biological 
activities under conditions of minimal perturbation 
and maximal similarity to the in vivo environment. In 
many cases, the typical timescales of biological reac¬ 
tions are quite long, from milliseconds and up. Tracing 
such long-time dynamics of a biomolecule necessitates 
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Surface-Tethered Lipid Vesicles, Fig. 1 Single-molecule 
studies within a lipid vesicle. A 100 nm small unilamellar vesicle 
is used to trap an individual biomolecule under conditions that 
allow a biological reaction of interest to proceed. Here the 
folding reaction of a small protein is demonstrated (Rhoades 
et al. 2004). The vesicle is tethered to the surface, typically 
using biotin-avidin chemistry. Insertion of nanopores (e.g., the 
bacterial toxin a-hemolysin) into the vesicle walls facilitates 
exchange of materials in and out of the vesicle (Okumus et al. 
2009) 

a method for immobilizing it. While many chemical 
and biochemical methods have been devised for 
molecular immobilization (Roy et al. 2008), many of 
them suffer from the risk of inducing strong interac¬ 
tions between the molecule and its environment. For 
example, a protein molecule can be directly tethered to 
a surface, but unless the surface is made fully inert 
(which is difficult), its interaction with residues on the 
protein might change the latter’s energy landscape and 
affect its function (Friedel et al. 2006). 

In 2001, Boukobza et al. (2001) proposed that 
lipid vesicles can be used as mild vehicles for protein 
immobilization. An extrusion process was used to 


prepare 100 nm SUVs encapsulating individual copies 
of a protein molecule. Biotin-(strept)avidin chemistry 
was then used to tether the vesicles to a supported 
bilayer initially prepared on a glass surface, thereby 
facilitating studies under the microscope (Fig. 1). 
A single-molecule fluorescence polarization technique 
was used to show that the molecules are freely rotating 
within the vesicles, strongly suggesting that interaction 
with the vesicular walls is minimal and non- 
perturbative. This is very reasonable considering that 
membranous surfaces, like the vesicle walls, are 
abundant in cells, and are therefore likely to provide 
a friendly environment to biomolecules. It is important 
to note that a vesicle size of 100 nm makes it relatively 
small compared to a diffraction-limited laser spot, so 
that its whole volume is illuminated uniformly. At the 
same time, such a vesicle is much larger than a typical 
protein molecule (with a typical size of a few 
nanometers). 

Applications to Biomolecular Folding 
and Functional Dynamics 

The vesicle encapsulation method was used by 
Rhoades et al. (2003, 2004) to study protein folding 
on the single-molecule level. Although it is possible 
to study single-molecule folding of proteins directly 
tethered to a surface (Kuzmenkina et al. 2005; Chung 
et al. 2009), it is difficult to guarantee that surface 
interactions will not modify the folding landscape of 
the protein. Molecules of adenylate kinase from 
Escherichia coli (Rhoades et al. 2003) and the cold 
shock protein from Thermotoga maritima (Rhoades 
et al. 2004) were labeled with two fluorescent probes 
suitable for fluorescence resonance energy transfer 
(FRET). The two-state behavior characteristic of the 
folding of small proteins was clearly recovered from 
analysis of single-molecule FRET trajectories of the 
cold shock protein. In contrast, single-molecule trajec¬ 
tories of adenylate kinase showed heterogeneous 
behavior, which suggested a multiplicity of intermedi¬ 
ate states. More recently, Flaran and coworkers 
returned to study adenylate kinase molecules 
entrapped in vesicles (Pirchi et al. 2011). Using an 
automated single-molecule spectrometer they were 
able to acquire thousands of ERET trajectories, and 
analyzed them using hidden Markov modeling 
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methods. The results indicated that as many as six 
states are populated on the energy landscape of the 
protein. It was further shown that multiple pathways 
were involved in folding reactions of this protein. 
This picture is drastically different from the standard 
two-state behavior of the cold shock protein mentioned 
above. 

Heterogeneous, multi-state dynamics also charac¬ 
terize folding reactions of RNA molecules. Indeed, 
Tan et al. (2003) found heterogeneous folding kinetics 
in their studies of individual molecules of the hairpin 
ribozyme, with folding and unfolding rates varying by 
as much as 50-fold between molecules. In this study 
the ribozyme molecules were directly tethered to 
a polyethylene glycol layer on the surface, and the 
question arose as to the possible role of the surface in 
inducing heterogeneous behavior. Ha and coworkers 
therefore repeated their study of the hairpin ribozyme, 
but this time they trapped individual molecules within 
surface-tethered vesicles (Okumus et al. 2004). They 
were able to show that the RNA molecules did not 
interact with the vesicular walls, and were trapped 
in the lumen. Reassuringly, the same pattern of 
heterogeneous dynamics was again observed. Vesicle 
encapsulation was also employed to study native 
protein dynamics on the single-molecule level, includ¬ 
ing conformational changes in synatoptagmin 1 (Choi 
et al. 2010) and in HSP-90 (Ratzke et al. 2010). 

The above applications made use of the ability of 
lipid vesicles to conhne biomolecules under conditions 
of minimal interaction. A second interesting advantage 
of vesicle nanocontainers is their ability to maintain 
a very high local concentration of molecules, while the 
overall concentration in solution remains very low. 
This can be useful for studies of relatively weak 
biomolecular interactions. Such interactions require 
in principle high concentrations of substrates, which 
are a priori incommensurate with the requirements of 
single-molecule spectroscopy. Chen and coworkers 
(Benitez et al. 2008) demonstrated how vesicle immo¬ 
bilization overcomes this problem by studying the 
interaction of single copies of the copper chaperone 
Hahl and the Wilson disease protein within lipid 
vesicles. The dissociation constant for the interaction 
of these two proteins is on the micromolar range, and 
since a single molecule within a 100 nm vesicle is 
equivalent to a concentration of ^3 pM, they were 
able to observe repetitive events of association and 


dissociation, and characterize the rate constants 
involved in the reaction, even though the overall 
concentration of labeled molecules remained 
extremely small. 

Membrane Pores Open the Way to 
Nonequilibrium Experiments 

A shortcoming of the encapsulation method used in all 
studies discussed above was that it does not allow 
exchange of small-molecule reactants or of products 
of the reactions occurring within the vesicles. While 
some solutes can diffuse spontaneously across a lipid 
membrane (Lande et al. 1995; Cisse et al. 2007), this 
diffusion might be rather slow, and many others cannot 
cross a membrane at all. This essentially limits the 
vesicle immobilization method to equilibrium studies 
involving only the molecules included during the 
encapsulation process. Ha and coworkers demon¬ 
strated a solution to this problem in an elegant series 
of studies (Okumus et al. 2009; Ishitsuka et al. 2010). 
They introduced molecules of the bacterial toxin 
a-hemolysin into the membranes of preformed 
vesicles in order to render them permeable to small 
molecules such as ions (Fig. 1). To demonstrate facile 
exchange of solutes through a-hemolysin pores, they 
studied both RNA and DNA folding initiated by 
introduction of ions, and interaction of a helicase 
with its single-stranded DNA track. The latter case 
provided another example of the principle discussed 
above of studying macromolecular interactions at 
high effective local concentrations, while maintaining 
low overall concentrations: The helicase could be 
seen to dissociate from the DNA, but instead of 
diffusing away, as it would do in free solution, it 
quickly encountered the DNA molecule again and 
re-associated with it. 


Summary 

In conclusion, immobilization of biomolecules within 
surface-tethered lipid vesicles emerges as an interest¬ 
ing and useful method for studies of single-molecule 
reactions using fluorescence methods. Two important 
advantages of this method for immobilization over the 
more common techniques involving direct tethering to 
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a surface emerge from current studies. First, the inter¬ 
action with immobilizing surfaces is minimized within 
vesicles, a property that is particularly important for 
folding studies. Second, high effective local concen¬ 
trations are achieved, which can be particularly useful 
for studies involving association between two or more 
macromolecules. It is expected that these advantages 
will continue to attract new and exciting applications 
of vesicle immobilization to facilitate equilibrium and 
nonequilibrium single-molecule studies. 

Cross-References 

► Single-Molecule Spectroscopy 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 
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Definition 

SWEET transporters = a family of transporters for 
glycosides that typically use a uniport transport 
mechanism. Their physiological function is typically 
in cellular efflux, for example, as the first step in 
phloem loading in plants. 

Sugar (in) ^ Sugar (out) 

Basic Characteristics 

Soluble carbohydrates, principally derived from pho¬ 
tosynthesis, serve as the preferred nutrient for many 
unicellular organisms, and as the dominant transported 
form for carbon and energy in multicellular organisms. 
Almost all cells from all kingdoms use carrier proteins 
to transport soluble carbohydrates across the lipid bila¬ 
yers that enclose their cytoplasm. Today, at least four 
classes of carriers that mediate transport of sugars like 
glucose, maltose, or sucrose are known, namely: 
(1) the sodium-dependent glucose transporters of the 
solute-sodium symporter family (SSS TCDB 2.A.21, 
and SLC2); (2) the mono- and disaccharide trans¬ 
porters of the major facilitator superfamily (MES 
TCDB 2.A.1 and SLC5), exemplified by the lactose 
transport protein, LacY; (3) the sugar 
phosphotransferase superfamily (PTS TCDB 4. 
A. 1-4); and (4) the recently identified mono- and disac¬ 
charide transporters of the SWEET family (SLC50). 
The SWEET family has homologs in bacteria, in 
plants, and in animals/humans (Chen et al. 2010). 
Prokaryotic SWEET homologs are predicted to consist 
of only three transmembrane helices (Hu, Sosso, 
Erommer unpublished), while the eukaryotic trans¬ 
porters contain seven predicted transmembrane helices 
with a direct repeat of the bacterial three transmem¬ 
brane helix motif sandwiching a central helix (Chen 
et al. 2010). Their small size makes them ideal candi¬ 
dates for structural analyses by NMR or crystalliza¬ 
tion. The Erommer group has demonstrated 
monosaccharide (glucose, mannose, fructose) trans¬ 
port activities for a number of plant and animal/ 
human SWEETs as well as sucrose transport activities 
for several of the plant SWEETs (Chen et al. 2010, 
2012 ). 

The first member of the SWEET family was identi¬ 
fied by coexpressing Arahidopsis membrane proteins 


of as yet unknown function together with a Edrster 
resonance energy transfer (ERET) sensor for glucose 
in mammalian cells and recording glucose-induced 
FRET changes in the cytosol of the human cells 
(Chen et al. 2010). Similarly, the first sucrose transport 
activity was detected by coexpressing Arahidopsis 
membrane proteins of as yet unknown function 
together with a Forster resonance energy transfer 
(FRET) sensor for sucrose in mammalian cells (Chen 
et al. 2012). The first SWEETs from animals and 
humans were characterized using similar expression 
systems as well as Xenopus oocytes in combination 
with radiotracer studies (Chen et al. 2010). 

The transporters characterized so far appear to func¬ 
tion as uniporters, and appear to localize to the plasma 
membrane or, in the case of the human SWEET 1, to 
the Golgi apparatus (Chen et al. 2010). The transport 
activity of the plant homologs is characterized by low 
affinity and pH independence (Chen et al. 2010, 2012). 

Arahidopsis and rice SWEETs that belong to the 
phylogenically defined clade III (Chen et al. 2010, 
2012), function as sucrose transporters. The 
Arahidopsis SWEET 11 and 12 proteins are expressed 
in phloem parenchyma, and their physiological func¬ 
tion is to allow efflux of sucrose into the cell wall 
space, from where sucrose is loaded into the phloem, 
i.e., the conduit serving long distance translocation of 
sucrose in plants. Thus, SWEETs, together with the 
MES family SUT sucrose proton cotransporters, are 
responsible for the key steps in phloem loading 
(Riesmeier et al. 1992, 1993, 1994; Chen et al. 2012). 

Plant pathogens attack their hosts in order to gain 
access to their nutrient resources. Interestingly, 
SWEET gene expression is induced when 
Xanthomonas oryzae infects rice plants (Chu et al. 
2006; Yang et al. 2006; Antony et al. 2010; Chen 
et al. 2010, 2012; Liu et al. 2011). The pathogens use 
type III effector molecules that function as transcrip¬ 
tion factors to trigger SWEET expression, most likely 
to induce efflux of sugars at the infection site (Chu 
et al. 2006; Yang et al. 2006; Antony et al. 2010; Chen 
et al. 2010, 2012; Liu et al. 2011; Yu et al. 2011). 
Excitingly, when induction of SWEETs is blocked, 
plants are resistant to the pathogen. SWEETs corre¬ 
spond to the important recessive resistance loci Xal3 
and Xa25. 

It will be interesting to study the regulation of the 
SWEETs, their structure and their structure-function 
relationship, as well as their role in the physiology of 
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the different classes of organisms. It is fascinating to 
see that organisms that live extremely different life 
styles such as plants and humans use related proteins 
for similar functions, here cellular sugar efflux. 

Chen et al. (2010) hypothesized that the animal/ 
human SWEETs are also involved in cellular sugar 
efflux and that they may represent the elusive glucose 
efflux transporters required to transport glucose out of 
our intestinal cells into the blood stream for nutrition, 
as well as glucose efflux from liver cells as a key step 
for glucose homeostasis. Therefore, they may be 
important players in obesity and diabetes. 

Cross-References 

► Excitatory Amino Acid Transporter 

► Fluorescence and FRET in Membranes 

► Glutamate Transporter Family 

► Membrane Protein Function 

► Membrane Proteins: Structure and Organization 

► Membrane Transport, Energetics and Overview 

► Substrate Capture by ABC Transporters 

► SWEET Glucoside Transporter Superfamily 
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► Bacterial Flagellar Motor: Overview 


Switching 

► Bacterial Flagellar Motor: Overview 


Synaptic Potentials 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 


Synaptic Transmission 

► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 


Synchronous Correlation 

► Infrared Spectroscopy: Data Analysis 


Synchrotron Light 

► Infrared Spectroscopy using Synchrotron Radiation 
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► Infrared Spectroscopy using Synchrotron Radiation 


Synchrotron Radiation Circular 
Dichroism Spectroscopy 

B. A. Wallace 

Department of Crystallography, Institute of Structural 
and Molecular Biology, Birkbeck College, University 
of London, London, UK 

Synonyms 

SRCD spectroscopy 

Definition 

Synchrotron radiation circular dichroism (SRCD) spec¬ 
troscopy extends conventional circular dichroism (CD) 
spectroscopy (using laboratory-based instruments) by 
employing the intense beam produced by a synchrotron 
instead of a xenon arc lamp as its light source. The light 
flux in an SRCD beamline is far more intense (up to four 
orders of magnitude) than conventional CD instruments 
and extends to a wider wavelength range covering the 
vacuum and far- and near-ultraviolet regions (i.e., from 
^130 to 300 nm). 

Basic Characteristics 

Enhanced Features of SRCD Spectroscopy Relative 
to Laboratory-based CD Spectroscopy 

SRCD spectroscopy results in improved quality and 
quantity of data obtainable, thereby leading to 
improved and enhanced analyses. The advantages of 
using this method are that it enables 

1. Collection of additional data at wavelengths below 
the 182-190 nm cutoff limit normally achievable 
with conventional CD instruments (Fig. 1), thus 
enabling the measurement of additional electronic 
transitions and resulting in higher information con¬ 
tent in the spectra 


2. Detection of spectra with higher signal-to-noise 
levels, enabling either the use of smaller amounts 
of material or shorter data collection times, or more 
sensitive detection of small subtle differences 
between samples 

3. Measurements in the presence of absorbing com¬ 
ponents (such as buffers, salts, lipids, and 
detergents) 

Additional Applications Possible with SRCD 
Spectroscopy 

This newly developing method is playing an important 
role in both static and dynamic studies of protein confor¬ 
mations and their complexes. Thus far a number of new 
types of experiments have been possible using SRCD 
spectroscopy which were not feasible with conventional 
CD spectroscopy. Some examples are 

1. Improved accuracy in secondary structure deter¬ 
mination, especially for proteins involving beta 
sheet structures (possible due to the low wave¬ 
length data where contributions of beta-type struc¬ 
tures are more prominent) (Fig. 1) 

2. The ability to distinguish not only secondary struc¬ 
tural types, but also different protein fold motifs 
(possible if data down to at least 170 nm can be 
obtained) 

3. The ability to examine conformational differences 
between a wild-type protein and a single-point 
mutation of the protein (possible because of the 
high sensitivity and signal-to-noise levels) 

4. The ability to compare the structures of 
a membrane protein in lipid vesicles, detergent 
micelles, and under crystallization conditions 
(possible because of the penetration of the beam 
into samples with highly absorbing components) 

5. The identification of new folding intermediates in 
the process of thermal folding/unfolding processes 
(possible due to the low wavelength data accessi¬ 
ble, in which the intermediate structures produce 
distinct spectra) 

6. Monitoring of rapid protein folding events, espe¬ 
cially involving beta-sheet-containing structures 
(possible due to the high sensitivity and thus more 
rapid data collection, and the ability to monitor 
changes at lower wavelengths, which provide 
more accurate information on beta sheet structures) 

7. The identification of the orientation of peptides 
and proteins in membranes (possible using ori¬ 
ented SRCD) 
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Synchrotron Radiation Circular Dichroism Spectroscopy, 
Fig. 1 SRCD spectra of a mostly helical protein in red, a mostly 
beta sheet protein in blue, and a polyproline helix (collagen) in 
yellow. The low wavelength data in the green shaded area to the 
left of the vertical black line are generally only accessible using 
SRCD spectroscopy, whereas the data to the right of the line are 
accessible by both SRCD and conventional CD instruments 
(After Miles and Wallace 2006) 

8. Improved characterization of natively disordered 
proteins (possible because their characteristic 
spectral features are predominantly in the low 
wavelength region below 200 nm) (Fig. 1) 

9. The ability to characterize the carbohydrate com¬ 
ponents of glycoproteins (possible because sugar 
transitions, for the most part, occur only at wave¬ 
lengths below 200 nm) 

10. The ability to characterize the nature of protein- 
protein interactions in complexes involving both 
induced-fit and rigid-body interactions (the former 
is possible even when they only involve subtle 
secondary structural changes, and the latter 
because the quaternary interactions can be moni¬ 
tored at wavelengths below 190 nm) 

11. Asa sensitive measure of ligand binding, so it can 
be used to identify potential new drug candidates 
(possible due to the high sensitivity and signal-to- 
noise levels) 

Future applications of SRCD spectroscopy are 
expected to include more facile and more rapid dynam¬ 
ics studies involving fast kinetic reactions, and protein 
folding/unfolding/refolding, including temperature 
jump kinetic studies, and facilities for high throughput 
examination of samples, which will facilitate screen¬ 
ing for new pharmaceutical drug candidates. 

Availability 

SRCD beamlines are currently located at the following 
synchrotrons: Institute for Storage Ring Facilities 


(ISA, Denmark); National Synchrotron Light Source 
(NSLS, USA); National Synchrotron Radiation 
Research Center (NSRRC, Taiwan); Beijing Synchro¬ 
tron Radiation Facility (BSRF, China); Soleil Synchro¬ 
tron (France); Diamond Light Source (UK); Berliner 
Elektronenspeicherring-Gesellschaft fiir Synchrotron- 
strahlung (Bessyll, Germany); Hiroshima Synchrotron 
Radiation Center (HiSOR, Japan); ANKA Synchrotron 
(Germany); and National Institute of Advanced Indus¬ 
trial Science and Technology (TERAS, Japan). 


Information Sources 

Information on practical aspects of sample preparation 
and data analysis is reviewed in Miles and Wallace 
(2006), and examples of applications enabled by the 
use of SRCD can be found in Wallace (2010) and 
Wallace and Janes (2010). Eull discussion of the 
SRCD technique is described in a recent monograph 
(Wallace and Janes 2009). 


Cross-References 

► Circular Dichroism and Chirality 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► DNA-Ligand Circular Dichroism 

► Ear UV Protein Circular Dichroism 

► Near UV Protein CD 

► Oriented Circular Dichroism Spectroscopy 

► Protein Circular Dichroism Analysis 

► Protein Circular Dichroism Data Bank 

► Protein Eolding 
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Synchrotron-Induced X-Ray 
Fluorescence (SXRF) 

► X-Ray Fluorescence Imaging: Elemental and Chem¬ 
ical Speciation Mapping of Biological Systems 


Synthetic Cell Biology (in the Context of 
Carbohydrates) 

Chunjuan Liu, Paul M. Gardner and 
Benjamin G. Davis 

Department of Chemistry, Chemistry Research 
Laboratory, University of Oxford, Oxford, UK 

Definition 

Synthetic biology, (Elowitz and Lim 2010; Benner and 
Sismour 2005) a rapidly emerging area, applies both 
scientific and engineering knowledge and principles to 
biology to design and manipulate novel biological 
components and systems with novel or specialized 
functions. This nascent field presents a wide variety 
of potential applications such as tissue engineering, 
environmental alteration, biomaterial fabrication, and 
understanding of naturally occurring biological pro¬ 
cesses. While much effort has been applied to the 
manipulations of existing systems, e.g., genetically 
engineered cells and organisms, (Andrianantoandro 
et al. 2006) to fulfill escalating requirements, the cre¬ 
ation of a living organism or system from nonliving 
parts (potentially de novo) remains a striking 
unaccomplished challenge. 

Synthetic cells or artificial cells are minimum cells 
or cell-like constructs composed of parts that achieve 
partial (and ultimately full) cellular functions. Under¬ 
standably, primary efforts have been focused on the 
creation of such cells with basic biological functions, 
e.g., protein transcription and translation and metabo¬ 
lite generation. 

There are two main and complementary ways to 
construct synthetic cells, in principle, referred to on 
some hypothetical axis of complexity as “top-down” 
(making an existing living cellular system less com¬ 
plex and typically more defined for a given purpose) 
and “bottom-up” (assembly from building blocks of 


varying complexity and identity, none of which them¬ 
selves possess cell-like function alone). The top-down 
approach attempts to minimize, through reduction, 
gene/protein copy numbers while retaining activity 
(Hammer et al. 2008; Robbins et al. 2010). In 
contrast, the bottom-up approach is perhaps a more 
chemical process, which expands a hypothesis 
that complex and delicate biological systems start 
(and so can be recapitulated) with simple molecules 
(Eig. 1). An archetypal example of this philosophy 
comes from the work of Miller (1953) who generated 
low levels of amino acids by applying electrical dis¬ 
charge to a proposed prebiotic microenvironment. 

One top-down synthetic cell design was suggested 
by Venter et al. in 2010 (Gibson et al. 2010). Through 
stepwise creation of a bacterial chromosome using 
chemical and biological methods followed by transfer 
to a bacteria cell, native bacterium DNA was replaced 
in a dilution-type approach by a “synthetic” genome. 
Consequently, these cells started creating new cells 
increasingly under the control of the synthetic genome. 
Over the long term, this approach may facilitate the 
generation of fully synthetic cell machinery to produce 
useful products by simply adding genes to the synthetic 
genome of the bacterium. Although promising and 
attractive, these current techniques are in essence 
only able to mimic or copy the existing genomes and 
use standard strategies from molecular biology and 
microbiology; they do not yet create any new biolog¬ 
ical processes or principles. 

The bottom-up approach has often focused on an 
understanding of principles that might be of relevance 
to both the origin of life, the steps from chemistry 
to biology, and to what might constitute the hallmarks 
of life in cellular form. In this context, protocells 
have been built as model systems to probe concepts 
such as growth, division, adaption, and evolution. Sev¬ 
eral characteristics have been probed as suggested 
requisites. Eor example, in some protocells, two 
components have been combined. To explore 
information-carrying genetic molecules within mem¬ 
brane-bound environments, Szostak et al. (Zhu and 
Szostak 2009a, b; Mansy and Szostak 2008; Mansy 
et al. 2008; Fujikawa et al. 2005; Chen et al. 2004, 
2005; Szostak et al. 2001) have built lipid vesicles and 
inserted self-replicating RNA molecules. In such 
systems, the osmotic pressure, generated in the process 
of replication, accelerated the growth of fatty acid 
membranes. Consequently, cells containing a faster 
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Synthetic Cell Biology (in 
the Context of 
Carbohydrates), 

Fig. 1 Top -down and 
bottom-up approaches 
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replicating “genome” grew faster than cells with 
slower nucleic acid replication. Accordingly, simple 
physical principles coordinating replications between 
the nucleic acids and the cell membranes proved key in 
a crude form of selection. 

Despite the fact that they are one of the fundamental 
components of existing life, both strategic approaches 
to such synthetic cells have largely neglected the role 
of carbohydrates, (► Glycoproteins) other than to 
serve, incidentally, as backbones of nucleic acids. 
Yet, this role, while now evolutionarily conserved 
and unaltered, must too have been the product of 
some form of essential prebiotic (or cobiotic) carbohy¬ 
drate chemistry (Powner et al. 2009). It is quite widely 
agreed that RNA may be considered as a viable pre¬ 
cursor of both DNA and protein, a notion encapsulated 
in the so-called “RNA-world” hypothesis (Joyce 2002; 
Orgel 2004; Gesteland et al. 2006). This hinges, in part, 
on the observations that RNA can work as both catalyst 
(fulfilling the role typically occupied by protein) and 
genetic information carrier (fulfilling the role typically 
occupied by DNA). Chemically plausible pathways to 


the generation of key building blocks of RNA are now 
starting to emerge (Powner et al. 2009). 

Opposed to a gene-first scenario, Wachtershauser 
has proposed (Wachtershauser 1988, 1990) 

a “metabolism-first” scenario, involving metabolic 
cycles independent of genetic materials. Through 
such stable cycles, genetic information might then 
emerge. One aspect of this scenario predicts the idea 
of “flat life”; that the beginning of life was directly 
related to the chemical (metabolic) reactions occurring 
on a given (mineral) surface. 

Desirable hallmarks for metabolic systems include 
modes of chemical amplification, one of which might 
viably include autocatalysis. Few autocatalytic sys¬ 
tems exist in organic chemistry and even fewer in the 
aqueous environment that is likely to have sustained 
early life. The formose reaction, in which highly 
concentrated aqueous formaldehyde condenses to 
a complex mixture of carbohydrates and sugar- 
alcohols (Fig. 2), typically in the presence of a base, 
is one such rare example and was first described by 
Butlerov in 1861 (Butlerov 1861). Detailed studies of 
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Synthetic Cell Biology (in the Context of Carbohydrates), Fig. 2 Overview of the formose reaction based on the autocatalytic 
cycle proposed by Breslow (1959) and GCT-MS data of persilylated carbohydrate reaction products 


the reaction have been relatively limited in part due to 
the difficulties in separating and identifying compo¬ 
nents of the product mixture, named “formose” with 
the empirical formula for polymerization of formalde¬ 
hyde (CH20)„. The function of this reaction as 
a putative metabolism(a protometabolism) embedded 
in protocells has recently been explored (Gardner et al. 
2009). Functional generation of carbohydrate within 
these chemical cells or “chells” was confirmed and 
intriguingly displayed a different product profile to 
those found from formose reactions conducted in 
bulk solution. 

Such varying start points generate many useful 
hypotheses. To provide a realizable target within the 
field of artihcial cells, an approach to the recognition of 


artificial chemical systems that lead to such chemical 
cells has been proposed. This can be considered as 
a Gedankenexperiment that exploits a “cellular imita¬ 
tion game” to test protocells (Cronin et al. 2006). The 
concept of the test Is that the artificial cell in question 
should be able to “communicate” with natural cells 
(directly or via a relay) in such a way that it can be 
interrogated by the latter, in a language sufficiently 
sophisticated so as to appropriately distinguish 
between alternative outputs from realizable experi¬ 
ments. In this idealized cellular imitation game, inter¬ 
rogation could take place following any of a series of 
increasingly more complex and sophisticated mecha¬ 
nisms, starting perhaps with a relatively simple of 
language based on low-molecular-weight signaling 
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molecules (and ultimately moving toward mechanical 
transduction, biofilm formation and swarming patterns 
of behavior etc.)- 

One such “language” is used by bacteria to commu¬ 
nicate with each other by releasing signal molecules, 
referred to as autoinducers. Bacteria measure their 
population by detecting the concentration of such 
autoinducers, which in turn correlates with cell den¬ 
sity; this is termed “quorum sensing.” Quorum sensing 
(Bassler and Losick 2006) occurs in both Gram¬ 
positive and Gram-negative bacteria. Also, it controls 
various biological processes, such as bioluminescence, 
secretion of virulence factors, sporulation, and conju¬ 
gation. It has been shown that the model bacterium 
Vibrio harveyi generates autoinducer-2 (AI-2), 
a furanosyl boratediester which appears to be widely 
occurring for interspecies cell-cell communication. 

From the carbohydrate generation pool created by 
the protometabolic formose reaction (6-8 pmol of 
carbohydrate metabolic product per mL of vesicle 
solution), active compounds structurally related to 
AI-2 were discovered (Gardner et al. 2009). When 
these protometabolic products of the vesicular formose 
reaction were introduced into V. harveyi culture, the 
bacteria showed a clear luminescent response and con¬ 
firmed an ability to serve as the potential signal mole¬ 
cules released by the protocell. This then enabled 
coupling of a protometabolism with a signaling pro¬ 
cess between artificial and natural cells. When appro¬ 
priate active protocell-containing medium was added 
directly to active V. harveyi culture, resulting signifi¬ 
cant light production by V. harveyi indicated success¬ 
ful protocell-to-cell signaling mediated by 
carbohydrates. 

Summary 

To date, fewer researchers have examined abiotic (as 
judged by our current biologies) systems in the gener¬ 
ation of putative synthetic cells. The incorporation of 
atypical reactions (such as the formose, which requires 
harsh conditions not normally compatible with biol¬ 
ogy) and chemistries (such as previously unexplored 
assembly manifolds for ribosides (Powner et al. 
2009)), therefore represent alternative avenues involv¬ 
ing carbohydrates and even suggest possible chemical 
and functional origins of bioactive carbohydrates 


(Sacerdote and Szostak 2005; Weber 2000) (e.g., in 
communication and in the central structures of other 
key biomolecules). 

Cross-References 

► (Glyco)Protein Folding Disorders 

► Carbohydrate Nomenclature 

► Glycoproteins 
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Taurine/Alpha-Ketoglutarate 
Dioxygenase - Computational Studies 

Sam P. de Visser 

Manchester Interdisciplinary Biocenter and School of 
Chemical Engineering and Analytical Science, 
University of Manchester, Manchester, UK 

Synonyms 

2-Oxoglutarate-dependent dioxygenases; a-Ketoglutarate- 
dependent hydroxylases 

Definition 

Taurine/a-ketoglutarate-dependent dioxygenases (TauD) 

Basic Characteristics 

The most commonly investigated mononuclear nonheme 
iron containing dioxygenase is taurine/a-ketoglutarate- 
dependent dioxygenase (TauD); as such it has become 
the template of nonheme iron biochemistry (Costas et al. 
2004; Bollinger et al. 2005). These enzymes require a 
co-substrate, generally a-ketoglutarate, and molecular 
oxygen in their catalytic cycle to generate a high-valent 


iron(IV)-oxo species, which abstracts a hydrogen 
atom from the substrate that ultimately leads to either 
substrate hydroxylation, dehydrogenation, or haloge- 
nation. Interestingly, the a-ketoglutarate-dependent 
dioxygenases as well as the related a-ketoglutarate- 
dependent halogenases do not require a cofactor and/ 
or proton sources, which make them of huge interest 
to the biotechnology industry. Their biochemical 
functions include the biosynthesis of antibiotics in 
bacteria, for example, vancomycin and fosfomycin, 
oxygen sensing, responses to hypoxia, and DNA and 
RNA base repair mechanisms. 

Figure 1 displays a typical active site structure of 
a TauD enzyme as taken from the 10S7 protein data 
bank (pdb) file (O’Brien et al. 2003). The central iron 
atom is bound through interactions with three peptide 
ligands in a typical 2-His/l-carboxylate ligand motif 
from His 99 , Aspioi, and His 255 . This substrate bound 
pdb file contains a-ketoglutarate (aKG) that binds as 
a bidentate ligand with one oxygen atom trans to His 99 
and another trans to Aspioi. The sixth ligand position 
of the metal is reserved for molecular oxygen, which 
will bind in a later step in the catalytic cycle. Substrate 
taurine is not covalently bound to the metal but is held 
in position through many hydrogen-bonding interac¬ 
tions in a small substrate-binding pocket. Two phenyl¬ 
alanine rings (of Phei 59 and Phe 206 ) surround the 
substrate-binding pocket; the former has been identi¬ 
fied with substrate binding and orientation. 

The catalytic cycle of TauD enzymes in schemati¬ 
cally depicted in Fig. 2. The catalytic cycle starts from 
the resting state (A), where a water molecule fills the 
sixth ligand position of the metal. Entrance of taurine 
into the substrate-binding pocket, however, displaces 
this water molecule from the iron center to form 


G.C.K. Roberts (ed.), Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 
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Taurine/Alpha-Ketoglutarate Dioxygenase - Computa¬ 
tional Studies, Fig. 1 Extract of the active site of TauD with 
several key amino acids and substrates (ot—ketoglutarate and 
taurine) highlighted. Labels taken from the pdb file 


Structure B, and its position is taken over by molecular 
oxygen that binds as a ferric-superoxo form (C). 
Molecular oxygen, subsequently attacks the a-keto 
position of a-ketoglutarate to initially generate 
a bicyclic ring-structure (D), whereby the dioxygen 
bond is weakened. Thereafter, decarboxylation of a- 
ketoglutarate into succinate and CO 2 produces an iron 
(IV)-oxo active species (E). This is an efficient and 
versatile oxidant capable of abstracting hydrogen 
atoms from aliphatic groups. After hydrogen atom 
abstraction and formation of the iron(lll)-hydroxo spe¬ 
cies (F) a fast radical rebound step gives alcohol prod¬ 
ucts. These are released from the substrate-binding 
pocket and the sixth coordination site of the metal is 
reoccupied by a water molecule to close the catalytic 
cycle. 

Several research groups have investigated the cata¬ 
lytic mechanism of oxygen activation by TauD 
enzymes computationally (de Visser 2007; Topol 
et al. 2006; Borowski et al. 2004). Most studies report 
density functional theory calculations on the process 
starting from the ferric-superoxo species (C in Fig. 2) 
and focus on the mechanism leading to hydroxylation 
of taurine. Further quantum mechanics/molecular 
mechanics (QM/MM) studies on an enzyme model 
system supported the established mechanisms 
(Godfrey et al. 2008). Figure 2 displays the catalytic 
mechanism and relative energies obtained for this 


chemical process. These calculations start from an 
iron(lIl)-superoxo species (C) that has a septet spin 
ground state with the quintet spin state somewhat 
higher in energy. The distal oxygen atom of the 
superoxo group attacks the a-keto position of 
a-ketoglutarate to form the ring-structure (D) via a 
C-0 bond formation barrier TSi. This step costs 
about 10 kcal moF* in energy on both spin state 
surfaces. The spin state surfaces become degenerate 
in intermediates D. Decarboxylation from this struc¬ 
ture encounters a small barrier (TS 2 ) and is highly 
exothermic on both spin state surfaces and, therefore, 
irreversible. This reaction step initially gives an iron 

(III) -peroxosuccinate structure (D'), which is an 
intermediate that had not been anticipated from exper¬ 
imental studies. In structure D' the quintet spin state is 
well lower in energy than the nearest septet spin state, 
which stays that way during the rest of the reaction 
mechanism. In the final step of this mechanism, the 
dioxygen bond breaks to form succinate and an iron 

(IV) -oxo species (E) via an 0-0 breaking barrier TS 3 . 
The latter barrier is reasonably high, which implies that 
this structure should have a finite lifetime and therefore 
may live long enough to be able to be detected. 
So far, no experimental evidence of the iron(III)- 
peroxosuccinate structure has been found, however. 
DFT studies investigated its potential to act as an 
alternative oxidant of substrate hydroxylation in com¬ 
petition with the iron(IV)-oxo species. The hydrogen 
abstraction barriers of taurine by D', however, were 
found to be substantially higher than dioxygen bond 
breakage via TS 3 to form the iron(IV)-oxo species. 
Moreover, the iron(IV)-oxo species gives very low 
barriers for hydrogen atom abstraction from substrates. 
Consequently, computational studies identify the iron 
(IV)-oxo species as the sole oxidant of TauD enzymes. 

Although these computational studies give varia¬ 
tions in the relative energies of the individual struc¬ 
tures, the main conclusions are the same. Thus, the 
rate-determining step of the catalytic cycle after 
dioxygen binding to the iron center proceeds via bar¬ 
rier TSi. The mechanism contains an entropically 
driven process, namely, decarboxylation, which can¬ 
not be reversed. Relative energies of the local minima 
in the calculations predict the iron(IV)-oxo species to 
be the most stable. Subsequent studies using DFT 
calculations on model complexes and full QM/MM 
on an enzymatic system was performed on the 
next step of the reaction mechanism, where the 
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Taurine/Alpha-Ketoglutarate Dioxygenase - Computational Studies, Fig. 2 Catalytic cycle of taurine hydroxylation by a- 
ketoglutarate-dependent dioxygenases such as TauD 


iron(IV)-oxo species abstracts a hydrogen atom from 
taurine. Both DFT and QM/MM methods predicted 
low hydrogen abstraction barrier heights in the 
order of 6.7 kcal mol”'. Note that the calculated 
Fe-0 distances are in good agreement with those 
obtained from rapid freeze quench X-ray absorption 
studies (1.62 A). 

Recently, a series of systematic computational stud¬ 
ies were performed where substrate hydroxylation by 
TauD enzymes was compared to that of heme-based 
iron(IV)-oxo oxidants (de Visser 2006, 2010; Latih 
et al. 2009). These studies showed that nonheme iron 
(IV)-oxo oxidants are more effective and powerful 
oxidants in hydrogen atom abstraction reactions as 
compared to heme type iron(IV)-oxo porphyrin cation 
radical oxidants like the active species of cytochrome 
P450 enzymes. Trends in substrate hydroxylation 


reactions by the iron(IV)-oxo species of TauD 
enzymes showed that the rate determining hydrogen 
atom abstraction barrier correlates linearly with the 
strength of the C-H bond that is broken and the 
strength of the 0-H bond that is formed. Hence, 
these reactivity trends can be used to predict substrate 
activation barriers and rate constants for TauD 
enzymes. 

Cross-References 

► Computational Enzymology 

► Cysteine Dioxygenase - Computational Studies 

► QM/MM Calculations of Vibrational Spectra 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 
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Techniques Applied to Glycan Structure and Conformation 
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Techniques Applied to Glycan Structure 
and Conformation 

Elizabeth Hounsell 

School of Biological and Chemical Sciences, Birkbeck 
College, University of London, London, UK 

Synonyms 

Carbohydrates; Glycolipid analysis; Glycoprotein 
analysis; Oligosaccharide analysis and conformation; 
Polysaccharides 


Definition 

The techniques applied to glycan/oligosaccharide/ 
polysaccharide structure and conformation are the 
same as those used to analyze the protein to which 
they can be attached (► Glycoproteins), but with 
specific adaptations that have been honed over the 
years to adapt to the different functional groups of 
oligosaccharides, their diversity of monosaccharide 
type, linkages, and branching and anomericity. Mono¬ 
saccharide analysis methods, mass spectrometry, cir¬ 
cular dichroism (► Carbohydrate Circular Dichroism), 
NMR (► Carbohydrate NMR Spectroscopy) and X-ray 
crystallography (► X-Ray Diffraction and Crystallog¬ 
raphy of Oligosaccharides and Polysaccharides), and 
different forms of microscopy all give clues to parts of 
the structural and conformational puzzle. In some 
cases analysis of intact glycoproteins can be carried 
out. In others the different biophysical characteristics 
of carbohydrate, protein, and lipid require separation 
before analysis and reconstruction by molecular 
graphics with molecular dynamics to give the full 
picture of the cell surface and its interactions. The 
difference between analysis of proteins and carbohy- 
drates/glycosylation is that monosaccharides, unlike 
amino acids, can be linked in different ways (not just 
by one, the peptide linkage) and therefore it is neces¬ 
sary to have linkage data as well as constituent and 
sequence analysis. In many ways the conformation of 
the oligosaccharide is then defined by the internal 
branching rather than with proteins where the folding 
of the peptide chain is determined by the interaction of 
often distant amino acids in the sequence. 

Basic Characteristics 

Historically, the first analysis was by chemical 
methods followed by the chromatographies - paper 
(PC), thin layer (TLC), gas-liquid (GLC or GC), and 
high pressure/performance (HPLC). GC quickly 
became associated with chemical ionization (Cl) and 
electron impact (LI) mass spectrometry (MS) for anal¬ 
ysis at the monosaccharide level of volatile derivatives 
(see below) as the first hyphenated technique, GC-MS. 
At about the same time as the first HPLC methods 
became available (early 1980s) the first soft ionization 
mass spectrometry (MS) methods were beginning to be 
applied, LAB-MS and, later, ESI and MALDI-MS 
becoming the most popular. TLC-MS and LC-MS 
followed. TLC, HPLC, and MS once used for 
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monosaccharide analysis were now used routinely at 
the oligosaccharide level. Together these techniques 
give monosaccharide composition, linkage, and 
branching patterns, but for complete analysis including 
anomeric designation and conformational informa¬ 
tion NMR is required, the first routine applications 
being when 400 MHz magnets became available. 
NMR gives complementary information on sequence, 
linkage, and branching. This and other biophysical 
techniques for oligosaccharides, polysaccharides, 
and glycoconjugates followed, reviewed in this 
encyclopedia. The use of NMR analysis for a wide 
variety of structural and biophysical applications in 
glycoproteins, glycation, bacterial polysaccharides, 
natural products, agriculture, and industry has been 
reviewed yearly for the RSC Specialist Reports on 
NMR, the last being Hounsell (2010) from which the 
previous 10 years can be accessed. Earlier NMR stud¬ 
ies of oligosaccharides, glycoconjugates, and glyco- 
peptides are summarized in Hounsell (1995) and 
Hounsell and Bailey (1997). A good modem academic 
summary is in Sinnott (2007). 

In general, due to the different biophysical charac¬ 
teristics of oligosaccharides and the protein to which 
they are attached, the glycan part of glycoproteins have 
been analyzed by the methods discussed below and 
(Hounsell 1994, 1997, 1998) after their release by 
chemical or enzymatic methods. 0-linked glycosyla- 
tion (oligosaccharides attached to protein via the 
oxygen atom of serine or threonine amino acids) 
(► Glycan-to-Protein Linkages) is classically released 
by mild alkali with subsequent reduction. N-linked 
glycosylation (oligosaccharides linked via the nitrogen 
of the acetamido group of asparagine amino acids) can 
be released chemically by hydrazinolysis, but is more 
commonly removed enzymatically. Glycopeptides 
formed by protease digestion are also analyzed, in 
particular to identify the position of occupied glyco¬ 
sylation sites in the protein. ► Glycosphingolipids 
(glycolipids) are usually analyzed intact. Obviously 
all the methods discussed can be applied in the analysis 
of polysaccharides of different species (► Bacterial 
Lipopolysaccharide, OPS, and Lipid A, ► Bacterial 
Polysaccharide Structure and Biosynthesis, ► Polysac¬ 
charides: Biophysical Properties, ► Pectin Biophysics). 

Chromatography 

For oligosaccharides and glycolipids, TLC is still 
a convenient method of characterization although this 


requires derivatization before chromatography and 
hence is used for analysis alone. PC has largely lost 
favor although it was used very successfully for 
separation of underivatized oligosaccharides in large 
amounts for their early use in structural, antigenic, and 
biological studies. In column chromatography, separa¬ 
tion or analysis of underivatized oligosaccharides by 
molecular weight can be carried out by size exclusion 
or gel filtration chromatography. A column packing 
introduced in HPLC, polymerized carbon PGC mimics 
the size separation, and was popular enough to spawn 
HypercarbS and GlycoSep H™ columns, but the com¬ 
mon column packings for normal and reverse phase 
HPLC also take into account the differences in 
hydrodynamic volume given by differences in mono¬ 
saccharides, linkage, and branching. These HPLC col¬ 
umn packings use derivatized oligosaccharides for 
analysis against a library of standards or with online 
MS analysis. All modern applications are with pre- 
rather than and post-column derivatization, although 
it could be considered that another commonly used 
analysis technique, high performance anion exchange 
chromatography with pulsed amperometric detection 
(HPAEC-PAD), at least involves on-column electro¬ 
chemical change (ionization of the free hydroxyl 
groups of mono- and oligosaccharides). Other standard 
anion exchange columns can be used for the acetamido 
group of, for example, GlcNAc and GalNAc, the car¬ 
boxyl group of the uronic acids GlcA and IdA, and the 
sialic acids (► Carbohydrate Nomenclature, ► Glyco¬ 
proteins, ► Proteoglycans). Affinity chromatography, 
based on oligosaccharide antigenicity and carbohy¬ 
drate binding proteins (► Lectins) are also common 
(Hounsell 1998). Also using biological reagents, is 
the application of one-off or sequential enzyme diges¬ 
tion combined with chromatography often with the 
highly increased sensitivity of reductive addition at 
the reducing end of oligosaccharides of a fluorescent 
label (Hounsell 1998). 

GC, particularly with MS detection (using Cl or El), 
is a highly sensitive technique used for monosaccharide 
analysis. The commonest derivatization procedures 
used to give the required volatility and specificity are 
(Hounsell 1994): 

Trimethylsilyl ethers of methyl glycosides 
Alditol acetates 

Partially methylated alditol acetates (PMAA) 

The last method using PMAAs is a particularly 
useful one which distinguishes not only the constituent 
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monosaccharides, but also how they are linked 
together. The method involves methylation (with 
NaH/methyl iodide) of all the free hydroxyl groups in 
an oligosaccharide, hydrolytic cleavage of the 
glycosidic bonds which reveals two more hydroxyl 
groups that were in the glycosidic bond, stabilization 
and volatization of revealed free hydroxyl groups 
by reduction and acetylation. In GC-EI MS analysis 
of the resulting PMAAs, the MS fragmentation gives 
preferential fragments between two methoxylated car¬ 
bons rather than methoxylated-acetoxylated and 
acetoxylated-acetoxylated thus identifying the original 
methylated hydroxyl groups and hence the position of 
linkage across the glycosyl bond. For poly/oligosac¬ 
charides where 0-methyl groups are to be found natu¬ 
rally, ethylation, for example, can be used instead. 

After successful PMAA analysis, two more param¬ 
eters are required to define oligosaccharide structure 
and one of these, the intermolecular sequence of 
monosaccharides, can be obtained by one of the soft 
ionization MS methods (see below) on the intact oli¬ 
gosaccharide with or without derivatization. The 
second is the anomeric configuration of all the mono¬ 
saccharides easily obtained by NMR (► Carbohydrate 
NMR Spectroscopy), but also inherent with proper 
standardization in the chromatographic mobility of 
oligosaccharides. 

Mass Spectrometry 

As discussed by Sinnott (2007) the following 
soft ionization methods are in common use for 
carbohydrates: 

FAB involves directing a high energy beam of Cs"^ or 
Xe atoms at the sample dissolved in a nonvolatile 
solvent such as m-nitrobenzyl alcohol. Matrix ions 
[M -I- H]"^ or [M -I- Na]"^ are commonly observed with 
the upper limit of M -i- of about 2000. Fragmentation 
at acetamido sugars gives sequence information 
(Hounsell 1998). FAB can be combined with TFC 
separation (by cutting out the bands and applying 
directly to the FAB probe), but not online with LC. 
ESI-MS is ideal as an hyphenated technique with 
HPFC as the eluent from the latter can be directly 
introduced into the ES nozzle which ejects eluent 
positive ions. A series of multiply charged ions are 
produced, the mass-to-charge ratios having to be 
deconvoluted to find the original mass. As well as 
for oligosaccharide sequence analysis, this can be 
additionally useful for glycoprotein analysis to 


quite high molecular weights. In-depth structural 
information can be obtained by fragmentation anal¬ 
ysis such as collision-induced dissociation (CID: 
e.g., see ► Glycosphingolipids). 

MAFDI occurs from a solution of the sample in 
a UV-absorbing matrix that is, for example, for 
carbohydrates a-cyano-4-hydroxycinnamic acid, 
2,5-dihydroxybenzoic acid, etc. A pulse of laser 
light directed on the matrix/sample mix gives rise 
to both positive and negative ions that can be further 
analyzed usually by time of flight (TOF) detectors. 
These methods are particular adaptations of 
MS techniques also used for proteins. The equivalent 
cross-peptide bond fragmentation, that is, cross glyco¬ 
sidic bond information described by the nomenclature 
B (from the nonreducing end of oligosaccharides) 
and Y (reducing end), is supplemented by cleavages 
across the glycosidic ring denoted by A (from the 
nonreducing end) and X (from the reducing end) with 
the bonds cleaved indicated by superscripts and the 
residue cleaved (e.g., 1,2,3 from the nonreducing end 
for A, B) indicated by subscripts. C and Z cleavages 
are those joining the glycosidic oxygen to a non- 
anomeric carbon (Sinnott 2007). 


Cross-References 

► Bacterial Lipopolysaccharide, OPS, and Fipid A 

► Bacterial Polysaccharide Structure and Biosynthesis 

► Carbohydrate Circular Dichroism 

► Carbohydrate Enzymology 

► Carbohydrate NMR Spectroscopy 

► Glycosphingolipids 

► Mass Spectrometry of N-Finked Carbohydrates and 
Glycoproteins 

► Molecular Dynamics Simulations of Carbohydrates 

► Mucin Biophysics 

► Pectin Biophysics 

► Polysaccharides: Biophysical Properties 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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TEM 

► Electron Microscopy: Classical Sample Preparation 


Temperature Jump 

► Kinetics: Relaxation Methods 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 

► Transient-State Kinetic Methods 

► Ultraviolet Resonance Raman (UVRR) Spectros¬ 
copy Studies of Structure and Dynamics of Proteins 


Template-Based Modeling 

► Homology Modeling of Protein Structures 

► Protein Structure Prediction and Structural 
Annotation of Proteomes 


Template-Free Modeling 

► Protein Structure Prediction and Structural 
Annotation of Proteomes 


Template-Mediated Polymerization 

► Molecularly Imprinted Polymers 


Terahertz Spectroscopy: Protein 
Dynamics 

Andrea Markelz 

Department of Physics, University at Buffalo, SUNY, 
Buffalo, NY, USA 

Synonyms 

Far-infrared spectroscopy of proteins; Suhmillimeter 
wave spectroscopy of proteins 

Definition 

Far-infrared or THz spectroscopy probes protein vibra¬ 
tional and librational excitations with energies in the 
range of 0.125-20 meV. Advantages and challenges of 
the method are described as well as general features of 
the spectrum. 

Basic Characteristics 

The FIR or THz spectral range extends from 
1-200 cm“\ to 0.125-25 meV, or to 1 cm-50 pm. 
This frequency range is often referred to as quasi 
optical, as the generation, detection, and manipulation 
of the light often use a combination of optical and 
microwave techniques. 

Two primary types of protein dynamics lie in this 
energy range and contribute to the THz dielectric 
response: local librational motions (hindered rotations) 
of side chains and solvent, and large-scale correlated 
motion of the polypeptide backbone (Markelz 2008; 
Markelz et al. 2007; Whitmire and Markelz 2003). 
Bulk water and surface water motions will also con¬ 
tribute to THz dielectric response (see ► Terahertz 
Spectroscopy: Solvation of Biomolecules) (Ebbinghaus 
et al. 2008). Both the local librational motions and the 
global correlated motions are expected to impact 
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biological function (Cozzini et al. 2008). In particular, 
there is some focus on the development of chemical 
computational models to incorporate local flexibility 
in ligand docking calculations, whereas changes in 
the access to global motions are considered a 
possible mechanism for allosteric regulation, where 
remote binding of the promoter/inhibiter can impact 
the large-scale backbone motions at the active site. 
This different impact on function of the protein pico¬ 
second motions motivates current exploration to 
develop techniques that can differentiate the local 
and global contributions in picoseconds dynamics 
measurements. 

Among the techniques used to measure global 
protein picosecond dynamics are neutron scattering, 
Raman scattering, and inelastic X-ray scattering. 
These techniques require on the order of 100 mg 
samples, dehydration, deuteration, or crystallization. 
Raman and most neutron scattering techniques detect 
both global motions and single particle excitations and 
so also are challenged by the need to differentiate the 
librational contributions from the global correlated 
motions (Fitter et al. 2006; Urabe et al. 1998). THz 
spectroscopy is a tabletop complementary technique 
that does not require large samples, deuteration, or 
crystallization. The technique couples to dipole active 
motions. 

Methods of modeling the THz absorption in general 
use molecular mechanics modeling (Brooks et al. 
1988; Zhang et al. 2008). Normal mode analysis and 
quasiharmonic calculations can be used for the global 
structural motions. The calculated spectrum while hav¬ 
ing many overlapping modes often has some structure 
in the THz range. The product of the absorption coef¬ 
ficient and refractive index a( co)n(co) that includes both 
local and global motions can be calculated using the 
dipole-dipole correlation from the calculated 
trajectories. 

Standard THz spectroscopic measurements include 
transmission measurements and attenuated reflection 
measurements (ATR). Measurements can be per¬ 
formed with discretely tunable FIR lasers such as 
molecular gas lasers, free electron lasers, and quantum 
cascade lasers. Broad spectroscopic measurements are 
typically performed using either a fourier transform 
interferometer (FTIR) (Genzel et al. 1984) or terahertz 
time domain spectroscopy (THz TDS). It is noted that 
THz TDS is a particularly strong technique as it allows 
for the complete determination of the dielectric 
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Terahertz Spectroscopy: Protein Dynamics, 

Fig. 1 Terahertz dielectric response for hen egg white lysozyme 
(HEWL) for native state (N-HEWL) and denatured state 
(D-HEWL). (a) Shows absorption coefficient and (b) shows the 
refractive index, both increase with denaturation (He et al. 2008) 

response of the material, that is, both the absorption 
and the refractive index, in a single measurement. 

The key challenges to these measurements are 
multiple reflection artifacts and water background. It 
is essential to test for reflection interference artifacts 
through additional measurements such as sample 
geometry, concentration, and temperature dependence. 
While there have been reports of narrow band reso¬ 
nances from biomolecular samples, the reports associ¬ 
ating the features with resonances from global 
vibrations did not thoroughly test for multiple reflec¬ 
tions and interference effects. For all reports that do 
include careful attention to sample uniformity, concen¬ 
tration, and temperature dependence, no narrow band 
resonances from global protein motions have been 
observed. Water background is addressed by (a) reduc¬ 
ing water content using lyophilized powder pellets or 
controlled hydration films or (b) removal of bulk water 
absorption from solution phase samples by reduction 
of temperature below 273 K. These sample prepara¬ 
tions can change the homogeneity and structure of the 
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protein. Secondary probes such as UV/Vis absorption, 
MIR absorption, circular dichroism, and fluorescence 
are used to determine the integrity of the sample. 

THz spectroscopic results to date include: (1) The 
linear THz dielectric response from unaligned pro¬ 
teins, either dry or hydrated, and over the 4.2-295 K 
temperature range does not have distinct narrow band 
features that one can readily identify with specific 
structural modes. In general, the imaginary part of the 
permittivity has a broad peak centered between 100 
and 200 cm” . (2) The response is strongly dependent 
on hydration, and this dependence is distinct from just 
the additive absorption from the librational motions of 
the water itself. It appears that the picoseconds 
response increases with hydration consistent with an 
increase in flexibility or plasticity with hydration. 

(3) The THz dielectric response increases with protein 
denaturing (see Fig. 1). This is likely due to the 
increase in the so-called biological water, that is 
water adjacent to the polypeptide chain for the random 
coil, or from added response of the librational motion 
of the surface side chains no longer constrained by 
internal hydrogen bonding in the native structure. 

(4) The response is dependent on ligand binding. 
Finally, the THz response is sensitive to the so-called 
dynamical transition at 220 K, associated with the 
onset of anharmonic structural motions. There is con¬ 
siderable evidence emerging that protein structural 
motions are contributing to THz measurements, but 
new techniques need to be developed to isolate these. 


Cross-References 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 

► Protein Folding: Molecular Dynamics Simulations 

► Sum Frequency Generation Vibrational 
Spectroscopy 

► Terahertz Spectroscopy: Solvation of Biomolecules 
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Terahertz Spectroscopy: Solvation of 
Biomolecules 
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Synonyms 

Hydration 


Definition 

Terahertz spectroscopy as a tool to study collective 
water network motions 
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Introduction 

Is there any water in a living cell? This may seem like 
a strange question, but water in the usual sense, that 
hydrogen-bonded bulk liquid melting at 0°C and boil¬ 
ing at 100°C may not exist within cells. With 
a cytoplasmic packing density of up to 400 mg/mL of 
protein, nucleic acids, lipids, carbohydrates, and small 
molecules or ionic compounds, there is not much space 
from any one of these molecules to its nearest neigh¬ 
bors, about 20-30 A only, depending on the molecular 
size. The 10-15 water layers that can fit into these 
spaces may be highly perturbed from the bulk and 
could have entirely different properties. However, the 
properties which are used to describe bulk or non-bulk 
like behavior of water in the environment of a solute, 
e.g., a protein, have to be defined. While the density of 
water may lie near the bulk value, the ever-changing 
orientation of molecular dipoles (negative at the oxy¬ 
gen atom, positive at the hydrogen atoms) and the 
dynamics of hydrogen bond rearrangements in the 
hydrogen bond network of water may still differ in 
fundamental aspects from the bulk. Such profound 
changes of water dynamics are observed when proteins 
are present. Such perturbations of the dynamics of 
water will decrease with increasing distance from the 
biomolecule, but at the same time, the number of 
molecules in a shell around the protein grows as the 
square of that distance. 

A large number of experimental techniques have 
been employed to study the properties of water solvat¬ 
ing biomolecules including: NMR, neutron scattering. 
X-ray crystallography, dielectric relaxation spectros¬ 
copy, and fluorescence spectroscopy. THz spectros¬ 
copy, among the most recent experimental methods 
brought to bear on this problem, is able to detect even 
small solute-induced changes of the collective water 
network dynamics at the biomolecule-water interface 
(Leitner et al. 2008). THz spectra of solvated saccha¬ 
rides reveal that the number of water molecules 
coupled dynamically to a saccharide, forming 
a dynamical hydration shell around it, is directly cor¬ 
related with the number of exposed oxygen atoms on 
the solute surface (Heugen et al. 2006). When studying 
the properties of water in aqueous solutions via THz 
absorption, proteins show an even longer range influ¬ 
ence on the surrounding water network dynamics 
(Ebbinghaus et al. 2007). The THz spectrum of 
a solvated protein is sensitive to mutations and depends 


on the surface charge and flexibility of the protein. 
Long-range coupling with the solvation shell appears 
most pronounced for native wild type proteins and 
decreases upon partial unfolding or mutation (Bom 
et al. 2009). This supports the thesis of a long-range 
dynamic coupling between biomolecules and solvent 
as imminent property of proteins. 

The THz Dance of the Water 

Water molecules in the bulk hydrogen bond with three 
to four other water molecules at any given time, as 
deduced from, e.g., neutron diffraction studies, but 
these hydrogen bonds are in a constant state of flux. 
Water molecules liberate (hindered rotational motion 
of a water molecule in its hydrogen-bonded environ¬ 
ment) in solution on a subpicosecond time scale, so 
that within a picosecond, a hydrogen bond between 
any two molecules may break and reform many times 
(Fig. 1). Diffusion of water molecules occurs on a pico¬ 
second time scale. Over this longer time scale, a given 
hydrogen bond between two water molecules may no 
longer hold, as reorientation and translation of a given 
water molecule favors new bond formation. All these 
motions lead to a fluctuation in the water network 
and thus to fluctuations of the water dipole moments 
on the sub-ps and ps time scale. Terahertz absorption 
measurements and theoretical studies simulating the 
Terahertz spectrum are most sensitive to the dynamical 
reorientation of dipole moments on the picosecond 
time scale - this is precisely what Terahertz 
spectroscopy probes (1 THz = 10*^ Hz = 1 ps“^). 
More precisely, the absorption can then be calculated 
as the Fourier transform of the autocorrelation function 
of the total dipole moment M of the system: 

0 {(a)) = F(®) J dt exp(ifflt) <M(0)M(t)>, 

where the bracket denotes the ensemble average and 
F(ffl) is a frequency dependent pre-factor. The THz 
absorption will increase at high temperatures, where 
the water network is more disorderly and fluxional and 
it will decrease at low temperatures with a rigidly 
imposed tetrahedral coordination in ice. If radiation 
at 1.5 THz penetrates a 100 pm water layer at 370 K, 
only 0.7% of the original radiation will arrive at the 
detector, whereas at 270 K 40% will reach the detector. 
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Rotational HB break 
+ rotational HB reforming 



Terahertz Spectroscopy: Solvation of Biomolecules, 
Fig. 1 In bulk water there is a constant breaking and closing 
of hydrogen bonds on a typical time scale of 1 ps. The underlying 
motions can be characterized as diffusion and rotation 


Therefore the THz absorption is a sensitive tool to 
probe changes in the water network dynamics. 

For water in the vicinity of a biomolecule the 
dynamical reorientation of the water dipole moment 
turns out to be affected over particularly long 
distances, up to several nm from the surface of the 
biomolecule. This reorientation arises as water mole¬ 
cules within the hydrogen-bonding network may be 
affected by hydrophilic or hydrophobic surfaces, by 
charges, or by sterical hindrance in their tumble around 
(Fig. 2), thus affecting the average time for making 
and breaking hydrogen bonds. Couple that with the 
radius-squared increase of the number of water mole¬ 
cules as one moves outward to more remote solvation 
shells, then huge numbers of water molecules can be 
affected by a single biomolecule. A simple picture of 
a solvated protein has to include the protein, nucleic 
acid or carbohydrate, bulk water (if far enough away), 
and hydration water with new physical properties, 
including a propensity for distinct THz absorption. 
The thus-far defined hydration water is not identical 
to the sterically bound water molecules probed by 
X-ray crystallography, NMR, or neutron crystallogra¬ 
phy. Therefore, a “dynamical hydration shell” is 
defined, which includes all water molecules that 
show water network dynamics distinct from the bulk, 
and thus a distinct THz absorbance. The influence of 
the protein on the water dynamics can include much 
more water molecules than the water molecules which 
are actually H-bonded to the protein (“static hydration 
radius”). 



Terahertz Spectroscopy: Solvation of Biomolecules, 
Fig. 2 The protein dynamics is coupled to the water network 
dynamics in its hydration shell 
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Terahertz Spectroscopy: Solvation of Biomolecules, 
Fig. 3 THz difference spectrometer: Displayed is the difference 
p-Ge spectrometer for simultaneous measurements of reference 
and sample absorbance. The emitted THz pulses are separated at 
a mirror chopper previous to probing the absorbing solutions in 
liquid cells. Both beams are reunited at a second mirror chopper 
and focused onto the detector 


Theoretical Concepts 

A more detailed picture of this collective THz dance 
has emerged from computational MD studies. Using an 
extensive set of ab initio MD simulation trajectories 
for bulk water the collective character and the long- 
range coupling over more than 5 A could be revealed 
(Heyden et al. 2010). Using classical MD the interplay 
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Terahertz Spectroscopy: Solvation of Biomolecules 


Terahertz Spectroscopy: Incoming ps THz pulse Outgoing attenuated and 

Solvation of Biomolecules, is focused onto sample shifted THZ pulse 

Fig. 4 KITA setup: THz 
pulses pass through a stopped- 
flow cell, where a mixer 
combines denatured ubiquitin 
with denaturant-free buffer to 
start refolding. The shape of 
the transmitted THz electric 
field is detected using a ZnTe 
crystal and a 800 nm gating 
pulse delayed by At. The 
difference AE of the electric 
field between denaturant-free 
1.5 mM protein solution and 
buffer is shown. For kinetics, 
the THz pulse is detected near 
the maximum electric field, 
and the mixer is scanned in 
time t with respect to the THz 
pulse 


of solutes such as sugars, salts, and proteins with water 
can be described. One way to quantify hydrogen bond 
rearrangements as probed in the THz is the introduction 
of time-dependent survival probabilities for bonds 
between water molecules using hydrogen bond correla¬ 
tion functions. Another way involves the calculation of 
the vibrational density of states (VDOS) in the THz 
range. Recently it could be shown that the solvation of 
a protein causes a blue shift in the vibrational density of 
states of the protein-water system compared to bulk 
water in the THz range. This results in a decrease in 
the VDOS below 50 cm“^ and an increase above 
50 cm“^ in agreement with the experimental observa¬ 
tion (Heyden and Havenith 2010). This blue shift goes 
along with a retardation of the solvation dynamics of the 
water molecules in the proximity of the protein. Accord¬ 
ingly, the average hydrogen bond lifetimes in the dynam¬ 
ical hydration shell, i.e., the average time it takes before 
the H-bond between two water molecules is broken, is 
found to be increased compared to bulk water. 

Terahertz Spectroscopy: Experimental 
Challenges 

THz spectroscopy has evolved considerably during the 
last decade (Schmuttenmaer 2004). For a long time. 


this spectral region between microwave and infrared 
technology was known as the “THz gap,” indicating 
a lack of powerful THz radiation sources. The previous 
inherent lack of bright THz sources can be attributed to 
the fact that it lies in the frequency range where clas¬ 
sical electronic devices end, and photonic devices such 
as diode lasers do not work because the corresponding 
frequency is smaller than the band gap of a semicon¬ 
ductor. Several new radiation sources have been devel¬ 
oped within the last decade including the p-Ge laser, 
which reaches inverse population within a subband; 
THz time domain spectrometers; and new synchrotron 
sources. The state of the art of THz sources has 
recently been reviewed (Tonouchi 2007). The high 
absorption of water in the THz range represents 
a major challenge for precision measurements of sol¬ 
vated biomolecules. Therefore, the initial studies were 
carried out using hydrated (and not completely sol¬ 
vated) samples (Markelz 2008). 

However, precise measurements of solute-solvent 
mixtures are required to detect the subtle changes 
in the THz absorption of few percent. It is there¬ 
fore crucial to maintain reproducible measurement 
conditions in respect of temperature and humidity. 
In a difference setup, the absorption in a sample 
and in the reference is recorded using subsequent 
THz pulses of a p-Ge laser (Fig. 3). This setup 
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allowed precise determinations of the absolute 
changes in the THz absorption by the solute (better 
than 0.5 %). 

Kinetic THz Absorption Spectroscopy (KITA) 

Using a time-domain spectrometer in combination 
with kinetic studies time-resolved measurements of 
changes in the couple protein-solvation dynamics dur¬ 
ing biological function has come into reach. An exper¬ 
imental setup which has been used to measure changes 
in the THz absorption during protein folding is shown 
in Fig. 4. Chemical denaturation of the protein is 
achieved by a surplus of denaturing agents in a stop- 
flow setup (Kim et al. 2008). KITA records the atten¬ 
uation of the THz radiation by the sample as a function 
of time with fs pulses. This allows to record any change 
with ps time resolution, which is beyond anything 
which can be reached with conventional THz-FTIR 
spectrometers. The simultaneous observation of 
changes in the collective water network during struc¬ 
tural changes of the protein will shed new light on the 
interplay between water and the proteins as a major 
constituent for life. 


Cross-References 

► Terahertz Spectroscopy: Protein Dynamics 
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Tethered Particle Microscopy 

Laura Finzi and David Dunlap 
Emory University, Atlanta, GA, USA 


Synonyms 

Single-molecule methods 


Definition 

Tethered particle microscopy (TPM) is a quantitative 
analysis of the Brownian motion of microspheres teth¬ 
ered to a glass surface by single polymer molecules. 

Basic Characteristics 

Tethered particle microscopy or the tethered particle 
motion (TPM) technique is a method used to detect 
conformational changes in single polymer molecules 
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Tethered Particle Microscopy, Fig. 1 Different TPM strate¬ 
gies. (a) The bead is tethered to the glass surface by a single 
DNA molecule by labeling each 5' end of the dsDNA with either 
a biotin or a digoxigenin molecule. The glass surface can then be 
coated with anti-digoxigenin while beads coated with 
streptavidin are commercially available. This type of sample 
preparation is best suited to study DNA-protein interactions 
that induce conformational changes which vary the tether length, 
(b) A motor enzyme-DNA stalled complex is nonspecifically 


adsorbed to the glass surface and the unprocessed DNA end is 
labeled with a bead using either the biotin-streptavidin or the 
digoxigenin-anti-dig pair. The enzyme is then reactivated and its 
processivity may be characterized by following the shortening of 
the tether (reduction in Brownian motion), (c) A streptavidin- 
coated bead is attached to a biotinylated motor protein, while 
the DNA is attached to the glass surface via the digoxigenin- 
anti-dig linkage. This experimental setup has been used to follow 
the activity of helicases 


(usually biopolymers) that tether microspheres to 
a glass surface and thereby restrict their range of 
Brownian motion. 

Applications. TPM has most often been employed 
to study nucleic acid-protein interactions and the 
activity of DNA motor enzymes. Most often, one end 
of a DNA molecule is anchored to the glass surface of 
a microscope flow chamber, and the Brownian motion 
of a microsphere attached at the other end is monitored 
versus time. Conformational or structural changes in 
the DNA induced by the protein(s) in the surrounding 
solution may alter the average end-to-end distance 
(effective length) of the DNA tether and consequently 
the amplitude of Brownian motion of the attached 
bead. Protein-mediated DNA compaction, looping, 
and bending are examples of such structural changes 
(Finzi and Dunlap 2010). In other cases, the activity of 
proteins that translocate on DNA to change the effec¬ 
tive tether length may be monitored. In one example, 
stalled complexes of a DNA-translocating enzyme and 
bead-labeled DNA were prepared and nonspecifically 
adsorbed to glass. Some of the enzymes retain enough 
activity to polymerize new nucleic acids and draw the 
DNA template through which changes the tether length 
and the Brownian motion of the attached bead (Yin 
et al. 1994). TPM can also be used to detect 


translocation in a transient tethering scheme, in 
which proteins attached to free microspheres briefly 
interact with a tethered DNA molecule. Translocation 
then changes the effective tether length and TPM sig¬ 
nal until the protein dissociates and the microsphere 
diffuses out of view (Dohoney and Gelles 2001). Sche¬ 
matic sample setups for these experiments are illus¬ 
trated in Fig. 1. The intrinsic stiffness of DNA 
molecules can be measured by TPM (Brinkers et al. 
2009). For this purpose, the amplitude of the Brownian 
motion of the tethered bead is measured and plotted for 
tethers of different lengths. The resulting data are 
fitted, most often with a worm-like chain model, 
using a single free parameter to determine the flexibil¬ 
ity of the chain. 

Type of signal and observables. Over time a bead 
tethered by a single molecule will visit locations within 
a hemisphere of a radius determined by the maximum 
extension of the tether. TPM data consists of a time 
series of AT-positions established for such a tethered 
bead in a stream of video data analyzed in real time. 
Such an ensemble of coordinates will form a circular 
cloud of points with the anchor point at the center. 
Plotting the two-dimensional distances of each posi¬ 
tion from the anchor point forms a distribution peaked 
at the most likely extension of the tether. If a protein 
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Tethered Particle Microscopy, Fig. 2 Loop formation and 
breakdown by TPM. The time trace of the TPM signal, as the 
time average of the 2-D displacement of the bead, is an indica¬ 
tion of the equilibrium between the unlooped and looped DNA 


intermittently induces a conformational change that 
reduces the effective tether length, a telegraphic-like 
signal results (Fig. 2) and the distribution of distances 
from the anchor point may become bimodal. Any 
long-range interactions between distant DNA seg¬ 
ments and the bound protein can greatly reduce the 
effective length of the tether and restrict the Brownian 
motion of the microsphere much more than the 
unencumbered DNA (without protein). The peaks of 
the distribution mark the average TPM signals for 
the two or more DNA states (with and without the 
induced conformational changes), and the relative 
weights of these populations may be used to determine 
the free energies separating these states. Other reac¬ 
tions and experimental configurations may produce 
gradually increasing or decreasing TPM signals 
for which measurements of rates and pauses in the 
activity of enzymes and lifetimes of different states 
can be made. 

Spatial and temporal resolution. TPM has a spatial 
resolution of about 10-15 nm and therefore may reveal 
conformational changes that cause a shortening of as 
little as 60-70 base pairs. The sensitivity decreases 
with increasing tether length, and useful tether lengths 
range from approximately 100-4,000 base pairs 
(Dunlap et al. 2011). Although the frame rate of the 



-► 

Time (s) 

molecule. In the simplest of cases, a telegraphic-like signal will 
be observed from which thermodynamic and kinetic parameters 
of the looping reaction may be extracted 


video camera is typically 25-30 frames/s, the temporal 
resolution is limited by the time required for a bead to 
adequately explore the accessible hemisphere space 
(Manghi et al. 2010) and by the analysis algorithm 
(Manzo and Finzi 2010). However, the video camera 
exposure time must be limited to prevent blurred 
images of the beads (Han et al. 2009). 

Materials and Methods. Glass microchambers may 
be assembled by placing a slotted film between a slide 
and a coverslip to contain a few tens of microliters of 
solution that can be introduced by capillary force 
(Dunlap et al. 2011). DNA tethers are attached to 
glass and bead surfaces sparsely coated with proteins 
having high affinity for ligands used to label the ends 
of the DNA. Most often, one end of the tether is labeled 
with biotin and the other with digoxigenin, so that 
streptavidin-coated microspheres can be tethered to 
an anti-digoxigenin-coated glass surface, or vice 
versa. Streptavidin- or antidigoxigenin-coated micro¬ 
spheres can be prepared in the lab or purchased directly 
from manufacturers. Differential interference contrast 
(DIG) microscopy is used to visualize and track the 
larger polystyrene microspheres ranging from approx. 
200-800 nm in diameter while dark field microscopy is 
used to visualize and track gold beads 20-50 nm in 
diameter (Milstein et al. 2011). 
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Particle tracking software. Either commercial or 
custom (Dunlap et al. 2011) software may be used 
for tracking single particles in real time. However, 
since the displacements exhibited by the micro¬ 
spheres are on the order of a few tens of nanometers, 
it is critical to prevent high-frequency mechanical 
vibrations in the instrument and eliminate slow 
mechanical drift by subtracting the motion of one or 
more reference marks from the video data (Nelson 
et al. 2006). 

Cost and Calibration. A TPM instrument consisting 
of a microscope with darkfield or differential interfer¬ 
ence contrast optics and a monochrome video camera is 
quite economical compared to instruments employing 
particle manipulation technologies that require 
nanopositioning piezoelectronics and micropositioning 
motors or lasers and associated optics for micromanip¬ 
ulation of the beads. For TPM, the noise level can be 
determined by analyzing the motion of beads firmly 
anchored on the glass, and a plot of the average ampli¬ 
tude of Brownian motion of beads tethered by mole¬ 
cules of different lengths suffices to calibrate the 
instrument. 

Advantages and disadvantages. Perhaps the most 
important feature for biophysical research is that TPM 
permits the detection of conformational changes in 
DNA in physiological solutions without artificial ten¬ 
sion. However, TPM cannot achieve the fine spatial or 
temporal resolution of some of the most sophisticated 
optical tweezers and becomes insensitive to small con¬ 
formational changes in DNA tethers more than several 
kilobases in length. 


Cross-References 

► Atomic Force Microscopy 

► High-Speed Atomic Force Microscopy (AFM) 

► Magnetic Tweezers 

► Optical Tweezers 

► Single-Molecule Methods 

► Worm-Fike Chain (WFC) Model 
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Definition 

Thermodynamics is the branch of physical science that 
deals with the relations between heat and other forms of 
energy, especially chemical energy. Nonidealty refers 
to the phenomenon whereby physical interactions 
between constituent molecules result in the storage of 
potential energy between them, thereby introducing 
a difference between the thermodynamic concentrations 
(activities) and their analytical counterparts. 

Introduction 

The notion of thermodynamic ideality originates in the 
classical theory of a gas comprised of noninteracting 
molecules of infinitesimal size and in the expectation 
of Raoult’s law that the saturated vapor pressure due to 
any volatile component of a liquid solution should 
show a simple dilution effect - it should be reduced 
below the value observed for the pure component by 
a factor corresponding to the mole fraction X of the 
component in the solution. Because it is often the 
behavior of an essentially nonvolatile solute compo¬ 
nent that is of interest, thermodynamic ideality is most 
often expressed in terms of the familiar expression for 
the chemical potential /r of a solute component, 

/2 = +RTlnX (1) 

where is the standard state chemical potential, R is 
the universal gas constant, and T is the absolute tem¬ 
perature. Over a limited range of low concentration, 
most solutes, even biological macromolecules, exhibit 
ideal solution behavior, meaning that experimental 


results conform precisely with Eq. 1. Furthermore, 
macromolecular interactions, hetero- as well as self- 
associative, can often be observed under conditions of 
low solute concentration in which any deviation from 
ideality can be regarded as a complicating factor of 
little practical significance. 

With appropriate adjustment of Eq. 1 is valid 
whether the solute concentration is measured 
on a mole fraction X, molal m (mol/kg), molar C 
(mol/L), or weight-based c (kg/L) scale because for 
sufficiently dilute solutions, all the other concentration 
scales are related to the dimensionless mole fraction in 
approximately linear fashion (Tanford 1961). How¬ 
ever, when measurements are made at higher concen¬ 
trations, there is invariably a difference between the 
actual concentration c and the thermodynamic activity 
z, that is, the effective concentration that gives the 
correct chemical potential through substitution into 
Eq. 1. This discrepancy between the actual and effec¬ 
tive concentration is said to constitute nonideal solu¬ 
tion behavior or thermodynamic nonideality. Positive 
deviations from ideality (z/c >1) stem most simply 
from the phenomenon of excluded volume: Molecules 
are of finite size and preclude the co-occupation of 
space with others. Negative deviations (z/c < 1) are 
symptomatic of cohesion between solute molecules. 
Deviations of either sign can arise due to alterations 
in the interactions of solute molecules with solvent 
molecules. 

Thermodynamic Description of a 
Single Solute 

Beyond a limited range of low solute concentrations, 
the linear relationship between concentration scales 
breaks down, and Eq. 1 loses its generality. Further¬ 
more, Eq. 1 becomes an inadequate description of the 
solute chemical potential because there is no rigorous 
specification of the thermodynamic constraints under 
which solutions of different concentrations are to be 
observed. In spectral and light-scattering methods, the 
dependence of the chemical potential of the solute A 
on its concentration is generally monitored under con¬ 
ditions of constant temperature and pressure. On the 
other hand, in techniques such as osmometry, equilib¬ 
rium dialysis, and gel chromatography, the depen¬ 
dence of on concentration is monitored under 
conditions of constant temperature and chemical 
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potential /t, of the solvent The significance of the 
elementary difference between these two thermody¬ 
namically distinct situations has been overlooked in 
most standard textbooks and a majority of experimen¬ 
tal studies. 

Solute Chemical Partition in Studies Involving 
Solvent Partition 

Under conditions of constant temperature and chemi¬ 
cal potential of the solvent, the thermodynamic activity 
Zj^ of a single solute is defined as 

I^A{T,lis,CA) = tfA{T.)-Ls)+RT\nzA ( 2 ) 

and is naturally a molar quantity because under 
those constraints, Eq. 2 is thermodynamically rigorous 
for an ideal solution with the molar concentration 
Ca replacing 7,4 (Hill 1968) . For that reason, it is usually 
most convenient also to express the activity Za 
as the product of Ca and a corresponding activity 
coefficient 7 , 4 , 

Za = IaCa (3) 

Adopting this convention renders possible the 
expression 

In 7/1 = 2BaaCa + BabCb + • • • (4) 

where Baa is the osmotic second virial coefficient 
reflecting solute nonideality of A as the result of its 
own concentration and where the term containing the 
additional virial coefficient Bab takes into account the 
consequence of a second solute B. These coefficients 
are found in the general expression for the osmotic 
pressure IT of a multicomponent solution 

YljRT — Ca + Cb + BaaC\ + BabCaCb + ■ ■ • (5) 

and are rigorously defined statistical-mechanical 
parameters that may be described in terms of the phys¬ 
ical interactions between pairs of solute molecules 
(McMillan and Mayer 1945). 

Solute Chemical Potential Defined Under the 
Constraint of Constant Pressure 

Under the commonly encountered constraints of con¬ 
stant temperature and pressure, the expression for sol¬ 
ute chemical potential becomes (Hill 1968) 


jij^{T,P,mA) = + RTlnaA ( 6 ) 

and the corresponding thermodynamic activity of sol¬ 
ute qa reduces to the molal concentration ma under 
ideal conditions. On that account, it is usually 
expressed in terms of an appropriate activity coeffi¬ 
cient yA as 

oa = yAtriA (7) 

and the counterpart of Eq. 4 is 

InjA = ^.CAAfnA + Cabhib + • ■ ■ (8) 

where the coefficients Caa and Cab come from the 
expression for the dependence of the solvent chemical 
potential on solute molalities 

- [^l,{T,P,mA,mB ...)- p:iT,P)]/RT 
= niA + mB + CAAm\ + CABmAm + ■■■ 

The coefficients Caa and Cab are not always easily 
evaluated in statistical-mechanical terms, but in prac¬ 
tice, they can usually be related to the corresponding 
osmotic quantities by the expressions 

Caa = (Baa - Mava)p, (10a) 

Cab = {Bab — MaVa — MbVb)Ps ( 10 b ) 

in which M, and v, are the molar mass and partial 
specific volume respectively of solute species i and 
solvent density p^ takes into account the different 
dimensions of Baa (L/mol) and Caa (kg/mol). Strictly 
speaking, Eqs. 10a and 10b are valid only for incom¬ 
pressible systems that show no volume change on 
mixing, meaning that the partial specific volumes v, 
are composition-independent constants - a reasonable 
approximation for most aqueous solutions of macro¬ 
molecules. Under this assumption, the molarity- 
molality relationship, 

Ca = mApJ[l + p,{MAVAmA + MbVbhtb)] (11 a) 

mA = Ca/[p.5(1 -MavaCa - MbVbCb)] (Hb) 

can be derived on the basis that the separate compo¬ 
nents of a solution occupy volume in fixed ratios. 
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Equations 11a and 11b may be used to express and 
7 ^ in terms of molal concentration m^, or a^, and yj^ in 
terms of molar concentration Ca- 

Thus, the concept of an activity coefficient as 
a dimensionless ratio of concentrations, yA = ^aICa 
or Ja = OAlniA, only applies when the experimenter has 
chosen the appropriate concentration scale for the ther¬ 
modynamic activity that is defined by the constraints 
of the experiment. The quadratic concentration depen¬ 
dence of an experimental quantity may be measured 
using any chosen scale and the coefficient treated as an 
operational parameter devoid of any mechanistic sig¬ 
nificance. However, the interpretation of second virial 
coefficients on a statistical-mechanical (molecular) 
basis requires careful consideration to be given to 
(1) the nature of the thermodynamic activity (z^ or 
Qa) that has been determined, (2) the concentration 
scale (Ca or Wa) that has been adopted, and (3) the 
virial expansion that must therefore be employed. 


kT 


1 fiOOZiZj exp [- K (x-R, ) ] 

8TlLlM{l-kKRi)(^l+KRj'^X 


(13) 


where Z, denotes the net charge on species i, k is the 
Debye-Huckel inverse screening length, and is the 
molar ionic strength of the solvent. 

An absence of charge on a solute (Z, and/or Zj zero) 
simplifies greatly the calculation of second virial coef¬ 
ficients in that the magnitude of is then governed 
solely hy the radii of the two species via the first term in 
Eq. 12a. For this simplified situation, scaled particle 
theory (Reiss et al. 1959) affords an equivalent statis¬ 
tical-mechanical approach for the evaluation of the 
second virial coefficient. Scaled particle theory is 
restricted to considerations of uncharged species, 
whereupon its extension to accommodate charged sol¬ 
utes (Minton and Edelhoch 1982) requires empirical 
adjustment of the magnitudes of R, and R, to accom¬ 
modate the additional contribution to Bjj that stems 
from the effects of electrostatic repulsion. 


Statistical-Mechanical Evaluation of Second 
Virial Coefficients 

For two spherical species (/, j) with radii R, and Rj, the 
second virial coefficient can be expressed in terms of 
the potential of mean force Ujj{x) between the mole¬ 
cules when their center-to-center distance is x 
(McMillan and Mayer 1945; Hill 1968) and thereby 
calculated through numerical integration of the second 
term in the expression 

Bij = 2(2 - dij)nL 



R,+Rj 


fij{x) = e\p[-Uij{x)/{kT)] - 1 (12b) 


Thermodynamic Activity Defined in 
Sedimentation Equilibrium and Static Light 
Scattering 

This discussion of thermodynamic nonideality in 
nonassociating systems concludes with considerations 
of the use of a thermodynamic activity for describing 
the concentration dependence of the molecular mass 
that is measured in two commonly employed tech¬ 
niques: sedimentation equilibrium and static light 
scattering. 

Sedimentation Equilibrium 

Thermodynamic nonideality in sedimentation equilib¬ 
rium was traditionally quantified from the concentra¬ 
tion dependence of the apparent molar mass of 
a homogeneous protein by means of the expression 
(Schachman 1959) 


where k is the Boltzmann constant, and the inclusion 
of Avogadro’s number (L) defines the virial coeffi¬ 
cient on a molar basis. The Kronecker delta (5^ is unity 
for identical molecules (/ = j) and zero for dif¬ 
ferent species (; ^ j). The exponent in Eq. 12b is 
readily calculated for molecules with a spherically 
symmetric surface charge distribution as (Scott 
et al. 2010) 


1/iHj^ = (1 /Ma)[ 1 -f CA{d\nyA/dcA)] 

= (1/Ma)[1 + 2B2Ca + ...] (14) 

where the omission of constraints on the differential 
term reflected unawareness of the existence of differ¬ 
ently defined thermodynamic activities and B 2 was an 
operational second virial coefficient derived from the 
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slope of the concentration dependence of . The 

insufficiency of Eq. 14 only came to light some 
30 years later (Wills and Winzor 1992) in a demonstra¬ 
tion that the natural constraints are constant tempera¬ 
ture and solvent chemical potential, whereupon the 
basic sedimentation equilibrium equation for a single 
macromolecular solute with a concentration- 
independent partial specific volume is unequivocally 

— VAPs)(o^rdr = = RTdlnzA (15) 

An improved version of the proof of Eq. 15 appears 
in a subsequent review article (Wills and Winzor 
2005). Equation 15 is readily integrated between radial 
limits I'p and r to give 

ZA{r) = ZA{rf)^Ai.r) (16a) 

i/r^(r) = exp[MA(l - VAP,)o?{r^ - rl)/{2RT)] 

(16b) 

in which ZAirp) is the thermodynamic activity of solute 
at a chosen reference radial distance 

Several important features emerge from Eqs. 15, 
16a and 16b. First, the measured distribution in sed¬ 
imentation equilibrium reflects the molar thermody¬ 
namic activity Za (or its weight-based counterpart 
MaZa), and hence, the constraints on the activity coef¬ 
ficient term of Eq. 14 have been positively identified. 
Although that expression clearly allows characteriza¬ 
tion of the second virial coefficient (now recognized 
as Baa) of a single solute solution from the concen¬ 
tration dependence of apparent molar mass, a more 
rewarding procedure stems from combining Eqs. 3, 4, 
16a and 16b as 

^^(/-) = [Q(r)/y^(/>)] exp[2B^CA(r) + ...] (17) 

The second virial coefficient, as well as the 
reference thermodynamic activity, ZAirp), can thus be 
obtained by nonlinear curve fitting of C. 4 (r)] 

data from a single sedimentation equilibrium 
experiment of meniscus-depletion design (Winzor 
etal. 2001 ). 

Static Light Scattering 

The characterization of a homogeneous protein solu¬ 
tion by using the technique of light scattering is also 


usually based on Eq. 14 with the apparent molar mass 
and thence second virial coefficient A2 obtained 
from the expression 

Kca/R0 = 1 /M7 =1/Ma+ 2 A 2 CA + ... (18) 

where K is an optical constant related to solvent refrac¬ 
tive index «s and wavelength of plane-polarized light 
and Ro is the excess Rayleigh scattering ratio at angle 0 
to the incident beam. Careful analysis (Winzor et al. 
2007) reveals that the second virial coefficient 
extracted from light-scattering measurements 
contains terms that depend on the derivatives 
(first /a and Xaa second) of «s with respect to solute 
concentration in the limit ^ 0 

A2 = [Caa/Ps + A 1 aVa- MAilA^s) 

- 2 Ma{Xaa/Xa)]/MI (19) 

In practice, the refractive contributions to A2 (final 
terms in Eq. 19) are often small (Winzor et al. 2007), 
and when Eqs. 10a, 10b, 11a and 11b are valid, the 
second virial coefficient obtained from light scattering 
from a two-component solution is effectively identical 
to the osmotic second virial coefficient Baa- 

However, at least in studies of biological macro¬ 
molecules, the “solvent” usually contains low molec¬ 
ular weight buffer and supporting electrolyte 
components, as well as the solvent itself (water), 
requiring an analysis of solutions containing more 
than one solute component (Kirkwood and Goldberg 
1950; Stockmayer 1950). Although the weak scatter¬ 
ing due to these secondary low molecular weight 
solutes is easily subtracted, they have a marked effect 
on the results obtained. In the case of a single addi¬ 
tional solute component B, the Debye plot of 
the excess scattering ARg due to macromolecule has 
the form 

KsCa/ ARg = Ai/Ma + ( 2 A 2 -t- Qcb)ca + .. . (20) 

where is modified to take account of component B, 
the intercept has a linear dependence on the concen¬ 
tration of B, 

Al = I + 2 [{Cab/P s + )^AVA)'yo/MB — Xab/Ib\cb 

( 21 ) 
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and with To = Xb^b/Xa^b, under the assumptions of 
Eqs. 10 and 11, the constant Q has the form 

= {Baab - Bis + + 2'1'oAi + A2)/{MaMb) 

( 22 ) 

The constants Ai and A 2 have complicated forms, 
but they make minor contributions to Q compared with 
the dominant thermodynamic part, which is negative 
when molecular interactions are solely of the excluded 
volume kind. Thus, it is dangerous to interpret the 
slope of a Debye plot as a measure of the osmotic 
second virial coefficient of a macromolecule because 
it is likely to have negative contributions, feigning as 
attractive A A macromolecular interactions, due to 
the presence of one or more low molecular weight 
solutes. 

Similar problems do not arise in sedimentation 
equilibrium, where the contributions to nonideality 
due to essentially nonsedimenting low molecular 
weight components are incorporated into the apparent 
buoyant molecular mass of the macromolecule in the 
supplemented “solvent” (Eisenberg 1976; Jacobsen 
et al. 1996). Because no other technique allows the 
separation of solute-solvent effects from those 
reflecting solute-solute interaction(s), sedimentation 
equilibrium becomes the procedure of choice for inter¬ 
pretation of thermodynamic nonideality on the statis¬ 
tical-mechanical basis of the potential of mean force 
between macromolecules. 

Thermodynamic Nonideality and Solute 
Self-association 

A solute undergoing reversible self-association can be 
designated as a single thermodynamic component on 
the grounds that specification of the total concentration 
suffices to specify the composition, which is governed 
by the association equilibrium constant. Therefore, one 
approach to the thermodynamic characterization of 
such systems is to retain the single-solute concept by 
describing the thermodynamic activity as the product 
of total solute concentration and an activity coefficient 
that takes into account the consequences of nonideality 
arising from interactions within clusters of two or more 
macromolecules, including even strong attraction 
involving noncovalent forces (Hill and Chen 1973; 
Wills and Winzor 2005). However, custom dictates 


the consideration of clusters resulting from strong 
short-range “associative” forces as separately identifi¬ 
able molecular species, whereupon the number of 
“components” is increased to incorporate the different 
oligomeric states of the solute. In terms of this 
notional, nonthermodynamic distinction between asso¬ 
ciative and nonassociative macromolecular interac¬ 
tions, the thermodynamic activity of a reversibly 
formed cluster of n monomers is the product of an 
association constant K„ and the thermodynamic activ¬ 
ity of monomer raised to the appropriate power. 

Consideration of Different Oligomeric States as 
Separate Species 

For experimental situations in which two-state solute 
self-association (nA 2 =^ C) is being examined under the 
constraints of constant temperature and solvent chem¬ 
ical potential, the molar thermodynamic association 
constant is defined by the expression 

K„ = zdA = IcCcKuCaT (23) 

where z, and C, refer to the respective molar activity 
and concentration of species i. Alternatively, expres¬ 
sion of concentrations on a weight basis, c,, leads to 
definition of the more commonly evaluated weight 
counterpart of as 

= nK„M''d = IccdbACAT (24) 

Provided that nearest-neighbor interactions suffice 
to describe the thermodynamic nonideality, the activ¬ 
ity coefficients defined on the statistical-mechanical 
basis of the potential of mean force become 

= qiv^IIBaaCa + BacCc] 

= txd2.{BAAlMA)cA + {BacIMc)cc\ (25a) 

yc = exppBccCc +BacCa] 

= exp[2(Bcc/4fc)cc + {BacIMa)ca] (25b) 

where the second virial coefficients are defined in 
terms of species radii (Ra, Rc) by expressions analo¬ 
gous to Eqs. 12a and 13 for each species. 

Such composition dependence of the activity coef¬ 
ficients clearly presents an impediment to ready deter¬ 
mination of the association constant (Eq. 24) from 
experimental data obtained either as the dependence 
of monomer activity M^z^ or concentration Ca upon 
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total solute concentration c. In either event, the exper¬ 
imental results require fitting to the expression 

C = Ca+Cc= MaZa/Ja + Xn{MAZA)"lyc (26) 

where the determination of values for yA and yc is 
contingent upon the adoption of an iterative approach 
in order to ascertain the correct composition and 
hence their correct magnitudes. Analysis of the 
experimental data on the basis of thermodynamic ide¬ 
ality (T/i = Tc =1) affords an initial estimate of X„, 
which is then rehned iteratively with revised estimates 
of activity coefficients based on the consequent depen¬ 
dence of Ca or za upon c (Wills et al. 1980). 

The necessity for this iterative approach has been 
avoided in computer software for the characterization 
of solute self-association from sedimentation equilib¬ 
rium distributions by assuming that the activity coeffi¬ 
cients of the different oligomeric states of the solute 
can be described in terms of total weight concentration 
c by the Adams-Fujita relationship, y, = exp{iMABc), 
where B is an empirical constant to be evaluated from 
the analysis. The limitations of this approach have 
recently been exposed (Scott and Winzor 2009). 

Consideration of a Self-associating Solute as 
a Single Thermodynamic Component 

An alternative to the above approach for solute dimer¬ 
ization (2A C) is to regard the solute self-association 
as a form of thermodynamic nonideality, whereupon 
the dimerization constant becomes incorporated into 
the virial coefficients of the series expansion in con¬ 
centration. In the initial attempt to develop this strat¬ 
egy, Hill and Chen (1973) expressed the monomer 
activity as a series expansion in total solute concentra¬ 
tion, but the approach is restricted to the study of 
extremely weak self-association because of 
a propensity for the virial series expansion to diverge 
with increasing values of X 2 . 

Realization that the activity of monomer is an 
experimentally accessible parameter in sedimentation 
equilibrium studies of self-association has prompted 
derivation of the alternative relationship (Wills et al. 
1996; Wills and Winzor 2005) 

c = MaZa + —X-2\(MaZaY 

+ (3/2) [XzBac/Ma - A(BAA/MA.f+BAAAlMl.]^ (MaZa? + ■■■ 

( 27 ) 


which overcomes the problems of slow series conver¬ 
gence. The parameter the third virial coefficient 
for self-interaction, may be calculated from the 
expression 

Baaa = 1607i2l2r^/9- 

j-oo j-oo rnj+rn 

^ ij I ik I fjk^~jkBj~jk 

Jo Jo J\rij-ni,\ 

(28) 

As before, the Mayer/-function for monomer mol¬ 
ecules i and j separated by distance x,, is given by 
Eq. 12b with Uij(Xjj)l{kT) described by Eq. 13. 
Although extension of Eq. 28 to higher powers of 
monomer activity is possible, its use for the character¬ 
ization of systems with self-association beyond dimer is 
restricted by the unavailability of satisfactory expres¬ 
sions for the higher-order virial coefficients in situations 
where the solute bears net charge (Scott et al. 2010). 

Summary 

Thermodynamic nonideality has been considered in 
the context of its interpretation on the statistical- 
mechanical basis of excluded volume. Quantitative 
allowance for the effects of nonideality is accom¬ 
plished more readily in experiments where the thermo¬ 
dynamic activity is being monitored under the 
constraints of constant temperature and solvent chem¬ 
ical potential, as in techniques such as osmometry, size 
exclusion chromatography, and equilibrium dialysis. 
Under those conditions, the thermodynamic activity is 
a molar parameter that relates directly to the concen¬ 
tration scale (g/L) adopted in most experimental stud¬ 
ies. The molar thermodynamic activity is also pertinent 
to sedimentation equilibrium studies of buffered pro¬ 
tein solutions, but not to static light-scattering mea¬ 
surements because of the necessity to regard 
components of the buffer system as additional solutes. 
Because sedimentation equilibrium is the only tech¬ 
nique that allows separation of the consequences of 
solute-solvent interactions from those reflecting sol¬ 
ute-solute interactions, it is clearly the procedure of 
choice for incorporating allowance for thermodynamic 
nonideality into the characterization of nonideal solute 
self-association on the statistical-mechanical basis of 
excluded volume. 
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Cross-References 

► Sedimentation Equilibrium Analytical 
Ultracentrifugation 
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Thermodynamics of Biomolecular 
Interactions 
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Synonyms 

Application of thermodynamic data to biological 
understanding; Calorimetry; Thermodynamic charac¬ 
terization of binding 

Definition 

Quantification of biomolecular interactions is essential 
for understanding biology. Thermodynamic measure¬ 
ment provides insight into the energetic contribution in 
biomolecular interfaces and can be correlated with 
structural perturbation on going from the free to the 
bound state. 

Introduction 

To fully characterize the nature of a biological equilib¬ 
rium interaction, it is necessary to obtain data on the 
thermodynamic parameters and the structure detail of the 
components in their free (unbound) and complex 
(bound) states. In combination, these data can be assim¬ 
ilated to enable definition of the mechanism of complex 
formation. Thermodynamic data gives quantification of 
the energetic properties of the interaction, and coupled 
with a high-resolution depiction of the structure, the 
individual interactions between atoms in the biomolec¬ 
ular interface can be assessed. Understanding the rela¬ 
tionship between thermodynamic data and structural 
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detail for molecular interactions forms a cornerstone of 
biochemical/biophysical research and has become of 
fundamental importance in areas such as computer- 
aided ligand design and medicinal chemistry in the 
search for new targets and inhibitors. Currently, thermo¬ 
dynamic data can be derived from high-sensitivity calo¬ 
rimetric instraments, and structural detail at high 
resolution is available from both X-ray crystallographic 
and nuclear magnetic resonance spectroscopic sources. 
Thus, using the combination of the energetic data and the 
structural detail of the interacting molecules in both the 
free and the complex states, we should be able to fully 
quantify an interaction at the atomic level. This should 
facilitate an understanding of the key elements which 
drive the coming together of large molecules in biolog¬ 
ical processes. In addition, it should enable the prediction 
of structural and/or thermodynamic ramifications of bio¬ 
molecular complex formation (e.g., predict how 
a molecule would bind to a given binding site or predict 
the energetics of an interaction). In reality, however, our 
understanding and interpretation of biomolecular inter¬ 
actions in this way is somewhat naive. This is largely 
based on the fact that the interaction of two biomolecules 
is complicated by a number of factors. Recently it has 
become increasingly clear that on forming interfaces, 
large molecules undergo stmctural perturbations 
(Ladbury and Williams 2007). These changes range 
from gross conformational rearrangement (i.e., large 
portions of structure change topology and formation of 
new interactions within themselves) to subtle changes in 
dynamics of side chain atoms. In addition to this, the 
effects of solvent rearrangement are not easily observed 
and have potentially dramatic consequences on the quan¬ 
tification of an interaction. Other possible contributions 
to the interaction energetics stem from ionization and 
protonation events and possible chemical modification. 
As a result, we are not yet at a stage where close corre¬ 
lation of thermodynamics and stracture is possible. The 
state of the art in this area is described below in terms of 
the calorimetric instmmentation used to measure the 
thermodynamics and the way in which this can be cor¬ 
related to structural detail. 


Basic Thermodynamics Associated with 
Equilibrium Biomolecular Interactions 

In this entry, a brief, simplistic overview of the ther¬ 
modynamic parameters required for describing 


a biomolecular interaction is summarized. A more 
detailed expose of these are given elsewhere in this 
encyclopedia. It should be emphasized that only inter¬ 
actions which are in equilibrium are described and the 
kinetics of forward or reverse interactions do not need 
to be considered in this entry. 

The affinity or strength of an equilibrium is quanti¬ 
fied by the ratio of concentrations of reactants and 
products stated as the equilibrium constant. Com¬ 
monly, this is represented by the equilibrium constant 
(Keq). Thus, the Kgq for the interaction 
Reactants ^ Products is: 

[Products] 

[Reactants] 


where [Reactants] and [Products] are the molar con¬ 
centrations of the interacting biomolecules. The equi¬ 
librium constant generally used in describing 
biological interactions is the equilibrium dissociation 
constant, K^. For an interaction of two biomolecules, 
A and B forming a complex AB, the Kj is given by: 


Kd 


MM 

[AB] 


( 2 ) 


Typically, the for biologically relevant interac¬ 
tions lies in the range 10 pM (for the weaker interac¬ 
tions) to 10 nM (for the stronger interactions). 


The Change in Gibbs Free Energy 

The change in Gibbs free energy, AG°, provides an 
evaluation of the affinity under standard state condi¬ 
tions. This value is thus comparable to other values 
measured under the same conditions. This term is 
related to the Kgq by the following equation: 

AG° = -RT In Keq (3) 

where R is the gas constant (8.32 J.mol” *.K“ *) and T is 
the absolute temperature. AG° is quoted in units of 
J/mol (or most likely kJ/mol), or cal/mol 
(1.00 cal = 4.18 J) is sometimes adopted. The ° sig¬ 
nifies a standard state quantity. Although for the most 
part affinities are given in terms of the equilibrium 
constant, the AG° provides a term which is comparable 
between biomolecular systems. Interactions with 
higher affinities (i.e., “tighter” binding) have more 
negative AG° values. The value of the thermodynamic 
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terms obtained from the ITC experiment is often not 
fully appreciated and sometimes over-interpreted. The 
AG° term provides a concentration-independent term 
which allows comparison of the affinities of interac¬ 
tions at a given temperature and set of conditions. 

The change in the Gibbs free energy term is made 
up of contributions from change in enthalpy AH° and 
change in entropy AS°. These are related by the 
equation: 

AG° = AH°-TAS° (4) 

The Change in Enthalpy 

The change in enthalpy corresponds to the transfer of 
heat energy to or from the system and is the result of 
the net making or breaking of non-covalent bonds on 
forming the biomolecular complex. Thus, this can be 
thought of as the net heat associated with bond forma¬ 
tion or removal. On going from the free states of the 
interacting molecules to the bound state, bonds (such 
as hydrogen bonds, van der Waals interactions, etc.) 
are formed between these molecules. Forming a non- 
covalent bond is an exothermic event, that is, heat 
energy is given out and the resultant change in 
enthalpy is negative (favorable). It should be empha¬ 
sized that this value includes not just the bonds asso¬ 
ciated with ligand binding in the appropriate site, but 
also those due to solvent rearrangement and any con¬ 
formational change experienced by the components of 
the interaction. Thus, on forming the biomolecular 
complex, the interacting molecules form an interface 
from which the previously interacting solvent mole¬ 
cules are excluded. Removing these solvent molecules 
requires the endothermic breaking of non-covalent 
bonds - giving rise to a positive AH°. The net change 
in enthalpy is thus the sum of the negative and positive 
contributions from making and breaking of non-cova¬ 
lent bonds. As a result of the potentially large number 
of bonds made and broken in the formation of biomo¬ 
lecular interactions the AH measured is complex. This 
term can best be appreciated when high-resolution 
structural detail is available of the interacting mole¬ 
cules in the free and bound states. This is because the 
net change in bonds in the binding site can be visual¬ 
ized. Comparison of the AH terms for interactions with 
a given biomolecule and subtly modified ligands can 
give some insight as to the importance of specific, non- 
covalent bond formation. This will be important in 


terms of understanding of intermolecular binding at 
an atomic level - see below. 

The Change in Entropy 

The change in entropy is far less easy to define in terms 
of tangible energetic terms (a more detailed descrip¬ 
tion of this term is provided elsewhere in this encyclo¬ 
pedia). Entropy is usually considered in simplistic 
terms as the thermodynamic state function that is 
a measure of the tendency of a system to disorder. 
This rather loose definition can be applied on the 
atomic/molecular level in terms of whether the system 
becomes more ordered or disordered on going from 
free to bound state. In other words, it can be considered 
as the contribution to the AG° derived from the net 
change in degrees of freedom of the interacting com¬ 
ponents and the solvent. On an atomic level, this term 
is favorable when there is an increase in vibrational, 
rotational, and translational degrees of freedom. Con¬ 
versely, when atoms of the interacting components 
become restricted, say for example when interacting 
within the context of an intermolecular interface, the 
change in entropy is unfavorable. Solvent molecules 
interact with protein surfaces, forming non-covalent 
bonds with polar and charged groups and/or packing 
in hydrogen-bonded formations to reduce their inter¬ 
action with nonpolar (hydrophobic) surfaces. In many 
biological interactions, a major contribution to the AS 
term is the effect of removal of solvent from the 
interacting interfaces, that is, burying two surfaces in 
a biomolecular complex results in the removal of these 
surfaces from exposure to solvent. Thus, the previ¬ 
ously bonded solvent molecules become liberated 
into bulk solvent. The AS° term is very ambiguous 
with respect to the definition of an interaction. One 
of the most important correlations of this term with 
structural effects on complex formation is based on 
the burial of hydrophobic surface area. However, 
this is conjectural and different opinions prevail as to 
the effect of apolar and polar surface area burial on 
the entropic contribution to complex formation. 
The organization, and hence restriction of the degrees 
of freedom, of water molecules on the surface 
of a biomolecule has been shown to extend to several 
layers of hydration. As a result, the burial of 
this surface in the formation of a complex interface 
results in the liberation of these waters. This can pro¬ 
vide a major contribution in terms of the AS° to 
the AG°. 
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Isothermal Titration Calorimetry (ITC) 

Determination of the thermodynamic parameters of an 
interaction can be achieved using a range of methods in 
which the extent of binding of one molecule to another 
can be probed over the course of a titration by a change 
of property of the system (e.g., spectroscopic signal). 
Unlike any other method used to determine thermody¬ 
namic parameters, isothermal titration calorimetric 
(ITC) methods use the change of heat as a probe. 
Thus, as one molecule is titrated into the other, the 
heat taken in (endothermic) or given out (exothermic) 
allows calculation of the extent of an interaction under 
a given concentration regime. The development over 
the last two decades of high-sensitivity isothermal 
titration calorimetric equipment has meant that 
a dramatic advance in the gathering of accurate 
thermodynamic data is now possible, and thus we 
seem poised for rapid progress in this field. 

The exchange of heat with the environment is 
a ubiquitous property of all equilibrium interactions. 
A modern ITC instrument can measure the heat of 
interaction (or molar enthalpy AH) with great accuracy 
(typically the minimum heat pulse is of the order of 
0.1 peal). As injections are made of one-component 
molecule into another, the concentration of the com¬ 
plex formed can be determined at any point in the 
titration, and thus, the equilibrium binding constant, 
Kb (= 1/Kd) can be determined (see below). If the AH 
and the Kb are known, then a full thermodynamic 
characterization of the interaction at the experimental 
temperature can be determined from the relationship in 
Eqs. 5 and 6. 


AG = 

-RT In Kb 

(5) 


AG-AH 


A5 = 

T 

(6) 


Thus, from the Kb we can determine the change in 
Gibbs free energy, AG, at the experimental tempera¬ 
ture, T, using Eq. 5, where R is the gas constant. If we 
know AG and AH, then the change in entropy, AS, for 
the interaction can be determined from Eq. 6. 

One further term that can be determined from the 
ITC experiment is the change in constant pressure heat 
capacity, ACp, on going from the free to the bound 
state. This term is the temperature derivative of the 
AH. Therefore, performing ITC experiments over 


a range of temperatures will provide a value for this 
term (as shown in Eq. 7). 

ACp = (A//r2 -^HTl)/T2-T, (7) 

where Ti and T 2 are two different experimental 
temperatures. 

The ACp term has been the most widely used term 
in relating the measured thermodynamics of an 
interaction to structure. This term has been shown to 
correlate in some cases with the burial of surface area 
(e.g., Spolar and Record 1994), and several groups 
have optimized this correlation to enable some predic¬ 
tive capability of the structural or thermodynamic 
changes on binding of biomolecules. Nonetheless, 
some discrepancies exist with this correlation and 
it should be used with caution. 

The thermodynamic parameters obtained from ITC 
experiments are often given the subscript “obs” to 
indicate that the value includes all the observable con¬ 
tributions from going from the free to the bound state. 
For example, as previously mentioned, the measured 
AH includes not only the heat of formation and break¬ 
ing of non-covalent bonds from the binding of two 
biomolecules, but also that associated with solvent 
interactions, conformational changes in the biomole¬ 
cules, protonation events, etc. 

There is some debate as to whether the standard 
state notation ° (see above) should be used for thermo¬ 
dynamic terms derived from ITC experiments. 
Generally, the decision on this depends on how the 
user views approximation of standard state conditions 
in a calorimeter cell throughout the course of a titra¬ 
tion. Most in vitro experiments are done at constant 
temperature and pressure in dilute aqueous solution 
near neutral pH. The standard state of water is defined 
as that of pure liquid, which means that the activity of 
water can be set to 1. This is justified by assuming that 
in a typical in vitro experiment the change in concen¬ 
tration of water is negligible. However, experiments 
which more closely approximate conditions in a living 
cell are likely to deviate from these conditions and thus 
from standard state. 

ITC Instrumentation 

In a typical ITC instrument, the change in heat energy 
(and hence change in enthalpy) on titrating two com¬ 
ponents is measured in a small reaction cell (with an 
approximate volume of 0.2-1.5 ml). Two identical 
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cells, one for the interaction (the sample cell) and one 
which acts as a reference (see Fig. 1), are located in 
a jacket (usually called the adiabatic jacket, although it 
is not strictly adiabatic) which is kept at a temperature 
below that at which the experiment is to be conducted 
(typically by about 5-10°C). 

The sample cell has an opening into which a syringe 
is inserted via which aliquots of one of the interacting 
solutions is injected into the other. In some instruments 
the syringe is fashioned into a small paddle which is 
rotated into the cell to enhance mixing. The reference 
cell contains buffer or water (ideally a solution of 
similar heat capacity as the interacting components in 
the sample cell) and is usually sealed to avoid evapo¬ 
ration or the entry of any contaminants. The two cells 
are heated to the experimental temperature. Usually 
this is limited by the stability of the components of the 
interaction. The experimental data can be affected by 
unfolded or nonnative biomolecules, although in some 
cases coupled equilibria involving binding and folding 
can be accommodated in the fitting protocol to 
obtain useful data (Ladbury and Williams 2007). The 
power applied to the cell to maintain constant temper¬ 
ature is measured and provides the baseline value at 
thermal equilibrium. The temperature of the two cells 
is continually monitored by a thermocouple device. 
The sample cell and the reference cell are always 
maintained in thermal equilibrium, that is, the temper¬ 
ature difference (if any) between cells remains con¬ 
stant. The injection of one of the components from the 
syringe into the other component of the interaction in 
the calorimeter sample cell will result in a heat change. 
If this is exothermic (i.e., heat is given out), the sample 
cell will require less power input to maintain thermal 
equilibrium with the reference cell. This change in 
power requirement at a given time is measured. The 
data can be presented as a plot of power against time 
(see Fig. 2). Initially the sample cell is at equilibrium 
with the reference cell receiving a steady power input. 
An initial injection of a small volume is used to remove 
any air bubbles from the end of the syringe and reduce 
any diffusion of interactants into the syringe during the 
equilibration period prior to initiating the titration. On 
injecting the first aliquot of the biomolecular compo¬ 
nent from the syringe, heat is given out, and thus, the 
power requirement to the sample cell is reduced to 
maintain thermal equilibrium with the reference. As 
the interaction returns to equilibrium the power has to 
return to its original value. As a result a peak is 


Syringe 
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Fig. 1 Schematic of the isothermal titration calorimeter inter¬ 
action vessel. The reference and sample cells are sited in an 
“adiabatic jacket.” The syringe is inserted into the sample cell. 
After injections of the biomolecular component from the syringe 
are made into the component in the cell the power required to 
heat the two cells is adjusted so as to maintain constant temper¬ 
ature difference (typically ^0°C) between them 


obtained on the plot. Subsequent injections will have 
the same effect. If the concentration regime of the 
interacting components is set up accordingly, a series 
of injections will gradually saturate the binding sites 
available in the calorimeter cell. Once the available 
binding sites are occupied the only heat that is 
observed is that derived from the component from the 
syringe being dissolved/diluted into the solution in the 
cell. This heat is known as the heat of dilution. Since 
this heat does not relate to the binding interaction it has 
to be determined and subtracted from the binding 
isotherm. As a result a separate control experiment is 
necessary to ascertain this heat (Fig. 2 - left panel). 

The change in molar enthalpy of the interaction can 
be determined by integrating power derived from the 
raw data peaks with respect to time (Fig. 2 - right 
panel). Since we know the total concentration of the 
components of the interaction at any given point in the 
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Fig. 2 Typical data output from an ITC experiment. Left 
panel shows raw data from a titration. The titration consists of 
18 injections (after an initial small injection - see text). An 
injection is made every 5 min. The concentration regime of the 
interacting components is set up such that the number of binding 
sites in the calorimeter sample cell is saturated during the exper¬ 
iment. As a result the heat of binding becomes less {blue peaks) 
as the titration progresses and the amount of unbound compo¬ 
nent in the cell decreases. This is shown by the reduction in the 


peak height as the power required over the course of the binding 
reaching equilibrium (and returning to baseline) for each peak 
becomes less. The “heat of dilution” control experiment is 
shown {magenta peaks). In this experiment, the buffer solution 
used in the binding experiment is titrated into the biomolecule 
which was in the sample cell. Right panel shows the binding 
isotherm derived from the raw data shown in the left panel. The 
power is integrated with respect to time to give the heat (change 
in enthalpy, AH) associated with each peak of the raw data in the 
left panel. Fitting this cun’e provides thermodynamic parameter 


titration, we can plot the AH against the molar ratio of 
the interacting biomolecules. For a simple interaction 
with a stoichiometry of one independent binding site, 
this gives a sigmoidal isotherm. This representation of 
the data is somewhat akin to what is seen in an acid- 
base titration and in a similar way the equivalence 
point corresponds to the stoichiometry of the 
interaction. 

Experimental Setup 

Before beginning an ITC experiment there are several 
points that have to be considered. These are itemized 
below. With the development of different ITC instru¬ 
mentation their principles and protocols may vary, but 
some of the issues raised will be generally applicable 
for all ITC methods. 


Optimizing the Detectable Signal/Heat of Interaction 
As stated above, heat is a ubiquitous signal derived 
from equilibrium interactions; however, the AH of 
interaction is temperature-dependent. For a typical 
(rigid body) interaction the AH varies with temperature 
in a linear-dependent manner. The slope of this linear 
relationship is usually negative and is otherwise known 
as the change in constant pressure heat capacity (ACp) 
(see Eq. 7 in section Basic Thermodynamics Associ¬ 
ated with Equilibrium Biomolecular Interactions). 
Eor most biologically relevant systems the line will 
pass through zero within the physiologically relevant 
temperature range. Thus, at some temperatures the 
AH will be low or zero and, thus, undetectable even 
with the highly sensitive microcalorimeters commer¬ 
cially available. The temperature at which AH is zero 
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is system-dependent; thus, the experimental tempera¬ 
ture has to he chosen such that sufficient heat 
is obtained from the binding event. Typically, an 
experiment should be initially tried at 25° C or at a 
physiologically relevant temperature (e.g., 37°C for 
human biomolecular interactions). If no heat is mea¬ 
surable then the temperature should be adjusted by, 
say, 10°C either up or down. Assuming that the inter¬ 
action has a significant ACp, this should provide 
a measurable AH. Caution has to be exercised in 
adjusting the temperature since the interacting com¬ 
ponents can become denatured and/or aggregated. 
Clearly, since AH is temperature-dependent, it is 
absolutely vital that the experimental temperature is 
provided in any reported data. Furthermore, if two 
or more interactions are to be compared (e.g., 
a protein interacting with a range of drug compounds) 
all the experiments should be carried out at the same 
temperature. 

The heat of an interaction can be enhanced by 
changing conditions other than the temperature. 
For example, if the interaction is accompanied by 
a protonation event, the choice of a buffer system 
with a high AH of ionization will enhance the binding 
enthalpy (see Thomson and Ladbury 2004). 

The Titration Isotherm 

Although the ITC experiment can be used solely to 
determine highly accurate AH values for interactions 
alone, the method is more generally applied to the 
determination of the Kb. The binding constant is deter¬ 
mined by fitting the ITC isotherm to obtain the change 
in free ligand concentration with respect to total ligand 
concentration (see section “Data Analysis”). As a basic 
consideration for the experimental setup it is important 
to obtain an isotherm which provides maximum data 
points for the fitting process. In other words the 
shape of the binding isotherm will dictate how accu¬ 
rately the Kb can be determined, if at all. The shape of 
the isotherm is dependent on the Kb and the concen¬ 
tration of the interacting component in the calorimeter 
cell (the total binding site concentration). The product 
of these two terms gives a number called the C-value 
(Wiseman et al. 1989), that is, 

C = n X Kb 

X [biomolecule in calorimeter sample cell] (8) 


where n is the stoichiometry of the interaction. C is 
a unit-less parameter since Kb is measured in and 
the concentration is in terms of M. Analysis of the data 
requires an appropriate C-value. For the best data 
fitting, as a rule of thumb, the C-value should be 
between 10 and 100. The reason for this is demon¬ 
strated in the data simulations shown in Fig. 3. These 
binding isotherms were simulated using the following: 
n = 1, AH = -10 kcal/mol, [ligand] = 200 pM, 
[macromolecule] = 10 pM, and dissociation constants 
ranging from 100 pM to 1 nM resulting in C-values 
ranging from 0.1 to 10,000. 

Under the concentration regimes usually adopted 
for ITC experiments, the data sets with low C-values 
(i.e., <10) are not sigmoidal and tend toward feature¬ 
less straight lines, as demonstrated in the simulated 
data in Fig. 3. Under these conditions the AH changes 
very little from injection to injection as the interaction 
progresses. Thus, the determination of the free versus 
bound ligand concentrations becomes inherently inac¬ 
curate. In the isotherms with the very high C-values 
(i.e., 1,000-10,000) the sigmoidal shape becomes 
more angular. The data correspond to a situation 
where in the early injections all of the ligand is binding 
to form a complex. At a given concentration of the 
component in the syringe the interaction becomes 
saturated and hence the final injections show no net 
heat of binding. This data set also does not facilitate the 
fitting of the free versus bound concentrations as the 
titration comes to completion. 

It should be home in mind that the useful range in 
terms of Kb of modem instruments using the direct 
titration method is approximately between 10^ 
and 10^ (or Kq between 1 mM and 1 nM). As 
a result Eq. 8 would suggest that concentrations as low 
as 10 nM can be used for titrations. Unfortunately, for 
most biomolecular systems concentrations in the nM 
range are not suitable since they do not provide suffi¬ 
cient heat change on binding. The range of binding 
affinities measurable by ITC can be extended by 
adopting competition-based experiments. 

Choice of Buffer Solution 

The robust nature of materials used in modern calo¬ 
rimeters makes it possible to use most commonly 
adopted solvents for biomolecular experiments 
(always check the ITC manufacturer’s recommenda¬ 
tions for the calorimeter cell). 
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Fig. 3 Effect of the “C-value” on the shape of the binding 
isotherm. The binding isotherms shown were simulated using 
a stoichiometry, n = 1, and a AH = —10 kcal/mol. The compo¬ 
nent from the syringe was set at a concentration of 200 pM and 
the component in the sample cell was 10 pM. The dissociation 
constants, Kj, were set ranging from 100 pM to 1 nM resulting in 
“C-values” ranging from 0.1 to 10,000. The simulation lines 
correspond to the following colors: Kj Black 100 pM, Red 
10 pM, Green 1 pM, Blue 100 nm. Cyan 10 nm. Pink 1 nm, 
and the corresponding C values B/acl: 0.1, Red \,Green W,Blue 
100, Cyan 1,000, Pink 10,000 

The heat of dilution of the solution injected into the 
solution in the calorimeter cell should be kept to 
a minimum. The accuracy of the data will be affected 
adversely if a large heat of dilution has to be subtracted 
from a comparatively small heat of binding. To ensure 
that this heat is minimized, the buffers used in the cell 
and syringe must be carefully matched. This is accom¬ 
plished most effectively by dialyzing both the 
interacting components against the same buffer. If one 
of the components is too small for dialysis, then dialyze 
the macromolecule and then dissolve the other compo¬ 
nent in the dialysate. This does not always work; how¬ 
ever, small molecules are usually synthesized and can 
often be accompanied by residues of the synthetic pro¬ 
cess or the buffers used in their separation/purification. 


Buffer solutions containing reactive ingredients 
such as reducing agents can be problematic in that 
these will often give a background heat effect. For 
example, the heat associated with the constant oxida¬ 
tion of a reducing agent will usually result in a drift in 
the raw data baseline. This can be a problem if this is 
severe. As a result, reducing agents with low heats of 
oxidation are recommended. 

One significant problem in many applications of 
ITC is where one or both of the interacting components 
are insoluble in pure aqueous solvent. This can cause 
problems if, for example if a nonaqueous or organic 
buffer is used substantial heats of dilution can arise. In 
most cases when performing titrations on biomolecular 
samples, a compromise has to be reached in terms of 
the buffer solution adopted. For example, in experi¬ 
ments commonly encountered in the pharmaceutical 
industry, many drug compounds need to be dissolved 
in an organic solvent, however, most proteins are 
unlikely to prevail in a nonaqueous buffer system. In 
these situations ideally the organic buffer should be 
diluted as low as possible with the buffer used to 
dialyze the protein. Serial dilutions can be used to 
reduce the organic solvent down to suitably low con¬ 
centration to have limited effect on the heats of dilu¬ 
tion. This incremental dilution of the stock solution 
should help to avoid precipitation of the protein/bio¬ 
molecule. If for example, the ligand is provided as 
a stock solution (e.g., DMSO), dilute the stock solution 
with the dialysate (ideally to below 1%) and then add 
a corresponding amount of stock solution solvent to the 
macromolecule solution. 

Control Experiments 

As mentioned above, the ITC experiment is set up to 
determine the heat of binding as two biomolecules are 
added to one another in a calorimeter cell. However, in 
a typical experiment there may be a number of sources 
of additional heat that have to be accounted for in 
control experiments. Failure to control these can result 
in gross misinterpretation of the data and even false 
observations of binding events. Thus, these additional 
heat effects have to be determined and removed from 
the raw data for the interaction. The injection of 
any solute into a solution in which it was previously 
absent results in a heat of dilution of this component. 
As a result, an independent control experiment is 
required to obtain this heat per injection. The control 
experiment is performed where the solution identical 
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to the solution in the ITC syringe used for the binding 
experiment is added to the buffer solution which is 
used for the biomolecule in the calorimeter cell (usu¬ 
ally the same as that for the injectant). This control 
experiment will also account for any effects resulting 
from mismatch of the buffers used. 

The heat of dilution control experiment can also 
reveal complications in the experiment which might 
not have previously been expected. For example, at the 
elevated concentrations sometimes required for ITC 
experiments, the component in the syringe could be 
in a self-associated state. If the injection of this com¬ 
ponent into the solution in the cell results in dissocia¬ 
tion of this biomolecule from its associated state, 
a gradually declining heat of dilution will be observed. 
If this is observed, since it is concentration-dependent, 
performing the titration at different concentrations will 
confirm molecular dissociation. Significantly different 
heats of dilution will be obtained under the different 
concentration regimes. These data can actually be used 
to determine the equilibrium dissociation constant in 
some cases (e.g., see Cooper 1998). 

A further contribution to the overall heats measured 
in a binding experiment could come from the 
additional effect of the dilution of the solute in the 
calorimeter cell. As a result, a heat of dilution experi¬ 
ment should be performed to control for this. In this 
case, the experiment requires that heats are determined 
as buffer is injected into the calorimeter cell. This 
effect is usually very small. This is because the actual 
dilution of the solute is very small (i.e., the cell volume 
is diluted by the volume of one injection). For exam¬ 
ple, an injection of 10 pi of buffer into a 1.5 ml cell will 
only effect a dilution of less than one hundredth. 

Because heat measurement in modem ITC instru¬ 
mentation is so highly sensitive, a final control experi¬ 
ment known as the machine blank is also suggested. This 
experiment involves adding buffer into buffer and takes 
account of the (usually very small) heat associated with 
machine effects such as pressure changes encountered. 

Data Analysis 

Detailed description of how ITC data output is handled 
to determine the stoichiometry, n, the AH, and the Kb 
has been provided elsewhere (see, e.g., Wiseman et al. 
1989; Blandamer 1998). The data analysis computer 
software from the different ITC manufacturers vary 
somewhat but they can be broadly described by the 
following general overview. 


If the solute from the syringe is assigned as A and 
that in the cell as B, after making an injection, 
a number of moles of substance A has been added to 
a total amount of substance B, [Btot] in the calorimeter 
cell. As a consequence of binding, the concentration of 
free A, [A] and the complex [AB] change. The inter¬ 
action is accompanied by the production of heat. The 
quantity of heat associated with the injection is a direct 
probe of the amount of binding that occurs. In the data 
fitting, the problem is to relate this heat effect to the 
change in [A] and in [AB]. This requires us to assume 
a model for the binding event. 

For the purpose of showing the required fitting, the 
simplest model for an interaction is adopted, that is, 
there is one independent binding site. The heat derived 
from a single injection Q is related to the change in 
enthalpy by the following Eq. 9: 

Q = [Btot] Vo (nz F 2 AR 2 - ni F,ZlHi) (9) 

where Vq is the volume of the cell and F is the fraction 
of B bound. The numerical subscripts refer to the terms 
in going from injection 1 to injection 2. n is the 
stoichiometry of the interaction (we can neglect the 
stoichiometry term since for our model it is unity). For 
the sake of simplicity, we have ignored the change in 
volume in the calorimeter cell resulting from the injec¬ 
tion of an amount of A. 

Since 

F = [AB]/[B.„.] = [AB]/([B] + [AB]) (10) 

the above Eq. 9 provides a relationship between heat 
and the amount of complex between A and B being 
formed at each injection. For our thermodynamic 
parameterization we need to relate F to Kb. 

Kb = [AB]/([A] [B]) (11) 

Therefore, by substituting Eq. 11 into Eq. 10: 

E = Kb [A]/(l-hKB[A]) (12) 

Erom Eq. 11 and Eq. 12 and knowing that 
[AB] = [A,ot] - [A] we get the equation: 

([A,o,] - [A])/[B,o,] = Kb[A]/(1 + Kb[A]) (13) 
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Rearrangement of Eq. 13 gives the quadratic 
equation: 

-Kb[A] 2 + [A](KB[A,ot] - Kb [B,o,] - 1) + [A,ot] = 0 

(14) 

The real root of this quadratic equation then gives 
a value for [A] in terms of the known concentrations 
[A,ot] and [Bto,]. Thus, fitting of a series of injections in 
a titration provides the value of the equilihrium bind¬ 
ing constant. 

In this entry, the potential value of the ITC meth¬ 
odology has been emphasized along with highlighting 
some of the areas in which the operator should exercise 
due care and attention. On the one hand, it is important 
to realize that as ITC instrumentation becomes more 
widespread it is imperative that users become aware of 
some of the issues which can compromise their data. 
On the other hand, it is also important that users learn 
that the range of experiments is large and the method 
provides a valuable approach to thermodynamic mea¬ 
surement which does not necessitate problems such as 
chemical modification, immobilization and potentially 
hazardous (e.g., radioactive) competing ligands which 
are required in other techniques. 

Examples of Applications of Thermodynamic 
Data to Understanding Biological Problems 

Fundamental biological processes mediated by equi¬ 
librium biomolecular interactions are dehned by the 
interdependence of thermodynamic properties on 
structural detail. Thus, determination of the physico¬ 
chemical parameters, and relation of these to the 
structural consequences of changing the system from 
the free to the bound state is the key to comprehending 
how interactions occur. The binding of two biomole¬ 
cules viewed from the atomic level is highly com¬ 
plex. It involves the formation, or removal of many 
individual non-covalent bonds both between the 
interacting molecules as well as with solvent. These 
all impinge on the overall AG of the interaction 
through their contributions to AH and AS. Currently, 
our understanding of how the thermodynamic quanti¬ 
fication of biomolecular interactions relates to atomic 
structural changes on going from the free to the bound 
state is somewhat naive. Below, a selection of studies 
provides examples whereby the measurement of 


thermodynamic parameters has revealed some level 
of further understanding about a biomolecular inter¬ 
action. In none of these examples (or indeed any other 
published work) is a definitive understanding of all 
the events surrounding binding achieved; however, 
the thermodynamic data does shed light on what 
might be significant in defining the mechanism of 
interaction. 

SH2 Domains and the Determination of AG as 
a Measure of Specificity 

Here the thermodynamic data for binding of peptides 
to a structurally well-characterized SH2 domain pro¬ 
vides an example of how to assess a biologically rele¬ 
vant interaction. The evaluation of the affinity 
expressed as a AG measurement provides useful 
input into an understanding of biology. Comparison 
of AG values provides a way of assessing the specihc- 
ity of a biomolecular ligand for a particular binding 
site. One good example of this is in understanding the 
issue of specificity in protein interactions in tyrosine 
kinase-mediated signal transduction (see, e.g., 
Ladbury 2005). Protein interactions emanating from 
receptor tyrosine kinases (RTKs) in eukaryotic cells 
which are involved in signaling pathways are generally 
mediated through well-defined domains. Commonly 
Src homology 2 (SH2) domains form the recognition 
site for interactions with phosphorylated tyrosines 
(pYs) which have been post-translationally modified 
by tyrosine kinases. The dogma that arose is RTK- 
mediated signaling that signals are processed via 
a linear pathway, thus for signal transduction without 
corruption these interactions have to be specific such 
that a signal from one type of receptor does not inter¬ 
fere with one from another. This level of specificity 
would be expected to be high since a given cell can 
express in excess of 100 different proteins containing 
these highly sequence-conserved, approximately 100 
amino acid-containing domains. Thus, the interaction 
of a particular tyrosylphosphoprotein in a given path¬ 
way with another protein with the appropriate SH2 
domain has to be mutually exclusive of any other 
SH2 domain interactions. For specific interactions it 
is estimated, based on potential concentrations of other 
competing ligands, that the affinity of a specific ligand 
should be approximately three orders of magnitude 
tighter than a nonspecific interaction. One way to 
assess the specificity of an interaction is to compare 
the AGobs of a given molecular complex formation 
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Thermodynamics of Biomolecular Interactions, Table 1 Thermodynamic parameters for the interaction of the Src SH2 domain 
with tyrosylphosphopeptides 


Peptide sequence 

Kb(x10'^ M^') 

Kd (pM) 

AG (kJ.mor') 

AH (kJ.mor') 

TAS (kJ.mor') 

pYEEI 

10.8 

0.09 

-40.1 

-38.7 

1.4 

pYEEV 

6.24 

0.16 

-38.8 

-28.6 

10.2 

pYEEE 

4.87 

0.21 

-38.1 

-32.7 

5.4 

pYEEW 

3.18 

0.31 

-37.1 

-32.2 

4.9 

pYEED 

2.64 

0.38 

-36.6 

-27.5 

9.1 

pYQPG 

0.07 

14.3 

-27.7 

-25.7 

2.0 

Adapted from (Ladbury 2005) 
EPQpYEEXPIYL specific peptide, where X 

is substituted as 

shown except for pY proximal sequences 

VPM and QPG. 


All experiments at 25° C and pH 6.0 using ITC 


with that expected for a nonspecific interaction. SH2 
domains were generally shown to recognize amino 
acid sequences proximal and C-terminal to the pY 
residue. The SH2 domain from the protein Src prefer¬ 
entially recognizes the sequence glutamate, glutamate, 
isoleucine, or EEI (using the single letter code for 
amino acids). Thus, exposing the Src SH2 domain to 
tyrosylphosphopeptides which contain the recognition 
sequences for another SH2 domain should provide 
a signihcant difference in the AGobs- Table 1 shows 
the thermodynamic parameters for the binding of 
the SH2 domains from Src to a series of 
tyrosylphosphopeptides. 

Changing the residue in the pY -i- 3 position to 
different size and polarity leads to approximately 
4 kJ.mol”' difference in AGobs (or only about an 
order of magnitude difference in affinity, Kb). In addi¬ 
tion, the “specific” sequence binds with an affinity only 
approximately 60-fold tighter than a sequence which is 
purported to be specific for a different SH2 domain 
(pYVPM). Eurthermore, a sequence which has no sim¬ 
ilarity to the “specihc” sequence whatsoever (pYQPG) 
binds only two orders of magnitude weaker than the 
pYEEI sequence. 

The thermodynamic data derived from ITC which 
demonstrates the lack of specificity in peptide ligands 
for an SH2 domain, also reveals a clear advantage 
of ITC measurement. On perusal of the binding data, 
it is apparent that despite similar AG values, the under¬ 
lying changes in enthalpy and entropy are quite differ¬ 
ent for each interaction. Although these terms 
compensate to give similar affinities for the peptides, 
the differences in AHobs and TASobs between com¬ 
pounds are striking. These terms represent a secondary 
level of information which is clearly important in 
dehning an interaction. 


The Contribution of Solvent Ions to 
Thermodynamic Parameters 

As mentioned above, the components of the solvent 
can have a signihcant effect on the measured thermo¬ 
dynamic parameters of two biomolecules going from 
the free to the bound state. In this example, the mea¬ 
surement of the Ke.obs provides feedback on the inclu¬ 
sion or removal of ions from the binding interface in 
a protein-DNA interaction. 

The interactions of components of the solvent are of 
fundamental importance in dictating the outcome of 
biomolecular complex formation. In some cases, the 
positions of molecules or atoms derived from the sol¬ 
vent can be observed in high-resolution structures. 
However, their quantitative contribution to the ener¬ 
getics of binding clearly cannot be established merely 
from a pictorial representation of the contact interface. 
ITC provides a method to determine the overall ener¬ 
getics of an interaction; however, the parsing of this 
into the independent effects of the biomolecules 
forming the complex and the binding/release of vari¬ 
ous components of the solvent is more of a challenge. 
The interaction of anions and cations from the solvent 
can have a dramatic effect on the formation of com¬ 
plexes, particularly in cases where charged chemical 
groups play a role in dictating the specihcity of the 
interacting biomolecules. 

By using ITC to explore the relationship between 
the affinity and the salt concentration, it is possible 
to isolate the contribution of anion and cation 
rearrangement to binding. The underlying idea is that 
if, for example, the solvent ions have a lower energy 
in the presence of a complex than in the presence of 
the separate interacting biomolecules, then increasing 
the salt concentration will favor formation of the 
complex. 
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The correlation between the affinity of an interac¬ 
tion and the concentration of salt has been established 
in the case of protein-DNA interactions by Record and 
colleagues (Ha et al. 1992). A modified version of the 
derived empirical relationship has been adopted 
(Eq. 15; O’Brien et al. 1998): 

logKB = logKB,ref - X\og[Salt] + 0.016T[5a/f] (15) 


where the values of X and Y are the number of ions and 
the number of water molecules respectively released to 
bulk solvent (negative number) or bound to the com¬ 
plex (positive number). The effect of ion release is 
manifested in a negative slope and ion binding as 
a positive slope in a plot of logA'e.obs against log 
[salt]. In most systems investigated the value of X is 
negative, highlighting the fact that the formation of the 
protein-DNA complex interface requires the removal 
of ions that were previously bound to the interacting 
biomolecules. 

Investigation of the interaction of the TATA bind¬ 
ing protein (TBP) from Pyrococcus woesei (Pw) with 
a cognate sequence of DNA using ITC revealed the 
effects of high ionic strength on complex formation. 
This halophilic and thermophilic archaeal organism 
prevails in an environment where the salt concentra¬ 
tion is in excess of 0.5 M and the temperature above 
100°C. In mesophilic organisms (e.g., yeast and 
humans) intracellular salt concentrations vary signifi¬ 
cantly from approximately 200 mM in human tissue to 
in excess of 2 M in some halophilic archaea. Thus, 
these different salt environments require modification 
of the way in which biomolecules recognize and inter¬ 
act with each other, particularly in interactions of bio¬ 
molecules in which charge plays an important part 
such as with polyelectrolytic DNA. Titration experi¬ 
ments were performed in increasing concentrations of 
NaCl (0.1-1.9 mM: Bergqvist et al. 2001). The slope of 
the plot of logT^^B.obs against log[salt] was found to be 
positive, that is, the binding becomes tighter in increas¬ 
ing concentrations of salt (see Fig. 4). 

In the case of the PwTBP, fitting the data to Eq. 15 
revealed that approximately two ions were bound to 
the complex (c.f. the free protein and DNA). In the case 
of the PwTBP, the tighter binding at higher salt con¬ 
centration implies that ions interact more favorably 
with the protein-DNA complex and hence at higher 
concentrations the binding is enhanced. 
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log[NaCI] 


Thermodynamics of Biomolecular Interactions, Fig. 4 Plot 
of log A^obs against log [NaCl] for wild type (WT) and mutant 
forms of Pm’TBP. The data were fit to Eq. 15 using a nonlinear 
least squares algorithm where the net change in ions (in this case 
assumed to be cations). A, the net change in water molecules in 
complex formation, B, and A^ref were fit as independent vari¬ 
ables. In the presence of the EDA mutation, the slope of the 
cun’e {Red) becomes less positive indicating that cation(s) have 
been released from the protein-DNA interface. This is also 
observed in the presence of the El28A mutation (Orange) and 
to a greater extent with the double mutation E12AE128A (Blue) 


Support for this interpretation can be derived from 
mutational studies in which negatively charged resi¬ 
dues, hypothesized to enhance the interaction with 
cations, were identified on the protein from X-ray 
crystallographic data and mutated to neutral amino 
acids. The substitution of a glutamate residue (El2) 
of the protein for an alanine (El2A) reduced the slope 
of the logXB,obs against log[salt] plot (Fig. 4). An 
additional mutation, El28A, reduced the positive 
slope of the logXB,obs against log[salt] plot further to 
the point whereby at low salt concentrations (<0.5 M) 
the line is almost horizontal (see Fig. 4), indicating that 
this double mutant protein (E12AE128A) does not 
sequester cations to the complex (extended) interface. 
Thus, the relationship between the affinity and the 
salt concentration provides a direct route to quantify¬ 
ing the sequestration of ions in complex formation, 
a phenomenon which is clearly related to the changes 
in electronegativity upon complexation. 

Thermodynamic reconciliation of this effect is pro¬ 
vided by the AH and AS data measured by ITC. As the 
salt concentration increases, the AH for the wild type 
and the E12A mutant (which does not sequester 
a cation on binding) are similar; however, the TAS 
component of the AG for the mutant is significantly 
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more favorable across the salt concentration range 
(until very high salt concentrations). This is because 
the wild type protein interaction incurs an entropic 
penalty resulting from the pinning down of an extra 
cation and thus restricting the degrees of freedom of 
the system. In the mutant interaction, the cation is 
liberated into bulk solvent and hence has a more favor¬ 
able AS (Bergqvist et al. 2001). 

Interfacial Water Molecules in Protein-Peptide 
Complexes 

Any water molecule observed in a biomolecular com¬ 
plex interface makes a favorable contribution to the 
free energy compared to a void in the same position; 
what is important in understanding the role of water in 
biomolecular interactions is establishing whether sub¬ 
stitution of the water by a ligand, or part thereof, will 
make a more favorable contribution. One approach to 
this is to identify and characterize binding sites where 
water improves the overall binding of an interaction. 
By their very complexity, the accurate measurement of 
the effects of inclusion of water in protein-ligand 
interactions is extremely difficult. The only way this 
can be attempted is by the combination of high- 
resolution structural detail and accurate thermody¬ 
namic data. To define fully any binding event, the 
interacting molecules have to be characterized in 
their free and bound states. Clearly the incorporation 
of water adds another dimension to the definition of 
these states. This requires knowledge of the change in 
hydration states of the protein and ligand on formation 
of an interface. The enthalpic and entropic contribu¬ 
tions of water molecules interacting with the free mol¬ 
ecules have to be assessed to ascertain if they are 
altered on forming the complex. In the examples 
given below, and generally in ligand design, these are 
not easily quantihed and as such will impinge on any 
conclusions drawn. 

The oligopeptide-binding protein, OppA, seques¬ 
ters peptides of two to hve amino acids residues in 
the periplasm of bacteria. These peptides can be made 
up of sequences of any of the naturally occurring 
amino acids. This promiscuous binding is facilitated 
by the protein binding directly to the peptide main 
chain but accommodating the peptide residue side 
chains in binding cavities through interactions largely 
mediated by water (Tame et al. 1996). The thermody¬ 
namic effect (at 25° C) of the inclusion of water mole¬ 
cules into the binding site was explored by taking 



Thermodynamics of Biomolecular Interactions, 
Fig. 5 Superposition of four structures of the central cavity of 
OppA with different side chains displaying water molecule posi¬ 
tions. The four structures contain the following residues in the 
central position of the tripeptide: Yellow alanine. Red trypto¬ 
phan, Green glutamic acid. Blue lysine. The independent struc¬ 
tures of the OppA cavity are color codes corresponding to the 
above. Water molecules are shown as spheres color coded 
according to their respective complex. The clusters of water 
molecules in approximately iso-energetic positions are circled 
{Orange) 


a series of tripeptides of the form LysXxxLys (where 
Xxx is any of the naturally occurring amino acids). The 
central residue of this peptide was shown to bind in 
a buried cavity isolated from bulk solvent. For this 
work this approximates to a “thermodynamically 
closed system.” Insertion of different residues into 
the cavity changed the water content but had 
a negligible effect on the cavity itself and makes no 
direct non-covalent bonding contacts with cavity resi¬ 
dues. All the interactions between protein and peptide 
are mediated by water (Fig. 5). 

This protein cavity, therefore, demonstrates the 
remarkable level of adaptability furnished by the inclu¬ 
sion of water molecules into the binding site since the 
binding constants for the LysXxxLys peptides are 
spread over less than two orders of magnitude. The 
inclusion of water molecules into the binding interface 
could be shown to improve the binding of the peptide 
(Table 2). 

Substitution of a tryptophan residue by an alanine in 
the central residue binding cavity is accompanied by 
the inclusion of three additional water molecules, 
which hll the void left by the removal of the bulky 
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Thermodynamics of Biomolecular Interactions, Table 2 Thermodynamic data for the binding of LysXxxLys tripeptides to 
OppA at 25°C and pH 7.0 


Peptide XXX 

Water“ 

Kb ((lO^lM-') 

AH (kJ.mol ') 

AG° (kJ.moF*) 

AS“ (J.moF'.K-') 

KAK 

7 

17.8 ± 3.0 

20.1 ± 0.2 

-41.4 

206.3 

KWK 

4 

9.0 ± 2.5 

21.3 ± 0.6 

-39.7 

204.6 

KEK 

7 

6.5 ± 1.0 

11.8 ± 0.1 

-38.9 

170.3 

KKK 

6 

0.5 ± 0.1 

27.3 ± 0.6 

-32.3 

200.4 


Adapted from Tame et al. (1996) 

“Water observed in the pocket which accommodates the central residue side chain 


hydrophobic group, and an increase in affinity 
(Kd LysTrpLys 0.11 pM, Kq LysAlaLys 0.06 pM). 
The thermodynamic consequences of this are a small, 
unfavorable entropy change (AAS°Trp^Aia = 1-7 

J.moF'.K”'; TAAS°Trp-Aia = 0.5 kJ.moF*) and 
a more favorable enthalpy change (AAH° xip^Aia = 
— 1.2 kJ.moF*). The same overall effect is observed for 
substitution of the charged residues (Lys ^ Glu) as 
shown in Table 2. In this case, the affinity is increased 
by an order of magnitude resulting from a significant 
decrease in AH° more than compensating for the 
unfavorable entropic effect of including one additional 
water molecule (AAH°Lys^Giu = —15.5 kJ.moF'; 
TAAS°Lys^Giu = —8.9 kJ.moF*). The water appears to 
cushion the effects of inclusion of the oppositely charged 
groups which have no significant effect on the cavity 
structure. 

As emphasized above, the conclusions drawn from 
these types of studies have to be weighed carefully. 
The measured thermodynamic parameters for binding 
can only be attributed to the addition of water mole¬ 
cules to the closed system if the initial state of all the 
molecules is equivalent in all cases which, although 
a reasonable assumption for the protein, may not be for 
the peptides. For example, the initial entropy of the 
different central residues of the tripeptides will differ, 
though these can be derived from theoretical 
principles. Furthermore, in the case of the Trp ^ Ala 
substitution, the free LysTrpLys is likely to have 
a significantly greater number of water molecules 
interacting with it than the LysAlaLys peptide. These 
water molecules will be liberated on binding producing 
a more favorable entropy. It is currently impossible to 
deconvolute all these contributions from measured 
data. 

The high-resolution structural detail (<1.4 A) of the 
water molecules that mediate the OppA-peptide inter¬ 
actions shows that although the amino acid side chains 
of the peptide are accommodated by changing the 


water content of the binding site, the water molecules 
that are retained occupy the same positions from one 
ligand to another (circled in Fig. 5). Although the 
effect of water in a binding interface is to change the 
specificity of ligand binding, this seems to be done in 
such a way that the water acts to extend the protein 
structure. Rather than moving around to change com¬ 
plementarity, water molecules conserved from one 
ligand to another in the binding site appear to adopt 
definite positions. This infers that the water is able to 
make hydrogen bonding arrangements within the bind¬ 
ing interface that are favorable and not greatly affected 
by the insertion of a ligand. It appears that the binding 
surface of OppA provides well-defined, highly favor¬ 
able sites for water such that it will become incorpo¬ 
rated into the interface and that the side chains of the 
interacting ligand have to displace these where 
appropriate. 

Biomolecular interactions can now be routinely 
characterized on an atomic level. The use of high- 
resolution structural information from X-ray crystallo¬ 
graphic and NMR spectroscopic studies permits an 
understanding of the changes in the positioning of 
atoms as a complex is formed. Furthermore knowing 
where individual atoms are likely to be in three- 
dimensional space allows an educated estimation of 
the positioning of short-range non-covalent bonds that 
form between the interacting partners. These structural 
details can be combined with the measurement of 
thermodynamic parameters of biomolecular complex 
formation derived from ITC. 

Calorimetry in Drug Development 

There are two essential steps at the beginning of the 
process of bringing a drug compound to market; 
the identification of a lead compound and the optimi¬ 
zation of this lead compound to hone the required 
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characteristics, including its affinity, for its specific 
target. The modem pharmaceutical industry is involved 
in a continuous search for ways of streamlining these 
two steps to increase yields, decrease time, and reduce 
costs. This has meant that there has been a push to 
develop the capabilities of instrumentation to assist in 
these processes. The recent revolution in the develop¬ 
ment of calorimetric instmmentation, and its applica¬ 
tions, has led to calorimetry now taking a place in the 
front line in the battle to develop pharmaceutical 
products. 

Calorimetry in Compound Screening 

The modern process of screening for hits and ulti¬ 
mately candidate lead compounds involves rapid iden¬ 
tification from among a library of molecules one or 
more that bind to a target molecule. In order to do this 
efficiently, a sensitive signal which indicates the pres¬ 
ence of an interaction is required. Currently popular 
methods include monitoring an interaction using fluo¬ 
rescent or radioactive labeling. An alternative 
approach which has begun to be used involves using 
the heat of interaction being used as the probe for 
a compound binding to the target molecule. With the 
development of high-sensitivity calorimeters, these 
instruments are being adopted as a tool in drug screen¬ 
ing. Currently it seems unlikely that the use of calori¬ 
metric methods will be adopted for frontline, primary 
screening, not least because the sensitivity is not as 
high as, for example spectroscopic methods. However, 
use of calorimetry for secondary or tertiary screening 
of smaller, more focused compound pools is already 
being adopted. 

The ITC instrument provides a highly sensitive 
probe of heat of interaction at a given temperature. 
Injection of an interacting drug into the target molecule 
(which is accommodated in the calorimeter cell) will 
result in an endothermic, or exothermic, enthalpic 
event which will be observed as a heat pulse. Thus, if 
a pool of molecules is injected, the presence of any 
interacting molecules, or “hits,” can be rapidly 
observed by the change in enthalpy. As with other 
screening procedures, by using different matrices of 
molecules from compound libraries it is possible to 
rapidly identify the hits. This method of identifying 
binding compounds is superior to many other tech¬ 
niques since it precludes the necessity for chemical 
modification of the target of drug compound 
(e.g., attaching fluorescent tags, immobilizing one 


binding component or radioactive labeling). Further¬ 
more, since every biomolecular interaction is accom¬ 
panied by a change in enthalpy, it should be 
a ubiquitous probe. Current limitations to the wide¬ 
spread adoption of this methodology are based on the 
perceived material requirements and the lack of instru¬ 
mentation to allow high throughput screening of mol¬ 
ecules. These problems are being addressed currently 
by instrument manufacturers. With regard to the 
amounts of material required, the screening process 
only requires detection of interaction rather than the 
determination of the absolute affinities (or full thermo¬ 
dynamic characterization). Thus, the complete iso¬ 
therm discussed in the ITC instrumentation section is 
not required. In fact, it is only necessary to observe 
a single heat pulse to identify a binding event. Since 
heat can be measured to such high sensitivity, typically 
concentrations down to the picomolar range are 
sufficient to detect binding. 

Correlation of Thermodynamic Data and 
Biomolecular Structural Detail 

The ITC method enables the rapid determination of 
a complete thermodynamic profile of an interaction in 
one experiment. With the ready availability of this 
information, one might have expected that these data 
would have made an impact not only in understanding 
biological systems but in the area of structure-based 
drug development. However, this has yet to be 
realized. 

One cornerstone of biophysical scientific endeavor 
is to be able to correlate the structural perturbation 
which accompanies biomolecular interactions (i.e., 
the making and breaking of non-covalent bonds 
between the interacting macromolecules as well as 
with the solvent, and the accompanying structural con¬ 
formational rearrangements) with the thermodynamic 
quantity associated with these changes. From this cor¬ 
relation the panacea of being able to predict the ther¬ 
modynamic character of an interaction, especially the 
free energy (or affinity), from some knowledge of the 
structure of the binding site and the potential ligand 
was very attractive to the pharmaceutical industry. For 
example, this would enable calculation of thermody¬ 
namic properties of an interaction of a range of small 
molecules with a target molecule if the high-resolution 
structure of these was known, thus avoiding the tem¬ 
porally and financially expensive complete synthesis 
and analysis of these compounds. 
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Unfortunately, this panacea has remained elusive. 
This is due to the fact that although structural pertur¬ 
bation can be observed on an atomic level using X-ray 
crystallographic and NMR spectroscopic structure 
determination methods, thermodynamic measurement 
using ITC reports on the global change of the system 
which includes intractable quantities such as changes 
in solvation. That is to say that, if the formation of 
a biomolecular complex involved only the formation 
of a defined number of hydrogen bonds between atoms 
observable in the structure, then the thermodynamic 
parameters measured for the interaction could be 
attributed to this and correlations easily made. How¬ 
ever, when a ligand binds a number of events are 
possible: (1) non-covalent bonds form between the 
ligand and the biomolecule, (2) non-covalent bonds 
break between the solvent and the biomolecular bind¬ 
ing site which becomes buried by the interacting 
ligand, and (3) similar types of bonds break between 
the solvent and the ligand. In addition there are 
other effects such as conformational change, proton- 
atlon, ion binding which could also contribute to the 
change in thermodynamic parameters for the interac¬ 
tion. As a result there are too many undefined events 
occurring in a single binding event which cannot 
be revealed by the high-resolution structure, but 
which are included in the measured thermodynamic 
parameters. 

The most refined attempts to correlate structure and 
thermodynamic parameters have been based on the 
perceived relationship between the burial of surface 
area and the change in constant pressure heat capacity, 
ACp. The original observations were made on protein 
folding/unfolding equilibria and heats of desolvation 
of small hydrocarbon compounds where it was pre¬ 
sumed that the ACp effect was due to the release of 
organized water molecules on primarily hydrophobic 
surface. This correlation with hydrophobic surface has 
progressively been fine-tuned to include hydrophilic 
surface and increasingly detailed definitions of surface. 
Although there is some value in this approach, there 
are many examples of lack of correlation and indeed 
questions over the whole tenet of the approach. 
This lack of ability to directly correlate structure and 
thermodynamic parameters and hence make useful 
predictions about the binding characteristics of lead 
compounds has in many cases led to pharmaceutical 
scientists ignoring the potential value of these 
parameters. 


Calorimetry in Structure-Based Drug Design 

Structure-based drug design is a general term applied 
to the process whereby a high-resolution structure of 
the target protein (or DNA) is used to provide infor¬ 
mation on how a potential ligand might interact with 
a surface. In its simplest form, this can be done by 
taking a compound and visually inspecting ways in 
which it will fit into a preordained binding site on the 
target molecule. By assessing which interactions in the 
binding interface are likely to be favorable and which 
are unfavorable, suggestions for compound improve¬ 
ment can be rationally made. Quantification of these 
favorable and unfavorable interactions requires knowl¬ 
edge of the thermodynamics of the interaction. The 
rational design process has become increasingly 
sophisticated over the last 10 years, such that now 
complex docking algorithms and molecular dynamics 
simulations can be used to predict binding energies for 
positioning drug compounds in target binding site. In 
an ideal scenario, these data are fed to medicinal 
chemists who can use them to suggest ways to modify 
the compounds to optimize the interactions in the 
interface. The capability to provide useful information 
is seriously hampered by a lack of experimental ther¬ 
modynamic data with which to verify these sophisti¬ 
cated programs and to enable checking of the effects of 
molecular variation in the drug discovery process. 
Furthermore, this form of drug discovery is based 
only on the prediction of the change in free energy 
(AG) of an interaction. This is generally assimilated by 
the addition of all the non-covalent bond terms in the 
interaction derived from the theoretical enthalpy 
changes (AH) based on bond distance. The entropic 
terms, including those derived from the effect of sol¬ 
vent interactions, are usually ignored or at best given 
a generalized, theoretical quantity in such programs. 

The major added value in adopting ITC is in the 
ability to utilize the high-quality thermodynamic data. 
Although ITC provides the ability to quantify the 
affinity of a given compound through the AG term, 
since most hits from screening have affinities within 
one or two orders of magnitude of one another (which 
corresponds to only ^6 kJ.moU* of AG), this infor¬ 
mation may have limited value in the decision-making 
process. Thus, we suggest that added value from ITC 
data in the drug development process can be derived 
from measurement of the AH term. 

AH represents a direct readout of the heat associated 
with the making and/or breaking of non-covalent 
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bonds on complex formation. Thus in its simplest 
form, if the binding of a ligand is accompanied by 
a favorable AH then more, or stronger bonds exist in 
the bound state than in the free state. This is important 
because in the rational development of ligands toward 
drugs it is extremely difficult to engineer novel 
non-covalent bonds. For example, there are very few 
reported cases where a compound has been modified 
successfully to create a novel hydrogen bond which 
contributes in a predictable way to the affinity with the 
protein target. The formation of hydrogen bonds is 
dictated by atomic distances and orientation of donor 
or acceptor atoms within the binding site. To modify 
a small molecule to satisfy these exquisitely sensitive 
atomic requirements is difficult. Thus, for the most 
part, formation of novel hydrogen bonds in structure- 
based compound development occurs serendipitously. 
It is generally considered easier to increase the entro- 
pic component to the AG. This can generally be done 
by increasing the area of surface on the compound 
which can be buried on forming the complex with the 
target. Removal of hydrophobic surface area increases 
the entropy term in line with release of interacting 
water or other solvent atoms or molecules. 

Taken in isolation, the measurement of AH is of 
limited use since the net change in bonds covers not 
only those formed/broken in making the complex, but 
also those derived from solvent. As previously stated 
this is a major complicating factor in thermodynamic/ 
structure correlation (see section “Correlation of 
Thermodynamic Data and Biomolecular Structural 
Detail”). The value of the AH term is rather found 
when comparing similar molecules which bind to the 
same target binding site (i.e., measuring AAHob,s)- This 
is because if the compounds are of a similar class (i.e., 
based on a similar template), the effects of removal of 
solvent from the burial of surface area will largely 
cancel out. Thus if a series of “hits” from a screen 
have been identified, one criteria for a judgment on 
which ones are suitable for further optimization would 
be based on those with the comparatively most favor¬ 
able enthalpic contribution to binding. The adoption of 
the AH term as part of the decision-making process in 
selection of leads can be considered similar to the use 
of “ligand efficiency” which is used by medicinal 
chemists to rate a molecule. Ligand efficiency is 
based on the number of bonding groups/molecular 
weight of the compound. Thus, “enthalpic efficiency” 
(EE) might be a better yardstick by which medicinal 


chemist can compare a molecule’s worth since it pro¬ 
vides a quantification of the ligand’s net bond forming 
capability. The enthalpic efficiency, EE (e.g., 
EE = AH/Q, where Q is non-hydrogen atoms or molec¬ 
ular weight), provides a measure which relates the 
changes in non-covalent bonding with the size of the 
molecule. Perhaps a more relevant evaluation would 
be derived from “the specific EE” (i.e., AH/Npoi, where 
Npoi is the number of polar atoms). This reports directly 
on the actual contributions of hydrogen bond donors 
and acceptors to the binding enthalpy and, therefore, 
reflects on the strength of those bonds. Comparison of 
the EEs of a series of compounds with similar structural 
templates can have a big impact on the decision-making 
process. Clearly it would be foolhardy to adopt these 
data at the expense of other input from tried and tested 
criteria, however, the EE should add to the tools appli¬ 
cable to the decision-making process. 

One potential further attraction of the use of the 
AHobs in aiding compound selection is that it is possi¬ 
ble to derive the data from a single (or ideally small 
number) of injections in the ITC experiment. The AH 
of the interaction can be determined from the binding 
of a known concentration of ligand to the target mol¬ 
ecule. Thus, if the concentration regime is set up such 
that in the first injection all of the ligand binds, then the 
heat from this injection corresponds to the change in 
enthalpy per mole of ligand. This potentially reduces 
the amount of material required particularly in the case 
of higher affinity measurements. 


Cross-References 

► Differential Scanning Calorimetry (DSC), Pressure 
Perturbation Calorimetry (PPC), and Isothermal 
Titration Calorimetry (ITC) of Lipid Bilayers 

► NMR in Drug Discovery - Introduction 

► Thermodynamics and Thermodynamic Nonideality 

► Thermodynamics of Lipid Interactions 
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Introduction 

Lipids and membranes are fundamentally different 
from the other main biological macromolecules and 
macromolecular assemblies in the sense that they do 
not form polymers. Lipids self-organize via the hydro- 
phobic effect into subtle and hierarchical macromolec¬ 
ular aggregates like the lipid bilayer component of cell 
membranes by virtue of physical and thermodynamic, 
often entropy-dominated, driving forces rather than the 
covalent bonding that keeps amino acids together in 
polypeptides (proteins), sugars in polysaccharides 
(carbohydrates), and nucleotides in nucleic acids 
(polynucleotides). Lipids do normally not form poly¬ 
mers among themselves, and there is no such thing as 
polylipids in biological systems (Mouritsen 2005). 

Lipid membranes are fluid under physiological con¬ 
ditions, and the relevant physical and thermodynamic 
states of the lipids are therefore characterized by terms 
like bottom-up, self-organized, self-assembled, versa¬ 
tile, diverse, plastic, adaptable, flexible, complex, 
fluid, length-scale tunable, durable, self-repairing, 
and self-healing. All these messy properties make it 
very difficult to interpret thermodynamic and 
thermomechanical properties of lipid membranes in 
terms of molecular parameters and interactions. 

Lipids are not directly encoded in the genome, and it is 
not written in the genes what sort of stmctures and 
functions lipids exert in the living cells. There is nowhere 
in the genome to be read that the lipid bilayer component 
of cell membranes is about 5 nm thick and that it is 
composed of a bimolecular sheet. Similarly, the genome 
does not predict the range over which the membrane is 
laterally organized in terms of domains and rafts 
(Lingwood and Simons 2010; Bagatolli et al. 2010) or 
how membranes curve in space and time (Lipowsky and 
Sackmann 1995). Properties like membrane structure, 
lateral organization, curvature, etc. are emergent prop¬ 
erties that are consequences of the collective many- 
particle nature of the self-assembled membrane. These 
properties are not simply related to the chemical struc¬ 
ture of the individual lipids but are rather controlled by 
the laws of physics and in particular thermodynamics 
and thermomechanics. Since lipids under biological 
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conditions in water organize in structures that have 
subtle elements of both order and disorder and where 
in fact the functional state often is found as a delicate 
balance between order and disorder, a quantitative study 
of lipid properties is not easy and requires hard-core 
physical and physicochemical techniques and method¬ 
ologies, experimentally as well as theoretically. 

► Lipid asymmetry and lipid diversity (implying 
that any given membrane may have different lipids in 
the two monolayer leaflets and thousands of different 
types of lipids) and apparent complexity may cover 
some generic principles still to be discovered and 
described, such as homeostasis of some kind, e.g., 
regulation of lateral pressure profile, control of intrin¬ 
sic curvature stress, organization in terms of differen¬ 
tiated domains, etc. It is conceivable that the right 
questions have not been asked yet and that the lipid 
complexity of membranes is only apparent and is in 
fact ruled by some collective principle. 

Thermodynamics of Lipid Bilayer Formation 

Due to their amphiphilic character, lipids self- 
assemble in water and form various polymorphous 
aggregates with different topologies of which the 
lamellar lipid bilayer is the most important one in 
a biological context (Marsh 1990; Mouritsen 2005; 
Heimburg 2007; Nag 2008). Many other topologies 
are possible, depending on the lipid species in question 
and the prevailing thermodynamic conditions, and the 
phase space of lipids encompasses both lamellar and 
curved structures. Moreover, membrane topologies of 
high genus number may arise. 

Lipid molecules in a lipid bilayer interact by a range 
of different physical molecular forces, such as van der 
Waals forces, electrostatic forces, and hydrogen bond¬ 
ing (Ninham and Lo Nostro 2010). On a mesoscopic 
and macroscopic level, these forces conspire to form 
thermodynamic and colloidal forces, such as hydro- 
phobic, hydrophilic, steric, and entropic forces. 

Thermodynamics of Lipid and Membrane 
Curvature 

The resulting packing characteristics of a lipid bilayer 
can often be characterized qualitatively in terms of 
effective lipid molecular shapes described by a 


molecular packing parameter (Israelachvili 1992), 
whose deviation from unity reflects a conical shape and 
hence propensity for forming curved and nonlamellar 
stmctures (Lipowsky and Sackmann 1995). Knowledge 
of the shape of lipid molecules can facilitate a prediction 
of what types of aggregates they will form in water. 
Similarly, knowledge of the shape of a molecule and its 
propensity for forming curved structures can lead to 
a prediction of the extent to which the addition of this 
type of molecule to a preexisting lipid bilayer membrane 
will stabilize or destabilize the membrane. 

Membrane curvature is an emergent and collective, 
physicochemical property of a lipid assembly consisting 
of a large number of molecules. Curvature is a key 
concept in biology. Typically, curvature is a dynamic 
property that fluctuates due to entropic forces. Mem¬ 
brane curvature can vary in time due to transient inter¬ 
actions between the membrane and other cellular 
components, such as proteins, solutes, as well as vesi¬ 
cles and other membranes. In fact, the entire cell, its 
membranes, organelles, and transport systems are sub¬ 
ject to stabilizing and destabilizing forces that couple to 
curvature. A prominent example is the phenomenon of 
intracellular trafficking and the dynamical morphogen¬ 
esis and maintenance of the Golgi apparatus. 

Self-assembled lipid bilayers are normally in a ten¬ 
sionless state although in small vesicles a substantial 
tension may be present. A tensionless state does not, 
however, imply that there are no forces and stresses 
acting in the mechanically stable equilibrium state of 
the bilayer. A lipid bilayer has a transverse structure 
which is far from that of an isotropic fluid slab of 
hydrocarbons. It displays a distinct lateral stress- or 
pressure profile with huge built-in stresses that can 
couple to protein function. 

It has not yet proved feasible to measure the lateral 
pressure profile in lipid bilayers and membranes, the 
main problem being that it is very difficult to design 
a pressure-sensitive probe that can gauge pressure dif¬ 
ferences over 5 nm without perturbing the profile. 
However, the profile can be calculated quite accurately 
from ► molecular dynamics simulations. 

Thermodynamics of Lipid Phase Transitions 
and Phase Equilibria 

The most dramatic manifestation in the thermodynam¬ 
ics of lipid-lipid interactions is the occurrence of 
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cooperativity and lipid phase transitions (Mouritsen 
2005). In the planar, lamellar phase, lipids display 
a number of phase transitions of which the so-called 
main phase transition is the most important one in 
relation to the lateral, molecular organization of the 
bilayer, although in some lipid bilayers other transi¬ 
tions separate different solid phases, in particular 
the so-called rippled phase. The main phase transi¬ 
tion takes the bilayer from a low-temperature solid- 
ordered phase to a high-temperature liquid-disordered 
phase. The main phase transition is a first-order tran¬ 
sition, but for certain lipids, the transition is close to 
a critical point characterized by critical fluctuations 
which implies that the transition is associated with 
strong precursor fluctuation effects that manifest 
themselves in anomalies in various thermodynamic 
response functions, such as specific heat, mechanical 
moduli (bending modulus, area-compression- 
expansion modulus), sound velocity, etc. The fluctu¬ 
ation-induced softening of the bilayer observed in the 
transition region correlates with anomalies in, e.g., 
passive permeability and partitioning of solutes such 
as alkanols. The thermodynamics of the lipid phase 
transition and the associated phase equilibria can 
be assessed quantitatively by means of differential 
scanning ► calorimetry, which provides transition 
temperature, enthalpy, entropy, as well as phase dia¬ 
grams for mixtures (LIPIDAT; Marsh 1990; Caffrey 
1993; Cevc 1993) and how these properties vary with 
pH, ionic strength, and other thermodynamic condi¬ 
tions such as pressure. The main transition is strongly 
dominated by entropy due to the acyl-chain melting 
process that underlies this transition (Mouritsen 2005; 
Heimburg 2007). 

Thermomechanics of lipid bilayers are usually 
studied by micromechanics techniques, such as 
vesicle-fluctuation analysis and ► micropipette 
aspiration, leading to the membrane bending mod¬ 
ulus and area-compression modulus. The thermo¬ 
dynamics of solutes and drugs interacting with lipid 
membranes can conveniently be measured by iso¬ 
thermal titration ► calorimetry, leading to data for 
the enthalpy and entropy of interaction as well as 
the membrane partitioning coefficient. In some 
cases, it is possible to interpret the calorimetrically 
obtained data in terms of effective molecular inter¬ 
action parameters by using appropriate models 
(Heimburg 2007). 


Thermodynamics of Lateral Membrane 
Organization 

The main phase transition and the underlying collective 
phenomena are the source of lateral organization and 
density fluctuations of the bilayer, e.g., in the form of 
macroscopic phase separation and lateral compositional 
fluctuations in the case of binary mixtures and 
multicomponent systems. In multicomponent lipid mix¬ 
tures, complex phase equilibria can arise with coexis¬ 
tence between phases of different structure. The more 
nonideal the mixture is in a thermodynamic sense, the 
more complex is the phase behavior. This leads to both 
static and dynamic heterogeneities on various length 
scales, depending on the correlation length of the lipid 
fluctuations, resulting in the occurrence of differentiated 
regions in the plane of the membrane in the form of lipid 
domains, sometimes called rafts. 

The phase equilibria in multicomponent lipid bilayers 
can imply solid and liquid phases, as well as solid-solid, 
solid-liquid, and liquid-liquid phase separation. 
Coexisting fluid phases in a membrane is attractive 
from the point of view of biological activity. Firstly, the 
fluidity provides for a membrane with a high degree of 
lateral, dijfusive mohiiity. Secondly, phase separation 
and domain formation furnish a mechanism for compart- 
mentalization of the bilayer into regions of different 
chain order and molecular composition and hence differ¬ 
ent bilayer thickness. Both of these assets are important 
for the formation of functional rafts in biological mem¬ 
branes (Jacobson et al. 2007). Only very few examples of 
fluid-fluid phase separation are found in lipid bilayer 
systems, and they almost always involve cholesterol. 

Lipid domains have been found in a wide range of 
lipid bilayer mixtures, e.g., by the use of various 
microscopy techniques, in particular ► atomic force 
microscopy and > fluorescence microscopy (Bagatolli 
2010). Examples of the lateral structure in a selection 
of different lipid mono- and bilayers are shown in 
Fig. 1. Lipid domains in biological membranes may 
be very small due to interactions with proteins, mem¬ 
brane curvature, the cytoskeleton, or due to fluxes of 
energy and material to or through the membrane. Sim¬ 
ilarly, in ► lipid hilayers on a solid support, the inter¬ 
action between the lipids and the support can break up 
the thermodynamic lipid phases into smaller domains. 
Super-resolution fluorescence microscopy finds small, 
possibly transient, nanoscopic domains of sizes around 
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Thermodynamics of Lipid Interactions, Fig. 1 Lipid 
domains detected in different lipid mono- and bilayers investi¬ 
gated by different techniques, (a) Multiphoton excitation fluo¬ 
rescence microscopy image, using laurdan, of the lateral 
structure of a DPPC monolayer in the liquid-extended liquid- 
condensed coexistence region at 8.8 mN/m and 21°C (Courtesy 
of Dr. Jonathan Brewer), (b) Atomic force microscopy image of 
the lateral stmcture of a single-supported bilayer of native pul¬ 
monary surfactant (Courtesy of Dr. Jorge Bemadino de la 
Serna), (c) Atomic force microscopy image of the lateral struc¬ 
ture of the top bilayer of a double-supported bilayer system of 

20 nm in plasma membrane in living cells where 

► membrane proteins dwell in periods of 10-20 ms 
(Eggeling et al. 2009). This study also suggests that 
these special domains owe in part their existence to 

► sphingolipids in the cholesterol-enriched rafts. 

Thermodynamics of Lipid-Cholesterol 
Interactions 

► Cholesterol is a lipid and a so-called higher sterol 
that is universally present in large amounts (20-40%) 


DLPC/DPPC 1:1 at 20°C, revealing the occurrence of ripple 
structures with a periodicity of 30 nm (Courtesy of Dr. Uffe 
Bemchou Jensen), (d) Lateral structure of a supported lipid 
bilayer reconstituted with individually discemahle aquaporins. 
The image is 100 nm x 100 nm (Courtesy of Dr. Danielle 
Keller), (e) Atomic force (e:A) and fluorescence microscopy 
(e:B and e:C) images of the lateral structure of single- and 
double-supported lipid bilayer of the raft mixture DOPC/ 
DPPC/Chol 2:2:1 spin coated onto a mica surface (Courtesy of 
Dr. Adam Cohen Simonsen) 

in the plasma membranes of all eukaryotic cells. The 
physical chemistry of cholesterol in lipid membranes 
has played a particularly important role for the under¬ 
standing of the structure and function of biological 
membranes. However, the phase equilibria in lipid 
mixtures with cholesterol have been surprisingly diffi¬ 
cult to unravel. A particularly troublesome case has 
been binary lipid-cholesterol mixtures for which the 
phase diagrams are still in dispute (Mouritsen and 
Zuckermann 2004). 

Early experimental and theoretical work suggested 
the existence of a special so-called liquid-ordered 
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Thermodynamics of Lipid 
Interactions, 

Fig. 2 Schematic phase 
diagram of a binary mixture of 
cholesterol in a phospholipid 
bilayer. Tn, is the transition 
temperature of the pure lipid 
bilayer 
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phase to appear at high cholesterol as illustrated in 
Fig. 2. This phase is intermediate between the solid 
and liquid phases of the lipid bilayer, two phases 
which more appropriately should be termed the solid- 
ordered and liquid-disordered phase, respectively. 
The underlying molecular reason for the occurrence 
of the liquid-ordered phase is cholesterol’s different 
modes of interaction with the lipids in the two lipid 
phases. From the point of view of packing, entropy of 
mixing dictates that cholesterol would be more soluble 
in the liquid-disordered (fluid) phase. From the point of 
view of van der Waals contact, cholesterol would 
prefer to have ordered lipid acyl chains next to it. The 
formation of an intermediate phase, the liquid-ordered 
phase, is the resulting compromise so unique to cho¬ 
lesterol and other higher sterols like ergosterol and 
phytosterols. 

Whereas the phase diagram of two-component bila¬ 
yers with cholesterol is not fully resolved, the situation 
for ternary systems is much more clarified. A large 
body of experimental evidence point to the existence 
of a liquid-ordered phase and liquid-ordered-liquid- 
disordered phase coexistence in ternary lipid bilayers 
involving cholesterol (Quinn 2004; Bagatolli 2010). 
The ternary systems are usually composed of mixtures 
of cholesterol with a high- and low-melting lipid spe¬ 
cies, where the high-melting species in many cases is 
a ► sphingolipid or a saturated long-chain lipid and the 
low-melting lipid is an unsaturated lipid. 


A full blown thermodynamic analysis of the phase 
equilibria in lipid mixtures involving cholesterol has 
been performed on the basis of a quantitative image 
analysis of the sizes of coexisting lipid domains in single 
giant unilamellar vesicles imaged by three-dimensional 
fluorescence microscopy (Bagatolli 2010). This work 
hence establishes a direct quantitative relation between 
the thermodynamically determined equilibrium phase 
diagram for bulk lipid systems and structural, real-space 
microscopy data as illustrated in Fig. 3. 

Thermodynamics of Lipid-Protein 
Interactions 

Proteins interact with lipids in lipid membranes in 
a variety of ways that involve hydrophobic, hydro¬ 
philic, and electrostatic forces (Tamm 2005; Perez- 
Gil 2008). The lipid phase behavior is strongly affected 
by the interaction with the proteins. ► Membrane pro¬ 
teins can bind peripherally to the membrane surface or 
be integral and membrane spanning, e.g., in the form of 
channels, pumps, and receptors. Usually, integral 
membrane proteins tend to stabilize fluid lipid phases, 
but it depends on the kind of physical constraint which 
the lipids put on the part of the protein that traverses the 
hydrophobic core of the bilayer membrane. Lipid-pro¬ 
tein interactions are the underlying molecular reason 
for the >■ functional roles of lipids in membranes. 
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Thermodynamics of Lipid Interactions, Fig. 3 (a) Theoret¬ 
ical phase diagram for the binary lipid mixture DLPC-DPPC. 
The black dots along the tie line represent molar fractions ana¬ 
lyzed in (b). The phase boundaries bounding the liquid- 
disordered and solid-ordered phases at 20° C from this phase 
diagram are indicated by black arrows, (b) Ratio of domain 
areas, Aij/Ajo, versus the lipid fraction x^ppc measured for 
a series of giant unilamellar liposomes. Data points for different 
liposomes are given hy open symbols and the mean value for 
each composition by ■. For each composition, a representative 
image of the surface projection of a liposome is shown 
at Xoppc = 0.4, 0.5, 0.6, 0.7, and 0.8. The different lines repre¬ 
sent various analyses of the data investigating the consistency 
with the theoretical, thermodynamic phase diagram and the lever 
rule. The good accordance between the analyses of the experi¬ 
mental microscopy images and the thermodynamic phase dia¬ 
gram demonstrates that a proper visual analysis of domain 
patterns on giant unilamellar liposomes can be used to extract 
bulk thermodynamic data for lipid mixtures, hence providing 
a link between spatial structure and thermodynamic properties 
(Courtesy of Dr. Luis A. Bagatolli) 


A key concept in the interaction of integral mem¬ 
brane proteins with lipid membranes is the notion of 
hydrophobic matching (Mouritsen 2005), i.e., the fact 
that the hydrophobic stretch of the protein is often 
found to approximately match the hydrophobic thick¬ 
ness of the lipid bilayer component of the membrane as 
illustrated in Fig. 4. The concept of hydrophobic 
matching can be formulated thermodynamically in 
the so-called mattress model which basically is 
a simple theory of the nonideal mixing of lipids and 
proteins. Within the theory, mismatch carries an 
energy penalty that provides a driving force for 
a lipid-mediated indirect interaction between the pro¬ 
teins, typically attractive as illustrated in Fig. 4. Under 
appropriate conditions, dictated by the underlying 
phase equilibria of the mixture, such attractive forces 
can lead to long-range capillary forces and wetting 
phenomena that lead to protein aggregation and ulti¬ 
mately protein crystallization in the plane of the mem¬ 
brane. Details of the energetics of lipid-protein 
interactions, considering the membrane as an elastic 
piece of material, hold the key to describing how 
various solutes and other agents that interact with 
membranes, such as amphiphilic drugs, may affect 
membrane protein structure and function. 

Hydrophobic matching between membrane pro¬ 
teins and lipids may imply the formation of 
a particular local environment, or lipid annulus, around 
the proteins as illustrated in Fig. 4a. This annulus is 
a statistical concept characterized by a particular cor¬ 
relation length. The lipids in this annulus need not be 
strongly bound to the protein, but the occurrence of 
lipid conformations or specific lipids species, that most 
readily conform to the hydrophobic matching condi¬ 
tion incurring the lowest cast in free energy, will sta¬ 
tistically populate the annulus. This leads to a kind of 
local enrichment, selection, and sorting of lipids at the 
protein interface. In this way, a mechanism is provided 
for lipid-controlled modulation and triggering of pro¬ 
tein conformations and function as suggested by the 
schematic illustration in Fig. 4b. 


Nonequilibrium Thermodynamics of Lipid 
Interactions 

Most of the existing knowledge about lipid and mem¬ 
brane thermodynamics and its implications for mem¬ 
brane function has been obtained from physical 
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Thermodynamics of Lipid Interactions, Fig. 4 (a) Sche¬ 
matic illustration of the principle of hydrophobic matching 
between lipid bilayers and integral membrane proteins. In the 
case of a mismatch, the deformation in the lipid matrix may 
induce an indirect, lipid-mediated attraction between the 


proteins, (b) Schematic illustration of a conformational change 
in an integral membrane protein induced by changes in the 
hydrophobic mismatch condition that is modulated by the ability 
of the lipids around the protein to adapt to local curvature and to 
assume matching chain conformations 


chemical studies of simple membrane systems 
under mostly thermodynamic equilibrium conditions 
(Mouritsen 2011). The same is true of most experi¬ 
mental models that typically are carried out as test tube 
experiments. Many membrane systems studied in the 
laboratory are not considered in the full functional 
state, and the experimental measurements usually per¬ 
tain to some kind of equilibrium or near-equilibrium 
state. 

Obviously, real biological membranes are far from 
equilibrium or at best in a driven steady-state situation. 
In any case, the well-known principles from equilib¬ 
rium thermodynamics do not apply. Typical examples 
include membranes with ion pumps that are driven by 
some kind of energy transduction mechanism; mem¬ 
branes being modulated by active enzymes, such as 
phospholipases and sphingomyelinases; and respon¬ 
sive membranes in the dermal barrier being subject to 
changes in the gradient in water chemical potential, 
morphogenesis of the endoplasmatic reticulum, and 
the Golgi apparatus with membranes that owe their 
existence to nonequilibrium conditions of flow of 
energy and matter. It is interesting to note that the 
plasma membrane can undergo phase separation 
when the cell dies (Lingwood and Simons 2010). 


Summary 

Lipids organize in space and time, e.g., in the form of 
lipid bilayer membranes, which display emergent phys¬ 
icochemical properties that are beyond the chemical 
nature of the individual molecules. Thermodynamics of 
lipid interactions characterizes bilayer stability and phase 
equilibria as well as the interactions between different 
lipids, such as phospholipids, sphingolipids, and sterols, 
as well as the interactions of lipids with peptides, pro¬ 
teins, solutes, and drugs. These various molecular inter¬ 
actions are responsible for the lateral organization and the 
mechanical properties of membranes which collectively 
take part in controlling membrane protein function. 
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Definition 

Thioredoxins are a family of small proteins (12 kDa or 
larger) catalyzing thiol-disulfide oxidoreductions by 
using redox-active cysteine residues. 

Basic Characteristics 

Thioredoxins catalyze thiol-disulfide oxidoreductions 
by using redox-active cysteine residues in their active 
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site which are typically present in a Trp-Cys-Gly-Pro- 
Cys sequence motif (Holmgren 1968). In oxidized 
thioredoxin (Trx-S 2 ) the active site is a disulfide and 
in reduced thioredoxin (Trx-(SH )2 it is a corresponding 
dithiol. Trx-(SH )2 is a powerful reductant of protein 
disulfides (Reaction 1) which results in the formation 
of Trx-S2 (Holmgren 1985). Trx-S 2 is reduced by 
thioredoxin reductase (TrxR) using electrons from 
NADPH (Reaction 2). In plant chloroplasts ferre- 
doxin-thioredoxin reductase catalyzes reduction of 
Trx-S 2 (Schurmann and Buchanan 2008). 


Trx-(SH )2 + Protein-S 2 ^ 
Protein-(SH )2 + Trx-S 2 


(Reaction 1) 


Trx-Sz + NADPH + H+ t^r 

-> (Reaction 2) 

Trx-(SH)2 + NADP+ 



Thioredoxin reductase from bacteria, fungi, and plants 
are dimeric FAD containing homodimers of 70 kDa 
with active site cysteine residues. In contrast 
mammalian thioredoxin reductases are larger dimeric 
selenoenzymes (114 kDa or larger) with a catalytically 
active selenocysteine residue located in the C-terminus 
(Zhong et al. 2000). Thioredoxins exist in all living 
cells and in multiple forms such as up to 19 isoforms in 
plants (Schurmann and Buchanan 2008). Man has two 
classical Trx, including Trxl in the cytosol/nucleus 
and Trx2 in mitochondria, both of which are essential 
and embryonically lethal (Lillig and Holmgren 2007). 
Thioredoxins have a large number of functions in 
enzyme-catalyzed reactions like to be an electron 
donor for ► ribonucleotide reductase essential for 
DNA synthesis. Other functions of Trx involve defense 
against ► oxidative stress by being a reductant of 
peroxiredoxins and methionine sulfoxide reductases 
(Lillig and Holmgren 2007). However, the largest 
number of functions of Trx is in redox regulation and 
signaling where Trx is involved in redox regulation of 
many transcription factors like NFkB, API, or p53 
(Lillig and Holmgren 2007). Reduced Trx binds to 
apoptosis signaling kinase (ASKl) to prevent apopto¬ 
sis signaling in cells. 

The structures of thioredoxins from many species 
have been determined by X-ray crystallography and 
multidimensional NMR and defines the thioredoxin 


Thioredoxins, Fig. 1 Structure of human Trxl in the fully 
reduced form. The active sites Cys 32 and Cys 35 ai'e shown as 
well as the structural Cys residues, Cys 62, Cys 69, and Cys 73. 
The helices and sheets are shown in the typical thioredoxin fold 


fold comprising a central core of p-sheet surrounded 
by a-helices (Holmgren 1995), see Fig. 1. The active 
site of E. coli and human Trxl comprises Cys 32 and 
Cys 35 which are located at the end of a the second 
beta-sheet and in the beginning of alpha helix 2. The 
structure has a conserved cA-proline (Pro 76) and 
undergoes a localized conformational change upon 
oxidoreduction (Holmgren 1995). This has been 
observed by 2-D NMR and by tryptophan fluorescence 
in E. coli Trx, where there is a threefold increase in 
fluorescence quantum yield at pH 7 from the strongly 
quenched Trp28 upon reduction. The pKa value of Cys 
32 is lowered to 7 and this is due to hydrogen bonding 
within the active site. The pKa value of Cys 35 is high 
in Trx-(SH) 2 . The redox potential of E. coli 
thioredoxin at pH 7.0 is —0.270 mV. The structure of 
Trx-2 is stabilized compared to that of Trx-(SH )2 
explaining why Trx is a thermodynamically preferred 
reductant of exposed disulfide bonds in proteins 
(Holmgren 1985). The mechanism of Trx involves 
docking and a transient mixed disulfide with the sub¬ 
strate following a nucleophilic attack by the 
depronated N-terminal Cys residue in the active site 
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(Holmgren 1985, 1995). Kinetically, Trx is known to 
be four to six orders of magnitude faster as a dithiol 
reductant compared to a strong chemical reductant like 
dithiothreitol (DTT). Recently the mechanism of 
thioredoxin catalysis has been probed by single mole¬ 
cule techniques (Wiita et al. 2007). 

The thioredoxin fold is found in a large number of 
proteins including glutaredoxins, ► glutathione perox¬ 
idases, glutathione transferases, protein disulfide isom- 
erases, and many other proteins (Martin 1995). 

Human thioredoxin is secreted and found in plasma 
reflecting the level of inflammation and has important 
functions as a growth factor for cells. Trx activity is in 
turn regulated by thioredoxin interacting protein 
(TXNIP or TBP2) (Yoshihara et al. 2010). The struc¬ 
ture and function is regulated by structural SH-groups 
which can be oxidized to an extra disulfide or 
nitrosylated. Thioredoxin plays a major role in protein 
denitrosylation but can also transnitrosylate other pro¬ 
teins (Hashemy and Holmgren 2008). 

Cross-References 

► Electron Transfer Proteins: Overview 

► Glutathione 

► Oxidative Stress 

► Ribonucleotide Reductase 
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Synonyms 

DNA 


Definition 

Topoisomerases are enzymes that modify the topology 
of DNA inside the cell. They are present in all three 
domains of life and are involved in several cellular 
processes, such as transcription, replication, and 
recombination (Champoux 2001; Wang 2002). All 
topoisomerases operate by forming transient 
phosphotyrosine covalent bond(s) with either the 
5' or the 3' end(s) of the broken DNA strand(s), 
followed by passage of the unbroken strand(s) through 
the break, and ultimately resealing the break(s) to 
change the DNA topology. Topoisomerases are classi¬ 
fied into two types, type I and type II enzymes, based 
on whether they cleave a single strand (type I) or both 
strands (type II) of DNA. Type I enzymes use the 
energy stored in supercoiled DNA as torsional strain 
for driving the DNA relaxation reaction and do not 
require any high energy cofactor for their activity. 
Type II enzymes utilize ATP hydrolysis to drive con¬ 
formational changes in the protein. The central role of 
topoisomerases in cellular processes is underscored by 
the fact that they are the target of several chemothera¬ 
peutic agents and antibiotics (Pommier et al. 2010). 

Basic Characteristics 

To perform DNA topological transformations, two 
general mechanisms are possible: enzyme-bridged 
strand passage and DNA swiveling. Enzyme-bridged 
strand passage involves cleaving and separating one or 
two DNA strands and passing other strand(s) through 
the opening before resealing the broken strand(s) 
(Eigs. 1 and 2). Every strand passage cycle involves 


the breaking and resealing of the DNA and leads to 
changes in the linking number strictly in steps of 1 or 2 
for type I and II topoisomerases respectively. Enzyme- 
bridged mechanisms require that the DNA strands are 
captured inside the protein following strand passage 
and hence enzymes that use this mechanism are char¬ 
acterized by the presence of holes that can accept DNA 
strands. This mechanism can be used not only to relax/ 
supercoil DNA, but also to catenate/decatenate and to 
knot/unknot DNA molecules. In DNA swiveling, 
a single DNA strand is transiently cleaved; one of the 
broken ends is attached covalently to the protein while 
the other end is bound non-covalently to the protein. 
The intact second strand along with the non-covalently 
attached part of the first strand rotate or swivel before 
the break is resealed (Fig. 1). Torsional energy stored 
in the supercoiled DNA drives the reaction. As more 
than one rotation can occur before religation occurs, 
the linking number can change by several steps per 
cleavage/religation cycle. The interaction between 
protein and DNA introduces friction during the rota¬ 
tion events and hence swiveling is not as fast as the 
introduction of a nick to allow the supercoils to relax. 

In the last few years, studies on the mechanism of 
topoisomerases have been greatly aided by the intro¬ 
duction of single molecule techniques (Koster et al. 
2010) (Table 1). Traditionally, changes on DNA topol¬ 
ogy have been studied using gel electrophoresis based 
approaches in bulk assays. Whereas bulk assays pro¬ 
vide powerful insights and are still widely used, the 
ability to manipulate single DNA molecules and to 
follow topological changes at the single molecule 
level has revolutionized many aspects of topoisomer- 
ase research. In particular, the use of magnetic and 
optical tweezers and other single molecule techniques 
permit tracking changes in DNA topology by 
topoisomerases in an unprecedented manner. Other 
techniques, such as FRET-based assays, are enabling 
the study of the dynamics of the reactions and promise 
to help provide a more comprehensive understanding 
of the mechanism of these enzymes. 

Type I topoisomerases are subdivided into three 
different subtypes (lA, IB, and IC) based on their 
structure and mechanism (Baker et al. 2009). In gen¬ 
eral, type I enzymes are monomeric and members of 
each subtype are similar to each other in sequence, 
structure, and atomic mechanism. Type lA 
topoisomerases are found in prokaryotes, eukaryotes, 
and archaea. They employ an enzyme-bridged 
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mechanism and form a covalent link to the 5' end of the 
broken DNA strand while non-covalently binding to 
the y end during the reaction. There are two paralogs 
of type lA enzymes: topoisomerases I and III. Topo- 
isomerase I is an efficient relaxation enzyme but not an 
efficient decatenating enzyme, while topoisomerase III 
has the opposite characteristics. In addition, a third 
type lA enzyme, reverse gyrase, is found in some 
bacteria and hyperthermophiles and is unique due to 
its ability to introduce positive supercoils into DNA in 
an ATP-dependent manner. 

The core of type lA enzymes has a characteristic 
flattened toroidal fold formed by four domains: I-IV 
(Fig. 1). This core domain is followed by a variable 
C-terminal region which is implicated in DNA bind¬ 
ing. The active site is located at the interface of two 
domains and is not fully assembled before DNA bind¬ 
ing. A single stranded DNA binding groove leads from 
the C-terminal domain to the active site, and binding of 
DNA to this region triggers assembly of the active 
site. The mechanism for cleavage/religation of the 
DNA is not completely understood, but likely involves 
magnesium. A general mechanism for type lA 
topoisomerases was proposed based on the structure 
of Escherichia coli topoisomerase I and is in agree¬ 
ment with a wealth of biochemical evidence and single 
molecule studies (Koster et al. 2010). 

Type IB topoisomerases have been identified in 
bacteria, poxviruses, and eukaryotes and share 


structural similarities with tyrosine recombinases, 
probably reflecting a common ancestor. Type IB 
enzymes relax both positively and negatively 
supercoiled DNA using a swiveling or “constrained 
rotation” mechanism and form a covalent intermediate 
with the y end of the broken DNA strand. Eukaryotic 
or cellular type IB topoisomerases are large proteins 
formed by four separate domains: a dispensable N- 
terminal domain (NTD), a highly conserved core, 
a short linker, and a C-terminal domain (CTD). The 
NTD contains a nuclear localization signal and is not 
directly involved in topoisomerase activity. The core 
and CTD domains form a “C-shaped clamp” around 
DNA (Fig. 1). The catalytic tyrosine is located in the 
CTD while the remaining active site residues form part 
of the core region. The bacterial and viral type IB 
topoisomerases are smaller proteins displaying the 
same overall architecture and mechanism of DNA 
relaxation as their eukaryotic counterparts. The mech¬ 
anisms of DNA cleavage/religation, as well as 
several aspects of drug binding and action, are well 
characterized. The overall outline of the swiveling 
mechanism is understood, but details on the conforma¬ 
tional changes of the protein during strand rotation are 
unknown. 

Type IC topoisomerases are represented by topo¬ 
isomerase V, present in the archaeal hyperthermo- 
philic Methanopyrus genus. Topoisomerase V is 
unique as it has both DNA relaxation and repair 


< - 

Topoisomerases, Fig. 1 Type I topoisomerases. The figure 
illustrates the three subtypes of type I topoisomerases with 
representative structures and their proposed mechanisms. For 
each subtype the domain arrangement in the primary sequence 
is shown above the structure of a representative member. The 
proposed mechanism of DNA relaxation for each subtype is 
shown on the right hand side, (a) Type lA topoisomerases. 
Crystal structure of Escherichia coli topoisomerase III bound 
to DNA (PDB ID 1I7D). Type lA enzymes have a toroidal fold 
with the active site at the interface of domains I and III. A single 
stranded DNA binding path is at the interface of domains I, III, 
and IV. Type lA topoisomerases employ an enzyme-bridged 
strand passage mechanism that involves opening and closing of 
a gate in the enzyme. When single stranded DNA binds to the 
enzyme, the active site is assembled and cleaves one of the DNA 
strands forming a protein-DNA covalent complex. A protein 
conformational change creates an opening in the protein and 
the cleaved DNA strand allowing the second DNA strand to 
enter into the internal cavity of the protein. The protein then 
closes followed by religation of the broken DNA. Final opening 
and closing of the protein releases the trapped DNA strand. The 
enzyme can continue to another cycle without releasing the 


DNA. (b) Type IB topoisomerases. Crystal structure of human 
topoisomerase I bound to DNA (PDB ID 1A35). The NTD and 
linker region are absent in the structure. Type IB enzymes form 
a C-shaped clamp around the DNA. The active site is formed by 
residues from domain III and the CTD. Type IB enzymes relax 
DNA by swiveling or “constrained rotation.” In this mechanism, 
one of the DNA strands is cleaved and forms a covalent complex 
with protein while the intact and the free end side of the broken 
DNA rotate past the covalently attached end. The protein scaf¬ 
fold controls the rate of rotation by creating friction between DNA 
and protein. The protein clamp may open up partially to accom¬ 
modate the rotating DNA. Each cycle involves several rotations 
leading to a change of linking number in steps of n. (c) Type IC 
topoisomerases. Crystal structure of the N-terminal 60 kDa frag¬ 
ment of topoisomerase V (PDB ID 2CSB). The compactly folded 
topoisomerase domain is followed by 12 (HhH )2 domains; the last 
few (HhFI )2 domains are responsible for the DNA repair activity. 
No structure with DNA is known. Type IC enzymes relax DNA 
by swiveling, but details of the mechanism are still unknown. The 
coloring scheme of the crystal structures matches the primary 
structure coloring scheme, active site residues are shown as 
cyan sticks, and DNA is in gray and black 
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activities in the same polypeptide. Like type IB 
enzymes, topoisomerase V relaxes both positively 
and negatively supercoiled DNA by swiveling, forms 
a covalent complex with the 3' end of the broken DNA 
strand, and does not require ATP or divalent metal ions 
for activity. Topoisomerase activity resides on 
a compact N-terminal topoisomerase domain 
presenting a unique fold that is followed by 12 tandem 
helix-hairpin-helix domains, some of which are 
responsible for apurinic/apyrimidinic lyase and deoxy- 
ribose phosphate lyase activities (Fig. 1). The way the 
enzyme binds DNA and the cleavage/religation are not 
yet fully understood. Furthermore, it is not clear 
whether the topoisomerase and DNA repair domains 
work synergistically or the reason for the presence of 
both activities in the same polypeptide. 

Type II topoisomerases are classified into two sub- 
types, IIA and IIB, based on their sequence and struc¬ 
tural characteristics (Schoeffler and Berger 2008). 
While type IIA topoisomerases are present in all 
domains of life, type IIB enzymes have only been 
found in archaea, plants, and some algae. Mechanisti¬ 
cally, all type II enzymes work through a similar, 
enzyme-bridged strand passage mechanism that 
changes the linking number in steps of 2. This 


mechanism places strong structural constraints on the 
enzyme and all type II enzymes include protein gates 
and internal cavities. In addition to DNA relaxation, 
type 11 enzymes can perform different DNA transfor¬ 
mations. For example, topoisomerase IV is a strong 
catenating/decatenating enzyme while gyrase intro¬ 
duces negative supercoils into DNA. 

Type IIA topoisomerases are multimeric enzymes 
with marked structural and sequence similarities. 
While eukaryotic enzymes are homodimeric, prokary¬ 
otic enzymes are heterotetrameric, and some viral 
enzymes are heterohexameric. All type IIA enzymes 
have the same overall architecture with equivalent 
domains, even if the domains reside in different sub¬ 
units. Interestingly, there are strong similarities 
between type lA and type IIA enzymes that extend to 
the general mechanism of DNA transformation, the 
overall structure, the active site structure, and mecha¬ 
nism of cleavage/religation. In general, type IIA 
topoisomerases are formed by seven domains and use 
ATP to drive protein conformational changes. 
Although the structure of an intact type llA enzyme 
is not known, current models based on structures of 
fragments show a molecule with a characteristic “out¬ 
side caliper” shape dominated by two large holes and 


< - 

Topoisomerases, Fig. 2 Type 11 topoisomerases. The figure 
illustrates the two subtypes of type II topoisomerases with rep¬ 
resentative structures and the proposed mechanisms. The pro¬ 
posed mechanism of DNA relaxation for each subtype is shown 
on the right hand side, (a) Domain arrangement for the different 
type II topoisomerases. Three different type IIA topoisomerases 
are shown to illustrate the differences in subunit stoichiometry 
and domain arrangements. All type II enzymes have several 
domains in common: GHKL, transducer, toprim, and WHD 
domains. The toprim and WHD (containing the catalytic tyro¬ 
sine) make up the breakage-reunion core and the GHKL domain 
contains the ATPase domain. Different type II enzymes may 
have additional, functionally specialized domains. The CTD 
domain is responsible for wrapping DNA in gyrases. The t/s 
domain is the tower-shoulder domain, the pp/nl domain is 
a supposed protein-protein and nuclear localization domain, 
and the Ig domain is a probable immuno-globulin like domain, 
(b) Type IIA topoisomerases. Model of a type IIA enzyme based 
on the crystal structures of different domains of type IIA 
topoisomerases. In the model, the GHKL and transducer 
domains (PDB ID IPVG) sit on top of the DNA breakage- 
reunion core and coiled-coiled domain (PDB ID 2RGR). Gyrase 
and topoisomerase IV have an ordered DNA wrapping 
C-terminal domain (CTD) shown in pink (PDB ID IZIO). Type 
IIA topoisomerases alter DNA topology using an enzyme- 
bridged strand passage mechanism. In the proposed mechanism. 


a G- or gate segment of double stranded DNA binds to the core of 
the enzyme followed by a double stranded break. Binding of two 
ATP cofactors help trap a transport or T-segment of DNA by 
closure of the N-gate. Hydrolysis of one ATP molecule results in 
passage of the T-segment through the DNA gate and broken 
G-segment. The T-segment is then passed through the exit or 
C-gate. A second ATP hydrolysis event allows the enzyme to 
reset and either release the re-ligated G-segment or continue the 
cycle in a processive manner, (c) Type IIB topoisomerases. 
Crystal structure of topoisomerase VI (PDB ID 2ZBK), a type 
IIB topoisomerase. Type IIB enzymes have similar domain 
architecture to type IIA topoisomerases, but they lack the 
coiled-coiled domain. In addition, some type IIB enzymes con¬ 
tain additional small domains, such as the Helix-2-Tum-Helix 
(H2TH) domain shown in yellow. The mechanism of type IIB 
topoisomerases is similar to that of type IIA. The core of the 
enzyme binds to a double stranded DNA or G-segment. The 
G-segment is broken and binding of two ATP cofactors drives 
closure of the N-gate thereby trapping the T-segment of the DNA. 
Type IIB topoisomerases lack the exit gate and so after passage of 
the T-segment through the DNA gate, the second hydrolysis step 
leads to resealing of the G-segment break and resetting of the 
enzyme. In the diagrams, DNA is shown as gray and tan rods 
while the protein is colored according to the domain shown in the 
primary structure diagram. The mechanism diagrams were drawn 
based on the ones described in Schoeffler and Berger (2008) 
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Topoisomerases, Table 1 

Examples of biophysical techniques used to study topoisomerases 


Technique 

Purpose 

System 

X-ray crystallography 

Determine high resolution atomic structures of fragments, and intact 
topoisomerases, as well as their complexes with DNA 

All topoisomerases 

Atomic force microscopy 

Used to visualize the interaction of topoisomerase with DNA by direct 
imaging of the complex 

Type IB enzymes 

Single molecule- magnetic 
tweezers 

Study DNA topological changes at the single DNA molecule level. 
Characterize the mechanism of DNA relaxation by topoisomerases by 
identifying the change in linking number and rate of relaxation on a single 
DNA molecule 

All type I and Type IIA 
topoisomerases 

Electron microscopy 

Study the structure of the larger topoisomerases, both alone and in complex 
with DNA. Determine low resolution molecular envelope to help build models 
of entire assemblies 

Type II enzymes 

Analytical ultra 
centrifugation (AUC) 

Measure hydrodynamic properties of the enzymes. Determine the 
homogeneity of protein-DNA complex and provide general information on 
shape 

Type IIA 
topoisomerases 

Single molecule - FRET 

Used to identify the conformational state and changes of type II enzymes while 
interacting with DNA 

Type IIA enzymes 

Small angle X-ray 
scattering - SAXS 

Characterize the solution structure of large molecular assemblies. Determine 
low resolution molecular envelopes suitable to place high resolution models of 
various domains of the protein 

Type II enzymes 

Small angle neutron 
scattering - SANS 

Characterize the solution structure of large molecular assemblies. Used to 
determine a low resolution molecular envelope 

Gyrase 

Single molecule studies - 
rotor bead tracking 

Study the details of the mechanism of DNA supercoiling by monitoring 
changes in DNA topology by means of a fluorescent bead attached to DNA 

Gyrase 


three gates that open or close to allow DNA strands to 
enter or exit the protein during topological transforma¬ 
tions (Fig. 2). The ATPase domains, which are respon¬ 
sible for some of the conformational changes during 
the reaction form the gate for DNA capture, or N-gate, 
and are located adjacent to the domains responsible for 
DNA cleavage/religation. These DNA cleavage/ 
religation domains open the DNA to form a DNA 
gate. Finally, two long coiled-coiled domains consti¬ 
tute the C-gate needed for strand exit. Functional spe¬ 
cialization leads to variations or additions to this 
general architecture. For example, gyrase and topo- 
isomerase IV have a C-terminal domain that, in the 
case of gyrase, is involved in DNA wrapping and is 
essential to introduce negative supercoils. 

During the reaction, a large, ^26 base pair DNA 
segment is bound by the protein and harshly bent. Type 
IIA topoisomerases undergo a series of large confor¬ 
mational changes involving the opening and closing of 
the protein gates during DNA transformations. In the 
proposed mechanistic models, DNA transformations 
begin when one double stranded DNA, the G- or gate 
segment, binds to the protein (Fig. 2). This is followed 
by the capture of another double stranded DNA seg¬ 
ment, the T- or transport segment, concomitant with an 


ATP-dependent conformational change that leads to 
the cleavage and opening of the G-segment. The 
T-segment can now pass through the G-segment 
followed by the closing and resealing of the DNA 
opening. A final conformational change opens the 
C-gate and allows the T-segment to exit. DNA cleav¬ 
age creates a 4 base pair staggered opening using 
a mechanism similar to the one employed by type lA 
enzymes and involving covalent attachments with the 
5' end of the broken strands. Recent studies indicate 
that the cleavage reaction employs a two-metal ion 
mechanism, but with important differences from the 
general one proposed for many other proteins and 
ribozymes (Schmidt et al. 2010). 

Type IIB topoisomerases, currently represented by 
topoisomerase VI, represent a much smaller group of 
type II enzymes. Unlike the different type I enzymes, 
which share no structural or sequence similarities 
between the subtypes, type IIA and IIB enzymes have 
many commonalities. Their overall mechanism of 
action shows parallels and this is reflected in an overall 
structural resemblance. Type IIB topoisomerases have 
several domains in common with type IIA enzymes 
and may even share a cleavage/religation mechanism 
(Fig. 2). Although they also work through an 
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enzyme-bridged strand passage mechanism, the exit or 
C-gate is missing in type IIB enzymes. Some type IIB 
enzymes have additional domains that are absent in 
other suhtypes, but the role of these additional domains 
is not known, although some may be involved in pro¬ 
tein-protein interactions. Additional differences 
include a dependency on ATP hydrolysis for DNA 
cleavage and a 2 base pair staggered cut. Mechanisti¬ 
cally, type IIB probably go through a very similar cycle 
as type IIA enzymes, including opening and closing of 
gates concomitant with strand passage events, but 
details of the mechanism, the conformational changes 
in the protein, and the way the protein and DNA 
interact are still poorly understood. 

Cross-References 

► ATPase: Overview 

► DNA Repair and Recombination 

► Fi-ATPase: Fundamental Properties and Structure 

► Macromolecular Crystallography: Overview 

► Magnetic Tweezers 

► Single-Molecule Methods 

► Small Angle X-ray Scattering 
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Due to the extent of chemical shift overlap and 
degeneracy, the assignment of side chain chemical 
shifts is tedious and difficult for many structural 
biologists. This technical challenge creates a major 
bottleneck in the pursuit of solution-based protein 
structures using nuclear magnetic resonance (NMR) 
spectroscopy (► Protein NMR - Introduction; 
► NMR-based Structural Proteomics) because of 
incomplete assignment tables, which poses 
a problem at the level of assigning 'H-'H Nuclear 
Overhauser Effect (NOE) distance restraints for sub¬ 
sequent structure calculation. However, access to 
high-field NMR spectrometers as well as major 
improvements in the manner in which NMR data is 
collected, processed, and analyzed has helped some to 
overcome this challenge. Specifically, total correla¬ 
tion spectroscopy (TOCSY), also referred to as homo- 
nuclear Hartmann Hahn (HOHAHA), has been an 
important development in the field whose importance 
is underscored by the ease in which side chain 'H, '^C, 
and '^N resonances are identified. This entry will 
focus on the importance and use of the TOCSY in 
protein nuclear magnetic resonance (NMR) spectros¬ 
copy, particularly in the derivation of experimentally 
derived high-resolution protein structures. 

As opposed to the chemical shift assignment 
of main chain elements, side chain resonances are 
much more challenging to assign because there are 
more 'H and '^C atoms. In this regard, the NMR data 
becomes difficult to interpret, increasing the risk of an 
incorrect assignment. The development of specific 
TOCSY experiments, including the 2D 'H-'H 
TOCSY (Braunschweiler and Ernst 1983; Bax and 
Davis 1985), 3D C(CO)NH-TOCSY (Logan et al. 
1992), 3D H(CCO)NH-TOCSY (Logan et al. 1992), 
3D HCCH-TOCSY (Bax et al. 1990), the complemen¬ 
tary 3D H(C)CH-TOCSY, 3D (H)CCH-TOCSY 
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(Bax et al. 1990), and the 3D '^N-TOCSY-HSQC 
(Marion et al. 1989) are common tools to minimize 
these errors and gaps in the compilation of the final 
assignment list. Of note, in several TOCSY-based 
experiments, especially in the case for the N-rooted 
3D C(CO)NH-TOCSY, 3D H(CCO)NH-TOCSY, and 
3D *^N-TOCSY-HSQC, the observed signal is heavily 
compromised owing to the multiple magnetization 
transfers and relaxation losses. In addition, these 
experiments suffer from a lack of digital resolution in 
the indirect dimension, making the stereospecific 
assignment of specific side chain resonances difficult 
(e.g., HB2 and HB3, HDll and HD21). Although the 
visualization and interpretation of the spectra for the 
2D 'H-^H TOCSY, 3D HCCH-TOCSY, 3D H(C)CH- 
TOCSY, 3D (H)CCH-TOCSY is more complex, they 
are more robust and rich in chemical shift information 
to help in the complete assignment of protein side 
chain resonances ► BMRB). Furthermore, these latter 
experiments work well when the protein is 
reconstituted in deuterium oxide (D 2 O); elimination 
of the water signal and labile protons helps to 
assign the chemical shift of aromatic and Hq( protons 
that reside close to the water signal at around 4.73 ppm. 
The latter is particularly important in the calculation 
of dihedral angle restraints for refinement of regions 
containing secondary structural elements (e.g., a-helix 
and P-sheet). 

Protein structures are built around a nest core of 
hydrophobic interactions between long acyl and aryl 
side chains such as isoleucine and tryptophan, respec¬ 
tively. The identification and tabulation of side chain 
resonance assignments from these TOCSY-based 
NMR experiments assists in the assignment of more 
NOEs and mapping of interproton networks for subse¬ 
quent structure determination. Therefore, the TOCSY, 
in part, improves the convergence and resolution of the 
structural ensemble and associated structure quality 
factors as measured by Molprobity and PROCHECK, 
which are readily accessed through the protein struc¬ 
ture validation software suite (PSVS) (Bhattacharya 
et al. 2007). 

Cross-References 

► BMRB 

► NMR-based Structural Proteomics 

► Protein NMR - Introduction 
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Synonyms 

Evanescent wave microscopy; Single-molecule 
methods; TIRE 


Definitions 

Evanescent field: A thin layer of illumination produced 
in a lower refractive index material (usually a liquid in 
biophysical applications) by a light beam propagating 
in a higher refractive index material (usually a solid 
glass coverslip or tissue culture plastic in biophysical 
applications) toward the solid/liquid interface at an 
angle of incidence (measured from the normal to the 
interface) greater than a critical angle. The evanescent 
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field strength decays exponentially with distance nor¬ 
mal to the interface. 

Critical angle: The incidence angle, measured from 
the normal in the high index solid, at which the 
refracted beam in the low index liquid skims along 
parallel to the interface. At incidence angles greater 
than the critical angle, the refracted beam becomes 
evanescent and does not propagate away from the 
interface. 

Introduction 

Total internal reflection fluorescence (TIRF) micros¬ 
copy (also called “evanescent wave microscopy”) 
provides a means to selectively excite fluorophores in 
an aqueous environment very near a solid surface 
(within <100 nm). Fluorescence excitation by this 
thin zone of electromagnetic energy (called an “eva¬ 
nescent field”) results in images with very low 
background fluorescence from out-of-focus planes. 
TIRF thereby provides the very dark background 
needed to observe single fluorophores. In combination 
with polarized light techniques, TIRF can also provide 
information about surface-bound or surface-proximal 
single-molecule orientation and rotational dynamics. 
The motion of single molecules (or small numbers 
of them) can be detected (and seen visually) as fluo¬ 
rescence fluctuations when fluorescent molecules enter 
and leave the thin evanescent field region. A complete 
physics and biophysics-oriented review is given by 
Axelrod (2007) and a recent review of TIRF applica¬ 
tions to biomolecules is given by Ross and Dixit 
( 2010 ). 

Numerous applications of TIRF are found in molec¬ 
ular, membrane, and cellular biophysics. Very sparse 
labeling of filaments in the submembrane of cells 
produces discrete speckles under TIRF illumination; 
the motion of these speckles can be followed easily 
(“speckle microscopy”) to report on filament subunit 
turnover. TIRF can be used to observe cell/substrate 
contact regions, submembrane filament structure, 
motion and release of secretory granules, and single 
ion channels, with good spatial and temporal resolu¬ 
tion. TIR, combined with fluorescence recovery after 
photobleaching (FRAP), or fluorescence correlation 
spectroscopy (FCS), can examine the specific or 
nonspecific binding kinetics of proteins to cellular 
or artificial membranes, and possible surface diffusion. 


Interactions among channel or receptor proteins at 
surfaces can be studied by TlR/fluorescence resonance 
energy transfer. 

Although TIRF microscopy became widely used 
only in the 1990s and increasingly so thereafter, it 
actually had much earlier origins in surface spectros¬ 
copy. TIR in a nonmicroscopic nonfluorescent form 
had long been used for visible and infrared absorption 
spectroscopy. Nonmicroscopic TIR combined with 
visible fluorescence spectroscopy at various kinds of 
interfaces also has a long history, beginning in the 
early 1960s. The first application of TIR to microscopy 
was nonfluorescent, describing evanescent light scat¬ 
tering from cells in 1961. The combination of all three 
elements - TIR and fluorescence and microscopy - 
was presented first in 1981. 

TIRF is one of many microscopy approaches 
designed to improve the microscope’s longitudinal 
resolution (i.e., along the optical axis), which 
otherwise is intrinsically much worse than its lateral 
resolution. Among these alternative methods are 
confocal, multiphoton, standing-wave, structured illu¬ 
mination, and STED microscopies. These alternatives 
have a clear advantage in that they can work at any 
accessible plane of the sample, not just at an interface 
between dissimilar refractive indices. However, TIRF 
has the advantage for some applications. The depth of 
the optical section in TIRF is typically <0.1 pm, much 
shorter than the ^0.6 pm typical of confocal and 
multiphoton microscopy. TIRF is much less expensive 
than most of these other approaches and can even be 
made with “homemade” components. TIRF is a wide- 
field (rather than point scanning) approach, so all parts 
of the field of view can be viewed simultaneously by 
a digital camera, which leads to much faster data 
acquisition and often more efficient light gathering. 
TIRF requires no post-acquisition image processing, 
unlike structured illumination. In TIRF, extraneous 
photodamage is minimized because only the surface- 
proximal region being imaged is under illumination. 

TIR Excitation 

The thin layer of illumination is an “evanescent field” 
produced by an excitation light beam in a solid 
(e.g., a glass coverslip or tissue culture plastic) that is 
incident at a high angle upon the solid/liquid surface at 
which the sample adheres. The incidence angle 6, 
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Total Internal Reflection 
Fluorescence Microscopy 
for Single-Molecule 
Studies, Fig. 1 TIR 

illumination scheme. 
Refractive index must be 
greater than n ,. The optional 
intermediate layer may consist 
of metal or a dielectric 
material of refractive index n 2 . 
The incidence angle 9 must be 
larger than the critical angle 6^ 
for TIR to occur. The 
exponentially decaying 
evanescent field in the Hi 
material is used to excite 
fluorophores in TIRF 



measured from the normal, must be greater than some 
“critical angle” for the beam to totally internally reflect 
rather than refract through the interface. TIR generates 
a very thin electromagnetic field in the liquid with the 
same frequency as the incident light, exponentially 
decaying in intensity with distance from the surface. 
This field is capable of exciting fluorophores near the 
surface while avoiding excitation of a possibly much 
larger number of fluorophores farther out in the liquid. 

The simplest case of TIR is that of an “infinitely”- 
extended plane wave incident upon a single interface 
(i.e., a beam width many times the wavelength of the 
light, which is a good approximation for unfocused or 
weakly focused light); see Fig. 1 . When a light beam 
propagating through a transparent medium 3 of high 
index of refraction (e.g., glass) encounters a planar 
interface with medium 1 of lower index of refraction 
(e.g., water), it undergoes total internal reflection for 
incidence angles 0 (measured from the normal to the 
interface) greater than the “critical angle” 6^. given by: 



where «i and «3 are the refractive indices of the liquid 
and the solid, respectively. Ratio njrij, must be less 
than unity for TIR to occur. For “subcritical” incidence 
angles 9 < 9^, most of the light propagates through the 
interface into the lower index material with a refraction 


angle (also measured from the normal) given by 
Snell’s Law. But for “supercritical” incidence angles 
9 > all of the light reflects back into the solid. Even 
in this case, some of the incident light energy pene¬ 
trates through the interface and propagates parallel to 
the surface in the plane of incidence. The field in the 
liquid (sometimes called the evanescent “wave”), is 
capable of exciting fluorescent molecules that might be 
present near the surface. 

The intensity 1 of the evanescent field at any posi¬ 
tion (measured as perpendicular distance z from the 
TIR interface) decays with z: 

/(z) = 

where 

d = —— (sin^0 — sin^0£.) 

471123 ^ 

Parameter Iq is the wavelength of the incident light in 
vacuum. Depth d is independent of the polarization of 
the incident light and decreases with increasing 0. 
Except for supercritical 9 ^ 9^ (where d oo), d is 
generally on the order of or smaller. 

The polarization of the evanescent field depends on 
the incident light polarization, which can be either 
“p-pol” (polarized in the plane of incidence formed 
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Total Internal Reflection Fluorescence Microscopy for 
Single-Molecule Studies, Fig. 2 Schematic drawing of the 
evanescent polarization resulting from p-pol incident light. The 
incident light wavefronts (with the intervals from solid to dashed 
wavefront lines representing one half of a wavelength in the 
glass) determine the spacing of the wavefronts in the evanescent 
field. The wavelength in the n^ medium is XJn^. Reflected light 
is not shown. The p-pol evanescent field is elliptically polarized 
in the x-z plane as shown (primarily polarized in the z-direction 
with a weaker x-component at a relative phase of n/2). For 
pictorial clarity, only one full period of evanescent electric 
field oscillation is shown; in reality, the evanescent region is 
much more extended and contains many more periods of oscil¬ 
lation in the x-direction. An s-pol incident beam (polarized 
normal to the page) produces a purely s-pol evanescent field 


by the incident and reflected rays, denoted here as the 
x-z plane) or “s-pol” (polarized normal to the plane of 
incidence). Figure 2 shows these polarizations as they 
appear on each side of the interface. The intensities for 
the two polarizations are not the same, and they are 
both strong functions of incidence angle (see Fig. 3). 
The evanescent intensity for both polarizations 
approaches zero as 6 ^ 90°. On the other hand, for 
supercritical angles within 10° of 6^, the evanescent 
intensity is as great or greater than the incident light 
intensity. 

For a finite-width beam, the evanescent held can be 
pictured as the beam’s partial emergence from the 
solid into the liquid, travel for some finite distance 
along the surface, and then reentrance into the solid. 
The distance of propagation along the surface is 
measurable for a finite-width beam and is called the 
Goos-Hanchen shift. The Goos-Hanchen shift ranges 
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Total Internal Reflection Fluorescence Microscopy for 
Single-Molecule Studies, Fig. 3 Evanescent intensities Ip ,, 
at z = 0 versus 9, assuming the incident intensities in the glass 
are set equal to unity. At angles 6 > 6e. the transmitted light is 
evanescent; at angles 6 < 9,,, it is propagating. Both s- and 
p-polarizations are shown. Refractive indices n^ = 1.46 (fused 
silica) and = 1.33 (water) are assumed here, corresponding to 
0c = 65.7°. Also shown is the evanescent intensity that would be 
obtained with a thin (20 nm) aluminum film coating 



from a fraction of a wavelength at 0 = 90° to infinite at 
9 — 0c, which of course corresponds to the refracted 
beam skimming along the interface. 

In practice, the interface may not be a simple 
interface between two media, but rather a stratified 
multilayer system. Insertion of intermediate layers 
never thwarts TIR, regardless of its refractive index 
of the layers. The only question is at which interface 
TIR takes place. But intermediate layers do affect the 
intensity at the final interface with the medium. 
Regardless of the refractive indices and thicknesses 
of the intermediate layers, the evanescent field’s pro¬ 
file in will be exponentially decaying with 
a characteristic decay distance d given above. 

A particularly interesting kind of intermediate layer 
is a metal film. Such a film will reduce the s-polarized 
evanescent intensity to nearly zero at all incidence 
angles. But the p-polarized behavior is quite different. 
At a certain sharply defined angle of incidence 0p (“the 
surface plasmon angle”), the p-polarized evanescent 
intensity becomes an order of magnitude brighter than 
the incident light at the peak. This strongly peaked 
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effect is due to a resonant excitation of electron oscil¬ 
lations at the metal/water interface. For an aluminum, 
gold, or silver film at a glass/water interface, 0p is 
greater than the critical angle for TIR. The intensity 
enhancement is rather remarkable since a 20 nm thick 
metal film is almost opaque to the eye. 

Fluorescence Emission near Surfaces 

Strictly speaking, “TIR” refers only to the mode of 
excitation. But since the molecules absorbing the eva¬ 
nescent excitation light are those residing very close to 
the substrate surface, the effect of the surface on the 
emission must be considered. A fluorophore very far 
from a surface will emit light energy equally divided 
between the directions away and toward the surface. 
But the closer a fluorophore is to the surface 
(i.e., within a light wavelength), the larger the propor¬ 
tion of its emitted light will enter the substrate. The 
cause of this asymmetry is the interaction of the 
fluorophore’s “near field” with the interface and its 
consequent conversion into light propagating at high 
angles into the higher index material (the solid 
substrate). If a sufficiently high aperture microscope 
objective looking through the substrate is used to 
gather the emission, then the surface selectivity pro¬ 
duced by TIR is further augmented by the greater 
ability of surface-proximal fluorophores to send emis¬ 
sion light into the substrate and the objective. The 
dependence of gathered emission upon z will be 
nonexponential and will depend upon the numerical 
aperture of the objective (as well as upon the orienta¬ 
tional distribution of the fluorophore dipoles, and the 
refractive indices bordering the interface. For very 
high aperture objectives that can capture near-held 
emission, the effective surface selectivity becomes 
even more rapidly decaying in the z-direction than 
the exponential decay of the TIR evanescent held. 

The orientation of the huorophore also affects emis¬ 
sion pattern. This is most evident at the back focal 
plane (BFP) of the objective gathering the emission 
light (rather than the image plane), because the light 
pattern at the BFP is a mapping of the angular depen¬ 
dence of the emission toward the substrate. Figure 4 
shows a theoretical example of the pattern that appears 
at the BFP. 

A metal-coated glass surface strongly affects both 
the intensity and the angular pattern of the emission. 



Total Internal Reflection Fluorescence Microscopy for 
Single-Molecule Studies, Fig. 4 Intensity pattern at the 
BFP of the objective (not the image plane), as produced by 
a single fluorophore residing on the TIR suiface. The 
fluorophore’s dipole moment here is assumed to be parallel to 
the substrate surface and oriented right/left. The bright ring can 
be gathered only by objectives with a numerical aperture (1.49 
assumed here) greater than the refractive index of the medium in 
which the fluorophore resides (1.33 assumed here). The diffuse 
light outside that ring arises entirely from the near field of the 
fluorophore, as captured by the substrate and directed to the 
objective. In general, the pattern contains information about 
the distance of the fluorophore from the surface, its orientation, 
and the index of refraction in which it resides 

Emitted light from a distant fluorophore reflects off the 
metal surface and very little penetrates through. But 
a nearby fluorophore will induce a surface plasmon 
excitation in the metal, which in turn produces 
a thin-walled hollow cone of emission light propagat¬ 
ing into the glass substrate. This hollow cone of 
light can be captured by a high aperture objective. 
The rapid conversion of near-held light into either 
a surface plasmon or heat in the metal leads to 
a much reduced excited state lifetime for fluorophores 
near a surface with a consequent great reduction in 
photobleaching rate. 

Optical Configurations 

Two basic kinds of optical setups for TIRF can been 
employed. “Objective-based” TIRF uses a high 
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numerical aperture (NA > 1.4) microscope objective 
for both TIR illumination and emission. “Prism-based” 
TIRF uses a prism to direct the light toward the 
TIR interface with the objective employed only for 
emission collection. 

Figure 5 shows some schemes for objective-based 
TIRF. To achieve TIR, the numerical aperture of the 
objective must be greater than «i, preferably by 
a substantial margin. By using an objective with 
a sufficiently high NA, supercritical angle incident 
light can be cast upon the sample by illumination 
through the objective. In some variations, a “raw” 
laser beam is focused (by an external focusing lens) 
to an off-axis point at the back focal plane (BFP) of the 
objective so that the light emerges from the objective 
in a collimated form (i.e., the “rays” are parallel to each 
other) at the same angle 0 with respect to the optical 
axis. There is a one-to-one correspondence between 
the off-axis radial distance and the angle 6. At 
a sufficiently off-axis radial distance, angle 6 will 
exceed the TIR critical angle. Further increases in 
radial distance serve to reduce the characteristic eva¬ 
nescent field depth c/ in a smooth and reproducible 
manner. In another variation (common in commercial 
systems), the exit port of a fiber optic cable carrying the 
laser light is refocused off-axis at the BFP. The angle 
of convergence/divergence of the laser beam at the 
BFP is proportional to the diameter of the illuminated 
region subsequently created at the sample plane. 

Commercial arc-lamp TIRF systems use an off-axis 
curved slit cutout at an equivalent BFP upbeam from 
the actual BFP. Arc illumination has the advantages of 
freedom from coherent light interference fringes and 
easy selection of excitation colors with filters, but it is 
somewhat dimmer because much of the arc lamp 
power directed toward the sample at subcritical angles 
is necessarily blocked. 

Figure 6 shows several schematic drawings for 
setting up “prism-based” TIR in an inverted micro¬ 
scope. A commercially available variation of prism- 
based TIR uses a high aperture condenser fed with 
laser light from an optical fiber tip instead of 
a custom-installed prism fed by a free laser beam. 
Although a prism may restrict sample accessibility 
or choice of objectives in some cases, prism-based 
TIR is very inexpensive to set up and produces 
a “cleaner” evanescent-excited fluorescence (i.e., less 
excitation light scattering in the optics) than objective- 
based TIR. 



filter cube/objective 




off-axis crescent silt 


Total Internal Reflection Fluorescence Microscopy for 
Single-Molecule Studies, Fig. 5 Three arrangements for 
objective-based TIRF in an inverted microscope. OBJ refers to 
the objective, SP refers to sample plane, and BFP refers to the 
objective’s back focal plane or its equivalent planes (also called 
“aperture planes”), (a) Laser illumination through a side port. 
The beam is focused at the back focal plane at a radial position 
sufficient to lead to supercritical angle propagation into the 
coverslip. Moving the focusing lens L transversely changes the 
angle of incidence at the sample and allows switching between 
subcritical (EPI) and supercritical (TIR) illumination, (b) Laser 
illumination introduced by an optical fiber through the rear port 
normally used by the arc lamp, (c) Arc-lamp TIR illumination 
with no laser at all. The goal is to produce a shaip-edged image 
of a crescent aperture at the BFP such that only supercritical light 
passes through the objective 
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Total Internal Reflection Fluorescence Microscopy for 
Single-Molecule Studies, Fig. 6 Three arrangements for 
prism-based TIR, all using a laser as a light source. The vertical 
distances are exaggerated for clarity, (a) The TIR prism (a cube 
here) is above the sample. A buffer-filled sample chamber sand¬ 
wich consists of a lower bare glass coverslip, a thin spacer ring, 
and a glass coverslip inverted so that the TIR surface faces down. 
The upper surface of the cell coverslip is put in optical contact 
with the prism lowered from above by a layer of immersion oil or 
glycerol. The lateral position of the prism is fixed but the sample 
can be translated while still maintaining optical contact, (b) The 
TIR prism (a triangle here) is installed below the sample to set up 

TIR from Multiple Directions 

Illumination by two mutually coherent TIR laser 
beams produces a striped interference fringe pattern 


multiple internal reflections in the substrate. This configuration 
thereby allows for complete access to the sample from above. 
However, only air or water immersion objectives may be used 
because oil at the substrate’s lower surface will thwart the 
internal reflections, (c) TIRF for an upright microscope utilizing 
the integral optics in the microscope base and a trapezoidal prism 
on the vertically movable condenser mount. The position of the 
beam is adjustable by moving the external lens. An alternative 
hemispherical prism configuration for variable incidence angle 
is also indicated to the left. An extra set of mirrors can be 
installed to deflect the beam into an epi-illumination light path 
as shown 

in the evanescent field intensity in the sample plane. 
For a typical glass substrate with = 1.5, the fringe 
spacing can be smaller than the Raleigh resolution 
limit of the microscope. These interference stripes 
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can be used in combination with a FRAP protocol to 
produce a periodic stripe bleach pattern, and the 
subsequent fluorescence recovery will then provide 
information about surface diffusion of fluorophores. 
TIR interference stripes can also be used in combina¬ 
tion with structured illumination analysis to improve 
lateral resolution. 

Variable Incidence Angle TIR: Concentration 
Profiles 

It may be of interest to deduce what is the concentra¬ 
tion profile C(z) of fluorophores within the evanescent 
field, to a resolution much shorter than the evanescent 
field depth. The observed fluorescence F(0) is 

OO 

F{e)= I P{2)F^o{B)C{z)e-^l‘^^^Uz 

r=0 

where /l(z) combines all the previously discussed 
effects of z-dependent emission collection efficiency 
through the chosen numerical aperture objective. F can 
be measured at several different values of 9 to deduce 
C(z). The result should be treated with caution, how¬ 
ever, because for a finite number of F(0) measure¬ 
ments, the deduction of C(z) is not mathematically 
unique. Also, the presence of intermediate films can 
significantly alter the 4=o(0) factor in the integrand 
from the form expected for bare surfaces that are 
depicted in Fig. 3. 

Summary 

Total internal reflection fluorescence (TIRF) micros¬ 
copy is a wide-field sample illumination technique 
which selectively excites fluorophores in an aqueous 
environment very near a solid surface, while avoiding 
excitation in more distal regions. TIRF thereby pro¬ 
vides the very dark background needed to observe 
single fluorophores in focus at the surface. The thin 
illumination zone in TIRF can be polarized and the 
resulting images can contain information about sur¬ 
face-bound or surface-proximal single-molecule ori¬ 
entation and rotational dynamics. Since it is a wide- 
field technique (rather than scanning), TIRF also can 
be used on living cells to observe cell/substrate contact 


regions, submembrane filament structure, single ion 
channels, and motion and release of secretory granules, 
with good spatial and temporal resolution. Combined 
with fluorescence recovery after photobleaching 
(FRAP) or fluorescence correlation spectroscopy 
(FCS), TIRF can examine the specific or nonspecific 
binding kinetics of proteins to cellular or artificial 
membranes, and possible surface diffusion. 


Cross-References 

► Fluorescence 

► Fluorescence and FRET in Membranes 

► Fluorescence Correlation Spectroscopy 

► Fluorescence Cross-Correlation Spectroscopy 

► Fluorescence Fluctuation Spectroscopy 

► Fluorescence Imaging with One Nanometer 
Accuracy 

► Fluorescence Techniques for Studying Ion Channel 
Gating: VCF, FRET, and LRET 

► Photoactivated Localization Microscopy (PALM) 

► Polarized Light, Linear Dichroism, and Circular 
Dichroism 

► Quenched-Llow Methods 

► Single Lluorophores Photobleaching 

► Single Particle Tracking 

► Single-Molecule Lluorescence Resonance Energy 
Transfer 

► Single-Molecule High-Resolution Colocalization 
(SHREC) 

► Single-Molecule High-Resolution Imaging with 
Photobleaching (SHRImP) 

► Single-Molecule Methods 

► Single-Molecule Sequencing 

► Single-Molecule Spectroscopy 

► Stochastic Optical Reconstruction Microscopy 

► Structured Illumination Microscopy (SIM) 

► Surface-Tethered Lipid Vesicles 

► Tethered Particle Microscopy 

► Zero-Mode Waveguides 
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Toxicity 

► Glycosylation in Metabolism 


TPC Mitochondrial Thiamine Carrier 

► Mitochondrial Transport Protein Family 


TPM or Tethered Particle Motion 

► Tethered Particle Microscopy 


Transbilayer Movement 

► Lipid Flip-Flop 
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Transcriptional 3'-End Processing 

Ian A. Taylor 

Division of Molecular Structure, National Institute for 
Medical Research, London, UK 

Synonyms 

Cleavage/Polyadenylation reaction; pre-mRNA 
processing 


Definition 

Transcriptional 3'-end processing is the combined pro¬ 
cess of the pre-mRNA transcript cleavage and 
polyadenylation that produces mature mRNA for 
export to the cytoplasm and translation. 

3'-End Processing and Polyadenylation 

Transcriptional 3'-end processing of messenger RNAs 
(mRNA) is a fundamental process carried out hy 
eukaryotic cells. In the first step of the process RNA 
polymerase II (RNAPII) transcribed pre-mRNAs are 
cleaved downstream from the last exon of the gene in 
their 3'-untranslated region (3'-UTR). The nascent 
3'-end is then a substrate for polyA polymerase (PapI) 
that adds a poly-adenylate (polyA) tail to produce the 
mature mRNA. The addition of this polyA tail is 
a critical requirement to allow export of mRNA from 
the nucleus and subsequent translation in the cyto¬ 
plasm, reviewed in (Edmonds 2002; Zhao et al. 1999). 
However, as well as promoting the transport of mRNA 
from nucleus to the cytoplasm, 3'-end processing has 
important regulatory functions in transcription elonga¬ 
tion and splicing (Bentley 2005; Calvo and Manley 
2003; Zorio and Bentley 2004), in directing correct 
transcriptional termination (Buratowski 2005; 
Rosonina et al. 2006) and in the posttranscriptional 
regulation of gene expression through the utilization 
of alternative cleavage sites (Kuersten and Goodwin 
2003; Lutz 2008; Wilkie et al. 2003). 

3'-UTR RNA Recognition Elements 

3'-end cleavage and polyadenylation is directed hy 
sequence elements found within the 3'-UTR of pre- 
mRNA. These sequence elements are found in almost 
every eukaryotic pre-mRNA. The biggest exceptions 
are histone pre-mRNAs that are cleaved but not 
polyadenylated. The RNA recognition sequences 
utilized in 3'-end processing by mammals and 
S. cerevisiae although different share strong similarities 
and can he broken down into the polyA signal (position¬ 
ing element in yeast), the cleavage site, an efficiency 
element together with upstream and downstream auxil¬ 
iary elements. The location and consensus sequences for 
these recognition elements are summarized in Table 1. 
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Transcriptional 3'-End Processing, Table 1 3'-end processing signal sequences 

Signal sequence 

Mammalian 


Yeast 


Consensus 

Position^ 

Consensus 

Position"* 

PolyA signal (PE) 

A(A/U)UAAA 

+(10-30) 

AA(U/A)AAA 

+(10-30) 

Cleavage site 

CA^N 

/ 

(C/OA^'N 

/ 

Efficiency element 

/ 

/ 

UAUAUA 

+(Variable) 

Upstream element 

U(U/G)U(U/A) 

+(Variable) 

Urich 

+(5-10) 

Downstream element 

U/G rich 

-(10-30) 

U rich 

-(5-10) 


“nucleotides, relative to the cleavage site 


In mammals, the polyA signal is highly conserved 
and has the sequence A(A/U)UAAA in 95% of 
polyadenylated transcripts. This strongly conserved 
element is found throughout higher eukaryotes and is 
located 10-30 nucleotides upstream of the transcript 
cleavage site. In yeast, the polyA signal is sometimes 
referred to as the positioning element (PE). It is also 
located 10-30 nucleotides upstream of the cleavage 
site and although the consensus sequence is not as 
conserved as in mammals AAUAAA and AAAAAA 
sequences support the most efficient cleavage/ 
polyadenylation reaction. The cleavage site itself is 
a conserved feature in both mammals and yeast. Tran¬ 
script cleavage most frequently occurs immediately 3' 
to an adenine nucleotide that is preceded hy 
a pyrimidine (PyA^N). In mammals the CA dinucleo¬ 
tide is strongly preferred but this is relaxed to (C/U)A 
in yeast. 

The polyA signal and cleavage site are also accom¬ 
panied by a number of other signal sequences that vary 
significantly between yeast and higher eukaryotes. In 
the mammalian system less conserved G/U or U-rich 
downstream elements are located 10-40 nucleotides 3' 
to the cleavage site, together with a U-rich upstream 
element located a variable distance 5’ to the polyA 
signal. In yeast, U-rich elements are found both just 
upstream and downstream of the cleavage site. There is 
also an efficiency element (EE) with an optimum 
sequence of UAUAUA located at a variable distance 
5 ’ to the PE. 

3'-End Processing Factors 

The conserved RNA sequence elements found in the 
3'-UTR of genes constitute the target recognition 
sequences required to recruit the multisubunit 3'-end 
processing complexes and direct site-specific cleavage 


and polyadenylation reactions. In both yeast and mam¬ 
mals, around 20 individual factors are required to carry 
out the 3'-end processing reaction and many of the 
proteins are conserved in both mammalian and yeast 
systems. However, the distribution of factors within 
processing complexes differs significantly. Table 2 
details the components of mammalian and yeast 
3'-end processing factors, the complex they are 
contained within, and their function in the processing 
reaction. Eor a comprehensive review of all the prop¬ 
erties of individual components of 3'-end processing 
complexes see (Mandel et al. 2008). 

In the yeast system two multiprotein assemblies 
Cleavage factor lA (CEIA) and Cleavage and 
Polyadenylation factor (CPE) together with three aux¬ 
iliary factors Papl, Hrpl (CEIB), and Pabl are 
required to carry out the transcript cleavage/ 
polyadenylation reaction. CEIA is required for site 
selection and the cleavage reaction, but not for 
polyadenylation. It is comprised from four subunits 
Rnal4, Rnal5, Pcfl 1, and Clpl but can be co-purified 
along with Hrpl (CEIB) as holo-CEI. CEIA is associ¬ 
ated with the upstream PE through the Rnal5 subunit 
but only when Hrpl Is bound to an adjacent EE. How¬ 
ever, the exact nature of this RNA-binding specificity 
remains unclear. The Pcfl 1 subunit of CEIA contains 
an RNAPII-CtD /nteracting domain (CID) at its 
N-terminus and couples 3'-end processing directly to 
transcription through interaction with phosphorylated 
YSPTSPS motifs present in the C-terminal region of 
large subunit of RNAPII. Rnal4 is a scaffolding pro¬ 
tein that directs assembly of CEIA, and Clpl is an 
ATP-containing bridging protein that binds to both 
Pcfl 1 and CPE subunits. 

CPE is a larger complex that is required in both the 
cleavage and polyadenylation reactions. It contains 
a core of eight subunits Cftl, Cft2, Yshl, Ptal Mpel, 
Pfs2, Fipl, and Ythl and an additional six subunits 
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Transcriptional 3'-End Processing, Table 2 Composition of S. cerevisiae and mammalian 3'-end processing complexes 

Yeast factor (complex) 

Mammalian factor (complex) 

Structural features/function 

Cftl (CPF) 

CPSF 160 (CPSF) 

P-propeller domains/polyA signal RNA binding 

Cft2 (CPF) 

CPSFIOO (CPSF) 

P-CASP protein/RNA binding, CPF factor recruitment 

Yshl (CPF) 

CPSF73 (CPSF) 

P-CASP protein/3'-end nuclease 

Ythl (CPF) 

CPSF30 (CPSF) 

Zn fingers/RNA binding to U-rich sequences 

Fipl (CPF) 

hFipl (CPSF) 

Recruitment of Papl to polyA site 

Ptal (CPF) 

Symplekin 

Scaffold protein/CPF factor recruitment 

Pfs2 (CPF) 

CstFSO (CstF)? 

WD-40 repeats/bridges CPF to CFIA 

Ptil (CPF) 

CstF64 (CstF) 

RRM/associates CPF activity with snoRNAs“ 

Mpel (CPF) 

/ 

Zinc Knuckle/CPF component but unction unknown 

Ref2 (CPF) 

/ 

Enhancement of processing efficiency at weak sites 

Swd2 (CPF) 

/ 

WD-40 repeats/CPF factor recruitment 

Sycl (CPF) 

/ 

Yshl homology/regulation of processing reaction 

Ssu72 (CPF) 

hSsu72 

Protein phosphatase/dephosphorylation of RNAPII CtD 

Glc7 (CPF) 

/ 

Protein phosphatase/dephosphorylation of Ptal 

Rnal4 (CFIA) 

CstF77 (CstF) 

Hat repeats/hridges CFIA to CPF 

RnalS (CFIA) 

CstF64 (CstF) 

RRM domain/RNA binding G/U- or A-rich sequences? 

Pcf 11 (CFIA) 

hPcfll (CFII„) 

CID domain/couples 3'-end processing to RNAPII 

Clpl (CFIA) 

hClpl (CFII„,) 

Walker-A motif/Nucleotide binding, kinase? 

Hqtl (CFIB) 

/ 

RRM domains, preference for UAUAUA (EE) sequences 

/ 

CFI„25 (CFI„) 

Nudix Motif/RNA binding to upstream element 

/ 

CFl„68(CFim)'’ 

RRMVrecruitment of Fipl and Pap to polyA sites 

Papl 

Pap 

Polymerase domain/adenosyl nucleotidyl transfer 

Pabl 

PABII 

Four RRMs/binds polyA tail, enhances polyadenylation 


^Associates CPF activity with small nucleolar RNAs (snoRNAs) rather than pre-mRNA processing 
'’CFlm59, CFI,„68, or CFIm72 isoforms can heterodimerize with CFIn,25 to form CFIn, 

'^RRM in CFI„,68 is required for interaction with CFIn,25, not RNA binding 


Ref2, Ptil, Swd2, Glc7, Ssu72, and Sycl. CPF is 
associated with the cleavage site and the surrounding 
U-rich sequences through RNA-protein interactions 
mediated by Cftl, Cft2, and Ythl. The Yshl subunit 
is the yeast orthologue of mammalian CPSF73, 
a p-CASP protein and likely the zinc-dependent ribo- 
nuclease that constitutes the catalytic center of the 
cleavage reaction. Other CPF subunits include Fipl 
that recruits Papl to the polyadenylation site and Pfs2 
that together with Yshl bind to Rnal4 and Clpl to 
tether CPF to CFIA during the processing reaction. 
Along with these large multiprotein complexes, Papl 
and Pabl auxiliary factors are also required for effi¬ 
cient addition of the poly(A) tail. 

In mammals, four processing complexes together 
with Symplekin, Papl, and Pab auxiliary factors are 
required to complete the 3'-end processing reaction. 
They are Cleavage and Polyadenylation Specihcity 
Factor (CPSF), Cleavage Stimulation Factor (CstF), 
Cleavage Factor /-(wiammalian) (CFlm), and Cleavage 
Factor //-(mammalian) (CFIIm). 


CPSF is associated with the polyA signal and is 
required for both the cleavage and polyadenylation 
reaction. The complex contains five subunits 
CPSF160, CPSFIOO, CPSF73, CPSF30, and hFipl 
that all have orthologues in yeast CPF. The CPSF160, 
CPSFIOO, and CPSF30 subunits mediate RNA binding 
with sequences around the polyA signal; CPSF73 is the 
3'-end ribonuclease and hFipl recruits Papl to the 
complex. 

The CstF complex is associated with the down¬ 
stream C/U-rich sequence element and is required for 
efficient site selection and cleavage. The complex con¬ 
tains three subunits CstF77 and CstF64, the mamma¬ 
lian orthologues of Rnal4 and Rnal5, together with 
CstFSO that has no known yeast counterpart. CstF64 
mediates RNA binding with the downstream element, 
while CstF77 is a dimeric scaffolding protein that 
directs complex assembly and association with 
CstFSO. 

CFIm is associated with the upstream sequence ele¬ 
ments and is required for both correct site-selection 
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and cleavage reactions. The complex comprises a 
small dimeric subunit, CFIm25, and one of three large 
subunits CFIn,59, CFIm68, or CFIn,72. No yeast 
orthologues of any of the CFI^ subunits have been 
identified. CFIm25 mediates RNA binding through a 
Nudix domain and although the large subunits all con¬ 
tain RRM domains, these are required for assembly 
with CFln,25 rather than RNA recognition. The CFII^ 
complex is also required for site selection and cleavage 
and comprises hPcfll and hClpl, the mammalian 
orthologues of the yeast Clpl and Pcfll. However, 
hPcf 11 is much larger than Pcfl 1 and although it con¬ 
tains the CID and Clpl binding domains found in Pcfl 1 
it might also have other functions in mammalian tran¬ 
scriptional regulation. hClpl performs the same bridg¬ 
ing function as Clpl and links CFIIn, to both CFIn, and 
CPSF during the processing reaction. It also has 5 ’-RNA 
kinase activity, absent in Clpl, and is likely important 
for tRNA splicing as well as 3'-end processing. 

The application of biochemical and genetic 
approaches has identihed a large number of the com¬ 
ponents of the yeast and mammalian 3'-end processing 
complexes and during the past decade, several crystal 
structures of individual 3'-end factors have been 
solved. However, although these studies have 
advanced our understanding of the makeup of 
processing complexes, the molecular details of the 
mechanism of 3'-end processing remain largely 
unknown. The future challenge will be to unravel the 
workings of these supramolecular machines by the 
application of high- and low-resolution methods to 
determine stmctures of entire complexes combined 
with detailed mechanistic studies of in vitro 
reconstituted 3'-end processing reactions. 

Cross-References 

► Alternative Splicing 

► Core Splicing Machinery: Structural and 
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► mRNA Degradation in Prokaryotes 

► mRNA Localization 

► RNA Polymerases and Transcription 
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► RNA-Binding Proteins - Catalytic Domains 

► Splicing 

► X-Ray Diffraction and Crystallography of 
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Transient Grating Spectroscopy: 
Dynamics of Photoreceptors 
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Introduction 

For understanding the molecular mechanisms of 
photoreceptors, not only the atomic resolution static 
structures of the receptors are required, but also infor¬ 
mation describing the dynamics of the photoinduced 
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reaction is essential, because chemical reactions are 
dynamic events. Reaction dynamics include changes 
in; (a) conformation, (b) intermolecular interactions 
(hydrogen bonding, hydrophobic interactions), 
(c) inter-protein interactions (e.g., oligomer formation, 
dissociation reaction), (d) energy, and (e) conforma¬ 
tional fluctuations. Since time-resolved detection is 
essential for revealing such dynamic properties, 
a variety of time-resolved detection techniques have 
been developed. One powerful technique involves 
absorption and emission detection in the UV/visible 
region. This technique is widely used, in particular, 
for studying dynamics of photoreceptors, because 
this method can be applied over a wide time range 
(i.e., femtoseconds to days) with high sensitivity. 
However, absorption changes only reflect changes 
near a chromophore molecule and there are many 
important spectrally silent dynamics that cannot be 
detected by this method. 

For example, firstly, conformational changes are 
always associated with the function of biomolecules. 
If these changes occur distal from the chromophore 
they are quite difficult to detect, i.e., spectrally silent. 
Secondly, to fully characterize intermolecular interac¬ 
tions, such as protein-protein, domain-domain, and pro¬ 
tein-water interactions, measurements of these 
spectrally silent intermolecular interactions should be 
carried out in the time-domain. Thirdly, energetic and 
conformational fluctuations have been recently recog¬ 
nized to be very important factors in facilitating various 
efficient reactions of biomolecules. Fourthly, spatial 
molecular transport properties such as translational dif¬ 
fusion are important for signal transduction or molecu¬ 
lar communications; however, it is quite difficult to 
trace diffusion dynamics in the time-domain (without 
tracer probes). The transient grating (TG) technique 
complements optical absorption detection methods 
and the signal contains information on spectrally silent 
dynamics that are difficult to measure by other methods. 

Origins of the TG Signal 

For the measurement of the TG signal, a sample is 
photoexcited by an optical interference pattern 
between two pulsed laser beams (Eichler et al. 1986; 
Terazima 1996, 1998). The photoexcitation by the 
interference pattern creates a sinusoidal modulation 
in the refractive index (n) and the absorption {k). The 


created refractive index and/or absorption gratings are 
monitored by the diffraction of a continuous wave 
probe beam. For a thick grating, which is defined 
by a grating with a longer optical length than the 
grating fringe length, the probe beam is applied at an 
appropriate angle to satisfy the phase-matching 
condition (Bragg condition) (Fig. 1). In the case of 
weak diffraction, the grating intensity (Itg) is approx¬ 
imately proportional to the square of the amplitude 
of the modulation in the refractive index (5n) and 
absorption (5^): 

Itg = a(§«)^ + 

where a and (3 are constants that represent the sensi¬ 
tivity of the system (Eichler et al. 1986). Because the 
absorption term provides essentially the same infor¬ 
mation as the transient absorption method at the probe 
wavelength, only the refractive index term 5n is 
frequently monitored. 

There are a variety of origins of the light-induced 
refractive-index change (Terazima et al. 2004). Under 
conditions of a relatively weak light field and 
a timescale slower than nanoseconds, there are three 
important contributions. Eirst, after the nonradiative 
transition of the excited states or the enthalpy change 
by the reaction, the temperature of the sample is raised 
by the released energy, and the density of the matrix is 
altered by the heating effect, which creates the thermal 
grating. Second, the absorption spectrum is changed by 
any chemical reaction, so that the refractive index 
change is induced by a molecular polarizability differ¬ 
ence between the reactant and product molecules. 
Third, if molecular volume is altered by the reaction, 
the density of the solution changes, which is called the 
volume grating. The sum of the population and volume 
grating components is called the species grating 
(Terazima et al. 2004). 

Temporal Profile of the TG Signal 

The temporal profile of the TG signal is essentially 
determined by two factors: reaction kinetics and the 
diffusion process (Terazima 1998, 2006, 2009). The 
reaction kinetics includes the heat releasing process, 
the volume change process, and the absorption spec¬ 
trum change process. The time-profile of the TG signal 
should also depend on the molecular diffusion rate. 
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Transient Grating 
Spectroscopy: Dynamics of 
Photoreceptors, 

Fig. 1 (a) Schematic 
illustration of the principle of 
the transient grating 
measurement, (b) Schematic 
illustration of the spatially 
modulated concentration of 
the reactant (open circles) and 
the product (closed circles). 
The dotted line indicates the 
spatially modulated light 
intensity 


a 

Interference 

pattern 

Probe light 


Pulsed 

excitation light 

b 

Concentration 




Spatial coordinate 



because the spatial modulation of concentrations of 
molecules produces the grating signal and this concen¬ 
tration modulation disappears due to translational dif¬ 
fusion processes. 

Information from the TG Signal 

The TG signal is rich in information because it arises 
from various origins. From the amplitude of the signal, 
some thermodynamic properties can be extracted. In 
particular, the enthalpy and volume changes that 
accompany the reaction provide important informa¬ 
tion. Moreover, external pressure or temperature 
dependences of a molecular volume provide us infor¬ 
mation on the compressibility or the thermal expansion 
coefficient, respectively. Such changes can be moni¬ 
tored in time-domain. 

The reaction kinetics and the diffusion coefficient 
of the proteins can be obtained from the rate constant 
of the temporal prohle of the signal. The diffusion 
coefficient, D, is a physical property that represents 


the rate of molecular diffusion. The Stokes-Einstein 
relationship is frequently used as a basis of the molec¬ 
ular interpretation of D\ 

D = kgT/arjr 

where kg, T, rj, a, and r are the Boltzmann constant, 
temperature, viscosity, a constant representing the 
boundary condition between the diffusing molecule 
and the solvent, and the radius of the molecule, respec¬ 
tively. On the basis of this equation, observed changes 
in D during the reaction have been frequently 
interpreted in terms of changes in the radius of the 
solute, i.e., volume change. This interpretation may 
be correct, if an oligomerization (or dissociation) reac¬ 
tion takes place. However, in the absence of such 
a reaction, in general, the volume effect to D may be 
neglected, because the volume change that arises as 
a result of a conformation change is usually very small. 
Another intuitive interpretation of a change in D may 
be a change in the shape of the macromolecule. D of 
a molecule with a nonspherical shape has been 
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theoretically derived (Cussler 1997). However, the 
shape dependence on D is rather small and the effect 
of a molecular shape change on D is usually negligible. 
In contrast, large changes in D during chemical reac¬ 
tions have been frequently observed (Terazima 2009, 
2011). One of the factors giving rise to the observed 
changes in D may be due to changes in intermolecular 
interactions between the molecule and the solvent, 
which affects the friction of the molecular movement 
in solution. There is experimental evidence indicating 
that a cause of reduction in D is the unfolding of the 
a-helices or P-sheets of proteins (Terazima 2006, 
2011) (Fig. 2). Changes in intermolecular interactions 
can be considered as a change in the boundary condi¬ 
tion of the Stokes-Einstein relation. The hydrodynamic 
flow change, due to the conformation change, may be 
another simultaneously important factor. 

Advantages of the TG Method 

The TG method has many advantages over other spec¬ 
troscopic methods. One advantage is the high sensitiv¬ 
ity. A reason for the high sensitivity arises because of 
the background free detection for the homodyne detec¬ 
tion of the signal. Further increases in sensitivity are 
possible using heterodyne detection (Eichler et al. 
1986). A second reason for the high sensitivity is that 
the signal is only created by photoexcited (or 
photoreacted) molecules. This contrasts other tech¬ 
niques that measure an average property. For example, 
the diffusion coefficient can be measured by methods 
such as light scattering or NMR spectroscopy. These 
traditional methods detect the diffusion of all mole¬ 
cules present in the solution. Hence, if the reaction 
quantum yield is low, the signal is a mixture of the 
reacted and nonreacted species, which can drastically 
reduce the sensitivity of detecting a change in the 
diffusion coefficient. On the other hand, the TG signal 
represents diffusion processes of the reactant and the 
product without being affected by nonreacted species. 

An additional advantage of the TG method is the 
high time resolution. If one uses an ultrashort pulsed 
laser for the excitation and probe, enthalpy and/or 
volume changes can be detected in the picoseconds 
time range. Consequently, compared with traditional 
techniques used for measuring thermodynamic prop¬ 
erties and the diffusion coefficient, the TG technique 
provides an “ultrafast” time response approach. 



Transient Grating Spectroscopy: Dynamics of Photorecep¬ 
tors, Fig. 2 A polypeptide adopting an a-helix {left) has less 
intermolecular interactions with water molecules, and therefore 
the diffusion rate is faster than the unfolded polypeptide (right). 
Time-resolved measurement of the diffusion coefficient can be 
used to determine the dynamics of the conformational changes 
that occur to the protein 

Summary 

The transient grating is a powerful time-resolved spec¬ 
troscopy for studying protein reactions in solution. 
In particular, it can probe the conformational changes, 
intermolecular interactions (e.g., protein-protein, 
domain-domain, and protein-water interactions), ener¬ 
getic and conformational fluctuations, and spatial 
molecular transport properties, all of which are diffi¬ 
cult to measure by other methods. There are many 
advantages over other spectroscopic methods, such as 
high sensitivity and high time resolution. 

Cross-References 

► Dynamic Light Scattering 

► Flash Photolysis 

► Protein Dynamics: Time-Resolved Spectroscopic 
Studies 

► Thermodynamics of Biomolecular Interactions 

► Transient-State Kinetic Methods 
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Introduction 

Transient-state kinetics refers to the time course of 
a reaction before it reaches a ► steady-state or equilib¬ 
rium position. The concentrations of species change in 
time according to the ► Law of Mass Action and hence 
analysis of the progress of a reaction can lead to the 
determination of the rate constants of the elementary 
steps that make up the pathway. Such a pathway, in 
conjunction with structural information, allows the 
mechanism of the reaction to be deduced (Fersht 
1985; Goody and Hofmann-Goody 2002; Gutfreund 
1975, 1995). A multistep reaction pathway is normally 
investigated by dissecting it into individual resolvable 
steps. However, coupling between the steps may 


complicate the overall reaction profile. In practice, 
the resolvable phases are hrst analyzed using analytical 
solutions to simple reaction schemes (Capellos and 
Bielski 1980). The reaction schemes are then assem¬ 
bled to give the whole pathway, which may then be 
checked by numerical simulation to allow comparison 
with the original experimental data (Johnson 2009). It 
follows that knowledge of the time courses of simple 
reaction schemes is a key requirement and therefore 
will be considered here. The simplest first-order equa¬ 
tion will be dealt with in detail to illustrate the princi¬ 
ples of analysis, while the solution to other schemes 
will be summarized more briefly. 

Irreversible First-Order Reaction 

For the reaction scheme; 



the corresponding rate equations based on the law of 
mass action are; 

Rate for disappearance of A = d[A]/dt = — ki[A] 
Rate for formation of B = d[B]/dt = ki[A] 
where kj is the rate constant (or rate coefficient). If the 
reaction starts with 100% A and no B, and the initial 
[A] is defined at [A]o, then it follows; 

Conservation equation [A]o = [A] -i- [B] 

Integrating the rate equations above leads to 
[A] = [A]„e-'‘>* and [B] = [A]„(l - 

A plot of [A] with time gives the characteristic expo¬ 
nential decay curve (Fig. 1). The half-time (ti/ 2 ) of the 
reaction (median time) represents the time it takes for 
50% of the population to react (= ln2/ki = 0.693/ki) 
and is independent of the starting concentration. Thus 
successive half-times lead to a [A] of 50%, 25%, 12.5%, 
etc., of the starting concentration at equal intervals. 
There is no precise end point to such a decay, but after 
seven times the ti /2 the reaction would be >99% 
complete. At the ► single molecule level, ti /2 represents 
the time for which there is a 50% chance the molecule 
would have reacted. 

The mean (average) lifetime (x) is longer than the 
half time (ti/ 2 ) because it is weighted by those mole¬ 
cules which, by chance, last more than 2 ti/ 2 . This 
parameter (= l/kQ is also known as the time constant 
or relaxation time of the reaction (Bemasconi 1976), 
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Transient-State Kinetic 
Methods, Fig. 1 Time 
course of a first-order 
irreversible reaction A —> B 
with a rate constant of 1 s“’ 
(i.e., time constant x = 1 s) 
with an initial [A] =100% and 
[B] = 0%. Note the half-time 
(Un) = 0.693 s 



and represents the time for [A] to decay to 1/e 
(i.e., 36.8%) its initial value. 

The dimensions of a first-order rate constant ki are 
s~*, which frequently leads to the confusion between 
a rate and a rate constant. The rate of the reaction is the 
change in concentration of a component with time, and 
is represented by the slope of the exponential decay 
curve at any given instant. At time ti /2 the rate is 50% 
the initial rate because the concentration of A is 0.5 
[A]o, and at infinite time the rate approaches zero. 
Rates can be positive or negative (concentrations of 
states may increase or decrease) but rate constants are 
always positive (as time does not go backward). 

An exponential decay equation can be linearized by 
plotting in the form ln[A] = ln[A]o - kit. This manip¬ 
ulation may be useful for detecting deviations from 
a single exponential (but the plot requires an accurate 
estimate of A [A] if the value of A is not zero at inhnite 
time) but nonlinear least-squares fitting to the expo¬ 
nential function is preferable owing to distortion of 
errors introduced by linearization. 

Reversible First-Order Reaction 

For a reaction of the form 

ki 

A ^ B 
k_ 


and starting with 100% A, the observed approach to the 
equilibrium follows a single exponential according to 
the equation; 

A[A] = [A] - [A], 

= [A]„(l-(k_,/(k,-fk_,))e-'‘°'’- 

where kous = ki -h k_i and [A]e = the equilibrium con¬ 
centration at the end point of the reaction (determined by 
Kgq = ki/k_i). Note that the effect of adding the reverse 
reaction is to make the observed rate constant larger. This 
may seem counter intuitive if the rate constant is mistak¬ 
enly thought of as a rate. In fact the initial rate of the 
reaction depends only on ki (as there is no B present at the 
start) and the initial slope is the same for the irreversible 
and reversible case (Fig. 2). However, the reversible 
reaction has less far to go because [A] reaches an equi¬ 
librium value [A]e, rather than 0, and thus the half-time is 
shorter. The relationship between a reduced observed 
amplitude and increased observed rate constant is 
a recurring theme in transient kinetic equations, and is 
pivotal to the correct interpretation of experimental data. 

Irreversible Sequential Reaction 

For a reaction of the type 


1 


A 
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Transient-State Kinetic Methods, Fig. 2 Time course of 
a first-order reversible reaction A « B {solid lines) with 
a forward rate constant of 1 s^' and reverse rate constant of 
2 s^' with an initial [A] = 100% and [B] = 0%. Note the 
observed rate constant = 3 s^*, giving a time constant 


T = 0.33 s, which is shorter than for the irreversible case 
(dashed line). The initial rates of the reversible and irreversible 
reactions (i.e., the slopes close to time zero) are identical, while 
the equilibrium concentration for the reversible reaction 
[A]e = 66.7% = k_,/(ki-rk_i) 


the decay of [A] will follow a single exponential with 
kobs = ki as A is uncoupled from subsequent reactions 
(if the first step was reversible, then it would be coupled to 
the second step and the reactions kinetics would not 
follow a single exponential unless k2 >> k_i). B will 
show biphasic kinetics as it is first formed and then decays 
to C, while C will have a lag before being formed (Fig. 3 ). 

It is intuitively obvious that if ki >> k2, then B will 
form essentially as a single exponential with rate 
constant ki and on a longer time scale decay exponen¬ 
tially with a rate constant k2. At the end of the first 
phase practically all molecules will be in state B and 
thus [B] = [AJo- Subsequently, C will form exponen¬ 
tially with rate constant, k2, with no discernible lag. 
For cases where ki and k2 are comparable (i.e., within 
an order of magnitude of each other) the following 
analytical solutions are appropriate: 

[B]=([A]„k,/(k,-k2))(e-^^‘-e-^‘') 

[C] = [A]„(l - l/(k, - k 2 )(k.e->^^' - k 2 e-^“)) 

A interesting scenario arises when k2 >> ki, in 
which case very little [B] forms ([B]n,ax ~ [A]o.ki/k2) 
and the reaction would appear to be a single step 
A ^ C with rate constant ki. However, if [B] could 
be detected by some unique spectroscopic signature, 


then the observed rate constant for its formation would 
be ki -I- k 2 « k 2 and it would decay with rate constant 
k2 (ki/(ki -I- k 2 )) « ki, i.e., the rate constant for decay 
controls the observed rate constant for the formation of 
B, while the rate constant for its formation controls the 
observed rate constant for its decay. This is a classic 
example where rate constants couple to give 
a counterintuitive result, and applies when the concen¬ 
tration of B remains small relative to [A]o throughout. 
This trend is even apparent in Fig. 3 , where ki is only 
slightly smaller than k2. 

Second-Order Binding Reaction 

For an irreversible binding reaction of the form 
A -I- B —> AB, the disappearance of A and B, and the 
formation of AB will show ► second-order kinetics, 
i.e.,d[AB]/dt = ki[A][B]. 

For the specific case when [A]o = [B]o it follows 
that 1 /[A] = l/[A]o -I- kp, allowing ki to be determined 
from the slope of a plot of 1 /[A] against t. 

For the more general case, when [A]o ^ [B]o, 
a plot of ln([A]/[B]) against time is linear from the 
equation: 

(l/([AL-[BU)ln([B]„.[A]/[A]„.[B])=kt 
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Transient-State Kinetic Methods, Fig. 3 Kinetic profile for 
a sequential reaction A ^ B ^ C with kj = 1 s“* and k 2 = 2 s~*. 
Note that the half-time for the formation of B is about 0.2 s, but 
[B] only reaches about [A]o/3 = [A]o.k|/(ki + k 2 ). Although this 
curve is not an exact exponential, it is close to one with 


kobs = 3.5 s“’ (fii k|-t k 2 , where k 2 makes the larger contribu¬ 
tion). After 1 s, the decay phase of B has a half-time of about 1 s, 
and approximates to an exponential with kobs = 0-69 s“* fn k|.k 2 / 
(k| + k 2 ). This rate constant also applies to approximate expo¬ 
nential formation of C after the initial lag 


However, it is often more convenient to work under 
conditions where one component (e.g., B) is in large 
excess over [A] so that very little B is used up in 
forming AB. From this it follows that [B] remains 
practically constant at [B]o. The rate equation then 
simplifies to d[AB]/dt = ki[B]o.[A] = ki'[A], where 
k/ represents a pseudo first-order rate constant. Thus, 
the disappearance of A and appearance of AB shows 
first-order exponential kinetics to yield a kob.s = ki [B]o. 
A plot of kobs against [B]o will therefore reveal the true 
second-order rate constant (with dimensions M“*.s“^). 
A pertinent question is how large an excess of B would 
be required to ensure pseudo first-order conditions. 
A tenfold excess would result in a 10% drop in [B] 
during the course of the reaction (or less if the reaction 
is reversible) and only a small deviation in the derived 
rate constant. It is a useful exercise to simulate second- 
order reactions at different ratios using numerical 
integration, and then subject them to analysis by fitting 
to an exponential function to judge the extent of the 
deviation. 

The first-order approximation has the advantage 
that the concentration of A (typically the protein) 
does not need to be known accurately, as the derived 
constant depends on the concentration of B (typically 
a small ligand). A further advantage of using pseudo 


first-order conditions is that it also can reveal the extent 
of the reversibility of binding (Fig. 4) because the 
intercept of a kobs against [B]o provides an estimate 
or limit for k_i (the reverse first-order dissociation 
constant). However, care is required in this extrapola¬ 
tion, as at lower values of [B]o it may not be in suffi¬ 
cient excess over [A]o to give true first-order kinetics. 
Also for points whose kobs are within a factor of 2 of 
the intercept value, the amplitude of the reaction 
should be less than half of the maximum amplitude 
and thus the signal-to-noise of the raw data may be 
poorer in this region (this follows from the fact that at 
[B]o = Kd, the kobs = ki[B]o + k_i = 2 k_i and A will 
be half saturated (i.e., [A]e = [ABjg)). 

The value of k_ i is best checked using a displacement 
reaction in which a competing ligand, L, is added to 
AB in large excess to displace the bound B. 

Reaction Pathways 

With the analytical solutions for the kinetics of these 
basic reactions established, it is now possible to dis¬ 
cuss the behavior of reaction pathways in which two or 
more steps are coupled together and for which, in many 
cases, there is no exact analytical solution available. 
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Transient-State Kinetic 
Methods, Fig. 4 The 

dependence of the observed 
pseudo first-order rate 
constant kobs on [B]o for the 
reaction A -t B AB with 
ki = 1 pM“'.s~' (slope) and 
k_i = 2 s^' (intercept) when 
[B]o >> [A]o. Note that at 
50% saturation ki[B]o = k_| 
and kobs = 2 k^j = 4 s“'. At 
this value [B]o = Kj = k_|/ 
ki = 2 pM (dashed line) 



Two-Step binding reactions are typical of many 
protein-ligand interactions such as that associated 
with the phenomenon of ► induced-fit (Goodrich and 
Kugel 2006; Trentham et al. 1976). A ligand initially 
binds weakly to an open state of the protein active site, 
which then closes to give a more tightly hound form 
(represented by *). 

ki k2 

L -r P s , " LP —^ LP* 

k-i k_2 

Such reactions can often he monitored using intrinsic 
protein ► fluorescence (tryptophan or tyrosine), as 
the occlusion of the ligand may he accompanied by 
the burying of aromatic amino acid residues. The 
dependence of kobs for the formation of LP* on [L] 
depends on the relative value of k_i compared with 
k 2 (it is assumed k 2 is similar or greater than k _2 in 
order that the second step proceeds to a significant 
extent). If k_i > k2, then the kobs for the first step 
under pseudo first-order conditions (= ki[L]o + k_i) 
always exceeds k 2 , and therefore the first step remains in 
equilihrium throughout the reaction. Thus the concentra¬ 
tion of LP is dehned hy the equation: [LP] = [L][P]/Ki, 
where Kj = k_i/ki. Consequently, LP* is formed with 
a single exponential profile with kobs for the formation 
of LP* = k2app + k_2 (i.e., the equation for a reversible 
first-order reaction - see above) where k 2 app = k 2 ([L]o/ 
([L]o -I- Ki)). The latter term gives a hyperbolic depen¬ 
dence on kobs versus [L]o and at saturating [L]o yields 
a limiting value of kobs = k 2 H- k_ 2 . 


If k_i < k 2 , then the reaction approximates more to 
an irreversible sequential reaction (see above). Here at 
low [L]o the reaction is limited by the first step with 
kobs = ki[L]o, while at high [L]o it reaches the same k 2 
-I- k _2 limit as before. At intermediate [L], when ki[L]o 
« k 2 H- k_ 2 , there will be a lag phase in the formation of 
LP*. If this profile is fitted to a single exponential there 
will be a small systematic deviation (which may not be 
obvious if the signal is noisy). However, plotting this 
apparent kobs for 4 single exponential ht against [L]o 
generates a saturation curve which is more sharply bent 
than a hyperbola. In practice, therefore, provided that 
the kobs d4ta approach saturation, a good model- 
independent estimate of k 2 + k _2 can be derived, but 
the interpretation of the [L]o that gives 50% of the 
maximum change in kobs requires more care, since it 
does not necessarily provide a measure of Ki. For 
cases where k_i « k 2 , accurate analysis can only be 
achieved by fitting the set of data curves to simulations 
derived for the model by numerical methods. A further 
complication occurs if part of the signal change occurs 
on the first step which, in general, will yield biphasic 
kinetics. 

Another (but not exclusive) mechanism of two-step 
ligand binding is ► conformational selection (Halford 
1971). Here the ligand binds only to one (rare) confor¬ 
mation of the protein, but “pulls” the reaction toward 
the * state. 

kj 

P . P* + L —^ LP* 

k_4 
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It is assumed that k _3 > k 3 in order that the majority 
of the protein is in the nonbinding form at the start. It is 
clear that as [L]o increases, so does the contribution of 
k 4 [L]o to the rate equations, such that at saturating 
concentrations, the conversion of P to LP* is limited 
by k 3 (there will also be an initial fast phase of LP* 
formation dependent on the [P*] present at the start). 
At lower [L]o the situation is more complex, because 
the profile of the reaction depends on both the rate and 
extent of L binding to P*. If the second step is freely 
reversible (k _4 > k _3 -i- k 3 ), L will bind rapidly but 
may not be sufficient to pull the equilibrium over fully 
to the LP* state. Consequently, there will only be 
a partial conversion of P to LP*. Under these condi¬ 
tions the amplitude will be less than maximum but the 
observed rate constant will be k 3 -i- k _3 (in effect L is 
a perturbant that shifts the equilibrium kinetics of the 
first step to a small degree and the kinetics will follow 
that of a reversible first-order reaction). Increasing [L]o 
such that k 4 [L]o > k 4 /K 3 will pull the reaction over 
fully and kobs becomes limited by k 3 . Thus the 
observed rate constant decreases with increasing [L]o 
because the contribution of k _3 becomes less signifi¬ 
cant. Unfortunately in some literature, it has been 
stated that a diagnostic for this scenario is that the 
rate of the reaction decreases with increasing concen¬ 
tration, which is a misuse of terminology and contra¬ 
dicts the Law of Mass Action. 

The induced-fit model and conformational selection 
mechanisms may operate in parallel, as summarized in 
the following scheme: 


dissociation rate constants is by displacement with 
another ligand, C. For the mechanism in which a pre¬ 
formed complex AB is mixed with C: 

k_j kj 

AB A-rB-rC—^AC 

k-i k_2 

the observed rate constant for the formation of AC will 
be limited by k_i for AB dissociation, as [C] 
approaches infinity. However, at lower concentrations 
of C, displacement may be incomplete and the 
observed rate constant will have contributions from 
all species such that: 


ki[B]„ k2[C]„ 

1 + - 1 + - 

k2[C]„ ki[B]„ 

when [B]o and [C]o > [A]o. 

Again this mechanism can give rise to scenarios in 
which the observed rate constant will decrease with 
increasing concentrations of C. The latter examples 
should serve to emphasize that amplitude information 
should also be considered in the analysis of mechanism 
as it contains information that is complementary to rate 
constants. 

As outlined in the ► stopped-flow section, transient 
kinetic analysis is a powerful way of investigating 
catalytic mechanisms of enzymes through the study 
of single-turnover and pre-steady-state phases 
(Barman et al. 2006; Eccleston et al. 2001; Johnson 
2003). For the mechanism; 


P 



LP 



1 Kj 

LP* 


Independent information is required to analyze such 
a scheme fully, such as the rate constants for the 
interconversion of P P* in the absence of L or 
LP LP* at saturating L, as might be determined by 
► relaxation methods (Bemasconi 1976). Note that the 
overall scheme must balance thermodynamically, 
i.e., K 1 K 2 = K 3 K 4 . 

In some cases the values of the rate constant for the 
dissociation step may be deduced from the intercept of 
kobs versus [L]o plot, although often this is ill-defined 
when [L]o —> 0. A better way of determining 


ki k2 

E -b S s— " ES -" E -b P 

ki 

under single-turnover condition ([E]o > [S]o), the 
kinetic profile resembles that of a sequential reaction, 
in which kiapp = ki[E]o provided ki[E]o > k_i -b k 2 . 
The latter condition is met when [E]o > K^, and 
ensures the rapid and near-complete formation of ES 
before it decays to give P. If this condition is not met, 
then the observed rate constants for the formation and 
decay of ES cannot be assigned directly to ki and k 2 , 
respectively (in effect the reaction then resembles 
a sequential reaction in which a slow step is followed 
by a fast one, which inverts the assignment - see 
above). 

Under multi-turnover conditions ([S]o >> [E]o)the 
transient kinetics of the pre-steady-state phase reveals 
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information about the binding step and, depending 
on the signal being monitored, subsequent isomeriza¬ 
tion and chemical reactions associated with catalysis 
(see ► Kinetics; Overview and ► Quenched-Flow 
Methods). 

Summary 

Transient kinetic analysis is a powerful way of deduc¬ 
ing the mechanisms of chemical interactions which 
complements and extends the traditional steady-state 
approach. 

Cross-References 

► Continuous Flow 

► Flash Photolysis 

► Fluorescence 

► Law of Mass Action 

► Molecular Recognition: Lock-and-Key, Induced Fit, 
and Conformational Selection 

► Order of Reaction 

► Quenched-Flow Methods 

► Single-Molecule Methods 

► Steady-State Enzyme Kinetics 

► Stopped-Flow Techniques 
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► Protein Synthesis: Translational Fidelity 


Translation Initiation: A Eukaryotic 
Perspective 
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Introduction to Eukaryotic Translation 
Initiation 

Protein synthesis, or translation, is the process by 
which genetic information in messenger RNA 
(mRNA) is translated into the amino acid sequence of 
proteins. The initiation phase, the rate-limiting step 
and a major site for regulation of translation, is an 
exceedingly intricate process defined by a complex 
series of timely interactions involving ribosome, 
mRNA, initiator tRNA, and a number of proteins 
termed initiation factors (IFs). Although the general 
mechanism of translation initiation is universal across 
kingdoms, the greater degree of complexity in eukary¬ 
otes reflects the need for a more sophisticated regula¬ 
tion of protein synthesis in response to multiple 
positive and negative stimuli, including nutrients, hor¬ 
mones, and tissue/cell-specific signaling events. In 
eukaryotes, where the majority of mRNAs possesses 
a 7-methylguanosine (m^G) cap structure at the 5'-end, 
the initiation of translation can be described by the 
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“scanning model” (Fig. 1) and begins with the forma¬ 
tion of a ternary complex (TC) consisting of initiator 
Met-tRNAi and the GTP-hound form of the eukaryotic 
initiation factor (elF) 2 (Jackson et al. 2010; Sonenherg 
and Flinnehusch 2009). The TC loads onto the 40S 
small ribosomal subunit, along with elFl, elFlA, 
eIF3 and eIF5 to generate a 43S pre-initiation complex 
(PIC). The 43S PIC is recruited to the capped 5'-end of 
the mRNA in a mechanism promoted hy eIF4F, 
a multifactorial complex comprising the cap-hinding 
protein eIF4E, the scaffolding factor eIF4C (that hinds 
to eIF4E, eIE4A, and eIF3) and eIE4A (a DEAD-box 
RNA helicase). The mRNA probably exists in 
a circularized state through an interaction between 
eIE4C and the polyA-binding protein (PABP) which 
coats the 3'-polyA tail of the mRNA (Jackson et al. 
2010; Sonenberg and Hinnebusch 2009). Subse¬ 
quently, the PIC scans the mRNA in a 5' to 3' direction, 
facilitated by DHX29, an RNA helicase DExH-box 
protein, in search of an AUG start codon (in a favorable 
nucleotide context) to pair with initiator Met-tRNAi 
located in the ribosomal P (peptidyl)-site. Irreversible 
hydrolysis of the eIP2-bound GTP causes the scanning 
PIC to arrest at the initiation codon, triggering the 
release of eIP2-GDP and other elEs from the 40S. 
In a further GTP hydrolysis reaction, the factor 
eIP5B recruits the large ribosomal subunit (60S) to 
form an SOS initiation complex, with the ribosomal 
A (aminoacyl)-site ready to accept an aminoacyl- 
tRNA and enter the elongation phase of translation. 

Apart from the scanning pathway, alternative initi¬ 
ation mechanisms exist in eukaryotes, including poly¬ 
some “reinitiation” (facilitated by the circularized 
mRNA shown in Pig. 1) and internal ribosome entry 
(recently reviewed in Hellen 2009; Jackson et al. 2010; 
Merrick 2010; Sonenberg and Hinnebusch 2009). 
Compared to eukaryotes, the initiation process in pro¬ 
karyotes is similar but less elaborate, with fewer initi¬ 
ation factors involved as mRNA can bind directly to 
the 30S subunit. The better-studied prokaryotic system 
has therefore been used, albeit with caution, to eluci¬ 
date aspects of the initiation mechanism in eukarya. 

Biophysics and Structural Biology of the 
Scanning Model of Translation Initiation 

The translation initiation flow scheme depicted in 
Pig. 1 summarizes the contemporary view of the 


scanning mechanism in eukaryotes. Allied to genetic 
analysis using the well-established yeast system 
(Altmann and Linder 2010), biophysical studies and 
structural biology have been invaluable in unraveling 
the mechanics and dynamics of this highly complex 
apparatus, as described below. 

To begin with, the impressive high-resolution 
crystallographic studies of the bacterial/archaeal 
ribosomes and their subunits, bound to different 
ligands and/or at different stages of translation, have 
illuminated fundamental mechanisms conserved in all 
forms of life (Berk and Cate 2007; Schmeing and 
Ramakrishnan 2009; Steitz 2008). Purthermore, these 
structures have assisted with the construction of 
detailed molecular models of the evolutionarily 
conserved core of the eukaryotic ribosome using 
cryo-EM and homology modeling (Dinman 2009; 
Passmore et al. 2007; Spahn et al. 2004). These 
studies, together with the very recent crystal structure 
of the yeast 808 ribosome (Ben-Shem et al. 2010), 
have started to unveil features that are unique to 
the eukaryotic system. In parallel, a clearer molecular 
view of eukaryotic initiation, in particular 438 
complex formation and start codon recognition, 
recently reviewed by (Lorsch and Dever 2010), 
has been enabled through structural investigations of 
isolated initiation factors and their complexes 
(solved by cryo-EM, X-ray crystallography and/or 
NMR). 

In addition to structural approaches, the sheer num¬ 
ber of coordinated protein-protein and protein-RNA 
interactions makes the biophysics of eukaryotic 
translation initiation a rich area for study, and quanti¬ 
tative thermodynamic and kinetic analyzes of these 
complexes have contributed immensely to a deeper 
understanding of the underlying mechanisms. 

Overall, the application of biophysical and struc¬ 
tural techniques has proved indispensable for: 

• Investigation of the molecular mechanisms that 
accompany the steps of translation, i.e., mRNA 
binding, start codon recognition, release of factors, 
and subunit joining 

• Understanding of the molecular organization of 
initiation complexes 

• Recognition of the role of individual elFs and 
discovery of their modes of action (including 
synergistic effects with other elPs) 

• Identification of interacting surfaces between 
ribosome, RNA, and elPs 
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Translation Initiation: A Eukaryotic Perspective, 
Fig. 1 Schematic of the “scanning model” of eukaryotic cap- 
dependent translation initiation. elFl, elFlA, eIF3, and eIF5 
bind to the 40S ribosomal subunit along with the ternary com¬ 
plex (TC: eIF2-GTP-Met-tRNAi) to form the 43S pre-initiation 
complex (PIC). The PIC associates with mRNA at the 5'-cap 
through interactions with the cap-binding complex eIF4F, 
a multi-protein assembly (consisting of elFG, eIF4E, and 
eIF4A) directing cap binding, removal of RNA secondary struc¬ 
ture in the 5'-untranslated region and ribosome association. 
mRNA circularization, which promotes ribosome recycling 
following translation termination, is achieved through an inter¬ 
action between eIF4G and polyA-binding protein (PABP) (see 
text). The PIC scans along the mRNA in an ATP-dependent 
manner (assisted by helicase DHX29) searching for an AUG 
codon in the right context. Once found, elFl is ejected from the 


PIC, triggering hydrolysis of eIF2-bound GTP and release of Pi, 
together with several of the elFs (see text). Hydrolysis of elFSB- 
bound GTP then directs the removal of elFlA and recruits the 
60S ribosomal subunit to begin the elongation phase of transla¬ 
tion. The initiation of translation is subject to a high degree of 
regulation, with the major effector being protein phosphoryla¬ 
tion. Under conditions of starvation or cell stress, levels of 
available TC are lowered by phosphorylation of the a-subunit 
of eIF2, making it unable to bind the guanine nucleotide 
exchange factor eIF2B and therefore unable to convert to the 
required GTP-bound form. The cap-binding complex is also 
a major site of regulation, as eIF4E can be held in an inactive 
form by the 4E-binding protein (4E-BP); phosphorylation of 
4E-BP leads to eIF4E dissociation, freeing it to bind to the 
5'-cap. The translation regulation pathways shown here are not 
discussed in this chapter 







Translation Initiation: A Eukaryotic Perspective 


2647 


T 


• Uncovering of ribosome conformational changes at 
different stages of the initiation process (including 
changes in the 40S subunit itself and movements of 
elFs within ribosomal pre-initiation complexes) 

• Specihcation of the timing and order of molecular 
events and description of the kinetic properties of 
steps in the process 

• Design of increasingly sophisticated biochemical 
and genetic experiments. 

Despite the great advances brought about by bio¬ 
physical studies, the complexity of the translation 
apparatus presents particular challenges. Translation 
initiation is a dynamic multistep process, in that each 
component establishes a network of contacts with 
other constituents, and these interactions change as 
initiation progresses. Although structural data are 
available, the description of each stage of the cycle 
lacks atomic resolution, even for prokaryotes. It is 
particularly challenging to obtain structural informa¬ 
tion on transition state configurations, which would be 
extremely informative in underpinning the molecular 
basis of initiation. The structures that have been 
obtained represent only a snapshot of the most thermo¬ 
dynamically stable conformation of the ribosome 
in that particular step of the process, whereas it 
appears likely that the ribosome has conformational 
degrees of freedom and an intrinsic capability of 
reversibly sampling a number of native states (Munro 
et al. 2009). 

Furthermore, initiation entails a multitude of weak 
interactions between ribosome and binding partners 
that occur only transiently and in context; as such, 
they may be difficult to detect and a combination of 
structural, kinetic and functional data is therefore 
essential to secure a reliable and accurate picture 
(Marintchev and Wagner 2004). For example, interac¬ 
tions between elFs have been abundantly studied off 
the ribosome but some probably only occur - or are 
substantially tighter - on the 40S surface. Conversely, 
some elFs may interact nonspecifically with one 
another as purified components, but only engage with 
their specific partners in a ribosomal context. An 
exemplary scenario occurs with the MFC (multifactor 
complex), containing eIF2-GTP-Met-tRNAi, eIF3, 
eIF5 and elFl, which has been found using pull¬ 
down assays with purified elFs from yeast. Although 
it has been hypothesized that these elFs bind to 40S as 


MFC complex (and not individually), there is as of yet 
no kinetic data to confirm this (Merrick 2010). 

Start Codon Recognition in Eukaryotes 

As an example of the impact of biophysical investiga¬ 
tions on translation initiation research, some of 
the studies concerning the mechanism of start site 
selection in eukarya will be briefly discussed. 

Start codon recognition is a critical step in the 
initiation process, as mistranslation of the RNA 
message would be an obvious detriment to the cell. 
Although many elFs have been implicated in this 
event, the details remain, as of yet, unclear. The con¬ 
sequence of start site identification is the irreversible 
hydrolysis of eIF2-bound GTP, but how is the signal 
from codon-anticodon pairing propagated to trigger 
this reaction? Biophysical and structural studies have 
unveiled conformational rearrangements of the pre¬ 
initiation complexes at several stages of the initiation 
process; in particular, they have corroborated the 
hypothesis that during ribosomal scanning the 43S 
complex adopts an “open” scanning-conducive con¬ 
formation that changes to a “closed” scanning- 
incompetent form upon start codon recognition 
(Pestova and Kolupaeva 2002). The first evidence for 
this came from a quantitative study of the interaction 
between elFl and elFlA in the context of the 40S in 
a yeast-based system, following the fluorescence 
resonance energy transfer (FRET) from fluorescein- 
labeled elFl to TAMRA-labeled elFlA (Maag et al. 
2005). FRET with a steady-state efficiency of 40% was 
observed when elElA and elEl were stoichiometri- 
cally bound to 40S. In the absence of 40S no ERET 
was observed, indicating that the two factors do not 
interact directly, but are placed sufficiently close in the 
pre-initiation complex for their fluorophores to see one 
another. The FRET efficiency increased slightly to 
^44% when the eIE2-TC was added, slightly 
decreased to 35% in the presence of mRNA(AUG), 
but strikingly decreased to 14% when both the TC and 
mRNA(AUG) were present. Intriguingly, this effect 
was dependent on the presence of an intact AUG 
codon, as even single point mutations here did not 
generate such a dramatic decrease in FRET efficiency 
(Maag et al. 2005). A combination of steady-state 
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FRET anisotropy and kinetic stopped-flow fluorimetry 
experiments carried out by Lorsch and coworkers 
endorsed the idea of a conformational change of the 
43S complex upon AUG codon recognition, which 
would result in an increase in distance between elFl 
and elFlA and promote elFl dissociation from the 
rihosomal complex (Maag et al. 2005). The existence 
of conformational rearrangements at different stages of 
the process was further substantiated by cryo-EM 
structures of the 40S (from S. cerevisiae) in complex 
with elFl and elFlA (Passmore et al. 2007). elFl and 
elFlA bind to the 40S in a cooperative fashion and 
with high affinity in vitro (1.7 nM and 6.1 nM, respec¬ 
tively), as assessed using fluorescently labeled elFs in 
a steady-state anisotropy binding assay (Maag and 
Lorsch 2003). Cryo-EM reconstructions at resolutions 
of 21 and 22 A for empty 40S and 40S-elFl-eIFlA 
complex, respectively, showed remarkable differences 
in the conformation of the 40S itself (Fig. 2). First, in 
complex with elFl-elFlA, the head of the 40S subunit 
appears to establish a new intramolecular interaction 
with the shoulder, mediated by 18S rRNA helix hl6 
and the ribosomal protein rpS3. Second, the entry 
channel of the mRNA cleft is occluded in the empty 
40S by a latch that opens in the 40S-eIFl-eIFlA 
complex (Fig. 2). The latch, formed hy non-covalent 
interactions between 18S rRNA helices hl8 and h34 
of the body and head of the ribosome respectively, 
is proposed to open at the 43 S PIC stage to allow 
mRNA to dock into the mRNA channel, and to 
close following AUG codon recognition to prevent 
mRNA dissociation. Cryo-EM structures consistently 
show that the latch only opens when both elFl 
and elFlA are bound, but it appears closed if only 
one of the two factors is present. Furthermore, 
kinetic and thermodynamic analyses have shown 
that the TC loading onto the 40S is significantly 
accelerated by the conformational changes brought 
about by elFl and elFlA (Passmore et al. 2007), 
adding support to the view that the ribosomal confor¬ 
mation induced by elFl and elFlA is competent for 
scanning. 

Taken together, these studies suggest that: (1) coop¬ 
erative binding of elFl and elFlA to the 40S converts 
it into an open scanning-conducive state that facilitates 
recruitment of the TC; (2) base pairing between the 
AUG start codon and the initiator tRNA anticodon 
results in dissociation of elFl from the ribosome 
(relieving repression of hydrolysis of eIF2-hound 
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Fig. 2 The effect of elFl and elFlA binding to the conforma¬ 
tion of the 40S ribosomal subunit, (a) 21 A resolution cryo-EM 
reconstruction of apo 40S ribosomal subunit from S. cerevisiae 
and (b) 22 A resolution cryo-EM reconstruction of S. cerevisiae 
40S ribosomal subunit in complex with elFl and elFlA. 
Structural features of the 40S subunit, beyond the distinction 
between the head and body, are labeled on the apo model (panel 
A) as follows: beak {hk), neck (n), shoulder {sh), left foot {If}, 
right foot {if), platform {pi). The mRNA entry and exit points are 
also displayed. The latch blocking the mRNA channel is clearly 
visible in the apo model as a connection between helices h34 and 
hl8. Upon elEl/elFlA binding, two major changes occur: The 
latch is seen to open, allowing access for mRNA to bind, and 
a new connection is formed between the head and the shoulder 
(shown by an asterisk in panel B). The structural model of 
40S/elFl/elFlA complex is labeled to show the positions of 
the aminoacyl (A)-, peptidyl (P)-, and exit (E)-sites in 
the mRNA-binding channel. In addition, the rRNA helices 
and ribosomal proteins that form the mRNA entry channel 
(hl8-h34), exit channel (ribosomal proteins rpS5-rpS14), and 
head-shoulder connection (rpS3-hl6) are highlighted 
(Reprinted from Hellen (2009). Copyright, with permission 
from Elsevier. Adapted from Passmore et al. (2007). Copyright 
Elsevier) 


GTP, see below), prompting the latch to close - in 
other words, elFl release from the ribosomal complex 
upon start codon recognition may be the trigger for the 
conversion from open to closed scanning-arrested 
state, and this somehow controls the irreversible 
hydrolysis of the eIF2-bound GTP that commits the 
ribosome to translation. This reaction is catalyzed 
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by the GTPase activating protein eIF5; however, 
quantitative analysis of the kinetic and thermodynamic 
parameters of eIF5-promoted GTP hydrolysis showed 
only a small dependence on the AUG codon (two- to 
fourfold), which is unlikely to account for the fidelity 
of AUG recognition (Algire et al. 2005). Nonetheless, 
employing a fluorescently labeled form of the E. coli 
phosphate binding protein PBP-MDCC, it has been 
possible to monitor, in real time, the rate of release of 
inorganic phosphate (Pi) from the pre-initiation com¬ 
plex upon addition of eIF5. Notably, in these experi¬ 
ments, the rate was found to be highly dependent 
(>100 fold) on the presence of an AUG codon (Algire 
et al. 2005). Since the rate constant of Pi release upon 
addition of eIF5 and mRNA(AUG) to the 43S complex 
was determined to be the same to that of elFl dissoci¬ 
ation (0.54 ± 0.05 s“* and 0.6 ± 0.03 s“* respectively) 
(Algire et al. 2005; Maag et al. 2005), these results 
would argue that, upon AUG codon recognition, elFl 
dissociation is the rate-limiting step controlling Pi 
release from ribosome-bound elF2-GDP/Pi, possibly 
by allowing elF5 to change conformation or position 
within the ribosomal complex (Algire and Lorsch 
2006; Conte et al. 2006). 

Although the resolution of the cryo-EM structures 
prevents the visualization of the binding sites of elFl 
and elFlA onto the 40S, data are consistent with plac¬ 
ing them in the vicinity of the E (exit)- and A-site 
respectively: These are remote from the sites of 
observed conformational changes, suggesting alloste¬ 
ric structural transitions caused by the synergic asso¬ 
ciation of these factors. elElA contains a central 
oligonucleotide/oligosaccharide-binding (OB) domain 
that interacts with the A-site, flanked by two long 
unstructured N-and C-terminal tails that extend toward 
the P-site, as estimated by direct hydroxyl radical 
probing (Yu et al. 2009). It has been hypothesized 
that codon-anticodon base pair formation induces 
a movement of the C-terminal tail of elElA out of 
the P-site and that this may be sensed by elEl, elP2, 
and/or eIP5 triggering the downstream events of elEl 
and Pi release. However this is yet to be demonstrated, 
and an alternative mechanism may imply the presence 
of ribosomal sensors similar to the decoding process 
during elongation, where conserved nucleotides in 
the prokaryotic A-site experience a change in 
conformation to interact with the newly formed 
mRNA-tRNA codon-anticodon base pair (Schmeing 
and Ramakrishnan 2009). 


Summary 

The current view of translation initiation is the result of 
almost five decades of work from a large number of 
laboratories worldwide. The high-resolution structures 
of the 30S and 50S ribosomal subunits (that led to the 
2009 Nobel Prize for Chemistry to Venkatraman 
Ramakrishnan, Thomas A. Steitz, and Ada E. Yonath) 
represent a huge advance in our understanding of 
translation as a whole, and facilitated the determina¬ 
tion and/or interpretation of subsequent ribosomal 
structures. Excitingly, the atomic resolution structure 
of the SOS ribosome from S. cerevisiae has very 
recently been solved, undoubtedly a formidable 
achievement and a new milestone for research in this 
field, which will pave the way to further structural 
studies of eukaryotic ribosomal complexes depicting 
distinct events of the translation process. The wealth of 
information such structural studies will provide, allied 
with the continued efforts of biophysicists, biochem¬ 
ists, cell biologists, and geneticists could mean that 
a complete and detailed molecular description of 
eukaryotic translation initiation is a real possibility in 
the near future. 

Einally, it has proven challenging to summarize the 
landslide of information on translation initiation and 
the contribution of biophysical data to the field, and 
regrettably important work has been omitted due to 
space limitation. With a focus on eukaryotic initiation, 
no attempt has been made to cover in this short review 
the extensive and seminal studies of bacterial transla¬ 
tion initiation. 
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Definition 

Translocation denotes the simultaneous movement of 
two tRNAs together with the mRNA by one codon on 
the ribosome 

Basic Characteristics of Translocation 

Protein synthesis on ribosomes, that is, the translation of 
the nucleotide sequence of mRNA into the amino acid 
sequence of proteins, is a cyclic process. In each round 
of elongation, two tRNA molecules together with the 
mRNA move through the ribosome in a multistep pro¬ 
cess called translocation. Thus, peptidyl-tRNA moves 
from the A site of the ribosome, where decoding takes 
place, to the P site, where the peptidyl-tRNA is placed 
before transferring the nascent peptide onto the 
aminoacyl-tRNA that enters the A site in the next 
round of elongation. In parallel, deacylated tRNA 
moves from the P site to the E (exit) site before disso¬ 
ciating from the ribosome. Translocation is arguably the 
most complex step of ribosomal protein synthesis and 
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involves large-scale conformational changes of the 
ribosome. The physiological reaction is promoted by 
an elongation factor (EF-G in bacteria or eEF2 in 
eukaryotes), which hydrolyzes GTP in the process. 
Mechanistic and structural details of translocation 
have been discussed in recent review articles (Shoji 
et al. 2009; Dunkle and Cate 2010; Frank 2011; Noller 
et al. 2011; Wintermeyer et al. 201 1). 

Spontaneous Translocation 

Translocation can proceed spontaneously, albeit rather 
slowly, in both forward and backward directions, 
implying that its basic features are inherent to the 
ribosome-tRNA complex (Shoji et al. 2009; 
Wintermeyer et al. 2011). The preferential direction 
of spontaneous movement differs for different tRNAs 
and depends on their affinities to the respective ribo- 
somal sites. Cryo-electron microscopy (cryo-EM) 
studies revealed that in the transition from one canon¬ 
ical ribosome-binding site to the other, that is, from the 
A to the P site and from the P to the E site, the tRNAs 
assume intermediate hybrid states, and the ribosome in 
parallel undergoes dynamic structural changes, fluctu¬ 
ating between the PRE and PRE* states (Fischer et al. 
2010; Frank 2011) (Fig. 1). In the PRE* state, the 
anticodon stem-loops of the tRNAs reside in the A and 
P sites of the 30S subunit, while the respective acceptor 
ends are oriented toward the P and E sites of the large 
SOS subunit. At the same time, the subunits undergo 
a rotational (ratchet-like) movement relative to each 
other and the formation of hybrid states correlates with 
the rotated orientation of subunits. Single-molecule 
FRET studies indicated that the ribosome spontane¬ 
ously fluctuates between classic/nonrotated and 
hybrid/rotated states (MacDougall and Gonzalez 
201 1). Furthermore, as indicated by molecular dynam¬ 
ics simulations, parts of the ribosome move upon tran¬ 
sitions between the two states, implying a high degree 
of flexibility and spontaneous motions and suggesting 
that the ribosome moves spontaneously through a whole 
energy landscape (Whitford et al. 201 1). 

EF-G-catalyzed Translocation 

The enzymes promoting translocation in bacteria or 
eukaryotes, EF-G and eEF2, are large GTPases 


(78 kDa) that consist of five domains. Domain 1 (or 
G domain) binds GTP/GDP, domain 2 is homologous 
to similar domains of other translational GTPases, and 
domains 3-5 are specific to EF-G. EF-G exerts its 
function on the ribosome by assuming different con¬ 
formations which are induced by GTP hydrolysis and 
release of inorganic phosphate (Pi). Conformational 
changes of EF-G are coupled to conformational 
changes of the ribosome that promote translocation. 
In the POST state, where peptidyl-tRNA resides in the 
P site, the A site is occupied by EF-G, which is present 
in a conformation that strongly differs from that of 
unbound EF-G, domain 4 reaching into the decoding 
site (Gao et al. 2009) (Fig. 2). 

In the presence of EF-G and GTP, translocation is 
driven to the post-translocation state, independent of 
the tRNAs involved and their intrinsic affinities to the 
different binding sites. Thus, in cases where the PRE 
state is favored thermodynamically, (part of) the free 
energy of GTP hydrolysis is used to drive the reaction 
to the POST state. Furthermore, pre-steady-state 
kinetic studies revealed that GTP hydrolysis precedes, 
and substantially accelerates, tRNA movement, 
suggesting that EF-G additionally couples the free 
energy of GTP hydrolysis into conformational changes 
of the pre-translocation complex that favor the reaction 
kinetically. 

Detailed analyses applying rapid kinetics (stopped- 
flow, quench-flow, single-molecule FRET) have 
provided a quantitative description of the elemental 
steps of translocation, as promoted by EF-G and GTP 
hydrolysis (Fig. 3). EF-G binding to the pre¬ 
translocation ribosome stabilizes the ratchet/hybrid 
conformation; the tRNA anticodons do not move 
at this stage. Upon interaction of EF-G with proteins 
L7/12 and the sarcin-ricin loop of 23S RNA, and possi¬ 
bly other, yet unidentified elements of the ribosome, the 
GTP-binding site closes and GTP is hydrolyzed rapidly. 
The hydrolysis products, GDP and P;, are retained in the 
closed pocket and released only during subsequent 
steps. GTP hydrolysis accelerates a rearrangement of 
the ribosome (frequently referred to as “unlocking”) 
that precedes and limits the rate of tRNA-mRNA move¬ 
ment on the 30S subunit. Unlocking is likely to involve 
a change in the mobility of elements of the 30S subunit, 
possibly by facilitating the swiveling movement of the 
30S head. Unlocking is driven by a conformational 
change of EF-G, presumably involving the switch 
I and/or II regions and changes of inter-domain contacts. 
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Translocation on the Ribosome, Fig. 1 Movement of tRNA 
through the ribosome. The two tRNA molecules bound to the 
A and P sites of the ribosome in the pre-translocation state (PRE) 
enter hybrid A/P and P/E states (PRE*) before they move to the 
P and E sites, respectively. Hybrid state formation is coupled to 
a rotation of the ribosomal subunits relative to each other. 


Following translocation, spontaneous dissociation of deacylated 
tRNA from the E site results in a posttranslocation complex with 
only peptidyl-tRNA in the P site (POST). In the absence of EF- 
G, the reaction can proceed in forward or backward directions, 
albeit slowly, following the thermodynamic gradient 


as the acceleration is abrogated when the conforma¬ 
tional flexibility of EF-G is restricted by an 
inter-domain cross-link between domains 1 and 5. 
Some of the reaction intermediates identified kinetically 
can be assigned to complexes that have been character¬ 
ized by structural and single-molecule fluorescence 
analysis. 

The release of P; from EF-G induces another 
conformational change in EF-G that is required for 
the ribosome to return to the locked state and for EF- 
G to dissociate from the ribosome. This change is 
inhibited by the antibiotic fusidic acid that binds to 
EF-G and freezes it in its post-translocation conforma¬ 
tion on the ribosome (Fig. 2). In the latter conforma¬ 
tion, EF-G has a high affinity for the ribosome. 
Similarly, the use of non-hydrolyzable GTP analogs 
or the addition of vanadate, an analog of Pi, appear 
to stabilize the high-affinity conformation of EF-G 
on the ribosome, thereby strongly inhibiting the turn¬ 
over reaction. 

EF-G as Brownian Ratchet 

The movement of tRNA-mRNA that takes place in the 
unlocked state of the ribosome is rapid and spontaneous. 
As in the reaction without EF-G, during EF-G-catalyzed 
translocation the tRNAs are likely to move through 
a quasi-continuous landscape of intermediate states, 
and some of these intermediate states have been char¬ 
acterized. EF-G promotes translocation in three distinct 
ways. First, EF-G binding to the ribosome contributes to 
the stabilization of the hybrid/rotated PRE* state and 
may, thereby, bias and accelerate the partial movement 
of the tRNAs from the classical/nonrotated toward 


the hybrid/rotated state. Second, EF-G promotes the 
unlocking step and probably also the following confor¬ 
mational changes at the decoding region required for 
rapid tRNA movement. When GTP hydrolysis is 
prevented by using non-hydrolyzable GTP analogs, 
the rate of unlocking is strongly decreased, suggesting 
that at least part of the free energy of GTP hydrolysis is 
coupled to conformational changes of the ribosome. At 
this step, EF-G is present in the GBP-Pj-bound form, 
and the release of P; is not required to drive unlocking; 
on the contrary, the retention of P; is probably necessary 
to prevent the premature collapse of EF-G into an 
unproductive conformation. Third, EF-G is likely to 
bias intrinsically random tRNA movements and 
motions of the ribosome to produce forward movement. 
This involves a conformational change of EF-G, 
brought about by GTP hydrolysis and (partially) stored 
in the system due to delayed P; release. Thus, in addition 
to promoting ribosome rearrangements that facilitate 
tRNA movement, presumably in both forward and 
backward direction, EF-G functions as a Brownian 
ratchet that biases tRNA movement in the forward 
direction by preventing the backward movement of 
peptidyl-tRNA while the unlocked state prevails. Crys¬ 
tal and cryo-EM structures show that in the POST state 
domain 4 of EF-G - which is crucial for translocation - 
occupies the 30S A site (Fig. 2), thus effectively 
preventing back movement. Thus, the ribosome can be 
regarded as a processive Brownian machine with EF-G 
serving the role of the pawl in a ratcheting mechanism. 

Another function of GTP hydrolysis may be to 
provide the energy required for unwinding mRNA 
structure during translocation. It is generally assumed 
that the ability to unwind secondary structure of 
mRNA is inherent in the ribosome, and it has been 
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Translocation on the Ribosome, Fig. 2 Structure of the 
POST state, (a) Crystal structure of the ribosome-EF-G complex 
in the POST state. The SOS and 30S subunits of the ribosome from 
Thermus thermophilus are indicated in different shades of gray. 
The deacylated tRNA in the E site (dissociation is prevented by 
the ionic conditions) is depicted in green, the peptidyl-tRNA in the 
P site in purple. The five domains of EF-G are color-coded in 
purple (domain 1), blue (2), green (3) yellow (4), and red (5). PDB 
entries 2WRI, 2WRJ, 2WRK, 2WRL (Gao et al. 2009). (b) Con¬ 
formations of EF-G in the POST and unbound states. The domains 
of ribosome-bound EF-G are color-coded as in panel A; unbound 
EF-G (PDB entry 2BV3) is depicted in gray. The two structures 
are aligned on domain 1. The tips of domain 4 in the two structures 
are about 30 A apart from one another 


suggested that ribosomal proteins S3, S4, S5 that reside 
at the mRNA entrance site are active in unwinding the 
mRNA. An involvement of cellular helicases in 
mRNA unwinding is considered unlikely because the 


ribosome is able to move through mRNA hairpins in 
the presence of only two factors required for 
translation, EF-Tu and EF-G, and GTP, and an addi¬ 
tional energy source, such as ATP, is not needed. The 
relation of GTP hydrolysis and mRNA unwinding still 
needs to be established. 


Antibiotics Acting on Translocation 

Translocation is inhibited by antibiotics that bind in, or 
are close to, the 30S decoding region. Spectinomycin 
binds to helix 34 of 16S rRNA and, by interfering with 
the swiveling movement of the 30S head, appears to 
lock the 30S subunit in a conformation that is refrac¬ 
tory to translocation (Wintermeyer et al. 2011). 
Paromomycin, which binds to helix 44 at the decoding 
site, seems to inhibit translocation solely by stabilizing 
tRNA binding in the A site, whereas hygromycin B, 
which binds nearby, has a mixed effect, inhibiting 
translocation by both stabilizing the binding of A-site 
tRNA and interfering with a conformational change of 
the 30S subunit required for movement. By contrast, 
streptomycin, which binds to the junction of helices 
18, 27, and 44 of 16S rRNA, stabilizes a structure of 
the 30S subunit that is prone to translocation, an 
effect which almost completely compensates the 
strong increase in affinity of the tRNA in the A site 
caused by the antibiotic. Taken together, these results 
reveal the important role of motions within the 
small ribosomal subunit that follow hybrid-state for¬ 
mation and ribosome unlocking for the movement of 
the tRNAs. 


Conclusion/Perspective 

Translocation is a highly dynamic process that entails 
movements within the ribosome, in particular in the 
small ribosomal subunit, that have been characterized 
by cryo-EM, X-ray crystallography, and single¬ 
molecule fluorescence. Intra-ribosomal movements 
and translocation can take place slowly in a spontane¬ 
ous manner, indicating that the process as a whole is 
due to Brownian motion. As in the spontaneous reac¬ 
tion, during EF-G-catalyzed translocation, the tRNAs 
are likely to move through a quasi-continuous land¬ 
scape of intermediate states. The intermediate states 
of spontaneous and EF-G-promoted translocation 
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Translocation on the Ribosome 


Translocation on the 
Ribosome, Fig. 3 Kinetic 
model of translocation. The 
30S subunit is depicted in two 
colors to indicate the locked 
(gray) and unlocked {green) 
state. Conformational changes 
of EF-G upon binding to the 
PRE ribosome and upon 
transition from the GTP- 
bound {pink) to the GDP Pi- 
bound {red) and GDP-bound 
{yellow’) forms are not 
indicated 


EF-G GTP Flybrid state 

binding hydrolysis formation 



Unlocking 


tRNA movement 
Pi release 


Re-locking 
Dissociation of 
tRNA and EF-G 
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presumably resemble each other structurally; however, 
their relative occupancy will probably be changed by 
the presence of EF-G and/or GTP hydrolysis. Future 
work applying time-resolved cryo-EM and single¬ 
molecule fluorescence is needed to resolve these 
issues. 

GTP hydrolysis by EF-G is required for rapid 
translocation and for driving translocation to the 
POST state, at least in those cases where the PRE 
state is favored thermodynamically. The mechanism 
of energetic coupling is not entirely clear yet; 
most likely it involves rectifying intra-ribosomal 
motions, with EF-G functioning as the pawl in 
a Brownian machine driven by thermal energy. 
Establishing this model will require measurements of 
the force a single ribosome exerts on the mRNA during 
translocation. 

Yet another function of GTP hydrolysis may be to 
provide the energy required for unwinding structured 
regions of the mRNA during translocation. It is gener¬ 
ally assumed that the ability to unwind the secondary 
structure of mRNA is inherent in the translating ribo¬ 
some, because the ribosome is able to move through 
mRNA hairpins in the presence of only two factors 
required for translation, EF-Tu and EF-G, and GTP. 
The role of EF-G and GTP hydrolysis for mRNA 
unwinding by the translating ribosome still needs to 
be established. 


Cross-References 

► Molecular Dynamics Simulations of Lipids 

► Peptide Synthesis on the Ribosome - Computational 
Studies 

► Protein Synthesis: Translational Fidelity 

► Single Particle Tracking 

► Single-Molecule Fluorescence Resonance Energy 
Transfer 

► Stopped-Flow Techniques 
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Transmembrane Helix Orientation and 
Dynamics 
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Synonyms 

Helix tilt; Protein-lipid interaction 


Definition 

The membrane-spanning helices of proteins that 
reside in lipid-bilayer membranes typically consist 
of a largely hydrophobic core sequence of about 
20 amino acid residues, flanked by several charged 
(lysine or arginine) or aromatic (tryptophan or tyro¬ 
sine) residues on either side of the core region 
(Landolt-Marticorena et al. 1993). Such sequences 
usually can be identified by means of hydrophobicity 
analysis of protein primary structure (Kyte and 
Doolittle 1982). 



Transmembrane Helix Orientation and Dynamics, 
Fig. 1 Left. Model for the GWALP23 peptide helix, showing 
side chains for tryptophan 5 (lower) and tryptophan 19 (upper). 
Right. Sample ^H NMR spectrum, showing pairs of ^H reso¬ 
nances from the deuterated methyl groups of two of the core 
alanines. Typically, two alanines are labeled with differing deu¬ 
terium isotope abundance. If the two observed ^H quadrupolar 
splitting magnitudes (distances between symmetric peaks) are 
different, the helix is tilted with respect to the lipid-bilayer 
normal 


The membrane-spanning helices, whether present 
as individual isolated helices or clusters, usually are 
tilted with respect to the lipid-bilayer normal 
(De Angelis et al. 2005). For a single helix, the entropy 
of precession about the bilayer normal affords 
a significant free-energy contribution in favor of helix 
tilting (Lee and Im 2008). 

Solid-state NMR spectroscopy provides a powerful 
means for deducing the structure and behavior of trans¬ 
membrane helical domains of proteins in the native 
lipid environment (De Angelis et al. 2005). NMR 
spectra of specifically ^^N- or ^H-labeled peptides, 
incorporated into oriented lipid-bilayer membranes, 
offer access to the average orientations of membrane- 
spanning helical segments, namely, the magnitude (x) 
and direction (p) of tilt of the helix axis, as well as 
estimates of the whole-body dynamics (Vostrikov 
et al. 2011). Here we consider the properties, in bilayer 
membranes, of neutral peptides in which one or 
more interfacial aromatic residues flank a hydrophobic 
helical core sequence. For this purpose, a particu¬ 
larly attractive model system is the peptide 












T 


2656 


Transmembrane Helix Orientation and Dynamics 


Transmembrane Helix 
Orientation and Dynamics, 
Fig. 2 Separated local field 
“SAMPI4” (or “PISEMA”) 
spectra for GWALP23 {left) 
and WWALP23 (right), each 
labeled with in residues 
13-17. See text for the peptide 
sequences 



acetyl-GGALW^(LA)6LW'‘^LAGA-amide, which 
possesses a poly-(leucine-alanine) core sequence, 
along with only two interfacial tryptophan (W) resi¬ 
dues, and is designated “GWALP23.” GWALP23 
(Fig. 1) is a useful host model peptide for examining 
the impact of additional tryptophan (W) residues or of 
single heterologous guest residues. Observations to 
date indicate that the dynamics of the uncharged pep¬ 
tide are strongly influenced by the number of aromatic 
residues that are located in the membrane/water 
interfaces. 

Basic Characteristics 

Aromatic or charged amino acid residues, such 
as tryptophan, tyrosine, lysine, or arginine, typically 
are located near the aqueous-lipid interface in 
type I single-span membrane proteins, where they 
are believed to help position the transmembrane 
helix within the lipid-bilayer membrane (Landolt- 
Marticorena et al. 1993). Indeed, the tryptophans of 
gramicidin A are important for the folding and assem¬ 
bly of a dimeric cation-selective channel in lipid- 
bilayer membranes (Koeppe and Andersen 1996). 
Other tryptophan-rich helical peptides modulate 
the lipid phase behavior, from bilayer to non-bilayer 
(Killian et al. 1996), as a function of the relative 
lengths of the hydrophobic regions of the peptide and 
lipid molecules (designated as “hydrophobic 
matching”). 

With only one tryptophan residue near each termi¬ 
nal, the properties of GWALP23 show rather system¬ 
atic behavior in lipid-bilayer membranes. For example, 
the measured quadrupolar splittings (Fig. 1) and 
apparent tilt angle of GWALP23 respond quite nicely 


to changes in the lipid bilayer’s hydrophobic thickness 
(Vostrikov et al. 2010a). At the same time, the pre¬ 
ferred direction of tilt for the GWALP23 helix, defined 
by the peptide rotation about the helix axis, is rather 
well defined and remains essentially constant in lipid 
bilayers of different thickness. By contrast, the intro¬ 
duction of additional tryptophan residues leads to dra¬ 
matically more dynamic averaging of the NMR 
signals. For example, the signals from five consecutive 
labels in the peptide backbone (Fig. 2) are 
averaged to a single value when glycine-2 and 
glycine-22 are changed to tryptophan, to give acetyl- 
GWALW^(LA)6LW'‘^LAWA-amide (designated 
WWALP23). The results suggest markedly increased 
whole-body dynamic averaging when tryptophan-2 
and tryptophan-22 are introduced (Vostrikov et al. 
2011). The extra tryptophans also cause a narrowing 
of the observed range of NMR signals, although the 
averaging is incomplete when deuterium is the 
observed nucleus (Vostrikov et al. 2010a). 

A single tyrosine at position 5 in acetyl- 
GGALY^(LA)6LW'^LAGA-amide (designated 
“Y^GWALP23”) confers very similar properties as 
does tryptophan-5 (W^). Indeed the tyrosine-5 and 
tryptophan-5 peptides show essentially the same tilt 
magnitude and (minimal) extent of dynamic averaging 
in three different lipid-bilayer membranes, DOPC, 
DMPC, and DLPC (Gleason et al. 2012). 

Interestingly, just one more tyrosine - namely, 
a second tyrosine at position 4, along with tyrosine-5 
- confers a similar marked increase in whole-body 
dynamic averaging as when the extra tryptophans 
2 and 22 are introduced (Gleason et al. 2012). Indeed, 
the accumulated results to date seem to suggest, 
concerning the preferred number of interfacial aro¬ 
matic tryptophan or tyrosine residues near each end 
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Transmembrane Helix Orientation and Dynamics, 
Fig. 3 Models for the GWALP23 peptide helix, with substitu¬ 
tion of a single arginine residue at position 14 (left) or position 12 
(right) 


of a transmembrane helix, that “one is enough,” and 
“two are too many.” A possible implication is that 
“extra” aromatic residues may compete with each 
other with respect to lipid interactions at the membrane 
interface. 

The GWALP23 peptide offers an appropriate 
framework for testing the effects of guest residue 
substitutions upon the protein-lipid interactions. Par¬ 
ticularly dramatic consequences have been observed 
for arginine substitutions at position 12 or 14 in 
GWALP23 (Fig. 3). The presence of arginine-14 
causes the peptide to rotate about 75° and tilt to 
increase its tilt by about 10°. Arginine-14 furthermore 
induces bilayer thinning and peptide displacement to 
“lift” the charged guanidinium group away from the 
bilayer center and toward the bilayer surface 
(Vostrikov et al. 2010b). The more “trapped” argi¬ 
nine-12 (Fig. 3), on the other hand, causes still more 
dramatic consequences. Indeed arginine-12 induces 
multiple and slowly exchanging peptide orientations, 
including an increased tendency for the peptide to exit 
the interior of the lipid bilayer (Vostrikov et al. 2010b). 

The results to date with tyrosine and arginine 
suggest that further guest residue experiments with 
GWALP23 will be of interest for understanding the 
fundamental membrane biophysics of lipids and 
proteins. 


Cross-References 

► Membrane Protein Structure 

► Membrane Proteins in Phospholipid Bilayers: 
Structure Determination by Solid-State NMR 

► Membrane Proteins; Structure and Organization 

► Molecular Dynamics Simulations of Lipids 
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► Bioelectricity, Ionic Basis of Membrane Potentials 
and Propagation of Voltage Signals 
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Transmission Electron Microscope 


Transmission Electron Microscope 

► Electron Microscopy: Classical Sample Preparation 


Transmission of Light 

► Absorption Spectroscopy and Transition Polariza¬ 
tions: Pictorial Description 

► Absorbance Spectroscopy: Overview 

► Protein Concentration Determination Using Dyes 
and Beer-Lambert Law 


Transverse Relaxation Optimized 
Spectroscopy 

► TROSY 


Triosephosphate Isomerase - 
Computational Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 
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Synonyms 

TIM 


Definition 

An enzyme that catalyzes the interconversion of dihy¬ 
droxy acetone phosphate and D-glyceraldehyde-3- 
phosphate with high efficiency. 

Basic Characteristics 

The efficient conversion of the triosephosphate 
isomers dihydroxyacetone phosphate and o-glyceral- 
dehyde-3-phosphate is important in glycolysis and 


therefore essential for efficient energy production. 
Triosephosphate isomerase (TIM), the enzyme that 
catalyzes this reaction, is therefore found in almost 
all living organisms. It has been (and still is) widely 
studied both experimentally and computationally. Cal¬ 
culations on the reaction pathways in TIM were among 
the earliest applications of ► QM/MM methods (Bash 
et al. 1991) to enzyme reactions. TIM has also been 
investigated by ► empirical valence bond methods 
(Aqvist and Fothergill 1996). Since then, many more 
computational studies into the mechanism of 
triosephosphate isomerase have been published, e.g., 
Cui and Karplus (2001), Lennartz et al. (2002), 
and Guallar et al. (2004). These studies have allowed 
a detailed picture of the catalytic mechanism, includ¬ 
ing the intermediacy of an enediol rather than an 
enediolate species and highlighting the flexible nature 
of the catalytic glutamate in the active site, which 
allows its direct involvement in the formation of the 
enediol intermediate. In addition to elucidating the 
mechanism of this important enzyme, the TIM- 
catalyzed reaction has been used as a test system for 
developing methods for simulation of reactions in 
enzymes; see, e.g., Riccardi et al. (2006), Hu et al. 
(2007), and Xiang and Warshel (2008). 

Cross-References 

► Empirical Valence Bond Methods 

► QM/MM Methods 


References 

Aqvist J, Fothergill M. Computer simulation of the 
triosephosphate isomerase catalyzed reaction. J Biol Chem. 
1996;271(17):10010-6. 

Bash PA, Field MJ, et al. Computer simulation and analysis of 
the reaction pathway of triosephosphate isomerase. 
Biochemistry. 1991;30(24):5826-32. 

Cui Q, Karplus M. Triosephosphate isomerase: a theoretical 
comparison of alternative pathways. J Am Chem 
Soc. 2001;123(10):2284-90. 

Guallar V, Jacobson M, et al. Computational modeling of the 
catalytic reaction in triosephosphate isomerase. J Mol Biol. 
2004;337(l):227-39. 

Flu FI, Lu Z, et al. QM/MM minimum free-energy path: meth¬ 
odology and application to triosephosphate isomerase. 
J Chem Theory Comput. 2007;3(2):390^06. 

Lennartz C, Schafer A, et al. Enzymatic reactions of 
triosephosphate isomerase: a theoretical calibration study. 
J Phys Chem B. 2002;106(7):1758-67. 







Triple Resonance NMR 


2659 


T 


Riccardi D, Schaefer P, et al. Development of effective quantum 
mechanical/molecular mechanical (QM/MM) methods for 
complex biological processes. J Phys Chem B. 
2006;110(13):6458-69. 

Xiang Y, Warshel A. Quantifying free energy profiles of proton 
transfer reactions in solution and proteins by using a diabatic 
FDFT mapping. J Phys Chem B. 2008;112(3):1007-15. 


Triple Resonance NMR 
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Synonyms 

NMR 


Definition 

The application of multidimensional heteronuclear 
nuclear magnetic resonance (NMR) spectroscopy 
using multiple radio frequencies directed to ’H, *^C, 
and *^N nuclei incorporated into biological macromol¬ 
ecules with the aim to obtain the resonance assign¬ 
ments and provide the basis for the identification of 
structural and dynamical parameters accessible by 
heteronuclear NMR spectroscopy. 

Introduction 

The advent of facile methods for the production of 
recombinant proteins in expression hosts that allow for 
random stable-isotope labeling heralded a new era in the 
application of heteronuclear NMR spectroscopy to pro¬ 
teins and nucleic acids. In particular, the ability to obtain 
large amounts of protein from E. coli cell cultures in 
defined media (usually M9 minimal media) that contain 
*^C-glucose (sometimes *^C-acetate) as the sole carbon 
source and a *^N-ammonium salt as the sole nitrogen 
source can yield proteins with effectively 100% isotope 
incorporation. These molecules can be analyzed by the 
application of multidimensional NMR pulse sequences 
that exploit the relatively large one-bond (sometimes 
two-bond) spin-spin (or scalar) couplings either 


between a hydrogen nucleus and either a carbon or 
nitrogen nucleus, between two carbon nuclei, or 
between a carbon and a nitrogen nucleus, in the 
polypeptide chain (Cavanagh et al. 2007; Clore and 
Gronenborn 1998; Kwan et al. 2011; Leopold et al. 
1994; Rule and Hitchens 2006, Sattler et al. 1999). In 
principle, this allows for the recording of a suite of 
heteronuclear NMR spectra that, taken together, can 
directly correlate every atom (including the protons) 
except the oxygen and sulfur atoms in the polypeptide 
chain. This paradigm, whereby the spectroscopist can 
obtain essentially complete 'H, *^C, and *^N resonance 
assignments, as well as structural restraints of different 
types, is known as triple resonance NMR. The sophisti¬ 
cation of triple resonance NMR is now highly devel¬ 
oped and, compared to earlier approaches to resonance 
assignment that relied on the detection of “through 
space” nuclear Overhauser effects (NOEs) between pro¬ 
tons that happen to be close in space within the struc¬ 
ture, is so much more efficient and reliable that now one 
would not normally embark upon an NMR project with¬ 
out recourse to double-'^C, *^N-labeling (Bax and 
Grzesiek 1993; Goto and Kay 2000; Hiller et al. 2007; 
Ikura et al. 1990; Kay et al. 1990; Lian and Barsukov 
2011; Whitehead et al. 1997). For simplicity this entry 
focuses upon triple resonance NMR as applied to pro¬ 
teins, but the reader should note that a similar approach 
is commonly used for isotope-labeled nucleic acids, 
particularly RNA (Cromsigt et al. 2001; Zidek et al. 
2001 ). 

Triple Resonance NMR Exploits Scalar 
Couplings Between Heteronuclei 

The basis of all the triple resonance NMR pulse 
sequence designs that are applicable to proteins is 
depicted in Fig. 1 which summarizes the one-bond 
(*J) and larger two- and three-bond (^J and ^J) scalar 
couplings that exist in the backbone of a double-^^C 
and '^N-isotope-labeled polypeptide chain (Bax and 
Grzesiek 1993; Cavanagh et al. 2007; Rule and 
Hitchens 2006; Sattler et al. 1999). 

The magnitude of the majority of these couplings is 
sufficiently large that the spectroscopist can use the 
rich radio frequency pulse “physics” that modem 
multichannel NMR spectrometers provide to effect 
magnetization transfer from one nucleus to its coupled 
partner in a highly efficient manner. As a consequence. 
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Triple Resonance NMR, Fig. 1 A schematic representation of 
a dipeptide segment of a polypeptide chain comprising residues 
i and i— 1. Numbers given in italics represent typical values (in 
Hz) for one-bond ’j scalar couplings {green), two-bond cou¬ 
plings (orange), and three-bond couplings (green). In general, 
double and triple resonance NMR pulse sequences are designed 
to exploit these scalar couplings to effect magnetization transfer 
between nuclei 


many different NMR pulse sequences have been 
designed to obtain corresponding multidimensional 
NMR spectra (effectively NMR signal correlation 
maps) that allow easy identification of locally 
connected subsets of atoms. The majority of triple 
resonance pulse sequences directed at the resonance 
assignment task take advantage of the relatively high 
signal dispersion and cross peak resolution provided 
by the 2D *H,*^N-correlated spectrum, usually 
recorded with a heteronuclear single-quantum coher¬ 
ence (HSQC) pulse sequence or its transverse relaxa¬ 
tion optimized spectroscopy (TROSY) variant (see 
below). This “fingerprint” spectrum contains a single 
cross peak for each N-H bond in the protein. Since 
most amino acid residues (other than proline) possess 
a backbone amide NH (as well as some NHs in certain 
side chains), there is approximately one cross peak in 
the HSQC spectrum per residue in the polypeptide. 
The pulse sequence for this experiment can be 
described in the following manner: 

Ni(tl) ^ H'^'(t2) 

The first step H^‘ —> Ni describes the transfer of the 
magnetic polarization of the amide proton (in residue i) 
onto the directly bonded nitrogen atom; this step has 
advantages of increasing the sensitivity of the measure¬ 
ment compared to starting with the normal polarization 


of the *^N nuclei. Ni(tl) represents the evolution of the 
transverse magnetization during an incremented time 
period tl that encodes the *^N chemical shift in the 
indirect dimension (denoted FI) of the 2D spectrum. 
Ni(tl) ^ indicates that the magnetization is trans¬ 
ferred from the nitrogen atom back to its attached pro¬ 
ton, exploiting (as before) the *Jnh coupling. Finally, 
(t2) represents the recording of the resultant free 
induction decay of the amide proton magnetization as 
a function of the measurement time t2. 2D Fourier 
transformation of the (tl, t2) time domain data set yields 
the F1(N),F2(H^) frequency spectrum that is visualized 
as 2D contour plot with cross peaks at coordinates that 
connect the Ni,Hi chemical shifts. 

The Wide Scope of Triple Resonance NMR 
Pulse Sequences 

Several of the triple resonance NMR pulse sequences 
build upon that used for the HSQC spectrum to transfer 
magnetization “out-and-back” to carbon atoms in the 
backbone and side chain (see Fig. 2). 

A good example is the 3D HNCO pulse sequence 
that transfers magnetization as follows (C' represents 
the carbonyl or CO carbon atom): 

H^‘^ Ni(tl) ^ C'i_i(t2) ^ Ni ^ H'^‘(t3) 

A typical implementation of the pulse sequence is 
depicted in Fig. 3. 

Note that the spectrometer is required to be able to 
generate pulses that are applied separately to the dif¬ 
ferent types of nuclei, and that for ^^C it is often 
necessary to separately control the magnetization for 
different atom types (here the carbonyl C' atoms and 
the aliphatic Ca atoms). In general, the broad chemical 
dispersion for the different chemical types of carbon 
atoms present in polypeptides (aliphatic, aromatic, 
carbonyl) means that it is relatively straightforward to 
effect selective excitation of these different atom sub¬ 
sets with “shaped” radiofrequency pulses. Compared 
to the 2D *H,*^N-HSQC pulse sequence, the additional 
element is represented by the Ni ^ C'i_ i(t2) ^ Ni step 
that effects magnetization transfer from the Ni atom to 
the directly bonded carbonyl atom of the preceding 
amino acid residue C'i_i where the '^C' chemical 
shift is encoded during the incremented time 
period t2 before transfer back to Ni. The final data 
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Triple Resonance NMR, Fig. 2 A schematic representation of 
a magnetization transfer pathways {orange arrows) effected hy 
different triple resonance 3D NMR pulse sequences: (a) HNCO, 
(b) HNCA, (c) HN(CO)CA, (d) HNCACB, and (e) CBCA(CO) 
NH. Note that the first four pulse sequences are of the “out-and- 
back” type where the magnetization starts and ends on the amide 
proton. The CBCA(CO)NH pulse sequence is a “straight- 


through” experiment in which polarization on the aliphatic pro¬ 
tons of one residue is ultimately transferred onto the of the 
next residue in the chain. In the HN(CO)CA and CBCA(CO)NH 
experiments the chemical shift of the carbonyl carbon (C’) is not 
recorded, and in the latter experiment neither are the shifts of the 
and protons 


matrix (tl,t2,t3) is subjected to 3D Fourier transforma¬ 
tion to yield the spectrum F1(N), F2(C'), and F3(H^) 
that can he visualized either as a 3D contoured box or 
(more conventionally) as a series of stacked 2D planes 
each corresponding to a narrow range of (or '^C) 
chemical shifts. This experiment is denoted the HNCO 
spectrum, reflecting that the output correlates the 


N, and C' resonance frequencies. The concept is read¬ 
ily extended to other sets of nuclei. For example, one of 
the most important triple resonance experiments is the 
HNCA pulse sequence which involves the following 
magnetization transfers 

H'^'^ Ni(tl) ^ C“i(t2) ^ Ni ^ H'^'(t3) 
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Triple Resonance NMR, Fig. 3 Simplified representation of 
the pulse sequence design to obtain a 3D HNCO NMR spectrum. 
Note separate RF pulses (narrow black bars are 90° pulses, 
broad black bars 180° pulses) are applied separately at *H, 
''^N, and '^C' ('^CO) frequencies. For simplicity, the 

scheme shown omits certain practical details such as the use of 
pulse field gradients and shaped flip-back pulse for optimal 
solvent suppression. A, T and 8 represent fixed delay periods; 

and 

Ni(tl) ^ C=‘i_i(t2) ^ Ni ^ H'^'(t3) 

By arranging that appropriate pulses are 
directed to the aliphatic carbon resonances, rather 
than the carbonyl carbons as in the HNCO experiment, 
magnetization is transferred from Nj to the alpha- 
carbon atoms of both of residues i and i—1 for fre¬ 
quency labeling during t2. The bifurcation of the mag¬ 
netization trajectory occurs because of the similarity of 
the one-bond intra-residue *Jnc°^ coupling (7-11 Hz) 
and the two-bond inter-residue coupling (4- 

9 Hz) (see Fig. 1). As a result, following 3D Fourier 
transformation of the dataset, the spectrum yields two 
cross peaks for each amide NH in the polypeptide 
chain: one has (FI, F2, F3) coordinates (N;, 
and the other (N;, C“i_i ,H’^‘). 

Triple Resonance NMR for Backbone 
Resonance Assignment 

In principle the HNCA spectrum should contain suffi¬ 
cient information to perform a “cross peak walk” 


tl, 12 are incremented delays; and 13 represents the experimen¬ 
tally detected free induction decay. Pulses are produced with 
phase X, unless indicated by a symbol above. The blue arrows 
illustrate the sense of the “out-and-back” flow of transverse 
magnetization in the pulse sequence. The chemical shift is 
not encoded the pulses serve to decouple the ’^0“ nuclei 
and direct the relevant magnetization only to the carbonyl 
carbons 

between all residues in the polypeptide chain that pos¬ 
sess an observable backbone NH cross peak, and 
thereby the scope to obtain a complete backbone 
atom resonance identification (“assignment”). How¬ 
ever, in practice further data are required in order to 
obtain unambiguous assignments. Apart from the fact 
that nonterminal proline residues have no backbone 
NH group (that leads inevitably to interruptions in the 
cross peak walk), is it usually the case that the disper¬ 
sion of the chemical shifts is smaller than that 
required to make the cross peak walk connections with 
100% reliability. Chemical shift overlap, or “degener¬ 
acy,” means that it is sensible to obtain further 
supporting information for HNCA-derived sequential 
cross peak connections. The versatility of the spin 
physics underpinning triple resonance NMR provides 
a variety of means to resolve the ambiguities. The 
approaches include specific direction of magnetization 
transfer pathways down one “limb” of an otherwise 
bifurcated trajectory and co-option of more remote 
(e.g., side chain) atom types into the NMR experiment. 
Combined with databases of chemical shift informa¬ 
tion, particularly for *^C resonances, the strategies lead 
to identification of the spin system topology and often 
the specific residue type corresponding to each NH 
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group in the root *H,*^N-correlated (HSQC) spectrum 
that, with independent knowledge of the primary 
sequence of amino acids, can lead to unambiguous 
backbone and (with further experimental elaboration) 
side chain resonance assignments. 

An example of the first approach is the HN(CO)CA 
pulse sequence which has the following magnetization 
transfer pathway: 

Ni(tl) ^ C'i_i ^ C^_i(t2) ^ Ni H'^'(t3) 

The pulse sequence is designed so that magnetiza¬ 
tion is diverted from Ni at the end of the tl period via 
the adjacent carbonyl carbon of the preceding residue 
and thence to its adjacent alpha-carbon. No frequency 
labeling of the C'i_i nucleus is employed, meaning that 
its chemical shift is not encoded in the experiment. 
Thus, compared to the HNCA experiment, the 
HN(CO)CA spectrum provides only the inter-residue 
cross peak connections. This has the advantage of 
removing the scope for ambiguity between the identi¬ 
fication of inter- and intra-cross peak connections that 
arises in the HNCA data. (Note the name of the pulse 
sequence — HN(CO)CA — indicates that the magneti¬ 
zation transfer pathway involves the carbonyl carbon 
without encoding of the corresponding chemical shift 
by placing “CO” in parentheses.) 

An example of the second approach is provided by 
triple resonance NMR experiments that include trans¬ 
fer of magnetization onto, and frequency encoding of, 
the side chain beta-carbon nuclei. Thus the HNCACB 
pulse sequence is a relative of the HNCA experiment 
in which the magnetization generated at the C“i_i and 
C“i nuclei is allowed to evolve under the influence 
of the respective *Jc«cp couplings before frequency 
labeling during t2. In essence, the experiment 
incorporates the following transfer pathways: 

H’^' ^ Ni(tl) ^ C“i ^ C='i/C^i(t2) ^ C“i ^ Ni 

and 

H^.^ N.(tl) ^ C^i-i ^ CVi/cVi(t2) ^ C“i_i 

^ Ni ^ H'^'(t3) 

The outcome of this experiment yields a spectrum 
wherein for each NH cross peak there may be 


correlations to both C'^i and C“i_i resonances along¬ 
side (with inverted phase) correlations with C^i and 
C^i-i. The ambiguity in the inter- and intra-residue 
cross peak connections with C^ atoms can be relieved 
with another triple resonance experiment called CBCA 
(CO)NH. This last experiment has a pulse sequence 
distinct from the so-called “out-and-back” type 
described so far. Thus, the CBCA(CO)NH spectrum 
is recorded with a “straight-through” pulse sequence in 
which the polarization of the H” and H^ protons is first 
transferred to the attached carbon atoms, before relay 
of the magnetization to the backbone NH: 

h7h^ ^ cVi/cVi(ti) ^ cVi ^ c'i_i 

^Ni(t2) ^H'^'(t3) 

The Fourier transformation of a CBCA(CO)NH 
dataset yields a spectrum with cross peaks that corre¬ 
late the C“i_i and chemical shifts in FI with 

those of Nj in F2 and H'^ in F3 (i.e., the inter-residue 
subset of the cross peaks obtained in the HNCACB 
experiment). 

The recording of the C^ chemical shifts represents 
a powerful means to And starting points for complete 
resonance assignment. Specifically: glycine residues 
have no C'^ atom and so lack these correlations (and 
additionally possess a uniquely upheld C“ chemical 
shift); alanine, being the only residue type with 
a methyl group at the C^ position, has a characteristic 
upfield C*^ chemical shift; and serine and threonine, 
which have p-hydroxyl substituents, have characteris¬ 
tically low field C^ chemical shifts that encroach 
on the C™ region. In fact, more generally, the sequences 
of C“/C^ chemical shift pairing revealed in these 
experiments can be interpreted in a probabilistic 
sense to align the connectivity patterns to the 
known amino acid sequence and thereby provide 
confidence in the identified sequence specific assign¬ 
ments (Fig. 4). 

Further Applications of Triple Resonance 
NMR 

What has been described here so far merely scratches 
the surface of the scope of triple resonance NMR 
methodology. For example, as well as applications to 
obtain backbone resonance assignments, there is 
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Triple Resonance NMR, Fig. 4 Example of cross peak walk¬ 
ing (dotted orange line) to assign cross peaks in triple resonance 
NMR spectra. A small selection of 2D contour plot “strips” taken 
from different 3D triple resonance datasets are superimposed. 
Note that the black and green cross peaks corresponding to 
chemical shifts have a relatively poor chemical shift dispersion; 
connections between sequential residues are greatly facilitated 
by the recording of H^(i)-N(i)-C*’(i) and H'^(i)-N(i)-C'’(i-1) 
correlations (red and blue cross peaks) 

a plethora of pulse sequences that are directed to the 
goals of obtaining full side chain assignments 
and extracting NMR parameters - such as NOEs, 
J-couplings, or residual dipolar couplings - that form 
the basis of 3D solution structure determination and 
methods to characterize macromolecular dynamics. 
There are too many of these experiment types to list 
them all here. However, one specific type of experi¬ 
ment is worth highlighting. A primary source of struc¬ 
tural restraints is interproton NOE spectroscopy 
(NOESY) in which correlations between protons that 
are close in space (typically less than 5-6 A apart) are 
revealed as cross peaks in a multidimensional NMR 
experiment. The large number and small size of these 
NOE cross peaks means that traditional 2D homonu- 
clear NOESY spectra are difficult to analyze because 
of chemical shift degeneracy. The advent of *^C/*^N- 
isotope labeling has allowed the elaboration of the 
NOESY experiment into 3D and 4D versions wherein 
the coordinates of an individual NOE cross peak cap¬ 
ture not only the chemical shifts of the protons 
involved but also the directly attached (or '^N) 
atoms (Kay et al. 1990). The effective increase in 
resolution afforded by spreading the NOE information 
over a higher number of spectral dimensions, coupled 


with the higher reliability of signal assignment con¬ 
comitant with triple resonance NMR methods, has 
been a boon to the application of NMR for the structure 
determination of small and medium-sized proteins, 
nucleic acids, and their complexes. 

Each attempt to utilize the triple resonance 
approach for a given sample may have its own advan¬ 
tages and pitfalls, dictated by the nature of the system 
under analysis. Eor example, the larger the protein 
under study (more strictly, the longer the molecular 
rotational correlation time, Tc) the more the efficiency 
of magnetization transfer steps is impacted by the rate 
of transverse nuclear relaxation. A particular bottle¬ 
neck in a subset of the triple resonance NMR experi¬ 
ments is engendered by efficient relaxation of 
coherences involving the nucleus resulting from 
dipolar interaction with the directly attached pro¬ 
ton. To address this and other aspects of working with 
ever larger and more complex proteins and their com¬ 
plexes, spectroscopists have developed means to 
“dilute out” the protons to minimize the effects of 
proton-induced relaxation. The general approach 
involves preparing the target system in a heterologous 
expression host cultured in deuterium oxide (“heavy 
water”) so that the bulk of the protons are replaced with 
deuterons (Gardner and Kay 1998; Goto and Kay 
2000; Tugarinov et al. 2002). The level of 
perdeuteration achieved depends in the precise recipe 
used for the culture medium - for example whether the 
carbon source is also perdeuterated or not - and may 
include more or less sophisticated combinations of 
metabolite intermediates to achieve a particular pattern 
of incorporation in specific chemical 

moieties. A prominent example of the latter situation 
is the use of specifically labeled a-ketoisovalerate and 
a-ketobutyrate to obtain protonated isoleucine, 
leucine, and valine *^CH 3 methyl groups in 
a background of *^C/^H-labeling that permits exploi¬ 
tation of relaxation interference mechanisms to obtain 
high quality ^^C-methyl TROSY spectra in arbitrarily 
large complexes (Kay 2011). 

Because the deuteron has a significantly smaller 
magnetogyric ratio than the proton, the effects of 
intemuclear dipole-dipole relaxation are strongly 
diminished, and these triple resonance experiments 
have better performance in terms of overall sensitivity 
and resolution. (Note: one requires the reprotonation of 
exchangeable protons in order for the H^-directed 
triple resonance experiments to yield a signal. Eor stable 
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proteins, complete reprotonation may require deliberate 
unfolding and refolding of the perdeuterated protein in 
normal H 2 O. This last aspect may be limiting depending 
upon the efficiency of the refolding process.) The com¬ 
bination of double *^C/*^N-isotope labeling with 
perdeuteration is becoming increasingly commonly 
applied as NMR spectroscopists expand the scope of 
their investigation to larger proteins, nucleic acids, and 
their complexes. It is worth pointing out however that 
perdeuteration has the effect of removing a major source 
of structural information, namely the interproton NOEs 
involving the (now deuteriated) side chains. While the 
proton dilution can have a specifically beneficial effect 
of allowing NOEs between protons to be detected 
over a wider distance range than in a fully protonated 
sample, structure determination for perdeuterated sam¬ 
ples will generally require a greater emphasis to be 
applied to alternative sources of structural parameters 
such as residual dipolar couplings (possibly derived in 
multiple dilute liquid crystal alignment systems), rota¬ 
tional diffusion anisotropy measurements from *^N 
nuclear relaxation analysis, or distance measures from 
paramagnetic relaxation effects obtained with modified 
forms of the protein engineered to be specifically 
attached to a paramagnetic metal ion or nitroxide spin 
label. 

Since about 1990, the impact of triple resonance 
NMR on the characterization of biological macromol¬ 
ecules has been dramatic. The well-established para¬ 
digm has been that these experiments are recorded in 
“inverse mode,” meaning that the magnetization is 
ultimately routed by the pulse sequence to (or 
H*^) protons for signal acquisition. The spectrometer 
is equipped with a *H-optimized probehead and the 
spectra only indirectly record the chemical shifts of the 
and ^^N nuclei. In more recent times, and particu¬ 
larly following the advent of cryogenically cooled 
probehead and preamplifier technology, spectrosco¬ 
pists have been exploring the utility of, in effect, 
bypassing the protons and recording directly the cor¬ 
relations between and *^N nuclei. Because both of 
these heteronuclei possess a magnetogyric ratio sub¬ 
stantially smaller than that of the proton, there is 
a significant diminution of the overall nuclear polari¬ 
zation and intrinsic loss of sensitivity. However, in 
certain situations such direct heteronuclear NMR can 
prove advantageous. Eor example, often subsets of 

resonances are lost in the NMR spectra of proteins 
because of dynamical broadening or fast exchange 


with bulk solvent. This is particularly a problem for 
intrinsically disordered polypeptides, a class of 
proteins for which there is increasing interest from 
biophysical and functional perspectives. Moreover, 
for intrinsically disordered proteins (or parts of pro¬ 
teins) the poor chemical shift dispersion can 
become the limiting factor in the discrimination of 
individual signals. It turns out that in these situa¬ 
tions it can prove highly useful to switch to ^^C- 
detected NMR pulse sequences in order to exploit the 
now superior *^C' and *^N spectral dispersion to obtain 
the backbone resonance assignments. Again, a number 
of different pulse sequences have been devised for this 
purpose, and this is an active area of development. One 
strategy (Bermel et al. 2006) involves the combined 
use of 3D CBCACON and COCON pulse sequences, 
referenced to the 2D “heteronuclear fingerprint” CON 
spectrum, where the magnetization transfer pathways 
involved are: 

CON : C'i ^ Ni+i(tl) ^ C'i(t2) 

CBCACON : C“i/C^i(tl) ^ C'i ^ Ni+i(t2) 

^ C'i(t3) 

COCON : C'i(tl) ^ Ni+i(t2) ^ C'i(t3) 

and 

C'i_i(tl)^C'i^Ni+i(t2) 

^ C'i(t3) 

and 

C'i+i(tl)^C'i^Ni+i(t2) 

^ C'i(t3) 

The COCON experiment exploits the three-bond 
^Jc'c' couplings (0.5-2.5 Hz) and provides symmetric 
connectivities between adjacent C' atoms in the poly¬ 
peptide chain. Directionality in the sequential assign¬ 
ment is inferred by reference to the CBCACON 
experiment which aids residue-type identification 
through the pattern of C“/C^ chemical shift pairings. 
Notably in the 2D CON spectrum, proline residues 
form a distinctively resolved group of cross peaks 
because of their unusual downfield '^N chemical 
shift, providing a good set of starting points for cross 
peak walks. Recently *^C-directed NMR pulse 
sequences have been demonstrated for isotope-labeled 
nuclei acids (Richter et al. 2010). 
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Summary 

The development of triple resonance NMR has 
fundamentally expanded the scope of NMR applica¬ 
tions to biological macromolecules. First, it has made 
the process of obtaining resonance assignments for the 
spectra more reliable, and more complete. Second, in 
combination with perdeuteration methods it has wid¬ 
ened the molecular weight range of the targets to which 
NMR spectroscopy can be applied. Nowadays, isotope 
labeling to facilitate triple resonance spectroscopy is 
a prerequisite for NMR investigations, even for rela¬ 
tively small macromolecules. Indeed, the scope of 
triple resonance NMR is arguably only limited by the 
technical challenge to obtain samples in the appropri¬ 
ate isotope-labeled state. Currently this is routine for 
samples that can be prepared in E. coli and some 
simple eukaryote expression systems, but is rarely 
economically viable in systems relying on insect or 
mammalian cell culture. 


Cross-References 

► INEPT 

► Isotope Labeling 

► Nuclear Overhauser Effect 

► Nucleic Acid NMR - Introduction 

► Multidimensional NMR Spectroscopy 

► Protein NMR - Introduction 

► Protein Segmental Labeling 

► Pulsed Eield Gradient NMR 

► Structure Determination by NMR: Overview 

► TROSY 
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Synonyms 

Transverse relaxation optimized spectroscopy 

Definition 

TROSY, or transverse relaxation optimized spectros¬ 
copy is a method used in solution-state nuclear mag¬ 
netic resonance spectroscopy for the acquisition of 
NMR spectra of large biomolecules/ complexes. At 
high magnetic fields, transverse relaxation in *Hn-X 
(n = 1-3) spin systems is dominated by dipole-dipole 
(DD) and chemical shift anisotropy (CSA) interac¬ 
tions. The TROSY experiment utilizes interference 
between dominant relaxation mechanisms (DD/CSA, 
DD/DD, or CSA/CSA) in order to select the slowest 
relaxing component of the detected multiplets in one or 
several spectral dimensions. The advantages of this 
method compared to conventional HSQC include 
increased spectral resolution, increased sensitivity, 
and reduced linewidths. TROSY may be further 
improved using CRINEPT (cross-relaxation-enhanced 
polarization transfer), which improves magnetization 
transfer efficiency in very large biomolecules. 

Initial implementation of TROSY was applied to the 
*H-*^N moieties in *^N-labeled proteins in a 2D fashion. 
Eurther implementations of TROSY have been 
extended to 3D double- /triple-resonance (or higher 
dimensionality) spectroscopy, *^C moieties, such as 
methyl and aromatic groups and for labeled DNA/RNA. 

Cross-References 

► Cross-Correlated Relaxation 

► INEPT 

► Kinetics: Relaxation Methods 

► Multidimensional NMR Spectroscopy 

► Triple Resonance NMR 


Tryptophan 2,3-dioxygenase (TDO) 

► Heme Dioxygenases - Computational Studies 


Tubeless NMR 

► Plow NMR 


Two-Color FIONA 

► Single-Molecule High-Resolution Colocalization 
(SHREC) 


Two-Dimensional Crystals (2D Crystals) 

► Electron Crystallography of 2D Crystals 


Two-Hybrid Screens 

► Protein-Protein Interactions - Interpreting Data 
from Large-Scale Screens 
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Synonyms 

Two-photon excitation microscopy 
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Definition 

It is a nonlinear fluorescence laser scanning micros¬ 
copy method based on two-photon excitation of fluo¬ 
rescence as contrast mechanism. It is a method for 
three-dimensional imaging method of biological spec¬ 
imens in vivo. Compared with confocal laser scanning 
fluorescence microscopy, it has the advantages of 
deeper tissue penetration and less photodamage at the 
expense of a slightly lower resolution. 

Basic Characteristics 

Two-photon excitation (2PE) microscopy uses as 
a basis a phenomenon early utilized in spectroscopy 
(Callis 1997) predicted more than 80 years ago: In 
1931, Maria Goeppert-Mayer in her brilliant doctoral 
dissertation on the theory of two-photon quantum tran¬ 
sitions in atoms and molecules anticipated the 2PE 
possibility (Goppert-Mayer 1931; Masters and So 
2004; Esposito et al. 2004). This photo-physical effect 
was verified experimentally by Kaiser and Garret in 
1961 (Kaiser and Garret 1961). One has to wait until 
1978 to find the description of the first nonlinear scan¬ 
ning optical microscope with depth resolution 
endowed of the possibility of performing 2PE based 
imaging (Sheppard and Kompfner 1978). Denk and 
colleagues clearly demonstrated the capability of 2PE 
imaging in biology (Denk et al. 1990). This brought 
a revolution in fluorescence microscopy stimulating 
original studies from the tracking of individual mole¬ 
cules within living cells to the observation of whole 
organisms (Hell 1996; Diaspro 1999a, b, 2002; Zipfel 
et al. 2003). 

Two-photon excitation of fluorescent molecules is 
a nonlinear process related to the simultaneous absorp¬ 
tion of two photons whose total energy equals the 
energy required for the more familiar one-photon exci¬ 
tation (IPE). The excitation process of a fluorescent 
molecule under IPE typically requires photons in the 
ultraviolet or blue/green spectral range. Under suffi¬ 
ciently intense illumination, as the one provided by 
a laser source, the very same process, i.e., excitation 
of a fluorescent molecule from the ground to the 
excited state, can take place in the infrared spectral 
region. Fig. 1. 

Since the 2PE event requires at least two statisti¬ 
cally “independent” photons for each excitation 



Two-Photon Excitation Fluorescence Microscopy, 
Fig. 1 Simplified Perrin-Jablonski scheme for IPE, 2PE and 
3PE. Once the excited state has been reached the subsequent 
fluorescent emission is the very same for the three different 
modalities of excitation. (Taken from Diaspro et al. (2006), 
open access http://www.biomedical-engineering-online.com/ 
content/5/1/36) 

process, its rate depends on the square power of the 
instantaneous intensity. 

Now, the most popular relationship about 2PE is 
related to the practical situation of a train of beam 
pulses focused through a high numerical aperture 
(NA) objective, with a duration Xp and fp repetition 
rate. Under controlled conditions, the probability, na, 
that a certain fluorophore simultaneously absorbs two 
photons during a single pulse, in the paraxial approx¬ 
imation is given by Denk et al. (1990) na §2 Pave^ 
Tp“* fp“^ NA'*' where Pave is the average power 
of the illumination beam, 82 is the two-photon cross 
section of the fluorescent molecule, and X is the exci¬ 
tation wavelength. Introducing 1 GM (Goppert- 
Mayer) = 10~ [m"^ s], for a 82 of approximately 

10 GM, focusing through an objective of NA >1, an 
average incident laser power of « 1-50 mW, operating 
at a wavelength ranging from 680 to 1,100 nm with 
80-150 fs pulsewidth and 80-100 MHz repetition rate, 
one should get fluorescence without saturation. 
A convenient repetition time of pulses should be on 
the order of typical excited-state lifetime of fluores¬ 
cence, which is a few nanoseconds for commonly used 
molecules. When n^ approaches unity, saturation 
effects start occurring. Such a consideration allows 
one to optimize optical and laser parameters in order 
to maximize the 2PE efficiency (Denk et al. 1990). 

Now, even if the quantum-mechanical selection 
rules for 2PE differ from those for IPE, several 
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common fluorescent molecules can be used (Diaspro 
2002). Unfortunately, the knowledge of IPE cross 
section for a specific fluorescent molecule does not 
allow any quantitative prediction of the two-photon 
trend. The only “rule of thumb” that one can use states 
that a 2PE cross section peak can be expected at 
a twofold wavelength with respect to the linear case 
(Diaspro 2002; Zipfel et al. 2003; Helmchen and Denk 
2005; Cahalan et al. 2002). In terms of optical impli¬ 
cations, the two-photon effect has the important con¬ 
sequence of limiting the excitation region to within 
a subfemtoliter volume. 

Now, the most interesting aspect is that the excita¬ 
tion power falls off as the square of the distance from 
the lens focal point, within the approximation of 
a conical illumination geometry, Eig. 2. 

In practice this means that the square relationship 
between the excitation power and the fluorescence 
intensity brings about the fact that 2PE falls off as the 
fourth power of distance from the focal point of the 
objective. This fact implies that those molecules away 
from the focal region of the objective lens do not 
contribute to the image formation process and are not 
affected hy photobleaching or phototoxicity. Since 
these molecules are not involved in the excitation 
process, a confocal-like effect is obtained without the 
necessity of a confocal pinhole. It is immediately evi¬ 
dent that in this case, the optical sectioning effect is 
obtained in a physically different way with respect to 
the confocal case and that overall photobleaching is 
minimized. In the 2PE scheme, no fluorescence has to 
be removed from the detection pathway since fluores¬ 
cence can exclusively come from a small focal volume 
that has a capacity of the order of fraction of femtoliter. 
In fact, in 2PE over 80% of the total intensity of 
fluorescence comes from a 700-1,000-nm thick region 
about the focal point for objectives with numerical 
apertures >1. The fact that the background signal 
coming from adjacent planes tends to zero produces 
a sort of compensation for the reduced spatial resolu¬ 
tion due to the utilization of a wavelength that is twice 
with respect to the IPE case. On the other hand, the 
utilization of infrared wavelengths instead of 
UV-visible ones allows achieving a deeper penetration 
than in the IPE case. Moreover, multiple fluorescence 
imaging capability have been demonstrated. Using 
a specific 2PE wavelength, it is possible to prime 
fluorescent emission from different fluorophores at 
the very same time, Eig. 3. 



Two-Photon Excitation Fluorescence Microscopy, 
Fig. 2 Confinement of the emission process, in green, under 
red 2PE. Under 2PE the only molecules excited are those 
confined in a small subfemtoliter volume at the illumination 
beam focus position. The capacity of the volume can be 
approximatively calculated by using the resolution parameters 
of the optical system, since they are indicators of the volume 
containing the maximum photon density. This is valid only for 
non-saturated processes. (Taken from Diaspro et al. (2006), open 
access http://www.biomedical-engineering-online.com/content/ 
5/1/36) 

The main elements to realize a 2PE/MPE architec¬ 
ture, including confocal modality, are the following: 
high peak-power laser delivering moderate average 
power (fs or ps pulsed at relatively high repetition 
rate) emitting infrared or near infrared wavelengths 
(650-1,100 nm), laser sources for confocal IPE, 
a laser beam scanning system, high numerical aperture 
objectives (>1), a high-throughput microscope path¬ 
way, a spectral separation module for the emitted sig¬ 
nal discrimination, and a high-sensitivity detection 
system (Diaspro et al. 1999). 

The great impact of 2PE in optical microscopy is 
related to the fact that it couples a three-dimensional 
intrinsic ability with almost five other interesting capa¬ 
bilities (Diaspro et al. 2005): (1) 2PE greatly reduces 
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Two-Photon Excitation Fluorescence Microscopy, 
Fig. 3 2PE multiple fluorescence image from a 16-pm cryostat 
section of mouse intestine stained with a combination of fluo¬ 
rescent stains (F-24631, Molecular Probes). Alexa Fluor 350 
wheat germ agglutinin, a blue-fluorescent lectin, was used to 
stain the mucus of goblet cells. The filamentous actin prevalent 
in the brush border was stained with red-fluorescent Alexa Flu or 
568 phalloidin. Finally, the nuclei were stained with SYTOX® 
Green nucleic acid stain. Imaging has been performed at 780 nm, 
100 X 1.4 NA Leica objective, using a Chameleon XR ultrafast 
Ti-Sapphire laser (Coherent Inc., USA) coupled at LAMBS- 
MicroScoBio with a Spectral Confocal Laser Scanning Micro¬ 
scope, Leica SP2-AOBS. (Taken from Diaspro et al. (2006), 
open access http://www.biomedical-engineering-online.com/ 
content/5/1/3 6) 

photo-interactions and allows imaging of living spec¬ 
imens on longtime periods. (2) 2PE allows operating in 
a high-sensitivity background-free acquisition scheme. 
(3) 2PE microscopy can penetrate turbid and thick 
specimens down to a depth of a few hundred micro¬ 
meters up to 1 mm. (4) Due to the distinct character of 
the absorption spectra of many of the fluorophores, 
2PE allows simultaneous excitation of different fluo¬ 
rescent molecules reducing colocalization errors, 
(5) 2PE can prime photochemical reactions within 
subfemtoliter volumes inside solutions, cells, and tis¬ 
sues. Moreover, the advances in the field of fluorescent 
markers added value and potential to 2PE microscopy. 
It is worth mentioning (Diaspro et al. 2005, 2006) 
among them the design of application tailored 
chromophores, the development and utilization of 
the so-called quantum dots, the use of visible and 
photoactivatable fluorescent proteins from the green 


fluorescent protein (GEP) and its natural homologues 
to specifically engineered variants, the use of 
photoswitchable proteins to break the diffraction bar¬ 
rier in fluorescence microscopy at low light intensities, 
the extension of the method to optical nanoscopy as 
two-photon excitation SEED (cross-reference). 


Cross-References 

► Confocal Laser Scanning Fluorescence Microscopy 

► Fluorescence; General Aspects 

► Fluorescence Correlation Spectroscopy 

► Fluorescence Three-Dimensional Optical Imaging 

► Stimulated Emission Depletion (STED) Microscopy 

► Two-Photon Excitation Fluorescence STED 
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► Two-Photon Excitation Eluorescence Microscopy 


Two-Photon Excitation Stimulated 
Emission Depletion Microscopy 

Alberto Diaspro and Paolo Bianchini 

Department of Nanophysics, Italian Institute of 

Technology (IIT), Genoa, Italy 

Department of Physics, University of Genoa, Genoa, 

Italy 

Synonyms 

2PE-STED; Two-photon excitation fluorescence STED 


Definition 

Two-photon STED is an optical nanoscopy far-field 
method developed within the super-resolution tech¬ 
niques that couples two-photon excitation fluorescence 
microscopy with the stimulated emission depletion pro¬ 
cess. This can be extended to multiphoton excitation. 
Stimulated emission depletion of fluorescent molecules 
allows to increase resolution due to spatial selective 
removal of fluorescence contribution. Two-photon 
excitation (2PE) allows to perform intrinsical three¬ 


dimensional optical section due to the nonlinear mech¬ 
anism of excitation coupled to other advantages, among 
them deep penetration imaging capability due to the use 
of long excitation wavelengths. Unfortunately, optical 
resolution in two-photon excitation imaging is worst 
than in the single-photon excitation case due to the 
general doubling of the excitation wavelength. Moving 
to two-photon STED allows to reach an optical resolu¬ 
tion closer to the single-photon fluorescence excitation 
STED even if hampered by the poorer amount of pho¬ 
tons available. Recently, it was possible to perform 
single-wavelength two-photon excitation STED open¬ 
ing a new perspective toward super-resolution optical 
imaging in deep. The development of two-photon 
STED solutions favored by new laser technologies 
will extend the number of fluorophores and wave¬ 
lengths usable for 2PE-STED microscopy in the future. 

Cross-References 

► Fluorescence Three-Dimensional Optical Imaging 

► Stimulated Emission Depletion (STED) Microscopy 

► Two-Photon Excitation Fluorescence Microscopy 


Type 1 Copper Proteins 

► Cupredoxins 


Type I Interferon Receptor 

Jacob Piehler 

Department of Biology, University of Osnabriick, 
Osnabriick, Lower Saxony, Germany 

Synonyms 

Interferon a/p receptor; IFNAR 

Definition 

Type I interferons are four helical bundle cytokines 
with key functions in the activation of innate 
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Type I Interferon Receptor, 
Fig. 1 Signaling by the type 
I interferon receptor. By 
simultaneous interaction of 
IFNs with the two receptor 
subunits IFNARl and 
IFNAR2 the active signaling 
complex is formed. In turn, the 
kinases Tyk2 and Jakl 
associated with IFNARl and 
IFNAR2, respectively, 
transactivate each other and 
phosphorylate specific 
tyrosine residues of IFNARl 
and IFNAR2 (indicated as red 
dots). These serve as docking 
sites for effector proteins of 
the STAT family. Upon 
phosphorylation, STATl and 
STAT2 form homo- and 
heterodimers, which 
translocate into the nucleus to 
activate transcription 



and adaptive immune responses. Their receptor is 
comprised of the two subunits IFNARl and IFNAR2, 
which belong to the class II cytokine receptor family. 

Introduction 

Type I interferons (IFN) are key cytokines in the early 
immune response upon pathogen infection. As an 
immediate consequence of pathogen recognition by 
the host, IFNs are secreted, which in turn activate 
a cell surface receptor. IFNs elicit numerous innate 
defense responses, which include antiviral, antimicro¬ 
bial, and antiproliferative activity as well as the 
activation of natural killer cells. Moreover, IFNs play 
an important role in the initiation of the adaptive 


immune response. Owing to their fundamental role 
in the activation of immune responses, IFNs have 
numerous medical applications, e.g., for the treatment 
of hepatitis B/C infections, several types of cancers, 
and multiples sclerosis. 

The human IFN family comprises 17 members, 13 
IFNa subtypes, and each one IFNp, IFNco, IFNk, and 
IFNs species, which all bind to the same receptor (Uze 
et al. 2007). The IFN receptor is expressed on all 
nucleated cells and is comprised of two subunits, 
IFNARl and IFNAR2 (Fig. 1). The cytoplasmic 
domains of these receptor subunits are associated 
with the Janus kinases (JAKs) Tyk2 and Jakl, respec¬ 
tively. IFNs simultaneously interact with both 
IFNARl and IFNAR2, thus bringing Jakl and Tyk2 
in close proximity. Tyk2 and Jakl are activated 
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by reciprocal ► trans-phosphorylation and in turn 
phosphorylate specihc tyrosine residues of IFNARl 
and IFNAR2. The ► signal transducers and activators 
of transcription (STATs) proteins STATl and STAT2 
are then recruited to the receptor complex are phos- 
phorylated on specific tyrosine residues and 
heterodimerize to form together with p48/IRF9 the 
► transcription factor ISGF3. This transcription factor 
is responsible for the expression of hundreds of IFN- 
stimulated genes (ISGs). Also, STATl homodimers 
form upon IFNAR activation, which bind to the 
IFNy-activated (GAS) element present in ISG pro¬ 
moters. In addition to these classical IFN pathways, 
other members of the STAT family including STAT3 
and STATS are involved in IFN signaling as well as 
further signaling cascades, which may operate in spe¬ 
cific cellular contexts (Platanias 2005). 

A striking feature of IFN signaling is the high 
redundancy of ligands, which engage the same recep¬ 
tor. Interestingly, differential cellular responses have 
been observed for different IFNs. More specifically, 
IFNa subtypes have very similar antiviral potency 
compared to IFNp. In contrast, IFNP is by ^2 orders 
of magnitude more potent in activating more complex 
cellular responses involving cell proliferation and dif¬ 
ferentiation. Interestingly, these additional responses 
require activation of IFN signaling pathways over sev¬ 
eral days, while antiviral response is already mounted 
after 4 h of IFN stimulation. For medical application, 
designing IFNs with specific activity patterns is highly 
desired, which is particularly important because the 
medical application of IFN is limited by its strong 
side effects. 

Recognition of IFNs by the Receptor 
Subunits 

Since differential activities of IFNs must be encoded in 
the recognition by the receptor, the interactions 
between IFNs and the receptor subunits have been 
characterized in detail by quantitative binding 
and activity assays in combination with mutational 
analysis (Pan et al. 2008; Piehler et al. 2000; Piehler 
and Schreiber 1999; Roisman et al. 2001, 2005; 
Runkel et al. 2000). Moreover, extensive low- and 
high-resolution structural information on these 
complexes is reported (Chill et al. 2003; Li et al. 
2008; Radhakrishnan et al. 1996; Strunk et al. 2008), 



Type I Interferon Receptor, Fig. 2 Crystal structure of IFNco 
in complex with the extracellular domains of IFNARl and 
IFNAR2 (Thomas et al. 2011). The FNIII-like domains (D) of 
IFNARl and IFNAR2 are numbered starting from the N-termi- 
nus. The helices of IFNco are indicated by uppercase letters in 
gray. Integration into the membrane and membrane-proximal 
association sites of Jakl and Tyk2 are indicated schematically 

including crystal structures of binary and ternary IFN- 
receptor complexes (Thomas et al. 2011). The struc¬ 
ture of the IFN-receptor complex and its organization 
in context of the plasma membrane is depicted in 
Fig. 2. The extracellular domains of IFNARl and 
IFNAR2 are comprised of 4 and 2 fibronectin 111 
(FNIII)-like domains, respectively. IFNs are four heli¬ 
cal bundle proteins (helix A, B, C, and E), which com¬ 
prise an additional helix (D). All IFNs bind with high 
affinity to IFNAR2 with dissociation constants ranging 
from the lower nanomolar regime for IFNa subtypes 
down to sub-nanomolar binding affinity of IFNp. 
In contrast, IFNARl is recognized with much lower 
affinity, ranging from several micromolar for IFNa 
subtypes to ^100 nM for IFNP (Lamken et al. 2004). 
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Owing to their very different ligand binding affinity, 
IFNAR2 and IFNARl are often referred to as binding 
and accessory subunit, respectively. The different bind¬ 
ing characteristics are accompanied by differences in 
the architecture of the interaction sites. The IFN/ 
IFNAR2 interface is characterized by a number of 
hotspot amino acid residues on both interaction part¬ 
ners, covering a total buried surface area of ^1,800 A^. 
The IFN binding site of IFNAR2 is located at the hinge 
between D1 and D2, with most residues contributing to 
IFN binding located on D1. Residues of IFN interacting 
with IFNAR2 are located on helix A and C as well as the 
AB loop. The energetic relevance of several interac¬ 
tions between amino acid side chains were confirmed 
by ► double mutant cycle analysis (Roisman et al. 
2001). However, the crystal structure of the IFNa2/ 
IFNAR2 complex revealed that some key interactions 
are mediated by contact with the backbone of IFNa2 
(Thomas et al. 2011). 

In contrast to this very typical architecture of 
a cytokine-receptor complex, an untypical, diffuse 
binding interface of the interaction between IFNs and 
IFNARl is observed (Roisman et al. 2005; Thomas 
et al. 2011). The three N-terminal FNIII-like domains 
are involved in IFN binding (Lamken et al. 2005), 
yielding a relatively large interface of ^2,200 A^. 
IFNs interact with IFNARl through helix B, C, and 
D. However, only two hotspot residues were identified 
for IFNARl and only a single hotspot residue for 
IFNa2 (Pan et al. 2008; Thomas et al. 2011). For 
many interface residues of both IFNARl and IFN, 
a very moderate decrease or even an increase in the 
binding affinity was observed upon mutation into ala¬ 
nine. Strikingly, this was true even for several residues 
highly conserved within the IFN family, suggesting 
that low affinity of IFNa subtypes toward IFNARl is 
an evolutionary optimized feature. For several 
cytokine receptors, cooperative interaction of the two 
receptor subunits with the ligand has been observed. In 
contrast, IFNs interact independently with each of its 
receptor subunits, i.e., binding of 1FNAR2 to IFN does 
not affect its interaction with IFNARl and vice 
versa. Another interesting feature of IFN binding to 
IFNARl is a substantial conformational change of 
IFNARl upon ligand binding (Strunk et al. 2008; 
Thomas et al. 2011), which is propagated into the 
membrane-proximal domain D4 (Strunk et al. 2008). 
The functional role of this conformational change 
remains elusive. However, a critical role of D4 for 


signal activation has been established (Lamken et al. 
2005), suggesting that the conformational change may 
be involved in transmembrane signal activation. 

While no differences in the structure of the signal¬ 
ing complex could be observed for different IFNs, very 
different rate and affinity constants of the interaction 
with the receptor subunits were observed (Jaks et al. 
2007; Lamken et al. 2004; Lavoie et al. 2011). Strik¬ 
ingly, different binding affinities not only resulted into 
different potencies of IFNs, but also correlated with 
differential IFN activities. In particular, the increased 
binding affinity of IFN(3 toward IFNARl was shown to 
play a critical role for additional functions of IFN(3 
compared to IFNa2. Thus, an IFNa2 variant 
engineered for increased binding affinity toward 
IFNARl was shown to be functionally equivalent to 
IFNp (Jaitin et al. 2006). 

Assembly and Dynamics of the Signaling 
Complex 

The consequences of differential binding affinities for 
the formation of the ternary IFN-receptor complex on 
the plasma membrane were explored in vitro with the 
extracellular domains tethered onto artificial mem¬ 
branes (Gavutis et al. 2005). A two-step assembly 
mechanism was confirmed: after IFN binding to 
the high-affinity subunit IFNAR2, IFNARl is subse¬ 
quently recruited on the membrane. The sequence 
of binding events, however, is determined by the 
substantially higher association rate constant of 
the IFN-IFNAR2 complex (k^) rather than its higher 
equilibrium binding affinity (/^£) (Fig. 3). 

No pre-association of the receptor subunits was 
detected under these conditions. This mechanism 
implies a dynamic equilibrium between binary and 
ternary complexes on the plasma membrane, which is 
determined by (a) the affinity of IFN toward IFNARl 
{KJj) and (b) the concentration of the receptor subunits. 
Because the receptor subunits are anchored in the 
membrane, not only the concentrations of the receptor 
subunits, but also the refer to area (e.g., molecules/ 
pm^) rather than volume (e.g., pM) concentrations. 
Thus, KJj cannot be readily inferred from the equilib¬ 
rium constant of the binary complex as determined by 
standard binding assays. However, the equilibrium 
between binary and ternary complex can be assessed 
by ligand binding experiments. As ligand dissociation 
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Type I Interferon Receptor, 
Fig. 3 Two-step assembly of 
the ternary IFN-receptor 
complex in the plasma 
membrane {orange: IFN; blue: 
IFNAR2; green: IFNARl). 
Details in the text 



from the ternary complex is much slower than from the 
binary complex, the equilibrium between binary and 
ternary complexes also affects the total ligand binding 
affinity to the cell surface receptor. Thus, the decrease 
in ligand dissociation kinetics compared to the binary 
IFN-IFNAR2 interaction can be used as a measure for 
ternary complex formation (Gavutis et al. 2005). These 
studies suggest that under physiological conditions 
ternary complex formation in the plasma membrane 
indeed can be limited by the concentration of the 
IFNARl and IFNAR2, for which typically only a few 
100 copies/cell can be found. While this is true for all 
characterized IFNa subtypes, the higher binding 
affinity of IFNp, however, ensures efficient ternary 
complex formation at these receptor surface concen¬ 
trations. It is important to note that the limitation of 
ternary complex formation by cannot be compen¬ 
sated by increasing the IFN concentration. Thus, dif¬ 
ferential efficacy in the recruitment of IFNARl into 
the ternary complex achieved by IFNa subtypes and 
IFNp could be the biophysical bases for regulating 
their differential activities. 


Endocytosis and Negative Feedback 
Regulation 

Specihc functions of IFNp compared to IFNa subtypes 
require long-term activation of the signaling complex. 
Thus, receptor endocytosis and negative feedback 
mechanisms are likely to play a critical role in regulat¬ 
ing differential signaling. 


After IFN stimulation, a rapid decrease of both 
IFNARl and IFNAR2 levels on the cell surface is 
observed (laitin et al. 2006). Moreover, irreversible 
IFN uptake is observed, confirming endocytosis of 
the entire signaling complex, which was shown to be 
clathrin mediated (Marchetti et al. 2006). The possible 
consequences of receptor endocytosis are as manifold 
as the possible endocytic trafficking pathways, and 
only a few should be discussed here, (a) Prima facie, 
the resulting decrease in surface concentration of 
IFNARl and 1FNAR2 on the cell surface probably 
shifts the equilibrium from ternary to binary complex. 
Thus, signaling by IFNP is less affected than signaling 
by IFNa subtype because of the different efficacy of 
IFNARl recruitment, (b) It is very likely that entire 
signaling complexes are taken up by endocytosis. Sig¬ 
naling may proceed in early endosomes, and notably, 
even a critical role of endocytosis for signaling has 
been suggested (Marchetti et al. 2006). In any case, 
endocytosis could well explain why the potency of 
IFNs does not increase with affinity above a certain 
threshold: an increase in binding affinity is in most 
cases accompanied by an increase in complex stability; 
if reversible ternary complex formation is followed by 
irreversible endocytosis, an increase in complex sta¬ 
bility pays of only up to a certain threshold, above 
which complexes are faster endocytosed than they 
dissociate (Fig. 4). 

Another important negative feedback mechanism is 
based on IFN-induced expression of the isopeptidase 
USP18 (UBP43), which specihcally removes the 
ubiquitin-like protein ISG15 from target proteins. 
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Type I Interferon Receptor, Fig. 4 Endocytosis and negative 
feedback in IFN signaling, (a) Reversible IFN binding and 
ternary complex formation is followed by endocytosis of the 
signaling complex. If the rate constant of endocytosis k,, signif¬ 
icantly exceeds the rate constant of complex dissociation k^, 
ternary complex formation can be considered irreversible, 


independent on the IFN binding affinity. While signaling prob¬ 
ably proceeds in early endosomes, endocytosis also leads to 
receptor degradation, (b) Interaction of the negative feedback 
regulator USP18 with the intracellular domain of IFNAR2 shifts 
the equilibrium toward the binary complex 


USP18 was shown to interfere with IFN signaling by 
binding to the cytosolic domain of IFNAR2 
(Malakhova et al. 2006). Interestingly, USP18 
regulates signal activation on the level of ligand 
binding and ternary complex assembly (Francois- 
Newton et al. 2011). Moreover, expression of USP 
results into a strong loss in sensitivity toward IFNa2, 
but not toward IFNP (Francois-Newton et al. 2011). 
This observation is in line with the substantially higher 
receptor binding affinity of IFNP compared to IFNa, 
suggesting that USP18 binding to IFNAR2 interferes 
with the formation of the ternary IFN-receptor com¬ 
plex. However, the important consequence of differ¬ 
ential negative feedback by USP18 is that signaling by 
IFNa subtypes, but not by IFNp is abrogated by the 
first wave of cellular responses. This could explain 
why IFNP can more efficiently elicit responses, 
which require maintaining signaling over extended 
time periods. 


Summary and Conclusions 

IFNs form a network of highly related cytokines, which 
differentially signal through the same receptor. The 
receptor is activated by sequential recruitment of the 
two cell surface receptor subunits IFNAR2 and 
IFNAR1, respectively, to form a ternary signaling com¬ 
plex. Signaling is mediated by tyrosine phosphoryla¬ 
tion through associated JAKs, with the STAT1/STAT2 
heterodimer as a hallmark of IFN signaling. Differen¬ 
tial cellular responses are not based on different struc¬ 
tures of IFN-receptor complexes, but they correlate 
with the binding different affinities toward the receptor 
subunits. In particular, the differential affinity of IFNs 
toward the receptor subunit IFNAR I appears to be the 
biophysical bases for regulating long-term IFN activity, 
which is mediated by receptor endocytosis and by 
destabilization of the ternary complex by the negative 
feedback regulator USP18 as key mechanisms. 
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Synonyms 

Cytochrome hci complex 

Definition 

Ubiquinol-cytochrome c oxidoreductase, commonly 
referred to as complex III or the cytochrome hci com¬ 
plex, is a multifunctional, oligomeric membrane pro¬ 
tein complex localized to the inner mitochondrial 
membrane of eukaryotic cells or the cytoplasmic mem¬ 
brane of prokaryotic organisms. This complex is an 
integral part of the mitochondrial respiratory chain, 
contributing to the generation of electrochemical 
potential (ApH+) by catalyzing the electron transfer 
(ET) reaction from ubiquinol (QH 2 ) to cytochrome c 
coupled to proton translocation across the membrane. 

Historical Perspective 

Complex III was among the earliest discovered ET 
complexes. Cytochrome h was detected as early as 


1922, and the hrst isolation procedure for complex III 
from bovine heart mitochondria was reported in 1962. 
Improvements that yielded more protein and higher 
activity were documented (Yu et al. 2009). Native 
protein crystals were obtained in 1991 (Yue et al. 
1991) and the hrst structure solved in 1997 (Xia et al. 
1997). 

The Structure 

The subunit composition of hcq complexes varies sig- 
nihcantly, from three subunits in P. denitrificans and 
R. capsulatus to as many as 11 subunits in human and 
bovine mitochondria. All hci complexes contain three 
redox subunits: cytochrome h, having two h-type 
hemes, a low potential heme (redox potential or 
Em = —90 mV) bi^ and a high potential heme 
(Em = 50 mV) bn, cytochrome Ci, containing 
a c-type heme (Em = 230 mV); and the iron-sulfur 
protein (ISP), including a high potential 2Fe-2S iron- 
sulfur cluster (ISC, Em = 290 mV) (Fig. 1). All addi¬ 
tional subunits, referred to as supernumerary subunits, 
are believed to contribute to the increased stability of 
these complexes. Purihed complex III also contains 
ubiquinone and various phospholipids. 

Despite its size (^490 kDa for the dimer) and 
complexity (11 different subunits in a monomer), the 
crystal structure of the bovine mitochondrial cyto¬ 
chrome be I complex was the first to be determined 
(Xia et al. 1997). Structures of bci complexes from 
yeast and photosynthetic bacteria R. sphaeroides and 
R. capsulatus were later available (PDB: lEZV, 
2QJP, IZRT). The strong sequence similarity of 
bacterial bc\ complexes to their mitochondrial 
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Ubiquinol-Cytochrome c Oxidoreductase (Complex III), 
Fig.1 Structural model of the dimeric hc^ complex from bovine 
mitochondria. The 11 different subunits are represented as rib¬ 
bons with the color code and subunit designation given on the 
left. Prosthetic groups such as the hemes hi^, h^, Ci and the iron- 
sulfur cluster (ISC) are shown as ball-and-stick models. The two 
black horizontal lines delineate the membrane-embedded por¬ 
tion of the he I complex 


counterparts makes them excellent model systems for 
mitochondrial enzymes. 

At the core of the hci complex is the cytochrome h 
dimer, supporting two pairs of cytochrome Ci and ISP 
subunits on opposite sides. The cytochrome h subunit, 
the only component of the hci complex encoded by the 
mitochondrial genome, is embedded in the membrane 
and consists of eight transmembrane helices. The two 
/7-type hemes are part of the active sites that catalyze 
opposite reactions: The Qp (Qo) site, located near the 
intermembrane space (IMS) in mitochondria or the 
periplasmic side in bacteria, provides access to lipid- 
soluble QH 2 for oxidation, and the Qn (Qi) site, situ¬ 
ated closer to the matrix (mitochondria) or cytoplasm 
(bacteria), carries out the reduction of ubiquinone. The 
two halves of a hc\ dimer are able to communicate 
through ET between the pair of h]^ hemes (Gong et al. 
2005; Swierezek et al. 2010). The catalytic domains for 
cytochrome Ci and ISP subunits are located in the IMS 
(mitochondria) or periplasma (bacteria), each anchored 
to the membrane by a single membrane-spanning helix. 
While the conformation of cytochrome Ci is stable, that 
of the ISP undergoes a controlled domain movement 
thought to be important for hc^ function. 


Functions 

The ET-coupled proton translocation function. The 
cytochrome hcy complex constitutes an essential com¬ 
ponent of the cellular respiratory chain; it catalyzes the 
antimycin-sensitive ET reaction from QH 2 to cyto¬ 
chrome c coupled with proton translocation across 
the membrane. As a result, for every QH 2 molecule 
consumed, four protons are transported and two mole¬ 
cules of cytochrome c are reduced. 

QH 2 + 2c“' -f 2H+^Q + Ic"^' + 4H+ 

The cross-membrane Ape + thus generated serves 
as the energy source for the generation of ATP by ATP 
synthase. The in vitro activity of the bci complex is 
often measured by the spectral change of cytochrome c 
reduction in the presence of ubiquinol analogs. 

The superoxide generation function. Under normal 
respiration conditions, the oxidation of QH 2 by cyto¬ 
chrome c catalyzed by the hci complex is accompanied 
by the generation of a small amount of superoxide 
anions through electron leakage to molecular oxygen 
(see ► Reactive Oxygen Species) (Turrens et al. 1985). 
This superoxide anion generation increases dramati¬ 
cally when the ET within the hcy complex is blocked 
by specific hci inhibitors such as antimycin A or when 
the ET chain becomes over reduced. The most likely 
site of superoxide anion generation under normal ET 
conditions is at the cytochrome bi^. Mutations in the 
cytochrome b subunit result in not only impaired ET 
activity but also in increased amounts of superoxide 
anions (Wenz et al. 2007; Yin et al. 2010). More 
recently, direct participation of molecular oxygen in 
intercepting electrons from the bifurcated oxidation of 
ubiquinol within hydrophobic cavities of the cyto¬ 
chrome h subunit has been suggested (Yin et al. 2010). 

The mitochondrial processing peptidase function. 
The cytochrome bci complexes from higher organisms 
also exhibit mitochondrial processing peptidase (MPP) 
activity (Glaser et al. 1994). Although MPP activity is 
readily detected in the purified hci complex from plant 
mitochondria, it is inactive (but can be activated with 
diluted detergents) in the bc^ complex of bovine mito¬ 
chondria (Deng et al. 2001). The MPP activity occurs 
in the heterodimeric core subunits I and II of the 
mitochondrial hci, which belong to the pitrilysin fam¬ 
ily of zinc metallopeptidases, whose members include 
insulin-degrading enzyme, mitochondrial MPP in 
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Ubiquinol-Cytochrome c Oxidoreductase (Complex III), 
Fig. 2 The Q-cycle mechanism. The Q cycle defines two 
reaction sites: quinol oxidation (Center P or Qp) and quinone 
reduction (Center N or Qn). The first QH 2 moves into the Qp site 
and undergoes bifurcated oxidation either by the concerted 
(a) or sequential (b) mechanism with one electron going to 
cytochrome c via ISP and cyt Cj (high potential chain), and 
another ending in the Qn via hemes fop and bn (low potential 
chain) to form a semiubiquinone, and releasing its two protons to 


the site of the membrane. The second QH 2 is oxidized in 
the same way at the Qp site but its low potential chain electron 
ends up reducing the semiubiquinone and releasing a QH 2 
upon picking up two protons from the *P“ side. As a result of 
the Q cycle, four protons are transferred to the 4'"^ side, 
two protons are picked up from the side and one QH 2 
oxidized. Specific interaction sites are indicated for various 
inhibitors: Sti'g stigmatellin, Myxo myxothiazol, AA antimycin A 


mammals and in fungi. The core I and II subunits of the 
cytochrome hci complex correspond to the P- and a- 
subunits of the MPP, respectively, whose active site 
lies at the interface between the a- and p- subunits. 


The Protonmotive Q-Cycle Mechanism 

Extensive biochemical and biophysical studies have 
led to the formulation of the “protonmotive Q-cycle” 
mechanism (Trumpower 1990) for the proton 
movement-coupled electron transfer in the cytochrome 
hci complex. The critical experiments that led to the 
Q-cycle hypothesis included the oxidant-induced cyto¬ 
chrome h reduction in the presence of hc^ inhibitor 
antimycin (Erecinska et al. 1972) and the reaction 
stoichiometry of two protons translocated across the 
membrane for every electron transferred (Brand et al. 
1976). The identification of the two postulated, sepa¬ 
rate quinone reduction and quinol oxidation sites in the 
crystal structure further validated the “Q-cycle” mech¬ 
anism (Xia et al. 1997). 

The key step in the Q-cycle mechanism is the bifur¬ 
cation of electrons from QH 2 at the Qp site. Two 
mechanisms, termed “sequential” (Crofts 2004) and 
“concerted” (Zhu et al. 2007), have been proposed 
for the bifurcated QH 2 oxidation (Eig. 2). In the 


“sequential oxidation” mechanism, the first electron 
of QH 2 is transferred to the ISC of ISP, then to heme 
Cl, and the resulting ubisemiquinone is used to reduce 
heme then h^. In the “concerted oxidation” mech¬ 
anism, the two electrons from QH 2 are transferred to 
ISC and heme ftp simultaneously. Eailure to detect 
a ubisemiquinone radical at the Qp site (SQp) calls 
into question the validity of the proposed sequential 
oxidation mechanism, even though several plausible 
explanations for the absence of a Q-radical at this site 
have been offered, such as antiferromagnetic coupling 
between SQp and ISC or SQp being rapidly oxidized 
by heme ftp. It should be noted that SQp has been 
detected under abnormal conditions. On the other 
hand, the absence of SQp in a mutant bci complex 
lacking heme (jp further indicates that SQp is not 
involved in bifurcated QH 2 oxidation. In fact, there is 
no evidence, so far, to show that ISC receives an 
electron from QH 2 before heme (jp. Development of 
a method to simultaneously determine the reduction 
rates of the ISC and heme (jp has only recently been 
achieved (Zhu et al. 2007). With known EPR signa¬ 
tures of the ISC, hemes l7p, and cq and the 
ubisemiquinone radical, the pre-steady-state kinetics 
of ISC and heme /jp reduction, by QH 2 , was studied 
using an ultrafast microfluidic mixer and a freeze¬ 
quenching device, coupled with EPR spectroscopy. 
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The results show that heme ISC have similar 

reduction kinetics with a ti /2 of 250 ps. These are 
consistent with the postulated concerted oxidation 
mechanism. 

The Structural Basis of the Mechanism of the 
Bifurcated ET at the Qp Site 

The critical step in the Q-cycle hypothesis is the bifur¬ 
cated ET at the Qp site. The first crystal structure 
revealed that the ISC docks to the cytochrome 
h subunit (ft-position) with the longer than expected 
distance of 31 A between the ISC and heme Ci, which 
does not permit effective ET between these two sites. 
Structures of mitochondrial bcy from various studies 
have shown that the extrinsic domain of ISP subunit 
(ISP-ED) is conformationally mobile; an ET by domain 
movement to bridge the 31 A gap between ISC and 
heme Ci was proposed (Xia et al. 2007). Experimental 
attempts to immobilize ISP-ED by mutations or by cross- 
linking in bacterial and yeast bci complexes have 
invariably inactivated the enzyme (Tian et al. 1998). 

The movement of ISP-ED is, however, by no means 
sufficient for the bifurcated ET, because under normal 
catalytic conditions the rate constant for ET between 
cytochrome Cj and ISP-ED is much faster than the 
turnover rate of the enzyme. A control mechanism, 
called the “binding affinity modulated ISP-ED motion 
switch hypothesis” has been proposed, allowing mod¬ 
ulation of the binding affinity of ISP-ED to the cyto¬ 
chrome h subunit (Xia et al. 2007). This hypothesis is 
supported by a great deal of experimental evidence; 
(1) the conformational switch of ISP-ED upon binding 
of two different types of Qp site inhibitors; (2) the 
simultaneous reduction of cytochrome and ISP by 
ubiquinol; (3) the reduction of b^^ preceding that of 
cytochrome Ci by the pre-steady-state kinetic analysis 
(Zhu et al. 2007); (4) the alterations of ET kinetics 
between cytochrome Ci and the ISP with mutations 
introduced to the ISP-ED binding surface (Rajagukguk 
et al. 2007); and (5) the redox-dependent inhibitory 
efficacy and binding affinity of Pm-type and Pf-type 
inhibitors. These results suggest that the ET from 
hemes fip to fin causes protein conformational changes 
that allow or force the reduced ISP-ED to move from 
the fi-position to the Ci position (Yu et al. 2008) to be 
oxidized. In this way, the second electron of QH 2 
should reach its destination, the quinone/semiquinone 


at the Qn site, before the first electron of QH 2 reaches 
heme Ci. Thus, no short-circuit reaction would occur. 
In other words, reduced ISP-ED moves only when 
heme is in the reduced and heme Ci is in the 
oxidized state. 

Proton Translocation Path 

One important question that remains to be answered 
about complex III function is how protons are trans¬ 
ferred across the membrane. Since the bc\ complex in 
its simplest bacterial form consists of only cyto¬ 
chromes b, Cl and ISP subunits and has full capacity 
for proton translocation, this activity must be achieved 
through the transmembrane helices of these three sub¬ 
units. The van der Waals surface of the matrix side of 
these transmembrane helices, particularly in cyto¬ 
chrome b, features channels, leading to the Qn site 
and serving as possible proton uptake pathways. Con¬ 
formational changes in the Qn site residues have been 
implicated for roles in shuttling protons into the Qn 
pocket (Gao et al. 2003). The van der Waals surface of 
the IMS side, on the other hand, is completely sealed; 
proton exit to IMS must be accompanied by some sort 
of conformational change or gating mechanism. It was 
reported that when the ISC of ISP in bcy is destroyed 
chemically or by mutagenesis, the resulting complex 
leaks protons after being reconstituted into phospho¬ 
lipid vesicles, suggesting that ISC plays an important 
role in proton exit (Gurung et al. 2005). 

It is assumed that the ISC is reduced by receiving an 
electron and a proton from QH 2 at the Qp site and 
subsequently releases the proton to the IMS when 
oxidized by heme Ci. Thus, an H^e ratio of one 
would be achieved. The second proton must exit by 
a different route. Protonable groups at the Qp site, from 
Glu271, Arg80, and the propionates of nearby heme 
have been speculated to play a role in the exit of the 
second proton, which is relayed through these groups 
with the second electron from QH 2 . Based on our 
current understanding, the proton exit is thought to 
occur as follows: The proton of the 1-hydroxyl group 
of QH 2 exits via the His ligand of ISC. The proton of 
the 4-hydroxl group of QH 2 is transferred to Glu271 of 
cytochrome b via protonated superoxide (O 2 H) and 
then to a propionate of heme /?l- The proton is then 
transferred to the IMS from the heme fip propionate via 
Arg80 and/or a proton-bonded water chain. 
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Inhibitors Targeting the bc^ Complex 

As a central component of the cellular respiratory 
chain, the hci complex became an easy target for 
numerous natural antibiotics, as a result of the constant 
battle for survival over the course of evolution. Exam¬ 
ples of natural compounds that specifically target the 
he I complex are antimycin A, strobilurin, 
myxothiazol, and stigmatellin. Many of the earlier 
functional and mechanistic studies of bci complex 
were facilitated by the use of these inhibitors, which 
block ET at selected points (von Jagow and Link 
1986). Structural analysis of hci complexes also 
benefited from these inhibitors, as they stabilize certain 
conformations at a given state. Structures of hci in 
complex with specific inhibitors are available. By 
manipulating differences in uptake between the host 
and pathogens, some of the inhibitors or their deriva¬ 
tives have been developed into antifungal and 
antiparasitic agents and are used in agriculture to con¬ 
trol fungal or bacterial diseases and in medicine to treat 
infections. 

Summary 

The cytochrome hci complex is the mid-segment of the 
cellular respiratory chain; it catalyzes the reaction of 
relaying electrons from ubiquinol molecules to cyto¬ 
chrome c and concomitantly transfers protons across 
the membrane, contributing to the proton-motive force 
essential for a variety of cellular activities such as ATP 
synthesis. Unlike other electron transfer complexes of 
the respiratory chain, the hci complex uses ubiquinol 
to shuttle protons across the membrane and does so via 
the Q-cycle mechanism that obligates the hci complex 
to have a quinol oxidation site, a quinone reduction 
site, and a concerted bifurcated electron transfer at the 
quinol oxidation site. 
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Synonyms 

Ubiquitinylation; Ubiquitylation 


Introduction: Defining Ubiquitination 

► Post-translational modifications are used in cells to 
alter or to regulate protein function. Ubiquitination 


describes the covalent attachment of one or more moi¬ 
eties of the 76 amino acid protein ubiquitin to a protein 
substrate. Discovered for its ability to regulate protein 
turnover in cells through the proteasome (Finley 
2009), ubiquitin signaling is now known to also func¬ 
tion in a wide array of proteasome-independent events, 
such as endocytosis and intracellular trafficking, tran¬ 
scription, and ► DNA repair. The versatility of 
ubiquitination in cell signaling is in part due to the 
diversity of the modification (Fig. la), which includes 
monoubiquitination, the attachment of just one ubiquitin 
to a substrate, multiubiquitination, in which more than 
one substrate amino acid is modified by ubiquitin, and 
polyubiquitination, in which a ubiquitin polymer is 
attached to substrate. Further variability is contributed 
by polyubiquitination through divergent chain length 
and linkage mechanism connecting the ubiquitin moie¬ 
ties within a chain (Fig. la). Such connections are made 
by isopeptide bonds between the C-terminal Gly76 of 
one ubiquitin and a primary amine of another that 
belongs to one of its seven lysines or to its N-terminal 
end. Ubiquitin chains can be of only one linkage type or 
heterotypic with many linkage types. 

Coordinated interactions among a large number of 
proteins determine whether a protein is targeted for 
ubiquitination, the type of ubiquitin modification 
performed on the substrate, and the functional outcome 
of this signaling mechanism (Liu and Walters 2010). In 
humans, >700 enzymes function to ubiquitinate or 
deubiquitinate a protein substrate, and the ubiquitin 
modification is recognized by over 150 receptors, 
which characteristically are multi-domain proteins in 
which one or more ubiquitin binding domain (UBD) 
represents just a portion of the protein’s many 
interacting regions. The integrated use of the many 
participants of the ubiquitinome enables ubiquitin to 
be used with great versatility and stringent specificity. 
Moreover, ubiquitin belongs to a family of ubiquitin- 
like (UBL) proteins that resemble it structurally and 
mechanistically, but perform their own distinct signal¬ 
ing. The UBL protein NEDD8 (neuronal precursor 
cell-expressed developmentally down-regulated pro¬ 
tein 8) plays essential roles in cell cycle control and 
embryogenesis (Dye and Schulman 2007) whereas 
SUMO (small ubiquitin-related modifier) proteins 
function in transcriptional regulation, maintenance of 
genome integrity, intracellular transport, protein sta¬ 
bility, and other biological processes (Geoffroy and 
Hay 2009). 
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Fig. 1 

(a) 

Substrates can 


monoubiquitinated with a single ubiquitin, multiubiquitinated 
with monoubiquitin at multiple residues, or polyubiquitinated 
with ubiquitin chains of closed or extended conformations 
formed by one or more linkage type. Chains can also be forked 
with multiple ubiquitins attached to a common moiety, 
(b) Enzymatic cascade of substrate ubiquitination. An El 


Ubiquitin and Its Enzymes 

Ubiquitin is highly conserved among eukaryotes, and 
a prokaryotic ubiquitin-like protein (Fhip) has been iden¬ 
tified that is functionally analogous to it and discussed 
further in (Bums and Darwin 2010). Most commonly, 
ubiquitin is attached to substrates by an isopeptide bond 
between the carboxyl group of its C-terminal glycine 
(Gly76) and the s-amino group of a substrate Lys resi¬ 
due. A substrate’s N-terminal end, serine hydroxyl or 
cysteine thiol group can also be modified by ubiquitin, 
as reviewed in (Liu and Walters 2010). 

El activating enzymes. Three classes of enzymes 
support ubiquitination including El activating 
enzymes, E2 conjugating enzymes, and E3 ligases 
(Eig. lb). The human genome appears to encode two 
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activates the carboxyl terminus of ubiquitin in an ATP- 
dependent manner to form a thioester bond with its catalytic 
cysteine. It is next transferred to the catalytic cysteine of an E2, 
which either directly ubiquitinates a substrate by using 
a scaffolding RING domain E3 ligase or transfers ubiquitin to 
the catalytic cysteine of a HECT domain E3 for subsequent 
transfer to a substrate 


Els (Ubal and Uba6), ^40 E2s, and greater than 600 
E3s for ubiquitination (Deshaies and Joazeiro 2009). 
Els activate ubiquitin for conjugation to a protein sub¬ 
strate by using a multi-domain architecture (Eig. 2a) 
and ATP to catalyze the adenylation of ubiquitin’s 
C-terminus. The acyl-phosphate linkage is converted 
to a thioester bond between ubiquitin and the El’s 
catalytic cysteine thereby releasing AMP (Pig. lb). 
Prior to the passage of ubiquitin to an E2, the El 
adenylates a second ubiquitin molecule to form an 
asymmetric, double loaded complex with ubiquitin, 
as discussed further in (Schulman and Harper 2009). 
The uhiquitin-loaded El binds to an uncharged E2 for 
a thioester transfer reaction, in which the C-terminus of 
the ubiquitin engaged in a thioester bond with the El is 
transferred to a conserved catalytic cysteine in 
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Ubiquitination, Fig. 2 (a) Structure of the S. cerevisiae 
ubiquitin-activating enzyme 1 (Ubal)-ubiquitin complex (PDB 
3CMM). Ubal contains six structural domains, the inactive 
adenylation domain (lAD, light green), active adenylation domain 
(AAD, dark green), first catalytic cysteine half-domain (FCCH, 
light blue), second catalytic cysteine half-domain (SCCH, dark 
blue) with the catalytic cysteine sidechain atoms rendered as 
purple spheres, four helix bundle (4HB) domain (orange), and 
C-terminal ubiquitin fold domain (UFD, red). Ubiquitin is 
displayed in dark grey and non-covalently associated at the 

a shallow cleft of the E2’s ubiquitin-conjugating 
(UBC) domain (Fig. 2b), as reviewed in (Dye and 
Schulman 2007; Schulman and Harper 2009). 

E2 ubiquitin conjugating enzymes. After being 
charged with ubiquitin, E2s generally interact with an 
E3 to catalyze the ubiquitination of a substrate 
(Fig. lb). Structural studies of ubiquitin and UBL 
enzymes indicate that the E2-E3 interaction involves 
the UBC domain’s N-terminal a-helix (Fig. 3a), which 
is also used to bind the El (Fig. 2b). This shared E2 
binding surface most likely ensures that the E1 enzyme 
is released prior to E2^ubiquitin interaction with an 
E3 (Schulman and Harper 2009). Some E2s contain 
only a UBC domain whereas others also contain 
N-and/or C-terminal extensions. Sequence outside the 
E2 UBC domain can modulate E3 activity. UbcHlO’s 
N-terminal extension provides an additional interac¬ 
tion region for the E3 APC (anaphase-promoting com¬ 
plex) that is required for its substrate selectivity 
(Summers et al. 2008). 

Some E2s generate ubiquitin chains of a distinct 
linkage type. Ubcl3-Mms2 assembles Fys63-linked 
chains, whereas E2-25K/Ubcl and Cdc34 assemble 
Fys48-linked chains, both when free and when 
bound to an E3 (Deshaies and Joazeiro 2009). By 
contrast, members of the UBE2D family, which are 
the smallest E2s and contain only a UBC domain, have 


N'temMnal 



adenylation active site of Ubal; its N- and C-termini are labeled 
Nt and Ct, respectively, (b) Model of the Ubcl UBC domain 
(orange) bound to Ubal’s UFD (red) with Ubcl’s catalytic cys¬ 
teine sidechain atoms rendered in purple. Three conserved acidic 
residues (Glul004, Aspl014, and Glul016) in the last two C- 
terminal (1 strands of Ubal’s UFD form a negatively charged 
surface and appear to contact conserved lysines (Lys5 and Lys9) 
from the UBC domain’s a-helix HI. This model is generated from 
structures assigned PDB accession codes ITTE, 3CMM, and 
1Y8X and as described in Lee and Schindelin (2008) 

no intrinsic linkage specificity and instead synthesize 
ubiquitin chains of all linkage types in vitro (Ye and 
Rape 2009). 

E3 ubiquitin ligases. E3s bind substrates and are 
typically required to link them with an E2~ubiquitin 
intermediate. There are two major classes of ubiquitin 
E3s, those with RING (Really Interesting New Gene)/ 
U-box (Ufd2-box) domains and those with HECT 
(Homologous to E6AP C-Terminus) domains. RING 
and U-box domain E3s serve as scaffolds that bridge 
E2s to substrates and thereby facilitate the transfer of 
ubiquitin from the E2 catalytic cysteine to the substrate 
(Fig. lb). HECT domain E3s accept ubiquitin from the 
E2^ubiquitin thioester adduct to form their own 
ubiquitin thioester intermediate from which ubiquitin 
is passed to a substrate (Fig. lb). HECT E3s have an 
^350 amino acid HECT domain at their C-terminus, 
which contains an N-terminal region (the N-lobe) that 
binds E2s and a C-terminal region (the C-lobe) that 
houses a catalytic cysteine (Fig. 3b). Substrates are 
recognized by protein-protein or protein-lipid interac¬ 
tion regions that are not part of the HECT domain 
(Rotin and Kumar 2009). HECT domain E3s can con¬ 
trol substrate specihcity and chain linkage type, as in 
the case of E6AP (E6-Associated Protein), which pref¬ 
erentially forms Lys48-lirLked chains (Rotin and 
Kumar 2009). 
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Ubiquitination, Fig. 3 (a) Structure of the c-Cbl, UbcH7, and 
phosphorylated ZAP-70 peptide complex (PDB IFBV). The 
TKB domain, linker helix, and RING domain of c-Cbl are 
displayed in grey, yellow, and green, respectively. UbcH7 and 
ZAP-70 peptide are colored in dark and light blue, respectively. 
The catalytic cysteine of UbcH7 and the two Zn^"^ ions in c-CbTs 
RING domain are represented as purple and red spheres, respec¬ 
tively. N- and C-termini are labeled Nt and Ct, respectively, 
(b) Ribbon diagram of the E6AP HECT domain bound to E2 
UbcH7 (blue) (PDB 1C4Z) illustrating its bimodal organization 
of an N- (orange) and C- (pink) lobe. The sidechain atoms of 
catalytic cysteines are rendered as purple spheres, (c) Ribbon 
diagram of Prpl9’s U-box domain (PDB 2BAY) with residues 

Over 95% of human E3s are RING/U-box domain 
E3s (Deshaies and Joazeiro 2009). RING domains 
contain eight buried cysteine or histidine residues that 
coordinate a pair of zinc ions (Fig. 3a). U-boxes lack 
metal ions but adopt the same fold by using salt bridges 
and hydrogen bonds between charged or polar residues 
or a coordinated water molecule (Fig. 3c). Substrates 
are recruited to RING domain proteins either by an 
additional region within the same polypeptide, such as 
the tyrosine kinase binding (TKB) domain in c-Cbl 
(Fig. 3a), or through a common binding partner, as is 


that stabilize the structure by forming hydrogen bonds shown in 
stick representation and labeled. Coordinated water molecules 
are shown as orange spheres, (d) The common catalytic core of 
CRTs consists of a RING domain protein (in orange) and 
a cullin-family member (Cull (in blue), 2, 3, 4A, 4B, 5, and 7). 
The C-terminal cullin homology domain of cullins binds to the 
RING domain protein, which recruits an E2 (in green), while its 
N-terminus binds a specific adaptor protein, such as Skpl (in 
purple) for Cull CRTs, also known as Skpl-Cull-F-box (SCF) 
proteins. Skpl binds to F-box motifs in F-box protein substrate 
receptors (in yellow), thus recruiting them to the SCF complex. 
F-box proteins recruit substrates through a substrate-interaction 
motif (SIM) 

the case for Cullin-RING Ligases (CRLs). CRLs are 
expected to account for approximately half of the 
known ubiquitin E3s and to contribute to ^20% of all 
proteasome-dependent degradation (Soucy et al. 
2009). They recognize their substrates by forming 
complexes with substrate-recognition subunits, such 
as F- or SOCS boxes, DDBl, or BTB domains 
(Fig. 3d). Some RING E3s adopt heterodimeric 
(BRCAI/BARDI, Bmil/RINGIb, and MDM2/ 
MDMX) or homodimeric (PrpI9 U-box and CHIP 
U-box) structures, which can be essential for their 
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activity (Deshaies and Joazeiro 2009; Dye and 
Schulman 2007); the RING E3 BRCAl is activated 
by the RING domain of BARDl. Recently, RBR 
(RING-in-between-RING) family members parkin 
and HHARI (human homologue of ariadne) were 
reclassified as RING/HECT hybrids. Despite lacking 
HECT domains, these E3s form a thioester bond with 
ubiquitin at a conserved cysteine prior to catalyzing its 
transfer to substrate via an isopeptide bond (Wenzel 
et al. 2011). 

Deuhiquitinating enzymes. The ubiquitination sta¬ 
tus of a protein substrate is also influenced by 
deubiquitinating enzymes (DUBs), which play essen¬ 
tial roles in ubiquitin processing and signaling. 
Ubiquitin is expressed in cells as linear polyubiquitin 
precursors, with an additional amino acid following 
the last ubiquitin monomer or fused with ribosomal 
proteins. DUBs are required to generate monoubiquitin 
from these precursors, as further discussed in (Reyes- 
Turcu and Wilkinson 2009). DUBs also act subsequent 
to or in conjunction with the ubiquitination machinery 
to deubiquitinate a protein substrate and thereby shut 
down a ubiquitin signaling cascade, or to reconfigure 
a ubiquitin chain and in turn influence the outcome of 
ubiquitination. Through such roles, they can also main¬ 
tain ubiquitin homeostatis by disassembling chains to 
recycle ubiquitin prior to substrate degradation by 
proteasome or lysosome. 

In humans, there are ^95 DUBs, which are classi¬ 
fied as ubiquitin C-terminal hydrolases (UCHs), 
ubiquitin-specific proteases (USPs/UBPs), ovarian 
tumor domain proteases (OTUs), Josephin/Machado- 
loseph disease proteases (MJDs), or JABl/MPN/ 
Mov34 ► metalloenzymes (JAMMs). Two excellent 
review articles describe how each of these families 
recognize and disassemble ubiquitin chains 
(Komander et al. 2009; Reyes-Turcu and Wilkinson 
2009). The first four subfamilies are cysteine prote¬ 
ases, which form an acyl-cysteine intermediate with 
ubiquitin’s C-terminal carboxyl group prior to hydro¬ 
lysis of the isopeptide bond. By contrast, lAMM 
domains coordinate two zinc ions, one of which acti¬ 
vates a water molecule to attack the carbonyl carbon of 
the isopeptide bond thereby releasing the amino group 
(Komander et al. 2009). 

DUBs can exhibit unique specificity for a modified 
substrate or ubiquitin chain type. Some contain 
ubiquitin-binding domains or other protein-protein 
interaction domains that contribute to DUB 


recognition of substrates or specificity toward particu¬ 
lar ubiquitin chain types. Eor example, the 
polyglutamine disease protein Ataxin-3 binds Lys48- 
and Lys63-linked chains, but preferentially cleaves the 
latter. The three UIMs of Ataxin-3 regulate its cleav¬ 
age specificity by restricting what the protease domain 
can cleave, as reviewed in (Komander et al. 2009). 

DUBs and ubiquitin ligases can function together 
to edit ubiquitin chains, as exemplified by A20. Its 
N-terminal end has an OTU-type DUB domain 
whereas its C-terminal end harbors E3 ligase activity. 
These dual activities enable A20 to convert Lys63- 
linked ubiquitin chains attached to the receptor¬ 
interacting protein (RIP) kinase to Lys48-linked ones, 
thus shutting down tumor necrosis factor (TNE)- 
induced NE-kB activation and signaling for the 
proteasome-mediated degradation of RIP, as discussed 
further in (Reyes-Turcu et al. 2009). 

Ubiquitin and Its Binding Partners 

UBDs continue to be discovered each year. Currently, 
over 20 different families (Table 1) have been identi¬ 
fied in over 150 human receptor proteins for ubiquitin. 
Mechanisms of UBD interaction and their selectivities 
for particular chain types have been described else¬ 
where (Dikic et al. 2009). Most UBDs recognize 
a hydrophobic patch on the ubiquitin surface that is 
formed by Leu8, Ile44, and Val70. Structural biology 
studies found that ubiquitin adopts slightly different 
configurations with its various UBDs, which select 
their preferred ubiquitin configuration from an ensem¬ 
ble of dynamic structures, as reviewed in (Dikic et al. 
2009). 

UBDs typically bind weakly to monoubiquitin and 
exhibit increased affinity for ubiquitin chains. Such 
increased affinity and selectivity for chains of specific 
linkage can arise from the UBD interacting with more 
than one of the ubiquitin moieties within a chain or 
from interactions with the region that links ubiquitin 
moieties together. The C-terminal UBA domain of 
hHR23A (human homologue of Rad23 protein A) 
binds to Lys48-linked chains with 3.6-fold higher 
affinity than to Lys63-linked chains, and 70-fold 
higher affinity than to monoubiquitin. This UBA 
domain contains a unique binding epitope for Lys48- 
linked diubiquitin that is achieved by forming 
a sandwich-like diubiquitin-UBA complex (Fig. 4a). 
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Ubiquitination, Table 1 Ubiquitin binding domains 


Structure 

Ubiquitin binding domain 

a-helical 

UIM, lUIM/MIU, DUIM, UBM, 

UBAN, UBA, CUE, GAT, VHS, UMI 
ubiquitin-like binding domain of ufd2 

Zinc finger 

UBZ, NZF, ZnF A20, ZnF UBP 

Pleckstrin-homology 

domain 

PRU, GLUE 

Ubiquitin- 

conjugating-like 

domain 

UEV, UBC 

Others 

SH3, PFU, JABl/MPN, N-terminal 
domain of DC-UbP, WD40 fi-propellers 


Remarkably, the UBAN domain of NEMO can distin¬ 
guish between Metl- and Lys63-linked chains, two 
chain types that are highly similar. Like hHR23A’s 
C-terminal UBA domain, this discrimination arises 
from the UBAN domain interacting with neighboring 
ubiquitin moieties and their linker region (Fig. 4b); 
both binding modes are reviewed in (Dikic et al. 2009). 

The presence of multiple UBDs also increases 
a receptor’s affinity for ubiquitin chains, as in the 
case of proteasome component S5a, which has two 
ubiquitin interacting motifs (UIMs) that bind simulta¬ 
neously to ubiquitin moieties in a chain (Fig. 4c). 
Moreover, multiple UBDs can be used to generate 
ubiquitin linkage specificity, as in the case of Rap80. 
The UIMs of Rap80 and their intervening linker region 
form a continuous helix that is perfectly suited for 
Lys63-linked chains (Fig. 4d). 

Ubiquitin binding domains are typically just one of 
multiple functional domains in ubiquitin receptors. 
Multivalent interactions involving ubiquitin chains, 
multiple ubiquitin receptors, and domains that localize 
the receptor to specific cellular complexes strongly 
influence the outcome of ubiquitination, as discussed 
further below and reviewed in (Liu and Walters 2010). 

Functional Outcomes of Ubiquitination 

Ubiquitin signaling is used diversely in cells to signal 
for protein degradation, ► DNA repair and transcrip¬ 
tion, innate immunity, endosomal sorting, and cellular 
trafficking. The localization of ubiquitinated cargo to 
specific subcellular assemblies is conferred by interac¬ 
tion surfaces that are distinct from the ubiquitin- 
binding region. The N-terminal Pru domain of 


proteasome component Rpnl3 uses a surface opposite 
to its ubiquitin-binding loops to assemble into the 
proteasome, and the other ubiquitin receptors that 
associate with proteasomes use distinct structural 
domains that are far in sequence from their ubiquitin- 
binding regions (Dikic et al. 2009; Finley 2009). In the 
endosome, ubiquitinated cargo is captured and sorted 
by endosomal sorting complex for transport (ESCRT) 
complexes, which contain interaction domains for 
ubiquitin, DUBs, phosphatidylinositol 3-phosphate 
(PtdIns(3)P), and clathrin. The assembly of these 
ubiquitin-dependent and ubiquitin-independent inter¬ 
actions recruits and concentrates ubiquitinated cargo at 
clathrin-coated microdomains of early endosomes 
(Raiborg and Stenmark 2009). 

The binding affinities of existing interactions can be 
enhanced through ubiquitination. Proliferating cell 
nuclear antigen (PCNA) acts as a “sliding clamp” for 
► DNA polymerases and is monoubiquitinated in 
response to DNA damage and stalled replication. 
This modification increases the affinity of UBD- 
containing TLS (trans-lesion synthesis) polymerases 
for PCNA to promote the error-prone TLS mode of 
replication (Bergink and Jentsch 2009). 

Ubiquitin chains can also serve as a scaffold to 
promote protein interactions, as observed for kinase 
activation in the NF-kB pathway. A Lys63-linked 
ubiquitin chain localizes the protein kinase complexes 
TAK1/TAB1/TAB2 (or TAB3) and NEMO/IKK-a/ 
IKK-p through their UBDs, namely, TAB2 or TAB3 
and NEMO. The nucleation of these two complexes is 
thought to promote TAKl autophosphorylation and in 
turn, its phosphorylation of IKK-P, as described fur¬ 
ther in (Bhoj and Chen 2009). 

Cross Talk with Other Post-translational 
Events 

Ubiquitination is regulated by other post-translational 
modifications. ► Phosphorylation of E2s, E3s, and 
DUBs can regulate substrate ubiquitination positively 
or negatively (Hunter 2007). Phosphorylation of 
a conserved serine in E2 Ubc2/hHR6A promotes its 
ubiquitination of ► histone H2B and the phosphoryla¬ 
tion status of the budding yeast E3 APC/C and that of 
its activator Hctl/Cdhl controls APC/C activity, as 
reviewed in (Deshaies and Joazeiro 2009). Phosphor¬ 
ylation of the tumor suppressor CYLD at Ser418 
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Ubiquitination, Fig. 4 Multivalent interactions between 
UBDs and ubiquitin lead to increased specificity and affinity, 
(a) The hHR23A UBA2 domain (orange) is sandwiched 
between the two ubiquitin subunits of Lys48-linked diubiquitin 
(PDB 1Z06). (b) The UBAN domains of NEMO (light and dark 
blue) form a ► coiled coil dimer and bind two linear diubiquitins 
(PDB 2ZVO). (c) S5a uses its two UIMs (red) to bind ubiquitin 


moieties of Lys48-linked diubiquitin (PDB 2KDE). (d) RAP80’s 
two UIMs (red) and their intervening region (orange) forms 
a contiguous helix to bind selectively to Lys63-linked 
diubiquitin (PDB 3A1Q). In Figs, (a)-(d), the proximal (Ub-P) 
and distal (Ub-D) moieties of diubiquitin are colored in dark and 
light grey, respectively 


causes it to lose its DUB activity and suppresses 
NF-kB signaling (Komander et al. 2009). Other mod¬ 
ifications also regulate the key enzymes in the 
ubiquitination pathway. The conjugating activity of 
E2 Ubcl3 can be suppressed by its ISGylation, which 
blocks Ubcl3’s El and E3 binding surface (van Wijk 
and Timmers 2010). CRLs exhibit maximal E3 activity 
when their C-terminal cullin homology domain is 
neddylated at a conserved lysine. This modification 
induces a conformational change that frees the 
Rbxl’s RING domain (Deshaies and Joazeiro 2009). 

Post-translational modifications on substrate pro¬ 
teins can influence their ubiquitination status by 


changing the exposure of their ubiquitination sites. 
Acetyl transferase p300 acetylates Srnad? at two lysine 
residues that are critical for its ubiquitination and 
thereby protects it from TGEP-induced degradation 
(Deribe et al. 2010). PCNA can be SUMOylated or 
ubiquitinated at a common lysine (Lysl64), and these 
two modifications are mutually exclusive (Bergink and 
Jentsch 2009). Phosphorylation of an E3 ligase substrate 
can promote its ubiquitination by creating an E3 recog¬ 
nition site, as reviewed in (Hunter 2007). Most of CRL 
substrates are targeted to their respective ligase by 
a covalent modification including phosphorylation, N- 
glycosylation, and proline hydroxylation (Deshaies and 
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Joazeiro 2009). Other modifiers can also regulate 
ubiquitination by controlling a substrate’s subcellular 
localization. Phosphorylation of p27Kipl at SerlO trig¬ 
gers its nuclear export and allows it to be degraded by 
cytoplasmic E3 ligases (Hunter 2007). Similarly, 
ubiquitination can regulate other modifications. For 
example, it regulates phosphorylation through degrada¬ 
tion or activation of protein kinases, as reviewed in 
(Hunter 2007). 

Summary 

Ubiquitin is a highly variable modification that is used 
widely in cells. A large number of enzymes support the 
use of ubiquitin as a signaling modifier and these can 
be divided into four classes, ubiquitin El activating 
enzymes, ubiquitin E2 conjugating enzymes, ubiquitin 
E3 ligases, and deubiquitinating enzymes. The activity 
of these enzymes is often regulated by other post- 
translational modifiers and E3s and DUBs can work 
together to achieve a specific cellular outcome. 
Ubiquitin receptors continue to be discovered and 
are typically multi-domain proteins that help control 
the outcome of ubiquitination through their coordi¬ 
nated interactions with ubiquitin and other cellular 
components. 

Cross-References 

► Coiled Coils 

► DNA Polymerases 

► DNA Repair and Recombination 

► Histone-Lysine A-methyltransferase 

► Lysozyme - Computational Studies 

► Machinery of DNA Replication 

► Post-translational Modifications 

► Post-Translational Modification of Histone Proteins 

► Protein Phosphorylation 

► Zinc-Dependent Metalloenzymes - Computational 
Studies 
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2D infrared spectroscopy of proteins; Multi¬ 
dimensional IR spectroscopy of proteins; Nonlinear 
IR spectroscopy of proteins 


Definition 

The principles of two-dimensional infrared spectros¬ 
copy are introduced. Applications of two-dimensional 
infrared spectroscopy to proteins are discussed, includ¬ 
ing studies of protein structures and their ultrafast 
dynamics, both in equilibrium and nonequilibrium. 

Introduction 

2D IR spectroscopy is a novel spectroscopic tool that 
uniquely combines ultrafast time resolution with 
appreciable structure resolution power. The transfer- 
ability of the concepts of multidimensional NMR 
spectroscopy to IR spectroscopy has been postulated 
already in the earliest publications on two-dimensional 
NMR (Aue et al. 1976); however, the first 
two-dimensional IR spectrum was measured only 
relatively recently (Hamm et al. 1998). In analogy to 
2D-NMR spectroscopy (► Multidimensional NMR 
Spectroscopy), the connectivity of various vibrational 
states can be related to local contacts of the corresponding 
molecular groups, which is the basic principle of structure 
determination in both cases. Equivalents of all fundamen¬ 
tal 2D-NMR experiments, COSY, NOESY, and EXSY 
have been demonstrated in the IR range, albeit on orders 
of magnitudes faster timescales than what NMR spec¬ 
troscopy can achieve. 2D IR spectroscopy has provided 
unprecedented insights into the structure, ultrafast 
dynamics, and energy transfer processes of a wide variety 
of molecular systems, ranging from neat liquids such as 
water, DNA (► Infrared Spectroscopy of DNA), 
to peptides and proteins in membrane environment. 

In this Chapter, the basic principles of 2D IR spec¬ 
troscopy will be introduced on a very qualitative level. 
For more theoretical accounts that also discuss many of 
the practical issues of 2D IR spectroscopy, see Hamm 
and Zanni (2011) and Cho (2009). The various modes 
of 2D IR spectroscopy will he illustrated with concrete 
examples of experiments that have been performed to 
learn about structure and dynamics of proteins. This 
list of examples cannot he exhaustive here; for more 
complete overview of the applications of 2D IR spec¬ 
troscopy, the reader is referred to recent special issues 
(Hochstrasser 2007; Mukamel et al. 2009) and review 
articles (Cho 2006; Zheng et al. 2007; Bredenbeck 
et al. 2007; Finkelstein et al. 2007; Hamm et al. 
2008; Ganim et al. 2008; Kim and Hochstrasser 2009). 
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Note that 2D IR spectroscopy discussed here is 
fundamentally different from 2D IR correlation spec¬ 
troscopy also common in the study of protein systems 
(► Infrared Spectroscopy: Data Analysis). While 2D 
IR spectroscopy disturbs the molecular system through 
ultrafast laser pulses, 2D IR correlation spectroscopy 
does so by an external perturbation such as 
temperature. 


Basic Principles of 2D IR Spectroscopy 

2D IR spectra are measured by interrogating 
a molecular system with a sequence of ultrashort 
IR laser pulses, which, in simple words, can be 
classified as a pump and a probe process. The essen¬ 
tial idea of 2D IR spectroscopy is to correlate the 
pumped spectrum (one frequency axis) with the 
probed spectrum (second frequency axis). There 
are many mechanisms that cause such correlations, 
as discussed below, and they may also change as 
a function of time between pump and probe process. 
For the pump-process, a spectrally selective IR 
pump pulse can be used directly, which then is 
relatively long in time (typically 1 ps), or a sequence 
of two ultrashort IR laser pulses (typically 50- 
100 fs) in which case the frequency information is 
regained by a post-processing Fourier transforma¬ 
tion. Both methods reveal very similar spectra. 
While the second method has conceptual advan¬ 
tages, it is in fact more intuitive to think in terms 
of the first pump-probe picture. 

All one needs to construct a 2D IR spectrum is 
a scheme of vibrational states together with the transi¬ 
tion dipoles connecting them. As the simplest possible 
case, consider the 2D IR spectrum of a single, slightly 
anharmonic oscillator (Fig. la, b) (► Molecular Vibra¬ 
tions and Their Interaction with Electromagnetic Radi¬ 
ation). Imagine scanning the pump-process across the 
resonance frequency of the vibrator. When resonance 
with the dipole allowed 0-1 transition is achieved, 
Mpump = ffloi, then a certain fraction of molecules in 
the laser focus will be excited from their ground state 
|0) into their first vibrationally excited state |1) (solid 
arrow in Fig. la). The subsequent probe pulse will test 
the two possible transitions from that excited state, i.e., 
the stimulated emission back into the ground-state and 
the excited-state absorption into the second excited 
state |2) (dotted arrows in Fig. la). In addition, since 





Ultrafast Two-Dimensional Infrared Spectroscopy of 
Proteins, Fig. 1 (a) Level scheme of an anharmonic oscillator 
with the dipole-allowed transitions depicted. The solid arrow 
represents the pump-process, the dotted arrow the probe process, 
(b) Resulting 2D-IR spectrum. Signals with opposite signs are 
depicted in red and blue, respectively, (c, d) The same for two 
coupled oscillators. The nomenclature \kl) labels a state with k 
quanta of excitation in the first mode and / quanta of excitation in 
the second mode. The dotted line in panel (d) indicates the 
pump frequency discussed in the text (Adapted from Hamm 
and Zanni (2011)) 


the difference of absorption between pump pulse 
switched on minus pump pulse switched off is typi¬ 
cally measured, the probe pulse will experience less 
absorption of ground-state molecules, an effect called 
a bleach. Both bleach and stimulated emission occur at 
the original 0-1 frequency cooi with identical sign. The 
excited-states absorption is typically red shifted with 
respect to the original 0-1 frequency due to 
the anharmonicity of the vibrator, and occurs with 
opposite sign due to the additional absorption that 
would not be present without a pump pulse. As 
a result, a doublet of peaks in the 2D-IR spectrum 
with opposite signs will be observed. Both will be 
situated at cOpump = ffloi since both are the outcome of 
the same pump-process. With respect to the probe 
frequency, however, they are resolved: the bleach/ 
stimulated emission signal appears at frequencies 
®probe = ®on whereas the excited-state absorption 
signal appears at (flprobe = ®i 2 - Both peaks have 
roughly the same intensities. 


u 



























u 


2694 


Ultrafast Two-Dimensional Infrared Spectroscopy of Proteins 


Cross-peaks 

Cross-peaks are the hallmark of multidimensional 
spectroscopy. They are a measure of the coupling 
between molecular vibrations and thus contain impor¬ 
tant information on molecular structure. Figure Ic 
shows the level scheme for two oscillators with all 
dipole-allowed transitions depicted (assuming the 
selection rules of harmonic oscillators, which typically 
is the case if anharmonicity is small). Again, imagine 
scanning the pump frequency across the resonances of 
the two oscillators. If the pump-process is resonant 
with, e.g., the first oscillator, state 110) will be excited 
(see dotted line in Fig. Id). The subsequent probe pulse 
now can interrogate three possible transitions, labeled 
(1), (3), and (4) in Fig. Ic. In addition, the probe pulse 
will observe a bleach of both oscillators, giving rise to 
transitions (8) and (2), since the number of molecules 
in the common ground state |00) is diminished. 
Transitions (8), (4), and (3) are the same as for 
a single oscillator (Fig. la, b), whereas transitions 
(1) and (2) are new. Transition (1) excites the second 
oscillator by one quantum from its ground state to its 
first excited state in the presence of already one quan¬ 
tum of excitation in the first oscillator: 110)^111). 
If the two oscillators were not coupled, then the exci¬ 
tation frequency of the second oscillator would not 
depend on the number of quanta of the first oscillator, 
and one would have exactly the same frequency for 
the |00)^I01) and the 110)^111) transitions. In 
that case, peaks (1) and (2) would exactly coincide 
and cancel each other due to their identical transition 
strength but opposite sign. If the two oscillators 
are coupled, however, then the frequencies of the 
two transitions are in fact slightly different and 
the two peaks do not cancel, so a doublet is obtained 
as a cross-peak in the off-diagonal region (Fig. Id). 

The coupling between two vibrational modes is 
closely related to the spatial vicinity of the 
corresponding molecular groups. The most common 
description is the exciton model, which has been used 
extensively to describe 2D IR spectra of the 
amide I band of peptides and proteins. In this case, 
the C=0 groups of the various peptide units along the 
backbone are thought to be coupled mostly through 
electrostatic interactions, which in the most simple 
approximation can be calculated as transition dipole¬ 
coupling (TDC) (Krimm and Bandekar 1986; 
Torii and Tasumi 1992, more sophisticated models 


for the coupling have been worked out, see Hamm 
and Zanni 2011) 

^ {nj ■ A) {nj ■ fij) 

rl /- 5 . 

. 'j ‘1 J 

where /ly is the strength of the coupling, p, are the 
transition dipoles of the local C=0 stretch modes, 
and fij a vector connecting two peptide units. In the 
weak coupling limit (|/iy| << |cu,'— my|), one can 
show that the off-diagonal anharmonicities jr,, (see 
Fig. Id) scale as: 


{hojj — hcoiY 

where A is the intrinsic (diagonal) anharmonicity 
of the C=0 vibrator, and m, its local mode frequency. 
The strength of the coupling, /iy, and hence the size of the 
cross-peak, is given by the distance and relative orienta¬ 
tion of the C=0 groups. This is the very idea behind the 
capability of 2D-IR spectroscopy to determine the struc¬ 
ture and/or structure distribution of peptides and proteins. 

The local mode frequency may be modified by 
isotope labeling, which allows one to spectrally isolate 
certain parts of a protein from the rest. Isotope labeling 
turns out to be a crucial tool to study more complicated 
molecular systems in a site-selective manner. 

Examples of Applications to Proteins 

Exciton coupling leads to 2D IR lineshapes of the amide 
I band that are a characteristic measure of the overall 
secondary structure of a protein, in particular, in the case 
of P-sheet rich proteins (Ganim et al. 2008). Site- 
selective information, in contrast, is obtained by isotope 
labeling. For example, Remorino et al. (2010) 
have determined the structure of a transmembrane 
helix-homodimer. Several amino acids have been 
'^C'^O-labeled (one at a time), and the coupling from 
one helix to the other has been measured. This produced 
structural constraints that were sufficient to elucidate the 
structure in connection with molecular modelling tools. 


PiJ = 


1 

47t£o 


Inhomogeneous Broadening and Spectral 
Diffusion 

It is very common that not all molecules in a sample 
are identical; they may differ in structure, hydrogen 
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Ultrafast Two-Dimensional Infrared Spectroscopy of 
Proteins, Fig. 2 (a) An inhomogeneously broadened vibra¬ 
tional transition and (b) resulting 2D-IR spectrum. Panels 
(c) and (d) show the same at a later time-delay, when spectral 
diffusion has occurred (Adapted from Hamm and Zanni (201 1)) 


bonding, or their environments. Because an ensemble 
of many molecules is measured, these differences will 
usually broaden the lineshape, which is referred to as 
inhomogeneous broadening. Hence, the overall 
absorption line is a superposition of many absorption 
lines, each of which has an intrinsic line width 
(Fig. 2a). In this case, only a sub-ensemble of mole¬ 
cules will be resonant with a given pump frequency, 
the homogeneous line width of which might be signif¬ 
icantly narrower than the overall line shape. As 
a result, a doublet corresponding to bleach/stimulated 
emission and excited state absorption contribution will 
be observed in the 2D-IR spectrum (the same way as 
Fig. lb) only of that sub-ensemble of molecules. As the 
pump-process is scanned across the vibrational transi¬ 
tion, a different sub-ensemble will be selected, and the 
overall 2D-IR lineshape will be tilted along the diago¬ 
nal (Fig. 2b). Pumped and probed molecules are cor¬ 
related. This is the hole-burning picture of 2D-IR 
spectroscopy, in which the pump-process burns 
a hole into an inhomogeneously broadened line. In 
simple words, the anti-diagonal width of the diagonal 
0-1 peak represents the homogeneous line width of the 
transition, whereas the diagonal width represents 
the total line width (i.e., inhomogeneous width convo¬ 
luted with the homogeneous width). 


In liquid-phase systems, the environment of 
a molecular group will not be static; hence, the term 
“inhomogeneous broadening” really depends of the 
timescale on which it is investigated. If the environ¬ 
ment in the vicinity of a particular reporter group 
diffuse around, then its vibrational frequency will 
change and alter the observed distribution of frequen¬ 
cies (Fig. 2c). This process is called spectral diffusion, 
and can be measured with 2D IR spectroscopy by 
varying the time-delay between pump and probe pro¬ 
cess. At small enough time-delays, the environment 
will not have enough time to move, and so one mea¬ 
sures a seemingly static inhomogeneous distribution of 
frequencies for the reporter group, as discussed above 
(Fig. 2b). As the time-delay is increased, however, the 
environment is given time to diffuse and, thus, the 
molecular group under study change its frequency 
(Fig. 2c). In the 2D IR spectrum, this results in a shift 
of intensity away from the diagonal, as indicated by the 
arrows in Fig. 2d. Consequently, pumped and probed 
molecules lose their correlation, and the 2D IR 
lineshape becomes more and more round. The 
timescale on which that happens reflects directly 
the timescale on which the environment of the reporter 
group changes. 

Examples of Applications to Proteins 

Manor et al. (2009) have determined the structure of 
the influenza A M2 channel by making use of the fact 
that those C=0 units (again, *^C**0-labeled) pointing 
into the channel experience a larger inhomogeneous 
broadening due to interactions with the channel water, 
whereas those pointing outward sense the more homo¬ 
geneous environment of the membrane. Another inter¬ 
esting example has been reported by Bandaria et al. 
(2010) who showed that the active site reveals only 
a short-lived inhomogeneity, but is otherwise structur¬ 
ally very homogeneous when the enzyme adopts the 
transition state-structure of a proton-transport reaction. 
Both reactant and product structures, on the other hand, 
exhibit a wide distribution of structures. 

Chemical Exchange 

The previous example had considered a continuous dis¬ 
tribution of vibrational frequencies due to a continuum 
of environments around a given molecular group. 
But distributions are often bimodal rather than 
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Ultrafast Two-Dimensional Infrared Spectroscopy of 
Proteins, Fig. 3 2D IR spectrum of a system undergoing 
chemical exchange with rate k; (a) no pump-probe time-delay 
and (b) a delay that is roughly that of the exchange kinetics 
(Adapted from Hamm and Zarmi (201 1)) 

continuous (Fig. 3a). For example, a hydrogen bond 
either exists or it does not, but the two states are sepa¬ 
rated by a reaction barrier, so the probability of finding 
molecules in the transition state is very low. In the 2D IR 
spectrum of a bimodal distribution, each structure 
creates a pair of diagonal peaks. At early pump-probe 
time-delays, there are no cross-peaks because one mol¬ 
ecule cannot exist in both states (see Fig. 3a). However, 
if the molecule changes between the pump and the 
probe process, then the molecule that was initially 
pumped in the one state is now probed in the other. 
Thus, the dynamics creates cross-peaks on both side of 
the diagonal (since the system is in equilibrium, mole¬ 
cules react in both directions). The cross-peak intensity 
rises with a time-constant that directly reflects the chem¬ 
ical exchange rate k between both species (see Fig. 3b). 
What is unique about 2D IR spectroscopy as compared 
to NMR is the ultrafast timescale it may resolve, ranging 
from femtoseconds to picoseconds. 

Examples of Applications to Proteins 

Ishikawa et al. (2008) used ultrafast exchange spec¬ 
troscopy of a mutant of CO-myoglobin to directly 
observe the ultrafast switching between two conforma¬ 
tional substates of the protein on a 50 ps timescale. 
Both substates give rise to a small frequency shift of 
the CO vibration, so the time-dependence of the cross¬ 
peaks mirror directly the transition between substates. 


Transient 2D IR Spectroscopy 

Chemical exchange is an equilibrium process such 
that the dynamics are set by the forward and backward 
rate constants. A second mode of measuring chemical 
or structural dynamics is often referred to as transient 
2D IR spectroscopy, in which some sort of 
nonequilibrium dynamics are initiated, for instance, 
by a temperature jump or a photochemical reaction. 
The resulting intermediates are probed with 2D IR 
spectroscopy as a function of time-delay between 
the actinic pump and the 2D IR probe. To the extent 
2D IR spectroscopy can elucidate the structure of 
molecule systems, transient 2D IR spectroscopy 
reveals information on transient structures during 
such nonequilibrium processes with ultrafast time 
resolution. 

Examples of Applications to Proteins 

Kolano et al. (2006) used transient 2D-IR spectroscopy 
to observe the unfolding of a p-turn on a 100 ps time- 
scale after photocleavage of a disulflde bridge. Upon 
unfolding, a hydrogen-bond breaks, which causes the 
intensity of a cross-peak across that hydrogen bond to 
change. Chung et al. (2007) investigated the unfolding 
of ubiquitin after an ultrafast temperature jump, 
observing characteristic changes of the amide I 2D IR 
spectrum. On a slower timescale. Shim et al. 2009 
mapped the aggregation pathway of amyloid fibrils 
involved in type-2 diabetes. Using isotope labeling, 
they obtained residue-specific structural information 
that provided the most detailed structural mechanism 
of fiber formation so far obtained. Work by Ihalainen 
et al. (2008) on microsecond peptide folding also 
illustrates the powerful combination of isotope 
labeling and transient IR spectroscopy (► Infrared 
Spectroscopy of Protein Dynamics: Ultrafast Kinetics) 
for elucidating protein kinetics with bond-by-bond 
structural resolution. 


Cross-References 

► Infrared Spectroscopy of DNA 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 

► Multidimensional NMR Spectroscopy 
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Ultramicroscopy 

► Selective Plane Illumination Microscopy (SPIM) 


Ultraviolet Resonance Raman (UVRR) 
Spectroscopy Studies of Structure and 
Dynamics of Proteins 

Gurusamy Balakrishnan and Thomas G. Spiro 
Department of Chemistry, University of Washington, 
Seattle, WA, USA 

Synonyms 

Allostery - allosterism; Aromatic side chain - aromatic 
amino acids; Hydrogen bonding; H-honding; Peptide 
backbone - amide backbone; Photolysis - photodisso¬ 
ciation; Temperature jump 

Definition 

Excitation in the deep UV region leads to selectively 
enhanced Raman spectra of various biological chro- 
mophores including peptide backbones (192-206 nm), 
proline (ca. 210 nm), aromatic amino acid side chains 
(192-260 nm), and nucleic acids (240-280 nm). 
The application of UVRR spectroscopy to study the 
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structure and dynamics of proteins is illustrated by 
studies of the allosteric transition in hemoglobin, the 
helix-coil transition in a model peptide, and the 
unfolding dynamics of apomyglohin. 

Introduction 

Vibrational spectroscopy (► Molecular Vibrations and 
Their Interaction with Electromagnetic Radiation) has 
proved to be a useful structural probe of biological 
systems as measured by infrared (IR) absorption and 
Raman scattering. Although the fundamental princi¬ 
ples and instrumentation of IR (► Protein Dynamics: 
Time-Resolved Spectroscopic Studies) and Raman 
spectroscopy (► Raman Spectroscopic Instrumenta¬ 
tion, Experimental Considerations) differ greatly, 
they provide complementary structural information. 
Various aspects of IR spectroscopic applications to 
protein structure and dynamics can he found in the 
articles ► Infrared Spectroscopy: Data Analysis, 

► Infrared Spectroscopy of Protein Structure and 

► Infrared Spectroscopy of Protein holding, 
Misfolding and Aggregation. The advent of reliable 
tunable lasers expanded applications of Raman spec¬ 
troscopy to a wide variety of biological chromophores 
including prosthetic groups, cofactors, aromatic amino 
acids, nucleic acids, and the peptide backbone of 
proteins. Advances in instrumentation and nanotech¬ 
nology have led to the development of different 
types of Raman spectroscopy including CARS 
(► Nonlinear Raman Spectroscopy: Coherent 
Anti-Stokes Raman Scattering (CARS)), SERS 
(► Surface-Enhanced Resonance Raman Spectros¬ 
copy in Electron Transfer Studies, ► Surface- 
Enhanced Raman Spectroscopy for Bioanalytics), 
fsSRS (► Ultrafast Raman Spectroscopy), TERS (Tip 
enhanced Raman spectroscopy), and ROA (► Raman 
Optical Activity Studies of Structure and Behavior of 
Biomolecules). 

Laser excitation in resonance with allowed elec¬ 
tronic transitions gives strong enhancement of vibra¬ 
tional Raman bands and provides high sensitivity and 
selectivity for reporters of chromophore stmctural fea¬ 
tures (Harada and Takeuchi 1986; Asher 1988; 
Kitagawa 1992; Austin et al. 1993). Resonance 
Raman (RR) enhancement depends on the degree to 


which the vibrational mode carries the molecule 
toward the geometry of the excited state. In protein 
studies, UV excitation selectively enhances Raman 
bands of the peptide backbone (192-206 nm), and of 
proline (ca. 210 nm), histidine (ca. 220 nm), and aro¬ 
matic amino acid residues (192-260 nm), as discussed 
below. Excitation in the 240-280 nm regions reso¬ 
nantly enhances Raman bands of nucleic acids, per¬ 
mitting the characterization of structure and dynamics 
in DNA and RNA. 

Dynamical features of protein structure can be 
studied with pump-probe techniques, using UV 
laser pulses to probe Raman signals following the 
initiation of dynamics through photoexcitation with 
visible wavelength pulses, or through temperature 
jumps via solvent heating with infrared pulses. Illus¬ 
trations are provided by studies of the allosteric 
transition in hemoglobin, the helix-coil transition 
in a model peptide, and the unfolding dynamics of 
apomyglohin. 


UVRR Spectra of Aromatic Acid Side Chains 

Phenylalanine: The UVRR spectmm with 197 nm 
excitation (Fig. 1) is in resonance with the strong Ba^b 
absorption band (Fig. 1 inset, J^^ax = 188 nm, 
£ = 60.3 mM“* cm”*). It contains strongly enhanced 
ring modes of the phenyl side chain (assignments in 
Table 1). The Bab band arises from the first allowed 
benzene 7i—7i* transition, and is preceded by symme¬ 
try-forbidden transitions, which appear as weak 
absorption bands. La and Lb, at 206 nm (£ = 10.5 
mM“* cm”*) and 257 nm (e = 0.2 mM”* cm”*) 
(Fodor et al. 1989). Excitation at the latter wavelengths 
gives weakly enhanced UVRR spectra, dominated by 
the F8a, F8b, and F9a modes, which vibronically mix 
the La and Ba b transitions. 

Although the UVRR band positions are essentially 
invariant, the intensities are sensitive to environment 
and are useful monitors of phenylalanine solvent expo¬ 
sure in proteins. 

Tyrosine: UVRR spectra are selectively enhanced at 
229 nm (Fig. 2), in resonance with the La electronic 
transition (Fig. 2 inset), which is red-shifted relative to 
phenylalanine (La at 223 nm, s = 8.4 mM”* cm”*; Lb 
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Ultraviolet Resonance 
Raman (UVRR) 
Spectroscopy Studies of 
Structure and Dynamics of 
Proteins, Fig. 1 197 nm - 
excited UVRR spectrum of 
L-phenylalanine in water. 
Inset - absorption spectrum. 



at 275 nm, e = 1.4 mM * cm Bat, at 193 nm, 
£ = 46.5 mM“^ cm”') (Fodor et al. 1989). 
The vibronically active modes Y8a, Y8b, and Y9a 
(Table 2) are dominant. 

Deprotonation of the -OH group red-shifts the elec¬ 
tronic transitions, and downshifts the vibrational mode 
wave numbers, especially Y8a, Y8b, and Y9a 
(Table 2). H-bond donation from the O-H produces 
intermediate shifts, making UVRR a useful probe for 
tyrosine H-bonding in proteins. 

Tryptophan: 229 nm is also a useful excitation 
wavelength for tryptophan, enhancing several indole 
ring modes (Table 3), near resonance with the strong 
Bb electronic transition (Fig. 3). 

The W3 band has proven to be a particularly useful 
structure marker, because its frequency varies with the 
orientation of the indole ring, following the empirical 
equation vW3 =1,542 + 6.7(cos 3lx^’'l + 1)*'^, where 
X^’* is the torsional angle around the Cp-Ca bond 
(Takeuchi 2003). The orientation dependence can 
sometimes be used to distinguish contributions from 
different tryptophan residues in a protein. 

H-bonding from the indole ring shifts the frequen¬ 
cies of the W4, W6, and W17 modes. W4 and W6 
bands are frequently obscured in protein UVRR 


Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Table 1 Funda¬ 
mental ring vibrations of phenylalanine observed in UVRR 
spectra 


v/cm ' 

Label 

Assignment 

1,606 

F8a 

Ring C-C stretch (aj) 

1,586 

F8b 

Asymmetric ring C-C stretch (b 2 ) 

1,207 

F7a 

Symmetric Cnng-Cext stretch (aj) 

1,182 

F9a 

C-H bend + ring C-C stretch (aj) 

1,030 

F18a 

C-H bend (aO 

1,000 

F12 

Symmetric ring stretching (a,) 


spectra, but W17 is better resolved (Austin et al. 
1993). Intensification of the strong W3 band has also 
been used to monitor tryptophan H-bonding. 

The intensity ratio, Ii 36 o/It 342 of the Fermi doublet 
band associated with the W7 mode is an indicator of 
the polarity of the tryptophan environment (Harada 
and Takeuchi 1986). The ratio is high for buried and 
low for water-exposed tryptophan residues. 

Histidine: UVRR spectra of histidine contain a rich 
assortment of bands (Fig. 4 and Table 4) but enhancement 
is modest, reflecting the weakness of the absorption band 
(207 nm, e = 5.8 mM“' cm“') (Fasman 1976), 
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Ultraviolet Resonance 
Raman (UVRR) 
Spectroscopy Studies of 
Structure and Dynamics of 
Proteins, Fig. 2 229 nm - 
excited UVRR spectrum of 
L-tyrosine in water. Inset - 
absorption spectrum 



Ultraviolet Resonance Raman (UVRR) Spectroscopy Studies of Structure and Dynamics of Proteins, Table 2 Fundamental 
ring vibrations of tyrosine and tyrosinate bands observed in UVRR spectra 


Tyrosine v/cm * 

Tyrosinate v/cm * 

Label 

Assignment 

1,617 

1,601 

Y8a 

Ring C-C stretch 

1,601 

1,558 

Y8b 

Ring C-C stretch 

1,263 

1,266 

Y7a’ 

Vj^’ C-0 stretch 

1,210 

1,208 

Y7a 

^7a Crinp;~Cext StrCtch 

1,180 

1,174 

Y9a 

V 9 „ C-H bend + COH bend 

852 

850 

Y16a 

Fermi doublet: interaction of Vi (ring breathing) 832 cm“* with 
2vi6a, an out-of-plane ring deformation 


associated with the ti-ti* transition of the imidazole ring. 
Band positions depend on which imidazole N atom 
carries the proton, and can be used to distinguish n and 
I tautomers. Complexation by metal ions increases the 
frequencies of the € 4=05 stretching bands (Takeuchi 
2003). 

Protonation or deprotonation of the imidazole pro¬ 
duces UVRR spectra with fewer and stronger bands, 
owing to the symmetrization of the ring. In D 2 O, the 
N-deuterated histidinium (HisD 2 '^) gives rise to 


a single strong band at 1,408 cm“* corresponding to 
the Nt:-C 2 -N^ symmetric stretch. This band is useful 
for quantitating the degree of protonation and for 
determining the pKa of histidine residues. 

UVRR Spectra of Peptide Bonds 

Deep UV wavelengths are needed to access the 7 t- 7 i* 
transitions (^180 nm, s ^ 9 mM“* cm”*) of 
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the -C(0)-NH- peptide bond (Austin et al. 1993). 
(The ^210 nm n-7i* transition is weak, and does not 
produce UVRR enhancement.) 

The main UVRR bands are sensitive to the peptide 
conformation, and the contributions from different 

Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Table 3 Funda¬ 
mental ring vibrations of tryptophan bands observed in UVRR 
spectra 


v/cm ' 

Label 

Assignment 

1,622 

W1 

Benzene-localized mode similar to v 8 a with 
some N i-Cg stretch 

1,579 

W2 

Benzene-localized mode similar to v 8 b 

1,552 

W3 

C 2 -C 3 pyrrole-localized mode 

1,496 

W4 

Similar to benzene vl9b 

1,462 

W5 

Similar to benzene vl9a 

1,435 

W 6 

N 1 -C 2 -C 3 stretch coupled to Nj-Fl bend 

1,360/ 

1,342 

W7 

Split due to Fermi resonance 

1,305 

W 8 

C 3 -C 9 stretch + N i-H bend 

1,238 

WIO 

C-H + C 3 -Cext stretch 

1,127 

W13 

Similar to benzene v9b C-H bend 

1,012 

W16 

Similar to benzene ring breathing C-C stretch 

879 

W17 

Similar to benzene V 12 -r NiH bend 

759 

W18 

Indole ring breathing 


elements of secondary structure have been evaluated 
through least squares fitting of proteins with known 
and varied structures (Huang et al. 2006). Figure 5 
compares the fitted spectra for a-helix and (l-sheet 
structures with spectra of poly-L-lysine under condi¬ 
tions favoring these structures, and also shows htted 
spectra for P-turn and for unordered segments. 

The strongly enhanced modes are associated with 
CO stretching (amide I, ^1,654 cm“*), and with out- 
of-phase (amide II, ^1565 cm“*) and in-phase (amide 
III, ^ 1,250 cm”*) combinations of CN stretching and 
NH bending coordinates. NH/D exchange in D 2 O has 
little effect on amide I, but lowers amide II and III to 
-1,510 and -960 cm”*. 

Frequency ranges for the amide I, II, and III bands 
are given in Table 5. Amide III, which contains con¬ 
tributions from several internal coordinates, is partic¬ 
ularly sensitive to conformation (Mikhonin et al. 
2006). Another useful band is amide S (—1,400 
cm”*), also known as 5CotH. It involves bending of 
the Cq(H bond, and disappears in the a-helix conforma¬ 
tion making it particularly useful for monitoring helix 
transformations. 

Proline residues are a special case, because the cyclic 
side-chain eliminates the N-H bond (secondary amide). 


Ultraviolet Resonance 
Raman (UVRR) 
Spectroscopy Studies of 
Structure and Dynamics of 
Proteins, Fig. 3 229 nm - 
excited UVRR spectrum 
of L-tryptophan in water. 

Inset - absorption spectrum 
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Ultraviolet Resonance 
Raman (UVRR) 
Spectroscopy Studies of 
Structure and Dynamics of 
Proteins, Fig. 4 229 nm - 
excited UVRR spectrum of 
L-histidine in water. Inset - 
absorption spectrum 



Raman Shift (cm ’') 


Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Table 4 Funda¬ 
mental ring vibrations of aqueous histidine observed in UVRR 
spectra 


N,:H v/cm ' 

N,iH v/cm ' 

Assignment 

1,575 

1,593 

C 4 =C 5 stretch 

1,453 

1,435 

N-C 2,5 Stretch -r NH bend -l- 
Cext-H bend 

1,324 

1,360 

N 71 -C 4 stretch - 1 - N. 1 -C 5 stretch 

1,286 

1,266 

Nx-C 4 stretch + C-Cext stretch 

940 


Ring breathing 


thereby removing the amide II interaction between C-N 
stretching and N-H bending. The proline amide II 
occurs at ^1,460 cm”*, and is sensitive to H-bonding 
with solvent or other amino acid residues to its CO 
group (Jordan et al. 1996). Also the electronic absorp¬ 
tion of proline amide is somewhat red-shifted, relative to 
primary peptide bonds, so that the proline amide II band 
can be selectively detected by excitation at 210 nm. 

Most UVRR studies of proteins have been carried 
out at 197-206 nm to monitor backbone structure and 
the solvation status of phenylalanine residues, and at 


229-244 nm, to investigate the environment of tyro¬ 
sine and tryptophan residues. Histidine studies are best 
carried out at 220 nm, while proline can be monitored 
at 210 nm. 


Protein Dynamics from Time-Resolved UVRR 
Spectroscopy 

Dynamics of Allostery in Hemoglobin 

Hemoglobin (Hb) serves to illustrate the use of 
visible pump-UVRR probe techniques in studying 
how proteins change their shape in response 
to a photochemical event. When CO is bound to Hb, 
it can be dissociated with ^100% quantum efficiency 
by visible light. A visible laser pulse produces 
photodissociated Hb in the R quaternary state, charac¬ 
teristic of ligated Hb, from which it relaxes to 
the T quaternary state characteristic of unligated 
Hb, before CO rebinds from solution (Sawicki and 
Gibson 1976). Subsequent to the pump pulse, UV 
probe pulses generate UVRR spectra that evolve in 
time (Fig. 6), monitoring motion of specific groups in 
the protein. 
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Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Fig. 5 197 nm - 

excited UVRR spectrum of aqueous poly-L-lysine in ot-helical 
(pH 12.0,10°C, 40% TFE) or (f-sheet (pH 11.6 at 50“C) structure 
(gray)(JiJi et al. 2006), and consensus spectra of ot-helix (black), 
P-sheet (black), P-tum (blue), and coil (red) structures from 
least-squares fitting to a set of 12 structurally characterized pro- 
teins(Huang et al. 2006). The 1,625-1,575 cm^’ spectral seg¬ 
ment was removed because of variable contributions of aromatic 
side chain bands to the protein spectra 


Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Table 5 Amide 
I, II, and III wave number ranges for secondary structure types 



Wave number/cm ’ 


Structure type 

Amide I 

Amide II 

Amide III 

a-helix 

1,645-1,660 

1,545-1,552 

1,260-1,300 

P-sheet 

1,665-1,680 

1,551-1,562 

1,230-1,240 

Unordered 

1,660-1,665 

1,552-1,560 

1,240-1,250 


With 229 nm excitation, negative tyrosine and tryp¬ 
tophan difference bands are initially seen, signaling the 
breaking of H-bonds that anchor the E and F helices. 
These helices hold the heme like a clamshell (Fig. 7) 


LO 



1600 1400 1200 1000 

Raman Shift (cm-t) 

Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Fig. 6 229 nm - 

excited difference UVRR spectra at the indicated time intervals 
(ps) following photodissociation of HbCO, compared with static 
HbCO and deoxyHb (unligated Hb) spectra (Balakrishnan et al. 
2004) 

and move in response to forces generated by ligand 
dissociation. At later times, new difference signals 
arise, which have been shown to arise from quaternary 
contacts formed in the T state, at intersubunit contacts 
(Fig. 7). The UVRR spectra reveal that the “hinge” 
contact is formed before (3 ps) the “switch” contact 
(20 ps), and establish that the quaternary transition is 
not concerted, but occurs in stages. After 20 ps, the 
spectra are the same as the static difference spectrum 
between deoxyHb and HbCO, but diminish after 50 ps 
as CO rebinds to complete the photocycle 
(Balakrishnan et al. 2004). 

Similar techniques have been applied to bacterio- 
rhodopsin (Ames et al. 1992) and other photoactive 
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Ultraviolet Resonance 
Raman (UVRR) 
Spectroscopy Studies of 
Structure and Dynamics of 
Proteins, Fig. 7 Ribbon 
diagram of the aiP 2 subunits 
in deoxy Hb (T-state), 
showing the critical 
intersubunit contacts that form 
during the R to T structural 
transition (rotation of one aP 
dimer against the other) 
(Perutz 1990) in Hb 


'switch contact’ 



proteins. The timescale can be from picoseconds to 
milliseconds, depending on laser sources and sample 
characteristics. 

Helix-Coil Transition 

A study of helix melting in a synthetic alanine-based 
polypeptide (Balakrishnan et al. 2007; Mikhonin et al. 
2007) illustrates how secondary structure can be mon¬ 
itored in response to a laser-induced temperature jump, 
using deep UV excitation of the Raman spectrum 
(Fig. 8). 

Nanosecond T-jumps (6—>37°C), produced by 
heating the solvent with 1.9 pm IR laser pulses, 
were used to perturb the helix-coil equilibrium. 
Structural relaxation to the new equilibrium position 
was monitored by Raman spectra generated with 
197 nm probe pulses. The difference spectra reveal 
intensity losses and gains for characteristic helix 
and coil bands, and evolve on the sub-microsecond 
timescale toward a response closely resembling the 
static difference spectrum for a similar temperature 
interval. 

Analysis of the time dependence revealed two 
phases for the helix-coil transition, a <40 ns phase 


and a ca. 120 ns phase. The latter is independent of 
the T-jump magnitude, while the former increases in 
amplitude with increasing temperature. These phases 
are attributed to an enthalpic process, involving dis¬ 
ruption of H-bonds, followed by an entropic process, 
involving reorientation of peptide links as the chain is 
extended (Balakrishnan et al. 2007). 

Unfolding of Apomyoglobin 

Unfolding mechanisms in proteins can also be studied 
by T-jump initiation, using 229 nm UVRR probe 
pulses to monitor the environment of structurally 
important tyrosine or tryptophan residues, as illus¬ 
trated by apomyoglobin. 

Upon heme removal and lowering of the pH to 4, 
several helices in the myoglobin structure become 
disordered, but a tightly packed core is retained, 
consisting of the A, G, and H helices, and the 
C-terminal part of the B helix (Fig. 9). The tyrosine 
residues are all exposed to solvent, and their UVRR 
bands are almost insensitive to temperature (Fig. 10). 
However, the tryptophan bands show strong intensity 
losses in response to a 45^60° T-jump, a temperature 
change that spans the global melting transition 
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1800 1600 1400 1200 


Raman Shift (cm-i) 

Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Fig. 8 197 nm - 

excited UVRR difference spectra (pump + probe minus probe 
only) following a 6 —► 37°C T-jump for a helical peptide, Ac- 
GSPEA 3 KA 4 KA 4 -CO-D-Arg-CONH 2 (Balakrishnan et al. 
2007), compared with a static UVRR spectrum at 10°C and 
a static difference spectrum between 42° C and 10°C 


(Tn, = 50°C) of apomyoglobin, as determined by CD 
measurements, and from the 197 nm-excited amide 
UVRR bands. 

Myoglobin has two tryptophan residues, one of 
which (TRP7) is exposed to solvent in the 
apomyoglobin structure, while the other (TRP14) is 
directed into the AGH helical core (Fig. 9). Exposure 
of the latter to solvent upon protein melting accounts 
for the intensity loss. Analysis of the time dependence 
of both tryptophan and amide bands, obtained at 229 
and 197 nm excitation, respectively, showed that the 



Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Fig. 9 Ribbon 
diagram of myoglobin, showing the core remaining when the 
heme is removed and the pH is lowered to 4 


tryptophan response was significantly faster than helix 
melting (7 vs. 14 ps). This observation suggests that 
loosening of tertiary contacts, as signaled by the tryp¬ 
tophan exposure, is a distinct step in the mechanism of 
protein unfolding (Huang et al. 2005). 


Summary 


UVRR spectroscopy offers structural probes of proteins 
with high sensitivity and selectivity. Excitation in the 
deep UV (206-197 nm) enhances amide bands of the 
protein backbone, which are sensitive to secondary struc¬ 
ture. Phenylalanine bands are also strongly enhanced and 
report on solvent exposure. Excitation at 229 nm selec¬ 
tively enhances tyrosine and tryptophan bands, which are 
sensitive to specific H-bonding as well as solvent expo¬ 
sure. In favorable cases, proline and histidine can also be 
probed, using 210 and 220 nm excitation. 

Because of its structural information content, 
UVRR is a particularly useful method of studying 
protein dynamics, and the mechanism of conforma¬ 
tion change. These changes can be initiated 
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ID 

T-ex 229 nm g 






Raman Shift (cm '') 

Ultraviolet Resonance Raman (UVRR) Spectroscopy Stud¬ 
ies of Structure and Dynamics of Proteins, Fig. 10 229 nm 

- excited UVRR difference spectra of apoMb at pH 4 following 
a 45 ^ 60°C T-jump (Huang et al. 2005), compared with a static 
UVRR spectrum at 45°C and a static difference spectrum 
between 45 and 61°C 

photochemically or via temperature jumps, using vis¬ 
ible or infrared laser pump pulses, and followed by 
Raman-generating UV probe pulses. 

Cross-References 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Infrared Spectroscopy of Protein Structure 

► Infrared Spectroscopy: Data Analysis 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 


► Nonlinear Raman Spectroscopy: Coherent Anti- 
Stokes Raman Scattering (CARS) 

► Raman Optical Activity Studies of Structure and 
Behavior of Biomolecules 

► Raman Spectroscopic Instrumentation, 
Experimental Considerations 

► Surface Enhanced Raman Spectroscopy in Electron 
Transfer Studies (Hildebrandt) 

► Surface-Enhanced Raman Spectroscopy for 
Bioanalytics 

► Ultrafast Raman Spectroscopy 
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Synonyms 

Hydrodynamics of Macromolecules 


Definition 

Macromolecular hydrodynamics concerns the mea¬ 
surement of biophysical parameters that describe the 
conformation of large biopolymers in solution; model¬ 
ing of these parameters permits the testing of putative 
structural models. 


Introduction 


The hydrodynamics of hiomacromolecules (e.g., pro¬ 
teins, nucleic acids, polysaccharides), as described 
by observables such as the translational diffusion 
coefficient (► Dynamic Light Scattering), sedimenta¬ 
tion coefficient (► Sedimentation Velocity Analytical 
Ultracentrifugation), rotational relaxation time 
(► NMR), and intrinsic viscosity, can be measured 
accurately. However, the interpretation of these 
parameters in terms of solution macromolecular struc¬ 
ture is not a straightforward task. This entry concerns 
a method for the construction of bead models to permit 
the computation of hydrodynamic (and related) 
parameters. 

Bead modeling methods were pioneered in the late 
1960s - early 1970s hy Bloomfield and collaborators 
(Garcia de la Torre and Bloomfield 1981). Simply put, 
in bead modeling, a macromolecule is represented by 
a collection of n beads of a certain radius, appropriately 
positioned in space. The frictional properties of the 
ensemble are calculated by solving a system of n linear 
equations with 3n unknowns (Byron 2008) 
(► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules). With the explo¬ 
sion in the number of high-resolution protein structures 
determined beginning in the late 1980s, the interest in 
biomacromolecular hydrodynamics shifted from low- 
resolution modeling (e.g., Garcia de la Torre 1989; 
Spotorno et al. 1997) to methods able to accurately 
compute the solution properties indirectly from the 
atomic coordinates. Hydrodynamic models are gener¬ 
ated from atomic Cartesian coordinates using at least 
two different approaches. The grid-based (AtoB 
(Byron 1997)) method encloses the high-resolution 
structure in a rectangular “box,” subdivided into 
cubes (the dimension of which is the user-defined 
model resolution) in each of which is placed a bead 
that represents the mass and volume of the atoms 
contained in that cube. The resultant model then 
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provides input for programs that compute hydrody¬ 
namic parameters, for example, HYDRO (► HYDRO 
Suite of Computer Programs for Solution Properties of 
Rigid Macromolecules). The shell-modeling (Garcia 
de la Torre et al. 2000) method covers the structure 
with small beads. In subsequent computations, the 
hydrodynamic parameters are determined for shells 
of beads of decreasing radii with a final extrapolation 
to zero bead radius. The shell-modeling approach has 
culminated in a very popular public-domain program, 
HYDROPRO (► HYDRO Suite of Computer Pro¬ 
grams for Solution Properties of Rigid Macromole¬ 
cules). Both methods, however, have their 
drawbacks: for instance, neither realistically represents 
hydrodynamic hydration (see below), and the increas¬ 
ingly large number of beads required for extrapolation 
somewhat limits the shell-modeling procedure when 
dealing with large structures/ensembles. 

For these reasons, in 1997, we began to develop an 
alternative method to reduce the atomic coordinates to 
fewer beads and hydrate the model realistically. In 
SOMO (Solution MOdeler) pre-defined segments of 
a biomacromolecule (e.g., main- and side-chain 
regions of each amino acid in a protein) are represented 
as appropriately positioned beads. The volume of each 
bead is initially set equal to the volume of the atoms it 
represents, plus that of the theoretically bound water of 
hydration. This generates an ensemble of overlapping 
beads of different radii. Appropriate methods of bead 
overlap removal were then developed, since the hydro- 
dynamic interaction tensor employed to compute fric¬ 
tional properties is valid for beads of different radii 
only when they do not overlap (Garcia de la Torre and 
Bloomheld 1981). The original SOMO method was 
validated against a small set of very well-characterized 
proteins (Rai et al. 2005), and was made freely avail¬ 
able as a collection of command-driven executables 
for the Linux operating system. 

More recently, SOMO was completely rewritten 
with a user-friendly graphical user interface (GUI), 
further validated against an extended range of proteins, 
and included as an additional tool within the popular 
public domain, open source analytical ultracentrifuga¬ 
tion data analysis, and modeling program UltraScan 
(US, Demeler (2005); http://www.ultrascan.uthscsa. 
edu/). This US-SOMO implementation, which 
includes an improved version of AtoB (Brookes 
et al. 2010a, b), and its ongoing development, is 
described here. 


Description of the Basic Principles: Direct 
Correspondence (Original SOMO) Method 

At the heart of the direct correspondence method lies 
an informed choice of the structural elements to be 
represented by beads of appropriate volume and spatial 
location. For proteins, the default choice is to represent 
each main-chain and side-chain segment with a bead. 
For the main-chain segment, two alternatives are avail¬ 
able, the main-chain of each nth amino acid (N-CA- 
C-CO)„, or the peptide bond between the nth and 
(nth + 1) residue (CA-C-CO)„N(„ + i); the second 
option reduces the chances of overlap between the 
main- and side-chain beads. The initial spatial location 
of each bead is chosen taking into account the nature of 
the segment it represents. For the main-chain (or pep¬ 
tide bond) and for the hydrophobic and non-polar side- 
chains, the bead is usually placed at the center of mass 
of the atoms involved, while for polar and charged 
side-chains the bead is located toward the end of the 
side-chain. Similar rules are applied for the sugar units 
in carbohydrates, and for the sugars and bases forming 
the nucleotides in RNA/DNA and co-factors such as 
NAD. The anhydrous volumes of these beads are set 
equal to those of the segments they represent, taken 
from literature analyses of crystallographic data (Per¬ 
kins 1986; Tsai et al. 1999; Nadassy et al. 2001; Voss 
and Gerstein 2005), or they can be calculated from 
structural models using dedicated software (e.g., the 
3'^ Contact Volume Calculator (http://www. 
molmovdb.org/cgi-bin/3v.cgi)). 

The second innovation introduced by the SOMO 
method is a more realistic treatment of the water of 
hydrodynamic hydration; in shell-modeling, and in 
the original grid-based (AtoB) approaches, hydration 
was implemented by uniform expansion of the model 
or by addition of a uniform shell on the model surface. 
In the direct correspondence method, the number of 
water molecules assigned to each bead is based on the 
theoretical, statistical hydration values determined by 
Kuntz and Kauzmann for each residue using NMR 
freezing (Kuntz and Kauzmann 1974). Importantly, 
on the basis of crystallographic studies (Gerstein and 
Chothia 1996) the volume of these water molecules is 
considered to be different from that of bulk water 
molecules. Traditionally, hydrodynamic hydration 
has been conceptualized as a layer of strongly bound 
water (with a density higher than that of bulk water) 
on the macromolecular surface. But this is at odds 
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with the very short residence times measured by solu¬ 
tion NMR spectroscopy for water on protein surfaces, 
that is, the water is not actually strongly bound. To 
precisely describe the frictional interaction between 
the protein surface and this rapidly diffusing water 
would be immensely complex. Fortuitously, it tran¬ 
spires (Halle and Davidovic 2003) that assigning 
strongly bound water molecules, although perhaps 
being not correct in principle, in practice compensates 
for these altered solvent properties at well-defined 
locations along the protein surface. A similar mecha¬ 
nism is evidently operating for other types of 
biomacromolecules (e.g., nucleic acids, carbohy¬ 
drates, etc.), and therefore, this approach can be 
applied in general. Operationally, the volume of the 
theoretically bound water molecules is then added to 
each corresponding bead, thereby accounting for the 
local variation in hydration. 

The third innovation is a pre-screening of the 
biomacromolecule to identify buried and exposed seg¬ 
ments. This information is maintained when the struc¬ 
ture is converted into beads, which are then labeled as 
being either buried or exposed. A further distinction is 
also made between exposed main- and side-chain seg¬ 
ments. This information is important to greatly reduce 
the number of beads that are subsequently included in 
hydrodynamic computations since it is only the beads 
that contribute to the surface frictional interaction with 
solvent that need to be considered. 


Overview of UltraScan-SOMO (US-SOMO) 
Function 

The GUI-based US-SOMO program generates bead 
models from atomic structures and computes hydrody¬ 
namic parameters for the resultant models. In Fig. 1, 
the main GUI interface is presented, showing how the 
various operations are subdivided into three areas; 
PDB Functions, Bead Model Functions, and Hydrody¬ 
namic Calculations. In addition, several pull-down 
menus are accessible from the top bar, while the 
right-hand-side window reports specific information 
and the process flow. In US-SOMO, bead model gen¬ 
eration can proceed using the direct correspondence 
(hereafter called SoMo) method (Rai et al. 2005) or via 
the upgraded grid (AtoB) procedure (Byron 1997; 
Brookes et al. 2010a, b). In both cases, the input atomic 
structures must be in protein data bank (PDB) format. 


Reliable, accurate results can be obtained if the struc¬ 
tures are complete, in the sense that there are no miss¬ 
ing residues; each residue contains complete main- and 
side-chains and the residues present in the macromol¬ 
ecule used for solution experimental studies are pre¬ 
sent in the input coordinate file. Since in US-SOMO 
the translation of an atomic-level structure into a bead 
model relies on the proper recognition of atoms and 
residues as coded in the PDB-formatted file, a number 
of look-up, user-editable tables have been defined 
(“Select Lookup Table” pull-down menu in Fig. 1). 
A first, basic table (default: somo.hybrid) contains the 
definition of the “hybridization” (see Tsai et al. 1999) 
of each “atomic group,” that is, a non-hydrogen atom 
plus the hydrogens bound to it (which are not explicitly 
considered in bead model generation). Each entry in 
the table, for example, C4H3 (tetrahedral carbon with 
three hydrogens bound), C3H0 (trigonal carbon with 
no hydrogen bound), etc., is associated with a radius, 
molecular mass, neutron scattering length, number of 
exchangeable protons, and total number of electrons. 
In addition, each entry in this table is linked to an entry 
in a small angle X-ray scattering (SAXS) coefficients 
table (default: somo.saxs_atoms) taken from the Inter¬ 
national Tables of Crystallography. A third table 
(default: somo.atom) contains the definitions of the 
atomic groups as found in the PDB files (e.g., N, CA, 
CO, and CB) with their associated radii and molecular 
masses taken from the hybridization table, plus the 
SAXS excluded volume and a link to the SAXS coef¬ 
ficient table. The last table (default: somo.residue) is at 
the heart of the model building procedure, as it con¬ 
tains the definition of each residue present in a PBD- 
formatted file (e.g., ARG, LYS, MAN, DA, DT, and 
NAD) with its global properties (anhydrous molecular 
volume, partial specific volume, accessible surface 
area), the atoms comprising it (picked from the atom 
definition table) with their hydration number, the num¬ 
ber of beads used to represent it, the assignment of 
each atom to a particular bead, and the rules for posi¬ 
tioning each bead. Each bead anhydrous volume must 
be defined here, and the final, hydrated volume is 
then computed using the atom hydration numbers 
or a directly entered value. Each bead molecular 
weight is also automatically calculated as the sum of 
masses of atoms for which it accounts. Currently 
(December 2011), about 550 and 100 entries are 
included in the distributed versions of the somo.atom 
and somo.residue tables, respectively. Additional 
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US-SOMO: Methods for Construction and Hydration of 
Macromolecular Hydrodynamic Models, Fig. 1 The main 
US-SOMO GUI interface. The left-hand side of the window is 
divided into the three sequential steps in construction and 


evaluation of hydrodynamic bead models: PDB Functions, 
Bead Model Functions, and Hydrodynamic Calculations. The 
right-hand panel reports on modeling progress 


residues, atoms, and hybridizations can be user- 
defined using the residue, atom, and hybridization edi¬ 
tors, but this is not a straightforward operation, as it 
requires detailed physicochemical and structural 
inputs. To partially alleviate this problem, approximate 
methods are available within US-SOMO to deal with 
non-defined residues and for cases where it is not 
possible to complete the structure, with the caveat 
that the quality of the result will be diminished some¬ 
what in proportion to the number of non-defined and/or 
missing (exposed) residues. PDB files containing mul¬ 
tiple models, such as those produced by NMR spec¬ 
troscopy, are also acceptable. In such cases, US- 
SOMO can process the complete set or a subset of 
structures. 

Once a PDB-formatted file has been successfully 
uploaded, bead model generation can proceed 
according to the two methods available. Both the 
SoMo and AtoB methods work in a stepwise manner, 
schematically depicted in Fig. 2, and each step has 


many user-selectable options controlled from pull¬ 
down menus (accessible from the “SOMO” menu in 
the top bar, see Fig. 1). 

In SoMo, the accessible surface area (ASA) is first 
determined, labeling each main- and side-chain as 
being either buried or exposed. The beads 
corresponding to exposed side-chains are then placed 
(A ^ B) and their overlaps removed (B ^ C); this 
procedure involves first fusing together (“popping”) 
beads that overlap by more than a pre-set threshold, 
and then proportionally reducing the bead radii either 
hierarchically (the couple with the largest overlap first, 
then the others) or synchronously (the radii of all 
overlapping beads are reduced by a percentage of 
their original radius, and the procedure is repeated 
until no overlaps remain). In addition, to preserve the 
topography of the original surface as much as possible, 
the bead centers are moved outwardly along a line 
connecting them to the center of mass of the macro¬ 
molecule by the same amount as their radii are being 
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SoMo Method 


US-SOMO: Methods for Construction and Hydration of 
Macromolecular Hydrodynamic Models, Fig. 2 Schematic 
representation of the two algorithms for bead model construc¬ 
tion: SoMo {left-hand side) and AtoB {right-hand side). The grid 
superimposed on the space filling representation in the AtoB 
panel is conceptual only and is in practice a three-dimensional 

reduced (“outward translation”). In the next step (C ^ 
D), the main-chain exposed beads (blue) are placed 
and their overlaps removed using one of the procedures 
described above, but without the outward translation. 
Finally (D ^ E), buried beads (orange) are placed and 
their overlaps removed, again without outward trans¬ 
lation. The entire bead model is then re-screened for 
ASA, because some beads might have changed their 
exposed/buried status during the overlap removal 
procedure. 

In AtoB, after an appropriate grid is defined 
(A B), all beads are generated in a single step 
(B —> C) and are then screened for ASA (C —> D; 
orange, buried; red, exposed). Overlaps are then 
removed in the exposed subset (D ^ E) using either 
the hierarchical or the synchronous procedure, with 
outward translation. Then, the same procedure is 
applied to the buried subset (E ^ F), without outward 
translation, and the entire set is re-screened for ASA, 
resulting in many more beads becoming exposed. 

Figure 3 shows a comparison of two starting struc¬ 
tures and their bead models produced by SoMo and 
AtoB. Both methods provide excellent hydrodynamic 
results (see below) (Brookes et al. 2010b). In general, 
the SoMo method should be used for structures up to 



AtoB Method 

“cage.” Key steps in the algorithm (A —> B, etc.) are descrihed in 
detail in the text. The color-coding in SoMo steps B ^ E is: blue, 
main-chain; cyan, hydrophobic; magenta, non-polar; yellow, 
basic; green, acid; white, fused beads; orange, buried beads. 
The color-coding in AtoB steps D —► F is; orange, buried 
beads; red, exposed beads 

300-400 kDa and the AtoB method, with an appropri¬ 
ate grid spacing, should be used with larger structures 
to limit the total number of beads created, thus reduc¬ 
ing computational time. Previously generated bead 
models can also be re-loaded in US-SOMO, in which 
case their pre-recorded generation procedures will be 
listed in the progress window (Fig. 1 , right-hand-side 
window). Besides bead models produced via the SoMo 
or AtoB methods, other external formats can be read by 
US-SOMO, including BEAMS (Spotomo et al. 1997) 
and those produced by DAMMIN/DAMMIE (Eranke 
and Svergun 2009) (► Wide and Small Angle X-Ray 
Scattering). The repertoire of accepted input formats 
will continue to expand as US-SOMO is developed. 

Once a bead model is either generated or loaded, 
US-SOMO can compute a large number of rigid-body 
hydrodynamic parameters, including translational and 
rotational diffusion and frictional coefficients; sedi¬ 
mentation coefficients; Stokes radius; relaxation 
times; and intrinsic viscosity. The computed parame¬ 
ters are listed in a text format file. When multiple 
models are generated and analyzed, the resultant 
parameters include averages and standard deviations. 
Sets of parameters can be selected for saving in 
a spreadsheet compatible format (CSV). 
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US-SOMO: Methods for Construction and Hydration of 
Macromolecular Hydrodynamic Models, Fig. 3 A compar¬ 
ison of input coordinates (shown in space-filling representation, 
middle row) and the resultant SoMo {top row) and AtoB {bottom 
row) models for PDB file 1A4V (human a-lactalbumin, left- 
hand side) and 2CGA (bovine chymotrypsinogen, right-hand 
side). Color coding is as for Fig. 2 

US-SOMO provides a multitude of configuration 
options. The default options are set to values that 
should provide good results for the majority of ana¬ 
lyses. Besides the aforementioned residue, atom and 
hybridization editors, the user can modify configura¬ 
tion options for: the parsing of the PDB, including 
switches for the automatic bead builder and the 
approximate method used for incomplete structures; 
the ASA computations; the SoMo and AtoB bead 
model building settings; and the hydrodynamic com¬ 
putations, including solvent type and conditions. The 
current configuration is automatically saved to file 
upon exiting the program and a full facility is available 
to save and load various configurations. In addition, 
US-SOMO includes a powerful GUI-based batch 
mode option. The batch mode can handle processing 
of thousands of PDB and/or bead models. For instance, 
one use of the batch mode could be to screen a large 
collection of structures against experimentally 
obtained hydrodynamic data. 


US-SOMO Performance and Comparison 
with Other Available Programs 

The original SOMO method was tested against a set of 
extremely well-characterized, small proteins, whose 
solution data were critically reevaluated (Rai et al. 
2005). More recently (Brookes et al. 2010a, b), the 
test was extended to a larger set of proteins spanning 
a molecular mass range of 6.5-235 kDa, again after 
critical re-evaluation of the experimental data. For the 
translational diffusion properties, the experimental 
data were matched by US-SOMO to within 1-2%, 
with just a few exceptions (3 out of 14 proteins). 
More difficult to assess is the performance for other 
parameters, such as the rotational relaxation time and 
the intrinsic viscosity. For the former, no data are 
available for proteins above 65 kDa, and the relaxation 
processes of smaller proteins, often measured by NMR 
spectroscopy, are greatly affected by local flexibility - 
therefore, rigid body hydrodynamic computations are 
unlikely to satisfactorily reproduce experimental data. 
In any case, on average, US-SOMO can match exper¬ 
imental rotational relaxation times to within 10%, not 
far from the usual reported experimental errors. Values 
computed for the intrinsic viscosity are more erratic, 
differing from experimentally determined data by any¬ 
thing from a few percent to more than 10%. This less 
than satisfactory outcome probably stems from two 
factors: (1) the approximation inherent in using rigid 
body hydrodynamic computations for molecules that 
include flexibly linked domains, as for the rotational 
correlation times and (2) most intrinsic viscosity data 
found in the literature are quite old and may be inac¬ 
curate. It is, therefore, difficult to fully evaluate the 
accuracy of the US-SOMO-computed values for intrin¬ 
sic viscosity data. Further improvements are also 
expected with a better treatment of the so-called 
“volume correction” affecting only the rotational 
relaxation and intrinsic viscosity (see Brookes et al. 
2010a, b for a more detailed account). 

Finally, we can compare how US-SOMO compares 
with HYDROPRO (► HYDRO Suite of Computer 
Programs for Solution Properties of Rigid Macromol¬ 
ecules), currently the most widely used program for 
computing solution hydrodynamics of biomacro¬ 
molecules. Comparison of the values computed for 
the extended set of test proteins by HYDROPRO 
(► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules) (Garcia de la 
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Torre et al. 2000; Garcia de la Torre 2001) and US- 
SOMO with the carefully re-evaluated literature exper¬ 
imental data (Brookes et al. 2010a, b) generated % 
differences indicating that US-SOMO appears to be on 
average more accurate (see, e.g.. Table 5 of Brookes 
et al. 2010b). In our opinion, this increase in accuracy 
results from a more faithful description of the overall 
macromolecular shape, and from a more realistic treat¬ 
ment of the water of hydration. However, the price to 
pay for this, justified by the corresponding ever- 
increasing precision of experimental data, is the require¬ 
ment for a very precise definition of the basic structural 
elements in US-SOMO. Indeed, while HYDROPRO 
(► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules) will analyze 
directly any biomacromolecular structure without 
requiring any additional definition, the best results are 
obtained with US-SOMO when every element is prop¬ 
erly defined. For “standard” biomacromolecules (pro¬ 
teins or nucleic acids), this should not be a problem, as 
the most common residues are already defined in the 
US-SOMO tables, but it could become more compli¬ 
cated when non-standard residues or prosthetic groups 
are an integral part of the structure under examination. 

Ongoing Implementations and Future 
Developments in US-SOMO 

US-SOMO is under active development. Much pro¬ 
gress has been made on the implementation of 
a comprehensive small angle X-ray (► X-Ray Scatter¬ 
ing of Lipid Membranes) and neutron (► Neutron 
Scattering of Membranes) scattering (SAXS/SANS) 
module. Presently, this module computes the pair¬ 
wise distance distribution function, p(r) versus r, 
from PDB coordinate files and bead models. The com¬ 
putation of the scattered intensity versus momentum 
transfer, I(q) versus q, is being (2011) installed offer¬ 
ing multiple I(q) methods including a native full Debye 
procedure and an interface to Crysol (► Wide and 
Small Angle X-Ray Scattering), and will allow direct 
comparison of the original structure or a bead model 
with the experimental data. The SAXS/SANS module 
calculations have will also been extended to batch 
mode. The other principal area of development deals 
with structures presenting substantial local or large- 
scale flexibility for which hydrodynamic computations 
are currently undertaken using either dedicated 


programs for segmentally flexible structures with two 
subunits (Spotomo et al. 1997) or Monte Carlo 
methods within the rigid-body approximation (e.g., 
MONTEHYDRO ►HYDRO Suite of Computer 
Programs for Solution Properties of Rigid Macromol¬ 
ecules). However, more appropriate methods, directly 
simulating the local or large-scale motions, should 
provide better results (Garcia de la Torre et al. 2009). 
To this end, we are extending US-SOMO to produce 
collections of structures using particle dynamics 
methods such as discrete molecular dynamics (DMD 
Ding et al. (2008)), which could be then analyzed using 
rigid-body computations. Alternatively, the DMD sets 
will be used to derive constraints (spring forces, range 
of displacements) that could be parameterized in 
meso-scale simulations such as Brownian dynamics 
(BD) (e.g., SIMUFLEX ► HYDRO Suite of Computer 
Programs for Solution Properties of Rigid Macromol¬ 
ecules) utilizing the US-SOMO-derived bead models 
as the starting structures. 

Summary 

In hydrodynamic bead modeling, a macromolecule is 
represented by an assembly of spheres, for which it is 
possible to accurately compute a series of hydrody¬ 
namic, and associated, solution parameters. This facili¬ 
tates the testing of structural models based on the 
agreement, or otherwise, of the computed parameters 
with those measured experimentally. This approach has 
particular utility for systems that do not readily crystal¬ 
lize, are too large for structure determination via NMR 
spectroscopy, comprise many non-covalently associ¬ 
ated components, or all of these. US-SOMO is software 
that generates bead models from high-resolution coor¬ 
dinates (of the molecule of interest or a homology 
model thereof) via a choice of two algorithms: SoMo 
or AtoB. SoMo retains the explicit chemical character¬ 
istics of the constituent molecular building blocks 
(e.g., residues or nucleotides, etc.) while AtoB is able 
to dramatically reduce the number of beads within 
a model - useful when computing the hydrodynamics 
of very large complexes. In its latest incarnation, 
US-SOMO realistically hydrates bead models generated 
by both the SoMo and AtoB algorithms. The method has 
performed well when extensively and critically tested 
against a number of well-characterized proteins, and its 
development is ongoing. 
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Cross-References 

► Dynamic Light Scattering 

► HYDRO Suite of Computer Programs for Solution 
Properties of Rigid Macromolecules 

► Neutron Scattering of Membranes 

► NMR 

► Sedimentation Velocity Analytical 
Ultracentrifugation 

► Wide and Small Angle X-Ray Scattering 

► X-Ray Scattering of Lipid Membranes 
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Synonyms 

DNA; Protein absorbance spectroscopy; RNA 


Definition 

Proteins, DNAs and RNAs are linear polymers where 
a limited set of residues are joined together by. 
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respectively, the amide or phosphodiester bonds. The 
situation is similar for carbohydrates though the 
linking options are more varied here. To a first approx¬ 
imation, the absorbance spectrum expected for 
a biomolecule is the sum of the component parts. 
However, the polymerization process and the packing 
of the units perturb the spectra as discussed below 
(Berova et al. in press; Norden and Rodger 2010; 
Wallace and Janes 2009). 

Basic Characteristics 

Electronic Spectroscopy of Nucleic Acid Bases 

The absorption spectra of nucleic acids in the 
easily accessible region of the spectrum (down to 
^180 nm) are dominated by n ^ n* transitions, 
which are, by symmetry, all polarized in the plane of 
the bases, so approximately perpendicular to the 
helix axis in B-DNA. The absorbance spectra of the 
DNA bases thymine (T), adenine (A), guanine (G), 
and cytosine (C) and the current best estimates 
of their transition polarizations are illustrated in 

Fig. 1. 

Electronic Spectroscopy of DNA and RNA 

In the case of nucleic acids (DNA and RNA), the UV 
absorbance from 200 to 300 nm is due exclusively to 
transitions of the planar purine and pyrimidine bases. 
The backbone begins to contribute at about 190 nm. 
The accessible region of the spectrum (nitrogen purg¬ 
ing is required below ^200 nm as oxygen absorption 
interferes with the spectrum) is therefore dominated 
by 71 —> Tt* transitions of the bases. The UV absor¬ 
bance spectra of the bases (Fig. 1) look as if there are 
two simple bands; however, each “simple” band 
observed is a composite of more than one transition. 
This makes detailed analysis of DNA absorption dif¬ 
ficult, but usually ensures that the absorption spec¬ 
trum changes when the system is perturbed in any 
way, thus making absorption spectroscopy a useful 
qualitative or empirical probe of structural changes. 
A typical DNA UV absorption spectrum is illustrated 
in Fig. 2. The base transitions are significantly 
perturbed by the so-called n-n stacking interactions, 
and so both wavelength maxima and transition inten¬ 
sities vary depending on the base sequence and struc¬ 
ture adopted (cf. Table 1). 


Electronic Spectroscopy of Proteins 

In the case of peptides and proteins, the spectroscopy 
of the amide bonds, the side chains, and any 
prosthetic groups (such as hemes) determines the 
observed UV/visible spectrum. However, as with 
DNA, intensities and wavelengths can be perturbed 
by the local environment of the groups. UV spectra 
of proteins are usually divided into the “near” and “far” 
UV regions. The near UV in this context means 
250-300 nm and is also described as the aromatic 
region, though transitions of disulfide bonds (cystines) 
contribute to the total absorption intensity in this 
region. The far UV (<250 nm) is dominated by tran¬ 
sitions of the peptide backbone of the protein, but 
transitions from some side chains also contribute in 
this region. 

Side Chain Absorption 

The aromatic side chains tryptophan, tyrosine, and 
phenylalanine have transitions in the near-UV region 
(Fig. 3). The indole of tryptophan has three transitions 
absorbing in the 240-290 nm region with 
maximum extinction coefficient 8279 nm 5,000 
moU* dm^ cm”*; tyrosine has one transition with 
£274 nm "^1,400 mol”* dm^ cm”*; phenylalanine also 
has one transition with £258 nm "^190 mol”* dm^ cm”*; 
and a cystine disulfide bond absorbs somewhere 
between 250 and 270 nm with e ^ 300 
mol”* dm^ cm”*. Although tryptophans have by far 
the most intense transitions in the aromatic region 
(characterized by the largest extinction coefficients), 
many proteins have few tryptophans compared with 
the other aromatic groups, so the aromatic absorbance 
is not necessarily dominated by tryptophan transitions. 

Chromophores with far UV transitions include the 
aromatic side chains, disulfide cystines, arginine, 
asparagine, aspartic acid, glutamine, glutamic acid, 
and histidine. 

Protein Backbone Absorption 

The peptide chromophore (Fig. 4) which gives rise to 
the transitions observed in the far-UV region (180- 
240 nm) has a-bonding electrons, nonbonding elec¬ 
trons on the oxygen and on the nitrogen atoms, and 
7i-electrons which are delocalized to some extent over 
the carbon, oxygen, and nitrogen atoms. The lowest 
energy transition of the peptide chromophore is one 
from a nonbonding orbital, n, to an anti-bonding n 
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UV Absorbance Spectroscopy of Biological Macromole¬ 
cules, Fig. 1 (a) UV spectra of the DNA nucleotides 
deoxyadenosine 5'-monophosphate (A), deoxyguanosine 5'- 
monophosphate (G), deoxycytidine 5'-monophosphate (C), and 
thymidine 5'-monophosphate (T) (that of uracil is very similar) 



UV Absorbance Spectroscopy of Biological Macromole¬ 
cules, Fig. 2 Absorption spectrum of calf thymus DNA 
(100 pM) in phosphate buffer. Path length = 1 cm 

orbital: an n ^ n* transition analogous to that in 
ketones. The next lowest transition is a 7t ^ 7t* tran¬ 
sition. As for carbonyls, the n ^ n* transition is of 
low intensity (e ^ 100 mol”' dm^ cm”'), though it is 


(Rodger and Norden 1997). (b) Probable transition polarizations 
for UV transitions of adenine, guanine, cytosine, and thymine 
from crystal and stretched film data (Chou and Johnson 1993; 
Holmen et al. 1997; Zaloudek et al. 1985; Clark 1977; Novros 
and Clark 1986) 

UV Absorbance Spectroscopy of Biological Macromole¬ 
cules, Table 1 Long wavelength absorbance maxima and 
approximate extinction coefficients for some DNAs. Calf thy¬ 
mus DNA is ~60% A-T. A adenine, T thymine, G guanine, and C 
cytosine 


DNA 

Wavelength 
of Arnax/nm 

Emax/(mor'dm^ cm) 

Calf thymus 

260 

6,600 

Poly[d(A-T) 2 ] 

262 

6,600 

Poly(dA).poly(dT) 

260 

6,000 

Poly[d(G-C) 2 ] 

254 

8,400 

Poly(dG).poly(dC) 

253 

7,400 


not as low as for a simple ketone; it occurs at about 
210-230 nm (depending mainly upon the extent of 
hydrogen bonding of the oxygen lone pairs) and 
its electric character is polarized more or 
less along the carbonyl bond. The n ^ n* transition 
(e ^ 7,000 mol”' dm^ cm”') is dominated by the car¬ 
bonyl 71-bond but is also affected by the involvement of 
the nitrogen in the n orbitals; its electric dipole transition 
moment is polarized somewhere near the line between 
the carbon and oxygen and it is centered at ^195 nm. 
These transitions are schematically illustrated in Fig. 4. 
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UV Absorbance Spectroscopy of Biological Macromole- 
cules. Fig. 3 Ai'omatic absorption spectra of tryptophan, tyro¬ 
sine, and phenylalanine scaled to give similar long wavelength 
maximum absorbances. The concentrations to achieve these 
spectra are indicated on the figure 


According to Kelly et al. 2005, the absorbance of a 1 mg/ 
mL protein in a 1-cm path length cell is ^1 at 280 nm 
(depending on the aromatic chromophores present), 5 at 
230 nm, 15 at 215 nm, 45 at 200 nm, and 68 at 193 nm. 
Of course one cannot measure absorbances of 68, so this 
has been determined from a measurement in a shorter 
path length cell or with a lower concentration sample. 

As noted above, a number of amino acid side chains 
also have transitions in the peptide region. Although 
these transitions are often stronger than the peptide 
71 ^ n* transitions, since the peptide chromophores 
are usually in significant excess, the side chain transi¬ 
tions are generally nearly impossible to detect. How¬ 
ever, the presence of these side chain transitions can be 
sufficient to confuse attempts to empirically determine 
the percentage of a given structural unit from CD or to 
use the Beer-Lambert law to determine concentrations. 

Carbohydrate Electronic Spectroscopy 

Carbohydrate UV/visible absorbance spectroscopy 
that can be measured by a normal laboratory spectrom¬ 
eter is essentially that of any substituents that have 



UV Absorbance Spectroscopy of Biological Macromole- « —> ti* and (b) ti —> ti* transitions indicated by the arrows. 
cules. Fig. 4 The peptide chromophore and the n and 7t orbitals The net transition moments in (c) an a-helix, (d) (l-sheet and (e) 
with predominant movement of electron density for the (a) polyproline II helix are illustrated 
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spectroscopy, since a simple sugar system such as 
starch has no spectroscopy above 200 nm. Carbohy¬ 
drates are often derivatized with thiols or aromatic 
chromophores which have spectra that can be 
measured. 

Cross-References 

► Absorbance Spectroscopy: Overview 

► Absorbance Spectroscopy: Spectral Artifacts and 
Other Sources of Error 

► Absorption Spectroscopy and Transition 
Polarizations: Pictorial Description 

► Absorption Spectroscopy: Practical Aspects 

► Absorption Spectroscopy: Relationship of 
Transition Type to Molecular Structure 

► Beer-Lambert Law Derivation 

► Circular Dichroism and Chirality 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► DNA-Ligand Circular Dichroism 

► Polarized Light, Linear Dichroism and Circular 
Dichroism 

► Protein Circular Dichroism Analysis 
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Vacuolar-Type ATPases in Animal 
and Plant Cells 

Hamko Okamoto and Masamitsu Futai 
Department of Biochemistry, Iwate Medical 
University, Yahaba, Iwate, Japan 

Synonyms 

ATPase complex that forms a proton gradient across 
vacuolar membrane; Molecular motors; Rotary Motor 

Definition 

The F-type ATP synthase and V- type ATPase are 
rotating enzymes in which proton transport across the 
membrane is coupled to ATP synthesis and, con¬ 
versely, the energy generated by ATP hydrolysis is 
coupled to proton transport. 

Introduction 

Protons (H"^) play vital roles in bioenergetics and ion 
homeostasis, as evidenced by the presence of unique 
acidic compartments both inside and outside of cells. 
The acidification of compartments results in an elec¬ 
trochemical proton gradient being generated across the 
membranes. The initial step in forming acidic com¬ 
partments is the transport of protons by ATPases. 
Three classes of ATPases are known: the P-ATPase 
(P-type ATPase), the F-ATPase (F-type ATPase), and 
the V-ATPase (Vacuolar-type ATPase); their 


nomenclature being derived from a phosphorylated 
intermediate, /actors of oxidative phosphorylation 
and a pump found initially in plant or fungal vacuoles, 
respectively. The P-ATPase family includes animal 
Na'^/K'^ ATPases and the gastric proton pump, as well 
as plant or yeast plasma membrane pumps. 

F-ATPases and V-ATPases share structural and 
enzymatic similarities, although they have physiolog¬ 
ically different roles: F-ATPases are ATP synthases 
found in mitochondria, chloroplasts, and bacterial 
membranes, whereas V-ATPases are proton pumps 
localized in various endomembrane organelles includ¬ 
ing the Golgi, the trans Golgi network (TGN), lyso- 
somes, synaptic vesicles, and plant vacuoles as well as 
in the plasma membrane of cells such as osteoclasts, 
cancer cells, and plant cells (Forgac 2007). The 
F-ATPase is constituted by the Fj (ocspa y5s) 
membrane extrinsic catalytic complex and the 
Fo {a\h 2 C\Q) transmembrane proton pathway, each 
complex containing defined subunits. The F-ATPase 
synthesizes ATP from ADP and Pi (phosphate) 
coupling with an electrochemical proton gradient 
formed by an electron transport chain, and can also 
reversibly hydrolyze ATP to form the gradient. The 
yscio subunit complex located in the center of the a 3 P 3 
hexamer rotates during energy coupling (Fig. 1). This 
rotary mechanism suggested by evidence from numer¬ 
ous biochemical studies was finally shown by single 
molecule observation (Futai et al. 2004; Nakanishi- 
Matsui and Futai 2008). 

The V-ATPase, a complicated multi-subunit 
ATPase similar to the F-ATPase (Fig. 1), has both 
a membrane extrinsic V i catalytic complex and trans¬ 
membrane Vo proton pathway. The V-ATPase hydro¬ 
lyzes ATP to ADP and Pi coupling this to transport 


G.C.K. Roberts (ed.). Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 




V 


2720 


Vacuolar-Type ATPases in Animal and Plant Cells 


F-ATPase 



Vacuolar-Type ATPases in Animal and Plant Cells, 
Fig. 1 Schematic models of F- and V-ATPases are shown side 
by side with the structures of the membrane extrinsic Fi and Vj 
complexes with the ATP synthesis-hydrolysis catalytic hexamer, 
rotor, and stator domains. The membrane intrinsic Fq and Vo 
sectors are shown with the direction of proton movement. The 


through rotation of a rotor stalk. This proton pump, 
expressed ubiquitously in yeast, mammalian, insect, 
nematode, and plant cells, has vital roles in intracellu¬ 
lar acidic compartments. Localization to these intra¬ 
cellular compartments is likely determined by 
isoforms of the membrane intrinsic Vq subunit. In 
addition, tissue and organ specific V-ATPase activity 
is determined by Vj subunit isoform combinations 
unique to each cell type. 

ATPase Structure and Function 

The yeast V-ATPase is formed from 13 subunits that 
are essentially conserved in multicellular organisms 
(Fig. 1). The Vi sector is composed of eight different 
subunits with a stoichiometry of A 3 BsiC)DEsFG 3 H 
(Zhang et al. 2008). Three-dimensional structures of 
isolated subunits C and H were solved recently (Shao 
and Forgac 2004; Drory et al. 2004). Subunit A is 
functionally homologous to the F-ATPase (3 subunit 
thus forming a catalytic site. In contrast to the Fq (fli 
hiCio) of the F-ATPase, the Vq sector is formed from 
six subunits (ac^c’c”de), although the c’ subunit is not 
found in mammals, nematodes, and plants. In a similar 


V-ATPase 



A 


ATP 

, ATP-hPI 



F-ATPase rotates clockwise as protons move through the Fq a 
subunit with rotation energy coupled to ATP synthesis. Con¬ 
versely, the V-ATPase rotates counterclockwise as the Vi cata¬ 
lytic subunit hydrolyses ATP and this energy is coupled to 
proton movement through the Vq sector 


way to the asPs hexamer of the F-ATPase, A 3 B 3 is 
connected to Vq by a rotor and a stator (components 
that stabilize the rotor), structurally identified central 
and peripheral stalks, respectively. The rotor is formed 
from the D and F subunits partly inside the A 3 B 3 
hexamer and attached to a transmembrane ring of c, 
c’, and c”. The stator, formed from a, C, E, G and H 
subunits, is connected to the Ajilj hexamer. 

A V-ATPase, not synthesizing ATP physiologically 
in eukaryotic cells, has been proven capable of synthe¬ 
sizing ATP in yeast vacuoles that are artificially 
expressing a proton pumping pyrophosphatase from 
the model plant, Amhidopsis thaliana (Sun-Wada 
et al. 2004). The engineered vacuoles synthesized 
ATP from ADP and Pi, coupling with an electrochem¬ 
ical proton gradient established by the hydrolysis of 
pyrophosphate. It is noteworthy that, an ATP synthase 
homologous to the yeast V-ATPase, but without the 
C and H subunits, is found in thermophilic bacterial 
membranes, where an electrochemical proton gradient 
is established by an electron transport chain. 

The V-ATPase is a rotating enzyme and generates 
torque in a similar way to the F-ATPase: An actin 
filament probe connected to subunit G could be rotated 
by a yeast V-ATPase immobilized on a glass surface 
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via c subunit (Futai et al. 2004). The forementioned 
bacterial ATP synthase homologous to the yeast 
V-ATPase has also been shown to rotate. The torque 
calculated from these two studies was quite similar and 
details of the mechanism has been addressed using 
probes with lower viscous drag than actin (Okamoto 
and Futai manuscript in preparation). 

V-ATPase, an Essential Proton Pump in 
Eukaryotic Cells 

Yeast mutants defective of any one of the V-ATPase 
subunits do not grow at neutral pFl, a vacuolar mem¬ 
brane ATPase (VMA) phenotype, indicating that V- 
ATPases are essential for acidifying compartments 
such as vacuoles, Golgi, etc. However, mutant cells 
can grow in acidic pH medium possibly by lowering 
organelle pH through pinocytosis of the medium. The 
V-ATPase is also essential for Caenorhahditis 
elegans: nematodes became embryonic lethal when 
the V-ATPase c subunit (encoded by a single gene) 
expression was reduced upon injection of the 
corresponding double stranded RNA (by RNA silenc¬ 
ing). RNAs corresponding to three of the four a 
subunit isoforms also caused a lethal phenotype at 
either the embryonic stage {unc-32) or at a specific 
larval stage {vha-5 or vha- 6 ), whereas of vha7 RNA 
had no effect, indicating that it is a dispensable 
subunit. 

Mouse embryos lacking the c subunit encoded by 
a single gene grow up to the blastocysts, but fail to be 
implanted, leading to embryonic lethal phenotype. As 
expected, the mutant blastocysts cultured in vitro had 
no lumenal acidic organelles and, thus, showed no 
endocytosis of dextran. In plants, a male gametophyte 
lethal phenotype of the A. thaliana vha-A (vacuolar 
/T^-ATPase-A), Vi subunit A null mutant, has demon¬ 
strated the essential role of V-ATPases for haploid 
gametogenesis (Gaxiola et al. 2007). These results 
indicate that acidic compartments formed by 
V-ATPases are essential for the early development of 
mammals, worms, and plants. 

V-lsoforms for Diverse Organelles 

Consistent with the diverse functions of V-ATPases, 
the mammalian V i complex is formed from a variety 
of combinations of the following isoforms: two each 
for the B (Bj, B 2 ), E {E,, £ 3 ), H {Hi, H 2 ), and d (di, d 2 ) 
subunits; three for the C (Cj, € 2 - 0 , C 2 .h) and G (Gi, G 2 , 
G 3 ) subunits (Smith et al. 2003). Each isoform affects 


the V-ATPase either an ATPase activity, level of 
energy coupling, and/or organelle specific localization. 

The BI isoform is specific for the kidney and inner 
ear and thus its mutation causes renal acidosis with 
hearing loss. V-ATPases with C 2 -a are specifically 
found in the lamellar bodies of lung alveolar epithelial 
cells responsible for surfactant secretion, while those 
with C 2 -h are found in the plasma membrane of renal 
intercalated cells responsible for ion homeostasis. The 
localization of the V-ATPase E 1 isoform is restricted to 
the acrosomes of developing spermatid and mature 
sperm, indicating that the sperm specific acidic pH is 
necessary for fertilization (Sun-Wada et al. 2004). The 
G 2 and Gi isoforms are found in neuronal synaptic 
vesicles and the kidney, respectively. The additional 
V 1 isoforms that are not discussed above are expressed 
ubiquitously. It is noteworthy that biochemical studies 
of mammalian V-ATPases are not easy because of the 
presence of V-ATPases with different isoforms in the 
same cells. 

Studies of yeast V-ATPase complex with a subunit 
replaced by mammalian isoforms in yeast are informa¬ 
tive, since most of the Vi subunit could complement 
yeast mutants lacking the corresponding genes. Func¬ 
tional differences between mouse Ei (acrosome spe¬ 
cific) and £2 (ubiquitous) isoforms were studied using 
a yeast chimeric V-ATPase with mouse Ei and £ 2 , 
respectively. Although the mouse £ 2 /yeast chimera 
V-ATPase was active at 37°C upon exposure of the 
Vo subunit a epitope to the corresponding antibodies, 
the Ejlyeast chimera V-ATPase was inactivated indi¬ 
cating that the £ subunit has a role in assembly with the 
Vo subunit a (Hayashi et al. 2008). It is of interest to 
reconstruct an active V-ATPase formed from all the 
mammalian subunits in yeast. 

Similarly to the mammalians, Vi isoforms, three 
B (Bi, B 2 , Bs), E {El, £ 2 , £ 5 ), G (G;, G 2 , G 5 ), and 
two d{di, d 2 ) subunits, have been found in A. thaliana 
(Sze et al. 2002). None of these subunit isoforms are 
orthologous to those of the corresponding mammalian 
subunit, indicating that these plant V-ATPase subunit 
isoforms are likely to have formed after plants 
diverged from metazoans. Promoter-GUS and -GFP 
reporter fusions of these subunit isoforms have 
revealed their specific and ubiquitous expression pat¬ 
terns during plant growth and development (Table 1). 

Knock-out mutation of the Ei isoform resulted in 
defective somatic embryo development, whereas £3 
expression is specific for the haploid male 
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Vacuolar-Type ATPases in Animal and Plant Cells, Table 1 Organ specific and ubiquitous expression of V-ATPase subunit 
isoforms in mouse and Arabidopsis 

Yeast 

Mouse 

Expressed tissue (cell) 

Arabidopsis 

Expressed tissue (cell) 

Null mutant phenotype 

Vi A 

A 

N.D. 

A 

Ubiquitous 

Male gametopbyte letbal 

B 

B, 

Kidney, ear, lung 

Bi 

N.D. 

N.D. 


B2 

Ubiquitous 

B2 

N.D. 

N.D. 




Bs 

N.D. 

N.D. 

C 

c, 

Ubiquitous 

C 

Ubiquitous 

Male gametopbyte letbal 


Cs-a 

Lung 





C2-h 

Kidney 




D 

D 

N.D. 

D 

N.D. 

N.D. 

E 

Ej 

Testis (acrosome) 

El 

Ubiquitous 

Embryo letbal 


£2 

Ubiquitous 

£2 

Pollen 

N.D. 




Es 

Epidermis 

N.D. 

F 

F 

N.D. 

F 

N.D. 

N.D. 

G 

Gi 

Ubiquitous 

Gi 

Endosperm/maternal Tissue 

N.D. 


G2 

Neural (synaptic vesicles) 

G2 

N.D. 

N.D. 


Gs 

Kidney 

Gs 

Pollen/flower 

N.D. 

H 

H 

N.D. 

H 

N.D. 

N.D. 

Vo c 

c 

N.D. 

Cl 

Ubiquitous 

N.D. 




C 2 

N.D. 

N.D. 




Cs 

Root cap 

N.D. 




C4 

N.D. 

N.D. 




C5 

N.D. 

N.D. 

c' 

No homolog 

- 

No homolog 

- 

- 

c” 

c” 

N.D. 

c" I 

N.D. 

N.D. 




c" 2 

N.D. 

N.D. 

d 

d, 

Ubiquitous 

d, 

N.D. 

N.D. 


^2 

Kidney, osteoclasts 

d2 

N.D. 

N.D. 

a (STVl) 

aj 

Neural 

ai 

Leaf (TON) 

N.D. 

a (VPHl) 

02 

Endotbelial 

02 

N.D. 

N.D. 


as 

osteoclasts 

as 

Root 

N.D. 


04 

kidney 





N.D. Not determined 

None of the mouse and Arabidopsis subunit isoforms are orthologous to that of each other 


gametophyte, suggesting the critical role of 
V-ATPases at both the haploid and diploid develop¬ 
mental stages (Gaxiola et al. 2007). 

Isoforms of Vq Subunits and Their Function in 
Membrane Trafficking 

Vo subunit a isoforms were found to be localized in the 
specific organelles. Unique isoforms are found for 
yeast Vo subunit a with the isoform STVl being 
found in prevacuolar vesicles while VPHl is localized 
in the vacuole. The amino terminal domains of these 
isoforms are responsible for the specific targeting of 
V-ATPases to vacuolar and Golgi compartments, 
respectively. As described earlier, four isoforms of 
subunit a are found in C. elegans, although they do 
not correspond to mammalian isoforms. 


Human genes for V-ATPase subunits have 
been summarized with their aliases and synonyms. 
Mouse and human V-ATPases have four isoforms 
of subunit a (oi, a 2 , < 23 , 04 ), and they are found in 
different organelles and plasma membranes of special¬ 
ized cells. The isoform is specifically localized in 
the plasma membrane of osteoclasts that are directly 
facing the bone, while aj and 122 are found in cytoplas¬ 
mic organelles of the same cells. Bone resorption and 
formation by osteoclasts and osteoblasts, respectively, 
maintain bone homeostasis. Consistent with the local¬ 
ization of the Os isoform, mice lacking the 
corresponding gene have osteopetrosis. 

The V-ATPase as isoform is localized to the late 
endosomes and the lysosomes in non-osteoclast cells 
including osteoclast projenitor and NIH3T3 cells. 
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Kinetic studies of osteoclast-like RAW264.7 cells after 
stimulation indicated that lysosomes that have the 
V-ATPase isoform are first transported to the cell 
periphery that later forms the osteoclast plasma mem¬ 
brane. Thus, as Is a key factor for plasma membrane 
localization of V-ATPases in osteoclast cells. In con¬ 
trast, the plasma membrane V-ATPase in kidney inter¬ 
calated cells has the isoform. Thus, the targeting 
function resulting from the specificity of a subunit 
isoform is expected to be different between cell 
types. The intracellular localization of a subunit is 
also isoform dependent: as, lysosomes; a^, Golgi; aj, 
coated vesicles. V-ATPases with a unique combination 
of isoforms are found in the kidney: V-ATPases with 
Bj, Gs, a 4 and Cs-h, are specific for the kidney, while 
those with Bs, Cj, Gj and aj, as, or as are also present 
in the kidney but also found in other cell types. Pres¬ 
ence of V-ATPases with diverse isoforms in kidney 
cells may be related to their role in renal ion 
homeostasis. 

The unique properties of A. thaliana Vq subunits a 
and c have also been investigated. Similar to the situ¬ 
ation for yeast STVl and VPHl, the localization of 
Wo aj was found predominantly in the TGN, whereas 
as and as were found in the vacuole. The localization 
of Vo a subunits is regulated by the amino terminal 
domain. An essential role of c subunit isoforms has 
been demonstrated by RNAi studies. These studies 
indicate that membrane intrinsic subunits have critical 
roles in intracellular and tissue-specific localization of 
V-ATPases (Gaxiola et al. 2007). 

Active Transport Driven by V-ATPases 

The transport by V-ATPases is often coupled with 
chloride channels which dissipate membrane potential 
formed by the V-ATPase. Furthermore, an electro¬ 
chemical proton gradient established by V-ATPase 
activity drives active transport into endomembrane 
organelles. Examples of this are neurotransmitter 
uptake into synaptic vesicles including glutamate, nor¬ 
epinephrine, serotonin, and y-amino butyric acid 
(GABA) coupled with either the membrane potential 
or protein gradient. The neurotransmitter is released 
into the synaptic cleft upon fusion with the presynaptic 
plasma membrane. 

Plant vacuoles accumulate sugars, amino acids, 
organic acids, and inorganic ions such as Ca^"^, 


through coupling with the proton gradient. The 
quantity of these compounds to be transported is 
dependent on photosynthesis and the metabolic 
state of the plant cells and this is likely to be 
influenced by the environment, responding to 
changes in CO 2 , light, temperature, and H 2 O. Active 
proton transport level is, at least in part, dependent 
on the number of pumps per unit surface of the 
membrane and this is regulated by both transcrip¬ 
tion and protein degradation (Maeshima and 
Nakanishi 2002). V-ATPases can be inactivated by 
low temperature in both animals and plants due to 
a diminishing number of V 1 complexes at low tem¬ 
perature. A high C02-dependent induction of 
V-ATPase B subunit has been reported in green 
algae. The transcript levels of A. thaliana 
V-ATPases are regulated by heat and drought stress 
indicating further roles of V-ATPases in the regula¬ 
tion of water balance in a changing environment. 
There is considerable evidence suggesting that the 
environmental signal transduction pathways regulat¬ 
ing V-ATPases are mediated via plant hormones 
(Binzel and Ratajczak 2005). 

Endocytosis and Exocytosis 

Exocytosis and endocytosis, for secretion and uptake 
of ions, proteins, viruses, etc., both require acidic pH 
in the lumen. Acidic pH is also necessary for the 
pathway leading to protein modifications, protein 
uptake, and recycling of receptors. In exocytosis, car¬ 
rier vesicles bud from early endosomes and transfer 
their cargo to late endosomes. Recruitment of Arf6 
(ADP - ribosylation /actor 6) and ARNO (ADP 
/ibosylation /actor nucleotide site opener), a GTP 
binding protein and its cognate GDP/GTP exchange 
factor, respectively, is necessary for carrier vesicle for¬ 
mation, and this is driven by endosomal-acidic pH. This 
pH-sensitive mechanism involves interactions of Arf6 
and ARNO with the as and c subunits, respectively 
(Marshansky and Eutai 2008). Insulin secretion also 
requires the presence of V-ATPases, in this case with 
the as isoform, in the membranes of secretory vesicles. 
These results suggest that subunit a isoforms of 
V-ATPases play essential roles in mammalian mem¬ 
brane trafficking. The predominant localization of 
A. thaliana subunit in the TGN and studies on the 
effect of V-ATPase inhibitors on membrane trafficking 
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have also indicated an important role for plant 
V-ATPases in endo- and exocytosis in plants (Gaxiola 
et al. 2007). 

Perspectives 

We have discussed the present status of our understand¬ 
ing of V-ATPases including their structure, function, 
and roles in membrane trafficking. Suhunit assembly 
and primary structure homology predict a similar mech¬ 
anism to that of F-ATPases. The two enzymes couple 
chemistry and proton transport through subunit rotation. 
Flowever, the detailed mechanisms need to be addressed 
biophysically because the transmembrane rotors of the 
F- and V-ATPase, Cw and C 4 c’c”, respectively, have 
different numbers of proton transporting carboxyl resi¬ 
dues. Thus, the precise rotation mechanism of the two 
enzymes could be different. A detailed mechanism for 
the role of V-ATPases in membrane trafficking in ani¬ 
mal and plant cells also awaits further analysis. 

In addition to the tissue-specific role of V-ATPase 
subunit isoforms, they have also been found to be 
regulated by environmental stresses such as salinity, 
heat, and drought, responses that help plants to main¬ 
tain their water balance. This unique feature would 
have been developed as a consequence of the sessile 
nature of plant life. The most significant finding in 
recent years would be the pathophysiological function 
of the vacuole that is essential for plant defense mech¬ 
anism against bacterial pathogens and that is akin to 
the mammalian defense mechanism. Further analysis 
of V-ATPase function in this response should reveal 
important new aspects of plant V-ATPases. 

Summary 

The V-ATPase, comprising multiple subunits forming 
membrane extrinsic V i and a transmembrane Vq com¬ 
plexes, is a rotating enzyme that hydrolyzes ATP to 
ADP and Pi, coupling this to Ff^ transport through 
rotation of a rotor stalk. This proton pump, expressed 
ubiquitously in yeast and diverse multicellular organ¬ 
isms across the phyla, has vital roles in intracellular 
acidic compartments, such as the Golgi, trans 
Golgi network, lysosome, synaptic vesicles, and 
the vacuole. Intracellular and tissue and organ specific 
V-ATPase localization is determined by isoforms 


of the V-ATPase subunits generated by subunit com¬ 
binations unique to each cell type. 

Cross-References 

► ATPase: Overview 

► Electron Transport Chains in Bacteria 

► Rotary Motor 
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Timothy A. Keiderling 

Department of Chemistry, University of Illinois at 
Chicago, Chicago, IL, USA 


Synonyms 

Coupled oscillator vibrational CD; Infrared Circular 
Dichroism (CD); Vibrational optical activity 


Definition 

Vibrational circular dichroism is experimentally 
measured as the differential absorption of left and 
right circularly polarized light caused by excitation of 
molecular vibrational transitions in a chiral environ¬ 
ment. In biopolymers, this can be used for secondary 
structure determinations and arises primarily from 
coupling of repeating (locally achiral) units in the 
polymer (e.g., amides in proteins). 


Basic Characteristics 

Vibrational circular dichroism (VCD) is 
a manifestation of chirality or handedness of 


molecular vibrational transitions, which is expressed 
as AA = Al — Ar and measured in the infrared (IR) 
region. Its dominant application at present is determi¬ 
nation of absolute configuration of small to 
medium sized chiral organic molecules by correlating 
experimental determinations of VCD spectra with 
spectral simulations using density functional theory 
(DFT) computations based on proposed structural 
models (Stephens et al. 2008). Additionally, VCD has 
historically been used for determining dominant sec¬ 
ondary structure types in proteins and nucleic acids in 
a manner that complements and extends techniques 
that use IR or electronic CD (BCD) alone (Keiderling 
et al. 2006). However, the highest impact biophysical 
applications of VCD have been for peptide structure 
and folding studies where theoretical simulation and 
isotopic labeling can provide structural sensitivity not 
possible with other optical techniques (Keiderling 
2002). Theoretical VCD has advanced to a stage 
where spectra can be predicted with very good accu¬ 
racy for conformationally restricted molecules. This 
capability has been extended to biopolymers by use 
of various approximations that allow spectral model¬ 
ing of stable peptide and nucleic acid conformers for 
selected modes (Kubelka et al. 2009). 

VCD can be measured with a modified dispersive 
IR or FTIR spectrometer that incorporates polarization 
modulation and detection of the resultant modulated 
intensities or with dedicated commercial instrumenta¬ 
tion that can produce excellent spectra (Keiderling 
et al. 2006). Water is an obstacle to obtaining high 
quality VCD for modest concentration samples, due 
to spectral overlap of intense bands, but use of 
D 20 -based solvents can avoid much of the interfer¬ 
ence. Band shape and frequency shifts result from H/D 
exchange, but these are well established and can be 
accounted for in the analyses. A complementary light 
scattering technique, Raman optical activity (ROA), 
has also developed many applications for determining 
average or qualitative protein and nucleic acid struc¬ 
tural characteristics (Barron 2006). Theoretical model¬ 
ing and measurement of ROA are more difficult than 
for VCD, which arises from the complex fundamental 
basis of ROA, but reliable methods for both are becom¬ 
ing more available. 

In biopolymers, absolute configuration is generally 
known (e.g., L-amino acid residues in proteins), so that 
determination of the relative conformation of succes¬ 
sive residues in a chain, or secondary structure. 
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Vibrational Circular Dichroism of Biopolymers, 
Fig. 1 Secondary structure variation in peptide VCD. IR and 
VCD spectra of the amide T band (amide C=0 stretch) for 
N-deuterated poly-L-lysine (in D 2 O) at (a) high pH and low 


temperature (ot-helical, with some residual sheet), (b) high pH 
after heating (P-sheet aggregate), and (c) neutral pH (coil or 
disordered) 


dominates VCD structural studies. The relative orien¬ 
tation of amides in peptides (similarly for bases or - 
P02~ groups in DNA) affects the mechanical and 
dipolar coupling of localized vibrations resulting in 
coupled polymer modes characteristic of the backbone 
geometry. Contributions from different secondary 
structural types to the VCD tend to spectrally overlap, 
so evaluation of their fractional components is the 
usual outcome (such as % helix, etc.). IR and VCD 
have both advantages and disadvantages as compared 
to other spectral techniques for secondary structure 
analyses, but are best used in conjunction with BCD, 
Raman, and magnetic resonance (NMR) spectra to 
exploit the complementary structural sensitivities 
available with each. 

The focus of IR and Raman spectra of proteins and 
peptides has been the assignment of component 
frequencies to various secondary structural types 
(Haris 2000). Most analyses of proteins are done for 
samples in aqueous solution and focus on the amide 
I (C=0 stretch) band (^1610-1690 cm”') with some 
application of the amide II (N-H deformation plus C-N 
stretch, ^1500-1550 cm”') in IR spectra and 


especially the amide III (oppositely phased N-H defor¬ 
mation plus C-N stretch, ^1330-1220 cm”') with 
Raman spectral methods. The same is true of VCD, 
but since it is a differential technique, like BCD, the 
band shape carries conformational information giving 
the spectra an extra dimension which allows more 
discrimination between structures whose BCD or IR 
spectra are less distinct. A comparison of the IR and 
VCD spectra for N-deuterated poly-L-lysine in a-heli- 
cal, P-sheet, and coil (unordered) forms is shown in 
Big. 1. These patterns are quite stable, having been 
identified in a wide variety of peptides. With proteins 
that have one dominant secondary structure type (e.g., 
a-helical myoglobin), the same patterns are detected, 
but are broadened due to the less homogenous struc¬ 
ture. If the protein has segments with a variety of 
secondary structures, their spectral contributions over¬ 
lap and must be deconvolved, and this process can be 
used to develop measures of average secondary struc¬ 
ture content (Keiderling et al. 2006). 

Some notable successes of VCD are determination 
of secondary structure for peptides rich in aromatic 
residues where the BCD is not useful due to overlap 
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Vibrational Circular Dichroism of Biopolymers, 
Fig. 2 VCD discrimination of 3 jq- and or.-helices. Comparison 
of experimental and computed VCD spectra in amide I and 
amide II region for three model 3io- and ot-helical peptides 
having different fractions of a-methylated residues. {Left) spec¬ 
tra measured in CDCI 3 solution: 3 10 helical, weak couplet amide 
I and strong negative amide II, Aib 5 -Leu-Aib 2 (top) and 



1700 1600 1500 


Wavenumber [cm 

(Aib-Ala )4 {middle, also for TFE, black dash line), with a- 
helical, strong couplet amide I and broad weak negative amide 
II, (Met 2 -Leu )8 {bottom). {Right) simulated 3 10 -helical Ac- 
(Aib) 8 -NH 2 {top) and Ac-(Aib-Ala) 4 -NH 2 {middle), with a- 
helical Ac-(Ala) 6 -NH 2 {bottom). Vertical lines indicate DFT 
computed rotational (R) strengths, and spectral envelops are 
their broadened sum 


of aromatic transitions with those of the amides, dis¬ 
crimination of 3 10 -helices from a-helices hy use of 
relative intensity and band shapes for amide I and II, 
both theoretically and experimentally with variation in 
the sequence of related octapeptides as shown in Fig. 2 
(Kubelka et al. 2002), and demonstration of the con¬ 
formational similarity of “random coil” peptides and 
disordered proteins with poly-L-proline II conformers 
(PPII) (Dukor and Keiderling 1991). These empirical 
correlations have been confirmed by DFT computation 
of the spectral properties, whose variation with struc¬ 
ture correlated closely with the experimental observa¬ 
tions. By use of isotopic labeling (*^C on the amide 
C = O), model a-helical peptides were shown with 
VCD to be disordered in a specific location (C-termi- 
nal), and to unfold from the ends toward the middle 
(Silva et al. 2000). These methods also enabled 


evaluation of the site-specific coupling constants 
between specific labeled residues by matching theoret¬ 
ical simulation of the spectra to experimental results 
(Huang et al. 2004). 

For nucleic acids, A, B, and Z forms are distinguish¬ 
able, especially by use of VCD of PO 2 modes, as seen 
in Fig. 3. A systematic dependence on the GC-AT ratio 
is found for base-centered VCD, and characteristic 
VCD for triplex formation has been demonstrated 
(Keiderling et al. 2006). 

Due to the longer wavelengths in the IR, VCD is 
less influenced by scattering and other polarization 
degrading properties of condensed phase samples. 
Measurements of VCD for films and detection of fibril 
formation in amyloidgenic systems from their charac¬ 
teristic intensified VCD have been recently reported 
(Ma et al. 2007). Since VCD, like IR, samples the 
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Vibrational Circular Dichroism of Biopolymers, 
Fig. 3 DNA VCD forms. VCD {left) for nucleic acid duplexes 
in {a) A-form RNA, (r(I)r(C)), {h) B-form DNA (d(G-C)), and 
(c) Z-form DNA (d(G-C) in alcohol) in the base deformation 
region and {right) for B- {dashed line) and Z-form {solid line) 
DNA, d(G-C) in the symmetric -PO 2 stretching modes 


Structure on a rapid time scale and can be applied to 
any biopolymeric structure, uniform or disordered, 
these spectra reflect the distribution of structures in 
the conformational ensemble during a transition. 
Study of protein and peptide folding processes with 
IR and VCD has made a significant contribution and 
has offered new insight, facilitated by the short 
range and potential site-resolved structure sensitivity 
of VCD, in contrast to BCD and fluorescence, for 
example (Silva et al. 2000). 


Cross-References 

► Amyloid Formation 

► Circular Dichroism Spectroscopy of 
Biomacromolecules 

► Far UV Protein Circular Dichroism 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications: Technical Aspects 

► H/D Bxchange 

► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Infrared Spectroscopy of Protein Folding, 
Misfolding and Aggregation 

► Protein Circular Dichroism: Theoretical Aspects 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Raman Optical Activity Studies of Structure and 
Behavior of Biomolecules 
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Vibrational Spectroscopy with Neutrons 
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Synonyms 

Hydration; Incoherent inelastic neutron scattering; 
Inelastic neutron scattering; Neutron scattering of 
membranes 

Definition 

Inelastic neutron scattering provides information 
on the vibrational spectrum of biomolecules that 
is complementary to infrared and Raman spectros¬ 
copy. The fundamentals of the technique are 
outlined and selected applications discussed, cover¬ 
ing bone mineral, proteins, and water associated 
with biomaterials. 

Introduction 

Vibrational spectroscopy is frequently used for the 
investigation of biological systems because it provides 
detailed information about the local structure of the 
material. Most studies are carried out with infrared or 
Raman spectroscopies. A complementary technique is 
inelastic neutron scattering (INS) spectroscopy 
(Mitchell et al. 2005), where the nature of INS spectra 
emphasizes a “whole spectrum” approach to vibra¬ 
tional analysis. 

The major difference between vibrational neutron 
spectroscopy and infrared and Raman spectroscopies 
is that the neutron has mass, thus an inelastic scattering 
event results in a significant transfer of both energy 
(£, cm”*) and momentum {Q, A” ). Since both Q and 
E are experimentally and independently accessible, it 
follows that INS spectroscopy is intrinsically a two- 
dimensional form of spectroscopy. 

INS spectroscopy can be either coherent, so gives 
information on collective motions of the system, or 
incoherent, which only involves the correlation 
between the position of the same nucleus at different 


times, hence the motions of a single particle are 
probed. Whether the scattering is primarily coherent 
or incoherent depends on the relative size of the coher¬ 
ent and incoherent scattering cross section’s (cr) of the 
scattering nuclei. For incoherent scattering the INS 
intensity, S(Q, co,), of the /th vibrational mode is pro¬ 
portional to (Mitchell et al. 2005) 

5(G, m,) cx exp(-e2t/2^^^j)^ (i) 

where t/, is the amplitude of vibration of the atoms 
undergoing the particular mode, cu,. Since the ampli¬ 
tude of vibration is greatest for *H and its incoherent 
cross section is at least a factor of ten times larger than 
that of most elements, for hydrogenous materials the 
scattering is dominated by hydrogen. The exponential 
term in Eq. 1 is the Debye-Waller factor, t/Totai is the 
mean square displacement (averaged over time) of 
the atoms undergoing mode m, summed over all the 
modes. It is dominated by the whole body (external) 
modes of the molecule and its magnitude is in large 
part determined by the thermal motion of the molecule. 
This can be reduced by cooling the sample and so 
spectra are typically recorded below 30 K to increase 
the scattering intensity. 

The key feature of Eq. 1 is that it is purely dynamic, 
the intensity does not depend on the electronic proper¬ 
ties of the system. Thus any method that can provide 
the Ui values can be used to generate a calculated INS 
spectrum. For small to medium size systems (up to 
a few hundred atoms) ab initio methods (Johnson 
et al. 2006; Hudson 2009) generally provide the best 
results, for large systems (thousands of atoms), molec¬ 
ular mechanics has proven effective (Goupil-Lamy 
et al. 1997). 

Since the scattering event is between the neutron 
and the atomic nucleus, neutrons are highly penetrat¬ 
ing. This means that sample presentation is straight¬ 
forward, for air stable materials wrapping the sample 
in aluminum foil is often sufficient. Where samples are 
air sensitive or require controlled humidity, thin- 
walled, sealed aluminum cans are used. A corollary 
of the high penetration is that scattering events 
are relatively rare, thus the sample size for INS spec¬ 
troscopy is in the 0.5-5.0 g range and measurement 
times range from a few hours to 24 h. A variety 
of instrumentation is used for INS measurements, 
these are described in detail elsewhere (Mitchell 
et al. 2005). 
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Vibrational Spectroscopy with Neutrons, Fig. 1 Left. Infra¬ 
red, Raman, and INS spectra of purine. (The Raman spectrum in 
green was recorded with two different instruments in order to 
observe the low energy part of the spectrum, hence the discon¬ 
tinuity and change of color at 250 cm“'). Right. Comparison of 



Wavenumber/cm '' 

the observed INS spectrum (center) with that calculated for an 
isolated, i.e., gas phase molecule (top) and for the solid state 
based on the experimental crystal structure (bottom). 
Reproduced from Parker et al. (2004). Copyright (2004), with 
permission from Elsevier 


Complementarity of INS Spectroscopy 

The left part of Fig. 1 shows the infrared, Raman, and 
INS spectra of purine (Parker et al. 2004), which is the 
parent compound of the purine bases whose deriva¬ 
tives include the biologically important molecules ade¬ 
nine and adenosine. Similarities and differences are 
apparent between all three types of spectra and empha¬ 
size the complementarity of the three techniques. The 
similarities arise because the vibrational energy levels 
are an intrinsic property of the system; hence, the 
energy differences between them (the vibrational tran¬ 
sition energies) are independent of the method of 
observation. In contrast, the intensities depend on the 
technique used to observe the transition. Note that INS 
spectroscopy allows routine access to the terahertz 
region (<400 cm”*). While it is possible to access 
this region with infrared or Raman spectroscopy, it 
becomes progressively more difficult (and expensive) 
to do so as the wavenumber decreases. The advantages 
of INS spectroscopy are that there are no selection 
rules (so all modes are allowed), the specificity to 
hydrogen (although non-hydrogenous materials can 
be measured albeit with tenfold reduced sensitivity), 
and the ease of simulation of the spectra. The major 


advantages of infrared and Raman spectroscopies over 
INS spectroscopy are sensitivity (mg sample size 
rather than g), speed (measurement time in seconds 
rather than hours), and availability of instrumentation 
(there are less than 30 operational INS spectrometers 
worldwide in contrast to the hundreds of thousands of 
infrared and Raman spectrometers). Surprisingly, the 
resolution of state-of-the-art INS spectrometers is 
effectively comparable to that of most laboratory infra¬ 
red and Raman spectrometers because bandwidths in 
the solid state are dominated by intermolecular inter¬ 
actions that are typically ^10 cm”* or more and high 
resolution is unnecessary. 

The right part of Fig. 1 shows a comparison of the 
experimental INS spectrum with that calculated for the 
isolated molecule and for the solid state. It is apparent 
that the hydrogen bonding and factor group splitting 
present in the solid state structure of purine has to be 
explicitly considered in order to correctly assign the 
spectrum. The ability to easily generate calculated 
spectra from ab initio calculations is one of the great 
strengths of INS spectroscopy. The comparison with 
the experimental spectrum is synergistic: it simulta¬ 
neously provides a stringent test of the calculation and 
provides unambiguous assignments. 
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Vibrational Spectroscopy 
with Neutrons, 

Fig. 2 Comparison of 
infrared {red lines) and INS 
(blue lines) spectra of (a) 
highly crystalline, 
stoichiometric hydroxyapatite 
and (b) defatted, 
deproteinated, and dried bone 
in the OH stretch (left) and 
0-H libration {right) regions. 
The vertical dashed lines 
indicate the OH-specific 
modes 


OH stretch 

+ libration OH stretch OH libration 



Bone Mineral 

Bone is a natural composite consisting of a hard, load 
bearing material, the mineral hydroxyapatite, 
Ca 5 (P 04 ) 30 H, in a softer, stress transmitting, extracel¬ 
lular protein matrix that is largely collagen. Many 
aspects concerning the composition of bone mineral 
are controversial and substitution of carbonate for 
phosphate and/or hydroxyl readily occurs. Hydroxyls 
in stoichiometric hydroxyapatite exhibit two infrared 
bands: an 0-H stretch at 3,575 cm“* and a libration at 
633 cm“\ Fig. 2a red traces. The same modes are seen 
by INS plus a combination mode of (stretch -i- libra¬ 
tion) Fig. 2a blue traces. In bone mineral these modes 
are usually either very weak or absent. Fig. 2b red 
traces, and on this basis, it has been suggested that 
there is complete substitution by carbonate at the 
hydroxyl site. INS spectroscopy of the same samples 
shows the characteristic modes to be present. Fig. 2b 
blue traces. From the INS spectra it was concluded that 
in ox femur bone 40-50% of the hydroxyls are still 
present even when the infrared spectra indicate com¬ 
plete substitution has occurred (Taylor et al. 2001). 
The extent of hydroxyl replacement is crucial since 


substitution results in a charge imbalance that causes 
calcium loss, with its attendant consequences for the 
mechanical properties of the bone. 


Molecular Mechanics Studies of Proteins 


As shown earlier for purine, small to medium sized 
molecules are accurately described by periodic-DFT 
calculations. In the near future improved computa¬ 
tional methods and hardware will allow thousands of 
atoms to be treated, meaning that small proteins 
become feasible. However, the flexibility of the struc¬ 
ture, the lack of symmetry, and the very large number 
of degrees of freedom mean that there are many min¬ 
ima of similar energy which will render finding the 
ground state extremely difficult. This is likely to be 
a significant problem with no obvious solution. The 
parameterized force field methods do not suffer from 
this difficulty and the purpose of the analysis of small 
molecules by molecular mechanics is to parameterize 
the force field so as to be able to analyze large systems. 
Alanine dipeptide (Baudry et al. 1997), see Fig. 3, is 
a useful model compound in that it contains the basic 
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Vibrational Spectroscopy with Neutrons, Fig. 3 Experi¬ 
mental INS spectrum of alanine dipeptide. The spectral regions 
that have major contributions from the methyl groups torsions, as 
shown by CHARMM, are indicated by the colored arrows 

constituents of a peptide chain. Thus there are sp^-sp^ 
backbone C-C bonds, backbone N-C bonds, and side- 
chain C-C bonds. The spectrum is complex but has 
been successfully modeled by molecular mechanics 
using the CHARMM force field. The results show 
a uniform mismatch in frequencies of ^35 cm”*, but 
the overall pattern is reproduced. Inspection of the 
eigenvectors shows that many of the peaks contain 
contributions from more than one methyl group, con¬ 
sistent with extensive mixing of the modes caused by 
the Cl site symmetry in the crystal. 

An example where this approach has been used is 
the enzyme Staphylococcal nuclease (Goupil-Lamy 
et al. 1997; Smith et al. 1998). A comparison of the 
INS spectrum of the dried protein after exchange of 
labile hydrogens for deuterium and the calculated 
spectrum is shown in Fig. 4. The agreement is gener¬ 
ally good, although some details are incorrect. Exam¬ 
ination of the displacement vectors showed that most 
of the modes below 1,500 cm”* are heavily mixed. 
Qualitatively, they follow the usual group frequency 
correlations: 350-500 cm”* skeletal bending modes 
and torsions, 700-1,000 cm”* C-C stretch, CH2 
and CH3 rock, 1,000-1,500 cm”* CH2 rock, wag, 
twist, scissors, and CH3 symmetric and asymmetric 
bend. The methyl torsion is calculated at 235 cm”* 
and observed at 269 cm” * showing that the parame¬ 
ters can be further refined. Staphylococcal nuclease 
has 2,395 atoms, and thus there are 7,179 internal 
modes, and is the largest molecule to be analyzed 
to date. 



2000 1000 0 
Wavenumber / cm^'' 

Vibrational Spectroscopy with Neutrons, Fig. 4 Experi¬ 
mental (top) and calculated (bottom) INS spectrum of Staphylo¬ 
coccal nuclease. Reproduced from Smith et al. (1998). Copyright 
(1998), with permission from Elsevier 

Water 

The interaction between water and biomacromolecules 
is very important in stabilizing their structure. For 
some functional polymer materials, many of their 
physical properties depend on their structure at differ¬ 
ent hydration levels. Water has been studied in a wide 
range of biological materials (Ford and Fi 2007), from 
living cells (yeast, Escherichia coli, green algae, 
bovine heart muscle) to vegetable matter (beetroot 
tuber, spinach leaves) and DNA. Typical results are 
shown in Fig. 5 (left part) for gelatin (Wu et al. 2002), 
which is obtained by controlled hydrolysis of the pro¬ 
tein collagen. As the water content is increased, there is 
a continuous increase in intensity in the regions 
corresponding to the water translational 
(0-400 cm”*) and librational (400-1,200 cm”*). The 
difference spectra, shown in Fig. 5 (right part, colored), 
progressively evolve from a disordered form of ice 
[(22% water) - (Dry)] that resembles high density 
amorphous (HDA) ice to a form [(300% water) - 
(22% water)] that is very similar to ordinary crystalline 
ice, 4 . This suggests that the very first molecules of 
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Vibrational Spectroscopy with Neutrons, Fig. 5 Left-. INS 
spectra of gelatin as a function of hydration. Right'. Difference 
spectra of the water in gelatin showing a progression from a form 

water are adsorbed on the hydrophilic OH, NH, and 
CO groups of the biopolymer and further increase in 
hydration up to 100% water leads to the formation of 
a disordered form of water, similar to high density 
amorphous ice (Li 1996). Beyond this level, the 
water behaves as /j,. 

Bacteria have been shown to have a bound water 
ratio of ^0.5 g/g dry weight and a total cytoplasmic 
water content of ^2.3 g/g dry weight under normal 
conditions. This suggests that bound water corre¬ 
sponds to about 22% of the total cell water. This 
value is close to those determined hy INS measure¬ 
ments of cardiac tissue (28%) and yeast (24-31%). 
Interfacial water molecules are also observed in 
X-ray crystallography studies of proteins and other 
hiomolecules. These waters occupy discrete positions 
in the crystal lattice, i.e., they are not disordered hut 
cannot be considered as bulk water. The mass of bound 
water can vary, but typically is about 10-20% of the 
protein mass in the crystals. Such a level does not agree 
with the interfacial water levels observed by INS. Thus 
interfacial water will probably include both ordered 
water as well as disordered water in the hydration 
shells surrounding proteins. The conclusion from sev¬ 
eral studies is that the interfacial water is about 30% of 
total water in the cell and is likely to be highly signif¬ 
icant for cellular reactions and processes. The 



Wavenumber/cm 

that is similar to HDA ice to one that is very similar to ice 1^. 
Reproduced from Wu et al. (2002). Copyright (2002), with 
permission from Springer-Verlag 

work indicates that tissues still contain a significant 
fraction of hulk water that is indistinguishable in terms 
of its vibrational properties from ice (Ford and 
Li 2007). 

Summary 

Inelastic neutron scattering is a complementary vibra¬ 
tional spectroscopy technique. Proton motion is 
highlighted in the spectra, thus it emphasizes modes 
that are often weak or absent in infrared or Raman 
spectra, especially at low energy. It is particularly 
straightforward to generate INS spectra from empirical 
force field calculations or from ab initio work. Thus it 
enables a stringent test of the calculation. This is syn¬ 
ergistic, the calculation provides unambiguous assign¬ 
ments, having validated the calculation, properties 
derived from it such as electron distributions, bulk 
moduli or sound velocities can be used with some 
degree of confidence. 

The examples shown here indicate some of the 
range of applications. Studies of small molecules 
such as purine or alanine dipeptide provide infor¬ 
mation about the dynamics of the molecule and 
generate parameters for molecular mechanics calcu¬ 
lations of very large systems such as Staphylococcal 
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nuclease. At the other extreme, water is an essential 
component of all living matter but its interactions 
with complex materials such as proteins or poly¬ 
saccharides are still poorly understood. INS spec¬ 
troscopy has provided evidence that the water most 
strongly associated with the biopolymer is distinct 
from bulk water. 

Cross-References 

► Computational Studies of Enzyme Motions 

► Far-Infrared Spectroscopy of Proteins 

► Molecular Vibrations and Their Interaction with 
Electromagnetic Radiation 

► Quantum Mechanical Simulations of Biopolymer 
Vibrational Spectra 

► Raman Spectroscopy 

► Terahertz Spectroscopy: Solvation of Biomolecules 
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Synonyms 

History of infrared spectroscopy; History of Raman 
spectroscopy 

Definition 

This entry gives a personal account of the recent his¬ 
tory of vibrational spectroscopy of biological 
molecules. 

Why should one actually bother writing about the 
history of any scientific subject when new discoveries 
and developments are taking place at a breathtaking 
rate? It is because the history behind many scientific 
endeavors can provide the context for future discover¬ 
ies, emphasizing the fact that as a product of human 
activity, learning is a cumulative process. Indeed, 
learning more about the circumstances of how, when, 
and where these scientific discoveries were made helps 
us to be aware of the motivations that guided and drove 
the evolution of that subject matter. And yet not all 
discoveries are demand-driven as scientists wandering 
along the path of curiosity sometimes stumble across 
new findings simply by accident or serendipity; of 
course, these findings must then be recognized as true 
discoveries. A case in point is the discovery of the 
infrared radiation by Sir William Herschel in 1800. 

Vibrational spectroscopy, as it is practiced today, 
stands on two legs, ► infrared spectroscopy and 
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► Raman spectroscopy, each with its own history. It is 
not within the scope here to delve deeply into the 
history of the discovery of infrared radiation (interest¬ 
ing as it may be), nor to discuss work of the early 
pioneers Melloni, Nobili, or Langley; neither will it 
be attempted to revisit the discovery by Sir Raman in 
1928 of the effect named after him. A comprehensive 
review of the history of infrared spectroscopy has 
recently been published by Andreas Barth and Parvez 
Haris (Barth and Haris 2009) and makes excellent 
reading which is recommended highly to those inter¬ 
ested. The history of Raman spectroscopy also was 
reviewed by Derek Long (Long 1988), with a recent 
publication dedicated to the 80th anniversary of the 
discovery of the Raman effect (Long 2008). So, as the 
author is someone having practiced vibrational spec¬ 
troscopy for the last 50 years, it is intended here to put 
a personal spin on this and emphasize the more recent, 
modern-day history. 

If pressed to come up with a date on when modern- 
day vibrational spectroscopy began, one could say that 
it only came into its own in the middle of the last 
century with the seminal treaty by Gerhard Herzberg 
“Infrared and Raman Spectra of Polyatomic Mole¬ 
cules” (Herzberg 1945). This is the second volume in 
his seminal trilogy “Molecular Spectra and Molecular 
Structure,” a work often referred to as the “Bible” of 
molecular spectroscopy, that guided many students 
(including this author). For a life-long dedication to 
molecular spectroscopy, GH (this is how we affection¬ 
ately referred to him) was given the 1971 Nobel Prize. 
It is worthwhile pointing out here that a young 
Herzberg (just 25) personally met Chandrasekhara 
Venkata Raman in 1929 at the Faraday Society meet¬ 
ing on Molecular Spectra and Molecular Structure in 
Bristol. In his memoirs, GH remembers this “as a most 
memorable meeting largely because of the presence of 
Raman who spoke clearly and lucidly in beautiful 
English and his enthusiasm was infectious.” Raman 
was awarded the Nobel Prize the following year. 

Major technical advances in present-day vibrational 
spectroscopy were achieved in the 1960s and 1970s 
with the advent of laser Raman and Fourier transform 
infrared spectroscopy, technical developments on 
which there is no need to elaborate here. 

Much of contemporary vibrational spectroscopy 
carries the label “Made in Europe.” Shortly after the 
dreadful conflict of WWII ended, three wise men, 
Harold (Tommy) Thompson from Oxford/UK, Jean 


Lecompte from Paris/France, and Reinhard Mecke of 
Freiburg/Germany, took on the task of uniting and 
bringing together spectroscopists across Europe, 
a continent soon to be divided again along ideological 
lines. Their efforts led to the establishment of what 
later became known as the European Congress on 
Molecular Spectroscopy, in short the EUCMOS con¬ 
ference. The first EUCMOS conference was organized 
in 1951 by Ernst Miescher on neutral Swiss territory in 
Basel, followed every 2 years by meetings in different 
European cities. While the conference continued under 
its European name, it soon became a truly international 
event. The name Molecular Spectroscopy was also 
retained, though the vast majority of presentations 
were dedicated to infrared and Raman spectroscopic 
studies. On a more personal note, the author has to 
admit that the single most significant event in his 
career as vibrational spectroscopist was his good for¬ 
tune to attend the 7th EUCMOS conference, held in 
1963 in Budapest. The three wise men from Europe 
were there, as were other prominent practitioners of 
molecular spectroscopy from all over the world: 
Herzberg, Pimentel, Lord, Jones, Bernstein, 
Stammreich, Shimanouchi, Longuett-Higgins, Miller, 
Orville-Thomas, Sheppard, Mangini, Bellanato, 
Josien, Tsuboi, Hadzi, Lippincott, Morcillo, Connes, 
Zerbi, and many more - the list reads like a who is who 
in spectroscopy. It was not so much the science, but the 
fascinating personalities that swayed young partici¬ 
pants like the author to pursue a career in vibrational 
spectroscopy. These grandmasters were pioneers in 
one of the most exciting emerging fields of their time 
and their legacy has been embraced by many men and 
women of science who strive to continue the tradition. 
While other vibrational spectroscopists have also made 
a name for themselves, it might be too soon to look at 
these from the perspective of history. It would however 
be remiss of the author not to mention here the series 
Advances in Infrared and Raman Spectroscopy, 
volumes 1-12 (retitled to Advances in Spectroscopy 
from volume 13 onward), edited by Clark and Hester, 
published initially by Heyden & Son, London, and 
later by John Wiley & Sons, Chichester; they contain 
numerous relevant review articles on modern-day 
infrared and Raman spectroscopy. 

Over time both infrared and Raman spectroscopy 
shifted their focus from simple to more complex mol¬ 
ecules. And as more vibrational spectroscopists turned 
their attention to the world of biomolecules, the need to 
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create a forum where its practitioners could meet and 
share their experiences became apparent. An opportu¬ 
nity presented itself at the 1983 NATO Advanced 
Study Institute on the Spectroscopy of Biological Mol¬ 
ecules, a meeting organized by two good Canadians, 
Camille Sandorfy (born in Hungary) and Theo 
Theophanides (born in Greece). The success of this 
NATO summer school, held in Aquafredda di Maratea 
in Southern Italy, led to a follow-up meeting in 1985 in 
Reims, organized by Michel Manfait and Alain Alix. 
The famous Cathedral and the free-flowing champagne 
in Reims contributed to the success of this meeting 
which henceforth became known as the first 
European Conference on the Spectroscopy of Biolog¬ 
ical Molecules, now a biennial meeting that continues 
to this day. The published proceedings of these meet¬ 
ings bear witness to the evolution of the field and to the 
advancements made over the last 25 years in vibra¬ 
tional biospectroscopy. These days, more than half of 
all vibrational spectroscopic studies on biomolecules 
involve proteins, with fewer studies dedicated to lipids 
or nucleic acids and only very few dealing with carbo¬ 
hydrates (Mantsch 2001). Interestingly, not only has 
the focus of vibrational spectroscopy evolved, but so 
has the language of science; while at EUCMOS 1963 
in Budapest only half of the 250 presentations were in 
English (the others used Erench, German, or Russian), 
in today’s meetings, English is the universal lingua 
franca. 

It is a well-known fact that throughout their exis¬ 
tence, infrared and Raman spectroscopy competed for 
the attention of scientists interested in molecular struc¬ 
tures. Today both techniques have evolved to the point 
where they play important roles in a staggering variety 
of scientific endeavors, though for the readership of 
this encyclopedia, the relevant question must be: “How 
useful is vibrational spectroscopy for biophysics?” 
Hopefully, this question will be adequately answered 
by the various contributions that follow in this section. 

It must also be recognized that over the last 
25 years, the center of gravity for vibrational 
biospectroscopy has shifted away from Europe as 
more and more studies are now being carried out in 
North America, Australia, and other non-European 
countries. The annual meetings of the (American) Bio¬ 
physical Society bear witness to the many new appli¬ 
cations of infrared and Raman spectroscopy which are 
currently used to address a multitude of questions in 
biophysics. Meanwhile many biophysicists have 


turned their attention to biomedical issues and now 
apply their infrared and Raman spectroscopic tools to 
the study of animal and human cells and tissues. To 
accommodate these developments, a meeting dedi¬ 
cated uniquely to this subject was started in 2000 in 
Winnipeg under the name “Shedding Light on Dis¬ 
ease ” and since has become a cycle of biennial 
conferences. 

This modern-day history cannot be finished without 
mentioning the efforts that are being made to exploit 
vibrational spectroscopy for molecular imaging. While 
both infrared and Raman spectroscopy are already 
used to image individual cells and tissues, the great 
challenge faced by practitioners of vibrational spec¬ 
troscopy is to come up with viable biomedical appli¬ 
cations. As vibrational biospectroscopy attempts to 
create its own niche, such as in functional imaging of 
skin, it will take time and a great deal of imagination to 
catch up with the by now well-established field of 
magnetic resonance imaging. 

Cross-References 

► Fourier Transform Infrared Spectroscopy for 
Biophysical Applications; Technical Aspects 

► Infrared Spectroscopy of Cells, Tissues, and 
Biofluids 

► Infrared Spectroscopy of Protein Structure 

► Raman Spectroscopic Instrumentation, 
Experimental Considerations 

► Raman Spectroscopy and Microscopy of Cells and 
Tissues 
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Introduction 

channels come in several types with distinct 
characteristics that enable them to accomplish diverse 


biological tasks, and mutations in them may lead to 
genetic diseases. Arguably the best known type of 
channel is the voltage-gated (Ky) channel. 
Most Kv channels open or close, that is, are “gated,” 
in response to decreased or increased negativity of the 
transmembrane voltage, conditions called depolariza¬ 
tion and hyperpolarization, respectively. Passive net 
efflux through open Kv channels repolarizes cell 
membranes, producing the sharp descending phase of 
action potentials. Another major type of channel is 
the inward-rectiher (Kir) channel. Ki^ channels 
exhibit voltage sensitivity with polarity opposite to 
that of classical Kv channels. That is, their effective 
conductance increases with hyperpolarization but 
diminishes with depolarization, a behavior called 
anomalous or inward rectihcation. Their characteristic 
voltage sensitivity enables Kj^ channels both to help 
establish or regulate the negative resting membrane 
potential without impeding the generation of 
depolarizing action potentials, and to accelerate 
the descending phase of action potentials. Although 
Kv and Kj^ channels both exhibit voltage sensitivity, 
the underlying mechanisms are completely different. 

Voltage Sensing in Channels 

The open probability of the ion pore of Kv channels is 
controlled by membrane voltage. The mechanism 
underlying this process is commonly referred to as 
voltage(-dependent) gating. Biophysical studies of 
voltage-gated ion currents performed in the 1950s led 
to the prediction that movement of electric charges 
within the membrane underlies the observed voltage 
sensitivity (Hodgkin and Huxley 1952). At least four 
charged particles must move before a channel can 
conduct ions. These charges whose movement causes 
the channel’s open-closed transitions are termed 
“gating charges.” Movement of gating charges within 
the membrane generates a capacitive current called 
gating current, experimentally first recorded in the 
1970s (Armstrong and Bezanilla 1973). In principle, 
the observed gating current may reflect movement of 
gating charges within an electric field, or a change of 
electric field, or both. In any case the channel must 
have a sensor to sense voltage changes, and the sensor 
is predicted to be a charged protein element. The 
voltage-driven conformational change or motion of 
the voltage sensor needs to be coupled to the 
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Voltage-Dependent Gating 
of Channels: Structural 
Basis, Fig. 1 An early 
conceptual model for the role 
of S4 in voltage sensing. S4 
drawn after that of (Catterall 
1986) is a transmembrane 
helix that undergoes 
translational and rotational 
movement upon membrane 
voltage changes (AV). The 
positively charged residues of 
S4 are paired with some 
negative charges. The blue 
trace outlines S4’s 
surroundings. 




“mechanical” opening-closing process of the gate 
which in turn controls ion flow through the ion pore. 

Cloning of a voltage-gated Na"^ (Nav) channel in the 
1980s revealed that a protein segment with a-helical 
propensity, now commonly known as the fourth trans¬ 
membrane segment (S4), harbors a large number of the 
positively charged amino acid residues: arginine and 
lysine (Noda et al. 1984). The charged residues in S4 
reside at every third position and the intervening pairs 
of residues are generally hydrophobic. This striking 
feature led to the suggestion that S4 participates in 
voltage sensing. Hydrophobicity analysis would sug¬ 
gest that a segment as highly charged as S4 must be 
located outside the cell membrane. However, to func¬ 
tion as voltage sensor it should be located within the 
membrane. Initial conceptual molecular models of 
voltage sensing were thus proposed, in which S4 is 
a transmembrane helix that moves normal to the mem¬ 
brane plane and twists in response to voltage changes, 
and the positively charged residues in S4 are paired 
with some nearby negative charges (Catterall 1986) 
(Fig. 1). Molecular cloning of Nay and Ky channels 
made investigation of S4’s role in voltage gating 
possible. Early studies showed that mutations at not 
only S4’s charged residues but also its non-charged 
hydrophobic residues perturbed voltage gating. 

Investigations of the molecular and biophysical 
mechanism of voltage sensing and other features of 
Ky channels took off following the molecular cloning 
of the Shaker Ky channel from Drosophila (e.g., 
Tempel et al. 1987). A Ky channel consists of four 
identical or very similar subunits, each subunit 
comprising six transmembrane segments (S1-S6). 


Four copies of the peptide segment encompassing 
S5 through S6, one from each subunit, surround the 
ion conduction pore (see Fig. 2; further details follow). 
In contrast, channels have only two transmembrane 
segments; they correspond to the S5 and S6 helices of 
Ky channels and likewise form the transmembrane ion 
conduction pore. Comparison of these two types of K"^ 
channel led to the notion that Ky channels have 
a modular architecture: an ion conduction module 
surrounded by four voltage-sensing modules. The lat¬ 
ter’s gating function can be specifically perturbed by 
certain small proteinaceous toxins from venomous 
animals such as scorpions and spiders. The modular 
nature of the Ky channel architecture was experimen¬ 
tally demonstrated by replacing the pore-forming 
region of Shaker Ky channels with that from the bac¬ 
terial KcsA K"^ channel, a two-transmembrane-helix, 
non-voltage-gated type of K"^ channel. In other words, 
the Shaker channel voltage-sensing module can be 
spliced onto the pore of the bacterial KcsA K"^ channel 
to confer voltage gating onto the latter. Such inter¬ 
changeability implies that the voltage-sensing and 
ion-conduction modules share a relatively loose inter¬ 
face. The concept of a voltage-sensing module was 
further supported by the discovery of such a module 
in a voltage-gated phosphatase (Murata et al. 2005), 
and by the finding that the module by itself can behave 
as a voltage-gated proton channel. 

After replacing individual positively charged resi¬ 
dues in S4 with a cysteine residue, one at a time, one 
can determine how the accessibility of these cysteines 
to hydrophilic sulfhydryl reagents added to one or the 
other side of the membrane varies with membrane 
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Voltage-Dependent Gating 
of Channels: Structural 
Basis, Fig. 2 Crystal 
structure from an extracellular 
viewpoint of a chimeric Kv 
channel, where the paddle (the 
C-terminal half of S3, the S3- 
S4 linker and the N-terminal 
half of S4) of Kv 1.2 is replaced 
by thatofKv2.1 (PDB ID: 
2R9R). The four subunits are 
colored differently to reveal 
their structural relation. The 
purple sphere represents 
a ion 



voltage. The accessibility pattern reveals that the intra¬ 
cellular (C-terminal) end of S4 becomes exposed to the 
intracellular solution upon hyperpolarization, whereas 
its extracellular (N-terminal) end “sees” the extracel¬ 
lular solution upon depolarization (Yang et al. 1996). 
Such findings provided the initial evidence that S4 
moves with respect to the membrane following voltage 
changes. The kinetics of fluorescence intensity varia¬ 
tions of a fluorophore attached to a residue in S4 
approximately tracks those of gating current, further 
evidence for S4 being the voltage sensor (Mannuzzu 
et al. 1996). Other studies have emphasized the 
rotational aspect of the S4 motion. 

Experimental evidence supports a model where 
each of the four voltage sensors moves independently 
toward the activated state upon membrane depolariza¬ 
tion. These four independent gating transitions involve 
movement of the majority of gating charges. The 
voltage-gating apparatus then undergoes a concerted 
transition with much lower voltage sensitivity prior to 
the opening of the activation gate. The Shaker 
Kv channel exhibits very strong voltage sensitivity, 
equivalent to the transfer of more than 12 elementary 


charges across the entire electric field (Schoppa et al. 
1992). The movement of each S4 in a given channel 
must then cause the transfer of more than three charges 
across the transmembrane electric field. The first four 
positively charged residues (usually arginines) serve as 
the main voltage-sensing residues. To produce more 
than three effective gating charges these four residues 
would have to move, on average, across more than 
80% of the electric field. The field across the mem¬ 
brane thickness is not uniform, but is concentrated over 
a small fraction of it. Simultaneous substitution of 
histidine (which is titratable) at two adjacent positively 
charged residues in S4 yield a proton-conducting cur¬ 
rent, which suggests that the hydrophobic seal separat¬ 
ing the intercellular and extracellular compartments is 
very narrow (Starace and Bezanilla 2004). Implicitly, 
the space between mobile S4 and the region around it 
must be narrow enough to prevent ions from passing 
(Fig. 1). In fact, substituting smaller residues for the 
first arginine at the external end of S4 does create 
a pathway for cations. This design, a short constricted 
path between vestibules continuous with the intracel¬ 
lular or extracellular solutions, minimizes the distance 
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S4 must move to expose its voltage-sensing residues to 
a sufficiently large fraction of the electric field. 

The crystal structure of a bacterial Kv channel 
(KvAP) was determined in 2003, soon followed by 
those of native and mutant versions of eukaryotic 
Kyi. 2 channel (Long et al. 2007). These eukaryotic 
Ky structures appear to represent activated states. 
Numerous structural studies not only validate certain 
key mechanisms already postulated from functional 
studies, but also reveal otherwise unattainable detailed 
structural features that in turn trigger conceptual 
advances. Ky channel architecture is indeed modular 
with an ion-conduction module surrounded by 
four voltage-sensing modules (Fig. 2). Remarkably, 
each voltage-sensing module (S1-S4) adjoins the 
pore module-forming portion (S5-S6) of another 
subunit, not its own. This structural feature is consis¬ 
tent with an earlier finding that, in the activated state, 
a cysteine substituted near the extracellular end of S4 
in one subunit can form a disulfide bond with another 
one substituted near the extracellular end of S5 in 
a different subunit (Laine et al. 2003). This structural 
arrangement may provide the inter-subunit interac¬ 
tions underlying the concerted transition that precedes 
gate opening. 

In all crystal structures to date, S4 is not a 
free-standing helix but forms, together with the 
C-terminal half of S3 (termed S3b) and the interven¬ 
ing (S3-S4) linker, a hairpin- or paddle-shaped 
secondary structure termed the “voltage-sensor 
paddle” (Fig. 3). The S4 helix is exposed not only 
to the channel residues but also to membrane lipid 
molecules, as suggested by electron paramagnetic 
resonance studies (Cuello et al. 2004). Such exposure 
raises the issue of how S4’s charged residues are 
energetically stabilized. As speculated in the 1980s, 
these charged residues must be paired with 
countercharges. Specifically, in the activated state, 
the first two (as viewed from the extracellular side) 
voltage-sensing residues are located in a 
water-containing “crevice” and poised to interact 
with the phospho-head group of membrane lipids. 
The existence of a water crevice, anticipated by 
functional studies, is also supported by solid-state 
NMR evidence (Krepkiy et al. 2009). The third and 
fourth positively charged residues interact with a set 
of negatively charged residues (termed the external 
negative cluster), whereas the fifth and sixth 
ones interact with another, internal negative cluster. 



Voltage-Dependent Gating of Channels: Structural 
Basis, Fig. 3 Crystal structure of a single subunit of the 
Kyi.2-2.1 chimera in an open state (PDB ID: 2R9R). The ion 
conduction pore module is colored green and the voltage¬ 
sensing module cyan, except for the paddle motif which is 
colored blue. The four residues in S4 that correspond to the 
main voltage-sensing residues in Shaker are shown as sticks. 
The S4—S5 linker, colored orange, is poised to push onto S6 
(the inner longer pore helix) to close the pore 

Functional studies have implicated some of 
these negatively charged residues in pairing with 
the positively charged residues. While the first three 
positively charged residues are in an a helix, the next 
three are in a 3io helix so that the side chains of the 
fifth and sixth points in the same angular direction 
with respect to the helix axis interact with the inter¬ 
nal negative cluster. 

The above structural features, and other functional 
studies, led to the following proposal regarding S4’s 
transition from activated to deactivated state, as 
triggered by membrane hyperpolarization. While S4 
moves toward the intracellular side, its N-terminal, 
previously ot-helical region, turns into a (tighter) 3io 
helix and assumes the position previously occupied in 
the activated state by its C-terminal, 3io-helical region. 
Consequently, the first and second positively charged 
residues would replace the fifth and sixth to interact 
with the internal negative cluster. By implication, the 
N-terminal half of S4 must not interact with S3b with 
such high complementarity that it is prevented from 
altering its secondary structure. And indeed, the chan¬ 
nels remain well voltage gated even after deletion of 
individual residue triplets (helical turns) from S3b or 
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from the N-terminal half of S4 that harbors the four key 
voltage-sensing residues. In fact, only the intracellular 
end of the paddle, comprising fewer than ten residues 
including the third and fourth voltage-sensing residues, 
is essential for basic voltage sensing (Xu et al. 2010). 
These two positively charged residues allow the chan¬ 
nel to be closed by physiological hyperpolarization. 
They presumably interact with the external and inter¬ 
nal negative clusters to help stabilize the voltage sen¬ 
sor in the activated or deactivated state, respectively. 
The remaining bulk of the paddle functions as 
a “modulator,” conferring specific gating features 
upon individual channel types. Consequently, the pad¬ 
dle motif in a Kv channel can be functionally replaced 
by its counterpart from another voltage-gated channel 
(or enzyme) or even from a ligand-gated channel with 
similar architecture, provided the key voltage-sensing 
residues are preserved. 

The crystal structure of the Kyi.2 channel further 
offers clues to what separates intracellular from extra¬ 
cellular solutions. In it, a phenylalanine side chain 
midway S2 extends toward S4 near the level of the 
fifth positively charged residue. This structural pecu¬ 
liarity led to the notion that this side chain forms part of 
a short hydrophobic region that electrically insulates 
the intracellular solution from the extracellular one. 
In the activated state, all four main voltage-sensing 
residues are located extracellular to the phenylalanine 
ring and exposed to a relatively hydrophilic environ¬ 
ment. Hyperpolarization would then drive most or all 
of them to the intracellular side of the phenylalanine 
residues to assume the deactivated conformation. If 
such a mechanism holds, the N-terminal half of S4 is 
likely to reside in a more hydrophobic environment in 
the deactivated state than in the activated state. Indeed, 
substituting more hydrophobic residues for those 
separating the four main voltage-sensing residues ener¬ 
getically favors the deactivated state (Xu et al. 2010). 
It appears as if the hydrophobic residues between 
the voltage-sensing ones help keep S4 poised between 
the activated and the deactivated states, whereas 
physiological hyperpolarization provides the necessary 
additional energy to bias the equilibrium toward the 
deactivated state. 

Many lipid molecules are present in and around the 
voltage-sensing module in the crystal structure (Long 
et al. 2007). That the phospho-head group of mem¬ 
brane lipids is important for Ky channel gating is 
underscored by the demonstration that Ky channels 


reconstituted in artificial membranes exhibit voltage¬ 
gated current only if the artificial membrane is made of 
phospholipids. Additionally, in a biological mem¬ 
brane, removal of the head group of the phospholipid 
sphingomyelin in the outer leaflet with one type of 
sphingomyelinase prevents the voltage sensors of Ky 
channels from entering the activated state (Xu et al. 
2008). However, if with another type of 
sphingomyelinase only the choline and not the phos- 
phoryl group is removed, the channels become acti¬ 
vated at hyperpolarized potentials where they 
otherwise remain deactivated (Xu et al. 2008). 
It appears that removal of the positively charged 
choline group from lipid molecules in the outer 
membrane leaflet makes it energetically easier for the 
positively charged S4 to move in the extracellular 
direction to assume the activated conformation. 
These findings suggest ways in which voltage-gated 
ion channels can be turned on or off experimentally, 
and possibly naturally, by means other than changing 
the membrane voltage. 

Gates of Kv Channels 

In the absence of membrane voltage, the voltage sensor 
of most Ky channels is in the activated state, and the gate 
it controls, commonly called the activation gate, is open. 
Hyperpolarization forces the voltage sensor to enter the 
deactivated state and thereby closes the activation gate. 
The activation gate is located near the intracellular end of 
the ion conduction pore, a conclusion reached in the 
1970s on the basis of the observation that a quaternary- 
ammonium ion, applied to the intracellular solution, 
becomes trapped in the pore following membrane hyper¬ 
polarization (Armstrong 1971) (Fig. 4, left panel). Muta¬ 
genesis studies performed in the 1990s led to the notion 
that the C-terminal part of S6 lines the intracellular end of 
the ion conduction pore. Cysteine substituted at sites near 
the C-terminal, intracellular end of S6 is accessible to 
intracellularly applied sulfhydryl reagents, whereas when 
substituted at more N-terminal or extracellular sites it is 
accessible only when the gate is open (Liu et al. 1997). 
While these findings locate the gate to S6, other studies 
suggest how S6 gates the pore. Furthermore, comparison 
of the crystal structure of the closed state of KcsA with 
that of the open state of another bacterial channel, 
MthK (Fig. 5), strongly implies that the intracellular 
part of S6 swings on a “hinge” away from or toward 
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Voltage-Dependent Gating of K'^ Channels: Structural 
Basis, Fig. 4 Early conceptual models about activation and 
inactivation gates. Left. The cartoon illustrates the observation 
that a TEA ion becomes trapped in the ion conduction pore by 
the activation gate located at the intracellular end of the pore. 
Right'. Illustration of the idea that the inactivation ball (gate), 
which is attached to the channel via a “chain,” occludes the pore 
from the intracellular side 


the central axis, respectively, to open or close the 
pore. In these channels, the hinge appears to be 
a glycine residue. 

It is noteworthy that some Kv channels have 
additional gates whose action causes inactivation. 
Once inactivated, Kv channels no longer conduct ions 
even when depolarized. There are at least two types of 
inactivation: The first type works by a ball-and-chain- 
like mechanism (Fig. 4, right panel) where the 
N-terminal end of the Ky channel, or that of its auxiliary 
P subunit, acts as a “ball” to occlude the ion 
conduction pore from the intracellular side, provided 
the activation gate is open. The ball-and-chain 
hypothesis was originally proposed to account for inac¬ 
tivation of Nav channels (Armstrong et al. 1973). The 
N-terminus-based inactivation of Kv channels is often 
called N-type inactivation (Hoshi et al. 1990). A second 
type of inactivation is independent of the 
N-terminus and is called C-type inactivation. Functional 
and structural studies support the notion that C-type 
inactivation is related to conformational changes in 
the extracellular, K'^-selective part of the ion 
conduction pore. 


Coupling Between Voltage Sensor and 
Activation Gate 

To confer voltage sensitivity upon Ky channel conduc¬ 
tance, the conformational state of voltage sensors must 
be coupled to that of the activation gate. The move¬ 
ment of gating charges is so tightly coupled to the 
channel’s gating transitions that the activation gate is 
practically under obligatory control of the voltage 
sensors. Several studies have adduced evidence for 
the following hypothesis regarding how this is accom¬ 
plished. Characteristics of the voltage sensors and the 
activation gate can be investigated by examining gat¬ 
ing and/or ionic currents. However, the coupling itself 
cannot be readily probed experimentally. Fortunately, 
the ability to use the voltage-sensing module of the 
Shaker channel to control the activation gate of the 
KcsA channel afforded a unique opportunity to iden¬ 
tify and delineate the requisite peptide sequences for 
functional coupling between the voltage-sensing and 
ion-conduction modules. In the construct consisting of 
the voltage-sensing module of a Ky channel and the 
conduction module of KcsA, two complementary 
sequences from the Shaker channel are required to 
confer full-voltage gating upon the chimeric channel 
(Lu et al. 2002). One sequence, about ten residues long, 
is within the so-called S4-S5 linker that trails the 
voltage-sensing S4. The other sequence, about equally 
long, encompasses the C-terminal end of S6 plus the 
beginning of the C-terminus, part of which forms the 
actual activation gate. This and other hndings led to the 
following proposal regarding the crux of the coupling 
mechanism: As membrane hyperpolarization causes 
S4 to descend inwardly and rotate, the S4-S5 linker 
“pushes” onto the sequence containing the gate¬ 
forming residues to close the gate. The open-state 
crystal structure of Kyi.2 shows how the former 
sequence is poised to push onto the latter (Long et al. 
2007) (Fig. 3). It is noteworthy that some plant Ky 
channels exhibit reversed coupling such that depolari¬ 
zation closes, and hyperpolarization opens them 
(Latorre et al. 2003). 

Voltage Sensitivity of Kjr Channels 

Kjf channels, unlike Ky channels, have no voltage¬ 
sensing domains. Their voltage sensitivity derives 
from voltage-dependent occlusion of the ion conduction 
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Voltage-Dependent Gating of K'^ Channels: Structural 
Basis, Fig. 5 Comparison of crystal structures of KcsA and 
MthK channels (PDB ID: 1K4D and ILNQ). Two diagonally 
opposite subunits are shown for each channel; the extracellular 
side is on top. The inner helix corresponds to M2 or S6 in Kj, or 


Kv channels. In the KcsA channel (left) the inner helices cross, 
thereby closing the ion pore. In the MthK channel (right) they 
move away from the central axis, opening the pore. Glycine 
residues within the shown inner helices that act as hinges in 
these channels are colored red 


pore by intracellular cationic blockers such as and 
polyamines. The voltage sensitivity associated with the 
long tetravalent polyamine spermine may be as strong 
as that resulting from transfer of about five charges 
across the electric field, that is, a valence of five. 
In principle, voltage sensitivity of pore block may 
arise either from the charged blocker moving across 
the electric field and/or from “displacement” of 
permeant ions by a blocker across the electric field. In 
the case of block by spermine, the valence of about five 
primarily reflects spermine displacing five ions 
across the electric field that drops steeply along the 
narrow ion selectivity filter, while spermine itself 
encounters little of the field. 

The transmembrane portion of the ion conduction 
pore of all channels has the same general architec¬ 
ture. The extracellular, narrow, selective part of the 
pore is lined by the so-called signature sequence within 
the linker between the two a helices that form the ion 
conduction module (labeled Ml and M2 in chan¬ 
nels, and S5 and S6 in Kv channels). This filter region 
contains four ion sites, of which the first and third, or 
the second and fourth, are alternately occupied hy two 
ions (Fig. 6). The remaining, intracellular part of 
the pore is lined by the inner pore-forming helix, which 


in Kv channels would be S6. The wide part of the pore 
immediately intracellular to the selectivity filter 
contains another ion site. In Kj^ channels, the cytoplas¬ 
mic termini form a wide aqueous pore that extends the 
transmembrane pore into the cytoplasm and increases 
its capacity to harbor ions. There are two ion-binding 
sites at the junction between the transmemhrane and 
the cytoplasmic parts of the pore, and four additional 
sites in the cytoplasmic pore. Of the eleven available 
ion sites, no more than nine may be simultaneously 
occupied by K"^. 

Four loops near the cytoplasmic end of the pore, 
one from each subunit, constrict the pore to about 
10 A. As the leading end of spermine reaches this 
site and displaces any ion occupying it the pore 
becomes blocked. Spermine penetrates further into 
the pore and eventually reaches the cavity, 
displacing as many as five K"^ ions across the selec¬ 
tivity filter to produce voltage sensitivity. This sen¬ 
sitivity increases with depolarization, as outward 
movement of K"^ ions across the filter is favored. 
The constricting loops near the cytoplasmic end of 
the pore, mentioned above, serve as a gasket to 
prevent K"^ ions present in the pore from bypassing 
spermine and leaking back into the intracellular 
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Voltage-Dependent Gating 
of Channels: Structural 
Basis, Fig. 6 Crystal 
structure of a chimeric Kj, 
channel (PDB ID: 2KQS). 
Only two diagonal subunits 
are shown, and the 
extracellular side is on top. In 
the chimera, the cytoplasmic 
pore of a bacterial Kj, channel 
is replaced by that of 
eukaryotic Ki,3.1. The spheres 
indicate the location of eleven 
ion sites (Sl-Sl 1) which have 
been observed in Kj, structures 
(PDB ID: 2KQS and 3K6N) 



solution. The gasket, therefore, maximizes the num¬ 
ber of displaceable ions and hence voltage 
sensitivity. 

Summary 

In Kv channels, the paddle-shaped motif effectively 
consists of two parts: a minimal core essential for 
voltage gating per se, and a comparatively much 
larger, modulatory part. The core, which contains two 
of the four main voltage-sensing arginines in S4, suf¬ 
fices to produce voltage gating in the physiological 
voltage range. The modulatory part confers certain 
specialized gating characteristics upon individual 
Kv subtypes. While the hydrophobic residue pairs 
that separate the voltage-sensing residues help keep 
the voltage sensor poised between the activated 
and deactivated states, hyperpolarization apparently 
provides the necessary additional energy for the 
voltage-sensing part of S4 to tilt, twist, and/or move 
inwardly from a (water-exposed) hydrophilic environ¬ 
ment to a (protein- and lipid-lined and on average) 
more hydrophobic region, a motion that is then prop¬ 
agated through the S4-S5 linker to S6, closing the 
channel gate. 


In Kir channels, the following principles effectively 
confer strong voltage sensitivity (Xu et al. 2009): First, 
an intracellular cationic blocker and several extracel¬ 
lular permeant (K"^) ions (electrostatically) displace 
one another and/or bind competitively to the inner 
pore from opposite sides. This process energetically 
couples blocker binding to the movement of K"^ ions. 
Second, as the blocker traverses the relatively wide 
inner pore, K"^ ions move through the narrow selec¬ 
tivity filter across which the electric field drops 
steeply. Therefore, the movement of K"^ ions, not of 
the blocker, is primarily voltage dependent. Third, the 
number of K"^ ions that reside in the inner pore sets an 
upper bound to the steepness of the voltage depen¬ 
dence. Fourth, a narrowing near the intracellular end 
of the pore functions as a gasket to prevent K"^ ions 
from bypassing the blocker and escaping into the 
intracellular solution. This structural constraint 
underlies the nearly obligatory coupling between 
intracellular blocker binding and outward K"^ ion 
movement in Kj^ channels, by which the blocker¬ 
binding reaction acquires strong voltage sensitivity. 
While the voltage sensitivity of Kv channels is intrin¬ 
sic, arising from the movement of their voltage sen¬ 
sors with respect to the electric field, the voltage 
sensitivity of Kj^ channels should be termed 
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“extrinsic” because it primarily derives from the dis¬ 
placement, by a cytoplasmic blocker, of ions 
across the electric held. 
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Synonyms 

Sodium channels 


Introduction 

Electrical signals control contraction of muscle, secretion 
of hormones, sensation of the environment, processing of 
information in the brain, and output from the brain to 
peripheral tissues (Hille 2001). In excitable cells, 
electrical signals also have an important influence on 
intracellular metabolism and signal transduction, gene 
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expression, protein synthesis and targeting, and protein 
degradation. In all of these cellular contexts, electrical 
signals are initiated hy voltage-gated sodium channels, 
and the resulting intracellular regulatory events are 
initiated by calcium entry through voltage-gated calcium 
channels. 

Voltage-Gated Sodium and Calcium Channel 
Proteins 

The ion channel superfamily is composed of eight major 
families of channels that have a common pore architec¬ 
ture, including voltage-gated sodium and calcium 
channels; voltage-gated, inwardly rectifying and 
two-pore-motif potassium channels; transient receptor 
channels; cyclic nucleotide-gated channels; and hyper¬ 
polarization activated, cyclic nucleotide-gated ion 
channels (Yu and Catterall 2004). The founding member 
of this superfamily in terms of its discovery as a separate 
protein is the voltage-gated sodium channel. Neurotoxin 
labeling, purification, and functional reconstitution 
showed that sodium channels from mammalian brain 
contain voltage-sensing and pore-forming elements in a 
single protein complex of one principal a subunit of 
220-260 kDa and one or two auxiliary P subunits of 
approximately 36 kDa (Catterall 2000a). Na"^ channel 
proteins in mammalian brain are complexes of a 
260 kD a-subunit in association with one or two auxiliary 
P-subunits of 33^0 kD in size. The primary sequence 
predicts that the sodium channel a-subunit folds into four 
internally repeated domains (I-IV), each of which 
contains six a-helical transmembrane segments (S1-S6) 
(Fig. la). In each domain, the SI through S4 segments 
serve as the voltage-sensing module, and the S5 and S 6 
segments and the reentrant P loop that folds into the 
transmembrane region of the channel between them 
serve as the pore-forming module. A large extracellular 
loop connects the S5 or S 6 transmembrane segments 
to the P loop in each domain, whereas the other 
extracellular loops are small. Large intracellular loops 
link the four homologous domains, and the large 
N-terminal and C-terminal domains also contribute 
substantially to the mass of the internal face of sodium 
channels. 

The voltage-gated Ca^"^ channel protein was 
discovered by a similar strategy using radiolabeled Ca^"^ 
channel-blocking drugs as molecular probes (Striessnig 
1999; Catterall 2000b). The voltage-gated Ca^"^ channels 


that have been characterized biochemically are complex 
proteins composed of four or five distinct subunits, which 
are encoded by multiple genes (Fig. lb). The aj subunit 
of 190-250 kDa is the largest subunit, and it incorporates 
the conduction pore, the voltage sensors and gating 
apparatus, and most of the known sites of channel regu¬ 
lation by second messengers, drugs, and toxins. Like the 
sodium channel a subunit, the ai subunit of calcium 
channels is composed of about 2,000 amino acid residues 
organized in four homologous domains (I-IV) with six 
transmembrane segments (S1-S6) in each. The SI 
through S4 segments serve as the voltage sensor module, 
whereas transmembrane segments S5 and S 6 in 
each domain and the P loop between them form the 
pore module. 

The ai subunits of calcium channels are associated 
with four distinct auxiliary protein subunits (Fig. lb; 
Striessnig 1999; Catterall 2000b). The intracellular (3 
subunit is a hydrophilic protein of 50-65 kDa. The 
transmembrane, disulfide-linked a 28 subunit complex 
is encoded by a single gene (see below) but the 
resulting pre-polypeptide is posttranslationally 
cleaved and disulfide bonded to yield the mature 
covalently linked 0.2 and 5 subunits, and recent 
evidence indicates that the 5 subunit is further 
processed by proteolytic cleavage near its C-terminus 
and attachment to the membrane by 
a glycophosphatidylinositol anchor (Bauer et al. 
2010). A y subunit having four transmembrane seg¬ 
ments is a component of skeletal muscle Ca^"^ chan¬ 
nels, and related subunits are expressed in heart and 
brain. Although these four auxiliary subunits modulate 
the functional properties of the Ca^"^ channel 
complex, the pharmacological and electrophysiologi- 
cal diversity of Ca^"^ channels arises primarily from the 
existence of multiple ai subunits. 

Sodium Channel Genes 

In evolution, the four-domain sodium and calcium chan¬ 
nels were last among the voltage-gated ion channels to 
appear and are only found in multicellular organisms 
(Hille 2001). It is thought that the eukaryotic four- 
domain sodium and calcium channels evolved by two 
rounds of gene duplication from ancestral single-domain 
bacterial sodium channels like NaChBac found in 
Baccillus halodurans which have characteristics of 
both eukaryotic sodium and calcium channels. 
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Voltage-Gated Sodium and Calcium Channels, 
Fig. 1 Transmemhrane organization of sodium and calcium 
channel subunits, (a) The primary structures of the suhunits of 
the voltage-gated ion channels are illustrated as transmemhrane 
folding diagrams. Cylinders represent alpha helical segments. 
Bold lines represent the polypeptide chains of each subunit with 
length approximately proportional to the number of amino acid 
residues. The extracellular domains of the (11 and (32 subunits are 
shown as immunoglobulin-like folds. T*, sites of probable 
N-linked glycosylation; P, sites of demonstrated protein 
phosphorylation by PKA (circles) and PKC (diamonds)', white 
circles, the outer (EEDD) and inner (DEKA) rings of amino 
residues that form the ion selectivity filter and the tetrodotoxin- 


binding site; -H-, S4 voltage sensors; h in a blue circle, inactiva¬ 
tion particle in the inactivation gate loop; open blue circles, sites 
implicated in forming the inactivation gate receptor. Transmem¬ 
brane segments are color-coded as in the structural model in 
Fig. 3: bluelgreen, voltage-sensing module; yellow/orange/red, 
pore-forming module. Sites of binding of a- and P-scorpion 
toxins and a site of interaction between a and pi subunits are 
also shown, (b) The subunit composition and structure of Ca^"^ 
channels purified from skeletal muscle are illustrated in the same 
format as in panel A. Predicted helices are depicted as cylinders. 
The lengths of lines correspond approximately to the lengths of 
the polypeptide segments represented 


Voltage-gated sodium channel genes are present in 
a variety of metazoan species including fly, leech, 
squid, and jellyfish. The biophysical properties, 
pharmacology, gene organization, and even intron-splice 
sites of these invertebrate sodium channels are largely 
similar to the mammalian sodium channels, providing 
support for the idea that the primordial sodium channel 
was established before the evolution of vertebrates. 

Ten related sodium channel genes are found in 
mammals (Fig. 2a), and each has more than 20 exons 


(Goldin 2001). Voltage-gated sodium channels have 
primarily a single physiological role - initiation and 
conduction of action potentials. However, the sodium 
channel subtypes are differentially expressed in tissues 
and differentially localized within individual cells. 
Navl.3 is highly expressed in fetal nervous tissues, 
whereas Nay 1.1, Nay 1.2, and Na„1.6 are abundant in 
juvenile and adult CNS. Generally, Nayl.l and Nayl.3 
are localized to neuronal cell bodies and initial segments 
where they may control the neuronal excitability through 
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NaJ.4 

NaJ.6 

Na^1.5 

Na„1.8 

NaJ.9 



Amino Acid Sequence Identity (%) 


75 80 85 90 95 100 

Amino Acid Sequence Identity (%) 

Voltage-Gated Sodium and Calcium Channels, 
Fig. 2 Amino acid sequence similarity of voltage-gated sodium 
and calcium channel subunits, (a) Molecular relationships 
sodium channel sequences Navl.l-Navl.9. The amino acid 
sequences for all of the isoforms were aligned using Clustal W. 

integration of synaptic impulses to set the threshold for 
action potential propagation to the dendritic and axonal 
compartments. Navl.2 is localized in unmyelinated 
axons. During development, Navi.6 replaces Navl.2 in 
maturing nodes of Ranvier in myelinated axons. Navl.l, 
Navl.2, and Nay 1.6 are also expressed in the peripheral 
nervous system. However, the three isoforms that have 
been cloned from sympathetic and dorsal root ganglion 
neurons, Navl.7, Nayl.S, and Navi.9, are more abundant 
in the peripheral nervous system. The Navl.7 channel is 
localized in axons, where it functions in initiating and 
conducting the action potential. More restricted expres¬ 
sion patterns are observed for Navi.8 and Navl.9; these 
channels are highly expressed in small sensory neurons 
of dorsal root ganglia and trigeminal ganglia where they 
have a key role in nociception. Nav 1.4 and Nay 1.5 are the 
primary muscle sodium channels that control the excit¬ 
ability of the skeletal and cardiac myocytes, respectively. 
Nay 1.5 is transiently expressed in developing skeletal 
muscle but is replaced by Nay 1.4 in the adult. 

The NavP subunits are encoded by four distinct genes 
in mammals (Catterall 2000a; Patino and Isom 2010). 
All four P subunits have a single transmembrane 
segment, a large N-terminal extracellular domain and 
a small intracellular C-terminal domain (Fig. la). The 
NavP subunits are dual-function proteins (Catterall 
2000a; Patino and Isom 2010). Co-expression of NavP 
subunits alters the kinetics and voltage dependence of 
activation and inactivation of sodium channel 
complexes. Moreover, their extracellular domain forms 


Divergent portions of the terminal regions and the cytoplasmic 
loops between domains I-II and II-III were excluded from the 
analysis, (b) Molecular relationships of Cay chamrels presented 
as in panel A 

an immunoglobulin-like fold, similar to many cell 
adhesion molecules, and the P subunits are involved in 
interactions of sodium channels with proteins in the 
extracellular matrix, with other cell adhesion molecules, 
and with intracellular regulatory proteins like protein 
kinases and phosphoprotein phosphatases. The p 
subunits associate broadly with multiple a subunits. 
The importance of the functions of the Nayp subunits 
is underscored by the strong effects of deletion of the 
genes encoding NayPl and Navp2, which include 
failure of myelination and axonal conduction, epileptic 
seizures, and premature death. 

Diversity of Calcium Currents in Excitable 
Cells 

In contrast to sodium chatmels, ionic currents conducted 
by Ca^"^ channels in different cell types have diverse 
physiological roles and pharmacological properties, 
and an alphabetical nomenclature has evolved for the 
distinct classes of Ca^"^ currents (Table 1; Catterall et al. 
2005b). L-type Ca^"^ currents typically require 
a strong depolarization for activation, have slow 
voltage-dependent inactivation, and are blocked by 
the organic L-type Ca^"^ channel antagonists, 
including drhydropyridmes, phenylalkylamines, and 
benzothiazepines. N-type, P/Q-type, and R-type Ca^"^ 
currents also require strong depolarization for activation. 
They are relatively unaffected by L-type Ca^"^ channel 
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Voltage-Gated Sodium and Calcium Channels, Table 1 Physiological function and pharmacology of calcium channels 


Channel 

Current 

Localization antagonists 

Specific 

Cellular functions 

Cavl.l 

L 

Skeletal muscle transverse tubules 

Dihydropyridines, 

phenylalkylamines, 

benzothiazepines 

Excitation-contraction coupling 

Cavl.2 

L 

Cardiac and smooth muscle myocytes, 
endocrine cells, neuronal cell bodies, and 
proximal dendrites 

Dihydropyridines, 

phenylalkylamines, 

benzothiazepines 

Excitation-contraction coupling, hormone 
release, regulation of transcription, synaptic 
integration 

Cavl.3 

L 

Endocrine cells, neuronal cell bodies and 
dendrites; Cardiac atrial myocytes and 
pacemaker cells; 

Cochlear hair cells 

Dihydropyridines, 

phenylalkylamines, 

benzothiazepines 

Hormone release, regulation of transcription, 
synaptic integration, cardiac pacemaking, 
hearing; neurotransmitter release from 
ribbon synapses in sensory cells 

Cavl.4 

L 

Retinal rod and bipolar cells, 
lymphocytes 

Not established 

Neurotransmitter release from ribbon 
synapses in photoreceptors 

Cav2.1 

P/Q 

Nerve terminals and dendrites, 
neuroendocrine cells 

ta-agatoxin IVA 

Neurotransmitter release, dendrites, Ca^"^ 
transients, hormone release 

Cav2.2 

N 

Nerve terminals and dendrites, 
neuroendocrine cells 

co-CTx-GVIA 

Neurotransmitter release, dendritic Ca^"^ 
transients, hormone release 

Ca.Y2.3 

R 

Neuronal cell bodies and dendrites 

SNX-482 

Repetitive firing, dendritic calcium 
transients 

Ca.v3-1 

T 

Neuronal cell bodies and dendrites, 
cardiac and smooth muscle myocytes 

None 

Pacemaking, repetitive firing 

Ca.v3.2 

T 

Neuronal cell bodies and dendrites, 
cardiac and smooth muscle myocytes 

None 

Pacemaking, repetitive firing 

Cav3.3 

T 

Neuronal cell bodies and dendrites 

None 

Pacemaking, repetitive firing 


antagonist drags but are blocked by specific polypeptide 
toxins from snail and spider venoms (Table 1). T-type 
Ca^"^ currents are activated by weak depolarization and 
are transient due to rapid voltage-dependent inactivation. 
They are resistant to both organic antagonists and to 
most snake and spider toxins used to define the L-, N-, 
P/Q-, and R-type Ca^"^ currents (Table 1). 


Calcium Channel Genes 

Ca^"^ channel ai subunits are encoded by ten distinct 
genes in mammals, which fall into three subfamilies 
distinguished by amino acid sequence, physiological 
function, and pharmacology (Fig. 2b). Each gene has 
more than 40 exons. The Cayl subfamily (Cay 1.1 to 
Cay 1.4) conduct L-type currents (Table 1; 

Catterall et al. 2005b). The Cay2 subfamily (Cay2.1 
to Cay2.3) conduct P/Q-type, N-type, and R-type Ca^"^ 
currents, respectively (Table 1). The Cay3 subfamily 
(Cay3.1 to Cay3.3) conducts T-type Ca^"^ currents. 
The amino acid sequences of these (Xi subunits are 
more than 70% identical within a subfamily but less 


than 40% identical among the three subfamilies. These 
family relationships are illustrated for the more con¬ 
served transmembrane and pore domains in Fig. 2. 
Division of Ca^"^ channels into these three subfamilies 
is phylogenetically ancient, as single representatives of 
each are found in the Caenorhahditis elegans genome. 

Each of the auxiliary subunits of Ca^"^ channels 
(Fig. lb) is encoded by a small gene family. Four genes 
encode a2§ subunits, four genes encode (3 subunits, and 
eight genes encode y subunits (Arikkath and Campbell 
2003; Klugbauer et al. 2003). The specificity of the 
assembly of Ca^"^ channel auxiliary subunits with the 
principal, pore-forming al subunits is not fully deter¬ 
mined. However, in neurons, it appears that multiple 
subtypes of auxiliary subunits can assemble successfully 
with al subunits. Therefore, it is likely that control of 
expression and localization of the principal and auxiliary 
subunits determines the subunit partners present in Ca^"^ 
channel complexes. Although the Cayyl subunit is asso¬ 
ciated specifically with skeletal muscle Cay 1.1 channels, 
the other related y subunits interact with other Ca^"^ 
channels, glutamate receptors, and possibly other mem¬ 
brane signaling proteins (Nicoll et al. 2006). Thus, the y 
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subunits discovered as auxiliary subunits of Ca^"^ chan¬ 
nels have a more widespread role in assembly and cell 
surface expression of other membrane signaling proteins. 

Physiological Roles of Calcium Channels 

Unlike sodium channels, which primarily initiate 
action potentials, Ca^"^ channels have diverse roles in 
cells because they both contribute to electrical events 
during action potential generation and conduction at 
the cell surface and conduct Ca^"^ into the cell to serve 
as a chemical messenger and initiate many intracellu¬ 
lar events (Hille 2001). 

Ionic current carried by Ca^"^ ions entering neurons 
through voltage-gated Ca^"^ channels helps to shape the 
action potential and has an important role in determin¬ 
ing the frequency of firing of action potentials (Hille 
2001). Depolarization of neurons by activation of 
sodium channels during the action potential activates 
Cayl and Cav2 channels. The depolarization caused by 
Ca^"^ current flowing inward through voltage-gated 
Cayl and Cav2 channels broadens the action potential 
and modulates the shape of the repolarizing phase of the 
action potential in cardiac myocytes, endocrine cells, 
and in the dendrites, cell bodies, and nerve terminals of 
neurons. CayS channels conduct T-type Ca^"^ currents 
that activate in the same potential range or even more 
negatively than sodium channels and thereby influence 
the threshold and frequency of firing of neurons (Perez- 
Reyes 2003). These Ca^"^ channels are prominent in 
repetitively firing cells, where they are activated follow¬ 
ing repolarization from one action potential and boost 
the membrane potential to the threshold for activating 
sodium channels and firing the next action potential. 
This function of CayS channels is important in the 
sinoatrial node of the heart and in the thalamus, where 
repetitive firing of neurons is required for determining 
sleep and wakefulness, as well as controlling other 
aspects of cortical function. 

In addition to their role in electrical signaling, 
Ca^"^ channels are also the key transducers of 
electrical signals (Hille 2001). They convert 
depolarization of the cell surface membrane into 
a transient local increase in intracellular Ca^"^, 
which serves as a second messenger to initiate 
many physiological events including muscle 
contraction, endocrine secretion, synaptic integration, 
synaptic transmission, and gene regulation in neurons 


(Catterall et al. 2005b). All three subfamilies of Ca^"^ 
channels participate in Ca^"^ signaling in these 
diverse physiological contexts. 

L-type Ca^"^ currents conducted by Cavl.2 chan¬ 
nels are the main Ca^"^ currents in cardiac and 
smooth muscle cells, where they initiate contraction 
(Hofmann et al. 1999; Striessnig 1999). In skeletal 
muscle. Cay 1.1 channels initiate excitation- 
contraction coupling by direct physical interaction 
with ryanodine-sensitive Ca^"^ channels in the sarco¬ 
plasmic reticulum, which open in response to the 
voltage-driven conformational change in Cay 1.1 
channels and release Ca^"^ from the intracellular 
store to initiate contraction (Franzini-Armstrong 
et al. 1998). L-type Ca^"^ currents conducted by 
Cayl.2 and Cayl.3 channels initiate secretion in 
endocrine cells (Catterall et al. 2005b). L-type Ca^"^ 
currents conducted by Cay 1.2 and Cay 1.3 channels 
are also prominent in neurons, where they are impor¬ 
tant for integration of synaptic inputs in dendrites, 
regulation of many intracellular events, and initiation 
of excitation-transcription coupling, which regulates 
expression of many neural genes. L-type Ca^"^ cur¬ 
rents conducted by Cay 1.3 and Cay 1.4 channels also 
trigger synaptic transmission in specialized ribbon 
synapses in photoreceptors in the retina and hair 
cells in the cochlea. Ca^"^ entering neurons through 
Cayl channels is particularly effective in activating 
expression of neuronal genes in response to electrical 
activity, in comparison to Ca^"^ entering through 
other permeability pathways. The molecular basis 
for this preferential effect of Cayl channels on 
gene expression is not known, but one current idea 
is that proteins bound to the Cayl channels, or 
even the C-terminus of the Cayl channels itself, 
serves as a Ca^’^-regulated molecular signal to control 
gene expression. 

Ca^"^ entering neurons through Cay2 channels is 
primarily responsible for initiating synaptic transmis¬ 
sion at fast synapses (Dunlap et al. 1995; Tsien et al. 
1995). Cay2.2 channels, which conduct N-type Ca^"^ 
current, are most important at synapses formed by 
neurons of the peripheral nervous system. In contrast, 
Cay2.1 channels, which conduct P/Q-type Ca^"^ cur¬ 
rents, are most important at synapses formed by neu¬ 
rons of the central nervous system. Both Cay2.2 
channels and Cay2.3 channels also have a signihcant 
role in initiating transmitter release at central synapses. 
In some cases, including a subset of inhibitory 




Voltage-Gated Sodium and Calcium Channels 


2751 


V 


interneurons of the hippocampus, Cay2.2 channels are 
predominant in neurotransmitter release. On the other 
hand, Cav2.3 channels seem always to play 
a secondary role at central synapses compared to 
Cav2.1 and/or Cav2.2 channels. The molecular basis 
for the prominent role of Cav2 channels in neurotrans¬ 
mission is not completely understood, but one factor is 
their direct interaction with the neurotransmitter 
release machinery in Ca^"^ channel signaling 
complexes (Catterall and Few 2008). Regulation of 
presynaptic Ca^"^ channels in signaling complexes is 
an important contributor to short-term synaptic 
plasticity of neurotransmitter release and synaptic 
transmission (Catterall and Few 2008). 

In addition to their role in shaping the frequency and 
form of action potentials, Ca^"^ entry through CayS 
channels is an important regulator of secretion of aldo¬ 
sterone in the adrenal cortex. Ca^’^-dependent regula¬ 
tion of Cav3.2 channels by angiotensin II via Ca^"^/ 
calmodulin-dependent protein kinase II exerts physio¬ 
logically important control over the level of aldoste¬ 
rone secretion and therefore of kidney function (Yao 
et al. 2006). The possible role of Cav3 channels in 
other forms of secretion from endocrine cells deserves 
further study. 


Molecular Basis of Sodium and Calcium 
Channel Function 

Classical work by Hodgkin and Huxley defined the 
three key features of ion channels: (1) voltage- 
dependent activation, (2) rapid inactivation, and 
(3) selective ion conductance (Hille 2001). Building 
on this foundation, structure-function studies 
employing molecular, biochemical, structural, and 
electrophysiological techniques have begun to provide 
a clear understanding of the molecular basis of ion 
channel function. The mechanism of activation is sim¬ 
ilar for voltage-gated Na"^, Ca^"^, and channels, 
whereas the molecular mechanisms of ion selectivity 
and inactivation are distinct. 

Activation and pore opening. The voltage depen¬ 
dence of activation of the voltage-gated ion channels 
derives from outward movement of gating charges as 
a consequence of altered membrane electric field, and 
this process is likely to be similar for Nay, Ky, and Cay 
channels (Bezanilla 2000; Catterall 2010b). The S4 
segments of each homologous domain serve as the 


primary voltage sensors for activation (Fig. la). They 
contain repeated motifs of a positively charged amino 
acid residue followed by two hydrophobic residues in 
S4 segments creating a transmembrane spiral of posi¬ 
tive charges. Upon depolarization, outward movement 
and rotation of S4 is thought to initiate 
a conformational change that opens the sodium chan¬ 
nel pore. This “sliding helix” model is supported by 
strong evidence (Catterall 2010b). For example, neu¬ 
tralization of the positively charged residues in S4 
reduces the voltage-dependence of gating. The out¬ 
ward and rotational gating movement of S4 segment 
has been detected directly by reaction of substituted 
cysteine residues in S4 segments with extracellular 
sulfhydryl reagents following channel activation. 
A structural model of the voltage-sensing and pore¬ 
forming modules of sodium channels in the resting and 
activated states is illustrated in Fig. 3. As the S4 seg¬ 
ment moves outward and rotates, it exerts a torque on 
the S4-S5 linker, which in turn pulls on the intracellu¬ 
lar end of S5 and opens the pore. 

Ion selectivity. The selectivity filter of the voltage¬ 
gated ion channels is located at the extracellular end of 
the pore (Yu and Catterall 2004). The narrow outer 
pore is formed by the reentrant P loops between 
transmembrane segments S5 and S6 of each domain 
(Figs. 1 and 3). Mutation of a set of four amino 
acid residues in analogous positions in each domain 
(aspartate in domain I, glutamate in domain II, lysine in 
domain III, and alanine in domain IV, DEKA) 
to glutamates changes sodium-selectivity to calcium- 
selectivity, indicating that the side chains of these 
amino acid residues are likely to interact with sodium 
and calcium ions as they are conducted through the ion 
selectivity filter of the pore (Sather et al. 1994). This 
mechanism differs in a fundamental way from ion selec¬ 
tivity in Ky channels, where backbone carbonyl groups 
of amino acid interact directly with ions. Determining 
the structural basis for ion selectivity in Nay and Cay 
charmels is an important goal in ion channel research. 

Inactivation. Fast inactivation of Nay channels is 
a critical process that ends the Na"^ conductance within 
a few milliseconds of channel opening, which is 
required to generate trains of action potentials in elec¬ 
trically excitable cells (Hille 2001). This process 
involves a conserved inactivation gate formed by the 
intracellular loop connecting domains III and IV, 
which serves as a hinged lid that binds to the 
intracellular end of the pore and blocks it (Fig. 1; 
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a 



Voltage-Gated Sodium and Calcium Channels, 
Fig. 3 Three-dimensional model of sodium channels. 
(a) Models of the closed and open states shown in cylinder 
representation. Only a single voltage-sensing module is shown 
attached to the tetramer of the pore-forming module for clarity. 
Transmembrane segments SI through S6, andP-loop are colored 
by a rainbow scheme from blue to red as in Fig. 1. The S4-S5 
linker colored purple. Approximate positions of the alpha carbon 
atoms of the first and fourth gating-charge-carrying arginines in 
the S4 segment (labeled as R1 and R4 and colored in blue) and 
E226 in the S2 (labeled as El and colored in red) shown in 
sphere representation. All intracellular and extracellular loops, 
except for the S4-S5 linker, are represented by curved lines for 
simplicity. Vertical translation of the R4 between the closed and 



OPEN 


c 



open states along the membrane normal vector and relative to the 
plane of the membrane indicated by arrows. The S3 
represented by ribbon to show clearly the positions of the 
gating-charge-carrying arginines in the S4 segments, 
(b) Model of the fast inactivation mechanism. IFM, Isoleucine 
1488, phenylalanine 1489, and methionine 1490 in the intracel¬ 
lular loop connecting domains III and IV (numbering from 
Navl.2). (c) Three-dimensional structure of the central segment 
of the inactivation gate as determined by multidimensional 
NMR. Isoleucine 1488, phenylalanine 1489, and methionine 
1490 (lEM) are illustrated in yellow. Threonine 1491, which is 
important for inactivation, and serine 1506, which is a site of 
phosphorylation and modulation by protein kinase C, are also 
indicated 


Catterall 2000a). Intracellular perfusion of proteases or 
intracellular application of antibodies directed to this 
loop prevents fast inactivation. The latch of this fast 
inactivation gate is formed by three key hydrophobic 
residues, IFM, and an adjacent threonine (T). Muta¬ 
tions of these amino acid residues destabilize the 
inactivated state, and peptides harboring the IFM 
motif can restore inactivation to sodium channels hav¬ 
ing mutated inactivation gates. The closed inactivation 
gate is thought to make multiple interactions with 
hydrophobic amino acid residues near the intracellular 
mouth of the pore that may constitute the inactivation 
gate receptor. Scanning mutagenesis experiments 
implicate hydrophobic residues in intracellular S4-S5 


loops of domains III and IV, as well as the intracellular 
end of the S6 transmembrane segment in domain IV in 
forming the inactivation gate receptor. The structure of 
the inactivation gate in the intracellular loop 
connecting domains III and IV has been determined 
by NMR analysis of this peptide expressed in bacteria 
(Fig. 3b). This structural analysis reveals a rigid alpha 
helical structure, preceded by two turns that position 
the key hydrophobic sequence motif IFMT such that it 
can interact with and block the inner mouth of the pore. 

In addition to fast inactivation. Nay channels have 
a slow inactivation process that is engaged during 
prolonged depolarizations of hundreds of milliseconds 
to seconds or during long trains of action potentials. 
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Slow inactivation is a separate process with a distinct 
mechanism and structural basis from fast inactivation. 
It is likely that it works by movements of the P loop and 
S6 segments that together close the pore. 

Cav channels have both voltage-dependent and 
calcium-dependent inactivation (Catterall 2000b). 
Voltage-dependent inactivation of Cay channels is 
not yet understood at the molecular level, but it is 
likely that it has a similar molecular basis to slow 
inactivation of Nay channels, which takes place on 
a similar timescale. Ca^'^-dependent inactivation of 
Cayl and Cay2 channels by Ca^"^ entering through 
the pore is faster than voltage-dependent inactivation 
of Cay channels. Entering Ca^"^ binds to calmodulin 
tethered to the C-terminal domain of Cayl and Cay2 
channels, which initiates interactions with an IQ-like 
motif that increases the rate of inactivation. The 
interaction of calmodulin with this regulatory site is 
biphasic in Cay2.1 channels, leading to facilitation of 
channel activity followed by inactivation (Catterall 
and Few 2008). Ca^'^-dependent inactivation of 
Cay 1.2 channels is an important regulator of 
excitation-contraction coupling in the heart, and facil¬ 
itation and inactivation of presynaptic Cay2.1 channels 
via this mechanism contributes to short-term synaptic 
plasticity at synapses (Catterall and Few 2008). 

Signaling complexes of sodium and calcium chan¬ 
nels. Although voltage-dependent activation, ion con¬ 
ductance and selectivity, and voltage-dependent 
inactivation of ion channels are intrinsic and autono¬ 
mous, these processes are subject to tight regulation by 
interacting proteins that form ion channel signaling 
complexes. Such signaling complexes confer local 
regulatory properties on ion channels and contribute 
greatly to the diversity of function of Nay and Cay 
channels (Catterall 2010a). Nay channels in brain neu¬ 
rons are regulated by a signaling complex with cAMP- 
dependent protein kinase and A Kinase Anchoring 
Protein-15 (AKAP15), which enhances slow inactiva¬ 
tion in response to dopamine and other neurotransmit¬ 
ters and reduces the frequency and duration of action 
potential firing during trains. Cay 1.2 channels in the 
heart are also regulated by a signaling complex with 
cAMP-dependent protein kinase and AKAP15, which 
increases their activity in response to beta-adrenergic 
stimulation in the fight-or-flight response. As noted 
above, regulation of Cay2.1 channels by a signaling 
complex with calmodulin and related Ca^"^ sensor pro¬ 
teins contributes to short-term synaptic plasticity. This 


presynaptic Cay channel signaling complex also con¬ 
tains SNARE proteins and many additional interacting 
proteins that are important for localization, tethering, 
anterograde signaling, and retrograde regulation of 
presynaptic Cay channels (Catterall 2010a). 

Molecular Pharmacology of Sodium and 
Calcium Channels 

Sodium channels are the molecular targets for drugs 
used in prevention of acute pain and in treatment of 
cardiac arrhythmias, epilepsy, and bipolar disorder 
(Catterall 2000a; Catterall et al. 2005a). Sodium chan¬ 
nel-blocking drugs that are selective for the sensory 
neuron Nay channels. Nay 1.7 and Nay 1.8, are 
also being developed for treatment of chronic pain 
(Dib-Hajj et al. 2007). Local anesthetics bind to 
a specific receptor site within the pore of sodium 
channels, formed by the S6 segments in domains I, II, 
and IV (Fig. 4a; Catterall 2000a). Their binding blocks 
ion movement through the pore and stabilizes the 
inactivated state of sodium channels. Antiarrhythmic 
drugs and antiepileptic drugs share similar, 
overlapping receptor sites. Complete block of sodium 
channels would be lethal. However, these drugs selec¬ 
tively block sodium channels in depolarized and/or 
rapidly firing cells, such as axons carrying high- 
intensity pain information and rapidly firing nerve 
and muscle cells that drive epileptic seizures or cardiac 
arrhythmias. This selective block arises because the 
drugs can reach their binding site in the pore of the 
sodium channel more rapidly when the pore is repeti¬ 
tively opened and they bind with high affinity to 
inactivated sodium channels that are generated in rap¬ 
idly bring or depolarized cells (Hille 2001). This use- 
dependent action of the sodium channel blocking drugs 
is essential for their therapeutic efficacy (Table 2). 

In addition to these important pharmaceutical 
agents, sodium channels are the molecular targets for 
a large number of neurotoxins that paralyze prey by 
preventing neuromuscular function (Table 2; Cestele 
and Catterall 2000). These toxins act at five or more 
distinct receptor sites and either block the pore of the 
channel or alter the kinetics or voltage dependence of 
gating. The pore blockers tetrodotoxin and saxitoxin 
bind to neurotoxin receptor site 1, which is formed by 
the P loops in the four domains. Alkaloids and other 
organic toxins from plants, frogs, and other species 
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Voltage-Gated Sodium and Calcium Channels, 
Fig. 4 Receptor sites for No* and Ca^* channel-blocking 
drugs, (a) Model of etidocaine binding to the local anesthetic 
receptor site formed by transmembrane segments IS6, IIIS6, and 
IVS6 of the Navl.2 channel (Catterall 2000a). Only transmem¬ 
brane segments IS6 (red), IIIS6 (green), and IVS6 (blue) are 
shown. Residues important for etidocaine binding are shown in 
space-filling representation, (b) The phenylalkylamine receptor 


site on Cavl.2 channels. Binding of desmethoxyverapamil 
(D888) in the half-folded conformation inside of the pore and 
the interface crevice between domains III and IV is illustrated. 
The aromatic rings on the ends of D888 interact with the side 
chains of Tyr-1463 and Ile-1470, correspondingly. The central 
amine interacts directly with the carhoxylate of the side chain of 
Glu-1118 of the selectivity filter, thereby blocking permeation 
(Lipkind and Fozzard 2003) 



bind at neurotoxin receptor site 2 and cause persistent 
activation of sodium channels, which depolarizes 
nerves and muscles and causes repetitive firing and 
paralysis. Polypeptide toxins from scorpions and spi¬ 
ders bind at neurotoxin receptors sites 3 and 4 and 
either slow the inactivation or enhance the activation 
of sodium channels, which also causes repetitive firing 
and paralysis. Brevetoxins and ciguatoxins, which are 
large polyether compounds made by marine dinoflag- 
gellates, bind to neurotoxin receptor site 5 and cause 
persistent activation of sodium channels. The diversity 
of organisms that have developed toxins that target 
sodium channels is remarkable, highlighting the phys¬ 
iological importance of these channels in electrical 
signaling and neuromuscular function. 

The pharmacology of the three subfamilies of Ca^"^ 
channels is quite different, and relates directly to their 
distinct physiological roles (Hockerman et al. 1997; 
Hofmann et al. 1999; Striessnig 1999). The Cayl 
channels are the molecular targets of the organic Ca^"^ 
channel blockers used widely in the therapy of 
cardiovascular diseases. These drugs are thought to act 
at three separate, but allosterically coupled, receptor sites 
(Table 1). Phenylalkylamines are intracellular pore 
blockers, which are thought to enter the pore from the 
cytoplasmic side of the channel and block it (Fig. 4b). 


Voltage-Gated Sodium and Calcium Channels, 
Table 2 Receptor sites on sodium channels 


Receptor site 

Toxin or drug 

Domains 

Neurotoxin 

Tetrodotoxin 

ISS2-S6, IISS2-S6 

receptor site 1 

Saxitoxin 

IP, IIP, HIP, IVP 


p-conotoxin 


Neurotoxin 

Veratridine 

IS6, IVS6 

receptor site 2 

Batrachotoxin 



Grayanotoxin 


Neurotoxin 

a-scorpion toxins 

IS5-IS6, IVS3-S4 

receptor site 3 

Sea anemone toxins 

IVS3-S4 

Neurotoxin 
receptor site 4 

P-scorpion toxins 

IIS1-S2, IIS3-S4 

Neurotoxin 

Brevetoxins 

IS6, IVS5 

receptor site 5 

Ciguatoxins 


Neurotoxin 
receptor site 6 

5-conotoxins 

IVS3-S4 

Local anesthetic 

Local anesthetic drugs 

IS6, IIIS6, IVS6 

receptor site 

Antiarrhythmic drugs 



Antiepileptic drugs 



Their receptor site is formed by amino acid residues in 
the S6 segments in domains 111 and IV, in close anal¬ 
ogy to the local anesthetic receptor site on sodium 
channels (Hockerman et al. 1997). Dihydropyridines 
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can be channel activators or inhibitors, and therefore 
are thought to act allosterically to shift the channel 
toward the open or closed state, rather than by occlud¬ 
ing the pore. Their receptor site includes amino acid 
residues in the S6 segments of domains III and IV and 
the S5 segment of domain III. The dihydropyridine 
receptor site is closely apposed to the 
phenylalkylamine receptor site and shares some com¬ 
mon amino acid residues. Diltiazem and related 
benzothiazepines are thought to bind to a third recep¬ 
tor site, but the amino acid residues that are required 
for their binding overlap extensively with those 
required for phenylalkylamine binding. 

The Ca.y2 subfamily of channels is relatively insen¬ 
sitive to dihydropyridines and other organic Ca^"^ 
channel blockers, but these Ca^"^ channels are 
specifically blocked with high affinity by peptide 
toxins from spiders and marine snails (Table 2; Olivera 
et al. 1994). The Cav2.1 channels are blocked 
specifically by co-agatoxin IVA from funnel web spider 
venom. The Cav2.2 channels are blocked specifically 
by (B-conotoxin GVIA and related cone snail toxins. 
The Cav2.3 channels are blocked specifically by the 
synthetic peptide toxin SNX-482 derived from 
tarantula venom. These peptide toxins are potent 
blockers of synaptic transmission because of their 
specihc effects on the Cav2 family of Ca^"^ channels. 
Block of Cav2.2 channels by a synthetic analog of 
a conotoxin, ziconotide, provides relief from chronic 
pain when infused into the spinal column in animal 
models and in man, and a new generation of 
small molecule organic Ca^"^ channel blockers with 
specihcity for Cav2.2 channels is being evaluated in 
clinical trials. 

The Cav3 subfamily of Ca^"^ channels are insensi¬ 
tive to both the dihydropyridines that block Cayl 
channels and the spider and cone snail toxins that 
block the Cav2 channels, and there are no widely 
used pharmacological agents that block T-type Ca^"^ 
currents specifically with high affinity (Catterall et al. 
2005b). The organic Ca^"^ channel blocker mibefradil 
is somewhat selective for T-type versus L-type Ca^"^ 
currents (three- to fivefold). High concentrafions of 
ethosuximide block T-fype Ca^"^ currents specifically, 
and this effect of the drug may be responsible for its 
effectiveness in treatment of absence seizures. The 
peptide kurtoxin inhibits the activation gating of 
Cav3.1 and Cav3.2 channels. Development of more 


specific and high-affinity blockers of the Cav3 subfam¬ 
ily of Ca^"^ channels would be useful for more detailed 
analysis of their physiological roles and potentially for 
therapy of pain, epilepsy, sleep disorders, and cardiac 
arrhythmias. 
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Water Vapor 

Worm-Like Chain (WLC) Model 

► Infrared Spectroscopy: Data Analysis 

Joshua N. Milstein and Jens-Christian Meiners 
Department of Physics and Department of Biophysics, 
University of Michigan, Ann Arbor, Ml, USA 

Water:Plastoquinone Oxidoreductase 

Synonyms 

► Photosystem II 

DNA; Kratky-Porod worm-like chain model; 
Persistent chain model; Single-molecule methods; 
Worm-like chain model 

Weakly Polar Interaction 

Definition 

► XH/tt Interactions 

Worm-Like Chain Model (WLC) is a continuous model 
used to characterize the behavior of a semi-flexible 
polymer. 

Wide and Small Angle X-Ray Scattering 

Introduction and Background 

► Small Angle X-ray Scattering 

► X-ray Fiber Diffraction 

Many biological molecules have an intrinsic elasticity 
that causes them to remain rigid at small length scales 
yet display significant flexibility over longer lengths. 
One of the simplest theories able to account for this 
characteristic behavior - and that has been successful 

Wide-Field Optical Fluorescence 
Microscopy 

in describing the elastic properties of a variety of bio¬ 
molecules, such as ssDNA, dsDNA, RNA and poly¬ 
peptide chains - is the worm-like chain (WLC) model 

► Optical Fluorescence Microscopy 

of a semiflexible polymer (Kratky and Porod 1949). 


G.C.K. Roberts (ed.), Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
© European Biophysical Societies’ Association (EBSA) 2013 
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Worm-Like Chain (WLC) Model 


Derivation of the WLC Model 


The WLC model assumes that the polymer is 
inextensible, has a linear elastic bending energy and 
is subjected to thermal fluctuations. A generic polymer 
may be modeled as a chain consisting of N segments rj, 
each of length |f/| = £, such that the total contour 
length is L = N£ (see Fig. la). By allowing all seg¬ 
ments to freely rotate about their vertices, this model 
already yields the freely-joined chain (FJC) model of 
a polymer. However, to account for intrinsic stiffness 
requires the addition of an energy cost associated with 
bending the polymer. A simple model for the bending 
energy can be written as follows: 

N-l 

H =■ n, (1) 

/-I 

which imposes an energy cost of e£^ times the cosine of 
the angle between neighboring segments. The right- 
hand side of (1) can be rewritten using the relation 

• r, =- —— ■ Moving from a discrete model 

to a continuous model requires taking the limits 
A —> oo and £ ^ 0. Likewise: 


lim 

e^o 



dT{s) 
ds ’ 


( 2 ) 


where t{s) is a tangent vector at location along the 
contour of the polymer (see Fig. lb). Finally, 
converting the sum to an integral yields: 


H 



(3) 


which is the Hamiltonian for the WLC model assuming 
that the contour length L is very large (Doi and 
Edwards 1988). Note the introduction of the persis¬ 
tence length ^ The persistence length serves as 

a characteristic length scale over which two tangent 
vectors along the polymer remain correlated, i.e.: 

(F(s) • t{s')) = (’’V*. (4) 

The WLC describes a polymer that is stiff over 
short distances and flexible at longer ones, with the 
persistence length setting the length scale of this 
transition in characteristic behavior. Moreover, while 



Worm-Like Chain (WLC) Model, Fig. 1 (a) Discrete model of 
a polymer composed of a chain of segments i) each of length £. 
(b) Continuous WLC polymer parameterized by the tangent 
vector f{s) = along the contour 

the WLC clearly excludes long-range interactions 
between distant segments, short-range interactions 
are assembled into the persistence length. For 
instance, the canonical value for the persistence 
length of dsDNA near physiological ionic concentra¬ 
tions is i^dsDNA = 50 nm, while for ssDNA it is 
(^dsDNA = L3 nm, which is consistent since dsDNA 
should be much stiffer than ssDNA due to the hydro¬ 
gen bonding between base pairs. 


Mean Square End-to-End Distance and 
Radius-of-Gyration 

The mean squared end-to-end distance and the radius- 
of-gyration are useful quantities to calculate as 
they both serve as a measure of the size of the poly¬ 
mer. They can both be obtained starting from 
the definition of the end-to-end distance, which is 
given by integrating a tangent along the length of the 
polymer, i.e.: 


~R = f f{s)ds. (5) 

Jo 

The mean squared end-to-end distance may now be 
written as: 


(R^) = f ds f ds'{T{s) ■ T{s')). (6) 

Jo Jo 

Substituting (4) into the right-hand side of (6) and 
performing the integrals yields: 

{R^) = 2e(^-l + efy (7) 
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Equation (7) can be related to the radius-of-gyration 
by the usual relationship for an entropically governed 
polymer displaying random walk behavior 

(r^\ 

(Rg^) = ^6 ■ When the contour length is much longer 
than the persistence length, L ^ (7) reduces to 

= 2L^ and the mean square radius-of-gyration 
is (Rg^) = y. It is interesting to calculate the radius- 
of-gyration for a bacterial chromosome, such as that of 
E. coli. With over 4.6 million base pairs to its DNA, the 
E. coli genome would have a radius-of-gyration of 
around 5 pm, which is more than double the size of 
the longest dimension of the cell (^2 pm) and implies 
that the bacterium must use additional methods to 
package its DNA. 


WLC Force Extension Relationship 


An inextensible polymer driven by thermal fluctua¬ 
tions will tend to cluster in order to maximize its 
entropy. This behavior will result in an entropic force 
that is resistant to attempts to stretch the polymer to its 
full contour length (see Fig. 2). The WLC model can be 
extended to incorporate a force along the z direction as 
follows: 


H 



^ / dT{s) 
2 \ & 


2 


£_ 

ksT 


?-T{s) 


ds. 


( 8 ) 


Equation (8) can be numerically solved for the force 
extension relationship; however, an interpolation for¬ 
mula that is commonly employed is given by (Marko 
and Siggia 1995): 


/ 


keT ( 1 1 Z\ 

C Ul-ff 4 l )' 


(9) 


where z is the end-to-end extension. At low force, the 
WLC model yields the standard Hookean linear rela¬ 
tion, but as the polymer begins to extend to its full 
contour length, the extension deviates from the linear 
behavior. The difference between the end-to-end 
extension and the total contour length approach zero 
as 

The first DNA stretching experiments, which were 
performed in the mid-1990s by applying a small force 



Worm-Like Chain (WLC) Modei, Fig. 2 Force extension 
curve for k-DNA fit with the WLC model. The insert shows 
the optical trap setup used to perform this measurement. Two 
dielectric microspheres, tethered to opposite ends of a strand of 
DNA, were confined by dual optical tweezers. The microspheres 
were then moved apart to stretch the DNA 


to a magnetic microsphere conjugated to a single 
dsDNA molecule, showed that the intrinsic elasticity 
of the DNA had to be accounted for as the simpler FJC 
model failed to reproduce the force extension data 
(Smith et al. 1992; Bustamante et al. 1994). The FJC 
model, which neglects all interactions between mono¬ 
mers, always displays a linear force extension and so 
is only valid for DNA at very weak stretching forces. 
In fact, the WLC results of (8) and (9) have been so 
successful at accounting for the elasticity of long 
stretches of dsDNA that they are now often used to 
calibrate the strength of magnetic and optical twee¬ 
zers (Bustamante et al. 2003; Neuman and Block 
2004). At forces of around 60 pN, the DNA extends 
to close to its full contour length and additional 
force results in an “overstretch transition.” At this 
point, it is necessary to extend the WLC model since 
the generic WLC model assumes an inextensible 
polymer. 


Cross-References 


W 


► Infrared Spectroscopy of Protein Dynamics: 
Ultrafast Kinetics 

► Magnetic Tweezers 

► Optical Tweezers 

► Thermodynamics of Biomolecular Interactions 
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Mechanisms 


XH/tt Interactions 

Michael Plevin 

Laboratoire de RMN, Institut de Biologie Structurale, 
Grenoble, France 

Synonyms 

Noncanonical hydrogen bonds; Weakly polar interaction 

Definition 

XH/ti interactions are a type of weak non-covalent 
bond in which the acceptor is a chemical group 
containing a delocalized system of electrons, for exam¬ 
ple, aromatic rings, peptide bonds, etc. The donor 
group consists of a hydrogen atom covalently bonded 
to a heavy atom (e.g., oxygen, nitrogen, sulfur, carbon, 
etc.) and therefore, XFI/ti interactions are frequently 
described as weak hydrogen bonds. The interaction 
energy is dependent on the polarity of the donor 
group: Ab initio studies have estimated that more 
polar donor groups (e.g., 0-Fl or N-H) form stronger 
interactions (up to 4 kcal mol”*), whereas more apolar 
donor groups (e.g., C-H) form weaker interactions 
(1 kcal mol”*). These bond energies are comparable 


to those of canonical hydrogen bonds, which are in the 
range of 1-10 kcal mol”*. Computational approaches 
have also been used to explore the affect of donor and 
acceptor group geometry on XH/tt interaction ener¬ 
gies. These studies have allowed several geometric 
definitions of XFI/tt interactions to be proposed. 

XH/tt interactions are commonly found in nature 
and they contribute on many levels to the function and 
tertiary and quaternary structure of proteins and 
nucleic acids. Examples of weak polar interactions 
can be readily identified in the 3D structures of 
biomolecules by searching coordinate files for the 
characteristic arrangement of the donor and acceptor 
groups. Analysis of protein sequence and structure 
within protein fold families has shown that XFI/ti inter¬ 
actions are often conserved structural motifs that can 
feature in functionally important regions of proteins. 

Experimentally analysis of the functional or struc¬ 
tural role of an individual XH/ti interaction in 
a complex molecule such as a protein or nucleic acid 
can be challenging. NMR spectroscopy has proven 
a useful tool for analyzing XH/ti interactions in their 
biological contexts, either by monitoring the perturba¬ 
tion of the donor proton signal caused by an aromatic 
acceptor (the so-called ring current shift), or by using 
2D heteronuclear NMR experiments to directly trans¬ 
fer magnetization via the weak scalar couplings that 
exist between the participating nuclei. 

Cross-References 

► J Coupling 

► Macromolecular Crystallography: Overview 

► Protein NMR - Introduction 
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X-Ray Absorption Fine Structure (XAFS) 

► X-Ray Absorption Spectroscopy of Metals in Biology 

X-Ray Absorption Near-Edge Structure 
(XANES) 

► X-Ray Absorption Spectroscopy of Metals in Biology 

X-Ray Absorption Spectroscopy of 
Metals in Biology 

Graham N. George and Ingrid J. Pickering 
Department of Geological Sciences, University of 
Saskatchewan, Saskatoon, SK, Canada 

Synonyms 

Extended X-ray absorption fine structure (EXAFS); 
Near-edge X-ray absorption fine structure (NEXAFS); 
X-ray absorption fine sfructure (XAFS); X-ray absorp- 
fion near-edge sfructure (XANES) 

Definition 

X-ray absorption spectroscopy provides information 
on the local physical and electronic structure of 
a metal in any biological system. 

Introduction 

Transition metal ions are found in approximately 30% 
of proteins and are key components of the catalytic 
active sites of metalloenzymes. Metalloenzymes cata¬ 
lyze many of the most industrially relevant and biolog¬ 
ically important chemical reactions, and the detailed 
structure of the metal site is generally key to their 
function. X-ray absorption spectroscopy (XAS) pro¬ 
vides a powerful probe of local molecular and elec¬ 
tronic structure that has been applied to metals in 
biology. Originating in the early 1970s, it has been 
essential in the development of our understanding of 


the metalloprotein active site structure. Increasingly, it 
is also applied to investigate metals in complex bio¬ 
logical systems including intact cells and tissues. 

X-Ray Absorption Spectroscopy 

X-ray absorption spectroscopy (XAS) arises from 
core-level excitation by absorption of X-rays (Fig. 1) 
and is thus associated with an absorption edge. In an 
XAS experiment. X-ray absorption is measured while 
sweeping the energy of a monochromatic incident 
X-ray beam. Almost all XAS are carried out using 
synchrotron light, and while some work on concen¬ 
trated samples can be done using conventional sources, 
this is not practical for biological samples. XAS is 
usually separated into two different regions: the 
extended X-ray absorption fine strucfure or EXAFS, 
which occurs at energies higher than the absorption 
edge, and the near-edge region, often called the 
XANES (X-ray absorption near-edge structure) 
which consists of features before the major inflection, 
and any after the inflection which are not part of the 
analyzable EXAFS (Fig. 1). XAS is element specific in 
thaf the absorption edges are well defined in energy 
and there is no interference from chemically similar 
elements. It can be used to investigate solids, liquids 
(including solutions), gaseous materials, and any mix¬ 
tures thereof. It probes all of an element within 
a sample with moderate sensitivity and is applicable 
to a very wide range of elements. XAS theory is well 
developed, and the methods of analysis of the EXAFS 
part of the spectrum to give a local radial structure are 
also well established. XAS is applied in many scien¬ 
tific disciplines, with a large number of applications in 
materials science, industrial catalysis research, medi¬ 
cine, the environmental sciences, and in structural 
molecular biology. The content included here is not 
intended to be a comprehensive treatment across all 
fields but instead seeks to illustrate the capabilities of 
the method relevant to Biophysics. 

X-Ray Absorption Near-Edge Spectroscopy 

XAS provides a probe both of electronic and of phys¬ 
ical structure. The features observed in near-edge spec¬ 
tra arise from predominantly dipole-allowed (i.e., 
A/ = ±1) transitions to bound state molecular orbitals; 
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a Emitted photo-electron 
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X-Ray Absorption Spectroscopy of Metals in Biology, 
Fig. 1 (a) Schematic diagram of the underlying physics of 
X-ray fluorescence. The incident photon causes ejection of 
a core electron resulting in a core hole which is filled by 

analysis of near-edge spectra can thus yield informa¬ 
tion on electronic structure including oxidation state 
and, in some cases, geometry. Generally, the near-edge 
spectrum shifts to higher energies with increasing 
metal oxidation state, although equally large changes 
can he observed from changes in the ligand environ¬ 
ment. In general, increased coordination from elec¬ 
tron-rich ligands such as sulfur shifts the near-edge to 
lower energy relative to ligands such as oxygen which 
would shift to higher energies (Fig. 2). For transition 
metal ion, K-edges core-hole lifetime broadening can 
be limiting, whereas for K-edges of ligand atoms such 
as sulfur, the much longer core-hole lifetime translates 
to sharper near-edge spectra with a correspondingly 
richer spectroscopy. Relative orbital energies mea¬ 
sured by separation of features in the near-edge spectra 
correspond to excited states, and the so-called Z -i- 1 
approximation states that, because of the presence of 
a core hole, the various transition energies correspond 
more closely to those of the element in the next group 
(i.e., atomic number increased by one, e.g., Co in the 
case of Fe). For first transition ions, one of the most 
informative features of the near-edge spectrum is 
sometimes the smallest - the formally dipole- 
forbidden and quadrupole-allowed Is ^ 3d transition 
(A/ = 2) has been productively used in many cases. 
While methods for analysis of near-edge spectra are 
increasingly common, the most common and the most 
productive method of interpretation of near-edge spec¬ 
tra is by simple “fingerprint” comparison of the spec¬ 
trum of an unknown with small-molecule coordination 
compounds of known structure. 


dipole-allowed decay of another electron with concomitant 
emission of X-ray fluorescence, (b) Regions of the X-ray absorp¬ 
tion spectrum, showing both the near-edge and the EXAFS 



Energy (eV) 

X-Ray Absorption Spectroscopy of Metals in Biology, 
Fig. 2 Iron K-edge X-ray absorption near-edge spectra showing 
the sensitivity to ligand type and to formal metal oxidation state. 
In all cases, the iron is approximately tetrahedrally coordinated 


Extended X-Ray Absorption Fine Structure 
Spectroscopy 

At incident X-ray energies E above an absorption edge 
£o, a core electron will be excited and leave the atom as 
a photoelectron wave with a de Broglie wavelength 
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and a kinetic energy of {E — Eq) = nieV^ jl where is 
the electron rest mass, and v the electron velocity. The 
extended X-ray absorption fine structure or EXAFS are 
generally plotted as a function of photoelectron wave- 
vector k which, using Ti as the reduced Planck’s con¬ 
stant, we can define as 

k = In/Xe = {E-^) 

EXAFS has its roots in the scattering of the emitted 
photoelectron wave by neighboring atoms. Because 
the initial state wave function (Is for a K-edge) is 
very compact and centered at the absorber atom 
nucleus, the absorption depends to a good approxima¬ 
tion upon the final state wave function at the nucleus of 
the absorber atom. The X-ray absorption coefficient 
will be at a maximum when the backscattered wave is 
in phase with the outgoing wave at the nucleus of the 
absorber and at a minimum when it is out of phase. 
This can be conceptualized as being due to construc¬ 
tive and destructive interference, respectively. As the 
X-ray energy increases above the absorption edge, the 
de Broglie wavelength decreases and the X-ray absorp¬ 
tion will go through successive minima and maxima 
yielding an oscillatory modulation of the absorption. 
These oscillations are known as the EXAFS. The fre¬ 
quency of the EXAFS depends upon the atomic sepa¬ 
ration, while the EXAFS amplitude, which is 
a reflection of the effectiveness of the backscatterer 
atoms, depends upon the size (atomic number) and the 
number of backscatterer atoms. Because the 
backscatterer atom modulates the phase of the photo¬ 
electron, the phase of the EXAFS will also depend 
upon the nature of the backscatterer. Thus, both 
phase and amplitude of the EXAFS depend on the 
backscatterer, and this gives some information on 
which element is backscattering. In general, it is not 
possible to discriminate atoms with similar atomic 
number; for example, Cu-S and Cu-Cl give nearly 
identical EXAFS, while that from Cu-S and Cu-0 is 
quite different. 

The EXAFS is described by the simple 
equation which is summed over all EXAFS scattering 
paths i: 

sin[2^/?,. -f 


Here A, is the number of backscatterers of type i for 
single scattering (equal to the path degeneracy with 
multiple scattering), 7?, is the absorber-backscatterer 
distance, A,(^) the total amplitude function, cr? is the 
mean-square deviation inf?,, Xi{k) is the photoelectron 
mean free path, and the total phase function. The 
EXAFS spectrum is extracted from the raw XAS spec¬ 
tra by what are now standard techniques. It is common 
practice to weight the EXAFS by k^ in order to coun¬ 
teract the damping of the spectra due to thermal vibra¬ 
tions and from the falloff in scattering amplitude. In 
addition, the theory used to analyze the spectra is more 
accurate at higher k, increasing approximately as k^, 
and the k weighting thus emphasizes the part of the 
spectrum that can be most accurately analyzed. 
EXAFS spectra are usually Fourier transformed in 
order to visualize the frequency contributions to the 
EXAFS corresponding to individual interactions as 
peaks verses R. A simple Fourier transform will 
give asymmetric peaks shifted from the interatomic 
distances by half the average phase-shift slope, and 
depending on the complexity of ip{k), multiple peaks 
can result. To simplify the transform, it is common 
practice to apply phase correction using a theoretical 
(p{k) for the most dominant backscatterer which gives 
more symmetrical peaks at values close to the actual 
absorber-backscatterer distances. 

In most cases, quantitative information is obtained 
by analyzing the EXAFS oscillations using a theoret¬ 
ical model. Some software packages perform the anal¬ 
ysis in k space while with others prefer the strategy of 
fitting transformed data in R space. Typically, ab initio 
theory is used to compute values for the total phase and 
amplitude functions and then values for A, R, and are 
refined (Fig. 3). A major strength of EXAFS is the 
accuracy with which it can determine R, the average 
interatomic distance for a given type; for directly coor¬ 
dinated ligands, this is approximately ±0.02 A. The 
precision is generally an order of magnitude better 
than this, so that relative changes in similar species can 
be very well defined. The EXAFS resolution, the mini¬ 
mum separation at which similar ligands can be discrim¬ 
inated with individual /?, , is directly related to the extent 
of the data and is approximately given by AR « n/2k. 
For typical data ranges, this separation AR is rather poor 
at around 0.15 A, and this is one major limitation of 
EXAFS analysis. In some cases, this can be improved by 
extending the range of the data, but for first transition 
metal ions, often, this is not practical as the K-edges of 
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X-Ray Absorption Spectroscopy of Metals in Biology, 
Fig. 3 Iron K-edge extended X-ray absorption fine structure 
(EXAFS) and EXAFS Fourier transform for a [Fe 4 S 4 ] ferre- 
doxin. Experimental data are shown as solid lines, while the 
results of EXAFS curve-fitting analysis are shown as broken 
lines. The EXAFS oscillations are shown in A, and the 


corresponding EXAFS Fourier transforms, phase-coiTected for 
Fe-S backscattering are shown in B. The fit shows four Fe-S at 
2.28 A (a^ = 0.0027 A^) and three Fe-Fe at 2.72 
(a^ = 0.0038 A^). The inset to B shows the crystal structure of 
the [Fe 4 S 4 ] cluster 


transition metals with higher atomic number truncate 
the ^-range of the data. Thus, Cu K-edge measurements 
(edge energy 8,980.3 eV) are typically truncated to ^ « 
13 by traces of zinc (Zn K-edge 9,654 eV) present 
in samples or equipment. Unresolved bonds to similar or 
the same atom type manifest as an increase in the effec¬ 
tive value for which can be written as the sum of static 
and vibrational components, + ^vibrational- 

Unresolved differences in bond length result in an 
increased static component, while the vibrational com¬ 
ponent depends upon the chemical bonding. 

Credible EXAFS analyses must employ chemically 
reasonable bounds for all values of N, R, and a. When 
multiple bonds of a similar type are present, an upper 
bound for is not trivial to dehne, because of uncer¬ 
tainties related to however, the use of overly 

large values, greater than about 0.01 A^, will effec¬ 
tively remove contributions from the EXAFS by 
damping all but the lowest k oscillations. Minimum 
values for (T^are simple to define as they cannot be less 
than which can be calculated. For example, 

for EXAFS of molybdenum enzymes, minimum 
values of 0.0020 A^ and 0.0015 A^ have been esti¬ 
mated for Mo-S and Mo = O bonds, respectively, at 
a sample temperature of 10 K (George et al. 1999). 
A zero or negative value is physically unreasonable. 
Typically determined interatomic distances and coor¬ 
dination numbers can be verified by comparison with 
accurately known structures in databases such as the 
Cambridge Crystallographic Database. Additionally, 


coordination numbers determined from EXAFS anal¬ 
ysis should be consistent with geometry information 
from the near-edge spectrum. 

Experimental Aspects of Biological X-Ray 
Absorption Spectroscopy 

The major components of the biological X-ray absorp¬ 
tion spectroscopy experiment are shown in Fig. 4. 
Metals and other heteroatoms in biological samples 
tend to be very dilute and therefore require the use of 
X-ray fluorescence for their detection. The most com¬ 
monly used detector systems are solid-state detector 
arrays which are sensitive to saturation at high fluores¬ 
cence intensities, although this can be relieved by pref¬ 
erential rejection of scattered X-rays using X-ray 
fluorescence filters and Seller slits (Cramer et al. 
1988). The use of a cryostat is generally considered to 
be essential not only to preserve the chemical integrity 
and anaerobicity of biological samples and to minimize 
X-ray photodamage to the sample but also to minimize 
the thermal molecular vibrations thereby increasing the 
EXAFS amplitudes at higher k. Liquid helium cryostats 
operating at 4-15 K give superior results to liquid nitro¬ 
gen devices. The intensity of incident and transmitted X- 
rays are generally measured using gas ionization cham¬ 
bers. The X-ray energy is calibrated by using a foil of the 
metal under investigation and standard values for the 
lowest energy inflection. 
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X-Ray Absorption 
Spectroscopy of Metals in 
Biology, Fig. 4 Schematic of 
experimental setup for X-ray 
absoiption spectroscopy (plan 
view). The flight path of the 
monochromatic X-ray beam 
enters through the gas 
ionization detector 


cryostat 



X-Ray Absorption Spectroscopy of 
Metalloproteins 

X-ray absorption spectroscopy was applied to 
metalloproteins very soon after the development of 
the technique. Early workers sought to exploit the 
high accuracy of EXAFS-determined bond lengths to 
provide additional structural information on crystallo- 
graphically characterized samples, since errors in 
determined metal-ligand hond-lengths from EXAFS 
are less than 0.02 A, while those from protein crystal¬ 
lography are closer to 0.1 A. The hrst reported EXAFS 
study of a biological molecule was of rubredoxin 
which showed that there were errors in the bond 
lengths determined by early crystallography; subse¬ 
quent crystallographic analyses confirmed the conclu¬ 
sions from EXAFS (e.g., see review hy George and 
Pickering 2007). While the role of XAS in providing 
ancillary information to crystallography is increas¬ 
ingly important, a large number of metalloprotein 
EXAFS applications comprise studies of systems for 
which there is little or no structural information. The 
enzyme nitrogenase provides an excellent early exam¬ 
ple. This enzyme is the key to the global nitrogen cycle 
and catalyzes the six-electron reduction of the very 
stable N 2 molecule to ammonia. Early EXAFS defini¬ 
tively showed that the active site of this important 
protein was a novel molyhdenum-iron-sulfur cluster 
(Cramer et al. 1978), at the time without chemical 
precedent. Later, crystal structures revealed a novel 
elliptically shaped molybdenum-iron-sulfur cluster 
with the molybdenum positioned at one end (Einsle 
et al. 2002). XAS has made major contributions to our 
understanding of metalloprotein active sites, in many 
cases complementing information available from crys¬ 
tallography. The radiation dose required for 
a diffraction experiment is generally much higher 


than that required for an XAS measurement (George 
et al. 1999). Thus, while photoreduction of metal sites 
in XAS is still a problem, it is less so than with crys¬ 
tallography. Moreover, the sensitivity of the near-edge 
portion of the spectrum to changes in oxidation states 
means that experimenters can monitor successive 
scans for changes, thereby detecting photoreduction 
or photooxidation if these do occur. 

The major strength of XAS is that it can be used to 
provide information on active site structure in solu¬ 
tion, and without the need for crystalline material, and 
it is therefore particularly suitable for study of sys¬ 
tems for which structural information is available but 
not on the metal binding site. An interesting example 
of this comes from the copper metallochaperones 
(Pushie et al. 2012), which are proteins that can he 
thought of as molecular taxicahs; they carry metal 
ions with great specificity within the cell to a 
particular destination molecule. They must hind the 
metal ion with great tenacity but must also readily 
give it up upon arrival at the destination. Copper 
metallochaperones have been extensively studied by 
X-ray crystallography, but despite considerable 
effort, crystals containing the native metal ion are 
almost impossible to make, perhaps because the pro¬ 
cess of crystallization triggers chaperones to give up 
bound metal. Thus, techniques such as XAS are the 
primary tool for understanding the metal ion coordi¬ 
nation in these cases. Particularly elegant has been the 
combination of XAS and nuclear magnetic resonance 
spectroscopy to build up a detailed picture of the 
solution structure of a copper metallochaperone 
(Arnesano et al. 2003). 

A recent and highly promising application of XAS 
is in the study of metals and metalloids in intact tissues 
(George et al. 2008; George and Pickering 2007). 
These include toxic, therapeutic, and functional 
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metals. Advances in synchrotron beamline and X-ray 
fluorescence detector technology are both key to this 
new application of the method, because the physiolog¬ 
ical levels of metals and metalloids are generally quite 
low. Because of this, applications typically use the 
near-edge region of the spectrum as this can be 
obtained at adequate signal to noise with lower con¬ 
centrations than can the EXAFS. Techniques for using 
the near-edge spectrum to speciate mixtures have been 
developed (George and Pickering 2007). 

Summary 

X-ray absorption spectroscopy is a proven technique 
that can give detailed structural information on metals 
in biological systems. The method has been success¬ 
fully applied both to purified metalloproteins and to 
intact tissues. 

Cross-References 

► X-Ray Fluorescence Imaging: Elemental and 
Chemical Speciation Mapping of Biological 
Systems 
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gen bonding; X-ray diffraction 

Definition 

These diffraction methods (X-ray, neutron, electron) 
provide information about the three-dimensional 
crystalline structures of oligosaccharide and 
polysaccharide 
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Introduction 

Diffraction by single crystals is the most powerful 
experimental method for the characterization of the 
atomic arrangements in molecules. X-ray and neutron 
are used since their wavelengths are of the same order 
of magnitude as the interatomic distances, typically 
around an angstrom unit. The electron and nuclei are 
the scattering elements of the incident X-ray and 
neutron-incident radiation, respectively. The single 
crystals are usually grown by slow evaporation of 
saturation under controlled environments. They have 
three-dimensional periodicity among the molecules. 
Ideally, their dimensions should be in the order of 
0.2-0.5 mm and over 1.0 mm, in the respective case 
of studies from X-ray and neutron diffraction. Crystals 
having smaller dimensions (in the order of 20-30 pm) 
can be studied using synchrotron X-ray diffraction 
intensities. The diffraction patterns are recorded on 
electronic devices; they consist of a series of Bragg 
reflections. Their positions and intensities, respec¬ 
tively, contain the details on the unit cell dimensions, 
space group symmetry, and atomic positions. Unit 
cells are the building blocks of the crystals. In order 
to determine the crystal structure, both the amplitude 
and phase of the diffracted wave in every Bragg reflec¬ 
tion are required. The resolution of the crystal 
structure, that is, the location of the atomic positions 
within the unit cell, follows a series of well-established 
computational procedures. From the final atomic coor¬ 
dinates, the molecular shape is revealed; structural 
features such as bond lengths, bond angles, torsion 
angles, hydrogen bonds, intermolecular distances, 
and packing features are directly computed. 

In nature, carbohydrates form an important family 
of biomolecules, as simple or complex carbohydrates, 
either alone or covalently linked to proteins or lipids. 
Most of the earlier studies on carbohydrates focused 
on plant polysaccharides, such as cellulose, starch, - 
pectins, etc., largely because of their wide range of 
applications. The role of carbohydrates in biological 
events has been recognized and glycobiology 
has emerged as a new and challenging research 
area at the interface of biology and chemistry. Oligo¬ 
saccharides play structural and physical roles but are 
also specifically recognized by lectin receptors. 
Of special interest are the carbohydrate-mediated rec¬ 
ognition events that are important in biological 
phenomena. 


Molecular Crystals of Oligosaccharides 

The Cambridge Structural Data Base (CSDB) contains 
over 400,000 entries in the form of structural data 
related to molecular crystals and organometallic struc¬ 
tures (Allen and Motherwell 2002). The number of 
entries dealing with carbohydrate crystal structures is 
over 4,000. There are 55 crystal structures of 
unsubstituted disaccharides. There are 24 trisaccharide 
crystal structures and only four tetrasaccharides. The 
largest linear oligosaccharide that has been crystallized 
and analyzed is p-nitrophenyl a-maltohexopyranoside. 
The largest underivatized oligosaccharide structure 
ever solved as a molecular crystal is that of tricolorin 
A (Fig. 1). 

There is an obvious reluctance of carbohydrates 
to crystallize in a form suitable for X-ray or neutron 
diffraction studies. This is particularly true for aldose- 
and ketose-containing carbohydrates as 
a configurational mixture of four isomers - a- and 
P-pyranoses, a- and P-furanoses - is likely to occur. 
Configurational heterogeneity in solution tends to 
inhibit crystallization. For this reason l-O-methyl 
derivatives, which cannot anomerize, are likely to 
crystallize more readily. The a and P ratio may 
be dependent on the temperature and solvent of crys¬ 
tallization, and may not be reproducible between 
independent investigators. The reluctance of carbohy¬ 
drates to crystallize in a form suitable for X-ray 
diffraction studies is more pronounced for compounds 
having molecular weights ranging from 1,000 to 5,000. 
This is true but for the exception of cyclic compounds 
such as cyclodextrins and cyclo-amyloses. One of the 
reasons may be the lack of sufficient amount of mate¬ 
rial available for crystal growth. The other reason is 
that the techniques of growing organic crystals of 
medium-sized biomolecules have not paralleled the 
revolution of growing protein and virus crystals. 

Important in their own right, disaccharides take 
great importance as the shortest components of the 
family of oligo- and polysaccharides and complex 
carbohydrates. As such, attention is given to their 
conformational properties because their molecular 
shapes are considered to be important determinants of 
their properties and those of their larger parents. For 
disaccharide moieties, the main conformational deter¬ 
minants are the ring shapes, the orientations of the 
hydroxyl groups, and the relative orientations of 
the monosaccharide units at the glycosidic linkage 
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X-Ray Diffraction and Crystallography of Oligosaccha¬ 
rides and Polysaccharides, Fig. 1 Tricolorin. (a) Schematic 
representation of the tricolorin A molecule, (b) along with the 
molecular conformation derived from the structural resolution 
from X-ray diffraction data recorded from a needle-shaped 


single crystal (0.5 x 0.01 x 0.01 mm^) on beam line ID29 at 
the ESRF (Grenoble). The size of the crystal unit cell indicated 
the presence of four independent tricolorin A molecules per 
asymmetric unit. Each unit contains a total of 284 nonhydrogen 
atoms (Rencurosi et al. 2004) 


(Imberty and Perez 2000). Ring shapes can be defined 
in terms of reference conformations (chair, C; twist, T; 
boat, B; envelope, E; skew, S) or by the puckering 
parameters. The exocyclic primary alcohol groups 
can adopt a number of low-energy conformations. 
They are in staggered arrangements that correspond 
to local minima. The relative orientation of two con¬ 
secutive monosaccharide units in a disaccharide is 
described by the torsion angles <D and 'P around the 
glycosidic bonds. $ represents the torsion angle about 
the C (anomeric)-O bond. The consideration of the 
axial/equatorial nature at the glycosidic linkage pro¬ 
vides a useful framework for the classification of the 
disaccharide moieties, independently of the remaining 
and of the surrounding of the oligosaccharidic 
molecule (Fig. 2). As for six-membered ring- 
containing disaccharides, axial-axial, axial-equato¬ 
rial, and equatorial-equatorial are found. The families 
involving axial-furanose and equatorial-furanose 
linkage to a hexopyranose are also indicated, along 
with the furanose-furanose cases. There is no 
representative of the equatorial-axial class, even 
though such type of glycosidic linkage can be found 
in several carbohydrate-containing molecules and 
macromolecules. 

Using this classification, all the unsubstituted com¬ 
ponents of the linear oligosaccharide structures have 


been identified, reported, and analyzed (Perez 2007a). 
For each class, the nature of the glycosidic linkage, the 
magnitude of the angles (<D, 'P, Q) at the glycosidic 
linkage, and the occurrence (if any) of inter-residual 
hydrogen bonds were reported. All the crystalline con¬ 
formations lie within the 5 kcal mol“* energy contour 
of the corresponding potential energy surface (Imberty 
and Perez 2000). There is a limited dispersion about 
the glycosidic <1> angles compared to that observed 
about 'P, as an expression of the influence of the exo- 
anomeric effect on the establishment of preferred 
conformation in crystalline oligosaccharides. The 
exo-anomeric effect arises due to the particular bond¬ 
ing sequence C-5-0-5-C-l-0-x-C-x in glycopyr- 
anosides and disaccharides and influences the 
conformation about the glycosidic torsion angle O. In 
the case of an axial type of linkage, only one staggered 
conformation is preferred with O being in the vicinity 
of 60°. As for the equatorial type of linkage, two 
staggered conformations are preferred (d? = 60° and 
—60°) of which that corresponding to O = —60° is 
favored due to further stabilization occurring from 
nonbonded interactions. The values of O generally 
vary between 40° and 120° for l^x axial linkages, 
and between —100° and —65° for l^x equatorial 
linkages. The magnitude of the exo-anomeric effect 
is not strong enough to drive the angle to one single 
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X-Ray Diffraction and 
Crystallography of 
Oligosaccharides and 
Polysaccharides, 

Fig. 2 Glycosidic linkages. 
Schematic representation of 
the different types of 
glycosidic linkages observed 
in crystal structures of 
disaccharides and 
oligosaccharides. For a 1 —> x 
glycosidic linkage: O = 05n- 
C 1 n“0^(n+ 1 )~CX(n+ 1 ), 

4^ = Cln-OX(n+|)-CX(n+l)-C 
(x -t l)n+i; for a 1 —► 6 linkage 
Cl = 01n-C6(n+l)-C5(n+i)- 
05(n+l), X 05n Cln OX(n+l) 
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conformation. It can be counterbalanced by the occur¬ 
rence of inter-residue hydrogen bond or other 
favorable interatomic interactions. The dispersion of 
the observed conformations about the torsion angle 
is much wider (Perez et al. 2000). 


Hydrogen Bonding in Crystalline Oligosaccha¬ 
rides. The carbohydrates offer rich examples to 
analyze the hydrogen bonding in crystals, from which 
rules can be extracted to be further used in molecular 
modeling situations. Most of the basic rules have been 
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established throughout the analysis of high-accuracy 
X-ray analysis and most evidently from those crystal¬ 
line structures that have been derived from neutron 
diffraction investigations. A systematic investigation 
of the differences in bond lengths from X-ray and 
neutron diffraction analyses of the same crystal struc¬ 
ture gave significant differences for dX-dN, for C-H 
and 0-H bonds, respectively, —0.096(7) and —0.155 
(10) A. This is why the X-ray X-H bond lengths have 
to be normalized to standard, neutron diffraction, 
values for the interpretation of hydrogen bonding in 
molecular crystals. The rules governing the establish¬ 
ment of hydrogen bonds obey two basic concepts 
(Jeffrey and Saenger 1994). 

1. Maximize the hydrogen-bond interactions through¬ 
out the participation of all hydroxyl groups and as 
many rings and, to a lesser extent, glycosidic 
oxygen atoms as possible. 

2. Maximize cooperativity by forming as many finite 
and infinite chains of hydrogen bonds as possible. 
Hydration is a fairly common feature of ciystalline 

oligosaccharides, which may result from their molec¬ 
ular shapes that do not provide efficient packing. 
Therefore, voids from the water molecules are left in 
the crystalline structures. The analysis of crystalline 
oligosaccharides can only provide a limited insight 
into the preferred hydration effect. Whereas water 
molecules may influence the occurrence of a given 
conformation, these water molecules may be expelled 
upon formation of the crystalline arrangement. In the 
hydrate structures, the water molecules always donate 
two hydrogen bonds; they may accept one or two. 

Packing Features. As for their crystalline structure, 
the carbohydrates crystallize in noncentro-symmetric 
structures. Only four space groups, that is, P2i2i2i, 
P2i, Pi, and C 2 , account for more than 80% of the 
observed space groups. There is a lack of comprehen¬ 
sive investigation of the molecular principles underly¬ 
ing the formation of the oligosaccharide crystals. Few 
reports have tried to analyze the anisotropy of 
intermolecular interactions in these crystals. A way 
of exploring the packing arrangement is to evaluate 
the intermolecular energy between one molecule, that 
is, the reference molecule, and all its neighbors. This 
energy is evaluated by taking into account the 
intermolecular hydrogen bonds as well as the 
nonbonded interactions. The packing arrangements 
found in crystal structures of oligosaccharides are con¬ 
sistent with a high packing density, as each molecule is 


usually surrounded by 12 neighbors. Among these 
neighbors, which occur in pairs, certain anisotropy 
exists, which in many cases can he correlated to the 
dimensions of the unit cell, and/or the morphology of 
the crystals. From the differences in the magnitude of 
the intermolecular energies, supramolecular elements 
such as molecular chains and/or molecular layers can 
be identified that constitute the basis of the molecular 
layers. From the limited number of packing of oligo¬ 
saccharide structures that have been investigated so 
far, the low-energy molecular layers correspond to 
the few, which underline the formation of the space 
groups cited above. The issues related to the under¬ 
standing of crystalline polymorphism along with the 
relationship between crystal packing and crystalline 
morphology should certainly represent a future line 
of research. 

Crystalline Conformations of 
Oligosaccharides Complexed with Proteins 

Since the beginning of the 1990s, an increasing 
number of crystal structures have been reported for 
glycoproteins and protein-carbohydrate complexes. 
Significant and rapid progresses arising from the use 
of synchrotron radiation are yielding highly resolved 
structures. For example, the protein-carbohydrate 
literature provides structural data for several 
cellobiose-protein crystal structures, lactose-protein 
crystal structures, maltose-protein crystal structures, 
sucrose-protein crystal structures, a,a-trehalose- 
protein crystal structures, etc. Forces in protein-car¬ 
bohydrate complexes should be more varied than in 
the fairly homogenous small-molecule oligosaccha¬ 
rides crystals. The protein geometry reduces the influ¬ 
ence of preferred packing modes; this is particularly 
true for hydrolytic enzymes that induce significant 
distortions both of the ring geometry and the confor¬ 
mations at the glycosidic linkages. Other bias may 
occur, resulting from the low resolution of the struc¬ 
ture, or from the fact that the crystallographic refine¬ 
ments are often guided with force fields. For these 
reasons, the presentation is restricted to two classes of 
proteins: the lectins and those interacting with gly- 
cosaminoglycans. In these two cases, there is no dis¬ 
tortion to the carbohydrates occurring from 
hydrolysis; besides the crystallographic resolution is 
generally very high. 
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X-Ray Diffraction and Crystallography of Oligosaccha¬ 
rides and Polysaccharides, Fig. 3 Lectin-Oligosaccharide 
interactions, (a) Three-dimensional structure of a lectin of Pseu¬ 
domonas aeruginosa interacting with the human milk 

Oligosaccharide-Lectin Complexes. Among 
proteins that interact noncovalently with carbohy¬ 
drates, lectins bind mono- and oligosaccharides revers¬ 
ibly and specifically, while displaying no catalytic or 
immunologic activity. More than 900 crystal struc¬ 
tures, corresponding to about 190 different lectins, 
have been solved; more than 60% of them as com¬ 
plexes with carbohydrate ligands. A database of the 
three-dimensional structures of lectin is maintained 
and is freely available at: http://www.cermav.cnrs.fr). 
An extraordinary wealth of information about oligo¬ 
saccharides is available in this database and detailed 
information about the conformation of more than 100 
oligosaccharides (from disaccharides to tetradecasac- 
charides) can be found here (Perez 2007a) (Fig. 3). 

Glycosaminoglycan-Protein Complexes. Glycos- 
aminoglycans (GAGs) are linear polysaccharides 
present on all animal cell surfaces and in the extracel¬ 
lular matrix, where they are usually found to be 
attached covalently to core proteins to form the pro¬ 
teoglycan family. GAGs can be classified into four 
groups: the hyaluronic acid type, the chondroitin/ 
dermatan sulfate type, the heparan/heparin type, and 
the keratan type. They are acidic polysaccharides made 
of disaccharide repeating units (ranging from 40 to 
100) which consist of uronic acids and hexosamines. 
Except for the hyaluronic acid, epimerization at the 
C-5 position of uronic acids, and N- and O-sulfation 
provide numerous sources of micro-heterogeneities. 
GAGs assume extended structures in aqueous solu¬ 
tions because of their strong hydrophilic nature based 
on their extensive sulfation, which is further exagger¬ 
ated when they are covalently linked to core proteins. 
They hold a large number of water molecules in 
their molecular domain and occupy enormous 


oligosaccharide; (b) atomic details of the molecular interactions; 
water molecules are shown along with the hydrogen-bonding 
pattern (Ferret et al. 2005) 


hydrodynamic space in solution. A complementary 
property of the GAGs is their capability to specifically 
interact with a number of important growth factors and 
functional proteins. Such interactions are often crucial 
to the biological functions of these proteins. 

Recent crystallographic elucidations have provided 
structural information about the binding of some low 
molecular weight oligosaccharides from GAGs in 
interactions with proteins. These are chemokines, 
complement proteins, extracellular matrix proteins, 
enzymes, growth factors, and viruses. A data base of 
the three-dimensional structures, (Perez 2007a; 
Imberty et al. 2007) has been set; it summarizes 
the type of glycoaminoglycan fragments that 
have been co-crystallized. Most of them are heparin 
oligosaccharides. Such a fragment as long as 
hexadecasaccharide could be co-crystallized as 
complexed with antithrombin and thrombin in a crystal 
solved at 2.5 A resolution. This is by far the largest 
oligosaccharide structure ever solved by X-ray diffrac¬ 
tion method (Fig. 4). 

Fiber Diffraction of Polysaccharides 

The most important method for the structure determi¬ 
nation of crystalline polysaccharides is X-ray fiber 
diffraction. Diffraction from well-ordered three- 
dimensional single crystals provides the most detailed 
information; it should be recognized that fibers do have 
certain advantages over single crystals. It has been 
observed that linear polysaccharides prefer to exist as 
long helices rather than more convoluted structures. 
A fibrillar sample may be a more relevant environment 
for biological macromolecules, as it can accommodate, 
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Oligosaccharides and 
Polysaccharides, 

Fig. 4 Antithromhin _16mers. 
Three-dimensional structure 
of a heparin 

hexadecasaccharide in the 
crystalline tertiary complex 
antithrombin-thrombin- 
heparin (Li et al. 2004) 



without loss of order, changes in polymer structure and 
conformational flexibility that would destroy order in 
a single crystal. Besides, the native organization of 
several polysaccharides occurs in a fibrillar form and 
it is, therefore, quite relevant to investigate the struc¬ 
tural results of concomitant biosynthesis and crystalli¬ 
zation. As for other types of polysaccharides, one can 
usually, after dissolution, produce samples, in which 
helical macromolecules are aligned with their long 
axes, parallel or antiparallel (by convention, this is 
the c-direction). Further lateral organization may 
occur, but rarely to the degree of a three-dimensionally 
ordered single crystal. 

X-Ray Fiber Diffraction and the Use of Synchrotron 
and Neutron Radiation. X-ray diffraction patterns 
obtained from fibrillar samples do not provide 
sufficient experimental information to resolve the crys¬ 
tallographic structure unambiguously. A fiber is 
composed of an assembly of crystallites having 
a common axis but random orientation. To this source 
of disorder there needs to be added one arising from the 
disorientation of the chains in the interior of the crys¬ 
tallite domains along with their small dimensions. 
These various sources of disorder are the origin of the 
low number of reflections found in fiber diagrams. 
Many polysaccharides yield no more than 50 indepen¬ 
dent X-ray reflections that can be used to determine the 
molecular geometry of the crystallographic asymmet¬ 
ric unit. On these diffraction patterns, the reflections 
are distributed in horizontal rows the spacing of which 
corresponds to the fiber repeat when the polymer axis 
is parallel to the fiber axis. Thus, this periodicity is 


a geometric parameter that can be determined unam¬ 
biguously from a fiber diagram and this usually 
corresponds to the c-dimension of the unit cell. 
Systematic absences of (001) reflections provide infor¬ 
mation about the helical symmetry of the polymer 
chain. The possibility of unambiguous determination 
of the other unit cell parameters as well as systematic 
absences in all the reciprocal space, and, therefore, the 
assignment of the space group, depends on the ability 
to index the observed reflections which, in turn, 
depends on the quality of the samples. The orientations 
of the individual chains (parallel or antiparallel) 
sometimes cannot be determined unambiguously. 
In the crystalline state as well as in the gel state, 
polusaccharides can occur in the form of single, double 
or triple helices (Fig. 5). 

These multiple helices make the interpretation of 
the diffraction patterns even more difficult; the same 
pattern might be explained by both single- and double¬ 
helical models. In contrast to other macromolecules, 
the diffraction data that can be obtained from 
polysaccharides are not sufficient to permit crystal 
structure determination based on the data alone. 
A modeling technique must be used which allows the 
calculation of diffraction intensities from various 
models for comparison with the observed intensities. 
The first, computer-assisted, and reasonably detailed 
structure analysis of a polysaccharide was completed 
more than 20 years ago. The number of apparently 
successful structure solutions of polysaccharides 
exceeds 100, counting all polymorphs, variants, deriv¬ 
atives, and complexes (Chandrasekaran 1997). The 
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X-Ray Diffraction and 
Crystallography of 
Oligosaccharides and 
Polysaccharides, 

Fig. 5 Helices. Three- 
dimensional representations of 
polysaccharide chains. Single¬ 
helical structure as in the case 
of cellulose; double-helical 
parallel stranded structure as 
in the case of starch; and triple 
helical parallel stranded 
structure as in the case of 
curdlan 



joint use of molecular modeling and diffraction 
techniques has been invaluable in the quantitative 
elucidation of crystal and molecular structures (Perez 
2007b). 

New structural tools became available throughout 
the development of large-scale infrastructures such as 
neutron reactors and synchrotron radiation facilities. 
High resolution of the X-ray has been obtained for 
some polysaccharides. Despite such achievements the 
reliable determination of hydrogen atoms is not possi¬ 
ble. Hydrogen atoms are relatively weak scatterers of 
X-rays but not of neutrons. Neutrons are also scattered 
in a significantly different way by hydrogen and by its 
isotope, deuterium. There exists a possibility to use 
neutron diffraction in combination with specific deu- 
teration for locating hydrogen atoms and investigating 
hydrogen bonding in solid-state structure of polysac¬ 
charides. To achieve this, methods have been devel¬ 
oped for deuteration of intracrystalline regions of 
native polysaccharides without affecting the overall 
structural integrity. Such a methodology has been 
applied successfully to the resolution of the crystalline 
structures of two allomorphs of native cellulose (Perez 
and Samain 2010). The possibility to locate the posi¬ 
tions of hydrogen atoms involved in hydrogen bonding 
was from Fourier-difference analysis using neutron 
diffraction data collected from hydrogenated and 
deuterated samples (Fig. 6). 


Electron Diffraction of Polysaccharides. As with 
many other stereoregular polymers, simple linear poly¬ 
saccharides, once dissolved and recrystallized, can 
yield single crystals. In other instances, crystalline 
arrangement of polysaccharides occurs as concomitant 
to biosynthesis, as in cellulose, starch, chitin, etc. In 
many instances, polysaccharides crystallize with the 
incorporation of water or other solvent molecules. In 
most cases, a well-dehned morphology is obtained, the 
most common being plate-like. These thin lamellar 
surfaces have lateral dimensions of several microme¬ 
ters for only a few tenths of an angstrom in thickness. 
Crystalline domains of such dimensions are well suited 
for examination by transmission electron microscopy 
in both imaging and diffraction modes. A problem 
frequently encountered when studying crystalline bio¬ 
polymers with the electron microscope relates to the 
vacuum dehydration of the specimen when inserted in 
the instrument column. This is critical when water or 
solvent is part of the crystalline structure. In such 
instances, total or partial de-crystallization takes 
place, in a matter of minutes, accompanied by drastic 
distortion of the sample. Several methods have been 
developed whereby the sample is either viewed inside 
a hydration chamber or quenched in a cryogenic bath 
prior to insertion into the electron microscope. The 
observations are performed at a temperature close to 
that of boiling liquid nitrogen, where the water of 
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X-Ray Diffraction and Crystallography of Oligosaccha¬ 
rides and Polysaccharides, Fig. 6 Cellulose, (a) Synchrotron 
X-ray diffraction data collected from fibers of Halocynthia 
roretzi cellulose ip on station ID2A at the ESRF, Grenoble, 
France, (b) Composite fiber diffraction pattern of deuterated 
cellulose ip as recorded from neutron and X-ray diffraction, 
(c) Details of some geometrical features found in the crystal 
structure of cellulose ip by combining observed structure factor 



amplitude from deuterated-ip and hydrogenous cellulose ip, 
respectively. The skeletal model represents the cellulose chains. 
The difference density is shown at two contour levels in blue and 
pink. The position of the deuterium atom associated with the 
secondary hydroxyl group at O 3 is clearly indicated. The deute¬ 
rium atoms associated with primary Oe and secondary O 2 
hydroxyl groups are also located, albeit with less definition 
(Nishiyama et al. 2002) 


crystallization is stable in high vacuum. With such 
a technique, frozen wet electron diffractograms are 
readily recorded. 

In nature, plant polysaccharides are commonly 
found as part of complex architecture whose ultrastruc- 
tural organization depends on the source organism. In 
a material such as wood, the cell walls are composed of 
cohesive, interlaced crystalline microfibrils that are 
themselves composed of cellulose. In the case of 
starch, a complex alternation of crystalline, semicrys¬ 
talline, and amorphous layers constitutes the granule. 
Applications of novel approaches using synchrotron 
radiation have made a momentous contribution toward 
characterization of the stmctural organization of such 
complex structural organization. X-ray diffraction dia¬ 
grams can be recorded on such samples as a wood 
section of about 10 mm thick or an isolated starch 
granule X-ray beam of 2 pm full width, having flux 
of about 1,010 photons/s/pm^, as available at European 
Synchrotron Radiation Facility (ESRF) can be used to 
map the occurrence of the crystalline regions within 
such untreated sample (Perez and Bertoft 2010) 
(Fig. 7). Oriented 2D-fiber diffraction patterns can be 
obtained yielding key information about the nature of 
the crystalline stmcture along with their locations and 
orientation. The most recent advent of synchrotron 


radiation machines and the development of the 
micro-focus beamline at the ESRF have greatly 
improved the micro-diffraction method. The technique 
has now been applied to a number of complex assem¬ 
blies of polysaccharides. X-ray probes with a diameter 
as low as 0.5 pm are currently available, with the 
brilliance of the synchrotron, and 0.1 pm probes will 
be available in the near future. 

Under the name PolysacDB an annotated database 
on three-dimensional stmctures of polysaccharides is 
available: http://www.cermav.cnrs.fr. It provides 
information regarding the structure (determined by 
X-ray crystallography, electron diffraction, and molec¬ 
ular modeling), linkage, repeating units, diffraction 
patterns, and reference of the publication of each of 
the respective polysaccharide entries. The user shall 
benefit from an overview (incorporated in the 
resource) of how these structure-determination 
methods are employed to study polysaccharide struc¬ 
tures. It shall also include an inbuilt tool that would 
allow researchers to build their polysaccharides struc¬ 
tures and allow them to generate and optimize 
polysaccharide helices. PolySacDB shall serve as 
a valuable resource as a database and building tool 
for the complex carbohydrates to the scientific 
community. 
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X-Ray Diffraction and Crystallography of Oligosaccha¬ 
rides and Polysaccharides, Fig. 7 Starch, (a) Fiber diffrac¬ 
tion diagram of starch, along with the location of the (10 0) 
reflection; (b) scanning electron micrograph of a batch of Phajus 
grandifolius starch granules; (c) composite image composed of 
X-ray patterns recorded during a mesh scan on the ESRF ID 13 


micro-focus beamline; (d) display of five selected diffraction 
diagrams showing the (1 0 0) reflection {d = 1.58 nm); 
(e) intensity distribution of the crystalline fraction of the granule 
overlaid with arrows reflecting the orientation of the local fiber 
axes. The length of the arrows is proportional to local (1 0 0) 
intensity (Chanzy et al. 2006) 


Summary 

The content of this entry provides essential informa¬ 
tion to initiate the general reader into the world of 
three-dimensional structures of oligosaccharide and 
polysaccharide as derived from diffraction methods. 
The relevant bibliographical information is provided, 
which may be useful to those directly involved in 
structural glycoscience. Some conformational features 
of oligosaccharides are described using the disaccha¬ 
ride moiety as a basic reference entity. Some rules 
which dictate the formation of hydrogen bonding and 
packing patterns in crystalline oligosaccharides are 
briefly presented, with the recognition that there is 
a lack of comprehensive investigation of the molecular 
principles underlying the formation of oligosaccharide 
crystals. There has been a stagnation of the number of 
oligosaccharide structure analyses reported over the 
last decade. Usually, the minute amount of available 
pure oligosaccharides along with their natural 
reluctance to crystallize in a form suitable for X-ray 
diffraction studies may explain such stagnation. 
Advances in the synthesis or biosynthesis of carbohy¬ 
drates, along with their large-scale productions, as well 
as development in X-ray diffraction sources will be 


required to overcome the paucity of structural and 
crystalline data on oligosaccharides. Interest is 
focusing on the more complex carbohydrate mole¬ 
cules, as diffraction methods extend to the held of 
protein-carbohydrate complex. Such a field benefits 
from the development of methods for crystallizing 
these protein-based complexes as well as from the 
rapid progresses arising from the use of synchrotron 
radiation which are yielding highly resolved structures. 
The conjunction of these new developments is 
generating new results as the number of complex 
oligosaccharide fragments as part of macromolecular 
complexes is overgrowing those derived from conven¬ 
tional molecular crystallography. As for the class of 
polysaccharides, conventional X-ray fiber diffraction 
and electron diffraction methods are also benefiting 
from methodological developments. The area of 
solid-state deuteration and its application to neutron 
scattering yields a unique type of structural informa¬ 
tion. The extreme brightness of synchrotron X-ray 
beams and the concomitant advances in micro-focus 
technology opens up completely new field of applica¬ 
tions in the area of structures and dynamics of 
biological reactions. Glycosciences would benefit 
from such developments as glycobiological materials 
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would be closely scrutinized at several levels of struc¬ 
tural organization to reveal the secrets behind their 
remarkable functions and properties. 

Cross-References 

► Cellulose 

► CNS (Crystallography and NMR System) 

► Electron Diffraction 

► Glycosaminoglycan (GAG) 

► Glycosylated Natural Products 

► Hydrodynamic Modeling of Carbohydrate Polymers 

► Lectins 

► Macromolecular Neutron Diffraction 

► Polysaccharide Biophysics 

► Polysaccharides: Biophysical Properties 

► X-ray Fiber Diffraction 
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X-ray Fiber Diffraction 

Louise Serpell 

School of Life Sciences, University of Sussex, Palmer, 
Brighton, East Sussex, UK 

Synonyms 

Wide and small angle X-ray scattering; X-ray 
diffraction 


Definition 

X-ray fiber diffraction is a technique that enables struc¬ 
ture determination of fibrous or polymeric molecules 
based on fhe ordered arrangement of molecular struc¬ 
ture along a fiber axis. Diffraction patterns are gener¬ 
ated by the X-ray scattering from the molecular 
organization of the fibrous molecule. The fibrous struc- 
fures are generally cylindrically symmetric around an 
uniaxial direction, which results in diffraction data with 
a characteristic four quadrant symmetry, with diffrac¬ 
tion signals arranged symmetrically across the “merid¬ 
ian” (usually vertical) and “equator” (horizontal) 
(Fig. 1). Unlike ► single crystal diffraction, diffraction 
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X-ray Fiber Diffraction, Fig. 1 X-ray fiber diffraction pattern. 
Left panel shows a schematic image highlighting the meridional 
and equatorial axes of the pattern. The pattern is quadrant sym¬ 
metrical and shows arcs rather than discreet spots. The right 
panel shows the diffraction pattern from an amyloid fibril, 


showing regularly spaced layer lines. [001] refers to the indexing 
of the meridional reflections resulting from repeats along the 
fiber axis (1). [hkO] is the indexing in the two directions around 
the cylindrically symmetric fiber 


signals are often overlapping and, depending on the 
quality of the diffracting sample, may show diffraction 
peaks arranged along “layer lines.” Layer lines arise 
from the helical pitch along the fiber axis, while dif¬ 
fraction signals appearing on the meridian give infor¬ 
mation regarding spacing of individual scattering 
entities along the fiber. 


Basic Characteristics 
Fibers 

X-ray fiber diffraction requires the use of molecules 
that have ordered organization of molecules around 
a central fiber axis. The amount of information obtain¬ 
able from diffraction data is directly related to the order 
within the diffracting sample (King and Alexander 
1974). Single crystals show the highest degree of 
order, whereby molecules are arranged on a three- 
dimensional lattice. In fibers, the molecules are 
arranged along a uniaxial direction. In this case, the 
scattering appears on layer lines, and the layer planes 
have a separation of the reciprocal of the periodic 
repeat along the fiber axis (Fig. 2a). Layer lines are 
arranged perpendicular to the meridian and partially 
parallel to the equator and arise from helical repeats 
along the fiber axis. Since a single chain molecule has 
no periodicity in the x- and y-directions, the scatter is 



X-ray Fiber Diffraction, Fig. 2 Continuous and discontinuous 
helix showing the coiresponding diffraction arising from P, the 
helical pitch, and p, the distance between stacked units 

a continuous function. In practice however, a fiber 
specimen is composed of several chain molecules 
with rotational averaging. The molecules may be axi¬ 
ally in register, but arranged at all possible azimuthal 
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orientations. These molecules are related to one 
another, but not by a crystallographic lattice. 
A collection of chain molecules give the scattering 
effect for the isolated molecules plus the effects caused 
by the orientation of the molecules, the packing, size, 
and mutual arrangement of ordered regions (Marvin 
and Nave 1982). 

In general, fiber samples are mounted with the fiber 
bundle vertical to the X-ray beam. The meridian of the 
resulting diffraction pattern is in the same orientation 
as the fiber and gives information about the crystalline 
order along the fiber axis. If the helical pitch is 
also interspersed by regularly arranged molecules 
as in a discontinuous helix (Fig. 2b), then the recipro¬ 
cal of the separation along the fiber axis will be appar¬ 
ent as meridional reflections. The equatorial 
diffraction signals give information from the lateral 
association of molecules within and between fiber 
molecules and are made up of cylindrically averaged 
information. 

In general, fibrous samples are weakly 
diffracting. In order to generate a fiber diffraction 
pattern from a sample composed of fibrous mole¬ 
cules, it is necessary to orient the fibers relative to 
one another to create a diffracting sample. If the 
fibers are not oriented, then the diffraction pattern 
will consist of rings, much like a powder diffraction 
pattern. If some orientation of the molecules relative 
to one another is possible, then the diffraction pat¬ 
tern will consist of reflection arcs that arise from the 
disorientation of the fiber molecules relative to 
a vertical axis. These patterns will allow distinction 
between the meridional and equatorial axes, reveal¬ 
ing information about the structure along the fiber 
axis and perpendicular to it. Very highly oriented 
samples will give rise to well-oriented diffraction 
patterns that can show separation of Bragg peaks 
onto layer lines. Occasionally, fiber molecules may 
arrange themselves in a preferred orientation, which 
can give rise to a diffraction pattern with some 
characteristics of a single crystal diffraction pattern 
and three-dimensional structural information. How¬ 
ever, fibers are generally cylindrically symmetric 
and therefore the reflections are spread into a disk, 
which intercepts the Ewald sphere at two symmet¬ 
rical points for an untilted sample. This results in 
a diffraction pattern that is symmetrical above and 
below the equator. Tilting of the sample may 
resolve the resolution of other reflections. 


Data Collection 

X-ray fiber diffraction data may be collected using in- 
house X-ray diffractometers that are traditionally 
used for single crystal X-ray crystallography, using 
CuKa radiation. Wide angle diffraction data will gen¬ 
erally require shorter specimen-to-detector distances 
that can be accommodated on in-house X-ray kits. 
However, low angle diffraction may require specifi¬ 
cally made geometry that allows for lower wave¬ 
lengths and/or longer specimen-to-detector distances 
to allow collection of low-resolution diffraction sig¬ 
nals. Long distances can also introduce the problem of 
air scatter at the CuKa wavelength and therefore 
helium chambers may be used to reduce this addi¬ 
tional scatter. This is particularly important for long- 
range ordered samples such as muscle fibers (Squire 
et al. 2005). 

Data Analysis 

Several fiber diffraction processing packages are 
available to aid analysis of the diffraction patterns 
(Table 1). Each contains modules to help with mea¬ 
surement of diffraction signal positions and back¬ 
ground subtraction. Simple interpretation of 
diffraction data can be made by considering that the 
diffraction signals found on the meridian arise from 
units stacked along a discontinuous helix, while layer 
line spacing arises from the helical repeat. Equatorial 
reflections can arise from the size of the fiber and also 
the association of subunits or individual components 


X-ray Fiber Diffraction, Table 1 Some examples of fiber 
diffraction data analysis packages 


FibreFix 

http://www.small-angle. 
ac.uk/small-angle/ 
Software/FibreFix.html 
(Squire et al. 2003) 

Pattern analysis; unit cell 
determination; 
background subtraction; 
quadrant averaging; ; 
reciprocal space mapping 

Clearer 

(Makin et al. 2007) L.C. 
Seipell@sussex.ac.uk 

Pattern analysis; unit cell 
determination; 
background subtraction; 
diffraction calculation 

WCEN 

http://fibemet. Vanderbilt, 
edu/softwai'e/wcen/ 

Pattern analysis: 
diffraction parameter 
determination; data 
correction; reciprocal 
space mapping 

Fit2D 

http://www.esrf.eu/ 

computing/scientific/ 

FIT2D/ 

Pattern analysis and 
manipulation; Calibration 
and correction of detector 
distortions 
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X-ray Fiber Diffraction, 

Fig. 3 Diffraction from 
B-DNA (Watson and Crick 
1953). Showing structure of 
B-DNA in right panel and the 
comparison between 
experimental diffraction from 
(Watson and Crick 1953) with 
calculated pattern (calculated 
using CLEARER (Makin et al. 
2007)) 
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of the fibrous molecule structure. However, interpre¬ 
tation of fiber diffraction data is often hampered by 
overlapping reflections and difficulties in measuring 
precise diffraction intensities. Therefore, molecular 
models can be built that may fit with constraints of 
the diffraction analysis, such as the helical pitch, rise 
of molecules on the axis, and diameter of the mole¬ 
cules. In some cases, it can be possible to determine 
possible unit cells into which the model coordinates 
may be built (Makin et al. 2007). To validate any 
model, it must give rise to a diffraction pattern with 
similar features to the experimental data. Therefore, 
facilities exist to calculate fiber diffraction patterns 
from model coordinates. These calculations must take 
into account disorientation and disorder of a fibrous 
molecule. Figure 3 shows an example of calculation 
of diffraction from B-DNA (Fuller et al. 1965) to 
produce a diffraction pattern that can be compared 
to the experimental data. 

Fibrous Structures 

X-ray fiber diffraction has been successfully used to 
produce structural models for a broad range of fibrous 
molecules from synthetic polymers such as nylons to 
fibrous proteins. Most famously, fiber diffraction was 
used to decipher the structure for DNA (Fuller et al. 
1965; Watson and Crick 1953) (Fig. 3). Significant 
advances have been made in structure determination 
of fibrous proteins such as muscle fibers, (actin, 
myosin, tropomyosin), collagen, and the bacterial fla¬ 
gella (Namba et al. 1989). Often, diffraction data is 


complemented by additional biophysical and/or imag¬ 
ing techniques that contribute information about the 
conformation or macromolecular structure. X-ray fiber 
diffraction remains a powerful method capable of 
resolving long-range order that can often be used to 
complement the short-range measurements obtained 
from other methods. 

Cross-References 

► Macromolecular Crystallography: Overview 

► X-Ray Diffraction and Crystallography of 
Oligosaccharides and Polysaccharides 
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X-Ray Fluorescence (XRF) 

► X-Ray Fluorescence Imaging: Elemental and Chem¬ 
ical Speciation Mapping of Biological Systems 


X-Ray Fluorescence Imaging: Elemental 
and Chemical Speciation Mapping of 
Biological Systems 

Graham N. George and Ingrid J. Pickering 
Department of Geological Sciences, University of 
Saskatchewan, Saskatoon, SK, Canada 

Synonyms 

Micro-X-ray fluorescence (p-XRF); Rapid scan X-ray 
fluorescence (RSXRF); Synchrotron-induced X-ray 
fluorescence (SXRF); X-ray fluorescence (XRF); X- 
ray fluorescence mapping (XFM) 

Definition 

X-ray fluorescence imaging is a technique that can be 
applied to determine the microscopic localization of 
trace elements in biological samples. 

Introduction 

Metal ions play a number of key roles in biology, 
functioning in the active sites of metalloenzymes, as 
structural elements of essential proteins, and 


additionally play a variety of physiological roles such 
as neurotransmitters and in the immune response. X- 
ray fluorescence imaging is a method that can be 
applied to map the microscopic localization of metals 
and other elements in a complex biological sample. 
This article is not intended to be a comprehensive 
treatment across all fields, but instead seeks to illus¬ 
trate the capabilities of the method relevant to 
biophysics. 

X-Ray Fluorescence 

When high-energy X-rays are used to illuminate 
a sample, core-level electrons are excited. Each core¬ 
level excitation creates a core hole which is filled by 
dipole-allowed transitions from higher levels with 
concomitant emission of X-ray fluorescence. The 
major fluorescence lines arise from atomically deep 
levels (e.g., the Kai and Ko ,2 spectral lines arise from 
2 p 3/2 ^ Is and 2 pi /2 ^ Is transitions, respectively) 
and thus show only very slight chemical sensitivity. 
Because X-ray fluorescence depends upon atomic 
physics the lines are at characteristic X-ray energies 
and cannot readily be confused. Figure 1 shows both 
a schematic of the X-ray physics and an example of an 
X-ray fluorescence spectrum. 

X-Ray Fluorescence Imaging 

Nearly all modern applications of biological X-ray 
fluorescence imaging (XFI) utilize synchrotron light 
as a source of X-rays. The advantages of synchrotron 
light include the ability to produce a bright, highly 
focused beam and the option to tune the incident 
energy to obtain chemical characterization, as 
described below. The technique of X-ray fluorescence 
imaging employs a small X-ray beam of fixed energy 
to illuminate the sample, which is raster scanned so 
that the beam interrogates different regions of the 
sample with X-ray fluorescence recorded at each 
beam position. The detector is most often a solid- 
state device, either Si or Ge based, which can provide 
rapid readout of the entire fluorescence spectrum with 
sufficient energy resolution (ca. 150-300 eV) to 
discriminate most elements (Fig. lb). Germanium- 
based detectors have the advantage of higher efficien¬ 
cies at high photon energies, but can lose information 
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X-Ray Fluorescence Imaging: Elemental and Chemical 
Speciation Mapping of Biological Systems, Fig. 1 

(a) Schematic diagram of the underlying physics of X-ray fluo¬ 
rescence. The incident photon causes ejection of a core electron 
resulting in a core hole which is filled by dipole-allowed decay of 
another electron with concomitant emission of X-ray fluores¬ 
cence. (b) Typical X-ray fluorescence spectrum from 

on lower atomic number elements due to overlap with 
escape peaks when the excitation energy is greater than 
the Ge K-edge. The sample scanning uses a precision 
mechanical stage and is either performed continuously 
with data taken “on the fly,” or with a “move-stop- 
measure, repeat” strategy. For large samples or 
samples in which the concentrations are high, the con¬ 
tinuous scan method is often best, whereas for very low 
concentrations requiring long integration times, the 
“move-stop-measure” method may be best. The spatial 
resolution of an XFI experiment depends upon the 
nature of the microfocus optics. In the hard X-ray 
regime (i.e., >7 keV), the best resolutions are gener¬ 
ally achieved by Fresnel zone plate type optics and can 
be as low as 30 nm, although 100-200 nm is more 
generally used. Intermediate resolutions of 1-15 pm 
can be achieved using Kirkpatrick-Baez mirrors or 
tapered capillary optics, and for large samples at 
macro resolutions, a simple aperture is often used. 
Figure 2a shows a schematic of a typical XFI experi¬ 
mental setup. In the figure, the orientation of the 
sample to the incident beam is 45° which allows the 
fluorescence detector to be positioned at 90°. The 
advantage of this geometry is that scattered radiation 
(see Fig. lb) is at a minimum at 90°, minimizing this 
unwanted background signal. The disadvantages of the 
45° geometry are that the beam spot is horizontally 
magnified by v/2 on the surface of the sample and 


b 



a biological sample (human cerebral cortex; Korbas et al. 
2010 ) measured with an energy dispersive solid-state detector, 
showing the characteristic X-ray fluorescence emission lines 
plus X-ray scattering arising from both inelastic and elastic 
processes. The dilute nature of biological samples means that 
the scatter peaks are normally the largest signals 


that the problems of shadowing and edge-effects can 
be more significant, especially for thicker samples. 
Figure 2b shows a typical data set with simultaneously 
measured images for a variety of elements. In most 
cases, a monochromatic beam of X-rays is used to 
excite the fluorescence, although in some experiments 
“pink beam” (poorly monochromatic or polychro¬ 
matic) conditions are used with dramatically increased 
photon fluxes. For monochromatic sources, 
a commonly used illumination energy is 13,450 eV, 
which is high enough to excite K fluorescence from all 
of the first transition elements as well as other elements 
of biological interest such as Se, and is just below the 
K-edge of bromine (13,474 eV), which is a common 
contaminant in plastics. The time taken to build up an 
image by raster scanning is a function of the number of 
pixels, the effective dwell time per pixel, which can 
range from 10 ms to 10 s., plus the latency which is 
inevitable in any system involving mechanical motion. 
In most cases, the total time taken to acquire an image 
is generally a matter of hours. 

With X-ray fluorescence imaging experiments, 
positions of interest on the sample can be further 
interrogated by recording an X-ray absorption spec¬ 
trum using the microfocused beam. With the beam 
fixed on a point of interest, the incident energy is 
scanned through the absorption edge of an element 
and its fluorescence recorded to measure the 
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X-Ray Fluorescence Imaging: Elemental and Chemical 
Speciation Mapping of Biological Systems, Fig. 2 

(a) Schematic plan view of an X-ray fluorescence-imaging 
experimental setup. The optical microscope is typically used 
for beam and sample alignment. The upstream detector may 


not always be present because of space constraints with some 
microfocusing optics (e.g., tapered capillary optics with short 
focal lengths), (b) X-ray fluorescence imaging of a section of 
a rodent (Chinese hamster) showing localizations of different 
elements 


spectrum. This will give the same chemical informa¬ 
tion as X-ray absorption spectroscopy but on 
a microscopic scale. 

Sample preparation for ordinary two-dimensional 
XFI can be similar to conventional microscopy, 
although it should be noted the fixing and staining 
may alter the localization of some elements (e.g., Ca 
and Zn). Moreover, conventional microscopy mate¬ 
rials such as glass slides and cover slips are considered 
undesirable because commercial glasses usually con¬ 
tain relatively high levels of zinc and iron. Samples are 
thus typically prepared either on plastic cover slips, 
which are typically made of polyethylene terephthal- 
ate, or on silicon nitride wafers, which offer more 
acceptable alternatives free of biologically relevant 
contaminants. 

Chemically Selective X-Ray Fluorescence 
Imaging 

The possibility of chemically selective X-ray fluores¬ 
cence imaging was first discussed by Pickering et al. 
2000. A simple X-ray fluorescence imaging experi¬ 
ment such as that shown in Fig. 2b typically uses 


a hxed single excitation energy well above the absorp¬ 
tion edges of the elements of interest. Chemically 
selective fluorescence imaging uses the same experi¬ 
mental apparatus but takes advantage of the chemical 
sensitivity of X-ray absorption near-edge spectra by 
using multiple X-ray energies to excite the fluores¬ 
cence with chemical sensitivity. As described by Pick¬ 
ering et al. (2000), the number of energies required is 
the same as the number of species being discriminated, 
and care must be taken to remove any background 
from scattered radiation. Figure 3 shows an example 
of chemically specihc imaging, where the technique 
was used to study a fern that accumulates arsenate 
from soil and transforms it to the more toxic arsenite 
(Pickering et al. 2006). 

Three-Dimensional X-Ray Fluorescence 
Imaging 

Three-dimensional elemental localizations within 
a sample can be determined by using X-ray fluores¬ 
cence tomography or confocal X-ray fluorescence 
imaging. The former method uses a similar experimen¬ 
tal setup to Fig. 2a, but with the addition of a (/j-stage 
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optical absorbance arsenite arsenate 



X-Ray Fluorescence Imaging: Elemental and Chemical 
Speciation Mapping of Biological Systems, Fig. 3 (a) The 

X-ray absorption K near-edge spectra of arsenite and arsenate, 
showing the two X-ray energies used for chemically specific 


imaging, (b) Comparison of optical micrograph, absorbance, 
and chemically specific images for arsenite and arsenate 
(Pickering et al. 2006) 



X-Ray Fluorescence Imaging: Elemental and Chemical 
Speciation Mapping of Biological Systems, Fig. 4 

(a) Schematic diagram of a sample mounted in a X-ray fluores¬ 
cence tomography setup, showing the cp-stage and horizontal 
scan direction (normal to the incident beam), (b) Experimental 
fluorescence tomography of a plant seedling mounted in a plastic 


tube, showing the transmittance sineogram corresponding to 
a horizontal cross-section, the computed tomograph (CT) in 
transmittance and in selenium fluorescence. The circular outer 
ring in the CT transmittance is the mounting tube, and the faint 
ray-like structures radiating from the center of both CT images 
are artifacts of CT reconstruction 


that is used to rotate the sample about its vertical axis. 
Sinograms are collected as a function of (p and hori¬ 
zontal sample position x (Fig. 4) and computed tomog¬ 
raphy (CT) is used to reconstruct slices. 

Confocal X-ray fluorescence imaging again uses 
a similar experimental setup to Fig. 2a, but with the 
addition of a microfocusing optic mounted on the 
fluorescence detector. The optic is usually a tapered 
polycapillary and is used to restrict the fluorescence 
accepted by the detector to that from the focal point of 
the incident X-ray beam which will be within the 
sample. Slices of X-ray fluorescence can then be col¬ 
lected directly, translating the sample relative to the 
detector for each slice. This experimental setup has the 


significant advantage over tomographic methods that 
micro XAS spectra can be collected from any accessi¬ 
ble voxel within the sample. In both methods, attenu¬ 
ation of the X-ray beams (both incident and 
fluorescent) must be corrected. Moreover, the exten¬ 
sive X-ray exposure of the sample means that radiation 
damage is an issue and that cryoprotection may be 
required. 

Despite the fact that XFI is a relatively new method, 
biophysical applications are already very diverse. 
These range from studies of plant physiology (e.g., 
Pickering et al. 2000,2006, Fig. 3b), to high-resolution 
studies of individual mammalian cells in cell cultures 
(e.g., Carmona et al. 2010) and can include intact small 











X-Ray Scattering of Lipid Membranes 


2785 


X 


animals (e.g., Korbas et al. 2008), mammalian organs 
(e.g., Zhang et al. 2009), sections of whole small ani¬ 
mals (e.g.. Fig. 2b) and sections of human organs such 
as human brain (Popescu et al. 2009). 

Summary 

X-ray fluorescence imaging is an emerging technique 
that can give detailed micro- and macroscopic infor¬ 
mation on the localization of elements within biologi¬ 
cal systems. The method has been successfully applied 
to a range of samples, from single mammalian cells to 
complete sections of human brain, with insights into 
physiology and disease. 

Cross-References 

► X-ray Absorption Spectroscopy of Metals in 
Biology 
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Synonyms 

Structure and elasticity of lipid/water aggregates 

Definition 

X-ray scattering probes positional correlations on 
molecular to supramolecular length scales. Here, we 
focus on its application to lipid membranes. 

Introduction 

Understanding the function of biological membranes is 
intimately coupled to determining their structure and 
their related dynamics. Lipid membranes, that is, bio¬ 
logical membranes devoid of protein and carbohydrate 
networks, are highly useful models in understanding 
certain aspects of biological membranes. Dispersed in 
an aqueous solution, membrane lipids form an array of 
structures, which are highly dependent on their molec¬ 
ular chemistry (shape), as well as on the properties of 
their surrounding solution (e.g., pH, ionic strength, 
hydration, temperature, and pressure). X-ray tech¬ 
niques offer a probe-free and noninvasive toolset to 
study positional correlations of lipid aggregates on 
various length scales. These positional correlations 
may either originate from long-range order in 
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nonlamellar phases (hexagonal, cubic), quasi long- 
range order in lamellar phases, or the short-range 
(fluid-like) order of hydrocarbon chains in the plane 
of the bilayer above the melting transition. 

Starting in the late 1950s, Luzzati and coworkers 
systematically characterized the structures of a number 
of lipid/water systems, culminating in the X-ray study 
by Tardieu et al., in which almost all of the lamellar 
phases known today were identified (Tardieu ef al. 
1973). X-rays were crucial in unraveling the different 
2D crystalline packing of lamellar gel phases 
(i.e., tilted nearest neighbor, next nearest neighbor, 
etc.), and also in understanding the packing of lipid 
head groups on a superlattice in the subgel phase (Pabst 
et al. 2010). Additionally, high photon fluxes available 
at synchrotron radiation sources have enabled 
millisecond time-resolved experiments, such as those 
employing temperature or pressure jumps, thus 
allowing for the elucidation of structural kinetics 
associated with phase transitions. 

Besides solving the structures of lipid/water aggre¬ 
gates, X-rays have been used to probe a membrane’s 
mechanical properties. It follows from a theoretical 
treatment that membrane bending fluctuations result 
in Bragg peaks with a power-law decay (quasi long- 
range order). As a result, analysis of the diffraction 
peak shapes has allowed for the determination of mem¬ 
brane bending fluctuations (i.e., bilayer elasticity) 
(Pabst et al. 2010). When combined with osmotic 
stress measurements, one can effectively probe the 
forces acting between membranes. 

X-Ray Scattering Techniques and Sample 
Geometries 

In principle, lipid membranes can be studied in aligned 
or unaligned (“powder”) geometries. The preparation 
of powder membranes is technically simple and 
involves the hydration of dry lipid. Liposomes 
composed of one or more shells of concentric bilayers 
(e.g., unilamellar, oligolamellar, multilamellar 
vesicles (MLVs), or the nonlamellar inverted hexago¬ 
nal and cubic phases), are by far the most frequently 
studied unaligned systems. Because of their random 
orientation. X-rays scatter isotropically, that is, in con¬ 
centric rings with respect to the straight through X-ray 
beam, as in the case of standard powder diffraction 
experiments (Fig. lA). Fully hydrated unilamellar 


vesicles (ULVs) with no net charge do not interact 
with each other, and exhibit a diffuse modulation of 
the scattered intensity in the small-angle regime orig¬ 
inating from a nonuniform bilayer electron density. 
Additionally, the crystalline order of an MLV sample 
may be so low (e.g., due to electrostatic repulsion) that 
the resultant scattering is diffuse. In other words, scat¬ 
tering by itself is not able to distinguish between ULVs 
and poor “crystalline” membranes (i.e., positionally 
uncorrelated). In all other cases, diffraction patterns 
exhibit Bragg peaks or quasi-Bragg peaks (not instru¬ 
ment resolution limited), whose amplitude and shape 
depend on the extent of the positional correlations. 
From the Bragg peak positions the system’s gross 
morphology (lamellar, hexagonal, cubic, etc.) can be 
determined. However, the peak shapes of membranes 
in the fluid lamellar phase contain additional informa¬ 
tion regarding elastic fluctuations (Pabst et al. 2010). 

It is always advantageous (though not always pos¬ 
sible) to study aligned lipid systems. However, some 
X-ray techniques make use only of aligned systems. 
For example, in the case of X-ray reflectometry and 
grazing incidence scattering, lipid monolayers are 
spread on an aqueous interface allowing for the 
compositional control of the aqueous subphase 
(e.g., addition of lipophilic agents) to study the effects 
of membrane active compounds. For more realistic 
models of biological membranes, bilayers can be 
adsorbed to solid substrates or soft polymer cushions 
either through vesicle rupture or Langmuir-Blodgett/ 
Langmuir-Schaeffer techniques, respectively. Further, 
aligned multihilayer systems can be simply prepared 
by spreading an organic lipid solution on a solid sub¬ 
strate, followed by evaporation of the solvent. The 
signal-to-noise ratio from aligned multibilayers is 
superior to that from single bilayers, with the signal 
manifesting itself (for the most part) as Bragg peaks. 
However, the single most prominent advantage of 
aligned membranes, compared to liposomal disper¬ 
sions, is the unambiguous determination of in-plane 
structural information. Namely, the signal from 
positional correlations that occur in the plane of the 
membrane is not powder-averaged, but occurs along 
the off-equatorial axis, while scattering from posi¬ 
tional correlations along the bilayer normal (out-of- 
plane) occurs along the equatorial axis (Fig. IB). 
This differentiation of scattering information from dif¬ 
ferent parts of the membrane has been fundamental to 
studies of rippled and subgel phase bilayers (Katsaras 
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X-Ray Scattering of Lipid Membranes, Fig. 1 Typical diffraction patterns of unaligned {panel A) and aligned lipid systems 
{panel B) (Adapted from S. Karmakar, PhD Thesis, Jawaharlal Nehru University, India) 


and Raghunathan 2000). In the case of fluid lamellar 
stacks of bilayers, the anisotropic scattering patterns 
allow for the determination of the membrane’s bending 
rigidity and the bulk modulus of membrane interac¬ 
tions (Lyatskaya et al. 2000; Salditt 2005). In compar¬ 
ison, the fluctuation parameter derived from 
corresponding unaligned systems is a convolution of 
the two (i.e., the bending rigidity and the bulk modulus 
of interaction). 

In addition to providing structural information, 
multibilayers are invaluable systems for studying 
forces that act between adjacent membranes, 
which can then be related to fundamental biological 
phenomena, such as cell adhesion. Such studies can 
be performed on both aligned and unaligned 
multibilayers. Interactions are probed by coupling the 
system to an external osmotic pressure that is created 
either through the use of large neutral polymers 
(e.g., polyethylene glycol) in solution, or by control¬ 
ling the relative humidity (RH) environment in which 
the lipid layers reside (Parsegian and Rand 1995). 
Although achieving 100% RH is not trivial (Katsaras 
1998), appropriate humidity chambers have been 
developed so that it is now possible to study 
multibilayers in their fully hydrated state (Pabst et al. 
2010 ). 

Single-Component Membranes 

Unit cell parameters can be derived directly from the 
position of Bragg peaks. Additional structural infor¬ 
mation of fully hydrated lipid/water systems is found 
in the one-dimensional (ID) electron density lipid 
bilayer profile. For MLVs whose X-ray scattering pat¬ 
terns exhibit a number of Bragg reflections, electron 
density profiles can be calculated by standard Fourier 
synthesis techniques using the amplitudes (square root 


of the area) of the Bragg peaks. In cases where there is 
no Bragg scattering, such as for ULVs, or when MLVs 
exhibit less than four distinct diffraction orders, one 
needs to resort to global analysis techniques, which 
include modeling of the diffuse scattering. In the latter 
case, a model for the ID electron density profile needs 
to be formulated. The latest advance in membrane 
structure determination involves the simultaneous 
refinement of neutron and X-ray scattering data 
(Kucerka et al. 2008). A model that is based on volume 
probability distributions, and their spatial conserva¬ 
tion, was derived by molecular dynamics (MD) simu¬ 
lation (Fig. 2). Importantly, results have shown that the 
areas of some lipids have been significantly 
overestimated (Kucerka et al. 2008). 

The lateral area per lipid describes the packing of 
lipid molecules and is needed to accurately describe 
membrane structure, as well as lipid-lipid and lipid- 
protein interactions. Additionally, the area per lipid is 
an important guide, and even an input parameter, for 
MD simulations, depending on whether the 
GROMACS or CHARMM potentials are used. 
Another important structural parameter for lipid bila¬ 
yers is membrane thickness, as it can be directly related 
to the hydrophobic matching of transmembrane pro¬ 
teins. However, because the lipid/water interface is not 
a sharp boundary, a clear definition of membrane 
thickness is not possible. Two definitions are most 
commonly used: The phosphate-phosphate (partial 
headgroup-headgroup) distance is one that closely 
resembles the Luzzati definition of membrane thick¬ 
ness, and can be determined directly from higher 
resolution electron density profiles. Alternatively, the 
steric definition of membrane thickness includes the 
entire (and more diffuse) lipid headgroups. From this, 
it is then easy to see how reported membrane thickness 
values can vary significantly. More meaningful are the 
reports of the relative changes to membrane thickness. 
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X-Ray Scattering of Lipid Membranes, Fig. 2 Determina¬ 
tion of lipid bilayer structure by the simultaneous analysis of 
X-ray and neutron scattering data. Graphs on the left-hand side 
show the fits to experimental X-ray and neutron scattering data 
from ULVs. The right-hand side panels show electron 


densities (ED) (top left) and neutron scattering length 
densities (NSLD) (top right) of the various lipid component 
distributions. The bottom panel shows volume probability dis¬ 
tributions (Figure reproduced from Pabst et al. (2010) with 
permission) 


due, for example, to temperature or the addition of 
membrane active compounds. A parameter closely 
related to membrane thickness is, hydrocarbon thick¬ 
ness, and it provides a measure of the hydrophobic 
portion of lipid bilayers (Nagle and Tristram-Nagle 
2000). This parameter can be directly compared to 
values obtained from deuterium nuclear magnetic res¬ 
onance (NMR) measurements. Finally, it should be 
noted that the number of water molecules per lipid 
can be accurately determined from X-ray experiments 
(Nagle and Tristram-Nagle 2000). 

Multicomponent Membranes 

Cell membranes are complex systems typically com¬ 
posed of hundreds of lipid species and proteins. Recent 
evidence suggests that plasma membrane rafts 
enriched in cholesterol and high melting-temperature 


(Tm) lipids, like sphingomyelin, are involved in diverse 
membrane functions, including protein sorting and cell 
signaling. Significant progress in understanding the 
physicochemical basis of rafts has been made through 
the study of simplified model systems containing two 
or three lipid species - it is now thought that rafts are 
a manifestation of coexisting fluid phases, that is, 
and Lo (liquid-ordered). While the coexistence of -h 
Lq phases in binary mixtures of diacylphosphati- 
dylcholine (PC) and cholesterol has proven controver¬ 
sial, ternary mixtures combining a low- and high-TM 
lipid with cholesterol offer unambiguous proof of 
fluid-fluid coexistence - in many such mixtures, 
micron-sized phase domains are readily imaged with 
fluorescence microscopy. Flowever, fluorescent probes 
can introduce unwanted and potentially strong interac¬ 
tions, including the photo-oxidation of lipids, thereby 
changing the membrane composition, and potentially 
inducing phase separation in otherwise uniformly 
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X-Ray Scattering of Lipid 
Membranes, 

Fig. 3 Comparison of (a) Lp' 
phase, (b) L„ phase, and (c) Lo 
phase diffraction patterns of 
aligned DPPC multibilayers 
(Figure reproduced from Mills 
et al. (2008a) with permission) 
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mixed bilayers. The small size and fleeting nature of 
rafts necessitates the use of experimental techniques 
that are sensitive to molecular length scales and which 
minimize perturbation of the native lipid interactions. 
For these reasons. X-ray scattering has recently 
emerged as an important tool for studying lateral 
phase separation in biomimetic membranes. 

In addition to diacylglycerophospholipids and 
sphingomyelins, all animal cells contain substantial 
amounts of cholesterol. The simplest models of animal 
cell membranes are therefore binary mixtures of PC 
and cholesterol. The addition of cholesterol to either 
gel-phase or fluid-phase PC increases the hydrocarbon 
thickness, and in the case of gel PC bilayers, the wide- 
angle X-ray scattering (WAXS) peak(s) shifts in posi¬ 
tion and broadens in width (Engelman and Rothman 
1972). The position of the WAXS peak reflects the 
average distance between chains in the lattice, while 
its width is inversely related to in-plane positional 
correlations. At high cholesterol (>30 mol%) content, 
WAXS peak position and width are consistent with 
neat fluid-phase bilayers (Engelman and Rothman 
1972). At the same time, NMR measurements indicate 
substantial ordering of the hydrocarbon chains, i.e., 
comparable to gel-phase PC bilayers (Huang et al. 
1993). At cholesterol levels typical of concentrations 
found in mammalian cells (i.e., 10-30 mol%), lateral 
phase separation over a wide range of temperatures has 
been reported using a variety of techniques, including 
X-ray scattering. The conventional criterion for 
establishing phase coexistence with X-ray scattering 
is the presence of more than one set of lamellar {D) 
Bragg reflections or wide-angle {d) spacings. Multiple 
D-spacings arise from a 3D phase separation - that is, 
a lateral separation of phases with different D-spacings 
occurs within single bilayers, and the domains are 
aligned in a manner such that they are positionally 
correlated across adjacent bilayers in the multilamellar 


stack. In contrast, multiple wide-angle peaks are the 
direct result of phase domains in single bilayers, and no 
positional correlation across bilayers is required. 

WAXS from oriented samples offers distinct advan¬ 
tages for examining phase separation. While 
unoriented samples can reveal phase coexistence 
through the criterion of multiple scattering peaks, the 
absence of multiple peaks does not imply the absence 
of phase separation. Eor example, coexisting phases 
with similar D or d may fail to be resolved (it should be 
noted that both Lq and phases exhibit broad WAXS 
peaks at, or near 4.5 A). At the other extreme, large 
differences in D between coexisting phases may intro¬ 
duce a packing frustration that favors anti-alignment of 
phase domains in neighboring bilayers, in which case, 
only a single average D is observed. Alternatively, 
coexisting domains may stack with nearly no distortion 
by forming an inclination angle. Aligned samples pro¬ 
vide additional off-axis scattering that is obscured in 
unoriented samples and from which the distribution of 
chain orientations can be extracted (Eig. 3). Such infor¬ 
mation is particularly important for raft mixtures of 
coexisting Lq and Lc phases, as it is largely chain 
orientational order that distinguishes these phases. In 
oriented fluid-phase bilayers, the angular extent of off- 
axis scattering intensity is related to the distribution of 
chain tilt angles, and the width of this distribution gives 
rise to an X-ray order parameter Sx-ray that is markedly 
different for Lot and Lq phases (Mills et al. 2008a). 
While differing in absolute magnitude from the 
carbon-deuterium (Scd) order parameter derived 
from NMR of chain-perdeuterated lipids, identical 
trends in the NMR and X-ray order parameters have 
nevertheless, been observed for binary mixtures of 
DPPC and cholesterol. 

Applying this analysis to the well-studied ternary 
raft mixture DPPC/DOPC/cholesterol, it was shown 
that two distributions were required to model the 
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scattering intensity for samples in the + Lq coexis¬ 
tence region, whereas a single distribution adequately 
fit the scattering curves at higher temperatures where 
the ordered phase has presumably melted (Mills et al. 
2008b). These findings are in good agreement with 
phase diagrams constructed from NMR and fluores¬ 
cence microscopy data. X-ray scattering has also 
shown potential for establishing the direction of 
a tieline tilt in phase-separated regions of ternary mix¬ 
tures, by matching £)-spacing curves (as a function of 
sample hydration) from samples prepared throughout 
the coexistence region (Uppamoochikkal et al. 2010). 

Membrane Active Compounds 

The active role of membrane lipids in cell processes 
has been largely ignored. There is, however, ample 
evidence that the biophysical properties of lipid 
membranes couple to cellular function. Recently, it 
was reported that the biological activity of 
sphingomyelinase, a key enzyme in sphingolipid sig¬ 
naling, is coupled to changes in lipid structure 
(Boulgaropoulos et al. 2010) (Fig. 4). X-ray experi¬ 
ments of lipid membranes offer insight regarding the 
structural and mechanical changes to the membrane 
induced by other molecules interacting with it. In con¬ 
junction with other techniques. X-rays can provide 
information on the conformation and location of mem¬ 
brane active compounds. In principle, three different 
scenarios can be distinguished: (1) the compound 
adsorbs to, or inserts into the membrane without 
perturbing the membrane’s integrity or morphology; 
(2) the insertion of molecules into the membrane 
results in structural change, for example, lamellar to 
hexagonal; and (3) the interaction leads to a partial 
disintegration of the membrane, for example, pore 
formation or complete lysis (detergent-like effect). 
A unifying description of these scenarios can be 
achieved in terms of the lateral pressure profile given 
by depth-dependent repulsive and attractive forces. In 
the first case, the adsorption or insertion of compounds 
into the membrane may cause a shift of lateral pres¬ 
sures toward the lipid/water interface, or the bilayer 
center, that need not be associated with any structural 
change to the membrane. For example, in the case of 
the amphipathic peptide alamethicin, membrane thin¬ 
ning and a distinct softening of the lipid bilayer were 
observed (Pabst et al. 2010). On the other hand, the 



t (min) 

X-Ray Scattering of Lipid Membranes, Fig. 4 Coupling of 
enzyme activity to membrane properties. Sphingomyelinase 
degrades sphingomyelin (5M) to ceramide. This induces a gel 
phase that coexists with fluid domains. Panel A shows the num¬ 
ber of lipids per gel-phase domain determined from WAXS. 
Panel B shows the hydrolysis of SM determined by thin layer 
chromatography. The insert shows the crystallographic structure 
of bacterial sphingomyelinase (RCSB protein data bank entry 
N57A). Data shows a dramatic change of kinetics when half of 
SM is converted to ceramide. This can be rationalized in terms of 
preferred interactions between these two lipid species 
(Data from Boulgaropoulus et al. (2010)) 


anesthetic drug ketamine did not affect membrane 
structure, although the shifts in lateral pressures were 
calculated to he strong enough to influence the confor¬ 
mational equilibrium of ion channels (Jerabek et al. 
2010). If the changes in lateral pressure profile take 
place asymmetrically, and if the lipids have a high 
propensity of forming nonlamellar phases, inverted 
hexagonal or cubic phases may result (Shearman 
et al. 2006), as was observed, for example, in lipid/ 
peptide interaction studies (Lohner et al. 2008). If on 
the other hand the changes in the lateral pressure dis¬ 
tribution are so large that the membrane forces cannot 
be balanced (i.e., sum of lateral pressures ^ 0), the 
membrane breaks up into micelles (Lohner et al. 
2008), something that is easily observed with X-rays 
(Sevcsik et al. 2008). 
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Summary 

X-ray scattering has proven to be a versatile technique 
capable of addressing a diverse number of scientibc 
questions related to biological membranes. These 
range from different topologies and mechanical prop¬ 
erties of lipid/water aggregates, to complex lipid mix¬ 
tures or interactions with lipophilic agents/drugs. 
However, several scientific challenges remain. One 
such challenge relates to the concept of functional 
domains in biological membranes (i.e., rafts). 
Although X-ray scattering is able to detect phase coex¬ 
istence without resorting to any molecular probe, its 
application to fluid-fluid-phase coexistence problems 
remains presently underdeveloped. The determination 
of tielines in ternary lipid mixtures was certainly one 
important step taken recently in addressing this issue. 
However, more will have to follow in order to establish 
X-ray scattering as a valuable complementary 
technique in the studies of lipid domains. 
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Zero-Mode Waveguides 

Jonas Korlach and Stephen W. Turner 
Pacific Biosciences, Menlo Park, CA, USA 

Synonyms 

Nanometer-scale apertures 

Definition 

Zero-mode waveguides are photonic nanostructures 
that create highly confined optical observation 
volumes, thereby allowing single-molecule-resolved 
biophysical studies at relatively high concentrations 
of fluorescent molecules. 

Basic Characteristics 

Conventional single-molecule optical spectroscopy is 
characterized by observation volumes limited by the 
diffraction limit of light, on the order of ^1 fl (10“^^ 1). 
This restricts the upper concentration limit at which 
individual fluorescent molecules can be distinguished 
to the pico- to nanomolar range. However, most 
physiological processes, enzyme turnovers, and ligand 


interactions occur at higher concentrations of at least 
one participating biomolecule. Zero-mode waveguides 
(ZMWs) are nanostructures which drastically reduce 
the effective optical observation volume beyond the 
diffraction limit, thereby permitting the use of higher 
concentrations of fluorescently tagged molecules 
for single-molecule studies. The name “zero-mode 
waveguide” describes the structure as a metal-clad 
waveguide in the cutoff regime - that is, in the size 
range where the waveguide provides zero guided 
modes for the propagation of light. 

ZMWs consist of holes in an opaque, metallic clad¬ 
ding film deposited on a transparent substrate (Fig. 1). 
Typically, aluminium is used as the metal and fused 
silica as the substrate, although gold and chromium 
have also been described as alternative cladding mate¬ 
rials. ZMWs were first described in 2003 (Levene et al. 
2003) as cylindrical or conical holes with dimensions 
that vary from ^30 to ^300 nm in hole diameter and 
^50 to ^200 nm in metallic film thickness. Rectangu¬ 
lar and c-shaped geometries have also been described 
(Wenger et al. 2005; Fore et al. 2007). 

ZMWs can be fabricated using a variety of tech¬ 
niques, such as electron beam lithography (dry etching 
or metal liftoff), direct writing onto the metal film by 
focused ion beam milling, or by photolithographic 
techniques (Foquet et al. 2008). While all of these 
techniques allow the manufacture of many ZMWs on 
a single substrate, photolithographic methods are par¬ 
ticularly attractive due to the reduced cost of achieving 
a high multiplex factor. 

When light from the visible spectrum is directed 
through the transparent substrate onto ZMWs, the light 
cannot penetrate through the holes, but instead the 
light’s electromagnetic energy decays exponentially 


G.C.K. Roberts (ed.). Encyclopedia of Biophysics, DOI 10.1007/978-3-642-16712-6, 
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Zero-Mode Waveguides 



— Emission 

Illumination 



Zero-Mode Waveguides, Fig. 1 Zero-mode waveguides 
(ZMWs) (a) Basic ZMW geometry and principle of operation. 
Although a relatively high concentration (micromolar range) of 
fluorescent molecules is applied, the extremely small observa¬ 
tion volume provided by the ZMW enables single fluorophore 
detection, in this example real-time DNA synthesis by single 
DNA polymerase molecules using fluorescently labeled nucleo¬ 
tides (Eid et al. (2009)) (b) Scanning electron micrograph of the 
top side and (c) transmission electron micrograph of the center 
cross section of a ZMW. Scale bars = 100 nm (Reprinted from 
Korlach et al. 2010) 


as it enters the ZMW. Incidentally, the same physical 
principle that governs this behavior occurs in the 
metallic screen of a microwave oven door. 
The door’s metallic screen is perforated with holes 
that are much smaller than the wavelength of the 
microwaves. Because of their relative size, the holes 
prevent the much longer microwaves from passing 
through. For ZMWs, this same principle is applied at 
the nanoscale, with visible light instead of microwave 
radiation. The wavelength cutoff above which 
ZMWs with hole diameter d do not propagate optical 
modes is given by waveguide theory as Zc = IJd. 
Instead of propagating through the ZMW, light with 
wavelengths greater than is attenuated to form an 


evanescent field at the bottom of the ZMW. In this 
respect, the ZMWs differ fundamentally from the con¬ 
finement that is afforded by nanoscale apertures, for 
which objects or analytes are imaged beyond the exit of 
the aperture, and which exploit the z~ power-law 
decay of field strength that occurs in this space. The 
ZMW principle relies unavoidably on the finite length 
of an opening, as opposed to an aperture which as 
a theoretical construct in optics has zero thickness. 
For this reason, the term “ZMW” is preferred to “nano¬ 
scale aperture” when observation is confined in the 
exponential decay region of the breach rather than 
the z~° decay region outside it. The effective observa¬ 
tion volume created within an illuminated ZMW is on 
the order of ^100 zeptoliters (100 x 10”^* 1), and is 
a function of the incident wavelength and hole 
diameter. 

The presence of the metallic cladding affects 
the behavior of fluorescent molecules in ZMWs 
(Rigneault et al. 2005). Fluorescence emission rates 
can be increased by local enhancements of the excita¬ 
tion intensity inside the ZMW cavity. In addition, the 
proximity of the metal reduces the molecular excited 
state lifetime. Both effects contribute to improved 
signal-to-noise ratios for single-molecule detection. 
Surface plasmon resonances can act beneficially to 
further enhance these effects, exemplified by the 
observation that ZMWs fabricated from gold 
cladding exhibited significantly higher fluorescence 
enhancement factors compared to aluminum ZMWs 
(Gerard et al. 2008). 

The extremely small physical size of the observa¬ 
tion volume created by ZMWs additionally facilitates 
improved temporal resolution of biological events of 
interest against a background of diffusion events. 
Diffusion in the ZMW-confined nanoscale volume 
occurs on very fast time scales - within only a few 
microseconds, labeled molecules travel into the ZMW, 
reach the observation volume, diffuse back up and exit 
the hole. In contrast, enzymatic turnover events or 
macromolecular binding events commonly have 
durations in the millisecond timescale, at least 2-3 
orders of magnitude longer. The improved temporal 
discrimination facilitates the detection of relatively 
weak biomolecular interactions and of rapid kinetic 
processes. 

ZMWs have been used in a variety of research 
studies, including freely diffusing fluorescent mole¬ 
cules in conjunction with fluorescence correlation 
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spectroscopy, biomolecular interactions with an 
adsorbed ligand or enzyme, protein oligomerization 
dynamics, and diffusion analysis of lipids and recep¬ 
tors in model and cell membranes (reviewed in Moran- 
Mirabal and Craighead (2008)). Improved surface 
chemistries have allowed the selective and specihc 
immobilization of biomolecules to just the bottom 
surface of ZMWs (Korlach et al. 2010). Recent studies 
have included combining ZMW confinement with 
a continuous flow mixer for rapid kinetic measure¬ 
ments (Liao et al. 2008), real-time detection of trans¬ 
lation elongation by single ribosomes (Uemura et al. 
2010), protein-protein interactions of chaperonin 
GroEL and cochaperonin GroES (Sameshima et al. 
2010), and the real-time observation of DNA polymer¬ 
ase with base-pair resolution (Eid et al. 2009; Korlach 
et al. 2010). The last application is being commercial¬ 
ized by Pacific Biosciences for single-molecule, 
real-time (SMRT’'''^) DNA sequencing. 

Cross-References 

► DNA Polymerase 

► Eluorescence Correlation Spectroscopy 

► Single-Molecule Methods 

► Single-Molecule Sequencing 
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Zinc-Dependent Metalloenzymes - 
Computational Studies 

Marc Willem van der Kamp 

Centre for Computational Chemistry, School of 

Chemistry, University of Bristol, Bristol, UK 

Definition 

A broad group of enzymes that require one or more 
zinc ions for their catalytic activity. 

Basic Characteristics 

Metalloenzymes that require zinc for their catalytic 
activity include many different enzymes. Computa¬ 
tional molecular modeling has proved very useful in 
understanding the mechanisms and other aspects 
of zinc-dependent metalloenzymes (Mujika et al. 
2009). These have included modeling studies with 
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quantum-mechanical (QM) electronic structure calcu¬ 
lations (e.g., on small “cluster” models of the zinc- 
hinding sites), combined ► QM/MM methods and 

► empirical valence bond (EVB) methods. Examples 
include: Class B ► P-lactamases (Park et al. 2005; Xu 
et al. 2007; Wu et al. 2010b); carbonic anhydrase 
(Aqvist et al. 1993; Riccardi et al. 2010) (also 
a popular test system for methods for reaction model¬ 
ing in enzymes); histone deacetylase (Wu et al. 2010a); 
and ► carboxypeptidase A (Kilshtain and Warshel 
2009; Szeto et al. 2009). An example of a quantum- 
mechanical cluster model calculation is a study of the 
reaction in the trinuclear zinc-dependent phospholi¬ 
pase C (Liao et al. 2010). 

Cross-References 

► Beta-Lactamases: Computational Studies 

► Famesyltransferase - Computational Studies 

► QM/MM Methods 

► Quantum Chemical Cluster Approach to Modeling 
Enzyme-Catalyzed Reactions 
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ZZ-Exchange 

Anthony Mittermaier 

McGill University, Montreal, QC, Canada 


Synonyms 

EXSY (exchange spectroscopy) 


Definition 

The ZZ-exchange NMR experiment is used to measure 
the kinetics of interconversion between different 
molecular states, such as between the folded and 
unfolded, or ligand-free and ligand-bound forms of 
a protein. This method is based on multidimensional 
NMR chemical shift correlation experiments, with 
the addition of a mixing delay between the indirect 
and direct detection periods. During the mixing delay, 
magnetization is aligned along the z-axis, parallel to 
the static magnetic field, hence the name of the 
experiment. For example, a protein may exist in two 
exchanging states, such that a given nucleus resonates 
at a frequency coa in state A and cOb in state B. Pro¬ 
vided that the exchange rate is slower than the coales¬ 
cence point (i.e., states A and B give rise to separate 
peaks in NMR spectra), a two-dimensional ZZ- 
exchange spectrum will contain two auto-peaks at 
(coa> ®a) and (cOb; ®b) in the (Fi, F 2 ) frequency dimen¬ 
sions. In addition, A ^ B and B —> A conversions 
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during the mixing delay will lead to the appearance of 
two cross-peaks located at (cda, cOb) and (cOb, 
respectively. When the mixing delay is short, the 
auto-peaks are intense and the cross-peaks are weak 
or absent. When the system is allowed to exchange 
during the mixing delay for a greater period of time, 
the intensities of the cross-peaks increase. When even 
longer delays are used, the intensities of both cross¬ 
peaks and auto-peaks are weak due to longitudinal spin 
relaxation. In order to obtain kinetic information, 
a series of ZZ-exchange spectra are recorded with 
a range of mixing delays. The dependences of the 
peak intensities on the length of the mixing delay are 
then analyzed to yield the first-order rate constants kAB 
and kflA and the longitudinal spin relaxation rates of 
the two states, Ria and Rib. The simplest ZZ- 
exchange experiment is based on homonuclear 
NOESY, such that the same nucleus is detected in the 
Fi and F 2 dimensions. This has the disadvantage that 
all auto-peaks are crowded along the diagonal of the 


spectmm. The problem can be resolved by adapting 
heteronuclear correlation experiments. For example, 
one of the most commonly used ZZ-exchange 
experiments for protein applications is a variant of 
a HSQC, and yields auto-peaks located at 

(coa^^'^, Ca*^) and (cob*^^, and cross-peaks 

located at ((Da*^^, and tOA'^)- 

Cross-References 
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